Research Papers

PUBMED Cancer: unknown Method: deep learning

Geospatial insights and artificial intelligence in West Nile virus disease Epidemiology: The Italy case study.

Amienwanlen Eugene Odigie, Angela Stufano, Violetta Iris Vasinioti, Iniobong Chukwuebuka Ikenna Ugochukwu, Cristiana Catella, Francesco Pellegrini, Paolo Capozza, Grazia Greco, Nicola Decaro, Michele Camero, Piero Lovreglio, Annamaria Pratelli, Hari S Iyer, Maria Tempesta
Published 2026-05-01 00:00

This study investigates the application of artificial intelligence, specifically machine learning, in understanding the epidemiology of West Nile virus (WNV) outbreaks in Italy over a decade. A machine learning framework was developed to predict WNV classification based on environmental indices derived from satellite imagery. The results indicated significant spatial autocorrelation and demonstrated the effectiveness of deep neural networks and gradient boosting machines in predicting WNV cases.

Read abstract

Although artificial intelligence, including machine learning offers transformative potential, its use in infectious disease epidemiology has been understudied. Emerging infectious diseases including WNV infection remains a leading cause of mortality globally. The study aimed to provide a decade-long insight into the global trends on WNV outbreaks. The study offers a ML-powered framework to guide preventive strategies. Environmental indices including neighbourhood greenness, forest cover, and temperature were captured using 250m resolution MODIS satellite imagery. Seasonal human, avian and equine WNV surveillance records and other regional indicators for Italy from 2013 to 2023 were retrieved from surveillance repositories. Spatial trends and interactions were evaluated using spatial autocorrelation analyses. Further, a ML architecture was developed for prediction of high or low WNV classification based on satellite imagery of climatic indices (deep neural networks) compared with non-raster variables (gradient boosting machine). Moran's I estimation, confirmed by the Monte Carlo approximation indicated evidence of significant spatial autocorrelation with Moran's I values of 0.0561 (P < 0.00259) and 0.0561 (P < 0.009) respectively while significant clustering was observed in regions of northern Italy. The ML-based deep neural network yielded an accuracy of 80% and 79.7% for NDVI- and LST-based raster inputs, while GBM model isolated key predictors of human WNV confirmed cases with an AUC of 0.835. This study presents a scalable approach for human WNV outbreak prediction incorporating geospatial analyses and artificial intelligence, providing insights on the disease epidemiology and a framework for other emerging infectious disease investigations.

PUBMED Cancer: breast cancer Method: fusion deep learning model

Artificial intelligence model for cardiovascular disease risk prediction in breast cancer patients using electronic health records and computed tomography scans.

Isha Shah, Sarah Lucas, Reece Walsh, Islam Osman, Mohammad Sami Shehata, Rasika Rajapakshe
Published 2026-05-01 00:00

This study presents a fusion deep learning model designed to predict cardiovascular disease (CVD) mortality in breast cancer patients using electronic health records and computed tomography scans. The model was developed and tested on a large cohort of 23,067 patients, achieving an area under the curve (AUC) of 0.946 and an accuracy of 0.93. The results indicate that the fusion approach effectively combines medical images and text for improved predictive performance.

Read abstract

Cardiovascular disease (CVD) is the leading cause of death globally [1] as well as the leading cause of death among cancer survivors [2]. The outcomes of CVD mortality among cancer patients, particularly those with breast cancer, highlight the need for early detection of CVD at the beginning of cancer treatment as cardiotoxicity can also lead to accelerated development of chronic diseases, especially in the presence of risk factors [3]. A fusion deep learning model was developed and tested using computed tomography (CT) scans and electronic health records (EHR) for CVD mortality prediction in breast cancer patients undergoing radiation therapy. The model utilizes computed tomography (CT) scans and electronic health records (EHR) for CVD mortality prediction in breast cancer patients undergoing radiation therapy. A cohort of 23,067 patients consisting of ∼5 million CT slices and ∼600,000 EHR documents was used for the model development and testing. Performance of the model is assessed using the AUC and accuracy at a 95% confidence level. The fusion model achieves an AUC of 0.946 [0.939---0.950], and accuracy of, 0.93 [0.92 - 0.94] at 95% confidence interval (CI). These results show that fusion models can learn versatile representations from medical images and medical text documents and can effectively be combined for tasks like predicting CVD mortality with higher accuracy when employing the appropriate fusion strategy.

PUBMED Cancer: non-small cell lung cancer Method: transformer

Effective use of PROs for survival prediction: Transformer-based modelling in NSCLC patients.

D Dudas, T J Dilling, H Jim, I El Naqa
Published 2026-05-01 00:00

This study investigates the use of transformer-based models to enhance survival prediction in patients with early-stage non-small cell lung cancer (NSCLC) by leveraging patient-reported outcomes (PROs). The model was developed using data from 475 patients who completed the Edmonton Symptom Assessment Scale (ESAS) and demonstrated improved prognostic accuracy compared to traditional modeling approaches. Key symptoms identified for survival prediction included appetite loss, pain, overall well-being, and shortness of breath.

Read abstract

Accurate survival prediction is a key component to patient quality of life (QoL)-centered treatment. It allows, for example, setting reasonable treatment goals or timely palliative care referral. However, clinical estimates are often too optimistic leading to lower patient QoL before death. Patient-reported outcomes (PROs) have proven to be an important survival predictor improving the overall prognostic accuracy. In this study, transformer architecture was explored to leverage PRO trajectories to improve survival accuracy and identify the most prognostic PRO symptoms for survival prediction. We analyzed 475 (cross-validation discovery set: 380; held-out testing set: 95) early-stage non-small cell lung cancer (NSCLC) patients who underwent SBRT treatment and routinely completed the Edmonton Symptom Assessment Scale (ESAS). A transformer-based model was developed to perform longitudinal modeling of overall survival (OS), incorporating PROs collected at multiple post-treatment follow-ups, as well as clinical and demographic variables. The performance of the proposed model was compared to traditional outcome modeling approaches, including univariate and multivariate (time-varying) Cox proportional hazards regression (CoxPH) and joint probability survival modelling. The best-performing transformer model was interpreted using the SHapley Additive exPlanation (SHAP) values, and the most prognostically relevant ESAS symptoms were identified through a backward elimination procedure guided by concordance index (c-index) and area under the ROC curve (AUC). The best-performing transformer model achieved a cross-validated c-index of 0.753 [95 % CI: 0.742-0.764] and an AUC of 0.862 [95 % CI: 0.846-0.878] on the discovery set. On the heldout test set, the model reached a c-index of 0.694 and an AUC of 0.785, evaluated at the last time point. It significantly outperformed both Cox models and the joint probability model. Model interpretation using SHAP values and backward elimination identified appetite loss, pain, overall well-being, and shortness of breath as the most prognostically relevant symptoms for survival prediction. Transformer-based survival models that integrate longitudinal PROs significantly enhance prognostic accuracy in SBRT-treated NSCLC patients. Loss of appetite and pain emerged as the most predictive symptoms, followed by overall wellbeing and shortness of breath. These findings suggest that targeted, symptom-focused PROs tracking could streamline clinical implementation and improve survival estimation in routine oncology care.

PUBMED Cancer: general cancer Method: machine learning

CRISPR-Cas12a biosensing technology advances and applications in precision diagnostics and cancer research.

Ziyu Zang, Jie Chen, Yi Dong, Linlin Chen, Meihua Yang, Meiya Mu, Lirenhui Zhou, Wei Zhang, Guangrong Zou, Chaoxing Liu
Published 2026-05-01 00:00

The paper discusses the advancements in CRISPR-Cas12a biosensing technology and its applications in precision diagnostics and cancer research. It highlights the system's capabilities in detecting various biomarkers, including cancer mutations, and its integration with machine learning for improved accuracy. The technology shows promise for early-stage cancer biomarker detection and offers innovative solutions for both research and clinical applications.

Read abstract

CRISPR-Cas12a has become a versatile biotechnology platform with important applications in biosensing, diagnostics, and precision genome editing. This system is activated by a single crRNA, exhibits target-responsive trans-cleavage activity, and recognizes programmable PAM sequences. These features provide a robust basis for accurate detection of diverse biomarkers. Its detection capabilities include nucleic acid targets such as viral RNA and cancer mutations, as well as non-nucleic acid molecules like exosomes and proteins. Recent advancements have shown significant advantages, including multi-temperature adaptability, rapid kinetics, and compatibility with both DNA and RNA targets. Technical improvements include machine learning-assisted crRNA design for enhanced prediction accuracy and engineered EnAsCas12a variants that overcome conventional PAM restrictions. Notable achievements involve entropy-driven circuits that achieve attomolar-level sensitivity, smartphone-compatible four-channel quantitative detection systems, and streamlined integrated workflows completed within 30 min. Advances in sensor design, such as metal-organic framework encapsulation and high-performance aptamer-based sensors, have further expanded detection capabilities. In oncology research, CRISPR-Cas12a technology provides powerful tools to comprehensively analyze complex molecular networks within the tumor microenvironment (TME) and facilitate ultrasensitive detection of early-stage cancer biomarkers. Additionally, in genome editing, CRISPR-Cas12a enables precise genomic modifications due to distinct repair pathways, versatile delivery methods, and efficient creation of transgenic models. Thus, it expands its functional scope beyond diagnostics. With ongoing development, this technology is expected to evolve into an integrated platform combining TME research, point-of-care cancer diagnostics, and programmable genome engineering, offering innovative solutions for both biomedical research and clinical translation.

PUBMED Cancer: general cancer Method: deep learning

Explainability-Based Optimized Deep Learning in Histopathological Diagnosis of Multiple Cancers and Development of Mobile Application.

Ritu Tandon, Narendra Pal Singh Rathore, Shweta Agrawal, Shruti Sharma
Published 2026-05-01 00:00

This study presents a novel deep learning model, Complementary Residual Retentive Network with Guided Gaussian Combined Arms Algorithm (C2RN2GC2A), aimed at enhancing efficiency and accuracy in histopathological cancer classification. The model integrates residual learning with optimized Gaussian perturbations and employs Layer-wise DeepLIFT Relevance Propagation for interpretability. Testing results indicate high accuracy rates of 98.02% and 98.54% on different datasets, demonstrating significant improvements in diagnostic reliability and clinical transparency.

Read abstract

Histopathological image analysis is critical for cancer diagnosis, yet many existing models suffer from limited interpretability, high computational demands, and suboptimal classification accuracy. To overcome these limitations, we propose a novel model, Complementary Residual Retentive Network with Guided Gaussian Combined Arms Algorithm (C2RN2GC2A), designed to enhance efficiency and accuracy in cancer classification from histopathological images. C2RN2GC2A is a deep learning model that assimilates residual learning with optimized Gaussian perturbations, thus enhancing both feature extraction and working time in classification tasks. The system merges 2GC2A, a metaheuristic optimization approach motivated by military tactics, for the purpose of improving feature selection, truncating training loss, and speeding up convergence. The Two-stage Guided Chaotic Capuchin Algorithm (2GC2A) brings together Gaussian perturbations with a combined arms tactic, which makes it possible to do a good job of both exploring and exploiting the search space for better parameter tuning. In order to achieve interpretability, Layer-wise DeepLIFT Relevance Propagation (LDLRP) is used to delineate the significant areas of the image that have an impact on the classification, thus making the process more transparent and building up clinical trust. LDLRP is a cutting-edge explainable AI technology that grants relevance ratings to input characteristics and thus allows the model to visually demonstrate the most significant regions in histopathological images, thereby facilitating clinical decision-making. Testing on LC25000 produced a remarkable accuracy of 98.02% along with a minuscule training loss of 0.08, besides which there were 13 false positives and 29 false negatives. On BreakHis, the accuracy was 98.54%, and the validation loss was 0.05, with 98 false positives and 112 false negatives. The proposed framework significantly improves diagnostic reliability, classification accuracy, and clinical transparency in multi-cancer histopathological image analysis.

PUBMED Cancer: non-small cell lung cancer Method: unknown

Five-year outcomes of pembrolizumab versus chemotherapy in Chinese patients with non-small-cell lung cancer and programmed cell death ligand 1 tumor proportion score ≥1%: KEYNOTE-042 China study.

Yi-Long Wu, Li Zhang, Yun Fan, JianYing Zhou, Li Zhang, Qing Zhou, Wei Li, ChengPing Hu, GongYan Chen, Xin Zhang, CaiCun Zhou, Carmen González Arenas, Zhenghong Chen, Wen Cheng Yu, Tony S K Mok
Published 2026-05-01 00:00

The KEYNOTE-042 China study evaluated the efficacy of pembrolizumab compared to chemotherapy in Chinese patients with locally advanced or metastatic non-small-cell lung cancer (NSCLC) and varying levels of programmed cell death ligand 1 (PD-L1) expression. After five years of follow-up, pembrolizumab demonstrated improved overall survival (OS) across all PD-L1 tumor proportion score subgroups. The study supports the use of pembrolizumab as a standard treatment for this patient population.

Read abstract

In the phase 3 KEYNOTE-042 China study of participants enrolled in China in the global KEYNOTE-042 (NCT02220894) and China extension (NCT03850444) studies, pembrolizumab improved overall survival (OS) versus chemotherapy in locally advanced or metastatic non-small-cell lung cancer (NSCLC) with programmed cell death ligand 1 (PD-L1) tumor proportion score (TPS) ≥50% (hazard ratio [HR], 0.63; 95% CI, 0.43-0.94), ≥20% (0.66; 0.47-0.92), and ≥1% (0.67; 0.50-0.89). We present outcomes from this study after 5 years of follow-up. Chinese participants with previously untreated locally advanced or metastatic NSCLC with PD-L1 TPS ≥1% without EGFR or ALK alterations were eligible. Participants were randomized 1:1 to pembrolizumab 200 mg every 3 weeks for up to 35 cycles or carboplatin plus paclitaxel or pemetrexed with optional pemetrexed maintenance (nonsquamous only). Primary endpoints were OS in the PD-L1 TPS ≥50%, ≥20%, and ≥1% subgroups. Median follow-up was 63.7 (range, 56.3-72.6) months among 262 participants (pembrolizumab, n = 128; chemotherapy, n = 134) included in this study. Pembrolizumab prolonged OS versus chemotherapy in participants with PD-L1 TPS ≥50% (HR, 0.65; 95% CI, 0.45-0.93), ≥20% (0.67; 0.49-0.91), and ≥1% (0.66; 0.51-0.87). Grade 3 to 5 treatment-related AEs occurred in 19.5% and 68.8% of participants in the pembrolizumab and chemotherapy groups, respectively. In conclusion, after 5 years of follow-up, pembrolizumab continued to demonstrate improved OS versus chemotherapy with manageable safety in Chinese participants with previously untreated locally advanced or metastatic NSCLC that expressed PD-L1. These data further support pembrolizumab monotherapy as a standard of care for these patients.

PUBMED Cancer: brain tumor Method: meta-guided multi-modal learning

MGML: A plug-and-play meta-guided multi-modal learning framework for incomplete multimodal brain tumor segmentation.

Yulong Zou, Bo Liu, Cun-Jing Zheng, Yuan-Ming Geng, Siyue Li, Qiankun Zuo, Shuihua Wang, Yudong Zhang, Jin Hong
Published 2026-05-01 00:00

This paper presents a novel meta-guided multi-modal learning (MGML) framework aimed at improving brain tumor segmentation using incomplete multimodal MRI data. The framework includes a meta-parameterized adaptive modality fusion component and a consistency regularization module, which together enhance the integration of multimodal information and improve segmentation performance. Extensive experiments on the BraTS2020 and BraTS2023 datasets demonstrate that the proposed method outperforms several state-of-the-art techniques.

Read abstract

Leveraging multimodal information from Magnetic Resonance Imaging (MRI) plays a vital role in lesion segmentation, especially for brain tumors. However, in clinical practice, multimodal MRI data are often incomplete, making it challenging to fully utilize the available information. Therefore, maximizing the utilization of this incomplete multimodal information presents a crucial research challenge. We present a novel meta-guided multi-modal learning (MGML) framework that comprises two components: meta-parameterized adaptive modality fusion and consistency regularization module. The meta-parameterized adaptive modality fusion (Meta-AMF) enables the model to effectively integrate information from multiple modalities under varying input conditions. By generating adaptive soft-label supervision signals based on the available modalities, Meta-AMF explicitly promotes more coherent multimodal fusion. In addition, the consistency regularization module enhances segmentation performance and implicitly reinforces the robustness and generalization of the overall framework. Notably, our approach does not alter the original model architecture and can be conveniently integrated into the training pipeline for end-to-end model optimization. We conducted extensive experiments on the public BraTS2020 and BraTS2023 datasets. Compared to multiple state-of-the-art methods from previous years, our method achieved superior performance. On BraTS2020, for the average Dice scores across fifteen missing modality combinations, building upon the baseline, our method obtained scores of 87.55, 79.36, and 62.67 for the whole tumor (WT), the tumor core (TC), and the enhancing tumor (ET), respectively. We have made our source code publicly available at https://github.com/worldlikerr/MGML.

PUBMED Cancer: breast cancer Method: active learning

MGScreener: A multi-view mammography-based model optimized with active learning for breast cancer diagnosis.

Yao Chen, Peiling Wang, Jun Zeng, Miduo Tan, Ke Shan, Libo Nie, Yaxin Xue, Tong Wang
Published 2026-05-01 00:00

This study introduces MGScreener, a multi-view framework designed for breast cancer diagnosis that integrates dual-view mammography images with clinical data. The model employs an active learning strategy to enhance classification tasks, specifically distinguishing between benign and malignant lesions and identifying invasive ductal carcinoma. Results indicate high accuracy rates in both classification tasks, demonstrating the model's potential to improve diagnostic accuracy for radiologists.

Read abstract

Breast cancer remains the leading cause of cancer-related mortality among women worldwide. Early screening and accurate subtype classification are critical for guiding clinical decision-making. In this study, we present MGScreener, an interpretable multi-view framework that integrates dual-view mammography images, including cranial-caudal (CC) and mediolateral oblique (MLO) views, with patient-level clinical data. The model incorporates an active learning strategy to address two key classification tasks: distinguishing benign from malignant breast lesions and identifying invasive ductal carcinoma (IDC) among malignant subtypes. An entropy-based uncertainty sampling method is employed to select highly informative cases from 210 unlabeled samples for prioritized annotation, substantially reducing manual labeling costs. Across two independent test sets, MGScreener-1 achieved an accuracy of 89.7 % and an ROC-AUC of 0.941 for benign-versus-malignant classification (Task 1), while MGScreener-2 achieved an accuracy of 88.2 % and an ROC-AUC of 0.884 for IDC identification (Task 2). With MGScreener assistance, radiologists improved their diagnostic accuracy by more than 10 % compared with independent reading. Overall, MGScreener offers a scalable and effective solution for precise breast cancer screening and molecular subtype classification.

PUBMED Cancer: lung cancer Method: unknown

Understanding Algorithmic Fairness for Clinical Prediction in Terms of Subgroup Net Benefit and Health Equity.

Jose Benitez-Aurioles, Alice Joules, Irene Brusini, Niels Peek, Matthew Sperrin
Published 2026-05-01 00:00

This paper addresses the fairness of clinical prediction models, particularly in relation to how their performance is influenced by protected attributes. The authors propose a new framework for assessing fairness by expanding the concept of net benefit, allowing for a comparison of clinical impacts across different subgroups. They illustrate their approach through the development of two prediction models, one for type 2 diabetes and another for lung cancer screening, highlighting the trade-offs between health equity and other healthcare objectives.

Read abstract

There are concerns about the fairness of clinical prediction models. "Fair" models are defined as those for which their performance or predictions are not inappropriately influenced by protected attributes such as ethnicity, gender, or socioeconomic status. Researchers have raised concerns that current algorithmic fairness paradigms enforce strict egalitarianism in healthcare, leveling down the performance of models in higher-performing subgroups instead of improving it in lower-performing ones. We propose assessing the fairness of a prediction model by expanding the concept of net benefit, using it to quantify and compare the clinical impact of a model in different subgroups. We use this to explore how a model distributes benefits across a population, its impact on health inequalities, and its role in the achievement of health equity. We show how resource constraints might introduce necessary trade-offs between health equity and other objectives of healthcare systems. We showcase our proposed approach with the development of two clinical prediction models: (1) a prognostic type 2 diabetes model used by clinicians to enroll patients into a preventive care lifestyle intervention programme and (2) a lung cancer screening algorithm used to allocate diagnostic scans across the population. This approach helps modelers better understand if a model upholds health equity by considering its performance in a clinical and social context.

PUBMED Cancer: chondrosarcoma Method: vision-language foundation model

Accessible cartilage tumor malignancy prediction via vision-language foundation model adaptation.

Xingxin He, Zachary E Stewart, Marcos R Gonzalez, Yin P Hung, Tara Shirin Ossiani, Yung Hsin Chen, Joseph Oliver Werenski, Ronald W Mercer, Zhaoye Zhou, Kendall Brown, Santiago A Lozano-Calderon, Fang Liu
Published 2026-05-01 00:00

This study aims to predict the malignancy of cartilage tumors using a vision-language foundation model that integrates radiographic images with non-imaging demographic information. A dataset of 3336 radiographs from patients with enchondroma or chondrosarcoma was utilized, and the model achieved an AUC of 0.94 when incorporating demographic data. The approach demonstrates strong performance and offers a non-invasive solution for cartilage tumor assessment in musculoskeletal oncology.

Read abstract

To predict cartilage tumor malignancy from radiographic images combined with readily available non-imaging information based on a vision-language foundation model. This single-institution study assembled a dataset of 3336 radiographs from 455 patients with enchondroma or chondrosarcoma that was assembled from two sources: (1) patients with histopathology-confirmed diagnoses of enchondroma or chondrosarcoma, and (2) patients with imaging-stable enchondroma without biopsy, confirmed through long-term imaging follow-up. An adapted vision-language foundation model based on the pre-trained CLIP (Contrastive Language-Image Pretraining) architecture was fine-tuned with our proposed Medical Knowledge Adapters and evaluated using 10-fold patient-level cross-validation to predict cartilage tumor malignancy from plain radiographs and demographic information. Using radiographs alone, the model achieved an Areas Under the receiver operating characteristic Curve (AUC) of 0.91 ± 0.04. Incorporating demographics improved the AUC to 0.94 ± 0.02. Subgroup analysis demonstrated robust generalizability across tumor grades with an AUC of 0.91 ± 0.07 in distinguishing atypical cartilaginous tumors (ACT) previously known as low grade chondrosarcomas, and 0.95 ± 0.02 in differentiating high-grade chondrosarcomas from enchondromas. Within the clinically challenging extremity subgroup (enchondroma vs ACT/LGCS), the model achieved an AUC of 0.79 ± 0.14, reflecting diagnostic difficulty observed in clinical practice. This foundation model-based approach demonstrates strong performance using accessible data sources, offering a non-invasive, cost-efficient, and scalable solution for cartilage tumor assessment in musculoskeletal oncology.

Find the papers that actually matter