Research Papers

ARXIV Cancer: oral cancer Method: RPA

Novel Architecture of RPA In Oral Cancer Lesion Detection

Revana Magdy, Joy Naoum, Ali Hamdi
Published 2026-03-11 16:12

This study focuses on the development of two RPA implementations, OC-RPAv1 and OC-RPAv2, for the detection of oral cancer lesions. The results indicate that OC-RPAv2 significantly improves prediction efficiency, processing images in an average of 0.06 seconds compared to 0.29 seconds for OC-RPAv1. The findings suggest that design patterns and batch processing can enhance the scalability and cost-effectiveness of oral cancer detection methods.

Read abstract

Accurate and early detection of oral cancer lesions is crucial for effective diagnosis and treatment. This study evaluates two RPA implementations, OC-RPAv1 and OC-RPAv2, using a test set of 31 images. OC-RPAv1 processes one image per prediction in an average of 0.29 seconds, while OCRPAv2 employs a Singleton design pattern and batch processing, reducing prediction time to just 0.06 seconds per image. This represents a 60-100x efficiency improvement over standard RPA methods, showcasing that design patterns and batch processing can enhance scalability and reduce costs in oral cancer detection

ARXIV Cancer: general cancer Method: large language model

PET-F2I: A Comprehensive Benchmark and Parameter-Efficient Fine-Tuning of LLMs for PET/CT Report Impression Generation

Yuchen Liu, Wenbo Zhang, Liling Peng, Yichi Zhang, Yu Fu, Xin Guo, Chao Qu, Yuan Qi, Le Xue
Published 2026-03-11 09:08

This paper presents PET-F2I-41K, a benchmark for generating diagnostic impressions from PET/CT reports using large language models (LLMs). The study evaluates 27 models, including proprietary and open-source LLMs, and introduces a fine-tuned model, PET-F2I-7B, which demonstrates significant improvements in generating accurate and reliable diagnostic impressions. The authors propose new metrics for assessing the quality of generated reports, highlighting the need for better performance in clinical settings.

Read abstract

PET/CT imaging is pivotal in oncology and nuclear medicine, yet summarizing complex findings into precise diagnostic impressions is labor-intensive. While LLMs have shown promise in medical text generation, their capability in the highly specialized domain of PET/CT remains underexplored. We introduce PET-F2I-41K (PET Findings-to-Impression Benchmark), a large-scale benchmark for PET/CT impression generation using LLMs, constructed from over 41k real-world reports. Using PET-F2I-41K, we conduct a comprehensive evaluation of 27 models across proprietary frontier LLMs, open-source generalist models, and medical-domain LLMs, and we develop a domain-adapted 7B model (PET-F2I-7B) fine-tuned from Qwen2.5-7B-Instruct via LoRA. Beyond standard NLG metrics (e.g., BLEU-4, ROUGE-L, BERTScore), we propose three clinically grounded metrics - Entity Coverage Rate (ECR), Uncovered Entity Rate (UER), and Factual Consistency Rate (FCR) - to assess diagnostic completeness and factual reliability. Experiments reveal that neither frontier nor medical-domain LLMs perform adequately in zero-shot settings. In contrast, PET-F2I-7B achieves substantial gains (e.g., 0.708 BLEU-4) and a 3.0x improvement in entity coverage over the strongest baseline, while offering advantages in cost, latency, and privacy. Beyond this modeling contribution, PET-F2I-41K establishes a standardized evaluation framework to accelerate the development of reliable and clinically deployable reporting systems for PET/CT.

ARXIV Cancer: general cancer Method: hypernetworks

Sparse Task Vector Mixup with Hypernetworks for Efficient Knowledge Transfer in Whole-Slide Image Prognosis

Pei Liu, Xiangxiang Zeng, Tengfei Ma, Yucheng Xing, Xuanbai Ren, Yiping Liu
Published 2026-03-11 08:30

This paper presents Sparse Task Vector Mixup with Hypernetworks (STEPH), a novel approach for improving prognosis estimation in cancer patients using Whole-Slide Images (WSIs). The method addresses the challenges of limited training samples and high heterogeneity in tumor samples by efficiently transferring knowledge from multiple cancer types without the need for large-scale joint training. Experimental results demonstrate that STEPH outperforms traditional cancer-specific learning and existing knowledge transfer methods in terms of accuracy and computational efficiency.

Read abstract

Whole-Slide Images (WSIs) are widely used for estimating the prognosis of cancer patients. Current studies generally follow a cancer-specific learning paradigm. However, the available training samples for one cancer type are usually scarce in pathology. Consequently, the model often struggles to learn generalizable knowledge, thus performing worse on the tumor samples with inherent high heterogeneity. Although multi-cancer joint learning and knowledge transfer approaches have been explored recently to address it, they either rely on large-scale joint training or extensive inference across multiple models, posing new challenges in computational efficiency. To this end, this paper proposes a new scheme, Sparse Task Vector Mixup with Hypernetworks (STEPH). Unlike previous ones, it efficiently absorbs generalizable knowledge from other cancers for the target via model merging: i) applying task vector mixup to each source-target pair and then ii) sparsely aggregating task vector mixtures to obtain an improved target model, driven by hypernetworks. Extensive experiments on 13 cancer datasets show that STEPH improves over cancer-specific learning and an existing knowledge transfer baseline by 5.14% and 2.01%, respectively. Moreover, it is a more efficient solution for learning prognostic knowledge from other cancers, without requiring large-scale joint training or extensive multi-model inference. Code is publicly available at https://github.com/liupei101/STEPH.

ARXIV Cancer: general cancer Method: In-Context Diffusion Transformer

Layout-Guided Controllable Pathology Image Generation with In-Context Diffusion Transformers

Yuntao Shou, Xiangyong Cao, Qian Zhao, Deyu Meng
Published 2026-03-11 06:14

This paper presents a novel approach for controllable pathology image synthesis using In-Context Diffusion Transformers (IC-DiT). The method integrates spatial layouts, textual descriptions, and visual embeddings to enhance the generation of high-fidelity pathology images. Extensive experiments demonstrate that IC-DiT outperforms existing models in terms of spatial controllability and diagnostic consistency, making it a valuable resource for cancer classification and survival analysis.

Read abstract

Controllable pathology image synthesis requires reliable regulation of spatial layout, tissue morphology, and semantic detail. However, existing text-guided diffusion models offer only coarse global control and lack the ability to enforce fine-grained structural constraints. Progress is further limited by the absence of large datasets that pair patch-level spatial layouts with detailed diagnostic descriptions, since generating such annotations for gigapixel whole-slide images is prohibitively time-consuming for human experts. To overcome these challenges, we first develop a scalable multi-agent LVLM annotation framework that integrates image description, diagnostic step extraction, and automatic quality judgment into a coordinated pipeline, and we evaluate the reliability of the system through a human verification process. This framework enables efficient construction of fine-grained and clinically aligned supervision at scale. Building on the curated data, we propose In-Context Diffusion Transformer (IC-DiT), a layout-aware generative model that incorporates spatial layouts, textual descriptions, and visual embeddings into a unified diffusion transformer. Through hierarchical multimodal attention, IC-DiT maintains global semantic coherence while accurately preserving structural and morphological details. Extensive experiments on five histopathology datasets show that IC-DiT achieves higher fidelity, stronger spatial controllability, and better diagnostic consistency than existing methods. In addition, the generated images serve as effective data augmentation resources for downstream tasks such as cancer classification and survival analysis.

ARXIV Cancer: colorectal liver metastases Method: radiomics

An Automated Radiomics Framework for Postoperative Survival Prediction in Colorectal Liver Metastases using Preoperative MRI

Muhammad Alberb, Jianan Chen, Hossam El-rewaidy, Paul Karanicolas, Arun Seth, Yutaka Amemiya, Anne Martel, Helen Cheung
Published 2026-03-10 20:34

This study presents an automated AI-based framework for predicting postoperative survival in patients with colorectal liver metastases (CRLM) using preoperative MRI. The framework includes an anatomy-aware segmentation pipeline and a radiomics pipeline, which together facilitate the extraction of tumor features and survival prediction. The proposed method, SurvAMINN, utilizes an autoencoder-based approach for time-to-event survival prediction, achieving a C-index of 0.69. The results indicate the effectiveness of combining segmentation algorithms with radiomics for accurate survival outcome predictions.

Read abstract

While colorectal liver metastasis (CRLM) is potentially curable via hepatectomy, patient outcomes remain highly heterogeneous. Postoperative survival prediction is necessary to avoid non-beneficial surgeries and guide personalized therapy. In this study, we present an automated AI-based framework for postoperative CRLM survival prediction using pre- and post-contrast MRI. We performed a retrospective study of 227 CRLM patients who had gadoxetate-enhanced MRI prior to curative-intent hepatectomy between 2013 and 2020. We developed a survival prediction framework comprising an anatomy-aware segmentation pipeline followed by a radiomics pipeline. The segmentation pipeline learns liver, CRLMs, and spleen segmentation from partially-annotated data, leveraging promptable foundation models to generate pseudo-labels. To support this pipeline, we propose SAMONAI, a prompt propagation algorithm that extends Segment Anything Model to 3D point-based segmentation. Predicted pre- and post-contrast segmentations are then fed into our radiomics pipeline, which extracts per-tumor features and predicts survival using SurvAMINN, an autoencoder-based multiple instance neural network for time-to-event survival prediction. SurvAMINN jointly learns dimensionality reduction and survival prediction from right-censored data, emphasizing high-risk metastases. We compared our framework against established methods and biomarkers using univariate and multivariate Cox regression. Our segmentation pipeline achieves median Dice scores of 0.96 (liver) and 0.93 (spleen), driving a CRLM segmentation Dice score of 0.78 and a detection F1-score of 0.79. Accurate segmentation enables our radiomics pipeline to achieve a survival prediction C-index of 0.69. Our results show the potential of integrating segmentation algorithms with radiomics-based survival analysis to deliver accurate and automated CRLM outcome prediction.

ARXIV Cancer: prostate cancer Method: multiple instance learning

Leveraging whole slide difficulty in Multiple Instance Learning to improve prostate cancer grading

Marie Arrivat, Rémy Peyret, Elsa Angelini, Pietro Gori
Published 2026-03-10 17:49

This paper introduces the concept of Whole Slide Difficulty (WSD) to enhance Multiple Instance Learning (MIL) for the grading of prostate cancer. By addressing the discrepancies in diagnoses between expert and non-expert pathologists, the authors propose two methods that utilize WSD: a multi-task approach and a weighted classification loss approach. The results indicate that incorporating WSD during training significantly improves classification performance, especially for higher Gleason grades.

Read abstract

Multiple Instance Learning (MIL) has been widely applied in histopathology to classify Whole Slide Images (WSIs) with slide-level diagnoses. While the ground truth is established by expert pathologists, the slides can be difficult to diagnose for non-experts and lead to disagreements between the annotators. In this paper, we introduce the notion of Whole Slide Difficulty (WSD), based on the disagreement between an expert and a non-expert pathologist. We propose two different methods to leverage WSD, a multi-task approach and a weighted classification loss approach, and we apply them to Gleason grading of prostate cancer slides. Results show that integrating WSD during training consistently improves the classification performance across different feature encoders and MIL methods, particularly for higher Gleason grades (i.e. worse diagnosis).

ARXIV Cancer: liver cancer Method: unsupervised domain adaptation

Unsupervised Domain Adaptation with Target-Only Margin Disparity Discrepancy

Gauthier Miralles, Loïc Le Folgoc, Vincent Jugnon, Pietro Gori
Published 2026-03-10 17:27

This study addresses the challenge of limited annotated data in interventional Cone-Beam Computed Tomography (CBCT) by employing unsupervised domain adaptation techniques. The proposed method utilizes a proprietary collection of unannotated CBCT scans alongside annotated CT data to enhance liver segmentation performance. Experimental results indicate that the novel framework based on Margin Disparity Discrepancy achieves state-of-the-art results in both domain adaptation and few-shot scenarios.

Read abstract

In interventional radiology, Cone-Beam Computed Tomography (CBCT) is a helpful imaging modality that provides guidance to practicians during minimally invasive procedures. CBCT differs from traditional Computed Tomography (CT) due to its limited reconstructed field of view, specific artefacts, and the intra-arterial administration of contrast medium. While CT benefits from abundant publicly available annotated datasets, interventional CBCT data remain scarce and largely unannotated, with existing datasets focused primarily on radiotherapy applications. To address this limitation, we leverage a proprietary collection of unannotated interventional CBCT scans in conjunction with annotated CT data, employing domain adaptation techniques to bridge the modality gap and enhance liver segmentation performance on CBCT. We propose a novel unsupervised domain adaptation (UDA) framework based on the formalism of Margin Disparity Discrepancy (MDD), which improves target domain performance through a reformulation of the original MDD optimization framework. Experimental results on CT and CBCT datasets for liver segmentation demonstrate that our method achieves state-of-the-art performance in UDA, as well as in the few-shot setting.

ARXIV Cancer: osteosarcoma Method: convolutional neural network

A Computer-aided Framework for Detecting Osteosarcoma in Computed Tomography Scans

Maximo Rodriguez-Herrero, Dante D. Sanchez-Gallegos, Marco Antonio Núñez-Gaona, Heriberto Aguirre-Meneses, Luis Alberto Villalvazo Gutiérrez, Mario Ibrahin Gutiérrez Velasco, J. L. Gonzalez-Compean, Jesus Carretero
Published 2026-03-10 11:45

This study presents a computer-aided framework aimed at automating the diagnosis of osteosarcoma using computed tomography (CT) scans. The framework incorporates preprocessing, detection, postprocessing, and visualization steps, utilizing various convolutional neural network (CNN) models. Evaluation on 12 patients demonstrated the framework's effectiveness, achieving an area under the curve (AUC) of 94.8% and a specificity of 94.6%.

Read abstract

Osteosarcoma is the most common primary bone cancer, mainly affecting the youngest and oldest populations. Its detection at early stages is crucial to reduce the probability of developing bone metastasis. In this context, accurate and fast diagnosis is essential to help physicians during the prognosis process. The research goal is to automate the diagnosis of osteosarcoma through a pipeline that includes the preprocessing, detection, postprocessing, and visualization of computed tomography (CT) scans. Thus, this paper presents a machine learning and visualization framework for classifying CT scans using different convolutional neural network (CNN) models. Preprocessing includes data augmentation and identification of the region of interest in scans. Post-processing includes data visualization to render a 3D bone model that highlights the affected area. An evaluation on 12 patients revealed the effectiveness of our framework, obtaining an area under the curve (AUC) of 94.8\% and a specificity of 94.6\%.

ARXIV Cancer: lung cancer Method: automated algorithm

Association of Progressive PPFE and Mortality in Lung Cancer Screening Cohorts

Shahab Aslani, Mehran Azimbagirad, Daryl Cheng, Daisuke Yamada, Ryoko Egashira, Adam Szmul, Justine Chan-Fook, Robert Chapman, Alfred Chung Pui So, Shanshan Wang, John McCabe, Tianqi Yang, Jose M Brenes, Eyjolfur Gudmundsson, The SUMMIT Consortium, Susan M. Astley, Daniel C. Alexander, Sam M. Janes, Joseph Jacob
Published 2026-03-10 11:37

This study investigates the association between progressive pleuroparenchymal fibroelastosis (PPFE) and mortality in lung cancer screening cohorts. Using longitudinal low-dose CT scans and clinical data, an automated algorithm quantified PPFE volume and assessed its progression. The findings indicate that progressive PPFE is independently associated with increased mortality and adverse clinical outcomes in lung cancer screening populations.

Read abstract

Background: Pleuroparenchymal fibroelastosis (PPFE) is an upper lobe predominant fibrotic lung abnormality associated with increased mortality in established interstitial lung disease. However, the clinical significance of radiologic PPFE progression in lung cancer screening (LCS) populations remains unclear. Methods: We analysed longitudinal low-dose CT scans and clinical data from two LCS studies: National Lung Screening Trial (NLST; n=7,980); SUMMIT study (n=8,561). An automated algorithm quantified PPFE volume on baseline and follow-up scans. Annualised change in PPFE was derived and dichotomised using a distribution-based threshold to define progressive PPFE. Associations between progressive PPFE and mortality were evaluated using Cox proportional hazards models adjusted for demographic and clinical variables. In SUMMIT cohort, associations between progressive PPFE and clinical outcomes were assessed using incidence rate ratios (IRR) and odds ratios (OR). Findings: Progressive PPFE independently associated with mortality in both LCS cohorts (NLST: Hazard Ratio (HR)=1.25, 95% Confidence Interval (CI): 1.01--1.56, p=0.042; SUMMIT: HR=3.14, 95% CI: 1.66--5.97, p<0.001). Within SUMMIT, progressive PPFE was strongly associated with higher respiratory admissions (IRR=2.79, p<0.001), increased antibiotic and steroid use (IRR=1.55, p=0.010), and showed a trend towards higher modified medical research council scores (OR=1.40, p=0.055). Interpretation: Radiologic PPFE progression independently associates with mortality across two large LCS cohorts, and associates with adverse clinical outcomes. Quantitative assessment of PPFE progression may provide a clinically relevant imaging biomarker to identify individuals at increased risk of respiratory morbidity within LCS programmes.

ARXIV Cancer: esophageal cancer Method: deep learning

A Guideline-Aware AI Agent for Zero-Shot Target Volume Auto-Delineation

Yoon Jo Kim, Wonyoung Cho, Jongmin Lee, Han Joo Chae, Hyunki Park, Sang Hoon Seo, Noh Jae Myung, Kyungmi Yang, Dongryul Oh, Jin Sung Kim
Published 2026-03-10 10:00

This paper presents OncoAgent, a guideline-aware AI agent designed for zero-shot delineation of clinical target volumes in radiotherapy. The framework converts textual clinical guidelines into three-dimensional target contours without the need for retraining, achieving high performance on esophageal cancer cases. The results indicate that OncoAgent is preferred by physicians over traditional supervised methods, demonstrating its effectiveness and adaptability in clinical settings.

Read abstract

Delineating the clinical target volume (CTV) in radiotherapy involves complex margins constrained by tumor location and anatomical barriers. While deep learning models automate this process, their rigid reliance on expert-annotated data requires costly retraining whenever clinical guidelines update. To overcome this limitation, we introduce OncoAgent, a novel guideline-aware AI agent framework that seamlessly converts textual clinical guidelines into three-dimensional target contours in a training-free manner. Evaluated on esophageal cancer cases, the agent achieves a zero-shot Dice similarity coefficient of 0.842 for the CTV and 0.880 for the planning target volume, demonstrating performance highly comparable to a fully supervised nnU-Net baseline. Notably, in a blinded clinical evaluation, physicians strongly preferred OncoAgent over the supervised baseline, rating it higher in guideline compliance, modification effort, and clinical acceptability. Furthermore, the framework generalizes zero-shot to alternative esophageal guidelines and other anatomical sites (e.g., prostate) without any retraining. Beyond mere volumetric overlap, our agent-based paradigm offers near-instantaneous adaptability to alternative guidelines, providing a scalable and transparent pathway toward interpretability in radiotherapy treatment planning.

Find the papers that actually matter