Research Library

Find the papers that actually matter

Search by concept, cancer type, source, or modeling approach. Every result is presented in a cleaner, review-friendly layout with summaries and direct access to the abstract.

Found papers
1201
Matches for your current filters.
Current query
All papers
Semantic ranking when query text is present.
Reset filters

Log in to save searches and build a personal reading queue.

ARXIV Cancer: unknown Method: two-stream deep learning architecture

A two-stream network with global-local feature fusion for bone age assessment

Qiong Lou, Han Yang, Fang Lu
Published 2025-12-20 11:56
This study presents a two-stream deep learning architecture, named BoNet+, for automated bone age assessment (BAA). The model integrates global and local feature extraction channels, utilizing a Transformer module for global features and a RFAConv module for local features. The proposed method demonstrates improved accuracy in BAA, achieving mean absolute errors of 3.81 and 5.65 months on two test datasets, thus reducing clinical workload and enhancing objectivity in assessments.
Read abstract

Bone Age Assessment (BAA) is a widely used clinical technique that can accurately reflect an individual's growth and development level, as well as maturity. In recent years, although deep learning has advanced the field of bone age assessment, existing methods face challenges in efficiently balancing global features and local skeletal details. This study aims to develop an automated bone age assessment system based on a two-stream deep learning architecture to achieve higher accuracy in bone age assessment. We propose the BoNet+ model incorporating global and local feature extraction channels. A Transformer module is introduced into the global feature extraction channel to enhance the ability in extracting global features through multi-head self-attention mechanism. A RFAConv module is incorporated into the local feature extraction channel to generate adaptive attention maps within multiscale receptive fields, enhancing local feature extraction capabilities. Global and local features are concatenated along the channel dimension and optimized by an Inception-V3 network. The proposed method has been validated on the Radiological Society of North America (RSNA) and Radiological Hand Pose Estimation (RHPE) test datasets, achieving mean absolute errors (MAEs) of 3.81 and 5.65 months, respectively. These results are comparable to the state-of-the-art. The BoNet+ model reduces the clinical workload and achieves automatic, high-precision, and more objective bone age assessment.