Multi-View Thermal Breast Imaging for Malignancy Detection: Performance Benchmarking of CNN, Transformer, and Involution Architectures
No Thumbnail Available
Date
2026
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Springer Science and Business Media Deutschland GmbH
Open Access Color
Green Open Access
No
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
Breast cancer screening demands accurate, non-invasive, low-cost tools. Infrared thermography is radiation-free and portable, but its utility hinges on robust computer-aided diagnosis (CAD). We benchmark three deep-learning families for static multi-view breast thermography—CNNs, Transformers, and an involution-based model (HarmonyNet-Lite). Experiments use the Breast Thermography dataset (119 patients; 476 manually segmented ROIs from anterior/oblique views). A compact pipeline performs ROI segmentation, RGB conversion, normalization, resizing, and moderate data augmentation; class imbalance is handled with minority oversampling and class-weighted loss. Evaluation follows patient-stratified five-fold cross-validation. HarmonyNet-Lite yields the best results: accuracy 87.47 ± 2.99%, recall 93.33 ± 2.13%, F1 68.43 ± 8.75%, and precision 54.23 ± 8.94%, indicating high sensitivity with an acceptable trade-off in precision for screening. Among CNNs, ResNet50 is strongest (85.59 ± 3.37%; F1 63.16 ± 3.87%), followed by InceptionV3 (83.38 ± 1.41%; F1 59.99 ± 6.72%), while DenseNet121 lags (79.25 ± 2.98%; F1 52.38 ± 5.62%). Transformer performance is mixed: ViT-Tiny is competitive (84.59 ± 4.23%; F1 59.46 ± 4.68%), whereas Swin-Tiny trails (81.30 ± 2.32%; F1 57.14 ± 4.44%) due to lower precision. Despite using only 0.14 M parameters, HarmonyNet-Lite outperforms heavier CNNs (ResNet50: 23.59 M; InceptionV3: 21.81 M) and lighter Transformers (ViT-Tiny: 2.84 M; Swin-Tiny: 11.78 M), demonstrating that content-adaptive, spatially aware involution operators efficiently capture fine thermal gradients. These findings position HarmonyNet-Lite as a strong, deployable CAD candidate. Future work will pursue multi-center validation, automated segmentation, multi-class labeling, hybrid involution–attention/multimodal models, and controlled GAN-based augmentation to mitigate data scarcity. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
Description
Niramai Health Analytix Pvt Ltd.
Keywords
Breast Thermography, Deep Learning, Harmonynet-Lite
Turkish CoHE Thesis Center URL
Fields of Science
Citation
WoS Q
N/A
Scopus Q
Q3

OpenCitations Citation Count
N/A
Source
Lecture Notes in Computer Science -- 4th International Conference on Artificial Intelligence over Infrared Images for Medical Applications, AIIIMA 2025 -- 2025-11-15 through 2025-11-15 -- Virtual, Online -- 343649
Volume
16308 LNCS
Issue
Start Page
20
End Page
35
PlumX Metrics
Citations
Scopus : 0

