ROBUSTNESS OF DEEP LEARNING MODELS TO DOMAIN SHIFT IN AUTOMATIC MELANOMA DETECTION ON DERMATOSCOPIC IMAGES

Main Article Content

Svitlana YURTSUN
Oleksandr BOGATYREV

Abstract

Introduction. The article addresses the problem of automated melanoma detection in
dermoscopic images using deep learning methods under conditions of limited data availability, which
may contribute to the early diagnosis of this life-threatening disease. The main objective is to explore
ways of improving the generalization ability of deep learning models in classifying data from different
sources. The study utilised two sets of dermatoscopic images from the ISIC Archive in various
combinations. A series of experiments was conducted applying transfer learning with CNNs and the
Vision Transformer (ViT) architecture, a custom convolutional model, and model ensembles.
Particular attention is given to the impact of image preprocessing (removal of black borders,
augmentations) on classification quality, the use of data from other sources only for the minority class
and for all classes, as well as the generalization capability of models on cross-domain data. The
results demonstrated the advantage of transfer learning, while the custom model showed lower
accuracy. Among convolutional neural networks, the EfficientNet architecture achieved the highest
classification performance, while Vision Transformer models demonstrated strong potential,
outperforming most CNN architectures. The study revealed that combining data from different sources
may not improve, but rather decrease classification performance due to the model learning domain
differences instead of medical features. This effect was especially evident when supplementing only the
minority class from external datasets to address class imbalance during training. By removing noise in
the form of black borders characteristic of one dataset, the accuracy of disease recognition was
improved. The results of the study indicate the necessity of thorough data preparation and the
application of specialized strategies for cross-domain training to ensure practical applicability.
Purpose. The aim of the article is to investigate the impact of combining cross-domain data and
image preprocessing on the quality of melanoma classification using deep learning methods.
Results. The study reveals that combining images from different sources often leads not to
improvement but to a decline in performance. Mixing cross-domain data during model training can
introduce systematic classification bias, where the algorithm begins to associate image features not
with disease characteristics but with the domain from which the data originate. As a result, the model
learns to distinguish between data sources rather than actual medical indicators of pathology, which
negatively affects its generalization ability and diagnostic accuracy. Despite the increased overall
volume of data, differences in statistical characteristics between domains (such as imaging types,
lighting conditions, image formats, and class balance) complicate model training and may reduce its
ability to discriminate classes within individual domains. This is reflected in a decline in accuracy and
recall, even when AUC value remains high, indicating that the model cannot effectively separate
classes at realistic thresholds–a critical limitation for clinical use. Conclusion. From a practical perspective, the results of this work highlight the importance of
thorough data preparation and the development of adaptive strategies for cross-domain training.
Despite these challenges, developing models capable of reliably operating on heterogeneous data
remains a promising direction, as this approach most closely reflects the conditions of real-world
medical practice.

Article Details

How to Cite
YURTSUN , S., & BOGATYREV, O. (2025). ROBUSTNESS OF DEEP LEARNING MODELS TO DOMAIN SHIFT IN AUTOMATIC MELANOMA DETECTION ON DERMATOSCOPIC IMAGES. Cherkasy University Bulletin: Applied Mathematics. Informatics, (1). https://doi.org/10.31651/2076-5886-2025-1-4-18
Section
Прикладна математика
Author Biographies

Svitlana YURTSUN , Bohdan Khmelnytsky National University of Cherkasy

Student, Department of Applied Mathematics and Informatics, The Bohdan Khmelnytsky National
University of Cherkasy, Ukraine

Oleksandr BOGATYREV, Bohdan Khmelnytsky National University of Cherkasy

Candidate of Physical and Mathematical Sciences, Associate Professor, Department of Applied
Mathematics and Informatics, The Bohdan Khmelnytsky National University of Cherkasy, Ukraine

References

 Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., et al. (2020). An

image is worth 16x16 words: Transformers for image recognition at scale. International Conference on

Learning Representations. https://arxiv.org/pdf/2010.11929.

 Alijani, S., Fayyad, J., & Najjaran, H. (2024). Vision transformers in domain adaptation and domain

generalization: A study of robustness. Neural Computing and Applications. Springer.

https://doi.org/10.1007/s00521-024-10353-5.

 Flosdorf, C., Engelker, J., Keller, I., & Mohr, N. (2024). Skin cancer detection utilizing deep learning:

Classification of skin lesion images using a vision transformer. arXiv.

https://doi.org/10.48550/arXiv.2407.18554.

 Wu, Y., Chen, B., Zeng, A., Pan, D., Wang, R., & Zhao, S. (2022). Skin cancer classification with deep

learning: A systematic review. Frontiers in Oncology, 12. https://doi.org/10.3389/fonc.2022.893972.

 ISIC Challenge Datasets. (2020). International Skin Imaging Collaboration. https://doi.org/10.34970/2020-

ds01.

 International Skin Imaging Collaboration. (2020). SIIM-ISIC 2020 challenge dataset. https://challenge.isicarchive.com/data/#2020.

 Saini, M., & Susan, S. (2023). Tackling class imbalance in computer vision: A contemporary review.

Artificial Intelligence Review, 56, 1279–1335. https://doi.org/10.1007/s10462-023-10557-6.

 Keras Applications. https://keras.io/2/api/applications/.

 Hosny, K., Kassem, M., & Fouad, M. (2018). Skin cancer classification using deep learning and transfer

learning. In 2018 9th Cairo International Biomedical Engineering Conference (CIBEC), 90–93.

https://doi.org/10.1109/CIBEC.2018.8641762.

 Hugging Face. (n.d.). Brian Dunn: vit-base-patch16-224-in21k_brain_tumor_diagnosis.

https://huggingface.co/DunnBC22/vit-base-patch16-224-in21k_brain_tumor_diagnosis.

 Woo, S., Park, J., Lee, J.-Y., & Kweon, I. (2018). CBAM: Convolutional block attention module. In

Proceedings of the European Conference on Computer Vision (ECCV), 3–19. https://doi.org/10.1007/978-3-

-01234-2_1.