Deep CNN Architectures for Medical and General Image Recognition

Comparative Analysis with AI-Enhanced Perspectives

Authors

  • Wang Ruiting, Khant Aung Chain, Tao Jingchu, Asser Tawfik, Yuan Chengwei School of Computing and IT, Taylor’s University (UWE Dual Awards Programme) Subang Jaya, Malaysia

DOI:

https://doi.org/10.54938/ijemdcsai.2026.04.2.634

Keywords:

convolutional neural networks, image recognition, ResNet, VGGNet, GoogLeNet, medical imaging, brain tumor classification, Vision Transformers, deep learning

Abstract

In image recognition, using Convolutional Neural Networks (CNNs), a computer can be trained to recognize images by learning in a hierarchical way from pixel information. In this paper, three CNN landmarks, VGGNet, ResNet and GoogLeNet, are studied and analyzed in terms of forward propagation, categorical cross-entropy loss functions, and gradient flow in back-propagation. In the field of medical imaging, an example of ResNet implementation is used for classification of brain tumors in MRI images. We also delve into the ways in which current AI developments[17-19], such as Vision Transformers, self-supervised learning, neural architecture search, and multimodal foundation models, are reshaping image recognition from traditional CNN methods.

Downloads

Published

2026-06-23

How to Cite

Tao Jingchu, Asser Tawfik, Yuan Chengwei, W. R. K. A. C. . (2026). Deep CNN Architectures for Medical and General Image Recognition: Comparative Analysis with AI-Enhanced Perspectives. International Journal of Emerging Multidisciplinaries: Computer Science & Artificial Intelligence, 4(2). https://doi.org/10.54938/ijemdcsai.2026.04.2.634

Issue

Section

Research Article

Categories