AI-Augmented Convolutional Neural Networks for Image Recognition
Architectural Analysis and ResNet50 Implementation with Modern Deep Learning Perspectives
DOI:
https://doi.org/10.54938/ijemdcsai.2026.04.2.636Keywords:
convolutional neural networks, image recognition, ResNet50, transfer learning, VGGNet, GoogLeNet, Vision Transformers, self-supervised learning, deep learningAbstract
The Convolutional Neural Network (CNN) is now the backbone of image recognition technology, powering computers to make accurate classifications and understand visual content. In this paper, we do a detailed analysis of the architecture of three popular CNN models namely ResNet, GoogLeNet (Inception v1) and VGGNet, and discuss the forward propagation, loss functions and back propagation mechanisms. We also explain in detail a practical implementation of ResNet50 on the CIFAR-10 dataset, which involves data preparation, building of the model, optimization of training using the Adam optimizer, and evaluation. In addition to this classic analysis, we discuss the latest developments in artificial intelligence paradigms such as Vision Transformers (ViTs), self-supervised learning, neural architecture search (NAS), and multimodal foundation models, which are transforming the image recognition field and supplementing what was already achieved by CNN models.
Downloads
Published
How to Cite
License
Copyright (c) 2026 International Journal of Emerging Multidisciplinaries: Computer Science & Artificial Intelligence

This work is licensed under a Creative Commons Attribution 4.0 International License.






