AI-Augmented Convolutional Neural Networks for Image Recognition

Architectural Analysis and ResNet50 Implementation with Modern Deep Learning Perspectives

Authors

  • Nicolas Lee, William Melvin Sukamto, Jovin Maurelio, Jovan Maurelio School of Computing and IT, Taylor’s University (UWE Dual Awards Programme), Subang Jaya, Malaysia

DOI:

https://doi.org/10.54938/ijemdcsai.2026.04.2.636

Keywords:

convolutional neural networks, image recognition, ResNet50, transfer learning, VGGNet, GoogLeNet, Vision Transformers, self-supervised learning, deep learning

Abstract

The Convolutional Neural Network (CNN) is now the backbone of image recognition technology, powering computers to make accurate classifications and understand visual content. In this paper, we do a detailed analysis of the architecture of three popular CNN models namely ResNet, GoogLeNet (Inception v1) and VGGNet, and discuss the forward propagation, loss functions and back propagation mechanisms. We also explain in detail a practical implementation of ResNet50 on the CIFAR-10 dataset, which involves data preparation, building of the model, optimization of training using the Adam optimizer, and evaluation. In addition to this classic analysis, we discuss the latest developments in artificial intelligence paradigms such as Vision Transformers (ViTs), self-supervised learning, neural architecture search (NAS), and multimodal foundation models, which are transforming the image recognition field and supplementing what was already achieved by CNN models.

Downloads

Published

2026-06-24

How to Cite

Jovin Maurelio, Jovan Maurelio, N. L. W. M. S. (2026). AI-Augmented Convolutional Neural Networks for Image Recognition: Architectural Analysis and ResNet50 Implementation with Modern Deep Learning Perspectives. International Journal of Emerging Multidisciplinaries: Computer Science & Artificial Intelligence, 4(2). https://doi.org/10.54938/ijemdcsai.2026.04.2.636

Issue

Section

Research Article

Categories