Author(s)

Harshil Trivedi, Hiral Parmar

  • Manuscript ID: 140022
  • Volume: 1
  • Issue: 1
  • Pages: 149–155

Subject Area: Computer Science

DOI: https://doi.org/10.64643/JATIRV1I1-140022-001
Abstract

Image classification has rapidly advanced due to deep learning. This paper surveys and compares core deep learning techniques used for image classification — convolutional neural networks (CNNs), residual networks, efficient architectures, and Vision Transformers (ViTs) — and presents a reproducible experimental framework for fair comparison. We explain architecture design choices, training procedures, common datasets, and evaluation metrics. We propose a set of controlled experiments (data preprocessing, augmentation, training hyperparameters) and provide a baseline implementation template. Finally, we analyze strengths and weaknesses of each approach and give practical recommendations for researchers and practitioners. All text in this manuscript is original.

Keywords
image classificationconvolutional neural networksResNetEfficientNetVision Transformerdata augmentationreproducible experiments