Journal of Modern Technology and Engineering

ISSN Online: 2519-4836

Journal of Modern Technology and Engineering is devoted to the publication of original investigations, observations, scholarly inquiries, and reviews in the various branches of technology and engineering. All published papers are peer-reviewed. It covers cutting edge developments in modern technology and engineering from around the globe. This widely referenced publication helps digital investigators remain current on new technologies, useful tools, relevant research, investigative techniques, and methods for handling security breaches.The journal is published three times in a year.

Submission Current Issue Archive Special Issues

Comparative Evaluation of Attention Mechanisms Across CNN Architectures and Image Classification Datasets

Published: 20-04-2026

https://doi.org/10.62476/jmte.11139

Abstract

Convolutional Neural Networks (CNNs) have established themselves as fundamental tools in computer vision, demonstrating strong performance across tasks including image classification, object detection, and medical image analysis. Their key strength lies in the ability to automatically extract hierarchical feature representations from visual inputs while preserving spatial structure. Incorporating attention mechanisms into CNN architectures has led to notable performance gains by enabling networks to focus on the most informative features, inspired by biological visual attention. Among recent attention approaches, Squeeze-and-Excitation (SE), Convolutional Block Attention Module (CBAM), Coordinate Attention (CA), and Efficient Multi-scale Attention (EMA) have shown considerable promise across a range of network designs. This study conducts a systematic comparative evaluation of these four attention mechanisms integrated into four architecturally distinct CNNs (ResNet-18, ResNet-50, EfficientNet-B0, and GoogLeNet), tested on datasets of increasing complexity: MNIST (handwritten digits), CIFAR-10 (natural objects), CIFAR-100 (fine-grained categories), and AppleLeaf9 (plant disease classification). All models were trained under uniform CUDA-accelerated settings and assessed using accuracy or F1-score, chosen according to dataset characteristics. Findings indicate that the benefit of attention mechanisms is closely tied to dataset complexity, yielding substantial improvements on challenging multi-class datasets such as CIFAR-100 while offering limited gains on simpler benchmarks ike MNIST. Performance also varied across architectures, with deeper networks and different design paradigms responding distinctly to each attention module. These results offer practical guidance for selecting suitable attention-architecture combinations in both research and applied settings.

This work is lisenced under a Creative Commons Attribution 4.0 International Lisence.

Volume 11 (2026) Issue 1Pages: 39-60

Copy

View 151
Downloads 43
Saveds 0
Citations (Crossref) 0

Journal of Modern Technology and Engineering

Comparative Evaluation of Attention Mechanisms Across CNN Architectures and Image Classification Datasets

Share

Volumes

Stay Updated