A Comprehensive Review of Convolutional Neural Network Architectures and Evolution

Swarnali Kundu; Biswasri Datta; Tousif Parvej; Pritam Pal; Fakruddin Ali Ahmed

A Comprehensive Review of Convolutional Neural Network Architectures and Evolution

Swarnali Kundu, Biswasri Datta, Tousif Parvej, Pritam Pal, Fakruddin Ali Ahmed | International Journal of Image Processing and Pattern Recognition | Vol 12, Issue 1 | pp. 32-40 | ISSN: 2456-6985

Abstract

Convolutional Neural Networks (CNNs) have become a foundational deep learning framework in computer vision because they can automatically extract layered, increasingly complex features directly from raw image data. CNNs use convolutional, pooling, and activation layers to extract spatial patterns from basic edges to intricate object components, drawing inspiration from the human visual cortex. An overview of CNNs’ basic architecture is given in this study, along with an explanation of important elements such kernels, convolution processes, pooling strategies, activation functions, and fully connected layers. To show how deep learning architectures have evolved in terms of depth, computational efficiency, and feature extraction capabilities, classic CNN models such as LeNet, AlexNet, VGGNet, and GoogLeNet are examined. Advances in large-scale picture categorization and recognition problems have been greatly aided by these structures. CNNs still have drawbacks despite their effectiveness, including high processing costs, the need for large labeled datasets, and interpretability issues. In order to serve real-time and embedded applications, future developments are anticipated to concentrate on lightweight architectures, increased model transparency, and improved training efficiency. All things considered, CNNs continue to be crucial to contemporary artificial intelligence research because they let machines to process and comprehend visual data with ever-increasing precision and resilience

Keywords

deep learning, convolutional neural network (CNN), CNN architecture, CNN models

🔒 This is a subscription article

Full text is available to subscribers and institutional members. Please choose an option below to access it.

Subscribe Purchase this article Institutional / Login access

References

Hubel DH, Wiesel TN. Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J Physiol. 1962;160:106–154.
Fukushima K. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol Cybern. 1980 Apr;36(4):193–202.
LeCun Y, Boser B, Denker JS, Henderson D, Howard RE, Hubbard W, et al. Backpropagation applied to handwritten zip code recognition. Neural Comput. 1989 Dec;1(4):541–551.
LeCun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE. 2002 Aug;86(11):2278–2324.
Karayiannis N, Venetsanopoulos AN. Artificial neural networks: Learning algorithms, performance evaluation, and applications. 1st ed. New York (NY): Springer Science & Business Media; 2013. p. 1–450.
Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. Commun ACM. 2017 May;60(6):84–90.
Li LJ, Su H, Lim Y, Fei-Fei L. Object bank: An object-level image representation for high-level visual recognition. Int J Comput Vis. 2014 Mar;107(1):20–39.
Jarrett K, Kavukcuoglu K, Ranzato MA, LeCun Y. What is the best multi-stage architecture for object recognition? In: Proceedings of the 2009 IEEE 12th International Conference on Computer Vision; 2009 Sep 29–Oct 2; Kyoto, Japan. IEEE; 2009. p. 2146–2153.
Krizhevsky A, Hinton G. Convolutional deep belief networks on CIFAR-10. Unpublished manuscript. 2010 Aug;40(7):1–9.
Zeiler MD, Taylor GW, Fergus R. Adaptive deconvolutional networks for mid and high level feature learning. In: Proceedings of the 2011 International Conference on Computer Vision; 2011 Nov 6–13; Barcelona, Spain. IEEE; 2011. p. 2018–2025.
Shin JS, Ma J, Choi SJ, Kim S, Hong M. Development of a deep learning model for predicting speech audiometry using pure-tone audiometry data. Appl Sci (Basel). 2024 Oct 15;14(20):9379.
Shawky NE. Convolutional neural network and its applications in artificial intelligence. J ACS Adv Comput Sci. 2021 Jun;12(1):10–26.
Balas VE, Kumar R, Srivastava R, editors. Recent trends and advances in artificial intelligence and internet of things. 1st ed. Cham (CH): Springer International Publishing; 2020. p. 1–520.

How to cite this article

APA

Kundu, S., Datta, B., Parvej, T., Pal, P., & Ahmed, F. A. (2026). A Comprehensive Review of Convolutional Neural Network Architectures and Evolution. International Journal of Image Processing and Pattern Recognition, 12(1), 32-40.

MLA

Kundu, Swarnali, et al. “A Comprehensive Review of Convolutional Neural Network Architectures and Evolution.” International Journal of Image Processing and Pattern Recognition, vol. 12, no. 1, 2026, pp. 32-40.

Chicago

Swarnali Kundu, Biswasri Datta, Tousif Parvej, Pritam Pal, and Fakruddin Ali Ahmed. “A Comprehensive Review of Convolutional Neural Network Architectures and Evolution.” International Journal of Image Processing and Pattern Recognition 12, no. 1 (2026): 32-40.

Vancouver

Kundu S, Datta B, Parvej T, Pal P, Ahmed FA. A Comprehensive Review of Convolutional Neural Network Architectures and Evolution. International Journal of Image Processing and Pattern Recognition. 2026;12(1):32-40.

BibTeX

@article{KunduS2026,
author = {Swarnali Kundu and Biswasri Datta and Tousif Parvej and Pritam Pal and Fakruddin Ali Ahmed},
title = {A Comprehensive Review of Convolutional Neural Network Architectures and Evolution},
journal = {International Journal of Image Processing and Pattern Recognition},
year = {2026},
volume = {12},
number = {1},
pages = {32--40},
issn = {2456-6985},
url = {https://journalspub.com/publication/ijippr/article=26340}
}

► Necessary Cookies Always Active

Necessary cookies enable essential site features like secure log-ins and consent preference adjustments. They do not store personal data.

► Functional Cookies Remark

Functional cookies support features like content sharing on social media, collecting feedback, and enabling third-party tools.

► Analytical Cookies Remark

Analytical cookies track visitor interactions, providing insights on metrics like visitor count, bounce rate, and traffic sources.

► Advertisement Cookies Remark

Advertisement cookies deliver personalized ads based on your previous visits and analyze the effectiveness of ad campaigns.

Swarnali Kundu, Biswasri Datta, Tousif Parvej, Pritam Pal, Fakruddin Ali Ahmed | International Journal of Image Processing and Pattern Recognition | Vol 12, Issue 1 | pp. 32-40 | ISSN: 2456-6985

Abstract

Keywords

deep learning, convolutional neural network (CNN), CNN architecture, CNN models

🔒 This is a subscription article

Full text is available to subscribers and institutional members. Please choose an option below to access it.

Subscribe Purchase this article Institutional / Login access

References

Hubel DH, Wiesel TN. Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J Physiol. 1962;160:106–154.
Fukushima K. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol Cybern. 1980 Apr;36(4):193–202.
LeCun Y, Boser B, Denker JS, Henderson D, Howard RE, Hubbard W, et al. Backpropagation applied to handwritten zip code recognition. Neural Comput. 1989 Dec;1(4):541–551.
LeCun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE. 2002 Aug;86(11):2278–2324.
Karayiannis N, Venetsanopoulos AN. Artificial neural networks: Learning algorithms, performance evaluation, and applications. 1st ed. New York (NY): Springer Science & Business Media; 2013. p. 1–450.
Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. Commun ACM. 2017 May;60(6):84–90.
Li LJ, Su H, Lim Y, Fei-Fei L. Object bank: An object-level image representation for high-level visual recognition. Int J Comput Vis. 2014 Mar;107(1):20–39.
Jarrett K, Kavukcuoglu K, Ranzato MA, LeCun Y. What is the best multi-stage architecture for object recognition? In: Proceedings of the 2009 IEEE 12th International Conference on Computer Vision; 2009 Sep 29–Oct 2; Kyoto, Japan. IEEE; 2009. p. 2146–2153.
Krizhevsky A, Hinton G. Convolutional deep belief networks on CIFAR-10. Unpublished manuscript. 2010 Aug;40(7):1–9.
Zeiler MD, Taylor GW, Fergus R. Adaptive deconvolutional networks for mid and high level feature learning. In: Proceedings of the 2011 International Conference on Computer Vision; 2011 Nov 6–13; Barcelona, Spain. IEEE; 2011. p. 2018–2025.
Shin JS, Ma J, Choi SJ, Kim S, Hong M. Development of a deep learning model for predicting speech audiometry using pure-tone audiometry data. Appl Sci (Basel). 2024 Oct 15;14(20):9379.
Shawky NE. Convolutional neural network and its applications in artificial intelligence. J ACS Adv Comput Sci. 2021 Jun;12(1):10–26.
Balas VE, Kumar R, Srivastava R, editors. Recent trends and advances in artificial intelligence and internet of things. 1st ed. Cham (CH): Springer International Publishing; 2020. p. 1–520.

How to cite this article

APA

MLA

Chicago

Vancouver

BibTeX

@article{KunduS2026,
author = {Swarnali Kundu and Biswasri Datta and Tousif Parvej and Pritam Pal and Fakruddin Ali Ahmed},
title = {A Comprehensive Review of Convolutional Neural Network Architectures and Evolution},
journal = {International Journal of Image Processing and Pattern Recognition},
year = {2026},
volume = {12},
number = {1},
pages = {32--40},
issn = {2456-6985},
url = {https://journalspub.com/publication/ijippr/article=26340}
}

Swarnali Kundu, Biswasri Datta, Tousif Parvej, Pritam Pal, Fakruddin Ali Ahmed | International Journal of Image Processing and Pattern Recognition | Vol 12, Issue 1 | pp. 32-40 | ISSN: 2456-6985

Abstract

Keywords

deep learning, convolutional neural network (CNN), CNN architecture, CNN models

🔒 This is a subscription article

Full text is available to subscribers and institutional members. Please choose an option below to access it.

Subscribe Purchase this article Institutional / Login access

References

Hubel DH, Wiesel TN. Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J Physiol. 1962;160:106–154.
Fukushima K. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol Cybern. 1980 Apr;36(4):193–202.
LeCun Y, Boser B, Denker JS, Henderson D, Howard RE, Hubbard W, et al. Backpropagation applied to handwritten zip code recognition. Neural Comput. 1989 Dec;1(4):541–551.
LeCun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE. 2002 Aug;86(11):2278–2324.
Karayiannis N, Venetsanopoulos AN. Artificial neural networks: Learning algorithms, performance evaluation, and applications. 1st ed. New York (NY): Springer Science & Business Media; 2013. p. 1–450.
Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. Commun ACM. 2017 May;60(6):84–90.
Li LJ, Su H, Lim Y, Fei-Fei L. Object bank: An object-level image representation for high-level visual recognition. Int J Comput Vis. 2014 Mar;107(1):20–39.
Jarrett K, Kavukcuoglu K, Ranzato MA, LeCun Y. What is the best multi-stage architecture for object recognition? In: Proceedings of the 2009 IEEE 12th International Conference on Computer Vision; 2009 Sep 29–Oct 2; Kyoto, Japan. IEEE; 2009. p. 2146–2153.
Krizhevsky A, Hinton G. Convolutional deep belief networks on CIFAR-10. Unpublished manuscript. 2010 Aug;40(7):1–9.
Zeiler MD, Taylor GW, Fergus R. Adaptive deconvolutional networks for mid and high level feature learning. In: Proceedings of the 2011 International Conference on Computer Vision; 2011 Nov 6–13; Barcelona, Spain. IEEE; 2011. p. 2018–2025.
Shin JS, Ma J, Choi SJ, Kim S, Hong M. Development of a deep learning model for predicting speech audiometry using pure-tone audiometry data. Appl Sci (Basel). 2024 Oct 15;14(20):9379.
Shawky NE. Convolutional neural network and its applications in artificial intelligence. J ACS Adv Comput Sci. 2021 Jun;12(1):10–26.
Balas VE, Kumar R, Srivastava R, editors. Recent trends and advances in artificial intelligence and internet of things. 1st ed. Cham (CH): Springer International Publishing; 2020. p. 1–520.

How to cite this article

APA

MLA

Chicago

Vancouver

BibTeX

@article{KunduS2026,
author = {Swarnali Kundu and Biswasri Datta and Tousif Parvej and Pritam Pal and Fakruddin Ali Ahmed},
title = {A Comprehensive Review of Convolutional Neural Network Architectures and Evolution},
journal = {International Journal of Image Processing and Pattern Recognition},
year = {2026},
volume = {12},
number = {1},
pages = {32--40},
issn = {2456-6985},
url = {https://journalspub.com/publication/ijippr/article=26340}
}

Enter the destination URL

URL

Link Text

Open link in a new tab

Or link to existing content

No search term specified. Showing recent items.