Fine-Grained Image Classification using Particle Swarm Optimization for Hyperparameter Optimization of Convolutional Neural Networks
1Research, Panchvati, 422003, Nashik, India
2Department of Computer Engineering, K K Wagh Institute of Engineering Education and Research, India
*Author to whom correspondence should be addressed:
E-mail: ppvaidya@kkwagh.edu.in (PPV)
E-mail: ppvaidya@kkwagh.edu.in (PPV)
Received: May 26, 2025 | Revised: August 05, 2025 | Accepted: September 02, 2025 | Published: September 2025
Abstract
This study presents a novel approach to fine-grained image classification (FGIC) by employing Particle Swarm Optimization (PSO) for automatic hyperparameter tuning in Convolutional Neural Networks (CNNs). Unlike conventional models with manually fixed architectures, the proposed method optimizes critical parameters—such as filter count, kernel size, pooling size, and stride—tailored to each dataset. The model operates on reduced image resolutions (128×128), yet achieves superior accuracy with significantly fewer training epochs. Specifically, it attains 99.56% accuracy on the Oxford Flowers dataset, 97.45% on Stanford Cars, and 95.95% on Stanford Dogs using only 25 epochs, outperforming deep networks like DenseNet-161 and ResNet-50 trained for 100–150 epochs. This PSO-based tuning framework not only enhances classification performance but also minimizes computational cost, making it a practical and scalable solution for real-world FGIC applications.
Keywords
PSO ; CNN ; Hyperparameters ; Fine grain image classification
Available Repositories
Share Article
Article Metrics
--
Views
--
Downloads
--
Citations
Export Citation
Full Text
References
- 1) X. Yu, Y. Zhao, Y. Gao, and S. Xiong, "MaskCOV: A random mask covariance network for ultra-fine-grained visual categorization," Pattern Recogn., 119 101-112 (2021) doi:10.1016/j.patcog.2021.108067
- 2) M. Zhang, H. Su, and J. Wen, "Classification of flower image based on attention mechanism and multi-loss attention network," Comput Commun., 179 45-59 (2021) doi:10.1016/j.comcom.2021.09.001
- 3) X. Zheng, L. Qi, Y. Ren, and X. Lu, "Fine-Grained Visual Categorization by Localizing Object Parts with Single Image," IEEE Trans Multimed., 23 201-215 (2021) doi:10.1109/TMM.2020.2993960
- 4) F. Zhang, M. Li, G. Zhai, and Y. Liu, "Multi-branch and Multi-scale Attention Learning for Fine-Grained Visual Categorization," in Lecture Notes in Comput Sci., 78-89 (2021) doi:10.1007/978-3-030-67832-6_12
- 5) D. Morales, E. Talavera, and B. Remeseiro, "Playing to distraction: towards a robust training of CNN classifiers through visual explanation techniques," Neural Comput Appl., 33(24) 1221-1235 (2021) doi:10.1007/s00521-021-06282-2
- 6) R. Thapa, K. Zhang, N. Snavely, S. Belongie, and A. Khan, "The Plant Pathology Challenge 2020 data set to classify foliar disease of apples," Appl Plant Sci., 8(9) 87-99 (2020) doi:10.1002/aps3.11390
- 7) C. Wang, Y. Qian, W. Gong, J. Cheng, Y. Wang, and Y. Wang, "Cross-layer progressive attention bilinear fusion method for fine-grained visual classification," J Vis Commun Image Represent., 82 233-245 (2022) doi:10.1016/j.jvcir.2021.103414
- 8) X. Nie, B. Chai, L. Wang, Q. Liao, and M. Xu, "Learning enhanced features and inferring twice for fine-grained image classification," Multimedia Tools Appl., 82(10) 1110-1125 (2023) doi:10.1007/s11042-022-13619-z
- 9) Y. Cui, Y. Song, C. Sun, A. Howard, and S. Belongie, "Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning," in Proc IEEE Comput Soc Conf Comput Vis Pattern Recogn., 12-23 (2018) doi:10.1109/CVPR.2018.00432
- 10) Z. C. Zhang, Z. D. Chen, Y. Wang, X. Luo, and X. S. Xu, "A vision transformer for fine-grained classification by reducing noise and enhancing discriminative information," Pattern Recogn., 145 77-90 (2024) doi:10.1016/j.patcog.2023.109979
- 11) S. Ye, Y. Wang, Q. Peng, X. You, and C. L. P. Chen, "The Image Data and Backbone in Weakly Supervised Fine-Grained Visual Categorization: A Revisit and Further Thinking," IEEE Trans Circuits Syst Video Technol., 34(1) 101-114 (2024) doi:10.1109/TCSVT.2023.3284405
- 12) J. Lu, W. Zhang, Y. Zhao, and C. Sun, "Image local structure information learning for fine-grained visual classification," Sci Rep., 12(1) 23-35 (2022) doi:10.1038/s41598-022-23835-0
- 13) Z. Wang, "Research on Flower Image Classification Based on Transfer Learning," Acad J Sci Technol., 4(3) 45-55 (2023) doi:10.54097/ajst.v4i3.5055
- 14) W. Yuan, "Accuracy Comparison of YOLOv7 and YOLOv4 Regarding Image Annotation Quality for Apple Flower Bud Classification," AgriEngineering., 5(1) 12-21 (2023) doi:10.3390/agriengineering5010027
- 15) G. Xu, Z. Zhu, S. Yin, G. Liu, and B. Lei, "Flower Image Classification System Based on Lightweight DCNN," Shuju Caiji Yu Chuli J Data Acquis Process., 36(4) 33-42 (2021) doi:10.16337/j.1004-9037.2021.04.014
- 16) M. V. D. Prasad et al., "An efficient classification of flower images with convolutional neural networks," Int J Eng Technol (UAE)., 7(1.1) 77-85 (2018) doi:10.14419/ijet.v7i1.1.9857
- 17) P. Zhao, Q. Miao, H. Yao, X. Liu, R. Liu, and M. Gong, "CA-PMG: Channel attention and progressive multi-granularity training network for fine-grained visual classification," IET Image Process., 15(14) 99-108 (2021) doi:10.1049/ipr2.12238
- 18) A. Peryanto, A. Yudhana, and R. Umar, "Convolutional Neural Network and Support Vector Machine in Classification of Flower Images," Khazanah Informatika: J Ilmu Komputer dan Informatika., 8(1) 11-19 (2022) doi:10.23917/khif.v8i1.15531
- 19) Z. Zeng, C. Huang, W. Zhu, Z. Wen, and X. Yuan, "Flower image classification based on an improved lightweight neural network with multi-scale feature fusion and attention mechanism," Math Biosci Eng., 20(8) 203-215 (2023) doi:10.3934/mbe.2023619
- 20) F. B. Legesse, A. Medyukhina, S. Heuke, and J. Popp, "Texture analysis and classification in coherent anti-Stokes Raman scattering (CARS) microscopy images for automated detection of skin cancer," Comput Med Imaging Graph., 43 45-55 (2015) doi:10.1016/j.compmedimag.2015.02.010
- 21) A. Sagingalieva et al., "Hybrid quantum ResNet for car classification and its hyperparameter optimization," Quantum Mach Intell., 5(2) 33-44 (2023) doi:10.1007/s42484-023-00123-2
- 22) R. Himilda and R. A. Johan, "Klasifikasi Jenis Kendaraan Menggunakan Metode Extreme Learning Machine," JTIM: J Teknol Inform Multimedia., 2(4) 11-22 (2021) doi:10.35746/jtim.v2i4.118
- 23) F. A. I. A. Putra, F. Utaminingrum, and W. F. Mahmudy, "HOG Feature Extraction and KNN Classification for Detecting Vehicle in The Highway," IJCCS (Indonesian J Comput Cybern Syst)., 14(3) 77-88 (2020) doi:10.22146/ijccs.54050
- 24) T. C. Hsu, H. Abelson, N. Lao, Y. H. Tseng, and Y. T. Lin, "Behavioral-pattern exploration and development of an instructional tool for young children to learn AI," Comput Educ Artif Intell., 2 12-23 (2021) doi:10.1016/j.caeai.2021.100012
- 25) H. Pei, R. Guo, Z. Tan, X. Huang, and Z. Bai, "Fine-grained classification of automobile front face modeling based on Gestalt psychology*," Vis Comput., 39(7) 101-115 (2023) doi:10.1007/s00371-022-02506-1
- 26) M. Abdar et al., "A review of uncertainty quantification in deep learning: Techniques, applications and challenges," Inf Fusion., 76 56-70 (2021) doi:10.1016/j.inffus.2021.05.008
- 27) D. Shah, "Car Image Classification and Recognition," Int J Res Appl Sci Eng Technol., 9(9) 99-108 (2021) doi:10.22214/ijraset.2021.38336
- 28) Y. X. Tan, C. P. Lee, M. Neo, and K. M. Lim, "Text-to-image synthesis with self-supervised learning," Pattern Recogn Lett., 157 33-45 (2022) doi:10.1016/j.patrec.2022.04.010
- 29) L. A. Hendricks, Z. Akata, M. Rohrbach, J. Donahue, B. Schiele, and T. Darrell, "Generating visual explanations," in Lecture Notes Comput Sci., 12-23 (2016) doi:10.1007/978-3-319-46493-0_1
- 30) S. Lin et al., "Local Patch AutoAugment With Multi-Agent Collaboration," IEEE Trans Multimed., 26 67-79 (2024) doi:10.1109/TMM.2023.3270635
- 31) R. Ji, J. Li, and L. Zhang, "Siamese self-supervised learning for fine-grained visual classification," Comput Vis Image Underst., 229 101-112 (2023) doi:10.1016/j.cviu.2023.103658
- 32) Y. Yu, B. Li, Z. Ji, J. Han, and Z. Zhang, "Knowledge Distillation Classifier Generation Network for Zero-Shot Learning," IEEE Trans Neural Netw Learn Syst., 34(6) 201-212 (2023) doi:10.1109/TNNLS.2021.3112229
- 33) W. Xiong, Y. Gong, W. Niu, and R. Wang, "Hierarchically Structured Network with Attention Convolution and Embedding Propagation for Imbalanced Few-Shot Learning," IEEE Access., 9 77-89 (2021) doi:10.1109/ACCESS.2021.3093422
- 34) D. Kang, H. Kwon, J. Min, and M. Cho, "Relational Embedding for Few-Shot Classification," in Proc IEEE Int Conf Comput Vis., 44-55 (2021) doi:10.1109/ICCV48922.2021.00870
- 35) H. Liu et al., "TransIFC: Invariant Cues-aware Feature Concentration Learning for Efficient Fine-grained Bird Image Classification," IEEE Trans Multimed., 25 11-22 (2023) doi:10.1109/TMM.2023.3238548
- 36) X. Zhao, Y. Feng, X. Shi, Y. Wang, and G. Zhang, "A color constancy based flower classification method in the blockchain data lake," Multimedia Tools Appl., 83(10) 121-133 (2024) doi:10.1007/s11042-023-16656-4
- 37) Z. Li, T. Gu, B. Li, W. Xu, X. He, and X. Hui, "ConvNeXt-Based Fine-Grained Image Classification and Bilinear Attention Mechanism Model," Appl Sci (Switz)., 12(18) 101-112 (2022) doi:10.3390/app12189016
- 38) Y. Li and C. Bian, "Few-Shot Fine-Grained Ship Classification With a Foreground-Aware Feature Map Reconstruction Network," IEEE Trans Geosci Remote Sens., 60 77-88 (2022) doi:10.1109/TGRS.2022.3172223
- 39) G. Nguyen, M. R. Taesiri, and A. Nguyen, "Visual correspondence-based explanations improve AI robustness and human-AI team accuracy," in Adv Neural Inf Process Syst., 12-23 (2022) doi:10.48550/arXiv.2208.00780
- 40) X. Hu, S. Zhu, and T. Peng, "Hierarchical attention vision transformer for fine-grained visual classification," J Vis Commun Image Represent., 91 34-45 (2023) doi:10.1016/j.jvcir.2023.103755
- 41) M. E. Sahin, H. Ulutas, E. Yuce, and M. F. Erkoc, "Detection and classification of COVID-19 by using faster R-CNN and mask R-CNN on CT images," Neural Comput Appl., 35(18) 201-212 (2023) doi:10.1007/s00521-023-08450-y
- 42) N. Bold, C. Zhang, and T. Akashi, "Cross-domain deep feature combination for bird species classification with audio-visual data," IEICE Trans Inf Syst., E102D(10) 101-112 (2019) doi:10.1587/transinf.2018EDP7383
- 43) Y. Di, Z. Jiang, and H. Zhang, "A public dataset for fine-grained ship classification in optical remote sensing images," Remote Sens., 13(4) 44-55 (2021) doi:10.3390/rs13040747
- 44) K. Sun and J. Zhu, "Learning Consistency From High-Confidence Pseudo-Labels for Weakly Supervised Object Localization," IEEE Access., 11 67-79 (2023) doi:10.1109/ACCESS.2023.3246259
- 45) S. Fayou, H. C. Ngo, Z. Meng, and Y. W. Sek, "Loop and distillation: Attention weights fusion transformer for fine-grained representation," IET Comput Vis., 17(4) 22-33 (2023) doi:10.1049/cvi2.12181
- 46) J. Guang and J. Liang, "CMSEA: Compound Model Scaling with Efficient Attention for Fine-Grained Image Classification," IEEE Access., 10 45-56 (2022) doi:10.1109/ACCESS.2022.3150320
- 47) W. Xu, Y. Xian, J. Wang, B. Schiele, and Z. Akata, "Attribute Prototype Network for Any-Shot Learning," Int J Comput Vis., 130(7) 101-112 (2022) doi:10.1007/s11263-022-01613-9
- 48) H.-Y. Tseng, "Cross-Domain Few-Shot Classification," Cross-Domain Few-Shot Classification via Learned Feature-Wise Transformation, 12-23 (2020)
- 49) C. Zhang, Y. Cai, G. Lin, and C. Shen, "DeepEMD: Few-shot image classification with differentiable earth mover’s distance and structured classifiers," in Proc IEEE Comput Soc Conf Comput Vis Pattern Recogn., 78-89 (2020) doi:10.1109/CVPR42600.2020.01222
- 50) H. Cheng, Y. Wang, H. Li, A. C. Kot, and B. Wen, "Disentangled Feature Representation for Few-Shot Image Classification," IEEE Trans Neural Netw Learn Syst., 35(8) 101-112 (2024) doi:10.1109/TNNLS.2023.3241919
- 51) B. Zhao, X. Wu, J. Feng, Q. Peng, and S. Yan, "Diversified Visual Attention Networks for Fine-Grained Object Classification," IEEE Trans Multimed., 19(6) 55-67 (2017) doi:10.1109/TMM.2017.2648498
- 52) M. Tan, G. Wang, J. Zhou, Z. Peng, and M. Zheng, "Fine-Grained Classification via Hierarchical Bilinear Pooling with Aggregated Slack Mask," IEEE Access., 7 88-99 (2019) doi:10.1109/ACCESS.2019.2936118
- 53) J. Lee, E. Kim, J. Mok, and S. Yoon, "Anti-Adversarially Manipulated Attributions for Weakly Supervised Semantic Segmentation and Object Localization," IEEE Trans Pattern Anal Mach Intell., 46(3) 101-115 (2024) doi:10.1109/TPAMI.2022.3166916
- 54) W. Ge, X. Lin, and Y. Yu, "Weakly supervised complementary parts models for fine-grained image classification from the bottom up," in Proc IEEE Comput Soc Conf Comput Vis Pattern Recogn., 77-88 (2019) doi:10.1109/CVPR.2019.00315
- 55) P. Zhuang, Y. Wang, and Y. Qiao, "Learning attentive pairwise interaction for fine-grained classification," in AAAI 2020 - 34th AAAI Conf Artif Intell., 33-44 (2020) doi:10.1609/aaai.v34i07.7016
- 56) R. Du et al., "Fine-Grained Visual Classification via Progressive Multi-granularity Training of Jigsaw Patches," in Lecture Notes Comput Sci., 101-112 (2020) doi:10.1007/978-3-030-58565-5_10
- 57) P. Li, J. Xie, Q. Wang, and Z. Gao, "Towards Faster Training of Global Covariance Pooling Networks by Iterative Matrix Square Root Normalization," in Proc IEEE Comput Soc Conf Comput Vis Pattern Recogn., 66-77 (2018) doi:10.1109/CVPR.2018.00105
- 58) Z. Wang, S. Wang, S. Yang, H. Li, J. Li, and Z. Li, "Weakly Supervised Fine-Grained Image Classification via Gaussian Mixture Model Oriented Discriminative Learning," in Proc IEEE Comput Soc Conf Comput Vis Pattern Recogn., 44-55 (2020) doi:10.1109/CVPR42600.2020.00977
- 59) W. Luo et al., "Cross-X learning for fine-grained visual categorization," in Proc IEEE Int Conf Comput Vis., 22-33 (2019) doi:10.1109/ICCV.2019.00833
- 60) J. Fu, H. Zheng, and T. Mei, "Look closer to see better: Recurrent attention convolutional neural network for fine-grained image recognition," in Proc 30th IEEE Conf Comput Vis Pattern Recogn, CVPR., 77-88 (2017) doi:10.1109/CVPR.2017.476
- 61) J. Krause, H. Jin, J. Yang, and F. F. Li, "Fine-grained recognition without part annotations," in Proc IEEE Comput Soc Conf Comput Vis Pattern Recogn., 55-66 (2015) doi:10.1109/CVPR.2015.7299194
- 62) H. Almogdady, S. Manaseer, and H. Hiary, "A flower recognition system based on image processing and neural networks," Int J Sci Technol Res., 7(11) 101-112 (2018)
- 63) G. Lihua and G. Chenggan, "A Two-Layer Local Constrained Sparse Coding Method for Fine-Grained Visual Categorization," arXiv:1505.02505 [cs]., 12-23 (2015)
- 64) S. Koul and U. Singhania, "Flower Species Detection Over Large Dataset Using Convolution Neural Networks and Image Processing Techniques," Int J Innov Sci Res Technol., 5(8) 77-88 (2020) doi:10.38124/ijisrt20aug006
Other Papers in This Issue
- Qualitative and Quantitative Analyses of Hazardous Compounds from NTPC Rihand, India
P. Kumar et al. (2025) - Microstructural and Mechanical Characterization of Magnesium-AZ31 Alloy Reinforced with Carbon Nanotubes and Nano-Hydroxyapatite
A. Tyagi, P. Kumar (2025) - Rapid Mapping of Morphological Change Following the 2024 Ruang Volcano Eruption Using Multi-sensor Remote Sensing Imagery
D. Monica et al. (2025) - Bioprocess Engineering: Harnessing Microorganisms for Sustainable Production
S. Sivamani et al. (2025) - The Influence of Surface Roughness and Cavitation on Journal Bearings: A Computational Study
M. Sagaf et al. (2025) - Competitive Adsorption of La3+/Ce3+/Nd3+ Ions on Poly (Methyl Methacrylate)-co-Diacrylate/Single-Walled Carbon Nanotube Nanocomposites
N. Jamilah, A. Riswoko, A. B. Cahaya (2025) - Advancements and Future Directions of Shape Memory Alloys in Aerospace Applications-A Comprehensive Review
H. Kaur et al. (2025) - Experimental Investigation Failure Analysis of Polyamide 66 Composite Spur Gear Subjected to Torque and Bending Loads
D. Choudhari et al. (2025) - Optimal Selection of Chromium and Titanium in Iron Alloy Based Coating Materials Deposited via HVOF
R. Sharma et al. (2025) - Synthesis of Catalyst for Aqueous Polymerization: Perform Artificial Neural Network for The Prediction of Maximum Yield of Polymer
D. Agrawal, N.K. Gupta, Y. Shrivastava (2025) - Experimental Study of Adhesively Bonded Single Lap Joint Behaviour in CFRP-to-CFRP, Al-to-Al, and CFRP-to-Al Configurations
K. Abdurohman et al. (2025) - Enhancing Car Safety with Multimodal Emotion Recognition using CNN-LSTM Networks
G.S. Salunkhe, S.N. Joglekar, J.A. Kengale (2025) - Mechanical Properties of Carbon/epoxy-HA Hybrid Composites for Potential External Fixation Bone Plates
H. Sosiati et al. (2025) - Tapping the Potential of Innovative Hi-Tech Services on Hotel Performance using PLS-SEM Approach
M. Sharma et al. (2025) - Enhancing Electricity Consumption Forecasting using Hybrid ANN-ANFIS Models for Smart Grid Applications
S. KUMAR et al. (2025) - Modification of Grey Relational Analysis (GRA) Method for Improved Decision Making
A. Isnain et al. (2025) - Coffee Ground-Based Modified Biochar for Effective Treatment of Nutrient-Rich Swine Wastewater
N. Thuy et al. (2025) - Impact of Illumination, Noise and Thermal Environment on Occupational Health of Handloom Weavers in Assam: An Ergonomics Perspective
S. Das, S. Karmakar, S. Mukhopadhyay (2025) - A Two-Phase Deep Learning Model for Counterfeit Detection of Indian Banknotes using YOLO-NAS and UV Imaging for Visually Impaired People
P. Chhabra, S. Goyal (2025) - A Hybrid Approach with CLAHE and Dark Channel Prior for Enhancing Underwater Images
V. Narla et al. (2025) - Alkaline-activated Materials for CO2 Capture – Literature Review, Own observations, and Future Perspectives
A. Przybek et al. (2025) - Research Status and Development of Aluminium Matrix Composite: State of the Art
M. Deshwal, P. Kumar (2025) - Mobile Testing Device to Determine the Accuracy of Photovoltaic Solar Tracking Systems
H. Zsiborács, N. Hegedűsné Baranyai, A. Vincze (2025) - Efficiency of using Bioorganic Preparations to Protect Pine Stands from Pests and Diseases: Lessons from the Application of Basidiomycetes
A. Hajiyeva et al. (2025) - Performance Evaluation of a Triangular-Finned Absorber Plate Solar Air Heater: A Theoretical and Experimental Study
P. Singh et al. (2025) - Experimental Determination and Theoretical Prediction of Thermal Conductivity in Glass Fabric Reinforced Epoxy Hybrid Composites
B.K. Basavarajappa, R. Hegde (2025) - Advanced Forecasting with AEGRU: A Robust Approach for Stock Market Time Series
J. ARORA, S. Bhardwaj, N. Arora (2025) - Microtremor Measurements for Regional Spatial Planning Based on Seismic Considerations in the Western Mataram City, West Nusa Tenggara Province, Indonesia
S. Faridah et al. (2025) - Efficient Multi-View Clustering via Greedy Automatic View Selection and Diverse Feature Integration
J. Mankar, S. Kamalapur (2025)









Creative Commons Attribution 4.0 International
