EVERGREEN

Joint Journal of Novel Carbon Resource Sciences and Green Asia Strategy

ISSN:2189-0420 (Print until Mar 2020)
ISSN:2432-5953 (Online)

SCImago Journal & Country Rank

Open Access
Scopus
Google Scholar
Crossref
SCImago Journal & Country Rank
4.3
2024CiteScore
 
69th percentile
Powered by Scopus
Metrics by SCOPUS 2024
CiteScore
4.3
SJR
0.391
SNIP
1.192


Fine-Grained Image Classification using Particle Swarm Optimization for Hyperparameter Optimization of Convolutional Neural Networks

Priti Prasad Vaidya1,*, Snehal Kamalapur2
1Research, Panchvati, 422003, Nashik, India
2Department of Computer Engineering, K K Wagh Institute of Engineering Education and Research, India
*Author to whom correspondence should be addressed:
E-mail: ppvaidya@kkwagh.edu.in (PPV)
Received: May 26, 2025 | Revised: August 05, 2025 | Accepted: September 02, 2025 | Published: September 2025
Abstract
This study presents a novel approach to fine-grained image classification (FGIC) by employing Particle Swarm Optimization (PSO) for automatic hyperparameter tuning in Convolutional Neural Networks (CNNs). Unlike conventional models with manually fixed architectures, the proposed method optimizes critical parameters—such as filter count, kernel size, pooling size, and stride—tailored to each dataset. The model operates on reduced image resolutions (128×128), yet achieves superior accuracy with significantly fewer training epochs. Specifically, it attains 99.56% accuracy on the Oxford Flowers dataset, 97.45% on Stanford Cars, and 95.95% on Stanford Dogs using only 25 epochs, outperforming deep networks like DenseNet-161 and ResNet-50 trained for 100–150 epochs. This PSO-based tuning framework not only enhances classification performance but also minimizes computational cost, making it a practical and scalable solution for real-world FGIC applications.
Keywords
PSO ; CNN ; Hyperparameters ; Fine grain image classification
Available Repositories
Share Article
Article Metrics
--
Views
--
Downloads
--
Citations
Full Text
Download PDF
References
  1. 1) X. Yu, Y. Zhao, Y. Gao, and S. Xiong, "MaskCOV: A random mask covariance network for ultra-fine-grained visual categorization," Pattern Recogn., 119 101-112 (2021) doi:10.1016/j.patcog.2021.108067
  2. 2) M. Zhang, H. Su, and J. Wen, "Classification of flower image based on attention mechanism and multi-loss attention network," Comput Commun., 179 45-59 (2021) doi:10.1016/j.comcom.2021.09.001
  3. 3) X. Zheng, L. Qi, Y. Ren, and X. Lu, "Fine-Grained Visual Categorization by Localizing Object Parts with Single Image," IEEE Trans Multimed., 23 201-215 (2021) doi:10.1109/TMM.2020.2993960
  4. 4) F. Zhang, M. Li, G. Zhai, and Y. Liu, "Multi-branch and Multi-scale Attention Learning for Fine-Grained Visual Categorization," in Lecture Notes in Comput Sci., 78-89 (2021) doi:10.1007/978-3-030-67832-6_12
  5. 5) D. Morales, E. Talavera, and B. Remeseiro, "Playing to distraction: towards a robust training of CNN classifiers through visual explanation techniques," Neural Comput Appl., 33(24) 1221-1235 (2021) doi:10.1007/s00521-021-06282-2
  6. 6) R. Thapa, K. Zhang, N. Snavely, S. Belongie, and A. Khan, "The Plant Pathology Challenge 2020 data set to classify foliar disease of apples," Appl Plant Sci., 8(9) 87-99 (2020) doi:10.1002/aps3.11390
  7. 7) C. Wang, Y. Qian, W. Gong, J. Cheng, Y. Wang, and Y. Wang, "Cross-layer progressive attention bilinear fusion method for fine-grained visual classification," J Vis Commun Image Represent., 82 233-245 (2022) doi:10.1016/j.jvcir.2021.103414
  8. 8) X. Nie, B. Chai, L. Wang, Q. Liao, and M. Xu, "Learning enhanced features and inferring twice for fine-grained image classification," Multimedia Tools Appl., 82(10) 1110-1125 (2023) doi:10.1007/s11042-022-13619-z
  9. 9) Y. Cui, Y. Song, C. Sun, A. Howard, and S. Belongie, "Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning," in Proc IEEE Comput Soc Conf Comput Vis Pattern Recogn., 12-23 (2018) doi:10.1109/CVPR.2018.00432
  10. 10) Z. C. Zhang, Z. D. Chen, Y. Wang, X. Luo, and X. S. Xu, "A vision transformer for fine-grained classification by reducing noise and enhancing discriminative information," Pattern Recogn., 145 77-90 (2024) doi:10.1016/j.patcog.2023.109979
  11. 11) S. Ye, Y. Wang, Q. Peng, X. You, and C. L. P. Chen, "The Image Data and Backbone in Weakly Supervised Fine-Grained Visual Categorization: A Revisit and Further Thinking," IEEE Trans Circuits Syst Video Technol., 34(1) 101-114 (2024) doi:10.1109/TCSVT.2023.3284405
  12. 12) J. Lu, W. Zhang, Y. Zhao, and C. Sun, "Image local structure information learning for fine-grained visual classification," Sci Rep., 12(1) 23-35 (2022) doi:10.1038/s41598-022-23835-0
  13. 13) Z. Wang, "Research on Flower Image Classification Based on Transfer Learning," Acad J Sci Technol., 4(3) 45-55 (2023) doi:10.54097/ajst.v4i3.5055
  14. 14) W. Yuan, "Accuracy Comparison of YOLOv7 and YOLOv4 Regarding Image Annotation Quality for Apple Flower Bud Classification," AgriEngineering., 5(1) 12-21 (2023) doi:10.3390/agriengineering5010027
  15. 15) G. Xu, Z. Zhu, S. Yin, G. Liu, and B. Lei, "Flower Image Classification System Based on Lightweight DCNN," Shuju Caiji Yu Chuli J Data Acquis Process., 36(4) 33-42 (2021) doi:10.16337/j.1004-9037.2021.04.014
  16. 16) M. V. D. Prasad et al., "An efficient classification of flower images with convolutional neural networks," Int J Eng Technol (UAE)., 7(1.1) 77-85 (2018) doi:10.14419/ijet.v7i1.1.9857
  17. 17) P. Zhao, Q. Miao, H. Yao, X. Liu, R. Liu, and M. Gong, "CA-PMG: Channel attention and progressive multi-granularity training network for fine-grained visual classification," IET Image Process., 15(14) 99-108 (2021) doi:10.1049/ipr2.12238
  18. 18) A. Peryanto, A. Yudhana, and R. Umar, "Convolutional Neural Network and Support Vector Machine in Classification of Flower Images," Khazanah Informatika: J Ilmu Komputer dan Informatika., 8(1) 11-19 (2022) doi:10.23917/khif.v8i1.15531
  19. 19) Z. Zeng, C. Huang, W. Zhu, Z. Wen, and X. Yuan, "Flower image classification based on an improved lightweight neural network with multi-scale feature fusion and attention mechanism," Math Biosci Eng., 20(8) 203-215 (2023) doi:10.3934/mbe.2023619
  20. 20) F. B. Legesse, A. Medyukhina, S. Heuke, and J. Popp, "Texture analysis and classification in coherent anti-Stokes Raman scattering (CARS) microscopy images for automated detection of skin cancer," Comput Med Imaging Graph., 43 45-55 (2015) doi:10.1016/j.compmedimag.2015.02.010
  21. 21) A. Sagingalieva et al., "Hybrid quantum ResNet for car classification and its hyperparameter optimization," Quantum Mach Intell., 5(2) 33-44 (2023) doi:10.1007/s42484-023-00123-2
  22. 22) R. Himilda and R. A. Johan, "Klasifikasi Jenis Kendaraan Menggunakan Metode Extreme Learning Machine," JTIM: J Teknol Inform Multimedia., 2(4) 11-22 (2021) doi:10.35746/jtim.v2i4.118
  23. 23) F. A. I. A. Putra, F. Utaminingrum, and W. F. Mahmudy, "HOG Feature Extraction and KNN Classification for Detecting Vehicle in The Highway," IJCCS (Indonesian J Comput Cybern Syst)., 14(3) 77-88 (2020) doi:10.22146/ijccs.54050
  24. 24) T. C. Hsu, H. Abelson, N. Lao, Y. H. Tseng, and Y. T. Lin, "Behavioral-pattern exploration and development of an instructional tool for young children to learn AI," Comput Educ Artif Intell., 2 12-23 (2021) doi:10.1016/j.caeai.2021.100012
  25. 25) H. Pei, R. Guo, Z. Tan, X. Huang, and Z. Bai, "Fine-grained classification of automobile front face modeling based on Gestalt psychology*," Vis Comput., 39(7) 101-115 (2023) doi:10.1007/s00371-022-02506-1
  26. 26) M. Abdar et al., "A review of uncertainty quantification in deep learning: Techniques, applications and challenges," Inf Fusion., 76 56-70 (2021) doi:10.1016/j.inffus.2021.05.008
  27. 27) D. Shah, "Car Image Classification and Recognition," Int J Res Appl Sci Eng Technol., 9(9) 99-108 (2021) doi:10.22214/ijraset.2021.38336
  28. 28) Y. X. Tan, C. P. Lee, M. Neo, and K. M. Lim, "Text-to-image synthesis with self-supervised learning," Pattern Recogn Lett., 157 33-45 (2022) doi:10.1016/j.patrec.2022.04.010
  29. 29) L. A. Hendricks, Z. Akata, M. Rohrbach, J. Donahue, B. Schiele, and T. Darrell, "Generating visual explanations," in Lecture Notes Comput Sci., 12-23 (2016) doi:10.1007/978-3-319-46493-0_1
  30. 30) S. Lin et al., "Local Patch AutoAugment With Multi-Agent Collaboration," IEEE Trans Multimed., 26 67-79 (2024) doi:10.1109/TMM.2023.3270635
  31. 31) R. Ji, J. Li, and L. Zhang, "Siamese self-supervised learning for fine-grained visual classification," Comput Vis Image Underst., 229 101-112 (2023) doi:10.1016/j.cviu.2023.103658
  32. 32) Y. Yu, B. Li, Z. Ji, J. Han, and Z. Zhang, "Knowledge Distillation Classifier Generation Network for Zero-Shot Learning," IEEE Trans Neural Netw Learn Syst., 34(6) 201-212 (2023) doi:10.1109/TNNLS.2021.3112229
  33. 33) W. Xiong, Y. Gong, W. Niu, and R. Wang, "Hierarchically Structured Network with Attention Convolution and Embedding Propagation for Imbalanced Few-Shot Learning," IEEE Access., 9 77-89 (2021) doi:10.1109/ACCESS.2021.3093422
  34. 34) D. Kang, H. Kwon, J. Min, and M. Cho, "Relational Embedding for Few-Shot Classification," in Proc IEEE Int Conf Comput Vis., 44-55 (2021) doi:10.1109/ICCV48922.2021.00870
  35. 35) H. Liu et al., "TransIFC: Invariant Cues-aware Feature Concentration Learning for Efficient Fine-grained Bird Image Classification," IEEE Trans Multimed., 25 11-22 (2023) doi:10.1109/TMM.2023.3238548
  36. 36) X. Zhao, Y. Feng, X. Shi, Y. Wang, and G. Zhang, "A color constancy based flower classification method in the blockchain data lake," Multimedia Tools Appl., 83(10) 121-133 (2024) doi:10.1007/s11042-023-16656-4
  37. 37) Z. Li, T. Gu, B. Li, W. Xu, X. He, and X. Hui, "ConvNeXt-Based Fine-Grained Image Classification and Bilinear Attention Mechanism Model," Appl Sci (Switz)., 12(18) 101-112 (2022) doi:10.3390/app12189016
  38. 38) Y. Li and C. Bian, "Few-Shot Fine-Grained Ship Classification With a Foreground-Aware Feature Map Reconstruction Network," IEEE Trans Geosci Remote Sens., 60 77-88 (2022) doi:10.1109/TGRS.2022.3172223
  39. 39) G. Nguyen, M. R. Taesiri, and A. Nguyen, "Visual correspondence-based explanations improve AI robustness and human-AI team accuracy," in Adv Neural Inf Process Syst., 12-23 (2022) doi:10.48550/arXiv.2208.00780
  40. 40) X. Hu, S. Zhu, and T. Peng, "Hierarchical attention vision transformer for fine-grained visual classification," J Vis Commun Image Represent., 91 34-45 (2023) doi:10.1016/j.jvcir.2023.103755
  41. 41) M. E. Sahin, H. Ulutas, E. Yuce, and M. F. Erkoc, "Detection and classification of COVID-19 by using faster R-CNN and mask R-CNN on CT images," Neural Comput Appl., 35(18) 201-212 (2023) doi:10.1007/s00521-023-08450-y
  42. 42) N. Bold, C. Zhang, and T. Akashi, "Cross-domain deep feature combination for bird species classification with audio-visual data," IEICE Trans Inf Syst., E102D(10) 101-112 (2019) doi:10.1587/transinf.2018EDP7383
  43. 43) Y. Di, Z. Jiang, and H. Zhang, "A public dataset for fine-grained ship classification in optical remote sensing images," Remote Sens., 13(4) 44-55 (2021) doi:10.3390/rs13040747
  44. 44) K. Sun and J. Zhu, "Learning Consistency From High-Confidence Pseudo-Labels for Weakly Supervised Object Localization," IEEE Access., 11 67-79 (2023) doi:10.1109/ACCESS.2023.3246259
  45. 45) S. Fayou, H. C. Ngo, Z. Meng, and Y. W. Sek, "Loop and distillation: Attention weights fusion transformer for fine-grained representation," IET Comput Vis., 17(4) 22-33 (2023) doi:10.1049/cvi2.12181
  46. 46) J. Guang and J. Liang, "CMSEA: Compound Model Scaling with Efficient Attention for Fine-Grained Image Classification," IEEE Access., 10 45-56 (2022) doi:10.1109/ACCESS.2022.3150320
  47. 47) W. Xu, Y. Xian, J. Wang, B. Schiele, and Z. Akata, "Attribute Prototype Network for Any-Shot Learning," Int J Comput Vis., 130(7) 101-112 (2022) doi:10.1007/s11263-022-01613-9
  48. 48) H.-Y. Tseng, "Cross-Domain Few-Shot Classification," Cross-Domain Few-Shot Classification via Learned Feature-Wise Transformation, 12-23 (2020)
  49. 49) C. Zhang, Y. Cai, G. Lin, and C. Shen, "DeepEMD: Few-shot image classification with differentiable earth mover’s distance and structured classifiers," in Proc IEEE Comput Soc Conf Comput Vis Pattern Recogn., 78-89 (2020) doi:10.1109/CVPR42600.2020.01222
  50. 50) H. Cheng, Y. Wang, H. Li, A. C. Kot, and B. Wen, "Disentangled Feature Representation for Few-Shot Image Classification," IEEE Trans Neural Netw Learn Syst., 35(8) 101-112 (2024) doi:10.1109/TNNLS.2023.3241919
  51. 51) B. Zhao, X. Wu, J. Feng, Q. Peng, and S. Yan, "Diversified Visual Attention Networks for Fine-Grained Object Classification," IEEE Trans Multimed., 19(6) 55-67 (2017) doi:10.1109/TMM.2017.2648498
  52. 52) M. Tan, G. Wang, J. Zhou, Z. Peng, and M. Zheng, "Fine-Grained Classification via Hierarchical Bilinear Pooling with Aggregated Slack Mask," IEEE Access., 7 88-99 (2019) doi:10.1109/ACCESS.2019.2936118
  53. 53) J. Lee, E. Kim, J. Mok, and S. Yoon, "Anti-Adversarially Manipulated Attributions for Weakly Supervised Semantic Segmentation and Object Localization," IEEE Trans Pattern Anal Mach Intell., 46(3) 101-115 (2024) doi:10.1109/TPAMI.2022.3166916
  54. 54) W. Ge, X. Lin, and Y. Yu, "Weakly supervised complementary parts models for fine-grained image classification from the bottom up," in Proc IEEE Comput Soc Conf Comput Vis Pattern Recogn., 77-88 (2019) doi:10.1109/CVPR.2019.00315
  55. 55) P. Zhuang, Y. Wang, and Y. Qiao, "Learning attentive pairwise interaction for fine-grained classification," in AAAI 2020 - 34th AAAI Conf Artif Intell., 33-44 (2020) doi:10.1609/aaai.v34i07.7016
  56. 56) R. Du et al., "Fine-Grained Visual Classification via Progressive Multi-granularity Training of Jigsaw Patches," in Lecture Notes Comput Sci., 101-112 (2020) doi:10.1007/978-3-030-58565-5_10
  57. 57) P. Li, J. Xie, Q. Wang, and Z. Gao, "Towards Faster Training of Global Covariance Pooling Networks by Iterative Matrix Square Root Normalization," in Proc IEEE Comput Soc Conf Comput Vis Pattern Recogn., 66-77 (2018) doi:10.1109/CVPR.2018.00105
  58. 58) Z. Wang, S. Wang, S. Yang, H. Li, J. Li, and Z. Li, "Weakly Supervised Fine-Grained Image Classification via Gaussian Mixture Model Oriented Discriminative Learning," in Proc IEEE Comput Soc Conf Comput Vis Pattern Recogn., 44-55 (2020) doi:10.1109/CVPR42600.2020.00977
  59. 59) W. Luo et al., "Cross-X learning for fine-grained visual categorization," in Proc IEEE Int Conf Comput Vis., 22-33 (2019) doi:10.1109/ICCV.2019.00833
  60. 60) J. Fu, H. Zheng, and T. Mei, "Look closer to see better: Recurrent attention convolutional neural network for fine-grained image recognition," in Proc 30th IEEE Conf Comput Vis Pattern Recogn, CVPR., 77-88 (2017) doi:10.1109/CVPR.2017.476
  61. 61) J. Krause, H. Jin, J. Yang, and F. F. Li, "Fine-grained recognition without part annotations," in Proc IEEE Comput Soc Conf Comput Vis Pattern Recogn., 55-66 (2015) doi:10.1109/CVPR.2015.7299194
  62. 62) H. Almogdady, S. Manaseer, and H. Hiary, "A flower recognition system based on image processing and neural networks," Int J Sci Technol Res., 7(11) 101-112 (2018)
  63. 63) G. Lihua and G. Chenggan, "A Two-Layer Local Constrained Sparse Coding Method for Fine-Grained Visual Categorization," arXiv:1505.02505 [cs]., 12-23 (2015)
  64. 64) S. Koul and U. Singhania, "Flower Species Detection Over Large Dataset Using Convolution Neural Networks and Image Processing Techniques," Int J Innov Sci Res Technol., 5(8) 77-88 (2020) doi:10.38124/ijisrt20aug006
Other Papers in This Issue