Research on Image Classification Method Based on Haar Wavelet Pooling and Probabilistic Mixture Model
DOI: 10.23977/jipta.2025.080116 | Downloads: 4 | Views: 175
Author(s)
Xin Yuan 1, Chunhui Liang 1, Jianye An 1
Affiliation(s)
1 School of science, Tianjin University of Commerce, Tianjin, 300134, China
Corresponding Author
Chunhui LiangABSTRACT
In high-dimensional image classification tasks, conventional convolutional neural networks (CNNs) suffer from high-frequency feature loss in pooling layers and limited robustness of Softmax classifiers when handling complex data distributions. In this paper, we propose a novel classification framework that combines multi-scale Haar wavelet pooling with a Gaussian Mixture Model Classifier (GMMC), which can simultaneously retain high-frequency information and optimize multimodal feature distributions. During model training, we integrate Selective Multiscale Wavelet Pooling (SMWP) into the Expectation-Maximization (EM) algorithm to enhance frequency-domain features and jointly improve classification accuracy. Our approach achieved classification accuracies of 97.17% on CIFAR-10 and 99.98% on SVHN, outperforming the MaxPooling + Softmax baseline by 7.32% and 4.93%, respectively. This research proposes a promising framework for fine-grained medical image classification, with potential applicability in low-light image enhancement and cross-modal retrieval tasks.
KEYWORDS
Image Classification, Haar Wavelet Pooling, Gaussian Mixture Model, Frequency-Domain Feature Preservation, Intra-Class MultimodalityCITE THIS PAPER
Xin Yuan, Chunhui Liang, Jianye An, Research on Image Classification Method Based on Haar Wavelet Pooling and Probabilistic Mixture Model. Journal of Image Processing Theory and Applications (2025) Vol. 8: 132-143. DOI: http://dx.doi.org/10.23977/jipta.2025.080116.
REFERENCES
[1] Lowe D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision. 60(2), 91–110 (2004).
[2] Bay H., Tuytelaars T., Van Gool L.: SURF: Speeded up robust features. In: European Conference on Computer Vision, pp. 404–417 (2006).
[3] LeCun Y., Bengio Y., Hinton G.: Deep learning. Nature. 521(7553), 436–444 (2015).
[4] Krizhevsky A., Sutskever I., Hinton G.E. ImageNet classification with deep convolutional neural networks. In: *Proceedings of the 25th International Conference on Neural Information Processing Systems (NIPS)*, 1097-1105. (2012).
[5] Zeiler M.D., Fergus R. Visualizing and understanding convolutional networks. In: *European Conference on Computer Vision (ECCV)*. Springer, 818-833 (2014).
[6] Scherer D., Müller A., Behnke S.: Evaluation of pooling operations in convolutional architectures for object recognition. In: International Conference on Artificial Neural Networks, pp. 92--101 (2010)
[7] Bishop C.M.: Pattern Recognition and Machine Learning. Springer, Boca Raton (2006).
[8] Zhang Y., Li D., Zhang N., Gong Y. Energy-preserving pooling for deep learning. IEEE Transactions on Image Processing , 30: 7609-7620(2021).
[9] Williams T., Li R.: Wavelet pooling for convolutional neural networks. In: International Conference on Learning Representations, pp. 1--10 (2018).
[10] Liu P., Zhao Z., Chen Y.: Multi-level wavelet convolutional neural networks. IEEE Access. 7, 74973–74985 (2019).
[11] Wang Z., Zhang F., Liu J.: Learnable wavelet packet transform for data-efficient deep learning. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12345--12354 (2022).
[12] Guo X., Zhang Y., Li J.: Dynamic wavelet thresholding for image denoising. IEEE Signal Processing Letters. 27, 1234–1238 (2020).
[13] Grave E., Joulin A., Usunier N.: Adaptive softmax for efficient language modeling. In: International Conference on Learning Representations, pp. 1--8 (2017).
[14] Blei D.M., Jordan M.I.: Variational inference for Dirichlet process mixtures. Bayesian Analysis. 1(1), 121–143 (2006).
[15] Reynolds D.A.: Gaussian mixture models. Encyclopedia of Biometrics, pp. 827–832 (2015).
[16] Schwarz G.: Estimating the dimension of a model. The Annals of Statistics. 6(2), 461–464 (1978).
[17] Li H., Zhang Y., Wu X.: Haar wavelet based channel attention for image classification. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1234--1243 (2017).
[18] Dai Z., Chen J., Liu Y.: Wavelet-domain feature pyramid network for object detection. In: IEEE/CVF International Conference on Computer Vision, pp. 5678--5687 (2021).
[19] Dai Y., Chen X., Zhou Z.: Wavelet-Driven Feature Pyramid Network for Multi-Scale Image Representation. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12345--12356 (2021).
Downloads: | 2455 |
---|---|
Visits: | 172060 |
Sponsors, Associates, and Links
-
Power Systems Computation
-
Internet of Things (IoT) and Engineering Applications
-
Computing, Performance and Communication Systems
-
Journal of Artificial Intelligence Practice
-
Advances in Computer, Signals and Systems
-
Journal of Network Computing and Applications
-
Journal of Web Systems and Applications
-
Journal of Electrotechnology, Electrical Engineering and Management
-
Journal of Wireless Sensors and Sensor Networks
-
Mobile Computing and Networking
-
Vehicle Power and Propulsion
-
Frontiers in Computer Vision and Pattern Recognition
-
Knowledge Discovery and Data Mining Letters
-
Big Data Analysis and Cloud Computing
-
Electrical Insulation and Dielectrics
-
Crypto and Information Security
-
Journal of Neural Information Processing
-
Collaborative and Social Computing
-
International Journal of Network and Communication Technology
-
File and Storage Technologies
-
Frontiers in Genetic and Evolutionary Computation
-
Optical Network Design and Modeling
-
Journal of Virtual Reality and Artificial Intelligence
-
Natural Language Processing and Speech Recognition
-
Journal of High-Voltage
-
Programming Languages and Operating Systems
-
Visual Communications and Image Processing
-
Journal of Systems Analysis and Integration
-
Knowledge Representation and Automated Reasoning
-
Review of Information Display Techniques
-
Data and Knowledge Engineering
-
Journal of Database Systems
-
Journal of Cluster and Grid Computing
-
Cloud and Service-Oriented Computing
-
Journal of Networking, Architecture and Storage
-
Journal of Software Engineering and Metrics
-
Visualization Techniques
-
Journal of Parallel and Distributed Processing
-
Journal of Modeling, Analysis and Simulation
-
Journal of Privacy, Trust and Security
-
Journal of Cognitive Informatics and Cognitive Computing
-
Lecture Notes on Wireless Networks and Communications
-
International Journal of Computer and Communications Security
-
Journal of Multimedia Techniques
-
Automation and Machine Learning
-
Computational Linguistics Letters
-
Journal of Computer Architecture and Design
-
Journal of Ubiquitous and Future Networks