Research on Wheat Seed Classification Based on Machine Learning Algorithms and Data Analysis Visualization
DOI: 10.23977/acss.2025.090207 | Downloads: 14 | Views: 414
Author(s)
Kaili Sun 1, Wei Bai 1, Jiexin Feng 1, Zhe Yang 1, Yanyan Li 1
Affiliation(s)
1 School of Trade and Economic, Haojing College of Shaanxi University of Science & Technology, Xi'an, Shaanxi, China
Corresponding Author
Kaili SunABSTRACT
This study addresses the problem of wheat seed classification by employing three machine learning algorithms—Random Forest (RF), Naïve Bayes (NB), and Support Vector Machine (SVM)—on the Wheat Seeds Dataset from the UCI database. Through comprehensive data preprocessing, feature analysis, and model construction, the impact of different feature combinations on classification accuracy was systematically investigated. The dataset, comprising 210 samples with seven attributes (e.g., area, perimeter, and kernel groove length), was standardized and split into training and testing sets to ensure robust evaluation. The experimental results demonstrate that RF and SVM significantly outperform NB in classification performance, with SVM achieving the highest accuracy of 97.61% when combining area or width with kernel groove length. Notably, the combination of perimeter and kernel groove length yielded the highest accuracy (96.67%) in RF, while compactness and asymmetry coefficient consistently performed poorly across all algorithms, with accuracy as low as 60.71% in SVM. These findings highlight the critical role of feature selection in classification tasks, with kernel groove length emerging as a key determinant. This research not only provides an effective technical reference for wheat variety classification but also underscores the practical value of machine learning in agricultural applications, offering insights for optimizing efficiency and reducing costs in food security initiatives.
KEYWORDS
Machine Learning, Data Analysis and Visualization, Feature CombinationCITE THIS PAPER
Kaili Sun, Wei Bai, Jiexin Feng, Zhe Yang, Yanyan Li, Research on Wheat Seed Classification Based on Machine Learning Algorithms and Data Analysis Visualization. Advances in Computer, Signals and Systems (2025) Vol. 9: 53-60. DOI: http://dx.doi.org/10.23977/acss.2025.090207.
REFERENCES
[1] Zhao J, Li Q. (2025). Mitigating Distribution Shift in Machine Learning–Augmented Hybrid Simulation [J]. SIAM Journal on Scientific Computing, 47 (2): 475-500.
[2] Sharma, R. (2016). The Role of Big Data Analytics in Business Decision-Making. International Journal of Business Analytics, 14(2), 130-145.
[3] Vakili S, Mousavi M S. (2025). Investigation of the effect of climatic parameters in machine learning algorithms for streamflow predicting in Hamoon Helmand Catchment, Iran [J]. Arabian Journal of Geosciences, 18(5): 106-108.
[4] Eckelt K, Gadhave K, Lex A, et al. (2024). Loops: Leveraging Provenance and Visualization to Support Exploratory Data Analysis in Notebooks. [J]. IEEE transactions on visualization and computer graphics, 4(2), 30-45.
[5] Chen Y. et al. Investigation of the random forest framework for classification of hyperspectral data [J]. IEEE Transactions on Geoscience and Remote Sensing, 2005, 43 (3): 492-501.
[6] Fadlil A, Riadi I, Putra P D J I. Comparison of Machine Learning Performance Using Naive Bayes and Random Forest Methods to Classify Batik Fabric Patterns [J]. Revue d'Intelligence Artificielle, 2023, 37(2): 56-67.
Downloads: | 38554 |
---|---|
Visits: | 698009 |
Sponsors, Associates, and Links
-
Power Systems Computation
-
Internet of Things (IoT) and Engineering Applications
-
Computing, Performance and Communication Systems
-
Journal of Artificial Intelligence Practice
-
Journal of Network Computing and Applications
-
Journal of Web Systems and Applications
-
Journal of Electrotechnology, Electrical Engineering and Management
-
Journal of Wireless Sensors and Sensor Networks
-
Journal of Image Processing Theory and Applications
-
Mobile Computing and Networking
-
Vehicle Power and Propulsion
-
Frontiers in Computer Vision and Pattern Recognition
-
Knowledge Discovery and Data Mining Letters
-
Big Data Analysis and Cloud Computing
-
Electrical Insulation and Dielectrics
-
Crypto and Information Security
-
Journal of Neural Information Processing
-
Collaborative and Social Computing
-
International Journal of Network and Communication Technology
-
File and Storage Technologies
-
Frontiers in Genetic and Evolutionary Computation
-
Optical Network Design and Modeling
-
Journal of Virtual Reality and Artificial Intelligence
-
Natural Language Processing and Speech Recognition
-
Journal of High-Voltage
-
Programming Languages and Operating Systems
-
Visual Communications and Image Processing
-
Journal of Systems Analysis and Integration
-
Knowledge Representation and Automated Reasoning
-
Review of Information Display Techniques
-
Data and Knowledge Engineering
-
Journal of Database Systems
-
Journal of Cluster and Grid Computing
-
Cloud and Service-Oriented Computing
-
Journal of Networking, Architecture and Storage
-
Journal of Software Engineering and Metrics
-
Visualization Techniques
-
Journal of Parallel and Distributed Processing
-
Journal of Modeling, Analysis and Simulation
-
Journal of Privacy, Trust and Security
-
Journal of Cognitive Informatics and Cognitive Computing
-
Lecture Notes on Wireless Networks and Communications
-
International Journal of Computer and Communications Security
-
Journal of Multimedia Techniques
-
Automation and Machine Learning
-
Computational Linguistics Letters
-
Journal of Computer Architecture and Design
-
Journal of Ubiquitous and Future Networks