Education, Science, Technology, Innovation and Life
Open Access
Sign In

Research on Wheat Seed Classification Based on Machine Learning Algorithms and Data Analysis Visualization

Download as PDF

DOI: 10.23977/acss.2025.090207 | Downloads: 14 | Views: 414

Author(s)

Kaili Sun 1, Wei Bai 1, Jiexin Feng 1, Zhe Yang 1, Yanyan Li 1

Affiliation(s)

1 School of Trade and Economic, Haojing College of Shaanxi University of Science & Technology, Xi'an, Shaanxi, China

Corresponding Author

Kaili Sun

ABSTRACT

This study addresses the problem of wheat seed classification by employing three machine learning algorithms—Random Forest (RF), Naïve Bayes (NB), and Support Vector Machine (SVM)—on the Wheat Seeds Dataset from the UCI database. Through comprehensive data preprocessing, feature analysis, and model construction, the impact of different feature combinations on classification accuracy was systematically investigated. The dataset, comprising 210 samples with seven attributes (e.g., area, perimeter, and kernel groove length), was standardized and split into training and testing sets to ensure robust evaluation. The experimental results demonstrate that RF and SVM significantly outperform NB in classification performance, with SVM achieving the highest accuracy of 97.61% when combining area or width with kernel groove length. Notably, the combination of perimeter and kernel groove length yielded the highest accuracy (96.67%) in RF, while compactness and asymmetry coefficient consistently performed poorly across all algorithms, with accuracy as low as 60.71% in SVM. These findings highlight the critical role of feature selection in classification tasks, with kernel groove length emerging as a key determinant. This research not only provides an effective technical reference for wheat variety classification but also underscores the practical value of machine learning in agricultural applications, offering insights for optimizing efficiency and reducing costs in food security initiatives.

KEYWORDS

Machine Learning, Data Analysis and Visualization, Feature Combination

CITE THIS PAPER

Kaili Sun, Wei Bai, Jiexin Feng, Zhe Yang, Yanyan Li, Research on Wheat Seed Classification Based on Machine Learning Algorithms and Data Analysis Visualization. Advances in Computer, Signals and Systems (2025) Vol. 9: 53-60. DOI: http://dx.doi.org/10.23977/acss.2025.090207.

REFERENCES

[1] Zhao J, Li Q. (2025). Mitigating Distribution Shift in Machine Learning–Augmented Hybrid Simulation [J]. SIAM Journal on Scientific Computing, 47 (2): 475-500.
[2] Sharma, R. (2016). The Role of Big Data Analytics in Business Decision-Making. International Journal of Business Analytics, 14(2), 130-145. 
[3] Vakili S, Mousavi M S. (2025). Investigation of the effect of climatic parameters in machine learning algorithms for streamflow predicting in Hamoon Helmand Catchment, Iran [J]. Arabian Journal of Geosciences, 18(5): 106-108.
[4] Eckelt K, Gadhave K, Lex A, et al. (2024). Loops: Leveraging Provenance and Visualization to Support Exploratory Data Analysis in Notebooks. [J]. IEEE transactions on visualization and computer graphics, 4(2), 30-45.
[5] Chen Y. et al. Investigation of the random forest framework for classification of hyperspectral data [J]. IEEE Transactions on Geoscience and Remote Sensing, 2005, 43 (3): 492-501.
[6] Fadlil A, Riadi I, Putra P D J I. Comparison of Machine Learning Performance Using Naive Bayes and Random Forest Methods to Classify Batik Fabric Patterns [J]. Revue d'Intelligence Artificielle, 2023, 37(2): 56-67.

Downloads: 38554
Visits: 698009

Sponsors, Associates, and Links


All published work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © 2016 - 2031 Clausius Scientific Press Inc. All Rights Reserved.