Education, Science, Technology, Innovation and Life
Open Access
Sign In

Research and Application of Statistical Method of Data Reduction Based on Empirical Distribution

Download as PDF

DOI: 10.23977/ferm.2021.040606 | Downloads: 15 | Views: 1129

Author(s)

Jie SUN 1

Affiliation(s)

1 Jinan Jiyang District Government Service Center, Jinan, Shandong 251400, China

Corresponding Author

Jie SUN

ABSTRACT

Data reduction is used to obtain the reduced representation of the data set, which is smaller than the original data, but still maintains the integrity of the original data approximately. Mining on the reduced data set will be more effective and produce the same or almost the same analysis results. A continuous multivariate coupled distribution estimation algorithm with arbitrary distribution is proposed. The distribution is estimated from samples by empirical distribution function, and new individuals are generated by sampling. Secondly, the idea of clustering is introduced into data reduction, and a time dimension reduction method based on clustering is formed. The basic idea of this method is to cluster the time dimension of time series data. In order to verify the feasibility of the two new methods proposed in this paper, a set of simulation experiments are designed in this paper, and representative data are used for data reduction respectively. Experiments show that the two data reduction methods proposed in this paper can not only effectively reduce the amount of data and achieve the purpose of data reduction, but also improve the classification accuracy and have strong practicability.

KEYWORDS

Empirical distribution, Data reduction, Statistics, Cluster

CITE THIS PAPER

Jie SUN. Research and Application of Statistical Method of Data Reduction Based on Empirical Distribution. Financial Engineering and Risk Management (2021) 4: 26-32. DOI: http://dx.doi.org/10.23977/ferm.2021.040606.

REFERENCES

[1] Yuvaraja T, Ramya K. Statistical data analysis for harmonic reduction in 3Ø -fragmented source using novel fuzzy digital logic switching technique. Circuit World, vo. 45, no. 3, pp. 148-155, 2019.
[2] Chen M S, Hwang C P, Ho T Y, et al. Driving behaviors analysis based on feature selection and statistical approach: a preliminary study. Journal of supercomputing, vol. 75, no. 4, pp. 2007-2026, 2019.
[3] Chow C, Andrasik R, Fischer B, et al. Application of statistical techniques to proportional loss data: Evaluating the predictive accuracy of physical vulnerability to hazardous hydro-meteorological events. Journal of Environmental Management, 2019, no. 15, pp. 85-100, 246.
[4] Yang M, Shahramian S, Shakiba H, et al. Statistical BER Analysis of Wireline Links With Non-Binary Linear Block Codes Subject to DFE Error Propagation. Circuits and Systems I: Regular Papers, IEEE Transactions on, no. 99, pp. 1-14, 2019.
[5] Martin T, Drissen L, Prunet S. Data reduction and calibration accuracy of the imaging Fourier transform spectrometer SITELLE. Monthly Notices of the Royal Astronomical Society, no. 4, pp. 4, 2021.
[6] Yi xueyi. research on statistical method system and its application. no. 2016-1, pp. 18-20, 2021.
[7] Gao yuhao, Zhao Yang, cheng yingjin. analysis method of da/dn-δ k curve based on probability and statistics theory . materials development and application, vol. 34, no. 06, pp. 25-32, 2019.
[8] Liu Rui. Research on innovation of big data and statistical methods . Statistics and Consulting, no. 2, pp. 22-25, 2020.

Downloads: 26767
Visits: 540225

All published work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © 2016 - 2031 Clausius Scientific Press Inc. All Rights Reserved.