1. 基于数据分布特征的分层无量纲化方法及其均衡性分析.
- Author
-
易平涛, 袁建荣, and 李伟伟
- Subjects
- *
DATA distribution , *PROBLEM solving , *DENSITY , *OBJECTIVITY , *HUMAN beings - Abstract
The hierarchical dimensionless method can effectively remove the effect of different index dimensions, and solve imbalanced data distribution and low discrimination caused by anomalous index values. However, when using this method, it is necessary to artificially specify the number of partition intervals so that the dimensionless results are interfered by human factors and lose objectivity. To solve this problem, a dimensionless method of density hierarchy is proposed considering the distribution characteristics of raw data. This method divides the interval according to the density of data distribution, objectively determines the hierarchical series, and takes into account the advantages of the hierarchical dimensionless method. The calculation is comparatively simple and reduces human factors. In addition, through the stochastic simulation method, it is found that the method has good anti-interference to outliers, and the balance of dimensionless results is affected by the scale of raw data. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF