Liu, Weirong, Deng, Kunyuan, Zhang, Xiaoyong, Cheng, Yijun, Zheng, Zhiyong, Jiang, Fu, and Peng, Jun
Driving style recognition plays a key role in ensuring driving safety and improving vehicle traffic efficiency. With the development of sensing technology, data-driven methods are more widely uesd to recognize driving style. However, adequately labeling data is difficult for supervised learning methods, while the classification accuracy is not sufficiently approved for unsupervised learning methods. This paper proposes a new driving style recognition method based on Tri-CatBoost, which takes CatBoost as base classifier and effectively utilizes the semi-supervised learning mechanism to reduce the dependency on data labels and improve the recognition ability. First, statistical features were extracted from the velocity, acceleration and jerk signals to fully characterize the driving style. The kernel principal component analysis was used to perform nonlinear feature dimension reduction to eliminate feature coupling. CatBoost is an ensemble of symmetric decision trees whose symmetry structure endows it fewer parameters, faster training and testing, and a higher accuracy. Then, a Tri-Training strategy is employed to integrate the base CatBoost classifiers and fully exploit the unlabeled data to generate pseudo-labels, by which the base CatBoost classifiers are optimized. To verify the effectiveness of the proposed method, a large number of experiments are performed on the UAH DriveSet. When the labeling ratio is 50%, the macro precision of Tri-CatBoost is 0.721, which is 15.7% higher than that of unsupervised K-means, 1.6% higher than that of supervised GBDT, 3.7% higher than that of Self-Training, 0.7% higher than that of Co-training, 1.5% higher than that of random forest, 6.7% higher than that of decision tree, and 4.0% higher than that of multilayer perceptron. The macro recall of Tri-CatBoost is 0.744, which is also higher than other methods. The experimental results fully demonstrate the superiority of this work in reducing label dependency and improving recognition performance, which indicates that the proposed method has broad application prospects. [ABSTRACT FROM AUTHOR]