1. A semantic embedding methodology for motor vehicle crash records: A case study of traffic safety in Manhattan Borough of New York City.
- Author
-
Wang, Yuxuan, Xiong, Ruoxin, Yu, Hao, Bao, Jie, and Yang, Zhao
- Subjects
- *
TRAFFIC safety , *TRAFFIC accidents , *K-means clustering , *TIMESTAMPS , *BOROUGHS , *POINT set theory , *TRAFFIC patterns - Abstract
This study introduces a hybrid Latent Dirichlet Allocation (LDA) model to excavate hidden crash patterns from the large-scale crash dataset. External semantic descriptions have been attached to raw GPS coordinates of crash events. The K-means clustering algorithm is first applied to determine land use characteristics of crash points by grouping surrounding Points of Interests (POIs). Then, each crash record is transformed into a formalized label consisting of land use, Annual Average Daily Traffic (AADT), and time stamps, allowing the analysis of massive traffic crash data as document corpora. Finally, a data-driven modeling approach based on the LDA is conducted to discover hidden crash patterns from traffic crash records combining the external semantic information. The approach is verified using motor vehicle crash data in Manhattan County of New York City. The novel semantic analysis of crash records provides an effective method to investigate the hidden information in traffic crashes. Identifying spatial-temporal patterns on motor vehicle crashes would provide insights into underlying traffic behaviors for intelligent policy-making and resource allocation. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF