Back to Search
Start Over
A Study on Data Pre-Processing and Accident Prediction Modelling for Occupational Accident Analysis in the Construction Industry
- Source :
- Applied Sciences, Volume 10, Issue 21, Applied Sciences, Vol 10, Iss 7949, p 7949 (2020)
- Publication Year :
- 2020
- Publisher :
- Multidisciplinary Digital Publishing Institute, 2020.
-
Abstract
- In the construction industry, it is difficult to predict occupational accidents because various accident characteristics arise simultaneously and organically in different types of work. Furthermore, even when analyzing occupational accident data, it is difficult to deduce meaningful results because the data recorded by the incident investigator are qualitative and include a wide variety of data types and categories. Recently, numerous studies have used machine learning to analyze the correlations in such complex construction accident data<br />however, heretofore the focus has been on predicting severity with various variables, and several limitations remain when deriving the correlations between features from various variables. Thus, this paper proposes a data processing procedure that can efficiently manipulate accident data using optimal machine learning techniques and derive and systematize meaningful variables to rationally approach such complex problems. In particular, among the various variables, the most influential variables are derived through methods such as clustering, chi-square, Cramer&rsquo<br />s V, and predictor importance<br />then, the analysis is simplified by optimally grouping the variables. For accident data with optimal variables and elements, a predictive model is constructed between variables, using a support vector machine and decision-tree-based ensemble<br />then, the correlation between the dependent and independent variables is analyzed through an alluvial flow diagram for several cases. Therefore, a new processing procedure has been introduced in data preprocessing and accident prediction modelling to overcome difficulties from complex and diverse construction occupational accident data, and effective accident prevention is possible by deriving correlations of construction accidents using this process.
- Subjects :
- Computer science
media_common.quotation_subject
correlation analysis
0211 other engineering and technologies
02 engineering and technology
computer.software_genre
lcsh:Technology
Data type
lcsh:Chemistry
Accident (fallacy)
data preprocessing
021105 building & construction
0202 electrical engineering, electronic engineering, information engineering
General Materials Science
support vector machine
Cluster analysis
lcsh:QH301-705.5
Instrumentation
alluvial flow diagram
media_common
Fluid Flow and Transfer Processes
Data processing
Variables
lcsh:T
Process Chemistry and Technology
ensemble
General Engineering
lcsh:QC1-999
Computer Science Applications
Support vector machine
Data flow diagram
latent class clustering analysis
lcsh:Biology (General)
lcsh:QD1-999
lcsh:TA1-2040
020201 artificial intelligence & image processing
Data mining
Data pre-processing
lcsh:Engineering (General). Civil engineering (General)
computer
lcsh:Physics
occupational accident
Subjects
Details
- Language :
- English
- ISSN :
- 20763417
- Database :
- OpenAIRE
- Journal :
- Applied Sciences
- Accession number :
- edsair.doi.dedup.....33d5fadcc7feb3ef4347ce938dc6535d
- Full Text :
- https://doi.org/10.3390/app10217949