Back to Search Start Over

Research on aviation unsafe incidents classification with improved TF-IDF algorithm.

Authors :
Wang, Yanhua
Zhang, Zhiyuan
Huo, Weigang
Source :
Modern Physics Letters B; May2016, Vol. 30 Issue 12, p1, 10p
Publication Year :
2016

Abstract

The text content of Aviation Safety Confidential Reports contains a large number of valuable information. Term frequency-inverse document frequency algorithm is commonly used in text analysis, but it does not take into account the sequential relationship of the words in the text and its role in semantic expression. According to the seven category labels of civil aviation unsafe incidents, aiming at solving the problems of TF-IDF algorithm, this paper improved TF-IDF algorithm based on co-occurrence network; established feature words extraction and words sequential relations for classified incidents. Aviation domain lexicon was used to improve the accuracy rate of classification. Feature words network model was designed for multi-documents unsafe incidents classification, and it was used in the experiment. Finally, the classification accuracy of improved algorithm was verified by the experiments. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02179849
Volume :
30
Issue :
12
Database :
Complementary Index
Journal :
Modern Physics Letters B
Publication Type :
Academic Journal
Accession number :
115348680
Full Text :
https://doi.org/10.1142/S0217984916501840