1. Automatic zone identification in scientific papers via fusion techniques
- Author
-
Kambiz Badie, Maryam Mahmoudi, and Nasrin Asadi
- Subjects
Fusion ,Computer science ,business.industry ,05 social sciences ,General Social Sciences ,Paper based ,Library and Information Sciences ,050905 science studies ,computer.software_genre ,Automatic summarization ,Computer Science Applications ,Identification (information) ,Information extraction ,Simple (abstract algebra) ,Sequential minimal optimization ,Artificial intelligence ,0509 other social sciences ,050904 information & library sciences ,business ,computer ,Natural language processing - Abstract
Zone identification is a topic in the area of text mining which helps researchers be benefited by the content of scientific papers in a satisfactory manner. The major aim of zone identification is to classify the sentences of scientific texts into some predefined zone categories which can be useful for summarization as well as information extraction. In this paper, we propose a two-level approach to zone identification within which the first level is in charge of classifying the sentences in a given paper based on some semantic and lexical features. In this respect, several machine learning algorithms such as Simple Logistics, Logistic Model Trees and Sequential Minimal Optimization are applied. The second level is responsible for applying fusion to the classification results obtained for consecutive sentences of the first level in order to make the final decision. The proposed method is evaluated on ART and DRI corpora as two well-known data sets. Results obtained for the accuracy of zone identification for these corpora are respectively 65.75% and 84.15%, which seem to be quite promising compared to those obtained by previous approaches.
- Published
- 2019