1. Research on the Automatic Subject-Indexing Method of Academic Papers Based on Climate Change Domain Ontology
- Author
-
Heng Yang, Nan Wang, Lina Yang, Wei Liu, and Sili Wang
- Subjects
climate change ,Renewable Energy, Sustainability and the Environment ,Geography, Planning and Development ,semantic ,deep mining ,Building and Construction ,ontology ,Management, Monitoring, Policy and Law ,automatic subject indexing - Abstract
It is important to classify academic papers in a fine-grained manner to uncover deeper implicit themes and semantics in papers for better semantic retrieval, paper recommendation, research trend prediction, topic analysis, and a series of other functions. Based on the ontology of the climate change domain, this study used an unsupervised approach to combine two methods, syntactic structure and semantic modeling, to build a framework of subject-indexing techniques for academic papers in the climate change domain. The framework automatically indexes a set of conceptual terms as research topics from the domain ontology by inputting the titles, abstracts and keywords of the papers using natural language processing techniques such as syntactic dependencies, text similarity calculation, pre-trained language models, semantic similarity calculation, and weighting factors such as word frequency statistics and graph path calculation. Finally, we evaluated the proposed method using the gold standard of manually annotated articles and demonstrated significant improvements over the other five alternative methods in terms of precision, recall and F1-score. Overall, the method proposed in this study is able to identify the research topics of academic papers more accurately, and also provides useful references for the application of domain ontologies and unsupervised data annotation.
- Published
- 2023
- Full Text
- View/download PDF