Back to Search Start Over

Exploration of Topic Classification in the Tourism Field with Text Mining Technology—A Case Study of the Academic Journal Papers

Authors :
I-Cheng Chang
Jeou-Shyan Horng
Chih-Hsing Liu
Sheng-Fang Chou
Tai-Yi Yu
Source :
Sustainability; Volume 14; Issue 7; Pages: 4053
Publication Year :
2022
Publisher :
Multidisciplinary Digital Publishing Institute, 2022.

Abstract

This study collects abstracts of SSCI tourism journal papers between 2010 and 2019 from the WoS (Web of Science) database and uses a novel method of topic classification to explore the vocabulary characteristics of the classified articles. The corpora of abstracts are given quantitative Term Frequency–Inverse Document Frequency (TF–IDF) weights. A hierarchical K-means cluster analysis is then performed to automatically classify the articles; co-word analysis techniques are used to show the characteristics of feature words for distinct clusters, titles, and the consistency of the classified articles. Based on the results for 5783 abstracts, cluster analysis classifies the number of K-means clusters into six categories: travel, culture, sustainability, model, behavior, and hotel. A cross-check method is applied to assess the consistency of the topic classifications, list titles and keywords of the documents with the three smallest distances in each category and apply a strategic diagram to present the features of the distinct categories.

Details

Language :
English
ISSN :
20711050
Database :
OpenAIRE
Journal :
Sustainability; Volume 14; Issue 7; Pages: 4053
Accession number :
edsair.doi.dedup.....a8190026bba9bc594f6e28f74949ed5e
Full Text :
https://doi.org/10.3390/su14074053