Back to Search Start Over

Topic Modeling for Mining Opinion Aspects from a Customer Feedback Corpus.

Authors :
Babina, O. I.
Source :
Automatic Documentation & Mathematical Linguistics; Feb2024, Vol. 58 Issue 1, p63-79, 17p
Publication Year :
2024

Abstract

The paper introduces a methodology for extracting opinion aspects from textual content by identifying the customer-evaluated parameters regarding a given object. These parameters form the foundation for shaping the customer's attitudes toward the product or service. The proposed approach leverages topic modeling tools to delineate classes of vocabulary exhibiting semantics aligned with the parameters influencing the customer's opinion about the object. Our study specifically explores the application of the BERTopic model as a topic modeling tool to address this challenge. The outlined methodology encompasses several sequential steps, including the preprocessing of textual data involving the removal of stopwords, conversion to lowercase characters, and lemmatization. Additionally, special consideration is given to the distinct lexical manifestations of opinion aspects, obtained as a result of the extraction of nominal, verbal, and adjectival single- and multicomponent phrases from the corpus. Subsequently, the corpus sentences are represented as vectors in a feature space expressed by the extracted words and phrases. The final step involves the application of topic modeling using the BERTopic model on the customer review corpus, utilizing the vector representations of corpus sentences. The experimental inquiry is conducted on a domain-specific Russian-language corpus comprising customer feedback on airline services gathered from customer review websites. The resultant topic distribution is then juxtaposed against a manually constructed conceptual model of the domain. The comparative analysis reveals that the automatic topic distribution aligns with the conceptual structure of the domain, demonstrating a precision of 0.955 and a recall of 0.875. These findings affirm the efficacy of employing the BERTopic model to address the problem of the corpus-based mining of opinion aspects. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00051055
Volume :
58
Issue :
1
Database :
Complementary Index
Journal :
Automatic Documentation & Mathematical Linguistics
Publication Type :
Academic Journal
Accession number :
176406255
Full Text :
https://doi.org/10.3103/S0005105524010060