Back to Search Start Over

An HMM-based synthetic view generator to improve the efficiency of ensemble systems.

Authors :
Borrajo, L
Vieira, A Seara
Iglesias, E L
Source :
Logic Journal of the IGPL; Feb2020, Vol. 28 Issue 1, p4-18, 15p
Publication Year :
2020

Abstract

One of the most active areas of research in semi-supervised learning has been to study methods for constructing good ensembles of classifiers. Ensemble systems are techniques that create multiple models and then combine them to produce improved results. These systems usually produce more accurate solutions than a single model would. Specially, multi-view ensemble systems improve the accuracy of text classification because they optimize the functions to exploit different views of the same input data. However, despite being more promising than the single-view approaches, document datasets often have no natural multiple views available. This study proposes an algorithm to generate a synthetic view from a standard text dataset. The model generates a new view from the standard bag-of-words approach using an algorithm based on hidden Markov models (HMMs). To show the effectiveness of the proposed HMM-based synthetic view generation method, it has been integrated in a co-training ensemble system and tested with four text corpora: Reuters, 20 Newsgroup, TREC Genomics and OHSUMED. The results obtained are promising, showing a significant increase in the efficiency of the ensemble system compared to a single-view approach. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13670751
Volume :
28
Issue :
1
Database :
Complementary Index
Journal :
Logic Journal of the IGPL
Publication Type :
Academic Journal
Accession number :
141477682
Full Text :
https://doi.org/10.1093/jigpal/jzz067