Back to Search Start Over

Construction Scheme of Training Data using Automated Exploring of Boundary Categories

Authors :
Yun Jeong Choi
Seung Soo Park
Jeong Gyu Jee
Source :
The KIPS Transactions:PartB. :479-488
Publication Year :
2009
Publisher :
Korea Information Processing Society, 2009.

Abstract

This paper shows a reinforced construction scheme of training data for improvement of text classification by automatic search of boundary category. The documents laid on boundary area are usually misclassified as they are including multiple topics and features. which is the main factor that we focus on. In this paper, we propose an automated exploring methodology of optimal boundary category based on previous research. We consider the boundary area among target categories to new category to be required training, which are then added to the target category sementically. In experiments, we applied our method to complex documents by intentionally making errors in training process. The experimental results show that our system has high accuracy and reliability in noisy environment.

Details

ISSN :
1598284X
Database :
OpenAIRE
Journal :
The KIPS Transactions:PartB
Accession number :
edsair.doi...........953efde47d79f1b3ae6d086c4f57e69c
Full Text :
https://doi.org/10.3745/kipstb.2009.16b.6.479