1. A Rough Set Approach to Classifying Web Page Without Negative Examples.
- Author
-
Carbonell, Jaime G., Siekmann, Jörg, Zhi-Hua Zhou, Hang Li, Qiang Yang, Qiguo Duan, Duoqian Miao, and Kaimin Jin
- Abstract
This paper studies the problem of building Web page classifiers using positive and unlabeled examples, and proposes a more principled technique to solving the problem based on tolerance rough set and Support Vector Machine (SVM). It uses tolerance classes to approximate concepts existed in Web pages and enrich the representation of Web pages, draws an initial approximation of negative example. It then iteratively runs SVM to build classifier which maximizes margins to progressively improve the approximation of negative example. Thus, the class boundary eventually converges to the true boundary of the positive class in the feature space. Experimental results show that the novel method outperforms existing methods significantly. [ABSTRACT FROM AUTHOR]
- Published
- 2007
- Full Text
- View/download PDF