Back to Search Start Over

An Automatic Approach to Classify Web Documents Using a Domain Ontology.

Authors :
Pal, Sankar K.
Bandyopadhyay, Sanghamitra
Biswas, Sambhunath
Song, Mu-Hee
Lim, Soo-Yeon
Park, Seong-Bae
Kang, Dong-Jin
Lee, Sang-Jo
Source :
Pattern Recognition & Machine Intelligence; 2005, p666-671, 6p
Publication Year :
2005

Abstract

This paper suggests an automated method for document classification using an ontology, which expresses terminology information and vocabulary contained in Web documents by way of a hierarchical structure. Ontologybased document classification involves determining document features that represent the Web documents most accurately, and classifying them into the most appropriate categories after analyzing their contents by using at least two pre-defined categories per given document features. In this paper, Web documents are classified in real time not with experimental data or a learning process, but by similar calculations between the terminology information extracted from Web texts and ontology categories. This results in a more accurate document classification since the meanings and relationships unique to each document are determined. Keywords: Document classification, Ontology, Web Page classification. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISBNs :
9783540305064
Database :
Complementary Index
Journal :
Pattern Recognition & Machine Intelligence
Publication Type :
Book
Accession number :
32965722
Full Text :
https://doi.org/10.1007/11590316_107