Back to Search Start Over

An Automatic Code Classification System by Using Memory-Based Learning and Information Retrieval Technique.

Authors :
Lee, Gary Geunbae
Yamada, Akio
Helen Meng
Sung Hyon Myaeng
Heui Seok Lim
Won Kyu Hoon Lee
Hyeon Chul Kim
Soon Young Jeong
Heon Chang Yu
Source :
Information Retrieval Technology (9783540291862); 2005, p577-582, 6p
Publication Year :
2005

Abstract

This paper proposes an automatic code classification for Korean census data by using information retrieval technique and memoory-based learning technique. The purpose of the proposed system is to convert natural language responses on survey questionnaires into corresponding numeric codes according to standard code book from the Census Bureau. The system was trained by memory based learning and experimented with 46,762 industry records and occupation 36,286 records. It was evaluated by using 10-fold cross-validation method. As experimental results, the proposed system showed 99.10% and 92.88% production rates for level 2 and level 5 codes respectively. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISBNs :
9783540291862
Database :
Supplemental Index
Journal :
Information Retrieval Technology (9783540291862)
Publication Type :
Book
Accession number :
32703448
Full Text :
https://doi.org/10.1007/11562382_53