Back to Search Start Over

Applying Machine Learning for High-Performance Named-Entity Extraction.

Authors :
Baluja, Shumeet
Mittal, Vibhu O.
Sukthankar, Rahul
Source :
Computational Intelligence. Nov2000, Vol. 16 Issue 4, p586. 10p. 2 Charts, 2 Graphs.
Publication Year :
2000

Abstract

This paper describes a machine learning approach to building an efficient and accurate name spotting system. Finding names in free text is an important task in many text-based applications. Most previous approaches were based on hand-crafted modules encoding language and genre-specific knowledge. These approaches had at least two shortcomings: They required large amounts of time and expertise to develop and were not easily portable to new languages and genres. This paper describes an extensible system that automatically combines weak evidence from different, easily available sources: parts-of-speech tags, dictionaries, and surface-level syntactic information such as capitalization and punctuation. Individually, each piece of evidence is insufficient for robust name detection. However, the combination of evidence, through standard machine learning techniques, yields a system that achieves performance equivalent to the best existing hand-crafted approaches. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
08247935
Volume :
16
Issue :
4
Database :
Academic Search Index
Journal :
Computational Intelligence
Publication Type :
Academic Journal
Accession number :
4336578
Full Text :
https://doi.org/10.1111/0824-7935.00129