Back to Search Start Over

The WATCHER Project: Building an Agent for Automatic Extraction of Language Resources from the Internet

Authors :
George E. Londos
George Kokkinakis
Kyriakos N. Sgarbas
Nikolaos D. Fakotakis
Source :
Literary and Linguistic Computing. 18:449-464
Publication Year :
2003
Publisher :
Oxford University Press (OUP), 2003.

Abstract

The WATCHER project aims to automate the extraction of language resources from the Internet via an intelligent agent called the 'WATCHER'. This agent (in its final form) will be able to actively search and collect subject-specific and language-specific texts and build corpora and lexicons from them. Although the resources will still have to be checked for validity after their collection, the proposed method requires the minimum of human interaction. Apart from its ability to collect these resources automatically, the WATCHER will also be able to track the evolution of a target language over time by collecting resources annually and presenting their analysis in annual reports. The WATCHER is still under development. This paper presents an overview of its architecture and functionality, and reports recent progress.

Details

ISSN :
14774615 and 02681145
Volume :
18
Database :
OpenAIRE
Journal :
Literary and Linguistic Computing
Accession number :
edsair.doi...........1390b25ead22cc42a495c7256439ac26
Full Text :
https://doi.org/10.1093/llc/18.4.449