Back to Search
Start Over
The WATCHER Project: Building an Agent for Automatic Extraction of Language Resources from the Internet
- Source :
- Literary and Linguistic Computing. 18:449-464
- Publication Year :
- 2003
- Publisher :
- Oxford University Press (OUP), 2003.
-
Abstract
- The WATCHER project aims to automate the extraction of language resources from the Internet via an intelligent agent called the 'WATCHER'. This agent (in its final form) will be able to actively search and collect subject-specific and language-specific texts and build corpora and lexicons from them. Although the resources will still have to be checked for validity after their collection, the proposed method requires the minimum of human interaction. Apart from its ability to collect these resources automatically, the WATCHER will also be able to track the evolution of a target language over time by collecting resources annually and presenting their analysis in annual reports. The WATCHER is still under development. This paper presents an overview of its architecture and functionality, and reports recent progress.
Details
- ISSN :
- 14774615 and 02681145
- Volume :
- 18
- Database :
- OpenAIRE
- Journal :
- Literary and Linguistic Computing
- Accession number :
- edsair.doi...........1390b25ead22cc42a495c7256439ac26
- Full Text :
- https://doi.org/10.1093/llc/18.4.449