Back to Search
Start Over
Over het Corpus Gesproken Nederlands.
- Source :
-
Dutch Linguistics / Nederlandse Taalkunde . 2007, Vol. 12 Issue 3, p194-215. 22p. - Publication Year :
- 2007
-
Abstract
- Resulting in the largest speech corpus of Dutch (+/- 9 million words), the construction of the Spoken Dutch Corpus (CGN) can be considered a unique project. Financed by the Dutch and the Flemish Government the CGN was successfully completed in 2004. This paper presents an overview of the CGN, from the beginning of the project in 1998 until now. First the project background -- motivation and project organisation -- will be described. Then a detailed description of the corpus contents will be presented, followed by an introduction to the corpus exploitation software Corex. A small-scale Corex experiment is carried out for further illustration. Besides, an overview of divergent corpus-based research using the CGN is given. The paper concludes with practical information about distribution by the TST-centrale, and future perspectives such as an online web version of the corpus and the additional JASMIN-CGN project. [ABSTRACT FROM AUTHOR]
Details
- Language :
- Dutch/Flemish
- ISSN :
- 13845845
- Volume :
- 12
- Issue :
- 3
- Database :
- Academic Search Index
- Journal :
- Dutch Linguistics / Nederlandse Taalkunde
- Publication Type :
- Academic Journal
- Accession number :
- 125341017