Back to Search Start Over

Over het Corpus Gesproken Nederlands.

Authors :
VAN EERTEN, LAURA
Source :
Dutch Linguistics / Nederlandse Taalkunde. 2007, Vol. 12 Issue 3, p194-215. 22p.
Publication Year :
2007

Abstract

Resulting in the largest speech corpus of Dutch (+/- 9 million words), the construction of the Spoken Dutch Corpus (CGN) can be considered a unique project. Financed by the Dutch and the Flemish Government the CGN was successfully completed in 2004. This paper presents an overview of the CGN, from the beginning of the project in 1998 until now. First the project background -- motivation and project organisation -- will be described. Then a detailed description of the corpus contents will be presented, followed by an introduction to the corpus exploitation software Corex. A small-scale Corex experiment is carried out for further illustration. Besides, an overview of divergent corpus-based research using the CGN is given. The paper concludes with practical information about distribution by the TST-centrale, and future perspectives such as an online web version of the corpus and the additional JASMIN-CGN project. [ABSTRACT FROM AUTHOR]

Details

Language :
Dutch/Flemish
ISSN :
13845845
Volume :
12
Issue :
3
Database :
Academic Search Index
Journal :
Dutch Linguistics / Nederlandse Taalkunde
Publication Type :
Academic Journal
Accession number :
125341017