Back to Search
Start Over
ArchiMob: Ein multidialektales Korpus schweizerdeutscher Spontansprache
- Source :
- Linguistik Online, Vol 98, Iss 5 (2019)
- Publication Year :
- 2019
- Publisher :
- Bern Open Publishing, 2019.
-
Abstract
- Although Swiss dialects of German are widely used in everyday communication, automatic processing of Swiss German is still a considerable challenge due to the fact that it is mostly a spoken variety and that it is subject to considerable regional variation. This paper presents the ArchiMob corpus, a freely available general-purpose corpus of transcribed spoken Swiss German based on oral history interviews. The corpus is a result of a long design process, intensive manual work and specially adapted computational processing. We first present the modalities of access of the corpus for dialectological, historical and computational research. We then describe how the documents were transcribed, segmented and aligned with the sound source, and summarise a series of experiments that have led to automatically annotated normalisation and part-of-speech tagging layers. Finally, we present several case studies to stimulate the use of the corpus for dialectological research.
- Subjects :
- Language. Linguistic theory. Comparative grammar
P101-410
Modalities
business.industry
Computer science
UFSP13-3 Language and Space
Automatic processing
Subject (documents)
430 German & related languages
10096 Institute of German Studies
Variety (linguistics)
computer.software_genre
language.human_language
German
Swiss German Language
language
Computational linguistics. Natural language processing
6121 Languages
Artificial intelligence
P98-98.5
business
computer
Natural language processing
Subjects
Details
- Language :
- German
- ISSN :
- 16153014
- Volume :
- 98
- Issue :
- 5
- Database :
- OpenAIRE
- Journal :
- Linguistik Online
- Accession number :
- edsair.doi.dedup.....4aea502b33c0a203fbda3a8878e061f5