Back to Search Start Over

Constructing a Corpus from the Web: Message Boards

Authors :
Claudia Claridge
Publication Year :
2018

Abstract

This paper investigates the challenges and chances involved in creating a corpus of message board (or internet forum) language, in particular one that also reflects the regional varieties of English. Message boards as an asynchronic and public form of computer-mediated communication function as an ‘electronic agora’ (Largier 2002: 287), in so far as they are used for a variety of functions ranging from the more private to the more public, including the discussion of highly topical socio-political subject-matter. Thus, content orientation, evaluation and interactive argumentation are potential characteristics of this text form. Firstly, the technical aspects of corpus compilation will be highlighted, examining such matters as how to transform the web interface into a suitably annotated corpus, how to adequately represent the sequencing/relatedness of messages and how to establish regional speaker identities. Secondly, a pilot study on interaction and stance markers will examine how these are realized and distributed in this genre, and whether there are any regional differences in their use.

Details

Language :
English
Database :
OpenAIRE
Accession number :
edsair.doi.dedup.....9ad368d942b99720977df441bee3b482