Back to Search Start Over

Construction and first analysis of a corpus for the evaluation and training of microblog/twitter geoparsers

Authors :
Alan M. MacEachren
Jan Oliver Wallgrün
Morteza Karimzadeh
Yiting Ju
Frank Hardisty
Scott Pezanowski
Source :
GIR
Publication Year :
2014
Publisher :
ACM, 2014.

Abstract

This article presents an approach to place reference corpus building and application of the approach to a Geo-Microblog Corpus that will foster research and development in the areas of microblog/twitter geoparsing and geographic information retrieval. Our corpus currently consists of 6000 tweets with identified and georeferenced place names. 30% of the tweets contain at least one place name. The corpus is intended to support the evaluation, comparison, and training of geoparsers. We introduce our corpus building framework, which is developed to be generally applicable beyond microblogs, and explain how we use crowdsourcing and geovisual analytics technology to support the construction of relatively large corpora. We then report on the corpus building work and present an analysis of causes of disagreement between the lay persons performing place identification in our crowdsourcing approach.

Details

Database :
OpenAIRE
Journal :
Proceedings of the 8th Workshop on Geographic Information Retrieval
Accession number :
edsair.doi...........172f5290f9f30d4757fa930558800262
Full Text :
https://doi.org/10.1145/2675354.2675701