Back to Search
Start Over
A spatially-aware algorithm for location extraction from structured documents.
- Source :
- GeoInformatica; Oct2023, Vol. 27 Issue 4, p645-679, 35p
- Publication Year :
- 2023
-
Abstract
- Place names facilitate locating and distinguishing geographic space where human activities and natural phenomena occur. Extracting place names at multiple spatial resolutions from text is beneficial in several tasks such as identifying the location of events, enriching gazetteers, discovering connections between events and places, etc. Most modern place name extraction approaches generalize the linguistic rules and lexical features as a universal rule and ignore patterns inherent in place names in the geographic contexts. As a result, they lack spatial awareness to effectively identify place names from different geographic contexts, especially the lesser-known place names. In this research, we develop a novel Spatially-Aware Location Extraction (SALE) algorithm for place name extraction from structured documents that uses a hybrid approach comprising of knowledge-driven and data-driven methods. We build a custom named entity recognition (NER) system based on the conditional random field (CRF) and train/ fine-tune it using spatial features extracted from a dataset based on a given geographic region. SALE uses multiple pathways, including the use of the spatially tuned NER to enhance the efficacy in our place names extraction. The experimental results using a large geographic region show that our algorithm outperforms well-known state-of-the-art place name recognizers. [ABSTRACT FROM AUTHOR]
- Subjects :
- GEOGRAPHIC names
RANDOM fields
LINGUISTIC context
SPATIAL resolution
ALGORITHMS
Subjects
Details
- Language :
- English
- ISSN :
- 13846175
- Volume :
- 27
- Issue :
- 4
- Database :
- Complementary Index
- Journal :
- GeoInformatica
- Publication Type :
- Academic Journal
- Accession number :
- 172285754
- Full Text :
- https://doi.org/10.1007/s10707-022-00482-1