Back to Search Start Over

A deep neural network model for Chinese toponym matching with geographic pre-training model

Authors :
Qinjun Qiu
Shiyu Zheng
Miao Tian
Jiali Li
Kai Ma
Liufeng Tao
Zhong Xie
Source :
International Journal of Digital Earth, Vol 17, Iss 1 (2024)
Publication Year :
2024
Publisher :
Taylor & Francis Group, 2024.

Abstract

ABSTRACTMultiple tasks within the field of geographical information retrieval and geographical information sciences necessitate toponym matching, which involves the challenge of aligning toponyms that share a common referent. The multiple string similarity approaches struggle when confronted with the complexities associated with unofficial and/or historical variants of identical toponyms. Also, current state-of-the-art approaches/tools to supervised machine learning rely on labeled samples, and they do not adequately address the intricacies of character replacements either from transliterations or historical shifts in linguistic and cultural norms. To address these issues, this paper proposes a novel matching approach that leverages a deep neural network model empowered by geographic language representation model, known as GeoBERT, which stands for geographic Bidirectional Encoder Representations from Transformers (BERT). This model harnesses the groundbreaking capabilities of the GeoBERT framework by extending a generalized Enhanced Sequential Inference Model architecture and integrating multiple features to enhance the accuracy and robustness of the toponym matching. We present a comprehensive evaluation of the proposed method’s performance using three extensive datasets. The findings clearly illustrate that our approach outperforms the individual similarity metrics used in previous studies.

Details

Language :
English
ISSN :
17538947 and 17538955
Volume :
17
Issue :
1
Database :
Directory of Open Access Journals
Journal :
International Journal of Digital Earth
Publication Type :
Academic Journal
Accession number :
edsdoj.bace0f73d1984b10b9048c69273c0b2b
Document Type :
article
Full Text :
https://doi.org/10.1080/17538947.2024.2353111