Back to Search Start Over

Modern Baselines for SPARQL Semantic Parsing

Authors :
Banerjee, Debayan
Nair, Pranav Ajit
Kaur, Jivat Neet
Usbeck, Ricardo
Biemann, Chris
Publication Year :
2022

Abstract

In this work, we focus on the task of generating SPARQL queries from natural language questions, which can then be executed on Knowledge Graphs (KGs). We assume that gold entity and relations have been provided, and the remaining task is to arrange them in the right order along with SPARQL vocabulary, and input tokens to produce the correct SPARQL query. Pre-trained Language Models (PLMs) have not been explored in depth on this task so far, so we experiment with BART, T5 and PGNs (Pointer Generator Networks) with BERT embeddings, looking for new baselines in the PLM era for this task, on DBpedia and Wikidata KGs. We show that T5 requires special input tokenisation, but produces state of the art performance on LC-QuAD 1.0 and LC-QuAD 2.0 datasets, and outperforms task-specific models from previous works. Moreover, the methods enable semantic parsing for questions where a part of the input needs to be copied to the output query, thus enabling a new paradigm in KG semantic parsing.<br />Comment: 5 pages, short paper, SIGIR 2022

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2204.12793
Document Type :
Working Paper
Full Text :
https://doi.org/10.1145/3477495.3531841