Back to Search Start Over

Towards a General Framework for Effective Solutions to the Data Mapping Problem

Authors :
Fletcher, G.H.L.
Wyss, C.M.
Spaccapietra, S.
Delcambre, L.
Database Group
Source :
Journal on Data Semantics XIV ISBN: 9783642105616, Journal on Data Semantics XIV, 37-73, STARTPAGE=37;ENDPAGE=73;TITLE=Journal on Data Semantics XIV
Publication Year :
2009
Publisher :
Springer Berlin Heidelberg, 2009.

Abstract

Automating the discovery of mappings between structured data sources is a long standing and important problem in data management. We discuss the rich history of the problem and the variety of technical solutions advanced in the database community over the previous four decades. Based on this discussion, we develop a basic statement of the data mapping problem and a general framework for reasoning about the design space of system solutions to the problem. We then concretely illustrate the framework with the Tupelo system for data mapping discovery, focusing on the important common case of relational data sources. Treating mapping discovery as example-driven search in a space of transformations, Tupelo generates queries encompassing the full range of structural and semantic heterogeneities encountered in relational data mapping. Hence, Tupelo is applicable in a wide range of data mapping scenarios. Finally, we present the results of extensive empirical validation, both on synthetic and real world datasets, indicating that the system is both viable and effective.

Details

ISBN :
978-3-642-10561-6
ISBNs :
9783642105616
Database :
OpenAIRE
Journal :
Journal on Data Semantics XIV ISBN: 9783642105616, Journal on Data Semantics XIV, 37-73, STARTPAGE=37;ENDPAGE=73;TITLE=Journal on Data Semantics XIV
Accession number :
edsair.doi.dedup.....44047746beab180419a6dc9f2c50a846
Full Text :
https://doi.org/10.1007/978-3-642-10562-3_2