Back to Search Start Over

Sifting Truths from Multiple Low-Quality Data Sources

Authors :
Qizhi Liu
Zhifeng Bao
Zizhe Xie
Source :
Web and Big Data ISBN: 9783319635781, APWeb/WAIM (1)
Publication Year :
2017
Publisher :
Springer International Publishing, 2017.

Abstract

In this paper, we study the problem of assessing the quality of co-reference tuples extracted from multiple low-quality data sources and finding true values from them. It is a critical part of an effective data integration solution. In order to solve this problem, we first propose a model to specify the tuple quality. Then we present a framework to infer the tuple quality based on the concept of quality predicates. In particular, we propose an algorithm underlying the framework to find true values for each attribute. Last, we have conducted extensive experiments on real-life data to verify the effectiveness and efficiency of our methods.

Details

ISBN :
978-3-319-63578-1
ISBNs :
9783319635781
Database :
OpenAIRE
Journal :
Web and Big Data ISBN: 9783319635781, APWeb/WAIM (1)
Accession number :
edsair.doi...........c2a7d58dc8b2935401b410016b12f4fa
Full Text :
https://doi.org/10.1007/978-3-319-63579-8_7