Back to Search
Start Over
Sifting Truths from Multiple Low-Quality Data Sources
- Source :
- Web and Big Data ISBN: 9783319635781, APWeb/WAIM (1)
- Publication Year :
- 2017
- Publisher :
- Springer International Publishing, 2017.
-
Abstract
- In this paper, we study the problem of assessing the quality of co-reference tuples extracted from multiple low-quality data sources and finding true values from them. It is a critical part of an effective data integration solution. In order to solve this problem, we first propose a model to specify the tuple quality. Then we present a framework to infer the tuple quality based on the concept of quality predicates. In particular, we propose an algorithm underlying the framework to find true values for each attribute. Last, we have conducted extensive experiments on real-life data to verify the effectiveness and efficiency of our methods.
- Subjects :
- Computer science
media_common.quotation_subject
02 engineering and technology
computer.software_genre
Sensor fusion
Order (business)
020204 information systems
Data quality
0202 electrical engineering, electronic engineering, information engineering
Quality (business)
Data mining
Tuple
computer
Data integration
media_common
Subjects
Details
- ISBN :
- 978-3-319-63578-1
- ISBNs :
- 9783319635781
- Database :
- OpenAIRE
- Journal :
- Web and Big Data ISBN: 9783319635781, APWeb/WAIM (1)
- Accession number :
- edsair.doi...........c2a7d58dc8b2935401b410016b12f4fa
- Full Text :
- https://doi.org/10.1007/978-3-319-63579-8_7