Back to Search Start Over

Discovering conditional matching rules

Authors :
Wang, Y
Song, S
Chen, L
Yu, JX
Cheng, H
Wang, Y
Song, S
Chen, L
Yu, JX
Cheng, H
Publication Year :
2017

Abstract

Matching dependencies (MDS) have recently been proposed to make data dependencies tolerant to various information representations, and found useful in data quality applications such as record matching. Instead of the strict equality function used in traditional dependency syntax (e.g., functional dependencies), MDS specify constraints based on similarity and identification.However, in practice, MDS may still be too strict and applicable only in a subset of tuples in a relation. Thereby, we study the conditional matching dependencies (CMDS), which bindmatching dependencies only in a certain part of a table, i.e., MDS conditionally applicable in a subset of tuples. Compared to MDS, CMDS have more expressive power that enables them to satisfy wider application needs. In this article, we study several important theoretical and practical issues of CMDS, including irreducible CMDS with respect to the implication, discovery of CMDS from data, reliable CMDS agreed most by a relation, approximate CMDS almost satisfied in a relation, and finally applications of CMDS in record matching and missing value repairing. Through an extensive experimental evaluation in real data sets, we demonstrate the efficiency of proposed CMDS discovery algorithms and effectiveness of CMDS in real applications.

Details

Database :
OAIster
Publication Type :
Electronic Resource
Accession number :
edsoai.on1197446337
Document Type :
Electronic Resource