1. Conflict-aware historical data fusion
- Author
-
Zadorozhny, V, Hsu, YF, Zadorozhny, V, and Hsu, YF
- Abstract
Historical data reports on numerous events for overlapping time intervals, locations, and names. As a result, it may include severe data conflicts caused by database redundancy that prevent researchers from obtaining the correct answers to queries on an integrated historical database. In this paper, we propose a novel conflict-aware data fusion strategy for historical data sources. We evaluated our approach on a large-scale data warehouse that integrates historical data from approximately 50,000 reports on US epidemiological data for more than 100 years. We demonstrate that our approach significantly reduces data aggregation error in the integrated historical database. © 2011 Springer-Verlag.
- Published
- 2011