Back to Search Start Over

Identifiability and Estimation of Probabilities from Multiple Databases with Incomplete Data and Sampling Selection.

Authors :
Dit-Yan Yeung
Kwok, James T.
Fred, Ana
Roli, Fabio
de Ridder, Dick
Jinzhu Jia
Zhi Geng
Mingfeng Wang
Source :
Structural, Syntactic & Statistical Pattern Recognition; 2006, p792-798, 7p
Publication Year :
2006

Abstract

For an application problem, there may be multiple databases, and each database may not contain complete variables or attributes, that is, some variables are observed but some others are missing. Further, data of a database may be collected conditionally on some designed variables. In this paper, we discuss problems related to data mining from such multiple databases. We propose an approach for detecting identifiability of a joint distribution from multiple databases. For an identifiable joint distribution, we further present the expectation-maximization (EM) algorithm for calculating the maximum likelihood estimates (MLEs) of the joint distribution. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISBNs :
9783540372363
Database :
Complementary Index
Journal :
Structural, Syntactic & Statistical Pattern Recognition
Publication Type :
Book
Accession number :
32910385
Full Text :
https://doi.org/10.1007/11815921_87