Back to Search
Start Over
Multi-source causal feature selection
- Publication Year :
- 2020
- Publisher :
- US : IEEE, 2020.
-
Abstract
- Causal feature selection has attracted much attention in recent years, as the causal features selected imply the causal mechanism related to the class attribute, leading to more reliable prediction models built using them. Currently there is a need of developing multi-source feature selection methods, since in many applications data for studying the same problem has been collected from various sources, such as multiple gene expression datasets obtained from different experiments for studying the causes of the same disease. However, the state-of-the-art causal feature selection methods generally tackle a single dataset, anda direct application of the methods to multiple datasets will result in unreliable results as the datasets may have different distributions. To address the challenges, by utilizing the concept of causal invariance in causal inference, we firstly formulate the problem of causal feature selection with multiple datasets as a search problem for an invariant set across the datasets, then give the upper and lower bounds of the invariant set, and finally we propose a new Multi-source Causal Feature Selection algorithm, MCFS. Using synthetic and real world datasets and 16 feature selection methods, the extensive experiments have validated the effectiveness of MCFS. Refereed/Peer-reviewed
- Subjects :
- Computer science
multiple datasets
Feature extraction
Markov process
Feature selection
02 engineering and technology
Machine learning
computer.software_genre
causal invariance
symbols.namesake
Artificial Intelligence
0202 electrical engineering, electronic engineering, information engineering
Search problem
causal feature selection
Invariant (mathematics)
Markov blanket
business.industry
Applied Mathematics
Bayesian network
Invariant (physics)
Computational Theory and Mathematics
Causal inference
symbols
020201 artificial intelligence & image processing
Computer Vision and Pattern Recognition
Artificial intelligence
business
computer
Software
Multi-source
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Accession number :
- edsair.doi.dedup.....249a0c9c3b85592caae94edd720f52f2