Back to Search
Start Over
A New Algorithm to Optimize Maximal Information Coefficient
- Source :
- PLoS ONE, Vol 11, Iss 6, p e0157567 (2016), PLoS ONE
- Publication Year :
- 2016
- Publisher :
- Public Library of Science (PLoS), 2016.
-
Abstract
- The maximal information coefficient (MIC) captures dependences between paired variables, including both functional and non-functional relationships. In this paper, we develop a new method, ChiMIC, to calculate the MIC values. The ChiMIC algorithm uses the chi-square test to terminate grid optimization and then removes the restriction of maximal grid size limitation of original ApproxMaxMI algorithm. Computational experiments show that ChiMIC algorithm can maintain same MIC values for noiseless functional relationships, but gives much smaller MIC values for independent variables. For noise functional relationship, the ChiMIC algorithm can reach the optimal partition much faster. Furthermore, the MCN values based on MIC calculated by ChiMIC can capture the complexity of functional relationships in a better way, and the statistical powers of MIC calculated by ChiMIC are higher than those calculated by ApproxMaxMI. Moreover, the computational costs of ChiMIC are much less than those of ApproxMaxMI. We apply the MIC values tofeature selection and obtain better classification accuracy using features selected by the MIC values from ChiMIC.
- Subjects :
- 0301 basic medicine
Time Factors
Microarrays
Statistics as Topic
Information Theory
lcsh:Medicine
Information theory
Chi Square Tests
01 natural sciences
010104 statistics & probability
Mathematical and Statistical Techniques
Neoplasms
Breast Tumors
Medicine and Health Sciences
lcsh:Science
Mathematics
media_common
Contingency table
Multidisciplinary
Noise (signal processing)
Applied Mathematics
Simulation and Modeling
Fractals
Bioassays and Physiological Analysis
Databases as Topic
Oncology
Physical Sciences
Algorithm
Algorithms
Statistics (Mathematics)
Research Article
Optimization
media_common.quotation_subject
Geometry
Research and Analysis Methods
03 medical and health sciences
Fractal
Breast Cancer
Humans
Partition (number theory)
Statistical Methods
0101 mathematics
Statistical Hypothesis Testing
Selection (genetic algorithm)
Variables
Contingency Tables
lcsh:R
Cancers and Neoplasms
Models, Theoretical
030104 developmental biology
Radii
lcsh:Q
Maximal information coefficient
Subjects
Details
- ISSN :
- 19326203
- Volume :
- 11
- Database :
- OpenAIRE
- Journal :
- PLOS ONE
- Accession number :
- edsair.doi.dedup.....b42a2ec2b419bd0f18cc8c44fae515ef