Back to Search
Start Over
A machine learning-based approach for estimating and testing associations with multivariate outcomes
- Source :
- The international journal of biostatistics, vol 17, iss 1
- Publication Year :
- 2020
- Publisher :
- eScholarship, University of California, 2020.
-
Abstract
- We propose a method for summarizing the strength of association between a set of variables and a multivariate outcome. Classical summary measures are appropriate when linear relationships exist between covariates and outcomes, while our approach provides an alternative that is useful in situations where complex relationships may be present. We utilize machine learning to detect nonlinear relationships and covariate interactions and propose a measure of association that captures these relationships. A hypothesis test about the proposed associative measure can be used to test the strong null hypothesis of no association between a set of variables and a multivariate outcome. Simulations demonstrate that this hypothesis test has greater power than existing methods against alternatives where covariates have nonlinear relationships with outcomes. We additionally propose measures of variable importance for groups of variables, which summarize each groups’ association with the outcome. We demonstrate our methodology using data from a birth cohort study on childhood health and nutrition in the Philippines.
- Subjects :
- Statistics and Probability
Multivariate statistics
Computer science
Statistics & Probability
Machine learning
computer.software_genre
Cohort Studies
Machine Learning
03 medical and health sciences
canonical correlation
0302 clinical medicine
multivariate outcomes
Rare Diseases
2.5 Research design and methodologies (aetiology)
030225 pediatrics
Covariate
Behavioral and Social Science
Humans
030212 general & internal medicine
Aetiology
Set (psychology)
Child
Statistical hypothesis testing
Pediatric
business.industry
Prevention
Statistics
General Medicine
Outcome (probability)
Variable (computer science)
variable importance
Birth Cohort
epidemiology
Artificial intelligence
Generic health relevance
Statistics, Probability and Uncertainty
Null hypothesis
Canonical correlation
business
computer
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- The international journal of biostatistics, vol 17, iss 1
- Accession number :
- edsair.doi.dedup.....5aeff0069d538f1962088d54e60da2c2