Back to Search
Start Over
The Causal Meaning of Genomic Predictors and How It Affects Construction and Comparison of Genome-Enabled Selection Models
- Source :
- Genetics. 200:483-494
- Publication Year :
- 2015
- Publisher :
- Oxford University Press (OUP), 2015.
-
Abstract
- The additive genetic effect is arguably the most important quantity inferred in animal and plant breeding analyses. The term effect indicates that it represents causal information, which is different from standard statistical concepts as regression coefficient and association. The process of inferring causal information is also different from standard statistical learning, as the former requires causal (i.e. non-statistical) assumptions and involves extra complexities. Remarkably, the task of inferring genetic effects is largely seen as a standard regression/prediction problem, contradicting its label. This widely accepted analysis approach is by itself insufficient for causal learning, suggesting that causality is not the point for selection. Given this incongruence, it is important to verify if genomic predictors need to represent causal effects to be relevant for selection decisions, especially because applying regression studies to answer causal questions may lead to wrong conclusions. The answer to this question defines if genomic selection models should be constructed aiming maximum genomic predictive ability or aiming identifiability of genetic causal effects. Here, we demonstrate that selection relies on a causal effect from genotype to phenotype, and that genomic predictors are only useful for selection if they distinguish such effect from other sources of association. Conversely, genomic predictors capturing non-causal signals provide information that is less relevant for selection regardless of the resulting predictive ability. Focusing on covariate choice decision, simulated examples are used to show that predictive ability, which is the criterion normally used to compare models, may not indicate the quality of genomic predictors for selection. Additionally, we propose using alternative criteria to construct models aiming for the identification of the genetic causal effects.
- Subjects :
- FOS: Computer and information sciences
Computer science
Correlation and dependence
Genomics
Investigations
Biology
Machine learning
computer.software_genre
Quantitative Biology - Quantitative Methods
Statistics - Applications
Linear regression
Covariate
Genetics
Humans
Additive genetic effects
Applications (stat.AP)
Computer Simulation
Selection, Genetic
Quantitative Methods (q-bio.QM)
Selection (genetic algorithm)
Models, Genetic
business.industry
Regression analysis
Causality
Regression
Term (time)
Identification (information)
Genetics, Population
FOS: Biological sciences
Causal inference
Identifiability
Artificial intelligence
business
computer
Algorithms
Subjects
Details
- ISSN :
- 19432631
- Volume :
- 200
- Database :
- OpenAIRE
- Journal :
- Genetics
- Accession number :
- edsair.doi.dedup.....3dd4132e06f4e0cd5c045ac2049e6989
- Full Text :
- https://doi.org/10.1534/genetics.114.169490