Back to Search Start Over

Complex-Trait Prediction in the Era of Big Data.

Authors :
de los Campos, Gustavo
Vazquez, Ana Ines
Hsu, Stephen
Lello, Louis
Source :
Trends in Genetics. Oct2018, Vol. 34 Issue 10, p746-754. 9p.
Publication Year :
2018

Abstract

Accurate prediction of complex traits requires using a large number of DNA variants. Advances in statistical and machine learning methodology enable the identification of complex patterns in high-dimensional settings. However, training these highly parameterized methods requires very large data sets. Until recently, such data sets were not available. But the situation is changing rapidly as very large biomedical data sets comprising individual genotype-phenotype data for hundreds of thousands of individuals become available in public and private domains. We argue that the convergence of advances in methodology and the advent of Big Genomic Data will enable unprecedented improvements in complex-trait prediction; we review theory and evidence supporting our claim and discuss challenges and opportunities that Big Data will bring to complex-trait prediction. Highlights Genome-wide association (GWA) studies have discovered thousands of variants associated with many important human traits and diseases. However, GWA-significant variants explain only a small fraction of the trait heritability. Achieving high genomic prediction accuracy requires using tens or hundreds of thousands of SNPs, including many that do not reach GWA significance. Penalized and Bayesian regressions can be used to fit high-dimensional regressions including hundreds of thousands of predictors. However, training these high-dimensional regressions requires using very large data sets. Until recently, such data sets were not available, but this situation is changing rapidly. We argue that the convergence of advances in methodology and the advent of very large biomedical data sets (comprising hundreds of thousands of genotypes linked to phenotypes) will enable unprecedented improvements in complex-trait prediction. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01689525
Volume :
34
Issue :
10
Database :
Academic Search Index
Journal :
Trends in Genetics
Publication Type :
Academic Journal
Accession number :
131730888
Full Text :
https://doi.org/10.1016/j.tig.2018.07.004