Back to Search Start Over

Modeling Linkage Disequilibrium Increases Accuracy of Polygenic Risk Scores.

Authors :
Vilhjálmsson, Bjarni J
Vilhjálmsson, Bjarni J
Yang, Jian
Finucane, Hilary K
Gusev, Alexander
Lindström, Sara
Ripke, Stephan
Genovese, Giulio
Loh, Po-Ru
Bhatia, Gaurav
Do, Ron
Hayeck, Tristan
Won, Hong-Hee
Schizophrenia Working Group of the Psychiatric Genomics Consortium, Discovery, Biology, and Risk of Inherited Variants in Breast Cancer (DRIVE) study
Kathiresan, Sekar
Pato, Michele
Pato, Carlos
Tamimi, Rulla
Stahl, Eli
Zaitlen, Noah
Pasaniuc, Bogdan
Belbin, Gillian
Kenny, Eimear E
Schierup, Mikkel H
De Jager, Philip
Patsopoulos, Nikolaos A
McCarroll, Steve
Daly, Mark
Purcell, Shaun
Chasman, Daniel
Neale, Benjamin
Goddard, Michael
Visscher, Peter M
Kraft, Peter
Patterson, Nick
Price, Alkes L
Vilhjálmsson, Bjarni J
Vilhjálmsson, Bjarni J
Yang, Jian
Finucane, Hilary K
Gusev, Alexander
Lindström, Sara
Ripke, Stephan
Genovese, Giulio
Loh, Po-Ru
Bhatia, Gaurav
Do, Ron
Hayeck, Tristan
Won, Hong-Hee
Schizophrenia Working Group of the Psychiatric Genomics Consortium, Discovery, Biology, and Risk of Inherited Variants in Breast Cancer (DRIVE) study
Kathiresan, Sekar
Pato, Michele
Pato, Carlos
Tamimi, Rulla
Stahl, Eli
Zaitlen, Noah
Pasaniuc, Bogdan
Belbin, Gillian
Kenny, Eimear E
Schierup, Mikkel H
De Jager, Philip
Patsopoulos, Nikolaos A
McCarroll, Steve
Daly, Mark
Purcell, Shaun
Chasman, Daniel
Neale, Benjamin
Goddard, Michael
Visscher, Peter M
Kraft, Peter
Patterson, Nick
Price, Alkes L
Source :
American journal of human genetics; vol 97, iss 4, 576-592; 0002-9297
Publication Year :
2015

Abstract

Polygenic risk scores have shown great promise in predicting complex disease risk and will become more accurate as training sample sizes increase. The standard approach for calculating risk scores involves linkage disequilibrium (LD)-based marker pruning and applying a p value threshold to association statistics, but this discards information and can reduce predictive accuracy. We introduce LDpred, a method that infers the posterior mean effect size of each marker by using a prior on effect sizes and LD information from an external reference panel. Theory and simulations show that LDpred outperforms the approach of pruning followed by thresholding, particularly at large sample sizes. Accordingly, predicted R(2) increased from 20.1% to 25.3% in a large schizophrenia dataset and from 9.8% to 12.0% in a large multiple sclerosis dataset. A similar relative improvement in accuracy was observed for three additional large disease datasets and for non-European schizophrenia samples. The advantage of LDpred over existing methods will grow as sample sizes increase.

Details

Database :
OAIster
Journal :
American journal of human genetics; vol 97, iss 4, 576-592; 0002-9297
Notes :
application/pdf, American journal of human genetics vol 97, iss 4, 576-592 0002-9297
Publication Type :
Electronic Resource
Accession number :
edsoai.on1377972247
Document Type :
Electronic Resource