Back to Search Start Over

Inflation of polygenic risk scores caused by sample overlap and relatedness: Examples of a major risk of bias.

Authors :
Ellis CA
Oliver KL
Harris RV
Ottman R
Scheffer IE
Mefford HC
Epstein MP
Berkovic SF
Bahlo M
Source :
American journal of human genetics [Am J Hum Genet] 2024 Sep 05; Vol. 111 (9), pp. 1805-1809. Date of Electronic Publication: 2024 Aug 20.
Publication Year :
2024

Abstract

Polygenic risk scores (PRSs) are an important tool for understanding the role of common genetic variants in human disease. Standard best practices recommend that PRSs be analyzed in cohorts that are independent of the genome-wide association study (GWAS) used to derive the scores without sample overlap or relatedness between the two cohorts. However, identifying sample overlap and relatedness can be challenging in an era of GWASs performed by large biobanks and international research consortia. Although most genomics researchers are aware of best practices and theoretical concerns about sample overlap and relatedness between GWAS and PRS cohorts, the prevailing assumption is that the risk of bias is small for very large GWASs. Here, we present two real-world examples demonstrating that sample overlap and relatedness is not a minor or theoretical concern but an important potential source of bias in PRS studies. Using a recently developed statistical adjustment tool, we found that excluding overlapping and related samples was equal to or more powerful than adjusting for overlap bias. Our goal is to make genomics researchers aware of the magnitude of risk of bias from sample overlap and relatedness and to highlight the need for mitigation tools, including independent validation cohorts in PRS studies, continued development of statistical adjustment methods, and tools for researchers to test their cohorts for overlap and relatedness with GWAS cohorts without sharing individual-level data.<br />Competing Interests: Declaration of interests The authors declare no competing interests.<br /> (Copyright © 2024 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.)

Details

Language :
English
ISSN :
1537-6605
Volume :
111
Issue :
9
Database :
MEDLINE
Journal :
American journal of human genetics
Publication Type :
Academic Journal
Accession number :
39168121
Full Text :
https://doi.org/10.1016/j.ajhg.2024.07.014