Back to Search Start Over

Machine learning models for predicting blood pressure phenotypes by combining multiple polygenic risk scores

Authors :
Yana Hrytsenko
Benjamin Shea
Michael Elgart
Nuzulul Kurniansyah
Genevieve Lyons
Alanna C. Morrison
April P. Carson
Bernhard Haring
Braxton D. Mitchell
Bruce M. Psaty
Byron C. Jaeger
C. Charles Gu
Charles Kooperberg
Daniel Levy
Donald Lloyd-Jones
Eunhee Choi
Jennifer A. Brody
Jennifer A. Smith
Jerome I. Rotter
Matthew Moll
Myriam Fornage
Noah Simon
Peter Castaldi
Ramon Casanova
Ren-Hua Chung
Robert Kaplan
Ruth J. F. Loos
Sharon L. R. Kardia
Stephen S. Rich
Susan Redline
Tanika Kelly
Timothy O’Connor
Wei Zhao
Wonji Kim
Xiuqing Guo
Yii-Der Ida Chen
The Trans-Omics in Precision Medicine Consortium
Tamar Sofer
Source :
Scientific Reports, Vol 14, Iss 1, Pp 1-17 (2024)
Publication Year :
2024
Publisher :
Nature Portfolio, 2024.

Abstract

Abstract We construct non-linear machine learning (ML) prediction models for systolic and diastolic blood pressure (SBP, DBP) using demographic and clinical variables and polygenic risk scores (PRSs). We developed a two-model ensemble, consisting of a baseline model, where prediction is based on demographic and clinical variables only, and a genetic model, where we also include PRSs. We evaluate the use of a linear versus a non-linear model at both the baseline and the genetic model levels and assess the improvement in performance when incorporating multiple PRSs. We report the ensemble model’s performance as percentage variance explained (PVE) on a held-out test dataset. A non-linear baseline model improved the PVEs from 28.1 to 30.1% (SBP) and 14.3% to 17.4% (DBP) compared with a linear baseline model. Including seven PRSs in the genetic model computed based on the largest available GWAS of SBP/DBP improved the genetic model PVE from 4.8 to 5.1% (SBP) and 4.7 to 5% (DBP) compared to using a single PRS. Adding additional 14 PRSs computed based on two independent GWASs further increased the genetic model PVE to 6.3% (SBP) and 5.7% (DBP). PVE differed across self-reported race/ethnicity groups, with primarily all non-White groups benefitting from the inclusion of additional PRSs. In summary, non-linear ML models improves BP prediction in models incorporating diverse populations.

Subjects

Subjects :
Medicine
Science

Details

Language :
English
ISSN :
20452322
Volume :
14
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Scientific Reports
Publication Type :
Academic Journal
Accession number :
edsdoj.322241f378754ea881e603a18375361b
Document Type :
article
Full Text :
https://doi.org/10.1038/s41598-024-62945-9