Back to Search Start Over

Systematic replication of smoking disease associations using survey responses and EHR data in the All of Us Research Program.

Authors :
Schlueter DJ
Sulieman L
Mo H
Keaton JM
Ferrara TM
Williams A
Qian J
Stubblefield O
Zeng C
Tran TC
Bastarache L
Dai J
Babbar A
Ramirez A
Goleva SB
Denny JC
Source :
Journal of the American Medical Informatics Association : JAMIA [J Am Med Inform Assoc] 2023 Dec 22; Vol. 31 (1), pp. 139-153.
Publication Year :
2023

Abstract

Objective: The All of Us Research Program (All of Us) aims to recruit over a million participants to further precision medicine. Essential to the verification of biobanks is a replication of known associations to establish validity. Here, we evaluated how well All of Us data replicated known cigarette smoking associations.<br />Materials and Methods: We defined smoking exposure as follows: (1) an EHR Smoking exposure that used International Classification of Disease codes; (2) participant provided information (PPI) Ever Smoking; and, (3) PPI Current Smoking, both from the lifestyle survey. We performed a phenome-wide association study (PheWAS) for each smoking exposure measurement type. For each, we compared the effect sizes derived from the PheWAS to published meta-analyses that studied cigarette smoking from PubMed. We defined two levels of replication of meta-analyses: (1) nominally replicated: which required agreement of direction of effect size, and (2) fully replicated: which required overlap of confidence intervals.<br />Results: PheWASes with EHR Smoking, PPI Ever Smoking, and PPI Current Smoking revealed 736, 492, and 639 phenome-wide significant associations, respectively. We identified 165 meta-analyses representing 99 distinct phenotypes that could be matched to EHR phenotypes. At Pā€‰<ā€‰.05, 74 were nominally replicated and 55 were fully replicated. At Pā€‰<ā€‰2.68 × 10-5 (Bonferroni threshold), 58 were nominally replicated and 40 were fully replicated.<br />Discussion: Most phenotypes found in published meta-analyses associated with smoking were nominally replicated in All of Us. Both survey and EHR definitions for smoking produced similar results.<br />Conclusion: This study demonstrated the feasibility of studying common exposures using All of Us data.<br /> (Published by Oxford University Press on behalf of the American Medical Informatics Association 2023.)

Details

Language :
English
ISSN :
1527-974X
Volume :
31
Issue :
1
Database :
MEDLINE
Journal :
Journal of the American Medical Informatics Association : JAMIA
Publication Type :
Academic Journal
Accession number :
37885303
Full Text :
https://doi.org/10.1093/jamia/ocad205