Back to Search Start Over

Accurate prediction of cell composition, age, smoking consumption and infection serostatus based on blood DNA methylation profiles

Authors :
Lluis Quintana-Murci
Darragh Duffy
Matthew L. Albert
Etienne Patin
Jacob Bergstedt
Alejandra Urrutia
Lund University [Lund]
Centre de Recherche Translationnelle - Center for Translational Science (CRT)
Institut Pasteur [Paris] (IP)
Immunobiologie des Cellules dendritiques
Institut Pasteur [Paris] (IP)-Institut National de la Santé et de la Recherche Médicale (INSERM)
Genentech, Inc. [San Francisco]
Génétique Evolutive Humaine - Human Evolutionary Genetics
Institut Pasteur [Paris] (IP)-Centre National de la Recherche Scientifique (CNRS)
Centre de Bioinformatique, Biostatistique et Biologie Intégrative (C3BI)
This work benefited from support of the French government’s Program In vestissement d’Avenir, managed by the Agence Nationale de la Recherche (ANR, reference 10-LABX-69-01). J.B. is a member of the LCCC Linnaeus Center and the ELLIIT Excellence Center at Lund University and is supported by the ELLIIT Excellence Center.
ANR-10-LABX-0069,MILIEU INTERIEUR,GENETIC & ENVIRONMENTAL CONTROL OF IMMUNE PHENOTYPE VARIANCE: ESTABLISHING A PATH TOWARDS PERSONALIZED MEDICINE(2010)
Institut Pasteur [Paris]
Institut Pasteur [Paris]-Institut National de la Santé et de la Recherche Médicale (INSERM)
Institut Pasteur [Paris]-Centre National de la Recherche Scientifique (CNRS)
Publication Year :
2021
Publisher :
HAL CCSD, 2021.

Abstract

Posté sur bioRxiv le 30/10/2018; DNA methylation is a stable epigenetic alteration that plays a key role in cellular differentiation and gene regulation, and that has been proposed to mediate environmental effects on disease risk. Epigenome-wide association studies have identified and replicated associations between methylation sites and several disease conditions, which could serve as biomarkers in predictive medicine and forensics. Nevertheless, heterogeneity in cellular proportions between the compared groups could complicate interpretation. Reference-based cell-type deconvolution methods have proven useful in correcting epigenomic studies for cellular heterogeneity, but they rely on reference libraries of sorted cells and only predict a limited number of cell populations. Here we leverage >850,000 methylation sites included in the MethylationEPIC array and use elastic net regularized and stability selected regression models to predict the circulating levels of 70 blood cell subsets, measured by standardized flow cytometry in 962 healthy donors of western European descent. We show that our predictions, based on a hundred of methylation sites or lower, are less error-prone than other existing methods, and extend the number of cell types that can be accurately predicted. Application of the same methods to age, smoking consumption and several serological responses to pathogen antigens also provide accurate estimations. Together, our study substantially improves predictions of blood cell composition based on methylation profiles, which will be critical in the emerging field of medical epigenomics.

Details

Language :
English
Database :
OpenAIRE
Accession number :
edsair.doi.dedup.....7b352e3352e2ac0b5df5d140197c1ac7