Back to Search
Start Over
Accurate prediction of cell composition, age, smoking consumption and infection serostatus based on blood DNA methylation profiles
- Publication Year :
- 2021
- Publisher :
- HAL CCSD, 2021.
-
Abstract
- Posté sur bioRxiv le 30/10/2018; DNA methylation is a stable epigenetic alteration that plays a key role in cellular differentiation and gene regulation, and that has been proposed to mediate environmental effects on disease risk. Epigenome-wide association studies have identified and replicated associations between methylation sites and several disease conditions, which could serve as biomarkers in predictive medicine and forensics. Nevertheless, heterogeneity in cellular proportions between the compared groups could complicate interpretation. Reference-based cell-type deconvolution methods have proven useful in correcting epigenomic studies for cellular heterogeneity, but they rely on reference libraries of sorted cells and only predict a limited number of cell populations. Here we leverage >850,000 methylation sites included in the MethylationEPIC array and use elastic net regularized and stability selected regression models to predict the circulating levels of 70 blood cell subsets, measured by standardized flow cytometry in 962 healthy donors of western European descent. We show that our predictions, based on a hundred of methylation sites or lower, are less error-prone than other existing methods, and extend the number of cell types that can be accurately predicted. Application of the same methods to age, smoking consumption and several serological responses to pathogen antigens also provide accurate estimations. Together, our study substantially improves predictions of blood cell composition based on methylation profiles, which will be critical in the emerging field of medical epigenomics.
- Subjects :
- Regulation of gene expression
0303 health sciences
Cellular differentiation
Genomics
Methylation
Computational biology
Biology
01 natural sciences
3. Good health
Predictive medicine
010104 statistics & probability
03 medical and health sciences
DNA methylation
[SDV.IMM]Life Sciences [q-bio]/Immunology
Epigenetics
0101 mathematics
030304 developmental biology
Epigenomics
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Accession number :
- edsair.doi.dedup.....7b352e3352e2ac0b5df5d140197c1ac7