Back to Search Start Over

Technical and biological sources of unreliability of Infinium probes on Illumina methylation microarrays

Authors :
Tatiana Nazarenko
Charlotte Dafni Vavourakis
Allison Jones
Iona Evans
Lena Schreiberhuber
Christine Kastner
Isma Ishaq-Parveen
Elisa Redl
Anthony W. Watson
Kirsten Brandt
Clive Carter
Alexey Zaikin
Chiara Maria Stella Herzog
Martin Widschwendter
Source :
Clinical Epigenetics, Vol 16, Iss 1, Pp 1-14 (2024)
Publication Year :
2024
Publisher :
BMC, 2024.

Abstract

Abstract The Illumina Methylation array platform has facilitated countless epigenetic studies on DNA methylation (DNAme) in health and disease, yet relatively few studies have so studied its reliability, i.e., the consistency of repeated measures. Here we investigate the reliability of both type I and type II Infinium probes. We propose a method for excluding unreliable probes based on dynamic thresholds for mean intensity (MI) and ‘unreliability’, estimated by probe-level simulation of the influence of technical noise on methylation β values using the background intensities of negative control probes. We validate our method in several datasets, including newly generated Illumina MethylationEPIC BeadChip v1.0 data from paired whole blood samples taken six weeks apart and technical replicates spanning multiple sample types. Our analysis revealed that specifically probes with low MI exhibit higher β value variability between repeated samples. MI was associated with the number of C-bases in the respective probe sequence and correlated negatively with unreliability scores. The unreliability scores were substantiated through validation in a new EPIC v1.0 (blood and cervix) and a publicly available 450k (blood) dataset, as they effectively captured the variability observed in β values between technical replicates. Finally, despite promising higher robustness, the newer version v2.0 of the MethylationEPIC BeadChip retained a substantial number of probes with poor unreliability scores. To enhance current pre-processing pipelines, we developed an R package to calculate MI and unreliability scores and provide guidance on establishing optimal dynamic score thresholds for a given dataset.

Subjects

Subjects :
Medicine
Genetics
QH426-470

Details

Language :
English
ISSN :
18687083
Volume :
16
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Clinical Epigenetics
Publication Type :
Academic Journal
Accession number :
edsdoj.95e45820f5743da8101fb63cf823224
Document Type :
article
Full Text :
https://doi.org/10.1186/s13148-024-01739-2