Back to Search Start Over

Accurate multi-population imputation of MICA, MICB, HLA-E, HLA-F and HLA-G alleles from genome SNP data.

Authors :
Tammi, Silja
Koskela, Satu
Biobank, Blood Service
Hyvärinen, Kati
Partanen, Jukka
Ritari, Jarmo
Source :
PLoS Computational Biology; 9/16/2024, Vol. 20 Issue 9, p1-19, 19p
Publication Year :
2024

Abstract

In addition to the classical HLA genes, the major histocompatibility complex (MHC) harbors a high number of other polymorphic genes with less established roles in disease associations and transplantation matching. To facilitate studies of the non-classical and non-HLA genes in large patient and biobank cohorts, we trained imputation models for MICA, MICB, HLA-E, HLA-F and HLA-G alleles on genome SNP array data. We show, using both population-specific and multi-population 1000 Genomes references, that the alleles of these genes can be accurately imputed for screening and research purposes. The best imputation model for MICA, MICB, HLA-E, -F and -G achieved a mean accuracy of 99.3% (min, max: 98.6, 99.9). Furthermore, validation of the 1000 Genomes exome short-read sequencing-based allele calling against a clinical-grade reference data showed an average accuracy of 99.8%, testifying for the quality of the 1000 Genomes data as an imputation reference. We also fitted the models for Infinium Global Screening Array (GSA, Illumina, Inc.) and Axiom Precision Medicine Research Array (PMRA, Thermo Fisher Scientific Inc.) SNP content, with mean accuracies of 99.1% (97.2, 100) and 98.9% (97.4, 100), respectively. Author summary: The major histocompatibility complex (MHC) region on chromosome 6 significantly influences disease risk, particularly in autoimmune conditions. To improve fine-mapping of potentially causal genetic variants within this region, we developed accurate imputation methods for inferring functional allelic variation from MHC SNP data. While existing tools primarily focus on classical HLA genes, our study extends to non-classical HLA genes (HLA-E, HLA-F, and HLA-G) and MHC Class I Chain-Related MIC genes (MICA and MICB) which have specific functions in innate and adaptive immunity. Leveraging population-specific Finnish and multi-population 1000 Genomes references, our imputation models demonstrate high accuracy. Moreover, we tailored models for two widely used genome SNP arrays: the Infinium Global Screening Array (Illumina, Inc.) and the Axiom Precision Medicine Research Array (Thermo Fisher Scientific Inc.). These freely available, multi-population models empower researchers to explore genetic MHC associations in more detail and contribute to our understanding of immune-related disease mechanisms. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
1553734X
Volume :
20
Issue :
9
Database :
Complementary Index
Journal :
PLoS Computational Biology
Publication Type :
Academic Journal
Accession number :
179663915
Full Text :
https://doi.org/10.1371/journal.pcbi.1011718