1. Unsupervised Clustering of Missense Variants in HNF1A Using Multidimensional Functional Data Aids Clinical Interpretation.
- Author
-
Althari S, Najmi LA, Bennett AJ, Aukrust I, Rundle JK, Colclough K, Molnes J, Kaci A, Nawaz S, van der Lugt T, Hassanali N, Mahajan A, Molven A, Ellard S, McCarthy MI, Bjørkhaug L, Njølstad PR, and Gloyn AL
- Subjects
- Adolescent, Adult, Alleles, Child, Cluster Analysis, Datasets as Topic, Diabetes Mellitus, Type 2 diagnosis, Diabetes Mellitus, Type 2 epidemiology, Diabetes Mellitus, Type 2 pathology, Female, Gene Expression, Humans, Male, Norway epidemiology, Phenotype, Principal Component Analysis, United Kingdom epidemiology, Exome Sequencing, Young Adult, Diabetes Mellitus, Type 2 genetics, Genetic Predisposition to Disease, Hepatocyte Nuclear Factor 1-alpha genetics, Mutation, Missense, Registries, Unsupervised Machine Learning
- Abstract
Exome sequencing in diabetes presents a diagnostic challenge because depending on frequency, functional impact, and genomic and environmental contexts, HNF1A variants can cause maturity-onset diabetes of the young (MODY), increase type 2 diabetes risk, or be benign. A correct diagnosis matters as it informs on treatment, progression, and family risk. We describe a multi-dimensional functional dataset of 73 HNF1A missense variants identified in exomes of 12,940 individuals. Our aim was to develop an analytical framework for stratifying variants along the HNF1A phenotypic continuum to facilitate diagnostic interpretation. HNF1A variant function was determined by four different molecular assays. Structure of the multi-dimensional dataset was explored using principal component analysis, k-means, and hierarchical clustering. Weights for tissue-specific isoform expression and functional domain were integrated. Functionally annotated variant subgroups were used to re-evaluate genetic diagnoses in national MODY diagnostic registries. HNF1A variants demonstrated a range of behaviors across the assays. The structure of the multi-parametric data was shaped primarily by transactivation. Using unsupervised learning methods, we obtained high-resolution functional clusters of the variants that separated known causal MODY variants from benign and type 2 diabetes risk variants and led to reclassification of 4% and 9% of HNF1A variants identified in the UK and Norway MODY diagnostic registries, respectively. Our proof-of-principle analyses facilitated informative stratification of HNF1A variants along the continuum, allowing improved evaluation of clinical significance, management, and precision medicine in diabetes clinics. Transcriptional activity appears a superior readout supporting pursuit of transactivation-centric experimental designs for high-throughput functional screens., (Copyright © 2020 The Authors. Published by Elsevier Inc. All rights reserved.)
- Published
- 2020
- Full Text
- View/download PDF