Back to Search Start Over

A deep learning model for prediction of autism status using whole-exome sequencing data.

Authors :
Wu, Qing
Morrow, Eric M.
Uzun, Ece D. Gamsiz
Source :
PLoS Computational Biology. 11/8/2024, Vol. 20 Issue 11, p1-18. 18p.
Publication Year :
2024

Abstract

Autism is a developmental disability. Research demonstrated that children with autism benefit from early diagnosis and early intervention. Genetic factors are considered major contributors to the development of autism. Machine learning (ML), including deep learning (DL), has been evaluated in phenotype prediction, but this method has been limited in its application to autism. We developed a DL model, the Separate Translated Autism Research Neural Network (STAR-NN) model to predict autism status. The model was trained and tested using whole exome sequencing data from 43,203 individuals (16,809 individuals with autism and 26,394 non-autistic controls). Polygenic scores from common variants and the aggregated count of rare variants on genes were used as input. In STAR-NN, protein truncating variants, possibly damaging missense variants and mild effect missense variants on the same gene were separated at the input level and merged to one gene node. In this way, rare variants with different level of pathogenic effects were treated separately. We further validated the performance of STAR-NN using an independent dataset, including 13,827 individuals with autism and 14,052 non-autistic controls. STAR-NN achieved a modest ROC-AUC of 0.7319 on the testing dataset and 0.7302 on the independent dataset. STAR-NN outperformed other traditional ML models. Gene Ontology analysis on the selected gene features showed an enrichment for potentially informative pathways including potassium ion transport. Author summary: Autism is a developmental disability. Genetic factors are considered as major contributors to the development of autism. Here, we present a deep neural network model, Separate Translated Autism Research Neural Network (STAR-NN), to predict autism status using WES data. STAR-NN showed a modest performance and was validated using an independent dataset. However, ML based models may have advantages in autism status prediction using genomic data and should be further studied. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
1553734X
Volume :
20
Issue :
11
Database :
Academic Search Index
Journal :
PLoS Computational Biology
Publication Type :
Academic Journal
Accession number :
180764052
Full Text :
https://doi.org/10.1371/journal.pcbi.1012468