Back to Search
Start Over
How Deepbics Quantifies Intensities of Transcription Factor-DNA Binding and Facilitates Prediction of Single Nucleotide Variant Pathogenicity With a Deep Learning Model Trained On ChIP-Seq Data Sets.
- Source :
-
IEEE/ACM transactions on computational biology and bioinformatics [IEEE/ACM Trans Comput Biol Bioinform] 2023 Mar-Apr; Vol. 20 (2), pp. 1594-1599. Date of Electronic Publication: 2023 Apr 03. - Publication Year :
- 2023
-
Abstract
- The binding of DNA sequences to cell type-specific transcription factors is essential for regulating gene expression in all organisms. Many variants occurring in these binding regions play crucial roles in human disease by disrupting the cis-regulation of gene expression. We first implemented a sequence-based deep learning model called deepBICS to quantify the intensity of transcription factors-DNA binding. The experimental results not only showed the superiority of deepBICS on ChIP-seq data sets but also suggested deepBICS as a language model could help the classification of disease-related and neutral variants. We then built a language model-based method called deepBICS4SNV to predict the pathogenicity of single nucleotide variants. The good performance of deepBICS4SNV on 2 tests related to Mendelian disorders and viral diseases shows the sequence contextual information derived from language models can improve prediction accuracy and generalization capability.
Details
- Language :
- English
- ISSN :
- 1557-9964
- Volume :
- 20
- Issue :
- 2
- Database :
- MEDLINE
- Journal :
- IEEE/ACM transactions on computational biology and bioinformatics
- Publication Type :
- Academic Journal
- Accession number :
- 35471887
- Full Text :
- https://doi.org/10.1109/TCBB.2022.3170343