32 results on '"deep learning"'
Search Results
2. Accurate prediction of CDR-H3 loop structures of antibodies with deep learning.
- Author
-
Hedi Chen, Xiaoyu Fan, Shuqian Zhu, Yuchan Pei, Xiaochun Zhang, Xiaonan Zhang, Lihang Liu, Feng Qian, and Boxue Tian
- Subjects
- *
DEEP learning , *LANGUAGE models , *DRUG administration routes , *ENGINEERS , *PROTEIN models , *IMMUNOGLOBULINS - Abstract
Accurate prediction of the structurally diverse complementarity determining region heavy chain 3 (CDR-H3) loop structure remains a primary and long-standing challenge for antibody modeling. Here, we present the H3-OPT toolkit for predicting the 3D structures of monoclonal antibodies and nanobodies. H3-OPT combines the strengths of AlphaFold2 with a pre-trained protein language model and provides a 2.24 Å average RMSDCa between predicted and experimentally determined CDR-H3 loops, thus outperforming other current computational methods in our non-redundant high-quality dataset. The model was validated by experimentally solving three structures of anti-VEGF nanobodies predicted by H3-OPT. We examined the potential applications of H3-OPT through analyzing antibody surface properties and antibody-antigen interactions. This structural prediction tool can be used to optimize antibody-antigen binding and engineer therapeutic antibodies with biophysical properties for specialized drug administration route. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Predicting metabolic modules in incomplete bacterial genomes with MetaPathPredict.
- Author
-
Geller-McGrath, David, Konwar, Kishori M., Edgcomb, Virginia P., Pachiadaki, Maria, Roddy, Jack W., Wheeler, Travis J., and McDermott, Jason E.
- Subjects
- *
DEEP learning , *MACHINE learning , *DATABASES , *COMPUTER workstation clusters , *SOFTWARE development tools , *ENVIRONMENTAL sampling , *BACTERIAL genomes - Abstract
The reconstruction of complete microbial metabolic pathways using 'omics data from environmental samples remains challenging. Computational pipelines for pathway reconstruction that utilize machine learning methods to predict the presence or absence of KEGG modules in incomplete genomes are lacking. Here, we present MetaPathPredict, a software tool that incorporates machine learning models to predict the presence of complete KEGG modules within bacterial genomic datasets. Using gene annotation data and information from the KEGG module database, MetaPathPredict employs deep learning models to predict the presence of KEGG modules in a genome. MetaPathPredict can be used as a command line tool or as a Python module, and both options are designed to be run locally or on a compute cluster. Benchmarks show that MetaPathPredict makes robust predictions of KEGG module presence within highly incomplete genomes. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. Deep learning insights into the architecture of the mammalian egg-sperm fusion synapse.
- Author
-
Elofsson, Arne, Ling Han, Bianchi, Enrica, Wright, Gavin J., and Jovine, Luca
- Subjects
- *
DEEP learning , *SYNAPSES , *REPRODUCTION , *GAMETES , *SPERMATOZOA , *CYTOSKELETAL proteins , *STRUCTURAL models , *SEMEN analysis - Abstract
A crucial event in sexual reproduction is when haploid sperm and egg fuse to form a new diploid organism at fertilization. In mammals, direct interaction between egg JUNO and sperm IZUMO1 mediates gamete membrane adhesion, yet their role in fusion remains enigmatic. We used AlphaFold to predict the structure of other extracellular proteins essential for fertilization to determine if they could form a complex that may mediate fusion. We first identified TMEM81, whose gene is expressed by mouse and human spermatids, as a protein having structural homologies with both IZUMO1 and another sperm molecule essential for gamete fusion, SPACA6. Using a set of proteins known to be important for fertilization and TMEM81, we then systematically searched for predicted binary interactions using an unguided approach and identified a pentameric complex involving sperm IZUMO1, SPACA6, TMEM81 and egg JUNO, CD9. This complex is structurally consistent with both the expected topology on opposing gamete membranes and the location of predicted N-glycans not modeled by AlphaFold-Multimer, suggesting that its components could organize into a synapse-like assembly at the point of fusion. Finally, the structural modeling approach described here could be more generally useful to gain insights into transient protein complexes difficult to detect experimentally. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Haplotype function score improves biological interpretation and cross-ancestry polygenic prediction of human complex traits.
- Author
-
Weichen Song, Yongyong Shi, and Guan Ning Lin
- Subjects
- *
HAPLOTYPES , *DEEP learning , *GENOTYPES , *SINGLE nucleotide polymorphisms - Abstract
We propose a new framework for human genetic association studies: at each locus, a deep learning model (in this study, Sei) is used to calculate the functional genomic activity score for two haplotypes per individual. This score, defined as the Haplotype Function Score (HFS), replaces the original genotype in association studies. Applying the HFS framework to 14 complex traits in the UK Biobank, we identified 3619 independent HFS-trait associations with a significance of p < 5 × 10-8. Fine-mapping revealed 2699 causal associations, corresponding to a median increase of 63 causal findings per trait compared with single-nucleotide polymorphism (SNP)-based analysis. HFS-based enrichment analysis uncovered 727 pathway-trait associations and 153 tissue-trait associations with strong biological interpretability, including 'circadian pathway-chronotype' and 'arachidonic acid-intelligence'. Lastly, we applied least absolute shrinkage and selection operator (LASSO) regression to integrate HFS prediction score with SNP-based polygenic risk scores, which showed an improvement of 16.1-39.8% in cross-ancestry polygenic prediction. We concluded that HFS is a promising strategy for understanding the genetic basis of human complex traits. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Genome-scale annotation of protein binding sites via language model and geometric deep learning.
- Author
-
Qianmu Yuan, Chong Tian, and Yuedong Yang
- Subjects
- *
LANGUAGE models , *BINDING sites , *PROTEIN binding , *DEEP learning , *GEOMETRIC modeling , *INTERNET servers - Abstract
Revealing protein binding sites with other molecules, such as nucleic acids, peptides, or small ligands, sheds light on disease mechanism elucidation and novel drug design. With the explosive growth of proteins in sequence databases, how to accurately and efficiently identify these binding sites from sequences becomes essential. However, current methods mostly rely on expensive multiple sequence alignments or experimental protein structures, limiting their genome-scale applications. Besides, these methods haven't fully explored the geometry of the protein structures. Here, we propose GPSite, a multi-task network for simultaneously predicting binding residues of DNA, RNA, peptide, protein, ATP, HEM, and metal ions on proteins. GPSite was trained on informative sequence embeddings and predicted structures from protein language models, while comprehensively extracting residual and relational geometric contexts in an end-to-end manner. Experiments demonstrate that GPSite substantially surpasses state-of-the-art sequence-based and structure-based approaches on various benchmark datasets, even when the structures are not well-predicted. The low computational cost of GPSite enables rapid genome-scale binding residue annotations for over 568,000 sequences, providing opportunities to unveil unexplored associations of binding sites with molecular functions, biological processes, and genetic variants. The GPSite webserver and annotation database can be freely accessed at https://bio-web1.nscc-gz.cn/app/GPSite. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Transformer-based spatial–temporal detection of apoptotic cell death in livecell imaging.
- Author
-
Pulfer, Alain, Pizzagalli, Diego Ulisse, Gagliardi, Paolo Armando, Hinderling, Lucien, Lopez, Paul, Zayats, Romaniya, Carrillo-Barberà, Pau, Antonello, Paola, Palomino-Segura, Miguel, Grädel, Benjamin, Nicolai, Mariaclaudia, Giusti, Alessandro, Thelen, Marcus, Gambardella, Luca Maria, Murooka, Thomas T., Pertz, Olivier, Krause, Rolf, and Fernandez Gonzalez, Santiago
- Subjects
- *
CELL death , *DEEP learning , *TISSUE viability , *CELL survival , *TASK performance , *APOPTOSIS - Abstract
Intravital microscopy has revolutionized live-cell imaging by allowing the study of spatial–temporal cell dynamics in living animals. However, the complexity of the data generated by this technology has limited the development of effective computational tools to identify and quantify cell processes. Amongst them, apoptosis is a crucial form of regulated cell death involved in tissue homeostasis and host defense. Live-cell imaging enabled the study of apoptosis at the cellular level, enhancing our understanding of its spatial–temporal regulation. However, at present, no computational method can deliver robust detection of apoptosis in microscopy timelapses. To overcome this limitation, we developed ADeS, a deep learning-based apoptosis detection system that employs the principle of activity recognition. We trained ADeS on extensive datasets containing more than 10,000 apoptotic instances collected both in vitro and in vivo, achieving a classification accuracy above 98% and outperforming state-of-the-art solutions. ADeS is the first method capable of detecting the location and duration of multiple apoptotic events in full microscopy timelapses, surpassing human performance in the same task. We demonstrated the effectiveness and robustness of ADeS across various imaging modalities, cell types, and staining techniques. Finally, we employed ADeS to quantify cell survival in vitro and tissue damage in mice, demonstrating its potential application in toxicity assays, treatment evaluation, and inflammatory dynamics. Our findings suggest that ADeS is a valuable tool for the accurate detection and quantification of apoptosis in live-cell imaging and, in particular, intravital microscopy data, providing insights into the complex spatial–temporal regulation of this process. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. Structural assembly of the bacterial essential interactome.
- Author
-
Gómez Borrego, Jordi and Torrent Burgas, Marc
- Subjects
- *
DEEP learning , *PROTEIN folding , *BACTERIAL proteins , *DELETION mutation , *PROTEIN-protein interactions , *STRUCTURAL stability , *SYNTHETIC biology - Abstract
The study of protein interactions in living organisms is fundamental for understanding biological processes and central metabolic pathways. Yet, our knowledge of the bacterial interactome remains limited. Here, we combined gene deletion mutant analysis with deep-learning protein folding using AlphaFold2 to predict the core bacterial essential interactome. We predicted and modeled 1402 interactions between essential proteins in bacteria and generated 146 high-accuracy models. Our analysis reveals previously unknown details about the assembly mechanisms of these complexes, highlighting the importance of specific structural features in their stability and function. Our work provides a framework for predicting the essential interactomes of bacteria and highlight the potential of deep-learning algorithms in advancing our understanding of the complex biology of living organisms. Also, the results presented here offer a promising approach to identify novel antibiotic targets. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. Design of the HPV-automated visual evaluation (PAVE) study: Validating a novel cervical screening strategy.
- Author
-
de Sanjosé, Silvia, Perkins, Rebecca B., Campos, Nicole, Inturrisi, Federica, Egemen, Didem, Befano, Brian, Rodriguez, Ana Cecilia, Jerónimo, Jose, Cheung, Li C., Desai, Kanan, Han, Paul, Novetsky, Akiva P., Ukwuani, Abigail, Marcus, Jenna, Ahmed, Syed Rakin, Wentzensen, Nicolas, Kalpathy-Cramer, Jayashree, and Schiffman, Mark
- Subjects
- *
RESOURCE-limited settings , *DISEASE risk factors , *MEDICAL communication , *DEEP learning , *CERVICAL cancer , *HUMAN papillomavirus - Abstract
Background: The HPV-automated visual evaluation (PAVE) Study is an extensive, multinational initiative designed to advance cervical cancer prevention in resource-constrained regions. Cervical cancer disproportionally affects regions with limited access to preventive measures. PAVE aims to assess a novel screening-triage-treatment strategy integrating self-sampled HPV testing, deep-learning-based automated visual evaluation (AVE), and targeted therapies. Methods: Phase 1 efficacy involves screening up to 100,000 women aged 25-49 across nine countries, using self-collected vaginal samples for hierarchical HPV evaluation: HPV16, else HPV18/45, else HPV31/33/35/52/58, else HPV39/51/56/59/68 else negative. HPV-positive individuals undergo further evaluation, including pelvic exams, cervical imaging, and biopsies. AVE algorithms analyze images, assigning risk scores for precancer, validated against histologic high-grade precancer. Phase 1, however, does not integrate AVE results into patient management, contrasting them with local standard care. Phase 2 effectiveness focuses on deploying AVE software and HPV genotype data in real-time clinical decision-making, evaluating feasibility, acceptability, cost-effectiveness, and health communication of the PAVE strategy in practice. Results: Currently, sites have commenced fieldwork, and conclusive results are pending. Conclusions: The study aspires to validate a screen-triage-treat protocol utilizing innovative biomarkers to deliver an accurate, feasible, and cost-effective strategy for cervical cancer prevention in resource-limited areas. Should the study validate PAVE, its broader implementation could be recommended, potentially expanding cervical cancer prevention worldwide. Funding: The consortial sites are responsible for their own study costs. Research equipment and supplies, and the NCI-affiliated staff are funded by the National Cancer Institute Intramural Research Program including supplemental funding from the Cancer Cures Moonshot Initiative. No commercial support was obtained. Brian Befano was supported by NCI/NIH under Grant T32CA09168. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
10. Independent regulation of Z-lines and M-lines during sarcomere assembly in cardiac myocytes revealed by the automatic image analysis software sarcApp.
- Author
-
Neininger-Castro, Abigail C., Hayes, James B., Sanchez, Zachary C., Taneja, Nilay, Fenix, Aidan M., Moparthi, Satish, Vassilopoulos, Stéphane, and Burnette, Dylan Tyler
- Subjects
- *
IMAGE analysis , *DEEP learning , *SARCOMERES , *MYOFIBRILS , *CONNECTIN - Abstract
Sarcomeres are the basic contractile units within cardiac myocytes, and the collective shortening of sarcomeres aligned along myofibrils generates the force driving the heartbeat. The alignment of the individual sarcomeres is important for proper force generation, and misaligned sarcomeres are associated with diseases, including cardiomyopathies and COVID-19. The actin bundling protein, α-actinin-2, localizes to the 'Z-Bodies" of sarcomere precursors and the 'Z-Lines' of sarcomeres, and has been used previously to assess sarcomere assembly and maintenance. Previous measurements of α-actinin-2 organization have been largely accomplished manually, which is time-consuming and has hampered research progress. Here, we introduce sarcApp, an image analysis tool that quantifies several components of the cardiac sarcomere and their alignment in muscle cells and tissue. We first developed sarcApp to utilize deep learning-based segmentation and real space quantification to measure α-actinin-2 structures and determine the organization of both precursors and sarcomeres/myofibrils. We then expanded sarcApp to analyze 'M-Lines' using the localization of myomesin and a protein that connects the Z-Lines to the M-Line (titin). sarcApp produces 33 distinct measurements per cell and 24 per myofibril that allow for precise quantification of changes in sarcomeres, myofibrils, and their precursors. We validated this system with perturbations to sarcomere assembly. We found perturbations that affected Z-Lines and M-Lines differently, suggesting that they may be regulated independently during sarcomere assembly. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
11. Spatial transformation of multi-omics data unlocks novel insights into cancer biology.
- Author
-
Sokač, Mateo, Kjær, Asbjørn, Dyrskjøt, Lars, Haibe-Kains, Benjamin, Aerts, Hugo J. W. L., and Birkbak, Nicolai J.
- Subjects
- *
DEEP learning , *MULTIOMICS , *SPATIAL orientation , *BIOLOGY , *NUCLEOTIDE sequencing , *IMAGE analysis - Abstract
The application of next-generation sequencing (NGS) has transformed cancer research. As costs have decreased, NGS has increasingly been applied to generate multiple layers of molecular data from the same samples, covering genomics, transcriptomics, and methylomics. Integrating these types of multi-omics data in a combined analysis is now becoming a common issue with no obvious solution, often handled on an ad hoc basis, with multi-omics data arriving in a tabular format and analyzed using computationally intensive statistical methods. These methods particularly ignore the spatial orientation of the genome and often apply stringent p-value corrections that likely result in the loss of true positive associations. Here, we present GENIUS (GEnome traNsformatIon and spatial representation of mUltiomicS data), a framework for integrating multi-omics data using deep learning models developed for advanced image analysis. The GENIUS framework is able to transform multi-omics data into images with genes displayed as spatially connected pixels and successfully extract relevant information with respect to the desired output. We demonstrate the utility of GENIUS by applying the framework to multi-omics datasets from the Cancer Genome Atlas. Our results are focused on predicting the development of metastatic cancer from primary tumors, and demonstrate how through model inference, we are able to extract the genes which are driving the model prediction and are likely associated with metastatic disease progression. We anticipate our framework to be a starting point and strong proof of concept for multi-omics data transformation and analysis without the need for statistical correction. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
12. Rapid protein stability prediction using deep learning representations.
- Author
-
Blaabjerg, Lasse M., Kassem, Maher M., Good, Lydia L., Jonsson, Nicolas, Cagiada, Matteo, Johansson, Kristoffer E., Boomsma, Wouter, Stein, Amelie, and Lindorff-Larsen, Kresten
- Subjects
- *
DEEP learning , *PROTEIN stability , *PROTEIN structure , *PROTEIN engineering , *FORECASTING , *GENETIC disorders , *AMINO acids - Abstract
Predicting the thermodynamic stability of proteins is a common and widely used step in protein engineering, and when elucidating the molecular mechanisms behind evolution and disease. Here, we present RaSP, a method for making rapid and accurate predictions of changes in protein stability by leveraging deep learning representations. RaSP performs on-par with biophysics-based methods and enables saturation mutagenesis stability predictions in less than a second per residue. We use RaSP to calculate ~ 230 million stability changes for nearly all single amino acid changes in the human proteome, and examine variants observed in the human population. We find that variants that are common in the population are substantially depleted for severe destabilization, and that there are substantial differences between benign and pathogenic variants, highlighting the role of protein stability in genetic diseases. RaSP is freely available--including via a Web interface--and enables large-scale analyses of stability in experimental and predicted protein structures. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
13. Large-scale electrophysiology and deep learning reveal distorted neural signal dynamics after hearing loss.
- Author
-
Sabesan, Shievanie, Fragner, Andreas, Bench, Ciaran, Drakopoulos, Fotios, and Lesica, Nicholas A.
- Subjects
- *
HEARING aids , *HEARING disorders , *DEEP learning , *AUDITORY perception , *NEURAL codes , *ELECTROPHYSIOLOGY , *AUDIOMETRY , *BRAIN anatomy - Abstract
Listeners with hearing loss often struggle to understand speech in noise, even with a hearing aid. To better understand the auditory processing deficits that underlie this problem, we made large-scale brain recordings from gerbils, a common animal model for human hearing, while presenting a large database of speech and noise sounds. We first used manifold learning to identify the neural subspace in which speech is encoded and found that it is low-dimensional and that the dynamics within it are profoundly distorted by hearing loss. We then trained a deep neural network (DNN) to replicate the neural coding of speech with and without hearing loss and analyzed the underlying network dynamics. We found that hearing loss primarily impacts spectral processing, creating nonlinear distortions in cross-frequency interactions that result in a hypersensitivity to background noise that persists even after amplification with a hearing aid. Our results identify a new focus for efforts to design improved hearing aids and demonstrate the power of DNNs as a tool for the study of central brain structures. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
14. Longitudinal fundus imaging and its genome- wide association analysis provide evidence for a human retinal aging clock.
- Author
-
Ahadi, Sara, Wilson, Kenneth A., Babenko, Boris, McLean, Cory Y., Bryant, Drew, Pritchard, Orion, Carrera, Ajay KumarEnrique M., Lamy, Ricardo, Stewart, Jay M., Varadarajan, Avinash, Bernd, Marc, Kapahi, Pankaj, and Bashir, Ali
- Subjects
- *
CLOCKS & watches , *AGE , *CLOCK genes , *AGING , *DEEP learning - Abstract
Biological age, distinct from an individual's chronological age, has been studied extensively through predictive aging clocks. However, these clocks have limited accuracy in short time- scales. Here we trained deep learning models on fundus images from the EyePACS dataset to predict individuals' chronological age. Our retinal aging clocking, 'eyeAge', predicted chronological age more accurately than other aging clocks (mean absolute error of 2.86 and 3.30 years on quality- filtered data from EyePACS and UK Biobank, respectively). Additionally, eyeAge was independent of blood marker- based measures of biological age, maintaining an all- cause mortality hazard ratio of 1.026 even when adjusted for phenotypic age. The individual- specific nature of eyeAge was reinforced via multiple GWAS hits in the UK Biobank cohort. The top GWAS locus was further validated via knockdown of the fly homolog, Alk, which slowed age- related decline in vision in flies. This study demonstrates the potential utility of a retinal aging clock for studying aging and age- related diseases and quantitatively measuring aging on very short time- scales, opening avenues for quick and actionable evaluation of gero- protective therapeutics. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
15. ProteInfer, deep neural networks for protein functional inference.
- Author
-
Sanderson, Theo, Bileschi, Maxwell L., Belanger, David, and Colwell, Lucy J.
- Subjects
- *
ARTIFICIAL neural networks , *NERVE tissue proteins , *AMINO acid sequence , *CONVOLUTIONAL neural networks , *DEEP learning , *SEQUENCE alignment - Abstract
Predicting the function of a protein from its amino acid sequence is a long-standing challenge in bioinformatics. Traditional approaches use sequence alignment to compare a query sequence either to thousands of models of protein families or to large databases of individual protein sequences. Here we introduce ProteInfer, which instead employs deep convolutional neural networks to directly predict a variety of protein functions - Enzyme Commission (EC) numbers and Gene Ontology (GO) terms - directly from an unaligned amino acid sequence. This approach provides precise predictions which complement alignment-based methods, and the computational efficiency of a single neural network permits novel and lightweight software interfaces, which we demonstrate with an in-browser graphical interface for protein function prediction in which all computation is performed on the user's personal computer with no data uploaded to remote servers. Moreover, these models place full-length amino acid sequences into a generalised functional space, facilitating downstream analysis and interpretation. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
16. Gating interactions steer loop conformational changes in the active site of the L1 metallo-β-lactamase.
- Author
-
Zhuoran Zhao, Xiayu Shen, Shuang Chen, Jing Gu, Haun Wang, Mojica, Maria F., Samanta, Moumita, Bhowmik, Debsindhu, Vila, Alejandro J., Bonomo, Robert A., and Haider, Shozeb
- Subjects
- *
MOLECULAR dynamics , *BETA lactam antibiotics , *STENOTROPHOMONAS maltophilia , *ANTIBACTERIAL agents , *DEEP learning , *MARKOV processes - Abstract
β-Lactam antibiotics are the most important and widely used antibacterial agents across the world. However, the widespread dissemination of β-lactamases among pathogenic bacteria limits the efficacy of β-lactam antibiotics. This has created a major public health crisis. The use of β-lactamase inhibitors has proven useful in restoring the activity of β-lactam antibiotics, yet, effective clinically approved inhibitors against class B metallo-β-lactamases are not available. L1, a class B3 enzyme expressed by Stenotrophomonas maltophilia, is a significant contributor to the β-lactam resistance displayed by this opportunistic pathogen. Structurally, L1 is a tetramer with two elongated loops, α3-β7 and β12-α5, present around the active site of each monomer. Residues in these two loops influence substrate/inhibitor binding. To study how the conformational changes of the elongated loops affect the active site in each monomer, enhanced sampling molecular dynamics simulations were performed, Markov State Models were built, and convolutional variational autoencoder-based deep learning was applied. The key identified residues (D150a, H151, P225, Y227, and R236) were mutated and the activity of the generated L1 variants was evaluated in cellbased experiments. The results demonstrate that there are extremely significant gating interactions between α3-β7 and β12-α5 loops. Taken together, the gating interactions with the conformational changes of the key residues play an important role in the structural remodeling of the active site. These observations offer insights into the potential for novel drug development exploiting these gating interactions. [ABSTRACT FROM AUTHOR]
- Published
- 2023
17. Transformer-based deep learning for predicting protein properties in the life sciences.
- Author
-
Chandra, Abel, Tünnermann, Laura, Löfstedt, Tommy, and Gratz, Regina
- Subjects
- *
DEEP learning , *AMINO acid sequence , *LANGUAGE models , *NATURAL language processing , *LIFE sciences , *POST-translational modification - Abstract
Recent developments in deep learning, coupled with an increasing number of sequenced proteins, have led to a breakthrough in life science applications, in particular in protein property prediction. There is hope that deep learning can close the gap between the number of sequenced proteins and proteins with known properties based on lab experiments. Language models from the field of natural language processing have gained popularity for protein property predictions and have led to a new computational revolution in biology, where old prediction results are being improved regularly. Such models can learn useful multipurpose representations of proteins from large open repositories of protein sequences and can be used, for instance, to predict protein properties. The field of natural language processing is growing quickly because of developments in a class of models based on a particular model--the Transformer model. We review recent developments and the use of large-scale Transformer models in applications for predicting protein characteristics and how such models can be used to predict, for example, post-translational modifications. We review shortcomings of other deep learning models and explain how the Transformer models have quickly proven to be a very promising way to unravel information hidden in the sequences of amino acids. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
18. Deep learning-driven insights into super protein complexes for outer membrane protein biogenesis in bacteria.
- Author
-
Mu Gao, An, Davi Nakajima, and Skolnick, Jeffrey
- Subjects
- *
MEMBRANE proteins , *BACTERIAL proteins , *PROTEINS , *PROTEIN structure , *DEEP learning , *MOLECULAR chaperones , *VIRAL envelope proteins - Abstract
To reach their final destinations, outer membrane proteins (OMPs) of gram-negative bacteria undertake an eventful journey beginning in the cytosol. Multiple molecular machines, chaperones, proteases, and other enzymes facilitate the translocation and assembly of OMPs. These helpers usually associate, often transiently, forming large protein assemblies. They are not well understood due to experimental challenges in capturing and characterizing protein-protein interactions (PPIs), especially transient ones. Using AF2Complex, we introduce a high-throughput, deep learning pipeline to identify PPIs within the Escherichia coli cell envelope and apply it to several proteins from an OMP biogenesis pathway. Among the top confident hits obtained from screening ~1500 envelope proteins, we find not only expected interactions but also unexpected ones with profound implications. Subsequently, we predict atomic structures for these protein complexes. These structures, typically of high confidence, explain experimental observations and lead to mechanistic hypotheses for how a chaperone assists a nascent, precursor OMP emerging from a translocon, how another chaperone prevents it from aggregating and docks to a β-barrel assembly port, and how a protease performs quality control. This work presents a general strategy for investigating biological pathways by using structural insights gained from deep learning-based predictions. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
19. Scratch-AID, a deep learning-based system for automatic detection of mouse scratching behavior with high accuracy.
- Author
-
Huasheng Yu, Jingwei Xiong, Yongxin Ye, Adam, Suna Li Cranfill, Cannonier, Tariq, Gautam, Mayank, Marina Zhang, Bilal, Rayan, Jong-Eun Park, Yuji Xue, Polam, Vidhur, Vujovic, Zora, Dai, Daniel, Ong, William, Ip, Jasper, Hsieh, Amanda, Mimouni, Nour, Lozada, Alejandra, Sosale, Medhini, and Ahn, Alex
- Subjects
- *
ITCHING , *DEEP learning , *RECURRENT neural networks , *CONVOLUTIONAL neural networks , *MICE , *ANIMAL models in research , *GENETIC testing - Abstract
Mice are the most commonly used model animals for itch research and for development of anti-itch drugs. Most laboratories manually quantify mouse scratching behavior to assess itch intensity. This process is labor-intensive and limits large-scale genetic or drug screenings. In this study, we developed a new system, Scratch-AID (Automatic Itch Detection), which could automatically identify and quantify mouse scratching behavior with high accuracy. Our system included a custom-designed videotaping box to ensure high-quality and replicable mouse behavior recording and a convolutional recurrent neural network trained with frame-labeled mouse scratching behavior videos, induced by nape injection of chloroquine. The best trained network achieved 97.6% recall and 96.9% precision on previously unseen test videos. Remarkably, Scratch-AID could reliably identify scratching behavior in other major mouse itch models, including the acute cheek model, the histaminergic model, and a chronic itch model. Moreover, our system detected significant differences in scratching behavior between control and mice treated with an anti-itch drug. Taken together, we have established a novel deep learning-based system that could replace manual quantification for mouse scratching behavior in different itch models and for drug screening. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
20. Mandrill mothers associate with infants who look like their own offspring using phenotype matching.
- Author
-
Charpentier, Marie J. E., Poirotte, Clémence, Roura-Torres, Berta, Amblard-Rambert, Paul, Willaume, Eric, Kappeler, Peter M., Rousset, François, and Renoult, Julien P.
- Subjects
- *
INFANTS , *SOCIAL structure , *DEEP learning , *PHENOTYPES , *MOTHERS , *SOCIAL processes - Abstract
Behavioral discrimination of kin is a key process structuring social relationships in animals. In this study, we provide evidence for discrimination towards non-kin by third-parties through a mechanism of phenotype matching. In mandrills, we recently demonstrated increased facial resemblance among paternally related juvenile and adult females indicating adaptive opportunities for paternal kin recognition. Here, we hypothesize that mandrill mothers use offspring’s facial resemblance with other infants to guide offspring’s social opportunities towards similar-looking ones. Using deep learning for face recognition in 80 wild mandrill infants, we first show that infants sired by the same father resemble each other the most, independently of their age, sex or maternal origin, extending previous results to the youngest age class. Using long-term behavioral observations on association patterns, and controlling for matrilineal origin, maternal relatedness and infant age and sex, we then show, as predicted, that mothers are spatially closer to infants that resemble their own offspring more, and that this maternal behavior leads to similar-looking infants being spatially associated. We then discuss the different scenarios explaining this result, arguing that an adaptive maternal behavior is a likely explanation. In support of this mechanism and using theoretical modeling, we finally describe a plausible evolutionary process whereby mothers gain fitness benefits by promoting nepotism among paternally related infants. This mechanism, that we call ‘second-order kin selection’, may extend beyond mother-infant interactions and has the potential to explain cooperative behaviors among non-kin in other social species, including humans. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
21. Deep mutational scanning and machine learning reveal structural and molecular rules governing allosteric hotspots in homologous proteins.
- Author
-
Leander, Megan, Zhuang Liu, Qiang Cui, and Raman, Srivatsan
- Subjects
- *
ALLOSTERIC regulation , *TRANSCRIPTION factors , *PROTEINS , *DEEP learning , *MACHINE learning - Abstract
A fundamental question in protein science is where allosteric hotspots – residues critical for allosteric signaling – are located, and what properties differentiate them. We carried out deep mutational scanning (DMS) of four homologous bacterial allosteric transcription factors (aTFs) to identify hotspots and built a machine learning model with this data to glean the structural and molecular properties of allosteric hotspots. We found hotspots to be distributed protein-wide rather than being restricted to ‘pathways’ linking allosteric and active sites as is commonly assumed. Despite structural homology, the location of hotspots was not superimposable across the aTFs. However, common signatures emerged when comparing hotspots coincident with long-range interactions, suggesting that the allosteric mechanism is conserved among the homologs despite differences in molecular details. Machine learning with our large DMS datasets revealed global structural and dynamic properties to be a strong predictor of whether a residue is a hotspot than local and physicochemical properties. Furthermore, a model trained on one protein can predict hotspots in a homolog. In summary, the overall allosteric mechanism is embedded in the structural fold of the aTF family, but the finer, molecular details are sequence-specific. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
22. Microplankton life histories revealed by holographic microscopy and deep learning.
- Author
-
Bachimanchi, Harshith, Midtvedt, Benjamin, Midtvedt, Daniel, Selander, Erik, and Volpe, Giovanni
- Subjects
- *
DEEP learning , *CARBON cycle , *FOOD chains , *CELL division , *MICROSCOPY , *BIOMASS - Abstract
The marine microbial food web plays a central role in the global carbon cycle. However, our mechanistic understanding of the ocean is biased toward its larger constituents, while rates and biomass fluxes in the microbial food web are mainly inferred from indirect measurements and ensemble averages. Yet, resolution at the level of the individual microplankton is required to advance our understanding of the microbial food web. Here, we demonstrate that, by combining holographic microscopy with deep learning, we can follow microplanktons throughout their lifespan, continuously measuring their three-dimensional position and dry mass. The deep-learning algorithms circumvent the computationally intensive processing of holographic data and allow rapid measurements over extended time periods. This permits us to reliably estimate growth rates, both in terms of dry mass increase and cell divisions, as well as to measure trophic interactions between species such as predation events. The individual resolution provides information about selectivity, individual feeding rates, and handling times for individual microplanktons. The method is particularly useful to detail the rates and routes of organic matter transfer in micro-zooplankton, the most important and least known group of primary consumers in the oceans. Studying individual interactions in idealized small systems provides insights that help us understand microbial food webs and ultimately larger-scale processes. We exemplify this by detailed descriptions of micro-zooplankton feeding events, cell divisions, and long-term monitoring of single cells from division to division. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
23. Deep learning-based feature extraction for prediction and interpretation of sharp-wave ripples in the rodent hippocampus .
- Author
-
Navas-Olive, Andrea, Amaducci, Rodrigo, Jurado-Parras, Maria-Teresa, Sebastian, Enrique R., and de la Prida, Liset M.
- Subjects
- *
FEATURE extraction , *DEEP learning , *HIPPOCAMPUS (Brain) , *AUTOMATIC identification , *RODENTS , *FORECASTING - Abstract
Local field potential (LFP) deflections and oscillations define hippocampal sharp-wave ripples (SWRs), one of the most synchronous events of the brain. SWRs reflect firing and synaptic current sequences emerging from cognitively relevant neuronal ensembles. While spectral analysis have permitted advances, the surge of ultra-dense recordings now call for new automatic detection strategies. Here, we show how one-dimensional convolutional networks operating over highdensity LFP hippocampal recordings allowed for automatic identification of SWR from the rodent hippocampus. When applied without retraining to new datasets and ultra-dense hippocampus-wide recordings, we discovered physiologically relevant processes associated to the emergence of SWR, prompting for novel classification criteria. To gain interpretability, we developed a method to interrogate the operation of the artificial network. We found it relied in feature-based specialization, which permit identification of spatially segregated oscillations and deflections, as well as synchronous population firing typical of replay. Thus, using deep learning-based approaches may change the current heuristic for a better mechanistic interpretation of these relevant neurophysiological events. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
24. DetecDiv, a generalist deep-learning platform for automated cell division tracking and survival analysis.
- Author
-
Aspert, Théo, Hentsch, Didier, and Charvin, Gilles
- Subjects
- *
MICROFLUIDICS , *CELL division , *DEEP learning , *SURVIVAL analysis (Biometry) , *IMAGE segmentation , *CELL cycle , *CELL survival - Abstract
Automating the extraction of meaningful temporal information from sequences of microscopy images represents a major challenge to characterize dynamical biological processes. So far, strong limitations in the ability to quantitatively analyze single-cell trajectories have prevented large-scale investigations to assess the dynamics of entry into replicative senescence in yeast. Here, we have developed DetecDiv, a microfluidic-based image acquisition platform combined with deep learning-based software for high-throughput single-cell division tracking. We show that DetecDiv can automatically reconstruct cellular replicative lifespans with high accuracy and performs similarly with various imaging platforms and geometries of microfluidic traps. In addition, this methodology provides comprehensive temporal cellular metrics using time-series classification and image semantic segmentation. Last, we show that this method can be further applied to automatically quantify the dynamics of cellular adaptation and real-time cell survival upon exposure to environmental stress. Hence, this methodology provides an all-in-one toolbox for high-throughput phenotyping for cell cycle, stress response, and replicative lifespan assays. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
25. Selfee, self- supervised features extraction of animal behaviors.
- Author
-
Yinjun Jia, Shuaishuai Li, Xuan Guo, Bo Lei, Junqiang Hu, Xiao-Hong Xu, and Wei Zhang
- Subjects
- *
ANIMAL behavior , *FEATURE extraction , *SUPERVISED learning , *CONVOLUTIONAL neural networks , *DEEP learning , *TIME series analysis , *BEHAVIORAL assessment - Abstract
Fast and accurately characterizing animal behaviors is crucial for neuroscience research. Deep learning models are efficiently used in laboratories for behavior analysis. However, it has not been achieved to use an end-to-end unsupervised neural network to extract comprehensive and discriminative features directly from social behavior video frames for annotation and analysis purposes. Here, we report a self-supervised feature extraction (Selfee) convolutional neural network with multiple downstream applications to process video frames of animal behavior in an end-to-end way. Visualization and classification of the extracted features (Meta-representations) validate that Selfee processes animal behaviors in a way similar to human perception. We demonstrate that Meta-representations can be efficiently used to detect anomalous behaviors that are indiscernible to human observation and hint in-depth analysis. Furthermore, time-series analyses of Meta-representations reveal the temporal dynamics of animal behaviors. In conclusion, we present a self-supervised learning approach to extract comprehensive and discriminative features directly from raw video recordings of animal behaviors and demonstrate its potential usage for various downstream applications. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
26. Waveform detection by deep learning reveals multi-area spindles that are selectively modulated by memory load.
- Author
-
Mofrad, Maryam H., Gilmore, Greydon, Koller, Dominik, Mirsattari, Seyed M., Burneo, Jorge G., Steven, David A., Khan, Ali R., Marti, Ana Suller, and Muller, Lyle
- Subjects
- *
VISUAL memory , *DEEP learning , *SLEEP spindles , *MACHINE learning , *GRAVITATIONAL waves , *MEMORY - Abstract
Sleep is generally considered to be a state of large-scale synchrony across thalamus and neocortex; however, recent work has challenged this idea by reporting isolated sleep rhythms such as slow oscillations and spindles. What is the spatial scale of sleep rhythms? To answer this question, we adapted deep learning algorithms initially developed for detecting earthquakes and gravitational waves in high-noise settings for analysis of neural recordings in sleep. We then studied sleep spindles in non-human primate electrocorticography (ECoG), human electroencephalogram (EEG), and clinical intracranial electroencephalogram (iEEG) recordings in the human. Within each recording type, we find widespread spindles occur much more frequently than previously reported. We then analyzed the spatiotemporal patterns of these large-scale, multi-area spindles and, in the EEG recordings, how spindle patterns change following a visual memory task. Our results reveal a potential role for widespread, multi-area spindles in consolidation of memories in networks widely distributed across primate cortex. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
27. Sampling alternative conformational states of transporters and receptors with AlphaFold2.
- Author
-
del Alamo, Diego, Sala, Davide, Mchaourab, Hassane S., and Meiler, Jens
- Subjects
- *
SEQUENCE alignment , *PROTEIN structure prediction , *PROTEIN structure , *MEMBRANE proteins , *MACHINE learning , *DEEP learning , *G protein coupled receptors - Abstract
Equilibrium fluctuations and triggered conformational changes often underlie the functional cycles of membrane proteins. For example, transporters mediate the passage of molecules across cell membranes by alternating between inward- and outward-facing states, while receptors undergo intracellular structural rearrangements that initiate signaling cascades. Although the conformational plasticity of these proteins has historically posed a challenge for traditional de novo protein structure prediction pipelines, the recent success of AlphaFold2 (AF2) in CASP14 culminated in the modeling of a transporter in multiple conformations to high accuracy. Given that AF2 was designed to predict static structures of proteins, it remains unclear if this result represents an underexplored capability to accurately predict multiple conformations and/or structural heterogeneity. Here, we present an approach to drive AF2 to sample alternative conformations of topologically diverse transporters and G-protein-coupled receptors that are absent from the AF2 training set. Whereas models of most proteins generated using the default AF2 pipeline are conformationally homogeneous and nearly identical to one another, reducing the depth of the input multiple sequence alignments by stochastic subsampling led to the generation of accurate models in multiple conformations. In our benchmark, these conformations spanned the range between two experimental structures of interest, with models at the extremes of these conformational distributions observed to be among the most accurate (average template modeling score of 0.94). These results suggest a straightforward approach to identifying native-like alternative states, while also highlighting the need for the next generation of deep learning algorithms to be designed to predict ensembles of biophysically relevant states. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
28. Virtual mouse brain histology from multi-contrast MRI via deep learning.
- Author
-
Zifei Liang, Lee, Choong H., Arefin, Tanzil M., Zijun Dong, Walczak, Piotr, Song-Hai Shi, Knoll, Florian, Yulin Ge, Ying, Leslie, and Jiangyang Zhang
- Subjects
- *
DEEP learning , *MAGNETIC resonance imaging , *CONVOLUTIONAL neural networks , *HISTOLOGY , *STAINS & staining (Microscopy) , *BRAIN anatomy - Abstract
1H MRI maps brain structure and function non-invasively through versatile contrasts that exploit inhomogeneity in tissue micro-environments. Inferring histopathological information from magnetic resonance imaging (MRI) findings, however, remains challenging due to absence of direct links between MRI signals and cellular structures. Here, we show that deep convolutional neural networks, developed using co-registered multi-contrast MRI and histological data of the mouse brain, can estimate histological staining intensity directly from MRI signals at each voxel. The results provide three-dimensional maps of axons and myelin with tissue contrasts that closely mimic target histology and enhanced sensitivity and specificity compared to conventional MRI markers. Furthermore, the relative contribution of each MRI contrast within the networks can be used to optimize multi-contrast MRI acquisition. We anticipate our method to be a starting point for translation of MRI results into easy-to-understand virtual histology for neurobiologists and provide resources for validating novel MRI techniques. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
29. Virtual mouse brain histology from multi-contrast MRI via deep learning.
- Author
-
Zifei Liang, Lee, Choong H., Arefin, Tanzil M., Zijun Dong, Walczak, Piotr, Song-Hai Shi, Knoll, Florian, Yulin Ge, Leslie Ying, and Jiangyang Zhang
- Subjects
- *
DEEP learning , *MAGNETIC resonance imaging , *CONVOLUTIONAL neural networks , *HISTOLOGY , *STAINS & staining (Microscopy) , *BRAIN anatomy - Abstract
¹H MRI maps brain structure and function non-invasively through versatile contrasts that exploit inhomogeneity in tissue micro-environments. Inferring histopathological information from magnetic resonance imaging (MRI) findings, however, remains challenging due to absence of direct links between MRI signals and cellular structures. Here, we show that deep convolutional neural networks, developed using co-registered multi-contrast MRI and histological data of the mouse brain, can estimate histological staining intensity directly from MRI signals at each voxel. The results provide three-dimensional maps of axons and myelin with tissue contrasts that closely mimic target histology and enhanced sensitivity and specificity compared to conventional MRI markers. Furthermore, the relative contribution of each MRI contrast within the networks can be used to optimize multi-contrast MRI acquisition. We anticipate our method to be a starting point for translation of MRI results into easy-to-understand virtual histology for neurobiologists and provide resources for validating novel MRI techniques. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
30. A faster way to model neuronal circuitry.
- Author
-
Davison, Andrew P.
- Subjects
- *
ARTIFICIAL neural networks , *DEEP learning , *SUPERCOMPUTERS , *NEURAL circuitry , *RECURRENT neural networks , *LONG-term memory , *WEATHER & climate change , *CONVOLUTIONAL neural networks - Published
- 2022
- Full Text
- View/download PDF
31. Analysis of long and short enhancers in melanoma cell states.
- Author
-
Mauduit, David, Taskiran, Ibrahim Ihsan, Minnoye, Liesbeth, de Waegeneer, Maxime, Christiaens, Valerie, Hulselmans, Gert, Demeulemeester, Jonas, Wouters, Jasper, and Aerts, Stein
- Subjects
- *
MELANOMA , *DEEP learning , *SOX transcription factors - Abstract
Understanding how enhancers drive cell- type specificity and efficiently identifying them is essential for the development of innovative therapeutic strategies. In melanoma, the melanocytic (MEL) and the mesenchymal- like (MES) states present themselves with different responses to therapy, making the identification of specific enhancers highly relevant. Using massively parallel reporter assays (MPRAs) in a panel of patient- derived melanoma lines (MM lines), we set to identify and decipher melanoma enhancers by first focusing on regions with state- specific H3K27 acetylation close to differentially expressed genes. An in-depth evaluation of those regions was then pursued by investigating the activity of overlapping ATAC-seq peaks along with a full tiling of the acetylated regions with 190 bp sequences. Activity was observed in more than 60% of the selected regions, and we were able to precisely locate the active enhancers within ATAC-seq peaks. Comparison of sequence content with activity, using the deep learning model DeepMEL2, revealed that AP-1 alone is responsible for the MES enhancer activity. In contrast, SOX10 and MITF both influence MEL enhancer function with SOX10 being required to achieve high levels of activity. Overall, our MPRAs shed light on the relationship between long and short sequences in terms of their sequence content, enhancer activity, and specificity across melanoma cell states. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
32. PARROT is a flexible recurrent neural network framework for analysis of large protein datasets.
- Author
-
Griffith, Daniel and Holehouse, Alex S.
- Subjects
- *
RECURRENT neural networks , *DEEP learning , *PROTEIN analysis , *PARROTS , *MACHINE learning , *AMINO acid sequence - Abstract
The rise of high- throughput experiments has transformed how scientists approach biological questions. The ubiquity of large- scale assays that can test thousands of samples in a day has necessitated the development of new computational approaches to interpret this data. Among these tools, machine learning approaches are increasingly being utilized due to their ability to infer complex nonlinear patterns from high- dimensional data. Despite their effectiveness, machine learning (and in particular deep learning) approaches are not always accessible or easy to implement for those with limited computational expertise. Here we present PARROT, a general framework for training and applying deep learning- based predictors on large protein datasets. Using an internal recurrent neural network architecture, PARROT is capable of tackling both classification and regression tasks while only requiring raw protein sequences as input. We showcase the potential uses of PARROT on three diverse machine learning tasks: predicting phosphorylation sites, predicting transcriptional activation function of peptides generated by high- throughput reporter assays, and predicting the fibrillization propensity of amyloid beta with data generated by deep mutational scanning. Through these examples, we demonstrate that PARROT is easy to use, performs comparably to state- of- the- art computational tools, and is applicable for a wide array of biological problems. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.