Back to Search
Start Over
Conditional generative modeling for de novo protein design with hierarchical functions
- Source :
- Bioinformatics, 38 (13)
- Publication Year :
- 2022
- Publisher :
- ETH Zurich, 2022.
-
Abstract
- Motivation Protein design has become increasingly important for medical and biotechnological applications. Because of the complex mechanisms underlying protein formation, the creation of a novel protein requires tedious and time-consuming computational or experimental protocols. At the same time, machine learning has enabled the solving of complex problems by leveraging large amounts of available data, more recently with great improvements on the domain of generative modeling. Yet, generative models have mainly been applied to specific sub-problems of protein design. Results Here, we approach the problem of general-purpose protein design conditioned on functional labels of the hierarchical Gene Ontology. Since a canonical way to evaluate generative models in this domain is missing, we devise an evaluation scheme of several biologically and statistically inspired metrics. We then develop the conditional generative adversarial network ProteoGAN and show that it outperforms several classic and more recent deep-learning baselines for protein sequence generation. We further give insights into the model by analyzing hyperparameters and ablation baselines. Lastly, we hypothesize that a functionally conditional model could generate proteins with novel functions by combining labels and provide first steps into this direction of research.<br />Bioinformatics, 38 (13)<br />ISSN:1367-4803<br />ISSN:1460-2059
- Subjects :
- Scheme (programming language)
Hyperparameter
Statistics and Probability
business.industry
Computer science
Deep learning
Protein design
Proteins
Machine learning
computer.software_genre
Biochemistry
Domain (software engineering)
Generative modeling
Computer Science Applications
Machine Learning
Computational Mathematics
Protein sequencing
Gene Ontology
Computational Theory and Mathematics
Artificial intelligence
business
computer
Molecular Biology
Generative grammar
computer.programming_language
Subjects
Details
- Language :
- English
- ISSN :
- 13674803 and 14602059
- Database :
- OpenAIRE
- Journal :
- Bioinformatics, 38 (13)
- Accession number :
- edsair.doi.dedup.....4d3f478f8f0e392e9c20c62cd3445717
- Full Text :
- https://doi.org/10.3929/ethz-b-000552644