Back to Search
Start Over
Predicting the stability of human lysozyme mutants using the tree-based classifier TTOSOM
- Source :
- Chemometrics and Intelligent Laboratory Systems. 162:65-72
- Publication Year :
- 2017
- Publisher :
- Elsevier BV, 2017.
-
Abstract
- One of the primary goals of applied proteomics is the development of new computational methods for modeling the properties of the proteins from the primary structure. In this work, we used the concept of semi-supervised learning, which is relatively new machine learning philosophy that combines labeled and unlabeled instances simultaneously, to perform classification of protein mutants according to their physical properties. Unlike more traditional methods, it does not demand the specification of the class labels of every sample. This is particularly useful when many exemplars are available but the actual class membership is only available for only a marginal subset. In spite of its desirable properties, semi-supervised learning has been seldom applied in molecular biology. In the recent years, a novel algorithm capable of performing semi-supervised learning has been proposed. This algorithm, namely the TTOSOM, is a tree-based neural network inspired in the well-known Self-Organizing Maps. In this paper, we use the TTOSOM to predict the stability of human lysozyme mutants. Since it plays a central role in the immunologic system, prediction of its structural stability is of primary importance for molecular biology. Our experimental results show that it is possible to predict the stability with accuracy above 64%, outperforming two well-known classifiers. This prediction is only based on historical data, i.e., without the necessity of expensive chemical substances and human resources.
- Subjects :
- 0301 basic medicine
Self-organizing map
Artificial neural network
business.industry
Process Chemistry and Technology
Machine learning
computer.software_genre
01 natural sciences
0104 chemical sciences
Computer Science Applications
Analytical Chemistry
010404 medicinal & biomolecular chemistry
03 medical and health sciences
030104 developmental biology
Protein stability
Tree based
Artificial intelligence
Class membership
business
Classifier (UML)
computer
Spectroscopy
Software
Mathematics
Subjects
Details
- ISSN :
- 01697439
- Volume :
- 162
- Database :
- OpenAIRE
- Journal :
- Chemometrics and Intelligent Laboratory Systems
- Accession number :
- edsair.doi...........dd91e016edb9ba23035254d6d63d917c