1. Predicting regions prone to protein aggregation based on SVM algorithm
- Author
-
Angélica Nakagawa Lima, Eric Allison Philot, Ana Ligia Scott, and Carlos Alves Moreira
- Subjects
0209 industrial biotechnology ,Computer science ,business.industry ,Applied Mathematics ,Protein Data Bank (RCSB PDB) ,020206 networking & telecommunications ,Pattern recognition ,02 engineering and technology ,Folding (DSP implementation) ,Protein aggregation ,Class (biology) ,Support vector machine ,Computational Mathematics ,020901 industrial engineering & automation ,Protein sequencing ,Sliding window protocol ,0202 electrical engineering, electronic engineering, information engineering ,Data bank ,Artificial intelligence ,business - Abstract
The phenomenon of protein aggregation has been associated with several neurodegenerative diseases, such as Parkinson's and Alzheimer's. Computational tools have been used to predict regions prone to aggregate in proteins with relative success. We have developed a tool called MAGRE for such predictions, based on the machine learning and sliding window techniques. We have applied the Support Vector Machine algorithm to generate classification models. In order to accomplish classification training, we adopted information of primary structure - protein sequence - from the Amyloid Data Bank. We have implemented two predictor categories according to protein structural information: General and Folding Class. We have selected the best performances of the sliding windows method and considered the folding class in order to develop the predictor. We conducted testing with randomly selected protein sequences from the PDB data bank - MAGRE's performance was compared with two predictors from literature: Aggrescan and Zyggregator, being considered satisfactory.
- Published
- 2019