Back to Search
Start Over
Predicting subcellular localization of multi-label proteins by incorporating the sequence features into Chou's PseAAC.
- Source :
-
Genomics . Dec2019, Vol. 111 Issue 6, p1325-1332. 8p. - Publication Year :
- 2019
-
Abstract
- The emergence of numerous genome projects has made the experimental classification of the protein localization almost impossible due to the exponential increase in the number of protein samples. However, most of the applications are merely developed for single-plex and completely ignored the presence of one protein at two or more locations in a cell. In this regard, few attempts were carried out to target Multi-label protein localizations; consequently, undesirable accuracies are achieved. This paper presents a novel approach, in which a discrete feature extraction method is fused with physicochemical properties of amino acids by using Chou's general form of Pseudo Amino Acid Composition. The technique is tested on two benchmark datasets namely: Gpos-mploc and Virus-mPLoc. The empirical results demonstrated that the proposed method yields better results via two examined classifiers i.e. ML-KNN and Rank-SVM. It is established that the proposed model has improved values in all performance measures considered for the comparison. • A computational model is developed for Multi label prokaryotic subcellular localization. • Physiochemical and discrete methods are utilized for features • SMOTE is applied as oversampling technique • Multi-label KNN and Rank SVM are used as classifiers • Obtained quite promising results than existing methods. [ABSTRACT FROM AUTHOR]
- Subjects :
- *AMINO acid sequence
*PALMITOYLATION
*AMINO acids
*FEATURE extraction
Subjects
Details
- Language :
- English
- ISSN :
- 08887543
- Volume :
- 111
- Issue :
- 6
- Database :
- Academic Search Index
- Journal :
- Genomics
- Publication Type :
- Academic Journal
- Accession number :
- 139631107
- Full Text :
- https://doi.org/10.1016/j.ygeno.2018.09.004