Back to Search Start Over

Predicting subcellular localization of multi-label proteins by incorporating the sequence features into Chou's PseAAC.

Authors :
Javed, Faisal
Hayat, Maqsood
Source :
Genomics. Dec2019, Vol. 111 Issue 6, p1325-1332. 8p.
Publication Year :
2019

Abstract

The emergence of numerous genome projects has made the experimental classification of the protein localization almost impossible due to the exponential increase in the number of protein samples. However, most of the applications are merely developed for single-plex and completely ignored the presence of one protein at two or more locations in a cell. In this regard, few attempts were carried out to target Multi-label protein localizations; consequently, undesirable accuracies are achieved. This paper presents a novel approach, in which a discrete feature extraction method is fused with physicochemical properties of amino acids by using Chou's general form of Pseudo Amino Acid Composition. The technique is tested on two benchmark datasets namely: Gpos-mploc and Virus-mPLoc. The empirical results demonstrated that the proposed method yields better results via two examined classifiers i.e. ML-KNN and Rank-SVM. It is established that the proposed model has improved values in all performance measures considered for the comparison. • A computational model is developed for Multi label prokaryotic subcellular localization. • Physiochemical and discrete methods are utilized for features • SMOTE is applied as oversampling technique • Multi-label KNN and Rank SVM are used as classifiers • Obtained quite promising results than existing methods. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
08887543
Volume :
111
Issue :
6
Database :
Academic Search Index
Journal :
Genomics
Publication Type :
Academic Journal
Accession number :
139631107
Full Text :
https://doi.org/10.1016/j.ygeno.2018.09.004