Back to Search Start Over

Unified mRNA Subcellular Localization Predictor based on machine learning techniques

Authors :
Saleh Musleh
Muhammad Arif
Nehad M. Alajez
Tanvir Alam
Source :
BMC Genomics, Vol 25, Iss 1, Pp 1-18 (2024)
Publication Year :
2024
Publisher :
BMC, 2024.

Abstract

Abstract Background The mRNA subcellular localization bears substantial impact in the regulation of gene expression, cellular migration, and adaptation. However, the methods employed for experimental determination of this localization are arduous, time-intensive, and come with a high cost. Methods In this research article, we tackle the essential challenge of predicting the subcellular location of messenger RNAs (mRNAs) through Unified mRNA Subcellular Localization Predictor (UMSLP), a machine learning (ML) based approach. We embrace an in silico strategy that incorporate four distinct feature sets: kmer, pseudo k-tuple nucleotide composition, nucleotide physicochemical attributes, and the 3D sequence depiction achieved via Z-curve transformation for predicting subcellular localization in benchmark dataset across five distinct subcellular locales, encompassing nucleus, cytoplasm, extracellular region (ExR), mitochondria, and endoplasmic reticulum (ER). Results The proposed ML model UMSLP attains cutting-edge outcomes in predicting mRNA subcellular localization. On independent testing dataset, UMSLP ahcieved over 87% precision, 94% specificity, and 94% accuracy. Compared to other existing tools, UMSLP outperformed mRNALocator, mRNALoc, and SubLocEP by 11%, 21%, and 32%, respectively on average prediction accuracy for all five locales. SHapley Additive exPlanations analysis highlights the dominance of k-mer features in predicting cytoplasm, nucleus, ER, and ExR localizations, while Z-curve based features play pivotal roles in mitochondria subcellular localization detection. Availability We have shared datasets, code, Docker API for users in GitHub at: https://github.com/smusleh/UMSLP .

Details

Language :
English
ISSN :
14712164
Volume :
25
Issue :
1
Database :
Directory of Open Access Journals
Journal :
BMC Genomics
Publication Type :
Academic Journal
Accession number :
edsdoj.b2c47b1d954f1887b8bc2c49058bcf
Document Type :
article
Full Text :
https://doi.org/10.1186/s12864-024-10077-9