Back to Search
Start Over
smORFunction: A Tool for Predicting Functions of Small Open Reading Frames and Microproteins
- Source :
- BMC Bioinformatics, BMC Bioinformatics, Vol 21, Iss 1, Pp 1-13 (2020)
- Publication Year :
- 2020
- Publisher :
- Research Square Platform LLC, 2020.
-
Abstract
- Background Small open reading frame (smORF) is open reading frame with a length of less than 100 codons. Microproteins, translated from smORFs, have been found to participate in a variety of biological processes such as muscle formation and contraction, cell proliferation, and immune activation. Although previous studies have collected and annotated a large abundance of smORFs, functions of the vast majority of smORFs are still unknown. It is thus increasingly important to develop computational methods to annotate the functions of these smORFs. Results In this study, we collected 617,462 unique smORFs from three studies. The expression of smORF RNAs was estimated by reannotated microarray probes. Using a speed-optimized correlation algorism, the functions of smORFs were predicted by their correlated genes with known functional annotations. After applying our method to 5 known microproteins from literatures, our method successfully predicted their functions. Further validation from the UniProt database showed that at least one function of 202 out of 270 microproteins was predicted. Conclusions We developed a method, smORFunction, to provide function predictions of smORFs/microproteins in at most 265 models generated from 173 datasets, including 48 tissues/cells, 82 diseases (and normal). The tool can be available at https://www.cuilab.cn/smorfunction.
- Subjects :
- Computational biology
Biology
lcsh:Computer applications to medicine. Medical informatics
Biochemistry
Microprotein
03 medical and health sciences
Open Reading Frames
Function prediction
Structural Biology
Small open reading frame
Humans
Molecular Biology
lcsh:QH301-705.5
030304 developmental biology
0303 health sciences
Internet
Applied Mathematics
Methodology Article
030302 biochemistry & molecular biology
Proteins
Reproducibility of Results
Molecular Sequence Annotation
Microarray Analysis
Computer Science Applications
Open reading frame
lcsh:Biology (General)
Gene Expression Regulation
lcsh:R858-859.7
RNA
Gene expression
UniProt
DNA microarray
Software
Immune activation
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- BMC Bioinformatics, BMC Bioinformatics, Vol 21, Iss 1, Pp 1-13 (2020)
- Accession number :
- edsair.doi.dedup.....017a3d83168f4e346ff6dcc3aad0cf3e
- Full Text :
- https://doi.org/10.21203/rs.3.rs-47996/v1