1. Accelerating de novo SINE annotation in plant and animal genomes.
- Author
-
Liao, Herui, Sun, Yanni, and Ou, Shujun
- Subjects
- *
PLANT genomes , *CONSERVED sequences (Genetics) , *GENOMES , *BRACHYDANIO , *ANNOTATIONS - Abstract
Genome annotation is an important but challenging task. Accurate identification of short interspersed nuclear elements (SINEs) is particularly difficult due to their lack of highly conserved sequences. AnnoSINE is state-of-the-art software for annotating SINEs in plant genomes, but it is computationally inefficient for large genomes. Moreover, its applicability to animals is limited due to the absence of animal pHMMs in its HMM library. Therefore, we propose AnnoSINE_v2, which extends accurate SINE annotation for animal genomes with greatly optimized computational efficiency. Our results show that AnnoSINE_v2's annotation of SINEs has over 20% higher F1-score compared to the existing tools on animal genomes and enables the processing of complicated genomes, like human and zebrafish, which were beyond the capabilities of AnnoSINE_v1. AnnoSINE_v2 is freely available on Conda and GitHub: https://github.com/liaoherui/AnnoSINE_v2. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF