101. A Robust Method for Finding the Automated Best Matched Genes Based on Grouping Similar Fragments of Large-Scale References for Genome Assembly
- Author
-
Young-Sik Jeong, Gangman Yi, Jaehee Jung, and Jong Im Kim
- Subjects
0301 basic medicine ,Physics and Astronomy (miscellaneous) ,Computer science ,General Mathematics ,Gene prediction ,Sequence assembly ,Hybrid genome assembly ,02 engineering and technology ,Computational biology ,Bioinformatics ,Genome ,contig ,gene ,gene annotation ,NGS ,organelle genome ,gene search ,03 medical and health sciences ,0202 electrical engineering, electronic engineering, information engineering ,Computer Science (miscellaneous) ,Contig ,lcsh:Mathematics ,Gene Annotation ,Genome project ,lcsh:QA1-939 ,030104 developmental biology ,Chemistry (miscellaneous) ,020201 artificial intelligence & image processing ,Reference genome - Abstract
Big data research on genomic sequence analysis has accelerated considerably with the development of next-generation sequencing. Currently, research on genomic sequencing has been conducted using various methods, ranging from the assembly of reads consisting of fragments to the annotation of genetic information using a database that contains known genome information. According to the development, most tools to analyze the new organelles’ genetic information requires different input formats such as FASTA, GeneBank (GB) and tab separated files. The various data formats should be modified to satisfy the requirements of the gene annotation system after genome assembly. In addition, the currently available tools for the analysis of organelles are usually developed only for specific organisms, thus the need for gene prediction tools, which are useful for any organism, has been increased. The proposed method—termed the genome_search_plotter—is designed for the easy analysis of genome information from the related references without any file format modification. Anyone who is interested in intracellular organelles such as the nucleus, chloroplast, and mitochondria can analyze the genetic information using the assembled contig of an unknown genome and a reference model without any modification of the data from the assembled contig.
- Published
- 2017