Back to Search Start Over

Gene Sequences Parallel Alignment Model Based on Multiple Inputs and Outputs.

Authors :
Feng, X. L.
Gao, J.
Source :
International Journal of Computers, Communications & Control; 2019, Vol. 14 Issue 2, p141-153, 13p, 2 Diagrams, 3 Charts, 3 Graphs
Publication Year :
2019

Abstract

Bioinformatics computing is a kind of big data processing problem, which usually has the characteristics of large data scale, large computational load and long computational time. Therefore, the use of big data technology in bioinformatics computing has gradually become a research hotspot, and using Hadoop for gene sequence alignment is one of it. It is a common way to use various tools to complete a job in the field of Biocomputing. In most studies of parallel alignment of gene sequences using Hadoop, third-party tools are also needed. However, there are few methods using Hadoop independently to complete gene sequences alignment. Adding data processing with other tools to Hadoop workflow not only affects the improvement of computing performance, but also complicates the application. In this paper, a parallel alignment model of gene sequences based on multiple inputs and outputs is proposed, which can independently complete parallel alignment of gene sequences in Hadoop platform without using other tools. This model not only simplifies the process flow of gene sequence alignment, but also improves the performance compared with other methods. This paper describes in detail the method of manipulating gene sequences with multiple inputs and outputs modes on Hadoop platform and the design of a computing model based on this method, and proves the superiority of this model through experiments. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
18419836
Volume :
14
Issue :
2
Database :
Supplemental Index
Journal :
International Journal of Computers, Communications & Control
Publication Type :
Academic Journal
Accession number :
135901860
Full Text :
https://doi.org/10.15837/ijccc.2019.2.3539