Back to Search
Start Over
Efficient String Matching Algorithm for Searching Large DNA and Binary Texts
- Source :
- International Journal on Semantic Web and Information Systems. 13:198-220
- Publication Year :
- 2017
- Publisher :
- IGI Global, 2017.
-
Abstract
- The exact string matching is essential in application areas such as Bioinformatics and Intrusion Detection Systems. Speeding-up the string matching algorithm will therefore result in accelerating the searching process in DNA and binary data. Previously, there are two types of fast algorithms exist, bit-parallel based algorithms and hashing algorithms. The bit-parallel based are efficient when dealing with patterns of short lengths, less than 64, but slow on long patterns. On the other hand, hashing algorithms have optimal sublinear average case on large alphabets and long patterns, but the efficiency not so good on small alphabet such as DNA and binary texts. In this paper, the authors present hybrid algorithm to overcome the shortcomings of those previous algorithms. The proposed algorithm is based on q-gram hashing with guaranteeing the maximal shift in advance. Experimental results on random and complete human genome confirm that the proposed algorithm is efficient on various pattern lengths and small alphabet.
- Subjects :
- Theoretical computer science
Computer Networks and Communications
Computer science
Hash function
Commentz-Walter algorithm
0102 computer and information sciences
02 engineering and technology
String searching algorithm
01 natural sciences
Rabin–Karp algorithm
Hybrid algorithm
law.invention
010201 computation theory & mathematics
law
Binary data
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
String metric
Boyer–Moore string search algorithm
Information Systems
Subjects
Details
- ISSN :
- 15526291 and 15526283
- Volume :
- 13
- Database :
- OpenAIRE
- Journal :
- International Journal on Semantic Web and Information Systems
- Accession number :
- edsair.doi...........023a12e5a3a127297a4c3281f8dd374c
- Full Text :
- https://doi.org/10.4018/ijswis.2017100110