Back to Search
Start Over
Efficient privacy-preserving variable-length substring match for genome sequence.
- Source :
-
Algorithms for molecular biology : AMB [Algorithms Mol Biol] 2022 Apr 26; Vol. 17 (1), pp. 9. Date of Electronic Publication: 2022 Apr 26. - Publication Year :
- 2022
-
Abstract
- The development of a privacy-preserving technology is important for accelerating genome data sharing. This study proposes an algorithm that securely searches a variable-length substring match between a query and a database sequence. Our concept hinges on a technique that efficiently applies FM-index for a secret-sharing scheme. More precisely, we developed an algorithm that can achieve a secure table lookup in such a way that [Formula: see text] is computed for a given depth of recursion where [Formula: see text] is an initial position, and V is a vector. We used the secure table lookup for vectors created based on FM-index. The notable feature of the secure table lookup is that time, communication, and round complexities are not dependent on the table length N, after the query input. Therefore, a substring match by reference to the FM-index-based table can also be conducted independently against the database length, and the entire search time is dramatically improved compared to previous approaches. We conducted an experiment using a human genome sequence with the length of 10 million as the database and a query with the length of 100 and found that the query response time of our protocol was at least three orders of magnitude faster than a non-indexed database search protocol under the realistic computation/network environment.<br /> (© 2022. The Author(s).)
Details
- Language :
- English
- ISSN :
- 1748-7188
- Volume :
- 17
- Issue :
- 1
- Database :
- MEDLINE
- Journal :
- Algorithms for molecular biology : AMB
- Publication Type :
- Academic Journal
- Accession number :
- 35473587
- Full Text :
- https://doi.org/10.1186/s13015-022-00211-1