Back to Search Start Over

Genie: the first open-source ISO/IEC encoder for genomic data

Authors :
Fabian Müntefering
Yeremia Gunawan Adhisantoso
Shubham Chandak
Jörn Ostermann
Mikel Hernaez
Jan Voges
Source :
Communications Biology, Vol 7, Iss 1, Pp 1-10 (2024)
Publication Year :
2024
Publisher :
Nature Portfolio, 2024.

Abstract

Abstract For the last two decades, the amount of genomic data produced by scientific and medical applications has been growing at a rapid pace. To enable software solutions that analyze, process, and transmit these data in an efficient and interoperable way, ISO and IEC released the first version of the compression standard MPEG-G in 2019. However, non-proprietary implementations of the standard are not openly available so far, limiting fair scientific assessment of the standard and, therefore, hindering its broad adoption. In this paper, we present Genie, to the best of our knowledge the first open-source encoder that compresses genomic data according to the MPEG-G standard. We demonstrate that Genie reaches state-of-the-art compression ratios while offering interoperability with any other standard-compliant decoder independent from its manufacturer. Finally, the ISO/IEC ecosystem ensures the long-term sustainability and decodability of the compressed data through the ISO/IEC-supported reference decoder.

Subjects

Subjects :
Biology (General)
QH301-705.5

Details

Language :
English
ISSN :
23993642
Volume :
7
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Communications Biology
Publication Type :
Academic Journal
Accession number :
edsdoj.0a852e8ff85b4568b37ab9e6cbb17094
Document Type :
article
Full Text :
https://doi.org/10.1038/s42003-024-06249-8