Back to Search Start Over

Disregarding multimappers leads to biases in the functional assessment of NGS data

Authors :
Michelle Almeida da Paz
Sarah Warger
Leila Taher
Source :
BMC Genomics, Vol 25, Iss 1, Pp 1-9 (2024)
Publication Year :
2024
Publisher :
BMC, 2024.

Abstract

Abstract Background Standard ChIP-seq and RNA-seq processing pipelines typically disregard sequencing reads whose origin is ambiguous (“multimappers”). This usual practice has potentially important consequences for the functional interpretation of the data: genomic elements belonging to clusters composed of highly similar members are left unexplored. Results In particular, disregarding multimappers leads to the underrepresentation in epigenetic studies of recently active transposable elements, such as AluYa5, L1HS and SVAs. Furthermore, this common strategy also has implications for transcriptomic analysis: members of repetitive gene families, such the ones including major histocompatibility complex (MHC) class I and II genes, are under-quantified. Conclusion Revealing inherent biases that permeate routine tasks such as functional enrichment analysis, our results underscore the urgency of broadly adopting multimapper-aware bioinformatic pipelines –currently restricted to specific contexts or communities– to ensure the reliability of genomic and transcriptomic studies.

Details

Language :
English
ISSN :
14712164
Volume :
25
Issue :
1
Database :
Directory of Open Access Journals
Journal :
BMC Genomics
Publication Type :
Academic Journal
Accession number :
edsdoj.23a305f5fc44abda5e7bcade8ea4200
Document Type :
article
Full Text :
https://doi.org/10.1186/s12864-024-10344-9