Back to Search
Start Over
OCRDetector: Accurately Detecting Open Chromatin Regions via Plasma Cell-Free DNA Sequencing Data
- Source :
- International Journal of Molecular Sciences, Vol 22, Iss 11, p 5802 (2021)
- Publication Year :
- 2021
- Publisher :
- MDPI AG, 2021.
-
Abstract
- Open chromatin regions (OCRs) are special regions of the human genome that can be accessed by DNA regulatory elements. Several studies have reported that a series of OCRs are associated with mechanisms involved in human diseases, such as cancers. Identifying OCRs using ATAC-seq or DNase-seq is often expensive. It has become popular to detect OCRs from plasma cell-free DNA (cfDNA) sequencing data, because both the fragmentation modes of cfDNA and the sequencing coverage in OCRs are significantly different from those in other regions. However, it is a challenging computational problem to accurately detect OCRs from plasma cfDNA-seq data, as multiple factors—e.g., sequencing and mapping bias, insufficient read depth, etc.—often mislead the computational model. In this paper, we propose a novel bioinformatics pipeline, OCRDetector, for detecting OCRs from whole-genome cfDNA sequencing data. The pipeline calculates the window protection score (WPS) waveform and the cfDNA sequencing coverage. To validate the proposed pipeline, we compared the percentage overlap of our OCRs with those obtained by other methods. The experimental results show that 81% of the TSS regions of housekeeping genes are detected, and our results have obvious tissue specificity. In addition, the overlap percentage between our OCRs and the high-confidence OCRs obtained by ATAC-seq or DNase-seq is greater than 70%.
Details
- Language :
- English
- ISSN :
- 14220067 and 16616596
- Volume :
- 22
- Issue :
- 11
- Database :
- Directory of Open Access Journals
- Journal :
- International Journal of Molecular Sciences
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.8fa88a89b64dd1bc869fa31015738a
- Document Type :
- article
- Full Text :
- https://doi.org/10.3390/ijms22115802