1. cPAS-based sequencing on the BGISEQ-500 to explore small non-coding RNAs
- Author
-
Snezana Drmanac, Su Xiaoshan, Andrei Alexeev, Chongjun Xu, Geng Chunyu, Yongping Li, Ming Ni, Martin Hart, Yuxiang Li, Jian Liu, Xun Xu, Tobias Fehlmann, Matthew Poulter, Andreas Keller, Christina Backes, Eckart Meese, Ao Chen, Cord F. Stähler, Radoje Drmanac, Zhang Chunyan, Zhenzhen Zhu, Nicole Ludwig, Stefanie Reinheimer, and Dan An
- Subjects
0301 basic medicine ,Sequencing data ,Biology ,MiRBase ,DNA sequencing ,03 medical and health sciences ,0302 clinical medicine ,Genetics ,Humans ,Biomarker discovery ,Molecular Biology ,Genetics (clinical) ,Exome sequencing ,Oligonucleotide Array Sequence Analysis ,miRNA ,Brain Chemistry ,Sequence Analysis, RNA ,Gene Expression Profiling ,Myocardium ,Research ,BGISEQ ,High-Throughput Nucleotide Sequencing ,Gene expression profiling ,030104 developmental biology ,Blood ,030220 oncology & carcinogenesis ,Next-generation sequencing ,RNA, Small Untranslated ,Human genome ,DNA microarray ,Developmental Biology - Abstract
Background We present the first sequencing data using the combinatorial probe-anchor synthesis (cPAS)-based BGISEQ-500 sequencer. Applying cPAS, we investigated the repertoire of human small non-coding RNAs and compared it to other techniques. Results Starting with repeated measurements of different specimens including solid tissues (brain and heart) and blood, we generated a median of 30.1 million reads per sample. 24.1 million mapped to the human genome and 23.3 million to the miRBase. Among six technical replicates of brain samples, we observed a median correlation of 0.98. Comparing BGISEQ-500 to HiSeq, we calculated a correlation of 0.75. The comparability to microarrays was similar for both BGISEQ-500 and HiSeq with the first one showing a correlation of 0.58 and the latter one correlation of 0.6. As for a potential bias in the detected expression distribution in blood cells, 98.6% of HiSeq reads versus 93.1% of BGISEQ-500 reads match to the 10 miRNAs with highest read count. After using miRDeep2 and employing stringent selection criteria for predicting new miRNAs, we detected 74 high-likely candidates in the cPAS sequencing reads prevalent in solid tissues and 36 candidates prevalent in blood. Conclusions While there is apparently no ideal platform for all challenges of miRNome analyses, cPAS shows high technical reproducibility and supplements the hitherto available platforms. Electronic supplementary material The online version of this article (doi:10.1186/s13148-016-0287-1) contains supplementary material, which is available to authorized users.
- Published
- 2016