1. Quantitation of next generation sequencing library preparation protocol efficiencies using droplet digital PCR assays - a systematic comparison of DNA library preparation kits for Illumina sequencing
- Author
-
Yong Gu, Louise Aigrain, and Michael A. Quail
- Subjects
0301 basic medicine ,Genetics ,Massive parallel sequencing ,Methodology Article ,2 base encoding ,Illumina sequencing ,High-Throughput Nucleotide Sequencing ,Computational biology ,Biology ,Polymerase Chain Reaction ,DNA sequencing ,03 medical and health sciences ,DNA sequencer ,030104 developmental biology ,Droplet digital PCR ,Next generation sequencing ,NGS ,DNA library preparation ,Genomic library ,Digital polymerase chain reaction ,ABI Solid Sequencing ,Illumina dye sequencing ,Biotechnology ,Gene Library - Abstract
Background The emergence of next-generation sequencing (NGS) technologies in the past decade has allowed the democratization of DNA sequencing both in terms of price per sequenced bases and ease to produce DNA libraries. When it comes to preparing DNA sequencing libraries for Illumina, the current market leader, a plethora of kits are available and it can be difficult for the users to determine which kit is the most appropriate and efficient for their applications; the main concerns being not only cost but also minimal bias, yield and time efficiency. Results We compared 9 commercially available library preparation kits in a systematic manner using the same DNA sample by probing the amount of DNA remaining after each protocol steps using a new droplet digital PCR (ddPCR) assay. This method allows the precise quantification of fragments bearing either adaptors or P5/P7 sequences on both ends just after ligation or PCR enrichment. We also investigated the potential influence of DNA input and DNA fragment size on the final library preparation efficiency. The overall library preparations efficiencies of the libraries show important variations between the different kits with the ones combining several steps into a single one exhibiting some final yields 4 to 7 times higher than the other kits. Detailed ddPCR data also reveal that the adaptor ligation yield itself varies by more than a factor of 10 between kits, certain ligation efficiencies being so low that it could impair the original library complexity and impoverish the sequencing results. When a PCR enrichment step is necessary, lower adaptor-ligated DNA inputs leads to greater amplification yields, hiding the latent disparity between kits. Conclusion We describe a ddPCR assay that allows us to probe the efficiency of the most critical step in the library preparation, ligation, and to draw conclusion on which kits is more likely to preserve the sample heterogeneity and reduce the need of amplification. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2757-4) contains supplementary material, which is available to authorized users.
- Published
- 2016