151. Novel computational methods for increasing PCR primer design effectiveness in directed sequencing
- Author
-
Anushka Brownley, Steve Ferriera, Sean D. Murphy, Tina C McIntosh, Karen Beeson, Dana A. Busam, Kelvin Li, Samuel Levy, and Timothy B. Stockwell
- Subjects
Molecular Sequence Data ,Computational biology ,Biology ,lcsh:Computer applications to medicine. Medical informatics ,Biochemistry ,Polymerase Chain Reaction ,Sensitivity and Specificity ,Data sequences ,Structural Biology ,Ensembl ,lcsh:QH301-705.5 ,Molecular Biology ,DNA Primers ,Genetics ,Base Sequence ,Applied Mathematics ,Reproducibility of Results ,Sequence Analysis, DNA ,Amplicon ,Pipeline (software) ,Computer Science Applications ,lcsh:Biology (General) ,Software design ,lcsh:R858-859.7 ,DNA microarray ,Primer (molecular biology) ,Sequence Alignment ,Algorithms ,Software ,In silico PCR - Abstract
Background Polymerase chain reaction (PCR) is used in directed sequencing for the discovery of novel polymorphisms. As the first step in PCR directed sequencing, effective PCR primer design is crucial for obtaining high-quality sequence data for target regions. Since current computational primer design tools are not fully tuned with stable underlying laboratory protocols, researchers may still be forced to iteratively optimize protocols for failed amplifications after the primers have been ordered. Furthermore, potentially identifiable factors which contribute to PCR failures have yet to be elucidated. This inefficient approach to primer design is further intensified in a high-throughput laboratory, where hundreds of genes may be targeted in one experiment. Results We have developed a fully integrated computational PCR primer design pipeline that plays a key role in our high-throughput directed sequencing pipeline. Investigators may specify target regions defined through a rich set of descriptors, such as Ensembl accessions and arbitrary genomic coordinates. Primer pairs are then selected computationally to produce a minimal amplicon set capable of tiling across the specified target regions. As part of the tiling process, primer pairs are computationally screened to meet the criteria for success with one of two PCR amplification protocols. In the process of improving our sequencing success rate, which currently exceeds 95% for exons, we have discovered novel and accurate computational methods capable of identifying primers that may lead to PCR failures. We reveal the laboratory protocols and their associated, empirically determined computational parameters, as well as describe the novel computational methods which may benefit others in future primer design research. Conclusion The high-throughput PCR primer design pipeline has been very successful in providing the basis for high-quality directed sequencing results and for minimizing costs associated with labor and reprocessing. The modular architecture of the primer design software has made it possible to readily integrate additional primer critique tests based on iterative feedback from the laboratory. As a result, the primer design software, coupled with the laboratory protocols, serves as a powerful tool for low and high-throughput primer design to enable successful directed sequencing.
- Published
- 2007