Back to Search Start Over

Context-dependent individualization of nucleotides and virtual genomic hybridization allow the precise location of human SNPs

Authors :
Margarita Flores
José L. Reyes
Laura Gómez-Romero
Margareta Boege
Guillermo Dávila
Alejandro Wences
Kim Palacios-Flores
Delfino García
Luis R. Arriola
Rafael Palacios
Ximena Ibarra-Soria
Source :
Proceedings of the National Academy of Sciences. 108:15294-15299
Publication Year :
2011
Publisher :
Proceedings of the National Academy of Sciences, 2011.

Abstract

We have entered the era of individual genomic sequencing, and can already see exponential progress in the field. It is of utmost importance to exclude false-positive variants from reported datasets. However, because of the nature of the used algorithms, this task has not been optimized to the required level of precision. This study presents a unique strategy for identifying SNPs, called COIN-VGH, that largely minimizes the presence of false-positives in the generated data. The algorithm was developed using the X-chromosome–specific regions from the previously sequenced genomes of Craig Venter and James Watson. The algorithm is based on the concept that a nucleotide can be individualized if it is analyzed in the context of its surrounding genomic sequence. COIN-VGH consists of defining the most comprehensive set of nucleotide strings of a defined length that map with 100% identity to a unique position within the human reference genome (HRG). Such set is used to retrieve sequence reads from a query genome (QG), allowing the production of a genomic landscape that represents a draft HRG-guided assembly of the QG. This landscape is analyzed for specific signatures that indicate the presence of SNPs. The fidelity of the variation signature was assessed using simulation experiments by virtually altering the HRG at defined positions. Finally, the signature regions identified in the HRG and in the QG reads are aligned and the precise nature and position of the corresponding SNPs are detected. The advantages of COIN-VGH over previous algorithms are discussed.

Details

ISSN :
10916490 and 00278424
Volume :
108
Database :
OpenAIRE
Journal :
Proceedings of the National Academy of Sciences
Accession number :
edsair.doi.dedup.....c42d4f953b11b35ca6b35ce4b4aaa72d
Full Text :
https://doi.org/10.1073/pnas.1112567108