23 results on '"M. de la Vega"'
Search Results
2. Ancestry informative marker sets for determining continental origin and admixture proportions in common populations in America
- Author
-
Roman Kosoy, Rami Nassir, Rick A. Kittles, Gabriel A. Silva, Lesley M. Butler, Chao Tian, Francisco M. De La Vega, John W. Belmont, Marta E. Alarcón-Riquelme, Phoebe A. White, Peter K. Gregersen, and Michael F. Seldin
- Subjects
Genotype ,Population ,Black People ,Population genetics ,Ancestry-informative marker ,Biology ,Population stratification ,Article ,White People ,Asian People ,Gene Frequency ,Mexican Americans ,Genetic variation ,Genetics ,Humans ,education ,Allele frequency ,Genotyping ,Genetics (clinical) ,Genetic association ,education.field_of_study ,Geography ,Genome, Human ,Genetic Variation ,Hispanic or Latino ,Genetics, Population ,Evolutionary biology ,Americas - Abstract
To provide a resource for assessing continental ancestry in a wide variety of genetic studies, we identified, validated, and characterized a set of 128 ancestry informative markers (AIMs). The markers were chosen for informativeness, genome-wide distribution, and genotype reproducibility on two platforms (TaqMan assays and Illumina arrays). We analyzed genotyping data from 825 subjects with diverse ancestry, including European, East Asian, Amerindian, African, South Asian, Mexican, and Puerto Rican. A comprehensive set of 128 AIMs and subsets as small as 24 AIMs are shown to be useful tools for ascertaining the origin of subjects from particular continents, and to correct for population stratification in admixed population sample sets. Our findings provide general guidelines for the application of specific AIM subsets as a resource for wide application. We conclude that investigators can use TaqMan assays for the selected AIMs as a simple and cost efficient tool to control for differences in continental ancestry when conducting association studies in ethnically diverse populations.
- Published
- 2009
- Full Text
- View/download PDF
3. Population genetic analysis of shotgun assemblies of genomic sequences from multiple individuals
- Author
-
Ines Hellmann, Yuan Mang, Francisco M. De La Vega, Peter W. Li, Zhiping Gu, Rasmus Nielsen, and Andrew G. Clark
- Subjects
Mutation rate ,Letter ,Pan troglodytes ,Population ,Biology ,Polymorphism, Single Nucleotide ,Nucleotide diversity ,Genetics ,Animals ,Humans ,education ,Genetics (clinical) ,Likelihood Functions ,education.field_of_study ,Natural selection ,Models, Genetic ,Genome, Human ,Shotgun sequencing ,Genetic Variation ,Sequence Analysis, DNA ,Genome project ,Background selection ,Genetics, Population ,Evolutionary biology ,Human genome ,human activities - Abstract
We introduce a simple, broadly applicable method for obtaining estimates of nucleotide diversity θ from genomic shotgun sequencing data. The method takes into account the special nature of these data: random sampling of genomic segments from one or more individuals and a relatively high error rate for individual reads. Applying this method to data from the Celera human genome sequencing and SNP discovery project, we obtain estimates of nucleotide diversity in windows spanning the human genome and show that the diversity to divergence ratio is reduced in regions of low recombination. Furthermore, we show that the elevated diversity in telomeric regions is mainly due to elevated mutation rates and not due to decreased levels of background selection. However, we find indications that telomeres as well as centromeres experience greater impact from natural selection than intrachromosomal regions. Finally, we identify a number of genomic regions with increased or reduced diversity compared with the local level of human–chimpanzee divergence and the local recombination rate.
- Published
- 2008
- Full Text
- View/download PDF
4. Validation of the performance of a comprehensive genotyping assay panel of single nucleotide polymorphisms in drug metabolism enzyme genes
- Author
-
Laura Burdett, Daniel Ingber, Robert Welch, Kashif A. Haque, Nianqing Xiao, Katherine D. Lazaruk, Loni Wronka, Meredith Yeager, Fiona Hyland, Francisco M. De La Vega, and Stephen J. Chanock
- Subjects
Genetics ,education.field_of_study ,Genotype ,Population ,Single-nucleotide polymorphism ,Biology ,Polymorphism, Single Nucleotide ,Enzymes ,Minor allele frequency ,Pharmaceutical Preparations ,Pharmacogenomics ,Mutation ,Genetic variation ,Humans ,International HapMap Project ,education ,Genotyping ,Genetics (clinical) - Abstract
A class of genes, known as drug metabolism enzymes (DMEs) are responsible for the metabolism and transport of drugs and other xenobiotics. Variation in DME genes most likely accounts for a proportion of the variability in drug response in humans, and may contribute to complex diseases such as cancer (Nebert DW, Dieter MZ. Pharmacology 2000;61:124-135). To date, assessing the extent of this variation has proven difficult, especially because of sequence paralogy issues that cause difficulty when attempting to genotype polymorphisms in very closely-related gene families (Murphy MP. Pharmacogenomics 2000;1:115-123; Ingelman-Sundberg M. Drug Metab Rev 1999;31:449-459). We have developed and genotyped a panel of N=2,325 individual TaqMan genotyping assays for polymorphisms in >200 DME genes; many of the variants in the panel are single nucleotide polymorphisms (SNPs) that are of known or putative function (e.g., missense, nonsense or frameshift). Using these assays, we have examined genetic variation among several groups of populations, including: 1) the two SNP500 Cancer population panels (http://snp500cancer.nci.nih.gov; last accessed: 11 December 2007); and 2) the panel used in the International HapMap Project panel (www.hapmap.org; last accessed: 11 December 2007). We have developed a comprehensive validation strategy to ensure reproducibility and accuracy of the assays and estimated minor allele frequencies. Here, we present the results of these analyses, which strongly suggest that this panel of DME assays are of extremely high quality and produce robust, accurate, and reproducible results.
- Published
- 2008
- Full Text
- View/download PDF
5. Inexpensive and Highly Reproducible Cloud-Based Variant Calling of 2,535 Human Genomes
- Author
-
Carlos Bustamante, Francisco M. De La Vega, Andrew Carroll, and Suyash Shringarpure
- Subjects
Data management ,Distributed computing ,Population ,lcsh:Medicine ,Sample (statistics) ,Genomics ,Cloud computing ,Biology ,03 medical and health sciences ,0302 clinical medicine ,Databases, Genetic ,Humans ,1000 Genomes Project ,lcsh:Science ,education ,030304 developmental biology ,Genetics ,0303 health sciences ,education.field_of_study ,Multidisciplinary ,business.industry ,Genome, Human ,lcsh:R ,Genetic Variation ,High-Throughput Nucleotide Sequencing ,Cloud Computing ,Pipeline (software) ,030220 oncology & carcinogenesis ,Scalability ,lcsh:Q ,business ,Software ,Research Article - Abstract
Population scale sequencing of whole human genomes is becoming economically feasible; however, data management and analysis remains a formidable challenge for many research groups. Large sequencing studies, like the 1000 Genomes Project, have improved our understanding of human demography and the effect of rare genetic variation in disease. Variant calling on datasets of hundreds or thousands of genomes is time-consuming, expensive, and not easily reproducible given the myriad components of a variant calling pipeline. Here, we describe a cloud-based pipeline for joint variant calling in large samples using the Real Time Genomics population caller. We deployed the population caller on the Amazon cloud with the DNAnexus platform in order to achieve low-cost variant calling. Using our pipeline, we were able to identify 68.3 million variants in 2,535 samples from Phase 3 of the 1000 Genomes Project. By performing the variant calling in a parallel manner, the data was processed within 5 days at a compute cost of $7.33 per sample (a total cost of $18,590 for completed jobs and $21,805 for all jobs). Analysis of cost dependence and running time on the data size suggests that, given near linear scalability, cloud computing can be a cheap and efficient platform for analyzing even larger sequencing studies in the future.
- Published
- 2015
6. The linkage disequilibrium maps of three human chromosomes across four populations reflect their demographic history and a common underlying recombination pattern
- Author
-
Lily Xu, Xiaoping Su, Dennis A. Gilbert, Charles R. Scafe, Sheri J. Olson, Yu Wang, Ryan T. Koehler, Ross A. Lippert, Mitsuo Itakura, Marion Laig-Webster, Junko Stevens, Andrew G. Clark, Francis Kalush, Karl J. Guegler, Sorin Istrail, Heinz G. Hemken, Kyle M. Leinen, Hadar Isaac, Janet S. Ziegle, Andrew Collins, Francisco M. De La Vega, Stephen J. O'Brien, Yi Zheng, Lewis T. Wogan, Eugene G. Spier, Xiaoqing You, Bjarni V. Halldorsson, and Michael W. Hunkapiller
- Subjects
Linkage disequilibrium ,Chromosomes, Human, Pair 21 ,Demographic history ,Chromosomes, Human, Pair 22 ,Population ,Black People ,Single-nucleotide polymorphism ,Biology ,Polymorphism, Single Nucleotide ,Linkage Disequilibrium ,White People ,Asian People ,Genetics ,Humans ,SNP ,Association mapping ,education ,Genetics (clinical) ,Demography ,Genetic association ,Recombination, Genetic ,education.field_of_study ,Chromosome Mapping ,Chromosome ,Articles ,Black or African American ,Genetics, Population ,Chromosomes, Human, Pair 6 - Abstract
The extent and patterns of linkage disequilibrium (LD) determine the feasibility of association studies to map genes that underlie complex traits. Here we present a comparison of the patterns of LD across four major human populations (African-American, Caucasian, Chinese, and Japanese) with a high-resolution single-nucleotide polymorphism (SNP) map covering almost the entire length of chromosomes 6, 21, and 22. We constructed metric LD maps formulated such that the units measure the extent of useful LD for association mapping. LD reaches almost twice as far in chromosome 6 as in chromosomes 21 or 22, in agreement with their differences in recombination rates. By all measures used, out-of-Africa populations showed over a third more LD than African-Americans, highlighting the role of the population's demography in shaping the patterns of LD. Despite those differences, the long-range contour of the LD maps is remarkably similar across the four populations, presumably reflecting common localization of recombination hot spots. Our results have practical implications for the rational design and selection of SNPs for disease association studies.
- Published
- 2005
- Full Text
- View/download PDF
7. A global reference for human genetic variation
- Author
-
Colonna V. (1000 Genomes Project Consortium) Adam Auton, Gonçalo R Abecasis, David M Altshuler, Richard M Durbin, David R Bentley, Aravinda Chakravarti, Andrew G Clark, Peter Donnelly, Evan E Eichler, Paul Flicek, Stacey B Gabriel, Richard A Gibbs, Eric D Green, Matthew E Hurles, Bartha M Knoppers, Jan O Korbel, Eric S Lander, Charles Lee, Hans Lehrach, Elaine R Mardis, Gabor T Marth, Gil A McVean, Deborah A Nickerson, Jeanette P Schmidt, Stephen T Sherry, Jun Wang, Richard K Wilson, Eric Boerwinkle, Harsha Doddapaneni, Yi Han, Viktoriya Korchina, Christie Kovar, Sandra Lee, Donna Muzny, Jeffrey G Reid, Yiming Zhu, Yuqi Chang, Qiang Feng, Xiaodong Fang, Xiaosen Guo, Min Jian, Hui Jiang, Xin Jin, Tianming Lan, Guoqing Li, Jingxiang Li, Yingrui Li, Shengmao Liu, Xiao Liu, Yao Lu, Xuedi Ma, Meifang Tang, Bo Wang, Guangbiao Wang, Honglong Wu, Renhua Wu, Xun Xu, Ye Yin, Dandan Zhang, Wenwei Zhang, Jiao Zhao, Meiru Zhao, Xiaole Zheng, Namrata Gupta, Neda Gharani, Lorraine H Toji, Norman P Gerry, Alissa M Resch, Jonathan Barker, Laura Clarke, Laurent Gil, Sarah E Hunt, Gavin Kelman, Eugene Kulesha, Rasko Leinonen, William M McLaren, Rajesh Radhakrishnan, Asier Roa, Dmitriy Smirnov, Richard E Smith, Ian Streeter, Anja Thormann, Iliana Toneva, Brendan Vaughan, Xiangqun Zheng-Bradley, Russell Grocock, Sean Humphray, Terena James, Zoya Kingsbury, Ralf Sudbrak, Marcus W Albrecht, Vyacheslav S Amstislavskiy, Tatiana A Borodina, Matthias Lienhard, Florian Mertes, Marc Sultan, Bernd Timmermann, Marie-Laure Yaspo, Lucinda Fulton, Robert Fulton, Victor Ananiev, Zinaida Belaia, Dimitriy Beloslyudtsev, Nathan Bouk, Chao Chen, Deanna Church, Robert Cohen, Charles Cook, John Garner, Timothy Hefferon, Mikhail Kimelman, Chunlei Liu, John Lopez, Peter Meric, Chris O'Sullivan, Yuri Ostapchuk, Lon Phan, Sergiy Ponomarov, Valerie Schneider, Eugene Shekhtman, Karl Sirotkin, Douglas Slotta, Hua Zhang, Senduran Balasubramaniam, John Burton, Petr Danecek, Thomas M Keane, Anja Kolb-Kokocinski, Shane McCarthy, James Stalker, Michael Quail, Christopher J Davies, Jeremy Gollub, Teresa Webster, Brant Wong, Yiping Zhan, Adam Auton, Christopher L Campbell, Yu Kong, Anthony Marcketta, Fuli Yu, Lilian Antunes, Matthew Bainbridge, Aniko Sabo, Zhuoyi Huang, Lachlan J M Coin, Lin Fang, Qibin Li, Zhenyu Li, Haoxiang Lin, Binghang Liu, Ruibang Luo, Haojing Shao, Yinlong Xie, Chen Ye, Chang Yu, Fan Zhang, Hancheng Zheng, Hongmei Zhu, Can Alkan, Elif Dal, Fatma Kahveci, Erik P Garrison, Deniz Kural, Wan-Ping Lee, Wen Fung Leong, Michael Stromberg, Alistair N Ward, Jiantao Wu, Mengyao Zhang, Mark J Daly, Mark A DePristo, Robert E Handsaker, Eric Banks, Gaurav Bhatia, Guillermo Del Angel, Giulio Genovese, Heng Li, Seva Kashin, Steven A McCarroll, James C Nemesh, Ryan E Poplin, Seungtai C Yoon, Jayon Lihm, Vladimir Makarov, Srikanth Gottipati, Alon Keinan, Juan L Rodriguez-Flores, Tobias Rausch, Markus H Fritz, Adrian M Stütz, Kathryn Beal, Avik Datta, Javier Herrero, Graham R S Ritchie, Daniel Zerbino, Pardis C Sabeti, Ilya Shlyakhter, Stephen F Schaffner, Joseph Vitti, David N Cooper, Edward V Ball, Peter D Stenson, Bret Barnes, Markus Bauer, R Keira Cheetham, Anthony Cox, Michael Eberle, Scott Kahn, Lisa Murray, John Peden, Richard Shaw, Eimear E Kenny, Mark A Batzer, Miriam K Konkel, Jerilyn A Walker, Daniel G MacArthur, Monkol Lek, Ralf Herwig, Li Ding, Daniel C Koboldt, David Larson, Kai Ye, Simon Gravel, Anand Swaroop, Emily Chew, Tuuli Lappalainen, Yaniv Erlich, Melissa Gymrek, Thomas Frederick Willems, Jared T Simpson, Mark D Shriver, Jeffrey A Rosenfeld, Carlos D Bustamante, Stephen B Montgomery, Francisco M De La Vega, Jake K Byrnes, Andrew W Carroll, Marianne K DeGorter, Phil Lacroute, Brian K Maples, Alicia R Martin, Andres Moreno-Estrada, Suyash S Shringarpure, Fouad Zakharia, Eran Halperin, Yael Baran, Eliza Cerveira, Jaeho Hwang, Ankit Malhotra, Dariusz Plewczynski, Kamen Radew, Mallory Romanovitch, Chengsheng Zhang, Fiona C L Hyland, David W Craig, Alexis Christoforides, Nils Homer, Tyler Izatt, Ahmet A Kurdoglu, Shripad A Sinari, Kevin Squire, Chunlin Xiao, Jonathan Sebat, Danny Antaki, Madhusudan Gujral, Amina Noor, Kenny Ye, Esteban G Burchard, Ryan D Hernandez, Christopher R Gignoux, David Haussler, Sol J Katzman, W James Kent, Bryan Howie, Andres Ruiz-Linares, Emmanouil T Dermitzakis, Scott E Devine, Hyun Min Kang, Jeffrey M Kidd, Tom Blackwell, Sean Caron, Wei Chen, Sarah Emery, Lars Fritsche, Christian Fuchsberger, Goo Jun, Bingshan Li, Robert Lyons, Chris Scheller, Carlo Sidore, Shiya Song, Elzbieta Sliwerska, Daniel Taliun, Adrian Tan, Ryan Welch, Mary Kate Wing, Xiaowei Zhan, Philip Awadalla, Alan Hodgkinson, Yun Li, Xinghua Shi, Andrew Quitadamo, Gerton Lunter, Jonathan L Marchini, Simon Myers, Claire Churchhouse, Olivier Delaneau, Anjali Gupta-Hinch, Warren Kretzschmar, Zamin Iqbal, Iain Mathieson, Androniki Menelaou, Andy Rimmer, Dionysia K Xifara, Taras K Oleksyk, Yunxin Fu, Xiaoming Liu, Momiao Xiong, Lynn Jorde, David Witherspoon, Jinchuan Xing, Brian L Browning, Sharon R Browning, Fereydoun Hormozdiari, Peter H Sudmant, Ekta Khurana, Chris Tyler-Smith, Cornelis A Albers, Qasim Ayub, Yuan Chen, Vincenza Colonna, Luke Jostins, Klaudia Walter, Yali Xue, Mark B Gerstein, Alexej Abyzov, Suganthi Balasubramanian, Jieming Chen, Declan Clarke, Yao Fu, Arif O Harmanci, Mike Jin, Donghoon Lee, Jeremy Liu, Xinmeng Jasmine Mu, Jing Zhang, Yan Zhang, Chris Hartl, Khalid Shakir, Jeremiah Degenhardt, Sascha Meiers, Benjamin Raeder, Francesco Paolo Casale, Oliver Stegle, Eric-Wubbo Lameijer, Ira Hall, Vineet Bafna, Jacob Michaelson, Eugene J Gardner, Ryan E Mills, Gargi Dayama, Ken Chen, Xian Fan, Zechen Chong, Tenghui Chen, Mark J Chaisson, John Huddleston, Maika Malig, Bradley J Nelson, Nicholas F Parrish, Ben Blackburne, Sarah J Lindsay, Zemin Ning, Yujun Zhang, Hugo Lam, Cristina Sisu, Danny Challis, Uday S Evani, James Lu, Uma Nagaswamy, Jin Yu, Wangshen Li, Lukas Habegger, Haiyuan Yu, Fiona Cunningham, Ian Dunham, Kasper Lage, Jakob Berg Jespersen, Heiko Horn, Donghoon Kim, Rob Desalle, Apurva Narechania, Melissa A Wilson Sayres, Fernando L Mendez, G David Poznik, Peter A Underhill, Lachlan Coin, David Mittelman, Ruby Banerjee, Maria Cerezo, Thomas W Fitzgerald, Sandra Louzada, Andrea Massaia, Graham R Ritchie, Fengtang Yang, Divya Kalra, Walker Hale, Xu Dan, Kathleen C Barnes, Christine Beiswanger, Hongyu Cai, Hongzhi Cao, Brenna Henn, Danielle Jones, Jane S Kaye, Alastair Kent, Angeliki Kerasidou, Rasika Mathias, Pilar N Ossorio, Michael Parker, Charles N Rotimi, Charmaine D Royal, Karla Sandoval, Yeyang Su, Zhongming Tian, Sarah Tishkoff, Marc Via, Yuhong Wang, Huanming Yang, Ling Yang, Jiayong Zhu, Walter Bodmer, Gabriel Bedoya, Zhiming Cai, Yang Gao, Jiayou Chu, Leena Peltonen, Andres Garcia-Montero, Alberto Orfao, Julie Dutil, Juan C Martinez-Cruzado, Rasika A Mathias, Anselm Hennis, Harold Watson, Colin McKenzie, Firdausi Qadri, Regina LaRocque, Xiaoyan Deng, Danny Asogun, Onikepe Folarin, Christian Happi, Omonwunmi Omoniwa, Matt Stremlau, Ridhi Tariyal, Muminatou Jallow, Fatoumatta Sisay Joof, Tumani Corrah, Kirk Rockett, Dominic Kwiatkowski, Jaspal Kooner, Trân T?nh Hiên, Sarah J Dunstan, Nguyen Thuy Hang, Richard Fonnie, Robert Garry, Lansana Kanneh, Lina Moses, John Schieffelin, Donald S Grant, Carla Gallo, Giovanni Poletti, Danish Saleheen, Asif Rasheed, Lisa D Brooks, Adam L Felsenfeld, Jean E McEwen, Yekaterina Vaydylevich, Audrey Duncanson, Michael Dunn, Jeffery A Schloss, 1000 Genomes Project Consortium, Institute for Medical Engineering and Science, Broad Institute of MIT and Harvard, Lincoln Laboratory, Massachusetts Institute of Technology. Department of Biology, Gabriel, Stacey, Lander, Eric Steven, Daly, Mark J, Banks, Eric, Bhatia, Gaurav, Kashin, Seva, McCarroll, Steven A, Nemesh, James, Poplin, Ryan E., Sabeti, Pardis, Shlyakhter, Ilya, Schaffner, Stephen F, Vitti, Joseph, Gymrek, Melissa A, Hartler, Christina M., and Tariyal, Ridhi
- Subjects
demography ,genetic association ,genotype ,Human genomics ,Genome-wide association study ,Review ,SUSCEPTIBILITY ,DISEASE ,polymorphism ,0302 clinical medicine ,quantitative trait locus ,INDEL Mutation ,genetics ,GeneralLiterature_REFERENCE(e.g.,dictionaries,encyclopedias,glossaries) ,MUTATION ,Exome sequencing ,0303 health sciences ,public health ,Sequence analysis ,High-Throughput Nucleotide Sequencing ,standard ,Genomics ,Reference Standards ,Physical Chromosome Mapping ,3. Good health ,priority journal ,Science & Technology - Other Topics ,BAYES FACTORS ,Molecular Developmental Biology ,Genotype ,Genetics, Medical ,Quantitative Trait Loci ,DNA sequence ,rare disease ,human genetics ,information processing ,Article ,03 medical and health sciences ,SDG 3 - Good Health and Well-being ,POPULATION HISTORY ,human genome ,Humans ,retroposon ,Genetic variability ,human ,GENOME-WIDE ASSOCIATION ,1000 Genomes Project ,Demography ,Science & Technology ,ancestry ,disease predisposition ,Genetic Variation ,MACULAR DEGENERATION ,major clinical study ,gene linkage disequilibrium ,purl.org/pe-repo/ocde/ford#3.01.02 [https] ,Genetics, Population ,030217 neurology & neurosurgery ,haplotype ,Internationality ,VARIANT ,Datasets as Topic ,Human genetic variation ,COMPLEMENT FACTOR-H ,single nucleotide polymorphism ,genetic variability ,Exome ,chromosome map ,Genetics ,Variant Call Format ,Genome ,Multidisciplinary ,1000 Genomes Project Consortium ,international cooperation ,Multidisciplinary Sciences ,standards ,Disease Susceptibility ,medical genetics ,General Science & Technology ,Population ,Computational biology ,Biology ,gene frequency ,Polymorphism, Single Nucleotide ,high throughput sequencing ,Rare Diseases ,promoter region ,MD Multidisciplinary ,Genetic variation ,QH426 ,030304 developmental biology ,Neurodevelopmental disorders Donders Center for Medical Neuroscience [Radboudumc 7] ,Genome, Human ,population genetics ,population structure ,Sequence Analysis, DNA ,gene structure ,INDIVIDUALS ,Haplotypes ,Genome-Wide Association Study ,purl.org/pe-repo/ocde/ford#1.06.07 [https] - Abstract
The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies., Wellcome Trust (London, England) (Core Award 090532/Z/09/Z), Wellcome Trust (London, England) (Senior Investigator Award 095552/Z/11/Z ), Wellcome Trust (London, England) (WT095908), Wellcome Trust (London, England) (WT109497), Wellcome Trust (London, England) (WT098051), Wellcome Trust (London, England) (WT086084/Z/08/Z), Wellcome Trust (London, England) (WT100956/Z/13/Z ), Wellcome Trust (London, England) (WT097307), Wellcome Trust (London, England) (WT0855322/Z/08/Z ), Wellcome Trust (London, England) (WT090770/Z/09/Z ), Wellcome Trust (London, England) (Major Overseas program in Vietnam grant 089276/Z.09/Z), Medical Research Council (Great Britain) (grant G0801823), Biotechnology and Biological Sciences Research Council (Great Britain) (grant BB/I02593X/1), Biotechnology and Biological Sciences Research Council (Great Britain) (grant BB/I021213/1), Zhongguo ke xue ji shu qing bao yan jiu suo. Office of 863 Programme of China (2012AA02A201), National Basic Research Program of China (2011CB809201), National Basic Research Program of China (2011CB809202), National Basic Research Program of China (2011CB809203), National Natural Science Foundation of China (31161130357), Shenzhen Municipal Government of China (grant ZYC201105170397A), Canadian Institutes of Health Research (grant 136855), Quebec Ministry of Economic Development, Innovation, and Exports (PSR-SIIRI-195), Germany. Bundesministerium für Bildung und Forschung (0315428A), Germany. Bundesministerium für Bildung und Forschung (01GS08201), Germany. Bundesministerium für Bildung und Forschung (BMBF-EPITREAT grant 0316190A), Deutsche Forschungsgemeinschaft (Emmy Noether Grant KO4037/1-1), Beatriu de Pinos Program (2006 BP-A 10144), Beatriu de Pinos Program (2009 BP-B 00274), Spanish National Institute for Health (grant PRB2 IPT13/0001-ISCIII-SGEFI/FEDER), Japan Society for the Promotion of Science (fellowship number PE13075), Marie Curie Actions Career Integration (grant 303772), Fonds National Suisse del la Recherche, SNSF, Scientifique (31003A_130342), National Center for Biotechnology Information (U.S.) (U54HG3067), National Center for Biotechnology Information (U.S.) (U54HG3273), National Center for Biotechnology Information (U.S.) (U01HG5211), National Center for Biotechnology Information (U.S.) (U54HG3079), National Center for Biotechnology Information (U.S.) (R01HG2898), National Center for Biotechnology Information (U.S.) (R01HG2385), National Center for Biotechnology Information (U.S.) (RC2HG5552), National Center for Biotechnology Information (U.S.) (U01HG6513), National Center for Biotechnology Information (U.S.) (U01HG5214), National Center for Biotechnology Information (U.S.) (U01HG5715), National Center for Biotechnology Information (U.S.) (U01HG5718), National Center for Biotechnology Information (U.S.) (U01HG5728), National Center for Biotechnology Information (U.S.) (U41HG7635), National Center for Biotechnology Information (U.S.) (U41HG7497), National Center for Biotechnology Information (U.S.) (R01HG4960), National Center for Biotechnology Information (U.S.) (R01HG5701), National Center for Biotechnology Information (U.S.) (R01HG5214), National Center for Biotechnology Information (U.S.) (R01HG6855), National Center for Biotechnology Information (U.S.) (R01HG7068), National Center for Biotechnology Information (U.S.) (R01HG7644), National Center for Biotechnology Information (U.S.) (DP2OD6514), National Center for Biotechnology Information (U.S.) (DP5OD9154), National Center for Biotechnology Information (U.S.) (R01CA166661), National Center for Biotechnology Information (U.S.) (R01CA172652), National Center for Biotechnology Information (U.S.) (P01GM99568), National Center for Biotechnology Information (U.S.) (R01GM59290), National Center for Biotechnology Information (U.S.) (R01GM104390), National Center for Biotechnology Information (U.S.) (T32GM7790), National Center for Biotechnology Information (U.S.) (R01HL87699), National Center for Biotechnology Information (U.S.) (R01HL104608), National Center for Biotechnology Information (U.S.) (T32HL94284), National Center for Biotechnology Information (U.S.) (HHSN268201100040C), National Center for Biotechnology Information (U.S.) (HHSN272201000025C), Lundbeck Foundation (grant R170-2014-1039, Simons Foundation (SFARI award SF51), National Science Foundation (U.S.) (Research Fellowship DGE-1147470)
- Published
- 2015
- Full Text
- View/download PDF
8. New Generation Pharmacogenomic Tools: A SNP Linkage Disequilibrium Map, Validated SNP Assay Resource, and High-Throughput Instrumentation System for Large-Scale Genetic Studies
- Author
-
David Dailey, Dawn Madden, Janet S. Ziegle, Dennis A. Gilbert, Francisco M. De La Vega, and Julie Williams
- Subjects
Genetics ,Linkage disequilibrium ,education.field_of_study ,business.industry ,Population ,Genomics ,Single-nucleotide polymorphism ,Computational biology ,Biology ,Tag SNP ,General Biochemistry, Genetics and Molecular Biology ,SNP ,Human genome ,Personalized medicine ,business ,education ,Biotechnology - Abstract
Since public and private efforts announced the first draft of the human genome last year, researchers have reported great numbers of single nucleotide polymorphisms (SNPs). We believe that the availability of well-mapped, quality SNP markers constitutes the gateway to a revolution in genetics and personalized medicine that will lead to better diagnosis and treatment of common complex disorders. A new generation of tools and public SNP resources for pharmacogenomic and genetic studies—specifically for candidate-gene, candidate-region, and whole-genome association studies—will form part of the new scientific landscape. This will only be possible through the greater accessibility of SNP resources and superior high-throughput instrumentation-assay systems that enable affordable, highly productive large-scale genetic studies. We are contributing to this effort by developing a high-quality linkage disequilibrium SNP marker map and an accompanying set of ready-to-use, validated SNP assays across every gene in the human genome. This effort incorporates both the public sequence and SNP data sources, and Celera Genomics’ human genome assembly and enormous resource of physically mapped SNPs (approximately 4 000 000 unique records). This article discusses our approach and methodology for designing the map, choosing quality SNPs, designing and validating these assays, and obtaining population frequency of the polymorphisms. We also discuss an advanced, high-performance SNP assay chemistry—a new generation of the TaqMan® probe-based, 5′ nuclease assay—and high-throughput instrumentation-software system for large-scale genotyping. We provide the new SNP map and validation information, validated SNP assays and reagents, and instrumentation systems as a novel resource for genetic discoveries.
- Published
- 2002
- Full Text
- View/download PDF
9. Joint variant andde novomutation identification on pedigrees from high-throughput sequencing data
- Author
-
John G Cleary, David Ware, Mehul Kamlesh Rathod, Kurt Gaastra, Sahar Nohzadeh-Malakshah, Richard Littin, Francisco M. De La Vega, Minita Shah, S. Inglis, Len Trigg, Ross Braithwaite, Brian S Hilbush, Sean A. Irvine, and Alan Jackson
- Subjects
Genetics ,education.field_of_study ,dbSNP ,Population ,False positive paradox ,Genomics ,Pedigree chart ,Biology ,education ,DNA sequencing ,Exome sequencing ,SNP array - Abstract
The analysis of whole-genome or exome sequencing data from trios and pedigrees has being successfully applied to the identification of disease-causing mutations. However, most methods used to identify and genotype genetic variants from next-generation sequencing data ignore the relationships between samples, resulting in significant Mendelian errors, false positives and negatives. Here we present a Bayesian network framework that jointly analyses data from all members of a pedigree simultaneously using Mendelian segregation priors, yet providing the ability to detectde novomutations in offspring, and is scalable to large pedigrees. We evaluated our method by simulations and analysis of WGS data from a 17 individual, 3-generation CEPH pedigree sequenced to 50X average depth. Compared to singleton calling, our family caller produced more high quality variants and eliminated spurious calls as judged by common quality metrics such as Ti/Tv, Het/Hom ratios, and dbSNP/SNP array data concordance. We developed a ground truth dataset to further evaluate our calls by identifying recombination cross-overs in the pedigree and testing variants for consistency with the inferred phasing, and we show that our method significantly outperforms singleton and population variant calling in pedigrees. We identify all previously validatedde novomutations in NA12878, concurrent with a 7X precision improvement. Our results show that our method is scalable to large genomics and human disease studies and allows cost optimization by rational sequencing capacity distribution.
- Published
- 2014
- Full Text
- View/download PDF
10. Population genetic inference from personal genome data: impact of ancestry and admixture on human genomic variation
- Author
-
Jeffrey M. Kidd, Andrés Moreno-Estrada, Victor Acuña-Alonzo, Stephen F. McLaughlin, Celeste Eng, Martin G. Reese, Larsson Omberg, Rodrigo Barquera-Lozano, Adam Auton, Atul J. Butte, Jake K. Byrnes, Andrew R. Reynolds, Samuel Canizales-Quinteros, Jeremiah D. Degenhardt, Heather E. Peckham, Francisco M. De La Vega, Stephen E. Lincoln, Christina A. Bormann Chung, Rong Chen, Sarah Stanley, Suehelay Acevedo-Acevedo, Simon Gravel, Archie Russell, Alon Keinan, Vrunda Sheth, Esteban G. Burchard, Elizabeth Levandowsky, Kevin A. Pearlstein, Andrew G. Clark, Katarzyna Bryc, Abra Brisbin, Shaila Musharoff, and Carlos Bustamante
- Subjects
Heterozygote ,Population ,Human genetic variation ,Biology ,Polymorphism, Single Nucleotide ,Genome ,Medical and Health Sciences ,Article ,Gene flow ,Loss of heterozygosity ,03 medical and health sciences ,0302 clinical medicine ,Genetics ,Humans ,Genetics(clinical) ,Polymorphism ,education ,Genetics (clinical) ,030304 developmental biology ,Genetics & Heredity ,0303 health sciences ,education.field_of_study ,Continental Population Groups ,Genome, Human ,Haplotype ,Racial Groups ,Human Genome ,Single Nucleotide ,Biological Sciences ,Genetics, Population ,Haplotypes ,Evolutionary biology ,Human genome ,030217 neurology & neurosurgery ,Personal genomics ,Human - Abstract
Full sequencing of individual human genomes has greatly expanded our understanding of human genetic variation and population history. Here, we present a systematic analysis of 50 human genomes from 11 diverse global populations sequenced at high coverage. Our sample includes 12 individuals who have admixed ancestry and who have varying degrees of recent (within the last 500 years) African, Native American, and European ancestry. We found over 21 million single-nucleotide variants that contribute to a 1.75-fold range in nucleotide heterozygosity across diverse human genomes. This heterozygosity ranged from a high of one heterozygous site per kilobase in west African genomes to a low of 0.57 heterozygous sites per kilobase in segments inferred to have diploid Native American ancestry from the genomes of Mexican and Puerto Rican individuals. We show evidence of all three continental ancestries in the genomes of Mexican, Puerto Rican, and African American populations, and the genome-wide statistics are highly consistent across individuals from a population once ancestry proportions have been accounted for. Using a generalized linear model, we identified subtle variations across populations in the proportion of neutral versus deleterious variation and found that genome-wide statistics vary in admixed populations even once ancestry proportions have been factored in. We further infer that multiple periods of gene flow shaped the diversity of admixed populations in the Americas—70% of the European ancestry in today’s African Americans dates back to European gene flow happening only 7–8 generations ago.
- Published
- 2012
11. Low-pass genome-wide sequencing and variant inference using identity-by-descent in an isolated human population
- Author
-
Markus Stoffel, Arthi Ramachandran, F. M. De La Vega, Jeffrey M. Friedman, T. N. Weaver, Michael R. Lyons, Q. C. Doan, Heather E. Peckham, Alexander Gusev, Eimear E. Kenny, J. Salit, Minita Shah, Vrunda Sheth, Itsik Pe'er, Clarence Lee, Jennifer K. Lowe, Elizabeth Levandowsky, Stephen F. McLaughlin, and Jan L. Breslow
- Subjects
Genotype ,Population ,Genome-wide association study ,Locus (genetics) ,Biology ,Investigations ,Pacific Islands ,Identity by descent ,Polymorphism, Single Nucleotide ,Cohort Studies ,Gene Frequency ,Population Groups ,Genetics ,Humans ,education ,Allele frequency ,Alleles ,education.field_of_study ,Genome, Human ,Reproducibility of Results ,Founder Effect ,Population bottleneck ,Imputation (genetics) ,Algorithms ,Founder effect ,Genome-Wide Association Study - Abstract
Whole-genome sequencing in an isolated population with few founders directly ascertains variants from the population bottleneck that may be rare elsewhere. In such populations, shared haplotypes allow imputation of variants in unsequenced samples without resorting to complex statistical methods as in studies of outbred cohorts. We focus on an isolated population cohort from the Pacific Island of Kosrae, Micronesia, where we previously collected SNP array and rich phenotype data for the majority of the population. We report identification of long regions with haplotypes co-inherited between pairs of individuals and methodology to leverage such shared genetic content for imputation. Our estimates show that sequencing as few as 40 personal genomes allows for inference in up to 60% of the 3000-person cohort at the average locus. We ascertained a pilot data set of whole-genome sequences from seven Kosraean individuals, with average 5× coverage. This assay identified 5,735,306 unique sites of which 1,212,831 were previously unknown. Additionally, these variants are unusually enriched for alleles that are rare in other populations when compared to geographic neighbors (published Korean genome SJK). We used the presence of shared haplotypes between the seven Kosraen individuals to estimate expected imputation accuracy of known and novel homozygous variants at 99.6% and 97.3%, respectively. This study presents whole-genome analysis of a homogenous isolate population with emphasis on optimal rare variant inference.
- Published
- 2011
12. Mapping copy number variation by population-scale genome sequencing
- Author
-
L. McDade, Eric D. Green, Aravinda Chakravarti, Susan Lindsay, Justin Paschall, Aylwyn Scally, Deborah A. Nickerson, Chip Stewart, Stephen T. Sherry, Chunlin Xiao, Alex Reynolds, Carol Scott, H. M. Khouri, Pardis C. Sabeti, Xinmeng Jasmine Mu, Stephen B. Montgomery, Eric Banks, Gabor T. Marth, A. Caprio, Xiaole Zheng, Philip Awadalla, Qunyuan Zhang, Wei Chen, Matthew N. Bainbridge, Donna Muzny, Steven A. McCarroll, Jeffrey M. Kidd, Honglong Wu, Audrey Duncanson, Vladimir Makarov, Lilia M. Iakoucheva, Mark Gerstein, Han-Jun Jin, Can Alkan, Iman Hajirasouliha, T. J. Fennell, C. R. Juenger, J. Kidd, Chris Tyler-Smith, Qasim Ayub, D. Ashworth, Kristian Cibulskis, Yutao Fu, William M. McLaren, Sol Katzman, Yujun Zhang, Rajini R Haraksingh, A. Kebbel, Stuart L. Schreiber, Manual Rivas, Onur Sakarya, Tobias Rausch, Yuan Chen, M. Bachorski, Matthew E. Hurles, N. C. Clemm, Wei Wang, Xiangqun Zheng-Bradley, Adrian M. Sütz, Thomas M. Keane, E. Bank, Stephen F. McLaughlin, Javier Herrero, Jon Keebler, Simon Myers, Aleksandr Morgulis, James Nemesh, Jing Leng, Molly Przeworski, Alon Keinan, Lorraine Toji, Ilya Shlyakhter, Joshua M. Korn, Martine Zilversmit, Luke Jostins, Jun Wang, Jared Maguire, J. M. Korn, Ryan E. Mills, Seungtai Yoon, Bo Wang, F. M. De La Vega, Heng Li, L. Guccione, Laura Clarke, Huisong Zheng, Jeffrey K. Ichikawa, K. Kao, Kirill Rotmistrovsky, L. Gu, David B. Jaffe, David Haussler, Toby Bloom, Tara Skelly, S. Yoon, Gil McVean, Carrie Sougnez, Mark A. Batzer, A. De Witte, Ralf Herwig, Jane Wilkinson, Min Hu, K. Pareja, John V. Pearson, Robert E. Handsaker, Jerilyn A. Walker, Fuli Yu, Anthony A. Philippakis, Aniko Sabo, Jonathan Marchini, Ryan D. Hernandez, Guoqing Li, Peter Donnelly, Eric S. Lander, David J. Dooling, Jun Ding, Lukas Habegger, Pilar N. Ossorio, Andreas Dahl, Wilfried Nietfeld, Miriam F. Moffatt, Alexej Abyzov, Sebastian Zöllner, Ekta Khurana, Jean E. McEwen, Robert S. Fulton, Alexey Soldatov, Fiona Hyland, Philippe Lacroute, Richa Agarwala, Paul Flicek, Weichun Huang, Alison J. Coffey, Tony Cox, John W. Wallis, Robert Sanders, David Neil Cooper, Jason P. Affourtit, Mark A. DePristo, D Wheeler, Christopher Celone, Eugene Kulesha, Craig Elder Mealmaker, B. Desany, Zhengdong D. Zhang, Jonathan M. Manning, Cynthia L. Turcotte, Lisa D Brooks, Xiuqing Zhang, C. Coafra, Rajesh Radhakrishnan, Alan J. Schafer, Jonathan Sebat, Ken Chen, Andrew G. Clark, Alexis Christoforides, Edward V. Ball, Mark S. Guyer, Sharon R. Grossman, Philip Rosenstiel, J. Knowlton, Gonçalo R. Abecasis, Min Jian, James O. Burton, S. Wang, Lucinda Murray, George M. Weinstock, Mark Lathrop, Harold Swerdlow, Michael L. Metzker, Xiaowei Zhan, Yeyang Su, Ruibang Luo, Charles Lee, Huanming Yang, P. Marquardt, Charles N. Rotimi, Lynne V. Nazareth, Michael Snyder, Faheem Niazi, Quan Long, Jane Kaye, Michael Strömberg, Adam Auton, Michael Bauer, Cheng-Sheng Lee, S. Gabriel, Jim Stalker, Heather E. Peckham, D. Conners, Raffaella Smith, Yingrui Li, Niall Anthony Gormley, Megan Hanna, Jinchuan Xing, Hugo Y. K. Lam, S. Giles, Evan E. Eichler, Justin Jee, Loukas Moutsianas, Jiang Du, Hyun Min Kang, Eric F. Tsung, Ni Huang, Kai Ye, Stephen F. Schaffner, Suleyman Cenk Sahinalp, Xinghua Shi, Sean Humphray, Ahmet Kurdoglu, Amy L. McGuire, Sandra J. Lee, Linnea Fulton, Francis S. Collins, Huiqing Liang, S. C. Melton, A. Nawrocki, Aaron R. Quinlan, Tatjana Borodina, Lynn B. Jorde, Leopold Parts, Michael D. McLellan, Adrian M. Stütz, Paul Scheet, Amit Indap, Vyacheslav Amstislavskiy, Waibhav Tembe, S. Attiya, Jin Yu, Dmitri Parkhomchuk, Si Quang Le, Fabian Grubert, E. Buglione, Ruiqiang Li, Yan Zhou, Fiona Cunningham, Gilean McVean, Wan-Ping Lee, W. Song, Richard Durbin, Andrew Kernytsky, Stephen M. Beckstrom-Sternberg, Xin Ma, J. Jeng, Lauren Ambrogio, Carol Churcher, Ryan Poplin, William O.C.M. Cookson, Rasko Leinonen, Alexey N. Davydov, Kenny Ye, Paige Anderson, Alexander E. Urban, Adam Felsenfeld, Jeffrey S. Reid, Cornelis A. Albers, Jan O. Korbel, Senduran Balasubramaniam, Elaine R. Mardis, Gozde Aksay, Peter H. Sudmant, Aaron McKenna, M. Labrecque, Amanda J. Price, Vadim Zalunin, Donald F. Conrad, Florian Mertes, Christie Kovar, Danny Challis, A. D. Ball, Petr Danecek, Kiran V. Garimella, Bryan Howie, Scott Kahn, Shuaishuai Tai, E. P. Garrison, Robert D. Bjornson, Shankar Balasubramanian, Fereydoun Hormozdiari, Geng Tian, S. Clark, Joanna L. Kelley, Asif T. Chinwalla, Ramenani Ravi K, Ralf Sudbrak, Mark Kaganovich, Jeffrey C. Barrett, David Rio Deiros, Jeremiah D. Degenhardt, A. Palotie, Alistair Ward, Gianna Costa, Huyen Dinh, M. Minderman, R. Keira Cheetham, Jingxiang Li, Michael A. Quail, P. Koko-Gonzales, Alastair Kent, Martin Shumway, David R. Bentley, Ferran Casals, Leena Peltonen, Klaudia Walter, Christopher Hartl, Erica Shefler, Zhaolei Zhang, Hans Lehrach, Jessica L. Peterson, Roger Winer, Daniel C. Koboldt, D. Riches, Terena James, Wen Fung Leong, Michael Egholm, Thomas W. Blackwell, Peter D. Stenson, Anthony J. Cox, Andrew D. Kern, David M. Carter, M. Tolzmann, Daniel G. MacArthur, Jiantao Wu, Jennifer Stone, Angie S. Hinrichs, M. Albrecht, Jo Knight, Chang-Yun Lin, Adam R. Boyko, Dan Turner, Xiaodong Fang, Youssef Idaghdour, Liming Liang, Ryan N. Gutenkunst, David Craig, Mark J. Daly, Xiaosen Guo, Neda Gharani, Gerton Lunter, Shuli Kang, A. Burke, Shripad Sinari, Yongming A. Sun, Zoya Kingsbury, Robert M. Kuhn, Miriam K. Konkel, T. Li, Kevin McKernan, Simon Gravel, Brian L. Browning, C Sidore, Zamin Iqbal, Matthew Mort, Afidalina Tumian, Michael C. Wendl, Adam Phillips, Bernd Timmermann, Carlos Bustamante, H. Y. Lam, Deniz Kural, Richard A. Gibbs, Bartha Maria Knoppers, Emmanouil T. Dermitzakis, Lon Phan, Richard K. Wilson, D. L. Altshuler, S. Keenen, Assya Abdallah, Eric A. Stone, Michael A. Eberle, Li Ding, and Broad Institute of MIT and Harvard
- Subjects
DNA Copy Number Variations ,Genotype ,Population ,Genomic Structural Variation ,Genomics ,Computational biology ,Biology ,Genome ,Article ,DNA sequencing ,structural variation segmental duplications short-read rearrangements disorders disease common schizophrenia polymorphism insertions ,03 medical and health sciences ,0302 clinical medicine ,Gene Duplication ,Insertional ,Genetics ,Humans ,Genetic Predisposition to Disease ,Copy-number variation ,1000 Genomes Project ,education ,Sequence Deletion ,030304 developmental biology ,0303 health sciences ,education.field_of_study ,Multidisciplinary ,Genome, Human ,Reproducibility of Results ,Sequence Analysis, DNA ,DNA ,Mutagenesis, Insertional ,Genetics, Population ,Mutagenesis ,Human genome ,Sequence Analysis ,030217 neurology & neurosurgery ,Human - Abstract
Summary Genomic structural variants (SVs) are abundant in humans, differing from other variation classes in extent, origin, and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (i.e., copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications. Most SVs (53%) were mapped to nucleotide resolution, which facilitated analyzing their origin and functional impact. We examined numerous whole and partial gene deletions with a genotyping approach and observed a depletion of gene disruptions amongst high frequency deletions. Furthermore, we observed differences in the size spectra of SVs originating from distinct formation mechanisms, and constructed a map constructed a map of SV hotspots formed by common mechanisms. Our analytical framework and SV map serves as a resource for sequencing-based association studies.
- Published
- 2011
13. Doubled haploid lines of Brassica carinata with modified erucic acid content through mutagenesis by EMS treatment of isolated microspores
- Author
-
J. Fernández-Escobar, M. de la Vega, Francisco Barro, and Antonio Martín
- Subjects
chemistry.chemical_classification ,education.field_of_study ,Population ,Brassica carinata ,food and beverages ,Fatty acid ,Plant Science ,Biology ,biology.organism_classification ,Crop ,chemistry.chemical_compound ,Microspore ,chemistry ,Erucic acid ,Botany ,Genetics ,Microspora ,Doubled haploidy ,education ,Agronomy and Crop Science - Abstract
Brassica carinata is a potential oilseed crop for the Mediterranean area. Chemical mutagenesis has been applied to microspores of B. carinata with the purpose of identifying lines with altered erucic acid content. From a population of nearly 400 doubled haploid plants recovered, nine lines have been identified that exhibit promising useful changes in erucic acid concentration in the seed oil. Three lines showed erucic acid contents below 25%, with a minimum of 17.1%, and in six lines the level of this fatty acid was greater than 52%. Changes in other fatty acids are also described and discussed.
- Published
- 2001
- Full Text
- View/download PDF
14. An ancestry informative marker set for determining continental origin: validation and extension using human genome diversity panels
- Author
-
Marta E. Alarcón-Riquelme, Michael F. Seldin, Lesley M. Butler, Rick A. Kittles, Francisco M. De La Vega, Gabriel A. Silva, Roman Kosoy, John W. Belmont, Rami Nassir, Peter K. Gregersen, Phoebe A. White, and Chao Tian
- Subjects
Genetic Markers ,lcsh:QH426-470 ,Genotype ,media_common.quotation_subject ,Population ,Black People ,Ancestry-informative marker ,Biology ,Population stratification ,Polymorphism, Single Nucleotide ,Genome ,White People ,03 medical and health sciences ,0302 clinical medicine ,Asian People ,Genetics ,Cluster Analysis ,Humans ,SNP ,Genetics(clinical) ,030216 legal & forensic medicine ,education ,Genetics (clinical) ,030304 developmental biology ,media_common ,Principal Component Analysis ,0303 health sciences ,education.field_of_study ,Genome, Human ,lcsh:Genetics ,Genetics, Population ,Genetic marker ,Human genome ,Research Article ,Diversity (politics) - Abstract
Background Case-control genetic studies of complex human diseases can be confounded by population stratification. This issue can be addressed using panels of ancestry informative markers (AIMs) that can provide substantial population substructure information. Previously, we described a panel of 128 SNP AIMs that were designed as a tool for ascertaining the origins of subjects from Europe, Sub-Saharan Africa, Americas, and East Asia. Results In this study, genotypes from Human Genome Diversity Panel populations were used to further evaluate a 93 SNP AIM panel, a subset of the 128 AIMS set, for distinguishing continental origins. Using both model-based and relatively model-independent methods, we here confirm the ability of this AIM set to distinguish diverse population groups that were not previously evaluated. This study included multiple population groups from Oceana, South Asia, East Asia, Sub-Saharan Africa, North and South America, and Europe. In addition, the 93 AIM set provides population substructure information that can, for example, distinguish Arab and Ashkenazi from Northern European population groups and Pygmy from other Sub-Saharan African population groups. Conclusion These data provide additional support for using the 93 AIM set to efficiently identify continental subject groups for genetic studies, to identify study population outliers, and to control for admixture in association studies.
- Published
- 2009
- Full Text
- View/download PDF
15. Identifying selected regions from heterozygosity and divergence using a light-coverage genomic dataset from two human populations
- Author
-
Stephen J. O'Brien, Michael W. Smith, Taras K. Oleksyk, Dennis A. Gilbert, Francisco M. De La Vega, and Kai Zhao
- Subjects
Genetic Markers ,0106 biological sciences ,Heterozygote ,Linkage disequilibrium ,Genotype ,Population ,lcsh:Medicine ,Single-nucleotide polymorphism ,Biology ,Polymorphism, Single Nucleotide ,010603 evolutionary biology ,01 natural sciences ,Linkage Disequilibrium ,White People ,03 medical and health sciences ,Genetics and Genomics/Population Genetics ,Humans ,Computer Simulation ,Selection, Genetic ,Evolutionary Biology/Genomics ,education ,lcsh:Science ,Alleles ,030304 developmental biology ,Genetics ,Evolutionary Biology ,0303 health sciences ,education.field_of_study ,Multidisciplinary ,Genome, Human ,Directional selection ,lcsh:R ,Genetic Variation ,Genetics and Genomics ,Evolutionary Biology/Human Evolution ,Black or African American ,Genetic divergence ,Fixation (population genetics) ,Chromosomal region ,lcsh:Q ,Genetics and Genomics/Comparative Genomics ,Selective sweep ,Research Article - Abstract
When a selective sweep occurs in the chromosomal region around a target gene in two populations that have recently separated, it produces three dramatic genomic consequences: 1) decreased multi-locus heterozygosity in the region; 2) elevated or diminished genetic divergence (F(ST)) of multiple polymorphic variants adjacent to the selected locus between the divergent populations, due to the alternative fixation of alleles; and 3) a consequent regional increase in the variance of F(ST) (S(2)F(ST)) for the same clustered variants, due to the increased alternative fixation of alleles in the loci surrounding the selection target. In the first part of our study, to search for potential targets of directional selection, we developed and validated a resampling-based computational approach; we then scanned an array of 31 different-sized moving windows of SNP variants (5-65 SNPs) across the human genome in a set of European and African American population samples with 183,997 SNP loci after correcting for the recombination rate variation. The analysis revealed 180 regions of recent selection with very strong evidence in either population or both. In the second part of our study, we compared the newly discovered putative regions to those sites previously postulated in the literature, using methods based on inspecting patterns of linkage disequilibrium, population divergence and other methodologies. The newly found regions were cross-validated with those found in nine other studies that have searched for selection signals. Our study was replicated especially well in those regions confirmed by three or more studies. These validated regions were independently verified, using a combination of different methods and different databases in other studies, and should include fewer false positives. The main strength of our analysis method compared to others is that it does not require dense genotyping and therefore can be used with data from population-based genome SNP scans from smaller studies of humans or other species.
- Published
- 2008
16. Optimal selection of SNP markers for disease association studies
- Author
-
Sorin Istrail, Francisco M. De La Vega, and Bjarni V. Halldorsson
- Subjects
Genetic Markers ,Linkage disequilibrium ,Candidate gene ,Time Factors ,Genotype ,Population ,Single-nucleotide polymorphism ,Computational biology ,Biology ,Polymorphism, Single Nucleotide ,Linkage Disequilibrium ,Genetics ,SNP ,Humans ,Genetic Predisposition to Disease ,education ,Association mapping ,Genetics (clinical) ,Alleles ,Genetic association ,education.field_of_study ,Principal Component Analysis ,Models, Statistical ,Models, Genetic ,Genetic Diseases, Inborn ,Tag SNP ,Genetic Techniques ,Haplotypes ,Multivariate Analysis ,Algorithms ,Software - Abstract
Genetic association studies with population samples hold the promise of uncovering the susceptibility genes underlying the heritability of complex or common disease. Most association studies rely on the use of surrogate markers, single-nucleotide polymorphism (SNP) being the most suitable due to their abundance and ease of scoring. SNP marker selection is aimed to increase the chances that at least one typed SNP would be in linkage disequilibrium (LD) with the disease causative variant, while at the same time controlling the cost of the study in terms of the number of markers genotyped and samples. Empirical studies reporting block-like segments in the genome with high LD and low haplotype diversity have motivated a marker selection strategy whereby subsets of SNPs that ‘tag’ the common haplotypes of a region are picked for genotyping, avoiding typing redundant SNPs. Based on these initial observations, a plethora of ‘tagging’ algorithms for selecting minimum informative subsets of SNPs has recently appeared in the literature. These differ mostly in two major aspects: the quality or correlation measure used to define tagging and the algorithm used for the minimization of the final number of tagging SNPs. In this review we describe the available tagging algorithms utilizing a 3-step unifying framework, point out their methodological and conceptual differences, and make an assessment of their assumptions, performance, and scalability.
- Published
- 2005
17. Mapping genes for common diseases: the case for genetic (LD) maps
- Author
-
Andrew Collins, Winston Lau, and Francisco M. De La Vega
- Subjects
Genetics ,education.field_of_study ,Linkage disequilibrium ,Genome, Human ,Population ,Chromosome Mapping ,Biology ,Tag SNP ,Polymorphism, Single Nucleotide ,Complete linkage ,Linkage Disequilibrium ,Genetics, Population ,Haplotypes ,Genetic linkage ,Evolutionary biology ,Humans ,Genetic Predisposition to Disease ,International HapMap Project ,Association mapping ,education ,Genetics (clinical) ,Genetic association - Abstract
We examine the current effort to develop a haplotype map of the human genome and suggest an alternative approach which represents linkage disequilibrium patterns in the form of a metric LD map. LD maps have some of the useful properties of genetic linkage maps but have a much higher resolution which is optimal for SNP-based association mapping of common diseases. The studies that have been undertaken to date suggest that LD and recombination maps show some close similarities because of abundant, narrow, recombination hot spots. These hot spots are co-localised in all populations but, unlike linkage maps, LD maps differ in scale for different populations because of differences in population history. The prospects for developing optimized panels of SNPs and the use of linkage disequilibrium maps in disease gene localisation are assessed in the light of recent evidence.
- Published
- 2004
18. Patterns of Linkage Disequilibrium across Human Chromosomes 6, 21, AND 22
- Author
-
Francisco M. De La Vega
- Subjects
Minor allele frequency ,Linkage disequilibrium ,education.field_of_study ,Evolutionary biology ,Haplotype ,Population ,Single-nucleotide polymorphism ,Human genome ,Biology ,Association mapping ,education ,Genetic association - Abstract
With the aim of developing a linkage disequilibrium (LD) SNP map to serve as a resource for candidate-gene, candidate-region and whole-genome association studies, we have genotyped >250,000 SNPs on 90 DNA samples (45 African-American, 45 Caucasian, unrelated) selected from the Coriell Human variation collection. The individual genotypes thus generated have enabled us to survey the patterns of LD and haplotype diversity across all gene regions of the human genome. Here I describe the empirical results of the first comparative study of the patterns of LD across three entire human autosomes: Chromosomes 6, 21, and 22. We selected for the study a total of 17,966 SNPs covering more than 209 Mb of chromosomal segments, and overlapping 2,266 predicted gene regions, with a minor allele frequency greater than 10% in either population, and that were in Hardy-Weinberg equilibrium (p>0.01). Several methods to define ?haplotype blocks? were applied to this dataset, including several forms of the D? method and the 4-gamete rule. Haplotypes were then computationally inferred for the markers within each block by the EM algorithm to assess haplotype diversity. In addition, a subset of 277 SNPs spanning 4 Mb across the HLA region on chromosome 6 was genotyped on 550 DNA samples of unrelated individuals of European ancestry from north Germany, 93 samples from Norway, and 77 samples from UK. We analyze the robustness of the different haplotype block definitions, the differences between the population samples, and the effect of sample size on the generalization of haplotype blocks defined in one given population sample. Finally, I present the preliminary results of haplotype-based power calculations for case-control studies across the gene regions of these three chromosomes.
- Published
- 2004
- Full Text
- View/download PDF
19. Patterns of linkage disequilibrium in the MHC region on human chromosome 6p
- Author
-
Simone M. Guenther, Jochen Hampe, W. Andreas Koch, Francisco M. De La Vega, Stefan Schreiber, Annette Stenzel, Timothy T. Lu, and Michael Krawczak
- Subjects
Linkage disequilibrium ,Population ,Single-nucleotide polymorphism ,Major histocompatibility complex ,Polymorphism, Single Nucleotide ,Linkage Disequilibrium ,White People ,Major Histocompatibility Complex ,Germany ,Genetics ,Humans ,education ,Genetics (clinical) ,Recombination, Genetic ,education.field_of_study ,biology ,Norway ,Haplotype ,Tag SNP ,United Kingdom ,United States ,White (mutation) ,Black or African American ,Genetics, Population ,biology.protein ,Human genome ,Chromosomes, Human, Pair 6 - Abstract
Single nucleotide polymorphisms (SNPs) in the human genome are thought to be organised into blocks of high internal linkage disequilibrium (LD), separated by intermittent recombination hotspots. Since understanding haplotype structure is critical for an accurate assessment of inter-individual genetic differences, we investigated up to 968 SNPs from a 10-Mb region on chromosome 6p21, including the human major histocompatibility complex (MHC), in five different population samples (45-550 individuals). Regions of well-defined block structure were found to coexist alongside large areas lacking any clear structure; occasional long-range LD was observed in all five samples. The four white populations analysed were remarkably similar in terms of the extend and spatial distribution of local LD. In US African Americans, the distribution of LD was similar to that in the white populations but the observed haplotype diversity was higher. The existence of large regions without any clear block structure renders the systematic and thorough construction of SNP haplotype maps a crucial prerequisite for disease-association studies.
- Published
- 2003
20. ON THE POWER TO DETECT SNP/PHENOTYPE ASSOCIATION IN CANDIDATE QUANTITATIVE TRAIT LOCI GENOMIC REGIONS: A SIMULATION STUDY
- Author
-
Josep M. Comeron, Francisco M. De La Vega, and Martin Kreitman
- Subjects
Genetics ,education.field_of_study ,Expression quantitative trait loci ,Haplotype ,Population ,Single-nucleotide polymorphism ,Biology ,Quantitative trait locus ,Association mapping ,education ,Genetic architecture ,Coalescent theory - Abstract
We use coalescent methods to investigate the ability of linked neutral "markers" to reveal in simulated population samples the presence of one or more single nucleotide polymorphisms that is contributing to a trait having a complex genetic basis (QTN: quantitative trait nucleotide). Realistic mutation and recombination rates in our simulations allow us to generate SNP data appropriate for analyzing human variation across short chromosomal intervals corresponding to approximately 100 kilobases. We investigate the performance of both single marker and multiple-marker (haplotype) data for several ad hoc procedures. Our results with single SNP markers indicate that (1) the density of SNP markers need not be much higher than 10% in order to achieve near-maximal detection of a QTN; (2) a higher density of markers does not improve much on the ability to localize a QTN within an interval unless the recombination rate is high. Haplotype-based tests were investigated for the case in which more than one QTN is present in the studied interval. Larger sample sizes improve both the probability of detecting the haplotype with the largest number of QTNs, as well as the ability to infer correct haplotypes from genotypic data. Testing a series of short haplotypes across a longer interval can also be beneficial. The rate of false positives (i.e., when the most significant haplotype does not contain the greatest number of QTNs in the sample) can be very high when the contribution of individual QTNs to a trait is small. The elimination of low-frequency haplotypes does not substantially reduce the probability of detecting the haplotype with the largest number of QTNs but it can reduce the rate of false positives.
- Published
- 2002
- Full Text
- View/download PDF
21. HUMAN GENOME VARIATION: DISEASE, DRUG RESPONSE, AND CLINICAL PHENOTYPES
- Author
-
Julie A. Schneider, Francisco M. De La Vega, J. Claiborne Stephens, and Isaac S. Kohane
- Subjects
Genetics ,Gene expression profiling ,Annotation ,education.field_of_study ,Population ,Human genome ,Computational biology ,Disease ,Biology ,education ,Genotyping ,Gene ,Session (web analytics) - Abstract
With the completion of a rough draft of the human genome sequence in sight, researchers are shifting to leverage this new information in the elucidation of the genetic basis of disease susceptibility and drug response. Massive genotyping and gene expression profiling studies are being planned and carried out by both academic/public institutions and industry. Researchers from different disciplines are all interested in the mining of the data coming from those studies; human geneticists, population geneticists, molecular biologists, computational biologists and even clinical practitioners. These communities have different immediate goals, but at the end of the day what is sought is analogous: the connection between variation in a group of genes or in their expression and observed phenotypes. There is an imminent need to link information across the huge data sets these groups are producing independently. However, there are tremendous challenges in the integration of polymorphism and gene expression databases and their clinical phenotypic annotation This is the third session devoted to the computational challenges of human genome variation studies held at the Pacific Symposium on Biocomputing 1,2 . The focus of the session has been the presentation and discussion of new research that promises to facilitate the elucidation of the connections between genotypes and phenotypes using the data generated by high -throughput technologies. Nine accepted manuscripts comprise this year’s original work presented at the conference.
- Published
- 2001
- Full Text
- View/download PDF
22. FRI0566-PC Prevalence of arterial hypertension in outpatients with rheumatoid arthritis
- Author
-
N. Villa, J. Gamba, A. Villamil, E. Chiuzzi, M. de la Vega, A. Riopedre, A. Russo, D. Mata, C. Fecchio, C. Uña, MP Girard Bosch, G. Redondo, Osvaldo D. Messina, Oscar Rillo, Graciela Gómez, and F. Eraña
- Subjects
medicine.medical_specialty ,education.field_of_study ,business.industry ,Immunology ,Population ,medicine.disease ,General Biochemistry, Genetics and Molecular Biology ,Rheumatology ,Surgery ,Blood pressure ,Rheumatoid arthritis ,Internal medicine ,medicine ,Arterial stiffness ,Immunology and Allergy ,Risk factor ,business ,education ,Stroke ,Leflunomide ,medicine.drug - Abstract
Background Rheumatoid arthritis (RA) is associated with increased cardiovascular mortality due to myocardial infarction, stroke and heart failure. Both, chronic inflammation leading to arterial stiffness, and some of the drugs used to treat RA, such as corticosteroids and leflunomide, are risk factors for developing arterial hypertension (HT). Objectives The aim of this study is to determine the prevalence of HT in outpatients with RA at a rheumatology office in Buenos Aires, and describe their relationship with clinical, laboratory and disease activity Methods We evaluated consecutive outpatients with a diagnosis of RA according ACR 90 criteria that attend to the Rheumatology office of a public hospital in Buenos Aires. Blood pressure (BP) was assessed by 3 protocolized measurements, clinical data was collected and disease activity was evaluated by DAS28-ESD. Statistical analysis was performed to establish prevalence of hypertension and to establish association with clinical and laboratory variables using Mann Whitney and Chi Square test. Significance was p≤0.05. Results We analyzed 99 patients (85.9% female and 14.1% male) with a mean age of 51.3 years old (range 26-80). 79.8% had Functional Class I and II. The prevalence of hypertension in our population was 50.5%. The median time to progression of RA: 9.57 years, DAS 28: 4.06 (range: 1.54 to 7.55), 65.7% of individuals take corticosteroids. HT was associated with age (56.16 years (26-80) the HT patients versus 46.35 years (26-81) for non HT patients(p Conclusions Almost a half of RA patients had blood hypertension at the medical office during a standard control. It was significantly related to age and abdominal diameter. It was higher than the prevalence for the general population of Buenos Aires References Solomon D, Karlson E, Rimm E, Cannuscio C, Mandl M. Cardiovascular morbidity and mortality in women diagnosed with Rheumatoid arthritis. Circulation 2003;107:1303–7. Ferrante D, Virgolini M. Encuesta Nacional de Factores de Riesgo 2005: resultados prinicpales. Prevalencia de factores de riesgo de enfermedades cardiovasculares en la Argentina. Rev. Argent Cardiol. 2007; 75; 20-29. Hernandez- Hernandez R, Silva H, Velasco M, et al. Hypertension in seven Latin American cities: the cardiovascular Risk Factor Multiple Evaluation in Latin America (CARMELA) study. J. Hypertens 2010; 28; 24-34. Panoulas V, et. al. Prevalence and associations of hypertension and its control in patients with rheumatoid arthritis. Rheumatology 2007;46:1477–1482 Disclosure of Interest None Declared
- Published
- 2013
- Full Text
- View/download PDF
23. Analyses of a set of 128 ancestry informative single-nucleotide polymorphisms in a global set of 119 population samples
- Author
-
William C. Speed, Françoise R. Friedlaender, Andrew J. Pakstis, Francisco M. De La Vega, Judith R. Kidd, and Kenneth K. Kidd
- Subjects
Genetics ,education.field_of_study ,Research ,Population ,Genetic admixture ,Context (language use) ,Single-nucleotide polymorphism ,Ancestry-informative marker ,Biology ,Pathology and Forensic Medicine ,Sample size determination ,education ,Molecular Biology ,Allele frequency ,Selection (genetic algorithm) - Abstract
Background Using DNA to determine an individual's ancestry from among human populations is generally interesting and useful for many purposes, including admixture mapping, controlling for population structure in disease or trait association studies and forensic ancestry inference. However, to estimate ancestry, including possible admixture within an individual, as well as heterogeneity within a group of individuals, allele frequencies are necessary for what are believed to be the contributing populations. For this purpose, panels of ancestry informative markers (AIMs) have been developed. Results We are presenting our work on one such panel, composed of 128 ancestry informative single-nucleotide polymorphisms (AISNPs) already proposed in the literature. Compared to previous studies of these AISNPs, we have studied three times the number of individuals (4,871) in three times as many population samples (119). We have validated this panel for many ancestry assignment and admixture studies, especially those that were the rationale for the original selection of the 128 SNPs: African Americans and Mexican Americans. At the same time, the limitations of the panel for distinguishing ancestry and quantifying admixture among Eurasian populations are noted. Conclusion We demonstrate the simultaneous importance of the specific set of population samples and their relative sample sizes in the use of the structure program to determine which groups cluster together and consequently influence the ability of a marker panel to infer ancestry. We demonstrate the strengths and weaknesses of this particular panel of AISNPs in a global context.
- Published
- 2011
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.