42 results on '"Michael Feolo"'
Search Results
2. Identifying Datasets for Cross-Study Analysis in dbGaP using PhenX
- Author
-
Huaqin Pan, Vesselina Bakalov, Lisa Cox, Michelle L. Engle, Stephen W. Erickson, Michael Feolo, Yuelong Guo, Wayne Huggins, Stephen Hwang, Masato Kimura, Michelle Krzyzanowski, Josh Levy, Michael Phillips, Ying Qin, David Williams, Erin M. Ramos, and Carol M. Hamilton
- Subjects
Science - Abstract
Abstract Identifying relevant studies and harmonizing datasets are major hurdles for data reuse. Common Data Elements (CDEs) can help identify comparable study datasets and reduce the burden of retrospective data harmonization, but they have not been required, historically. The collaborative team at PhenX and dbGaP developed an approach to use PhenX variables as a set of CDEs to link phenotypic data and identify comparable studies in dbGaP. Variables were identified as either comparable or related, based on the data collection mode used to harmonize data across mapped datasets. We further added a CDE data field in the dbGaP data submission packet to indicate use of PhenX and annotate linkages in the future. Some 13,653 dbGaP variables from 521 studies were linked through PhenX variable mapping. These variable linkages have been made accessible for browsing and searching in the repository through dbGaP CDE-faceted search filter and the PhenX variable search tool. New features in dbGaP and PhenX enable investigators to identify variable linkages among dbGaP studies and reveal opportunities for cross-study analysis.
- Published
- 2022
- Full Text
- View/download PDF
3. GRAF-pop: A Fast Distance-Based Method To Infer Subject Ancestry from Multiple Genotype Datasets Without Principal Components Analysis
- Author
-
Yumi Jin, Alejandro A. Schaffer, Michael Feolo, J. Bradley Holmes, and Brandi L. Kattman
- Subjects
ancestry inference ,population structure ,admixture mapping ,GWAS ,barycentric coordinates ,Genetics ,QH426-470 - Abstract
Inferring subject ancestry using genetic data is an important step in genetic association studies, required for dealing with population stratification. It has become more challenging to infer subject ancestry quickly and accurately since large amounts of genotype data, collected from millions of subjects by thousands of studies using different methods, are accessible to researchers from repositories such as the database of Genotypes and Phenotypes (dbGaP) at the National Center for Biotechnology Information (NCBI). Study-reported populations submitted to dbGaP are often not harmonized across studies or may be missing. Widely-used methods for ancestry prediction assume that most markers are genotyped in all subjects, but this assumption is unrealistic if one wants to combine studies that used different genotyping platforms. To provide ancestry inference and visualization across studies, we developed a new method, GRAF-pop, of ancestry prediction that is robust to missing genotypes and allows researchers to visualize predicted population structure in color and in three dimensions. When genotypes are dense, GRAF-pop is comparable in quality and running time to existing ancestry inference methods EIGENSTRAT, FastPCA, and FlashPCA2, all of which rely on principal components analysis (PCA). When genotypes are not dense, GRAF-pop gives much better ancestry predictions than the PCA-based methods. GRAF-pop employs basic geometric and probabilistic methods; the visualized ancestry predictions have a natural geometric interpretation, which is lacking in PCA-based methods. Since February 2018, GRAF-pop has been successfully incorporated into the dbGaP quality control process to identify inconsistencies between study-reported and computationally predicted populations and to provide harmonized population values in all new dbGaP submissions amenable to population prediction, based on marker genotypes. Plots, produced by GRAF-pop, of summary population predictions are available on dbGaP study pages, and the software, is available at https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/Software.cgi.
- Published
- 2019
- Full Text
- View/download PDF
4. Quickly identifying identical and closely related subjects in large databases using genotype data.
- Author
-
Yumi Jin, Alejandro A Schäffer, Stephen T Sherry, and Michael Feolo
- Subjects
Medicine ,Science - Abstract
Genome-wide association studies (GWAS) usually rely on the assumption that different samples are not from closely related individuals. Detection of duplicates and close relatives becomes more difficult both statistically and computationally when one wants to combine datasets that may have been genotyped on different platforms. The dbGaP repository at the National Center of Biotechnology Information (NCBI) contains datasets from hundreds of studies with over one million samples. There are many duplicates and closely related individuals both within and across studies from different submitters. Relationships between studies cannot always be identified by the submitters of individual datasets. To aid in curation of dbGaP, we developed a rapid statistical method called Genetic Relationship and Fingerprinting (GRAF) to detect duplicates and closely related samples, even when the sets of genotyped markers differ and the DNA strand orientations are unknown. GRAF extracts genotypes of 10,000 informative and independent SNPs from genotype datasets obtained using different methods, and implements quick algorithms that enable it to find all of the duplicate pairs from more than 880,000 samples within and across dbGaP studies in less than two hours. In addition, GRAF uses two statistical metrics called All Genotype Mismatch Rate (AGMR) and Homozygous Genotype Mismatch Rate (HGMR) to determine subject relationships directly from the observed genotypes, without estimating probabilities of identity by descent (IBD), or kinship coefficients, and compares the predicted relationships with those reported in the pedigree files. We implemented GRAF in a freely available C++ program of the same name. In this paper, we describe the methods in GRAF and validate the usage of GRAF on samples from the dbGaP repository. Other scientists can use GRAF on their own samples and in combination with samples downloaded from dbGaP.
- Published
- 2017
- Full Text
- View/download PDF
5. HLA diversity in the 1000 genomes dataset.
- Author
-
Pierre-Antoine Gourraud, Pouya Khankhanian, Nezih Cereb, Soo Young Yang, Michael Feolo, Martin Maiers, John D Rioux, Stephen Hauser, and Jorge Oksenberg
- Subjects
Medicine ,Science - Abstract
The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation by sequencing at a level that should allow the genome-wide detection of most variants with frequencies as low as 1%. However, in the major histocompatibility complex (MHC), only the top 10 most frequent haplotypes are in the 1% frequency range whereas thousands of haplotypes are present at lower frequencies. Given the limitation of both the coverage and the read length of the sequences generated by the 1000 Genomes Project, the highly variable positions that define HLA alleles may be difficult to identify. We used classical Sanger sequencing techniques to type the HLA-A, HLA-B, HLA-C, HLA-DRB1 and HLA-DQB1 genes in the available 1000 Genomes samples and combined the results with the 103,310 variants in the MHC region genotyped by the 1000 Genomes Project. Using pairwise identity-by-descent distances between individuals and principal component analysis, we established the relationship between ancestry and genetic diversity in the MHC region. As expected, both the MHC variants and the HLA phenotype can identify the major ancestry lineage, informed mainly by the most frequent HLA haplotypes. To some extent, regions of the genome with similar genetic or similar recombination rate have similar properties. An MHC-centric analysis underlines departures between the ancestral background of the MHC and the genome-wide picture. Our analysis of linkage disequilibrium (LD) decay in these samples suggests that overestimation of pairwise LD occurs due to a limited sampling of the MHC diversity. This collection of HLA-specific MHC variants, available on the dbMHC portal, is a valuable resource for future analyses of the role of MHC in population and disease studies.
- Published
- 2014
- Full Text
- View/download PDF
6. Supplementing high-density SNP microarrays for additional coverage of disease-related genes: addiction as a paradigm.
- Author
-
Scott F Saccone, Laura J Bierut, Elissa J Chesler, Peter W Kalivas, Caryn Lerman, Nancy L Saccone, George R Uhl, Chuan-Yun Li, Vivek M Philip, Howard J Edenberg, Stephen T Sherry, Michael Feolo, Robert K Moyzis, and Joni L Rutter
- Subjects
Medicine ,Science - Abstract
Commercial SNP microarrays now provide comprehensive and affordable coverage of the human genome. However, some diseases have biologically relevant genomic regions that may require additional coverage. Addiction, for example, is thought to be influenced by complex interactions among many relevant genes and pathways. We have assembled a list of 486 biologically relevant genes nominated by a panel of experts on addiction. We then added 424 genes that showed evidence of association with addiction phenotypes through mouse QTL mappings and gene co-expression analysis. We demonstrate that there are a substantial number of SNPs in these genes that are not well represented by commercial SNP platforms. We address this problem by introducing a publicly available SNP database for addiction. The database is annotated using numeric prioritization scores indicating the extent of biological relevance. The scores incorporate a number of factors such as SNP/gene functional properties (including synonymy and promoter regions), data from mouse systems genetics and measures of human/mouse evolutionary conservation. We then used HapMap genotyping data to determine if a SNP is tagged by a commercial microarray through linkage disequilibrium. This combination of biological prioritization scores and LD tagging annotation will enable addiction researchers to supplement commercial SNP microarrays to ensure comprehensive coverage of biologically relevant regions.
- Published
- 2009
- Full Text
- View/download PDF
7. NCBI's Database of Genotypes and Phenotypes: dbGaP.
- Author
-
Kimberly A. Tryka, Luning Hao, Anne Sturcke, Yumi Jin, Zhen Y. Wang, Lora Ziyabari, Moira Lee, Natalia Popova, Nataliya Sharopova, Masato Kimura, and Michael Feolo
- Published
- 2014
- Full Text
- View/download PDF
8. Database resources of the National Center for Biotechnology Information.
- Author
-
Eric W. Sayers, Tanya Barrett, Dennis A. Benson, Evan Bolton, Stephen H. Bryant, Kathi Canese, Vyacheslav Chetvernin, Deanna M. Church, Michael DiCuccio, Scott Federhen, Michael Feolo, Ian M. Fingerman, Lewis Y. Geer, Wolfgang Helmberg, Yuri Kapustin, Sergey Krasnov, David Landsman, David J. Lipman, Zhiyong Lu, Thomas L. Madden, Tom Madej, Donna R. Maglott, Aron Marchler-Bauer, Vadim Miller, Ilene Karsch-Mizrachi, James Ostell, Anna R. Panchenko, Lon Phan, Kim D. Pruitt, Gregory D. Schuler, Edwin Sequeira, Stephen T. Sherry, Martin Shumway, Karl Sirotkin, Douglas J. Slotta, Alexandre Souvorov, Grigory Starchenko, Tatiana A. Tatusova, Lukas Wagner 0002, Yanli Wang, W. John Wilbur, Eugene Yaschenko, and Jian Ye
- Published
- 2012
- Full Text
- View/download PDF
9. Database resources of the National Center for Biotechnology Information.
- Author
-
Eric W. Sayers, Tanya Barrett, Dennis A. Benson, Evan Bolton, Stephen H. Bryant, Kathi Canese, Vyacheslav Chetvernin, Deanna M. Church, Michael DiCuccio, Scott Federhen, Michael Feolo, Ian M. Fingerman, Lewis Y. Geer, Wolfgang Helmberg, Yuri Kapustin, David Landsman, David J. Lipman, Zhiyong Lu, Thomas L. Madden, Tom Madej, Donna R. Maglott, Aron Marchler-Bauer, Vadim Miller, Ilene Mizrachi, James Ostell, Anna R. Panchenko, Lon Phan, Kim D. Pruitt, Gregory D. Schuler, Edwin Sequeira, Stephen T. Sherry, Martin Shumway, Karl Sirotkin, Douglas J. Slotta, Alexandre Souvorov, Grigory Starchenko, Tatiana A. Tatusova, Lukas Wagner 0002, Yanli Wang, W. John Wilbur, Eugene Yaschenko, and Jian Ye
- Published
- 2011
- Full Text
- View/download PDF
10. Database resources of the National Center for Biotechnology Information.
- Author
-
Eric W. Sayers, Tanya Barrett, Dennis A. Benson, Evan Bolton, Stephen H. Bryant, Kathi Canese, Vyacheslav Chetvernin, Deanna M. Church, Michael DiCuccio, Scott Federhen, Michael Feolo, Lewis Y. Geer, Wolfgang Helmberg, Yuri Kapustin, David Landsman, David J. Lipman, Zhiyong Lu, Thomas L. Madden, Tom Madej, Donna R. Maglott, Aron Marchler-Bauer, Vadim Miller, Ilene Mizrachi, James Ostell, Anna R. Panchenko, Kim D. Pruitt, Gregory D. Schuler, Edwin Sequeira, Stephen T. Sherry, Martin Shumway, Karl Sirotkin, Douglas J. Slotta, Alexandre Souvorov, Grigory Starchenko, Tatiana A. Tatusova, Lukas Wagner 0002, Yanli Wang, W. John Wilbur, Eugene Yaschenko, and Jian Ye
- Published
- 2010
- Full Text
- View/download PDF
11. Database resources of the National Center for Biotechnology Information.
- Author
-
Eric W. Sayers, Tanya Barrett, Dennis A. Benson, Stephen H. Bryant, Kathi Canese, Vyacheslav Chetvernin, Deanna M. Church, Michael DiCuccio, Ron Edgar, Scott Federhen, Michael Feolo, Lewis Y. Geer, Wolfgang Helmberg, Yuri Kapustin, David Landsman, David J. Lipman, Thomas L. Madden, Donna R. Maglott, Vadim Miller, Ilene Karsch-Mizrachi, James Ostell, Kim D. Pruitt, Gregory D. Schuler, Edwin Sequeira, Stephen T. Sherry, Martin Shumway, Karl Sirotkin, Alexandre Souvorov, Grigory Starchenko, Tatiana A. Tatusova, Lukas Wagner 0002, Eugene Yaschenko, and Jian Ye
- Published
- 2009
- Full Text
- View/download PDF
12. Database resources of the National Center for Biotechnology Information.
- Author
-
David L. Wheeler, Tanya Barrett, Dennis A. Benson, Stephen H. Bryant, Kathi Canese, Vyacheslav Chetvernin, Deanna M. Church, Michael DiCuccio, Ron Edgar, Scott Federhen, Michael Feolo, Lewis Y. Geer, Wolfgang Helmberg, Yuri Kapustin, Oleg Khovayko, David Landsman, David J. Lipman, Thomas L. Madden, Donna R. Maglott, Vadim Miller, James Ostell, Kim D. Pruitt, Gregory D. Schuler, Martin Shumway, Edwin Sequeira, Steven T. Sherry, Karl Sirotkin, Alexandre Souvorov, Grigory Starchenko, Roman L. Tatusov, Tatiana A. Tatusova, Lukas Wagner 0002, and Eugene Yaschenko
- Published
- 2008
- Full Text
- View/download PDF
13. GRAF-pop: A Fast Distance-Based Method To Infer Subject Ancestry from Multiple Genotype Datasets Without Principal Components Analysis
- Author
-
Brandi L. Kattman, Alejandro A. Schäffer, Yumi Jin, Michael Feolo, and J. Bradley Holmes
- Subjects
ancestry inference ,barycentric coordinates ,Population ,Inference ,Genetic admixture ,Genome-wide association study ,QH426-470 ,Investigations ,Biology ,computer.software_genre ,Population stratification ,03 medical and health sciences ,0302 clinical medicine ,Databases, Genetic ,Genetics ,Cluster Analysis ,Humans ,GWAS ,education ,Molecular Biology ,Genetic Association Studies ,Genetics (clinical) ,030304 developmental biology ,Genetic association ,Principal Component Analysis ,0303 health sciences ,education.field_of_study ,Reproducibility of Results ,population structure ,Subject (documents) ,Genetics, Population ,admixture mapping ,Principal component analysis ,Data mining ,computer ,Algorithms ,Software ,030217 neurology & neurosurgery ,Genome-Wide Association Study - Abstract
Inferring subject ancestry using genetic data is an important step in genetic association studies, required for dealing with population stratification. It has become more challenging to infer subject ancestry quickly and accurately since large amounts of genotype data, collected from millions of subjects by thousands of studies using different methods, are accessible to researchers from repositories such as the database of Genotypes and Phenotypes (dbGaP) at the National Center for Biotechnology Information (NCBI). Study-reported populations submitted to dbGaP are often not harmonized across studies or may be missing. Widely-used methods for ancestry prediction assume that most markers are genotyped in all subjects, but this assumption is unrealistic if one wants to combine studies that used different genotyping platforms. To provide ancestry inference and visualization across studies, we developed a new method, GRAF-pop, of ancestry prediction that is robust to missing genotypes and allows researchers to visualize predicted population structure in color and in three dimensions. When genotypes are dense, GRAF-pop is comparable in quality and running time to existing ancestry inference methods EIGENSTRAT, FastPCA, and FlashPCA2, all of which rely on principal components analysis (PCA). When genotypes are not dense, GRAF-pop gives much better ancestry predictions than the PCA-based methods. GRAF-pop employs basic geometric and probabilistic methods; the visualized ancestry predictions have a natural geometric interpretation, which is lacking in PCA-based methods. Since February 2018, GRAF-pop has been successfully incorporated into the dbGaP quality control process to identify inconsistencies between study-reported and computationally predicted populations and to provide harmonized population values in all new dbGaP submissions amenable to population prediction, based on marker genotypes. Plots, produced by GRAF-pop, of summary population predictions are available on dbGaP study pages, and the software, is available at https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/Software.cgi.
- Published
- 2019
- Full Text
- View/download PDF
14. The sequencing-based typing tool of dbMHC: typing highly polymorphic gene sequences.
- Author
-
Wolfgang Helmberg, Raymond Dunivin, and Michael Feolo
- Published
- 2004
- Full Text
- View/download PDF
15. NCBI’s Database of Genotypes and Phenotypes: dbGaP
- Author
-
Zhen Y Wang, Kimberly A Tryka, Luning Hao, Yumi Jin, Lora Ziyabari, Moira Lee, Masato Kimura, Nataliya Sharopova, Natalia Popova, Michael Feolo, and Anne Sturcke
- Subjects
Internet ,Database ,Genotype ,National Library of Medicine (U.S.) ,Level data ,Biology ,computer.software_genre ,United States ,Metadata ,Genotype-phenotype distinction ,ComputingMethodologies_PATTERNRECOGNITION ,VI. Genomic variation, diseases and drugs ,Phenotype ,Databases, Genetic ,Genetics ,Humans ,computer - Abstract
The Database of Genotypes and Phenotypes (dbGap, http://www.ncbi.nlm.nih.gov/gap) is a National Institutes of Health-sponsored repository charged to archive, curate and distribute information produced by studies investigating the interaction of genotype and phenotype. Information in dbGaP is organized as a hierarchical structure and includes the accessioned objects, phenotypes (as variables and datasets), various molecular assay data (SNP and Expression Array data, Sequence and Epigenomic marks), analyses and documents. Publicly accessible metadata about submitted studies, summary level data, and documents related to studies can be accessed freely on the dbGaP website. Individual-level data are accessible via Controlled Access application to scientists across the globe.
- Published
- 2013
16. Integrated genome-wide analysis of expression quantitative trait loci aids interpretation of genomic association studies
- Author
-
Roby Joehanes, Daniel Levy, Quang Tri Nguyen, Poching Liu, Peter J. Munson, Xiaoling Zhang, Andrew D. Johnson, Josée Dupuis, Nataliya Sharopova, Kahraman Tanriverdi, Michael Feolo, Jane E. Freedman, Nancy L. Heard-Costa, Han Chen, Alejandro A. Schäffer, Richard J. Wang, Saixia Ying, Kimberly Woodhouse, Tianxiao Huan, Anne Sturcke, Chen Yao, Christopher J. O'Donnell, Nalini Raghavachari, and Cumhur Y Demirkale
- Subjects
0301 basic medicine ,Adult ,Male ,Quantitative Trait Loci ,Gene Expression ,Context (language use) ,Genome-wide association study ,Single-nucleotide polymorphism ,Biology ,Quantitative trait locus ,Regulatory Sequences, Nucleic Acid ,Web Browser ,Polymorphism, Single Nucleotide ,03 medical and health sciences ,Gene Frequency ,SNP ,Cluster Analysis ,Humans ,Genetic Predisposition to Disease ,Alleles ,Genetic association ,Aged ,Genetics ,Gene Expression Profiling ,Research ,Reproducibility of Results ,Genomics ,Middle Aged ,3. Good health ,Gene expression profiling ,MicroRNAs ,030104 developmental biology ,Expression quantitative trait loci ,Female ,Genome-Wide Association Study - Abstract
Background Identification of single nucleotide polymorphisms (SNPs) associated with gene expression levels, known as expression quantitative trait loci (eQTLs), may improve understanding of the functional role of phenotype-associated SNPs in genome-wide association studies (GWAS). The small sample sizes of some previous eQTL studies have limited their statistical power. We conducted an eQTL investigation of microarray-based gene and exon expression levels in whole blood in a cohort of 5257 individuals, exceeding the single cohort size of previous studies by more than a factor of 2. Results We detected over 19,000 independent lead cis-eQTLs and over 6000 independent lead trans-eQTLs, targeting over 10,000 gene targets (eGenes), with a false discovery rate (FDR)
- Published
- 2016
17. The International HapMap Project
- Author
-
Jessica Watkin, Stacey Gabriel, Norio Niikawa, Michael Boehnke, Lincoln Stein, Karen Kennedy, Mark Leppert, Renzong Qiu, John Stewart, Peter E. Chen, Panos Deloukas, Wei Huang, Deborah A. Nickerson, Fuli Yu, Sarah E. Hunt, Ming Xiao, Francis S. Collins, Fiona Cunningham, Stephen F. Schaffner, Yoshimitsu Fukushima, Jonathan Marchini, Troy Duster, Jane Peterson, Koki Sorimachi, Michael Feolo, Bruce S. Weir, Paul L'Archevêque, Raymond D. Miller, Hongguang Wang, Toyin Aniagwu, Mildred K. Cho, Darryl Macer, Qingrun Zhang, Paul K.H. Tam, Ardavan Kanani, Guy Bellemare, Thomas D. Willis, Mark Shillito, Martin Leboeuf, Lynn F. Zacharia, Pilar N. Ossorio, Charmaine D.M. Royal, Paul Hardenbol, Yusuke Nakamura, Maria Jasperse, Pui-Yan Kwok, Mark S. Guyer, Bin Liu, Leonid Kruglyak, Huanming Yang, Aravinda Chakravarti, John W. Belmont, Ellen Wright Clayton, Jane Rogers, Arnold Oliphant, Jack Spiegel, Houcan Zhang, Stephen T. Sherry, Vincent Ferretti, Julio Licinio, Toshihiro Tanaka, Richard R. Hudson, Mary M.Y. Waye, Lon R. Cardon, Elke Jordan, Gonçalo R. Abecasis, Kazuto Kato, Vivian Ota Wang, Gilean McVean, Lawrence M. Sung, Don Powell, Patricia A. Marshall, Patricia Spallone, Lan Yang Ch'Ang, Alastair Kent, James C. Mullikin, Eric S. Lander, Lucinda Fulton, Michael S. Phillips, Jeffrey Tze Fei Wong, David Valle, Fanny Chagnon, Semyon Kruglyak, Tatsuhiko Tsunoda, Hua Han, John P. Rice, David J. Cutler, Mark J. Daly, Peter Donnelly, Yan Shen, Jean E. McEwen, Andrew P. Morris, Richard Seabrook, Luana Galver, Thomas J. Hudson, Chibuzor Nkwodimmah, Clement Adebamowo, Lisa D. Brooks, Arthur L. Holden, Robert L. Nussbaum, David R. Bentley, Jeffrey C. Long, Nancy L. Saccone, Michael Dunn, Charles N. Rotimi, Sarah S. Murray, Richard A. Gibbs, Simon Myers, George M. Weinstock, Bartha Maria Knoppers, Takashi Fujita, Julie A. Douglas, Georgia M. Dunston, Richard K. Wilson, Sharon F. Terry, Kazuo Todani, Akihiro Sekine, Barbara Skene, Martin Godbout, David Altshuler, Bruce W. Birren, Lynn B. Jorde, Mark S. Chee, Olayemi Matthew, Erica Sodergren, Lap-Chee Tsui, Changqing Zeng, John C. Wallenburg, Missy Dixon, Gudmundur A. Thorisson, Ichiro Matsuda, Andrei Verner, Carl S. Kashuk, Eiji Yoshino, Patricia Taillon-Miller, Morris W. Foster, Satoshi Tanaka, Alexandre Montpetit, Yoichi Tanaka, Denise L. Lind, Eric H. Lai, Eiko Suda, and Shenghui Duan
- Subjects
Multidisciplinary ,Public Sector ,Base Sequence ,Genome, Human ,International Cooperation ,Racial Groups ,Genetic Variation ,Genomics ,Single-nucleotide polymorphism ,Computational biology ,DNA ,Biology ,Genome ,Polymorphism, Single Nucleotide ,Gene Frequency ,Haplotypes ,Humans ,Human genome ,Copy-number variation ,International HapMap Project ,Haplotype estimation ,Imputation (genetics) - Abstract
The goal of the International HapMap Project is to determine the common patterns of DNA sequence variation in the human genome and to make this information freely available in the public domain. An international consortium is developing a map of these patterns across the genome by determining the genotypes of one million or more sequence variants, their frequencies and the degree of association between them, in DNA samples from populations with ancestry from parts of Africa, Asia and Europe. The HapMap will allow the discovery of sequence variants that affect common disease, will facilitate development of diagnostic tools, and will enhance our ability to choose targets for therapeutic intervention. © 2003 Nature Publishing Group.
- Published
- 2016
18. Novel sequence feature variant type analysis of the HLA genetic association in systemic sclerosis
- Author
-
David S. DeLuca, Nishanth Marthandan, Michael Feolo, Effie Petersdorf, Wolfgang Helmberg, Paula A. Guidry, Frank C. Arnett, Glenys Thomson, Christopher J. Mungall, Barry Smith, John D. Reveille, Darren A. Natale, David R. Karp, Maureen D. Mayes, Steven G.E. Marsh, Richard H. Scheuermann, Karen Eilbeck, Bjoern Peters, Raymond Dunivin, Chul Ahn, Suzanna E. Lewis, Matthew J. Waller, and Alexander D. Diehl
- Subjects
Genetics ,Scleroderma, Systemic ,Association Studies Articles ,Haplotype ,Molecular Conformation ,Genetic Variation ,Peptide binding ,Single-nucleotide polymorphism ,HLA-DR Antigens ,General Medicine ,Human leukocyte antigen ,Biology ,HLA Antigens ,Genotype ,Genetic variation ,Humans ,Allele ,Molecular Biology ,Genetics (clinical) ,HLA-DRB1 Chains ,Genetic association - Abstract
We describe a novel approach to genetic association analyses with proteins sub-divided into biologically relevant smaller sequence features (SFs), and their variant types (VTs). SFVT analyses are particularly informative for study of highly polymorphic proteins such as the human leukocyte antigen (HLA), given the nature of its genetic variation: the high level of polymorphism, the pattern of amino acid variability, and that most HLA variation occurs at functionally important sites, as well as its known role in organ transplant rejection, autoimmune disease development and response to infection. Further, combinations of variable amino acid sites shared by several HLA alleles (shared epitopes) are most likely better descriptors of the actual causative genetic variants. In a cohort of systemic sclerosis patients/controls, SFVT analysis shows that a combination of SFs implicating specific amino acid residues in peptide binding pockets 4 and 7 of HLA-DRB1 explains much of the molecular determinant of risk.
- Published
- 2009
- Full Text
- View/download PDF
19. The NCBI dbGaP database of genotypes and phenotypes
- Author
-
Michael Kimelman, Yumi Jin, Moira Lee, A S Graeff, Masato Kimura, Rinat Bagoutdinov, James Ostell, Stephen T. Sherry, Anne Kiang, Jeffrey M. Beck, Eugene Yaschenko, Minghong Ward, Michael Feolo, Stephanie Pretel, Zhen Y Wang, Matthew D. Mailman, Yu Shao, Sergey Shevelev, Michael Kholodov, Don Preuss, Karl Sirotkin, Natalia Popova, Kerry L. Zbicz, Justin Paschall, Luning Hao, Lora Ziyabari, Kimberly A Tryka, and Lon Phan
- Subjects
Genetics ,Databases, Factual ,Genotype ,National Library of Medicine (U.S.) ,Computational Biology ,Biology ,Public repository ,United States ,Article ,Unique identifier ,Phenotype ,Data sequences ,Genotype-phenotype distinction ,Databases, Genetic ,Trait - Abstract
The National Center for Biotechnology Information has created the dbGaP public repository for individual-level phenotype, exposure, genotype and sequence data and the associations between them. dbGaP assigns stable, unique identifiers to studies and subsets of information from those studies, including documents, individual phenotypic variables, tables of trait data, sets of genotype data, computed phenotype-genotype associations, and groups of study subjects who have given similar consents for use of their data.
- Published
- 2007
- Full Text
- View/download PDF
20. 14th International HLA and Immunogenetics Workshop: Report on the HLA component of type 1 diabetes
- Author
-
Ann R. Steenkiste, Wolfgang Helmberg, G. Schoch, Patrick Concannon, Janelle A. Noble, Ana M. Valdes, Alberto Pugliese, J. S. Dorman, Michael Feolo, John A. Hansen, Glenys Thomson, and Douglas W. Hoffman
- Subjects
Genetics ,Linkage disequilibrium ,Histocompatibility Antigens Class I ,Immunology ,Haplotype ,Histocompatibility Antigens Class II ,General Medicine ,Immunogenetics ,Human leukocyte antigen ,Biology ,Biochemistry ,Histocompatibility ,Diabetes Mellitus, Type 1 ,HLA Antigens ,Humans ,Immunology and Allergy ,Genetic Predisposition to Disease ,Allele ,Age of onset ,HLA Complex - Abstract
The type 1 diabetes (T1D) component of the 13th International Histocompatibility Workshop (IHW) obtained microsatellite (msat) and human leukocyte antigen (HLA)-DR/DQ data on case/control and family samples through an international collaboration. The aim was to detect the effects of susceptibility loci on the HLA complex independent of the primary determinants in the class II region (HLA-DR/DQ). As part of the activity of the 14th International HLA and Immunogenetics Workshop (14th IHIWS), a T1D workshop was held to present analyses of the 13th IHW data and to discuss the current status of knowledge about the genetics of T1D. These data are now available online through dbMHC, a web-based resource established by the National Center for Biotechnology. Continuing work since the 13th IHW has resulted in published work showing heterogeneity of DR3 haplotypes in data sets from the 13th IHW and Human Biological Data Interchange (HBDI). In addition, we identified markers that define DRB1*1501 DQB1*0602 haplotypes conferring reduced protection from diabetes in a Swedish 13th IHW data set. Further analyses of the 13th IHW data set not only showed some significant results but also demonstrated extensive heterogeneity reminiscent of non-HLA genes. The haplotype analysis in HBDI families identified two msats with significant effects on susceptibility and statistically significant age of onset effects at class III markers that are not because of linkage disequilibrium, with class I alleles known to affect age of onset. The above studies underscore the importance of refining our understanding of susceptibility associated with genes in the HLA complex.
- Published
- 2007
- Full Text
- View/download PDF
21. The dbMHC microsatellite portal: a public resource for the storage and display of MHC microsatellite information
- Author
-
Anne Cambon-Thomsen, Wolfgang Helmberg, Michael Feolo, Pierre-Antoine Gourraud, and Douglas W. Hoffman
- Subjects
Genetic Markers ,Genetics ,Internet ,biology ,Sequence analysis ,Immunology ,Single-nucleotide polymorphism ,Locus (genetics) ,General Medicine ,Human leukocyte antigen ,Computational biology ,Major histocompatibility complex ,Biochemistry ,Histocompatibility ,Major Histocompatibility Complex ,Databases, Genetic ,biology.protein ,Animals ,Humans ,Immunology and Allergy ,Microsatellite ,Primer (molecular biology) ,Microsatellite Repeats - Abstract
Major histocompatibility complex (MHC) region Microsatellites (Msat) have been extensively used in various applications, such as disease mapping, forensics, and population genetics. A comprehensive review of HLA Msat primers has been previously published based on literature and sequence analysis, but electronic tools are lacking to make it easily accessible and actually used by the community. We have integrated data from this review, with an overlapping set of 31 Msat markers used in the 13th International Histocompatibility and Immunogenetics Workshop (IHIWS) to create a public archive that will synchronize published descriptions to a common framework. http://www.ncbi.nlm.nih.gov/projects/mhc. Currently, the dbMHC contains 389 primer pairs across the extended MHC targeting 281 distinct repeat regions (approximately 1/45 kb). Literature review and analysis of the primers reveal that over 200 synonymous names have been published for these markers. Users may view or download specific Msat data sets using the portal. Query options include name or partial name, primer sequence, neighboring genes, and/or position. Query results include locus name(s), a graphic showing of the relative location of the marker in relation to the classical HLA genes, a listing of the constituent primer pairs and name, a link to UniSTS, aliases, allele range (bp), overlapping single nucleotide polymorphisms, a link to e-polymerase chain reaction, and physical mapping information. To increase the utility of this resource, researchers using Msat markers in the HLA region are encouraged by the authors to submit new primers to the dbMHC. The minimal Msat submission consists of primers sequences, a submitter's name and contact information. Additional information recommended but not required is the laboratory protocol(s), known allele size range (bp), known aliases, and an exemplar sequence. Assigned UniSTS numbers can be used for primer pair standard identification.
- Published
- 2006
- Full Text
- View/download PDF
22. A haplotype map of the human genome
- Author
-
Mark Leppert, Aravinda Chakravarti, Charmaine D.M. Royal, Sarah S. Murray, Renzong Qiu, Panos Deloukas, Renwu Wang, David A. Hinds, Barbara E. Stranger, Xiaoli Tang, Huanming Yang, John W. Belmont, Nigel P. Carter, Huy Nguyen, William Mak, Kazuto Kato, Shiran Pasternak, Chaohua Li, Jeffrey C. Barrett, Lon R. Cardon, Vincent Ferretti, Atsushi Nagashima, Peter E. Chen, Stephen F. Schaffner, Hongbo Fu, Zhu Chen, Siqi Liu, John Burton, Paul Hardenbol, Gudmundur A. Thorisson, Yusuke Nakamura, Mark Griffiths, Imtiaz Yakub, Eiko Suda, Gonçalo R. Abecasis, Carl S. Kashuk, Qingrun Zhang, Yoshimitsu Fukushima, Karen Kennedy, Sarah E. Hunt, Yi Wang, Norio Niikawa, Ichiro Matsuda, Lynn F. Zacharia, Lalitha Krishnan, Zhen Wang, Stéphanie Roumy, C M Clee, David J. Cutler, Albert V. Smith, Lincoln Stein, Simon Myers, Jane Peterson, Jun Zhou, Yozo Ohnishi, Weihua Guan, Matthew Stephens, Xiaoyan Xiong, Julian Maller, Houcan Zhang, Pui-Yan Kwok, Mark S. Guyer, Liuda Ziaugra, Jonathan Witonsky, Matthew C. Jones, Stacey Gabriel, You-Qiang Song, Daochang An, Haifeng Wang, Gilean McVean, Lawrence M. Sung, Zhijian Yao, Yan Shen, Yangfan Liu, George M. Weinstock, Ludmila Pawlikowska, Erica Sodergren, Mark T. Ross, Andrew Boudreau, Toshihiro Tanaka, Thomas D. Willis, Weitao Hu, Kelly A. Frazer, Li Jin, Robert W. Plumb, Paul I.W. de Bakker, Hongbin Zhao, Wei Lin, Sarah Sims, Richard A. Gibbs, Maura Faggart, Michael Feolo, Dennis G. Ballinger, Xun Chu, Lucinda Fulton, Marcos Delgado, Ellen Winchester, Wei Huang, Fuli Yu, Christianne R. Bird, Shaun Purcell, Jessica Roy, Dongmei Cai, Launa M. Galver, Bartha Maria Knoppers, Emmanouil T. Dermitzakis, Gao Yang, Takashi Morizono, Rachel Barry, Kirsten McLay, Daryl J. Thomas, Steve McCarroll, Jonathan Marchini, Daniel J. Richter, Andy Peiffer, Patricia Taillon-Miller, Richard K. Wilson, Stephen Kwok-Wing Tsui, Jian-Bing Fan, Lisa D. Brooks, Laura L. Stuve, Paul L'Archevêque, David M. Evans, Clémentine Sallée, Peter Donnelly, Hong Xue, Hui Zhao, Charles N. Rotimi, Jean E. McEwen, J. Tze Fei Wong, Hao Pan, Alastair Kent, Brendan Blumenstiel, Qing Li, Weiwei Sun, L. Kang, Colin Freeman, John Stewart, Chibuzor Nkwodimmah, Morris W. Foster, Don Powell, Leonardo Bottolo, Raymond D. Miller, Stephen T. Sherry, Francis S. Collins, Donna M. Muzny, Jun Yu, Ike Ajayi, Hua Han, Pardis C. Sabeti, Hongguang Wang, Takahisa Kawaguchi, Tatsuhiko Tsunoda, Guy Bellemare, Zhaohui S. Qin, H. B. Hu, Jane Rogers, Thomas J. Hudson, Mark J. Daly, Andrew P. Morris, Supriya Gupta, Ming Xiao, Patrick Varilly, Nick Patterson, Akihiro Sekine, Chris C. A. Spencer, Jonathan Morrison, Missy Dixon, Paul K.H. Tam, Jian Wang, Matthew Defelice, Susana Eyheramendy, Michael Shi, Yungang He, Ellen Wright Clayton, Richa Saxena, Heather M. Munro, Arthur L. Holden, Yayun Shen, Christine P. Bird, Bruce W. Birren, Itsik Pe'er, David R. Bentley, Lynne V. Nazareth, Pamela Whittaker, Pak C. Sham, Amy L. Camargo, David A. Wheeler, Koji Saeki, Martin Godbout, David Altshuler, Liang Xu, Ying Wang, David Willey, Alexandre Montpetit, Shin Lin, Michael S. Phillips, Changqing Zeng, Clement Adebamowo, John C. Wallenburg, Mark S. Chee, Ben Fry, Erich Stahl, Melissa Parkin, Rhian Gwilliam, Andrei Verner, Patrick J. Nailer, Lap-Chee Tsui, Bo Zhang, Fanny Chagnon, David R. Cox, Jack Spiegel, Jamie Moore, Vivian Ota Wang, Patricia A. Marshall, Takuya Kitamoto, Bruce S. Weir, Darryl Macer, Geraldine M. Clarke, Robert C. Onofrio, Mary M.Y. Waye, Wei Wang, Suzanne M. Leal, James C. Mullikin, Toyin Aniagwu, Daniel C. Koboldt, Mary Goyette, Martin Leboeuf, Isaac F. Adewole, Ruth Jamieson, Arnold Oliphant, Jessica Watkin, and Jean François Olivier
- Subjects
Linkage disequilibrium ,Biology ,DNA, Mitochondrial ,Polymorphism, Single Nucleotide ,Article ,Linkage Disequilibrium ,Structural variation ,Gene Frequency ,Humans ,Selection, Genetic ,International HapMap Project ,Genetic association ,Haplotypes - genetics ,Recombination, Genetic ,Genetics ,Chromosomes, Human, Y ,Multidisciplinary ,Genome, Human ,DNA, Mitochondrial - genetics ,Haplotype ,Tag SNP ,Polymorphism, Single Nucleotide - genetics ,Haplotypes ,Human genome ,Haplotype estimation ,Chromosomes, Human, Y - genetics - Abstract
Inherited genetic variation has a critical but as yet largely uncharacterized role in human disease. Here we report a public database of common variation in the human genome: more than one million single nucleotide polymorphisms (SNPs) for which accurate and complete genotypes have been obtained in 269 DNA samples from four populations, including ten 500-kilobase regions in which essentially all information about common DNA variation has been extracted. These data document the generality of recombination hotspots, a block-like structure of linkage disequilibrium and low haplotype diversity, leading to substantial correlations of SNPs with many of their neighbours. We show how the HapMap resource can guide the design and analysis of genetic association studies, shed light on structural variation and recombination, and identify loci that may have been subject to natural selection during human evolution. © 2005 Nature Publishing Group., link_to_OA_fulltext
- Published
- 2005
- Full Text
- View/download PDF
23. P1‐350: ALZHEIMER'S DISEASE SEQUENCING PROJECT DATA PORTAL
- Author
-
Michael Feolo, Otto Valladares, Kurt Rodarmer, Daniel Laufer, Li-San Wang, Georgy Godynskiy, Daniel Micah Childress, and Amanda Partch
- Subjects
Psychiatry and Mental health ,Cellular and Molecular Neuroscience ,Data portal ,Developmental Neuroscience ,Epidemiology ,business.industry ,Health Policy ,Medicine ,Neurology (clinical) ,Disease ,Geriatrics and Gerontology ,business ,Bioinformatics - Published
- 2014
- Full Text
- View/download PDF
24. Whole-exome sequencing identifies rare and low-frequency coding variants associated with LDL cholesterol
- Author
-
Leslie A. Lange, Youna Hu, He Zhang, Chenyi Xue, Ellen M. Schmidt, Zheng-Zheng Tang, Chris Bizon, Ethan M. Lange, Joshua D. Smith, Emily H. Turner, Goo Jun, Hyun Min Kang, Gina Peloso, Paul Auer, Kuo-ping Li, Jason Flannick, Ji Zhang, Christian Fuchsberger, Kyle Gaulton, Cecilia Lindgren, Adam Locke, Alisa Manning, Xueling Sim, Manuel A. Rivas, Oddgeir L. Holmen, Omri Gottesman, Yingchang Lu, Douglas Ruderfer, Eli A. Stahl, Qing Duan, Yun Li, Peter Durda, Shuo Jiao, Aaron Isaacs, Albert Hofman, Joshua C. Bis, Adolfo Correa, Michael E. Griswold, Johanna Jakobsdottir, Albert V. Smith, Pamela J. Schreiner, Mary F. Feitosa, Qunyuan Zhang, Jennifer E. Huffman, Jacy Crosby, Christina L. Wassel, Ron Do, Nora Franceschini, Lisa W. Martin, Jennifer G. Robinson, Themistocles L. Assimes, David R. Crosslin, Elisabeth A. Rosenthal, Michael Tsai, Mark J. Rieder, Deborah N. Farlow, Aaron R. Folsom, Thomas Lumley, Ervin R. Fox, Christopher S. Carlson, Ulrike Peters, Rebecca D. Jackson, Cornelia M. van Duijn, André G. Uitterlinden, Daniel Levy, Jerome I. Rotter, Herman A. Taylor, Vilmundur Gudnason, David S. Siscovick, Myriam Fornage, Ingrid B. Borecki, Caroline Hayward, Igor Rudan, Y. Eugene Chen, Erwin P. Bottinger, Ruth J.F. Loos, Pål Sætrom, Kristian Hveem, Michael Boehnke, Leif Groop, Mark McCarthy, Thomas Meitinger, Christie M. Ballantyne, Stacey B. Gabriel, Christopher J. O’Donnell, Wendy S. Post, Kari E. North, Alexander P. Reiner, Eric Boerwinkle, Bruce M. Psaty, David Altshuler, Sekar Kathiresan, Dan-Yu Lin, Gail P. Jarvik, L. Adrienne Cupples, Charles Kooperberg, James G. Wilson, Deborah A. Nickerson, Goncalo R. Abecasis, Stephen S. Rich, Russell P. Tracy, Cristen J. Willer, David M. Altshuler, Gonçalo R. Abecasis, Hooman Allayee, Sharon Cresci, Mark J. Daly, Paul I.W. de Bakker, Mark A. DePristo, Peter Donnelly, Tim Fennell, Kiran Garimella, Stanley L. Hazen, Daniel M. Jordan, Adam Kiezun, Guillaume Lettre, Bingshan Li, Mingyao Li, Christopher H. Newton-Cheh, Sandosh Padmanabhan, Sara Pulit, Daniel J. Rader, David Reich, Muredach P. Reilly, Steve Schwartz, Laura Scott, John A. Spertus, Nathaniel O. Stitziel, Nina Stoletzki, Shamil R. Sunyaev, Benjamin F. Voight, Ermeg Akylbekova, Larry D. Atwood, Maja Barbalic, R. Graham Barr, Emelia J. Benjamin, Joshua Bis, Donald W. Bowden, Jennifer Brody, Matthew Budoff, Greg Burke, Sarah Buxbaum, Jeff Carr, Donna T. Chen, Ida Y. Chen, Wei-Min Chen, Pat Concannon, Ralph D’Agostino, Anita L. DeStefano, Albert Dreisbach, Josée Dupuis, J. Peter Durda, Jaclyn Ellis, Caroline S. Fox, Ervin Fox, Vincent Funari, Santhi K. Ganesh, Julius Gardin, David Goff, Ora Gordon, Wayne Grody, Myron Gross, Xiuqing Guo, Ira M. Hall, Nancy L. Heard-Costa, Susan R. Heckbert, Nicholas Heintz, David M. Herrington, DeMarc Hickson, Jie Huang, Shih-Jen Hwang, David R. Jacobs, Nancy S. Jenny, Andrew D. Johnson, Craig W. Johnson, Steven Kawut, Richard Kronmal, Raluca Kurz, Martin G. Larson, Mark Lawson, Cora E. Lewis, Dalin Li, Honghuang Lin, Chunyu Liu, Jiankang Liu, Kiang Liu, Xiaoming Liu, Yongmei Liu, William T. Longstreth, Cay Loria, Kathryn Lunetta, Aaron J. Mackey, Rachel Mackey, Ani Manichaikul, Taylor Maxwell, Barbara McKnight, James B. Meigs, Alanna C. Morrison, Solomon K. Musani, Josyf C. Mychaleckyj, Jennifer A. Nettleton, Kari North, Daniel O’Leary, Frank Ong, Walter Palmas, James S. Pankow, Nathan D. Pankratz, Shom Paul, Marco Perez, Sharina D. Person, Joseph Polak, Aaron R. Quinlan, Leslie J. Raffel, Vasan S. Ramachandran, Kenneth Rice, Jill P. Sanders, Pamela Schreiner, Sudha Seshadri, Steve Shea, Stephen Sidney, Kevin Silverstein, Nicholas L. Smith, Nona Sotoodehnia, Asoke Srinivasan, Kent Taylor, Fridtjof Thomas, Michael Y. Tsai, Kelly A. Volcik, Chrstina L. Wassel, Karol Watson, Gina Wei, Wendy White, Kerri L. Wiggins, Jemma B. Wilk, O. Dale Williams, Gregory Wilson, Phillip Wolf, Neil A. Zakai, John Hardy, James F. Meschia, Michael Nalls, Andrew Singleton, Brad Worrall, Michael J. Bamshad, Kathleen C. Barnes, Ibrahim Abdulhamid, Frank Accurso, Ran Anbar, Terri Beaty, Abigail Bigham, Phillip Black, Eugene Bleecker, Kati Buckingham, Anne Marie Cairns, Daniel Caplan, Barbara Chatfield, Aaron Chidekel, Michael Cho, David C. Christiani, James D. Crapo, Julia Crouch, Denise Daley, Anthony Dang, Hong Dang, Alicia De Paula, Joan DeCelie-Germana, Allen DozorMitch Drumm, Maynard Dyson, Julia Emerson, Mary J. Emond, Thomas Ferkol, Robert Fink, Cassandra Foster, Deborah Froh, Li Gao, William Gershan, Ronald L. Gibson, Elizabeth Godwin, Magdalen Gondor, Hector Gutierrez, Nadia N. Hansel, Paul M. Hassoun, Peter Hiatt, John E. Hokanson, Michelle Howenstine, Laura K. Hummer, Jamshed Kanga, Yoonhee Kim, Michael R. Knowles, Michael Konstan, Thomas Lahiri, Nan Laird, Christoph Lange, Lin Lin, Xihong Lin, Tin L. Louie, David Lynch, Barry Make, Thomas R. Martin, Steve C. Mathai, Rasika A. Mathias, John McNamara, Sharon McNamara, Deborah Meyers, Susan Millard, Peter Mogayzel, Richard Moss, Tanda Murray, Dennis Nielson, Blakeslee Noyes, Wanda O’Neal, David Orenstein, Brian O’Sullivan, Rhonda Pace, Peter Pare, H. Worth Parker, Mary Ann Passero, Elizabeth Perkett, Adrienne Prestridge, Nicholas M. Rafaels, Bonnie Ramsey, Elizabeth Regan, Clement Ren, George Retsch-Bogart, Michael Rock, Antony Rosen, Margaret Rosenfeld, Ingo Ruczinski, Andrew Sanford, David Schaeffer, Cindy Sell, Daniel Sheehan, Edwin K. Silverman, Don Sin, Terry Spencer, Jackie Stonebraker, Holly K. Tabor, Laurie Varlotta, Candelaria I. Vergara, Robert Weiss, Fred Wigley, Robert A. Wise, Fred A. Wright, Mark M. Wurfel, Robert Zanni, Fei Zou, Phil Green, Jay Shendure, Joshua M. Akey, Carlos D. Bustamante, Evan E. Eichler, P. Keolu Fox, Wenqing Fu, Adam Gordon, Simon Gravel, Jill M. Johnsen, Mengyuan Kan, Eimear E. Kenny, Jeffrey M. Kidd, Fremiet Lara-Garduno, Suzanne M. Leal, Dajiang J. Liu, Sean McGee, Timothy D. O’Connor, Bryan Paeper, Peggy D. Robertson, Jeffrey C. Staples, Jacob A. Tennessen, Gao Wang, Qian Yi, Rebecca Jackson, Garnet Anderson, Hoda Anton-Culver, Paul L. Auer, Shirley Beresford, Henry Black, Robert Brunner, Robert Brzyski, Dale Burwen, Bette Caan, Cara L. Carty, Rowan Chlebowski, Steven Cummings, J. David Curb, Charles B. Eaton, Leslie Ford, Stephanie M. Fullerton, Margery Gass, Nancy Geller, Gerardo Heiss, Barbara V. Howard, Li Hsu, Carolyn M. Hutter, John Ioannidis, Karen C. Johnson, Lewis Kuller, Andrea LaCroix, Kamakshi Lakshminarayan, Dorothy Lane, Norman Lasser, Erin LeBlanc, Kuo-Ping Li, Marian Limacher, Benjamin A. Logsdon, Shari Ludlam, JoAnn E. Manson, Karen Margolis, Lisa Martin, Joan McGowan, Keri L. Monda, Jane Morley Kotchen, Lauren Nathan, Judith Ockene, Mary Jo O’Sullivan, Lawrence S. Phillips, Ross L. Prentice, John Robbins, Jacques E. Rossouw, Haleh Sangi-Haghpeykar, Gloria E. Sarto, Sally Shumaker, Michael S. Simon, Marcia L. Stefanick, Evan Stein, Hua Tang, Kira C. Taylor, Cynthia A. Thomson, Timothy A. Thornton, Linda Van Horn, Mara Vitolins, Jean Wactawski-Wende, Robert Wallace, Sylvia Wassertheil-Smoller, Donglin Zeng, Deborah Applebaum-Bowden, Michael Feolo, Weiniu Gan, Dina N. Paltoo, Phyliss Sholinsky, Anne Sturcke, Epidemiology, and Internal Medicine
- Subjects
Male ,Genome-wide association study ,030204 cardiovascular system & hematology ,Cohort Studies ,0302 clinical medicine ,Gene Frequency ,Receptors ,Genotype ,Dyslipidemias/blood ,Receptors, LDL/genetics ,Genetics(clinical) ,Exome ,Genetics (clinical) ,Exome sequencing ,Genetics ,0303 health sciences ,Serine Endopeptidases ,Single Nucleotide ,Middle Aged ,3. Good health ,Cholesterol ,Phenotype ,Genetic Code ,Cholesterol, LDL/genetics ,Female ,lipids (amino acids, peptides, and proteins) ,Proprotein Convertases ,Proprotein Convertase 9 ,Sequence Analysis ,Adult ,Apolipoproteins E/blood ,LDL/genetics ,Serine Endopeptidases/genetics ,Single-nucleotide polymorphism ,Biology ,Polymorphism, Single Nucleotide ,Article ,03 medical and health sciences ,Apolipoproteins E ,SDG 3 - Good Health and Well-being ,Humans ,Polymorphism ,Allele frequency ,030304 developmental biology ,Genetic association ,Aged ,Dyslipidemias ,PCSK9 ,DNA ,Cholesterol, LDL ,Lipase ,Sequence Analysis, DNA ,Receptors, LDL ,Lipase/genetics ,Proprotein Convertases/genetics ,Follow-Up Studies ,Genome-Wide Association Study - Abstract
Elevated low-density lipoprotein cholesterol (LDL-C) is a treatable, heritable risk factor for cardiovascular disease. Genome-wide association studies (GWASs) have identified 157 variants associated with lipid levels but are not well suited to assess the impact of rare and low-frequency variants. To determine whether rare or low-frequency coding variants are associated with LDL-C, we exome sequenced 2,005 individuals, including 554 individuals selected for extreme LDL-C (>98(th) or
- Published
- 2014
25. HLA diversity in the 1000 genomes dataset
- Author
-
Soo Young Yang, Pierre-Antoine Gourraud, Nezih Cereb, Martin Maiers, Stephen L. Hauser, Pouya Khankhanian, John D. Rioux, Jorge R. Oksenberg, Michael Feolo, and Colombo, Gualtiero I
- Subjects
Linkage disequilibrium ,lcsh:Medicine ,Genome ,Linkage Disequilibrium ,Major Histocompatibility Complex ,0302 clinical medicine ,HLA Antigens ,Databases, Genetic ,Human Genome Project ,Medicine and Health Sciences ,2.1 Biological and endogenous factors ,Aetiology ,lcsh:Science ,Genetics ,Sanger sequencing ,0303 health sciences ,education.field_of_study ,Principal Component Analysis ,Multidisciplinary ,Ecology ,Histocompatibility Testing ,Single Nucleotide ,Genomics ,Genomic Databases ,030220 oncology & carcinogenesis ,symbols ,Human ,Biotechnology ,Research Article ,Genotype ,Ecological Metrics ,General Science & Technology ,Population Size ,Population ,Immunology ,Human leukocyte antigen ,Biology ,Polymorphism, Single Nucleotide ,03 medical and health sciences ,symbols.namesake ,Databases ,Genetic ,Effective Population Size ,Humans ,1000 Genomes Project ,Polymorphism ,education ,Alleles ,030304 developmental biology ,Comparative genomics ,Evolutionary Biology ,Genome, Human ,Haplotype ,lcsh:R ,Human Genome ,Genetic Variation ,Biology and Life Sciences ,Computational Biology ,Genome Analysis ,Haplotypes ,Genetic Polymorphism ,lcsh:Q ,Clinical Immunology ,Population Genetics - Abstract
The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation by sequencing at a level that should allow the genome-wide detection of most variants with frequencies as low as 1%. However, in the major histocompatibility complex (MHC), only the top 10 most frequent haplotypes are in the 1% frequency range whereas thousands of haplotypes are present at lower frequencies. Given the limitation of both the coverage and the read length of the sequences generated by the 1000 Genomes Project, the highly variable positions that define HLA alleles may be difficult to identify. We used classical Sanger sequencing techniques to type the HLA-A, HLA-B, HLA-C, HLA-DRB1 and HLA-DQB1 genes in the available 1000 Genomes samples and combined the results with the 103,310 variants in the MHC region genotyped by the 1000 Genomes Project. Using pairwise identity-by-descent distances between individuals and principal component analysis, we established the relationship between ancestry and genetic diversity in the MHC region. As expected, both the MHC variants and the HLA phenotype can identify the major ancestry lineage, informed mainly by the most frequent HLA haplotypes. To some extent, regions of the genome with similar genetic or similar recombination rate have similar properties. An MHC-centric analysis underlines departures between the ancestral background of the MHC and the genome-wide picture. Our analysis of linkage disequilibrium (LD) decay in these samples suggests that overestimation of pairwise LD occurs due to a limited sampling of the MHC diversity. This collection of HLA-specific MHC variants, available on the dbMHC portal, is a valuable resource for future analyses of the role of MHC in population and disease studies. © 2014 Gourraud et al.
- Published
- 2013
- Full Text
- View/download PDF
26. Assessing and Managing Risk when Sharing Aggregate Genetic Variant Data
- Author
-
David Craig, Teri A. Manolio, Robert M. Goor, Justin Paschall, Zhenyuan Wang, Michael Feolo, Jim Ostell, and Stephen T. Sherry
- Subjects
Genetics ,Data sharing ,Aggregate (data warehouse) ,Genetic variants ,Genetic data ,Biology ,Molecular Biology ,Allele frequency ,Data science ,Genetics (clinical) ,Article ,Genetic association - Abstract
Access to genetic data across studies is an important aspect of identifying new genetic associations through genome-wide association studies (GWASs). Meta-analysis across multiple GWASs with combined cohort sizes of tens of thousands of individuals often uncovers many more genome-wide associated loci than the original individual studies; this emphasizes the importance of tools and mechanisms for data sharing. However, even sharing summary-level data, such as allele frequencies, inherently carries some degree of privacy risk to study participants. Here we discuss mechanisms and resources for sharing data from GWASs, particularly focusing on approaches for assessing and quantifying the privacy risks to participants that result from the sharing of summary-level data.
- Published
- 2011
27. Databases in Human and Medical Genetics
- Author
-
Peter D. Stenson, David Neil Cooper, Donna Maglott, Johan T. den Dunnen, Helen V. Firth, Stephen T. Sherry, Roberta A Pagon, Michael Feolo, and Ada Hamosh
- Subjects
medicine.medical_specialty ,Database ,business.industry ,Entrez Gene ,Genetic counseling ,information science ,Locus (genetics) ,Computational biology ,Gene mutation ,computer.software_genre ,OMIM : Online Mendelian Inheritance in Man ,Medicine ,Medical genetics ,DECIPHER ,Ensembl ,natural sciences ,business ,computer ,health care economics and organizations - Abstract
This chapter provides an introduction to the major, freely available, Internet-accessible databases in human and medical genetics used by healthcare providers in the diagnosis, management, and genetic counseling of persons with inherited disorders and their families, as well as by researchers for gene discovery, recording allelic variants, and cataloging genotype-phenotype relationships. Databases discussed include: GeneTests (view: www.genetests.org); Online Mendelian Inheritance in Man (view: www.ncbi.nlm.nih.gov/Omim); locus specific databases (LSDBs) identified at the Human Genome Variation Society (HGVS) web site (http://www.HGVS.org/dblist.html); DatabasE of Chromosome Imbalance and Phenotype in Humans using Ensembl Resources (view: http://decipher.sanger.ac.uk); Entrez Gene (view: ncbi.nlm.nih.gov/gene); dbGap: Database of Genotype and Phenotype (view: ncbi.nlm.nih.gov/dbgap); and the Human Gene Mutation Database HGMD ® (view: http://www.hgmd.org).
- Published
- 2010
- Full Text
- View/download PDF
28. Supplementing High-Density SNP Microarrays for Additional Coverage of Disease-Related Genes: Addiction as a Paradigm
- Author
-
Chuan-Yun Li, Robert K. Moyzis, Laura J. Bierut, George R. Uhl, Stephen T. Sherry, Peter W. Kalivas, Caryn Lerman, Joni L. Rutter, Nancy L. Saccone, Scott F. Saccone, Michael Feolo, Howard J. Edenberg, Vivek M. Philip, and Elissa J. Chesler
- Subjects
dbSNP ,Quantitative Trait Loci ,lcsh:Medicine ,Genetics and Genomics/Pharmacogenomics ,Single-nucleotide polymorphism ,Genome-wide association study ,Quantitative trait locus ,Biology ,Polymorphism, Single Nucleotide ,03 medical and health sciences ,Mice ,0302 clinical medicine ,SNP ,Animals ,Humans ,International HapMap Project ,lcsh:Science ,Genetics and Genomics/Genetics of Disease ,030304 developmental biology ,Oligonucleotide Array Sequence Analysis ,Genetics ,0303 health sciences ,Multidisciplinary ,lcsh:R ,Genetics and Genomics/Bioinformatics ,3. Good health ,Behavior, Addictive ,lcsh:Q ,Human genome ,DNA microarray ,030217 neurology & neurosurgery ,Research Article - Abstract
Commercial SNP microarrays now provide comprehensive and affordable coverage of the human genome. However, some diseases have biologically relevant genomic regions that may require additional coverage. Addiction, for example, is thought to be influenced by complex interactions among many relevant genes and pathways. We have assembled a list of 486 biologically relevant genes nominated by a panel of experts on addiction. We then added 424 genes that showed evidence of association with addiction phenotypes through mouse QTL mappings and gene co-expression analysis. We demonstrate that there are a substantial number of SNPs in these genes that are not well represented by commercial SNP platforms. We address this problem by introducing a publicly available SNP database for addiction. The database is annotated using numeric prioritization scores indicating the extent of biological relevance. The scores incorporate a number of factors such as SNP/gene functional properties (including synonymy and promoter regions), data from mouse systems genetics and measures of human/mouse evolutionary conservation. We then used HapMap genotyping data to determine if a SNP is tagged by a commercial microarray through linkage disequilibrium. This combination of biological prioritization scores and LD tagging annotation will enable addiction researchers to supplement commercial SNP microarrays to ensure comprehensive coverage of biologically relevant regions.
- Published
- 2009
29. 14th International HLA and Immunogenetics Workshop: report on mapping microsatellite markers in the major histocompatibility complex region
- Author
-
Douglas W. Hoffman, Anne Cambon-Thomsen, Pierre-Antoine Gourraud, and Michael Feolo
- Subjects
Genetics ,Genetic Markers ,Human leucocyte antigen ,Internet ,Immunology ,Haplotype ,General Medicine ,Human leukocyte antigen ,Immunogenetics ,Computational biology ,Biology ,Major histocompatibility complex ,Biochemistry ,Major Histocompatibility Complex ,Genetic marker ,Databases, Genetic ,biology.protein ,Immunology and Allergy ,Microsatellite ,Humans ,Primer (molecular biology) ,Microsatellite Repeats - Abstract
This paper describes the use of the program e-pcr to localize 687 known major histocompatability complex (MHC) microsatellite primer pairs to their sequence positions in several genomic assemblies across the MHC region. The sequences used were the Sequences of Sanger Institute's MHC Haplotype Project: COX, PGF, QBL, as well as the Celera, and Reference (PGF across extended MHC) sequences from the NCBI genomic build 36. More than 95% (664/687) of the markers mapped unambiguously to the Reference assembly sequence. All primer pairs used in this analysis, and those were previously unknown to UniSTS, have now been assigned permanent public UniSTS identifiers. Mapping and descriptive data for each primer pair are available at the publicly accessible dbMHC microsatellite resource: http://www.ncbi.nlm.nih.gov/projects/mhc/xslcgi.fcgi?cmd=mssearch.
- Published
- 2007
30. Nomenclature for HLA microsatellites
- Author
-
Pierre-Antoine Gourraud, Wolfgang R. Mayr, Richard M. Single, E.M. Dauber, Michael Feolo, Eric Mickelson, Mogens Thomsen, Anne Cambon-Thomsen, and John A. Hansen
- Subjects
Genetics ,biology ,Immunology ,Histocompatibility Antigens Class I ,Histocompatibility Antigens Class II ,Locus (genetics) ,General Medicine ,Human leukocyte antigen ,Major histocompatibility complex ,Biochemistry ,HLA Antigens ,Terminology as Topic ,biology.protein ,Immunology and Allergy ,Microsatellite ,Humans ,Allele ,Nomenclature ,Alleles ,DNA Primers ,Microsatellite Repeats - Abstract
A proposal for a standardized nomenclature for human leukocyte antigen (HLA) microsatellites is presented. It provides recommendations for Microsatellites as regards to locus name, primer names, and denominations for alleles.
- Published
- 2007
31. Inference and analysis of haplotypes from combined genotyping studies deposited in dbSNP
- Author
-
Eleazar Eskin, Eran Halperin, Michael Feolo, Noah Zaitlen, Stephen T. Sherry, and Hyun Min Kang
- Subjects
Genetics ,dbSNP ,Genotype ,Pan troglodytes ,Haplotype ,Computational Biology ,Genomics ,Single-nucleotide polymorphism ,Computational biology ,Biology ,Polymorphism, Single Nucleotide ,Resources ,Haplotypes ,Databases, Genetic ,Animals ,Humans ,Genotyping ,Genetics (clinical) ,Genetic association ,Reference genome - Abstract
In the attempt to understand human variation and the genetic basis of complex disease, a tremendous number of single nucleotide polymorphisms (SNPs) have been discovered and deposited into NCBI's dbSNP public database. More than 2.7 million SNPs in the database have genotype information. This data provides an invaluable resource for understanding the structure of human variation and the design of genetic association studies. The genotypes deposited to dbSNP are unphased, and thus, the haplotype information is unknown. We applied the phasing method HAP to obtain the haplotype information, block partitions, and tag SNPs for all publicly available genotype data and deposited this information into the dbSNP database. We also deposited the orthologous chimpanzee reference sequence for each predicted haplotype block computed using the UCSC BLASTZ alignments of human and chimpanzee. Using dbSNP, researchers can now easily perform analyses using multiple genotype data sets from the same genomic regions. Dense and sparse genotype data sets from the same region were combined to show that the number of common haplotypes is significantly underestimated in whole genome data sets, while the predicted haplotypes over the common SNPs are consistent between studies. To validate the accuracy of the predictions, we benchmarked HAP's running time and phasing accuracy against PHASE. Although HAP is slightly less accurate than PHASE, HAP is over 1000 times faster than PHASE, making it suitable for application to the entire set of genotypes in dbSNP.
- Published
- 2005
32. The reagent database at dbMHC
- Author
-
Michael Feolo, Wolfgang Helmberg, and R Dunivin
- Subjects
Databases, Factual ,Genotype ,Computer science ,computer.internet_protocol ,Immunology ,Information Storage and Retrieval ,computer.software_genre ,Biochemistry ,Polymerase Chain Reaction ,HLA Antigens ,Genetics ,Immunology and Allergy ,Humans ,Typing ,Web site ,Database ,Histocompatibility Testing ,food and beverages ,Computational Biology ,General Medicine ,Sequence Analysis, DNA ,United States ,National Institutes of Health (U.S.) ,Reagent ,computer ,XML - Abstract
The reagent database dbMHC was built by the National Center for Biotechnology Information (NCBI) as an open resource for registration and characterization of HLA DNA-typing kits and reagents. Each reagent is uniquely identified as sequence-specific oligonucleotide (SSO) or primer (SSP), SSO mix, or SSP mix. Computerized prediction of allele reactivities, based on annealing stringency, is performed on all submissions to the reagent database. User-specified allele reactivities may be added or deleted independently of the prediction algorithm. Updates of allele reactivities are performed in synchronization with the IMGT/HLA database, in order to account for newly discovered alleles. Probe and primer sequences aligned to allelic sequences can be displayed at any time. Reagents registered in the reagent database are grouped in typing kits. Each kit or kit batch is uniquely identified. Group-specific amplification of alleles can be specified for an entire kit or for sections of each kit. Kits designed to test multiple loci are supported. Kits can be entered and updated via the web or submitted as batches in extensible markup language (XML) format. A tool for online interpretation of typing results is available. Both the reagent database and the typing kit database have been designed to facilitate the exchange of HLA typing based on raw typing data using the unique identifiers of kits or individual reagents. In addition, batch-wise reinterpretation of previous typing data can be performed either using the NCBI web site or by locally using downloaded allele-reactivity lists. Reinterpretation by the NCBI requires submission of raw typing data in XML format.
- Published
- 2004
33. Major histocompatibility complex class II gene frequencies by serologic and deoxyribonucleic acid genomic typing in idiopathic dilated cardiomyopathy
- Author
-
Richard H. Ward, Jeffrey L. Anderson, Michael Feolo, Dee Husebye, and John F. Carlquist
- Subjects
Cardiomyopathy, Dilated ,HLA-DP Antigens ,Genotype ,business.industry ,Genes, MHC Class II ,Dot blot ,HLA-DR Antigens ,Polymerase Chain Reaction ,Serology ,Restriction enzyme ,Gene Frequency ,HLA-DQ Antigens ,Immunology ,Idiopathic dilated cardiomyopathy ,Etiology ,Humans ,Medicine ,Typing ,Allele ,skin and connective tissue diseases ,Cardiology and Cardiovascular Medicine ,business ,Gene ,Alleles - Abstract
Certain immunologic features associated with idiopathic dilated cardiomyopathy (IDC) suggest an infectious and/or autoimmune etiology. In this regard, an association between the major histocompatibility complex class II allele, DR4, and increased risk for IDC was previously identified. In the present report, 43 additional patients with IDC and 236 control subjects were studied for major histocompatibility class II allele associations. DR alleles were identified by microcytotoxicity. No significant differences between control subjects and patients with IDC were seen, although the frequency of DR4 was increased among patients. DR4 subtyping (n = 9) was performed by “dot blot” hybridization of allele-specific oligonucleotide probes to PCR-amplified genomic deoxyribonucleic acid. The DRB1∗0401 and DRB1∗0404 alleles were each found in 44% (n = 4) of patients with IDC, and DRB1∗0407 was identified in 1 patient (11%). DQ and DP alleles were identified by restriction endonuclease codigestion of polymerase chain reaction-amplified deoxyribonucleic acid. The digested fragments were separated and identified by polyacrylamide gel electrophoresis. Differences between patients and control subjects were observed for DQA1∗0501 (11% of patients vs 28% of control subjects, p < 0.05) and DQB1∗0201 (13% patients vs 25% control subjects, p < 0.05). A modest difference was noted for DQA1∗0301 (35% patients vs 23% control subjects, p = 0.08). These findings suggest a complex immune-related etiology for IDC that cannot be explained solely by the presence or absence of a single class II allele. However, this and other studies continue to implicate genes within the class II region in determining the risk for IDC.
- Published
- 1994
- Full Text
- View/download PDF
34. Linkage analysis of HLA and candidate genes for celiac disease in a North American family-based study
- Author
-
Susan L. Neuhausen, Linda S. Book, John J. Zone, Michael Feolo, and James M. Farnham
- Subjects
Candidate gene ,lcsh:Internal medicine ,lcsh:QH426-470 ,Disease ,Human leukocyte antigen ,Biology ,Pathogenesis ,03 medical and health sciences ,0302 clinical medicine ,Genetic linkage ,Dermatitis herpetiformis ,medicine ,Genetics ,Genetics(clinical) ,lcsh:RC31-1245 ,Genetics (clinical) ,Genetic association ,nutritional and metabolic diseases ,medicine.disease ,digestive system diseases ,Human genetics ,3. Good health ,lcsh:Genetics ,030211 gastroenterology & hepatology ,030215 immunology ,Research Article - Abstract
Background Celiac disease has a strong genetic association with HLA. However, this association only explains approximately half of the sibling risk for celiac disease. Therefore, other genes must be involved in susceptibility to celiac disease. We tested for linkage to genes or loci that could play a role in pathogenesis of celiac disease. Methods DNA samples, from members of 62 families with a minimum of two cases of celiac disease, were genotyped at HLA and at 13 candidate gene regions, including CD4, CTLA4, four T-cell receptor regions, and 7 insulin-dependent diabetes regions. Two-point and multipoint heterogeneity LOD (HLOD) scores were examined. Results The highest two-point and multipoint HLOD scores were obtained in the HLA region, with a two-point HLOD of 3.1 and a multipoint HLOD of 5.0. For the candidate genes, we found no evidence for linkage. Conclusions Our significant evidence of linkage to HLA replicates the known linkage and association of HLA with CD. In our families, likely candidate genes did not explain the susceptibility to celiac disease.
- Published
- 2001
35. A strategy for high throughput HLA-DQ typing
- Author
-
Susan L. Neuhausen, John J. Zone, Michelle Taylor, Michael Feolo, and Thomas C. Fuller
- Subjects
musculoskeletal diseases ,Genetics ,HLA-DQB1 ,endocrine system diseases ,Base Sequence ,Genotype ,Histocompatibility Testing ,Immunology ,nutritional and metabolic diseases ,Locus (genetics) ,Biology ,Polymerase Chain Reaction ,HLA-DQ alpha-Chains ,HLA-DQ Antigens ,HLA-DQ ,Immunology and Allergy ,Multilocus sequence typing ,HLA-DQ beta-Chains ,Humans ,Typing ,Primer (molecular biology) ,Genotyping ,Alleles ,DNA Primers - Abstract
We have developed a high throughput HLA typing methodology that is a modification of the standard sequence-specific primer method. This approach is distinct from other methods using an automated DNA analyzer, as more than one gene is typed in a single lane. We have optimized the method for use on an ABI 373 automated genotyping machine. Primers were designed to preferentially amplify DNA fragments of the generic allelic groups of the DQA1 and DQB1 loci. PCR products representing alleles at the DQA1 locus were amplified using a different fluorescent dye than the PCR products from the DQB1 locus. Only three PCR reactions are required for low resolution typing of DQA1 and DQB1. Use of different labeled primers enables genotyping for both loci in a single gel lane, allowing for 64 samples to be typed at low resolution for both DQA1 and DQB1 on a single gel. Automated allele assignments were determined based on DNA migration distance through a polyacrylamide gel using a standard genotype allele-calling program. Accuracy of this method is greater than 98% for both loci. The strategy described here may be adapted to include more loci or to produce higher resolution typing of alleles encoded by these loci. It can be readily optimized for use on other slab gel or capillary electrophoresis systems.
- Published
- 2001
36. Erratum: Assessing and managing risk when sharing aggregate genetic variant data
- Author
-
Zhenyuan Wang, Teri A. Manolio, David Craig, Jim Ostell, Michael Feolo, Stephen T. Sherry, Robert M. Goor, and Justin Paschall
- Subjects
musculoskeletal diseases ,endocrine system diseases ,GWAS Central ,Aggregate (data warehouse) ,Genetics ,Genetic variants ,nutritional and metabolic diseases ,natural sciences ,Computational biology ,Biology ,Link (knot theory) ,Molecular Biology ,Genetics (clinical) - Abstract
Nature Reviews Genetics 12, 730–736 (2011) In the above article, the incorrect link was provided for GWAS Central. The correct link should have been http://www.gwascentral.org. In the Further Information Box, the link to http://gwas.nih.gov was incorrectly described as 'GWAS Central (includes policy)'.
- Published
- 2011
- Full Text
- View/download PDF
37. 150-P: IDAWG - the Immunogenomic Data-Analysis Working Group
- Author
-
Wolfgang Helmberg, Sge Marsh, Richard M. Single, Alex K. Lancaster, Glenys Thomson, Marcelo Fernandez-Vina, B. Tait, O. Nathalang, D Middleton, Jill A. Hollenbach, U. Kanga, Martin Maiers, P. Kupatawintu, Steve Mack, Henry A. Erlich, Diogo Meyer, Carlheinz Mueller, Michael D. Varney, Michael Feolo, P.-A. Gourrauud, M.H. Park, and H. Maldonado-Torres
- Subjects
medicine.medical_specialty ,Group (periodic table) ,business.industry ,Immunology ,Analysis working ,Physical therapy ,Immunology and Allergy ,Medicine ,General Medicine ,business - Published
- 2009
- Full Text
- View/download PDF
38. HLA genetic association analysis using the sequence feature variant type approach (136.31)
- Author
-
Nishanth Marthandan, David R Karp, Frank Arnett, Matthew J Waller, Paula Guidry, Michael Feolo, Steven GE Marsh, and Richard H Scheuermann
- Subjects
Immunology ,Immunology and Allergy - Abstract
Immunogeneticists have found significant associations between specific HLA alleles and a variety of medical conditions, including autoimmune diseases. Current disease association studies treat each HLA allele as a single complete unit, which does not illuminate the parts of the molecule associated with disease. We have developed a novel approach for genetic association analysis in which HLA genes and proteins are broken down into smaller sequence features. Sequence features are defined based on structural or functional information, and can be overlapping and continuous or discontinuous in the linear sequence. The extent of sequence variation is then assessed for each HLA sequence feature to define all variant types found in the human population. This allows for the independent analysis of disease association with any sequence feature variant type (SFVT). We tested this approach in the analysis of systemic sclerosis patients using HLA-DRB1 and HLA-DQB1 typing data. We identified a region of the HLA-DRB1 protein centered around peptide-binding pocket 7 that appears to be associated with disease risk; aromatic amino acids found in HLA-DRB1*1104 at positions 26, 28, 37 and 67 were associated with increased risk. The SFVT approach is a novel method for identifying the molecular determinants of disease association in highly polymorphic genes like the HLA. For DAIT-DISC; supported by NIAID N01AI40076
- Published
- 2009
- Full Text
- View/download PDF
39. Two new clinical resources at dbMHC: The rheumatoid arthritis project and the diabetes project of the 13. IHWC
- Author
-
Wolfgang Helmberg, Alberto Pugliese, Douglas W. Hoffman, Michael Feolo, and Lee Nelson
- Subjects
medicine.medical_specialty ,business.industry ,Internal medicine ,Rheumatoid arthritis ,Diabetes mellitus ,Immunology ,medicine ,Immunology and Allergy ,General Medicine ,medicine.disease ,business - Published
- 2005
- Full Text
- View/download PDF
40. Population based characterisation of HLA typing kits and ambiguous typing results: The frequency inferred typing (FIT) index
- Author
-
Douglas W. Hoffman, Martin Maiers, Wolfgang Helmberg, and Michael Feolo
- Subjects
Index (economics) ,Immunology ,Immunology and Allergy ,General Medicine ,Human leukocyte antigen ,Population based ,Computational biology ,Typing ,Biology ,Bioinformatics - Published
- 2005
- Full Text
- View/download PDF
41. Use of dbMHC /CN3D to highlight amino acid positions on 3D molecular models
- Author
-
Michael Feolo, Raymond Dunivin, and Wolfgang Helmberg
- Subjects
chemistry.chemical_classification ,Molecular model ,Biochemistry ,Chemistry ,Immunology ,Immunology and Allergy ,General Medicine ,Amino acid - Published
- 2003
- Full Text
- View/download PDF
42. Hla genotype data of the IHWG anthropology project at dbMHC
- Author
-
Steven J. Mack, Michael Feolo, Douglas C. Hoffman, and Wolfgang Helmberg
- Subjects
Genetics ,Hla genotype ,Immunology ,Immunology and Allergy ,General Medicine ,Biology - Published
- 2003
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.