246 results on '"Bruce S. Weir"'
Search Results
52. Characterizing allelic association in the genome era
- Author
-
C. C. Laurie and Bruce S. Weir
- Subjects
Genetics ,Estimation ,education.field_of_study ,Genome ,Population ,Genetic Variation ,General Medicine ,Biology ,Chromosomes ,Article ,Genetics, Population ,Variation (linguistics) ,Evolutionary biology ,Genetic variation ,Multiple comparisons problem ,Humans ,Allele ,education ,Alleles ,Realization (probability) - Abstract
SummaryWhole genome data are allowing the estimation of population genetic parameters with an accuracy not imagined 50 years ago. Variation in these parameters along the genome is being found empirically where once only approximate theoretical values were available. Along with increased information, however, has come the issue of multiple testing and the realization that high values of the coefficients of variation of quantities such as relatedness measures may make it difficult to draw inferences. This review concentrates on measures of allelic association within and between individuals and within and between populations.
- Published
- 2010
- Full Text
- View/download PDF
53. Confidence Interval of the Likelihood Ratio Associated with Mixed Stain DNA Evidence*
- Author
-
Gary W. Beecham and Bruce S. Weir
- Subjects
Score test ,education.field_of_study ,Population ,Variance (accounting) ,Likelihood ratios in diagnostic testing ,Genealogy ,Confidence interval ,Pathology and Forensic Medicine ,symbols.namesake ,Likelihood-ratio test ,Statistics ,Genetics ,Taylor series ,symbols ,education ,Likelihood function ,Mathematics - Abstract
Likelihood ratios are necessary to properly interpret mixed stain DNA evidence. They can flexibly consider alternate hypotheses and can account for population substructure. The likelihood ratio should be seen as an estimate and not a fixed value, because the calculations are functions of allelic frequency estimates that were estimated from a small portion of the population. Current methods do not account for uncertainty in the likelihood ratio estimates and are therefore an incomplete picture of the strength of the evidence. We propose the use of a confidence interval to report the consequent variation of likelihood ratios. The confidence interval is calculated using the standard forensic likelihood ratio formulae and a variance estimate derived using the Taylor expansion. The formula is explained, and a computer program has been made available. Numeric work shows that the evidential strength of DNA profiles decreases as the variation among populations increases.
- Published
- 2010
- Full Text
- View/download PDF
54. Quality control and quality assurance in genotypic data for genome-wide association studies
- Author
-
Laura J. Bierut, Emily L. Harris, Kimberly F. Doheny, Xiuwen Zheng, Frederick J. Boehm, Neil E. Caporaso, Caitlin P. McHugh, John P. Rice, Elizabeth W. Pugh, Frank B. Hu, Maria Teresa Landi, Teri A. Manolio, Thomas Lumley, Peter Kraft, Tushar Bhangale, Howard J. Edenberg, Kevin B. Jacobs, Bruce S. Weir, Daniel B. Mirel, Cathy C. Laurie, S. Gabriel, Marilyn C. Cornelis, Kenneth Rice, Ian Painter, and Justin Paschall
- Subjects
Male ,Quality Control ,Lung Neoplasms ,Genotype ,Substance-Related Disorders ,Epidemiology ,Population ,Genome-wide association study ,Computational biology ,Biology ,Polymorphism, Single Nucleotide ,Chromosome aberration ,Article ,Gene Frequency ,Humans ,education ,Allele frequency ,Genotyping ,Sex Chromosome Aberrations ,Genetics (clinical) ,Genetic association ,Chromosome Aberrations ,Genetics ,education.field_of_study ,business.industry ,Genetic Variation ,Aneuploidy ,Genetics, Population ,Case-Control Studies ,Data quality ,Female ,Artifacts ,business ,Quality assurance ,Genome-Wide Association Study - Abstract
Genome-wide scans of nucleotide variation in human subjects are providing an increasing number of replicated associations with complex disease traits. Most of the variants detected have small effects and, collectively, they account for a small fraction of the total genetic variance. Very large sample sizes are required to identify and validate findings. In this situation, even small sources of systematic or random error can cause spurious results or obscure real effects. The need for careful attention to data quality has been appreciated for some time in this field, and a number of strategies for quality control and quality assurance (QC/QA) have been developed. Here we extend these methods and describe a system of QC/QA for genotypic data in genome-wide association studies. This system includes some new approaches that (1) combine analysis of allelic probe intensities and called genotypes to distinguish gender misidentification from sex chromosome aberrations, (2) detect autosomal chromosome aberrations that may affect genotype calling accuracy, (3) infer DNA sample quality from relatedness and allelic intensities, (4) use duplicate concordance to infer SNP quality, (5) detect genotyping artifacts from dependence of Hardy-Weinberg equilibrium (HWE) test p-values on allelic frequency, and (6) demonstrate sensitivity of principal components analysis (PCA) to SNP selection. The methods are illustrated with examples from the ‘Gene Environment Association Studies’ (GENEVA) program. The results suggest several recommendations for QC/QA in the design and execution of genome-wide association studies.
- Published
- 2010
- Full Text
- View/download PDF
55. Detecting Coevolution through Allelic Association between Physically Unlinked Loci
- Author
-
Rori V. Rohlfs, Willie J. Swanson, and Bruce S. Weir
- Subjects
Linkage disequilibrium ,Zona pellucida glycoprotein ,Genotype ,Population ,Egg protein ,Receptors, Cell Surface ,Biology ,Zona Pellucida Glycoproteins ,Linkage Disequilibrium ,Article ,03 medical and health sciences ,0302 clinical medicine ,Genetics ,Humans ,Genetics(clinical) ,Allele ,education ,Alleles ,Genetics (clinical) ,Selection (genetic algorithm) ,030304 developmental biology ,Linkage (software) ,0303 health sciences ,education.field_of_study ,Membrane Glycoproteins ,Polymorphism, Genetic ,Egg Proteins ,030217 neurology & neurosurgery - Abstract
Coevolving interacting genes undergo complementary mutations to maintain their interaction. Distinct combinations of alleles in coevolving genes interact differently, conferring varying degrees of fitness. If this fitness differential is adequately large, the resulting selection for allele matching could maintain allelic association, even between physically unlinked loci. Allelic association is often observed in a population with the use of gametic linkage disequilibrium. However, because the coevolving genes are not necessarily in physical linkage, this is not an appropriate measure of coevolution-induced allelic association. Instead, we propose using both composite linkage disequilibrium (CLD) and a measure of association between genotypes, which we call genotype association (GA). Using a simple selective model, we simulated loci and calculated power for tests of CLD and GA, showing that the tests can detect the allelic association expected under realistic selective pressure. We apply CLD and GA tests to the polymorphic, physically unlinked, and putatively coevolving human gamete-recognition genes ZP3 and ZP3R. We observe unusual allelic association, not attributable to population structure, between ZP3 and ZP3R. This study shows that selection for allele matching can drive allelic association between unlinked loci in a contemporary human population, and that selection can be detected with the use of CLD and GA tests. The observation of this selection is surprising, but reasonable in the highly selected system of fertilization. If confirmed, this sort of selection provides an exception to the paradigm of chromosomal independent assortment.
- Published
- 2010
- Full Text
- View/download PDF
56. Case-control association testing in the presence of unknown relationships
- Author
-
Yoonha Choi, Bruce S. Weir, and Ellen M. Wijsman
- Subjects
Genetic Markers ,Risk ,Epidemiology ,Population ,Context (language use) ,Genome-wide association study ,Biology ,Polymorphism, Single Nucleotide ,Article ,Statistics ,Kinship ,Null distribution ,Humans ,Spurious relationship ,education ,Alleles ,Genetics (clinical) ,Genetic association ,Genetics ,Likelihood Functions ,education.field_of_study ,Models, Statistical ,Models, Genetic ,Genome, Human ,Computational Biology ,Pedigree ,Genetics, Population ,Case-Control Studies ,Genome-Wide Association Study ,Type I and type II errors - Abstract
Genome-wide association studies result in inflated false positive results when unrecognized cryptic relatedness exists. A number of methods have been proposed for testing association between markers and disease with a correction for known pedigree-based relationships. However, in most case-control studies, relationships are generally unknown, yet the design is predicated on the assumption of at least ancestral relatedness among cases. Here, we focus on adjusting cryptic relatedness when the genealogy of the sample is unknown, particularly in the context of samples from isolated populations where cryptic relatedness may be problematic. We estimate cryptic relatedness using maximum-likelihood methods and use a corrected chi-square test with estimated kinship coefficients for testing in the context of unknown cryptic relatedness. Estimated kinship coefficients characterize precisely the relatedness between truly related people, but are biased for unrelated pairs. The proposed test substantially reduces spurious positive results, producing a uniform null distribution of p-values. Especially with missing pedigree information, estimated kinship coefficients can still be used to correct non-independence among individuals. The corrected test was applied to real data sets from genetic isolates and created a distribution of p-value that was close to uniform. Thus the proposed test corrects the non-uniform distribution of p-values obtained with the uncorrected test and illustrates the advantage of the approach on real data.
- Published
- 2009
- Full Text
- View/download PDF
57. Genetics in geographically structured populations: defining, estimating and interpreting FST
- Author
-
Bruce S. Weir and Kent E. Holsinger
- Subjects
Statistics as Topic ,Population ,Biology ,Article ,Gene Frequency ,Population Groups ,Genetic variation ,Genetics ,Animals ,Humans ,education ,Molecular Biology ,Allele frequency ,Genetics (clinical) ,education.field_of_study ,Geography ,Human evolutionary genetics ,Extramural ,Genetic Variation ,Data interpretation ,Genetic differentiation ,Genetics, Population ,Evolutionary biology ,Data Interpretation, Statistical ,human activities - Abstract
Wright's F-statistics, and especially F(ST), provide important insights into the evolutionary processes that influence the structure of genetic variation within and among populations, and they are among the most widely used descriptive statistics in population and evolutionary genetics. Estimates of F(ST) can identify regions of the genome that have been the target of selection, and comparisons of F(ST) from different parts of the genome can provide insights into the demographic history of populations. For these reasons and others, F(ST) has a central role in population and evolutionary genetics and has wide applications in fields that range from disease association mapping to forensic science. This Review clarifies how F(ST) is defined, how it should be estimated, how it is related to similar statistics and how estimates of F(ST) should be interpreted.
- Published
- 2009
- Full Text
- View/download PDF
58. Drawing inferences about the coancestry coefficient
- Author
-
Bruce S. Weir, Yi-Ju Li, and Suvajit Samanta
- Subjects
Likelihood Functions ,education.field_of_study ,Population ,Population genetics ,Models, Theoretical ,Biological Evolution ,Identity by descent ,Article ,Gene Frequency ,Genetic distance ,Genetic drift ,Sample size determination ,F-statistics ,Statistics ,Econometrics ,education ,Allele frequency ,Alleles ,Ecology, Evolution, Behavior and Systematics ,Mathematics - Abstract
The coancestry coefficient, also known as the population structure parameter, is of great interest in population genetics. It can be thought of as the intraclass correlation of pairs of alleles within populations and it can serve as a measure of genetic distance between populations. For a general class of evolutionary models it determines the distribution of allele frequencies among populations. Under more restrictive models it can be regarded as the probability of identity by descent of any pair of alleles at a locus within a random mating population. In this paper we review estimation procedures that use the method of moments or are maximum likelihood under the assumption of normally distributed allele frequencies. We then consider the problem of testing hypotheses about this parameter. In addition to parametric and non-parametric bootstrap tests we present an asymptotically-distributed chi-square test. This test reduces to the contingency-table test for equal sample sizes across populations. Our new test appears to be more powerful than previous tests, especially for loci with multiple alleles. We apply our methods to HapMap SNP data to confirm that the coancestry coefficient for humans is strictly positive.
- Published
- 2009
- Full Text
- View/download PDF
59. Correlation-Based Inference for Linkage Disequilibrium With Multiple Alleles
- Author
-
Alexander I. Pudovkin, Dmitri V. Zaykin, and Bruce S. Weir
- Subjects
Linkage disequilibrium ,Genotype ,Investigations ,Biology ,Polymorphism, Single Nucleotide ,Linkage Disequilibrium ,Gene Frequency ,Genetics ,Chi-square test ,Test statistic ,Animals ,Humans ,Alleles ,Statistic ,Contingency table ,Stochastic Processes ,Models, Statistical ,Models, Genetic ,Chromosome Mapping ,Models, Theoretical ,Exact test ,Haplotypes ,Sample size determination ,Total correlation ,Monte Carlo Method ,Software - Abstract
The correlation between alleles at a pair of genetic loci is a measure of linkage disequilibrium. The square of the sample correlation multiplied by sample size provides the usual test statistic for the hypothesis of no disequilibrium for loci with two alleles and this relation has proved useful for study design and marker selection. Nevertheless, this relation holds only in a diallelic case, and an extension to multiple alleles has not been made. Here we introduce a similar statistic, R2, which leads to a correlation-based test for loci with multiple alleles: for a pair of loci with k and m alleles, and a sample of n individuals, the approximate distribution of n(k – 1)(m – 1)/(km)R2 under independence between loci is \documentclass[10pt]{article} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{pmc} \usepackage[Euler]{upgreek} \pagestyle{empty} \oddsidemargin -1.0in \begin{document} \begin{equation*}{\mathrm{{\chi}}}_{(k-1)(m-1)}^{2}\end{equation*}\end{document}. One advantage of this statistic is that it can be interpreted as the total correlation between a pair of loci. When the phase of two-locus genotypes is known, the approach is equivalent to a test for the overall correlation between rows and columns in a contingency table. In the phase-known case, R2 is the sum of the squared sample correlations for all km 2 × 2 subtables formed by collapsing to one allele vs. the rest at each locus. We examine the approximate distribution under the null of independence for R2 and report its close agreement with the exact distribution obtained by permutation. The test for independence using R2 is a strong competitor to approaches such as Pearson's chi square, Fisher's exact test, and a test based on Cressie and Read's power divergence statistic. We combine this approach with our previous composite-disequilibrium measures to address the case when the genotypic phase is unknown. Calculation of the new multiallele test statistic and its P-value is very simple and utilizes the approximate distribution of R2. We provide a computer program that evaluates approximate as well as “exact” permutational P-values.
- Published
- 2008
- Full Text
- View/download PDF
60. A Maximum-Likelihood Method for the Estimation of Pairwise Relatedness in Structured Populations
- Author
-
Amy D. Anderson and Bruce S. Weir
- Subjects
Genetics ,Estimation ,Likelihood Functions ,education.field_of_study ,Polymorphism, Genetic ,Models, Genetic ,Degree (graph theory) ,Siblings ,Maximum likelihood ,Population Dynamics ,Population ,Value (computer science) ,Estimator ,Investigations ,Biology ,Gene Frequency ,Databases, Genetic ,Animals ,Humans ,Pairwise comparison ,education ,human activities ,Allele frequency - Abstract
A maximum-likelihood estimator for pairwise relatedness is presented for the situation in which the individuals under consideration come from a large outbred subpopulation of the population for which allele frequencies are known. We demonstrate via simulations that a variety of commonly used estimators that do not take this kind of misspecification of allele frequencies into account will systematically overestimate the degree of relatedness between two individuals from a subpopulation. A maximum-likelihood estimator that includes FST as a parameter is introduced with the goal of producing the relatedness estimates that would have been obtained if the subpopulation allele frequencies had been known. This estimator is shown to work quite well, even when the value of FST is misspecified. Bootstrap confidence intervals are also examined and shown to exhibit close to nominal coverage when FST is correctly specified.
- Published
- 2007
- Full Text
- View/download PDF
61. Estimating F-statistics: A historical view
- Author
-
Bruce S. Weir
- Subjects
History ,Computer science ,Population structure ,Estimator ,Sewall wright ,Data science ,Article ,Set (abstract data type) ,Philosophy ,Wright ,Human disease ,History and Philosophy of Science ,F-statistics ,Weir ,Mathematical economics - Abstract
Characterizing the genetic structure of populations is of importance to evolutionary biology, to human disease gene mapping, and to forensic science. Sewall Wright introduced a set of F-statistics to describe population structure in 1951, and he emphasized that these quantities were ratios of variances. Responding to uncertainty over the best way to estimate F-statistics, Weir and Cockerham published a method-of-moments set of estimators in 1984. This paper continues to be widely cited, with over 7,000 citations to date. Some background to the publishing history of the Weir and Cockerham paper is given here, along with subsequent developments and a discussion of current uses of Wright's F-statistics.
- Published
- 2015
62. Exact Inference for Hardy-Weinberg Proportions with Missing Genotypes: Single and Multiple Imputation
- Author
-
Stephanie M. Gogarten, Sarah C. Nelson, Jan Graffelman, Bruce S. Weir, Universitat Politècnica de Catalunya. Departament d'Estadística i Investigació Operativa, and Universitat Politècnica de Catalunya. COSDA-UPC - COmpositional and Spatial Data Analysis
- Subjects
Linkage disequilibrium ,Biometry ,Disequilibrium ,Inference ,imputation ,Biology ,Investigations ,01 natural sciences ,Linkage Disequilibrium ,010104 statistics & probability ,03 medical and health sciences ,missing data ,exact test ,Statistics ,Genetics ,medicine ,Statistical inference ,Inbreeding ,Imputation (statistics) ,0101 mathematics ,Molecular Biology ,Genetics (clinical) ,030304 developmental biology ,0303 health sciences ,Models, Genetic ,Matemàtiques i estadística [Àrees temàtiques de la UPC] ,Hardy−Weinberg equilibrium ,Missing data ,Hardy–Weinberg principle ,Quantitative Biology::Genomics ,Data Accuracy ,Exact test ,Genetics, Population ,medicine.symptom ,Estadística mèdica ,Algorithms - Abstract
This paper addresses the issue of exact-test based statistical inference for Hardy−Weinberg equilibrium in the presence of missing genotype data. Missing genotypes often are discarded when markers are tested for Hardy−Weinberg equilibrium, which can lead to bias in the statistical inference about equilibrium. Single and multiple imputation can improve inference on equilibrium. We develop tests for equilibrium in the presence of missingness by using both inbreeding coefficients (or, equivalently, χ2 statistics) and exact p-values. The analysis of a set of markers with a high missing rate from the GENEVA project on prematurity shows that exact inference on equilibrium can be altered considerably when missingness is taken into account. For markers with a high missing rate (>5%), we found that both single and multiple imputation tend to diminish evidence for Hardy−Weinberg disequilibrium. Depending on the imputation method used, 6−13% of the test results changed qualitatively at the 5% level.
- Published
- 2015
63. Testing for Hardy-Weinberg equilibrium at biallelic genetic markers on the X chromosome
- Author
-
Jan Graffelman, Bruce S. Weir, Universitat Politècnica de Catalunya. Departament d'Estadística i Investigació Operativa, and Universitat Politècnica de Catalunya. COSDA-UPC - COmpositional and Spatial Data Analysis
- Subjects
Estadística matemàtica ,0301 basic medicine ,Genetic Markers ,Male ,Genotype ,Genotyping Techniques ,Genome-wide association study ,Biology ,01 natural sciences ,Polymorphism, Single Nucleotide ,010104 statistics & probability ,03 medical and health sciences ,Gene Frequency ,Human population genetics ,Genetics ,Humans ,0101 mathematics ,Allele frequency ,Genotyping ,Genetics (clinical) ,Venous Thrombosis ,Chromosomes, Human, X ,Likelihood Functions ,Models, Genetic ,Hardy–Weinberg principle ,Mathematical statistics ,Exact test ,030104 developmental biology ,Matemàtiques i estadística::Estadística aplicada [Àrees temàtiques de la UPC] ,Female ,Original Article ,Genètica ,Type I and type II errors ,Genome-Wide Association Study - Abstract
Testing genetic markers for Hardy–Weinberg equilibrium (HWE) is an important tool for detecting genotyping errors in large-scale genotyping studies. For markers at the X chromosome, typically the ¿2 or exact test is applied to the females only, and the hemizygous males are considered to be uninformative. In this paper we show that the males are relevant, because a difference in allele frequency between males and females may indicate HWE not to hold. The testing of markers on the X chromosome has received little attention, and in this paper we lay down the foundation for testing biallelic X-chromosomal markers for HWE. We develop four frequentist statistical test procedures for X-linked markers that take both males and females into account: the ¿2 test, likelihood ratio test, exact test and permutation test. Exact tests that include males are shown to have a better Type I error rate. Empirical data from the GENEVA project on venous thromboembolism is used to illustrate the proposed tests. Results obtained with the new tests differ substantially from tests that are based on female genotype counts only. The new tests detect differences in allele frequencies and seem able to uncover additional genotyping error that would have gone unnoticed in HWE tests based on females only
- Published
- 2015
64. Impact of dense genetic marker maps on plant population genetic studies
- Author
-
Bruce S. Weir
- Subjects
Genetics ,education.field_of_study ,Genetic diversity ,Population ,Population genetics ,Plant Science ,Horticulture ,Biology ,Gene mapping ,Genetic marker ,Evolutionary biology ,F-statistics ,Human population genetics ,Genetic variability ,education ,Agronomy and Crop Science - Abstract
Recent work and publications in human population genetics involving very dense SNP datasets suggest that the use of these markers will become common in plant breeding. The experience of human geneticists futher suggests that attention will need to be paid to the very high degree of genomic heterogeneity of population genetic measures such as genetic diversity. Dense marker sets will also allow more precision of estimates of relatedness between individual plants or pure-breeding lines.
- Published
- 2006
- Full Text
- View/download PDF
65. Three major haplotypes of the β2 adrenergic receptor define psychological profile, blood pressure, and the risk for development of a common musculoskeletal pain disorder
- Author
-
Amy D. Anderson, Svetlana A. Shabalina, Swetha Sama, Inna Belfer, Luda Diatchenko, Mitchell B. Max, William Maixner, Roger B. Fillingim, Gary D. Slade, Bruce S. Weir, David Goldman, and Tomas J. Higgins
- Subjects
Adult ,medicine.medical_specialty ,Adolescent ,Genotype ,Receptor expression ,Gene Expression ,Blood Pressure ,Bioinformatics ,Article ,Cellular and Molecular Neuroscience ,Musculoskeletal disorder ,Gene Frequency ,Risk Factors ,Surveys and Questionnaires ,Internal medicine ,medicine ,Humans ,Genetic Predisposition to Disease ,Risk factor ,Genetics (clinical) ,Polymorphism, Genetic ,Haplotype ,Chronic pain ,Temporomandibular Joint Disorders ,medicine.disease ,Psychiatry and Mental health ,Endocrinology ,Blood pressure ,Haplotypes ,Linear Models ,Anxiety ,Female ,Receptors, Adrenergic, beta-2 ,medicine.symptom ,Stress, Psychological ,Anxiety disorder - Abstract
Adrenergic receptor beta(2) (ADRB2) is a primary target for epinephrine. It plays a critical role in mediating physiological and psychological responses to environmental stressors. Thus, functional genetic variants of ADRB2 will be associated with a complex array of psychological and physiological phenotypes. These genetic variants should also interact with environmental factors such as physical or emotional stress to produce a phenotype vulnerable to pathological states. In this study, we determined whether common genetic variants of ADRB2 contribute to the development of a common chronic pain condition that is associated with increased levels of psychological distress and low blood pressure, factors which are strongly influenced by the adrenergic system. We genotyped 202 female subjects and examined the relationships between three major ADRB2 haplotypes and psychological factors, resting blood pressure, and the risk of developing a chronic musculoskeletal pain condition-Temporomandibular Joint Disorder (TMD). We propose that the first haplotype codes for lower levels of ADRB2 expression, the second haplotype codes for higher ADRB2 expression, and the third haplotype codes for higher receptor expression and rapid agonist-induced internalization. Individuals who carried one haplotype coding for high and one coding for low ADRB2 expression displayed the highest positive psychological traits, had higher levels of resting arterial pressure, and were about 10 times less likely to develop TMD. Thus, our data suggest that either positive or negative imbalances in ADRB2 function increase the vulnerability to chronic pain conditions such as TMD through different etiological pathways that imply the need for tailored treatment options.
- Published
- 2006
- Full Text
- View/download PDF
66. A haplotype map of the human genome
- Author
-
Mark Leppert, Aravinda Chakravarti, Charmaine D.M. Royal, Sarah S. Murray, Renzong Qiu, Panos Deloukas, Renwu Wang, David A. Hinds, Barbara E. Stranger, Xiaoli Tang, Huanming Yang, John W. Belmont, Nigel P. Carter, Huy Nguyen, William Mak, Kazuto Kato, Shiran Pasternak, Chaohua Li, Jeffrey C. Barrett, Lon R. Cardon, Vincent Ferretti, Atsushi Nagashima, Peter E. Chen, Stephen F. Schaffner, Hongbo Fu, Zhu Chen, Siqi Liu, John Burton, Paul Hardenbol, Gudmundur A. Thorisson, Yusuke Nakamura, Mark Griffiths, Imtiaz Yakub, Eiko Suda, Gonçalo R. Abecasis, Carl S. Kashuk, Qingrun Zhang, Yoshimitsu Fukushima, Karen Kennedy, Sarah E. Hunt, Yi Wang, Norio Niikawa, Ichiro Matsuda, Lynn F. Zacharia, Lalitha Krishnan, Zhen Wang, Stéphanie Roumy, C M Clee, David J. Cutler, Albert V. Smith, Lincoln Stein, Simon Myers, Jane Peterson, Jun Zhou, Yozo Ohnishi, Weihua Guan, Matthew Stephens, Xiaoyan Xiong, Julian Maller, Houcan Zhang, Pui-Yan Kwok, Mark S. Guyer, Liuda Ziaugra, Jonathan Witonsky, Matthew C. Jones, Stacey Gabriel, You-Qiang Song, Daochang An, Haifeng Wang, Gilean McVean, Lawrence M. Sung, Zhijian Yao, Yan Shen, Yangfan Liu, George M. Weinstock, Ludmila Pawlikowska, Erica Sodergren, Mark T. Ross, Andrew Boudreau, Toshihiro Tanaka, Thomas D. Willis, Weitao Hu, Kelly A. Frazer, Li Jin, Robert W. Plumb, Paul I.W. de Bakker, Hongbin Zhao, Wei Lin, Sarah Sims, Richard A. Gibbs, Maura Faggart, Michael Feolo, Dennis G. Ballinger, Xun Chu, Lucinda Fulton, Marcos Delgado, Ellen Winchester, Wei Huang, Fuli Yu, Christianne R. Bird, Shaun Purcell, Jessica Roy, Dongmei Cai, Launa M. Galver, Bartha Maria Knoppers, Emmanouil T. Dermitzakis, Gao Yang, Takashi Morizono, Rachel Barry, Kirsten McLay, Daryl J. Thomas, Steve McCarroll, Jonathan Marchini, Daniel J. Richter, Andy Peiffer, Patricia Taillon-Miller, Richard K. Wilson, Stephen Kwok-Wing Tsui, Jian-Bing Fan, Lisa D. Brooks, Laura L. Stuve, Paul L'Archevêque, David M. Evans, Clémentine Sallée, Peter Donnelly, Hong Xue, Hui Zhao, Charles N. Rotimi, Jean E. McEwen, J. Tze Fei Wong, Hao Pan, Alastair Kent, Brendan Blumenstiel, Qing Li, Weiwei Sun, L. Kang, Colin Freeman, John Stewart, Chibuzor Nkwodimmah, Morris W. Foster, Don Powell, Leonardo Bottolo, Raymond D. Miller, Stephen T. Sherry, Francis S. Collins, Donna M. Muzny, Jun Yu, Ike Ajayi, Hua Han, Pardis C. Sabeti, Hongguang Wang, Takahisa Kawaguchi, Tatsuhiko Tsunoda, Guy Bellemare, Zhaohui S. Qin, H. B. Hu, Jane Rogers, Thomas J. Hudson, Mark J. Daly, Andrew P. Morris, Supriya Gupta, Ming Xiao, Patrick Varilly, Nick Patterson, Akihiro Sekine, Chris C. A. Spencer, Jonathan Morrison, Missy Dixon, Paul K.H. Tam, Jian Wang, Matthew Defelice, Susana Eyheramendy, Michael Shi, Yungang He, Ellen Wright Clayton, Richa Saxena, Heather M. Munro, Arthur L. Holden, Yayun Shen, Christine P. Bird, Bruce W. Birren, Itsik Pe'er, David R. Bentley, Lynne V. Nazareth, Pamela Whittaker, Pak C. Sham, Amy L. Camargo, David A. Wheeler, Koji Saeki, Martin Godbout, David Altshuler, Liang Xu, Ying Wang, David Willey, Alexandre Montpetit, Shin Lin, Michael S. Phillips, Changqing Zeng, Clement Adebamowo, John C. Wallenburg, Mark S. Chee, Ben Fry, Erich Stahl, Melissa Parkin, Rhian Gwilliam, Andrei Verner, Patrick J. Nailer, Lap-Chee Tsui, Bo Zhang, Fanny Chagnon, David R. Cox, Jack Spiegel, Jamie Moore, Vivian Ota Wang, Patricia A. Marshall, Takuya Kitamoto, Bruce S. Weir, Darryl Macer, Geraldine M. Clarke, Robert C. Onofrio, Mary M.Y. Waye, Wei Wang, Suzanne M. Leal, James C. Mullikin, Toyin Aniagwu, Daniel C. Koboldt, Mary Goyette, Martin Leboeuf, Isaac F. Adewole, Ruth Jamieson, Arnold Oliphant, Jessica Watkin, and Jean François Olivier
- Subjects
Linkage disequilibrium ,Biology ,DNA, Mitochondrial ,Polymorphism, Single Nucleotide ,Article ,Linkage Disequilibrium ,Structural variation ,Gene Frequency ,Humans ,Selection, Genetic ,International HapMap Project ,Genetic association ,Haplotypes - genetics ,Recombination, Genetic ,Genetics ,Chromosomes, Human, Y ,Multidisciplinary ,Genome, Human ,DNA, Mitochondrial - genetics ,Haplotype ,Tag SNP ,Polymorphism, Single Nucleotide - genetics ,Haplotypes ,Human genome ,Haplotype estimation ,Chromosomes, Human, Y - genetics - Abstract
Inherited genetic variation has a critical but as yet largely uncharacterized role in human disease. Here we report a public database of common variation in the human genome: more than one million single nucleotide polymorphisms (SNPs) for which accurate and complete genotypes have been obtained in 269 DNA samples from four populations, including ten 500-kilobase regions in which essentially all information about common DNA variation has been extracted. These data document the generality of recombination hotspots, a block-like structure of linkage disequilibrium and low haplotype diversity, leading to substantial correlations of SNPs with many of their neighbours. We show how the HapMap resource can guide the design and analysis of genetic association studies, shed light on structural variation and recombination, and identify loci that may have been subject to natural selection during human evolution. © 2005 Nature Publishing Group., link_to_OA_fulltext
- Published
- 2005
- Full Text
- View/download PDF
67. Genotypic probabilities for pairs of inbred relatives
- Author
-
Bruce S. Weir and Wenlei Liu
- Subjects
Genetics ,education.field_of_study ,Genotype ,Models, Genetic ,Population ,Genetic Variation ,Locus (genetics) ,Breeding ,Biology ,Identity by descent ,General Biochemistry, Genetics and Molecular Biology ,Genetics, Population ,Gene Frequency ,Genetic variation ,Inbreeding ,Allele ,General Agricultural and Biological Sciences ,education ,Allele frequency ,Probability ,Research Article - Abstract
Expressions for the joint genotypic probabilities of two related individuals are used in many population and quantitative genetic analyses. These expressions, resting on a set of 15 probabilities of patterns of identity by descent among the four alleles at a locus carried by the relatives, are generally well known. There has been recent interest in special cases where the two individuals are both related and inbred, although there have been differences among published results. Here, we return to the original 15-probability treatment and show appropriate reductions for relatives when they are drawn from a population that itself is inbred or when the relatives have parents who are related. These results have application in affected-relative tests for linkage, and in methods for interpreting forensic genetic profiles.
- Published
- 2005
- Full Text
- View/download PDF
68. Affected Sib Pair Tests in Inbred Populations
- Author
-
W. Liu and Bruce S. Weir
- Subjects
Genetics ,education.field_of_study ,Linkage disequilibrium ,Population ,Locus (genetics) ,Biology ,Statistics ,Cutoff ,False positive rate ,education ,Null hypothesis ,Inbreeding ,Genetics (clinical) ,Statistical hypothesis testing - Abstract
Summary The affected-sib-pair (ASP) method for detecting linkage between a disease locus and marker loci was first established 50 years ago, and since then numerous modifications have been made. We modify two identity-by-state (IBS) test statistics of Lange (Lange, 1986a, 1986b) to allow for inbreeding in the population. We evaluate the power and false positive rates of the modified tests under three disease models, using simulated data. Before estimating false positive rates, we demonstrate that IBS tests are tests of both linkage and linkage disequilibrium between marker and disease loci. Therefore, the null hypothesis of IBS tests should be no linkage and no LD. When the population inbreeding coefficient is large, the false positive rates of Lange's tests become much larger than the nominal value, while those of our modified tests remain close to the nominal value. To estimate power with a controlled false positive rate, we choose the cutoff values based on simulated datasets under the null hypothesis, so that both Lange's tests and the modified tests generate same false positive rate. The powers of Lange's z-test and our modified z-test are very close and do not change much with increasing inbreeding. The power of the modified chi-square test also stays stable when the inbreeding coefficient increases. However, the power of Lange's chi-square test increases with increasing inbreeding, and is larger than that of our modified chi-square test for large inbreeding coefficients. The power is high under a recessive disease model for both Lange's tests and the modified tests, though the power is low for additive and dominant disease models. Allowing for inbreeding is therefore appropriate, at least for diseases known to be recessive.
- Published
- 2004
- Full Text
- View/download PDF
69. Properties of the Multiallelic Trend Test
- Author
-
Wendy Czika and Bruce S. Weir
- Subjects
Genetic Markers ,Statistics and Probability ,Linkage disequilibrium ,Disease status ,Biometry ,Population ,Multiple alleles ,Biology ,Linkage Disequilibrium ,General Biochemistry, Genetics and Molecular Biology ,Statistics ,Econometrics ,Humans ,Allele ,education ,Alleles ,Disease gene ,education.field_of_study ,Chi-Square Distribution ,Models, Genetic ,General Immunology and Microbiology ,Applied Mathematics ,General Medicine ,Trend analysis ,Genetic marker ,Case-Control Studies ,General Agricultural and Biological Sciences - Abstract
Summary. Disease genes can be mapped on the basis of associations between genetic markers and disease status, with the case–control design having the advantage of not requiring individuals from different generations. When the marker loci have multiple alleles, there has been debate on whether the power of tests for association increases or decreases. We show here that the multiple-allele version of Armitage's trend test has increased power over the two-allele version under the requirement of equifrequent alleles, but not in general. The trend test has the advantage of remaining valid even when the sampled population is not in Hardy–Weinberg equilibrium. A departure from Hardy–Weinberg means that association tests depend on gametic and nongametic linkage disequilibrium between marker and disease loci, and we illustrate the magnitude of these effects with simulated data.
- Published
- 2004
- Full Text
- View/download PDF
70. Allelic association patterns for a dense SNP map
- Author
-
Bruce S. Weir, Lon R. Cardon, and William G. Hill
- Subjects
Linkage disequilibrium ,Genotype ,Epidemiology ,Population ,Chromosomes, Human, Pair 20 ,Single-nucleotide polymorphism ,Biology ,Polymorphism, Single Nucleotide ,Linkage Disequilibrium ,White People ,Asian People ,Humans ,Genetic Predisposition to Disease ,Association mapping ,education ,Genotyping ,Alleles ,Genetics (clinical) ,Genetic association ,Genetics ,education.field_of_study ,Models, Statistical ,Models, Genetic ,Genome, Human ,Haplotype ,Chromosome Mapping ,Tag SNP ,Black or African American ,Genetics, Population ,Haplotypes ,Case-Control Studies - Abstract
A dense set of 5,000 SNPs on a 10-Mb region of human chromosome 20 has been typed on samples of African Americans, East Asians, and United Kingdom Caucasians. There are departures from Hardy-Weinberg equilibrium beyond the level at which markers are often discarded because of possible genotyping errors. The observation that markers showing such departures are often close together on the chromosome confirms the result that Hardy-Weinberg tests at two loci are correlated to an extent that depends on the linkage disequilibrium between those two markers. Linkage disequilibrium can be described by the composite linkage disequilibrium coefficient, the parameter that determines the behavior of case-control allelic tests of association. A useful preliminary investigation of datasets of this type is provided by counting the numbers of distinct multi-locus genotypes in windows of a few markers.
- Published
- 2004
- Full Text
- View/download PDF
71. Matching and Partially-Matching DNA Profiles
- Author
-
Bruce S. Weir
- Subjects
Genetics ,education.field_of_study ,DNA profiling ,Population structure ,Population ,Statistics ,Locus (genetics) ,Allele ,education ,Allele frequency ,Pathology and Forensic Medicine ,Mathematics - Abstract
The DNA profiles of two individuals can have 0, 1, or 2 pairs of alleles that are the same at each locus. These events may be called mismatches, partial matches or matches, respectively, and they have probabilities that depend on the population proportions of alleles as well as the population structure parameter theta. The observed and expected numbers of pairs of individuals with various numbers of matching or partially matching loci in FBI and Australian databases are found to be in good agreement provided theta is set equal to some small value greater than zero. The likelihood ratios for two individuals having a specified degree of relationship versus being unrelated also depend on the numbers of matching and partially matching loci, but even unrelated pairs of individuals can have likelihood ratios that support hypotheses of relatedness. Matching probabilities allow predictions to be made for the sizes of databases that are expected to contain a pair of individuals with high numbers of matching loci. It is very likely that two individuals with at least 9 matching loci among the 13 CODIS loci have already been typed.
- Published
- 2004
- Full Text
- View/download PDF
72. Proceedings of the Fourth Seattle Symposium in Biostatistics: Clinical Trials
- Author
-
Thomas R. Fleming, Bruce S. Weir, Thomas R. Fleming, and Bruce S. Weir
- Subjects
- Biometry--Congresses
- Abstract
This volume contains a selection of chapters base on papers presented at the Fourth Seattle Symposium in Biostatistics: Clinical Trials. The symposium was held in 2010 to celebrate the 40th anniversary of the University of Washington School of Public Health and Community Medicine. It featured keynote lectures by David DeMets and Susan Ellenberg and 16 invited presentations by other prominent researchers. The papers contained in this volume encompass recent methodological advances in several important clinical trials research, such as biomarkers, meta-analyses, sequential and adaptive clinical trials, and various genetic bioinformatic techniques. This volume will be a valuable reference for researchers and practitioners in the field of clinical trials.
- Published
- 2013
73. GWASTools: an R/Bioconductor package for quality control and analysis of genome-wide association studies
- Author
-
Caitlin P. McHugh, Rohit Swarnkar, David R. Crosslin, Cathy C. Laurie, Jess Shen, Matthew P. Conomos, Bruce S. Weir, Sarah C. Nelson, Ian Painter, Thomas Lumley, Kenneth Rice, Tushar Bhangale, Xiuwen Zheng, David K. Levine, Stephanie M. Gogarten, and Cecelia A. Laurie
- Subjects
Quality Control ,Statistics and Probability ,Genotype ,Interface (Java) ,Computer science ,Single-nucleotide polymorphism ,Genome-wide association study ,computer.software_genre ,Polymorphism, Single Nucleotide ,Biochemistry ,Computer Science Applications ,Bioconductor ,Applications Note ,Computational Mathematics ,Computational Theory and Mathematics ,Humans ,Data mining ,Molecular Biology ,computer ,Software ,Genome-Wide Association Study ,Genetic association - Abstract
Summary: GWASTools is an R/Bioconductor package for quality control and analysis of genome-wide association studies (GWAS). GWASTools brings the interactive capability and extensive statistical libraries of R to GWAS. Data are stored in NetCDF format to accommodate extremely large datasets that cannot fit within R’s memory limits. The documentation includes instructions for converting data from multiple formats, including variants called from sequencing. GWASTools provides a convenient interface for linking genotypes and intensity data with sample and single nucleotide polymorphism annotation. Availability and implementation: GWASTools is implemented in R and is available from Bioconductor (http://www.bioconductor.org). An extensive vignette detailing a recommended work flow is included. Contact: sdmorris@uw.edu
- Published
- 2012
- Full Text
- View/download PDF
74. Estimating F-Statistics
- Author
-
Bruce S. Weir and William G. Hill
- Subjects
Genetics ,education.field_of_study ,Models, Genetic ,Population ,Population structure ,Estimator ,Locus (genetics) ,Biology ,Genetics, Population ,F-statistics ,Statistics ,Weir ,Genetic variation ,education ,Inbreeding - Abstract
▪ Abstract A moment estimator of θ, the coancestry coefficient for alleles within a population, was described by Weir & Cockerham in 1984 ( 100 ) and is still widely cited. The estimate is used by population geneticists to characterize population structure, by ecologists to estimate migration rates, by animal breeders to describe genetic variation, and by forensic scientists to quantify the strength of matching DNA profiles. This review extends the work of Weir & Cockerham by allowing different levels of coancestry for different populations, and by allowing non-zero coancestries between pairs of populations. All estimates are relative to the average value of θ between pairs of populations. Moment estimates for within- and between-population θ values are likely to have large sampling variances, although these may be reduced by combining information over loci. Variances also decrease with the numbers of alleles at a locus, and with the numbers of populations sampled. This review also extends the work of Weir & Cockerham by employing maximum likelihood methods under the assumption that allele frequencies follow the normal distribution over populations. For the case of equal θ values within populations and zero θ values between populations, the maximum likelihood estimate is the same as that given by Robertson & Hill in 1984 ( 70 ). The review concludes by relating functions of θ values to times of population divergence under a pure drift model.
- Published
- 2002
- Full Text
- View/download PDF
75. Association Studies under General Disease Models
- Author
-
Bruce S. Weir and Dahlia M. Nielsen
- Subjects
Genetic Markers ,Genetics ,Candidate gene ,Linkage disequilibrium ,Genotype ,Models, Genetic ,Chromosome Mapping ,Locus (genetics) ,Tag SNP ,Biology ,Linkage Disequilibrium ,Apolipoproteins E ,Phenotype ,Cytochrome P-450 CYP2D6 ,Haplotypes ,Humans ,Genetic Predisposition to Disease ,Allele ,Association mapping ,Allele frequency ,Alleles ,Ecology, Evolution, Behavior and Systematics ,Genetic association - Abstract
There is great expectation that the levels of association found between genetic markers and disease status will play a role in the location of disease genes. This expectation follows from regarding association as being proportional to linkage disequilibrium and therefore inversely related to recombination value. For disease genes with more than two alleles, the association measure is instead a weighted average of linkage disequilibria, with the weights depending on allele frequencies and genotype susceptibilities at the disease loci. There is no longer a simple relationship, even in expectation, with recombination. We adopt a general framework to examine association mapping methods which helps to clarify the nature of case-control and transmission/disequilibrium-type tests and reveals the relationship between measures of association and coefficients of linkage disequilibrium. In particular, we can show the consequences of additive and nonadditive effects at the trait locus on the behavior of these tests. These concepts have a natural extension to marker haplotypes. The association of two-locus marker haplotypes with disease phenotype depends on a weighted average of three-locus disequilibria (two markers with each disease locus). It is likely that these two-marker analyses will provide additional information in association mapping studies.
- Published
- 2001
- Full Text
- View/download PDF
76. Challenges Facing Statistical Genetics
- Author
-
Bruce S. Weir
- Subjects
Statistics and Probability ,Statistical genetics ,Sociology ,Statistics, Probability and Uncertainty ,Data science - Abstract
(2000). Challenges Facing Statistical Genetics. Journal of the American Statistical Association: Vol. 95, No. 449, pp. 319-322.
- Published
- 2000
- Full Text
- View/download PDF
77. A Comparative Study of Sibship Tests of Linkage and/or Association
- Author
-
Norman L. Kaplan, Bruce S. Weir, and S.A. Monks
- Subjects
Genetic Markers ,Parents ,Linkage disequilibrium ,Genetic Linkage ,Population ,Context (language use) ,Disease ,Biology ,Nuclear Family ,Association ,Family-based tests ,Genetics ,False Positive Reactions ,Genetics(clinical) ,education ,Nuclear family ,Alleles ,Genetics (clinical) ,Linkage (software) ,education.field_of_study ,Models, Statistical ,Models, Genetic ,Transmission/disequilibrium test ,Linkage ,Genetic Diseases, Inborn ,Case-control study ,Transmission disequilibrium test ,Research Design ,Case-Control Studies ,Power study ,Monte Carlo Method ,Research Article ,Demography - Abstract
SummaryPopulation-based tests of association have used data from either case-control studies or studies based on trios (affected child and parents). Case-control studies are more prone to false-positive results caused by inappropriate controls, which can occur if, for example, there is population admixture or stratification. An advantage of family-based tests is that cases and controls are well matched, but parental data may not always be available, especially for late-onset diseases. Three recent family-based tests of association and linkage utilize unaffected siblings as surrogates for untyped parents. In this paper, we propose an extension of one of these tests. We describe and compare the four tests in the context of a complex disease for both biallelic and multiallelic markers, as well as for sibships of different sizes. We also examine the consequences of having some parental data in the sample.
- Published
- 1998
- Full Text
- View/download PDF
78. Hardy-Weinberg Testing for Continuous Data
- Author
-
Bruce S. Weir and Lauren M. McIntyre
- Subjects
Genetics ,Analysis of Variance ,Total harmonic distortion ,Models, Statistical ,Genotype ,Models, Genetic ,Discretization ,Intraclass correlation ,Kernel density estimation ,Numerical Analysis, Computer-Assisted ,Minisatellite Repeats ,Investigations ,Biology ,Hardy–Weinberg principle ,Continuous data ,Exact test ,Databases as Topic ,Statistics ,Test statistic ,Computer Simulation ,Alleles - Abstract
Estimation of allelic and genotypic distributions for continuous data using kernel density estimation is discussed and illustrated for some variable number of tandem repeat data. These kernel density estimates provide a useful representation of data when only some of the many variants at a locus are present in a sample. Two Hardy-Weinberg test procedures are introduced for continuous data: a continuous chi-square test with test statistic TCCS and a test based on Hellinger's distance with test statistic TCCS. Simulations are used to compare the powers of these tests to each other and to the powers of a test of intraclass correlation TIC, as well as to the power of Fisher's exact test TFET applied to discretized data. Results indicate that the power of TCCS is better than that of THD but neither is as powerful as TFET. The intraclass correlation test does not perform as well as the other tests examined in this article.
- Published
- 1997
- Full Text
- View/download PDF
79. Tests for Linkage and Association in Nuclear Families
- Author
-
Norman L. Kaplan, Bruce S. Weir, and Eden R. Martin
- Subjects
Genetic Markers ,Linkage disequilibrium ,Genetic Linkage ,Locus (genetics) ,Biology ,Linkage Disequilibrium ,Nuclear Family ,03 medical and health sciences ,0302 clinical medicine ,Gene mapping ,Genetic linkage ,Chi-square test ,Genetics ,Humans ,Genetic Predisposition to Disease ,Genetics(clinical) ,Allele ,Child ,Nuclear family ,Alleles ,Genetics (clinical) ,030304 developmental biology ,0303 health sciences ,Chi-Square Distribution ,Models, Genetic ,Genetic Diseases, Inborn ,Reproducibility of Results ,Transmission disequilibrium test ,stomatognathic diseases ,Monte Carlo Method ,030217 neurology & neurosurgery ,Research Article - Abstract
Summary The transmission/disequilibrium test (TDT) originally was introduced to test for linkage between a genetic marker and a disease-susceptibility locus, in the presence of association. Recently, the TDT has been used to test for association in the presence of linkage. The motivation for this is that linkage analysis typically identifies large candidate regions, and further refinement is necessary before a search for the disease gene is begun, on the molecular level. Evidence of association and linkage may indicate which markers in the region are closest to a disease locus. As a test of linkage, transmissions from heterozygous parents to all of their affected children can be included in the TDT; however, the TDT is a valid x 2 test of association only if transmissions to unrelated affected children are used in the analysis. If the sample contains independent nuclear families with multiple affected children, then one procedure that has been used to test for association is to select randomly a single affected child from each sibship and to apply the TDT to those data. As an alternative, we propose two statistics that use data from all of the affected children. The statistics give valid x 2 tests of the null hypothesis of no association or no linkage and generally are more powerful than the TDT with a single, randomly chosen, affected child from each family.
- Published
- 1997
- Full Text
- View/download PDF
80. Interpreting Whole-Genome Marker Data
- Author
-
Bruce S. Weir
- Subjects
Statistics and Probability ,Genetics ,Linkage disequilibrium ,Genetic marker ,Statistical genetics ,Genotype ,Single-nucleotide polymorphism ,Biology ,Association mapping ,Biochemistry, Genetics and Molecular Biology (miscellaneous) ,Genome ,Hardy–Weinberg principle ,Article - Abstract
The challenges of whole-genome data, when genotypes are available from hundreds of thousands of genetic markers, are explored for four topics in statistical genetics: Hardy-Weinberg testing, estimating linkage disequilibrium from unphased genotypic data, association mapping and characterizing population structure.
- Published
- 2013
81. Proceedings of the Fourth Seattle Symposium in Biostatistics: Clinical Trials
- Author
-
Bruce S. Weir and Thomas R. Fleming
- Subjects
Clinical trial ,medicine.medical_specialty ,business.industry ,Public health ,education ,medicine ,Library science ,Biostatistics ,business ,humanities - Abstract
This volume contains a selection of chapters base on papers presented at the Fourth Seattle Symposium in Biostatistics: Clinical Trials. The symposium was held in 2010 to celebrate the 40th anniversary of the University of Washington School of Public Health and Community Medicine. It featured keynote lectures by David DeMets and Susan Ellenberg and 16 invited presentations by other prominent researchers. The papers contained in this volume encompass recent methodological advances in several important clinical trials research, such as biomarkers, meta-analyses, sequential and adaptive clinical trials, and various genetic bioinformatic techniques. This volume will be a valuable reference for researchers and practitioners in the field of clinical trials.
- Published
- 2013
- Full Text
- View/download PDF
82. Genetic Variants Associated With Development of TMD and Its Intermediate Phenotypes: The Genetic Architecture of TMD in the OPPERA Prospective Cohort Study
- Author
-
Shad B. Smith, William Maixner, Bruce S. Weir, Roger B. Fillingim, Joel D. Greenspan, Gary D. Slade, Eric Bair, Ronald Dubner, Richard Ohrbach, Ellen Mir, Luda Diatchenko, and Charles Knott
- Subjects
Adult ,Male ,Oncology ,Pathology ,medicine.medical_specialty ,Adolescent ,Polymorphism, Single Nucleotide ,Article ,Risk Factors ,Polymorphism (computer science) ,Internal medicine ,Genetic variation ,Humans ,Medicine ,Genetic Predisposition to Disease ,Prospective Studies ,Prospective cohort study ,Genetic Association Studies ,business.industry ,Hazard ratio ,Chronic pain ,Temporomandibular Joint Disorders ,medicine.disease ,Genetic architecture ,stomatognathic diseases ,Phenotype ,Anesthesiology and Pain Medicine ,Neurology ,Genetic marker ,Multiple comparisons problem ,Female ,Neurology (clinical) ,business - Abstract
Genetic risk factors are believed to combine with environmental exposures and contribute to the risk of developing temporomandibular disorder (TMD). In this prospective cohort study, 2,737 people without TMD were assessed for common genetic variation in 358 genes known to contribute to nociceptive pathways, inflammation, and affective distress. During a median follow-up period of 2.8 years, 260 people developed first-onset TMD. Hazard ratios were computed as measures of association between 2,924 single-nucleotide polymorphisms and TMD incidence. After correction for multiple testing, no single single-nucleotide polymorphism was significantly associated with risk of onset TMD. However, several single-nucleotide polymorphisms exceeded Bonferroni correction for multiple comparison or false discovery rate thresholds (.05, .1, or .2) for association with intermediate phenotypes shown to be predictive of TMD onset. Nonspecific orofacial symptoms were associated with voltage-gated sodium channel, type I, alpha subunit ( SCN1A , rs6432860, P = 2.77 × 10 −5 ) and angiotensin I–converting enzyme 2 ( ACE2 , rs1514280, P = 4.86 × 10 −5 ); global psychological symptoms with prostaglandin-endoperoxide synthase 1 ( PTGS1 , rs3842803, P = 2.79 × 10 −6 ); stress and negative affectivity with amyloid-β (A4) precursor protein ( APP , rs466448, P = 4.29 × 10 −5 ); and heat pain temporal summation with multiple PDZ domain protein ( MPDZ , rs10809907, P = 3.05 × 10 −5 ). The use of intermediate phenotypes for complex pain diseases revealed new genetic pathways influencing risk of TMD. Perspective This article reports the findings of a large candidate gene association study of first-onset TMD and related intermediate phenotypes in the OPPERA Study. Although no genetic markers predicted TMD onset, several genetic risk factors for clinical, psychological, and sensory phenotypes associated with TMD onset were observed.
- Published
- 2013
- Full Text
- View/download PDF
83. Mixed model approaches for diallel analysis based on a bio-model
- Author
-
Jun Zhu and Bruce S. Weir
- Subjects
Mixed model ,Analysis of Variance ,Models, Genetic ,Restricted maximum likelihood ,Genetic Variation ,MINQUE ,Sampling (statistics) ,General Medicine ,Variance (accounting) ,Covariance ,Diallel cross ,Quadratic equation ,Statistics ,Genetics ,Computer Simulation ,Monte Carlo Method ,Alleles ,Crosses, Genetic ,Mathematics - Abstract
SummaryA MINQUE(l) procedure, which is minimum norm quadratic unbiased estimation (MINQUE) method with 1 for all the prior values, is suggested for estimating variance and covariance components in a bio-model for diallel crosses. Unbiasedness and efficiency of estimation were compared for MINQUE(l), restricted maximum likelihood (REML) and MINQUE(θ) which has parameter values for the prior values. MINQUE(l) is almost as efficient as MINQUE(θ) for unbiased estimation of genetic variance and covariance components. The bio-model is efficient and robust for estimating variance and covariance components for maternal and paternal effects as well as for nuclear effects. A procedure of adjusted unbiased prediction (AUP) is proposed for predicting random genetic effects in the bio-model. The jack-knife procedure is suggested for estimation of sampling variances of estimated variance and covariance components and of predicted genetic effects. Worked examples are given for estimation of variance and covariance components and for prediction of genetic merits.
- Published
- 1996
- Full Text
- View/download PDF
84. New Zealand population data at five VNTR loci: validation as databases for forensic identity testing
- Author
-
S. J. Cordiner, Bruce S. Weir, John Buckleton, L. Starling, J. F. Hamilton, D. L. Monahan, and Geoffrey K. Chambers
- Subjects
Genetics ,Genetic diversity ,Databases, Factual ,Database ,common ,Locus (genetics) ,Minisatellite Repeats ,Identity testing ,Biology ,computer.software_genre ,White People ,Pathology and Forensic Medicine ,Forensic science ,Polynesians ,Gene Frequency ,common.group ,Ethnicity ,Population data ,Humans ,Allele ,DNA Probes ,computer ,Allele frequency ,Alleles ,New Zealand - Abstract
Databases were developed for three New Zealand ethnic groups (Caucasian, Maori and Polynesian), at five VNTR loci (D1S7, D2S44, D4S139, D5S110 and D12S11), and validated for interpretation of forensic identity tests. A +/-2.8% sliding window was used to define the alleles at each locus and allelic frequency distributions were obtained for each locus. The conservative nature of the sliding window approach for forensic casework was demonstrated. Tests for independence of alleles within and between loci showed good agreement with the expectation of independence. Although Polynesians are known to have reduced genetic diversity at some VNTR loci, this was found not to be a concern for the present methodology. Procedures for the analysis and reporting of DNA profile results used by New Zealand forensic scientists are therefore appropriate.
- Published
- 1996
- Full Text
- View/download PDF
85. A Genome-Wide Survey of Hybrid Incompatibility Factors by the Introgression of Marked Segments of Drosophila mauritiana Chromosomes into Drosophila simulans
- Author
-
Bruce S. Weir, John R. True, and Cathy C. Laurie
- Subjects
Male ,Genetics ,X Chromosome ,biology ,Sterility ,Hybrid inviability ,Introgression ,Genes, Insect ,Investigations ,biology.organism_classification ,Drosophila mauritiana ,Mutation ,Backcrossing ,Animals ,Haldane's rule ,Drosophila ,Female ,Infertility, Female ,Mauritiana ,Crosses, Genetic ,Infertility, Male ,Hybrid - Abstract
In hybrids between Drosophila simulans and D. mauritiana, males are sterile and females are fertile, in compliance with Haldane's rule. The genetic basis of this phenomenon was investigated by introgression of segments of the mauritiana genome into a simulans background. A total of 87 positions throughout the mauritiana genome were marked with P-element insertions and replicate introgressions were made by repeated backcrossing to simulans for 15 generations. The fraction of hemizgyous X chromosomal introgressions that are male sterile is ~50% greater than the fraction of homozygous autosomal segments. This result suggests that male sterility factors have evolved at a higher rate on the X, but chromosomal differences in segment length cannot be ruled out. The fraction of homozygous autosomal introgressions that are male sterile is several times greater than the fraction that are either female sterile or inviable. This observation strongly indicates that male sterility factors have evolved more rapidly than either female sterility or inviability factors. These results, combined with previous work on these and other species, suggest that Haldane> 's rule has at least two causes: recessivity of incompatibility factors and differential accumulation of sterility factors affecting males and females.
- Published
- 1996
- Full Text
- View/download PDF
86. DNA statistics in the Simpson matter
- Author
-
Bruce S. Weir
- Subjects
Likelihood Functions ,Dna evidence ,Famous Persons ,Football ,Forensic Medicine ,History, 20th Century ,Biology ,DNA Fingerprinting ,California ,DNA profiling ,Statistics ,Genetics ,Spite ,Statistical analysis ,Famous persons ,Homicide ,Expert Testimony - Abstract
On 3 October 1995, O.J. Simpson was acquitted of two murders in spite of very strong DNA evidence linking his blood to the crime. Although numerical statements describing the strength of this evidence were made, the DNA profiles included so many loci that the need for presenting numbers in this case, and in others using similarly high numbers of loci, is probably unnecessary. If numbers are to be presented, however, they should be given in the form of likelihood ratios. One thing the verdict in the Simpson case makes clear is that it is essential that the integrity of DNA evidence (with regard to collection, potential contamination or tampering) be beyond doubt.
- Published
- 1995
- Full Text
- View/download PDF
87. A comparison of tests for independence in the FBI RFLP data bases
- Author
-
P. J. Maiste and Bruce S. Weir
- Subjects
Databases, Factual ,Locus (genetics) ,DNA ,Plant Science ,General Medicine ,Biology ,DNA Fingerprinting ,Hardy–Weinberg principle ,United States ,Social Control, Formal ,Exact test ,Genetics, Population ,Government Agencies ,Data Interpretation, Statistical ,Insect Science ,Statistics ,Ethnicity ,Genetics ,Humans ,Animal Science and Zoology ,Restriction fragment length polymorphism ,Alleles ,Polymorphism, Restriction Fragment Length ,Probability ,Statistical hypothesis testing - Abstract
Several tests of independence of allelic frequencies within and between loci have been compared, and it has been found that Fisher's exact test is the best test to use. When this test is applied to RFLP databases established by the FBI, paying no attention to the single-band problem, there is generally evidence for independence at one locus but not at two loci. When the test is restricted to double-banded entries in the databases; there is overall evidence for independence.
- Published
- 1995
- Full Text
- View/download PDF
88. Genetic Markers in Clinical Trials
- Author
-
Bruce S. Weir and P. J. Heagerty
- Subjects
Linkage disequilibrium ,Genetic marker ,SNP ,Single-nucleotide polymorphism ,Human genome ,Quantitative genetics ,Computational biology ,Biology ,Association mapping ,Regression - Abstract
The current availability of dense sets of marker SNPs for the human genome is having a large impact on genetic studies and offers new possibilities for clinical trials. This chapter offers a unified basis for the analysis of marker and response data, emphasizing the central importance of the correlation, or linkage disequilibrium, between SNP markers and the genes that affect response. It is convenient to phrase the development of association mapping in the language of quantitative genetics, using additive and non-additive components of variance. A novel feature of dense SNP data is that good estimates can be made of actual inbreeding and relatedness. These estimates are more relevant than values predicted from family pedigree, and are all that are available in the absence of family data.The dimensionality of SNP marker datasets has required the development of new methods that are appropriate for a large number of statistical comparisons, and the development of computational methods that allow high-dimensional regression. These methods are reviewed here, as is the use of biological annotation for both viewing the relevance of empirical associations, and to structure analysis in order to focus on those markers with the highest expectation for association with the outcomes under study.
- Published
- 2012
- Full Text
- View/download PDF
89. A high-performance computing toolset for relatedness and principal component analysis of SNP data
- Author
-
Bruce S. Weir, Xiuwen Zheng, David K. Levine, Cathy C. Laurie, Stephanie M. Gogarten, and Jess Shen
- Subjects
Statistics and Probability ,Snp data ,Principal Component Analysis ,Computer science ,computer.software_genre ,Biochemistry ,Polymorphism, Single Nucleotide ,Computer Science Applications ,Computational Mathematics ,Applications Note ,Computational Theory and Mathematics ,Principal component analysis ,Key (cryptography) ,Humans ,Data mining ,Performance computing ,Molecular Biology ,computer ,Algorithms ,Software ,Genome-Wide Association Study - Abstract
Summary: Genome-wide association studies are widely used to investigate the genetic basis of diseases and traits, but they pose many computational challenges. We developed gdsfmt and SNPRelate (R packages for multi-core symmetric multiprocessing computer architectures) to accelerate two key computations on SNP data: principal component analysis (PCA) and relatedness analysis using identity-by-descent measures. The kernels of our algorithms are written in C/C++ and highly optimized. Benchmarks show the uniprocessor implementations of PCA and identity-by-descent are ∼8–50 times faster than the implementations provided in the popular EIGENSTRAT (v3.0) and PLINK (v1.07) programs, respectively, and can be sped up to 30–300-fold by using eight cores. SNPRelate can analyse tens of thousands of samples with millions of SNPs. For example, our package was used to perform PCA on 55 324 subjects from the ‘Gene-Environment Association Studies’ consortium studies. Availability and implementation: gdsfmt and SNPRelate are available from R CRAN (http://cran.r-project.org), including a vignette. A tutorial can be found at https://www.genevastudy.org/Accomplishments/software. Contact: zhengx@u.washington.edu
- Published
- 2012
90. Variation in actual relationship among descendants of inbred individuals
- Author
-
William G. Hill and Bruce S. Weir
- Subjects
Genetic Markers ,Male ,Models, Genetic ,Offspring ,Chromosome Mapping ,Genetic Variation ,General Medicine ,Biology ,Article ,Pedigree ,Variation (linguistics) ,Evolutionary biology ,Genetic marker ,Genetic variation ,Inbreeding depression ,Genetics ,Humans ,Female ,Inbreeding ,Allele ,Mating ,Algorithms ,Alleles - Abstract
SummaryIn previous analyses, the variation in actual, or realized, relationship has been derived as a function of map length of chromosomes and type of relationship, the variation being greater the shorter the total chromosome length and the coefficient of variation being greater the more distant the relationship. Here, the results are extended to allow for the relatives’ ancestor being inbred. Inbreeding of a parent reduces variation in actual relationship among its offspring, by an amount that depends on the inbreeding level and the type of mating that led to that level. For descendants of full-sibs, the variation is reduced in later generations, but for descendants of half-sibs, it is increased.
- Published
- 2012
- Full Text
- View/download PDF
91. DNA commission of the International Society of Forensic Genetics: Recommendations on the evaluation of STR typing results that may include drop-out and/or drop-in using probabilistic methods
- Author
-
Wolfgang R. Mayr, Niels Morling, Lourdes Prieto, Leonor Gusmão, Hinda Haned, Walther Parson, H. Schneider, Peter M. Schneider, Bruce S. Weir, M. Prinz, and Peter Gill
- Subjects
Forensic Genetics ,Societies, Scientific ,Locus (genetics) ,Computational biology ,Biology ,Article ,Pathology and Forensic Medicine ,03 medical and health sciences ,chemistry.chemical_compound ,0302 clinical medicine ,Probabilistic method ,Drop out ,Genetics ,Humans ,030216 legal & forensic medicine ,Allele ,Alleles ,030304 developmental biology ,0303 health sciences ,Likelihood Functions ,DNA ,DNA Fingerprinting ,DNA profiling ,chemistry ,Str typing ,Forensic genetics ,Microsatellite Repeats - Abstract
DNA profiling of biological material from scenes of crimes is often complicated because the amount of DNA is limited and the quality of the DNA may be compromised. Furthermore, the sensitivity of STR typing kits has been continuously improved to detect low level DNA traces. This may lead to (1) partial DNA profiles and (2) detection of additional alleles. There are two key phenomena to consider: allelic or locus ‘drop-out’, i.e. ‘missing’ alleles at one or more genetic loci, while ‘drop-in’ may explain alleles in the DNA profile that are additional to the assumed main contributor(s). The drop-in phenomenon is restricted to 1 or 2 alleles per profile. If multiple alleles are observed at more than two loci then these are considered as alleles from an extra contributor and analysis can proceed as a mixture of two or more contributors. Here, we give recommendations on how to estimate probabilities considering drop-out, Pr(D), and drop-in, Pr(C). For reasons of clarity, we have deliberately restricted the current recommendations considering drop-out and/or drop-in at only one locus. Furthermore, we offer recommendations on how to use Pr(D) and Pr(C) with the likelihood ratio principles that are generally recommended by the International Society of Forensic Genetics (ISFG) as measure of the weight of the evidence in forensic genetics. Examples of calculations are included. An Excel spreadsheet is provided so that scientists and laboratories may explore the models and input their own data.
- Published
- 2012
- Full Text
- View/download PDF
92. Familial Identification: Population Structure and Relationship Distinguishability
- Author
-
Stephanie M. Fullerton, Rori V. Rohlfs, and Bruce S. Weir
- Subjects
Forensic Genetics ,Cancer Research ,lcsh:QH426-470 ,Population ,Population genetics ,Sample (statistics) ,Biology ,03 medical and health sciences ,0302 clinical medicine ,Gene Frequency ,Genetics ,Confidence Intervals ,Crime scene ,Humans ,Family ,030216 legal & forensic medicine ,education ,Molecular Biology ,Allele frequency ,Genetics (clinical) ,Ecology, Evolution, Behavior and Systematics ,Alleles ,030304 developmental biology ,0303 health sciences ,education.field_of_study ,Likelihood Functions ,Statistics ,Criminals ,16. Peace & justice ,Probability Theory ,DNA Fingerprinting ,Confidence interval ,United States ,Forensic identification ,lcsh:Genetics ,Identification (information) ,Biometric Identification ,Data Interpretation, Statistical ,Crime ,Databases, Nucleic Acid ,Population Genetics ,Mathematics ,Demography ,Research Article - Abstract
With the expansion of offender/arrestee DNA profile databases, genetic forensic identification has become commonplace in the United States criminal justice system. Implementation of familial searching has been proposed to extend forensic identification to family members of individuals with profiles in offender/arrestee DNA databases. In familial searching, a partial genetic profile match between a database entrant and a crime scene sample is used to implicate genetic relatives of the database entrant as potential sources of the crime scene sample. In addition to concerns regarding civil liberties, familial searching poses unanswered statistical questions. In this study, we define confidence intervals on estimated likelihood ratios for familial identification. Using these confidence intervals, we consider familial searching in a structured population. We show that relatives and unrelated individuals from population samples with lower gene diversity over the loci considered are less distinguishable. We also consider cases where the most appropriate population sample for individuals considered is unknown. We find that as a less appropriate population sample, and thus allele frequency distribution, is assumed, relatives and unrelated individuals become more difficult to distinguish. In addition, we show that relationship distinguishability increases with the number of markers considered, but decreases for more distant genetic familial relationships. All of these results indicate that caution is warranted in the application of familial searching in structured populations, such as in the United States., Author Summary The forensic identification of criminal suspects through DNA profiling is now common in the United States. Indirect identification by familial DNA profiling is increasingly proposed to extend the utility of DNA databases. In familial searching, a DNA profile from a crime scene partially matches a database profile entry, implicating close relatives of the partial match. While the basic principles behind familial searching methods are simple and elegant, statistical confidence that a partially matched profile belongs to a true genetic relative has not been fully explored. Here, we derive relative identification likelihood ratio statistics and consider how the ability of familial searching to distinguish relatives from unrelated individuals varies over population samples and is affected by inaccurately assumed population background. We observe lower relationship distinguishability for population samples with less identifying information in the genetic loci considered. Additionally, we show that relationship distinguishability decreases with discordance between true and assumed population samples. These results indicate that, if an inappropriate genetic population group is assumed, individuals from certain marginalized groups may be disproportionately more often subject to false familial identification. Our results suggest that care is warranted in the use and interpretation of familial searching forensic techniques.
- Published
- 2012
93. A population genetic database of cat breeds developed in coordination with a domestic cat STR multiplex
- Author
-
Marilyn, Menotti-Raymond, Victor A, David, Bruce S, Weir, and Stephen J, O'Brien
- Subjects
Male ,Heterozygote ,Genotype ,DNA Fingerprinting ,Polymerase Chain Reaction ,Article ,Genetics, Population ,Gene Frequency ,Y Chromosome ,Cats ,Animals ,Female ,Databases, Nucleic Acid ,Microsatellite Repeats - Abstract
A simple tandem repeat (STR) PCR-based typing system developed for the genetic individualization of domestic cat samples has been used to generate a population genetic database of domestic cat breeds. A panel of 10 tetranucleotide STR loci and a gender-identifying sequence tagged site (STS) were co-amplified in genomic DNA of 1043 individuals representing 38 cat breeds. The STR panel exhibits relatively high heterozygosity in cat breeds, with an average 10-locus heterozygosity of 0.71, which represents an average of 38 breed-specific heterozygosities for the 10-member panel. When the entire set of breed individuals was analyzed as a single population, a heterozygosity of 0.87 was observed. Heterozygosities obtained for the 10 loci range from 0.72 to 0.96. The power for genetic individualization of domestic cat samples of the multiplex is high, with a probability of match (p(m)) of 6.2E-14, using a conservative θ = 0.05.
- Published
- 2012
94. THE EFFECTS OF INBREEDING ON FORENSIC CALCULATIONS
- Author
-
Bruce S. Weir
- Subjects
Forensic science ,Consanguinity ,Models, Genetic ,Population structure ,Genetics ,MEDLINE ,Humans ,Forensic Medicine ,Biology ,Inbreeding ,Demography - Published
- 1994
- Full Text
- View/download PDF
95. Genome-partitioning of genetic variation for complex traits using common SNPs
- Author
-
Elizabeth W. Pugh, Teri A. Manolio, Alvaro Alonso, Peter M. Visscher, Julie M. Cunningham, Mads Melbye, Eleanor Feingold, Hua Ling, Rasika A. Mathias, Eric Boerwinkle, Maria Teresa Landi, Michael E. Goddard, Peng Lin, Bjarke Feenstra, Guillaume Lettre, M. Geoffrey Hayes, Neil E. Caporaso, Jian Yang, Bruce S. Weir, William G. Hill, Louis R. Pasquale, Marilyn C. Cornelis, William L. Lowe, and Mariza de Andrade
- Subjects
Genetics ,Chromosomes, Human, X ,Models, Genetic ,Genetic Variation ,Single-nucleotide polymorphism ,Heritability ,Biology ,Population stratification ,Explained variation ,Polymorphism, Single Nucleotide ,Article ,Body Height ,Chromosomes ,Body Mass Index ,Genetics, Population ,Polygene ,X Chromosome Inactivation ,Genetic variation ,von Willebrand Factor ,Humans ,Genetic variability ,Human height ,Genome-Wide Association Study - Abstract
Recently, we reported a method to estimate the proportion of phenotypic variance explained by all SNPs from genome-wide association studies, and estimated that half of the heritability for human height was captured by common SNPs. Here we partition genetic variation for height, body mass index (BMI), von Willebrand factor (vWF) and QT interval (QTi) onto chromosomes and chromosome segments, using 586,898 SNPs genotyped on 11,586 unrelated individuals. We estimate that ~45%, ~17%, ~25% and ~21% of variance in height, BMI, vWF and QTi, respectively, can be explained by considering all autosomal SNPs simultaneously, and a further ~0.5–1% by X-chromosome SNPs for height, BMI and vWF. We show that variance explained by each chromosome for height and QTi is proportional to the total gene length on that chromosome. In genome-wide analyses, common SNPs in or near genes explain more variation than SNPs between genes. We propose a novel approach to estimate variation due to cryptic relatedness and population stratification. Our results provide further evidence that a substantial proportion of heritability is accounted for by causal variants in linkage disequilibrium with common SNPs; that height, BMI and QTi are highly polygenic traits; and that the additive variation explained by a part of the genome is approximately proportional to the total length of DNA contained within genes therein.
- Published
- 2011
96. Potential genetic risk factors for chronic TMD: genetic associations from the OPPERA case control study
- Author
-
Richard Ohrbach, Charles Knott, Shad B. Smith, Dustin G. Gibson, Dmitri V. Zaykin, William Maixner, Eric Bair, Luda Diatchenko, Dylan W. Maixner, Roger B. Fillingim, Bruce S. Weir, Ronald Dubner, Gary D. Slade, and Joel D. Greenspan
- Subjects
Adult ,Male ,medicine.medical_specialty ,Candidate gene ,Orofacial pain ,Adolescent ,Single-nucleotide polymorphism ,Bioinformatics ,Population stratification ,Article ,Cohort Studies ,Young Adult ,Risk Factors ,medicine ,Humans ,Multicenter Studies as Topic ,Prospective Studies ,Prospective cohort study ,Genetic Association Studies ,Aged ,business.industry ,Chronic pain ,Case-control study ,Middle Aged ,Temporomandibular Joint Disorders ,medicine.disease ,stomatognathic diseases ,Anesthesiology and Pain Medicine ,Neurology ,Case-Control Studies ,Physical therapy ,Female ,Neurology (clinical) ,medicine.symptom ,Chronic Pain ,business ,Cohort study - Abstract
Genetic factors play a role in the etiology of persistent pain conditions, putatively by modulating underlying processes such as nociceptive sensitivity, psychological well-being, inflammation, and autonomic response. However, to date, only a few genes have been associated with temporomandibular disorders (TMD). This study evaluated 358 genes involved in pain processes, comparing allelic frequencies between 166 cases with chronic TMD and 1,442 controls enrolled in the OPPERA (Orofacial Pain: Prospective Evaluation and Risk Assessment) study cooperative agreement. To enhance statistical power, 182 TMD cases and 170 controls from a similar study were included in the analysis. Genotyping was performed using the Pain Research Panel, an Affymetrix gene chip representing 3,295 single nucleotide polymorphisms, including ancestry-informative markers that were used to adjust for population stratification. Adjusted associations between genetic markers and TMD case status were evaluated using logistic regression. The OPPERA findings provided evidence supporting previously reported associations between TMD and 2 genes: HTR2A and COMT. Other genes were revealed as potential new genetic risk factors for TMD, including NR3C1, CAMK4, CHRM2, IFRD1, and GRK5. While these findings need to be replicated in independent cohorts, the genes potentially represent important markers of risk for TMD, and they identify potential targets for therapeutic intervention. Perspective Genetic risk factors for TMD pain were explored in the case-control component of the OPPERA cooperative agreement, a large population-based prospective cohort study. Over 350 candidate pain genes were assessed using a candidate gene panel, with several genes displaying preliminary evidence for association with TMD status.
- Published
- 2011
97. The Interpretation of Lineage Markers in Forensic DNA Testing
- Author
-
Bruce S. Weir, John Buckleton, and Michael Krawczak
- Subjects
Genetic Markers ,Mitochondrial DNA ,Lineage (genetic) ,Population ,Biology ,DNA, Mitochondrial ,Article ,Pathology and Forensic Medicine ,Gene Frequency ,Genetics ,Crime scene ,Humans ,education ,education.field_of_study ,Likelihood Functions ,Chromosomes, Human, Y ,Models, Genetic ,Lineage markers ,Haplotype ,DNA Fingerprinting ,Genetics, Population ,DNA profiling ,Haplotypes ,Evolutionary biology ,Genetic marker ,Tandem Repeat Sequences ,Databases, Nucleic Acid - Abstract
Mitochondrial DNA (mtDNA) and the non-recombining portion of the Y-chromosome are inherited matrilinealy and patrilinealy, respectively, and without recombination. Collectively they are termed 'lineage markers'. Lineage markers may be used in forensic testing of an item, such as a hair from a crime scene, against a hypothesised source, or in relationship testing. An estimate of the evidential weight of a match is usually provided by a count of the occurrence in some database of the mtDNA or Y-STR haplotype under consideration. When the factual statement of a count in the database is applied to a case, issues of relevance of the database and sampling uncertainty may arise. In this paper, we re-examine the issues of sampling uncertainty, the relevance of the database, and the combination of autosomal and lineage marker evidence. We also review the recent developments by C.H. Brenner. (C) 2010 Elsevier Ireland Ltd. All rights reserved.
- Published
- 2011
98. Variation in actual relationship as a consequence of Mendelian sampling and linkage
- Author
-
William G. Hill and Bruce S. Weir
- Subjects
Linkage (software) ,Genetics ,Genome ,Models, Genetic ,Genetic Linkage ,Sampling (statistics) ,Chromosome Mapping ,Pedigree chart ,General Medicine ,Variance (accounting) ,Biology ,Article ,Pedigree ,Variation (linguistics) ,Skewness ,Statistics ,Animals ,Computer Simulation ,Inbreeding ,Rabbits ,Sex linkage ,Probability - Abstract
SummaryAlthough the expected relationship or proportion of genome shared by pairs of relatives can be obtained from their pedigrees, the actual quantities deviate as a consequence of Mendelian sampling and depend on the number of chromosomes and map length. Formulae have been published previously for the variance of actual relationship for a number of specific types of relatives but no general formula for non-inbred individuals is available. We provide here a unified framework that enables the variances for distant relatives to be easily computed, showing, for example, how the variance of sharing for great grandparent–great grandchild, great uncle–great nephew, half uncle–nephew and first cousins differ, even though they have the same expected relationship. Results are extended in order to include differences in map length between sexes, no recombination in males and sex linkage. We derive the magnitude of skew in the proportion shared, showing the skew becomes increasingly large the more distant the relationship. The results obtained for variation in actual relationship apply directly to the variation in actual inbreeding as both are functions of genomic coancestry, and we show how to partition the variation in actual inbreeding between and within families. Although the variance of actual relationship falls as individuals become more distant, its coefficient of variation rises, and so, exacerbated by the skewness, it becomes increasingly difficult to distinguish different pedigree relationships from the actual fraction of the genome shared.
- Published
- 2011
- Full Text
- View/download PDF
99. Variances for distances between plant sequences
- Author
-
Bruce S. Weir and Brandon S. Gaut
- Subjects
Phylogenetic tree ,Genetic distance ,Phylogenetics ,Resampling ,Mutation (genetic algorithm) ,Botany ,Pairwise comparison ,Plant Science ,Expected value ,Biology ,Divergence (statistics) ,Ecology, Evolution, Behavior and Systematics - Abstract
Phylogenetic trees may be constructed from pairwise distances among the taxonomic units. When the data consist of DNA sequences, appropriate distances can be defined to reflect the mutation model and to have expected values proportional to the time of divergence of the sequences. With the growing amount of sequence data, it is also necessary to incorporate within-species sequence variation into measures of distance between sequences. This additional feature requires attention to be paid to drift and recombination as well as mutation, and greatly increases the difficulty of estimating variances of estimated distances. Numerical resampling seems appropriate. These concepts are illustrated with some sequence data from the Adhl gene from maize and pearl millet.
- Published
- 1993
- Full Text
- View/download PDF
100. Population Structure With Localized Haplotype Clusters
- Author
-
Bruce S. Weir and Sharon R. Browning
- Subjects
Asia ,Chromosomes, Human, Pair 22 ,Population ,Biology ,Investigations ,Polymorphism, Single Nucleotide ,Divergence ,Modal haplotype ,Genetic variation ,Genetics ,Cluster Analysis ,Humans ,International HapMap Project ,education ,Mexico ,Sampling bias ,education.field_of_study ,Haplotype ,Genetic Variation ,Markov Chains ,Europe ,Genetics, Population ,Haplotypes ,Africa ,Haplotype estimation ,human activities ,Algorithms ,Software - Abstract
We propose a multilocus version of FST and a measure of haplotype diversity using localized haplotype clusters. Specifically, we use haplotype clusters identified with BEAGLE, which is a program implementing a hidden Markov model for localized haplotype clustering and performing several functions including inference of haplotype phase. We apply this methodology to HapMap phase 3 data. With this haplotype-cluster approach, African populations have highest diversity and lowest divergence from the ancestral population, East Asian populations have lowest diversity and highest divergence, and other populations (European, Indian, and Mexican) have intermediate levels of diversity and divergence. These relationships accord with expectation based on other studies and accepted models of human history. In contrast, the population-specific FST estimates obtained directly from single-nucleotide polymorphisms (SNPs) do not reflect such expected relationships. We show that ascertainment bias of SNPs has less impact on the proposed haplotype-cluster-based FST than on the SNP-based version, which provides a potential explanation for these results. Thus, these new measures of FST and haplotype-cluster diversity provide an important new tool for population genetic analysis of high-density SNP data.
- Published
- 2010
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.