328 results on '"HAYES BJ"'
Search Results
2. Genetic Gain and Inbreeding from Genomic Selection in a Simulated Commercial Breeding Program for Perennial Ryegrass
- Author
-
Lin, Z, Cogan, Noel, Pembleton, LW, Spangenberg, German, Forster, The, Hayes, BJ, and Daetwyler, Hans
- Subjects
Uncategorized - Abstract
Genomic selection (GS) provides an attractive option for accelerating genetic gain in perennial ryegrass (Lolium perenne L.) improvement given the long cycle times of most current breeding programs. The present study used simulation to investigate the level of genetic gain and inbreeding obtained from GS breeding strategies compared with traditional breeding strategies for key traits (persistency, yield, and flowering time). Base population genomes were simulated through random mating for 60,000 generations at an effective population size of 10,000. The degree of linkage disequilibrium (LD) in the resulting population was compared with that obtained from empirical studies. Initial parental varieties were simulated to match diversity of current commercial cultivars. Genomic selection was designed to fit into a company breeding program at two selection points in the breeding cycle (spaced plants and miniplot). Genomic estimated breeding values (GEBVs) for productivity traits were trained with phenotypes and genotypes from plots. Accuracy of GEBVs was 0.24 for persistency and 0.36 for yield for single plants, while for plots it was lower (0.17 and 0.19, respectively). Higher accuracy of GEBVs was obtained for flowering time (up to 0.7), partially as a result of the larger reference population size that was available from the clonal row stage. The availability of GEBVs permit a 4-yr reduction in cycle time, which led to at least a doubling and trebling genetic gain for persistency and yield, respectively, than the traditional program. However, a higher rate of inbreeding per cycle among varieties was also observed for the GS strategy.
- Published
- 2023
- Full Text
- View/download PDF
3. Evaluation of IMproving Palliative care Education and Training Using Simulation in Dementia (IMPETUS-D) a staff simulation training intervention to improve palliative care of people with advanced dementia living in nursing homes: a cluster randomised cont
- Author
-
Tropea, J, Nestel, D, Johnson, C, Hayes, BJ, Hutchinson, Ana, Brand, C, Le, BH, Blackberry, I, Caplan, GA, Bicknell, R, Hepworth, G, Lim, WK, Tropea, J, Nestel, D, Johnson, C, Hayes, BJ, Hutchinson, Ana, Brand, C, Le, BH, Blackberry, I, Caplan, GA, Bicknell, R, Hepworth, G, and Lim, WK
- Abstract
BACKGROUND: People with dementia have unique palliative and end-of-life needs. However, access to quality palliative and end-of-life care for people with dementia living in nursing homes is often suboptimal. There is a recognised need for nursing home staff training in dementia-specific palliative care to equip them with knowledge and skills to deliver high quality care. OBJECTIVE: The primary aim was to evaluate the effectiveness of a simulation training intervention (IMPETUS-D) aimed at nursing home staff on reducing unplanned transfers to hospital and/or deaths in hospital among residents living with dementia. DESIGN: Cluster randomised controlled trial of nursing homes with process evaluation conducted alongside. SUBJECTS & SETTING: One thousand three hundred four people with dementia living in 24 nursing homes (12 intervention/12 control) in three Australian cities, their families and direct care staff. METHODS: Randomisation was conducted at the level of the nursing home (cluster). The allocation sequence was generated by an independent statistician using a computer-generated allocation sequence. Staff from intervention nursing homes had access to the IMPETUS-D training intervention, and staff from control nursing homes had access to usual training opportunities. The predicted primary outcome measure was a 20% reduction in the proportion of people with dementia who had an unplanned transfer to hospital and/or death in hospital at 6-months follow-up in the intervention nursing homes compared to the control nursing homes. RESULTS: At 6-months follow-up, 128 (21.1%) people with dementia from the intervention group had an unplanned transfer or death in hospital compared to 132 (19.0%) residents from the control group; odds ratio 1.14 (95% CI, 0.82-1.59). There were suboptimal levels of staff participation in the training intervention and several barriers to participation identified. CONCLUSION: This study of a dementia-specific palliative care staff training i
- Published
- 2022
4. Evaluation of IMproving Palliative care Education and Training Using Simulation in Dementia (IMPETUS-D) a staff simulation training intervention to improve palliative care of people with advanced dementia living in nursing homes: a cluster randomised controlled trial
- Author
-
Tropea, J, Nestel, D, Johnson, C, Hayes, BJ, Hutchinson, AF, Brand, C, Le, BH, Blackberry, I, Caplan, GA, Bicknell, R, Hepworth, G, Lim, WK, Tropea, J, Nestel, D, Johnson, C, Hayes, BJ, Hutchinson, AF, Brand, C, Le, BH, Blackberry, I, Caplan, GA, Bicknell, R, Hepworth, G, and Lim, WK
- Abstract
BACKGROUND: People with dementia have unique palliative and end-of-life needs. However, access to quality palliative and end-of-life care for people with dementia living in nursing homes is often suboptimal. There is a recognised need for nursing home staff training in dementia-specific palliative care to equip them with knowledge and skills to deliver high quality care. OBJECTIVE: The primary aim was to evaluate the effectiveness of a simulation training intervention (IMPETUS-D) aimed at nursing home staff on reducing unplanned transfers to hospital and/or deaths in hospital among residents living with dementia. DESIGN: Cluster randomised controlled trial of nursing homes with process evaluation conducted alongside. SUBJECTS & SETTING: One thousand three hundred four people with dementia living in 24 nursing homes (12 intervention/12 control) in three Australian cities, their families and direct care staff. METHODS: Randomisation was conducted at the level of the nursing home (cluster). The allocation sequence was generated by an independent statistician using a computer-generated allocation sequence. Staff from intervention nursing homes had access to the IMPETUS-D training intervention, and staff from control nursing homes had access to usual training opportunities. The predicted primary outcome measure was a 20% reduction in the proportion of people with dementia who had an unplanned transfer to hospital and/or death in hospital at 6-months follow-up in the intervention nursing homes compared to the control nursing homes. RESULTS: At 6-months follow-up, 128 (21.1%) people with dementia from the intervention group had an unplanned transfer or death in hospital compared to 132 (19.0%) residents from the control group; odds ratio 1.14 (95% CI, 0.82-1.59). There were suboptimal levels of staff participation in the training intervention and several barriers to participation identified. CONCLUSION: This study of a dementia-specific palliative care staff training inter
- Published
- 2022
5. Healthcare providers' experiences with advance care planning and goals of patient care medical treatment orders in residential aged care facilities: an explanatory descriptive study
- Author
-
Martin, RS, Hayes, BJ, Hutchinson, A, Yates, P, Lim, WK, Martin, RS, Hayes, BJ, Hutchinson, A, Yates, P, and Lim, WK
- Abstract
BACKGROUND: Advance care planning (ACP) is a process by which people communicate their healthcare preferences and values, planning for a time when they are unable to voice them. Within residential aged care facilities (RACF), both the completion and the clarity of ACP documents are varied and, internationally, medical treatment orders have been used to address these issues. AIMS: In this study, goals of patient care (GOPC) medical treatment orders were introduced alongside usual ACP in three RACF to improve healthcare decision-making for residents. This study explored the experiences of RACF healthcare providers with ACP and GOPC medical treatment orders. METHODS: The study used an explanatory descriptive approach. Within three RACF where the GOPC medical treatment orders had been introduced, focus groups and interviews with healthcare providers were performed. The transcribed interviews were analysed thematically. RESULTS: Healthcare providers not only reported support for ACP and GOPC but also discussed many problematic issues. Analysis of the data identified four main themes: enablers, barriers, resident autonomy and advance documentation (ACP and GOPC). CONCLUSION: Healthcare providers identified ACP and GOPC as positive tools for assisting with medical decision-making for residents. Although barriers exist in completion and activation of plans, healthcare providers described them as progressing resident-centred care. Willingness to follow ACP instructions was reported to be reduced by lack of trust by clinicians. Families were also reported to change their views from those documented in family-completed ACP, attributed to poor understanding of their purpose. Participants reported that GOPC led to clearer documentation of residents' medical treatment plans rather than relying on ACP documents alone.
- Published
- 2022
6. Additional file 1 of CELPI: trial protocol for a randomised controlled trial of a Carer End of Life Planning Intervention in people dying with dementia
- Author
-
Arendts, G, Chenoweth, L, Hayes, BJ, Campbell, E, Agar, M, Etherton-Beer, C, Spilsbury, K, Howard, K, Braitberg, G, Cubitt, M, Sheehan, C, Magann, L, Sudharshan, T, Schnitker, LM, Pearce, J, Gilmore, I, Cerra, N, duPreez, J, Jaworski, R, Soh, S-C, and Celenza, A
- Abstract
Additional file 1. Carer Needs Assessment.
- Published
- 2022
- Full Text
- View/download PDF
7. Evolution of tissue and developmental specificity of transcription start sites in Bos taurus indicus
- Author
-
Forutan, M, Ross, E, Chamberlain, AJ, Nguyen, L, Mason, B, Moore, S, Garner, JB, Xiang, R, Hayes, BJ, Forutan, M, Ross, E, Chamberlain, AJ, Nguyen, L, Mason, B, Moore, S, Garner, JB, Xiang, R, and Hayes, BJ
- Abstract
To further the understanding of the evolution of transcriptional regulation, we profiled genome-wide transcriptional start sites (TSSs) in two sub-species, Bos taurus taurus and Bos taurus indicus, that diverged approximately 500,000 years ago. Evolutionary and developmental-stage differences in TSSs were detected across the sub-species, including translocation of dominant TSS and changes in TSS distribution. The 16% of all SNPs located in significant differentially used TSS clusters across sub-species had significant shifts in allele frequency (472 SNPs), indicating they may have been subject to selection. In spleen and muscle, a higher relative TSS expression was observed in Bos indicus than Bos taurus for all heat shock protein genes, which may be responsible for the tropical adaptation of Bos indicus.
- Published
- 2021
8. Gene expression of the heat stress response in bovine peripheral white blood cells and milk somatic cells in vivo
- Author
-
Garner, JB, Chamberlain, AJ, Vander Jagt, C, Nguyen, TTT, Mason, BA, Marett, LC, Leury, BJ, Wales, WJ, Hayes, BJ, Garner, JB, Chamberlain, AJ, Vander Jagt, C, Nguyen, TTT, Mason, BA, Marett, LC, Leury, BJ, Wales, WJ, and Hayes, BJ
- Abstract
Heat stress in dairy cattle leads to reduction in feed intake and milk production as well as the induction of many physiological stress responses. The genes implicated in the response to heat stress in vivo are not well characterised. With the aim of identifying such genes, an experiment was conducted to perform differential gene expression in peripheral white blood cells and milk somatic cells in vivo in 6 Holstein Friesian cows in thermoneutral conditions and in 6 Holstein Friesian cows exposed to a short-term moderate heat challenge. RNA sequences from peripheral white blood cells and milk somatic cells were used to quantify full transcriptome gene expression. Genes commonly differentially expressed (DE) in both the peripheral white blood cells and in milk somatic cells were associated with the cellular stress response, apoptosis, oxidative stress and glucose metabolism. Genes DE in peripheral white blood cells of cows exposed to the heat challenge compared to the thermoneutral control were related to inflammation, lipid metabolism, carbohydrate metabolism and the cardiovascular system. Genes DE in milk somatic cells compared to the thermoneutral control were involved in the response to stress, thermoregulation and vasodilation. These findings provide new insights into the cellular adaptations induced during the response to short term moderate heat stress in dairy cattle and identify potential candidate genes (BDKRB1 and SNORA19) for future research.
- Published
- 2020
9. Quantifying the contribution of sequence variants with regulatory and evolutionary significance to 34 bovine complex traits
- Author
-
Xiang, R, van den Berg, I, MacLeod, IM, Hayes, BJ, Prowse-Wilkins, CP, Wang, M, Bolormaa, S, Liu, Z, Rochfort, SJ, Reich, CM, Mason, BA, Vander Jagt, CJ, Daetwyler, HD, Lund, MS, Chamberlain, AJ, Goddard, ME, Xiang, R, van den Berg, I, MacLeod, IM, Hayes, BJ, Prowse-Wilkins, CP, Wang, M, Bolormaa, S, Liu, Z, Rochfort, SJ, Reich, CM, Mason, BA, Vander Jagt, CJ, Daetwyler, HD, Lund, MS, Chamberlain, AJ, and Goddard, ME
- Abstract
Many genome variants shaping mammalian phenotype are hypothesized to regulate gene transcription and/or to be under selection. However, most of the evidence to support this hypothesis comes from human studies. Systematic evidence for regulatory and evolutionary signals contributing to complex traits in a different mammalian model is needed. Sequence variants associated with gene expression (expression quantitative trait loci [eQTLs]) and concentration of metabolites (metabolic quantitative trait loci [mQTLs]) and under histone-modification marks in several tissues were discovered from multiomics data of over 400 cattle. Variants under selection and evolutionary constraint were identified using genome databases of multiple species. These analyses defined 30 sets of variants, and for each set, we estimated the genetic variance the set explained across 34 complex traits in 11,923 bulls and 32,347 cows with 17,669,372 imputed variants. The per-variant trait heritability of these sets across traits was highly consistent (r > 0.94) between bulls and cows. Based on the per-variant heritability, conserved sites across 100 vertebrate species and mQTLs ranked the highest, followed by eQTLs, young variants, those under histone-modification marks, and selection signatures. From these results, we defined a Functional-And-Evolutionary Trait Heritability (FAETH) score indicating the functionality and predicted heritability of each variant. In additional 7,551 cattle, the high FAETH-ranking variants had significantly increased genetic variances and genomic prediction accuracies in 3 production traits compared to the low FAETH-ranking variants. The FAETH framework combines the information of gene regulation, evolution, and trait heritability to rank variants, and the publicly available FAETH data provide a set of biological priors for cattle genomic selection worldwide.
- Published
- 2019
10. Overlap between eQTL and QTL associated with production traits and fertility in dairy cattle
- Author
-
van den Berg, I, Hayes, BJ, Chamberlain, AJ, Goddard, ME, van den Berg, I, Hayes, BJ, Chamberlain, AJ, and Goddard, ME
- Abstract
Background Identifying causative mutations or genes through which quantitative trait loci (QTL) act has proven very difficult. Using information such as gene expression may help to identify genes and mutations underlying QTL. Our objective was to identify regions associated both with production traits or fertility and with gene expression, in dairy cattle. We used three different approaches to discover QTL that are also expression QTL (eQTL): 1) estimate the correlation between local genomic estimated breeding values (GEBV) and gene expression, 2) investigate whether the 300 intervals explaining most genetic variance for a trait contain more eQTL than 300 randomly selected intervals, and 3) a colocalisation analysis. Phenotypes and genotypes up to sequence level of 35,775 dairy bulls and cows were used for QTL mapping, and gene expression and genotypes of 131 cows were used to identify eQTL. Results With all three approaches, we identified some overlap between eQTL and QTL, though the majority of QTL in our dataset did not seem to be eQTL. The most significant associations between QTL and eQTL were found for intervals on chromosome 18, where local GEBV for all traits showed a strong association with the expression of the FUK and DDX19B. Intervals whose local GEBV for a trait correlated highly significantly with the expression of a nearby gene explained only a very small part of the genetic variance for that trait. It is likely that part of these correlations were due to linkage disequilibrium (LD) in the interval. While the 300 intervals explaining most genetic variance explained most of the GEBV variance, they contained only slightly more eQTL than 300 randomly selected intervals that explained a minimal portion of the GEBV variance. Furthermore, some variants showed a high colocalisation probability, but this was only the case for few variants. Conclusions Several reasons may have contributed to the low level of overlap between QTL and eQTL detected in our study
- Published
- 2019
11. Fine-mapping sequence mutations with a major effect on oligosaccharide content in bovine milk.
- Author
-
Liu, Z, Wang, T, Pryce, JE, MacLeod, IM, Hayes, BJ, Chamberlain, AJ, Jagt, CV, Reich, CM, Mason, BA, Rochfort, S, Cocks, BG, Liu, Z, Wang, T, Pryce, JE, MacLeod, IM, Hayes, BJ, Chamberlain, AJ, Jagt, CV, Reich, CM, Mason, BA, Rochfort, S, and Cocks, BG
- Abstract
Human milk contains abundant oligosaccharides (OS) which are believed to have strong health benefits for neonates. OS are a minor component of bovine milk and little is known about how the production of OS is regulated in the bovine mammary gland. We have measured the abundance of 12 major OS in milk of 360 cows, which had high density SNP marker genotypes. Most of the OS were found to be highly heritable (h2 between 50 and 84%). A genome-wide association study allowed us to fine-map several QTL and identify candidate genes with major effects on five OS. Among them, a putative causal mutation close to the ABO gene on Chromosome 11 accounted for approximately 80% of genetic variance for two OS, N-acetylgalactosaminyllactose and lacto-N-neotetraose. This mutation lies very close to a variant associated with the expression levels of ABO. A third QTL mapped close to ST3GAL6 on Chromosome 1 explaining 33% of genetic variation of an abundant OS, 3'-sialyllactose. The presence of major gene effects suggests that targeted marker-assisted selection would lead to a significant increase in the level of these OS in milk. This is the first attempt to map candidate genes and causal mutations for bovine milk OS.
- Published
- 2019
12. A multi-trait Bayesian method for mapping QTL and genomic prediction
- Author
-
Kemper, KE, Bowman, PJ, Hayes, BJ, Visscher, PM, Goddard, ME, Kemper, KE, Bowman, PJ, Hayes, BJ, Visscher, PM, and Goddard, ME
- Abstract
BACKGROUND: Genomic prediction and quantitative trait loci (QTL) mapping typically analyze one trait at a time but this may ignore the possibility that one polymorphism affects multiple traits. The aim of this study was to develop a multivariate Bayesian approach that could be used for simultaneously elucidating genetic architecture, QTL mapping, and genomic prediction. Our approach uses information from multiple traits to divide markers into 'unassociated' (no association with any trait) and 'associated' (associated with one or more traits). The effect of associated markers is estimated independently for each trait to avoid the assumption that QTL effects follow a multi-variate normal distribution. RESULTS: Using simulated data, our multivariate method (BayesMV) detected a larger number of true QTL (with a posterior probability > 0.9) and increased the accuracy of genomic prediction compared to an equivalent univariate method (BayesR). With real data, accuracies of genomic prediction in validation sets for milk yield traits with high-density genotypes were approximately equal to those from equivalent single-trait methods. BayesMV tended to select a similar number of single nucleotide polymorphisms (SNPs) per trait for genomic prediction compared to BayesR (i.e. those with non-zero effects), but BayesR selected different sets of SNPs for each trait, whereas BayesMV selected a common set of SNPs across traits. Despite these two dramatically different estimates of genetic architecture (i.e. different SNPs affecting each trait vs. pleiotropic SNPs), both models indicated that 3000 to 4000 SNPs are associated with a trait. The BayesMV approach may be advantageous when the aim is to develop a low-density SNP chip that works well for a number of traits. SNPs for milk yield traits identified by BayesMV and BayesR were also found to be associated with detailed milk composition. CONCLUSIONS: The BayesMV method simultaneously estimates the proportion of SNPs that are associat
- Published
- 2018
13. Putative bovine topological association domains and CTCF binding motifs can reduce the search space for causative regulatory variants of complex traits
- Author
-
Wang, M, Hancock, TP, Chamberlain, AJ, Jagt, CJV, Pryce, JE, Cocks, BG, Goddard, ME, Hayes, BJ, Wang, M, Hancock, TP, Chamberlain, AJ, Jagt, CJV, Pryce, JE, Cocks, BG, Goddard, ME, and Hayes, BJ
- Abstract
BACKGROUND: Topological association domains (TADs) are chromosomal domains characterised by frequent internal DNA-DNA interactions. The transcription factor CTCF binds to conserved DNA sequence patterns called CTCF binding motifs to either prohibit or facilitate chromosomal interactions. TADs and CTCF binding motifs control gene expression, but they are not yet well defined in the bovine genome. In this paper, we sought to improve the annotation of bovine TADs and CTCF binding motifs, and assess whether the new annotation can reduce the search space for cis-regulatory variants. RESULTS: We used genomic synteny to map TADs and CTCF binding motifs from humans, mice, dogs and macaques to the bovine genome. We found that our mapped TADs exhibited the same hallmark properties of those sourced from experimental data, such as housekeeping genes, transfer RNA genes, CTCF binding motifs, short interspersed elements, H3K4me3 and H3K27ac. We showed that runs of genes with the same pattern of allele-specific expression (ASE) (either favouring paternal or maternal allele) were often located in the same TAD or between the same conserved CTCF binding motifs. Analyses of variance showed that when averaged across all bovine tissues tested, TADs explained 14% of ASE variation (standard deviation, SD: 0.056), while CTCF explained 27% (SD: 0.078). Furthermore, we showed that the quantitative trait loci (QTLs) associated with gene expression variation (eQTLs) or ASE variation (aseQTLs), which were identified from mRNA transcripts from 141 lactating cows' white blood and milk cells, were highly enriched at putative bovine CTCF binding motifs. The linearly-furthermost, and most-significant aseQTL and eQTL for each genic target were located within the same TAD as the gene more often than expected (Chi-Squared test P-value < 0.001). CONCLUSIONS: Our results suggest that genomic synteny can be used to functionally annotate conserved transcriptional components, and provides a tool to reduce t
- Published
- 2018
14. Genome variants associated with RNA splicing variations in bovine are extensive shared between tissues
- Author
-
Xiang, R, Hayes, BJ, Vander Jagt, CJ, MacLeod, IM, Khansefid, M, Bowman, PJ, Yuan, Z, Prowse-Wilkins, CP, Reich, CM, Mason, BA, Garner, JB, Marett, LC, Chen, Y, Bolormaa, S, Daetwyler, HD, Chamberlain, AJ, Goddard, ME, Xiang, R, Hayes, BJ, Vander Jagt, CJ, MacLeod, IM, Khansefid, M, Bowman, PJ, Yuan, Z, Prowse-Wilkins, CP, Reich, CM, Mason, BA, Garner, JB, Marett, LC, Chen, Y, Bolormaa, S, Daetwyler, HD, Chamberlain, AJ, and Goddard, ME
- Abstract
BACKGROUND: Mammalian phenotypes are shaped by numerous genome variants, many of which may regulate gene transcription or RNA splicing. To identify variants with regulatory functions in cattle, an important economic and model species, we used sequence variants to map a type of expression quantitative trait loci (expression QTLs) that are associated with variations in the RNA splicing, i.e., sQTLs. To further the understanding of regulatory variants, sQTLs were compare with other two types of expression QTLs, 1) variants associated with variations in gene expression, i.e., geQTLs and 2) variants associated with variations in exon expression, i.e., eeQTLs, in different tissues. RESULTS: Using whole genome and RNA sequence data from four tissues of over 200 cattle, sQTLs identified using exon inclusion ratios were verified by matching their effects on adjacent intron excision ratios. sQTLs contained the highest percentage of variants that are within the intronic region of genes and contained the lowest percentage of variants that are within intergenic regions, compared to eeQTLs and geQTLs. Many geQTLs and sQTLs are also detected as eeQTLs. Many expression QTLs, including sQTLs, were significant in all four tissues and had a similar effect in each tissue. To verify such expression QTL sharing between tissues, variants surrounding (±1 Mb) the exon or gene were used to build local genomic relationship matrices (LGRM) and estimated genetic correlations between tissues. For many exons, the splicing and expression level was determined by the same cis additive genetic variance in different tissues. Thus, an effective but simple-to-implement meta-analysis combining information from three tissues is introduced to increase power to detect and validate sQTLs. sQTLs and eeQTLs together were more enriched for variants associated with cattle complex traits, compared to geQTLs. Several putative causal mutations were identified, including an sQTL at Chr6:87392580 within the 5th exon
- Published
- 2018
15. Genomewide association study of methane emissions in angus beef cattle with validation in dairy cattle
- Author
-
Manzanilla-Pech, CIV, De Haas, Y, Hayes, BJ, Veerkamp, RF, Khansefid, M, Donoghue, KA, Arthur, PF, and Pryce, Jennie
- Subjects
Feed intake ,WIAS ,Fokkerij & Genomica ,Residual methane ,Methane production ,Animal Breeding & Genomics ,Uncategorized - Abstract
Methane (CH4) is a product of enteric fermentation in ruminants, and it represents around 17% of global CH4 emissions. There has been substantial effort from the livestock scientific community toward tools that can help reduce this percentage. One approach is to select for lower emitting animals. To achieve this, accurate genetic parameters and identification of the genomic basis of CH4 traits are required. Therefore, the objectives of this study were 1) to perform a genomewide association study to identify SNP associated with several CH4 traits in Angus beef cattle (1,020 animals) and validate them in a lactating Holstein population (population 1 [POP1]; 205 animals); 2) to validate significant SNP for DMI and weight at test (WT) from a second Holstein population, from a previous study (population 2 [POP2]; 903 animals), in an Angus population; and 3) to evaluate 2 different residual CH4 traits and determine if the genes associated with CH4 also control residual CH4 traits. Phenotypes calculated for the genotyped Angus population included CH4 production (MeP), CH4 yield (MeY), CH4 intensity (MI), DMI, and WT. The Holstein population (POP1) was multiparous, with phenotypes on CH4 traits (MeP, MeY, and MI) plus genotypes. Additionally, 2 CH4 traits, residual genetic CH4 (RGM) and residual phenotypic CH4 (RPM), were calculated by adjusting MeP for DMI and WT. Estimated heritabilities in the Angus population were 0.30, 0.19, and 0.15 for MeP, RGM, and RPM, respectively, and genetic correlations of MeP with DMI and WT were 0.83 and 0.80, respectively. Estimated heritabilities in Holstein POP1 were 0.23, 0.30, and 0.42 for MeP, MeY, and MI, respectively. Strong associations with MeP were found on chromosomes 4, 12, 14, 20, and 30 at P < 0.001, and those chromosomes also had significant SNP for DMI in Holstein POP1. In the Angus population, the number of significant SNP for MeP at P < 0.005 was 3,304, and approximately 630 of those SNP also were important for DMI and WT. When a set (approximately 3,300) of significant SNP for DMI and WT in the Angus population was used to estimate genetic parameters for MeP and MeY in Holstein POP1, the genetic variance and, consequently, the heritability slightly increased, meaning that most of the genetic variation is largely captured by these SNP. Residual traits could be a good option to include in the breeding goal, as this would facilitate selection for lower emitting animals without compromising DMI and WT.
- Published
- 2016
16. Differentially Expressed Genes in Endometrium and Corpus Luteum of Holstein Cows Selected for High and Low Fertility Are Enriched for Sequence Variants Associated with Fertility
- Author
-
Moore, SG, Pryce, Jennie, Hayes, BJ, Chamberlain, Amanda, Kemper, KE, Berry, DP, McCabe, M, Cormican, P, Lonergan, P, Fair, T, Butler, ST, Department of Agriculture, Food and the Marine, Ireland, Teagasc Walsh Fellowship Programme, National Development Plan, Irish Dairy Levy Research Trust, and 13/S/528
- Subjects
Trnscriptomics ,Corpus luteum ,Endometrium ,Fertility ,Genome-wide association ,Differentially expressed genes ,Variants ,Genetic merit ,Uncategorized - Abstract
peer-reviewed Despite the importance of fertility in humans and livestock, there has been little success dissecting the genetic basis of fertility. Our hypothesis was that genes differentially expressed in the endometrium and corpus luteum on Day 13 of the estrous cycle between cows with either good or poor genetic merit for fertility would be enriched for genetic variants associated with fertility. We combined a unique genetic model of fertility (cattle that have been selected for high and low fertility and show substantial difference in fertility) with gene expression data from these cattle and genome-wide association study (GWAS) results in ∼20 000 cattle to identify quantitative trait loci (QTL) regions and sequence variants associated with genetic variation in fertility. Two hundred and forty-five QTL regions and 17 sequence variants associated primarily with prostaglandin F2alpha, steroidogenesis, mRNA processing, energy status, and immune-related processes were identified. Ninety-three of the QTL regions were validated by two independent GWAS, with signals for fertility detected primarily on chromosomes 18, 5, 7, 8, and 29. Plausible causative mutations were identified, including one missense variant significantly associated with fertility and predicted to affect the protein function of EIF4EBP3. The results of this study enhance our understanding of 1) the contribution of the endometrium and corpus luteum transcriptome to phenotypic fertility differences and 2) the genetic architecture of fertility in dairy cattle. Including these variants in predictions of genomic breeding values may improve the rate of genetic gain for this critical trait.
- Published
- 2015
17. Occurrence of Infected Amoebae in Cooling Towers Compared with Natural Aquatic Environments: Implications for Emerging Pathogens
- Author
-
Sharon G. Berk, Anthony L. Farone, Farsian M, Uddin N, Williams El, Gunderson Jh, Anthony L. Newsome, Johnson Ra, Skimmyhorn J, Mary B. Farone, Hayes Bj, Reid A, and Redding Ks
- Subjects
Legionella ,Molecular Sequence Data ,Fresh Water ,chemical and pharmacologic phenomena ,Human pathogen ,macromolecular substances ,Natural (archaeology) ,Legionella pneumophila ,Odds Ratio ,Animals ,Environmental Chemistry ,Air Conditioning ,Cooling tower ,Amoeba ,DNA Primers ,Base Sequence ,biology ,Ecology ,Aquatic ecosystem ,Computational Biology ,Sequence Analysis, DNA ,General Chemistry ,Hydrogen-Ion Concentration ,biochemical phenomena, metabolism, and nutrition ,biology.organism_classification ,Tennessee ,Carbon ,respiratory tract diseases ,Logistic Models ,Aquatic environment ,Protozoa ,Water Microbiology ,Bacteria ,Environmental Monitoring ,circulatory and respiratory physiology - Abstract
Many species of bacteria pathogenic to humans, such as Legionella, are thought to have evolved in association with amoebal hosts. Several novel unculturable bacteria related to Legionella have also been found in amoebae, a few of which have been thought to be causes of nosocomial infections in humans. Because amoebae can be found in cooling towers, we wanted to know whether cooling tower environments might enhance the association between amoebae and bacterial pathogens of amoebae in order to identify potential "hot spots" for emerging human pathogens. To compare occurrence of infected amoebae in natural environments with those in cooling towers, 40 natural aquatic environments and 40 cooling tower samples were examined. Logistic regression analysis determined variables that were significant predictors of the occurrence of infected amoebae, which were found in 22 of 40 cooling tower samples but in only 3 of the 40 natural samples. An odds ratio showed that it is over 16 times more likely to encounter infected amoebae in cooling towers than in natural environments. Environmental data from cooling towers and natural habitats combined revealed dissolved organic carbon (DOC) and pH were predictors of the occurrence of the pathogens, however, when cooling tower data alone were analyzed, no variables accounted for the occurrence. Several bacteria have novel rRNA sequences, and most strains were not culturable outside of amoebae. Such pathogens of amoebae may spread to the environment via aerosols from cooling towers. Studies of emerging infectious diseases should strongly consider cooling towers as a source of amoeba-associated pathogens.
- Published
- 2006
- Full Text
- View/download PDF
18. Multi-breed genomic prediction using Bayes R with sequence data and dropping variants with a small effect
- Author
-
van den Berg, I, Bowman, PJ, MacLeod, IM, Hayes, BJ, Wang, T, Bolormaa, S, Goddard, ME, van den Berg, I, Bowman, PJ, MacLeod, IM, Hayes, BJ, Wang, T, Bolormaa, S, and Goddard, ME
- Abstract
BACKGROUND: The increasing availability of whole-genome sequence data is expected to increase the accuracy of genomic prediction. However, results from simulation studies and analysis of real data do not always show an increase in accuracy from sequence data compared to high-density (HD) single nucleotide polymorphism (SNP) chip genotypes. In addition, the sheer number of variants makes analysis of all variants and accurate estimation of all effects computationally challenging. Our objective was to find a strategy to approximate the analysis of whole-sequence data with a Bayesian variable selection model. Using a simulated dataset, we applied a Bayes R hybrid model to analyse whole-sequence data, test the effect of dropping a proportion of variants during the analysis, and test how the analysis can be split into separate analyses per chromosome to reduce the elapsed computing time. We also investigated the effect of imputation errors on prediction accuracy. Subsequently, we applied the approach to a dataset that contained imputed sequences and records for production and fertility traits for 38,492 Holstein, Jersey, Australian Red and crossbred bulls and cows. RESULTS: With the simulated dataset, we found that prediction accuracy was highly increased for a breed that was not represented in the training population for sequence data compared to HD SNP data. Either dropping part of the variants during the analysis or splitting the analysis into separate analyses per chromosome decreased accuracy compared to analysing whole-sequence data. First, dropping variants from each chromosome and reanalysing the retained variants together resulted in an accuracy similar to that obtained when analysing whole-sequence data. Adding imputation errors decreased prediction accuracy, especially for errors in the validation population. With real data, using sequence variants resulted in accuracies that were similar to those obtained with the HD SNPs. CONCLUSIONS: We present an efficient
- Published
- 2017
19. Genomic Selection Improves Heat Tolerance in Dairy Cattle (vol 6, 34114, 2017)
- Author
-
Garner, JB, Douglas, ML, Williams, SRO, Wales, WJ, Marett, LC, Nguyen, TTT, Reich, CM, Hayes, BJ, Garner, JB, Douglas, ML, Williams, SRO, Wales, WJ, Marett, LC, Nguyen, TTT, Reich, CM, and Hayes, BJ
- Abstract
Scientific Reports 6: Article number: 34114; published online: 29 September 2016; updated: 19 January 2017. The original version of this Article omitted an affiliation for J. B. Garner. The correct affiliations are listed below: Agriculture Victoria, Department of Economic Development, Jobs, Transport and Resources, 1301 Hazeldean Road, Ellinbank, Victoria 3821, Australia.
- Published
- 2017
20. Application of a Bayesian non-linear model hybrid scheme to sequence data for genomic prediction and QTL mapping
- Author
-
Wang, T, Chen, Y-PP, MacLeod, IM, Pryce, JE, Goddard, ME, Hayes, BJ, Wang, T, Chen, Y-PP, MacLeod, IM, Pryce, JE, Goddard, ME, and Hayes, BJ
- Abstract
BACKGROUND: Using whole genome sequence data might improve genomic prediction accuracy, when compared with high-density SNP arrays, and could lead to identification of casual mutations affecting complex traits. For some traits, the most accurate genomic predictions are achieved with non-linear Bayesian methods. However, as the number of variants and the size of the reference population increase, the computational time required to implement these Bayesian methods (typically with Monte Carlo Markov Chain sampling) becomes unfeasibly long. RESULTS: Here, we applied a new method, HyB_BR (for Hybrid BayesR), which implements a mixture model of normal distributions and hybridizes an Expectation-Maximization (EM) algorithm followed by Markov Chain Monte Carlo (MCMC) sampling, to genomic prediction in a large dairy cattle population with imputed whole genome sequence data. The imputed whole genome sequence data included 994,019 variant genotypes of 16,214 Holstein and Jersey bulls and cows. Traits included fat yield, milk volume, protein kg, fat% and protein% in milk, as well as fertility and heat tolerance. HyB_BR achieved genomic prediction accuracies as high as the full MCMC implementation of BayesR, both for predicting a validation set of Holstein and Jersey bulls (multi-breed prediction) and a validation set of Australian Red bulls (across-breed prediction). HyB_BR had a ten fold reduction in compute time, compared with the MCMC implementation of BayesR (48 hours versus 594 hours). We also demonstrate that in many cases HyB_BR identified sequence variants with a high posterior probability of affecting the milk production or fertility traits that were similar to those identified in BayesR. For heat tolerance, both HyB_BR and BayesR found variants in or close to promising candidate genes associated with this trait and not detected by previous studies. CONCLUSIONS: The results demonstrate that HyB_BR is a feasible method for simultaneous genomic prediction and QTL mapping
- Published
- 2017
21. Haplotypes of Single Nucleotide Polymorphisms (SNPs) in casein loci of Girgentana goats explain isoelectrofocusing results
- Author
-
SARDINA, Maria Teresa, FINOCCHIARO, Raffaella, PORTOLANO, Baldassare, Hayes, BJ, van Kaam, JBCHM, Sardina, MT, Hayes, BJ, Finocchiaro, R, van Kaam, JBCHM, and Portolano, B
- Subjects
Settore AGR/17 - Zootecnica Generale E Miglioramento Genetico ,Girgentana goat, SNPs, casein loci - Abstract
Goat milk, cheese and yoghurt offer an alternative to cow milk products. In Sicily one of the important goat breeds reared is the Girgentana. Traditionally the milk from Girgentana goats was used for nourishment of infants and elderly. Recently, the Girgentana breed has been in decline. This of concern, as the Girgentana may carry unique milk protein genetic variants. If health benefits of Girgentana goat milk is proven, then increasing production from this population will be of great interest. The most abundant proteins in goat milk, as in other milks, are the caseins, αS1-, β-, αS2- and κ-casein, coded by the loci CSN1S1, CSN2, CSN1S2 and CSN3 respectively. In other goat breeds, the casein loci have been found to be highly polymorphic, and a number of genetic variants of the casein genes that affect milk production traits have been described. The aim of this study was to quantify the variation of casein loci within the Girgentana breed, and additionally to determine if there was any agreement between DNA polymorphism and isoelectrofocusing (IEF) protein expression of αS1. Forty individuals including goats and bucks were genotyped for 9 SNPs and one deletion (exon 9) in CSN1S1, 7 SNPs in CSN2, 3 SNPs in CSN1S2 and 13 SNPs in CSN3. Genotypes for the deletion in exon 9 were in very good agreement with the IEF, results; goats that were homozygous for this deletion had no or very low levels of αS1 expression, goats that were heterozygous had intermediate levels of expression, and goats that were homozygous for the non deletion allele had full αS1 expression. For each casein, haplotypes of the SNP genotypes were constructed. There were a limited number of haplotypes within each casein locus; 6 for CSN1S1 and CSN2, 3 for CSN1S2, and 7 for CSN3. The limited number of haplotypes indicates strong linkage disequilibrium between SNPs within each locus. Future work will determine if the haplotypes have significant effects on health and production traits, and their potential for use in marker assisted selection.
- Published
- 2006
22. Genomic Selection Improves Heat Tolerance in Dairy Cattle
- Author
-
Garner, JB, Douglas, ML, Williams, SRO, Wales, WJ, Marett, LC, Nguyen, TTT, Reich, CM, Hayes, BJ, Garner, JB, Douglas, ML, Williams, SRO, Wales, WJ, Marett, LC, Nguyen, TTT, Reich, CM, and Hayes, BJ
- Abstract
Dairy products are a key source of valuable proteins and fats for many millions of people worldwide. Dairy cattle are highly susceptible to heat-stress induced decline in milk production, and as the frequency and duration of heat-stress events increases, the long term security of nutrition from dairy products is threatened. Identification of dairy cattle more tolerant of heat stress conditions would be an important progression towards breeding better adapted dairy herds to future climates. Breeding for heat tolerance could be accelerated with genomic selection, using genome wide DNA markers that predict tolerance to heat stress. Here we demonstrate the value of genomic predictions for heat tolerance in cohorts of Holstein cows predicted to be heat tolerant and heat susceptible using controlled-climate chambers simulating a moderate heatwave event. Not only was the heat challenge stimulated decline in milk production less in cows genomically predicted to be heat-tolerant, physiological indicators such as rectal and intra-vaginal temperatures had reduced increases over the 4 day heat challenge. This demonstrates that genomic selection for heat tolerance in dairy cattle is a step towards securing a valuable source of nutrition and improving animal welfare facing a future with predicted increases in heat stress events.
- Published
- 2016
23. Exploiting biological priors and sequence variants enhances QTL discovery and genomic prediction of complex traits
- Author
-
MacLeod, IM, Bowman, PJ, Vander Jagt, CJ, Haile-Mariam, M, Kemper, KE, Chamberlain, AJ, Schrooten, C, Hayes, BJ, Goddard, ME, MacLeod, IM, Bowman, PJ, Vander Jagt, CJ, Haile-Mariam, M, Kemper, KE, Chamberlain, AJ, Schrooten, C, Hayes, BJ, and Goddard, ME
- Abstract
BACKGROUND: Dense SNP genotypes are often combined with complex trait phenotypes to map causal variants, study genetic architecture and provide genomic predictions for individuals with genotypes but no phenotype. A single method of analysis that jointly fits all genotypes in a Bayesian mixture model (BayesR) has been shown to competitively address all 3 purposes simultaneously. However, BayesR and other similar methods ignore prior biological knowledge and assume all genotypes are equally likely to affect the trait. While this assumption is reasonable for SNP array genotypes, it is less sensible if genotypes are whole-genome sequence variants which should include causal variants. RESULTS: We introduce a new method (BayesRC) based on BayesR that incorporates prior biological information in the analysis by defining classes of variants likely to be enriched for causal mutations. The information can be derived from a range of sources, including variant annotation, candidate gene lists and known causal variants. This information is then incorporated objectively in the analysis based on evidence of enrichment in the data. We demonstrate the increased power of BayesRC compared to BayesR using real dairy cattle genotypes with simulated phenotypes. The genotypes were imputed whole-genome sequence variants in coding regions combined with dense SNP markers. BayesRC increased the power to detect causal variants and increased the accuracy of genomic prediction. The relative improvement for genomic prediction was most apparent in validation populations that were not closely related to the reference population. We also applied BayesRC to real milk production phenotypes in dairy cattle using independent biological priors from gene expression analyses. Although current biological knowledge of which genes and variants affect milk production is still very incomplete, our results suggest that the new BayesRC method was equal to or more powerful than BayesR for detecting candidate causa
- Published
- 2016
24. A hybrid expectation maximisation and MCMC sampling algorithm to implement Bayesian mixture model based genomic prediction and QTL mapping
- Author
-
Wang, T, Chen, Y-PP, Bowman, PJ, Goddard, ME, Hayes, BJ, Wang, T, Chen, Y-PP, Bowman, PJ, Goddard, ME, and Hayes, BJ
- Abstract
BACKGROUND: Bayesian mixture models in which the effects of SNP are assumed to come from normal distributions with different variances are attractive for simultaneous genomic prediction and QTL mapping. These models are usually implemented with Monte Carlo Markov Chain (MCMC) sampling, which requires long compute times with large genomic data sets. Here, we present an efficient approach (termed HyB_BR), which is a hybrid of an Expectation-Maximisation algorithm, followed by a limited number of MCMC without the requirement for burn-in. RESULTS: To test prediction accuracy from HyB_BR, dairy cattle and human disease trait data were used. In the dairy cattle data, there were four quantitative traits (milk volume, protein kg, fat% in milk and fertility) measured in 16,214 cattle from two breeds genotyped for 632,002 SNPs. Validation of genomic predictions was in a subset of cattle either from the reference set or in animals from a third breeds that were not in the reference set. In all cases, HyB_BR gave almost identical accuracies to Bayesian mixture models implemented with full MCMC, however computational time was reduced by up to 1/17 of that required by full MCMC. The SNPs with high posterior probability of a non-zero effect were also very similar between full MCMC and HyB_BR, with several known genes affecting milk production in this category, as well as some novel genes. HyB_BR was also applied to seven human diseases with 4890 individuals genotyped for around 300 K SNPs in a case/control design, from the Welcome Trust Case Control Consortium (WTCCC). In this data set, the results demonstrated again that HyB_BR performed as well as Bayesian mixture models with full MCMC for genomic predictions and genetic architecture inference while reducing the computational time from 45 h with full MCMC to 3 h with HyB_BR. CONCLUSIONS: The results for quantitative traits in cattle and disease in humans demonstrate that HyB_BR can perform equally well as Bayesian mixture models
- Published
- 2016
25. Detailed phenotyping identifies genes with pleiotropic effects on body composition
- Author
-
Bolormaa, S, Hayes, BJ, van der Werf, JHJ, Pethick, D, Goddard, ME, Daetwyler, HD, Bolormaa, S, Hayes, BJ, van der Werf, JHJ, Pethick, D, Goddard, ME, and Daetwyler, HD
- Abstract
BACKGROUND: Genetic variation in both the composition and distribution of fat and muscle in the body is important to human health as well as the healthiness and value of meat from cattle and sheep. Here we use detailed phenotyping and a multi-trait approach to identify genes explaining variation in body composition traits. RESULTS: A multi-trait genome wide association analysis of 56 carcass composition traits measured on 10,613 sheep with imputed and real genotypes on 510,174 SNPs was performed. We clustered 71 significant SNPs into five groups based on their pleiotropic effects across the 56 traits. Among these 71 significant SNPs, one group of 11 SNPs affected the fatty acid profile of the muscle and were close to 8 genes involved in fatty acid or triglyceride synthesis. Another group of 23 SNPs had an effect on mature size, based on their pattern of effects across traits, but the genes near this group of SNPs did not share any obvious function. Many of the likely candidate genes near SNPs with significant pleiotropic effects on the 56 traits are involved in intra-cellular signalling pathways. Among the significant SNPs were some with a convincing candidate gene due to the function of the gene (e.g. glycogen synthase affecting glycogen concentration) or because the same gene was associated with similar traits in other species. CONCLUSIONS: Using a multi-trait analysis increased the power to detect associations between SNP and body composition traits compared with the single trait analyses. Detailed phenotypic information helped to identify a convincing candidate in some cases as did information from other species.
- Published
- 2016
26. A computationally efficient algorithm for genomic prediction using a Bayesian model
- Author
-
Wang, T, Chen, Y-PP, Goddard, ME, Meuwissen, THE, Kemper, KE, Hayes, BJ, Wang, T, Chen, Y-PP, Goddard, ME, Meuwissen, THE, Kemper, KE, and Hayes, BJ
- Abstract
BACKGROUND: Genomic prediction of breeding values from dense single nucleotide polymorphisms (SNP) genotypes is used for livestock and crop breeding, and can also be used to predict disease risk in humans. For some traits, the most accurate genomic predictions are achieved with non-linear estimates of SNP effects from Bayesian methods that treat SNP effects as random effects from a heavy tailed prior distribution. These Bayesian methods are usually implemented via Markov chain Monte Carlo (MCMC) schemes to sample from the posterior distribution of SNP effects, which is computationally expensive. Our aim was to develop an efficient expectation-maximisation algorithm (emBayesR) that gives similar estimates of SNP effects and accuracies of genomic prediction than the MCMC implementation of BayesR (a Bayesian method for genomic prediction), but with greatly reduced computation time. METHODS: emBayesR is an approximate EM algorithm that retains the BayesR model assumption with SNP effects sampled from a mixture of normal distributions with increasing variance. emBayesR differs from other proposed non-MCMC implementations of Bayesian methods for genomic prediction in that it estimates the effect of each SNP while allowing for the error associated with estimation of all other SNP effects. emBayesR was compared to BayesR using simulated data, and real dairy cattle data with 632 003 SNPs genotyped, to determine if the MCMC and the expectation-maximisation approaches give similar accuracies of genomic prediction. RESULTS: We were able to demonstrate that allowing for the error associated with estimation of other SNP effects when estimating the effect of each SNP in emBayesR improved the accuracy of genomic prediction over emBayesR without including this error correction, with both simulated and real data. When averaged over nine dairy traits, the accuracy of genomic prediction with emBayesR was only 0.5% lower than that from BayesR. However, emBayesR reduced computing time up
- Published
- 2015
27. Non-additive genetic variation in growth, carcass and fertility traits of beef cattle
- Author
-
Bolormaa, S, Pryce, JE, Zhang, Y, Reverter, A, Barendse, W, Hayes, BJ, Goddard, ME, Bolormaa, S, Pryce, JE, Zhang, Y, Reverter, A, Barendse, W, Hayes, BJ, and Goddard, ME
- Abstract
BACKGROUND: A better understanding of non-additive variance could lead to increased knowledge on the genetic control and physiology of quantitative traits, and to improved prediction of the genetic value and phenotype of individuals. Genome-wide panels of single nucleotide polymorphisms (SNPs) have been mainly used to map additive effects for quantitative traits, but they can also be used to investigate non-additive effects. We estimated dominance and epistatic effects of SNPs on various traits in beef cattle and the variance explained by dominance, and quantified the increase in accuracy of phenotype prediction by including dominance deviations in its estimation. METHODS: Genotype data (729 068 real or imputed SNPs) and phenotypes on up to 16 traits of 10 191 individuals from Bos taurus, Bos indicus and composite breeds were used. A genome-wide association study was performed by fitting the additive and dominance effects of single SNPs. The dominance variance was estimated by fitting a dominance relationship matrix constructed from the 729 068 SNPs. The accuracy of predicted phenotypic values was evaluated by best linear unbiased prediction using the additive and dominance relationship matrices. Epistatic interactions (additive × additive) were tested between each of the 28 SNPs that are known to have additive effects on multiple traits, and each of the other remaining 729 067 SNPs. RESULTS: The number of significant dominance effects was greater than expected by chance and most of them were in the direction that is presumed to increase fitness and in the opposite direction to inbreeding depression. Estimates of dominance variance explained by SNPs varied widely between traits, but had large standard errors. The median dominance variance across the 16 traits was equal to 5% of the phenotypic variance. Including a dominance deviation in the prediction did not significantly increase its accuracy for any of the phenotypes. The number of additive × additive epistatic e
- Published
- 2015
28. Improved precision of QTL mapping using a nonlinear Bayesian method in a multi-breed population leads to greater accuracy of across-breed genomic predictions
- Author
-
Kemper, KE, Reich, CM, Bowman, PJ, vander Jagt, CJ, Chamberlain, AJ, Mason, BA, Hayes, BJ, Goddard, ME, Kemper, KE, Reich, CM, Bowman, PJ, vander Jagt, CJ, Chamberlain, AJ, Mason, BA, Hayes, BJ, and Goddard, ME
- Abstract
BACKGROUND: Genomic selection is increasingly widely practised, particularly in dairy cattle. However, the accuracy of current predictions using GBLUP (genomic best linear unbiased prediction) decays rapidly across generations, and also as selection candidates become less related to the reference population. This is likely caused by the effects of causative mutations being dispersed across many SNPs (single nucleotide polymorphisms) that span large genomic intervals. In this paper, we hypothesise that the use of a nonlinear method (BayesR), combined with a multi-breed (Holstein/Jersey) reference population will map causative mutations with more precision than GBLUP and this, in turn, will increase the accuracy of genomic predictions for selection candidates that are less related to the reference animals. RESULTS: BayesR improved the across-breed prediction accuracy for Australian Red dairy cattle for five milk yield and composition traits by an average of 7% over the GBLUP approach (Australian Red animals were not included in the reference population). Using the multi-breed reference population with BayesR improved accuracy of prediction in Australian Red cattle by 2 - 5% compared to using BayesR with a single breed reference population. Inclusion of 8478 Holstein and 3917 Jersey cows in the reference population improved accuracy of predictions for these breeds by 4 and 5%. However, predictions for Holstein and Jersey cattle were similar using within-breed and multi-breed reference populations. We propose that the improvement in across-breed prediction achieved by BayesR with the multi-breed reference population is due to more precise mapping of quantitative trait loci (QTL), which was demonstrated for several regions. New candidate genes with functional links to milk synthesis were identified using differential gene expression in the mammary gland. CONCLUSIONS: QTL detection and genomic prediction are usually considered independently but persistence of genomic pred
- Published
- 2015
29. Simultaneous Discovery, Estimation and Prediction Analysis of Complex Traits Using a Bayesian Mixture Model
- Author
-
Haley, C, Moser, G, Lee, SH, Hayes, BJ, Goddard, ME, Wray, NR, Visscher, PM, Haley, C, Moser, G, Lee, SH, Hayes, BJ, Goddard, ME, Wray, NR, and Visscher, PM
- Abstract
Gene discovery, estimation of heritability captured by SNP arrays, inference on genetic architecture and prediction analyses of complex traits are usually performed using different statistical models and methods, leading to inefficiency and loss of power. Here we use a Bayesian mixture model that simultaneously allows variant discovery, estimation of genetic variance explained by all variants and prediction of unobserved phenotypes in new samples. We apply the method to simulated data of quantitative traits and Welcome Trust Case Control Consortium (WTCCC) data on disease and show that it provides accurate estimates of SNP-based heritability, produces unbiased estimators of risk in new samples, and that it can estimate genetic architecture by partitioning variation across hundreds to thousands of SNPs. We estimated that, depending on the trait, 2,633 to 9,411 SNPs explain all of the SNP-based heritability in the WTCCC diseases. The majority of those SNPs (>96%) had small effects, confirming a substantial polygenic component to common diseases. The proportion of the SNP-based variance explained by large effects (each SNP explaining 1% of the variance) varied markedly between diseases, ranging from almost zero for bipolar disorder to 72% for type 1 diabetes. Prediction analyses demonstrate that for diseases with major loci, such as type 1 diabetes and rheumatoid arthritis, Bayesian methods outperform profile scoring or mixed model approaches.
- Published
- 2015
30. Extensive variation between tissues in allele specific expression in an outbred mammal
- Author
-
Chamberlain, AJ, Vander Jagt, CJ, Hayes, BJ, Khansefid, M, Marett, LC, Millen, CA, Nguyen, TTT, Goddard, ME, Chamberlain, AJ, Vander Jagt, CJ, Hayes, BJ, Khansefid, M, Marett, LC, Millen, CA, Nguyen, TTT, and Goddard, ME
- Abstract
BACKGROUND: Allele specific gene expression (ASE), with the paternal allele more expressed than the maternal allele or vice versa, appears to be a common phenomenon in humans and mice. In other species the extent of ASE is unknown, and even in humans and mice there are several outstanding questions. These include; to what extent is ASE tissue specific? how often does the direction of allele expression imbalance reverse between tissues? how often is only one of the two alleles expressed? is there a genome wide bias towards expression of the paternal or maternal allele; and finally do genes that are nearby on a chromosome share the same direction of ASE? Here we use gene expression data (RNASeq) from 18 tissues from a single cow to investigate each of these questions in turn, and then validate some of these findings in two tissues from 20 cows. RESULTS: Between 40 and 100 million sequence reads were generated per tissue across three replicate samples for each of the eighteen tissues from the single cow (the discovery dataset). A bovine gene expression atlas was created (the first from RNASeq data), and differentially expressed genes in each tissue were identified. To analyse ASE, we had access to unambiguously phased genotypes for all heterozygous variants in the cow's whole genome sequence, where these variants were homozygous in the whole genome sequence of her sire, and as a result we were able to map reads to parental genomes, to determine SNP and genes showing ASE in each tissue. In total 25,251 heterozygous SNP within 7985 genes were tested for ASE in at least one tissue. ASE was pervasive, 89 % of genes tested had significant ASE in at least one tissue. This large proportion of genes displaying ASE was confirmed in the two tissues in a validation dataset. For individual tissues the proportion of genes showing significant ASE varied from as low as 8-16 % of those tested in thymus to as high as 71-82 % of those tested in lung. There were a number of cases where t
- Published
- 2015
31. Impact of QTL properties on the accuracy of multi-breed genomic prediction
- Author
-
Wientjes, YCJ, Calus, MPL, Goddard, ME, Hayes, BJ, Wientjes, YCJ, Calus, MPL, Goddard, ME, and Hayes, BJ
- Abstract
BACKGROUND: Although simulation studies show that combining multiple breeds in one reference population increases accuracy of genomic prediction, this is not always confirmed in empirical studies. This discrepancy might be due to the assumptions on quantitative trait loci (QTL) properties applied in simulation studies, including number of QTL, spectrum of QTL allele frequencies across breeds, and distribution of allele substitution effects. We investigated the effects of QTL properties and of including a random across- and within-breed animal effect in a genomic best linear unbiased prediction (GBLUP) model on accuracy of multi-breed genomic prediction using genotypes of Holstein-Friesian and Jersey cows. METHODS: Genotypes of three classes of variants obtained from whole-genome sequence data, with moderately low, very low or extremely low average minor allele frequencies (MAF), were imputed in 3000 Holstein-Friesian and 3000 Jersey cows that had real high-density genotypes. Phenotypes of traits controlled by QTL with different properties were simulated by sampling 100 or 1000 QTL from one class of variants and their allele substitution effects either randomly from a gamma distribution, or computed such that each QTL explained the same variance, i.e. rare alleles had a large effect. Genomic breeding values for 1000 selection candidates per breed were estimated using GBLUP modelsincluding a random across- and a within-breed animal effect. RESULTS: For all three classes of QTL allele frequency spectra, accuracies of genomic prediction were not affected by the addition of 2000 individuals of the other breed to a reference population of the same breed as the selection candidates. Accuracies of both single- and multi-breed genomic prediction decreased as MAF of QTL decreased, especially when rare alleles had a large effect. Accuracies of genomic prediction were similar for the models with and without a random within-breed animal effect, probably because of insufficient p
- Published
- 2015
32. Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle
- Author
-
te Pas, MFW, Gonzalez-Recio, O, Daetwyler, HD, MacLeod, IM, Pryce, JE, Bowman, PJ, Hayes, BJ, Goddard, ME, te Pas, MFW, Gonzalez-Recio, O, Daetwyler, HD, MacLeod, IM, Pryce, JE, Bowman, PJ, Hayes, BJ, and Goddard, ME
- Abstract
The proportion of genetic variation in complex traits explained by rare variants is a key question for genomic prediction, and for identifying the basis of "missing heritability"--the proportion of additive genetic variation not captured by common variants on SNP arrays. Sequence variants in transcript and regulatory regions from 429 sequenced animals were used to impute high density SNP genotypes of 3311 Holstein sires to sequence. There were 675,062 common variants (MAF>0.05), 102,549 uncommon variants (0.01
- Published
- 2015
33. Genetic variants in mammary development, prolactin signalling and involution pathways explain considerable variation in bovine milk production and milk composition
- Author
-
Raven, L-A, Cocks, BG, Goddard, ME, Pryce, JE, Hayes, BJ, Raven, L-A, Cocks, BG, Goddard, ME, Pryce, JE, and Hayes, BJ
- Abstract
BACKGROUND: The maintenance of lactation in mammals is the result of a balance between competing signals from mammary development, prolactin signalling and involution pathways. Dairy cattle are an interesting case study to investigate the effect of polymorphisms that affect the function of genes in these pathways. In dairy cattle, lactation yields and milk composition (for example protein percentage and fat percentage) are routinely recorded, and these vary greatly between individuals. In this study, we test 8058 single nucleotide polymorphisms in or close to genes in these pathways for association with milk production traits and determine the proportion of variance explained by each pathway, using data on 16 812 dairy cattle, including Holstein-Friesian and Jersey bulls and cows. RESULTS: Single nucleotide polymorphisms close to genes in the mammary development, prolactin signalling and involution pathways were significantly associated with milk production traits. The involution pathway explained the largest proportion of genetic variation for production traits. The mammary development pathway also explained additional genetic variation for milk volume, fat percentage and protein percentage. CONCLUSIONS: Genetic variants in the involution pathway explained considerably more genetic variation in milk production traits than expected by chance. Many of the associations for single nucleotide polymorphisms in genes in this pathway have not been detected in conventional genome-wide association studies. The pathway approach used here allowed us to identify some novel candidates for further studies that will be aimed at refining the location of associated genomic regions and identifying polymorphisms contributing to variation in lactation volume and milk composition.
- Published
- 2014
34. A Multi-Trait, Meta-analysis for Detecting Pleiotropic Polymorphisms for Stature, Fatness and Reproduction in Beef Cattle
- Author
-
Flint, J, Bolormaa, S, Pryce, JE, Reverter, A, Zhang, Y, Barendse, W, Kemper, K, Tier, B, Savin, K, Hayes, BJ, Goddard, ME, Flint, J, Bolormaa, S, Pryce, JE, Reverter, A, Zhang, Y, Barendse, W, Kemper, K, Tier, B, Savin, K, Hayes, BJ, and Goddard, ME
- Abstract
Polymorphisms that affect complex traits or quantitative trait loci (QTL) often affect multiple traits. We describe two novel methods (1) for finding single nucleotide polymorphisms (SNPs) significantly associated with one or more traits using a multi-trait, meta-analysis, and (2) for distinguishing between a single pleiotropic QTL and multiple linked QTL. The meta-analysis uses the effect of each SNP on each of n traits, estimated in single trait genome wide association studies (GWAS). These effects are expressed as a vector of signed t-values (t) and the error covariance matrix of these t values is approximated by the correlation matrix of t-values among the traits calculated across the SNP (V). Consequently, t'V-1t is approximately distributed as a chi-squared with n degrees of freedom. An attractive feature of the meta-analysis is that it uses estimated effects of SNPs from single trait GWAS, so it can be applied to published data where individual records are not available. We demonstrate that the multi-trait method can be used to increase the power (numbers of SNPs validated in an independent population) of GWAS in a beef cattle data set including 10,191 animals genotyped for 729,068 SNPs with 32 traits recorded, including growth and reproduction traits. We can distinguish between a single pleiotropic QTL and multiple linked QTL because multiple SNPs tagging the same QTL show the same pattern of effects across traits. We confirm this finding by demonstrating that when one SNP is included in the statistical model the other SNPs have a non-significant effect. In the beef cattle data set, cluster analysis yielded four groups of QTL with similar patterns of effects across traits within a group. A linear index was used to validate SNPs having effects on multiple traits and to identify additional SNPs belonging to these four groups.
- Published
- 2014
35. Identification of genomic regions associated with inbreeding depression in Holstein and Jersey dairy cattle
- Author
-
Pryce, JE, Haile-Mariam, M, Goddard, ME, Hayes, BJ, Pryce, JE, Haile-Mariam, M, Goddard, ME, and Hayes, BJ
- Abstract
BACKGROUND: Inbreeding reduces the fitness of individuals by increasing the frequency of homozygous deleterious recessive alleles. Some insight into the genetic architecture of fitness, and other complex traits, can be gained by using single nucleotide polymorphism (SNP) data to identify regions of the genome which lead to reduction in performance when identical by descent (IBD). Here, we compared the effect of genome-wide and location-specific homozygosity on fertility and milk production traits in dairy cattle. METHODS: Genotype data from more than 43 000 SNPs were available for 8853 Holstein and 4138 Jersey dairy cows that were part of a much larger dataset that had pedigree records (338 696 Holstein and 64 049 Jersey animals). Measures of inbreeding were based on: (1) pedigree data; (2) genotypes to determine the realised proportion of the genome that is IBD; (3) the proportion of the total genome that is homozygous and (4) runs of homozygosity (ROH) which are stretches of the genome that are homozygous. RESULTS: A 1% increase in inbreeding based either on pedigree or genomic data was associated with a decrease in milk, fat and protein yields of around 0.4 to 0.6% of the phenotypic mean, and an increase in calving interval (i.e. a deterioration in fertility) of 0.02 to 0.05% of the phenotypic mean. A genome-wide association study using ROH of more than 50 SNPs revealed genomic regions that resulted in depression of up to 12.5 d and 260 L for calving interval and milk yield, respectively, when completely homozygous. CONCLUSIONS: Genomic measures can be used instead of pedigree-based inbreeding to estimate inbreeding depression. Both the diagonal elements of the genomic relationship matrix and the proportion of homozygous SNPs can be used to measure inbreeding. Longer ROH (>3 Mb) were found to be associated with a reduction in milk yield and captured recent inbreeding independently and in addition to overall homozygosity. Inbreeding depression can be reduced by mi
- Published
- 2014
36. Selection for complex traits leaves little or no classic signatures of selection
- Author
-
Kemper, KE, Saxton, SJ, Bolormaa, S, Hayes, BJ, Goddard, ME, Kemper, KE, Saxton, SJ, Bolormaa, S, Hayes, BJ, and Goddard, ME
- Abstract
BACKGROUND: Selection signatures aim to identify genomic regions underlying recent adaptations in populations. However, the effects of selection in the genome are difficult to distinguish from random processes, such as genetic drift. Often associations between selection signatures and selected variants for complex traits is assumed even though this is rarely (if ever) tested. In this paper, we use 8 breeds of domestic cattle under strong artificial selection to investigate if selection signatures are co-located in genomic regions which are likely to be under selection. RESULTS: Our approaches to identify selection signatures (haplotype heterozygosity, integrated haplotype score and FST) identified strong and recent selection near many loci with mutations affecting simple traits under strong selection, such as coat colour. However, there was little evidence for a genome-wide association between strong selection signatures and regions affecting complex traits under selection, such as milk yield in dairy cattle. Even identifying selection signatures near some major loci was hindered by factors including allelic heterogeneity, selection for ancestral alleles and interactions with nearby selected loci. CONCLUSIONS: Selection signatures detect loci with large effects under strong selection. However, the methodology is often assumed to also detect loci affecting complex traits where the selection pressure at an individual locus is weak. We present empirical evidence to suggests little discernible 'selection signature' for complex traits in the genome of dairy cattle despite very strong and recent artificial selection.
- Published
- 2014
37. Metagenomics of rumen bacteriophage from thirteen lactating dairy cattle
- Author
-
Ross, EM, Petrovski, S, Moate, PJ, Hayes, BJ, Ross, EM, Petrovski, S, Moate, PJ, and Hayes, BJ
- Abstract
BACKGROUND: The bovine rumen hosts a diverse and complex community of Eukarya, Bacteria, Archea and viruses (including bacteriophage). The rumen viral population (the rumen virome) has received little attention compared to the rumen microbial population (the rumen microbiome). We used massively parallel sequencing of virus like particles to investigate the diversity of the rumen virome in thirteen lactating Australian Holstein dairy cattle all housed in the same location, 12 of which were sampled on the same day. RESULTS: Fourteen putative viral sequence fragments over 30 Kbp in length were assembled and annotated. Many of the putative genes in the assembled contigs showed no homology to previously annotated genes, highlighting the large amount of work still required to fully annotate the functions encoded in viral genomes. The abundance of the contig sequences varied widely between animals, even though the cattle were of the same age, stage of lactation and fed the same diets. Additionally the twelve animals which were co-habited shared a number of their dominant viral contigs. We compared the functional characteristics of our bovine viromes with that of other viromes, as well as rumen microbiomes. At the functional level, we found strong similarities between all of the viral samples, which were highly distinct from the rumen microbiome samples. CONCLUSIONS: Our findings suggest a large amount of between animal variation in the bovine rumen virome and that co-habiting animals may have more similar viromes than non co-habited animals. We report the deepest sequencing to date of the rumen virome. This work highlights the enormous amount of novelty and variation present in the rumen virome.
- Published
- 2013
38. Metagenomic Predictions: From Microbiome to Complex Health and Environmental Phenotypes in Humans and Cattle
- Author
-
White, BA, Ross, EM, Moate, PJ, Marett, LC, Cocks, BG, Hayes, BJ, White, BA, Ross, EM, Moate, PJ, Marett, LC, Cocks, BG, and Hayes, BJ
- Abstract
Mammals have a large cohort of endo- and ecto- symbiotic microorganisms (the microbiome) that potentially influence host phenotypes. There have been numerous exploratory studies of these symbiotic organisms in humans and other animals, often with the aim of relating the microbiome to a complex phenotype such as body mass index (BMI) or disease state. Here, we describe an efficient methodology for predicting complex traits from quantitative microbiome profiles. The method was demonstrated by predicting inflammatory bowel disease (IBD) status and BMI from human microbiome data, and enteric greenhouse gas production from dairy cattle rumen microbiome profiles. The method uses unassembled massively parallel sequencing (MPS) data to form metagenomic relationship matrices (analogous to genomic relationship matrices used in genomic predictions) to predict IBD, BMI and methane production phenotypes with useful accuracies (r = 0.423, 0.422 and 0.466 respectively). Our results show that microbiome profiles derived from MPS can be used to predict complex phenotypes of the host. Although the number of biological replicates used here limits the accuracy that can be achieved, preliminary results suggest this approach may surpass current prediction accuracies that are based on the host genome. This is especially likely for traits that are largely influenced by the gut microbiota, for example digestive tract disorders or metabolic functions such as enteric methane production in cattle.
- Published
- 2013
39. Detection of quantitative trait loci in Bos indicus and Bos taurus cattle using genome-wide association studies
- Author
-
Bolormaa, S, Pryce, JE, Kemper, KE, Hayes, BJ, Zhang, Y, Tier, B, Barendse, W, Reverter, A, Goddard, ME, Bolormaa, S, Pryce, JE, Kemper, KE, Hayes, BJ, Zhang, Y, Tier, B, Barendse, W, Reverter, A, and Goddard, ME
- Abstract
BACKGROUND: The apparent effect of a single nucleotide polymorphism (SNP) on phenotype depends on the linkage disequilibrium (LD) between the SNP and a quantitative trait locus (QTL). However, the phase of LD between a SNP and a QTL may differ between Bos indicus and Bos taurus because they diverged at least one hundred thousand years ago. Here, we test the hypothesis that the apparent effect of a SNP on a quantitative trait depends on whether the SNP allele is inherited from a Bos taurus or Bos indicus ancestor. METHODS: Phenotype data on one or more traits and SNP genotype data for 10 181 cattle from Bos taurus, Bos indicus and composite breeds were used. All animals had genotypes for 729 068 SNPs (real or imputed). Chromosome segments were classified as originating from B. indicus or B. taurus on the basis of the haplotype of SNP alleles they contained. Consequently, SNP alleles were classified according to their sub-species origin. Three models were used for the association study: (1) conventional GWAS (genome-wide association study), fitting a single SNP effect regardless of subspecies origin, (2) interaction GWAS, fitting an interaction between SNP and subspecies-origin, and (3) best variable GWAS, fitting the most significant combination of SNP and sub-species origin. RESULTS: Fitting an interaction between SNP and subspecies origin resulted in more significant SNPs (i.e. more power) than a conventional GWAS. Thus, the effect of a SNP depends on the subspecies that the allele originates from. Also, most QTL segregated in only one subspecies, suggesting that many mutations that affect the traits studied occurred after divergence of the subspecies or the mutation became fixed or was lost in one of the subspecies. CONCLUSIONS: The results imply that GWAS and genomic selection could gain power by distinguishing SNP alleles based on their subspecies origin, and that only few QTL segregate in both B. indicus and B. taurus cattle. Thus, the QTL that segregate in cur
- Published
- 2013
40. Genes of the RNASE5 pathway contain SNP associated with milk production traits in dairy cattle
- Author
-
Raven, L-A, Cocks, BG, Pryce, JE, Cottrell, JJ, Hayes, BJ, Raven, L-A, Cocks, BG, Pryce, JE, Cottrell, JJ, and Hayes, BJ
- Abstract
BACKGROUND: Identification of the processes and mutations responsible for the large genetic variation in milk production among dairy cattle has proved challenging. One approach is to identify a biological process potentially involved in milk production and to determine the genetic influence of all the genes included in the process or pathway. Angiogenin encoded by angiogenin, ribonuclease, RNase A family 5 (RNASE5) is relatively abundant in milk, and has been shown to regulate protein synthesis and act as a growth factor in epithelial cells in vitro. However, little is known about the role of angiogenin in the mammary gland or if the polymorphisms present in the bovine RNASE5 gene are associated with lactation and milk production traits in dairy cattle. Given the high economic value of increased protein in milk, we have tested the hypothesis that RNASE5 or genes in the RNASE5 pathway are associated with milk production traits. First, we constructed a "RNASE5 pathway" based on upstream and downstream interacting genes reported in the literature. We then tested SNP in close proximity to the genes of this pathway for association with milk production traits in a large dairy cattle dataset. RESULTS: The constructed RNASE5 pathway consisted of 11 genes. Association analysis between SNP in 1 Mb regions surrounding these genes and milk production traits revealed that more SNP than expected by chance were associated with milk protein percent (P < 0.05 significance). There was no significant association with other traits such as milk fat content or fertility. CONCLUSIONS: These results support a role for the RNASE5 pathway in milk production, specifically milk protein percent, and indicate that polymorphisms in or near these genes explain a proportion of the variation for this trait. This method provides a novel way of understanding the underlying biology of lactation with implications for milk production and can be applied to any pathway or gene set to test whether they are
- Published
- 2013
41. Inferring Demography from Runs of Homozygosity in Whole-Genome Sequence, with Correction for Sequence Errors
- Author
-
MacLeod, IM, Larkin, DM, Lewin, HA, Hayes, BJ, Goddard, ME, MacLeod, IM, Larkin, DM, Lewin, HA, Hayes, BJ, and Goddard, ME
- Abstract
Whole-genome sequence is potentially the richest source of genetic data for inferring ancestral demography. However, full sequence also presents significant challenges to fully utilize such large data sets and to ensure that sequencing errors do not introduce bias into the inferred demography. Using whole-genome sequence data from two Holstein cattle, we demonstrate a new method to correct for bias caused by hidden errors and then infer stepwise changes in ancestral demography up to present. There was a strong upward bias in estimates of recent effective population size (Ne) if the correction method was not applied to the data, both for our method and the Li and Durbin (Inference of human population history from individual whole-genome sequences. Nature 475:493-496) pairwise sequentially Markovian coalescent method. To infer demography, we use an analytical predictor of multiloci linkage disequilibrium (LD) based on a simple coalescent model that allows for changes in Ne. The LD statistic summarizes the distribution of runs of homozygosity for any given demography. We infer a best fit demography as one that predicts a match with the observed distribution of runs of homozygosity in the corrected sequence data. We use multiloci LD because it potentially holds more information about ancestral demography than pairwise LD. The inferred demography indicates a strong reduction in the Ne around 170,000 years ago, possibly related to the divergence of African and European Bos taurus cattle. This is followed by a further reduction coinciding with the period of cattle domestication, with Ne of between 3,500 and 6,000. The most recent reduction of Ne to approximately 100 in the Holstein breed agrees well with estimates from pedigrees. Our approach can be applied to whole-genome sequence from any diploid species and can be scaled up to use sequence from multiple individuals.
- Published
- 2013
42. Whole-genome resequencing of two elite sires for the detection of haplotypes under selection in dairy cattle.
- Author
-
Larkin, DM, Daetwyler, HD, Hernandez, AG, Wright, CL, Hetrick, LA, Boucek, L, Bachman, SL, Brand, MR, Akraiko, TV, Cohen-Zinder, M, Thimmapuram, Jyothi, Macleod, IM, Harkin, TT, McCaque, JE, Goddard, ME, Hayes, BJ, Lewin, HA, Larkin, DM, Daetwyler, HD, Hernandez, AG, Wright, CL, Hetrick, LA, Boucek, L, Bachman, SL, Brand, MR, Akraiko, TV, Cohen-Zinder, M, Thimmapuram, Jyothi, Macleod, IM, Harkin, TT, McCaque, JE, Goddard, ME, Hayes, BJ, and Lewin, HA
- Abstract
Using a combination of whole-genome resequencing and high-density genotyping arrays, genome-wide haplotypes were reconstructed for two of the most important bulls in the history of the dairy cattle industry, Pawnee Farm Arlinda Chief ("Chief") and his son Walkway Chief Mark ("Mark"), each accounting for ∼7% of all current genomes. We aligned 20.5 Gbp (∼7.3× coverage) and 37.9 Gbp (∼13.5× coverage) of the Chief and Mark genomic sequences, respectively. More than 1.3 million high-quality SNPs were detected in Chief and Mark sequences. The genome-wide haplotypes inherited by Mark from Chief were reconstructed using ∼1 million informative SNPs. Comparison of a set of 15,826 SNPs that overlapped in the sequence-based and BovineSNP50 SNPs showed the accuracy of the sequence-based haplotype reconstruction to be as high as 97%. By using the BovineSNP50 genotypes, the frequencies of Chief alleles on his two haplotypes then were determined in 1,149 of his descendants, and the distribution was compared with the frequencies that would be expected assuming no selection. We identified 49 chromosomal segments in which Chief alleles showed strong evidence of selection. Candidate polymorphisms for traits that have been under selection in the dairy cattle population then were identified by referencing Chief's DNA sequence within these selected chromosome blocks. Eleven candidate genes were identified with functions related to milk-production, fertility, and disease-resistance traits. These data demonstrate that haplotype reconstruction of an ancestral proband by whole-genome resequencing in combination with high-density SNP genotyping of descendants can be used for rapid, genome-wide identification of the ancestor's alleles that have been subjected to artificial selection.
- Published
- 2012
43. High throughput whole rumen metagenome profiling using untargeted massively parallel sequencing
- Author
-
Ross, EM, Moate, PJ, Bath, CR, Davidson, SE, Sawbridge, TI, Guthridge, KM, Cocks, BG, Hayes, BJ, Ross, EM, Moate, PJ, Bath, CR, Davidson, SE, Sawbridge, TI, Guthridge, KM, Cocks, BG, and Hayes, BJ
- Abstract
BACKGROUND: Variation of microorganism communities in the rumen of cattle (Bos taurus) is of great interest because of possible links to economically or environmentally important traits, such as feed conversion efficiency or methane emission levels. The resolution of studies investigating this variation may be improved by utilizing untargeted massively parallel sequencing (MPS), that is, sequencing without targeted amplification of genes. The objective of this study was to develop a method which used MPS to generate "rumen metagenome profiles", and to investigate if these profiles were repeatable among samples taken from the same cow. Given faecal samples are much easier to obtain than rumen fluid samples; we also investigated whether rumen metagenome profiles were predictive of faecal metagenome profiles. RESULTS: Rather than focusing on individual organisms within the rumen, our method used MPS data to generate quantitative rumen micro-biome profiles, regardless of taxonomic classifications. The method requires a previously assembled reference metagenome. A number of such reference metagenomes were considered, including two rumen derived metagenomes, a human faecal microflora metagenome and a reference metagenome made up of publically available prokaryote sequences. Sequence reads from each test sample were aligned to these references. The "rumen metagenome profile" was generated from the number of the reads that aligned to each contig in the database. We used this method to test the hypothesis that rumen fluid microbial community profiles vary more between cows than within multiple samples from the same cow. Rumen fluid samples were taken from three cows, at three locations within the rumen. DNA from the samples was sequenced on the Illumina GAIIx. When the reads were aligned to a rumen metagenome reference, the rumen metagenome profiles were repeatable (P < 0.00001) by cow regardless of location of sampling rumen fluid. The repeatability was estimated at 9%, alb
- Published
- 2012
44. Genome position specific priors for genomic prediction
- Author
-
Brondum, RF, Su, G, Lund, MS, Bowman, PJ, Goddard, ME, Hayes, BJ, Brondum, RF, Su, G, Lund, MS, Bowman, PJ, Goddard, ME, and Hayes, BJ
- Abstract
BACKGROUND: The accuracy of genomic prediction is highly dependent on the size of the reference population. For small populations, including information from other populations could improve this accuracy. The usual strategy is to pool data from different populations; however, this has not proven as successful as hoped for with distantly related breeds. BayesRS is a novel approach to share information across populations for genomic predictions. The approach allows information to be captured even where the phase of SNP alleles and casuative mutation alleles are reversed across populations, or the actual casuative mutation is different between the populations but affects the same gene. Proportions of a four-distribution mixture for SNP effects in segments of fixed size along the genome are derived from one population and set as location specific prior proportions of distributions of SNP effects for the target population. The model was tested using dairy cattle populations of different breeds: 540 Australian Jersey bulls, 2297 Australian Holstein bulls and 5214 Nordic Holstein bulls. The traits studied were protein-, fat- and milk yield. Genotypic data was Illumina 777K SNPs, real or imputed. RESULTS: Results showed an increase in accuracy of up to 3.5% for the Jersey population when using BayesRS with a prior derived from Australian Holstein compared to a model without location specific priors. The increase in accuracy was however lower than was achieved when reference populations were combined to estimate SNP effects, except in the case of fat yield. The small size of the Jersey validation set meant that these improvements in accuracy were not significant using a Hotelling-Williams t-test at the 5% level. An increase in accuracy of 1-2% for all traits was observed in the Australian Holstein population when using a prior derived from the Nordic Holstein population compared to using no prior information. These improvements were significant (P<0.05) using the Hotelling W
- Published
- 2012
45. Sensitivity of genomic selection to using different prior distributions.
- Author
-
Verbyla, KL, Bowman, PJ, Hayes, BJ, Goddard, ME, Verbyla, KL, Bowman, PJ, Hayes, BJ, and Goddard, ME
- Abstract
UNLABELLED: Genomic selection describes a selection strategy based on genomic estimated breeding values (GEBV) predicted from dense genetic markers such as single nucleotide polymorphism (SNP) data. Different Bayesian models have been suggested to derive the prediction equation, with the main difference centred around the specification of the prior distributions. METHODS: The simulated dataset of the 13(th) QTL-MAS workshop was analysed using four Bayesian approaches to predict GEBV for animals without phenotypic information. Different prior distributions were assumed to assess their affect on the accuracy of the predicted GEBV. CONCLUSION: All methods produced GEBV that were highly correlated with the true breeding values. The models appear relatively insensitive to the choice of prior distributions for QTL-MAS data set and this is consistent with uniformity of performance of different methods found in real data.
- Published
- 2010
46. Genetic Architecture of Complex Traits and Accuracy of Genomic Prediction: Coat Colour, Milk-Fat Percentage, and Type in Holstein Cattle as Contrasting Model Traits
- Author
-
Georges, M, Hayes, BJ, Pryce, J, Chamberlain, AJ, Bowman, PJ, Goddard, ME, Georges, M, Hayes, BJ, Pryce, J, Chamberlain, AJ, Bowman, PJ, and Goddard, ME
- Abstract
Prediction of genetic merit using dense SNP genotypes can be used for estimation of breeding values for selection of livestock, crops, and forage species; for prediction of disease risk; and for forensics. The accuracy of these genomic predictions depends in part on the genetic architecture of the trait, in particular number of loci affecting the trait and distribution of their effects. Here we investigate the difference among three traits in distribution of effects and the consequences for the accuracy of genomic predictions. Proportion of black coat colour in Holstein cattle was used as one model complex trait. Three loci, KIT, MITF, and a locus on chromosome 8, together explain 24% of the variation of proportion of black. However, a surprisingly large number of loci of small effect are necessary to capture the remaining variation. A second trait, fat concentration in milk, had one locus of large effect and a host of loci with very small effects. Both these distributions of effects were in contrast to that for a third trait, an index of scores for a number of aspects of cow confirmation ("overall type"), which had only loci of small effect. The differences in distribution of effects among the three traits were quantified by estimating the distribution of variance explained by chromosome segments containing 50 SNPs. This approach was taken to account for the imperfect linkage disequilibrium between the SNPs and the QTL affecting the traits. We also show that the accuracy of predicting genetic values is higher for traits with a proportion of large effects (proportion black and fat percentage) than for a trait with no loci of large effect (overall type), provided the method of analysis takes advantage of the distribution of loci effects.
- Published
- 2010
47. A novel predictor of multilocus haplotype homozygosity: comparison with existing predictors
- Author
-
MacLeod, IM, Meuwissen, THE, Hayes, BJ, Goddard, ME, MacLeod, IM, Meuwissen, THE, Hayes, BJ, and Goddard, ME
- Abstract
The patterns of linkage disequilibrium (LD) between dense polymorphic markers are shaped by the ancestral population history. It is therefore possible to use multilocus predictors of LD to infer past population history and to infer sharing of identical alleles in quantitative trait locus (QTL) studies. We develop a multilocus predictor of LD for pairs of haplotypes, which we term haplotype homozygosity (HHn): the probability that any two haplotypes share a given number of n adjacent identical markers or 'runs of homozygosity'. Our method, based on simplified coalescence theory, accounts for recombination and mutation. We compare our HHn predictions, with HHn in simulated populations and with two published predictors of HHn. Our method performs consistently better across a range of population parameters, including populations with a severe bottleneck followed by expansion, compared to two published methods. We demonstrate that we can predict the pattern of HHn observed in dense single nucleotide polymorphisms (SNPs) genotyped in a cattle population, given appropriate historical changes in population size. Our method is practical for use with very large numbers of individuals and dense genome wide polymorphic DNA data. It has potential applications in inferring ancestral population history and QTL mapping studies.
- Published
- 2009
48. Accuracy of genomic breeding values in multi-breed dairy cattle populations
- Author
-
Hayes, BJ, Bowman, PJ, Chamberlain, AC, Verbyla, K, Goddard, ME, Hayes, BJ, Bowman, PJ, Chamberlain, AC, Verbyla, K, and Goddard, ME
- Abstract
BACKGROUND: Two key findings from genomic selection experiments are 1) the reference population used must be very large to subsequently predict accurate genomic estimated breeding values (GEBV), and 2) prediction equations derived in one breed do not predict accurate GEBV when applied to other breeds. Both findings are a problem for breeds where the number of individuals in the reference population is limited. A multi-breed reference population is a potential solution, and here we investigate the accuracies of GEBV in Holstein dairy cattle and Jersey dairy cattle when the reference population is single breed or multi-breed. The accuracies were obtained both as a function of elements of the inverse coefficient matrix and from the realised accuracies of GEBV. METHODS: Best linear unbiased prediction with a multi-breed genomic relationship matrix (GBLUP) and two Bayesian methods (BAYESA and BAYES_SSVS) which estimate individual SNP effects were used to predict GEBV for 400 and 77 young Holstein and Jersey bulls respectively, from a reference population of 781 and 287 Holstein and Jersey bulls, respectively. Genotypes of 39,048 SNP markers were used. Phenotypes in the reference population were de-regressed breeding values for production traits. For the GBLUP method, expected accuracies calculated from the diagonal of the inverse of coefficient matrix were compared to realised accuracies. RESULTS: When GBLUP was used, expected accuracies from a function of elements of the inverse coefficient matrix agreed reasonably well with realised accuracies calculated from the correlation between GEBV and EBV in single breed populations, but not in multi-breed populations. When the Bayesian methods were used, realised accuracies of GEBV were up to 13% higher when the multi-breed reference population was used than when a pure breed reference was used. However no consistent increase in accuracy across traits was obtained. CONCLUSION: Predicting genomic breeding values using a genomic
- Published
- 2009
49. A Validated Genome Wide Association Study to Breed Cattle Adapted to an Environment Altered by Climate Change
- Author
-
Añel, JA, Hayes, BJ, Bowman, PJ, Chamberlain, AJ, Savin, K, van Tassell, CP, Sonstegard, TS, Goddard, ME, Añel, JA, Hayes, BJ, Bowman, PJ, Chamberlain, AJ, Savin, K, van Tassell, CP, Sonstegard, TS, and Goddard, ME
- Abstract
Continued production of food in areas predicted to be most affected by climate change, such as dairy farming regions of Australia, will be a major challenge in coming decades. Along with rising temperatures and water shortages, scarcity of inputs such as high energy feeds is predicted. With the motivation of selecting cattle adapted to these changing environments, we conducted a genome wide association study to detect DNA markers (single nucleotide polymorphisms) associated with the sensitivity of milk production to environmental conditions. To do this we combined historical milk production and weather records with dense marker genotypes on dairy sires with many daughters milking across a wide range of production environments in Australia. Markers associated with sensitivity of milk production to feeding level and sensitivity of milk production to temperature humidity index on chromosome nine and twenty nine respectively were validated in two independent populations, one a different breed of cattle. As the extent of linkage disequilibrium across cattle breeds is limited, the underlying causative mutations have been mapped to a small genomic interval containing two promising candidate genes. The validated marker panels we have reported here will aid selection for high milk production under anticipated climate change scenarios, for example selection of sires whose daughters will be most productive at low levels of feeding.
- Published
- 2009
50. Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds.
- Author
-
Bovine Hap Map, Consortium, Gibbs, Ra, Taylor, Jf, Van Tassel, Cp, Barendse, W, Eversole, Ka, Gill, Ca, Green, Rd, Hamernik, Dl, Kappes, Sm, Lien, S, Matukumalli, Lk, Mcevan, Jc, Mazareth, Lv, Schnabel, Rd, Weinstock, Gm, Wheeler, Da, Ajmone Marsan, Paolo, Boettcher, Pj, Caetano, Ar, Garcia, Jf, Hanotte, O, Mariani, P, Skow, Lc, Sonstegard, T, Williams, Jl, Diallo, B, Hailemariam, L, Martinez, Ml, Morris, Ca, Silva, Lo, Spelman, Rj, Malatu, W, Zhao, K, Abbey, Ca, Agaba, M, Araujo, Fr, Bunch, Rj, Burton, J, Gorni, C, Olivier, H, Harrison, Be, Luff, B, Machado, Ma, Mwakaya, J, Plastow, G, Sim, W, Smith, T, Thomas, Mb, Valentini, A, Williams, P, Womack, J, Wolliams, Ja, Liu, Y, Qin, X, Worley, Kc, Gao, C, Jiang, H, Moore, S, Ren, Y, Song, Xz, Bustamante, Cd, Hernandez, Rd, Muzny, Dm, Patil, S, San Lucas, A, Fu, Q, Kent, Mp, Vega, R, Matukumalli, A, Mcwilliam, S, Sclep, G, Bryc, K, Choi, J, Gao, H, Grefenstette, Jj, Murdoch, B, Stella, A, Villa Angulo, R, Wright, M, Aerts, J, Jann, O, Negrini, Riccardo, Goddard, Me, Hayes, Bj, Bradley, Dg, Lau, Lp, Liu, Ge, Lynn, Dj, Panzitta, F, Dodds, Kg, Ajmone Marsan, Paolo (ORCID:0000-0003-3165-4579), Negrini, Riccardo (ORCID:0000-0002-8735-0286), Bovine Hap Map, Consortium, Gibbs, Ra, Taylor, Jf, Van Tassel, Cp, Barendse, W, Eversole, Ka, Gill, Ca, Green, Rd, Hamernik, Dl, Kappes, Sm, Lien, S, Matukumalli, Lk, Mcevan, Jc, Mazareth, Lv, Schnabel, Rd, Weinstock, Gm, Wheeler, Da, Ajmone Marsan, Paolo, Boettcher, Pj, Caetano, Ar, Garcia, Jf, Hanotte, O, Mariani, P, Skow, Lc, Sonstegard, T, Williams, Jl, Diallo, B, Hailemariam, L, Martinez, Ml, Morris, Ca, Silva, Lo, Spelman, Rj, Malatu, W, Zhao, K, Abbey, Ca, Agaba, M, Araujo, Fr, Bunch, Rj, Burton, J, Gorni, C, Olivier, H, Harrison, Be, Luff, B, Machado, Ma, Mwakaya, J, Plastow, G, Sim, W, Smith, T, Thomas, Mb, Valentini, A, Williams, P, Womack, J, Wolliams, Ja, Liu, Y, Qin, X, Worley, Kc, Gao, C, Jiang, H, Moore, S, Ren, Y, Song, Xz, Bustamante, Cd, Hernandez, Rd, Muzny, Dm, Patil, S, San Lucas, A, Fu, Q, Kent, Mp, Vega, R, Matukumalli, A, Mcwilliam, S, Sclep, G, Bryc, K, Choi, J, Gao, H, Grefenstette, Jj, Murdoch, B, Stella, A, Villa Angulo, R, Wright, M, Aerts, J, Jann, O, Negrini, Riccardo, Goddard, Me, Hayes, Bj, Bradley, Dg, Lau, Lp, Liu, Ge, Lynn, Dj, Panzitta, F, Dodds, Kg, Ajmone Marsan, Paolo (ORCID:0000-0003-3165-4579), and Negrini, Riccardo (ORCID:0000-0002-8735-0286)
- Published
- 2009
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.