37 results on '"Legarra, Andrés"'
Search Results
2. Multiple Trait Bayesian Analysis of Partitioned Genetic Trends Accounting for Uncertainty in Genetic Parameters. An Example With the Pirenaica and Rubia Gallega Beef Cattle Breeds.
- Author
-
López‐Carbonell, David, Legarra, Andrés, Altarriba, Juan, Hervás‐Rivero, Carlos, Sánchez‐Díaz, Manuel, and Varona, Luis
- Subjects
- *
BEEF cattle breeds , *BAYESIAN analysis , *BIRTH weight , *OBESITY , *DAMS - Abstract
ABSTRACT Genetic trends are a valuable tool for analysing the efficiency of breeding programs. They are calculated by averaging the predicted breeding values for all individuals born within a specific time period. Moreover, partitioned genetic trends allow dissecting the contributions of several selection paths to overall genetic progress. These trends are based on the linear relationship between breeding values and the Mendelian sampling terms of ancestors, enabling genetic trends to be split into contributions from different categories of individuals. However, (1) the use of predicted breeding values in calculating partitioned genetic trends depends on the variance components used and (2) a multiple trait analysis allows accounting for selection on correlated traits. These points are often not considered. To overcome these limitations, we present a software called “TM_TRENDS.” This software performs a Bayesian analysis of partitioned genetic trends in a multiple trait model, accounting for uncertainty in the variance components. To illustrate the capabilities of this tool, we analysed the partitioned genetic trends for five traits (Birth Weight, Weight at 210 days, Cold Carcass Weight, Carcass Conformation, and Fatness Conformation) in two Spanish beef cattle breeds, Pirenaica and Rubia Gallega. The global genetic trends showed an increase in Carcass Conformation and a decrease in Birth Weight, Weight at 210 days, Cold Carcass Weight, and Fatness Conformation. These trends were partitioned into six categories: non‐reproductive individuals, dams of females and non‐reproductive individuals, dams of sires, sires with fewer than 20 progeny, sires between 20 and 50 progeny, and sires with more than 50 progeny. The results showed that the main source of genetic progress comes from sires with more than 50 progenies, followed by dams of males. Additionally, two additional features of the Bayesian analysis are presented: the calculation of the posterior probability of the global and partitioned genetic response between two time points, and the calculation of the posterior probability of positive (or negative) genetic progress. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Unfavorable genetic correlations between fecal egg count and milk production traits in the French blond-faced Manech dairy sheep breed
- Author
-
Aguerre, Sophie, Astruc, Jean-Michel, Legarra, Andrés, Bordes, Léa, Prevot, Françoise, Grisez, Christelle, Vial Novella, Corinne, Fidelle, Francis, Jacquiet, Philippe, and Moreno-Romieux, Carole
- Published
- 2022
- Full Text
- View/download PDF
4. Detection of unrecorded environmental challenges in high-frequency recorded traits, and genetic determinism of resilience to challenge, with an application on feed intake in lambs
- Author
-
Garcia-Baccino, Carolina Andrea, Marie-Etancelin, Christel, Tortereau, Flavie, Marcon, Didier, Weisbecker, Jean-Louis, and Legarra, Andrés
- Published
- 2021
- Full Text
- View/download PDF
5. Bias and accuracy of dairy sheep evaluations using BLUP and SSGBLUP with metafounders and unknown parent groups
- Author
-
Macedo, Fernando L., Christensen, Ole F., Astruc, Jean-Michel, Aguilar, Ignacio, Masuda, Yutaka, and Legarra, Andrés
- Published
- 2020
- Full Text
- View/download PDF
6. Comparing estimates of genetic variance across different relationship models
- Author
-
Legarra, Andres
- Published
- 2016
- Full Text
- View/download PDF
7. Single Step, a general approach for genomic selection
- Author
-
Legarra, Andres, Christensen, Ole F., Aguilar, Ignacio, and Misztal, Ignacy
- Published
- 2014
- Full Text
- View/download PDF
8. Impact of interpopulation distance on dominance variance and average heterosis in hybrid populations within species.
- Author
-
Legarra, Andrés, Gonzalez-Dieguez, David Omar, Charcosset, Alain, and Vitezica, Zulma G.
- Subjects
- *
AGRICULTURE , *ALLELES , *GENETIC variation , *GENETIC techniques , *DEMOGRAPHIC characteristics - Abstract
Interpopulation improvement for crosses of close populations in crops and livestock depends on the amount of heterosis and the amount of variance of dominance deviations in the hybrids. It has been intuited that the further the distance between populations, the lower the amount of dominance variation and the higher the heterosis. Although experience in speciation and interspecific crosses shows, however, that this is not the case when populations are so distant--here we confine ourselves to the case of not-too-distant populations typical in crops and livestock. We present equations that relate the distance between 2 populations, expressed as Nei's genetic distance or as correlation of allele frequencies, quadratically to the amount of dominance deviations across all possible crosses and linearly to the expected heterosis averaging all possible crosses. The amount of variation of dominance deviations decreases with genetic distance until the point where allele frequencies are uncorrelated, and then increases for negatively correlated frequencies. Heterosis always increases with Nei's genetic distance. These expressions match well and complete previous theoretical and empirical findings. In practice, and for close enough populations, they mean that unless frequencies are negatively correlated, selection for hybrids will be more efficient when populations are distant. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
9. The effects of selective breeding against scrapie susceptibility on the genetic variability of the Latxa Black-Faced sheep breed
- Author
-
Legarra Andrés, Parada Analia, Alfonso Leopoldo, Ugarte Eva, and Arana Ana
- Subjects
genetic variability ,scrapie ,Manech ,Latxa ,sheep ,Animal culture ,SF1-1100 ,Genetics ,QH426-470 - Abstract
Abstract Breeding sheep populations for scrapie resistance could result in a loss of genetic variability. In this study, the effect on genetic variability of selection for increasing the ARR allele frequency was estimated in the Latxa breed. Two sources of information were used, pedigree and genetic polymorphisms (fifteen microsatellites). The results based on the genealogical information were conditioned by a low pedigree completeness level that revealed the interest of also using the information provided by the molecular markers. The overall results suggest that no great negative effect on genetic variability can be expected in the short time in the population analysed by selection of only ARR/ARR males. The estimated average relationship of ARR/ARR males with reproductive females was similar to that of all available males whatever its genotype: 0.010 vs. 0.012 for a genealogical relationship and 0.257 vs. 0.296 for molecular coancestry, respectively. However, selection of only ARR/ARR males implied important losses in founder animals (87 percent) and low frequency alleles (30 percent) in the ram population. The evaluation of mild selection strategies against scrapie susceptibility based on the use of some ARR heterozygous males was difficult because the genetic relationships estimated among animals differed when pedigree or molecular information was used, and the use of more molecular markers should be evaluated.
- Published
- 2006
- Full Text
- View/download PDF
10. Implication of the order of blending and tuning when computing the genomic relationship matrix in single‐step GBLUP.
- Author
-
McWhorter, Taylor M., Bermann, Matias, Garcia, Andre L. S., Legarra, Andrés, Aguilar, Ignacio, Misztal, Ignacy, and Lourenco, Daniela
- Subjects
SINGLE nucleotide polymorphisms ,SELF-tuning controllers - Abstract
Single‐step genomic BLUP (ssGBLUP) relies on the combination of the genomic (G$$ \mathbf{G} $$) and pedigree relationship matrices for all (A$$ \mathbf{A} $$) and genotyped (A22$$ {\mathbf{A}}_{22} $$) animals. The procedure ensures G$$ \mathbf{G} $$ and A22$$ {\mathbf{A}}_{22} $$ are compatible so that both matrices refer to the same genetic base ('tuning'). Then G$$ \mathbf{G} $$ is combined with a proportion of A22$$ {\mathbf{A}}_{22} $$ ('blending') to avoid singularity problems and to account for the polygenic component not accounted for by markers. This computational procedure has been implemented in the reverse order (blending before tuning) following the sequential research developments. However, blending before tuning may result in less optimal tuning because the blended matrix already contains a proportion of A22$$ {\mathbf{A}}_{22} $$. In this study, the impact of 'tuning before blending' was compared with 'blending before tuning' on genomic estimated breeding values (GEBV), single nucleotide polymorphism (SNP) effects and indirect predictions (IP) from ssGBLUP using American Angus Association and Holstein Association USA, Inc. data. Two slightly different tuning methods were used; one that adjusts the mean diagonals and off‐diagonals of G$$ \mathbf{G} $$ to be similar to those in A22$$ {\mathbf{A}}_{22} $$ and another one that adjusts based on the average difference between all elements of G$$ \mathbf{G} $$ and A22$$ {\mathbf{A}}_{22} $$. Over 6 million Angus growth records and 5.9 million Holstein udder depth records were available. Genomic information was available on 51,478 Angus and 105,116 Holstein animals. Average realized relationship estimates among groups of animals were similar across scenarios. Scatterplots show that GEBV, SNP effects and IP did not noticeably change for all animals in the evaluation regardless of the order of computations and when using blending parameter of 0.05. Formulas were derived to determine the blending parameter that maximizes changes in the genomic relationship matrix and GEBV when changing the order of blending and tuning. Algebraically, the change is maximized when the blending parameter is equal to 0.5. Overall, tuning G$$ \mathbf{G} $$ before blending, regardless of blending parameter used, had a negligible impact on genomic predictions and SNP effects in this study. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
11. Multivariate genomic model improves analysis of oil palm (Elaeis guineensis Jacq.) progeny tests
- Author
-
Marchal, Alexandre, Legarra, Andrés, Tisné, Sébastien, Carasco-Lacombe, Catherine, Manez, Aurore, Suryana, Edyana, Omoré, Alphonse, Nouy, Bruno, Durand-Gasselin, Tristan, Sánchez, Leopoldo, Bouvet, Jean-Marc, and Cros, David
- Published
- 2016
- Full Text
- View/download PDF
12. Fine-mapping quantitative trait loci with a medium density marker panel: efficiency of population structures and comparison of linkage disequilibrium linkage analysis models
- Author
-
ROLDAN, DANA L, GILBERT, HÉLÈNE, HENSHALL, JOHN M, LEGARRA, ANDRÉS, and ELSEN, JEAN-MICHEL
- Published
- 2012
13. Fine tuning genomic evaluations in dairy cattle through SNP pre-selection with the Elastic-Net algorithm
- Author
-
CROISEAU, PASCAL, LEGARRA, ANDRÉS, GUILLAUME, FRANÇCOIS, FRITZ, SÉBASTIEN, BAUR, AURÉLIA, COLOMBANI, CARINE, ROBERT-GRANIÉ, CHRISTÈLE, BOICHARD, DIDIER, and DUCROCQ, VINCENT
- Published
- 2011
14. Improved Lasso for genomic selection
- Author
-
LEGARRA, ANDRÉS, ROBERT-GRANIÉ, CHRISTÈLE, CROISEAU, PASCAL, GUILLAUME, FRANÇOIS, and FRITZ, SÈBASTIEN
- Published
- 2011
15. Computing strategies for multi-population genomic evaluation.
- Author
-
Legarra, Andrés, González-Diéguez, David, and Vitezica, Zulma G.
- Subjects
GENOTYPE-environment interaction ,SINGLE nucleotide polymorphisms - Abstract
Background: Multiple breed evaluation using genomic prediction includes the use of data from multiple populations, or from parental breeds and crosses, and is expected to lead to better genomic predictions. Increased complexity comes from the need to fit non-additive effects such as dominance and/or genotype-by-environment interactions. In these models, marker effects (and breeding values) are modelled as correlated between breeds, which leads to multiple trait formulations that are based either on markers [single nucleotide polymorphism best linear unbiased prediction (SNP-BLUP)] or on individuals [genomic(G)BLUP)]. As an alternative, we propose the use of generalized least squares (GLS) followed by backsolving of marker effects using selection index (SI) theory. Results: All investigated options have advantages and inconveniences. The SNP-BLUP yields marker effects directly, which are useful for indirect prediction and for planned matings, but is very large in number of equations and is structured in dense and sparse blocks that do not allow for simple solving. GBLUP uses a multiple trait formulation and is very general, but results in many equations that are not used, which increase memory needs, and is also structured in dense and sparse blocks. An alternative formulation of GBLUP is more compact but requires tailored programming. The alternative of solving by GLS + SI is the least consuming, both in number of operations and in memory, and it uses only single dense blocks. However, it requires dedicated programming. Computational complexity problems are exacerbated when more than additive effects are fitted, e.g. dominance effects or genotype x environment interactions. Conclusions: As multi-breed predictions become more frequent and non-additive effects are more often included, standard equations for genomic prediction based on Henderson's mixed model equations become less practical and may need to be replaced by more efficient (although less general) approaches such as the GLS + SI approach proposed here. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
16. Joint genomic evaluation of French dairy cattle breeds using multiple-trait models
- Author
-
Karoui Sofiene, Carabaño María Jesús, Díaz Clara, and Legarra Andrés
- Subjects
Animal culture ,SF1-1100 ,Genetics ,QH426-470 - Abstract
Abstract Background Using a multi-breed reference population might be a way of increasing the accuracy of genomic breeding values in small breeds. Models involving mixed-breed data do not take into account the fact that marker effects may differ among breeds. This study was aimed at investigating the impact on accuracy of increasing the number of genotyped candidates in the training set by using a multi-breed reference population, in contrast to single-breed genomic evaluations. Methods Three traits (milk production, fat content and female fertility) were analyzed by genomic mixed linear models and Bayesian methodology. Three breeds of French dairy cattle were used: Holstein, Montbéliarde and Normande with 2976, 950 and 970 bulls in the training population, respectively and 964, 222 and 248 bulls in the validation population, respectively. All animals were genotyped with the Illumina Bovine SNP50 array. Accuracy of genomic breeding values was evaluated under three scenarios for the correlation of genomic breeding values between breeds (rg): uncorrelated (1), rg = 0; estimated rg (2); high, rg = 0.95 (3). Accuracy and bias of predictions obtained in the validation population with the multi-breed training set were assessed by the coefficient of determination (R2) and by the regression coefficient of daughter yield deviations of validation bulls on their predicted genomic breeding values, respectively. Results The genetic variation captured by the markers for each trait was similar to that estimated for routine pedigree-based genetic evaluation. Posterior means for rg ranged from −0.01 for fertility between Montbéliarde and Normande to 0.79 for milk yield between Montbéliarde and Holstein. Differences in R2 between the three scenarios were notable only for fat content in the Montbéliarde breed: from 0.27 in scenario (1) to 0.33 in scenarios (2) and (3). Accuracies for fertility were lower than for other traits. Conclusions Using a multi-breed reference population resulted in small or no increases in accuracy. Only the breed with a small data set and large genetic correlation with the breed with a large data set showed increased accuracy for the traits with moderate (milk) to high (fat content) heritability. No benefit was observed for fertility, a lowly heritable trait.
- Published
- 2012
- Full Text
- View/download PDF
17. A note on the rationale for estimating genealogical coancestry from molecular markers
- Author
-
García-Cortés Luis, Toro Miguel, and Legarra Andrés
- Subjects
Animal culture ,SF1-1100 ,Genetics ,QH426-470 - Abstract
Abstract Background Genetic relatedness or similarity between individuals is a key concept in population, quantitative and conservation genetics. When the pedigree of a population is available and assuming a founder population from which the genealogical records start, genetic relatedness between individuals can be estimated by the coancestry coefficient. If pedigree data is lacking or incomplete, estimation of the genetic similarity between individuals relies on molecular markers, using either molecular coancestry or molecular covariance. Some relationships between genealogical and molecular coancestries and covariances have already been described in the literature. Methods We show how the expected values of the empirical measures of similarity based on molecular marker data are functions of the genealogical coancestry. From these formulas, it is easy to derive estimators of genealogical coancestry from molecular data. We include variation of allelic frequencies in the estimators. Results The estimators are illustrated with simulated examples and with a real dataset from dairy cattle. In general, estimators are accurate and only slightly biased. From the real data set, estimators based on covariances are more compatible with genealogical coancestries than those based on molecular coancestries. A frequently used estimator based on the average of estimated coancestries produced inflated coancestries and numerical instability. The consequences of unknown gene frequencies in the founder population are briefly discussed, along with alternatives to overcome this limitation. Conclusions Estimators of genealogical coancestry based on molecular data are easy to derive. Estimators based on molecular covariance are more accurate than those based on identity by state. A correction considering the random distribution of allelic frequencies improves accuracy of these estimators, especially for populations with very strong drift.
- Published
- 2011
- Full Text
- View/download PDF
18. Does probabilistic modelling of linkage disequilibrium evolution improve the accuracy of QTL location in animal pedigree?
- Author
-
Estivals Delphine, Ytournel Florence, Druet Tom, Gilbert Hélène, Legarra Andrés, Dejean Sébastien, Cierco-Ayrolles Christine, Oumouhou Naïma, and Mangin Brigitte
- Subjects
Animal culture ,SF1-1100 ,Genetics ,QH426-470 - Abstract
Abstract Background Since 2001, the use of more and more dense maps has made researchers aware that combining linkage and linkage disequilibrium enhances the feasibility of fine-mapping genes of interest. So, various method types have been derived to include concepts of population genetics in the analyses. One major drawback of many of these methods is their computational cost, which is very significant when many markers are considered. Recent advances in technology, such as SNP genotyping, have made it possible to deal with huge amount of data. Thus the challenge that remains is to find accurate and efficient methods that are not too time consuming. The study reported here specifically focuses on the half-sib family animal design. Our objective was to determine whether modelling of linkage disequilibrium evolution improved the mapping accuracy of a quantitative trait locus of agricultural interest in these populations. We compared two methods of fine-mapping. The first one was an association analysis. In this method, we did not model linkage disequilibrium evolution. Therefore, the modelling of the evolution of linkage disequilibrium was a deterministic process; it was complete at time 0 and remained complete during the following generations. In the second method, the modelling of the evolution of population allele frequencies was derived from a Wright-Fisher model. We simulated a wide range of scenarios adapted to animal populations and compared these two methods for each scenario. Results Our results indicated that the improvement produced by probabilistic modelling of linkage disequilibrium evolution was not significant. Both methods led to similar results concerning the location accuracy of quantitative trait loci which appeared to be mainly improved by using four flanking markers instead of two. Conclusions Therefore, in animal half-sib designs, modelling linkage disequilibrium evolution using a Wright-Fisher model does not significantly improve the accuracy of the QTL location when compared to a simpler method assuming complete and constant linkage between the QTL and the marker alleles. Finally, given the high marker density available nowadays, the simpler method should be preferred as it gives accurate results in a reasonable computing time.
- Published
- 2010
- Full Text
- View/download PDF
19. Validation of models for analysis of ranks in horse breeding evaluation
- Author
-
Ricard Anne and Legarra Andrés
- Subjects
Animal culture ,SF1-1100 ,Genetics ,QH426-470 - Abstract
Abstract Background Ranks have been used as phenotypes in the genetic evaluation of horses for a long time through the use of earnings, normal score or raw ranks. A model, ("underlying model" of an unobservable underlying variable responsible for ranks) exists. Recently, a full Bayesian analysis using this model was developed. In addition, in reality, competitions are structured into categories according to the technical level of difficulty linked to the technical ability of horses (horses considered to be the "best" meet their peers). The aim of this article was to validate the underlying model through simulations and to propose a more appropriate model with a mixture distribution of horses in the case of a structured competition. The simulations involved 1000 horses with 10 to 50 performances per horse and 4 to 20 horses per event with unstructured and structured competitions. Results The underlying model responsible for ranks performed well with unstructured competitions by drawing liabilities in the Gibbs sampler according to the following rule: the liability of each horse must be drawn in the interval formed by the liabilities of horses ranked before and after the particular horse. The estimated repeatability was the simulated one (0.25) and regression between estimated competing ability of horses and true ability was close to 1. Underestimations of repeatability (0.07 to 0.22) were obtained with other traditional criteria (normal score or raw ranks), but in the case of a structured competition, repeatability was underestimated (0.18 to 0.22). Our results show that the effect of an event, or category of event, is irrelevant in such a situation because ranks are independent of such an effect. The proposed mixture model pools horses according to their participation in different categories of competition during the period observed. This last model gave better results (repeatability 0.25), in particular, it provided an improved estimation of average values of competing ability of the horses in the different categories of events. Conclusions The underlying model was validated. A correct drawing of liabilities for the Gibbs sampler was provided. For a structured competition, the mixture model with a group effect assigned to horses gave the best results.
- Published
- 2010
- Full Text
- View/download PDF
20. Linear models for joint association and linkage QTL mapping
- Author
-
Fernando Rohan L and Legarra Andrés
- Subjects
Animal culture ,SF1-1100 ,Genetics ,QH426-470 - Abstract
Abstract Background Populational linkage disequilibrium and within-family linkage are commonly used for QTL mapping and marker assisted selection. The combination of both results in more robust and accurate locations of the QTL, but models proposed so far have been either single marker, complex in practice or well fit to a particular family structure. Results We herein present linear model theory to come up with additive effects of the QTL alleles in any member of a general pedigree, conditional to observed markers and pedigree, accounting for possible linkage disequilibrium among QTLs and markers. The model is based on association analysis in the founders; further, the additive effect of the QTLs transmitted to the descendants is a weighted (by the probabilities of transmission) average of the substitution effects of founders' haplotypes. The model allows for non-complete linkage disequilibrium QTL-markers in the founders. Two submodels are presented: a simple and easy to implement Haley-Knott type regression for half-sib families, and a general mixed (variance component) model for general pedigrees. The model can use information from all markers. The performance of the regression method is compared by simulation with a more complex IBD method by Meuwissen and Goddard. Numerical examples are provided. Conclusion The linear model theory provides a useful framework for QTL mapping with dense marker maps. Results show similar accuracies but a bias of the IBD method towards the center of the region. Computations for the linear regression model are extremely simple, in contrast with IBD methods. Extensions of the model to genomic selection and multi-QTL mapping are straightforward.
- Published
- 2009
- Full Text
- View/download PDF
21. Genomic prediction of hybrid crops allows disentangling dominance and epistasis.
- Author
-
González-Diéguez, David, Legarra, Andrés, Charcosset, Alain, Moreau, Laurence, Lehermeier, Christina, Teyssédre, Simon, and Vitezica, Zulma G.
- Subjects
- *
AGRICULTURE , *CORN , *GENETIC variation , *GENE expression , *GENOMICS , *GENETIC markers , *HUMAN reproductive technology - Abstract
We revisited, in a genomic context, the theory of hybrid genetic evaluation models of hybrid crosses of pure lines, as the current practice is largely based on infinitesimal model assumptions. Expressions for covariances between hybrids due to additive substitution effects and dominance and epistatic deviations were analytically derived. Using dense markers in a GBLUP analysis, it is possible to split specific combining ability into dominance and across-groups epistatic deviations, and to split general combining ability (GCA) into within-line additive effects and within-line additive by additive (and higher order) epistatic deviations. We analyzed a publicly available maize data set of Dent x Flint hybrids using our new model (called GCA-model) up to additive by additive epistasis. To model higher order interactions within GCAs, we also fitted "residual genetic" line effects. Our new GCA-model was compared with another genomic model which assumes a uniquely defined effect of genes across origins. Most variation in hybrids is accounted by GCA. Variances due to dominance and epistasis have similar magnitudes. Models based on defining effects either differently or identically across heterotic groups resulted in similar predictive abilities for hybrids. The currently used model inflates the estimated additive genetic variance. This is not important for hybrid predictions but has consequences for the breeding scheme--e.g. overestimation of the genetic gain within heterotic group. Therefore, we recommend using GCA-model, which is appropriate for genomic prediction and variance component estimation in hybrid crops using genomic data, and whose results can be practically interpreted and used for breeding purposes. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
22. Genomic selection models for directional dominance: an example for litter size in pigs.
- Author
-
Varona, Luis, Legarra, Andrés, Herring, William, and Vitezica, Zulma G.
- Subjects
SWINE genetics ,GENOMICS ,ANIMAL litters ,INBREEDING ,HOMOZYGOSITY - Abstract
Background: The quantitative genetics theory argues that inbreeding depression and heterosis are founded on the existence of directional dominance. However, most procedures for genomic selection that have included dominance effects assumed prior symmetrical distributions. To address this, two alternatives can be considered: (1) assume the mean of dominance effects different from zero, and (2) use skewed distributions for the regularization of dominance effects. The aim of this study was to compare these approaches using two pig datasets and to confirm the presence of directional dominance. Results: Four alternative models were implemented in two datasets of pig litter size that consisted of 13,449 and 11,581 records from 3631 and 2612 sows genotyped with the Illumina PorcineSNP60 BeadChip. The models evaluated included (1) a model that does not consider directional dominance (Model SN), (2) a model with a covariate b for the average individual homozygosity (Model SC), (3) a model with a parameter λ that reflects asymmetry in the context of skewed Gaussian distributions (Model AN), and (4) a model that includes both b and λ (Model Full). The results of the analysis showed that posterior probabilities of a negative b or a positive λ under Models SC and AN were higher than 0.99, which indicate positive directional dominance. This was confirmed with the predictions of inbreeding depression under Models Full, SC and AN, that were higher than in the SN Model. In spite of differences in posterior estimates of variance components between models, comparison of models based on LogCPO and DIC indicated that Model SC provided the best fit for the two datasets analyzed. Conclusions: Our results confirmed the presence of positive directional dominance for pig litter size and suggested that it should be taken into account when dominance effects are included in genomic evaluation procedures. The consequences of ignoring directional dominance may affect predictions of breeding values and can lead to biased prediction of inbreeding depression and performance of potential mates. A model that assumes Gaussian dominance effects that are centered on a non-zero mean is recommended, at least for datasets with similar features to those analyzed here. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
23. Orthogonal Estimates of Variances for Additive, Dominance, and Epistatic Effects in Populations.
- Author
-
Vitezica, Zulma G., Legarra, Andrés, Toro, Miguel A., and Varona, Luis
- Subjects
- *
GENOMICS , *ESTIMATION theory , *VARIANCES , *HARDY-Weinberg formula , *LINKAGE (Genetics) , *GENETICS - Abstract
Genomic prediction methods based on multiple markers have potential to include nonadditive effects in prediction and analysis of complex traits. However, most developments assume a Hardy-Weinberg equilibrium (HWE). Statistical approaches for genomic selection that account for dominance and epistasis in a general context, without assuming HWE (e.g., crosses or homozygous lines), are therefore needed. Our method expands the natural and orthogonal interactions (NOIA) approach, which builds incidence matrices based on genotypic (not allelic) frequencies, to include genome-wide epistasis for an arbitrary number of interacting loci in a genomic evaluation context. This results in an orthogonal partition of the variances, which is not warranted otherwise. We also present the partition of variance as a function of genotypic values and frequencies following Cockerham's orthogonal contrast approach. Then we prove for the first time that, even not in HWE, the multiple-loci NOIA method is equivalent to construct epistatic genomic relationship matrices for higher-order interactions using Hadamard products of additive and dominant genomic orthogonal relationships. A standardization based on the trace of the relationship matrices is, however, needed. We illustrate these results with two simulated F1 (not in HWE) populations, either in linkage equilibrium (LE), or in linkage disequilibrium (LD) and divergent selection, and pure biological dominant pairwise epistasis. In the LE case, correct and orthogonal estimates of variances were obtained using NOIA genomic relationships but not if relationships were constructed assuming HWE. For the LD simulation, differences were smaller, due to the smaller deviation of the F1 from HWE. Wrongly assuming HWE to build genomic relationships and estimate variance components yields biased estimates, inflates the total genetic variance, and the estimates are not empirically orthogonal. The NOIA method to build genomic relationships, coupled with the use of Hadamard products for epistatic terms, allows the obtaining of correct estimates in populations either in HWE or not in HWE, and extends to any order of epistatic interactions. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
24. Genetic evaluation with major genes and polygenic inheritance when some animals are not genotyped using gene content multiple-trait BLUP.
- Author
-
Legarra, Andrés and Vitezica, Zulma G.
- Subjects
MONOGENIC & polygenic inheritance (Genetics) ,ANIMAL genetics ,GENOTYPES ,PHENOTYPES ,GENES - Abstract
Background: In pedigreed populations with a major gene segregating for a quantitative trait, it is not clear how to use pedigree, genotype and phenotype information when some individuals are not genotyped. We propose to consider gene content at the major gene as a second trait correlated to the quantitative trait, in a gene content multipletrait best linear unbiased prediction (GCMTBLUP) method. Results: The genetic covariance between the trait and gene content at the major gene is a function of the substitution effect of the gene. This genetic covariance can be written in a multiple-trait form that accommodates any pattern of missing values for either genotype or phenotype data. Effects of major gene alleles and the genetic covariance between genotype at the major gene and the phenotype can be estimated using standard EM-REML or Gibbs sampling. Prediction of breeding values with genotypes at the major gene can use multiple-trait BLUP software. Major genes with more than two alleles can be considered by including negative covariances between gene contents at each different allele. We simulated two scenarios: a selected and an unselected trait with heritabilities of 0.05 and 0.5, respectively. In both cases, the major gene explained half the genetic variation. Competing methods used imputed gene contents derived by the method of Gengler et al. or by iterative peeling. Imputed gene contents, in contrast to GCMTBLUP, do not consider information on the quantitative trait for genotype prediction. GCMTBLUP gave unbiased estimates of the gene effect, in contrast to the other methods, with less bias and better or equal accuracy of prediction. GCMTBLUP improved estimation of genotypes in non-genotyped individuals, in particular if these individuals had own phenotype records and the trait had a high heritability. Ignoring the major gene in genetic evaluation led to serious biases and decreased prediction accuracy. Conclusions: CGMTBLUP is the best linear predictor of additive genetic merit including pedigree, phenotype, and genotype information at major genes, since it considers missing genotypes. Simulations confirm that it is a simple, efficient and theoretically sound method for genetic evaluation of traits influenced by polygenic inheritance and one or several major genes. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
25. Sequence- vs. chip-assisted genomic selection: accurate biological information is advised.
- Author
-
Pérez-Enciso, Miguel, Rincón, Juan C., and Legarra, Andrés
- Subjects
GENETIC algorithms ,EVOLUTIONARY algorithms ,PARTICLE swarm optimization ,LEARNING classifier systems ,REINFORCEMENT learning - Abstract
Background: The development of next-generation sequencing technologies (NGS) has made the use of wholegenome sequence data for routine genetic evaluations possible, which has triggered a considerable interest in animal and plant breeding fields. Here, we investigated whether complete or partial sequence data can improve upon existing SNP (single nucleotide polymorphism) array-based selection strategies by simulation using a mixed coalescence - gene-dropping approach. Results: We simulated 20 or 100 causal mutations (quantitative trait nucleotides, QTN) within 65 predefined 'gene' regions, each 10 kb long, within a genome composed of ten 3-Mb chromosomes. We compared prediction accuracy by cross-validation using a medium-density chip (7.5 k SNPs), a high-density (HD, 17 k) and sequence data (335 k). Genetic evaluation was based on a GBLUP method. The simulations showed: (1) a law of diminishing returns with increasing number of SNPs; (2) a modest effect of SNP ascertainment bias in arrays; (3) a small advantage of using whole-genome sequence data vs. HD arrays i.e. ∼4%; (4) a minor effect of NGS errors except when imputation error rates are high (≥20%); and (5) if QTN were known, prediction accuracy approached 1. Since this is obviously unrealistic, we explored milder assumptions. We showed that, if all SNPs within causal genes were included in the prediction model, accuracy could also dramatically increase by ∼40%. However, this criterion was highly sensitive to either misspecification (including wrong genes) or to the use of an incomplete gene list; in these cases, accuracy fell rapidly towards that reached when all SNPs from sequence data were blindly included in the model. Conclusions: Our study shows that, unless an accurate prior estimate on the functionality of SNPs can be included in the predictor, there is a law of diminishing returns with increasing SNP density. As a result, use of whole-genome sequence data may not result in a highly increased selection response over high-density genotyping. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
26. Single-marker and multi-marker mixed models for polygenic score analysis in family-based data.
- Author
-
Bohossian, Nora, Saad, Mohamad, Legarra, Andrés, and Martinez, Maria
- Subjects
GENETIC markers ,MONOGENIC & polygenic inheritance (Genetics) ,MATHEMATICAL models ,FAMILIES ,HUMAN genome ,SINGLE nucleotide polymorphisms ,STATISTICAL methods in genetics - Abstract
Genome-wide association studies have proven successful but they remain underpowered for detecting variants of weaker effect. Alternative methods propose to test for association by using an aggregate score that combines the effects of the most associated variants. The set of variants that are to be aggregated may come from either of two modeling approaches: single-marker or multi-marker. The goal of this paper is to evaluate this alternative strategy by using sets of single-nucleotide polymorphisms identified by the two modeling approaches in the simulated pedigree data set provided for the Genetic Analysis Workshop 18. We focused on quantitative traits association analysis of diastolic blood pressure and of Q1, which served to control the statistical significance of our results. We carried out all analyses with knowledge of the underlying simulation model. We found that the probability to replicate association with the aggregate score depends on the single-nucleotide polymorphism set size and, for smaller sets (≤100), on the modeling approach. Nonetheless, assessing the statistical significance of these results in this data set was challenging, likely because of linkage because we are analyzing pedigree data, and also because the genotypes were the same across the replicates. Further methods need to be developed to facilitate the application of this alternative strategy in pedigree data. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
27. Genomic analysis of dominance effects on milk production and conformation traits in Fleckvieh cattle.
- Author
-
Ertl, Johann, Legarra, Andrés, Vitezica, Zulma G., Varona, Luis, Edel, Christian, Emmerling, Reiner, and Götz, Kay-Uwe
- Subjects
SINGLE nucleotide polymorphisms ,GENETIC polymorphisms ,FLECKVIEH cattle ,CATTLE breeds ,ANIMAL genetics ,CATTLE - Abstract
Estimates of dominance variance in dairy cattle based on pedigree data vary considerably across traits and amount to up to 50% of the total genetic variance for conformation traits and up to 43% for milk production traits. Using bovine SNP (single nucleotide polymorphism) genotypes, dominance variance can be estimated both at the marker level and at the animal level using genomic dominance effect relationship matrices. Yield deviations of high-density genotyped Fleckvieh cows were used to assess cross-validation accuracy of genomic predictions with additive and dominance models. The potential use of dominance variance in planned matings was also investigated. Results Variance components of nine milk production and conformation traits were estimated with additive and dominance models using yield deviations of 1996 Fleckvieh cows and ranged from 3.3% to 50.5% of the total genetic variance. REML and Gibbs sampling estimates showed good concordance. Although standard errors of estimates of dominance variance were rather large, estimates of dominance variance for milk, fat and protein yields, somatic cell score and milkability were significantly different from 0. Cross-validation accuracy of predicted breeding values was higher with genomic models than with the pedigree model. Inclusion of dominance effects did not increase the accuracy of the predicted breeding and total genetic values. Additive and dominance SNP effects for milk yield and protein yield were estimated with a BLUP (best linear unbiased prediction) model and used to calculate expectations of breeding values and total genetic values for putative offspring. Selection on total genetic value instead of breeding value would result in a larger expected total genetic superiority in progeny, i.e. 14.8% for milk yield and 27.8% for protein yield and reduce the expected additive genetic gain only by 4.5% for milk yield and 2.6% for protein yield. Conclusions Estimated dominance variance was substantial for most of the analyzed traits. Due to small dominance effect relationships between cows, predictions of individual dominance deviations were very inaccurate and including dominance in the model did not improve prediction accuracy in the cross-validation study. Exploitation of dominance variance in assortative matings was promising and did not appear to severely compromise additive genetic gain. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
28. Genetic parameters for growth and faecal worm egg count following Haemonchus contortus experimental infestations using pedigree and molecular information.
- Author
-
Assenza, Fabrizio, Elsen, Jean-Michel, Legarra, Andrés, Carré, Clément, Sallé, Guillaume, Robert-Granié1, Christèle, and Moreno, Carole R.
- Subjects
HAEMONCHUS ,PARASITIC diseases ,SHEEP industry ,ANTHELMINTICS ,ANIMAL genetics - Abstract
Background: Haemonchosis is a parasitic disease that causes severe economic losses in sheep industry. In recent years, the increasing resistance of the parasite to anthelmintics has raised the need for alternative control strategies. Genetic selection is a promising alternative but its efficacy depends on the availability of genetic variation and on the occurrence of favourable genetic correlations between the traits included in the breeding goal. The objective of this study was twofold. First, to estimate both the heritability of and the genetic correlations between growth traits and parasite resistance traits, using bivariate linear mixed animal models, from the phenotypes and genotypes of 1004 backcross lambs (considered as a single population), which underwent two subsequent experimental infestations protocols with Haemonchus contortus. Second, to compare the precision of the estimates when using two different relationship matrices: including pedigree information only or including also SNP (single nucleotide polymorphism) information. Results: Heritabilities were low for average daily gain before infestation (0.10 to 0.15) and average daily gain during the first infestation (0.11 to 0.16), moderate for faecal egg counts during the first infestation (0.21 to 0.38) and faecal egg counts during the second infestation (0.48 to 0.55). Genetic correlations between both growth traits and faecal egg count during the naïve infestation were equal to zero but the genetic correlation between faecal egg count during the second infestation and growth was positive in a Haemonchus contortus free environment and negative in a contaminated environment. The standard errors of the estimates obtained by including SNP information were smaller than those obtained by including pedigree information only. Conclusions: The genetic parameters estimates suggest that growth performance can be selected for independently of selection on resistance to naïve infestation. Selection for increased growth in a non-contaminated environment could lead to more susceptible animals with long-term exposure to the infestation but it could be possible to select for increased growth in a contaminated environment while also increasing resistance to the long-term exposure to the parasite. The use of molecular information increases the precision of the estimates. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
29. A note on the rationale for estimating genealogical coancestry from molecular markers.
- Author
-
Toro, Miguel Ángel, García-Cortés, Luis Alberto, and Legarra, Andrés
- Subjects
GENETIC markers ,DAIRY cattle ,ANIMAL population genetics ,GENE expression ,GENETIC regulation ,CATTLE - Abstract
Background: Genetic relatedness or similarity between individuals is a key concept in population, quantitative and conservation genetics. When the pedigree of a population is available and assuming a founder population from which the genealogical records start, genetic relatedness between individuals can be estimated by the coancestry coefficient. If pedigree data is lacking or incomplete, estimation of the genetic similarity between individuals relies on molecular markers, using either molecular coancestry or molecular covariance. Some relationships between genealogical and molecular coancestries and covariances have already been described in the literature. Methods: We show how the expected values of the empirical measures of similarity based on molecular marker data are functions of the genealogical coancestry. From these formulas, it is easy to derive estimators of genealogical coancestry from molecular data. We include variation of allelic frequencies in the estimators. Results: The estimators are illustrated with simulated examples and with a real dataset from dairy cattle. In general, estimators are accurate and only slightly biased. From the real data set, estimators based on covariances are more compatible with genealogical coancestries than those based on molecular coancestries. A frequently used estimator based on the average of estimated coancestries produced inflated coancestries and numerical instability. The consequences of unknown gene frequencies in the founder population are briefly discussed, along with alternatives to overcome this limitation. Conclusions: Estimators of genealogical coancestry based on molecular data are easy to derive. Estimators based on molecular covariance are more accurate than those based on identity by state. A correction considering the random distribution of allelic frequencies improves accuracy of these estimators, especially for populations with very strong drift. [ABSTRACT FROM AUTHOR]
- Published
- 2011
- Full Text
- View/download PDF
30. Does probabilistic modelling of linkage disequilibrium evolution improve the accuracy of QTL location in animal pedigree?
- Author
-
Cierco-Ayrolles, Christine, Dejean, Sébastien, Legarra, Andrés, Druet, Tom, Ytournel, Florence, Estivals, Delphine, Oumouhou, Naïma, and Mangin, Brigitte
- Subjects
LINKAGE disequilibrium ,ANIMAL pedigrees ,ANIMAL breeding ,ANIMAL genome mapping ,LOCUS (Genetics) ,GENE frequency - Abstract
Background: Since 2001, the use of more and more dense maps has made researchers aware that combining linkage and linkage disequilibrium enhances the feasibility of fine-mapping genes of interest. So, various method types have been derived to include concepts of population genetics in the analyses. One major drawback of many of these methods is their computational cost, which is very significant when many markers are considered. Recent advances in technology, such as SNP genotyping, have made it possible to deal with huge amount of data. Thus the challenge that remains is to find accurate and efficient methods that are not too time consuming. The study reported here specifically focuses on the half-sib family animal design. Our objective was to determine whether modelling of linkage disequilibrium evolution improved the mapping accuracy of a quantitative trait locus of agricultural interest in these populations. We compared two methods of fine-mapping. The first one was an association analysis. In this method, we did not model linkage disequilibrium evolution. Therefore, the modelling of the evolution of linkage disequilibrium was a deterministic process; it was complete at time 0 and remained complete during the following generations. In the second method, the modelling of the evolution of population allele frequencies was derived from a Wright-Fisher model. We simulated a wide range of scenarios adapted to animal populations and compared these two methods for each scenario. Results: Our results indicated that the improvement produced by probabilistic modelling of linkage disequilibrium evolution was not significant. Both methods led to similar results concerning the location accuracy of quantitative trait loci which appeared to be mainly improved by using four flanking markers instead of two. Conclusions: Therefore, in animal half-sib designs, modelling linkage disequilibrium evolution using a Wright-Fisher model does not significantly improve the accuracy of the QTL location when compared to a simpler method assuming complete and constant linkage between the QTL and the marker alleles. Finally, given the high marker density available nowadays, the simpler method should be preferred as it gives accurate results in a reasonable computing time. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
31. Linear models for joint association and linkage QTL mapping.
- Author
-
Legarra, Andrés and Fernando, Rohan L.
- Subjects
LINEAR models (Communication) ,LINKAGE disequilibrium ,MATHEMATICAL statistics ,MATHEMATICAL logic ,REGRESSION analysis - Abstract
Background: Populational linkage disequilibrium and within-family linkage are commonly used for QTL mapping and marker assisted selection. The combination of both results in more robust and accurate locations of the QTL, but models proposed so far have been either single marker, complex in practice or well fit to a particular family structure. Results: We herein present linear model theory to come up with additive effects of the QTL alleles in any member of a general pedigree, conditional to observed markers and pedigree, accounting for possible linkage disequilibrium among QTLs and markers. The model is based on association analysis in the founders; further, the additive effect of the QTLs transmitted to the descendants is a weighted (by the probabilities of transmission) average of the substitution effects of founders' haplotypes. The model allows for non-complete linkage disequilibrium QTL-markers in the founders. Two submodels are presented: a simple and easy to implement Haley-Knott type regression for half-sib families, and a general mixed (variance component) model for general pedigrees. The model can use information from all markers. The performance of the regression method is compared by simulation with a more complex IBD method by Meuwissen and Goddard. Numerical examples are provided. Conclusion: The linear model theory provides a useful framework for QTL mapping with dense marker maps. Results show similar accuracies but a bias of the IBD method towards the center of the region. Computations for the linear regression model are extremely simple, in contrast with IBD methods. Extensions of the model to genomic selection and multi-QTL mapping are straightforward. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
32. Performance of Genomic Selection in Mice.
- Author
-
Legarra, Andrés, Robert-Granié, Christële, Manfredi, Eduardo, and Elsen, Jean-Michel
- Subjects
- *
LABORATORY mice , *GENETIC markers , *ANIMAL breeding , *BIOMARKERS , *MEDICAL research - Abstract
Selection plans in plant and animal breeding are driven by genetic evaluation. Recent developments suggest using massive genetic marker information, known as "genomic selection." There is little evidence of its performance, though. We empirically compared three strategies for selection: (1) use of pedigree and phenotypic information, (2) use of genomewide markers and phenotypic information, and (3) the combination of both. We analyzed four traits from a heterogeneous mouse population (http://gscan.well.ox.ac.uk/), including 1884 individuals and 10,946 SNP markers. We used linear mixed models, using extensions of association analysis. Cross-validation techniques were used, providing assumption-free estimates of predictive ability. Sampling of validation and training data sets was carried out across and within families, which allows comparing across-and within-family information. Use of genomewide genetic markers increased predictive ability up to 0.22 across families and up to 0.03 within families. The latter is not statistically significant. These values are roughly comparable to increases of up to 0.57 (across family) and 0.14 (within family) in accuracy of prediction of genetic value. In this data set, within-family information was more accurate than across-family information, and populational linkage disequilibrium was nota completely accurate source of information for genetic evaluation. This fact questions some applications of genomic selection. [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
33. GIBBSTHUR: Software for Estimating Variance Components and Predicting Breeding Values for Ranking Traits Based on a Thurstonian Model.
- Author
-
Varona, Luis and Legarra, Andrés
- Subjects
- *
FISH breeding , *GIBBS sampling , *COMPETITION horses , *MARKOV processes , *VARIANCES , *FORECASTING - Abstract
Simple Summary: This article describes a new software (GIBBSTHUR) that provides Bayesian estimation of variance components and predictions of breeding values for ranking traits generated from equine competitions based on a Thurstonian approach. The GIBBSTHUR software was developed in FORTRAN 90 and can be executed in UNIX, OSX, or WINDOWS environments, and is freely available in a public repository (https://github.com/lvaronaunizar/Gibbsthur). (1) Background: Ranking traits are used commonly for breeding purposes in several equine populations; however, implementation is complex, because the position of a horse in a competition event is discontinuous and is influenced by the performance of its competitors. One approach to overcoming these limitations is to assume an underlying Gaussian liability that represents a horse's performance and dictates the observed classification in a competition event. That approach can be implemented using Montecarlo Markov Chain (McMC) techniques with a procedure known as the Thurstonian model. (2) Methods: We have developed software (GIBBSTHUR) that analyses ranking traits along with other continuous or threshold traits. The software implements a Gibbs Sampler scheme with a data-augmentation step for the liability of the ranking traits and provides estimates of the variance and covariance components and predictions of the breeding values and the average performance of the competitors in competition events. (3) Results: The results of a simple example are presented, in which it is shown that the procedure can recover the simulated variance and covariance components. In addition, the correlation between the simulated and predicted breeding values and between the estimates of the event effects and the average additive genetic effect of the competitors demonstrates the ability of the software to produce useful predictions for breeding purposes. (4) Conclusions: the GIBBSTHUR software provides a useful tool for the breeding evaluation of ranking traits in horses and is freely available in a public repository (https://github.com/lvaronaunizar/Gibbsthur). [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
34. Association analysis of loci implied in "buffering" epistasis.
- Author
-
Reverter, Antonio, Vitezica, Zulma G, Naval-Sánchez, Marina, Henshall, John, Raidan, Fernanda S S, Li, Yutao, Meyer, Karin, Hudson, Nicholas J, Porto-Neto, Laercio R, and Legarra, Andrés
- Subjects
CATTLE breeds ,BREEDING ,LIKELIHOOD ratio tests ,NATURAL selection ,BASE pairs ,BIOLOGICAL networks - Abstract
The existence of buffering mechanisms is an emerging property of biological networks, and this results in the buildup of robustness through evolution. So far, there are no explicit methods to find loci implied in buffering mechanisms. However, buffering can be seen as interaction with genetic background. Here we develop this idea into a tractable model for quantitative genetics, in which the buffering effect of one locus with many other loci is condensed into a single statistical effect, multiplicative on the total additive genetic effect. This allows easier interpretation of the results and simplifies the problem of detecting epistasis from quadratic to linear in the number of loci. Using this formulation, we construct a linear model for genome-wide association studies that estimates and declares the significance of multiplicative epistatic effects at single loci. The model has the form of a variance components, norm reaction model and likelihood ratio tests are used for significance. This model is a generalization and explanation of previous ones. We test our model using bovine data: Brahman and Tropical Composite animals, phenotyped for body weight at yearling and genotyped at high density. After association analysis, we find a number of loci with buffering action in one, the other, or both breeds; these loci do not have a significant statistical additive effect. Most of these loci have been reported in previous studies, either with an additive effect or as footprints of selection. We identify buffering epistatic SNPs present in or near genes reported in the context of signatures of selection in multi-breed cattle population studies. Prominent among these genes are those associated with fertility (INHBA , TSHR , ESRRG , PRLR , and PPARG), growth (MSTN , GHR), coat characteristics (KIT , MITF , PRLR), and heat resistance (HSPA6 and HSPA1A). In these populations, we found loci that have a nonsignificant statistical additive effect but a significant epistatic effect. We argue that the discovery and study of loci associated with buffering effects allow attacking the difficult problems, among others, of the release of maintenance variance in artificial and natural selection, of quick adaptation to the environment, and of opposite signs of marker effects in different backgrounds. We conclude that our method and our results generate promising new perspectives for research in evolutionary and quantitative genetics based on the study of loci that buffer effect of other loci. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
35. Short communication: Accounting for new mutations in genomic prediction models.
- Author
-
Casellas, Joaquim, Esquivelzeta, Cecilia, and Legarra, Andrés
- Subjects
- *
GENETIC mutation , *GENOMICS , *ANIMAL genetics research , *PHENOTYPES , *STATISTICAL correlation - Abstract
Genomic evaluation models so far do not allow for accounting of newly generated genetic variation due to mutation. The main target of this research was to extend current genomic BLUP models with mutational relationships (model AM), and compare them against standard genomic BLUP models (model A) by analyzing simulated data. Model performance and precision of the predicted breeding values were evaluated under different population structures and heritabilities. The deviance information criterion (DIC) clearly favored the mutational relationship model under large heritabilities or populations with moderate-to-deep pedigrees contributing phenotypic data (i.e., differences equal or larger than 10 DIC units); this model provided slightly higher correlation coefficients between simulated and predicted genomic breeding values. On the other hand, null DIC differences, or even relevant advantages for the standard genomic BLUP model, were reported under small heritabilities and shallow pedigrees, although precision of the genomic breeding values did not differ across models at a significant level. This method allows for slightly more accurate genomic predictions and handling of newly created variation; moreover, this approach does not require additional genotyping or phenotyping efforts, but a more accurate handing of available data. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
36. Evaluating Sequence-Based Genomic Prediction with an Efficient New Simulator.
- Author
-
Pérez-Enciso, Miguel, Forneris, Natalia, de los Campos, Gustavo, and Legarra, Andrés
- Subjects
- *
GENOMIC information retrieval , *NUCLEOTIDE sequence , *SINGLE nucleotide polymorphisms , *GENOTYPES , *BIOINFORMATICS software - Abstract
The vast amount of sequence data generated to analyze complex traits is posing new challenges in terms of the analysis and interpretation of the results. Although simulation is a fundamental tool to investigate the reliability of genomic analyses and to optimize experimental design, existing software cannot realistically simulate complete genomes. To remedy this, we have developed a new strategy (Sequence-Based Virtual Breeding, SBVB) that uses real sequence data and simulates new offspring genomes and phenotypes in a very efficient and flexible manner. Using this tool, we studied the efficiency of full sequence in genomic prediction compared to SNP arrays. We used real porcine sequences from three breeds as founder genomes of a 2500-animal pedigree and two genetic architectures: "neutral" and "selective." In the neutral architecture, frequencies and allele effects were sampled independently whereas, in the selective case, SNPs were sites putatively under selection after domestication and a negative correlation between effect and frequency was induced. We compared the effectiveness of different genotyping strategies for genomic selection, including the use of full sequence commercial arrays or randomly chosen SNP sets in both outbred and crossbred experimental designs. We found that accuracy increases using sequence instead of commercial chips but modestly, perhaps by ≤ 4%. This result was robust to extreme genetic architectures. We conclude that full sequence is unlikely to offset commercial arrays for predicting genetic value when the number of loci is relatively large and the prior given to each SNP is uniform. Using sequence to improve selection thus requires optimized prior information and, likely, increased population sizes. The code and manual for SBVB are available at https://github.com/mperezenciso/sbvb0. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
37. Predicting Quantitative Traits With Regression Models for Dense Molecular Markers and Pedigree.
- Author
-
de los Campos, Gustavo, Naya, Hugo, Gianola, Daniel, Crossa, José, Legarra, Andrés, Manfredi, Eduardo, Weigel, Kent, and Cotes, José Miguel
- Subjects
- *
REGRESSION analysis , *MULTIVARIATE analysis , *GENETIC markers , *BIOMARKERS , *SIMULATION methods & models , *GENETICS - Abstract
The availability of genomewide dense markers brings opportunities and challenges to breeding programs. An important question concerns the ways in which dense markers and pedigrees, together with phenotypic records, should be used to arrive at predictions of genetic values for complex traits. If a large number of markers are included in a regression model, marker-specific shrinkage of regression coefficients may be needed. For this reason, the Bayesian least absolute shrinkage and selection operator (LASSO) (BL) appears to be an interesting approach for fitting marker effects in a regression model. This article adapts the BL to arrive at a regression model where markers, pedigrees, and covariates other than markers are considered jointly. Connections between BL and other marker-based regression models are discussed, and the sensitivity of BL with respect to the choice of prior distributions assigned to key parameters is evaluated using simulation. The proposed model was fitted to two data sets from wheat and mouse populations, and evaluated using cross-validation methods. Results indicate that inclusion of markers in the regression further improved the predictive ability of models. An R program that implements the proposed model is freely available. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.