Back to Search Start Over

A Semi-Automated SNP-Based Approach for Contaminant Identification in Biparental Polyploid Populations of Tropical Forage Grasses

Authors :
Felipe Bitencourt Martins
Aline Costa Lima Moraes
Alexandre Hild Aono
Rebecca Caroline Ulbricht Ferreira
Lucimara Chiari
Rosangela Maria Simeão
Sanzio Carvalho Lima Barrios
Mateus Figueiredo Santos
Liana Jank
Cacilda Borges do Valle
Bianca Baccili Zanotto Vigna
Anete Pereira de Souza
CACILDA BORGES DO VALLE, CNPGC
BIANCA BACCILI ZANOTTO VIGNA, CPPSE
ANETE PEREIRA DE SOUZA, Center for Molecular Biology and Genetic Engineering
UNICAMP.
ALEXANDRE HILD AONO, Center for Molecular Biology and Genetic Engineering
FELIPE BITENCOURT MARTINS, Center for Molecular Biology and Genetic Engineering
ALINE COSTA LIMA MORAES, Center for Molecular Biology and Genetic Engineering
REBECCA CAROLINE ULBRICHT FERREIRA, Center for Molecular Biology and Genetic Engineering
LUCIMARA CHIARI, CNPGC
ROSANGELA MARIA SIMEAO, CNPGC
SANZIO CARVALHO LIMA BARRIOS, CNPGC
MATEUS FIGUEIREDO SANTOS, CNPGC
LIANA JANK, CNPGC
Source :
Frontiers in Plant Science, Vol 12 (2021), Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA-Alice), Empresa Brasileira de Pesquisa Agropecuária (Embrapa), instacron:EMBRAPA, Frontiers in Plant Science
Publication Year :
2021
Publisher :
Frontiers Media S.A., 2021.

Abstract

Artificial hybridization plays a fundamental role in plant breeding programs since it generates new genotypic combinations that can result in desirable phenotypes. Depending on the species and mode of reproduction, controlled crosses may be challenging, and contaminating individuals can be introduced accidentally. In this context, the identification of such contaminants is important to avoid compromising further selection cycles, as well as genetic and genomic studies. The main objective of this work was to propose an automated multivariate methodology for the detection and classification of putative contaminants, including apomictic clones (ACs), self-fertilized individuals, half-siblings (HSs), and full contaminants (FCs), in biparental polyploid progenies of tropical forage grasses. We established a pipeline to identify contaminants in genotyping-by-sequencing (GBS) data encoded as allele dosages of single nucleotide polymorphism (SNP) markers by integrating principal component analysis (PCA), genotypic analysis (GA) measures based on Mendelian segregation, and clustering analysis (CA). The combination of these methods allowed for the correct identification of all contaminants in all simulated progenies and the detection of putative contaminants in three real progenies of tropical forage grasses, providing an easy and promising methodology for the identification of contaminants in biparental progenies of tetraploid and hexaploid species. The proposed pipeline was made available through the polyCID Shiny app and can be easily coupled with traditional genetic approaches, such as linkage map construction, thereby increasing the efficiency of breeding programs. Made available in DSpace on 2021-12-07T14:00:42Z (GMT). No. of bitstreams: 1 SemiAutomatedSNP.pdf: 2844859 bytes, checksum: e7cb8fd79620358847aaa86717df754b (MD5) Previous issue date: 2021

Details

Language :
English
Volume :
12
Database :
OpenAIRE
Journal :
Frontiers in Plant Science
Accession number :
edsair.doi.dedup.....1f41e77976e9f3d417b06ede12acb3b5
Full Text :
https://doi.org/10.3389/fpls.2021.737919/full