1. Building the Embrapa rice breeding dataset for efficient data reuse
- Author
-
D. M. Soares, Julian Pietragalla, G. B. Abreu, Péricles de Carvalho Ferreira Neves, Marley Marico Utumi, José Manoel Colombari Filho, Raquel Neves de Mello, Mateo Vargas Hernández, Austrelino Silveira Filho, Flavio Breseghello, Elcio Perpetuo Guimaraes, I. V. Furtini, José Almeida Pereira, Sergio Lopes Júnior, Adriano Pereira de Castro, José Crossa, Paulo Ricardo Reis Fagundes, Paulo Hideo Nakano Rangel, Patricia Valle Pinheiro, Francisco Pereira Moura Neto, Antônio Carlos Centeno Cordeiro, Ariano Martins de Magalhães Júnior, FLAVIO BRESEGHELLO, CNPAF, RAQUEL NEVES DE MELLO, CNPAF, PATRICIA VALLE PINHEIRO, CNPAF, DINO MAGALHAES SOARES, CNPAF, SERGIO LOPES JUNIOR, CNPAF, PAULO HIDEO NAKANO RANGEL, CNPAF, ELCIO PERPETUO GUIMARAES, CNPAF, ADRIANO PEREIRA DE CASTRO, CNPAF, JOSE MANOEL COLOMBARI FILHO, CNPAF, ARIANO MARTINS DE MAGALHAES JUNIOR, CPACT, PAULO RICARDO REIS FAGUNDES, CPACT, PERICLES DE CARVALHO FERREIRA NEVES, CNPAF, ISABELA VOLPI FURTINI, CNPAF, MARLEY MARICO UTUMI, CPAF-RO, JOSE ALMEIDA PEREIRA, CPAMN, ANTONIO CARLOS CENTENO CORDEIRO, CPAF-RR, AUSTRELINO SILVEIRA FILHO, CPATU, GUILHERME BARBOSA ABREU, CPACP, FRANCISCO PEREIRA MOURA NETO, CNPAF, JULIAN PIETRAGALLA, INTEGRATED BREEDING PLATFORM, Texcoco, Mexico, MATEO VARGAS HERNÁNDEZ, CIMMYT, Texcoco-Mexico, and JOSE CROSSA, CIMMYT, Texcoco-Mexico.
- Subjects
Canopy ,Oryza Sativa ,Oryza sativa ,Fenótipo ,food and beverages ,Phenotypic trait ,Melhoramento Genético Vegetal ,Biology ,Upland rice ,Heritability ,Plant breeding ,Databases ,Agronomy ,Arroz ,Genetic gain ,Genetics ,Rice ,Banco de dados ,Agronomy and Crop Science ,Cropping ,Panicle - Abstract
Embrapa has led breeding programs for irrigated and upland rice (Oryza sativa L.) since 1977, generating a large amount of pedigree and phenotypic data. However, there were no systematic standards for data recording nor long-term data preservation and reuse strategies. With the new aim of making data reuse practical, we recovered all data available and structured it into the Embrapa Rice Breeding Dataset (ERBD). In its current version, the ERBD includes 20,504 crosses involving 9,974 parents, the pedigrees of most of the 4,532 inbred lines that took part in advanced field trials, and phenotypic data from 2,711 field trials (1,118 irrigated, 1,593 upland trials), representing 226,458 field plots. Those trials were conducted over 38 years (1982-2019), in 247 locations, in latitudes ranging from 3°N to 33°S. Phenotypic traits included grain yield, days to flowering, plant height, canopy lodging, and five important fungal diseases: leaf blast, panicle blast, brown spot, leaf scald, and grain discoloration. The total number of data points surpasses 1.27 million. Descriptive statistics were computed over the dataset, split by cropping systems (irrigated or upland). The mean heritability of grain yield was high for both systems, at around .7, whereas the mean coefficient of variation was 13.9% for irrigated trials and 18.7% for upland trials. The ERBD offers the possibility of conducting studies on different aspects of rice breeding and genetics, including genetic gain, G×E analysis, genome-wide association studies and genomic prediction. Made available in DSpace on 2021-11-30T12:00:26Z (GMT). No. of bitstreams: 1 cropscience-2021.pdf: 1682169 bytes, checksum: e47084fb3060926dd37ca8419b8936da (MD5) Previous issue date: 2021
- Published
- 2021
- Full Text
- View/download PDF