1. Whole-Genome Sequencing of the NARO World Rice Core Collection (WRC) as the Basis for Diversity and Association Studies
- Author
-
Masao Ishimoto, N Tanaka, Shuichi Fukuoka, Hiroyuki Kanamori, Hiroaki Sakai, Kazuhiko Sugimoto, Kaworu Ebana, Jun-ichi Yonemaru, Masahiko Kumagai, Jianzhong Wu, Yoshihiro Kawahara, and Matthew Shenton
- Subjects
0106 biological sciences ,0301 basic medicine ,Genome-wide association study ,Physiology ,Population ,Flowers ,Plant Science ,Genome browser ,Computational biology ,Biology ,Quantitative trait locus ,Genes, Plant ,01 natural sciences ,Genome ,Genetic diversity ,03 medical and health sciences ,Quantitative Trait, Heritable ,education ,Phylogeny ,Genetic association ,Ecotype ,Whole genome sequencing ,Principal Component Analysis ,education.field_of_study ,Whole Genome Sequencing ,Haplotype ,Regular Papers ,Genetic Variation ,food and beverages ,Oryza ,Cell Biology ,General Medicine ,Phenotype ,030104 developmental biology ,Haplotypes ,Core collection ,Mutation ,Genome, Plant ,010606 plant biology & botany - Abstract
Genebanks provide access to diverse materials for crop improvement. To utilize and evaluate them effectively, core collections, such as the World Rice Core Collection (WRC) in the Genebank at the National Agriculture and Food Research Organization, have been developed. Because the WRC consists of 69 accessions with a high degree of genetic diversity, it has been used for >300 projects. To allow deeper investigation of existing WRC data and to further promote research using Genebank rice accessions, we performed whole-genome resequencing of these 69 accessions, examining their sequence variation by mapping against the Oryza sativa ssp. japonica Nipponbare genome. We obtained a total of 2,805,329 single nucleotide polymorphisms (SNPs) and 357,639 insertion–deletions. Based on the principal component analysis and population structure analysis of these data, the WRC can be classified into three major groups. We applied TASUKE, a multiple genome browser to visualize the different WRC genome sequences, and classified haplotype groups of genes affecting seed characteristics and heading date. TASUKE thus provides access to WRC genotypes as a tool for reverse genetics. We examined the suitability of the compact WRC population for genome-wide association studies (GWASs). Heading date, affected by a large number of quantitative trait loci (QTLs), was not associated with known genes, but several seed-related phenotypes were associated with known genes. Thus, for QTLs of strong effect, the compact WRC performed well in GWAS. This information enables us to understand genetic diversity in 37,000 rice accessions maintained in the Genebank and to find genes associated with different phenotypes.The sequence data have been deposited in DNA Data Bank of Japan Sequence Read Archive (DRA) (Supplementary Table S1).
- Published
- 2020
- Full Text
- View/download PDF