Janet Higgins, Bruno Santos, Tran Dang Khanh, Khuat Huu Trung, Tran Duy Duong, Nguyen Thi Phuong Doai, Nguyen Truong Khoa, Dang Thi Thanh Ha, Nguyen Thuy Diep, Kieu Thi Dung, Cong Nguyen Phi, Tran Thi Thuy, Nguyen Thanh Tuan, Hoang Dung Tran, Nguyen Thanh Trung, Hoang Thi Giang, Ta Kim Nhung, Cuong Duy Tran, Son Vi Lang, La Tuan Nghia, Nguyen Van Giang, Tran Dang Xuan, Anthony Hall, Sarah Dyer, Le Huy Ham, Mario Caccamo, and Jose J. De Vega
Abstract Background Vietnam possesses a vast diversity of rice landraces due to its geographical situation, latitudinal range, and a variety of ecosystems. This genetic diversity constitutes a highly valuable resource at a time when the highest rice production areas in the low-lying Mekong and Red River Deltas are enduring increasing threats from climate changes, particularly in rainfall and temperature patterns. Results We analysed 672 Vietnamese rice genomes, 616 newly sequenced, that encompass the range of rice varieties grown in the diverse ecosystems found throughout Vietnam. We described four Japonica and five Indica subpopulations within Vietnam likely adapted to the region of origin. We compared the population structure and genetic diversity of these Vietnamese rice genomes to the 3000 genomes of Asian cultivated rice. The named Indica-5 (I5) subpopulation was expanded in Vietnam and contained lowland Indica accessions, which had very low shared ancestry with accessions from any other subpopulation and were previously overlooked as admixtures. We scored phenotypic measurements for nineteen traits and identified 453 unique genotype-phenotype significant associations comprising twenty-one QTLs (quantitative trait loci). The strongest associations were observed for grain size traits, while weaker associations were observed for a range of characteristics, including panicle length, heading date and leaf width. Conclusions We showed how the rice diversity within Vietnam relates to the wider Asian rice diversity by using a number of approaches to provide a clear picture of the novel diversity present within Vietnam, mainly around the Indica-5 subpopulation. Our results highlight differences in genome composition and trait associations among traditional Vietnamese rice accessions, which are likely the product of adaption to multiple environmental conditions and regional preferences in a very diverse country. Our results highlighted traits and their associated genomic regions that are a potential source of novel loci and alleles to breed a new generation of low input sustainable and climate resilient rice.