1. New insights into the wheat chromosome 4D structure and virtual gene order, revealed by survey pyrosequencing
- Author
-
Catherine Feuillet, Miroslav Valárik, Bernardo J. Clavijo, Ingrid Garbus, Jeroslav Doležel, Maximo Rivarola, G. Tranquilli, Marcelo Helguera, Leonardo Sebastián Vanzetti, Mario Caccamo, Norma Paniego, Mihaela Martis, Viviana Echenique, Klaus F. X. Mayer, Hana Šimková, Phillippe Leroy, Sergio Alberto González, Estación Experimental Agropecuaria Marcos Juáre, Instituto Nacional de Tecnología Agropecuaria (INTA), Instituto de Biotecnología, Centro Investigación en Ciencias Veterinarias y Agronómicas (CICVyA), Consejo Nacional de Investigaciones Científicas y Técnicas [Buenos Aires] (CONICET), The Genome Analysis Centre (TGAC), MIPS/IBIS, Helmholtz-Zentrum München (HZM), Estación Experimental Agropecuaria Marcos Juárez, Instituto de Biotecnología, Centro Investigación en Ciencias Veterinarias y Agronómicas (CICVyA), Centro de Recursos Naturales Renovables de la Zona Semiárida [Bahía Blanca] (CERZOS), Universidad Nacional del Sur [Argentina] (UNS)-Consejo Nacional de Investigaciones Científicas y Técnicas [Buenos Aires] (CONICET), Universidad Nacional del Sur [Argentina] (UNS), Génétique Diversité et Ecophysiologie des Céréales (GDEC), Institut National de la Recherche Agronomique (INRA)-Université Blaise Pascal - Clermont-Ferrand 2 (UBP), Centre of the Region Haná for Biotechnological and Agricultural Research, Institute of Experimental Botany, Norwich Research Park, (TGAC), Centro de Investigación de Recursos Naturales (CIRN), CONICET (International Cooperation Grant) D456, AEBIO 245001.711.732 902 PNBIO 1131041.43 PNCYO 1127041, Universidad Nacional del Sur (PGI-TIR) CSU-142/14, Agencia Espanola de Cooperacion Internacional y Desarrollo AECID-A1/041041/11, Czech Science Foundation P501/12/G090, National Program of Sustainability I LO1204, Institute of Experimental Botany of the Czech Academy of Sciences (IEB / CAS), Czech Academy of Sciences [Prague] (CAS)-Czech Academy of Sciences [Prague] (CAS), Helmholtz Zentrum München = German Research Center for Environmental Health, Consejo Nacional de Investigaciones Científicas y Técnicas [Buenos Aires] (CONICET)-Universidad Nacional del Sur [Argentina] (UNS), and Université Blaise Pascal - Clermont-Ferrand 2 (UBP)-Institut National de la Recherche Agronomique (INRA)
- Subjects
0106 biological sciences ,Cromosoma 4 D ,Gene annotation ,Virtual gene order ,Plant Science ,01 natural sciences ,Genome ,Chromosome 4d Survey Sequence ,Gene Annotation ,Gene Content ,Synteny ,Triticum Aestivum ,Virtual Gene Order ,Gene Order ,Triticum ,Expressed Sequence Tags ,2. Zero hunger ,Genetics ,0303 health sciences ,Expressed sequence tag ,virtual gene order ,biology ,Contig ,synteny ,Chromosome Mapping ,High-Throughput Nucleotide Sequencing ,food and beverages ,General Medicine ,Gene content ,SNP, single nucleotide polymorphism ,dN/dS, non-synonymous-to-synonymous substitutions ,EST, expressed sequence tag ,ISBP, insertion site-based polymorphism ,Biotecnología Agrícola y Biotecnología Alimentaria ,Os, Oryza sativa – rice ,Orthologous Gene ,Chromosome 4D survey sequence ,LMP, long mate pair ,Molecular Sequence Data ,gene content ,Biotecnología Agropecuaria ,Triticum aestivum ,CDS, coding DNA sequences ,Trigo ,IWGSC, International Wheat Genome Sequencing Consortium ,Bd, Brachypodium distachyon ,Chromosomes ,Chromosomes, Plant ,Article ,03 medical and health sciences ,chromosome 4D survey sequence ,SE, single end ,Aegilops tauschii ,[SDV.BV]Life Sciences [q-bio]/Vegetal Biology ,030304 developmental biology ,Sb, Sorghum bicolor – sorghum ,Cromosomas ,NGS, next generation sequencing ,Sequence Analysis, DNA ,biology.organism_classification ,gene annotation ,Chromosome 4 ,Genes ,CIENCIAS AGRÍCOLAS ,MDA, multiple displacement amplification ,SSR, single sequence repeat ,Agronomy and Crop Science ,010606 plant biology & botany - Abstract
Highlights • Survey sequence of T. aestivum chromosome 4D was obtained by pyrosequencing. • Near 5700 genes were predicted on 4D chromosome, ∼2200 on 4DS and ∼3500 on 4DL. • A 4D virtual gene order based on synteny with orthologous gene loci is proposed. • Among group 4, higher collinearity exists between 4D and 4B as compared to 4A. • Complementary data to that provided by IWGSC is presented, available at NCBI., Survey sequencing of the bread wheat (Triticum aestivum L.) genome (AABBDD) has been approached through different strategies delivering important information. However, the current wheat sequence knowledge is not complete. The aim of our study is to provide different and complementary set of data for chromosome 4D. A survey sequence was obtained by pyrosequencing of flow-sorted 4DS (7.2×) and 4DL (4.1×) arms. Single ends (SE) and long mate pairs (LMP) reads were assembled into contigs (223 Mb) and scaffolds (65 Mb) that were aligned to Aegilops tauschii draft genome (DD), anchoring 34 Mb to chromosome 4. Scaffolds annotation rendered 822 gene models. A virtual gene order comprising 1973 wheat orthologous gene loci and 381 wheat gene models was built. This order was largely consistent with the scaffold order determined based on a published high density map from the Ae. tauschii chromosome 4, using bin-mapped 4D ESTs as a common reference. The virtual order showed a higher collinearity with homeologous 4B compared to 4A. Additionally, a virtual map was constructed and ∼5700 genes (∼2200 on 4DS and ∼3500 on 4DL) predicted. The sequence and virtual order obtained here using the 454 platform were compared with the Illumina one used by the IWGSC, giving complementary information.
- Published
- 2015
- Full Text
- View/download PDF