38 results on '"Marcela K Tello-Ruiz"'
Search Results
2. An improved reference of the grapevine genome reasserts the origin of the PN40024 highly homozygous genotype
- Author
-
Amandine Velt, Bianca Frommer, Sophie Blanc, Daniela Holtgräwe, Éric Duchêne, Vincent Dumas, Jérôme Grimplet, Philippe Hugueney, Catherine Kim, Marie Lahaye, José Tomás Matus, David Navarro-Payá, Luis Orduña, Marcela K Tello-Ruiz, Nicola Vitulo, Doreen Ware, and Camille Rustenholz
- Subjects
Genetics ,QH426-470 - Abstract
AbstractThe genome sequence of the diploid and highly homozygous Vitis viniferaVitis.
- Published
- 2023
- Full Text
- View/download PDF
3. Double triage to identify poorly annotated genes in maize: The missing link in community curation.
- Author
-
Marcela K Tello-Ruiz, Cristina F Marco, Fei-Man Hsu, Rajdeep S Khangura, Pengfei Qiao, Sirjan Sapkota, Michelle C Stitzer, Rachael Wasikowski, Hao Wu, Junpeng Zhan, Kapeel Chougule, Lindsay C Barone, Cornel Ghiban, Demitri Muna, Andrew C Olson, Liya Wang, Doreen Ware, and David A Micklos
- Subjects
Medicine ,Science - Abstract
The sophistication of gene prediction algorithms and the abundance of RNA-based evidence for the maize genome may suggest that manual curation of gene models is no longer necessary. However, quality metrics generated by the MAKER-P gene annotation pipeline identified 17,225 of 130,330 (13%) protein-coding transcripts in the B73 Reference Genome V4 gene set with models of low concordance to available biological evidence. Working with eight graduate students, we used the Apollo annotation editor to curate 86 transcript models flagged by quality metrics and a complimentary method using the Gramene gene tree visualizer. All of the triaged models had significant errors-including missing or extra exons, non-canonical splice sites, and incorrect UTRs. A correct transcript model existed for about 60% of genes (or transcripts) flagged by quality metrics; we attribute this to the convention of elevating the transcript with the longest coding sequence (CDS) to the canonical, or first, position. The remaining 40% of flagged genes resulted in novel annotations and represent a manual curation space of about 10% of the maize genome (~4,000 protein-coding genes). MAKER-P metrics have a specificity of 100%, and a sensitivity of 85%; the gene tree visualizer has a specificity of 100%. Together with the Apollo graphical editor, our double triage provides an infrastructure to support the community curation of eukaryotic genomes by scientists, students, and potentially even citizen scientists.
- Published
- 2019
- Full Text
- View/download PDF
4. Ensembl Genomes 2022: an expanding genome resource for non-vertebrates.
- Author
-
Andrew D. Yates, James E. Allen, M. Ridwan Amode, Andrey G. Azov, Matthieu Barba, Andrés Becerra, Jyothish Bhai, Lahcen I. Campbell, Manuel Carbajo Martinez, Marc Chakiachvili, Kapeel Chougule, Mikkel B. Christensen, Bruno Contreras-Moreira, Alayne Cuzick, Luca Da Rin Fioretto, Paul Davis 0001, Nishadi De Silva, Stavros Diamantakis, Sarah Dyer, Justin Elser, Carla V. Filippi, Astrid Gall, Dionysios Grigoriadis, Cristina Guijarro-Clarke, Parul Gupta, Kim E. Hammond-Kosack, Kevin L. Howe, Pankaj Jaiswal, Vinay Kaikala, Vivek Kumar, Sunita Kumari, Nick Langridge, Tuan Le, Manuel Luypaert, Gareth Maslen, Thomas Maurel, Benjamin Moore, Matthieu Muffato, Aleena Mushtaq, Guy Naamati, Sushma Naithani, Andrew Olson, Anne Parker, Michael Paulini, Helder Pedro, Emily Perry, Justin Preece, Mark Quinton-Tulloch, Faye Rodgers, Marc Rosello, Magali Ruffier, James Seager, Vasily Sitnik, Michal Szpak, John G. Tate, Marcela K. Tello-Ruiz, Stephen J. Trevanion, Martin Urban, Doreen Ware, Sharon Wei, Gary Williams, Andrea Winterbottom, Magdalena Zarowiecki, Robert D. Finn, and Paul Flicek
- Published
- 2022
- Full Text
- View/download PDF
5. Gramene 2021: harnessing the power of comparative genomics and pathways for plant research.
- Author
-
Marcela K. Tello-Ruiz, Sushma Naithani, Parul Gupta, Andrew Olson, Sharon Wei, Justin Preece, Yinping Jiao, Bo Wang, Kapeel Chougule, Priyanka Garg, Justin Elser, Sunita Kumari, Vivek Kumar, Bruno Contreras-Moreira, Guy Naamati, Nancy George, Justin Cook, Dan M. Bolser, Peter D'Eustachio, Lincoln D. Stein, Amit Gupta, Weijia Xu, Jennifer Regala, Irene Papatheodorou, Paul J. Kersey, Paul Flicek, Crispin Taylor, Pankaj Jaiswal, and Doreen Ware
- Published
- 2021
- Full Text
- View/download PDF
6. Ensembl Genomes 2020 - enabling non-vertebrate genomic research.
- Author
-
Kevin L. Howe, Bruno Contreras-Moreira, Nishadi De Silva, Gareth Maslen, Wasiu A. Akanni, James E. Allen, Jorge álvarez-Jarreta, Matthieu Barba, Dan M. Bolser, Lahcen Cambell, Manuel Carbajo, Marc Chakiachvili, Mikkel B. Christensen, Carla A. Cummins, Alayne Cuzick, Paul Davis 0001, Silvie Fexova, Astrid Gall, Nancy George, Laurent Gil, Parul Gupta, Kim E. Hammond-Kosack, Erin Haskell, Sarah E. Hunt, Pankaj Jaiswal, Sophie H. Janacek, Paul J. Kersey, Nick Langridge, Uma Maheswari, Thomas Maurel, Mark D. McDowall, Benjamin Moore, Matthieu Muffato, Guy Naamati, Sushma Naithani, Andrew Olson, Irene Papatheodorou, Mateus Patricio, Michael Paulini, Helder Pedro, Emily Perry, Justin Preece, Marc Rosello, Matthew Russell, Vasily Sitnik, Daniel M. Staines, Joshua C. Stein, Marcela K. Tello-Ruiz, Stephen J. Trevanion, Martin Urban, Sharon Wei, Doreen Ware, Gary Williams, Andrew D. Yates, and Paul Flicek
- Published
- 2020
- Full Text
- View/download PDF
7. Plant Reactome: a knowledgebase and resource for comparative pathway analysis.
- Author
-
Sushma Naithani, Parul Gupta, Justin Preece, Peter D'Eustachio, Justin Elser, Priyanka Garg, Daemon A. Dikeman, Jason Kiff, Justin Cook, Andrew Olson, Sharon Wei, Marcela K. Tello-Ruiz, Antonio Fabregat Mundo, Alfonso Muñoz-Pomer Fuentes, Suhaib Mohammed, Tiejun Cheng, Evan Bolton, Irene Papatheodorou, Lincoln Stein, Doreen Ware, and Pankaj Jaiswal
- Published
- 2020
- Full Text
- View/download PDF
8. Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species.
- Author
-
Paul Julian Kersey, James E. Allen, Alexis Allot, Matthieu Barba, Sanjay Boddu, Bruce J. Bolt, Denise Carvalho-Silva, Mikkel B. Christensen, Paul Davis 0001, Christoph Grabmueller, Navin Kumar, Zicheng Liu 0004, Thomas Maurel, Benjamin Moore, Mark D. McDowall, Uma Maheswari, Guy Naamati, Victoria Newman, Chuang Kee Ong, Michael Paulini, Helder Pedro, Emily Perry, Matthew Russell, Helen Sparrow, Electra Tapanari, Kieron R. Taylor, Alessandro Vullo, Gareth Williams, Amonida Zadissa, Andrew Olson, Joshua C. Stein, Sharon Wei, Marcela K. Tello-Ruiz, Doreen Ware, Aurelien Luciani, Simon C. Potter, Robert D. Finn, Martin Urban, Kim E. Hammond-Kosack, Dan M. Bolser, Nishadi De Silva, Kevin L. Howe, Nicholas Langridge, Gareth Maslen, Daniel Michael Staines, and Andrew D. Yates
- Published
- 2018
- Full Text
- View/download PDF
9. Gramene 2018: unifying comparative genomics and pathway resources for plant research.
- Author
-
Marcela K. Tello-Ruiz, Sushma Naithani, Joshua C. Stein, Parul Gupta, Michael Campbell, Andrew Olson, Sharon Wei, Justin Preece, Matthew J. Geniza, Yinping Jiao, Young Koung Lee, Bo Wang, Joseph Mulvaney, Kapeel Chougule, Justin Elser, Noor Al-Bader, Sunita Kumari, James Thomason, Vivek Kumar, Daniel M. Bolser, Guy Naamati, Electra Tapanari, Nuno A. Fonseca, Laura Huerta, Haider Iqbal, Maria Keays, Alfonso Muñoz-Pomer Fuentes, Y. Amy Tang, Antonio Fabregat, Peter D'Eustachio, Joel Weiser, Lincoln D. Stein, Robert Petryszak, Irene Papatheodorou, Paul J. Kersey, Patti Lockhart, Crispin Taylor, Pankaj Jaiswal, and Doreen Ware
- Published
- 2018
- Full Text
- View/download PDF
10. Ensembl Genomes 2016: more genomes, more complexity.
- Author
-
Paul Julian Kersey, James E. Allen, Irina M. Armean, Sanjay Boddu, Bruce J. Bolt, Denise Carvalho-Silva, Mikkel B. Christensen, Paul Davis 0001, Lee J. Falin, Christoph Grabmueller, Jay C. Humphrey, Arnaud Kerhornou, Julia Khobova, Naveen K. Aranganathan, Nicholas Langridge, Ernesto Lowy-Gallego, Mark D. McDowall, Uma Maheswari, Michael Nuhn, Chuang Kee Ong, Bert Overduin, Michael Paulini, Helder Pedro, Emily Perry, Giulietta Spudich, Electra Tapanari, Brandon Walts, Gareth Williams, Marcela K. Tello-Ruiz, Joshua C. Stein, Sharon Wei, Doreen Ware, Daniel M. Bolser, Kevin L. Howe, Eugene Kulesha, Daniel Lawson, Gareth Maslen, and Daniel M. Staines
- Published
- 2016
- Full Text
- View/download PDF
11. Gramene 2016: comparative plant genomics and pathway resources.
- Author
-
Marcela K. Tello-Ruiz, Joshua C. Stein, Sharon Wei, Justin Preece, Andrew Olson, Sushma Naithani, Vindhya Amarasinghe, Palitha Dharmawardhana, Yinping Jiao, Joseph Mulvaney, Sunita Kumari, Kapeel Chougule, Justin Elser, Bo Wang, James Thomason, Daniel M. Bolser, Arnaud Kerhornou, Brandon Walts, Nuno A. Fonseca, Laura Huerta, Maria Keays, Y. Amy Tang, Helen E. Parkinson, Antonio Fabregat, Sheldon J. McKay, Joel Weiser, Peter D'Eustachio, Lincoln Stein, Robert Petryszak, Paul J. Kersey, Pankaj Jaiswal, and Doreen Ware
- Published
- 2016
- Full Text
- View/download PDF
12. NAPPN Annual Conference Abstract: Can we make phenomic, genetic, and genomic data FAIR? The AgBioData road trip towards best practices of data sharing and management in agricultural research and education
- Author
-
Annarita Marrano, Darwin Campbell, Jaqueline Campbell, Ethalinda Cannon, Laurel Cooper, Peter Harrison, Lisa Harper6, Eva Huala, Sook Jung, Sunita Kumari, John Mcnamara, Sushma Naithani, Monica Poelchau, Leonore Reiser, Margaret Staton, and Marcela K Tello-Ruiz
- Published
- 2023
- Full Text
- View/download PDF
13. An improved reference of the grapevine genome supports reasserting the origin of the PN40024 highly-homozygous genotype
- Author
-
Amandine Velt, Bianca Frommer, Sophie Blanc, Daniela Holtgräwe, Éric Duchêne, Vincent Dumas, Jérôme Grimplet, Philippe Hugueney, Marie Lahaye, Catherine Kim, José Tomás Matus, David Navarro-Payá, Luis Orduña, Marcela K. Tello-Ruiz, Nicola Vitulo, Doreen Ware, and Camille Rustenholz
- Subjects
Vitis vinífera ,Genomas - Abstract
The genome sequence assembly of the diploid and highly homozygousV. viniferagenotype PN40024 serves as the reference for many grapevine studies. Despite several improvements of the PN40024 genome assembly, its current version PN12X.v2 is quite fragmented and only represents the haploid state of the genome with mixed haplotypes. In fact, despite the PN40024 genome is nearly homozygous, it still contains various heterozygous regions. Taking the opportunity of the improvements that long-read sequencing technologies offer to fully discriminate haplotype sequences and considering that severalVitissp. genomes have recently been assembled with these approaches, an improved version of the reference, called PN40024.v4, was generated.Through incorporating long genomic sequencing reads to the assembly, the continuity of the 12X.v2 scaffolds was highly increased. The number of scaffolds decreased from 2,059 to 640 and the number of N bases was reduced by 88%. Additionally, the full alternative haplotype sequence was built for the first time, the chromosome anchoring was improved and the amount of unplaced scaffolds were reduced by half. To obtain a high-quality gene annotation that outperforms previous versions, a liftover approach was complemented with an optimized annotation workflow forVitis. Integration of the gene reference catalogue and its manual curation have also assisted in improving the annotation, while defining the most reliable estimation to date of 35,230 genes. Finally, we demonstrate that PN40024 resulted from selfings of cv. ‘Helfensteiner’ (cross of cv. ‘Pinot noir’ and ‘Schiava grossa’) instead of a single ‘Pinot noir’. These advances will help maintaining the PN40024 genome as a gold-standard reference also contributing in the eventual elaboration of the grapevine pangenome.
- Published
- 2022
- Full Text
- View/download PDF
14. Gramene 2021: harnessing the power of comparative genomics and pathways for plant research
- Author
-
Jennifer Regala, Lincoln Stein, Vivek Kumar, Irene Papatheodorou, Andrew Olson, Amit Gupta, Justin Cook, Bruno Contreras-Moreira, Justin Preece, Weijia Xu, Nancy George, Parul Gupta, Dan Bolser, Crispin B. Taylor, Peter D'Eustachio, Marcela K. Tello-Ruiz, Doreen Ware, Paul J. Kersey, Paul Flicek, Sharon Wei, Yinping Jiao, Justin Elser, Bo Wang, Sushma Naithani, Priyanka Garg, Guy Naamati, Sunita Kumari, Kapeel Chougule, and Pankaj Jaiswal
- Subjects
Crops, Agricultural ,0106 biological sciences ,Gene Organization ,Knowledge Bases ,Genomics ,Genome browser ,Computational biology ,Ontology (information science) ,Biology ,Zea mays ,01 natural sciences ,Genome ,Polyploidy ,03 medical and health sciences ,Annotation ,Gene Expression Regulation, Plant ,Gene Duplication ,Databases, Genetic ,Protein Interaction Mapping ,Genetics ,Database Issue ,Gene Regulatory Networks ,Gene ,Plant Proteins ,030304 developmental biology ,Comparative genomics ,Internet ,0303 health sciences ,food and beverages ,Molecular Sequence Annotation ,Oryza ,Plants ,Gene Ontology ,DNA Transposable Elements ,Genome, Plant ,Metabolic Networks and Pathways ,Software ,010606 plant biology & botany - Abstract
Gramene (http://www.gramene.org), a knowledgebase founded on comparative functional analyses of genomic and pathway data for model plants and major crops, supports agricultural researchers worldwide. The resource is committed to open access and reproducible science based on the FAIR data principles. Since the last NAR update, we made nine releases; doubled the genome portal's content; expanded curated genes, pathways and expression sets; and implemented the Domain Informational Vocabulary Extraction (DIVE) algorithm for extracting gene function information from publications. The current release, #63 (October 2020), hosts 93 reference genomes—over 3.9 million genes in 122 947 families with orthologous and paralogous classifications. Plant Reactome portrays pathway networks using a combination of manual biocuration in rice (320 reference pathways) and orthology-based projections to 106 species. The Reactome platform facilitates comparison between reference and projected pathways, gene expression analyses and overlays of gene–gene interactions. Gramene integrates ontology-based protein structure–function annotation; information on genetic, epigenetic, expression, and phenotypic diversity; and gene functional annotations extracted from plant-focused journals using DIVE. We train plant researchers in biocuration of genes and pathways; host curated maize gene structures as tracks in the maize genome browser; and integrate curated rice genes and pathways in the Plant Reactome.
- Published
- 2020
- Full Text
- View/download PDF
15. Expression Atlas update: gene and protein expression in multiple species
- Author
-
John C. Marioni, Jana Eliasova, Alfonso Munoz-Pomer, Upendra Kumbham, David García-Seisdedos, Sunita Kumari, Zhichiao Miao, Doreen Ware, Marcela K. Tello-Ruiz, Anja Füllgrabe, Sarah A. Teichmann, Tony Burdett, Suhaib Mohammed, Mathias Walzer, David Osumi-Sutherland, Juan Antonio Vizcaíno, Kerstin B. Meyer, Natassja Bush, Jonathan R. Manning, Lingyun Zhao, Deepti J. Kundu, Laura Clarke, Irene Papatheodorou, Alvis Brazma, Pablo Moreno, Haider Iqbal, Nancy George, Ananth Prakash, Silvie Fexova, Shengbo Wang, Andrey Solovyev, Martijn C. Nawijn, Yalan Bi, Mark J. Arends, Moreno, Pablo [0000-0002-9856-1679], Miao, Zhichiao [0000-0002-5777-9815], Fullgrabe, Anja [0000-0002-8674-0039], Clarke, Laura [0000-0002-5989-6898], Tello-Ruiz, Marcela Karey [0000-0002-7499-5368], Burdett, Tony [0000-0002-2513-5396], Marioni, John [0000-0001-9092-0852], Vizcaíno, Juan Antonio [0000-0002-3905-4335], Papatheodorou, Irene [0000-0001-7270-5470], Apollo - University of Cambridge Repository, and Groningen Research Institute for Asthma and COPD (GRIAC)
- Subjects
Proteomics ,Developmental stage ,AcademicSubjects/SCI00010 ,Atlas (topology) ,Sequence Analysis, RNA ,Gene Expression Profiling ,Computational Biology ,Proteins ,Computational biology ,Biology ,Expression (computer science) ,Multiple species ,Metadata ,Databases, Genetic ,Genetics ,Database Issue ,Gene and protein expression ,Humans ,RNA-Seq ,Single-Cell Analysis ,Gene ,Software - Abstract
The EMBL-EBI Expression Atlas is an added value knowledge base that enables researchers to answer the question of where (tissue, organism part, developmental stage, cell type) and under which conditions (disease, treatment, gender, etc) a gene or protein of interest is expressed. Expression Atlas brings together data from >4500 expression studies from >65 different species, across different conditions and tissues. It makes these data freely available in an easy to visualise form, after expert curation to accurately represent the intended experimental design, re-analysed via standardised pipelines that rely on open-source community developed tools. Each study's metadata are annotated using ontologies. The data are re-analyzed with the aim of reproducing the original conclusions of the underlying experiments. Expression Atlas is currently divided into Bulk Expression Atlas and Single Cell Expression Atlas. Expression Atlas contains data from differential studies (microarray and bulk RNA-Seq) and baseline studies (bulk RNA-Seq and proteomics), whereas Single Cell Expression Atlas is currently dedicated to Single Cell RNA-Sequencing (scRNA-Seq) studies. The resource has been in continuous development since 2009 and it is available at https://www.ebi.ac.uk/gxa.
- Published
- 2022
- Full Text
- View/download PDF
16. WormBase: a multi-species resource for nematode biology and genomics.
- Author
-
Todd W. Harris, Nansheng Chen, Fiona Cunningham, Marcela K. Tello-Ruiz, Igor Antoshechkin, Carol Bastiani, Tamberlyn Bieri, Darin Blasiar, Keith Bradnam, Juancarlos Chan, Chao-Kung Chen, Wen J. Chen, Paul Davis 0001, Eimear Kenny, Ranjana Kishore, Daniel Lawson, Raymond Y. N. Lee, Hans-Michael Müller, Cecilia Nakamura, Philip Ozersky, Andrei Petcherski, Anthony Rogers, Aniko Sabo, Erich M. Schwarz, Kimberly Van Auken, Qinghua Wang 0003, Richard Durbin, John Spieth, Paul W. Sternberg, and Lincoln D. Stein
- Published
- 2004
- Full Text
- View/download PDF
17. Plant Reactome: a knowledgebase and resource for comparative pathway analysis
- Author
-
Pankaj Jaiswal, Doreen Ware, Lincoln Stein, Marcela K. Tello-Ruiz, Antonio Fabregat Mundo, Jason Kiff, Alfonso Munoz-Pomer, Irene Papatheodorou, Sushma Naithani, Priyanka Garg, Parul Gupta, Evan E Bolton, Peter D'Eustachio, Justin Cook, Suhaib Mohammed, Andrew Olson, Tiejun Cheng, Sharon Wei, Daemon A Dikeman, Justin Elser, and Justin Preece
- Subjects
0106 biological sciences ,Proteomics ,Transport pathways ,Computational biology ,Biology ,Web Browser ,01 natural sciences ,03 medical and health sciences ,Upload ,Resource (project management) ,Databases, Genetic ,Genetics ,Database Issue ,Humans ,Metabolomics ,Gene Regulatory Networks ,Hormone signaling ,030304 developmental biology ,2. Zero hunger ,0303 health sciences ,Transcriptional Networks ,food and beverages ,Computational Biology ,Genomics ,15. Life on land ,Plants ,Pathway analysis ,Data model ,Plant species ,Metabolic Networks and Pathways ,010606 plant biology & botany ,Signal Transduction - Abstract
Plant Reactome (https://plantreactome.gramene.org) is an open-source, comparative plant pathway knowledgebase of the Gramene project. It uses Oryza sativa (rice) as a reference species for manual curation of pathways and extends pathway knowledge to another 82 plant species via gene-orthology projection using the Reactome data model and framework. It currently hosts 298 reference pathways, including metabolic and transport pathways, transcriptional networks, hormone signaling pathways, and plant developmental processes. In addition to browsing plant pathways, users can upload and analyze their omics data, such as the gene-expression data, and overlay curated or experimental gene-gene interaction data to extend pathway knowledge. The curation team actively engages researchers and students on gene and pathway curation by offering workshops and online tutorials. The Plant Reactome supports, implements and collaborates with the wider community to make data and tools related to genes, genomes, and pathways Findable, Accessible, Interoperable and Re-usable (FAIR).
- Published
- 2019
18. Ensembl Genomes 2020—enabling non-vertebrate genomic research
- Author
-
Bruno Contreras-Moreira, Laurent Gil, Uma Maheswari, Erin Haskell, Paul Flicek, Kim E. Hammond-Kosack, Stephen J. Trevanion, Matthieu Barba, Sushma Naithani, Matthew Russell, Mark D. McDowall, Nancy George, Andrew D. Yates, Emily Perry, Vasily Sitnik, Dan Bolser, Michael Paulini, Paul J. Kersey, Pankaj Jaiswal, Mikkel B. Christensen, Helder Pedro, Gareth Maslen, Lahcen Cambell, Sophie Helen Janacek, Daniel M. Staines, Gary Williams, Matthieu Muffato, Carla Cummins, Wasiu Akanni, Marc Rosello, James E. Allen, Irene Papatheodorou, Astrid Gall, Nick Langridge, Marc Chakiachvili, Joshua C. Stein, Mateus Patricio, Silvie Fexova, Nishadi De Silva, Sarah E. Hunt, Benjamin Moore, Martin Urban, Justin Preece, Marcela K. Tello-Ruiz, Guy Naamati, Andrew Olson, Parul Gupta, Thomas Maurel, Jorge Alvarez-Jarreta, Doreen Ware, Paul Davis, Manuel Carbajo, Alayne Cuzick, Kevin L. Howe, and Sharon Wei
- Subjects
0106 biological sciences ,Tree of life ,Genomics ,Context (language use) ,Computational biology ,Biology ,01 natural sciences ,Genome ,User-Computer Interface ,03 medical and health sciences ,Resource (project management) ,Reference Values ,Ensembl Genomes ,Databases, Genetic ,Genetics ,Database Issue ,Animals ,Ensembl ,Caenorhabditis elegans ,030304 developmental biology ,Internet ,0303 health sciences ,Computational Biology ,Genetic Variation ,Molecular Sequence Annotation ,Plants ,Phenotype ,Genome, Fungal ,Algorithms ,Genome, Bacterial ,Genome, Plant ,Software ,010606 plant biology & botany - Abstract
Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of interfaces to genomic data across the tree of life, including reference genome sequence, gene models, transcriptional data, genetic variation and comparative analysis. Data may be accessed via our website, online tools platform and programmatic interfaces, with updates made four times per year (in synchrony with Ensembl). Here, we provide an overview of Ensembl Genomes, with a focus on recent developments. These include the continued growth, more robust and reproducible sets of orthologues and paralogues, and enriched views of gene expression and gene function in plants. Finally, we report on our continued deeper integration with the Ensembl project, which forms a key part of our future strategy for dealing with the increasing quantity of available genome-scale data across the tree of life.
- Published
- 2019
- Full Text
- View/download PDF
19. Author response: Vision, challenges and opportunities for a Plant Cell Atlas
- Author
-
Luis C. Romero, Ai My Luong, Jenny C Mortimer, Nicolas L. Taylor, Sergio Alan Cervantes-Pérez, David W. Ehrhardt, Yana Kazachkova, Adrien Burlaocot, Rajiv K. Tripathi, Alfredo Cruz-Ramírez, Nicholas J. Provart, Uwe John, Shou-Ling Xu, Renate A Weizbauer, Mathew G. Lewsey, José M. Palma, R. Glen Uhrig, Asela J. Wijeratne, Maria J. Harrison, William P Dwyer, Alexander T. Borowsky, Yuling Jiao, Kaushal Kumar Bhati, Edoardo Bertolini, Anna Stepanova, Francisco J. Corpas, Fabio Zanini, Pubudu P. Handakumbura, Dominique C. Bergmann, Devang Mehta, Saroj K Sah, Naomi Nakayama, Claire D McWhite, Jahed Ahmed, Dhruv Lavania, Gazala Ameen, Mather A Khan, Marc Libault, Gergo Palfalvi, Seung Y. Rhee, Laura E. Bartley, Vaishali Arora, Cesar L. Cuevas-Velazquez, Josh T. Cuperus, Benjamin Buer, Amir H. Ahkami, Lachezar A. Nikolov, Selena L Rice, Feng Zhao, Ronelle Roth, Ajay Kumar, Atique ur Rehman, Andrew Farmer, Maida Romera-Branchat, Zhi-Yong Wang, Tuan M Tran, Lydia-Marie Joubert, Le Liu, Julia Bailey-Serres, Fabio Gomez-Cano, Ramin Yadegari, Sanjay Joshi, James Whelan, Batthula Vijaya Lakshmi Vadde, Rachel Shahan, Houlin Yu, Bao-Hua Song, Andrey V Malkovskiy, Arun Kumar, Aaron J. Ogden, Javier Brumos, Xiaohong Zhuang, Oluwafemi Alaba, Harmanpreet Kaur, Tatsuya Nobori, Marisa S. Otegui, Peter H Denolf, Miguel Miñambres Martín, Sakil Mahmud, Tingting Xiang, Lisa I David, Justin W. Walley, Purva Karia, Maite Saura-Sanchez, Pankaj Kumar, Jamie Waese, Ansul Lokdarshi, Suryatapa Ghosh Jha, Sagar Kumar, Matthew M. S. Evans, Hai Ying Yuan, Rajveer Singh, Puneet Paul, Carly A Martin, Robert E. Jinkerson, Dianyi Liu, Rajdeep S. Khangura, Dae Kwan Ko, Tedrick Thomas Salim Lew, Jennifer A N Brophy, Ari Pekka Mähönen, Marija Vidović, Mark-Christoph Ott, Alok Arun, Pinky Agarwal, Pradeep Kumar, Alexandre P. Marand, R. Clay Wright, Moises Exposito-Alonso, Rosangela Sozzani, Tamas Varga, Luigi Di Costanzo, Shyam Solanki, Sixue Chen, Chien-Yuan Lin, Iain C. Macaulay, Tie Liu, Elsa H Quezada-Rodríguez, Trevor M. Nolan, Peter Denolf, Stefania Giacomello, Elizabeth S. Haswell, Nancy George, Noel Blanco-Touriñán, Bruno Contreras-Moreira, Benjamin J. Cole, Abhishek Joshi, Steven P. Briggs, Toshihiro Obata, Kerstin Kaufmann, Kenneth D. Birnbaum, Klaas J. van Wijk, Noah Fahlgren, Kamal Kumar Malukani, Ramesh Katam, Pingtao Ding, Mario A. Arteaga-Vazquez, Marcela K. Tello-Ruiz, Shao-shan Carol Huang, Sunil Kumar Kenchanmane Raju, Venura Herath, George W. Bassel, Christopher R. Anderton, Stefan de Folter, Gary Stacey, and Jie Zhu
- Subjects
Engineering ,Atlas (topology) ,business.industry ,business ,Data science - Published
- 2021
- Full Text
- View/download PDF
20. Comparative Analysis of Sorghum EMS Mutants and Natural Populations
- Author
-
Marcela K. Tello-Ruiz, Jenny Mortimer, Anna Lipzen, Zhanguo Xin, Liya Wang, Junping Chen, Doreen Ware, Zhenyuan Lu, Xiaofei Wang, and Kerrie Barry
- Subjects
Genetics ,Functional validation ,education.field_of_study ,Mutation ,Population ,Mutant ,Biology ,Sorghum ,biology.organism_classification ,medicine.disease_cause ,Natural population growth ,medicine ,education ,Gene - Abstract
To build a large-scale genomic resource for functional validation of sorghum genes through EMS-mutagenized BTx623 seeds, we deep sequenced (30-60X) an additional 445 phenotyped EMS mutant lines. 4.2 million EMS mutations are called with nearly 36,800 mutations that could have a disruptive effect on functions of over 15,500 genes. Combining variants carried by both the natural population and previous EMS efforts, over 69% of sorghum coding genes (23644) are now presented with one or more mutations that are, or are predicted to be, disruptive to their functions. Our results show that the EMS population carries more significant mutations but less in each sample than the natural population, which makes it more powerful in elucidating sorghum gene functions on a large scale and requiring less work in validation of candidate causal genes. We have made the data available through two ways, one is the integration with the BSAseq workflow that supports retrieving independent EMS samples carrying the same genes with significant mutation for complementary testing, and the other is a web application for directly querying genes with significant mutations on SciApps.org.
- Published
- 2021
- Full Text
- View/download PDF
21. De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes
- Author
-
Qiuhan Jiang, David E. Hufnagel, Rebecca D. Piri, Jianming Yu, John L. Portwood, Carson M. Andorf, Shujun Ou, Matthew B. Hufford, Doreen Ware, Margaret R. Woodhouse, Xianran Li, Jonathan I. Gent, William A. Ricci, Erin Baggs, Dong Won Kim, Na Wang, Samantha J. Snodgrass, Silas Tittes, Tingting Guo, Sharon Wei, Andrew Olson, Ethalinda K. S. Cannon, Amanda M. Gilbert, Kevin Fengler, Michael Syring, Arun S. Seetharam, Rafael Della Coletta, Zhenyuan Lu, David Kudrna, Victor Llaca, R. Kelly Dawe, Ksenia V. Krasileva, Jeffrey Ross-Ibarra, Robert J. Schmitz, Nancy Manchanda, Yinjie Qiu, Alexandre P. Marand, Bo Wang, Sarah Pedersen, Kapeel Chougule, Marcela K. Tello-Ruiz, Christine H. O’Connor, Yibing Zeng, Candice N. Hirsch, Asher I. Hudson, and Jianing Liu
- Subjects
2. Zero hunger ,0106 biological sciences ,0303 health sciences ,education.field_of_study ,Population ,Sequence assembly ,Computational biology ,Quantitative trait locus ,Biology ,01 natural sciences ,Genome ,Structural variation ,03 medical and health sciences ,DNA methylation ,Nested association mapping ,education ,Gene ,030304 developmental biology ,010606 plant biology & botany - Abstract
We report de novo genome assemblies, transcriptomes, annotations, and methylomes for the 26 inbreds that serve as the founders for the maize nested association mapping population. The data indicate that the number of pan-genes exceeds 103,000 and that the ancient tetraploid character of maize continues to degrade by fractionation to the present day. Excellent contiguity over repeat arrays and complete annotation of centromeres further reveal the locations and internal structures of major cytological landmarks. We show that combining structural variation with SNPs can improve the power of quantitative mapping studies. Finally, we document variation at the level of DNA methylation, and demonstrate that unmethylated regions are enriched for cis-regulatory elements that overlap QTL and contribute to changes in gene expression.One sentence summaryA multi-genome analysis of maize reveals previously unknown variation in gene content, genome structure, and methylation.
- Published
- 2021
- Full Text
- View/download PDF
22. De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes
- Author
-
Xianran Li, Marcela K. Tello-Ruiz, Doreen Ware, Qiuhan Jiang, Robert J. Schmitz, Margaret R. Woodhouse, Carson M. Andorf, Dong Won Kim, Zhenyuan Lu, R. Kelly Dawe, Ethalinda K. S. Cannon, Arun S. Seetharam, Kevin Fengler, Ksenia V. Krasileva, David Kudrna, Amanda M. Gilbert, Jeffrey Ross-Ibarra, Rebecca D. Piri, Alexandre P. Marand, Yinjie Qiu, Nancy Manchanda, Christine H. O’Connor, Jonathan I. Gent, Silas Tittes, Michael Syring, Sharon Wei, Rafael Della Coletta, Candice N. Hirsch, Shujun Ou, Matthew B. Hufford, Jianing Liu, Na Wang, David E. Hufnagel, Samantha J. Snodgrass, Erin Baggs, Kapeel Chougule, Tingting Guo, Jianming Yu, Andrew Olson, John L. Portwood, Bo Wang, Sarah Pedersen, Yibing Zeng, Asher I. Hudson, William A. Ricci, and Victor Llaca
- Subjects
0106 biological sciences ,Multifactorial Inheritance ,Genotype ,General Science & Technology ,Population ,Centromere ,Sequence assembly ,Computational biology ,Biology ,Regulatory Sequences, Nucleic Acid ,Genes, Plant ,01 natural sciences ,Genome ,Polymorphism, Single Nucleotide ,Zea mays ,Chromosomes, Plant ,Article ,Structural variation ,03 medical and health sciences ,Genetic variation ,Nested association mapping ,education ,030304 developmental biology ,Disease Resistance ,Plant Diseases ,2. Zero hunger ,Comparative genomics ,0303 health sciences ,education.field_of_study ,Multidisciplinary ,Whole Genome Sequencing ,Chromosome Mapping ,Genetic Variation ,High-Throughput Nucleotide Sequencing ,Molecular Sequence Annotation ,Genome project ,Sequence Analysis, DNA ,DNA Methylation ,Tetraploidy ,Phenotype ,Transcriptome ,Genome, Plant ,010606 plant biology & botany - Abstract
An a-maize-ing set of genomes Maize is an important crop cultivated worldwide. As maize spread across the world, selection for local environments resulted in variation, but the impact on differences between the genome has not been quantified. By producing high-quality genomic sequences of the 26 lines used in the maize nested association mapping panel, Hufford et al . map important traits and demonstrate the diversity of maize. Examining RNA and methylation of genes across accessions, the authors identified a core set of maize genes. Beyond this core set, comparative analysis across lines identified high levels of variation in the total set of genes, the maize pan-genome. The value of this resource was further exemplified by mapping quantitative traits of interest, including those related to pathogen resistance. —LMZ
- Published
- 2021
23. Vision, challenges and opportunities for a Plant Cell Atlas
- Author
-
George W. Bassel, Claire D McWhite, Dhruv Lavania, Gazala Ameen, Christopher R. Anderton, Rajiv K. Tripathi, Maria J. Harrison, Josh T. Cuperus, Amir H. Ahkami, William P Dwyer, Bao-Hua Song, Fabio Zanini, Miguel Miñambres Martín, Atique ur Rehman, Cesar L. Cuevas-Velazquez, Ari Pekka Mähönen, Tamas Varga, Gergo Palfalvi, Andrew Farmer, Matthew M. S. Evans, Vaishali Arora, Uwe John, Mathew G. Lewsey, Dominique C. Bergmann, Selena L Rice, Mario A. Arteaga-Vazquez, Dae Kwan Ko, Tedrick Thomas Salim Lew, Jennifer A N Brophy, Jenny C Mortimer, Marc Libault, Bruno Contreras-Moreira, Benjamin J. Cole, Naomi Nakayama, Marcela K. Tello-Ruiz, Ronelle Roth, Laura E. Bartley, Tingting Xiang, Benjamin Buer, Shyam Solanki, Nicolas L. Taylor, Feng Zhao, Shao-shan Carol Huang, Alok Arun, Pinky Agarwal, Marisa S. Otegui, Arun Kumar, Marija Vidović, Pankaj Kumar, Aaron J. Ogden, Sagar Kumar, Puneet Paul, Sergio Alan Cervantes-Pérez, Purva Karia, Stefan de Folter, Kerstin Kaufmann, Gary Stacey, Le Liu, Robert E. Jinkerson, Javier Brumos, Harmanpreet Kaur, Tatsuya Nobori, David W. Ehrhardt, Francisco J. Corpas, Steven P. Briggs, James Whelan, Batthula Vijaya Lakshmi Vadde, Peter H Denolf, Tie Liu, Kamal Kumar Malukani, Elsa H Quezada-Rodríguez, Jahed Ahmed, Hai Ying Yuan, Rajveer Singh, Trevor M. Nolan, Ramesh Katam, Mather A Khan, Jamie Waese, Toshihiro Obata, Ramin Yadegari, Lachezar A. Nikolov, Seung Y. Rhee, Luis C. Romero, Ajay Kumar, Kenneth D. Birnbaum, Nicholas J. Provart, Tuan M Tran, Sakil Mahmud, Maida Romera-Branchat, Pradeep Kumar, Saroj K Sah, Ai My Luong, Alexandre P. Marand, R. Clay Wright, Yana Kazachkova, Moises Exposito-Alonso, Klaas J. van Wijk, Noah Fahlgren, Peter Denolf, Fabio Gomez-Cano, Houlin Yu, Luigi Di Costanzo, Adrien Burlaocot, Alfredo Cruz-Ramírez, Pingtao Ding, Dianyi Liu, Renate A Weizbauer, Suryatapa Ghosh Jha, Jie Zhu, Pubudu P. Handakumbura, Kaushal Kumar Bhati, Edoardo Bertolini, Anna Stepanova, Rachel Shahan, Lisa I David, Justin W. Walley, Lydia-Marie Joubert, Nancy George, Sanjay Joshi, José M. Palma, Rosangela Sozzani, Mark-Christoph Ott, Sixue Chen, Ansul Lokdarshi, Sunil Kumar Kenchanmane Raju, Chien-Yuan Lin, Iain C. Macaulay, Venura Herath, Noel Blanco-Touriñán, Rajdeep S. Khangura, Zhi-Yong Wang, Alexander T. Borowsky, Julia Bailey-Serres, Andrey V Malkovskiy, Xiaohong Zhuang, Oluwafemi Alaba, Yuling Jiao, Abhishek Joshi, Devang Mehta, Maite Saura-Sanchez, Carly A Martin, Stefania Giacomello, Elizabeth S. Haswell, Shou-Ling Xu, R. Glen Uhrig, Asela J. Wijeratne, National Science Foundation (US), Jha, S. G., Borowsky, A. T., Cole, B. J., Fahlgren, N., Farmer, A., Huang, S. C., Karia, P., Libault, M., Provart, N. J., Rice, S. L., Saura-Sanchez, M., Agarwal, P., Ahkami, A. H., Anderton, C. R., Briggs, S. P., Brophy, J. A., Denolf, P., Di Costanzo, L., Exposito-Alonso, M., Giacomello, S., Gomez-Cano, F., Kaufmann, K., Ko, D. K., Kumar, S., Malkovskiy, A. V., Nakayama, N., Obata, T., Otegui, M. S., Palfalvi, G., Quezada-Rodriguez, E. H., Singh, R., Uhrig, R. G., Waese, J., VAN WIJK, K., Wright, R. C., Ehrhardt, D. W., Birnbaum, K. D., Rhee, S. Y., Helsinki Institute of Life Science HiLIFE, and Institute of Biotechnology
- Subjects
Life Sciences & Biomedicine - Other Topics ,0106 biological sciences ,Engineering ,chlamydomonas reinhardtii ,Chloroplasts ,Plant Cell Atla ,0601 Biochemistry and Cell Biology ,maize ,01 natural sciences ,Zea may ,Plant science ,Molecular level ,cell biology ,Plant Cell Atlas Consortium ,Image Processing, Computer-Assisted ,Biology (General) ,single-cell omic ,2. Zero hunger ,0303 health sciences ,Atlas (topology) ,General Neuroscience ,Agriculture ,General Medicine ,Plants ,ARABIDOPSIS ,C-4 PHOTOSYNTHESIS ,Plant Cell Atlas ,single-cell omics ,Plant development ,VOCABULARY ,SYSTEMS BIOLOGY ,Medicine ,location-to-function ,Life Sciences & Biomedicine ,4D imaging ,QH301-705.5 ,DATABASE ,Science ,Plant Development ,Translational research ,Cellular level ,Environmental stewardship ,Zea mays ,Chloroplast ,General Biochemistry, Genetics and Molecular Biology ,MECHANISMS ,03 medical and health sciences ,Component (UML) ,Plant Cells ,Biology ,030304 developmental biology ,General Immunology and Microbiology ,business.industry ,Feature Article ,Computational Biology ,Plant ,15. Life on land ,11831 Plant biology ,GENE ,Data science ,science forum ,translational research ,13. Climate action ,A. thaliana ,PLASTIDS ,Biochemistry and Cell Biology ,business ,GENERATION ,010606 plant biology & botany - Abstract
With growing populations and pressing environmental problems, future economies will be increasingly plant-based. Now is the time to reimagine plant science as a critical component of fundamental science, agriculture, environmental stewardship, energy, technology and healthcare. This effort requires a conceptual and technological framework to identify and map all cell types, and to comprehensively annotate the localization and organization of molecules at cellular and tissue levels. This framework, called the Plant Cell Atlas (PCA), will be critical for understanding and engineering plant development, physiology and environmental responses. A workshop was convened to discuss the purpose and utility of such an initiative, resulting in a roadmap that acknowledges the current knowledge gaps and technical challenges, and underscores how the PCA initiative can help to overcome them., National Science Foundation 1916797 David W Ehrhardt, Kenneth D Birnbaum, Seung Yon Rhee; National Science Foundation 2052590 Seung Yon Rhee
- Published
- 2021
24. Exploring the fluid gene space of plants using the Gramene comparative gemics and pathway resource
- Author
-
Marcela K. Tello-Ruiz
- Subjects
Resource (biology) ,Computer science ,Space (commercial competition) ,Data science - Published
- 2020
- Full Text
- View/download PDF
25. Finding and fixing antation errors through community curation
- Author
-
Marcela K. Tello-Ruiz
- Published
- 2020
- Full Text
- View/download PDF
26. Gramene database: Navigating plant comparative genomics resources
- Author
-
Paul J. Kersey, Yinping Jiao, Robert Petryszak, Joshua C. Stein, Lincoln D. Stein, Doreen Ware, Antonio Fabregat, Sunita Kumari, Kapeel Chougule, Justin Preece, Laura Huerta, Peter D'Eustachio, Sharon Wei, Parul Gupta, Sushma Naithani, Andrew Olson, Pankaj Jaiswal, Maria Keays, Joel Weiser, Young Koung Lee, Marcela K. Tello-Ruiz, and Joseph Mulvaney
- Subjects
0301 basic medicine ,Systems biology ,Genomic data ,Plant Science ,Biology ,computer.software_genre ,Biochemistry ,Genome ,Article ,03 medical and health sciences ,Annotation ,lcsh:Botany ,Genetics ,2. Zero hunger ,Comparative genomics ,Phylogenetic tree ,Database ,food and beverages ,Cell Biology ,15. Life on land ,Pathway analysis ,Data science ,lcsh:QK1-989 ,030104 developmental biology ,Expression data ,computer ,Developmental Biology - Abstract
Gramene (http://www.gramene.org) is an online, open source, curated resource for plant comparative genomics and pathway analysis designed to support researchers working in plant genomics, breeding, evolutionary biology, system biology, and metabolic engineering. It exploits phylogenetic relationships to enrich the annotation of genomic data and provides tools to perform powerful comparative analyses across a wide spectrum of plant species. It consists of an integrated portal for querying, visualizing and analyzing data for 44 plant reference genomes, genetic variation data sets for 12 species, expression data for 16 species, curated rice pathways and orthology-based pathway projections for 66 plant species including various crops. Here we briefly describe the functions and uses of the Gramene database.
- Published
- 2016
- Full Text
- View/download PDF
27. Double triage to identify poorly annotated genes in maize: The missing link in community curation
- Author
-
Cornel Ghiban, Cristina F. Marco, Junpeng Zhan, Marcela K. Tello-Ruiz, Demitri Muna, Fei-Man Hsu, Michelle C. Stitzer, Rachael Wasikowski, Pengfei Qiao, Lindsay Barone, Doreen Ware, Sirjan Sapkota, Liya Wang, David A. Micklos, Rajdeep S. Khangura, Andrew Olson, Kapeel Chougule, and Hao Wu
- Subjects
0106 biological sciences ,Critical Care and Emergency Medicine ,Plant Science ,Plant Genetics ,01 natural sciences ,Genome ,Database and Informatics Methods ,Exon ,Databases, Genetic ,Invertebrate Genomics ,Plant Genomics ,Medicine and Health Sciences ,Coding region ,Data Curation ,Plant Proteins ,Data Management ,2. Zero hunger ,0303 health sciences ,Multidisciplinary ,Eukaryota ,Phylogenetic Analysis ,Genome project ,Genomics ,Plants ,Functional Genomics ,Phylogenetics ,Molecular Sequence Annotation ,Experimental Organism Systems ,Engineering and Technology ,Medicine ,Sequence Analysis ,Algorithms ,Research Article ,Biotechnology ,Computer and Information Sciences ,Bioinformatics ,Gene prediction ,Science ,Bioengineering ,Computational biology ,Biology ,Research and Analysis Methods ,Zea mays ,03 medical and health sciences ,Annotation ,Model Organisms ,Plant and Algal Models ,Genetics ,Humans ,Evolutionary Systematics ,Education, Graduate ,Grasses ,Students ,Gene ,Taxonomy ,030304 developmental biology ,Evolutionary Biology ,Models, Genetic ,Organisms ,Biology and Life Sciences ,Computational Biology ,Gene Annotation ,Genome Analysis ,Genome Annotation ,Maize ,Animal Genomics ,Animal Studies ,Plant Biotechnology ,Triage ,Sequence Alignment ,010606 plant biology & botany ,Reference genome - Abstract
The sophistication of gene prediction algorithms and the abundance of RNA-based evidence for the maize genome may suggest that manual curation of gene models is no longer necessary. However, quality metrics generated by the MAKER-P gene annotation pipeline identified 17,225 of 130,330 (13%) protein-coding transcripts in the B73 Reference Genome V4 gene set with models of low concordance to available biological evidence. Working with eight graduate students, we used the Apollo annotation editor to curate 86 transcript models flagged by quality metrics and a complimentary method using the Gramene gene tree visualizer. All of the triaged models had significant errors – including missing or extra exons, non-canonical splice sites, and incorrect UTRs. A correct transcript model existed for about 60% of genes (or transcripts) flagged by quality metrics; we attribute this to the convention of elevating the transcript with the longest coding sequence (CDS) to the canonical, or first, position. The remaining 40% of flagged genes resulted in novel annotations and represent a manual curation space of about 10% of the maize genome (~4,000 protein-coding genes). MAKER-P metrics have a specificity of 100%, and a sensitivity of 85%; the gene tree visualizer has a specificity of 100%. Together with the Apollo graphical editor, our double triage provides an infrastructure to support the community curation of eukaryotic genomes by scientists, students, and potentially even citizen scientists.
- Published
- 2019
28. AgBioData consortium recommendations for sustainable genomics and genetics databases for agriculture
- Author
-
Christopher J. Mungall, Ethalinda K. S. Cannon, Rex T. Nelson, Pankaj Jaiswal, Daureen Nesdill, Marie-Angélique Laporte, Steven B. Cannon, Margaret R. Woodhouse, James P. Carson, David Grant, Margaret Staton, Jacqueline D. Campbell, Marcela K. Tello-Ruiz, Ramona Walls, Carson M. Andorf, Monica Munoz-Torres, Laurel Cooper, Jill L. Wegrzyn, Victor P. Unda, Sabarinath Subramaniam, Stephen P. Ficklin, Sook Jung, Pierre Larmande, James M. Reecy, Christine G. Elsik, Tanya Z. Berardini, Nathan Dunn, Fiona M. McCarthy, Monica F. Poelchau, Liya Wang, Elizabeth Arnaud, Clement Jonquet, Jodi L. Humann, Nic Herndon, Doreen Ware, Deepak Unni, Emily S. Grau, Leonore Reiser, Clayton Birkett, Doreen Main, Taner Z. Sen, Zhi-Liang Hu, Jing Yu, Gerard R. Lazo, Jason Williams, Carissa A. Park, Bradford Condon, Lacey-Anne Sanderson, Lisa C. Harper, Naama Menda, Andrew Farmer, Sushma Naithani, Dept Hort & Landscape Architecture, Washington State University (WSU), Georgetown University [Washington] (GU), Department of Botany and Plant Pathology, Oregon State University (OSU), University of Missouri [Columbia] (Mizzou), University of Missouri System, Energy and Sustainability Research Division, University of Nottingham, UK (UON), Department of Animal Science and Center for Integrated Animal Genomics, Iowa State University (ISU), Plastes et différenciation cellulaire (PDC), Université Joseph Fourier - Grenoble 1 (UJF)-Centre National de la Recherche Scientifique (CNRS), WEB-CUBE, Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier (LIRMM), Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM), Stanford School of Medicine [Stanford], Stanford Medicine, Stanford University-Stanford University, German Centre for Integrative Biodiversity Research, Diversité, adaptation, développement des plantes (UMR DIADE), Institut de Recherche pour le Développement (IRD [France-Sud])-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad), Lawrence Berkeley National Laboratory [Berkeley] (LBNL), Department of Animal Science, Department of Entomology and Plant Pathology, The University of Tennessee [Knoxville], Department of Anatomy, Cold Spring Harbor Laboratory, Novartis Institutes for BioMedical Research (NIBR), WEB Architecture x Semantic WEB x WEB of Data (WEB3), Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS), and Centre de Coopération Internationale en Recherche Agronomique pour le Développement (Cirad)-Université de Montpellier (UM)-Centre National de la Recherche Scientifique (CNRS)-Institut de Recherche pour le Développement (IRD [France-Sud])
- Subjects
[SDV.SA]Life Sciences [q-bio]/Agricultural sciences ,0301 basic medicine ,Computer science ,Best practice ,Data management ,Interoperability ,Biological database ,Breeding ,computer.software_genre ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,Databases ,0302 clinical medicine ,Genetic ,Library and Information Studies ,Surveys and Questionnaires ,Databases, Genetic ,2. Zero hunger ,Biological data ,Metadata ,[INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB] ,Data curation ,Database ,business.industry ,[INFO.INFO-WB]Computer Science [cs]/Web ,Agriculture ,Genomics ,Data Format ,[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing ,030104 developmental biology ,Data access ,Gene Ontology ,[INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR] ,Zero Hunger ,[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM] ,General Agricultural and Biological Sciences ,business ,computer ,030217 neurology & neurosurgery ,Information Systems - Abstract
© The Author(s) 2018. Published by Oxford University Press. The future of agricultural research depends on data. The sheer volume of agricultural biological data being produced today makes excellent data management essential. Governmental agencies, publishers and science funders require data management plans for publicly funded research. Furthermore, the value of data increases exponentially when they are properly stored, described, integrated and shared, so that they can be easily utilized in future analyses. AgBioData (https://www.agbiodata.org) is a consortium of people working at agricultural biological databases, data archives and knowledgbases who strive to identify common issues in database development, curation and management, with the goal of creating database products that are more Findable, Accessible, Interoperable and Reusable. We strive to promote authentic, detailed, accurate and explicit communication between all parties involved in scientific data. As a step toward this goal, we present the current state of biocuration, ontologies, metadata and persistence, database platforms, programmatic (machine) access to data, communication and sustainability with regard to data curation. Each section describes challenges and opportunities for these topics, along with recommendations and best practices.
- Published
- 2018
- Full Text
- View/download PDF
29. Gramene 2018: unifying comparative genomics and pathway resources for plant research
- Author
-
Lincoln D. Stein, Pankaj Jaiswal, Dan Bolser, Parul Gupta, Guy Naamati, Sunita Kumari, Crispin B. Taylor, Kapeel Chougule, Antonio Fabregat, Nuno A. Fonseca, Alfonso Muñoz-Pomer Fuentes, Joseph Mulvaney, James Thomason, Noor Al-Bader, Yinping Jiao, Robert Petryszak, Electra Tapanari, Y. Amy Tang, Paul J. Kersey, Justin Preece, Matthew Geniza, Haider Iqbal, Patti Lockhart, Bo Wang, Young Koung Lee, Doreen Ware, Sushma Naithani, Peter D'Eustachio, Laura Huerta, Vivek Kumar, Sharon Wei, Justin Elser, Marcela K. Tello-Ruiz, Irene Papatheodorou, Andrew Olson, Maria Keays, Joel Weiser, Michael S. Campbell, and Joshua C. Stein
- Subjects
0301 basic medicine ,Genetic Research ,Knowledge Bases ,Genomics ,Genome browser ,Computational biology ,Paralogous Gene ,Biology ,Bioinformatics ,Genome ,Epigenesis, Genetic ,03 medical and health sciences ,User-Computer Interface ,Gene Expression Regulation, Plant ,Databases, Genetic ,Genetics ,Ensembl ,Database Issue ,Synteny ,2. Zero hunger ,Comparative genomics ,Genetic Variation ,Molecular Sequence Annotation ,Gene Annotation ,15. Life on land ,Plants ,030104 developmental biology ,Gene Ontology ,Genome, Plant ,Metabolic Networks and Pathways ,Software - Abstract
Gramene (http://www.gramene.org) is a knowledgebase for comparative functional analysis in major crops and model plant species. The current release, #54, includes over 1.7 million genes from 44 reference genomes, most of which were organized into 62,367 gene families through orthologous and paralogous gene classification, whole-genome alignments, and synteny. Additional gene annotations include ontology-based protein structure and function; genetic, epigenetic, and phenotypic diversity; and pathway associations. Gramene's Plant Reactome provides a knowledgebase of cellular-level plant pathway networks. Specifically, it uses curated rice reference pathways to derive pathway projections for an additional 66 species based on gene orthology, and facilitates display of gene expression, gene–gene interactions, and user-defined omics data in the context of these pathways. As a community portal, Gramene integrates best-of-class software and infrastructure components including the Ensembl genome browser, Reactome pathway browser, and Expression Atlas widgets, and undergoes periodic data and software upgrades. Via powerful, intuitive search interfaces, users can easily query across various portals and interactively analyze search results by clicking on diverse features such as genomic context, highly augmented gene trees, gene expression anatomograms, associated pathways, and external informatics resources. All data in Gramene are accessible through both visual and programmatic interfaces.
- Published
- 2017
30. Gramene: A Resource for Comparative Analysis of Plants Genomes and Pathways
- Author
-
Ken Youens-Clark, Pankaj Jaiswal, Marcela K. Tello-Ruiz, Sharon Wei, Joshua C. Stein, and Doreen Ware
- Subjects
0301 basic medicine ,Comparative genomics ,Phylogenetic tree ,Genome browser ,Computational biology ,Biology ,Genome ,Structural variation ,03 medical and health sciences ,Upload ,030104 developmental biology ,Phylogenetics ,Botany ,Synteny - Abstract
Gramene is an integrated informatics resource for accessing, visualizing, and comparing plant genomes and biological pathways. Originally targeting grasses, Gramene has grown to host annotations for economically important and research model crops, including wheat, potato, tomato, banana, grape, poplar, and Chlamydomonas. Its strength derives from the application of a phylogenetic framework for genome comparison and the use of ontologies to integrate structural and functional annotation data. This chapter outlines system requirements for end users and database hosting, data types and basic navigation within Gramene, and provides examples of how to (1) view a phylogenetic tree for a family of transcription factors, (2) explore genetic variation in the orthologues of a gene with a known trait association, and (3) upload, visualize, and privately share end user data into a new genome browser track.Moreover, this is the first publication describing Gramene's new web interface-intended to provide a simplified portal to the most complete and up-to-date set of plant genome and pathway annotations.
- Published
- 2016
- Full Text
- View/download PDF
31. Gramene 2016: comparative plant genomics and pathway resources
- Author
-
Sharon Wei, Justin Preece, Andrew Olson, Y. Amy Tang, James Thomason, Bo Wang, Palitha Dharmawardhana, Paul J. Kersey, Dan Bolser, Lincoln D. Stein, Helen Parkinson, Joseph Mulvaney, Vindhya Amarasinghe, Sheldon J. McKay, Robert Petryszak, Yinping Jiao, Pankaj Jaiswal, Brandon Walts, Maria Keays, Sushma Naithani, Antonio Fabregat, Joshua C. Stein, Justin Elser, Nuno A. Fonseca, Joel Weiser, Marcela K. Tello-Ruiz, Doreen Ware, Sunita Kumari, Kapeel Chougule, Peter D'Eustachio, Arnaud Kerhornou, and Laura Huerta
- Subjects
0301 basic medicine ,Gene Expression ,Genomics ,Computational biology ,Biology ,Bioinformatics ,Genome ,03 medical and health sciences ,Annotation ,Databases, Genetic ,Genetics ,Ensembl ,Database Issue ,2. Zero hunger ,Comparative genomics ,Internet ,food and beverages ,Genetic Variation ,Molecular Sequence Annotation ,15. Life on land ,Plants ,030104 developmental biology ,Functional genomics ,Plant genomics ,Genome, Plant ,Metabolic Networks and Pathways - Abstract
Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to ∼ 200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials.
- Published
- 2015
32. Strong Association of Socioeconomic Status and Genetic Ancestry in Latinos
- Author
-
Carlos A. Aguilar-Salinas, G. Bedoya, Desmond Campbell, Jose C. Florez, Marcela K. Tello-Ruiz, Liliana Franco, Andres Ruiz-Linares, María Victoria Parra, Laura Riba, Fuli Yu, Alberto Villegas, Richa Saxena, M. RodrÃguez-Torres, Alkes L. Price, Constanza Duque, M. T. Tusié-Luna, Natalia Gallego, and David Reich
- Subjects
Genetic genealogy ,Biology ,Association (psychology) ,Socioeconomic status ,Demography - Published
- 2014
- Full Text
- View/download PDF
33. Linkage disequilibrium mapping of the replicated type 2 diabetes linkage signal on chromosome 1q
- Author
-
Leslie J. Baier, Swapan K Das, Clifton Bogardus, Mark I. McCarthy, Alan R. Shuldiner, Cheng Hu, Braxton D. Mitchell, Panos Deloukas, M Vaxillaire, Lincoln Stein, Eleftheria Zeggini, Juliana C.N. Chan, Steven C. Elbein, Philippe Froguel, N. William Rayner, Robert L. Hanson, Colin N. A. Palmer, Congrong Wang, Maggie C.Y. Ng, Mao Fu, Timothy M. Frayling, Inga Prokopenko, Rhian Gwilliam, Benjamin F. Voight, Laura J. Scott, Christopher J. Groves, Andrew T. Hattersley, Pelin Akan, Andrew D. Morris, Marcela K. Tello-Ruiz, Weiping Jia, Katherine S. Elliott, and Medical Research Council (MRC)
- Subjects
Linkage disequilibrium ,SAMPLES ,Endocrinology, Diabetes and Metabolism ,LOCI ,Genome-wide association study ,SUSCEPTIBILITY ,VARIANTS ,Linkage Disequilibrium ,0302 clinical medicine ,Gene Frequency ,Reference Values ,Ethnicity ,International Type 2 Diabetes 1q Consortium ,Association mapping ,POPULATION ,RISK ,Genetics ,0303 health sciences ,GK RAT ,Linkage Disequilibrium Mapping ,NUCLEAR FACTOR-4-ALPHA GENE ,Chromosome Mapping ,11 Medical And Health Sciences ,Tag SNP ,DNA-Binding Proteins ,Chromosomes, Human, Pair 1 ,DISEASES ,Original Article ,Life Sciences & Biomedicine ,ENDOCRINOLOGY & METABOLISM ,European Continental Ancestry Group ,Ethnic Groups ,030209 endocrinology & metabolism ,Single-nucleotide polymorphism ,Biology ,Polymorphism, Single Nucleotide ,Risk Assessment ,White People ,03 medical and health sciences ,Genetic linkage ,Internal Medicine ,Humans ,GENOME-WIDE ASSOCIATION ,030304 developmental biology ,Adaptor Proteins, Signal Transducing ,Science & Technology ,Genetic Variation ,Histone-Lysine N-Methyltransferase ,Diabetes Mellitus, Type 2 ,Imputation (genetics) ,Transcription Factors - Abstract
OBJECTIVE Linkage of the chromosome 1q21–25 region to type 2 diabetes has been demonstrated in multiple ethnic groups. We performed common variant fine-mapping across a 23-Mb interval in a multiethnic sample to search for variants responsible for this linkage signal. RESEARCH DESIGN AND METHODS In all, 5,290 single nucleotide polymorphisms (SNPs) were successfully genotyped in 3,179 type 2 diabetes case and control subjects from eight populations with evidence of 1q linkage. Samples were ascertained using strategies designed to enhance power to detect variants causal for 1q linkage. After imputation, we estimate ∼80% coverage of common variation across the region (r 2 > 0.8, Europeans). Association signals of interest were evaluated through in silico replication and de novo genotyping in ∼8,500 case subjects and 12,400 control subjects. RESULTS Association mapping of the 23-Mb region identified two strong signals, both of which were restricted to the subset of European-descent samples. The first mapped to the NOS1AP (CAPON) gene region (lead SNP: rs7538490, odds ratio 1.38 [95% CI 1.21–1.57], P = 1.4 × 10−6, in 999 case subjects and 1,190 control subjects); the second mapped within an extensive region of linkage disequilibrium that includes the ASH1L and PKLR genes (lead SNP: rs11264371, odds ratio 1.48 [1.18–1.76], P = 1.0 × 10−5, under a dominant model). However, there was no evidence for association at either signal on replication, and, across all data (>24,000 subjects), there was no indication that these variants were causally related to type 2 diabetes status. CONCLUSIONS Detailed fine-mapping of the 23-Mb region of replicated linkage has failed to identify common variant signals contributing to the observed signal. Future studies should focus on identification of causal alleles of lower frequency and higher penetrance.
- Published
- 2009
- Full Text
- View/download PDF
34. Strong association of socioeconomic status with genetic ancestry in Latinos: implications for admixture studies of type 2 diabetes
- Author
-
Liliana Franco, David Reich, Andres Ruiz-Linares, Constanza Duque, Marcela K. Tello-Ruiz, Jose C. Florez, Maribel Rodríguez-Torres, Fuli Yu, Laura Riba, Natalia Gallego, Carlos A. Aguilar-Salinas, Gabriel Bedoya, Alkes L. Price, María Victoria Parra, Ma. Teresa Tusié-Luna, Alberto Villegas, Richa Saxena, and Desmond Campbell
- Subjects
Gerontology ,Diabetes risk ,Endocrinology, Diabetes and Metabolism ,Genetic genealogy ,Population ,Ethnic group ,Genetic admixture ,Type 2 diabetes ,Colombia ,White People ,Article ,Internal Medicine ,medicine ,Humans ,education ,Socioeconomic status ,Mexico ,Genetic association ,education.field_of_study ,business.industry ,Racial Groups ,Hispanic or Latino ,medicine.disease ,United States ,Diabetes Mellitus, Type 2 ,Socioeconomic Factors ,business ,Demography - Abstract
Type 2 diabetes is more prevalent in US American minority populations of African or Native American descent than it is in European Americans. However, the proportion of this epidemiological difference that can be ascribed to genetic or environmental factors is unknown. To determine whether genetic ancestry is correlated with diabetes risk in Latinos, we estimated the proportion of European ancestry in case-control samples from Mexico and Colombia in whom socioeconomic status had been carefully ascertained.We genotyped 67 ancestry-informative markers in 499 participants with type 2 diabetes and 197 controls from Medellín (Colombia), as well as in 163 participants with type 2 diabetes and 72 controls from central Mexico. Each participant was assigned a socioeconomic status scale via various measures.Although European ancestry was associated with lower diabetes risk in Mexicans (OR [95% CI] 0.06 [0.02-0.21], p = 2.0 x 10(-5)) and Colombians (OR 0.26 [0.08-0.78], p = 0.02), adjustment for socioeconomic status eliminated the association in the Colombian sample (OR 0.64 [0.19-2.12], p = 0.46) and significantly attenuated it in the Mexican sample (OR 0.17 [0.04-0.71], p = 0.02). Adjustment for BMI did not change the results.The proportion of non-European ancestry is associated with both type 2 diabetes and lower socioeconomic status in admixed Latino populations from North and South America. We conclude that ancestry-directed search for genetic markers associated with type 2 diabetes in Latinos may benefit from information involving social factors, as these factors have a quantitatively important effect on type 2 diabetes risk relative to ancestry effects.
- Published
- 2009
35. Gastroenterologic and Hepatic Diseases
- Author
-
Marcela K. Tello-Ruiz, Emily C. Walsh, and John D. Rioux
- Subjects
education.field_of_study ,Population ,Prevalence ,Autoimmune hepatitis ,Human leukocyte antigen ,Disease ,Biology ,medicine.disease ,Ulcerative colitis ,Inflammatory bowel disease ,digestive system diseases ,Immunology ,medicine ,Etiology ,education - Abstract
Celiac disease, autoimmune hepatitis (AIH), and the inflammatory bowel diseases (IBDs), Crohn’s disease and ulcerative colitis (UC), are chronic inflammatory diseases of unknown etiology. They are considered complex genetic diseases because both inherited and environmental influences appear to be important in determining risk Complex genetic diseases are more common than Mendelian diseases in the population. Prevalence ranges for AIH, IBD and celiac disease are given in Table 1. Generally, accepted average prevalences are 1 in 1,000 persons for Crohn’s disease and UC, and 3 in 1,000 for celiac disease. However, the prevalence rate of Crohn’s, UC and celiac disease are much lower in some populations (e.g., 1.25 in 100,000 for Crohn’s disease in Hong Kong1). The average prevalence of AIH is 10-fold lower at 1 in 10,000.
- Published
- 2007
- Full Text
- View/download PDF
36. WormBase: a multi-species resource for nematode biology and genomics
- Author
-
Raymond Lee, Lincoln Stein, Chao-Kung Chen, Aniko Sabo, John Spieth, Todd W. Harris, Carol Bastiani, Wen J. Chen, Marcela K. Tello-Ruiz, Hans-Michael Müller, Philip Ozersky, Richard Durbin, Eimear E. Kenny, Igor Antoshechkin, Qinghua Wang, Tamberlyn Bieri, Juancarlos Chan, Keith Bradnam, Andrei Petcherski, Darin Blasiar, Paul H. Davis, Anthony Rogers, Fiona Cunningham, Ranjana Kishore, Paul W. Sternberg, Cecilia Nakamura, Daniel Lawson, Nansheng Chen, Kimberly Van Auken, and Erich M. Schwarz
- Subjects
Comparative genomics ,Caenorhabditis briggsae ,Genetics ,Internet ,biology ,WormBook ,Computational Biology ,Information Storage and Retrieval ,Context (language use) ,Genomics ,Computational biology ,Articles ,biology.organism_classification ,User-Computer Interface ,Databases, Genetic ,Caenorhabditis ,Animals ,WormBase ,Caenorhabditis elegans ,Caltech Library Services ,Synteny - Abstract
WormBase (http://www.wormbase.org/) is the central data repository for information about Caenorhabditis elegans and related nematodes. As a model organism database, WormBase extends beyond the genomic sequence, integrating experimental results with extensively annotated views of the genome. The WormBase Consortium continues to expand the biological scope and utility of WormBase with the inclusion of large-scale genomic analyses, through active data and literature curation, through new analysis and visualization tools, and through refinement of the user interface. Over the past year, the nearly complete genomic sequence and comparative analyses of the closely related species Caenorhabditis briggsae have been integrated into WormBase, including gene predictions, ortholog assignments and a new synteny viewer to display the relationships between the two species. Extensive site-wide refinement of the user interface now provides quick access to the most frequently accessed resources and a consistent browsing experience across the site. Unified single-page views now provide complete summaries of commonly accessed entries like genes. These advances continue to increase the utility of WormBase for C.elegans researchers, as well as for those researchers exploring problems in functional and comparative genomics in the context of a powerful genetic system.
- Published
- 2004
37. A Genomewide Admixture Map for Latino Populations
- Author
-
Laura Riba, Alberto Villegas, Carlos A. Aguilar-Salinas, Gavin J. McDonald, Arti Tandon, Andres Ruiz-Linares, Alicja Waliszewska, Samuel Canizales-Quinteros, Gabriel Bedoya, Christopher A. Haiman, Cheryl A. Winkler, Francisco M. Salzano, Carla Gallo, Marcela K. Tello-Ruiz, Christine Schirmer, Nick Patterson, Fuli Yu, Julie Neubauer, David Cox, Maria Cátira Bortolini, Constanza Duque, Guido Mazzotti, Brian E. Henderson, Alkes L. Price, Teresa Tusié-Luna, Marta Menjivar, David Reich, and William Klitz
- Subjects
Linkage disequilibrium ,Cromosomas Humanos ,0302 clinical medicine ,Databases, Genetic ,Chromosomes, Human ,Genetics(clinical) ,Genetics (clinical) ,African Continental Ancestry Group ,Genetics ,0303 health sciences ,education.field_of_study ,medicine.diagnostic_test ,Estudios de Casos y Controles ,Chromosome Mapping ,Hispanic or Latino ,Marcadores Genéticos ,Grupo de Ascendencia Continental Africana ,Alelos ,Genetic Markers ,Population ,Genetic admixture ,Black People ,Predisposición Genética a la Enfermedad ,Locus (genetics) ,Biology ,Article ,White People ,03 medical and health sciences ,Gene mapping ,medicine ,Humans ,Computer Simulation ,Genetic Predisposition to Disease ,Genetic Testing ,Genética de Población ,education ,Alleles ,030304 developmental biology ,Genetic testing ,Genetic association ,Bases de Datos Genéticas ,Genome, Human ,Reproducibility of Results ,Mapeo Cromosómico ,Indios Norteamericanos ,Genetics, Population ,Genetic marker ,Case-Control Studies ,Indians, North American ,030217 neurology & neurosurgery - Abstract
Admixture mapping is an economical and powerful approach for localizing disease genes in populations of recently mixed ancestry and has proven successful in African Americans. The method holds equal promise for Latinos, who typically inherit a mix of European, Native American, and African ancestry. However, admixture mapping in Latinos has not been practical because of the lack of a map of ancestry-informative markers validated in Native American and other populations. To address this, we screened multiple databases, containing millions of markers, to identify 4,186 markers that were putatively informative for determining the ancestry of chromosomal segments in Latino populations. We experimentally validated each of these markers in at least 232 new Latino, European, Native American, and African samples, and we selected a subset of 1,649 markers to form an admixture map. An advantage of our strategy is that we focused our map on markers distinguishing Native American from other ancestries and restricted it to markers with very similar frequencies in Europeans and Africans, which decreased the number of markers needed and minimized the possibility of false disease associations. We evaluated the effectiveness of our map for localizing disease genes in four Latino populations from both North and South America. COL0006723
- Full Text
- View/download PDF
38. Ensembl Genomes 2022: an expanding genome resource for non-vertebrates
- Author
-
Andrew D. Yates, Michael Paulini, Anne Parker, Kim E. Hammond-Kosack, Marcela K. Tello-Ruiz, Justin Elser, Jyothish Bhai, Manuel Luypaert, Faye H. Rodgers, Stavros Diamantakis, James Seager, Pankaj Jaiswal, Robert D. Finn, Thomas Maurel, Michal Szpak, Marc Rosello, James E. Allen, Parul Gupta, Matthieu Barba, Lahcen I. Campbell, Marc Chakiachvili, Vivek Kumar, Manuel Carbajo Martinez, Benjamin Moore, Nishadi De Silva, John Tate, Ridwan M Amode, Andrew Olson, Astrid Gall, Magali Ruffier, Paul Davis, Mikkel B. Christensen, Mark Quinton-Tulloch, Alayne Cuzick, Kevin L. Howe, Martin Urban, Justin Preece, Nick Langridge, Sunita Kumari, Kapeel Chougule, Emily Perry, Tuan Le, Gareth Maslen, Sharon Wei, Dionysios Grigoriadis, Carla Valeria Filippi, Andrea Winterbottom, Doreen Ware, Aleena Mushtaq, Vinay Kaikala, Andrey G Azov, Helder Pedro, Luca Da Rin Fioretto, Gary Williams, Magdalena Zarowiecki, Cristina Guijarro-Clarke, Guy Naamati, Sushma Naithani, Bruno Contreras-Moreira, Sarah Dyer, Andrés Becerra, Paul Flicek, Matthieu Muffato, Vasily Sitnik, and Stephen J. Trevanion
- Subjects
Whole genome sequencing ,Internet ,AcademicSubjects/SCI00010 ,Computational Biology ,Genomics ,Genome project ,Gene Annotation ,Computational biology ,Plants ,Biology ,Genome ,Annotation ,Resource (project management) ,ComputingMethodologies_PATTERNRECOGNITION ,Ensembl Genomes ,Databases, Genetic ,Vertebrates ,Genetics ,Database Issue ,Animals ,Ensembl ,Genome, Fungal ,Genome, Bacterial ,Genome, Plant ,Software - Abstract
Ensembl Genomes (https://www.ensemblgenomes.org) provides access to non-vertebrate genomes and analysis complementing vertebrate resources developed by the Ensembl project (https://www.ensembl.org). The two resources collectively present genome annotation through a consistent set of interfaces spanning the tree of life presenting genome sequence, annotation, variation, transcriptomic data and comparative analysis. Here, we present our largest increase in plant, metazoan and fungal genomes since the project's inception creating one of the world's most comprehensive genomic resources and describe our efforts to reduce genome redundancy in our Bacteria portal. We detail our new efforts in gene annotation, our emerging support for pangenome analysis, our efforts to accelerate data dissemination through the Ensembl Rapid Release resource and our new AlphaFold visualization. Finally, we present details of our future plans including updates on our integration with Ensembl, and how we plan to improve our support for the microbial research community. Software and data are made available without restriction via our website, online tools platform and programmatic interfaces (available under an Apache 2.0 license). Data updates are synchronised with Ensembl's release cycle.
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.