8 results on '"Cornel Ghiban"'
Search Results
2. Management, Analyses, and Distribution of the MaizeCODE Data on the Cloud
- Author
-
Liya Wang, Zhenyuan Lu, Melissa delaBastide, Peter Van Buren, Xiaofei Wang, Cornel Ghiban, Michael Regulski, Jorg Drenkow, Xiaosa Xu, Carlos Ortiz-Ramirez, Cristina F. Marco, Sara Goodwin, Alexander Dobin, Kenneth D. Birnbaum, David P. Jackson, Robert A. Martienssen, William R. McCombie, David A. Micklos, Michael C. Schatz, Doreen H. Ware, and Thomas R. Gingeras
- Subjects
bioinformatics ,functional annotations ,cloud computing ,ENCODE ,workflows ,Plant culture ,SB1-1110 - Abstract
MaizeCODE is a project aimed at identifying and analyzing functional elements in the maize genome. In its initial phase, MaizeCODE assayed up to five tissues from four maize strains (B73, NC350, W22, TIL11) by RNA-Seq, Chip-Seq, RAMPAGE, and small RNA sequencing. To facilitate reproducible science and provide both human and machine access to the MaizeCODE data, we enhanced SciApps, a cloud-based portal, for analysis and distribution of both raw data and analysis results. Based on the SciApps workflow platform, we generated new components to support the complete cycle of MaizeCODE data management. These include publicly accessible scientific workflows for the reproducible and shareable analysis of various functional data, a RESTful API for batch processing and distribution of data and metadata, a searchable data page that lists each MaizeCODE experiment as a reproducible workflow, and integrated JBrowse genome browser tracks linked with workflows and metadata. The SciApps portal is a flexible platform that allows the integration of new analysis tools, workflows, and genomic data from multiple projects. Through metadata and a ready-to-compute cloud-based platform, the portal experience improves access to the MaizeCODE data and facilitates its analysis.
- Published
- 2020
- Full Text
- View/download PDF
3. Double triage to identify poorly annotated genes in maize: The missing link in community curation.
- Author
-
Marcela K Tello-Ruiz, Cristina F Marco, Fei-Man Hsu, Rajdeep S Khangura, Pengfei Qiao, Sirjan Sapkota, Michelle C Stitzer, Rachael Wasikowski, Hao Wu, Junpeng Zhan, Kapeel Chougule, Lindsay C Barone, Cornel Ghiban, Demitri Muna, Andrew C Olson, Liya Wang, Doreen Ware, and David A Micklos
- Subjects
Medicine ,Science - Abstract
The sophistication of gene prediction algorithms and the abundance of RNA-based evidence for the maize genome may suggest that manual curation of gene models is no longer necessary. However, quality metrics generated by the MAKER-P gene annotation pipeline identified 17,225 of 130,330 (13%) protein-coding transcripts in the B73 Reference Genome V4 gene set with models of low concordance to available biological evidence. Working with eight graduate students, we used the Apollo annotation editor to curate 86 transcript models flagged by quality metrics and a complimentary method using the Gramene gene tree visualizer. All of the triaged models had significant errors-including missing or extra exons, non-canonical splice sites, and incorrect UTRs. A correct transcript model existed for about 60% of genes (or transcripts) flagged by quality metrics; we attribute this to the convention of elevating the transcript with the longest coding sequence (CDS) to the canonical, or first, position. The remaining 40% of flagged genes resulted in novel annotations and represent a manual curation space of about 10% of the maize genome (~4,000 protein-coding genes). MAKER-P metrics have a specificity of 100%, and a sensitivity of 85%; the gene tree visualizer has a specificity of 100%. Together with the Apollo graphical editor, our double triage provides an infrastructure to support the community curation of eukaryotic genomes by scientists, students, and potentially even citizen scientists.
- Published
- 2019
- Full Text
- View/download PDF
4. DNA barcoding Brooklyn (New York): A first assessment of biodiversity in Marine Park by citizen scientists.
- Author
-
Christine Marizzi, Antonia Florio, Melissa Lee, Mohammed Khalfan, Cornel Ghiban, Bruce Nash, Jenna Dorey, Sean McKenzie, Christine Mazza, Fabiana Cellini, Carlo Baria, Ron Bepat, Lena Cosentino, Alexander Dvorak, Amina Gacevic, Cristina Guzman-Moumtzis, Francesca Heller, Nicholas Alexander Holt, Jeffrey Horenstein, Vincent Joralemon, Manveer Kaur, Tanveer Kaur, Armani Khan, Jessica Kuppan, Scott Laverty, Camila Lock, Marianne Pena, Ilona Petrychyn, Indu Puthenkalam, Daval Ram, Arlene Ramos, Noelle Scoca, Rachel Sin, Izabel Gonzalez, Akansha Thakur, Husan Usmanov, Karen Han, Andy Wu, Tiger Zhu, and David Andrew Micklos
- Subjects
Medicine ,Science - Abstract
DNA barcoding is both an important research and science education tool. The technique allows for quick and accurate species identification using only minimal amounts of tissue samples taken from any organism at any developmental phase. DNA barcoding has many practical applications including furthering the study of taxonomy and monitoring biodiversity. In addition to these uses, DNA barcoding is a powerful tool to empower, engage, and educate students in the scientific method while conducting productive and creative research. The study presented here provides the first assessment of Marine Park (Brooklyn, New York, USA) biodiversity using DNA barcoding. New York City citizen scientists (high school students and their teachers) were trained to identify species using DNA barcoding during a two-week long institute. By performing NCBI GenBank BLAST searches, students taxonomically identified 187 samples (1 fungus, 70 animals and 116 plants) and also published 12 novel DNA barcodes on GenBank. Students also identified 7 ant species and demonstrated the potential of DNA barcoding for identification of this especially diverse group when coupled with traditional taxonomy using morphology. Here we outline how DNA barcoding allows citizen scientists to make preliminary taxonomic identifications and contribute to modern biodiversity research.
- Published
- 2018
- Full Text
- View/download PDF
5. DNA Subway: Making Genome Analysis Egalitarian.
- Author
-
Uwe Hilgert, Sheldon J. McKay, Mohammed Khalfan, Jason Williams 0003, Cornel Ghiban, and David Micklos
- Published
- 2014
- Full Text
- View/download PDF
6. Double triage to identify poorly annotated genes in maize: The missing link in community curation
- Author
-
Cornel Ghiban, Cristina F. Marco, Junpeng Zhan, Marcela K. Tello-Ruiz, Demitri Muna, Fei-Man Hsu, Michelle C. Stitzer, Rachael Wasikowski, Pengfei Qiao, Lindsay Barone, Doreen Ware, Sirjan Sapkota, Liya Wang, David A. Micklos, Rajdeep S. Khangura, Andrew Olson, Kapeel Chougule, and Hao Wu
- Subjects
0106 biological sciences ,Critical Care and Emergency Medicine ,Plant Science ,Plant Genetics ,01 natural sciences ,Genome ,Database and Informatics Methods ,Exon ,Databases, Genetic ,Invertebrate Genomics ,Plant Genomics ,Medicine and Health Sciences ,Coding region ,Data Curation ,Plant Proteins ,Data Management ,2. Zero hunger ,0303 health sciences ,Multidisciplinary ,Eukaryota ,Phylogenetic Analysis ,Genome project ,Genomics ,Plants ,Functional Genomics ,Phylogenetics ,Molecular Sequence Annotation ,Experimental Organism Systems ,Engineering and Technology ,Medicine ,Sequence Analysis ,Algorithms ,Research Article ,Biotechnology ,Computer and Information Sciences ,Bioinformatics ,Gene prediction ,Science ,Bioengineering ,Computational biology ,Biology ,Research and Analysis Methods ,Zea mays ,03 medical and health sciences ,Annotation ,Model Organisms ,Plant and Algal Models ,Genetics ,Humans ,Evolutionary Systematics ,Education, Graduate ,Grasses ,Students ,Gene ,Taxonomy ,030304 developmental biology ,Evolutionary Biology ,Models, Genetic ,Organisms ,Biology and Life Sciences ,Computational Biology ,Gene Annotation ,Genome Analysis ,Genome Annotation ,Maize ,Animal Genomics ,Animal Studies ,Plant Biotechnology ,Triage ,Sequence Alignment ,010606 plant biology & botany ,Reference genome - Abstract
The sophistication of gene prediction algorithms and the abundance of RNA-based evidence for the maize genome may suggest that manual curation of gene models is no longer necessary. However, quality metrics generated by the MAKER-P gene annotation pipeline identified 17,225 of 130,330 (13%) protein-coding transcripts in the B73 Reference Genome V4 gene set with models of low concordance to available biological evidence. Working with eight graduate students, we used the Apollo annotation editor to curate 86 transcript models flagged by quality metrics and a complimentary method using the Gramene gene tree visualizer. All of the triaged models had significant errors – including missing or extra exons, non-canonical splice sites, and incorrect UTRs. A correct transcript model existed for about 60% of genes (or transcripts) flagged by quality metrics; we attribute this to the convention of elevating the transcript with the longest coding sequence (CDS) to the canonical, or first, position. The remaining 40% of flagged genes resulted in novel annotations and represent a manual curation space of about 10% of the maize genome (~4,000 protein-coding genes). MAKER-P metrics have a specificity of 100%, and a sensitivity of 85%; the gene tree visualizer has a specificity of 100%. Together with the Apollo graphical editor, our double triage provides an infrastructure to support the community curation of eukaryotic genomes by scientists, students, and potentially even citizen scientists.
- Published
- 2019
7. DNA barcoding Brooklyn (New York): A first assessment of biodiversity in Marine Park by citizen scientists
- Author
-
Ilona Petrychyn, Christine Mazza, Cristina Guzman-Moumtzis, Daval Ram, Camila Lock, Ron Bepat, Indu Puthenkalam, Manveer Kaur, Husan Usmanov, Fabiana Cellini, Izabel Gonzalez, Andy Wu, Marianne Pena, Jeffrey Horenstein, Christine Marizzi, Tiger Zhu, Amina Gacevic, Arlene Ramos, Sean K. McKenzie, Melissa Lee, Akansha Thakur, Carlo Baria, Armani Khan, Jessica Kuppan, David A. Micklos, Alexander Dvorak, Tanveer Kaur, Antonia M. Florio, Scott Laverty, Vincent Joralemon, Jenna E. Dorey, Mohammed Khalfan, Cornel Ghiban, Nicholas Alexander Holt, Bruce Nash, Karen Han, Lena Cosentino, Rachel Sin, Francesca Heller, and Noelle Scoca
- Subjects
0301 basic medicine ,Molecular biology ,Biodiversity ,Invasive Species ,Social Sciences ,lcsh:Medicine ,Evolutionary biology ,Plant Science ,DNA barcoding ,Science education ,Sociology ,Citizen science ,Leukocytes ,lcsh:Science ,Organism ,Molecular systematics ,Multidisciplinary ,Computer and information sciences ,Schools ,4. Education ,Database and informatics methods ,Sequence analysis ,Academies and Institutes ,Eukaryota ,Plants ,Insects ,Geography ,Dna barcodes ,GenBank ,Marine park ,Databases, Nucleic Acid ,Research Article ,Arthropoda ,Bioinformatics ,Evolutionary systematics ,education ,Data management ,Education ,03 medical and health sciences ,Species Colonization ,Animals ,DNA Barcoding, Taxonomic ,14. Life underwater ,Students ,DNA sequence analysis ,Taxonomy ,Biology and life sciences ,Ants ,Diagnostic Tests, Routine ,Ecology and Environmental Sciences ,lcsh:R ,Organisms ,Plant Taxonomy ,DNA ,Data science ,Invertebrates ,Hymenoptera ,Research and analysis methods ,030104 developmental biology ,Molecular biology techniques ,Animal Taxonomy ,New York City ,lcsh:Q ,Zoology - Abstract
DNA barcoding is both an important research and science education tool. The technique allows for quick and accurate species identification using only minimal amounts of tissue samples taken from any organism at any developmental phase. DNA barcoding has many practical applications including furthering the study of taxonomy and monitoring biodiversity. In addition to these uses, DNA barcoding is a powerful tool to empower, engage, and educate students in the scientific method while conducting productive and creative research. The study presented here provides the first assessment of Marine Park (Brooklyn, New York, USA) biodiversity using DNA barcoding. New York City citizen scientists (high school students and their teachers) were trained to identify species using DNA barcoding during a two–week long institute. By performing NCBI GenBank BLAST searches, students taxonomically identified 187 samples (1 fungus, 70 animals and 116 plants) and also published 12 novel DNA barcodes on GenBank. Students also identified 7 ant species and demonstrated the potential of DNA barcoding for identification of this especially diverse group when coupled with traditional taxonomy using morphology. Here we outline how DNA barcoding allows citizen scientists to make preliminary taxonomic identifications and contribute to modern biodiversity research.
- Published
- 2018
8. A tool to evaluate correspondence between extraction ion chromatographic peaks and peptide-spectrum matches in shotgun proteomics experiments
- Author
-
Samantha Peacock, Peter Leopold, Keith Rivera, Cristian I. Ruse, Cornel Ghiban, and Darryl J. Pappin
- Subjects
chemistry.chemical_classification ,Proteomics ,Chromatography ,Models, Statistical ,Peptide ,Tandem mass spectrometry ,Orbitrap ,Biochemistry ,Article ,Ion ,law.invention ,HEK293 Cells ,chemistry ,law ,Tandem Mass Spectrometry ,Humans ,Database search engine ,Quadrupole ion trap ,Shotgun proteomics ,Databases, Protein ,Peptides ,Molecular Biology ,Software - Abstract
Chromatographed peptide signals form the basis of further data processing that eventually results in functional information derived from data-dependent bottom-up proteomics assays. We seek to rank LC/MS parent ions by the quality of their extracted ion chromatograms (ExICs.) Ranked extracted ion chromatograms (RExICs) act as an intuitive physical/chemical preselection filter to improve the quality of MS/MS fragment scans submitted for database search. We identify more than 4,900 proteins when considering detector shifts of less than 7ppm. High quality parent ions for which the database search yields no hits become candidates for subsequent unrestricted analysis for post-translational modifications. Following this rational approach, we prioritize identification of more than 5,000 spectrum matches from modified peptides and confirmed the presence of acetylaldehyde modified His/Lys. We present a logical workflow that scores data-dependent selected ion chromatograms and leverage information about semi-analytical LC/LC dimension prior to mass spectrometry. Our method can be successfully used to identify unexpected modifications in peptides with excellent chromatography characteristics, independent of fragmentation pattern and activation methods. We illustrate analysis of ion chromatograms detected in two different modes by RF linear ion trap and electrostatic field orbitrap.
- Published
- 2013
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.