29 results on '"Lawrence-Dill CJ"'
Search Results
2. Genome-wide association studies from spoken phenotypic descriptions: a proof of concept from maize field studies.
- Author
-
Yanarella CF, Fattel L, and Lawrence-Dill CJ
- Subjects
- Genome, Plant, Quantitative Trait Loci, Quantitative Trait, Heritable, Polymorphism, Single Nucleotide, Zea mays genetics, Genome-Wide Association Study methods, Phenotype
- Abstract
We present a novel approach to genome-wide association studies (GWAS) by leveraging unstructured, spoken phenotypic descriptions to identify genomic regions associated with maize traits. Utilizing the Wisconsin Diversity panel, we collected spoken descriptions of Zea mays ssp. mays traits, converting these qualitative observations into quantitative data amenable to GWAS analysis. First, we determined that visually striking phenotypes could be detected from unstructured spoken phenotypic descriptions. Next, we developed two methods to process the same descriptions to derive the trait plant height, a well-characterized phenotypic feature in maize: (1) a semantic similarity metric that assigns a score based on the resemblance of each observation to the concept of 'tallness' and (2) a manual scoring system that categorizes and assigns values to phrases related to plant height. Our analysis successfully corroborated known genomic associations and uncovered novel candidate genes potentially linked to plant height. Some of these genes are associated with gene ontology terms that suggest a plausible involvement in determining plant stature. This proof-of-concept demonstrates the viability of spoken phenotypic descriptions in GWAS and introduces a scalable framework for incorporating unstructured language data into genetic association studies. This methodology has the potential not only to enrich the phenotypic data used in GWAS and to enhance the discovery of genetic elements linked to complex traits but also to expand the repertoire of phenotype data collection methods available for use in the field environment., Competing Interests: Conflicts of interest The author(s) declare no conflicts of interest., (© The Author(s) 2024. Published by Oxford University Press on behalf of The Genetics Society of America.)
- Published
- 2024
- Full Text
- View/download PDF
3. Wisconsin diversity panel phenotypes: spoken descriptions of plants and supporting data.
- Author
-
Yanarella CF, Fattel L, Kristmundsdóttir ÁÝ, Lopez MD, Edwards JW, Campbell DA, Abel CA, and Lawrence-Dill CJ
- Subjects
- Humans, Wisconsin, Data Collection, Farms, Phenotype, Agriculture
- Abstract
Objectives: Phenotyping plants in a field environment can involve a variety of methods including the use of automated instruments and labor-intensive manual measurement and scoring. Researchers also collect language-based phenotypic descriptions and use controlled vocabularies and structures such as ontologies to enable computation on descriptive phenotype data, including methods to determine phenotypic similarities. In this study, spoken descriptions of plants were collected and observers were instructed to use their own vocabulary to describe plant features that were present and visible. Further, these plants were measured and scored manually as part of a larger study to investigate whether spoken plant descriptions can be used to recover known biological phenomena., Data Description: Data comprise phenotypic observations of 686 accessions of the maize Wisconsin Diversity panel, and 25 positive control accessions that carry visible, dramatic phenotypes. The data include the list of accessions planted, field layout, data collection procedures, student participants' (whose personal data are protected for ethical reasons) and volunteers' observation transcripts, volunteers' audio data files, terrestrial and aerial images of the plants, Amazon Web Services method selection experimental data, and manually collected phenotypes (e.g., plant height, ear and tassel features, etc.; measurements and scores). Data were collected during the summer of 2021 at Iowa State University's Agricultural Engineering and Agronomy Research Farms., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
4. Current challenges and future of agricultural genomes to phenomes in the USA.
- Author
-
Tuggle CK, Clarke JL, Murdoch BM, Lyons E, Scott NM, Beneš B, Campbell JD, Chung H, Daigle CL, Das Choudhury S, Dekkers JCM, Dórea JRR, Ertl DS, Feldman M, Fragomeni BO, Fulton JE, Guadagno CR, Hagen DE, Hess AS, Kramer LM, Lawrence-Dill CJ, Lipka AE, Lübberstedt T, McCarthy FM, McKay SD, Murray SC, Riggs PK, Rowan TN, Sheehan MJ, Steibel JP, Thompson AM, Thornton KJ, Van Tassell CP, and Schnable PS
- Subjects
- United States, Genomics, Phenomics, Agriculture
- Abstract
Dramatic improvements in measuring genetic variation across agriculturally relevant populations (genomics) must be matched by improvements in identifying and measuring relevant trait variation in such populations across many environments (phenomics). Identifying the most critical opportunities and challenges in genome to phenome (G2P) research is the focus of this paper. Previously (Genome Biol, 23(1):1-11, 2022), we laid out how Agricultural Genome to Phenome Initiative (AG2PI) will coordinate activities with USA federal government agencies expand public-private partnerships, and engage with external stakeholders to achieve a shared vision of future the AG2PI. Acting on this latter step, AG2PI organized the "Thinking Big: Visualizing the Future of AG2PI" two-day workshop held September 9-10, 2022, in Ames, Iowa, co-hosted with the United State Department of Agriculture's National Institute of Food and Agriculture (USDA NIFA). During the meeting, attendees were asked to use their experience and curiosity to review the current status of agricultural genome to phenome (AG2P) work and envision the future of the AG2P field. The topic summaries composing this paper are distilled from two 1.5-h small group discussions. Challenges and solutions identified across multiple topics at the workshop were explored. We end our discussion with a vision for the future of agricultural progress, identifying two areas of innovation needed: (1) innovate in genetic improvement methods development and evaluation and (2) innovate in agricultural research processes to solve societal problems. To address these needs, we then provide six specific goals that we recommend be implemented immediately in support of advancing AG2P research., (© 2023. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
5. Gene function annotations for the maize NAM founder lines.
- Author
-
Fattel L, Yanarella CF, Ngara B, Johnson OT, Campbell DA, Wimalanathan K, and Lawrence-Dill CJ
- Subjects
- Phenotype, Zea mays genetics, Genome, Plant genetics
- Abstract
Objectives: We annotated the latest published sequences of the 26 Zea mays Nested Association Mapping (NAM) founder lines using GOMAP, the Gene Ontology Meta Annotator for Plants. The maize NAM panel enables researchers to understand and identify the genetic basis of complex traits. Annotations of predicted functions for genes can help researchers investigate gene-phenotype associations, prioritize candidate genes for phenotypes of interest, and formulate testable hypotheses about gene function/phenotype associations. The creation and release of high-confidence, high-coverage gene function annotation sets for the NAM founder lines is critical to accelerate the generation of knowledge in maize genetics research. GOMAP is a high-throughput computational pipeline that annotates gene functions genome-wide in plant genomes using Gene Ontology functional class terms. Here we report and share GOMAP-generated functional annotations for the NAM founder lines., Data Description: Datasets include the protein sequences used as input, GOMAP-generated annotation files, scripts used to update obsolete terms, and GAF-formatted tab-delimited text files of gene function annotations along with README files that describe formatting, content, and how files relate to each other., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
6. Standardized genome-wide function prediction enables comparative functional genomics: a new application area for Gene Ontologies in plants.
- Author
-
Fattel L, Psaroudakis D, Yanarella CF, Chiteri KO, Dostalik HA, Joshi P, Starr DC, Vu H, Wimalanathan K, and Lawrence-Dill CJ
- Subjects
- Databases, Genetic, Gene Ontology, Molecular Sequence Annotation, Phylogeny, Plants genetics, Genome, Plant, Genomics
- Abstract
Background: Genome-wide gene function annotations are useful for hypothesis generation and for prioritizing candidate genes potentially responsible for phenotypes of interest. We functionally annotated the genes of 18 crop plant genomes across 14 species using the GOMAP pipeline., Results: By comparison to existing GO annotation datasets, GOMAP-generated datasets cover more genes, contain more GO terms, and are similar in quality (based on precision and recall metrics using existing gold standards as the basis for comparison). From there, we sought to determine whether the datasets across multiple species could be used together to carry out comparative functional genomics analyses in plants. To test the idea and as a proof of concept, we created dendrograms of functional relatedness based on terms assigned for all 18 genomes. These dendrograms were compared to well-established species-level evolutionary phylogenies to determine whether trees derived were in agreement with known evolutionary relationships, which they largely are. Where discrepancies were observed, we determined branch support based on jackknifing then removed individual annotation sets by genome to identify the annotation sets causing unexpected relationships., Conclusions: GOMAP-derived functional annotations used together across multiple species generally retain sufficient biological signal to recover known phylogenetic relationships based on genome-wide functional similarities, indicating that comparative functional genomics across species based on GO data holds promise for generating novel hypotheses about comparative gene function and traits., (© The Author(s) 2022. Published by Oxford University Press GigaScience.)
- Published
- 2022
- Full Text
- View/download PDF
7. Ten simple rules to ruin a collaborative environment.
- Author
-
Lawrence-Dill CJ, Allscheid RL, Boaitey A, Bauman T, Buckler ES 4th, Clarke JL, Cullis C, Dekkers J, Dorius CJ, Dorius SF, Ertl D, Homann M, Hu G, Losch M, Lyons E, Murdoch B, Navabi ZK, Punnuri S, Rafiq F, Reecy JM, Schnable PS, Scott NM, Sheehan M, Sirault X, Staton M, Tuggle CK, Van Eenennaam A, and Voas R
- Abstract
Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2022
- Full Text
- View/download PDF
8. The Agricultural Genome to Phenome Initiative (AG2PI): creating a shared vision across crop and livestock research communities.
- Author
-
Tuggle CK, Clarke J, Dekkers JCM, Ertl D, Lawrence-Dill CJ, Lyons E, Murdoch BM, Scott NM, and Schnable PS
- Subjects
- Animals, Crops, Agricultural genetics, Agriculture, Livestock genetics
- Published
- 2022
- Full Text
- View/download PDF
9. Gene Ontology Meta Annotator for Plants (GOMAP).
- Author
-
Wimalanathan K and Lawrence-Dill CJ
- Abstract
Annotating gene structures and functions to genome assemblies is necessary to make assembly resources useful for biological inference. Gene Ontology (GO) term assignment is the most used functional annotation system, and new methods for GO assignment have improved the quality of GO-based function predictions. The Gene Ontology Meta Annotator for Plants (GOMAP) is an optimized, high-throughput, and reproducible pipeline for genome-scale GO annotation of plants. We containerized GOMAP to increase portability and reproducibility and also optimized its performance for HPC environments. Here we report on the pipeline's availability and performance for annotating large, repetitive plant genomes and describe how GOMAP was used to annotate multiple maize genomes as a test case. Assessment shows that GOMAP expands and improves the number of genes annotated and annotations assigned per gene as well as the quality (based on [Formula: see text]) of GO assignments in maize. GOMAP has been deployed to annotate other species including wheat, rice, barley, cotton, and soy. Instructions and access to the GOMAP Singularity container are freely available online at https://bioinformapping.com/gomap/ . A list of annotated genomes and links to data is maintained at https://dill-picl.org/projects/gomap/ .
- Published
- 2021
- Full Text
- View/download PDF
10. Utility of Climatic Information via Combining Ability Models to Improve Genomic Prediction for Yield Within the Genomes to Fields Maize Project.
- Author
-
Jarquin D, de Leon N, Romay C, Bohn M, Buckler ES, Ciampitti I, Edwards J, Ertl D, Flint-Garcia S, Gore MA, Graham C, Hirsch CN, Holland JB, Hooker D, Kaeppler SM, Knoll J, Lee EC, Lawrence-Dill CJ, Lynch JP, Moose SP, Murray SC, Nelson R, Rocheford T, Schnable JC, Schnable PS, Smith M, Springer N, Thomison P, Tuinstra M, Wisser RJ, Xu W, Yu J, and Lorenz A
- Abstract
Genomic prediction provides an efficient alternative to conventional phenotypic selection for developing improved cultivars with desirable characteristics. New and improved methods to genomic prediction are continually being developed that attempt to deal with the integration of data types beyond genomic information. Modern automated weather systems offer the opportunity to capture continuous data on a range of environmental parameters at specific field locations. In principle, this information could characterize training and target environments and enhance predictive ability by incorporating weather characteristics as part of the genotype-by-environment (G×E) interaction component in prediction models. We assessed the usefulness of including weather data variables in genomic prediction models using a naïve environmental kinship model across 30 environments comprising the Genomes to Fields (G2F) initiative in 2014 and 2015. Specifically four different prediction scenarios were evaluated (i) tested genotypes in observed environments; (ii) untested genotypes in observed environments; (iii) tested genotypes in unobserved environments; and (iv) untested genotypes in unobserved environments. A set of 1,481 unique hybrids were evaluated for grain yield. Evaluations were conducted using five different models including main effect of environments; general combining ability (GCA) effects of the maternal and paternal parents modeled using the genomic relationship matrix; specific combining ability (SCA) effects between maternal and paternal parents; interactions between genetic (GCA and SCA) effects and environmental effects; and finally interactions between the genetics effects and environmental covariates. Incorporation of the genotype-by-environment interaction term improved predictive ability across all scenarios. However, predictive ability was not improved through inclusion of naive environmental covariates in G×E models. More research should be conducted to link the observed weather conditions with important physiological aspects in plant development to improve predictive ability through the inclusion of weather data., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2021 Jarquin, de Leon, Romay, Bohn, Buckler, Ciampitti, Edwards, Ertl, Flint-Garcia, Gore, Graham, Hirsch, Holland, Hooker, Kaeppler, Knoll, Lee, Lawrence-Dill, Lynch, Moose, Murray, Nelson, Rocheford, Schnable, Schnable, Smith, Springer, Thomison, Tuinstra, Wisser, Xu, Yu and Lorenz.)
- Published
- 2021
- Full Text
- View/download PDF
11. Editorial: Phenotyping; From Plant, to Data, to Impact and Highlights of the International Plant Phenotyping Symposium - IPPS 2018.
- Author
-
Pommier C, Garnett T, Lawrence-Dill CJ, Pridmore T, Watt M, Pieruschka R, and Ghamkhar K
- Abstract
Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
- Published
- 2020
- Full Text
- View/download PDF
12. Computing on Phenotypic Descriptions for Candidate Gene Discovery and Crop Improvement.
- Author
-
Braun IR, Yanarella CF, and Lawrence-Dill CJ
- Abstract
Many newly observed phenotypes are first described, then experimentally manipulated. These language-based descriptions appear in both the literature and in community datastores. To standardize phenotypic descriptions and enable simple data aggregation and analysis, controlled vocabularies and specific data architectures have been developed. Such simplified descriptions have several advantages over natural language: they can be rigorously defined for a particular context or problem, they can be assigned and interpreted programmatically, and they can be organized in a way that allows for semantic reasoning (inference of implicit facts). Because researchers generally report phenotypes in the literature using natural language, curators have been translating phenotypic descriptions into controlled vocabularies for decades to make the information computable. Unfortunately, this methodology is highly dependent on human curation, which does not scale to the scope of all publications available across all of plant biology. Simultaneously, researchers in other domains have been working to enable computation on natural language. This has resulted in new, automated methods for computing on language that are now available, with early analyses showing great promise. Natural language processing (NLP) coupled with machine learning (ML) allows for the use of unstructured language for direct analysis of phenotypic descriptions. Indeed, we have found that these automated methods can be used to create data structures that perform as well or better than those generated by human curators on tasks such as predicting gene function and biochemical pathway membership. Here, we describe current and ongoing efforts to provide tools for the plant phenomics community to explore novel predictions that can be generated using these techniques. We also describe how these methods could be used along with mobile speech-to-text tools to collect and analyze in-field spoken phenotypic descriptions for association genetics and breeding applications., Competing Interests: The authors declare that there are no conflicts of interest regarding the contents of this manuscript or its publication., (Copyright © 2020 Ian R. Braun et al.)
- Published
- 2020
- Full Text
- View/download PDF
13. GenomeQC: a quality assessment tool for genome assemblies and gene structure annotations.
- Author
-
Manchanda N, Portwood JL 2nd, Woodhouse MR, Seetharam AS, Lawrence-Dill CJ, Andorf CM, and Hufford MB
- Subjects
- Chromosome Mapping, Computational Biology methods, High-Throughput Nucleotide Sequencing, Humans, Molecular Sequence Annotation, Sequence Analysis, DNA, Software, Genomics methods
- Abstract
Background: Genome assemblies are foundational for understanding the biology of a species. They provide a physical framework for mapping additional sequences, thereby enabling characterization of, for example, genomic diversity and differences in gene expression across individuals and tissue types. Quality metrics for genome assemblies gauge both the completeness and contiguity of an assembly and help provide confidence in downstream biological insights. To compare quality across multiple assemblies, a set of common metrics are typically calculated and then compared to one or more gold standard reference genomes. While several tools exist for calculating individual metrics, applications providing comprehensive evaluations of multiple assembly features are, perhaps surprisingly, lacking. Here, we describe a new toolkit that integrates multiple metrics to characterize both assembly and gene annotation quality in a way that enables comparison across multiple assemblies and assembly types., Results: Our application, named GenomeQC, is an easy-to-use and interactive web framework that integrates various quantitative measures to characterize genome assemblies and annotations. GenomeQC provides researchers with a comprehensive summary of these statistics and allows for benchmarking against gold standard reference assemblies., Conclusions: The GenomeQC web application is implemented in R/Shiny version 1.5.9 and Python 3.6 and is freely available at https://genomeqc.maizegdb.org/ under the GPL license. All source code and a containerized version of the GenomeQC pipeline is available in the GitHub repository https://github.com/HuffordLab/GenomeQC.
- Published
- 2020
- Full Text
- View/download PDF
14. Automated Methods Enable Direct Computation on Phenotypic Descriptions for Novel Candidate Gene Prediction.
- Author
-
Braun IR and Lawrence-Dill CJ
- Abstract
Natural language descriptions of plant phenotypes are a rich source of information for genetics and genomics research. We computationally translated descriptions of plant phenotypes into structured representations that can be analyzed to identify biologically meaningful associations. These representations include the entity-quality (EQ) formalism, which uses terms from biological ontologies to represent phenotypes in a standardized, semantically rich format, as well as numerical vector representations generated using natural language processing (NLP) methods (such as the bag-of-words approach and document embedding). We compared resulting phenotype similarity measures to those derived from manually curated data to determine the performance of each method. Computationally derived EQ and vector representations were comparably successful in recapitulating biological truth to representations created through manual EQ statement curation. Moreover, NLP methods for generating vector representations of phenotypes are scalable to large quantities of text because they require no human input. These results indicate that it is now possible to computationally and automatically produce and populate large-scale information resources that enable researchers to query phenotypic descriptions directly., (Copyright © 2020 Braun and Lawrence-Dill.)
- Published
- 2020
- Full Text
- View/download PDF
15. High-frequency random DNA insertions upon co-delivery of CRISPR-Cas9 ribonucleoprotein and selectable marker plasmid in rice.
- Author
-
Banakar R, Eggenberger AL, Lee K, Wright DA, Murugan K, Zarecor S, Lawrence-Dill CJ, Sashital DG, and Wang K
- Subjects
- Agrobacterium genetics, DNA Fragmentation, Ribonucleoproteins genetics, Ribonucleoproteins metabolism, CRISPR-Cas Systems genetics, Oryza genetics, Plasmids genetics, RNA, Plant genetics
- Abstract
An important advantage of delivering CRISPR reagents into cells as a ribonucleoprotein (RNP) complex is the ability to edit genes without reagents being integrated into the genome. Transient presence of RNP molecules in cells can reduce undesirable off-target effects. One method for RNP delivery into plant cells is the use of a biolistic gun. To facilitate selection of transformed cells during RNP delivery, a plasmid carrying a selectable marker gene can be co-delivered with the RNP to enrich for transformed/edited cells. In this work, we compare targeted mutagenesis in rice using three different delivery platforms: biolistic RNP/DNA co-delivery; biolistic DNA delivery; and Agrobacterium-mediated delivery. All three platforms were successful in generating desired mutations at the target sites. However, we observed a high frequency (over 14%) of random plasmid or chromosomal DNA fragment insertion at the target sites in transgenic events generated from both biolistic delivery platforms. In contrast, integration of random DNA fragments was not observed in transgenic events generated from the Agrobacterium-mediated method. These data reveal important insights that must be considered when selecting the method for genome-editing reagent delivery in plants, and emphasize the importance of employing appropriate molecular screening methods to detect unintended alterations following genome engineering.
- Published
- 2019
- Full Text
- View/download PDF
16. Assessing plant performance in the Enviratron.
- Author
-
Bao Y, Zarecor S, Shah D, Tuel T, Campbell DA, Chapman AVE, Imberti D, Kiekhaefer D, Imberti H, Lübberstedt T, Yin Y, Nettleton D, Lawrence-Dill CJ, Whitham SA, Tang L, and Howell SH
- Abstract
Background: Assessing the impact of the environment on plant performance requires growing plants under controlled environmental conditions. Plant phenotypes are a product of genotype × environment (G × E), and the Enviratron at Iowa State University is a facility for testing under controlled conditions the effects of the environment on plant growth and development. Crop plants (including maize) can be grown to maturity in the Enviratron, and the performance of plants under different environmental conditions can be monitored 24 h per day, 7 days per week throughout the growth cycle., Results: The Enviratron is an array of custom-designed plant growth chambers that simulate different environmental conditions coupled with precise sensor-based phenotypic measurements carried out by a robotic rover. The rover has workflow instructions to periodically visit plants growing in the different chambers where it measures various growth and physiological parameters. The rover consists of an unmanned ground vehicle, an industrial robotic arm and an array of sensors including RGB, visible and near infrared (VNIR) hyperspectral, thermal, and time-of-flight (ToF) cameras, laser profilometer and pulse-amplitude modulated (PAM) fluorometer. The sensors are autonomously positioned for detecting leaves in the plant canopy, collecting various physiological measurements based on computer vision algorithms and planning motion via "eye-in-hand" movement control of the robotic arm. In particular, the automated leaf probing function that allows the precise placement of sensor probes on leaf surfaces presents a unique advantage of the Enviratron system over other types of plant phenotyping systems., Conclusions: The Enviratron offers a new level of control over plant growth parameters and optimizes positioning and timing of sensor-based phenotypic measurements. Plant phenotypes in the Enviratron are measured in situ-in that the rover takes sensors to the plants rather than moving plants to the sensors., Competing Interests: Competing interestsThe authors declare no competing interests. David Imberti, Daniel Kiekhaefer and Henry Imberti are employees of Percival Scientific, Inc., the manufacturer of the Enviratron’s plant growth chambers., (© The Author(s) 2019.)
- Published
- 2019
- Full Text
- View/download PDF
17. MaizeDIG: Maize Database of Images and Genomes.
- Author
-
Cho KT, Portwood JL 2nd, Gardiner JM, Harper LC, Lawrence-Dill CJ, Friedberg I, and Andorf CM
- Abstract
Background: An organism can be described by its observable features (phenotypes) and the genes and genomic information (genotypes) that cause these phenotypes. For many decades, researchers have tried to find relationships between genotypes and phenotypes, and great strides have been made. However, improved methods and tools for discovering and visualizing these phenotypic relationships are still needed. The maize genetics and genomics database (MaizeGDB, www.maizegdb.org) provides an array of useful resources for diverse data types including thousands of images related to mutant phenotypes in Zea mays ssp. mays (maize). To integrate mutant phenotype images with genomics information, we implemented and enhanced the web-based software package BioDIG (Biological Database of Images and Genomes). Findings: We developed a genotype-phenotype database for maize called MaizeDIG. MaizeDIG has several enhancements over the original BioDIG package. MaizeDIG, which supports multiple reference genome assemblies, is seamlessly integrated with genome browsers to accommodate custom tracks showing tagged mutant phenotypes images in their genomic context and allows for custom tagging of images to highlight the phenotype. This is accomplished through an updated interface allowing users to create image-to-gene links and is accessible via the image search tool. Conclusions: We have created a user-friendly and extensible web-based resource called MaizeDIG. MaizeDIG is preloaded with 2,396 images that are available on genome browsers for 10 different maize reference genomes. Approximately 90 images of classically defined maize genes have been manually annotated. MaizeDIG is available at http://maizedig.maizegdb.org/. The code is free and open source and can be found at https://github.com/Maize-Genetics-and-Genomics-Database/maizedig.
- Published
- 2019
- Full Text
- View/download PDF
18. Activities and specificities of CRISPR/Cas9 and Cas12a nucleases for targeted mutagenesis in maize.
- Author
-
Lee K, Zhang Y, Kleinstiver BP, Guo JA, Aryee MJ, Miller J, Malzahn A, Zarecor S, Lawrence-Dill CJ, Joung JK, Qi Y, and Wang K
- Subjects
- Agrobacterium, Endonucleases genetics, Gene Targeting methods, Mutagenesis, Mutation, RNA, Guide, CRISPR-Cas Systems genetics, Sequence Alignment, Zea mays genetics, CRISPR-Cas Systems, Endonucleases metabolism, Gene Editing methods, Genome, Plant genetics, Zea mays enzymology
- Abstract
CRISPR/Cas9 and Cas12a (Cpf1) nucleases are two of the most powerful genome editing tools in plants. In this work, we compared their activities by targeting maize glossy2 gene coding region that has overlapping sequences recognized by both nucleases. We introduced constructs carrying SpCas9-guide RNA (gRNA) and LbCas12a-CRISPR RNA (crRNA) into maize inbred B104 embryos using Agrobacterium-mediated transformation. On-target mutation analysis showed that 90%-100% of the Cas9-edited T0 plants carried indel mutations and 63%-77% of them were homozygous or biallelic mutants. In contrast, 0%-60% of Cas12a-edited T0 plants had on-target mutations. We then conducted CIRCLE-seq analysis to identify genome-wide potential off-target sites for Cas9. A total of 18 and 67 potential off-targets were identified for the two gRNAs, respectively, with an average of five mismatches compared to the target sites. Sequencing analysis of a selected subset of the off-target sites revealed no detectable level of mutations in the T1 plants, which constitutively express Cas9 nuclease and gRNAs. In conclusion, our results suggest that the CRISPR/Cas9 system used in this study is highly efficient and specific for genome editing in maize, while CRISPR/Cas12a needs further optimization for improved editing efficiency., (© 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.)
- Published
- 2019
- Full Text
- View/download PDF
19. Crowdsourcing image analysis for plant phenomics to generate ground truth data for machine learning.
- Author
-
Zhou N, Siegel ZD, Zarecor S, Lee N, Campbell DA, Andorf CM, Nettleton D, Lawrence-Dill CJ, Ganapathysubramanian B, Kelly JW, and Friedberg I
- Subjects
- Algorithms, Data Accuracy, Food Supply, Humans, Internet, Phenotype, Pilot Projects, Crops, Agricultural physiology, Crowdsourcing methods, Image Processing, Computer-Assisted methods, Machine Learning
- Abstract
The accuracy of machine learning tasks critically depends on high quality ground truth data. Therefore, in many cases, producing good ground truth data typically involves trained professionals; however, this can be costly in time, effort, and money. Here we explore the use of crowdsourcing to generate a large number of training data of good quality. We explore an image analysis task involving the segmentation of corn tassels from images taken in a field setting. We investigate the accuracy, speed and other quality metrics when this task is performed by students for academic credit, Amazon MTurk workers, and Master Amazon MTurk workers. We conclude that the Amazon MTurk and Master Mturk workers perform significantly better than the for-credit students, but with no significant difference between the two MTurk worker types. Furthermore, the quality of the segmentation produced by Amazon MTurk workers rivals that of an expert worker. We provide best practices to assess the quality of ground truth data, and to compare data quality produced by different sources. We conclude that properly managed crowdsourcing can be used to establish large volumes of viable ground truth data at a low cost and high quality, especially in the context of high throughput plant phenotyping. We also provide several metrics for assessing the quality of the generated datasets., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2018
- Full Text
- View/download PDF
20. Maize Genomes to Fields: 2014 and 2015 field season genotype, phenotype, environment, and inbred ear image datasets.
- Author
-
AlKhalifah N, Campbell DA, Falcon CM, Gardiner JM, Miller ND, Romay MC, Walls R, Walton R, Yeh CT, Bohn M, Bubert J, Buckler ES, Ciampitti I, Flint-Garcia S, Gore MA, Graham C, Hirsch C, Holland JB, Hooker D, Kaeppler S, Knoll J, Lauter N, Lee EC, Lorenz A, Lynch JP, Moose SP, Murray SC, Nelson R, Rocheford T, Rodriguez O, Schnable JC, Scully B, Smith M, Springer N, Thomison P, Tuinstra M, Wisser RJ, Xu W, Ertl D, Schnable PS, De Leon N, Spalding EP, Edwards J, and Lawrence-Dill CJ
- Subjects
- Environment, Genome, Plant, Inbreeding, Plant Breeding, Seasons, Sequence Analysis, DNA, Datasets as Topic, Genotype, Phenotype, Zea mays genetics
- Abstract
Objectives: Crop improvement relies on analysis of phenotypic, genotypic, and environmental data. Given large, well-integrated, multi-year datasets, diverse queries can be made: Which lines perform best in hot, dry environments? Which alleles of specific genes are required for optimal performance in each environment? Such datasets also can be leveraged to predict cultivar performance, even in uncharacterized environments. The maize Genomes to Fields (G2F) Initiative is a multi-institutional organization of scientists working to generate and analyze such datasets from existing, publicly available inbred lines and hybrids. G2F's genotype by environment project has released 2014 and 2015 datasets to the public, with 2016 and 2017 collected and soon to be made available., Data Description: Datasets include DNA sequences; traditional phenotype descriptions, as well as detailed ear, cob, and kernel phenotypes quantified by image analysis; weather station measurements; and soil characterizations by site. Data are released as comma separated value spreadsheets accompanied by extensive README text descriptions. For genotypic and phenotypic data, both raw data and a version with outliers removed are reported. For weather data, two versions are reported: a full dataset calibrated against nearby National Weather Service sites and a second calibrated set with outliers and apparent artifacts removed.
- Published
- 2018
- Full Text
- View/download PDF
21. Response to Persistent ER Stress in Plants: A Multiphasic Process That Transitions Cells from Prosurvival Activities to Cell Death.
- Author
-
Srivastava R, Li Z, Russo G, Tang J, Bi R, Muppirala U, Chudalayandi S, Severin A, He M, Vaitkevicius SI, Lawrence-Dill CJ, Liu P, Stapleton AE, Bassham DC, Brandizzi F, and Howell SH
- Subjects
- Cell Death genetics, Endoplasmic Reticulum Stress genetics, RNA, Messenger genetics, RNA, Messenger metabolism, Unfolded Protein Response genetics, Zea mays genetics, Cell Death physiology, Endoplasmic Reticulum Stress physiology, Unfolded Protein Response physiology, Zea mays cytology, Zea mays metabolism
- Abstract
The unfolded protein response (UPR) is a highly conserved response that protects plants from adverse environmental conditions. The UPR is elicited by endoplasmic reticulum (ER) stress, in which unfolded and misfolded proteins accumulate within the ER. Here, we induced the UPR in maize ( Zea mays ) seedlings to characterize the molecular events that occur over time during persistent ER stress. We found that a multiphasic program of gene expression was interwoven among other cellular events, including the induction of autophagy. One of the earliest phases involved the degradation by regulated IRE1-dependent RNA degradation (RIDD) of RNA transcripts derived from a family of peroxidase genes. RIDD resulted from the activation of the promiscuous ribonuclease activity of ZmIRE1 that attacks the mRNAs of secreted proteins. This was followed by an upsurge in expression of the canonical UPR genes indirectly driven by ZmIRE1 due to its splicing of Zmbzip60 mRNA to make an active transcription factor that directly upregulates many of the UPR genes. At the peak of UPR gene expression, a global wave of RNA processing led to the production of many aberrant UPR gene transcripts, likely tempering the ER stress response. During later stages of ER stress, ZmIRE1's activity declined, as did the expression of survival modulating genes, Bax inhibitor1 and Bcl-2-associated athanogene7 , amid a rising tide of cell death. Thus, in response to persistent ER stress, maize seedlings embark on a course of gene expression and cellular events progressing from adaptive responses to cell death., (© 2018 American Society of Plant Biologists. All rights reserved.)
- Published
- 2018
- Full Text
- View/download PDF
22. Maize GO Annotation-Methods, Evaluation, and Review (maize-GAMER).
- Author
-
Wimalanathan K, Friedberg I, Andorf CM, and Lawrence-Dill CJ
- Abstract
We created a new high-coverage, robust, and reproducible functional annotation of maize protein-coding genes based on Gene Ontology (GO) term assignments. Whereas the existing Phytozome and Gramene maize GO annotation sets only cover 41% and 56% of maize protein-coding genes, respectively, this study provides annotations for 100% of the genes. We also compared the quality of our newly derived annotations with the existing Gramene and Phytozome functional annotation sets by comparing all three to a manually annotated gold standard set of 1,619 genes where annotations were primarily inferred from direct assay or mutant phenotype. Evaluations based on the gold standard indicate that our new annotation set is measurably more accurate than those from Phytozome and Gramene. To derive this new high-coverage, high-confidence annotation set, we used sequence similarity and protein domain presence methods as well as mixed-method pipelines that were developed for the Critical Assessment of Function Annotation (CAFA) challenge. Our project to improve maize annotations is called maize-GAMER (GO Annotation Method, Evaluation, and Review), and the newly derived annotations are accessible via MaizeGDB (http://download.maizegdb.org/maize-GAMER) and CyVerse (B73 RefGen_v3 5b+ at doi.org/10.7946/P2S62P and B73 RefGen_v4 Zm00001d.2 at doi.org/10.7946/P2M925)., Competing Interests: None declared.
- Published
- 2018
- Full Text
- View/download PDF
23. Sowing the seeds of skepticism: Russian state news and anti-GMO sentiment.
- Author
-
Dorius SF and Lawrence-Dill CJ
- Subjects
- Agriculture, Russia, Social Media, Organisms, Genetically Modified metabolism
- Abstract
Biotech news coverage in English-language Russian media fits the profile of the Russian information warfare strategy described in recent military reports. This raises the question of whether Russia views the dissemination of anti-GMO information as just one of many divisive issues it can exploit as part of its information war, or if GMOs serve more expansive disruptive purposes. Distinctive patterns in Russian news provide evidence of a coordinated information campaign that could turn public opinion against genetic engineering. The recent branding of Russian agriculture as the ecologically clean alternative to genetically engineered foods is suggestive of an economic motive behind the information campaign against western biotechnologies.
- Published
- 2018
- Full Text
- View/download PDF
24. The effect of artificial selection on phenotypic plasticity in maize.
- Author
-
Gage JL, Jarquin D, Romay C, Lorenz A, Buckler ES, Kaeppler S, Alkhalifah N, Bohn M, Campbell DA, Edwards J, Ertl D, Flint-Garcia S, Gardiner J, Good B, Hirsch CN, Holland J, Hooker DC, Knoll J, Kolkman J, Kruger G, Lauter N, Lawrence-Dill CJ, Lee E, Lynch J, Murray SC, Nelson R, Petzoldt J, Rocheford T, Schnable J, Schnable PS, Scully B, Smith M, Springer NM, Srinivasan S, Walton R, Weldekidan T, Wisser RJ, Xu W, Yu J, and de Leon N
- Subjects
- Chimera, Gene Frequency, Genetic Variation, Phenotype, Plant Breeding, Selection, Genetic, Tropical Climate, Zea mays genetics, Genome, Plant, Polymorphism, Single Nucleotide, Zea mays physiology
- Abstract
Remarkable productivity has been achieved in crop species through artificial selection and adaptation to modern agronomic practices. Whether intensive selection has changed the ability of improved cultivars to maintain high productivity across variable environments is unknown. Understanding the genetic control of phenotypic plasticity and genotype by environment (G × E) interaction will enhance crop performance predictions across diverse environments. Here we use data generated from the Genomes to Fields (G2F) Maize G × E project to assess the effect of selection on G × E variation and characterize polymorphisms associated with plasticity. Genomic regions putatively selected during modern temperate maize breeding explain less variability for yield G × E than unselected regions, indicating that improvement by breeding may have reduced G × E of modern temperate cultivars. Trends in genomic position of variants associated with stability reveal fewer genic associations and enrichment of variants 0-5000 base pairs upstream of genes, hypothetically due to control of plasticity by short-range regulatory elements.
- Published
- 2017
- Full Text
- View/download PDF
25. Achieving Plant CRISPR Targeting that Limits Off-Target Effects.
- Author
-
Wolt JD, Wang K, Sashital D, and Lawrence-Dill CJ
- Subjects
- Mutation, CRISPR-Cas Systems genetics, Gene Editing methods, Genome, Plant genetics
- Abstract
The CRISPR-Cas9 system (clustered regularly interspaced short palindromic repeats with associated Cas9 protein) has been used to generate targeted changes for direct modification of endogenous genes in an increasing number of plant species; but development of plant genome editing has not yet fully considered potential off-target mismatches that may lead to unintended changes within the genome. Assessing the specificity of CRISPR-Cas9 for increasing editing efficiency as well as the potential for unanticipated downstream effects from off-target mutations is an important regulatory consideration for agricultural applications. Increasing genome-editing specificity entails developing improved design methods that better predict the prevalence of off-target mutations as a function of genome composition and design of the engineered ribonucleoprotein (RNP). Early results from CRISPR-Cas9 genome editing in plant systems indicate that the incidence of off-target mutation frequencies is quite low; however, by analyzing CRISPR-edited plant lines and improving both computational tools and reagent design, it may be possible to further decrease unanticipated effects at potential mismatch sites within the genome. This will provide assurance that CRISPR-Cas9 reagents can be designed and targeted with a high degree of specificity. Improved and experimentally validated design tools for discriminating target and potential off-target positions that incorporate consideration of the designed nuclease fidelity and selectivity will help to increase confidence for regulatory decision making for genome-edited plants., (Copyright © 2016 Crop Science Society of America.)
- Published
- 2016
- Full Text
- View/download PDF
26. The Quest for Understanding Phenotypic Variation via Integrated Approaches in the Field Environment.
- Author
-
Pauli D, Chapman SC, Bart R, Topp CN, Lawrence-Dill CJ, Poland J, and Gore MA
- Subjects
- Crops, Agricultural, DNA, Plant, Genetic Variation, Molecular Sequence Data, Phenotype, Plant Development, Plant Physiological Phenomena, Plant Roots, Plants genetics, Biological Variation, Population genetics, Environment
- Published
- 2016
- Full Text
- View/download PDF
27. MaizeGDB update: new tools, data and interface for the maize model organism database.
- Author
-
Andorf CM, Cannon EK, Portwood JL 2nd, Gardiner JM, Harper LC, Schaeffer ML, Braun BL, Campbell DA, Vinnakota AG, Sribalusu VV, Huerta M, Cho KT, Wimalanathan K, Richter JD, Mauch ED, Rao BS, Birkett SM, Sen TZ, and Lawrence-Dill CJ
- Subjects
- Gene Expression, Genes, Plant, Genetic Variation, Genome, Plant, Metabolic Networks and Pathways, Models, Genetic, Software, User-Computer Interface, Zea mays metabolism, Databases, Genetic, Zea mays genetics
- Abstract
MaizeGDB is a highly curated, community-oriented database and informatics service to researchers focused on the crop plant and model organism Zea mays ssp. mays. Although some form of the maize community database has existed over the last 25 years, there have only been two major releases. In 1991, the original maize genetics database MaizeDB was created. In 2003, the combined contents of MaizeDB and the sequence data from ZmDB were made accessible as a single resource named MaizeGDB. Over the next decade, MaizeGDB became more sequence driven while still maintaining traditional maize genetics datasets. This enabled the project to meet the continued growing and evolving needs of the maize research community, yet the interface and underlying infrastructure remained unchanged. In 2015, the MaizeGDB team completed a multi-year effort to update the MaizeGDB resource by reorganizing existing data, upgrading hardware and infrastructure, creating new tools, incorporating new data types (including diversity data, expression data, gene models, and metabolic pathways), and developing and deploying a modern interface. In addition to coordinating a data resource, the MaizeGDB team coordinates activities and provides technical support to the maize research community. MaizeGDB is accessible online at http://www.maizegdb.org., (© Published by Oxford University Press on behalf of Nucleic Acids Research 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.)
- Published
- 2016
- Full Text
- View/download PDF
28. Emerging semantics to link phenotype and environment.
- Author
-
Thessen AE, Bunker DE, Buttigieg PL, Cooper LD, Dahdul WM, Domisch S, Franz NM, Jaiswal P, Lawrence-Dill CJ, Midford PE, Mungall CJ, Ramírez MJ, Specht CD, Vogt L, Vos RA, Walls RL, White JW, Zhang G, Deans AR, Huala E, Lewis SE, and Mabee PM
- Abstract
Understanding the interplay between environmental conditions and phenotypes is a fundamental goal of biology. Unfortunately, data that include observations on phenotype and environment are highly heterogeneous and thus difficult to find and integrate. One approach that is likely to improve the status quo involves the use of ontologies to standardize and link data about phenotypes and environments. Specifying and linking data through ontologies will allow researchers to increase the scope and flexibility of large-scale analyses aided by modern computing methods. Investments in this area would advance diverse fields such as ecology, phylogenetics, and conservation biology. While several biological ontologies are well-developed, using them to link phenotypes and environments is rare because of gaps in ontological coverage and limits to interoperability among ontologies and disciplines. In this manuscript, we present (1) use cases from diverse disciplines to illustrate questions that could be answered more efficiently using a robust linkage between phenotypes and environments, (2) two proof-of-concept analyses that show the value of linking phenotypes to environments in fishes and amphibians, and (3) two proposed example data models for linking phenotypes and environments using the extensible observation ontology (OBOE) and the Biological Collections Ontology (BCO); these provide a starting point for the development of a data model linking phenotypes and environments.
- Published
- 2015
- Full Text
- View/download PDF
29. A quick guide to CRISPR sgRNA design tools.
- Author
-
Brazelton VA Jr, Zarecor S, Wright DA, Wang Y, Liu J, Chen K, Yang B, and Lawrence-Dill CJ
- Subjects
- Animals, Computational Biology, Genetic Engineering methods, Genome, RNA chemistry, Sequence Analysis, RNA, Species Specificity, CRISPR-Cas Systems, Internet, Software
- Abstract
Targeted genome editing is now possible in nearly any organism and is widely acknowledged as a biotech game-changer. Among available gene editing techniques, the CRISPR-Cas9 system is the current favorite because it has been shown to work in many species, does not necessarily result in the addition of foreign DNA at the target site, and follows a set of simple design rules for target selection. Use of the CRISPR-Cas9 system is facilitated by the availability of an array of CRISPR design tools that vary in design specifications and parameter choices, available genomes, graphical visualization, and downstream analysis functionality. To help researchers choose a tool that best suits their specific research needs, we review the functionality of various CRISPR design tools including our own, the CRISPR Genome Analysis Tool (CGAT; http://cropbioengineering.iastate.edu/cgat ).
- Published
- 2015
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.