193 results on '"Zhiwu Zhang"'
Search Results
2. Linking Phenotypes to Protein Characteristics in 3D Structures Predicted by Alphafold
- Author
-
Atit Parajuli, Robert Brueggeman, Steven Wagner, Marilyn Warburton, Michael Peel, Longxi Yu, Deven See, and Zhiwu Zhang
- Abstract
Plant breeding aims to develop elite crop varieties appropriate for various environments with higher quality and quantity of production. Researchers use quantitative trait loci (QTL) mapping and association studies to identify regions in the genome responsible for the variation of the quantitative traits of interest. However, mapped regions do not always translate to functional proteins, which makes it challenging to identify genes associated with traits of interest. The biological functions of proteins are strongly dependent on their 3D structure. Alternatively, if proteins can be directly linked with the phenotypes, the effect of mutations on phenotypic changes can be assessed. Innovation of deep learning models in biology opens new avenues of exploration. AlphaFold is an AI system that predicts the 3D structure of a protein from its amino acid sequence with near experimental accuracy and was used in this study. Point mutations with a significant influence on the 3D structure of a protein can capture the effect on phenotypes through association study, and this provides insights into the regions that are of functional importance. In the current study, 534 plants were selected based on plant vigor, and 154 missense variants that change amino acid sequences, including 5 significant hits from previous study, were included. The changes in protein 3D structure were assessed by association with the phenotype. The analysis identified five significant associations, four of which were also identified in previous study of SNPs GWAS, however, a new fifth association was also identified which was annotated as disease resistance gene in Medicago truncatula. This study helps to associate SNPs that could be missed by GWAS due to stringent Bonferroni corrected p-values by providing a more robust filter for SNPs using features from predicted protein 3D structures.
- Published
- 2023
3. Bulked Target Capture Sequencing Identified Numerous Genetic Loci Associated with Alfalfa Growth Vigor During Inbreeding
- Author
-
Atit Parajuli, Robert Brueggeman, Steven Wagner, Marilyn Warburton, Michael Peel, Longxi Yu, Deven See, and Zhiwu Zhang
- Abstract
Alfalfa primarily produces seeds through cross-pollination among different individuals. Self-pollination results in severe inbreeding depression, such as weak seedlings and termination of growth. Identifying the genetic loci associated with vigorous plant growth could enable the elimination of deleterious alleles and eventually develop inbred alfalfa lines for hybrid production. In this study, 215 alfalfa accessions were self-pollinated for three generations. Within accessions, pairs of weak and strong plants were sampled and bulked for exome sequencing. We extracted individual DNA from 534 plants that included parental clones, strong and weak pairs of plants, and plants selected based on the number of seeds produced. Among them, we formed 42 pools, including 16 with strong plants and 17 with weak plants, 3 top-seeded plants, 3 low-seeded plants, and 3 no-seeded plants. Along with 79 individuals, these pools were sequenced in the target regions covering 112,626 contigs across the entire alfalfa genome. From the 121 samples, (42+79) genotyped, 13.2 million SNPs including indels were generated. After filtering for MAF (>5%), depth (
- Published
- 2023
4. Streamline unsupervised machine learning to survey and graph indel-based haplotypes from pan-genomes
- Author
-
Bosen Zhang, Haiyan Huang, Laura E. Tibbs-Cortes, Adam Vanous, Zhiwu Zhang, Karen Sanguinet, Kimberly A. Garland-Campbell, Jianming Yu, and Xianran Li
- Subjects
General Medicine - Published
- 2023
5. In Silico Analysis Identified bZIP Transcription Factors under Abiotic Stress in Alfalfa (Medicago sativa L.)
- Author
-
Atit Parajuli, Bhabesh Borphukan, Karen A. Sanguinet, and Zhiwu Zhang
- Subjects
plant_sciences - Abstract
Background Alfalfa (Medicago sativa L.) is the most cultivated forage legume around the world. Under a variety of growing conditions, forage yield in alfalfa is stymied by biotic and abiotic stresses including heat, salt, drought, and disease. Given the sessile nature of plants, they use strategies including, but not limited to, differential gene expression to respond to environmental cues. Transcription factors control the expression of genes that contribute to or enable tolerance and survival during periods of stress. Basic-leucine zipper (bZIP) transcription factors have been demonstrated to play a critical role in regulating plant growth and development as well as mediate the responses to abiotic stress in several species, including Arabidopsis thaliana, Oryza sativa, Lotus japonicus and Medicago truncatula. However, there is little information about bZIP transcription factors in cultivated alfalfa.Result In the present study, 237 bZIP genes were identified in alfalfa from publicly available sequencing data. Multiple sequence alignments showed the presence of intact bZIP motifs in the identified sequences. Based on previous phylogenetic analyses in A. thaliana, alfalfa bZIPs were similarly divided and fell into 10 groups. The physico-chemical properties, motif analysis and phylogenetic study of the alfalfa bZIPs revealed high specificity within groups. The differential expression of alfalfa bZIPs in a suite of tissues indicates that bZIP genes are specifically expressed at different developmental stages in alfalfa. Similarly, expression analysis in response to ABA, cold, drought and salt stresses, indicates that a subset of bZIP genes are also differentially expressed and likely play a role in abiotic stress signaling and/or tolerance. RT-qPCR analysis on selected genes further verified these differential expression patterns.Conclusions Taken together, this work provides a framework for the future study of bZIPs in alfalfa and presents candidate bZIPs involved in stress-response signaling.
- Published
- 2023
6. In Silico Analysis Identified bZIP Transcription Factors under Abiotic Stress in Alfalfa (Medicago sativa L.)
- Author
-
Atit Parajuli, Bhabesh Borphukan, Karen Sanguinet, and Zhiwu Zhang
- Abstract
Alfalfa (Medicago sativa L.) is the most cultivated forage legume around the world. Under a variety of growing conditions, forage yield in alfalfa is stymied by biotic and abiotic stresses including heat, salt, drought, and disease. Given the sessile nature of plants, they use strategies such as differential gene expression to respond to environmental cues. Transcription factors control the expression of genes that contribute to or enable tolerance and survival during periods of stress. Basic-leucine zipper (bZIP) transcription factors have been demonstrated to play a critical role in regulating plant growth and development as well as mediate the responses to abiotic stress in several species, including Arabidopsis thaliana, Oryza sativa, Lotus japonicus, and Medicago truncatula. However, there is little information about bZIP transcription factors in cultivated alfalfa. In the present study, 237 bZIP genes were identified in alfalfa from publicly available sequencing data. Multiple sequence alignments showed the presence of intact bZIP motifs in the identified sequences. Based on previous phylogenetic analyses in Arabidopsis thaliana, alfalfa bZIPs were similarly divided and fell into 10 groups. The physicochemical properties, motif analysis, and phylogenetic study of the alfalfa bZIPs revealed high specificity within groups. The differential expression of alfalfa bZIPs in a suite of tissues indicates that particular bZIP genes are specifically expressed at different developmental stages in alfalfa. Similarly, expression analysis in response to ABA, cold, drought, and salt stresses, indicates that a subset of bZIP genes are also differentially expressed and likely play a role in abiotic stress signaling and/or tolerance. These expression patterns were further verified by qRT-PCR. However, further functional characterization of bZIP transcription factors in alfalfa will help illuminate the role they play in stress tolerance mechanisms in legumes and facilitate the molecular breeding of stress tolerance in alfalfa.
- Published
- 2023
7. Genetics-inspired data-driven approaches explain and predict crop performance fluctuations attributed to changing climatic conditions
- Author
-
Xianran Li, Tingting Guo, Guihua Bai, Zhiwu Zhang, Deven See, Juliet Marshall, Kimberly A. Garland-Campbell, and Jianming Yu
- Subjects
Crops, Agricultural ,Climate Change ,Plant Science ,Molecular Biology - Published
- 2022
8. Fruit cracking and firmness DNA test development and evaluation in sweet cherry
- Author
-
W. Wesley Crump, Cameron Peace, Zhiwu Zhang, and Per McCord
- Published
- 2022
9. Streamline unsupervised machine learning to survey and graph indel-based haplotypes from pan-genomes
- Author
-
Bosen Zhang, Haiyan Huang, Laura E. Tibbs-Cortes, Adam Vanous, Zhiwu Zhang, Karen Sanguinet, Kimberly A. Garland-Campbell, Jianming Yu, and Xianran Li
- Abstract
Identification and visualization of large insertion and deletion (indel) polymorphisms, which contribute significantly to natural phenotypic variation, are challenge from a pan-genome. Here, through streamlining two unsupervised machine learning algorithms, we developed a BRIDGEcereal webapp for surveying and graphing indel-based haplotypes for genes of interest from publicly accessible pangenomes. Over hundreds of assemblies from five major cereals were compiled. We demonstrated the potential of BRIDGEcereal in exploring natural variation with wheat candidate genes within QTLs and GWAS intervals. BRIDGEcereal is available fromhttps://bridgecereal.scinet.usda.gov.
- Published
- 2023
10. Researching language learning motivation: A concise guide
- Author
-
Ran Pei and Zhiwu Zhang
- Subjects
General Psychology - Published
- 2023
11. An independent validation reveals the potential to predict Hagberg–Perten falling number using spectrometers
- Author
-
Chun‐Peng James Chen, Yang Hu, Xianran Li, Craig F. Morris, Stephen Delwiche, Arron H. Carter, Camille Steber, and Zhiwu Zhang
- Subjects
Plant Science ,Agronomy and Crop Science - Published
- 2023
12. Ideas in Genomic Selection with the Potential to Transform Plant Molecular Breeding
- Author
-
Matthew T McGowan, Alexander E. Lipka, Hiroyoshi Iwata, Xiangfeng Wang, Xiaolei Liu, Zhiwu Zhang, Yutao Li, Haixiao Dong, Jiabo Wang, and Yi Jia
- Subjects
2. Zero hunger ,Molecular breeding ,0303 health sciences ,04 agricultural and veterinary sciences ,Computational biology ,Best linear unbiased prediction ,Biology ,03 medical and health sciences ,Bayes' theorem ,040103 agronomy & agriculture ,anatomy_morphology ,0401 agriculture, forestry, and fisheries ,Plant breeding ,Genomic selection ,030304 developmental biology - Abstract
Estimation of breeding values through Best Linear Unbiased Prediction (BLUP) using pedigree-based kinship and Marker-Assisted Selection (MAS) are the two fundamental breeding methods used before and after the introduction of genetic markers, respectively. The emergence of high-density genome-wide markers has led to the development of two parallel series of approaches inspired by BLUP and MAS, which are collectively referred to as Genomic Selection (GS). The first series of GS methods alters pedigree-based BLUP by replacing pedigree-based kinship with marker-based kinship in a variety of ways, including weighting markers by their effects in genome-wide association study (GWAS), joining both pedigree and marker-based kinship together in a single-step BLUP, and substituting individuals with groups in a compressed BLUP. The second series of GS methods estimates the effects for all genetic markers simultaneously. For the second series methods, the marker effects are summed together regardless of their individual significance. Instead of fitting individuals as random effects like in the BLUP series, the second series fits markers as random effects. Differing assumptions regarding the underlying distribution of these marker effects have resulted in the development of many Bayesian-based GS methods. This review highlights critical concept developments for both of these series and explores ongoing GS developments in machine learning, multiple trait selection, and adaptation for hybrid breeding. Furthermore, considering the increasing use and variety of GS methods in plant breeding programs, this review addresses important concerns for future GS development and application, such as the use of GWAS-assisted GS, the long-term effectiveness of GS methods, and the valid assessment of prediction accuracy.
- Published
- 2021
13. GAPIT Version 3: Boosting Power and Accuracy for Genomic Association and Prediction
- Author
-
Jiabo Wang and Zhiwu Zhang
- Subjects
Mixed model ,Boosting (machine learning) ,Source code ,Genotype ,Computer science ,media_common.quotation_subject ,Best linear unbiased prediction ,computer.software_genre ,Polymorphism, Single Nucleotide ,Biochemistry ,Linkage Disequilibrium ,Statistical power ,Software ,Genetics ,Molecular Biology ,Selection (genetic algorithm) ,media_common ,General linear model ,Genome ,Models, Genetic ,business.industry ,Bayes Theorem ,Genomics ,Computational Mathematics ,Phenotype ,Data mining ,business ,computer ,Genome-Wide Association Study - Abstract
Genome-wide association study (GWAS) and genomic prediction/selection (GP/GS) are the two essential enterprises in genomic research. Due to the great magnitude and complexity of genomic and phenotypic data, analytical methods and their associated software packages are frequently advanced. GAPIT is a widely-used genomic association and prediction integrated tool as an R package. The first version was released to the public in 2012 with the implementation of the general linear model (GLM), mixed linear model (MLM), compressed MLM (CMLM), and genomic best linear unbiased prediction (gBLUP). The second version was released in 2016 with several new implementations, including enriched CMLM (ECMLM) and settlement of MLMs under progressively exclusive relationship (SUPER). All the GWAS methods are based on the single-locus test. For the first time, in the current release of GAPIT, version 3 implemented three multi-locus test methods, including multiple loci mixed model (MLMM), fixed and random model circulating probability unification (FarmCPU), and Bayesian-information and linkage-disequilibrium iteratively nested keyway (BLINK). Additionally, two GP/GS methods were implemented based on CMLM (named compressed BLUP; cBLUP) and SUPER (named SUPER BLUP; sBLUP). These new implementations not only boost statistical power for GWAS and prediction accuracy for GP/GS, but also improve computing speed and increase the capacity to analyze big genomic data. Here, we document the current upgrade of GAPIT by describing the selection of the recently developed methods, their implementations, and potential impact. All documents, including source code, user manual, demo data, and tutorials, are freely available at the GAPIT website (http://zzlab.net/GAPIT).
- Published
- 2021
14. Investigation of Quench Sensitivity and Microstructure Evolution During Isothermal Treatment in 2195 Al–Li Alloy
- Author
-
Shiquan Huang, Hailin He, Zhiwu Zhang, Youping Yi, Yonglin Guo, and Wen You
- Subjects
Quenching ,Materials science ,Precipitation (chemistry) ,Metals and Alloys ,Nucleation ,Atmospheric temperature range ,Condensed Matter Physics ,Microstructure ,Isothermal process ,Precipitation hardening ,Differential scanning calorimetry ,Mechanics of Materials ,Materials Chemistry ,Composite material - Abstract
To investigate the quenching sensitivity of the 2195 Al–Li alloy rolled sheet and guide the design of the quenching process, the time–temperature-property (TTP) curves of this material were researched through interrupted quenching experiments. The differential scanning calorimetry (DSC) and transmission electron microscope (TEM) were used to characterize the evolution of precipitates during isothermal treatment. The results of this essay demonstrated that the nose temperature of 2195 Al–Li alloy is around 370 °C and the temperature range of quenching sensitivity is 340 °C to 400 °C. The microstructure observation revealed that the T1 particles precipitate and grow rapidly at the temperature from 340 to 400 °C, which is due to the high nucleation rate of phase and fast solute diffusion kinetics, especially at the nose temperature. The needle-shaped θ′/θ″ and T1 particles grow up quickly as the isothermal preservation time prolonged, leading to the decrease of the supersaturated solid solution of the matrix. This will reduce the number of the age-induced precipitate and weaken the subsequent age hardening effect. Therefore, the rate of cooling should be increased in the quenching sensitivity range (340–400 °C) to inhibit the precipitation of the second phase and obtain excellent mechanical properties. While in other temperature ranges, the cooling rate should be decreased appropriately to reduce residual stress. The appropriate average cooling rate is recommended to be around 13 °C s−1 at the temperature from 340 to 400 °C.
- Published
- 2021
15. Joint analysis of days to flowering reveals independent temperate adaptations in maize
- Author
-
Lynn Johnson, Yongxiang Li, Cinta Romay, Zachary R. Miller, Tianyu Wang, Chris-Carolin Schön, Tiffany Ho, Eva Bauer, Zhiwu Zhang, Peter J. Bradbury, Edward S. Buckler, Yu Li, Kelly Swarts, and Jeffrey C. Glaubitz
- Subjects
0106 biological sciences ,0301 basic medicine ,Population ,Climate change ,Flowers ,Biology ,Zea mays ,010603 evolutionary biology ,01 natural sciences ,Article ,03 medical and health sciences ,Genetics ,Temperate climate ,Nested association mapping ,Domestication ,education ,Alleles ,Genetic Association Studies ,Genetics (clinical) ,2. Zero hunger ,education.field_of_study ,Ecology ,15. Life on land ,Adaptation, Physiological ,Genetic architecture ,030104 developmental biology ,13. Climate action ,Trait ,Adaptation - Abstract
Domesticates are an excellent model for understanding biological consequences of rapid climate change. Maize (Zea mays ssp. mays) was domesticated from a tropical grass yet is widespread across temperate regions today. We investigate the biological basis of temperate adaptation in diverse structured nested association mapping (NAM) populations from China, Europe (Dent and Flint) and the United States as well as in the Ames inbred diversity panel, using days to flowering as a proxy. Using cross-population prediction, where high prediction accuracy derives from overall genomic relatedness, shared genetic architecture, and sufficient diversity in the training population, we identify patterns in predictive ability across the five populations. To identify the source of temperate adapted alleles in these populations, we predict top associated genome-wide association study (GWAS) identified loci in a Random Forest Classifier using independent temperate–tropical North American populations based on lines selected from Hapmap3 as predictors. We find that North American populations are well predicted (AUC equals 0.89 and 0.85 for Ames and USNAM, respectively), European populations somewhat well predicted (AUC equals 0.59 and 0.67 for the Dent and Flint panels, respectively) and that the Chinese population is not predicted well at all (AUC is 0.47), suggesting an independent adaptation process for early flowering in China. Multiple adaptations for the complex trait days to flowering in maize provide hope for similar natural systems under climate change.
- Published
- 2021
16. Interpretation of Manhattan Plots and Other Outputs of Genome-Wide Association Studies
- Author
-
Jiabo, Wang, Jianming, Yu, Alexander E, Lipka, and Zhiwu, Zhang
- Subjects
Genome ,Chromosome Mapping ,Humans ,Linkage Disequilibrium ,Software ,Genome-Wide Association Study - Abstract
With increasing marker density, estimation of recombination rate between a marker and a causal mutation using linkage analysis becomes less important. Instead, linkage disequilibrium (LD) becomes the major indicator for gene mapping through genome-wide association studies (GWAS). In addition to the linkage between the marker and the causal mutation, many other factors may contribute to the LD, including population structure and cryptic relationships among individuals. As statistical methods and software evolve to improve statistical power and computing speed in GWAS, the corresponding outputs must also evolve to facilitate the interpretation of input data, the analytical process, and final association results. In this chapter, our descriptions focus on (1) considerations in creating a Manhattan plot displaying the strength of LD and locations of markers across a genome; (2) criteria for genome-wide significance threshold and the different appearance of Manhattan plots in single-locus and multiple-locus models; (3) exploration of population structure and kinship among individuals; (4) quantile-quantile (QQ) plot; (5) LD decay across the genome and LD between the associated markers and their neighbors; (6) exploration of individual and marker information on Manhattan and QQ plots via interactive visualization using HTML. The ultimate objective of this chapter is to help users to connect input data to GWAS outputs to balance power and false positives, and connect GWAS outputs to the selection of candidate genes using LD extent.
- Published
- 2022
17. Performing Genome-Wide Association Studies with Multiple Models Using GAPIT
- Author
-
Jiabo, Wang, You, Tang, and Zhiwu, Zhang
- Subjects
Genetic Markers ,Genome ,Bayes Theorem ,Genomics ,Genome-Wide Association Study - Abstract
Genome-wide association study (GWAS) is based on the linkage disequilibrium (LD) between phenotypes and genetic markers covering the whole genome. Besides the genetic linkage between the genetic markers and the causal mutations, many other factors contribute to the LD, including selection and nonrandom mating formatting population structure. Many methods have been developed with accompany of corresponding software such as multiple loci mixed model (MLMM). There are software packages that implement multiple methods to reduce the learning curve. One of them is the Genomic Association and Prediction Integrated Tool (GAPIT), which implemented eight models including GLM (General Linear Model), Mixed Linear Model (MLM), Compressed MLM, MLMM, SUPER (Settlement of mixed linear models Under Progressively Exclusive Relationship), FarmCPU (Fixed and random model Circulating Probability Unification), and BLINK (Bayesian-information and Linkage-disequilibrium Iteratively Nested Keyway). Besides the availability of multiple models, GAPIT provides comprehensive functions for data quality control, data visualization, and publication-ready quality graphic outputs, such as Manhattan plots in rectangle and circle formats, quantile-quantile (QQ) plots, principal component plots, scatter plot of minor allele frequency against GWAS signals, plots of LD between associated markers and the adjacent markers. GAPIT developers and users established a community through the GAPIT forum ( https://groups.google.com/g/gapit-forum ) with over 600 members for asking questions, making comments, and sharing experiences. In this chapter, we detail the GAPIT functions, input data frame, output files, and example codes for each GWAS model. We also interpret parameters, functional algorithms, and modules of GAPIT implementation.
- Published
- 2022
18. Affordable High Throughput Field Detection of Wheat Stripe Rust Using Deep Learning with Semi-Automated Image Labeling
- Author
-
Zhou Tang, Meinan Wang, Michael Schirrmann, Karl-Heinz Dammer, Xianran Li, Robert Brueggeman, Sindhuja Sankaran, Arron H. Carter, Michael O. Pumphrey, Yang Hu, Xianming Chen, and Zhiwu Zhang
- Subjects
agricultural_sciences_agronomy ,Forestry ,Horticulture ,Agronomy and Crop Science ,Computer Science Applications - Abstract
Stripe rust (caused by Puccinia striiformis f. sp. tritici) is one of the most devastating diseases of wheat and causes large-scale epidemics and severe yield loss. Applying fungicides during early epidemic development is crucial to controlling the disease but is often challenged by resource-limited human visual scouting. Deep learning has the potential to process images and videos captured from affordable devices to empower high-throughput phenotyping for early detection of stripe rust for timely application of fungicides and improve control efficiency. Here, we developed RustNet, a neural network-based image classifier, for efficiently monitoring fields for stripe rust. RustNet was built on a ResNet-18 architecture pre-trained with ImageNet Large-Scale Visual Recognition Challenge (ILSVRC) dataset using transfer learning. RGB images and videos of multiple wheat fields with different wheat types (winter and spring wheat), conditions (irrigated and non-irrigated), and locations were acquired using smartphones or unmanned aerial vehicles near the canopy. A semi-automated image labeling approach was conducted to improve labeling efficiency by combining automated machine labeling and human correction. Cross-validations across multiple categories (sensor platforms, wheat types, and locations) achieved Area Under Curve from 0.72 to 0.87. Independent validation on a published dataset from Germany achieved accuracies ranging from 0.79 to 0.86. The visualization of the last convolutional layer of RustNet demonstrated the identification of pixels with stripe rust. RustNet is freely available at https://zzlab.net/RustNet.
- Published
- 2022
19. Additive and Dominant Loci Jointly Pyramiding the Grain Quality of Hybrid Rice
- Author
-
Lanzhi Li, Xingfei Zheng, Jiabo Wang, Xueli Zhang, Xiaogang He, Liwen Xiong, Shufeng Song, Jing Su, Ying Diao, Zheming Yuan, Zhiwu Zhang, and Zhongli Hu
- Subjects
food and beverages - Abstract
Hybrid rice has an advantage in its heterosis, resulting in high yield and resistance to biotic and abiotic stress. However, genetic improvement of grain quality is more challenging in hybrid rice than in inbred rice due to additional complexity, such as the dominant effect. It is critical to identify a path to efficiently develop inbreds and identify their superior crosses for hybrid production. Here, we developed a pipeline for joint analysis of phenotypes, effects, and generations (JPEG) and analysed 113 inbred varieties as male parents, five tester varieties as female parents, and their 565 (113×5) hybrid testcrosses for 12 grain quality traits, including grain length and width, chalkiness, and amylose content. A total of 1,619,588 single nucleotide polymorphisms were obtained for the parent varieties and inferred for the hybrids using whole-genome sequencing with an average sequencing depth of 11×. Genome-wide association studies with JPEG identified 128 loci associated with at least one of the 12 traits. There were 44 and 97 loci with additive and dominant effects, respectively, including 13 overlaps. Among the 128 associated loci, 42 loci are located in or near (
- Published
- 2022
20. A new strategy for using historical imbalanced yield data to conduct genome-wide association studies and develop genomic prediction models for wheat breeding
- Author
-
Shannon Baker, Richard P. Metz, Kele Hui, Bryan E. Simoneaux, Qingwu Xue, Charles D. Johnson, Chenggen Chu, Amir M. H. Ibrahim, Kirk E. Jessup, Jackie C. Rudd, Shuyu Liu, Ming-Shun Chen, Geraldine Opena, Jason A. Baker, Ravindra N. Devkota, Shichen Wang, Haixiao Dong, Xiaoxiao Liu, and Zhiwu Zhang
- Subjects
Yield (finance) ,Genetics ,Genome-wide association study ,Computational biology ,Plant Science ,Biology ,Agronomy and Crop Science ,Molecular Biology ,Article ,Predictive modelling ,Biotechnology - Abstract
Using imbalanced historical yield data to predict performance and select new lines is an arduous breeding task. Genome-wide association studies (GWAS) and high throughput genotyping based on sequencing techniques can increase prediction accuracy. An association mapping panel of 227 Texas elite (TXE) wheat breeding lines was used for GWAS and a training population to develop prediction models for grain yield selection. An imbalanced set of yield data collected from 102 environments (year-by-location) over 10 years, through testing yield in 40–66 lines each year at 6–14 locations with 38–41 lines repeated in the test in any two consecutive years, was used. Based on correlations among data from different environments within two adjacent years and heritability estimated in each environment, yield data from 87 environments were selected and assigned to two correlation-based groups. The yield best linear unbiased estimation (BLUE) from each group, along with reaction to greenbug and Hessian fly in each line, was used for GWAS to reveal genomic regions associated with yield and insect resistance. A total of 74 genomic regions were associated with grain yield and two of them were commonly detected in both correlation-based groups. Greenbug resistance in TXE lines was mainly controlled by Gb3 on chromosome 7DL in addition to two novel regions on 3DL and 6DS, and Hessian fly resistance was conferred by the region on 1AS. Genomic prediction models developed in two correlation-based groups were validated using a set of 105 new advanced breeding lines and the model from correlation-based group G2 was more reliable for prediction. This research not only identified genomic regions associated with yield and insect resistance but also established the method of using historical imbalanced breeding data to develop a genomic prediction model for crop improvement. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11032-022-01287-8.
- Published
- 2022
21. As the number falls, alternatives to the Hagberg-Perten falling number method: A review
- Author
-
Yang Hu, Stephanie M. Sjoberg, Chunpen (James) Chen, Amber L. Hauvermale, Craig F. Morris, Stephen R. Delwiche, Ashley E. Cannon, Camille M. Steber, and Zhiwu Zhang
- Subjects
Seeds ,Starch ,Bread ,alpha-Amylases ,Edible Grain ,Triticum ,Food Science - Abstract
This review examines the application, limitations, and potential alternatives to the Hagberg-Perten falling number (FN) method used in the global wheat industry for detecting the risk of poor end-product quality mainly due to starch degradation by the enzyme α-amylase. By viscometry, the FN test indirectly detects the presence of α-amylase, the primary enzyme that digests starch. Elevated α-amylase results in low FN and damages wheat product quality resulting in cakes that fall, and sticky bread and noodles. Low FN can occur from preharvest sprouting (PHS) and late maturity α-amylase (LMA). Moist or rainy conditions before harvest cause PHS on the mother plant. Continuously cool or fluctuating temperatures during the grain filling stage cause LMA. Due to the expression of additional hydrolytic enzymes, PHS has a stronger negative impact than LMA. Wheat grain with low FN/high α-amylase results in serious losses for farmers, traders, millers, and bakers worldwide. Although blending of low FN grain with sound wheat may be used as a means of moving affected grain through the marketplace, care must be taken to avoid grain lots from falling below contract-specified FN. A large amount of sound wheat can be ruined if mixed with a small amount of sprouted wheat. The FN method is widely employed to detect α-amylase after harvest. However, it has several limitations, including sampling variability, high cost, labor intensiveness, the destructive nature of the test, and an inability to differentiate between LMA and PHS. Faster, cheaper, and more accurate alternatives could improve breeding for resistance to PHS and LMA and could preserve the value of wheat grain by avoiding inadvertent mixing of high- and low-FN grain by enabling testing at more stages of the value stream including at harvest, delivery, transport, storage, and milling. Alternatives to the FN method explored here include the Rapid Visco Analyzer, enzyme assays, immunoassays, near-infrared spectroscopy, and hyperspectral imaging.
- Published
- 2022
22. Detection of Breeding-Relevant Fruit Cracking and Fruit Firmness Quantitative Trait Loci in Sweet Cherry via Pedigree-Based and Genome-Wide Association Approaches
- Author
-
William Wesley Crump, Cameron Peace, Zhiwu Zhang, and Per McCord
- Subjects
food and beverages ,Plant Science - Abstract
Breeding for decreased fruit cracking incidence and increased fruit firmness in sweet cherry creates an attractive alternative to variable results from cultural management practices. DNA-informed breeding increases its efficiency, yet upstream research is needed to identify the genomic regions associated with the trait variation of a breeding-relevant magnitude, as well as to identify the parental sources of favorable alleles. The objectives of this research were to identify the quantitative trait loci (QTLs) associated with fruit cracking incidence and firmness, estimate the effects of single nucleotide polymorphism (SNP) haplotypes at the detected QTLs, and identify the ancestral source(s) of functional haplotypes. Fruit cracking incidence and firmness were evaluated for multiple years on 259 unselected seedlings representing 22 important breeding parents. Phenotypic data, in conjunction with genome-wide genotypic data from the RosBREED cherry 6K SNP array, were used in the QTL analysis performed via Pedigree-Based Analysis using the FlexQTL™ software, supplemented by a Genome-Wide Association Study using the BLINK software. Haplotype analysis was conducted on the QTLs to identify the functional SNP haplotypes and estimate their phenotypic effects, and the haplotypes were tracked through the pedigree. Four QTLs (two per trait) were consistent across the years and/or both analysis methods and validated the previously reported QTLs. qCrack-LG1.1m (the label given to a consistent QTL for cracking incidence on chromosome 1) explained 2–15.1% of the phenotypic variance, while qCrack-LG5.1m, qFirm-LG1.2m, and qFirm-LG3.2m explained 7.6–13.8, 8.8–21.8, and 1.7–10.1% of the phenotypic variance, respectively. At each QTL, at least two SNP haplotypes had significant effects and were considered putative functional SNP haplotypes. Putative low-cracking SNP haplotypes were tracked to an unnamed parent of ‘Emperor Francis’ and ‘Schmidt’ and unnamed parents of ‘Napoleon’ and ‘Hedelfingen,’ among others, and putative high-firmness haplotypes were tracked to an unnamed parent of ‘Emperor Francis’ and ‘Schmidt,’ an unnamed grandparent of ‘Black Republican,’ ‘Rube,’ and an unknown parent of ‘Napoleon.’ These four stable QTLs can now be targeted for DNA test development, with the goal of translating information discovered here into usable tools to aid in breeding decisions.
- Published
- 2022
23. Assessment of the Potential for Genomic Selection To Improve Husk Traits in Maize
- Author
-
Ao Zhang, Yan He, Haixiao Dong, Zhiwu Zhang, Zhenhai Cui, and Yanye Ruan
- Subjects
0106 biological sciences ,Quantitative Trait Loci ,Population ,QH426-470 ,Best linear unbiased prediction ,Biology ,Quantitative trait locus ,maize ,Polymorphism, Single Nucleotide ,Zea mays ,01 natural sciences ,Husk ,genomic selection ,genomic ,03 medical and health sciences ,Inbred strain ,shared data resources ,prediction accuracy ,Genetics ,Humans ,genpred ,Selection, Genetic ,education ,Molecular Biology ,Genetics (clinical) ,Selection (genetic algorithm) ,gapit ,030304 developmental biology ,0303 health sciences ,education.field_of_study ,Models, Genetic ,gblup ,business.industry ,population structure ,Genomics ,prediction ,Marker-assisted selection ,husk ,Biotechnology ,Phenotype ,Stalk ,Genomic Prediction ,breeding ,marker assisted selection ,rrblup ,business ,010606 plant biology & botany - Abstract
Husk has multiple functions such as protecting ears from diseases, infection, and dehydration during development. Additionally, husks comprised of fewer, shorter, thinner, and narrower layers allow faster moisture evaporation of kernels prior to harvest. Intensive studies have been conducted to identify appropriate husk architecture by understanding the genetic basis of related traits, including husk length, husk layer number, husk thickness, and husk width. However, marker-assisted selection is inefficient because the identified quantitative trait loci and associated genetic loci could only explain a small proportion of total phenotypic variation. Genomic selection (GS) has been used successfully on many species including maize on other traits. Thus, the potential of using GS for husk traits to directly identify superior inbred lines, without knowing the specific underlying genetic loci, is well worth exploring. In this study, we compared four GS models on a maize association population with 498 inbred lines belonging to four subpopulations, including 27 lines in stiff stalk, 67 lines in non-stiff stalk, 193 lines in tropical-subtropical, and 211 lines in mixture subpopulations. Genomic Best Linear Unbiased Prediction with principal components as cofactor, performed the best and was selected to examine the impact of interaction between sampling proportions and subpopulations. We found that predictions on inbred lines in a subpopulation were benefited from excluding individuals from other subpopulations for training if the training population within the subpopulation was large enough. Husk thickness exhibited the highest prediction accuracy among all husk traits. These results gave strategic insight to improve husk architecture.
- Published
- 2020
24. Identification of Stripe Rust Resistance Loci in U.S. Spring Wheat Cultivars and Breeding Lines Using Genome-Wide Association Mapping and Yr Gene Markers
- Author
-
Zhiwu Zhang, Lu Liu, Meinan Wang, Xianming Chen, and Deven R. See
- Subjects
0106 biological sciences ,0301 basic medicine ,Genetics ,education.field_of_study ,fungi ,Population ,food and beverages ,Genome-wide association study ,Plant Science ,Biology ,Quantitative trait locus ,01 natural sciences ,Genome ,03 medical and health sciences ,030104 developmental biology ,Genetic marker ,Genotype ,Cultivar ,education ,Agronomy and Crop Science ,Genotyping ,010606 plant biology & botany - Abstract
Stripe rust, caused by Puccinia striiformis f. sp. tritici (Pst), poses a major threat to wheat production worldwide, especially in the United States. To identify loci for effective stripe rust resistance in U.S. wheat, a genome-wide association study (GWAS) was conducted using a panel of 616 spring wheat cultivars and breeding lines. The accessions in this panel were phenotyped for stripe rust response in the greenhouse at seedling stage with five predominant and highly virulent races of Pst and in different field environments at adult-plant stage in 2017 and 2018. In total, 2,029 single-nucleotide polymorphism markers that cover the whole genome were generated with genotyping by multiplexed sequencing and used in GWAS. In addition, 23 markers of previously reported resistance genes or quantitative trait loci (QTLs) were used to genotype the population. This spring panel was grouped into three subpopulations based on principal component analysis. A total of 37 genes or QTLs including 10 potentially new QTLs for resistance to stripe rust were detected by GWAS and linked marker tests. The frequencies of the resistance genes or QTLs in various nurseries were determined, indicating different intensities of these genes or QTLs used in breeding programs of different regions. These resistance loci and the information on their markers, effectiveness, and distributions should be useful for improving stripe rust resistance in wheat cultivars.
- Published
- 2020
25. Genome-Wide Identification and Expression Analysis of bZIP Transcription Factors under Abiotic Stress in Alfalfa (Medicago sativa L.)
- Author
-
Atit Parajuli, Karen Sanguinet, and Zhiwu Zhang
- Subjects
fungi ,food and beverages - Abstract
Alfalfa (Medicago sativa L.) is the most cultivated forage legume around the world. Under a variety of growing conditions, forage yield in alfalfa is stymied by biotic and abiotic stresses including heat, salt, drought, and disease. Given the sessile nature of plants, they use strategies such as differential gene expression to respond to environmental cues. Transcription factors control the expression of genes that contribute to or enable tolerance and survival during periods of stress. Basic-leucine zipper (bZIP) transcription factors have been demonstrated to play a critical role in regulating plant growth and development as well as mediate the responses to abiotic stress in several species, including Arabidopsis thaliana, Oryza sativa, Lotus japonicus and Medicago truncatula. However, there is little information about bZIP transcription factors in cultivated alfalfa. In the present study, 237 bZIP genes were identified in alfalfa from publicly available sequencing data. Multiple sequence alignments showed the presence of intact bZIP motifs in the identified sequences. Based on previous phylogenetic analyses in Arabidopsis thaliana, alfalfa bZIPs were similarly divided and fell into 10 groups. The physico-chemical properties, motif analysis and phylogenetic study of the alfalfa bZIPs revealed high specificity within groups. The differential expression of alfalfa bZIPs in a suite of tissues indicates that particular bZIP genes are specifically expressed at different developmental stages in alfalfa. Similarly, expression analysis in response to ABA, cold, drought and salt stresses, indicates that a subset of bZIP genes are also differentially expressed and likely play a role in abiotic stress signaling and/or tolerance. However, further functional characterization of bZIP transcription factors in alfalfa will help illuminate the role they play in stress tolerance mechanisms in legumes and facilitate the molecular breeding of stress tolerance in alfalfa.
- Published
- 2022
26. Genome-Wide Identification and Expression Analysis of bZIP Transcription Factors under Abiotic Stress in Alfalfa (Medicago sativa L.)
- Author
-
Atit Parajuli, Karen A. Sanguinet, and Zhiwu Zhang
- Subjects
fungi ,food and beverages ,plant_sciences - Abstract
Background Alfalfa (Medicago sativa L.) is the most cultivated forage legume around the world. Under a variety of growing conditions, forage yield in alfalfa is stymied by biotic and abiotic stresses including heat, salt, drought, and disease. Given the sessile nature of plants, they use strategies such as differential gene expression to respond to environmental cues. Transcription factors control the expression of genes that contribute to or enable tolerance and survival during periods of stress. Basic-leucine zipper (bZIP) transcription factors have been demonstrated to play a critical role in regulating plant growth and development as well as mediate the responses to abiotic stress in several species, including Arabidopsis thaliana, Oryza sativa, Lotus japonicus and Medicago truncatula. However, there is little information about bZIP transcription factors in cultivated alfalfa. ResultIn the present study, 237 bZIP genes were identified in alfalfa from publicly available sequencing data. Multiple sequence alignments showed the presence of intact bZIP motifs in the identified sequences. Based on previous phylogenetic analyses in Arabidopsis thaliana, alfalfa bZIPs were similarly divided and fell into 10 groups. The physico-chemical properties, motif analysis and phylogenetic study of the alfalfa bZIPs revealed high specificity within groups. The differential expression of alfalfa bZIPs in a suite of tissues indicates that bZIP genes are specifically expressed at different developmental stages in alfalfa. Similarly, expression analysis in response to ABA, cold, drought and salt stresses, indicates that a subset of bZIP genes are also differentially expressed and likely play a role in abiotic stress signaling and/or tolerance.ConclusionsTaken together, this work provides a framework for the future study of bZIPs in alfalfa and presents candidate bZIPs involved in stress-response signaling.
- Published
- 2022
27. Performing Genome-Wide Association Studies with Multiple Models Using GAPIT
- Author
-
Jiabo Wang, You Tang, and Zhiwu Zhang
- Published
- 2022
28. Interpretation of Manhattan Plots and Other Outputs of Genome-Wide Association Studies
- Author
-
Jiabo Wang, Jianming Yu, Alexander E. Lipka, and Zhiwu Zhang
- Published
- 2022
29. Prospectus of Genomic Selection and Phenomics in Cereal, Legume and Oilseed Breeding Programs
- Author
-
Karansher S. Sandhu, Lance F. Merrick, Sindhuja Sankaran, Zhiwu Zhang, and Arron H. Carter
- Subjects
genetic gain ,machine and deep learning ,root phenomics ,Genetics ,genomics ,high throughput phenotyping ,plant breeding ,Molecular Medicine ,Review ,QH426-470 ,Genetics (clinical) - Abstract
The last decade witnessed an unprecedented increase in the adoption of genomic selection (GS) and phenomics tools in plant breeding programs, especially in major cereal crops. GS has demonstrated the potential for selecting superior genotypes with high precision and accelerating the breeding cycle. Phenomics is a rapidly advancing domain to alleviate phenotyping bottlenecks and explores new large-scale phenotyping and data acquisition methods. In this review, we discuss the lesson learned from GS and phenomics in six self-pollinated crops, primarily focusing on rice, wheat, soybean, common bean, chickpea, and groundnut, and their implementation schemes are discussed after assessing their impact in the breeding programs. Here, the status of the adoption of genomics and phenomics is provided for those crops, with a complete GS overview. GS’s progress until 2020 is discussed in detail, and relevant information and links to the source codes are provided for implementing this technology into plant breeding programs, with most of the examples from wheat breeding programs. Detailed information about various phenotyping tools is provided to strengthen the field of phenomics for a plant breeder in the coming years. Finally, we highlight the benefits of merging genomic selection, phenomics, and machine and deep learning that have resulted in extraordinary results during recent years in wheat, rice, and soybean. Hence, there is a potential for adopting these technologies into crops like the common bean, chickpea, and groundnut. The adoption of phenomics and GS into different breeding programs will accelerate genetic gain that would create an impact on food security, realizing the need to feed an ever-growing population.
- Published
- 2022
30. ROOSTER: An image labeler and classifier through interactive recurrent annotation
- Author
-
Yang Hu, Zhiwu Zhang, and Zhou Tang
- Subjects
General Immunology and Microbiology ,General Medicine ,General Pharmacology, Toxicology and Pharmaceutics ,General Biochemistry, Genetics and Molecular Biology - Abstract
A large amount of training data is usually lacking at the beginning of system development and labeling such a large number of RGB (red, green, blue) images is laborious. Interactive recurrent annotation is beneficial to incrementally gain training images in the stream of the system development and provides an opportunity to reduce human workload. We developed a software package, ROOSTER, to integrate both labeling and prediction in a single user-friendly graphic user interface with interactive deep learning to reduce the laborious human labeling for fast development of machine vision systems. Predictions can be performed under both single-image mode and batch mode for multiple images. The prediction results can be used as the initial image labeling and manually adjusted under a single image mode. Human labeling and machine predictions are visualized on the same image. ROOSTER provides fully automatic labeling for abundantly available initial images of wheat stripe rust to gain essential predictability. The navigation of integrating prediction with labeling benefits human adjustment to iteratively improve predictability. The development of a detection system for wheat stripe rust was presented as a use case to demonstrate the efficiency of using interactive deep learning to develop machine vision systems.
- Published
- 2023
31. The study of the Spatholobus decoction on improving the maximal oxygen uptake of track and field athletes
- Author
-
Jiacheng Feng, Qimeng Niu, and Zhiwu Zhang
- Subjects
Environmental Engineering ,Industrial and Manufacturing Engineering - Abstract
Objective: It aims to observe the effect of Spatholobus decoction on the maximum oxygen uptake of athletes.This is of great significance to improve the endurance of athletes. Methods: Male sprint athletes in sports schools were randomly divided into a control group, and three experimental groups. The experiment period lasted for four weeks, and four groups ran 5,000 meters once a day, including warm-up and training. The second and fourth groups were given suberect Spatholobus decoction at a ratio of 2ml/kg according to the body weight of the participants. The control group and the third group received the same dose of water. Fasting blood biochemical indexes were determined at 8 a.m. the day before and the day after the experiment. The VO2 maximum load test was carried out on the same day. Half a year later, the same experiment was done by switching the control group and the fourth group, and the second group switched to the third group. Statistical data are collected and analyzed using SPSS analysis systemResults -(1) The correlation values of red blood cells in the second group had an obvious decreasing trend (P
- Published
- 2023
32. VTag: a semi-supervised pipeline for tracking pig activity with a single top-view camera
- Author
-
Chun-Peng J Chen, Gota Morota, Kiho Lee, Zhiwu Zhang, and Hao Cheng
- Subjects
Farms ,Agricultural and Veterinary Sciences ,Dairy & Animal Science ,Swine ,Video Recording ,General Medicine ,RGB camera ,Biological Sciences ,computer vision ,pig activity ,Artificial Intelligence ,Genetics ,Animals ,Animal Science and Zoology ,object tracking ,Software ,Food Science - Abstract
Precision livestock farming has become an important research focus with the rising demand of meat production in the swine industry. Currently, the farming practice is widely conducted by the technology of computer vision (CV), which automates monitoring pig activity solely based on video recordings. Automation is fulfilled by deriving imagery features that can guide CV systems to recognize animals' body contours, positions, and behavioral categories. Nevertheless, the performance of the CV systems is sensitive to the quality of imagery features. When the CV system is deployed in a variable environment, its performance may decrease as the features are not generalized enough under different illumination conditions. Moreover, most CV systems are established by supervised learning, in which intensive effort in labeling ground truths for the training process is required. Hence, a semi-supervised pipeline, VTag, is developed in this study. The pipeline focuses on long-term tracking of pig activity without requesting any pre-labeled video but a few human supervisions to build a CV system. The pipeline can be rapidly deployed as only one top-view RGB camera is needed for the tracking task. Additionally, the pipeline was released as a software tool with a friendly graphical interface available to general users. Among the presented datasets, the average tracking error was 17.99 cm. Besides, with the prediction results, the pig moving distance per unit time can be estimated for activity studies. Finally, as the motion is monitored, a heat map showing spatial hot spots visited by the pigs can be useful guidance for farming management. The presented pipeline saves massive laborious work in preparing training dataset. The rapid deployment of the tracking system paves the way for pig behavior monitoring. Lay Summary Collecting detailed measurements of animals through cameras has become an important focus with the rising demand for meat production in the swine industry. Currently, researchers use computational approaches to train models to recognize pig morphological features and monitor pig behaviors automatically. Though little human effort is needed after model training, current solutions require a large amount of pre-selected images for the training process, and the expensive preparation work is difficult for many farms to implement such practice. Hence, a pipeline, VTag, is presented to address these challenges in our study. With few supervisions, VTag can automatically track positions of multiple pigs from one single top-view RGB camera. No pre-labeled images are required to establish a robust pig tracking system. Additionally, the pipeline was released as a software tool with a friendly graphical user interface, that is easy to learn for general users. Among the presented datasets, the average tracking error is 17.99 cm, which is shorter than one-third of the pig body length in the study. The estimated pig activity from VTag can serve as useful farming guidance. The presented strategy saves massive laborious work in preparing training datasets and setting up monitoring environments. The rapid deployment of the tracking system paves the way for pig behavior monitoring. The presented pipeline, VTag, saves massive laborious work in preparing labeled training datasets and setting up environment for pig tracking tasks. VTag can be deployed rapidly and paves the way for pig behavior monitoring. USDA-NIFA [202067030-31339, 2020-67021-32460] Published version This study was supported by the USDA-NIFA grant 202067030-31339 and 2020-67021-32460
- Published
- 2021
33. Detection of Breeding-Relevant Fruit Cracking and Fruit Firmness Quantitative Trait Loci in Sweet Cherry
- Author
-
William Wesley, Crump, Cameron, Peace, Zhiwu, Zhang, and Per, McCord
- Abstract
Breeding for decreased fruit cracking incidence and increased fruit firmness in sweet cherry creates an attractive alternative to variable results from cultural management practices. DNA-informed breeding increases its efficiency, yet upstream research is needed to identify the genomic regions associated with the trait variation of a breeding-relevant magnitude, as well as to identify the parental sources of favorable alleles. The objectives of this research were to identify the quantitative trait loci (QTLs) associated with fruit cracking incidence and firmness, estimate the effects of single nucleotide polymorphism (SNP) haplotypes at the detected QTLs, and identify the ancestral source(s) of functional haplotypes. Fruit cracking incidence and firmness were evaluated for multiple years on 259 unselected seedlings representing 22 important breeding parents. Phenotypic data, in conjunction with genome-wide genotypic data from the RosBREED cherry 6K SNP array, were used in the QTL analysis performed
- Published
- 2021
34. Human Breast Extracellular Matrix Microstructures and Protein Hydrogel 3D Cultures of Mammary Epithelial Cells
- Author
-
Chandler R. Keller, Roland K. Chen, Weimin Li, Barry T. Kahn, Zhiwu Zhang, Anika E. VanDeen, Steve R. Martinez, Kelsey F. Ruud, and Yang Hu
- Subjects
3D culture ,Cancer Research ,extracellular matrix ,Immunofluorescence staining ,Article ,Extracellular matrix ,Breast cancer ,breast cancer ,breast tissue ,structure ,hydrogel ,microenvironment ,machine learning ,acini ,morphology ,medicine ,RC254-282 ,Chemistry ,Neoplasms. Tumors. Oncology. Including cancer and carcinogens ,medicine.disease ,Invasive ductal carcinoma ,Cell biology ,Oncology ,Native tissue ,Self-healing hydrogels ,Human breast ,Type I collagen - Abstract
Simple Summary Human breast tissue extracellular matrix (ECM) is a microenvironment essential for the survival and biological activities of mammary epithelial cells. The ECM structural features of human breast tissues remain poorly defined. In this study, we identified the structural and mechanical properties of human normal breast and invasive ductal carcinoma tissue ECM using histological methods and atomic force microscopy. Additionally, a protein hydrogel was generated using human breast tissue ECM and defined for its microstructural features using immunofluorescence imaging and machine learning. Furthermore, we examined the three-dimensional growth of normal mammary epithelial cells or breast cancer cells cultured on the ECM protein hydrogel, where the cells exhibited biological phenotypes like those seen in native breast tissues. Our data provide novel insights into cancer cell biology, tissue microenvironment mimicry and engineering, and native tissue ECM-based biomedical and pharmaceutical applications. Abstract Tissue extracellular matrix (ECM) is a structurally and compositionally unique microenvironment within which native cells can perform their natural biological activities. Cells grown on artificial substrata differ biologically and phenotypically from those grown within their native tissue microenvironment. Studies examining human tissue ECM structures and the biology of human tissue cells in their corresponding tissue ECM are lacking. Such investigations will improve our understanding about human pathophysiological conditions for better clinical care. We report here human normal breast tissue and invasive ductal carcinoma tissue ECM structural features. For the first time, a hydrogel was successfully fabricated using whole protein extracts of human normal breast ECM. Using immunofluorescence staining of type I collagen (Col I) and machine learning of its fibrous patterns in the polymerized human breast ECM hydrogel, we have defined the microstructural characteristics of the hydrogel and compared the microstructures with those of other native ECM hydrogels. Importantly, the ECM hydrogel supported 3D growth and cell-ECM interaction of both normal and cancerous mammary epithelial cells. This work represents further advancement toward full reconstitution of the human breast tissue microenvironment, an accomplishment that will accelerate the use of human pathophysiological tissue-derived matrices for individualized biomedical research and therapeutic development.
- Published
- 2021
- Full Text
- View/download PDF
35. Trait Association and Prediction Through Integrative K-mer Analysis
- Author
-
Yangfan Hao, Jinliang Yang, Jacob D. Washburn, Sanzhen Liu, Cheng He, and Zhiwu Zhang
- Subjects
Candidate gene ,k-mer ,Kernel (statistics) ,food and beverages ,SNP ,Single-nucleotide polymorphism ,Genome-wide association study ,Phenotypic trait ,Computational biology ,Biology ,Gene - Abstract
Genome-wide association study (GWAS) with single nucleotide polymorphisms (SNPs) has been widely used to explore genetic controls of phenotypic traits. Here we employed an GWAS approach using k-mers, short substrings from sequencing reads. Using maize cob and kernel color traits, we demonstrated that k-mer GWAS can effectively identify associated k-mers. Co-expression analysis of kernel color k-mers and pathway genes directly found k-mers from causal genes. Analyzing complex traits of kernel oil and leaf angle resulted in k-mers from both known and candidate genes. Evolution analysis revealed most k-mers positively correlated with kernel oil were strongly selected against in maize populations, while most k-mers for upright leaf angle were positively selected. In addition, phenotypic prediction of kernel oil, leaf angle, and flowering time using k-mer data showed at least a similarly high prediction accuracy to the standard SNP-based method. Collectively, our results demonstrated the bridging role of k-mers for data integration and functional gene discovery.
- Published
- 2021
36. Whole-Genome Sequencing on 220 Alfalfa (Medicago sativa L.) Accessions Identified Loci Associated with Fall Dormancy
- Author
-
Fan Zhang, Yan Sun, Qingchuan Yang, Mingna Li, Changfu Yang, Jie Kong, Xueqian Jiang, Xijiang Yang, Junmei Kang, Fei He, Ruicai Long, Zhen Wang, and Zhiwu Zhang
- Subjects
Whole genome sequencing ,Genetics ,fungi ,food and beverages ,Dormancy ,Medicago sativa ,Biology - Abstract
Fall dormancy (FD) is one of the most important traits of alfalfa (Medicago sativa) for cultivar selection to overcome winter damage. Regrowth plant height following autumn clipping is an indirect way to evaluate FD. Although transcriptomics, proteomics analysis, and QTL mapping have revealed some important genes correlated with FD, the genetic architecture of this trait is still unclear. There are no applicable genes or markers for selection, which hinders progress in the genetic research and molecular breeding for the trait. We conducted whole-genome sequencing (WGS) on 220 alfalfa accessions at 10x depth. Among the 875,023 SNPs, seven of them were associated with FD using genome-wide association study (GWAS). One SNP located on chromosome 6 is in linkage disequilibrium with dehydration-responsive element-binding protein 1C (DREB1C). Furthermore, seven DREB genes are clustered in this region, one of which has previously been shown to enhance freezing tolerance in the model plant Medicago truncatula. The candidate genes uncovered by our research will benefit the transgenic and CRISPR-Cas9 research of FD in alfalfa. This gene will also be useful for marker development and assisted selection of FD for alfalfa.
- Published
- 2021
37. Comparison of Single-Trait and Multi-Trait Genome-Wide Association Models and Inclusion of Correlated Traits in the Dissection of the Genetic Architecture of a Complex Trait in a Breeding Program
- Author
-
A. B. Burke, Zhiwu Zhang, Arron H. Carter, and Lance F. Merrick
- Subjects
education.field_of_study ,Candidate gene ,Breeding program ,Population ,Chromosome ,Plant culture ,Genome-wide association study ,pleiotropic effects ,Plant Science ,Biology ,covariates ,Genetic architecture ,multi-locus model ,SB1-1110 ,Evolutionary biology ,reduced height alleles ,seedling emergence ,Trait ,single-locus model ,Association mapping ,education - Abstract
Traits with an unknown genetic architecture make it difficult to create a useful bi-parental mapping population to characterize the genetic basis of the trait due to a combination of complex and pleiotropic effects. Seedling emergence of wheat (Triticum aestivum L.) from deep planting is a vital factor affecting stand establishment and grain yield, has a poorly understood genetic architecture, and is historically correlated with coleoptile length. The creation of bi-parental mapping populations can be overcome by using genome-wide association studies (GWAS). This study aimed to dissect the genetic architecture of seedling emergence while accounting for correlated traits using one multi-trait GWAS model (MT-GWAS) and three single-trait GWAS models (ST-GWAS) with the inclusion of covariates for correlated traits. The ST-GWAS models included one single locus model (MLM), and two multiple loci models (FarmCPU and BLINK). We conducted the GWAS using two populations, the first consisting of 473 varieties from a diverse association mapping panel (DP) phenotyped from 2015-2019, and the other population used as a validation population consisting of 279 breeding lines (BL) phenotyped in 2015 in Lind, WA, with 40,368 markers. We also compared the inclusion of coleoptile length and markers associated with reduced height as covariates in our ST-GWAS models for the DP. ST-GWAS found 107 significant markers across 19 chromosomes, while MT-GWAS found 82 significant markers across 14 chromosomes. MT-GWAS models were able to identify large-effect markers on chromosome 5A. FarmCPU and BLINK models were able to identify many small effect markers, and the inclusion of covariates helped to identify the large effect markers on chromosome 5A. Therefore, by using multi-locus models combined with pleiotropic covariates, breeding programs can uncover the complex nature of traits to help identify candidate genes and the underlying architecture of a trait, such as seedling emergence of deep-sown winter wheat.
- Published
- 2021
38. Book Review: International Perspectives on CLIL
- Author
-
Xiaolan He and Zhiwu Zhang
- Subjects
multilingualism ,CLIL ,Professional development ,bilingualism ,Book Review ,BF1-990 ,Content and language integrated learning ,Pedagogy ,Psychology ,Multilingualism ,content and language integrated learning ,Neuroscience of multilingualism ,General Psychology ,professional development - Published
- 2021
39. Book Review: Positive Psychology in Second and Foreign Language Education
- Author
-
Zhiwu Zhang and Jinfen Xu
- Subjects
foreign language education ,second language education ,positive psychology ,Foreign language ,positive emotions ,Second-language acquisition ,positive language education ,Linguistics ,Book Review ,BF1-990 ,Psychology ,second language acquisition ,Positive psychology ,General Psychology - Published
- 2021
40. Book Review: Positive Psychology: The Basics
- Author
-
Zhiwu Zhang
- Subjects
Psychoanalysis ,media_common.quotation_subject ,positive psychology ,positive emotions ,Book Review ,optimism ,BF1-990 ,Optimism ,well-being ,positive relations ,flow ,Psychology ,Positive psychology ,Grit ,grit ,General Psychology ,media_common - Published
- 2021
41. E183K Mutation in Chalcone Synthase C2 Causes Protein Aggregation and Maize Colorless
- Author
-
Shengzhong Su, Xiaohui Shan, Hongkui Liu, Nan Jiang, He Li, Haixiao Dong, Xuyang Wu, Yaping Yuan, Shipeng Li, Yingjie Xue, and Zhiwu Zhang
- Subjects
0106 biological sciences ,Chalcone synthase ,prokaryotic expression ,Mutant ,Plant Science ,medicine.disease_cause ,maize ,01 natural sciences ,transient expression ,SB1-1110 ,03 medical and health sciences ,Arabidopsis ,medicine ,flavonoid biosynthesis ,Gene ,030304 developmental biology ,Original Research ,0303 health sciences ,Mutation ,ethyl methyl sulfone ,biology ,Mutagenesis ,fungi ,food and beverages ,Plant culture ,biology.organism_classification ,colorless mutant ,Transformation (genetics) ,Flavonoid biosynthesis ,Biochemistry ,biology.protein ,MutMap ,010606 plant biology & botany - Abstract
Flavonoids give plants their rich colors and play roles in a number of physiological processes. In this study, we identified a novel colorless maize mutant showing reduced pigmentation throughout the whole life cycle by EMS mutagenesis. E183K mutation in maize chalcone synthase C2 (ZmC2) was mapped using MutMap strategy as the causal for colorless, which was further validated by transformation in Arabidopsis. We evaluated transcriptomic and metabolic changes in maize first sheaths caused by the mutation. The downstream biosynthesis was blocked while very few genes changed their expression pattern. ZmC2-E183 site is highly conserved in chalcone synthase among Plantae kingdom and within species’ different varieties. Through prokaryotic expression, transient expression in maize leaf protoplasts and stable expression in Arabidopsis, we observed that E183K and other mutations on E183 could cause almost complete protein aggregation of chalcone synthase. Our findings will benefit the characterization of flavonoid biosynthesis and contribute to the body of knowledge on protein aggregation in plants.
- Published
- 2021
42. Identification of loci controlling adaptation in Chinese soya bean landraces via a combination of conventional and bioclimatic GWAS
- Author
-
Zhangxiong Liu, Yu Tian, Patrick S. Schnable, Ruzhen Chang, Delin Li, Jochen C. Reif, Huihui Li, Xinan Zhou, Liu Bin, Yinghui Li, Rongxia Guan, James C. Schnable, Lijuan Qiu, Yanfei Li, Zhiwu Zhang, Yong Guo, Qingbo You, Huilong Hong, Huai-zhu Chen, Lijuan Zhang, Yong‐qing Jiao, and Ting Zhang
- Subjects
0106 biological sciences ,0301 basic medicine ,Linkage disequilibrium ,Genotype ,bioclimatic variable ,Population ,Locus (genetics) ,adaptation ,Plant Science ,Quantitative trait locus ,Genes, Plant ,Polymorphism, Single Nucleotide ,01 natural sciences ,Linkage Disequilibrium ,03 medical and health sciences ,soya bean landrace ,Domestication ,education ,Alleles ,Research Articles ,Local adaptation ,Genetic diversity ,education.field_of_study ,biology ,food and beverages ,flowering time ,biology.organism_classification ,Adaptation, Physiological ,associated SNP ,030104 developmental biology ,Evolutionary biology ,Soybeans ,Glycine soja ,Agronomy and Crop Science ,Genome-Wide Association Study ,Research Article ,010606 plant biology & botany ,Biotechnology - Abstract
Summary Landraces often contain genetic diversity that has been lost in modern cultivars, including alleles that confer enhanced local adaptation. To comprehensively identify loci associated with adaptive traits in soya bean landraces, for example flowering time, a population of 1938 diverse landraces and 97 accessions of the wild progenitor of cultivated soya bean, Glycine soja was genotyped using tGBS®. Based on 99 085 high‐quality SNPs, landraces were classified into three sub‐populations which exhibit geographical genetic differentiation. Clustering was inferred from STRUCTURE, principal component analyses and neighbour‐joining tree analyses. Using phenotypic data collected at two locations separated by 10 degrees of latitude, 17 trait‐associated SNPs (TASs) for flowering time were identified, including a stable locus Chr12:5914898 and previously undetected candidate QTL/genes for flowering time in the vicinity of the previously cloned flowering genes, E1 and E2. Using passport data associated with the collection sites of the landraces, 27 SNPs associated with adaptation to three bioclimatic variables (temperature, daylength, and precipitation) were identified. A series of candidate flowering genes were detected within linkage disequilibrium (LD) blocks surrounding 12 bioclimatic TASs. Nine of these TASs exhibit significant differences in flowering time between alleles within one or more of the three individual sub‐populations. Signals of selection during domestication and/or subsequent landrace diversification and adaptation were detected at 38 of the 44 flowering and bioclimatic TASs. Hence, this study lays the groundwork to begin breeding for novel environments predicted to arise following global climate change.
- Published
- 2019
43. Screening of NogoA/NTR-related differential genes in rat sciatic nerve injury signal pathway
- Author
-
Shengjun Yu, Zhiwu Zhang, Mingchun Luan, Junjie Jiang, Fei Gao, Zhenjie Ma, and Yuanchen Yu
- Subjects
0301 basic medicine ,MAPK/ERK pathway ,Nogo Proteins ,Apoptosis ,Receptors, Nerve Growth Factor ,030105 genetics & heredity ,Receptor, Nerve Growth Factor ,Rats, Sprague-Dawley ,03 medical and health sciences ,0302 clinical medicine ,Downregulation and upregulation ,Peripheral Nerve Injuries ,Neurotrophic factors ,medicine ,Animals ,Receptor, trkB ,Pharmacology ,biology ,General Medicine ,Sciatic nerve injury ,medicine.disease ,Sciatic Nerve ,Nerve Regeneration ,Rats ,Cell biology ,030220 oncology & carcinogenesis ,LDL receptor ,biology.protein ,Neuralgia ,Molecular Medicine ,Schwann Cells ,Sciatic Neuropathy ,Transcriptome ,Interleukin 1 receptor, type I ,Signal Transduction ,Neurotrophin - Abstract
Aim: To screen the differential genes in NogoA/NTR-related pathways that associate with sciatic nerve injury. Results: There was no difference in the expression of NogoA, NTR and Ntrk2. Differential genes existed in 11 differential pathways that include NogoA, NTR and Ntrk2. Pathways closely related to sciatic nerve injury are MAPK, endophagocytosis, apoptosis, neurotrophin signaling and inflammatory mediators. NTRK1, FASLG, LDLR ADRB1 and HTR2A in model rats were downregulated compared with control rats, IL1R1, CSF1R, BCL2L1 and HRH1 in model rats were upregulated compared with control rats. Conclusion: MAPK, endophagocytic, apoptotic, neurotrophic factor and inflammatory mediators of ductal mediators may be involved in the sciatic nerve injury in rats. The differentially expressed genes in these pathways may play important roles in sciatic nerve injury.
- Published
- 2019
44. GridFree: a python package of imageanalysis for interactive grain counting and measuring
- Author
-
Yang Hu and Zhiwu Zhang
- Subjects
0106 biological sciences ,0301 basic medicine ,Physiology ,Computer science ,Plant Science ,01 natural sciences ,Plot (graphics) ,Set (abstract data type) ,03 medical and health sciences ,Software ,Genetics ,Image Processing, Computer-Assisted ,Research Articles ,computer.programming_language ,business.industry ,Botany ,Pattern recognition ,Python (programming language) ,Crop Production ,030104 developmental biology ,Kernel (statistics) ,Outlier ,Seeds ,Unsupervised learning ,Artificial intelligence ,Noise (video) ,business ,Edible Grain ,computer ,010606 plant biology & botany - Abstract
Grain characteristics, including kernel length, kernel width, and thousand kernel weight, are critical component traits for grain yield. Manual measurements and counting are expensive, forming the bottleneck for dissecting these traits’ genetic architectures toward ultimate yield improvement. High-throughput phenotyping methods have been developed by analyzing images of kernels. However, segmenting kernels from the image background and noise artifacts or from other kernels positioned in close proximity remain as challenges. In this study, we developed a software package, named GridFree, to overcome these challenges. GridFree uses an unsupervised machine learning approach, K-Means, to segment kernels from the background by using principal component analysis on both raw image channels and their color indices. GridFree incorporates users’ experiences as a dynamic criterion to set thresholds for a divide-and-combine strategy that effectively segments adjacent kernels. When adjacent multiple kernels are incorrectly segmented as a single object, they form an outlier on the distribution plot of kernel area, length, and width. GridFree uses the dynamic threshold settings for splitting and merging. In addition to counting, GridFree measures kernel length, width, and area with the option of scaling with a reference object. Evaluations against existing software programs demonstrated that GridFree had the smallest error on counting seeds for multiple crop species. GridFree was implemented in Python with a friendly graphical user interface to allow users to easily visualize the outcomes and make decisions, which ultimately eliminates time-consuming and repetitive manual labor. GridFree is freely available at the GridFree website (https://zzlab.net/GridFree).
- Published
- 2021
45. Harnessing Agronomics Through Genomics and Phenomics in Plant Breeding: A Review
- Author
-
Chunpeng James Chen, Ben J. Hayes, Zhiwu Zhang, Jessica Rutkoski, Xiuliang Jin, Seth C. Murray, Wang L, Benjamin Stich, José Crossa, and James C. Schnable
- Subjects
Phenomics ,anatomy_morphology ,Genomics ,Plant breeding ,Computational biology ,Biology ,Genomic selection - Abstract
Plant breeding primarily focuses on improving conventional agronomic traits, e.g. yield, quality, and resistance to biotic and abiotic stress; however, genetic improvement methods are being rapidly enhanced through genomics and phenomics. In the Genomics-Phenomics-Agronomics (GPA) paradigm, diverse research approaches have been conducted to bridge any two of these elements, and recently, all of them together. This review first highlights the progress to link i) genomics to agronomics; ii) genomics to phenomics; and iii) phenomics to agronomics. Secondly, the GPA domain is dissected into different layers, each addressing the three elements simultaneously. These dissected layers include genetic dissection through gene mapping using genome-wide association studies and genomic selection using Best Linear Unbiased Prediction, Bayesian approaches, and machine learning. The objective of the review is to help readers to grasp the core developments among the exponentially growing literature in each of these fields. Through this review, the connections among the three elements of the GPA paradigm are coherently integrated toward the prospect of sustainable development of agronomic traits through both genomics and phenomics.
- Published
- 2021
46. Assembly of chromosome-scale and allele-aware autotetraploid genome of the Chinese alfalfa cultivar Zhongmu-4 and identification of SNP loci associated with 27 agronomic traits
- Author
-
Wenwen Liu, Qingchuan Yang, Long-Xi Yu, Ruicai Long, Mingna Li, Lin Chen, Zhiwu Zhang, Zhen Wang, Junmei Kang, Changfu Yang, Fei He, Xue Wang, Tiejun Zhang, Xijiang Yang, Fan Zhang, and Xueqian Jiang
- Subjects
Whole genome sequencing ,Germplasm ,Genetics ,fungi ,food and beverages ,Genome-wide association study ,Single-nucleotide polymorphism ,Allele ,Biology ,Genome ,Single molecule real time sequencing ,Genetic association - Abstract
Alfalfa (Medicago sativaL.), the most valuable perennial legume crop, referred to as “Queen of the Forages” for its high nutritional value and yield production among forage crops. Comprehensive genomic information of germplasm resources from different ecological regions and modern breeding strategies, such as molecular-marker assisted breeding are of great importance to breed new alfalfa varieties with environmental resilience. Here, we report assembly of the genome sequence of Zhongmu-4 (ZM-4), one of the most planted cultivars in China, and identification of SNPs associated with alfalfa agronomic traits by Genome-wide Association Studies (GWAS). Sequence of 32 allelic chromosomes was assembled successfully by single molecule real time sequencing and Hi-C technique with ALLHiC algorithm. About 2.74 Gbp contigs, accounting for 88.39% of the estimated genome, were assembled with 2.56 Gbp contigs anchored to 32 pseudo-chromosomes. In comparison withM. truncatulaA17, distinctive inversion and translocation on chromosome 1, and between chromosome 4 and 8, respectively, were detected. Moreover, we conducted resequencing of 220 alfalfa accessions collected globally and performed GWAS analysis based on our assembled genome. Population structure analysis demonstrated that alfalfa has a complex genetic relationship among germplasm with different geographic origins. GWAS identified 101 SNPs associated with 27 out of 93 agronomic traits. The updated chromosome-scale and allele-aware genome sequence, coupled with the resequencing data of most global alfalfa germplasm, provides valuable information for alfalfa genetic research, and further analysis of major SNP loci will accelerate unravelling the molecular basis of important agronomic traits and facilitate genetic improvement of alfalfa.
- Published
- 2021
47. Validation of UAV-based alfalfa biomass predictability using photogrammetry with fully automatic plot segmentation
- Author
-
Atit Parajuli, Long-Xi Yu, Samuel R. Revolinski, Cesar Augusto Medina, Sen Lin, Zhiwu Zhang, Chunpeng James Chen, Zhou Tang, and Yang Hu
- Subjects
0106 biological sciences ,Biomass (ecology) ,Plant genetics ,Multidisciplinary ,Coefficient of determination ,010504 meteorology & atmospheric sciences ,Science ,Multispectral image ,Red edge ,Forage ,01 natural sciences ,Article ,Plant breeding ,Normalized Difference Vegetation Index ,Photogrammetry ,Medicine ,Cultivar ,010606 plant biology & botany ,0105 earth and related environmental sciences ,Mathematics ,Remote sensing - Abstract
Alfalfa is the most widely cultivated forage legume, with approximately 30 million hectares planted worldwide. Genetic improvements in alfalfa have been highly successful in developing cultivars with exceptional winter hardiness and disease resistance traits. However, genetic improvements have been limited for complex economically important traits such as biomass. One of the major bottlenecks is the labor-intensive phenotyping burden for biomass selection. In this study, we employed two alfalfa fields to pave a path to overcome the challenge by using UAV images with fully automatic field plot segmentation for high-throughput phenotyping. The first field was used to develop the prediction model and the second field to validate the predictions. The first and second fields had 808 and 1025 plots, respectively. The first field had three harvests with biomass measured in May, July, and September of 2019. The second had one harvest with biomass measured in September of 2019. These two fields were imaged one day before harvesting with a DJI Phantom 4 pro UAV carrying an additional Sentera multispectral camera. Alfalfa plot images were extracted by GRID software to quantify vegetative area based on the Normalized Difference Vegetation Index. The prediction model developed from the first field explained 50–70% (R Square) of biomass variation in the second field by incorporating four features from UAV images: vegetative area, plant height, Normalized Green–Red Difference Index, and Normalized Difference Red Edge Index. This result suggests that UAV-based, high-throughput phenotyping could be used to improve the efficiency of the biomass selection process in alfalfa breeding programs.
- Published
- 2021
48. Genome Assembly of Alfalfa Cultivar Zhongmu-4 and Identification of SNPs Associated with Agronomic Traits
- Author
-
Ruicai Long, Fan Zhang, Zhiwu Zhang, Mingna Li, Lin Chen, Xue Wang, Wenwen Liu, Tiejun Zhang, Long-Xi Yu, Fei He, Xueqian Jiang, Xijiang Yang, Changfu Yang, Zhen Wang, Junmei Kang, and Qingchuan Yang
- Subjects
Computational Mathematics ,Nitrogen ,Genetics ,DNA Transposable Elements ,Molecular Biology ,Biochemistry ,Polymorphism, Single Nucleotide ,Genome-Wide Association Study ,Medicago sativa - Abstract
Alfalfa (Medicago sativa L.) is the most important legume forage crop worldwide with high nutritional value and yield. For a long time, the breeding of alfalfa was hampered by lacking reliable information on the autotetraploid genome and molecular markers linked to important agronomic traits. We herein reported the de novo assembly of the allele-aware chromosome-level genome of Zhongmu-4, a cultivar widely cultivated in China, and a comprehensive database of genomic variations based on resequencing of 220 germplasms. Approximate 2.74 Gb contigs (N50 of 2.06 Mb), accounting for 88.39% of the estimated genome, were assembled, and 2.56 Gb contigs were anchored to 32 pseudo-chromosomes. A total of 34,922 allelic genes were identified from the allele-aware genome. We observed the expansion of gene families, especially those related to the nitrogen metabolism, and the increase of repetitive elements including transposable elements, which probably resulted in the increase of Zhongmu-4 genome compared with Medicago truncatula. Population structure analysis revealed that the accessions from Asia and South America had relatively lower genetic diversity than those from Europe, suggesting that geography may influence alfalfa genetic divergence during local adaption. Genome-wide association studies identified 101 single nucleotide polymorphisms (SNPs) associated with 27 agronomic traits. Two candidate genes were predicted to be correlated with fall dormancy and salt response. We believe that the allele-aware chromosome-level genome sequence of Zhongmu-4 combined with the resequencing data of the diverse alfalfa germplasms will facilitate genetic research and genomics-assisted breeding in variety improvement of alfalfa.
- Published
- 2021
49. Chromosomal Characteristics of Salt Stress Heritable Gene Expression in the Rice Genome
- Author
-
Matthew T McGowan, Stephen P. Ficklin, and Zhiwu Zhang
- Subjects
0106 biological sciences ,0301 basic medicine ,Quantitative Trait Loci ,Population ,Health Informatics ,Quantitative trait locus ,Biology ,01 natural sciences ,Salt Stress ,Genome ,Chromosomes, Plant ,Heritability ,03 medical and health sciences ,Gene Expression Regulation, Plant ,Genetic variation ,Gene expression ,Genetics ,Transcriptomics ,education ,Gene ,education.field_of_study ,Research ,Oryza ,RNAseq ,Phenotype ,Agronomy ,030104 developmental biology ,Genome, Plant ,010606 plant biology & botany - Abstract
BackgroundGene expression is potentially an important heritable quantitative trait that mediates between genetic variation and higher-level complex phenotypes through time and condition-dependent regulatory interactions. Increasing quantities of high-throughput DNA and RNA sequencing and standardization of research populations has resulted in the accumulation of overlapping -omics data allowing for deeper investigation into the genomic structure and conditional stability of gene expression traits. Therefore, we sought to explore both the genomic and condition-specific characteristics of gene expression heritability within the context of chromosomal structure, using a diverse, 84-line, Oryza sativa (rice) population under optimal and salt-stressed conditions.ResultsHeritability was estimated for the 84 genotypes with common tools and an approach using biological gene expression replicates similar to a twin study in humans. Overall, 5,936 genes were found to have heritable expression regardless of condition and 1,377 genes were found to have heritable expression only during salt stress. These genes with salt-specific heritable expression are enriched for functional terms associated with response to stimulus and transcription factor activity. Additionally, we discovered that highly and lowly expressed genes, and genes with heritable expression are distributed differently along the chromosomes in patterns that follow previously identified chromosomal conformation capture (Hi-C) A/B chromatin compartments. Furthermore, multiple genomic hot-spots enriched for genes with salt-specific heritability were identified on chromosomes 1, 4, 6, and 8. These hotspots were found to contain genes functionally enriched for transcriptional regulation and overlaps with a previously identified major QTL for salt-tolerance in rice.ConclusionsInvestigating the heritability of traits, and in-particular gene expression traits, is important towards a basic understanding of how regulatory networks behave across a population. Additionally, heritable gene expression architecture can be used for further exploration of gene-trait relationships, assist in interpretation and analysis of eQTLs, used as priors for approaches seeking to identification of potential biomarkers, or used in genomic selection modules with potential applications in plant breeding. This work provides insights into patterns of expression and spatial patterns at the chromosomal level.
- Published
- 2021
50. Deep Learning for Predicting Complex Traits in Spring Wheat Breeding Program
- Author
-
Dennis N. Lozada, Arron H. Carter, Michael O. Pumphrey, Karansher S. Sandhu, and Zhiwu Zhang
- Subjects
0106 biological sciences ,0301 basic medicine ,Breeding program ,Population ,convolutional neural network ,Plant Science ,lcsh:Plant culture ,Overfitting ,Machine learning ,computer.software_genre ,01 natural sciences ,Convolutional neural network ,genomic selection ,03 medical and health sciences ,lcsh:SB1-1110 ,multilayer perceptron ,wheat breeding ,education ,Dropout (neural networks) ,Original Research ,Mathematics ,education.field_of_study ,Artificial neural network ,business.industry ,Deep learning ,deep learning ,artificial intelligence ,neural networks ,030104 developmental biology ,Multilayer perceptron ,Artificial intelligence ,business ,computer ,010606 plant biology & botany - Abstract
Genomic selection (GS) is transforming the field of plant breeding and implementing models that improve prediction accuracy for complex traits is needed. Analytical methods for complex datasets traditionally used in other disciplines represent an opportunity for improving prediction accuracy in GS. Deep learning (DL) is a branch of machine learning (ML) which focuses on densely connected networks using artificial neural networks for training the models. The objective of this research was to evaluate the potential of DL models in the Washington State University spring wheat breeding program. We compared the performance of two DL algorithms, namely multilayer perceptron (MLP) and convolutional neural network (CNN), with ridge regression best linear unbiased predictor (rrBLUP), a commonly used GS model. The dataset consisted of 650 recombinant inbred lines (RILs) from a spring wheat nested association mapping (NAM) population planted from 2014–2016 growing seasons. We predicted five different quantitative traits with varying genetic architecture using cross-validations (CVs), independent validations, and different sets of SNP markers. Hyperparameters were optimized for DL models by lowering the root mean square in the training set, avoiding model overfitting using dropout and regularization. DL models gave 0 to 5% higher prediction accuracy than rrBLUP model under both cross and independent validations for all five traits used in this study. Furthermore, MLP produces 5% higher prediction accuracy than CNN for grain yield and grain protein content. Altogether, DL approaches obtained better prediction accuracy for each trait, and should be incorporated into a plant breeder’s toolkit for use in large scale breeding programs.
- Published
- 2021
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.