1. Panax ginseng genome examination for ginsenoside biosynthesis
- Author
-
Zhengwei Jia, Baosheng Liao, Zhenzhan Chang, Linlin Dong, Mingli Wu, Jingjing Zhang, Zhixiang Liu, Wei Rao, He Su, Ruiyang Cheng, Qinggang Yin, Jiang Xu, Yuebin Zhang, Lei Zhang, Lirui Qiao, Guangwei Zhu, Yujun Zhang, Shilin Chen, Yung-Chi Cheng, Jun Qian, Daniel Afreh, Jie Zhang, Yang Chu, Shuiming Xiao, Xiaoyan Zhang, Ruth Nahurira, Yingjie Zhu, Rui Bai, Guozheng Li, Xiwen Li, Lianjuan Zhang, Chao Zhou, and Zhihai Huang
- Subjects
0106 biological sciences ,0301 basic medicine ,Ginsenosides ,Mevalonic Acid ,Panax ,Health Informatics ,mass spectrometry imaging ,Computational biology ,Biology ,complex mixtures ,01 natural sciences ,Genome ,03 medical and health sciences ,chemistry.chemical_compound ,Ginseng ,Tandem repeat ,Gene family ,genome ,Gene ,Whole genome sequencing ,Research ,Panax ginseng ,Nucleic acid sequence ,Glycosyltransferases ,food and beverages ,Molecular Sequence Annotation ,3. Good health ,Computer Science Applications ,030104 developmental biology ,chemistry ,Ginsenoside ,Hydroxymethylglutaryl CoA Reductases ,Genome, Plant ,010606 plant biology & botany - Abstract
Ginseng, which contains ginsenosides as bioactive compounds, has been regarded as an important traditional medicine for several millennia. However, the genetic background of ginseng remains poorly understood, partly because of the plant's large and complex genome composition. We report the entire genome sequence of Panax ginseng using next-generation sequencing. The 3.5-Gb nucleotide sequence contains more than 60% repeats and encodes 42 006 predicted genes. Twenty-two transcriptome datasets and mass spectrometry images of ginseng roots were adopted to precisely quantify the functional genes. Thirty-one genes were identified to be involved in the mevalonic acid pathway. Eight of these genes were annotated as 3-hydroxy-3-methylglutaryl-CoA reductases, which displayed diverse structures and expression characteristics. A total of 225 UDP-glycosyltransferases (UGTs) were identified, and these UGTs accounted for one of the largest gene families of ginseng. Tandem repeats contributed to the duplication and divergence of UGTs. Molecular modeling of UGTs in the 71st, 74th, and 94th families revealed a regiospecific conserved motif located at the N-terminus. Molecular docking predicted that this motif captures ginsenoside precursors. The ginseng genome represents a valuable resource for understanding and improving the breeding, cultivation, and synthesis biology of this key herb.
- Published
- 2017
- Full Text
- View/download PDF