1. 1001 Proteomes: a functional proteomics portal for the analysis of Arabidopsis thaliana accessions
- Author
-
Joffrey Fitz, Jun Cao, Anna Lipzen, Wendy Schackwitz, A. Michelle Smith-Moritz, Hiren J. Joshi, Detlef Weigel, Len A. Pennacchio, Joshua L. Heazlewood, Katy M. Christiansen, and Joel Martin
- Subjects
Statistics and Probability ,Proteomics ,DNA, Plant ,Proteome ,Sequence analysis ,Arabidopsis ,Computational biology ,Biochemistry ,Genome ,Polymorphism, Single Nucleotide ,Arabidopsis thaliana ,Protein phosphorylation ,Phosphorylation ,Databases, Protein ,Molecular Biology ,biology ,business.industry ,Arabidopsis Proteins ,fungi ,food and beverages ,Sequence Analysis, DNA ,biology.organism_classification ,Computer Science Applications ,Biotechnology ,Computational Mathematics ,Computational Theory and Mathematics ,Amino Acid Substitution ,business ,Protein Processing, Post-Translational ,Reference genome - Abstract
Motivation: The sequencing of over a thousand natural strains of the model plant Arabidopsis thaliana is producing unparalleled information at the genetic level for plant researchers. To enable the rapid exploitation of these data for functional proteomics studies, we have created a resource for the visualization of protein information and proteomic datasets for sequenced natural strains of A. thaliana. Results: The 1001 Proteomes portal can be used to visualize amino acid substitutions or non-synonymous single-nucleotide polymorphisms in individual proteins of A. thaliana based on the reference genome Col-0. We have used the available processed sequence information to analyze the conservation of known residues subject to protein phosphorylation among these natural strains. The substitution of amino acids in A. thaliana natural strains is heavily constrained and is likely a result of the conservation of functional attributes within proteins. At a practical level, we demonstrate that this information can be used to clarify ambiguously defined phosphorylation sites from phosphoproteomic studies. Protein sets of available natural variants are available for download to enable proteomic studies on these accessions. Together this information can be used to uncover the possible roles of specific amino acids in determining the structure and function of proteins in the model plant A. thaliana. An online portal to enable the community to exploit these data can be accessed at http://1001proteomes.masc-proteomics.org/ Contact: jlheazlewood@lbl.gov Supplementary information: Supplementary data are available at Bioinformatics online.
- Published
- 2012