Back to Search
Start Over
Small molecule annotation for the Protein Data Bank
- Source :
- Database: The Journal of Biological Databases and Curation
- Publication Year :
- 2014
-
Abstract
- The Protein Data Bank (PDB) is the single global repository for three-dimensional structures of biological macromolecules and their complexes, and its more than 100 000 structures contain more than 20 000 distinct ligands or small molecules bound to proteins and nucleic acids. Information about these small molecules and their interactions with proteins and nucleic acids is crucial for our understanding of biochemical processes and vital for structure-based drug design. Small molecules present in a deposited structure may be attached to a polymer or may occur as a separate, non-covalently linked ligand. During curation of a newly deposited structure by wwPDB annotation staff, each molecule is cross-referenced to the PDB Chemical Component Dictionary (CCD). If the molecule is new to the PDB, a dictionary description is created for it. The information about all small molecule components found in the PDB is distributed via the ftp archive as an external reference file. Small molecule annotation in the PDB also includes information about ligand-binding sites and about covalent and other linkages between ligands and macromolecules. During the remediation of the peptide-like antibiotics and inhibitors present in the PDB archive in 2011, it became clear that additional annotation was required for consistent representation of these molecules, which are quite often composed of several sequential subcomponents including modified amino acids and other chemical groups. The connectivity information of the modified amino acids is necessary for correct representation of these biologically interesting molecules. The combined information is made available via a new resource called the Biologically Interesting molecules Reference Dictionary, which is complementary to the CCD and is now routinely used for annotation of peptide-like antibiotics and inhibitors.
- Subjects :
- Models, Molecular
Computer science
Protein Data Bank (RCSB PDB)
Computational biology
Ligands
General Biochemistry, Genetics and Molecular Biology
Small Molecule Libraries
Molecule
Data Mining
Binding site
Databases, Protein
chemistry.chemical_classification
Information retrieval
Binding Sites
Glycopeptides
Reproducibility of Results
computer.file_format
Protein Data Bank
Small molecule
Amino acid
Anti-Bacterial Agents
Glucose
chemistry
Nucleic acid
Original Article
General Agricultural and Biological Sciences
computer
Databases, Chemical
Information Systems
Macromolecule
Subjects
Details
- ISSN :
- 17580463
- Volume :
- 2014
- Database :
- OpenAIRE
- Journal :
- Database : the journal of biological databases and curation
- Accession number :
- edsair.doi.dedup.....dd5a7fff69a51c252644b8839980870b