Back to Search Start Over

RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures

Authors :
Pablo Lorenzano Menna
Martina Bevilacqua
Mariane Gonçalves Kulik
Alexander Miguel Monzon
Lisanna Paladin
José Luis López
Martin Gonzalez Buitron
Javier Rios
Marco Necci
Sara Errigo
Layla Hirsh
Ivan Mičetić
Juliet F. Nilsson
Andrey V. Kajava
María Silvina Fornasari
Antonio Lagares
Damiano Piovesan
Sebastian Fernandez-Alberti
Maia Diana Eliana Cabrera
Gustavo Parisi
María Laura Fabre
Miguel A. Andrade-Navarro
Silvio C. E. Tosatto
Centre de recherche en Biologie Cellulaire (CRBM)
Université Montpellier 2 - Sciences et Techniques (UM2)-Centre National de la Recherche Scientifique (CNRS)-Université de Montpellier (UM)-Université Montpellier 1 (UM1)
Source :
Nucleic Acids Research, SEDICI (UNLP), Universidad Nacional de La Plata, instacron:UNLP, Nucleic Acids Research, Oxford University Press, 2020, ⟨10.1093/nar/gkaa1097⟩
Publication Year :
2020
Publisher :
Oxford University Press, 2020.

Abstract

The RepeatsDB database (URL: https://repeatsdb.org/) provides annotations and classification for protein tandem repeat structures from the Protein Data Bank (PDB). Protein tandem repeats are ubiquitous in all branches of the tree of life. The accumulation of solved repeat structures provides new possibilities for classification and detection, but also increasing the need for annotation. Here we present RepeatsDB 3.0, which addresses these challenges and presents an extended classification scheme. The major conceptual change compared to the previous version is the hierarchical classification combining top levels based solely on structural similarity (Class > Topology > Fold) with two new levels (Clan > Family) requiring sequence similarity and describing repeat motifs in collaboration with Pfam. Data growth has been addressed with improved mechanisms for browsing the classification hierarchy. A new UniProt-centric view unifies the increasingly frequent annotation of structures from identical or similar sequences. This update of RepeatsDB aligns with our commitment to develop a resource that extracts, organizes and distributes specialized information on tandem repeat protein structures.<br />Facultad de Ciencias Exactas<br />Instituto de Biotecnologia y Biologia Molecular

Details

Language :
English
ISSN :
13624962 and 03051048
Volume :
49
Issue :
D1
Database :
OpenAIRE
Journal :
Nucleic Acids Research
Accession number :
edsair.doi.dedup.....588bf86faec4b1890cf5cf27d81eb86a