Back to Search
Start Over
InterPro, progress and status in 2005
- Source :
- Nucleic Acids Research, Nucleic Acids Research Database issue (33), D201-D205. (2005), Nucleic Acids Research, Vol. 33, No Database issue (2005) pp. D201-5, Nucleic Acids Research, Oxford University Press, 2005, 33 Database Issue (Database issue), pp.D201-D205. ⟨10.1093/nar/gki106⟩, Nucleic Acids Research, 2005, 33 Database Issue (Database issue), pp.D201-D205. ⟨10.1093/nar/gki106⟩, Nucleic Acids Research 33 (Database Issue): D201-D205.
- Publication Year :
- 2005
-
Abstract
- International audience; InterPro, an integrated documentation resource of protein families, domains and functional sites, was created to integrate the major protein signature databases. Currently, it includes PROSITE, Pfam, PRINTS, ProDom, SMART, TIGRFAMs, PIRSF and SUPERFAMILY. Signatures are manually integrated into InterPro entries that are curated to provide biological and functional information. Annotation is provided in an abstract, Gene Ontology mapping and links to specialized databases. New features of InterPro include extended protein match views, taxonomic range information and protein 3D structure data. One of the new match views is the InterPro Domain Architecture view, which shows the domain composition of protein matches. Two new entry types were introduced to better describe InterPro entries: these are active site and binding site. PIRSF and the structure-based SUPERFAMILY are the latest member databases to join InterPro, and CATH and PANTHER are soon to be integrated. InterPro release 8.0 contains 11 007 entries, representing 2573 domains, 8166 families, 201 repeats, 26 active sites, 21 binding sites and 20 post-translational modification sites. InterPro covers over 78% of all proteins in the Swiss-Prot and TrEMBL components of UniProt. The database is available for text- and sequence-based searches via a webserver (http://www.ebi.ac.uk/interpro), and for download by anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro).
- Subjects :
- InterPro
[SDV.OT]Life Sciences [q-bio]/Other [q-bio.OT]
Architecture domain
Protein family
Simple Modular Architecture Research Tool
Protein Sequence Analysis
Databases, Protein/trends
570 Life Sciences
Computational biology
PROSITE
Biology
Bioinformatics
610 Medical Sciences, Medicine
03 medical and health sciences
Annotation
TIGRFAMs
Sequence Analysis, Protein
Genetics
Humans
Protein Databases
ddc:576
Tertiary Protein Structure
Databases, Protein
030304 developmental biology
Proteins/chemistry/classification
0303 health sciences
030302 biochemistry & molecular biology
Proteins
Articles
Protein Structure, Tertiary
Systems Integration
Cardiovascular and Metabolic Diseases
UniProt
Sequence Alignment
Autre (Sciences du Vivant)
Subjects
Details
- Language :
- English
- ISSN :
- 13624962 and 03051048
- Volume :
- 33
- Database :
- OpenAIRE
- Journal :
- Nucleic acids research
- Accession number :
- edsair.doi.dedup.....5b5f275513ed12ce0d0038631f445fa2
- Full Text :
- https://doi.org/10.1093/nar/gki106⟩