Back to Search Start Over

ProRepeat: an integrated repository for studying amino acid tandem repeats in proteins

Authors :
Audrey David
Harm Nijveen
Ke Lin
Jack A. M. Leunissen
Hong Luo
Source :
Nucleic acids research, 40(D1), D394-D399, Nucleic Acids Research, Nucleic acids research 40 (2012) D1
Publication Year :
2012

Abstract

ProRepeat (http://prorepeat.bioinformatics.nl/) is an integrated curated repository and analysis platform for in-depth research on the biological characteristics of amino acid tandem repeats. ProRepeat collects repeats from all proteins included in the UniProt knowledgebase, together with 85 completely sequenced eukaryotic proteomes contained within the RefSeq collection. It contains non-redundant perfect tandem repeats, approximate tandem repeats and simple, low-complexity sequences, covering the majority of the amino acid tandem repeat patterns found in proteins. The ProRepeat web interface allows querying the repeat database using repeat characteristics like repeat unit and length, number of repetitions of the repeat unit and position of the repeat in the protein. Users can also search for repeats by the characteristics of repeat containing proteins, such as entry ID, protein description, sequence length, gene name and taxon. ProRepeat offers powerful analysis tools for finding biological interesting properties of repeats, such as the strong position bias of leucine repeats in the N-terminus of eukaryotic protein sequences, the differences of repeat abundance among proteomes, the functional classification of repeat containing proteins and GC content constrains of repeats’ corresponding codons.

Details

Language :
English
ISSN :
03051048
Volume :
40
Issue :
D1
Database :
OpenAIRE
Journal :
Nucleic acids research
Accession number :
edsair.doi.dedup.....839e9b8f47defb45bda9b143a90343dd