Back to Search
Start Over
ProRepeat: an integrated repository for studying amino acid tandem repeats in proteins
- Source :
- Nucleic acids research, 40(D1), D394-D399, Nucleic Acids Research, Nucleic acids research 40 (2012) D1
- Publication Year :
- 2012
-
Abstract
- ProRepeat (http://prorepeat.bioinformatics.nl/) is an integrated curated repository and analysis platform for in-depth research on the biological characteristics of amino acid tandem repeats. ProRepeat collects repeats from all proteins included in the UniProt knowledgebase, together with 85 completely sequenced eukaryotic proteomes contained within the RefSeq collection. It contains non-redundant perfect tandem repeats, approximate tandem repeats and simple, low-complexity sequences, covering the majority of the amino acid tandem repeat patterns found in proteins. The ProRepeat web interface allows querying the repeat database using repeat characteristics like repeat unit and length, number of repetitions of the repeat unit and position of the repeat in the protein. Users can also search for repeats by the characteristics of repeat containing proteins, such as entry ID, protein description, sequence length, gene name and taxon. ProRepeat offers powerful analysis tools for finding biological interesting properties of repeats, such as the strong position bias of leucine repeats in the N-terminus of eukaryotic protein sequences, the differences of repeat abundance among proteomes, the functional classification of repeat containing proteins and GC content constrains of repeats’ corresponding codons.
- Subjects :
- Repetitive Sequences, Amino Acid
Bioinformatics
selection
Biology
annotation resource
Pentapeptide repeat
Genome
User-Computer Interface
Tandem repeat
Sequence Analysis, Protein
evolution
Bioinformatica
Genetics
RefSeq
Direct repeat
Databases, Protein
database
Repeat unit
algorithm
codon usage
EPS-4
Proteins
Articles
Variable number tandem repeat
proteomes
Armadillo repeats
sequences
alanine
genomes
Subjects
Details
- Language :
- English
- ISSN :
- 03051048
- Volume :
- 40
- Issue :
- D1
- Database :
- OpenAIRE
- Journal :
- Nucleic acids research
- Accession number :
- edsair.doi.dedup.....839e9b8f47defb45bda9b143a90343dd