Back to Search
Start Over
Structural homology guided alignment of cysteine rich proteins
- Source :
- SpringerPlus
- Publisher :
- Springer Nature
-
Abstract
- Background Cysteine rich protein families are notoriously difficult to align due to low sequence identity and frequent insertions and deletions. Results Here we present an alignment method that ensures homologous cysteines align by assigning a unique 10 amino acid barcode to those identified as structurally homologous by the DALI webserver. The free inter-cysteine regions of the barcoded sequences can then be aligned using any standard algorithm. Finally the barcodes are replaced with the original columns to yield an alignment which requires the minimum of manual refinement. Conclusions Using structural homology information to constrain sequence alignments allows the alignment of highly divergent, repetitive sequences that are poorly dealt with by existing algorithms. Tools are provided to perform this method online using the CysBar web-tool (http://CysBar.science.latrobe.edu.au) and offline (python script available from http://github.com/ts404/CysBar). Electronic supplementary material The online version of this article (doi:10.1186/s40064-015-1609-z) contains supplementary material, which is available to authorized users.
- Subjects :
- 0301 basic medicine
Computer science
Computational biology
computer.software_genre
Barcode
law.invention
Cysteine rich protein
03 medical and health sciences
law
computer.programming_language
Sequence (medicine)
Alignment
Multidisciplinary
030102 biochemistry & molecular biology
Python (programming language)
Cysteine-rich proteins
Sequence identity
Structural homology
ComputingMethodologies_PATTERNRECOGNITION
030104 developmental biology
Defensin
Standard algorithms
Data mining
computer
Software
Cysteine
Subjects
Details
- Language :
- English
- ISSN :
- 21931801
- Volume :
- 5
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- SpringerPlus
- Accession number :
- edsair.doi.dedup.....98dcac6859f0a705ebf78293368c4475
- Full Text :
- https://doi.org/10.1186/s40064-015-1609-z