Back to Search
Start Over
Spectral density ratio based clustering methods for the binary segmentation of protein sequences: A comparative study
- Source :
- BioSystems
- Publication Year :
- 2010
- Publisher :
- Elsevier BV, 2010.
-
Abstract
- We compare several spectral domain based clustering methods for partitioning protein sequence data. The main instrument for this exercise is the spectral density ratio model, which specifies that the logarithmic ratio of two or more unknown spectral density functions has a parametric linear combination of cosines. Maximum likelihood inference is worked out in detail and it is shown that its output yields several distance measures among independent stationary time series. These similarity indices are suitable for clustering time series data based on their second order properties. Other spectral domain based distances are investigated as well and we compare all methods and distances to the problem of producing segmentations of bacterial outer membrane proteins consistent with their transmembrane topology. Protein sequences are transformed to time series data by employing numerical scales of physicochemical parameters. We also present interesting results on the prediction of transmembrane β-strands, based on the clustering outcome, for a representative set of bacterial outer membrane proteins with given three-dimensional structure. © 2010 Elsevier Ireland Ltd. 100 2 132 143 Cited By :2
- Subjects :
- sequence analysis
Neisseria meningitidis
Distance measures
Sequence Analysis, Protein
Cluster Analysis
Databases, Protein
Linear combination
Bacteria (microorganisms)
Periodogram
comparative study
Parametric statistics
Mathematics
Quantitative Biology::Biomolecules
maximum likelihood analysis
Applied Mathematics
article
protein domain
protein processing
General Medicine
simulation
Modeling and Simulation
OMP topology prediction
Biological system
Bacterial Outer Membrane Proteins
Statistics and Probability
Time series
residue analysis
protein localization
Spectral analysis
outer membrane protein
General Biochemistry, Genetics and Molecular Biology
spectrometry
Quantitative Biology::Subcellular Processes
Similarity (network science)
physical chemistry
biochemistry
Protein sequence segmentation
Amino Acid Sequence
protein structure
Cluster analysis
hydrophobicity
nonhuman
Series (mathematics)
business.industry
physicochemical property
Spectral density
Pattern recognition
prediction
sequence homology
Physicochemical parameters
time series analysis
Artificial intelligence
numerical model
protein
business
Sequence Alignment
cluster analysis
Subjects
Details
- ISSN :
- 03032647
- Volume :
- 100
- Database :
- OpenAIRE
- Journal :
- Biosystems
- Accession number :
- edsair.doi.dedup.....610e8bf981b067167c5e5e4c4ae3286f
- Full Text :
- https://doi.org/10.1016/j.biosystems.2010.02.008