1. Support vector machine-based prediction of pore-forming toxins (PFT) using distributed representation of reduced alphabets
- Author
-
Hrushikesh Bhosale, Valadi K Jayaraman, and Vigneshwar Ramakrishnan
- Subjects
Sequence ,Pore-forming toxin ,Support Vector Machine ,Bacteria ,Computer science ,Virulence ,Computational biology ,Biochemistry ,Distributed representation ,Computer Science Applications ,Support vector machine ,Protein sequencing ,Similarity (network science) ,Amino Acid Sequence ,Molecular Biology ,Hydrophobicity scales - Abstract
Bacterial virulence can be attributed to a wide variety of factors including toxins that harm the host. Pore-forming toxins are one class of toxins that confer virulence to the bacteria and are one of the promising targets for therapeutic intervention. In this work, we develop a sequence-based machine learning framework for the prediction of pore-forming toxins. For this, we have used distributed representation of the protein sequence encoded by reduced alphabet schemes based on conformational similarity and hydropathy index as input features to Support Vector Machines (SVMs). The choice of conformational similarity and hydropathy indices is based on the functional mechanism of pore-forming toxins. Our methodology achieves about 81% accuracy indicating that conformational similarity, an indicator of the flexibility of amino acids, along with hydrophobic index can capture the intrinsic features of pore-forming toxins that distinguish it from other types of transporter proteins. Increased understanding of the mechanisms of pore-forming toxins can further contribute to the use of such “mechanism-informed” features that may increase the prediction accuracy further.
- Published
- 2021