Back to Search Start Over

The Helitron family classification using SVM based on Fourier transform features applied on an unbalanced dataset.

Authors :
Touati, Rabeb
Oueslati, Afef Elloumi
Messaoudi, Imen
Lachiri, Zied
Source :
Medical & Biological Engineering & Computing. Oct2019, Vol. 57 Issue 10, p2289-2304. 16p. 3 Diagrams, 10 Charts, 4 Graphs.
Publication Year :
2019

Abstract

Helitrons are mobile sequences which belong to the class 2 of eukaryotic transposons. Their specificity resides in their mechanism of transposition: the rolling circle mechanism. They play an important role in remodeling proteomes due to their ability to modify existing genes and introducing new ones. A major difficulty in identifying and classifying Helitron families comes from the complex structure, the unspecified length, and the unbalanced appearance number of each Helitron type. The Helitron's recognition is still not solved in literature. The purpose of this paper is to characterize and classify Helitron types using spectral features and support vector machine (SVM) classification technique. Thus, the helitronic DNA is transformed into a numerical form using the FCGS2 coding technique. Then, a set of spectral features is extracted from the smoothed Fourier transform applied on the FCGS2 signals. Based on the spectral signature and the classification's confusion matrix, we demonstrated that some specific classes which do not show similarities, such as HelitronY2 and NDNAX3, are easily discriminated with important accuracy rates exceeding 90%. However, some Helitron types have great similarities such as the following: Helitron1, HelitronY1, HelitronY1A, and HelitronY4. Our system is also able to predict them with promising values reaching 70%. Graphical abstract The Helitron recognizer based on features extracted from smoothed Fourier transform. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01400118
Volume :
57
Issue :
10
Database :
Academic Search Index
Journal :
Medical & Biological Engineering & Computing
Publication Type :
Academic Journal
Accession number :
139126448
Full Text :
https://doi.org/10.1007/s11517-019-02027-5