1. Éclair—a web service for unravelling species origin of sequences sampled from mixed host interfaces
- Author
-
Igor V. Tetko and Stephen Rudd
- Subjects
0106 biological sciences ,Sequence analysis ,Computational biology ,Biology ,Bioinformatics ,01 natural sciences ,Genome ,Article ,Host-Parasite Interactions ,03 medical and health sciences ,User-Computer Interface ,Species Specificity ,Artificial Intelligence ,Complementary DNA ,Genetics ,Software Review ,Genomic library ,Symbiosis ,030304 developmental biology ,Sequence (medicine) ,Gene Library ,Expressed Sequence Tags ,0303 health sciences ,Expressed sequence tag ,Internet ,cDNA library ,Computational Biology ,Sequence Analysis, DNA ,Classification ,Identification (biology) ,Disease Susceptibility ,Software ,010606 plant biology & botany - Abstract
The identification of the genes that participate at the biological interface of two species remains critical to our understanding of the mechanisms of disease resistance, disease susceptibility and symbiosis. The sequencing of complementary DNA (cDNA) libraries prepared from the biological interface between two organisms provides an inexpensive way to identify the novel genes that may be expressed as a cause or consequence of compatible or incompatible interactions. Sequence classification and annotation of species origin typically use an orthology-based approach and require access to large portions of either genome, or a close relative. Novel species- or clade-specific sequences may have no counterpart within existing databases and remain ambiguous features. Here we present a web-service, Eclair, which utilizes support vector machines for the classification of the origin of expressed sequence tags stemming from mixed host cDNA libraries. In addition to providing an interface for the classification of sequences, users are presented with the opportunity to train a model to suit their preferred species pair. Eclair is freely available at http://eclair.btk.fi.
- Published
- 2005