Back to Search
Start Over
Active and semi-supervised data domain description
- Source :
- Görnitz, N, Kloft, M & Brefeld, U 2009, Active and semi-supervised data domain description . in Machine Learning and Knowledge Discovery in Databases . Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 1, vol. 5781 LNAI, pp. 407-422, European Conference, ECML PKDD 2009, Bled, Slovenia, 07.09.09 . DOI: 10.1007/978-3-642-04180-8_44, Görnitz, N, Kloft, M & Brefeld, U 2009, Active and semi-supervised data domain description . in W Buntine, M Grobelnik, D Mladenic & J Shawe-Taylor (eds), Machine Learning and Knowledge Discovery in Databases : European Conference, ECML PKDD 2009, Bled, Slovenia, September 7-11, 2009, Proceedings, Part I . Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 1, vol. 5781 LNAI, Springer, Berlin, Heidelberg, pp. 407-422, European Conference on Machine Learning and Knowledge Discovery in Databases-2009, Bled, Slovenia, 07.09.09 . https://doi.org/10.1007/978-3-642-04180-8_44
- Publication Year :
- 2009
-
Abstract
- Data domain description techniques aim at deriving concise descriptions of objects belonging to a category of interest. For instance, the support vector domain description (SVDD) learns a hypersphere enclosing the bulk of provided unlabeled data such that points lying outside of the ball are considered anomalous. However, relevant information such as expert and background knowledge remain unused in the unsupervised setting. In this paper, we rephrase data domain description as a semi-supervised learning task, that is, we propose a semi-supervised generalization of data domain description (SSSVDD) to process unlabeled and labeled examples. The corresponding optimization problem is non-convex. We translate it into an unconstraint, continuous problem that can be optimized accurately by gradient-based techniques. Furthermore, we devise an effective active learning strategy to query low-confidence observations. Our empirical evaluation on network intrusion detection and object recognition tasks shows that our SSSVDDs consistently outperform baseline methods in relevant learning settings. Data domain description techniques aim at deriving concise descriptions of objects belonging to a category of interest. For instance, the support vector domain description (SVDD) learns a hypersphere enclosing the bulk of provided unlabeled data such that points lying outside of the ball are considered anomalous. However, relevant information such as expert and background knowledge remain unused in the unsupervised setting. In this paper, we rephrase data domain description as a semi-supervised learning task, that is, we propose a semi-supervised generalization of data domain description (SSSVDD) to process unlabeled and labeled examples. The corresponding optimization problem is non-convex. We translate it into an unconstraint, continuous problem that can be optimized accurately by gradient-based techniques. Furthermore, we devise an effective active learning strategy to query low-confidence observations. Our empirical evaluation on network intrusion detection and object recognition tasks shows that our SSSVDDs consistently outperform baseline methods in relevant learning settings.
- Subjects :
- Learning settings
Informatics
Continuous problems
Active Learning
Business informatics
Background knowledge
Baseline methods
Data domain description
Empirical evaluations
upport vector domain description
Semi-supervised learning
Gradient based
Network intrusion detection
Unlabeled data
Optimization problems
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- Görnitz, N, Kloft, M & Brefeld, U 2009, Active and semi-supervised data domain description . in Machine Learning and Knowledge Discovery in Databases . Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 1, vol. 5781 LNAI, pp. 407-422, European Conference, ECML PKDD 2009, Bled, Slovenia, 07.09.09 . DOI: 10.1007/978-3-642-04180-8_44, Görnitz, N, Kloft, M & Brefeld, U 2009, Active and semi-supervised data domain description . in W Buntine, M Grobelnik, D Mladenic & J Shawe-Taylor (eds), Machine Learning and Knowledge Discovery in Databases : European Conference, ECML PKDD 2009, Bled, Slovenia, September 7-11, 2009, Proceedings, Part I . Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 1, vol. 5781 LNAI, Springer, Berlin, Heidelberg, pp. 407-422, European Conference on Machine Learning and Knowledge Discovery in Databases-2009, Bled, Slovenia, 07.09.09 . https://doi.org/10.1007/978-3-642-04180-8_44
- Accession number :
- edsair.dedup.wf.001..b9ff0e4ccf5a302b2592fe7b09b1e923