Back to Search
Start Over
Interactive textual feature selection for consensus clustering
- Source :
- Repositório Institucional da USP (Biblioteca Digital da Produção Intelectual), Universidade de São Paulo (USP), instacron:USP
- Publication Year :
- 2015
- Publisher :
- Elsevier BV, 2015.
-
Abstract
- An approach to incorporate users' experience into consensus clustering is proposed.The approach relies on interactive feature selection from textual data.We model an additional (high-level) text representation using the selected features.We explore high-level features to improve the consensus clustering accuracy.Our approach is competitive even when only few features are selected by the users. Consensus clustering and interactive feature selection are very useful methods to extract and manage knowledge from texts. While consensus clustering allows the aggregation of different clustering solutions into a single robust clustering solution, the interactive feature selection facilitates the incorporation of the users' experience in the clustering tasks by selecting a set of textual features, i.e., including user's supervision at the term-level. We propose an approach for incorporating interactive textual feature selection into consensus clustering. Experimental results on several text collections demonstrate that our approach significantly improves consensus clustering accuracy, even when only few textual features are selected by the users.
- Subjects :
- Fuzzy clustering
Information retrieval
Brown clustering
business.industry
Computer science
Conceptual clustering
Feature selection
MINERAÇÃO DE DADOS
Set (abstract data type)
Text mining
Artificial Intelligence
Signal Processing
Consensus clustering
Computer Vision and Pattern Recognition
Representation (mathematics)
business
Cluster analysis
Software
Subjects
Details
- ISSN :
- 01678655
- Volume :
- 52
- Database :
- OpenAIRE
- Journal :
- Pattern Recognition Letters
- Accession number :
- edsair.doi.dedup.....f287624cc95364feb45252c001cd9d9c
- Full Text :
- https://doi.org/10.1016/j.patrec.2014.09.008