Back to Search
Start Over
Semi-Supervised Clustering Algorithms for Grouping Scientific Articles
- Source :
- ICCS
- Publication Year :
- 2017
- Publisher :
- Elsevier BV, 2017.
-
Abstract
- Creating sessions in scientific conferences consists in grouping papers with common topics taking into account the size restrictions imposed by the conference schedule. Therefore, this problem can be considered as semi-supervised clustering of documents based on their content. This paper aims to propose modifications in traditional clustering algorithms to incorporate size constraints in each cluster. Specifically, two new algorithms are proposed to semi-supervised clustering, based on: binary integer linear programming with cannot-link constraints and a variation of the K-Medoids algorithm, respectively. The applicability of the proposed semi-supervised clustering methods is illustrated by addressing the problem of automatic configuration of conference schedules by clustering articles by similarity. We include experiments, applying the new techniques, over real conferences datasets: ICMLA-2014, AAAI-2013 and AAAI-2014. The results of these experiments show that the new methods are able to solve practical and real problems.
- Subjects :
- 0301 basic medicine
Clustering high-dimensional data
DBSCAN
Fuzzy clustering
Computer science
Correlation clustering
Conceptual clustering
02 engineering and technology
Machine learning
computer.software_genre
Biclustering
03 medical and health sciences
CURE data clustering algorithm
Consensus clustering
0202 electrical engineering, electronic engineering, information engineering
Cluster analysis
General Environmental Science
Brown clustering
k-medoids
business.industry
Constrained clustering
ComputingMethodologies_PATTERNRECOGNITION
030104 developmental biology
Data stream clustering
Canopy clustering algorithm
General Earth and Planetary Sciences
FLAME clustering
020201 artificial intelligence & image processing
Artificial intelligence
Data mining
business
computer
Algorithm
Subjects
Details
- ISSN :
- 18770509
- Volume :
- 108
- Database :
- OpenAIRE
- Journal :
- Procedia Computer Science
- Accession number :
- edsair.doi...........d816c164290a3134b01740e77c78ef8c
- Full Text :
- https://doi.org/10.1016/j.procs.2017.05.206