Back to Search
Start Over
Unsupervised Contextual Keyword Relevance Learning and Measurement using PLSA
- Source :
- 2006 Annual IEEE India Conference.
- Publication Year :
- 2006
- Publisher :
- IEEE, 2006.
-
Abstract
- In this paper, we have developed a probabilistic approach using PLSA for the discovery and analysis of contextual keyword relevance based on the distribution of keywords across a training text corpus. We have shown experimentally, the flexibility of this approach in classifying keywords into different domains based on their context. We have developed a prototype system that allows us to project keyword queries on the loaded PLSA model and returns keywords that are closely correlated. The keyword query is vectorized using the PLSA model in the reduce aspect space and correlation is derived by calculating a dot product. We also discuss the parameters that control PLSA performance including a) number of aspects, b) number of EM iterations c) weighting functions on TDM (pre-weighting). We have estimated the quality through computation of precision-recall scores. We have presented our experiments on PLSA application towards document classification.
- Subjects :
- Text corpus
Probabilistic latent semantic analysis
business.industry
Computer science
Document classification
InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL
Probabilistic logic
Context (language use)
computer.software_genre
Weighting
ComputingMethodologies_PATTERNRECOGNITION
Unsupervised learning
Relevance (information retrieval)
Artificial intelligence
Data mining
business
computer
Natural language processing
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2006 Annual IEEE India Conference
- Accession number :
- edsair.doi...........9792ff33c872c4606d5ab025c13a63dd
- Full Text :
- https://doi.org/10.1109/indcon.2006.302787