151. An improved algorithm for the calculation of exact term discrimination values
- Author
-
Abdelmoula El-Hamdouchi and Peter Willett
- Subjects
Term Discrimination ,Computer science ,business.industry ,Search engine indexing ,Pattern recognition ,Library and Information Sciences ,Management Science and Operations Research ,Similarity measure ,Linear discriminant analysis ,Computer Science Applications ,Automatic indexing ,Media Technology ,Trigonometric functions ,Artificial intelligence ,Document retrieval ,Computational linguistics ,business ,Information Systems - Abstract
The term discrimination model provides a means of evaluating indexing terms in automatic document retrieval systems. This article describes an efficient algorithm for the calculation of term discrimination values that may be used when the interdocument similarity measure used is the cosine coefficient and when the document representatives have been weighted using one particular term-weighting scheme. The algorithm has an expected running time proportional to Nn2 for a collection of N documents, each of which has been assigned an average of n terms.
- Published
- 1988