1. Multivalued Co-Citation Measure Based on Semantic Distance between Co-Cited Papers in a Citing Paper: A Case Study Focused on Enumeration of Citations.
- Author
-
Eto, Masaki
- Subjects
- *
CITATION analysis , *SEMANTICS , *CHARTERS , *INFORMATION retrieval , *DOCUMENTATION , *COLLECTIONS - Abstract
Purpose: One typical document retrieval method is to use to co-citation. The method is based on the premise that the degree of similarity among co-cited papers is equal in a particular paper. The degree is calculated with binary values: "co-cited" or "not-cited". To improve upon this method, the author proposes a multivalued co-citation measure based on semantic distance between co-cited papers. Methods: To determine the distance between citations, the author measured two machine parseable relationships (location and citing words) between places where papers are cited. In order to evaluate the proposed method, we identified two categories of co-citation: a group with strong relationships indicating "enumerated co-citation" (papers cited within one statement) and a group with weak relationships showing "non enumerated co-citation". Similarities within each group were calculated and compared using the CiteSeer dataset and 6 major similarity indicators. Results: All of the similarity indicators showed that degree of "enumerated co-citation" is higher than "non enumerated co-citation". Consequently, it became clear that the proposed co-citation measure can be used to distinguish the strength of co-citation more precisely and the it can be applied to large-scale document collections. [ABSTRACT FROM AUTHOR]
- Published
- 2007