1. STATISTICAL TECHNIQUES FOR TEXT CLASSIFICATION BASED ON WORD RECURRENCE INTERVALS
- Author
-
Andrew Allison, Matthew J. Berryman, Derek Abbott, Berryman, Matthew John, Allison, Andrew, and Abbott, D
- Subjects
business.industry ,Computer science ,Applied Mathematics ,General Mathematics ,Stylography ,text authorship ,statistics of text ,keyword extraction ,Keyword extraction ,General Physics and Astronomy ,Interval (mathematics) ,computer.software_genre ,Set (abstract data type) ,Statistical analysis ,Artificial intelligence ,business ,computer ,Natural language processing ,Word (computer architecture) ,Hebrews - Abstract
We present a method for characterizing text based on a statistical analysis of word recurrence interval. This method can be used for extracting keywords from text, and also for comparing texts by an unknown author against a set of known authors. We also use these methods to comment on the controversial question of who wrote the letter to the Hebrews in the New Testament.
- Published
- 2003
- Full Text
- View/download PDF