Back to Search Start Over

Shakespeare Machine: New AI-Based Technologies for Textual Analysis

Authors :
Ehrett, Carl
Ghita, Lucian
Ranwala, Dillon
Menezes, Alison
Source :
Literary and Linguistic Computing; June 2024, Vol. 39 Issue: 2 p522-531, 10p
Publication Year :
2024

Abstract

This article demonstrates a method using tools from the field of Natural Language Processing (NLP) to aid in analyzing theatrical texts and similar works. The method deploys pre-trained large language model neural networks to gather metadata for a text that is amenable to downstream statistical analyses surfacing patterns of interest in character dialogue. We specifically focus on Shakespeare’s works, collecting metadata in the form of sentiment and emotion scores for each line of his plays. In addition to sentiment and emotion scores produced by NLP models, we also directly gather metadata such as genre, line length, and character gender. We show how these metadata may be used to illuminate a number of interesting patterns in Shakespearean character which may be difficult to detect from a direct reading of the texts. We use these metadata to expose statistically significant relationships in Shakespeare between character gender and the emotional content of that character’s dialogue, controlling for genre. We also present here the publicly available dataset that we have compiled to perform these analyses. The data collects text from Shakespeare’s plays along with a variety of metadata useful for this and other forms of analysis of Shakespeare’s works. The methodology demonstrated here may be extended to other varieties of metadata provided by large NLP models.

Details

Language :
English
ISSN :
02681145 and 14774615
Volume :
39
Issue :
2
Database :
Supplemental Index
Journal :
Literary and Linguistic Computing
Publication Type :
Periodical
Accession number :
ejs66662823
Full Text :
https://doi.org/10.1093/llc/fqae021