Back to Search
Start Over
Linguistic annotation of Byzantine book epigrams.
- Source :
-
Language Resources & Evaluation . Dec2023, p1-26. - Publication Year :
- 2023
-
Abstract
- In this paper, we explore the feasibility of developing a part-of-speech tagger for not-normalised, Byzantine Greek epigrams. Hence, we compared three different transformer-based models with embedding representations, which are then fine-tuned on a fine-grained part-of-speech tagging task. To train the language models, we compiled two data sets: the first consisting of Ancient and Byzantine Greek texts, the second of Ancient, Byzantine and Modern Greek. This allowed us to ascertain whether Modern Greek contributes to the modelling of Byzantine Greek. For the supervised task of part-of-speech tagging, we collected a training set of existing, annotated (Ancient) Greek texts. For evaluation, a gold standard containing 10,000 tokens of unedited Byzantine Greek poems was manually annotated and validated through an inter-annotator agreement study. The experimental results look very promising, with the BERT model trained on all Greek data achieving the best performance for fine-grained part-of-speech tagging. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 1574020X
- Database :
- Academic Search Index
- Journal :
- Language Resources & Evaluation
- Publication Type :
- Academic Journal
- Accession number :
- 174195533
- Full Text :
- https://doi.org/10.1007/s10579-023-09703-x