1. D-Terminer: Online Demo for Monolingual and Bilingual Automatic Term Extraction
- Author
-
Rigouts Terryn, Ayla, Hoste, Veronique, Lefever, Els, Costa, Rute, Carvalho, Sara, Ostroski Anic, Ana, and Khan, Anas Fahad
- Subjects
ComputingMethodologies_PATTERNRECOGNITION ,automatic term extraction ,InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL ,multilingual term extraction ,terminology ,term extraction ,demo ,Languages and Literatures - Abstract
This contribution presents D-Terminer: an open access, online demo for monolingual and multilingual automatic term extraction from parallel corpora. The monolingual term extraction is based on a recurrent neural network, with a supervised methodology that relies on pretrained embeddings. Candidate terms can be tagged in their original context and there is no need for a large corpus, as the methodology will work even for single sentences. With the bilingual term extraction from parallel corpora, potentially equivalent candidate term pairs are extracted from translation memories and manual annotation of the results shows that good equivalents are found for most candidate terms. Accompanying the release of the demo is an updated version of the ACTER Annotated Corpora for Term Extraction Research (version 1.5). ispartof: pages:33-40 ispartof: Proceedings of the Workshop on Terminology in the 21st Century : Many Faces, Many Places, co-located with the LREC 2022 conference pages:33-40 ispartof: Workshop on Terminology in the 21st Century : Many Faces, Many Places, co-located with the LREC 2022 conference location:Marseille, France date:20 Jun - 25 Jun 2022 status: published
- Published
- 2022