Back to Search Start Over

User-centred Views on Terminology Extraction Tools: Usage Scenarios and Integration into MT and CAT Tools

Authors :
Blancafort, Helena
Heid, Ulrich
Gornostay, Tatiana
Méchoulam, Claude
Daille, Béatrice
Sharoff, Serge
Syllabs
IMS, University of Stuttgart
University of Stuttgart
TILDE
TILDE-TILDE
SOGITEC
SOGITEC Industries
TALN
Laboratoire d'Informatique de Nantes Atlantique (LINA)
Mines Nantes (Mines Nantes)-Université de Nantes (UN)-Centre National de la Recherche Scientifique (CNRS)-Mines Nantes (Mines Nantes)-Université de Nantes (UN)-Centre National de la Recherche Scientifique (CNRS)
University of Leeds
European Project: 248005,EC:FP7:ICT,FP7-ICT-2009-4,TTC(2010)
Traitement Automatique du Langage Naturel (LS2N - équipe TALN )
Laboratoire des Sciences du Numérique de Nantes (LS2N)
Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST)
Université de Nantes (UN)-Université de Nantes (UN)-École Centrale de Nantes (ECN)-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique)
Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST)
Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)
Centre National de la Recherche Scientifique (CNRS)-Mines Nantes (Mines Nantes)-Université de Nantes (UN)-Centre National de la Recherche Scientifique (CNRS)-Mines Nantes (Mines Nantes)-Université de Nantes (UN)
Source :
Actes du colloque Tralogy : Anticiper les technologies pour la traduction, Tralogy I. Métiers et technologies de la traduction : quelles convergences pour l'avenir ?, Tralogy I. Métiers et technologies de la traduction : quelles convergences pour l'avenir ?, Mar 2011, Paris, France. 10 p
Publication Year :
2011
Publisher :
HAL CCSD, 2011.

Abstract

http://lodel.irevues.inist.fr/tralogy/index.php?id=91; International audience; This paper presents usage scenarios of the platform being developed within the TTC project (Terminology Extraction, Translation Tools and Comparable Corpora) along with the first feedback from potential users. The TTC project aims at leveraging translation tools, computer-assisted translation tools, and terminology management tools by automatically generating bilingual terminologies from comparable corpora in several languages of the European Union (English, French, German, Latvian and Spanish), as well as in Chinese and Russian. The TTC platform includes a web crawler and a corpora management tool, as well as tools for monolingual term extraction and bilingual terminology alignment, online terminology management, and terminology export into CAT tools and MT systems. Overall, the paper focuses on the language activities to be carried out with the TTC tools, issues with respect to the availability of required language resources and linguistic knowledge, and different user profiles and needs. Regarding potential user needs, we discuss the results of an online questionnaire-based survey on terminology and corpora issues conducted in the translation and localization industry to reveal user needs. Furthermore, we present the envisaged usage scenarios as well as first feedback from potential users. The expected TTC input and outputs are also outlined. Finally, as it seems clear that the amount of available data and resources will not be the same for all languages, we discuss technical solutions to achieve language coverage: the TTC tools will offer different approaches depending on the amount and type of linguistic knowledge available.

Details

Language :
English
Database :
OpenAIRE
Journal :
Actes du colloque Tralogy : Anticiper les technologies pour la traduction, Tralogy I. Métiers et technologies de la traduction : quelles convergences pour l'avenir ?, Tralogy I. Métiers et technologies de la traduction : quelles convergences pour l'avenir ?, Mar 2011, Paris, France. 10 p
Accession number :
edsair.dedup.wf.001..c401d1bf3073be7360631a438ed82a56