1. Automatic annotation of medical reports using SNOMED-CT: a flexible approach based on medical knowledge databases
- Author
-
UCL - SSH/ILC/PLIN - Pôle de recherche en linguistique, UCL - SSH/ILC - Institut Langage et Communication, UCL - SSH/TALN - Centre de traitement automatique du langage, De Meyere, Damien, Klein, Thierry, François, Thomas, Debongnie, Jean-Claude, Radulescu, Cristina, Mbengo, Nicole, Ouro Koura, Maliki, Coppieters 't Wallant, Yves, Fairon, Cédrick, 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, UCL - SSH/ILC/PLIN - Pôle de recherche en linguistique, UCL - SSH/ILC - Institut Langage et Communication, UCL - SSH/TALN - Centre de traitement automatique du langage, De Meyere, Damien, Klein, Thierry, François, Thomas, Debongnie, Jean-Claude, Radulescu, Cristina, Mbengo, Nicole, Ouro Koura, Maliki, Coppieters 't Wallant, Yves, Fairon, Cédrick, and 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics
- Abstract
This paper presents a rule-based method for the detection and normalization of medical entities using SNOMED-CT which, although based on knowledge stored in terminological resources, allows some flexibility in order to account for the language variation typical of medical texts. Our system is based on the software Unitex and is one of the few to code French medical texts with SNOMED-CT concept identifiers. Our evaluation quantifies the benefits of such a flexible approach, but also emphasizes terminological resource shortcomings for the processing of medical reports written in French. Finally, our methodology is an interesting alternative to supervised training, as the extraction rules require limited development.
- Published
- 2015