1. Methodology of event extraction from unstructured medical texts on the example of the Russian language.
- Author
-
Isakov, Tim and Kovalchuk, Sergey
- Subjects
DEEP learning ,RUSSIAN language ,NATURAL language processing ,MACHINE learning ,PUBLIC domain (Copyright law) - Abstract
Processing and structuring medical texts today are still difficult tasks due to specific terminology and abbreviations, a large number of errors and a lack of labeled data for high-quality model training. One of the important tasks is event extraction from unstructured texts. The most efficient event extraction methods to date are deep learning methods, however, they require labeled data to train them. Manual marking is labor-intensive, and expert knowledge is required for high-quality marking. A good alternative is automated pre-labeling. In the public domain, it is difficult to find works on preliminary data markup for the task of extracting events. In this paper, we propose a methodology for rude event extraction based on rules that evaluate the elements of a syntax tree. This methodology can also be used for pre-labeling data for training machine learning and deep learning models. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF