Back to Search Start Over

Application of text mining technologies in Russian language for solving the problems of primary financial monitoring

Authors :
R.A. Bessonov
Mikhail Ivanov
I.V. Osliakova
D. Yu. Kupriyanov
V.Yu. Radygin
Source :
BICA
Publication Year :
2021
Publisher :
Elsevier BV, 2021.

Abstract

The paper deals with the issues of text mining based on a vector classification model. The term frequency and inverse document frequency function (TF IDF) is used as a measure to evaluate the selection of terms. The text preprocessing stage performs tokenization and normalization of the text. The algorithms of stemming and lemmatization of the Russian-language text are used during normalization. A set of programs has been created that makes it possible to analyze the selected subject area, create a thesaurus for thematic search for violations of the public procurement Federal Law of the Russian Federation, as well as to implement functionality for automated search and analysis of documents of arbitration courts within the framework of this law.

Details

ISSN :
18770509
Volume :
190
Database :
OpenAIRE
Journal :
Procedia Computer Science
Accession number :
edsair.doi...........1b8d30dea9d1cf72ddbf4df9d372ef0d
Full Text :
https://doi.org/10.1016/j.procs.2021.06.078