51. Semantic super networks: A case analysis of Wikipedia papers
- Author
-
Taisiya Lebedeva, Alexander Goritov, and Evgeny Kostyuchenko
- Subjects
Power graph analysis ,Theoretical computer science ,Parsing ,business.industry ,Computer science ,Graph based ,Library science ,computer.software_genre ,Semantic network ,Encyclopedia ,Graph (abstract data type) ,The Internet ,business ,computer ,Case analysis - Abstract
An algorithm for constructing super-large semantic networks has been developed in current work. Algorithm was tested using the “Cosmos” category of the Internet encyclopedia “Wikipedia” as an example. During the implementation, a parser for the syntax analysis of Wikipedia pages was developed. A graph based on list of articles and categories was formed. On the basis of the obtained graph analysis, algorithms for finding domains of high connectivity in a graph were proposed and tested. Algorithms for constructing a domain based on the number of links and the number of articles in the current subject area is considered. The shortcomings of these algorithms are shown and explained, an algorithm is developed on their joint use. The possibility of applying a combined algorithm for obtaining the final domain is shown. The problem of instability of the received domain was discovered when starting an algorithm from two neighboring vertices related to the domain.
- Published
- 2017