1. Open Data Categorization Based on Formal Concept Analysis
- Author
-
Milena Frtunić Gligorijević, Leonid Stoimenov, Nataša Veljković, and Miloš Bogdanović
- Subjects
Open government ,Information retrieval ,Conceptualization ,Computer science ,business.industry ,Data structure ,Computer Science Applications ,Human-Computer Interaction ,Metadata ,Open data ,Categorization ,Knowledge base ,Computer Science (miscellaneous) ,Formal concept analysis ,business ,Information Systems - Abstract
Government institutions have released a large number of datasets on their open data portals, which are in line with the data transparency and open government initiatives. With the purpose of making it more accessible and visible, these portals categorize datasets based on different criteria like publishers, categories, formats, and descriptions. However, some of this information is often missing, making it impossible to find datasets in all of these ways. As a result, with the number of datasets growing further on the portals, it is getting harder to obtain the desired information. This paper addresses this issue by introducing EODClassifier framework that suggests the best match for the category where a dataset should belong to. It relies on formal concept analysis as a means to generate a data structure that will reveal shared conceptualization originating from tags' usage and utilize it as a knowledge base to categorize uncategorized open datasets.
- Published
- 2021
- Full Text
- View/download PDF