Journal: information sciences / Publisher: elsevier science inc. / Topic: computer science - Searchworks@Jio Institute Digital Library Search Results

1. Data quality measures based on granular computing for multi-label classification

Author: Koen Vanhoof, Marilyn Bello, Rafael Bello, Gonzalo Nápoles, and Cognitive Science & AI
Subjects: Information Systems and Management, Computer science, media_common.quotation_subject, Multi-label classification, 02 engineering and technology, computer.software_genre, Theoretical Computer Science, Data quality measures, Consistency (database systems), Artificial Intelligence, 0202 electrical engineering, electronic engineering, information engineering, Quality (business), Granular computing, media_common, Rough set theory, 05 social sciences, 050301 education, Class (biology), Computer Science Applications, Control and Systems Engineering, Data quality, 020201 artificial intelligence & image processing, Data mining, Rough set, 0503 education, computer, Software
Abstract: Rough set theory is a granular computing formalism that allows analyzing a given dataset through well-defined measures. Some of these measures aim to characterize datasets used to discover knowledge, mostly in traditional classification problems. Measuring the data quality is pivotal to estimate beforehand the problem’s difficulty since a classification model’s accuracy heavily depends on the data quality. However, to the best of our knowledge, there are no measures devoted to analyzing the quality of multi-label datasets. In this paper, we propose six data quality measures for multi-label problems, which are based on different granular approaches. Some of these measures redefine the decision class concept, while others redefine the consistency concept. Moreover, we study the impact of the similarity threshold parameters and the distance functions on the behavior of these measures. The numerical simulations show a statistical correlation between the measures that redefine the consistency concept and the performance of the ML- k NN classifier .
Published: 2021

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

1 results

1. Data quality measures based on granular computing for multi-label classification

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Publication Year Range

Language

Database

1 results

Search Results

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources