Back to Search
Start Over
SELECTING EFFECTIVE FEATURES AND RELATIONS FOR EFFICIENT MULTI-RELATIONAL CLASSIFICATION
- Source :
- Computational Intelligence. 26:258-281
- Publication Year :
- 2010
- Publisher :
- Wiley, 2010.
-
Abstract
- Feature selection is an essential data processing step to remove irrelevant and redundant attributes for shorter learning time, better accuracy, and better comprehensibility. A number of algorithms have been proposed in both data mining and machine learning areas. These algorithms are usually used in a single table environment, where data are stored in one relational table or one flat file. They are not suitable for a multi-relational environment, where data are stored in multiple tables joined to one another by semantic relationships. To address this problem, in this article, we propose a novel approach called FARS to conduct both Feature And Relation Selection for efficient multi-relational classification. Through this approach, we not only extend the traditional feature selection method to select relevant features from multi-relations, but also develop a new method to reconstruct the multi-relational database schema and eliminate irrelevant tables to improve classification performance further. The results of the experiments conducted on both real and synthetic databases show that FARS can effectively choose a small set of relevant features, thereby enhancing classification efficiency and prediction accuracy significantly.
Details
- ISSN :
- 08247935
- Volume :
- 26
- Database :
- OpenAIRE
- Journal :
- Computational Intelligence
- Accession number :
- edsair.doi...........cd1189e9be4183ce9fdca4d4876340d6
- Full Text :
- https://doi.org/10.1111/j.1467-8640.2010.00359.x