Back to Search
Start Over
On the discovery of significant statistical quantitative rules
- Source :
- KDD
- Publication Year :
- 2004
- Publisher :
- ACM, 2004.
-
Abstract
- In this paper we study market share rules, rules that have a certain market share statistic associated with them. Such rules are particularly relevant for decision making from a business perspective. Motivated by market share rules, in this paper we consider statistical quantitative rules (SQ rules) that are quantitative rules in which the RHS can be any statistic that is computed for the segment satisfying the LHS of the rule. Building on prior work, we present a statistical approach for learning all significant SQ rules, i.e., SQ rules for which a desired statistic lies outside a confidence interval computed for this rule. In particular we show how resampling techniques can be effectively used to learn significant rules. Since our method considers the significance of a large number of rules in parallel, it is susceptible to learning a certain number of "false" rules. To address this, we present a technique that can determine the number of significant SQ rules that can be expected by chance alone, and suggest that this number can be used to determine a "false discovery rate" for the learning procedure. We apply our methods to online consumer purchase data and report the results.
Details
- Database :
- OpenAIRE
- Journal :
- Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
- Accession number :
- edsair.doi...........53a1f9c33204953cec2187eef47aa66b
- Full Text :
- https://doi.org/10.1145/1014052.1014094