Back to Search
Start Over
Subjectively Interesting Component Analysis:Data Projections that Contrast with Prior Expectations
- Source :
- Kang, B, Lijffijt, J, Santos-Rodriguez, R & De Bie, T 2016, Subjectively Interesting Component Analysis : Data Projections that Contrast with Prior Expectations . in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining-KDD '16 . vol. August 2016, Association for Computing Machinery (ACM), New York, NY, USA, pp. 1615-1624, ACM KDD 2016, San Francisco, United States, 13/08/16 . https://doi.org/10.1145/2939672.2939840, KDD'16 : PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD
- Publication Year :
- 2016
- Publisher :
- Association for Computing Machinery (ACM), 2016.
-
Abstract
- Methods that find insightful low-dimensional projections are essential to effectively explore high-dimensional data. Principal Component Analysis is used pervasively to find low dimensional projections, not only because it is straightforward to use, but it is also often effective, because the variance in data is often dominated by relevant structure. However, even if the projections highlight real structure in the data, not all structure is interesting to every user. If a user is already aware of, or not interested in the dominant structure, Principal Component Analysis is less effective for finding interesting components. We introduce a new method called Subjectively Interesting Component Analysis (SICA), designed to find data projections that are subjectively interesting, i.e, projections that truly surprise the end-user. It isrooted in information theory and employs an explicit model of a user's prior expectations about the data. The corresponding optimization problem is a simple eigenvalue problem, and the result is a trade-o between explained variance and novelty. We present five case studies on synthetic data, images, time-series, and spatial data, to illustrate how SICA enables users to find (subjectively) interesting projections.
- Subjects :
- Technology and Engineering
Information Theory
02 engineering and technology
Information theory
Machine learning
computer.software_genre
Synthetic data
FACE RECOGNITION
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
Spatial analysis
Dimensionality Reduction
Subjective Interestingness
Mathematics
Structure (mathematical logic)
business.industry
Exploratory Data Mining
Dimensionality reduction
Contrast (statistics)
Variance (accounting)
REDUCTION
Principal component analysis
020201 artificial intelligence & image processing
Artificial intelligence
business
computer
Subjects
Details
- Language :
- English
- ISBN :
- 978-1-4503-4232-2
- ISBNs :
- 9781450342322
- Database :
- OpenAIRE
- Journal :
- Kang, B, Lijffijt, J, Santos-Rodriguez, R & De Bie, T 2016, Subjectively Interesting Component Analysis : Data Projections that Contrast with Prior Expectations . in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining-KDD '16 . vol. August 2016, Association for Computing Machinery (ACM), New York, NY, USA, pp. 1615-1624, ACM KDD 2016, San Francisco, United States, 13/08/16 . https://doi.org/10.1145/2939672.2939840, KDD'16 : PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD
- Accession number :
- edsair.doi.dedup.....003df44db499003a864cceda845d20b7