Back to Search Start Over

Toward a generic representation of random variables for machine learning.

Authors :
Donnat, Philippe
Marti, Gautier
Very, Philippe
Source :
Pattern Recognition Letters. Jan2016, Vol. 70, p24-31. 8p.
Publication Year :
2016

Abstract

This paper presents a pre-processing and a distance which improve the performance of machine learning algorithms working on independent and identically distributed stochastic processes. We introduce a novel non-parametric approach to represent random variables which splits apart dependency and distribution without losing any information. We also propound an associated metric leveraging this representation and its statistical estimate. Besides experiments on synthetic datasets, the benefits of our contribution is illustrated through the example of clustering financial time series, for instance prices from the credit default swaps market. Results are available on the website http://www.datagrapple.com and an IPython Notebook tutorial is available at http://www.datagrapple.com/Tech for reproducible research. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01678655
Volume :
70
Database :
Academic Search Index
Journal :
Pattern Recognition Letters
Publication Type :
Academic Journal
Accession number :
112675784
Full Text :
https://doi.org/10.1016/j.patrec.2015.11.004