Back to Search
Start Over
Probabilistic modelling of general noisy multi-manifold data sets
- Source :
- ARTIFICIAL INTELLIGENCE, Artificial Intelligence
- Publication Year :
- 2022
-
Abstract
- The intrinsic nature of noisy and complex data sets is often concealed in low-dimensional structures embedded in a higher dimensional space. Number of methodologies have been developed to extract and represent such structures in the form of manifolds (i.e. geometric structures that locally resemble continuously deformable intervals of R-j1). Usually apriori knowledge of the manifold's intrinsic dimensionality is required. Additionally, their performance can often be hampered by the presence of a significant high-dimensional noise aligned along the low-dimensional core manifold. In real-world applications, the data can contain several low-dimensional structures of different dimensionalities. We propose a framework for dimensionality estimation and reconstruction of multiple noisy manifolds embedded in a noisy environment. To the best of our knowledge, this work represents the first attempt at detection and modelling of a set of coexisting general noisy manifolds by uniting two aspects of multi-manifold learning: the recovery and approximation of core noiseless manifolds and the construction of their probabilistic models. The easy-to-understand hyper-parameters can be manipulated to obtain an emerging picture of the multi-manifold structure of the data. We demonstrate the workings of the framework on two synthetic data sets, presenting challenging features for state-of-the-art techniques in Multi-Manifold learning. The first data set consists of multiple sampled noisy manifolds of different intrinsic dimensionalities, such as Mobius strip, toroid and spiral arm. The second one is a topologically complex set of three interlocked toroids. Given the absence of such unified methodologies in the literature, the comparison with existing techniques is organized along the two separate aspects of our approach mentioned above, namely manifold approximation and probabilistic modelling. The framework is then applied to a complex data set containing simulated gas volume particles from a particle simulation of a dwarf galaxy interacting with its host galaxy cluster. Detailed analysis of the recovered 1D and 2D manifolds can help us to understand the nature of Star Formation in such complex systems. (C) 2021 The Author(s). Published by Elsevier B.V.
- Subjects :
- Linguistics and Language
Technology and Engineering
Computer science
Complex system
02 engineering and technology
Space (mathematics)
01 natural sciences
Language and Linguistics
Set (abstract data type)
Riemannian manifolds
Artificial Intelligence
0103 physical sciences
0202 electrical engineering, electronic engineering, information engineering
EIGENMAPS
NONLINEAR DIMENSIONALITY REDUCTION
Dimensionality estimation
010303 astronomy & astrophysics
Latent variable models
Complex data type
Probabilistic modelling
Probabilistic logic
Manifold
Data set
Density estimation
Physics and Astronomy
GTM
020201 artificial intelligence & image processing
Multi-manifold Learning
Algorithm
Generative topographic mapping
Curse of dimensionality
Subjects
Details
- Language :
- English
- ISSN :
- 00043702 and 18727921
- Database :
- OpenAIRE
- Journal :
- ARTIFICIAL INTELLIGENCE, Artificial Intelligence
- Accession number :
- edsair.doi.dedup.....df5c3998d24f5c6e224fd57aabcdc057