1. Resampling and averaging coordinates on data
- Author
-
Blumberg, Andrew J., Carriere, Mathieu, Fung, Jun Hou, and Mandell, Michael A.
- Subjects
Statistics - Machine Learning ,Computer Science - Computational Geometry ,Computer Science - Machine Learning - Abstract
We introduce algorithms for robustly computing intrinsic coordinates on point clouds. Our approach relies on generating many candidate coordinates by subsampling the data and varying hyperparameters of the embedding algorithm (e.g., manifold learning). We then identify a subset of representative embeddings by clustering the collection of candidate coordinates and using shape descriptors from topological data analysis. The final output is the embedding obtained as an average of the representative embeddings using generalized Procrustes analysis. We validate our algorithm on both synthetic data and experimental measurements from genomics, demonstrating robustness to noise and outliers.
- Published
- 2024