15 results on '"Mankovich, Nathan"'
Search Results
2. Recovering Latent Confounders from High-dimensional Proxy Variables
- Author
-
Mankovich, Nathan, Durand, Homer, Diaz, Emiliano, Varando, Gherardo, and Camps-Valls, Gustau
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning - Abstract
Detecting latent confounders from proxy variables is an essential problem in causal effect estimation. Previous approaches are limited to low-dimensional proxies, sorted proxies, and binary treatments. We remove these assumptions and present a novel Proxy Confounder Factorization (PCF) framework for continuous treatment effect estimation when latent confounders manifest through high-dimensional, mixed proxy variables. For specific sample sizes, our two-step PCF implementation, using Independent Component Analysis (ICA-PCF), and the end-to-end implementation, using Gradient Descent (GD-PCF), achieve high correlation with the latent confounder and low absolute error in causal effect estimation with synthetic datasets in the high sample size regime. Even when faced with climate data, ICA-PCF recovers four components that explain $75.9\%$ of the variance in the North Atlantic Oscillation, a known confounder of precipitation patterns in Europe. Code for our PCF implementations and experiments can be found here: https://github.com/IPL-UV/confound_it. The proposed methodology constitutes a stepping stone towards discovering latent confounders and can be applied to many problems in disciplines dealing with high-dimensional observed proxies, e.g., spatiotemporal fields.
- Published
- 2024
3. Improving generalisation via anchor multivariate analysis
- Author
-
Durand, Homer, Varando, Gherardo, Mankovich, Nathan, and Camps-Valls, Gustau
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning ,Statistics - Applications ,Statistics - Methodology ,62Hxx - Abstract
We introduce a causal regularisation extension to anchor regression (AR) for improved out-of-distribution (OOD) generalisation. We present anchor-compatible losses, aligning with the anchor framework to ensure robustness against distribution shifts. Various multivariate analysis (MVA) algorithms, such as (Orthonormalized) PLS, RRR, and MLR, fall within the anchor framework. We observe that simple regularisation enhances robustness in OOD settings. Estimators for selected algorithms are provided, showcasing consistency and efficacy in synthetic and real-world climate science problems. The empirical validation highlights the versatility of anchor regularisation, emphasizing its compatibility with MVA approaches and its role in enhancing replicability while guarding against distribution shifts. The extended AR framework advances causal inference methodologies, addressing the need for reliable OOD generalisation., Comment: 21 pages, 15 figures
- Published
- 2024
4. Fun with Flags: Robust Principal Directions via Flag Manifolds
- Author
-
Mankovich, Nathan, Camps-Valls, Gustau, and Birdal, Tolga
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning ,Mathematics - Differential Geometry ,Mathematics - Optimization and Control ,Statistics - Machine Learning - Abstract
Principal component analysis (PCA), along with its extensions to manifolds and outlier contaminated data, have been indispensable in computer vision and machine learning. In this work, we present a unifying formalism for PCA and its variants, and introduce a framework based on the flags of linear subspaces, ie a hierarchy of nested linear subspaces of increasing dimension, which not only allows for a common implementation but also yields novel variants, not explored previously. We begin by generalizing traditional PCA methods that either maximize variance or minimize reconstruction error. We expand these interpretations to develop a wide array of new dimensionality reduction algorithms by accounting for outliers and the data manifold. To devise a common computational approach, we recast robust and dual forms of PCA as optimization problems on flag manifolds. We then integrate tangent space approximations of principal geodesic analysis (tangent-PCA) into this flag-based framework, creating novel robust and dual geodesic PCA variations. The remarkable flexibility offered by the 'flagification' introduced here enables even more algorithmic variants identified by specific flag types. Last but not least, we propose an effective convergent solver for these flag-formulations employing the Stiefel manifold. Our empirical results on both real-world and synthetic scenarios, demonstrate the superiority of our novel algorithms, especially in terms of robustness to outliers on manifolds.
- Published
- 2024
5. Featurizing Koopman Mode Decomposition For Robust Forecasting
- Author
-
Aristoff, David, Copperman, Jeremy, Mankovich, Nathan, and Davies, Alexander
- Subjects
Mathematics - Dynamical Systems ,Mathematical Physics ,Statistics - Machine Learning ,37M22, 37M05, 37M10, 37M15, 37N25 - Abstract
This article introduces an advanced Koopman mode decomposition (KMD) technique -- coined Featurized Koopman Mode Decomposition (FKMD) -- that uses delay embedding and a learned Mahalanobis distance to enhance analysis and prediction of high dimensional dynamical systems. The delay embedding expands the observation space to better capture underlying manifold structure, while the Mahalanobis distance adjusts observations based on the system's dynamics. This aids in featurizing KMD in cases where good features are not a priori known. We show that FKMD improves predictions for a high-dimensional linear oscillator, a high-dimensional Lorenz attractor that is partially observed, and a cell signaling problem from cancer research., Comment: 11 pages, 8 figures
- Published
- 2023
6. Chordal Averaging on Flag Manifolds and Its Applications
- Author
-
Mankovich, Nathan and Birdal, Tolga
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning ,Mathematics - Differential Geometry ,Mathematics - Optimization and Control ,Statistics - Machine Learning - Abstract
This paper presents a new, provably-convergent algorithm for computing the flag-mean and flag-median of a set of points on a flag manifold under the chordal metric. The flag manifold is a mathematical space consisting of flags, which are sequences of nested subspaces of a vector space that increase in dimension. The flag manifold is a superset of a wide range of known matrix spaces, including Stiefel and Grassmanians, making it a general object that is useful in a wide variety computer vision problems. To tackle the challenge of computing first order flag statistics, we first transform the problem into one that involves auxiliary variables constrained to the Stiefel manifold. The Stiefel manifold is a space of orthogonal frames, and leveraging the numerical stability and efficiency of Stiefel-manifold optimization enables us to compute the flag-mean effectively. Through a series of experiments, we show the competence of our method in Grassmann and rotation averaging, as well as principal component analysis. We release our source code under https://github.com/nmank/FlagAveraging., Comment: Appears at ICCV 2023
- Published
- 2023
7. The Flag Median and FlagIRLS
- Author
-
Mankovich, Nathan, King, Emily, Peterson, Chris, and Kirby, Michael
- Subjects
Statistics - Machine Learning ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning ,Mathematics - Metric Geometry ,Mathematics - Optimization and Control - Abstract
Finding prototypes (e.g., mean and median) for a dataset is central to a number of common machine learning algorithms. Subspaces have been shown to provide useful, robust representations for datasets of images, videos and more. Since subspaces correspond to points on a Grassmann manifold, one is led to consider the idea of a subspace prototype for a Grassmann-valued dataset. While a number of different subspace prototypes have been described, the calculation of some of these prototypes has proven to be computationally expensive while other prototypes are affected by outliers and produce highly imperfect clustering on noisy data. This work proposes a new subspace prototype, the flag median, and introduces the FlagIRLS algorithm for its calculation. We provide evidence that the flag median is robust to outliers and can be used effectively in algorithms like Linde-Buzo-Grey (LBG) to produce improved clusterings on Grassmannians. Numerical experiments include a synthetic dataset, the MNIST handwritten digits dataset, the Mind's Eye video dataset and the UCF YouTube action dataset. The flag median is compared the other leading algorithms for computing prototypes on the Grassmannian, namely, the $\ell_2$-median and to the flag mean. We find that using FlagIRLS to compute the flag median converges in $4$ iterations on a synthetic dataset. We also see that Grassmannian LBG with a codebook size of $20$ and using the flag median produces at least a $10\%$ improvement in cluster purity over Grassmannian LBG using the flag mean or $\ell_2$-median on the Mind's Eye dataset.
- Published
- 2022
8. Featurizing Koopman mode decomposition for robust forecasting.
- Author
-
Aristoff, David, Copperman, Jeremy, Mankovich, Nathan, and Davies, Alexander
- Subjects
HARMONIC oscillators ,CELL communication ,SYSTEM dynamics ,CANCER research ,A priori - Abstract
This article introduces an advanced Koopman mode decomposition (KMD) technique—coined Featurized Koopman Mode Decomposition (FKMD)—that uses delay embedding and a learned Mahalanobis distance to enhance analysis and prediction of high-dimensional dynamical systems. The delay embedding expands the observation space to better capture underlying manifold structures, while the Mahalanobis distance adjusts observations based on the system's dynamics. This aids in featurizing KMD in cases where good features are not a priori known. We show that FKMD improves predictions for a high-dimensional linear oscillator, a high-dimensional Lorenz attractor that is partially observed, and a cell signaling problem from cancer research. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. Randomized Extended Kaczmarz is a Limit Point of Sketch-and-Project
- Author
-
Jarman, Benjamin, Mankovich, Nathan, and Moorman, Jacob D.
- Subjects
Mathematics - Numerical Analysis ,15A06, 15B52, 65F10, 65F20, 65Y20, 68Q25, 68W10, 68W20, 68W40 - Abstract
The sketch-and-project (SAP) framework for solving systems of linear equations has unified the theory behind popular projective iterative methods such as randomized Kaczmarz, randomized coordinate descent, and variants thereof. The randomized extended Kaczmarz (REK) method is a popular extension of randomized Kaczmarz for solving inconsistent systems, which has not yet been shown to lie within the SAP framework. In this work we show that, in a certain sense, REK may be expressed as the limit point of a family of SAP methods, but we argue that it is unlikely that REK can be translated into a SAP method itself. We provide an extensive theoretical analysis of the family of methods comprising said limit, including convergence guarantees and further connections to REK. We follow this with an array of experiments demonstrating these methods and their connections in practice., Comment: 18 pages, 14 figures
- Published
- 2021
10. Pathway expression analysis
- Author
-
Mankovich, Nathan, Kehoe, Eric, Peterson, Amy, and Kirby, Michael
- Published
- 2022
- Full Text
- View/download PDF
11. Chordal Averaging on Flag Manifolds and Its Applications
- Author
-
Mankovich, Nathan, primary and Birdal, Tolga, additional
- Published
- 2023
- Full Text
- View/download PDF
12. Module representatives for refining gene co-expression modules
- Author
-
Mankovich, Nathan, primary, Andrews-Polymenis, Helene, additional, Threadgill, David, additional, and Kirby, Michael, additional
- Published
- 2023
- Full Text
- View/download PDF
13. Pathway Expression Analysis
- Author
-
Mankovich, Nathan, primary, Kirby, Michael, additional, Kehoe, Eric, additional, and Peterson, Amy, additional
- Published
- 2022
- Full Text
- View/download PDF
14. The Flag Median and FlagIRLS
- Author
-
Mankovich, Nathan, primary, King, Emily J., additional, Peterson, Chris, additional, and Kirby, Michael, additional
- Published
- 2022
- Full Text
- View/download PDF
15. Searching for a Lost Plane: A Neighborhood-Based Probabilistic Model.
- Author
-
Jay, Melissa, Mankovich, Nathan, and Campbell, Eleanore
- Subjects
TRANSOCEANIC flights ,PROBABILITY theory ,AIR traffic control ,REMOTE-sensing images ,RADIO (Medium) - Abstract
We propose a model that determines the most probable locations of a missing transoceanic flight. Since transoceanic flights are not detected by radar when they are more than 200 miles from a coast, radio communication and the occasional satellite imagery is essential to an effective search plan. Our search plan is built on the last distance in miles on the intended flight path (IFP) where air traffic control heard from the pilot. We create a quadratic distance-to-casualty probability density function to predict the distance fromthe point of last contact to where the plane likely fell. Through trajectories and the possibility of veering off the IFP, we determine a two-dimensional search region and discretize it into smaller areas. The trajectoriesaccount foranymodelof aircraft, since parameterssuch as cruising altitude and glide ratio are assessed. Probabilities of containment (POC) (probability that the lost plane is in the given region) are then assigned to each search region. We next consider the probabilities of detection (POD) during the search. We apply Koopman's equation for the probability of detecting a missing boat, which provides flexibility for the model of a search plane and its instrumentation. Upon determining POD, we create an algorithm to search the bounded area. Our algorithm applies the POD to the POCs and redistributes probabilities if the lost plane is not found in a region while still considering the chance that the plane was actually in the region searched. If the plane is not found in a given cell, the algorithm outputs a neighboring region to search next. This customized neighborhood search plan maximizes resources while remaining time-efficient. An example of a hypothetical missing Transatlantic Flight ABCD is included as a demonstration of our methodology. Our search plan is built on assumptions, include that the IFP is linear and that the plane disassembled only upon impacting the water. Our model is sensitive to specific parameter changes but accommodates different plane types. Strengths of the model include non-uniform probability distributions modeling distance to casualty and impact and a neighborhood-based, continuouslyupdating search mechanism. [ABSTRACT FROM AUTHOR]
- Published
- 2015
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.