Author: "Khamaru, Koulik" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Khamaru, Koulik"' showing total 28 results

Start Over Author "Khamaru, Koulik"

28 results on '"Khamaru, Koulik"'

1. Inference with the Upper Confidence Bound Algorithm

Author: Khamaru, Koulik and Zhang, Cun-Hui
Subjects: Statistics - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Systems and Control, Mathematics - Statistics Theory
Abstract: In this paper, we discuss the asymptotic behavior of the Upper Confidence Bound (UCB) algorithm in the context of multiarmed bandit problems and discuss its implication in downstream inferential tasks. While inferential tasks become challenging when data is collected in a sequential manner, we argue that this problem can be alleviated when the sequential algorithm at hand satisfies certain stability property. This notion of stability is motivated from the seminal work of Lai and Wei (1982). Our first main result shows that such a stability property is always satisfied for the UCB algorithm, and as a result the sample means for each arm are asymptotically normal. Next, we examine the stability properties of the UCB algorithm when the number of arms $K$ is allowed to grow with the number of arm pulls $T$. We show that in such a case the arms are stable when $\frac{\log K}{\log T} \rightarrow 0$, and the number of near-optimal arms are large., Comment: 17 pages, 1 figure
Published: 2024

2. Informativeness of Weighted Conformal Prediction

Author: Ying, Mufang, Guo, Wenge, Khamaru, Koulik, and Hung, Ying
Subjects: Statistics - Methodology, Statistics - Machine Learning
Abstract: Weighted conformal prediction (WCP), a recently proposed framework, provides uncertainty quantification with the flexibility to accommodate different covariate distributions between training and test data. However, it is pointed out in this paper that the effectiveness of WCP heavily relies on the overlap between covariate distributions; insufficient overlap can lead to uninformative prediction intervals. To enhance the informativeness of WCP, we propose two methods for scenarios involving multiple sources with varied covariate distributions. We establish theoretical guarantees for our proposed methods and demonstrate their efficacy through simulations., Comment: 25 pages
Published: 2024

3. Stochastic Optimization with Constraints: A Non-asymptotic Instance-Dependent Analysis

Author: Khamaru, Koulik
Subjects: Mathematics - Optimization and Control, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We consider the problem of stochastic convex optimization under convex constraints. We analyze the behavior of a natural variance reduced proximal gradient (VRPG) algorithm for this problem. Our main result is a non-asymptotic guarantee for VRPG algorithm. Contrary to minimax worst case guarantees, our result is instance-dependent in nature. This means that our guarantee captures the complexity of the loss function, the variability of the noise, and the geometry of the constraint set. We show that the non-asymptotic performance of the VRPG algorithm is governed by the scaled distance (scaled by $\sqrt{N}$) between the solutions of the given problem and that of a certain small perturbation of the given problem -- both solved under the given convex constraints; here, $N$ denotes the number of samples. Leveraging a well-established connection between local minimax lower bounds and solutions to perturbed problems, we show that as $N \rightarrow \infty$, the VRPG algorithm achieves the renowned local minimax lower bound by H\`{a}jek and Le Cam up to universal constants and a logarithmic factor of the sample size., Comment: 18 pages
Published: 2024

4. Statistical Limits of Adaptive Linear Models: Low-Dimensional Estimation and Inference

Author: Lin, Licong, Ying, Mufang, Ghosh, Suvrojit, Khamaru, Koulik, and Zhang, Cun-Hui
Subjects: Mathematics - Statistics Theory, Computer Science - Machine Learning
Abstract: Estimation and inference in statistics pose significant challenges when data are collected adaptively. Even in linear models, the Ordinary Least Squares (OLS) estimator may fail to exhibit asymptotic normality for single coordinate estimation and have inflated error. This issue is highlighted by a recent minimax lower bound, which shows that the error of estimating a single coordinate can be enlarged by a multiple of $\sqrt{d}$ when data are allowed to be arbitrarily adaptive, compared with the case when they are i.i.d. Our work explores this striking difference in estimation performance between utilizing i.i.d. and adaptive data. We investigate how the degree of adaptivity in data collection impacts the performance of estimating a low-dimensional parameter component in high-dimensional linear models. We identify conditions on the data collection mechanism under which the estimation error for a low-dimensional parameter component matches its counterpart in the i.i.d. setting, up to a factor that depends on the degree of adaptivity. We show that OLS or OLS on centered data can achieve this matching error. In addition, we propose a novel estimator for single coordinate inference via solving a Two-stage Adaptive Linear Estimating equation (TALE). Under a weaker form of adaptivity in data collection, we establish an asymptotic normality property of the proposed estimator., Comment: This paper is accepted at NeurIPS 2023
Published: 2023

5. Adaptive Linear Estimating Equations

Author: Ying, Mufang, Khamaru, Koulik, and Zhang, Cun-Hui
Subjects: Mathematics - Statistics Theory, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Sequential data collection has emerged as a widely adopted technique for enhancing the efficiency of data gathering processes. Despite its advantages, such data collection mechanism often introduces complexities to the statistical inference procedure. For instance, the ordinary least squares (OLS) estimator in an adaptive linear regression model can exhibit non-normal asymptotic behavior, posing challenges for accurate inference and interpretation. In this paper, we propose a general method for constructing debiased estimator which remedies this issue. It makes use of the idea of adaptive linear estimating equations, and we establish theoretical guarantees of asymptotic normality, supplemented by discussions on achieving near-optimal asymptotic variance. A salient feature of our estimator is that in the context of multi-armed bandits, our estimator retains the non-asymptotic performance of the least square estimator while obtaining asymptotic normality property. Consequently, this work helps connect two fruitful paradigms of adaptive inference: a) non-asymptotic inference using concentration inequalities and b) asymptotic inference via asymptotic normality., Comment: Paper is accepted at NeurIPS 2023
Published: 2023

6. Semi-parametric inference based on adaptively collected data

Author: Lin, Licong, Khamaru, Koulik, and Wainwright, Martin J.
Subjects: Mathematics - Statistics Theory, Computer Science - Machine Learning, Statistics - Methodology, Statistics - Machine Learning
Abstract: Many standard estimators, when applied to adaptively collected data, fail to be asymptotically normal, thereby complicating the construction of confidence intervals. We address this challenge in a semi-parametric context: estimating the parameter vector of a generalized linear regression model contaminated by a non-parametric nuisance component. We construct suitably weighted estimating equations that account for adaptivity in data collection, and provide conditions under which the associated estimates are asymptotically normal. Our results characterize the degree of "explorability" required for asymptotic normality to hold. For the simpler problem of estimating a linear functional, we provide similar guarantees under much weaker assumptions. We illustrate our general theory with concrete consequences for various problems, including standard linear bandits and sparse generalized bandits, and compare with other methods via simulation studies.
Published: 2023

7. Instance-Dependent Confidence and Early Stopping for Reinforcement Learning

Author: Khamaru, Koulik, Xia, Eric, Wainwright, Martin J., and Jordan, Michael I.
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: Various algorithms for reinforcement learning (RL) exhibit dramatic variation in their convergence rates as a function of problem structure. Such problem-dependent behavior is not captured by worst-case analyses and has accordingly inspired a growing effort in obtaining instance-dependent guarantees and deriving instance-optimal algorithms for RL problems. This research has been carried out, however, primarily within the confines of theory, providing guarantees that explain \textit{ex post} the performance differences observed. A natural next step is to convert these theoretical guarantees into guidelines that are useful in practice. We address the problem of obtaining sharp instance-dependent confidence regions for the policy evaluation problem and the optimal value estimation problem of an MDP, given access to an instance-optimal algorithm. As a consequence, we propose a data-dependent stopping rule for instance-optimal algorithms. The proposed stopping rule adapts to the instance-specific difficulty of the problem and allows for early termination for problems with favorable structure.
Published: 2022

8. Optimal variance-reduced stochastic approximation in Banach spaces

Author: Mou, Wenlong, Khamaru, Koulik, Wainwright, Martin J., Bartlett, Peter L., and Jordan, Michael I.
Subjects: Mathematics - Statistics Theory, Computer Science - Machine Learning, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: We study the problem of estimating the fixed point of a contractive operator defined on a separable Banach space. Focusing on a stochastic query model that provides noisy evaluations of the operator, we analyze a variance-reduced stochastic approximation scheme, and establish non-asymptotic bounds for both the operator defect and the estimation error, measured in an arbitrary semi-norm. In contrast to worst-case guarantees, our bounds are instance-dependent, and achieve the local asymptotic minimax risk non-asymptotically. For linear operators, contractivity can be relaxed to multi-step contractivity, so that the theory can be applied to problems like average reward policy evaluation problem in reinforcement learning. We illustrate the theory via applications to stochastic shortest path problems, two-player zero-sum Markov games, as well as policy evaluation and $Q$-learning for tabular Markov decision processes.
Published: 2022

9. Near-optimal inference in adaptive linear regression

Author: Khamaru, Koulik, Deshpande, Yash, Lattimore, Tor, Mackey, Lester, and Wainwright, Martin J.
Subjects: Mathematics - Statistics Theory, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: When data is collected in an adaptive manner, even simple methods like ordinary least squares can exhibit non-normal asymptotic behavior. As an undesirable consequence, hypothesis tests and confidence intervals based on asymptotic normality can lead to erroneous results. We propose a family of online debiasing estimators to correct these distributional anomalies in least squares estimation. Our proposed methods take advantage of the covariance structure present in the dataset and provide sharper estimates in directions for which more information has accrued. We establish an asymptotic normality property for our proposed online debiasing estimators under mild conditions on the data collection process and provide asymptotically exact confidence intervals. We additionally prove a minimax lower bound for the adaptive linear regression problem, thereby providing a baseline by which to compare estimators. There are various conditions under which our proposed estimators achieve the minimax lower bound. We demonstrate the usefulness of our theory via applications to multi-armed bandit, autoregressive time series estimation, and active learning with exploration., Comment: 51 pages, 7 figures
Published: 2021

10. Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning

Author: Khamaru, Koulik, Xia, Eric, Wainwright, Martin J., and Jordan, Michael I.
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: Various algorithms in reinforcement learning exhibit dramatic variability in their convergence rates and ultimate accuracy as a function of the problem structure. Such instance-specific behavior is not captured by existing global minimax bounds, which are worst-case in nature. We analyze the problem of estimating optimal $Q$-value functions for a discounted Markov decision process with discrete states and actions and identify an instance-dependent functional that controls the difficulty of estimation in the $\ell_\infty$-norm. Using a local minimax framework, we show that this functional arises in lower bounds on the accuracy on any estimation procedure. In the other direction, we establish the sharpness of our lower bounds, up to factors logarithmic in the state and action spaces, by analyzing a variance-reduced version of $Q$-learning. Our theory provides a precise way of distinguishing "easy" problems from "hard" ones in the context of $Q$-learning, as illustrated by an ensemble with a continuum of difficulty.
Published: 2021

11. Instability, Computational Efficiency and Statistical Accuracy

Author: Ho, Nhat, Khamaru, Koulik, Dwivedi, Raaz, Wainwright, Martin J., Jordan, Michael I., and Yu, Bin
Subjects: Computer Science - Machine Learning, Mathematics - Statistics Theory, Statistics - Machine Learning
Abstract: Many statistical estimators are defined as the fixed point of a data-dependent operator, with estimators based on minimizing a cost function being an important special case. The limiting performance of such estimators depends on the properties of the population-level operator in the idealized limit of infinitely many samples. We develop a general framework that yields bounds on statistical accuracy based on the interplay between the deterministic convergence rate of the algorithm at the population level, and its degree of (in)stability when applied to an empirical object based on $n$ samples. Using this framework, we analyze both stable forms of gradient descent and some higher-order and unstable algorithms, including Newton's method and its cubic-regularized variant, as well as the EM algorithm. We provide applications of our general results to several concrete classes of models, including Gaussian mixture estimation, non-linear regression models, and informative non-response models. We exhibit cases in which an unstable algorithm can achieve the same statistical accuracy as a stable algorithm in exponentially fewer steps -- namely, with the number of iterations being reduced from polynomial to logarithmic in sample size $n$., Comment: 68 pages, 6 Figures, 2 Tables. First three authors contributed equally
Published: 2020

12. Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis

Author: Khamaru, Koulik, Pananjady, Ashwin, Ruan, Feng, Wainwright, Martin J., and Jordan, Michael I.
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning, Mathematics - Optimization and Control
Abstract: We address the problem of policy evaluation in discounted Markov decision processes, and provide instance-dependent guarantees on the $\ell_\infty$-error under a generative model. We establish both asymptotic and non-asymptotic versions of local minimax lower bounds for policy evaluation, thereby providing an instance-dependent baseline by which to compare algorithms. Theory-inspired simulations show that the widely-used temporal difference (TD) algorithm is strictly suboptimal when evaluated in a non-asymptotic setting, even when combined with Polyak-Ruppert iterate averaging. We remedy this issue by introducing and analyzing variance-reduced forms of stochastic approximation, showing that they achieve non-asymptotic, instance-dependent optimality up to logarithmic factors., Comment: 38 pages, 3 figures
Published: 2020

13. Sharp Analysis of Expectation-Maximization for Weakly Identifiable Models

Author: Dwivedi, Raaz, Ho, Nhat, Khamaru, Koulik, Wainwright, Martin J., Jordan, Michael I., and Yu, Bin
Subjects: Mathematics - Statistics Theory, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We study a class of weakly identifiable location-scale mixture models for which the maximum likelihood estimates based on $n$ i.i.d. samples are known to have lower accuracy than the classical $n^{- \frac{1}{2}}$ error. We investigate whether the Expectation-Maximization (EM) algorithm also converges slowly for these models. We provide a rigorous characterization of EM for fitting a weakly identifiable Gaussian mixture in a univariate setting where we prove that the EM algorithm converges in order $n^{\frac{3}{4}}$ steps and returns estimates that are at a Euclidean distance of order ${ n^{- \frac{1}{8}}}$ and ${ n^{-\frac{1} {4}}}$ from the true location and scale parameter respectively. Establishing the slow rates in the univariate setting requires a novel localization argument with two stages, with each stage involving an epoch-based argument applied to a different surrogate EM operator at the population level. We demonstrate several multivariate ($d \geq 2$) examples that exhibit the same slow rates as the univariate case. We also prove slow statistical rates in higher dimensions in a special case, when the fitted covariance is constrained to be a multiple of the identity., Comment: 30 pages, 4 figures. The first three authors contributed equally to this work. To appear in AISTATS 2020
Published: 2019

14. Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems

Author: Malik, Dhruv, Pananjady, Ashwin, Bhatia, Kush, Khamaru, Koulik, Bartlett, Peter L., and Wainwright, Martin J.
Subjects: Computer Science - Machine Learning, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: We study derivative-free methods for policy optimization over the class of linear policies. We focus on characterizing the convergence rate of these methods when applied to linear-quadratic systems, and study various settings of driving noise and reward feedback. We show that these methods provably converge to within any pre-specified tolerance of the optimal policy with a number of zero-order evaluations that is an explicit polynomial of the error tolerance, dimension, and curvature properties of the problem. Our analysis reveals some interesting differences between the settings of additive driving noise and random initialization, as well as the settings of one-point and two-point reward feedback. Our theory is corroborated by extensive simulations of derivative-free methods on these systems. Along the way, we derive convergence rates for stochastic zero-order optimization algorithms when applied to a certain class of non-convex problems., Comment: Version v3 consistent with paper appearing in JMLR
Published: 2018

15. Singularity, Misspecification, and the Convergence Rate of EM

Author: Dwivedi, Raaz, Ho, Nhat, Khamaru, Koulik, Jordan, Michael I., Wainwright, Martin J., and Yu, Bin
Subjects: Mathematics - Statistics Theory, Statistics - Machine Learning, Primary 62F15, 62G05, secondary 62G20
Abstract: A line of recent work has analyzed the behavior of the Expectation-Maximization (EM) algorithm in the well-specified setting, in which the population likelihood is locally strongly concave around its maximizing argument. Examples include suitably separated Gaussian mixture models and mixtures of linear regressions. We consider over-specified settings in which the number of fitted components is larger than the number of components in the true distribution. Such misspecified settings can lead to singularity in the Fisher information matrix, and moreover, the maximum likelihood estimator based on $n$ i.i.d. samples in $d$ dimensions can have a non-standard $\mathcal{O}((d/n)^{\frac{1}{4}})$ rate of convergence. Focusing on the simple setting of two-component mixtures fit to a $d$-dimensional Gaussian distribution, we study the behavior of the EM algorithm both when the mixture weights are different (unbalanced case), and are equal (balanced case). Our analysis reveals a sharp distinction between these two cases: in the former, the EM algorithm converges geometrically to a point at Euclidean distance of $\mathcal{O}((d/n)^{\frac{1}{2}})$ from the true parameter, whereas in the latter case, the convergence rate is exponentially slower, and the fixed point has a much lower $\mathcal{O}((d/n)^{\frac{1}{4}})$ accuracy. Analysis of this singular case requires the introduction of some novel techniques: in particular, we make use of a careful form of localization in the associated empirical process, and develop a recursive argument to progressively sharpen the statistical rate., Comment: 63 pages, 12 figures. The first three authors contributed equally to this work. To appear in Annals of Statistics
Published: 2018

16. Convergence guarantees for a class of non-convex and non-smooth optimization problems

Author: Khamaru, Koulik and Wainwright, Martin J.
Subjects: Statistics - Machine Learning, Computer Science - Learning, Mathematics - Optimization and Control
Abstract: We consider the problem of finding critical points of functions that are non-convex and non-smooth. Studying a fairly broad class of such problems, we analyze the behavior of three gradient-based methods (gradient descent, proximal update, and Frank-Wolfe update). For each of these methods, we establish rates of convergence for general problems, and also prove faster rates for continuous sub-analytic functions. We also show that our algorithms can escape strict saddle points for a class of non-smooth functions, thereby generalizing known results for smooth functions. Our analysis leads to a simplification of the popular CCCP algorithm, used for optimizing functions that can be written as a difference of two convex functions. Our simplified algorithm retains all the convergence properties of CCCP, along with a significantly lower cost per iteration. We illustrate our methods and theory via applications to the problems of best subset selection, robust estimation, mixture density estimation, and shape-from-shading reconstruction., Comment: 50 pages, 2 figures
Published: 2018

17. Computation of the Maximum Likelihood estimator in low-rank Factor Analysis

Author: Khamaru, Koulik and Mazumder, Rahul
Subjects: Mathematics - Optimization and Control, Statistics - Computation, Statistics - Machine Learning
Abstract: Factor analysis, a classical multivariate statistical technique is popularly used as a fundamental tool for dimensionality reduction in statistics, econometrics and data science. Estimation is often carried out via the Maximum Likelihood (ML) principle, which seeks to maximize the likelihood under the assumption that the positive definite covariance matrix can be decomposed as the sum of a low rank positive semidefinite matrix and a diagonal matrix with nonnegative entries. This leads to a challenging rank constrained nonconvex optimization problem. We reformulate the low rank ML Factor Analysis problem as a nonlinear nonsmooth semidefinite optimization problem, study various structural properties of this reformulation and propose fast and scalable algorithms based on difference of convex (DC) optimization. Our approach has computational guarantees, gracefully scales to large problems, is applicable to situations where the sample covariance matrix is rank deficient and adapts to variants of the ML problem with additional constraints on the problem parameters. Our numerical experiments demonstrate the significant usefulness of our approach over existing state-of-the-art approaches., Comment: 22 pages, 4 figures
Published: 2018

18. Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning

Author: Xia, Eric, primary, Khamaru, Koulik, additional, Wainwright, Martin J., additional, and Jordan, Michael I., additional
Published: 2024
Full Text: View/download PDF

19. A Peak Synchronization Measure for Multiple Signals

Author: Biswas, Rahul, Khamaru, Koulik, and Majumdar, Kaushik
Subjects: Statistics - Methodology, Statistics - Applications
Abstract: Peaks signify important events in a signal. In a pair of signals how peaks are occurring with mutual correspondence may offer us significant insights into the mutual interdependence between the two signals based on important events. In this work we proposed a novel synchronization measure between two signals, called peak synchronization, which measures the simultaneity of occurrence of peaks in the signals. We subsequently generalized it to more than two signals. We showed that our measure of synchronization is largely independent of the underlying parameter values. A time complexity analysis of the algorithm has also been presented. We applied the measure on intracranial EEG signals of epileptic patients and found that the enhanced synchronization during an epileptic seizure can be modeled better by the new peak synchronization measure than the classical amplitude correlation method.
Published: 2014
Full Text: View/download PDF

20. Guarantees for a few structured statistical problems

Author: Khamaru, Koulik
Subjects: Statistics
Abstract: In recent years, we have seen a tremendous interest in applying statistics and machine learning methods in various areas of science: health, education, drug design, public policy design to name a few. This immense popularity of statistical methods comes with challenging new questions which lie in the boundary of theoretical and methodological aspects of statistics, machine learning and optimization. The aim of this dissertation is to address some of the these challenges that arise in modern reinforcement learning, and in modern data science practice and provide new insights that are helpful to practitioners. The dissertation is divided into four parts. In Part I we discuss principled ways of designing fast algorithms for various reinforcement learning problems. The Part II of the dissertation is devoted to problems that arise due to model misspecification. In Part III we discuss how can we perform inference when the data set is collected in a sequential manner; i.e. the helpful iid structure is not present in the data. Finally, Part IV focuses on deriving fast algorithms for structured non-convex problems.
Published: 2022

21. Computation of the maximum likelihood estimator in low-rank factor analysis

Author: Khamaru, Koulik and Mazumder, Rahul
Published: 2019
Full Text: View/download PDF

22. Instance-Dependent Confidence and Early Stopping for Reinforcement Learning.

Author: Xia, Eric, Khamaru, Koulik, Wainwright, Martin J., and Jordan, Michael I.
Subjects: *OPTIMAL stopping (Mathematical statistics), *MACHINE learning, *CONFIDENCE regions (Mathematics), *MARKOV processes, *REINFORCEMENT learning, *CONFIDENCE
Abstract: Reinforcement learning algorithms are known to exhibit a variety of convergence rates depending on the problem structure. Recent years have witnessed considerable progress in developing theory that is instance-dependent, along with algorithms that achieve such instance-optimal guarantees. However, important questions remain in how to utilize such notions for inferential purposes, or for early stopping, so that data and computational resources can be saved for "easy" problems. This paper develops data-dependent procedures that output instance-dependent confidence regions for evaluating and optimizing policies in a Markov decision process. Notably, our procedures require only black-box access to an instance-optimal algorithm, and re-use the samples used in the estimation algorithm itself. The resulting data-dependent stopping rule adapts instance-specific difficulty of the problem and allows for early termination for problems with favorable structure. We highlight benefit of such early stopping rules via some numerical studies. [ABSTRACT FROM AUTHOR]
Published: 2023

23. Computation of the maximum likelihood estimator in low-rank factor analysis

Author: Sloan School of Management, Massachusetts Institute of Technology. Operations Research Center, Khamaru, Koulik, Mazumder, Rahul, Sloan School of Management, Massachusetts Institute of Technology. Operations Research Center, Khamaru, Koulik, and Mazumder, Rahul
Abstract: Factor analysis is a classical multivariate dimensionality reduction technique popularly used in statistics, econometrics and data science. Estimation for factor analysis is often carried out via the maximum likelihood principle, which seeks to maximize the Gaussian likelihood under the assumption that the positive definite covariance matrix can be decomposed as the sum of a low-rank positive semidefinite matrix and a diagonal matrix with nonnegative entries. This leads to a challenging rank constrained nonconvex optimization problem, for which very few reliable computational algorithms are available. We reformulate the low-rank maximum likelihood factor analysis task as a nonlinear nonsmooth semidefinite optimization problem, study various structural properties of this reformulation; and propose fast and scalable algorithms based on difference of convex optimization. Our approach has computational guarantees, gracefully scales to large problems, is applicable to situations where the sample covariance matrix is rank deficient and adapts to variants of the maximum likelihood problem with additional constraints on the model parameters. Our numerical experiments validate the usefulness of our approach over existing state-of-the-art approaches for maximum likelihood factor analysis.
Published: 2021

24. Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis

Author: Khamaru, Koulik, primary, Pananjady, Ashwin, additional, Ruan, Feng, additional, Wainwright, Martin J., additional, and Jordan, Michael I., additional
Published: 2021
Full Text: View/download PDF

25. Singularity, misspecification and the convergence rate of EM

Author: Dwivedi, Raaz, primary, Ho, Nhat, additional, Khamaru, Koulik, additional, Wainwright, Martin J., additional, Jordan, Michael I., additional, and Yu, Bin, additional
Published: 2020
Full Text: View/download PDF

26. Convergence guarantees for a class of non-convex and non-smooth optimization problems.

Author: Khamaru, Koulik and Wainwright, Martin J.
Subjects: *SMOOTHNESS of functions, *SUBSET selection, *CONVEX functions, *CONTINUOUS functions, *SURETYSHIP & guaranty
Abstract: We consider the problem of finding critical points of functions that are non-convex and non- smooth. Studying a fairly broad class of such problems, we analyze the behavior of three gradient-based methods (gradient descent, proximal update, and Frank-Wolfe update). For each of these methods, we establish rates of convergence for general problems, and also prove faster rates for continuous sub-analytic functions. We also show that our algorithms can escape strict saddle points for a class of non-smooth functions, thereby generalizing known results for smooth functions. Our analysis leads to a simplification of the popular CCCP algorithm, used for optimizing functions that can be written as a difference of two convex functions. Our simplified algorithm retains all the convergence properties of CCCP, along with a significantly lower cost per iteration. We illustrate our methods and theory via applications to the problems of best subset selection, robust estimation, mixture density estimation, and shape-from-shading reconstruction. [ABSTRACT FROM AUTHOR]
Published: 2019

27. A Peak Synchronization Measure for Multiple Signals

Author: Biswas, Rahul, primary, Khamaru, Koulik, additional, and Majumdar, Kaushik K., additional
Published: 2014
Full Text: View/download PDF

28. Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems.

Author: Malik, Dhruv, Pananjady, Ashwin, Bhatia, Kush, Khamaru, Koulik, Bartlett, Peter L., and Wainwrighty, Martin J.
Subjects: *LINEAR systems, *RANDOM noise theory, *STOCHASTIC convergence, *SURETYSHIP & guaranty, *PROCESS optimization
Abstract: We study derivative-free methods for policy optimization over the class of linear policies. We focus on characterizing the convergence rate of these methods when applied to linear- quadratic systems, and study various settings of driving noise and reward feedback. Our main theoretical result provides an explicit bound on the sample or evaluation complexity: we show that these methods are guaranteed to converge to within any pre-specified tolerance of the optimal policy with a number of zero-order evaluations that is an explicit polynomial of the error tolerance, dimension, and curvature properties of the problem. Our analysis reveals some interesting differences between the settings of additive driving noise and random initialization, as well as the settings of one-point and two-point reward feedback. Our theory is corroborated by simulations of derivative-free methods in application to these systems. Along the way, we derive convergence rates for stochastic zero-order optimization algorithms when applied to a certain class of non-convex problems. [ABSTRACT FROM AUTHOR]
Published: 2020

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

28 results on '"Khamaru, Koulik"'

1. Inference with the Upper Confidence Bound Algorithm

2. Informativeness of Weighted Conformal Prediction

3. Stochastic Optimization with Constraints: A Non-asymptotic Instance-Dependent Analysis

4. Statistical Limits of Adaptive Linear Models: Low-Dimensional Estimation and Inference

5. Adaptive Linear Estimating Equations

6. Semi-parametric inference based on adaptively collected data

7. Instance-Dependent Confidence and Early Stopping for Reinforcement Learning

8. Optimal variance-reduced stochastic approximation in Banach spaces

9. Near-optimal inference in adaptive linear regression

10. Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning

11. Instability, Computational Efficiency and Statistical Accuracy

12. Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis

13. Sharp Analysis of Expectation-Maximization for Weakly Identifiable Models

14. Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems

15. Singularity, Misspecification, and the Convergence Rate of EM

16. Convergence guarantees for a class of non-convex and non-smooth optimization problems

17. Computation of the Maximum Likelihood estimator in low-rank Factor Analysis

18. Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning

19. A Peak Synchronization Measure for Multiple Signals

20. Guarantees for a few structured statistical problems

21. Computation of the maximum likelihood estimator in low-rank factor analysis

22. Instance-Dependent Confidence and Early Stopping for Reinforcement Learning.

23. Computation of the maximum likelihood estimator in low-rank factor analysis

24. Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis

25. Singularity, misspecification and the convergence rate of EM

26. Convergence guarantees for a class of non-convex and non-smooth optimization problems.

27. A Peak Synchronization Measure for Multiple Signals

28. Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

28 results on '"Khamaru, Koulik"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources