Author: "Mazumder, Rahul" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Mazumder, Rahul"' showing total 158 results

Start Over Author "Mazumder, Rahul"

158 results on '"Mazumder, Rahul"'

151. A Computational Framework for Multivariate Convex Regression and Its Variants

Author: Bodhisattva Sen, Rahul Mazumder, Arkopal Choudhury, Garud Iyengar, Sloan School of Management, and Mazumder, Rahul
Subjects: Statistics and Probability, FOS: Computer and information sciences, Multivariate statistics, Statistics::Theory, Regression function, MathematicsofComputing_NUMERICALANALYSIS, 01 natural sciences, Statistics - Computation, Methodology (stat.ME), Statistics::Machine Learning, 010104 statistics & probability, Computer Science::Systems and Control, 0502 economics and business, FOS: Mathematics, Statistics::Methodology, Applied mathematics, Quadratic programming, 0101 mathematics, Mathematics - Optimization and Control, Computation (stat.CO), Statistics - Methodology, 050205 econometrics, Mathematics, Augmented Lagrangian method, 05 social sciences, Regular polygon, Nonparametric statistics, Regression, Optimization and Control (math.OC), Statistics, Probability and Uncertainty
Abstract: We study the nonparametric least squares estimator (LSE) of a multivariate convex regression function. The LSE, given as the solution to a quadratic program with O(n²) linear constraints (n being the sample size), is difficult to compute for large problems. Exploiting problem specific structure, we propose a scalable algorithmic framework based on the augmented Lagrangian method to compute the LSE. We develop a novel approach to obtain smooth convex approximations to the fitted (piecewise affine) convex LSE and provide formal bounds on the quality of approximation. When the number of samples is not too large compared to the dimension of the predictor, we propose a regularization scheme—Lipschitz convex regression—where we constrain the norm of the subgradients, and study the rates of convergence of the obtained LSE. Our algorithmic framework is simple and flexible and can be easily adapted to handle variants: estimation of a nondecreasing/nonincreasing convex/concave (with or without a Lipschitz bound) function. We perform numerical studies illustrating the scalability of the proposed algorithm—on some instances our proposal leads to more than a 10,000-fold improvement in runtime when compared to off-the-shelf interior point solvers for problems with n = 500. Keywords: Augmented Lagrangian method; Lipschitz convex regression; Non parametric least squares estimator; Scalable quadratic programming; Smooth convex regression, United States. Office of Naval Research (Grant N00014-15-1-2342)
Published: 2019
Full Text: View/download PDF

152. Hierarchical Modeling and Shrinkage for User Session LengthPrediction in Media Streaming

Author: Zhen Zhu, Antoine Dedieu, Hossein Vahabi, Rahul Mazumder, Sloan School of Management, Dedieu, Antoine, and Mazumder, Rahul
Subjects: Service (systems architecture), Computer science, business.industry, 02 engineering and technology, Machine learning, computer.software_genre, Login, Session (web analytics), Ranking, User engagement, 020204 information systems, Metric (mathematics), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer
Abstract: An important metric of users' satisfaction and engagement within on-line streaming services is the user session length, i.e. the amount of time they spend on a service continuously without interruption. Being able to predict this value directly benefits the recommendation and ad pacing contexts in music and video streaming services. Recent research has shown that predicting the exact amount of time spent is highly nontrivial due to many external factors for which a user can end a session, and the lack of predictive covariates. Most of the other related literature on duration based user engagement has focused on dwell time for websites, for search and display ads, mainly for post-click satisfaction prediction or ad ranking. In this work we present a novel framework inspired by hierarchical Bayesian modeling to predict, at the moment of login, the amount of time a user will spend in the streaming service. The time spent by a user on a platform depends upon user-specific latent variables which are learned via hierarchical shrinkage. Our framework enjoys theoretical guarantees and naturally incorporates flexible parametric/nonparametric models on the covariates, including models robust to outliers. Our proposal is found to outperform state-of-the-art estimators in terms of efficiency and predictive performance on real world public and private datasets.
Published: 2018

153. Flexible Low-Rank Statistical Modeling with Missing Data and Side Information

Author: Rahul Mazumder, William Fithian, Sloan School of Management, and Mazumder, Rahul
Subjects: Statistics and Probability, Mathematical optimization, Matrix completion, Optimization problem, convex optimization, Computer science, General Mathematics, Matrix norm, 020206 networking & telecommunications, Statistical model, 02 engineering and technology, matrix factorization, 01 natural sciences, nuclear norm regularization, Matrix decomposition, 010104 statistics & probability, missing data, Convex optimization, 0202 electrical engineering, electronic engineering, information engineering, Maximum a posteriori estimation, 0101 mathematics, Statistics, Probability and Uncertainty, Smoothing
Abstract: We explore a general statistical framework for low-rank modeling of matrix-valued data, based on convex optimization with a generalized nuclear norm penalty. We study several related problems: the usual low-rank matrix completion problem with flexible loss functions arising from generalized linear models; reduced-rank regression and multi-task learning; and generalizations of both problems where side information about rows and columns is available, in the form of features or smoothing kernels. We show that our approach encompasses maximum a posteriori estimation arising from Bayesian hierarchical modeling with latent factors, and discuss ramifications of the missing-data mechanism in the context of matrix completion. While the above problems can be naturally posed as rank-constrained optimization problems, which are nonconvex and computationally difficult, we show how to relax them via generalized nuclear norm regularization to obtain convex optimization problems. We discuss algorithms drawing inspiration from modern convex optimization methods to address these large scale convex optimization computational tasks. Finally, we illustrate our flexible approach in problems arising in functional data reconstruction and ecological species distribution modeling., United States. Office of Naval Research (Grant N000141512342)
Published: 2018

154. Best subset selection via a modern optimization lens

Author: Angela King, Dimitris Bertsimas, Rahul Mazumder, Massachusetts Institute of Technology. Operations Research Center, Sloan School of Management, Bertsimas, Dimitris J, King, Angela, and Mazumder, Rahul
Subjects: FOS: Computer and information sciences, Statistics and Probability, 90C27, Mathematical optimization, least absolute deviation, 0211 other engineering and technologies, Machine Learning (stat.ML), 02 engineering and technology, algorithms, 01 natural sciences, Statistics - Computation, mixed integer programming, 90C26, Methodology (stat.ME), 010104 statistics & probability, 62J07, Lasso (statistics), 62J05, Statistics - Machine Learning, Discrete optimization, FOS: Mathematics, 0101 mathematics, lasso, Integer programming, Global optimization, Mathematics - Optimization and Control, Computation (stat.CO), Statistics - Methodology, Mathematics, Continuous optimization, 021103 operations research, global optimization, Solver, 90C11, $\ell_{0}$-constrained minimization, Optimization and Control (math.OC), Least absolute deviations, discrete optimization, 62G35, Statistics, Probability and Uncertainty, Sparse linear regression, best subset selection, Integer (computer science)
Abstract: In the last twenty-five years (1990-2014), algorithmic advances in integer optimization combined with hardware improvements have resulted in an astonishing 200 billion factor speedup in solving Mixed Integer Optimization (MIO) problems. We present a MIO approach for solving the classical best subset selection problem of choosing $k$ out of $p$ features in linear regression given $n$ observations. We develop a discrete extension of modern first order continuous optimization methods to find high quality feasible solutions that we use as warm starts to a MIO solver that finds provably optimal solutions. The resulting algorithm (a) provides a solution with a guarantee on its suboptimality even if we terminate the algorithm early, (b) can accommodate side constraints on the coefficients of the linear regression and (c) extends to finding best subset solutions for the least absolute deviation loss function. Using a wide variety of synthetic and real datasets, we demonstrate that our approach solves problems with $n$ in the 1000s and $p$ in the 100s in minutes to provable optimality, and finds near optimal solutions for $n$ in the 100s and $p$ in the 1000s in minutes. We also establish via numerical experiments that the MIO approach performs better than {\texttt {Lasso}} and other popularly used sparse learning procedures, in terms of achieving sparse solutions with good predictive power., This is a revised version (May, 2015) of the first submission in June 2014
Published: 2015

155. The Discrete Dantzig Selector: Estimating Sparse Linear Models via Mixed Integer Linear Optimization

Author: Rahul Mazumder, Peter Radchenko, Sloan School of Management, and Mazumder, Rahul
Subjects: FOS: Computer and information sciences, Optimization problem, Linear programming, 0211 other engineering and technologies, Machine Learning (stat.ML), Mathematics - Statistics Theory, Statistics Theory (math.ST), 02 engineering and technology, Library and Information Sciences, Statistics - Computation, 01 natural sciences, Least squares, Methodology (stat.ME), 010104 statistics & probability, Statistics - Machine Learning, Algorithmics, Linear regression, FOS: Mathematics, Applied mathematics, Quadratic programming, 0101 mathematics, Mathematics - Optimization and Control, Computation (stat.CO), Statistics - Methodology, Mathematics, 021103 operations research, Linear model, Computer Science Applications, Optimization and Control (math.OC), Information Systems, Integer (computer science)
Abstract: We propose a novel high-dimensional linear regression estimator: the Discrete Dantzig Selector , which minimizes the number of nonzero regression coefficients subject to a budget on the maximal absolute correlation between the features and residuals. Motivated by the significant advances in integer optimization over the past 10–15 years, we present a mixed integer linear optimization ( MILO ) approach to obtain certifiably optimal global solutions to this nonconvex optimization problem. The current state of algorithmics in integer optimization makes our proposal substantially more computationally attractive than the least squares subset selection framework based on integer quadratic optimization, recently proposed by Bertsimas et al. and the continuous nonconvex quadratic optimization framework of Liu et al. . We propose new discrete first-order methods, which when paired with the state-of-the-art MILO solvers, lead to good solutions for the Discrete Dantzig Selector problem for a given computational budget. We illustrate that our integrated approach provides globally optimal solutions in significantly shorter computation times, when compared to off-the-shelf MILO solvers. We demonstrate both theoretically and empirically that in a wide range of regimes the statistical properties of the Discrete Dantzig Selector are superior to those of popular $\ell _{1}$ -based approaches. We illustrate that our approach can handle problem instances with $p =10,\!000$ features with certifiable optimality making it a highly scalable combinatorial variable selection approach in sparse linear modeling.
Published: 2017

156. Ensembled best subset selection using summary statistics for polygenic risk prediction.

Author: Chen T, Zhang H, Mazumder R, and Lin X
Abstract: Polygenic risk scores (PRS) enhance population risk stratification and advance personalized medicine, yet existing methods face a tradeoff between predictive power and computational efficiency. We introduce ALL-Sum, a fast and scalable PRS method that combines an efficient summary statistic-based L 0 L 2 penalized regression algorithm with an ensembling step that aggregates estimates from different tuning parameters for improved prediction performance. In extensive large-scale simulations across a wide range of polygenicity and genome-wide association studies (GWAS) sample sizes, ALL-Sum consistently outperforms popular alternative methods in terms of prediction accuracy, runtime, and memory usage. We analyze 27 published GWAS summary statistics for 11 complex traits from 9 reputable data sources, including the Global Lipids Genetics Consortium, Breast Cancer Association Consortium, and FinnGen, evaluated using individual-level UKBB data. ALL-Sum achieves the highest accuracy for most traits, particularly for GWAS with large sample sizes. We provide ALL-Sum as a user-friendly command-line software with pre-computed reference data for streamlined user-end analysis.
Published: 2023
Full Text: View/download PDF

157. Accurate and Efficient Estimation of Local Heritability using Summary Statistics and LD Matrix.

Author: Li H, Mazumder R, and Lin X
Abstract: Existing SNP-heritability estimation methods that leverage GWAS summary statistics produce estimators that are less efficient than the restricted maximum likelihood (REML) estimator using individual-level data under linear mixed models (LMMs). Increasing the precision of a heritability estimator is particularly important for regional analyses, as local genetic variances tend to be small. We introduce a new estimator for local heritability, "HEELS", which attains comparable statistical efficiency as REML (\emph{i.e.} relative efficiency greater than 92%) but only requires summary-level statistics -- Z-scores from the marginal association tests plus the empirical LD matrix. HEELS significantly improves the statistical efficiency of the existing summary-statistics-based heritability estimators-- for instance, HEELS produces heritability estimates that are more than 3-fold and 7-times less variable than GRE and LDSC, respectively. Moreover, we introduce a unified framework to evaluate and compare the performance of different LD approximation strategies. We propose representing the empirical LD as the sum of a low-rank matrix and a banded matrix. This approximation not only reduces the storage and memory cost of using the LD matrix, but also improves the computational efficiency of the HEELS estimation. We demonstrate the statistical efficiency of HEELS and the advantages of our proposed LD approximation strategies both in simulations and through empirical analyses of the UK Biobank data.
Published: 2023
Full Text: View/download PDF

158. Matrix Completion and Low-Rank SVD via Fast Alternating Least Squares.

Author: Hastie T, Mazumder R, Lee JD, and Zadeh R
Abstract: The matrix-completion problem has attracted a lot of attention, largely as a result of the celebrated Netflix competition. Two popular approaches for solving the problem are nuclear-norm-regularized matrix approximation (Candès and Tao, 2009; Mazumder et al., 2010), and maximum-margin matrix factorization (Srebro et al., 2005). These two procedures are in some cases solving equivalent problems, but with quite different algorithms. In this article we bring the two approaches together, leading to an efficient algorithm for large matrix factorization and completion that outperforms both of these. We develop a software package softlmpute in R for implementing our approaches, and a distributed version for very large matrices using the Spark cluster programming environment.
Published: 2015

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

158 results on '"Mazumder, Rahul"'

151. A Computational Framework for Multivariate Convex Regression and Its Variants

152. Hierarchical Modeling and Shrinkage for User Session LengthPrediction in Media Streaming

153. Flexible Low-Rank Statistical Modeling with Missing Data and Side Information

154. Best subset selection via a modern optimization lens

155. The Discrete Dantzig Selector: Estimating Sparse Linear Models via Mixed Integer Linear Optimization

156. Ensembled best subset selection using summary statistics for polygenic risk prediction.

157. Accurate and Efficient Estimation of Local Heritability using Summary Statistics and LD Matrix.

158. Matrix Completion and Low-Rank SVD via Fast Alternating Least Squares.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

158 results on '"Mazumder, Rahul"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources