Author: "Maity, Raj Kumar" / Publisher: arxiv - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Maity, Raj Kumar"' showing total 3 results

Start Over Author "Maity, Raj Kumar" Publisher arxiv

3 results on '"Maity, Raj Kumar"'

1. Escaping Saddle Points in Distributed Newton's Method with Communication Efficiency and Byzantine Resilience

Author: Ghosh, Avishek, Maity, Raj Kumar, Mazumdar, Arya, and Ramchandran, Kannan
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing, Statistics - Machine Learning, Optimization and Control (math.OC), FOS: Mathematics, Machine Learning (stat.ML), Distributed, Parallel, and Cluster Computing (cs.DC), Mathematics - Optimization and Control, Machine Learning (cs.LG)
Abstract: The problem of saddle-point avoidance for non-convex optimization is quite challenging in large scale distributed learning frameworks, such as Federated Learning, especially in the presence of Byzantine workers. The celebrated cubic-regularized Newton method of \cite{nest} is one of the most elegant ways to avoid saddle-points in the standard centralized (non-distributed) setup. In this paper, we extend the cubic-regularized Newton method to a distributed framework and simultaneously address several practical challenges like communication bottleneck and Byzantine attacks. Note that the issue of saddle-point avoidance becomes more crucial in the presence of Byzantine machines since rogue machines may create \emph{fake local minima} near the saddle-points of the loss function, also known as the saddle-point attack. Being a second order algorithm, our iteration complexity is much lower than the first order counterparts. Furthermore we use compression (or sparsification) techniques like $\delta$-approximate compression for communication efficiency. We obtain theoretical guarantees for our proposed scheme under several settings including approximate (sub-sampled) gradients and Hessians. Moreover, we validate our theoretical findings with experiments using standard datasets and several types of Byzantine attacks, and obtain an improvement of $25\%$ with respect to first order methods in iteration complexity.
Published: 2021
Full Text: View/download PDF

2. Robust Gradient Descent via Moment Encoding with LDPC Codes

Author: Maity, Raj Kumar, Rawat, Ankit Singh, and Mazumdar, Arya
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Distributed, Parallel, and Cluster Computing, Statistics - Machine Learning, Computer Science - Information Theory, Information Theory (cs.IT), Machine Learning (stat.ML), Data_CODINGANDINFORMATIONTHEORY, Distributed, Parallel, and Cluster Computing (cs.DC), Computer Science::Information Theory, Machine Learning (cs.LG)
Abstract: This paper considers the problem of implementing large-scale gradient descent algorithms in a distributed computing setting in the presence of {\em straggling} processors. To mitigate the effect of the stragglers, it has been previously proposed to encode the data with an erasure-correcting code and decode at the master server at the end of the computation. We, instead, propose to encode the second-moment of the data with a low density parity-check (LDPC) code. The iterative decoding algorithms for LDPC codes have very low computational overhead and the number of decoding iterations can be made to automatically adjust with the number of stragglers in the system. We show that for a random model for stragglers, the proposed moment encoding based gradient descent method can be viewed as the stochastic gradient descent method. This allows us to obtain convergence guarantees for the proposed solution. Furthermore, the proposed moment encoding based method is shown to outperform the existing schemes in a real distributed computing setup.
Published: 2018
Full Text: View/download PDF

3. Shaping Proto-Value Functions via Rewards

Author: Narayanan, Chandrashekar Lakshmi, Maity, Raj Kumar, and Bhatnagar, Shalabh
Subjects: FOS: Computer and information sciences, Computer Science - Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Machine Learning (cs.LG)
Abstract: In this paper, we combine task-dependent reward shaping and task-independent proto-value functions to obtain reward dependent proto-value functions (RPVFs). In constructing the RPVFs we are making use of the immediate rewards which are available during the sampling phase but are not used in the PVF construction. We show via experiments that learning with an RPVF based representation is better than learning with just reward shaping or PVFs. In particular, when the state space is symmetrical and the rewards are asymmetrical, the RPVF capture the asymmetry better than the PVFs.
Published: 2015
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Maity, Raj Kumar"'

1. Escaping Saddle Points in Distributed Newton's Method with Communication Efficiency and Byzantine Resilience

2. Robust Gradient Descent via Moment Encoding with LDPC Codes

3. Shaping Proto-Value Functions via Rewards

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

3 results on '"Maity, Raj Kumar"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources