Author: "Acharya, Ayan" / Database: OpenAIRE - Searchworks@Jio Institute Digital Library Search Results

1. mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization

Author: Behdin, Kayhan, Song, Qingquan, Gupta, Aman, Acharya, Ayan, Durfee, David, Ocejo, Borja, Keerthi, Sathiya, and Mazumder, Rahul
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: Modern deep learning models are over-parameterized, where different optima can result in widely varying generalization performance. To account for this, Sharpness-Aware Minimization (SAM) modifies the underlying loss function to guide descent methods towards flatter minima, which arguably have better generalization abilities. In this paper, we focus on a variant of SAM known as micro-batch SAM (mSAM), which, during training, averages the updates generated by adversarial perturbations across several disjoint shards (micro batches) of a mini-batch. We extend a recently developed and well-studied general framework for flatness analysis to show that distributed gradient computation for sharpness-aware minimization theoretically achieves even flatter minima. In order to support this theoretical superiority, we provide a thorough empirical evaluation on a variety of image classification and natural language processing tasks. We also show that contrary to previous work, mSAM can be implemented in a flexible and parallelizable manner without significantly increasing computational costs. Our practical implementation of mSAM yields superior generalization performance across a wide range of tasks compared to SAM, further supporting our theoretical framework., Comment: arXiv admin note: substantial text overlap with arXiv:2212.04343
Published: 2023
Full Text: View/download PDF

2. Improved Deep Neural Network Generalization Using m-Sharpness-Aware Minimization

Author: Behdin, Kayhan, Song, Qingquan, Gupta, Aman, Durfee, David, Acharya, Ayan, Keerthi, Sathiya, and Mazumder, Rahul
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Optimization and Control (math.OC), FOS: Mathematics, Mathematics - Optimization and Control, Machine Learning (cs.LG)
Abstract: Modern deep learning models are over-parameterized, where the optimization setup strongly affects the generalization performance. A key element of reliable optimization for these systems is the modification of the loss function. Sharpness-Aware Minimization (SAM) modifies the underlying loss function to guide descent methods towards flatter minima, which arguably have better generalization abilities. In this paper, we focus on a variant of SAM known as mSAM, which, during training, averages the updates generated by adversarial perturbations across several disjoint shards of a mini-batch. Recent work suggests that mSAM can outperform SAM in terms of test accuracy. However, a comprehensive empirical study of mSAM is missing from the literature -- previous results have mostly been limited to specific architectures and datasets. To that end, this paper presents a thorough empirical evaluation of mSAM on various tasks and datasets. We provide a flexible implementation of mSAM and compare the generalization performance of mSAM to the performance of SAM and vanilla training on different image classification and natural language processing tasks. We also conduct careful experiments to understand the computational cost of training with mSAM, its sensitivity to hyperparameters and its correlation with the flatness of the loss landscape. Our analysis reveals that mSAM yields superior generalization performance and flatter minima, compared to SAM, across a wide range of tasks without significantly increasing computational costs.
Published: 2022
Full Text: View/download PDF

3. Isometric Graph Neural Networks

Author: Walker, Matthew, Yan, Bo, Xiao, Yiou, Wang, Yafei, and Acharya, Ayan
Subjects: Social and Information Networks (cs.SI), FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Machine Learning (stat.ML), Computer Science - Social and Information Networks, Machine Learning (cs.LG)
Abstract: Many tasks that rely on representations of nodes in graphs would benefit if those representations were faithful to distances between nodes in the graph. Geometric techniques to extract such representations have poor scaling over large graph size, and recent advances in Graph Neural Network (GNN) algorithms have limited ability to reflect graph distance information beyond the first degree neighborhood. To enable this highly desired capability, we propose a technique to learn Isometric Graph Neural Networks (IGNN), which requires changing the input representation space and loss function to enable any GNN algorithm to generate representations that reflect distances between nodes. We experiment with the isometric technique on several GNN architectures for modeling multiple prediction tasks on multiple datasets. In addition to an improvement in AUC-ROC as high as $43\%$ in these experiments, we observe a consistent and substantial improvement as high as 400% in Kendall's Tau (KT), a measure that directly reflects distance information, demonstrating that the learned embeddings do account for graph distances.
Published: 2020

4. REVIEW OF ANTICHOLINERGIC DRUGS

Author: Acharya, Ayan
Published: 2020
Full Text: View/download PDF

5. RESEALED ERYTHROCYTE : A NOVEL DRUG DELIVERY SYSTEM

Author: Acharya, Ayan
Published: 2018
Full Text: View/download PDF

6. Scalable Variational Bayesian Factorization Machine

Author: Saha, Avijit, Rishabh Misra, Acharya, Ayan, and Balaraman Ravindran
Published: 2017
Full Text: View/download PDF

7. FORMULATION AND EVALUATION OF PHENYTOIN SODIUM SUSTAINED RELEASE MATRIX BASED TABLET NSHM KNOWLEDGE CAMPUS, KOLKATA-GROUP OF INSTITUTIONS CERTIFICATE OF ORIGINALITY OF WORK

Author: Acharya, Ayan
Published: 2017
Full Text: View/download PDF

8. Nonparametric Bayesian Factor Analysis for Dynamic Count Matrices

Author: Acharya, Ayan, Ghosh, Joydeep, and Zhou, Mingyuan
Subjects: Methodology (stat.ME), FOS: Computer and information sciences, Statistics - Machine Learning, Machine Learning (stat.ML), Applications (stat.AP), Statistics - Applications, Statistics - Methodology
Abstract: A gamma process dynamic Poisson factor analysis model is proposed to factorize a dynamic count matrix, whose columns are sequentially observed count vectors. The model builds a novel Markov chain that sends the latent gamma random variables at time $(t-1)$ as the shape parameters of those at time $t$, which are linked to observed or latent counts under the Poisson likelihood. The significant challenge of inferring the gamma shape parameters is fully addressed, using unique data augmentation and marginalization techniques for the negative binomial distribution. The same nonparametric Bayesian model also applies to the factorization of a dynamic binary matrix, via a Bernoulli-Poisson link that connects a binary observation to a latent count, with closed-form conditional posteriors for the latent counts and efficient computation for sparse observations. We apply the model to text and music analysis, with state-of-the-art results., Comment: Appeared in Artificial Intelligence and Statistics (AISTATS), May 2015. The ArXiv version fixes a typo in (8), the equation right above Section 3.2 in Page 4 of http://www.jmlr.org/proceedings/papers/v38/acharya15.pdf
Published: 2015
Full Text: View/download PDF

9. An Optimization Framework for Semi-Supervised and Transfer Learning using Multiple Classifiers and Clusterers

Author: Acharya, Ayan, Hruschka, Eduardo R., Ghosh, Joydeep, and Acharyya, Sreangsu
Subjects: FOS: Computer and information sciences, Computer Science - Learning, ComputingMethodologies_PATTERNRECOGNITION, I.5.2, I.5.3, I.5.4, Machine Learning (cs.LG)
Abstract: Unsupervised models can provide supplementary soft constraints to help classify new, "target" data since similar instances in the target set are more likely to share the same class label. Such models can also help detect possible differences between training and target distributions, which is useful in applications where concept drift may take place, as in transfer learning settings. This paper describes a general optimization framework that takes as input class membership estimates from existing classifiers learnt on previously encountered "source" data, as well as a similarity matrix from a cluster ensemble operating solely on the target data to be classified, and yields a consensus labeling of the target data. This framework admits a wide range of loss functions and classification/clustering methods. It exploits properties of Bregman divergences in conjunction with Legendre duality to yield a principled and scalable approach. A variety of experiments show that the proposed framework can yield results substantially superior to those provided by popular transductive learning techniques or by naively applying classifiers learnt on the original task to the target data.
Published: 2012

10. Balancing Exploration and Exploitation by an Elitist Ant System with Exponential Pheromone Deposition Rule

Author: Acharya, Ayan, Maiti, Deepyaman, Banerjee, Aritra, and Konar, Amit
Subjects: FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence
Abstract: The paper presents an exponential pheromone deposition rule to modify the basic ant system algorithm which employs constant deposition rule. A stability analysis using differential equation is carried out to find out the values of parameters that make the ant system dynamics stable for both kinds of deposition rule. A roadmap of connected cities is chosen as the problem environment where the shortest route between two given cities is required to be discovered. Simulations performed with both forms of deposition approach using Elitist Ant System model reveal that the exponential deposition approach outperforms the classical one by a large extent. Exhaustive experiments are also carried out to find out the optimum setting of different controlling parameters for exponential deposition approach and an empirical relationship between the major controlling parameters of the algorithm and some features of problem environment., 2008 IEEE Region 10 Colloquium and the Third ICIIS, Kharagpur, INDIA. Paper ID: 250
Published: 2008

11. A Swarm Intelligence Based Scheme for Complete and Fault-tolerant Identification of a Dynamical Fractional Order Process

Author: Maiti, Deepyaman, Acharya, Ayan, and Konar, Amit
Subjects: FOS: Computer and information sciences, Computer Science - Other Computer Science, Other Computer Science (cs.OH)
Abstract: System identification refers to estimation of process parameters and is a necessity in control theory. Physical systems usually have varying parameters. For such processes, accurate identification is particularly important. Online identification schemes are also needed for designing adaptive controllers. Real processes are usually of fractional order as opposed to the ideal integral order models. In this paper, we propose a simple and elegant scheme of estimating the parameters for such a fractional order process. A population of process models is generated and updated by particle swarm optimization (PSO) technique, the fitness function being the sum of squared deviations from the actual set of observations. Results show that the proposed scheme offers a high degree of accuracy even when the observations are corrupted to a significant degree. Additional schemes to improve the accuracy still further are also proposed and analyzed., 2008 IEEE Region 10 Colloquium and the Third ICIIS, Kharagpur, INDIA. Paper Identification Number 239
Published: 2008

12. Tuning PID and FOPID Controllers using the Integral Time Absolute Error Criterion

Author: Maiti, Deepyaman, Acharya, Ayan, Chakraborty, Mithun, Konar, Amit, and Janarthanan, Ramadoss
Subjects: FOS: Computer and information sciences, Computer Science - Other Computer Science, Other Computer Science (cs.OH)
Abstract: Particle swarm optimization (PSO) is extensively used for real parameter optimization in diverse fields of study. This paper describes an application of PSO to the problem of designing a fractional-order proportional-integral-derivative (FOPID) controller whose parameters comprise proportionality constant, integral constant, derivative constant, integral order (lambda) and derivative order (delta). The presence of five optimizable parameters makes the task of designing a FOPID controller more challenging than conventional PID controller design. Our design method focuses on minimizing the Integral Time Absolute Error (ITAE) criterion. The digital realization of the deigned system utilizes the Tustin operator-based continued fraction expansion scheme. We carry out a simulation that illustrates the effectiveness of the proposed approach especially for realizing fractional-order plants. This paper also attempts to study the behavior of fractional PID controller vis-a-vis that of its integer order counterpart and demonstrates the superiority of the former to the latter., Comment: 4th IEEE International Conference on Information and Automation for Sustainability, 2008
Published: 2008
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

12 results on '"Acharya, Ayan"'

1. mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization

2. Improved Deep Neural Network Generalization Using m-Sharpness-Aware Minimization

3. Isometric Graph Neural Networks

4. REVIEW OF ANTICHOLINERGIC DRUGS

5. RESEALED ERYTHROCYTE : A NOVEL DRUG DELIVERY SYSTEM

6. Scalable Variational Bayesian Factorization Machine

7. FORMULATION AND EVALUATION OF PHENYTOIN SODIUM SUSTAINED RELEASE MATRIX BASED TABLET NSHM KNOWLEDGE CAMPUS, KOLKATA-GROUP OF INSTITUTIONS CERTIFICATE OF ORIGINALITY OF WORK

8. Nonparametric Bayesian Factor Analysis for Dynamic Count Matrices

9. An Optimization Framework for Semi-Supervised and Transfer Learning using Multiple Classifiers and Clusterers

10. Balancing Exploration and Exploitation by an Elitist Ant System with Exponential Pheromone Deposition Rule

11. A Swarm Intelligence Based Scheme for Complete and Fault-tolerant Identification of a Dynamical Fractional Order Process

12. Tuning PID and FOPID Controllers using the Integral Time Absolute Error Criterion

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

12 results on '"Acharya, Ayan"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources