Author: "Bhattacharya, Sourangshu" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Bhattacharya, Sourangshu"' showing total 142 results

Start Over Author "Bhattacharya, Sourangshu"

142 results on '"Bhattacharya, Sourangshu"'

1. A Greedy Hierarchical Approach to Whole-Network Filter-Pruning in CNNs

Author: Purohit, Kiran, Parvathgari, Anurag Reddy, and Bhattacharya, Sourangshu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Deep convolutional neural networks (CNNs) have achieved impressive performance in many computer vision tasks. However, their large model sizes require heavy computational resources, making pruning redundant filters from existing pre-trained CNNs an essential task in developing efficient models for resource-constrained devices. Whole-network filter pruning algorithms prune varying fractions of filters from each layer, hence providing greater flexibility. Current whole-network pruning methods are either computationally expensive due to the need to calculate the loss for each pruned filter using a training dataset, or use various heuristic / learned criteria for determining the pruning fractions for each layer. This paper proposes a two-level hierarchical approach for whole-network filter pruning which is efficient and uses the classification loss as the final criterion. The lower-level algorithm (called filter-pruning) uses a sparse-approximation formulation based on linear approximation of filter weights. We explore two algorithms: orthogonal matching pursuit-based greedy selection and a greedy backward pruning approach. The backward pruning algorithm uses a novel closed-form error criterion for efficiently selecting the optimal filter at each stage, thus making the whole algorithm much faster. The higher-level algorithm (called layer-selection) greedily selects the best-pruned layer (pruning using the filter-selection algorithm) using a global pruning criterion. We propose algorithms for two different global-pruning criteria: (1) layer-wise relative error (HBGS), and (2) final classification error (HBGTS). Our suite of algorithms outperforms state-of-the-art pruning methods on ResNet18, ResNet32, ResNet56, VGG16, and ResNext101. Our method reduces the RAM requirement for ResNext101 from 7.6 GB to 1.5 GB and achieves a 94% reduction in FLOPS without losing accuracy on CIFAR-10., Comment: Accepted in TMLR 2024
Published: 2024

2. VTruST: Controllable value function based subset selection for Data-Centric Trustworthy AI

Author: Das, Soumi, Nag, Shubhadip, Sharma, Shreyyash, Bhattacharya, Suparna, and Bhattacharya, Sourangshu
Subjects: Computer Science - Machine Learning
Abstract: Trustworthy AI is crucial to the widespread adoption of AI in high-stakes applications with fairness, robustness, and accuracy being some of the key trustworthiness metrics. In this work, we propose a controllable framework for data-centric trustworthy AI (DCTAI)- VTruST, that allows users to control the trade-offs between the different trustworthiness metrics of the constructed training datasets. A key challenge in implementing an efficient DCTAI framework is to design an online value-function-based training data subset selection algorithm. We pose the training data valuation and subset selection problem as an online sparse approximation formulation. We propose a novel online version of the Orthogonal Matching Pursuit (OMP) algorithm for solving this problem. Experimental results show that VTruST outperforms the state-of-the-art baselines on social, image, and scientific datasets. We also show that the data values generated by VTruST can provide effective data-centric explanations for different trustworthiness metrics., Comment: Accepted in ICLR 2024 DMLR workshop
Published: 2024

3. In-Context Ability Transfer for Question Decomposition in Complex QA

Author: V, Venktesh, Bhattacharya, Sourangshu, and Anand, Avishek
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Answering complex questions is a challenging task that requires question decomposition and multistep reasoning for arriving at the solution. While existing supervised and unsupervised approaches are specialized to a certain task and involve training, recently proposed prompt-based approaches offer generalizable solutions to tackle a wide variety of complex question-answering (QA) tasks. However, existing prompt-based approaches that are effective for complex QA tasks involve expensive hand annotations from experts in the form of rationales and are not generalizable to newer complex QA scenarios and tasks. We propose, icat (In-Context Ability Transfer) which induces reasoning capabilities in LLMs without any LLM fine-tuning or manual annotation of in-context samples. We transfer the ability to decompose complex questions to simpler questions or generate step-by-step rationales to LLMs, by careful selection from available data sources of related tasks. We also propose an automated uncertainty-aware exemplar selection approach for selecting examples from transfer data sources. Finally, we conduct large-scale experiments on a variety of complex QA tasks involving numerical reasoning, compositional complex QA, and heterogeneous complex QA which require decomposed reasoning. We show that ICAT convincingly outperforms existing prompt-based solutions without involving any model training, showcasing the benefits of re-using existing abilities., Comment: 10 pages
Published: 2023

4. A Data-Driven Defense against Edge-case Model Poisoning Attacks on Federated Learning

Author: Purohit, Kiran, Das, Soumi, Bhattacharya, Sourangshu, and Rana, Santu
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security
Abstract: Federated Learning systems are increasingly subjected to a multitude of model poisoning attacks from clients. Among these, edge-case attacks that target a small fraction of the input space are nearly impossible to detect using existing defenses, leading to a high attack success rate. We propose an effective defense using an external defense dataset, which provides information about the attack target. The defense dataset contains a mix of poisoned and clean examples, with only a few known to be clean. The proposed method, DataDefense, uses this dataset to learn a poisoned data detector model which marks each example in the defense dataset as poisoned or clean. It also learns a client importance model that estimates the probability of a client update being malicious. The global model is then updated as a weighted average of the client models' updates. The poisoned data detector and the client importance model parameters are updated using an alternating minimization strategy over the Federated Learning rounds. Extensive experiments on standard attack scenarios demonstrate that DataDefense can defend against model poisoning attacks where other state-of-the-art defenses fail. In particular, DataDefense is able to reduce the attack success rate by at least ~ 40% on standard attack setups and by more than 80% on some setups. Furthermore, DataDefense requires very few defense examples (as few as five) to achieve a near-optimal reduction in attack success rate.
Published: 2023

5. Opponent-aware Role-based Learning in Team Competitive Markov Games

Author: Koley, Paramita, Maiti, Aurghya, Ganguly, Niloy, and Bhattacharya, Sourangshu
Subjects: Computer Science - Multiagent Systems, I.2.6
Abstract: Team competition in multi-agent Markov games is an increasingly important setting for multi-agent reinforcement learning, due to its general applicability in modeling many real-life situations. Multi-agent actor-critic methods are the most suitable class of techniques for learning optimal policies in the team competition setting, due to their flexibility in learning agent-specific critic functions, which can also learn from other agents. In many real-world team competitive scenarios, the roles of the agents naturally emerge, in order to aid in coordination and collaboration within members of the teams. However, existing methods for learning emergent roles rely heavily on the Q-learning setup which does not allow learning of agent-specific Q-functions. In this paper, we propose RAC, a novel technique for learning the emergent roles of agents within a team that are diverse and dynamic. In the proposed method, agents also benefit from predicting the roles of the agents in the opponent team. RAC uses the actor-critic framework with role encoder and opponent role predictors for learning an optimal policy. Experimentation using 2 games demonstrates that the policies learned by RAC achieve higher rewards than those learned using state-of-the-art baselines. Moreover, experiments suggest that the agents in a team learn diverse and opponent-aware policies., Comment: 9 pages, 6 figures
Published: 2023

6. Modeling Continuous Time Sequences with Intermittent Observations using Marked Temporal Point Processes

Author: Gupta, Vinayak, Bedathur, Srikanta, Bhattacharya, Sourangshu, and De, Abir
Subjects: Computer Science - Machine Learning
Abstract: A large fraction of data generated via human activities such as online purchases, health records, spatial mobility etc. can be represented as a sequence of events over a continuous-time. Learning deep learning models over these continuous-time event sequences is a non-trivial task as it involves modeling the ever-increasing event timestamps, inter-event time gaps, event types, and the influences between different events within and across different sequences. In recent years neural enhancements to marked temporal point processes (MTPP) have emerged as a powerful framework to model the underlying generative mechanism of asynchronous events localized in continuous time. However, most existing models and inference methods in the MTPP framework consider only the complete observation scenario i.e. the event sequence being modeled is completely observed with no missing events -- an ideal setting that is rarely applicable in real-world applications. A recent line of work which considers missing events while training MTPP utilizes supervised learning techniques that require additional knowledge of missing or observed label for each event in a sequence, which further restricts its practicability as in several scenarios the details of missing events is not known apriori. In this work, we provide a novel unsupervised model and inference method for learning MTPP in presence of event sequences with missing events. Specifically, we first model the generative processes of observed events and missing events using two MTPP, where the missing events are represented as latent random variables. Then, we devise an unsupervised training method that jointly learns both the MTPP by means of variational inference. Such a formulation can effectively impute the missing data among the observed events and can identify the optimal position of missing events in a sequence., Comment: ACM TIST
Published: 2022
Full Text: View/download PDF

7. A Comparative Analysis of Machine-learning Models for Solar Flare Forecasting: Identifying High-performing Active Region Flare Indicators

Author: Sinha, Suvadip, Gupta, Om, Singh, Vishal, Lekshmi, B., Nandy, Dibyendu, Mitra, Dhrubaditya, Chatterjee, Saikat, Bhattacharya, Sourangshu, Chatterjee, Saptarshi, Srivastava, Nandita, Brandenburg, Axel, and Pal, Sanchita
Subjects: Astrophysics - Solar and Stellar Astrophysics
Abstract: Solar flares create adverse space weather impacting space and Earth-based technologies. However, the difficulty of forecasting flares, and by extension severe space weather, is accentuated by the lack of any unique flare trigger or a single physical pathway. Studies indicate that multiple physical properties contribute to active region flare potential, compounding the challenge. Recent developments in machine learning (ML) have enabled analysis of higher-dimensional data leading to increasingly better flare forecasting techniques. However, consensus on high-performing flare predictors remains elusive. In the most comprehensive study to date, we conduct a comparative analysis of four popular ML techniques (k-nearest neighbor, logistic regression, random forest classifier, and support vector machine) by training these on magnetic parameters obtained from the Helioseismic and Magnetic Imager (HMI) on board the Solar Dynamics Observatory (SDO) for the entirety of solar cycle 24. We demonstrate that the logistic regression and support vector machine algorithms perform extremely well in forecasting active region flaring potential. The logistic regression algorithm returns the highest true skill score of $0.967 \pm 0.018$, possibly the highest classification performance achieved with any strictly parametric study. From a comparative assessment, we establish that the magnetic properties like total current helicity, total vertical current density, total unsigned flux, R_VALUE, and total absolute twist are the top-performing flare indicators. We also introduce and analyze two new performance metrics, namely, severe and clear space weather indicators. Our analysis constrains the most successful ML algorithms and identifies physical parameters that contribute most to active region flare productivity.
Published: 2022
Full Text: View/download PDF

8. CheckSel: Efficient and Accurate Data-valuation Through Online Checkpoint Selection

Author: Das, Soumi, Sagarkar, Manasvi, Bhattacharya, Suparna, and Bhattacharya, Sourangshu
Subjects: Computer Science - Machine Learning
Abstract: Data valuation and subset selection have emerged as valuable tools for application-specific selection of important training data. However, the efficiency-accuracy tradeoffs of state-of-the-art methods hinder their widespread application to many AI workflows. In this paper, we propose a novel 2-phase solution to this problem. Phase 1 selects representative checkpoints from an SGD-like training algorithm, which are used in phase-2 to estimate the approximate training data values, e.g. decrease in validation loss due to each training point. A key contribution of this paper is CheckSel, an Orthogonal Matching Pursuit-inspired online sparse approximation algorithm for checkpoint selection in the online setting, where the features are revealed one at a time. Another key contribution is the study of data valuation in the domain adaptation setting, where a data value estimator obtained using checkpoints from training trajectory in the source domain training dataset is used for data valuation in a target domain training dataset. Experimental results on benchmark datasets show the proposed algorithm outperforms recent baseline methods by up to 30% in terms of test accuracy while incurring a similar computational burden, for both standalone and domain adaptation settings.
Published: 2022

9. Offsetting Unequal Competition through RL-assisted Incentive Schemes

Author: Koley, Paramita, Maiti, Aurghya, Bhattacharya, Sourangshu, and Ganguly, Niloy
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Machine Learning, Computer Science - Multiagent Systems
Abstract: This paper investigates the dynamics of competition among organizations with unequal expertise. Multi-agent reinforcement learning has been used to simulate and understand the impact of various incentive schemes designed to offset such inequality. We design Touch-Mark, a game based on well-known multi-agent-particle-environment, where two teams (weak, strong) with unequal but changing skill levels compete against each other. For training such a game, we propose a novel controller assisted multi-agent reinforcement learning algorithm \our\, which empowers each agent with an ensemble of policies along with a supervised controller that by selectively partitioning the sample space, triggers intelligent role division among the teammates. Using C-MADDPG as an underlying framework, we propose an incentive scheme for the weak team such that the final rewards of both teams become the same. We find that in spite of the incentive, the final reward of the weak team falls short of the strong team. On inspecting, we realize that an overall incentive scheme for the weak team does not incentivize the weaker agents within that team to learn and improve. To offset this, we now specially incentivize the weaker player to learn and as a result, observe that the weak team beyond an initial phase performs at par with the stronger team. The final goal of the paper has been to formulate a dynamic incentive scheme that continuously balances the reward of the two teams. This is achieved by devising an incentive scheme enriched with an RL agent which takes minimum information from the environment.
Published: 2022

10. MTLTS: A Multi-Task Framework To Obtain Trustworthy Summaries From Crisis-Related Microblogs

Author: Mukherjee, Rajdeep, Vishnu, Uppada, Peruri, Hari Chandana, Bhattacharya, Sourangshu, Rudra, Koustav, Goyal, Pawan, and Ganguly, Niloy
Subjects: Computer Science - Information Retrieval, H.3.3
Abstract: Occurrences of catastrophes such as natural or man-made disasters trigger the spread of rumours over social media at a rapid pace. Presenting a trustworthy and summarized account of the unfolding event in near real-time to the consumers of such potentially unreliable information thus becomes an important task. In this work, we propose MTLTS, the first end-to-end solution for the task that jointly determines the credibility and summary-worthiness of tweets. Our credibility verifier is designed to recursively learn the structural properties of a Twitter conversation cascade, along with the stances of replies towards the source tweet. We then take a hierarchical multi-task learning approach, where the verifier is trained at a lower layer, and the summarizer is trained at a deeper layer where it utilizes the verifier predictions to determine the salience of a tweet. Different from existing disaster-specific summarizers, we model tweet summarization as a supervised task. Such an approach can automatically learn summary-worthy features, and can therefore generalize well across domains. When trained on the PHEME dataset [29], not only do we outperform the strongest baselines for the auxiliary task of verification/rumour detection, we also achieve 21 - 35% gains in the verified ratio of summary tweets, and 16 - 20% gains in ROUGE1-F1 scores over the existing state-of-the-art solutions for the primary task of trustworthy summarization., Comment: Accepted as a Full Paper at WSDM 2022; 9 pages; Codes: https://github.com/rajdeep345/MTLTS
Published: 2021
Full Text: View/download PDF

11. PASTE: A Tagging-Free Decoding Framework Using Pointer Networks for Aspect Sentiment Triplet Extraction

Author: Mukherjee, Rajdeep, Nayak, Tapas, Butala, Yash, Bhattacharya, Sourangshu, and Goyal, Pawan
Subjects: Computer Science - Computation and Language, I.2.7
Abstract: Aspect Sentiment Triplet Extraction (ASTE) deals with extracting opinion triplets, consisting of an opinion target or aspect, its associated sentiment, and the corresponding opinion term/span explaining the rationale behind the sentiment. Existing research efforts are majorly tagging-based. Among the methods taking a sequence tagging approach, some fail to capture the strong interdependence between the three opinion factors, whereas others fall short of identifying triplets with overlapping aspect/opinion spans. A recent grid tagging approach on the other hand fails to capture the span-level semantics while predicting the sentiment between an aspect-opinion pair. Different from these, we present a tagging-free solution for the task, while addressing the limitations of the existing works. We adapt an encoder-decoder architecture with a Pointer Network-based decoding framework that generates an entire opinion triplet at each time step thereby making our solution end-to-end. Interactions between the aspects and opinions are effectively captured by the decoder by considering their entire detected spans while predicting their connecting sentiment. Extensive experiments on several benchmark datasets establish the better efficacy of our proposed approach, especially in the recall, and in predicting multiple and aspect/opinion-overlapped triplets from the same review sentence. We report our results both with and without BERT and also demonstrate the utility of domain-specific BERT post-training for the task., Comment: Accepted as a Long Paper at EMNLP 2021 (Main Conference); 13 pages; Codes: https://github.com/rajdeep345/PASTE
Published: 2021

12. AR-BERT: Aspect-relation enhanced Aspect-level Sentiment Classification with Multi-modal Explanations

Author: Islam, Sk Mainul and Bhattacharya, Sourangshu
Subjects: Computer Science - Computation and Language, Computer Science - Information Retrieval
Abstract: Aspect level sentiment classification (ALSC) is a difficult problem with state-of-the-art models showing less than 80% macro-F1 score on benchmark datasets. Existing models do not incorporate information on aspect-aspect relations in knowledge graphs (KGs), e.g. DBpedia. Two main challenges stem from inaccurate disambiguation of aspects to KG entities, and the inability to learn aspect representations from the large KGs in joint training with ALSC models. We propose AR-BERT, a novel two-level global-local entity embedding scheme that allows efficient joint training of KG-based aspect embeddings and ALSC models. A novel incorrect disambiguation detection technique addresses the problem of inaccuracy in aspect disambiguation. We also introduce the problem of determining mode significance in multi-modal explanation generation, and propose a two step solution. The proposed methods show a consistent improvement of 2.5 - 4.1 percentage points, over the recent BERT-based baselines on benchmark datasets. The code is available at https://github.com/mainuliitkgp/AR-BERT.git., Comment: Accepted at: The Webconf 2022
Published: 2021

13. Finding High-Value Training Data Subset through Differentiable Convex Programming

Author: Das, Soumi, Singh, Arshdeep, Chatterjee, Saptarshi, Bhattacharya, Suparna, and Bhattacharya, Sourangshu
Subjects: Computer Science - Machine Learning
Abstract: Finding valuable training data points for deep neural networks has been a core research challenge with many applications. In recent years, various techniques for calculating the "value" of individual training datapoints have been proposed for explaining trained models. However, the value of a training datapoint also depends on other selected training datapoints - a notion that is not explicitly captured by existing methods. In this paper, we study the problem of selecting high-value subsets of training data. The key idea is to design a learnable framework for online subset selection, which can be learned using mini-batches of training data, thus making our method scalable. This results in a parameterized convex subset selection problem that is amenable to a differentiable convex programming paradigm, thus allowing us to learn the parameters of the selection model in end-to-end training. Using this framework, we design an online alternating minimization-based algorithm for jointly learning the parameters of the selection model and ML model. Extensive evaluation on a synthetic dataset, and three standard datasets, show that our algorithm finds consistently higher value subsets of training data, compared to the recent state-of-the-art methods, sometimes ~20% higher value than existing methods. The subsets are also useful in finding mislabelled training data. Our algorithm takes running time comparable to the existing valuation functions.
Published: 2021

14. Convex Online Video Frame Subset Selection using Multiple Criteria for Data Efficient Autonomous Driving

Author: Das, Soumi, Patibandla, Harikrishna, Bhattacharya, Suparna, Bera, Kshounis, Ganguly, Niloy, and Bhattacharya, Sourangshu
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, I.2.9, G.1.6
Abstract: Training vision-based Urban Autonomous driving models is a challenging problem, which is highly researched in recent times. Training such models is a data-intensive task requiring the storage and processing of vast volumes of (possibly redundant) driving video data. In this paper, we study the problem of developing data-efficient autonomous driving systems. In this context, we study the problem of multi-criteria online video frame subset selection. We study convex optimization-based solutions and show that they are unable to provide solutions with high weightage to the loss of selected video frames. We design a novel convex optimization-based multi-criteria online subset selection algorithm that uses a thresholded concave function of selection variables. We also propose and study a submodular optimization-based algorithm. Extensive experiments using the driving simulator CARLA show that we are able to drop 80% of the frames while succeeding to complete 100% of the episodes w.r.t. the model trained on 100% data, in the most difficult task of taking turns. This results in a training time of less than 30% compared to training on the whole dataset. We also perform detailed experiments on prediction performances of various affordances used by the Conditional Affordance Learning (CAL) model and show that our subset selection improves performance on the crucial affordance "Relative Angle" during turns., Comment: Submitted to CVPR 2020
Published: 2021

15. Demarcating Endogenous and Exogenous Opinion Dynamics: An Experimental Design Approach

Author: Koley, Paramita, Saha, Avirup, Bhattacharya, Sourangshu, Ganguly, Niloy, and De, Abir
Subjects: Computer Science - Social and Information Networks, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The networked opinion diffusion in online social networks (OSN) is often governed by the two genres of opinions - endogenous opinions that are driven by the influence of social contacts among users, and exogenous opinions which are formed by external effects like news, feeds etc. Accurate demarcation of endogenous and exogenous messages offers an important cue to opinion modeling, thereby enhancing its predictive performance. In this paper, we design a suite of unsupervised classification methods based on experimental design approaches, in which, we aim to select the subsets of events which minimize different measures of mean estimation error. In more detail, we first show that these subset selection tasks are NP-Hard. Then we show that the associated objective functions are weakly submodular, which allows us to cast efficient approximation algorithms with guarantees. Finally, we validate the efficacy of our proposal on various real-world datasets crawled from Twitter as well as diverse synthetic datasets. Our experiments range from validating prediction performance on unsanitized and sanitized events to checking the effect of selecting optimal subsets of various sizes. Through various experiments, we have found that our method offers a significant improvement in accuracy in terms of opinion forecasting, against several competitors., Comment: 25 Pages, Accepted in ACM TKDD, 2021
Published: 2021
Full Text: View/download PDF

16. Scalable Backdoor Detection in Neural Networks

Author: Harikumar, Haripriya, Le, Vuong, Rana, Santu, Bhattacharya, Sourangshu, Gupta, Sunil, and Venkatesh, Svetha
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recently, it has been shown that deep learning models are vulnerable to Trojan attacks, where an attacker can install a backdoor during training time to make the resultant model misidentify samples contaminated with a small trigger patch. Current backdoor detection methods fail to achieve good detection performance and are computationally expensive. In this paper, we propose a novel trigger reverse-engineering based approach whose computational complexity does not scale with the number of labels, and is based on a measure that is both interpretable and universal across different network and patch types. In experiments, we observe that our method achieves a perfect score in separating Trojaned models from pure models, which is an improvement over the current state-of-the art method.
Published: 2020

17. Read what you need: Controllable Aspect-based Opinion Summarization of Tourist Reviews

Author: Mukherjee, Rajdeep, Peruri, Hari Chandana, Vishnu, Uppada, Goyal, Pawan, Bhattacharya, Sourangshu, and Ganguly, Niloy
Subjects: Computer Science - Information Retrieval, Computer Science - Computation and Language, H.3.3
Abstract: Manually extracting relevant aspects and opinions from large volumes of user-generated text is a time-consuming process. Summaries, on the other hand, help readers with limited time budgets to quickly consume the key ideas from the data. State-of-the-art approaches for multi-document summarization, however, do not consider user preferences while generating summaries. In this work, we argue the need and propose a solution for generating personalized aspect-based opinion summaries from large collections of online tourist reviews. We let our readers decide and control several attributes of the summary such as the length and specific aspects of interest among others. Specifically, we take an unsupervised approach to extract coherent aspects from tourist reviews posted on TripAdvisor. We then propose an Integer Linear Programming (ILP) based extractive technique to select an informative subset of opinions around the identified aspects while respecting the user-specified values for various control parameters. Finally, we evaluate and compare our summaries using crowdsourcing and ROUGE-based metrics and obtain competitive results., Comment: 4 pages, accepted in the Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2020
Published: 2020
Full Text: View/download PDF

18. Map Enhanced Route Travel Time Prediction using Deep Neural Networks

Author: Das, Soumi, Kalava, Rajath Nandan, Kumar, Kolli Kiran, Kandregula, Akhil, Suhaas, Kalpam, Bhattacharya, Sourangshu, and Ganguly, Niloy
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Travel time estimation is a fundamental problem in transportation science with extensive literature. The study of these techniques has intensified due to availability of many publicly available large trip datasets. Recently developed deep learning based models have improved the generality and performance and have focused on estimating times for individual sub-trajectories and aggregating them to predict the travel time of the entire trajectory. However, these techniques ignore the road network information. In this work, we propose and study techniques for incorporating road networks along with historical trips' data into travel time prediction. We incorporate both node embeddings as well as road distance into the existing model. Experiments on large real-world benchmark datasets suggest improved performance, especially when the train data is small. As expected, the proposed method performs better than the baseline when there is a larger difference between road distance and Vincenty distance between start and end points.
Published: 2019

19. Stark: Fast and Scalable Strassen's Matrix Multiplication using Apache Spark

Author: Misra, Chandan, Bhattacharya, Sourangshu, and Ghosh, Soumya K.
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: This paper presents a new fast, highly scalable distributed matrix multiplication algorithm on Apache Spark, called Stark, based on Strassen's matrix multiplication algorithm. Stark preserves Strassen's 7 multiplications scheme in a distributed environment and thus achieves faster execution. It is based on two new ideas; it creates a recursion tree of computation where each level of such tree corresponds to division and combination of distributed matrix blocks in the form of Resilient Distributed Datasets(RDDs); It processes each divide and combine step in parallel and memorize the sub-matrices by intelligently tagging matrix blocks in it. To the best of our knowledge, Stark is the first Strassen's implementation in Spark platform. We show experimentally that Stark has a strong scalability with increasing matrix size enabling us to multiply two (16384 x 16384) matrices with 28% and 36% less wall clock time than Marlin and MLLib respectively, state-of-the-art matrix multiplication approaches based on Spark.
Published: 2018

20. SPIN: A Fast and Scalable Matrix Inversion Method in Apache Spark

Author: Misra, Chandan, Bhattacharya, Sourangshu, and Ghosh, Soumya K.
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: The growth of big data in domains such as Earth Sciences, Social Networks, Physical Sciences, etc. has lead to an immense need for efficient and scalable linear algebra operations, e.g. Matrix inversion. Existing methods for efficient and distributed matrix inversion using big data platforms rely on LU decomposition based block-recursive algorithms. However, these algorithms are complex and require a lot of side calculations, e.g. matrix multiplication, at various levels of recursion. In this paper, we propose a different scheme based on Strassen's matrix inversion algorithm (mentioned in Strassen's original paper in 1969), which uses far fewer operations at each level of recursion. We implement the proposed algorithm, and through extensive experimentation, show that it is more efficient than the state of the art methods. Furthermore, we provide a detailed theoretical analysis of the proposed algorithm, and derive theoretical running times which match closely with the empirically observed wall clock running times, thus explaining the U-shaped behaviour w.r.t. block-sizes.
Published: 2018

21. Scalable Backdoor Detection in Neural Networks

Author: Harikumar, Haripriya, Le, Vuong, Rana, Santu, Bhattacharya, Sourangshu, Gupta, Sunil, Venkatesh, Svetha, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Hutter, Frank, editor, Kersting, Kristian, editor, Lijffijt, Jefrey, editor, and Valera, Isabel, editor
Published: 2021
Full Text: View/download PDF

22. A Methodology for Customizing Clinical Tests for Esophageal Cancer based on Patient Preferences

Author: Roy, Asis, Bhattacharya, Sourangshu, and Guin, Kalyan
Subjects: Computer Science - Learning, Statistics - Machine Learning
Abstract: Tests for Esophageal cancer can be expensive, uncomfortable and can have side effects. For many patients, we can predict non-existence of disease with 100% certainty, just using demographics, lifestyle, and medical history information. Our objective is to devise a general methodology for customizing tests using user preferences so that expensive or uncomfortable tests can be avoided. We propose to use classifiers trained from electronic health records (EHR) for selection of tests. The key idea is to design classifiers with 100% false normal rates, possibly at the cost higher false abnormals. We compare Naive Bayes classification (NB), Random Forests (RF), Support Vector Machines (SVM) and Logistic Regression (LR), and find kernel Logistic regression to be most suitable for the task. We propose an algorithm for finding the best probability threshold for kernel LR, based on test set accuracy. Using the proposed algorithm, we describe schemes for selecting tests, which appear as features in the automatic classification algorithm, using preferences on costs and discomfort of the users. We test our methodology with EHRs collected for more than 3000 patients, as a part of project carried out by a reputed hospital in Mumbai, India. Kernel SVM and kernel LR with a polynomial kernel of degree 3, yields an accuracy of 99.8% and sensitivity 100%, without the MP features, i.e. using only clinical tests. We demonstrate our test selection algorithm using two case studies, one using cost of clinical tests, and other using "discomfort" values for clinical tests. We compute the test sets corresponding to the lowest false abnormals for each criterion described above, using exhaustive enumeration of 15 clinical tests. The sets turn out to different, substantiating our claim that one can customize test sets based on user preferences.
Published: 2016

23. Distributed Weighted Parameter Averaging for SVM Training on Big Data

Author: Das, Ayan and Bhattacharya, Sourangshu
Subjects: Computer Science - Learning
Abstract: Two popular approaches for distributed training of SVMs on big data are parameter averaging and ADMM. Parameter averaging is efficient but suffers from loss of accuracy with increase in number of partitions, while ADMM in the feature space is accurate but suffers from slow convergence. In this paper, we report a hybrid approach called weighted parameter averaging (WPA), which optimizes the regularized hinge loss with respect to weights on parameters. The problem is shown to be same as solving SVM in a projected space. We also demonstrate an $O(\frac{1}{N})$ stability bound on final hypothesis given by WPA, using novel proof techniques. Experimental results on a variety of toy and real world datasets show that our approach is significantly more accurate than parameter averaging for high number of partitions. It is also seen the proposed method enjoys much faster convergence compared to ADMM in features space.
Published: 2015

24. Learning and Forecasting Opinion Dynamics in Social Networks

Author: De, Abir, Valera, Isabel, Ganguly, Niloy, Bhattacharya, Sourangshu, and Rodriguez, Manuel Gomez
Subjects: Computer Science - Social and Information Networks, Physics - Physics and Society
Abstract: Social media and social networking sites have become a global pinboard for exposition and discussion of news, topics, and ideas, where social media users often update their opinions about a particular topic by learning from the opinions shared by their friends. In this context, can we learn a data-driven model of opinion dynamics that is able to accurately forecast opinions from users? In this paper, we introduce SLANT, a probabilistic modeling framework of opinion dynamics, which represents users opinions over time by means of marked jump diffusion stochastic differential equations, and allows for efficient model simulation and parameter estimation from historical fine grained event data. We then leverage our framework to derive a set of efficient predictive formulas for opinion forecasting and identify conditions under which opinions converge to a steady state. Experiments on data gathered from Twitter show that our model provides a good fit to the data and our formulas achieve more accurate forecasting than alternatives.
Published: 2015

25. Learning Information Dynamics in Online Social Media: A Temporal Point Process Perspective

Author: Samanta, Bidisha, Saha, Avirup, Ganguly, Niloy, Bhattacharya, Sourangshu, De, Abir, Abarbanel, Henry, Series Editor, Braha, Dan, Series Editor, Érdi, Péter, Series Editor, Friston, Karl, Series Editor, Haken, Hermann, Series Editor, Jirsa, Viktor, Series Editor, Kacprzyk, Janusz, Series Editor, Kaneko, Kunihiko, Series Editor, Kelso, Scott, Series Editor, Kirkilionis, Markus, Series Editor, Kurths, Jürgen, Series Editor, Nowak, Andrzej, Series Editor, Qudrat-Ullah, Hassan, Series Editor, Reichl, Linda, Series Editor, Schuster, Peter, Series Editor, Schweitzer, Frank, Series Editor, Sornette, Didier, Series Editor, Thurner, Stefan, Series Editor, Ghanbarnejad, Fakhteh, editor, Saha Roy, Rishiraj, editor, Karimi, Fariba, editor, Delvenne, Jean-Charles, editor, and Mitra, Bivas, editor
Published: 2019
Full Text: View/download PDF

26. A Comparative Data-driven Study of Intensity-based Categorical Emotion Representations for MER

Author: Chaki, Sanga, primary, Bhattacharya, Sourangshu, additional, Borgohain, Junmoni, additional, Patnaik, Priyadarshi, additional, Mullick, Raju, additional, and Karambelkar, Gouri, additional
Published: 2024
Full Text: View/download PDF

27. A fast scalable distributed kriging algorithm using Spark framework

Author: Misra, Chandan, Bhattacharya, Sourangshu, and Ghosh, Soumya K.
Published: 2020
Full Text: View/download PDF

28. SprIntMap: A System for Visualizing Interpolating Surface Using Apache Spark

Author: Misra, Chandan, Bhattacharya, Sourangshu, Ghosh, Soumya K., Kacprzyk, Janusz, Series Editor, Pal, Nikhil R., Advisory Editor, Bello Perez, Rafael, Advisory Editor, Corchado, Emilio S., Advisory Editor, Hagras, Hani, Advisory Editor, Kóczy, László T., Advisory Editor, Kreinovich, Vladik, Advisory Editor, Lin, Chin-Teng, Advisory Editor, Lu, Jie, Advisory Editor, Melin, Patricia, Advisory Editor, Nedjah, Nadia, Advisory Editor, Nguyen, Ngoc Thanh, Advisory Editor, Wang, Jun, Advisory Editor, Sa, Pankaj Kumar, editor, Bakshi, Sambit, editor, Hatzilygeroudis, Ioannis K., editor, and Sahoo, Manmath Narayan, editor
Published: 2018
Full Text: View/download PDF

29. Analyzing Music to Music Perceptual Contagion of Emotion in Clusters of Survey-Takers, Using a Novel Contagion Interface: A Case Study of Hindustani Classical Music

Author: Chaki, Sanga, Bhattacharya, Sourangshu, Mullick, Raju, Patnaik, Priyadarshi, Hutchison, David, Series Editor, Kanade, Takeo, Series Editor, Kittler, Josef, Series Editor, Kleinberg, Jon M., Series Editor, Mattern, Friedemann, Series Editor, Mitchell, John C., Series Editor, Naor, Moni, Series Editor, Pandu Rangan, C., Series Editor, Steffen, Bernhard, Series Editor, Terzopoulos, Demetri, Series Editor, Tygar, Doug, Series Editor, Weikum, Gerhard, Series Editor, Aramaki, Mitsuko, editor, Davies, Matthew E. P., editor, Kronland-Martinet, Richard, editor, and Ystad, Sølvi, editor
Published: 2018
Full Text: View/download PDF

30. Scalable Backdoor Detection in Neural Networks

Author: Harikumar, Haripriya, primary, Le, Vuong, additional, Rana, Santu, additional, Bhattacharya, Sourangshu, additional, Gupta, Sunil, additional, and Venkatesh, Svetha, additional
Published: 2021
Full Text: View/download PDF

31. Finding High-Value Training Data Subset Through Differentiable Convex Programming

Author: Das, Soumi, primary, Singh, Arshdeep, additional, Chatterjee, Saptarshi, additional, Bhattacharya, Suparna, additional, and Bhattacharya, Sourangshu, additional
Published: 2021
Full Text: View/download PDF

32. A methodology for customizing clinical tests for esophageal cancer based on patient preferences

Author: Roy, Asis, Bhattacharya, Sourangshu, and Guin, Kalyan
Published: 2019
Full Text: View/download PDF

33. Mechanism Design for Cost Optimal PAC Learning in the Presence of Strategic Noisy Annotators

Author: Garg, Dinesh, Bhattacharya, Sourangshu, Sundararajan, S., and Shevade, Shirish
Subjects: Computer Science - Learning, Computer Science - Computer Science and Game Theory, Statistics - Machine Learning
Abstract: We consider the problem of Probably Approximate Correct (PAC) learning of a binary classifier from noisy labeled examples acquired from multiple annotators (each characterized by a respective classification noise rate). First, we consider the complete information scenario, where the learner knows the noise rates of all the annotators. For this scenario, we derive sample complexity bound for the Minimum Disagreement Algorithm (MDA) on the number of labeled examples to be obtained from each annotator. Next, we consider the incomplete information scenario, where each annotator is strategic and holds the respective noise rate as a private information. For this scenario, we design a cost optimal procurement auction mechanism along the lines of Myerson's optimal auction design framework in a non-trivial manner. This mechanism satisfies incentive compatibility property, thereby facilitating the learner to elicit true noise rates of all the annotators., Comment: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)
Published: 2012

34. Facial Indicators of Extraversion and Neuroticism: A Computational Approach

Author: Mullick, Raju, primary, Pramanik, Suchitra, additional, Parnaik, Priyadarshi, additional, and Bhattacharya, Sourangshu, additional
Published: 2023
Full Text: View/download PDF

35. Learning Information Dynamics in Online Social Media: A Temporal Point Process Perspective

Author: Samanta, Bidisha, primary, Saha, Avirup, additional, Ganguly, Niloy, additional, Bhattacharya, Sourangshu, additional, and De, Abir, additional
Published: 2019
Full Text: View/download PDF

36. Prediction of esophageal cancer using demographic, lifestyle, patient history, and basic clinical tests

Author: Roy, Asis, Bhattacharya, Sourangshu, and Guin, Kalyan
Published: 2017
Full Text: View/download PDF

37. LearnDefend: Learning to Defend against Targeted Model-Poisoning Attacks on Federated Learning

Author: Purohit, Kiran, Das, Soumi, Bhattacharya, Sourangshu, Rana, Santu, Purohit, Kiran, Das, Soumi, Bhattacharya, Sourangshu, and Rana, Santu
Abstract: Targeted model poisoning attacks pose a significant threat to federated learning systems. Recent studies show that edge-case targeted attacks, which target a small fraction of the input space are nearly impossible to counter using existing fixed defense strategies. In this paper, we strive to design a learned-defense strategy against such attacks, using a small defense dataset. The defense dataset can be collected by the central authority of the federated learning task, and should contain a mix of poisoned and clean examples. The proposed framework, LearnDefend, estimates the probability of a client update being malicious. The examples in defense dataset need not be pre-marked as poisoned or clean. We also learn a poisoned data detector model which can be used to mark each example in the defense dataset as clean or poisoned. We estimate the poisoned data detector and the client importance models in a coupled optimization approach. Our experiments demonstrate that LearnDefend is capable of defending against state-of-the-art attacks where existing fixed defense strategies fail. We also show that LearnDefend is robust to size and noise in the marking of clean examples in the defense dataset.
Published: 2023

38. A fast scalable distributed kriging algorithm using Spark framework

Author: Misra, Chandan, Bhattacharya, Sourangshu, and Ghosh, Soumya K.
Abstract: Environmental and climate models used for weather prediction require evenly spaced meteorological datasets at a very high spatial and temporal resolution to facilitate the analysis of recent climatic changes. However, due to the small number of weather stations available, often the data collected from them are scattered and inadequate for such model creation. For this reason, very high-resolution gridded meteorological surface is developed by interpolating the available scattered data points to fulfill the need of various ecological and climatic applications. Among various interpolation techniques, Ordinary Kriging (OK) is one of the most popular and widely used gridding methodologies with a sound statistical basis providing a possibility to obtain highly accurate results. However, OK interpolation on large unevenly spaced data points is computationally demanding and has a computational cost that scales as the cube of the number of data points as it involves multiplication and inversion of matrices of large cardinalities infeasible for computation on a single node. Additionally, its standard implementation involves complex model fitting and function minimization steps which make automatic kriging analysis from raw data a considerable challenge. Meanwhile, Apache Spark has emerged as a large-scale data processing engine with a dedicated Machine Learning Library (MLLib) for processing large matrices and thereby can be used for large-scale kriging analysis with considerable time. In this paper, we present a new fast distributed OK algorithm on Apache Spark framework and provide an efficient and simple distributed matrix inversion scheme to accelerate the execution of distributed OK algorithm. We have employed Strassen’s direct method for matrix inversion and the acceleration is achieved by exploiting the symmetry nature of the variance–covariance matrix of the OK equation to invert the matrix. We show experimentally that our distributed inversion scheme enables us to invert a 16,000×16,000matrix with 51% and 38% less wall clock time than distributed Spark-based LUand Strassen’s inversion scheme, respectively.
Published: 2024
Full Text: View/download PDF

39. Analyzing Music to Music Perceptual Contagion of Emotion in Clusters of Survey-Takers, Using a Novel Contagion Interface: A Case Study of Hindustani Classical Music

Author: Chaki, Sanga, primary, Bhattacharya, Sourangshu, additional, Mullick, Raju, additional, and Patnaik, Priyadarshi, additional
Published: 2018
Full Text: View/download PDF

40. Offsetting Unequal Competition Through RL-Assisted Incentive Schemes

Author: Koley, Paramita, primary, Maiti, Aurghya, additional, Bhattacharya, Sourangshu, additional, and Ganguly, Niloy, additional
Published: 2023
Full Text: View/download PDF

41. Accurate and Efficient Channel pruning via Orthogonal Matching Pursuit

Author: Purohit, Kiran, primary, Parvathgari, Anurag, additional, Das, Soumi, additional, and Bhattacharya, Sourangshu, additional
Published: 2022
Full Text: View/download PDF

42. Modeling Continuous Time Sequences with Intermittent Observations using Marked Temporal Point Processes

Author: Gupta, Vinayak, primary, Bedathur, Srikanta, additional, Bhattacharya, Sourangshu, additional, and De, Abir, additional
Published: 2022
Full Text: View/download PDF

43. A Comparative Analysis of Machine-learning Models for Solar Flare Forecasting: Identifying High-performing Active Region Flare Indicators

Author: Sinha, Suvadip, primary, Gupta, Om, additional, Singh, Vishal, additional, Lekshmi, B., additional, Nandy, Dibyendu, additional, Mitra, Dhrubaditya, additional, Chatterjee, Saikat, additional, Bhattacharya, Sourangshu, additional, Chatterjee, Saptarshi, additional, Srivastava, Nandita, additional, Brandenburg, Axel, additional, and Pal, Sanchita, additional
Published: 2022
Full Text: View/download PDF

44. Stark: Fast and Scalable Strassen’s Matrix Multiplication Using Apache Spark

Author: Misra, Chandan, primary, Bhattacharya, Sourangshu, additional, and Ghosh, Soumya K., additional
Published: 2022
Full Text: View/download PDF

45. AR-BERT: Aspect-relation enhanced Aspect-level Sentiment Classification with Multi-modal Explanations

Author: Islam, Sk Mainul, primary and Bhattacharya, Sourangshu, additional
Published: 2022
Full Text: View/download PDF

46. MTLTS: A Multi-Task Framework To Obtain Trustworthy Summaries From Crisis-Related Microblogs

Author: Mukherjee, Rajdeep, primary, Vishnu, Uppada, additional, Peruri, Hari Chandana, additional, Bhattacharya, Sourangshu, additional, Rudra, Koustav, additional, Goyal, Pawan, additional, and Ganguly, Niloy, additional
Published: 2022
Full Text: View/download PDF

47. Rac: Multi-Agent Actor-Critic Framework for Learning Emergent Roles In Team-Competitive Games

Author: Koley, Paramita, primary, Ganguly, Niloy, additional, Bhattacharya, Sourangshu, additional, and Maity, Aurghya, additional
Published: 2022
Full Text: View/download PDF

48. TMCOSS: Thresholded Multi-Criteria Online Subset Selection for Data-Efficient Autonomous Driving

Author: Das, Soumi, primary, Patibandla, Harikrishna, additional, Bhattacharya, Suparna, additional, Bera, Kshounis, additional, Ganguly, Niloy, additional, and Bhattacharya, Sourangshu, additional
Published: 2021
Full Text: View/download PDF

49. Demarcating Endogenous and Exogenous Opinion Dynamics: An Experimental Design Approach

Author: Koley, Paramita, primary, Saha, Avirup, additional, Bhattacharya, Sourangshu, additional, Ganguly, Niloy, additional, and De, Abir, additional
Published: 2021
Full Text: View/download PDF

50. Multi-criteria online frame-subset selection for autonomous vehicle videos

Author: Das, Soumi, Mandal, Sayan, Bhoyar, Ashwin, Bharde, Madhumita, Ganguly, Niloy, Bhattacharya, Suparna, and Bhattacharya, Sourangshu
Published: 2020
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

142 results on '"Bhattacharya, Sourangshu"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources