Author: "Budhathoki, Kailash" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Budhathoki, Kailash"' showing total 25 results

Start Over Author "Budhathoki, Kailash"

25 results on '"Budhathoki, Kailash"'

1. LLM-Rank: A Graph Theoretical Approach to Pruning Large Language Models

Author: Hoffmann, David, Budhathoki, Kailash, and Kleindessner, Matthaeus
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: The evolving capabilities of large language models are accompanied by growing sizes and deployment costs, necessitating effective inference optimisation techniques. We propose a novel pruning method utilising centrality measures from graph theory, reducing both the computational requirements and the memory footprint of these models. Specifically, we devise a method for creating a weighted directed acyclical graph representation of multilayer perceptrons to which we apply a modified version of the weighted PageRank centrality measure to compute node importance scores. In combination with uniform pruning this leads to structured sparsity. We call this pruning method MLPRank. Furthermore we introduce an extension to decoder-only transformer models and call it LLMRank. For both variants we demonstrate a strong performance. With MLPRank on average leading to 6.09 % higher accuracy retention than three popular baselines and 13.42 % with LLMRank compared to two popular baselines.
Published: 2024

2. Inference Optimization of Foundation Models on AI Accelerators

Author: Park, Youngsuk, Budhathoki, Kailash, Chen, Liangfu, Kübler, Jonas, Huang, Jiaji, Kleindessner, Matthäus, Huan, Jun, Cevher, Volkan, Wang, Yida, and Karypis, George
Subjects: Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Powerful foundation models, including large language models (LLMs), with Transformer architectures have ushered in a new era of Generative AI across various industries. Industry and research community have witnessed a large number of new applications, based on those foundation models. Such applications include question and answer, customer services, image and video generation, and code completions, among others. However, as the number of model parameters reaches to hundreds of billions, their deployment incurs prohibitive inference costs and high latency in real-world scenarios. As a result, the demand for cost-effective and fast inference using AI accelerators is ever more higher. To this end, our tutorial offers a comprehensive discussion on complementary inference optimization techniques using AI accelerators. Beginning with an overview of basic Transformer architectures and deep learning system frameworks, we deep dive into system optimization techniques for fast and memory-efficient attention computations and discuss how they can be implemented efficiently on AI accelerators. Next, we describe architectural elements that are key for fast transformer inference. Finally, we examine various model compression and fast decoding strategies in the same context., Comment: [v2] Tutorial website added [v1] Tutorial published at KDD 2024. Camera-ready version
Published: 2024

3. Evaluating the Fairness of Discriminative Foundation Models in Computer Vision

Author: Ali, Junaid, Kleindessner, Matthaeus, Wenzel, Florian, Budhathoki, Kailash, Cevher, Volkan, and Russell, Chris
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computers and Society, Computer Science - Machine Learning
Abstract: We propose a novel taxonomy for bias evaluation of discriminative foundation models, such as Contrastive Language-Pretraining (CLIP), that are used for labeling tasks. We then systematically evaluate existing methods for mitigating bias in these models with respect to our taxonomy. Specifically, we evaluate OpenAI's CLIP and OpenCLIP models for key applications, such as zero-shot classification, image retrieval and image captioning. We categorize desired behaviors based around three axes: (i) if the task concerns humans; (ii) how subjective the task is (i.e., how likely it is that people from a diverse range of backgrounds would agree on a labeling); and (iii) the intended purpose of the task and if fairness is better served by impartiality (i.e., making decisions independent of the protected attributes) or representation (i.e., making decisions to maximize diversity). Finally, we provide quantitative fairness evaluations for both binary-valued and multi-valued protected attributes over ten diverse datasets. We find that fair PCA, a post-processing method for fair representations, works very well for debiasing in most of the aforementioned tasks while incurring only minor loss of performance. However, different debiasing approaches vary in their effectiveness depending on the task. Hence, one should choose the debiasing approach depending on the specific use case., Comment: Accepted at AIES'23
Published: 2023
Full Text: View/download PDF

4. Meaningful Causal Aggregation and Paradoxical Confounding

Author: Zhu, Yuchen, Budhathoki, Kailash, Kuebler, Jonas, and Janzing, Dominik
Subjects: Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: In aggregated variables the impact of interventions is typically ill-defined because different micro-realizations of the same macro-intervention can result in different changes of downstream macro-variables. We show that this ill-definedness of causality on aggregated variables can turn unconfounded causal relations into confounded ones and vice versa, depending on the respective micro-realization. We argue that it is practically infeasible to only use aggregated causal systems when we are free from this ill-definedness. Instead, we need to accept that macro causal relations are typically defined only with reference to the micro states. On the positive side, we show that cause-effect relations can be aggregated when the macro interventions are such that the distribution of micro states is the same as in the observational distribution; we term this natural macro interventions. We also discuss generalizations of this observation., Comment: CLeaR 2024
Published: 2023

5. Explaining the root causes of unit-level changes

Author: Budhathoki, Kailash, Michailidis, George, and Janzing, Dominik
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Existing methods of explainable AI and interpretable ML cannot explain change in the values of an output variable for a statistical unit in terms of the change in the input values and the change in the "mechanism" (the function transforming input to output). We propose two methods based on counterfactuals for explaining unit-level changes at various input granularities using the concept of Shapley values from game theory. These methods satisfy two key axioms desirable for any unit-level change attribution method. Through simulations, we study the reliability and the scalability of the proposed methods. We get sensible results from a case study on identifying the drivers of the change in the earnings for individuals in the US., Comment: Under review
Published: 2022

6. DoWhy-GCM: An extension of DoWhy for causal inference in graphical causal models

Author: Blöbaum, Patrick, Götz, Peter, Budhathoki, Kailash, Mastakouri, Atalanti A., and Janzing, Dominik
Subjects: Statistics - Methodology, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: We present DoWhy-GCM, an extension of the DoWhy Python library, which leverages graphical causal models. Unlike existing causality libraries, which mainly focus on effect estimation, DoWhy-GCM addresses diverse causal queries, such as identifying the root causes of outliers and distributional changes, attributing causal influences to the data generating process of each node, or diagnosis of causal structures. With DoWhy-GCM, users typically specify cause-effect relations via a causal graph, fit causal mechanisms, and pose causal queries -- all with just a few lines of code. The general documentation is available at https://www.pywhy.org/dowhy and the DoWhy-GCM specific code at https://github.com/py-why/dowhy/tree/main/dowhy/gcm.
Published: 2022

7. Why did the distribution change?

Author: Budhathoki, Kailash, Janzing, Dominik, Bloebaum, Patrick, and Ng, Hoiyi
Subjects: Statistics - Methodology, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: We describe a formal approach based on graphical causal models to identify the "root causes" of the change in the probability distribution of variables. After factorizing the joint distribution into conditional distributions of each variable, given its parents (the "causal mechanisms"), we attribute the change to changes of these causal mechanisms. This attribution analysis accounts for the fact that mechanisms often change independently and sometimes only some of them change. Through simulations, we study the performance of our distribution change attribution method. We then present a real-world case study identifying the drivers of the difference in the income distribution between men and women., Comment: Proceedings of the Twenty Fourth International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Published: 2021

8. Discovering Reliable Causal Rules

Author: Budhathoki, Kailash, Boley, Mario, and Vreeken, Jilles
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: We study the problem of deriving policies, or rules, that when enacted on a complex system, cause a desired outcome. Absent the ability to perform controlled experiments, such rules have to be inferred from past observations of the system's behaviour. This is a challenging problem for two reasons: First, observational effects are often unrepresentative of the underlying causal effect because they are skewed by the presence of confounding factors. Second, naive empirical estimations of a rule's effect have a high variance, and, hence, their maximisation can lead to random results. To address these issues, first we measure the causal effect of a rule from observational data---adjusting for the effect of potential confounders. Importantly, we provide a graphical criteria under which causal rule discovery is possible. Moreover, to discover reliable causal rules from a sample, we propose a conservative and consistent estimator of the causal effect, and derive an efficient and exact algorithm that maximises the estimator. On synthetic data, the proposed estimator converges faster to the ground truth than the naive estimator and recovers relevant causal rules even at small sample sizes. Extensive experiments on a variety of real-world datasets show that the proposed algorithm is efficient and discovers meaningful rules., Comment: Poster presented in NeurIPS 2018 Workshop on Causal Learning
Published: 2020

9. Quantifying intrinsic causal contributions via structure preserving interventions

Author: Janzing, Dominik, Blöbaum, Patrick, Mastakouri, Atalanti A., Faller, Philipp M., Minorics, Lenon, and Budhathoki, Kailash
Subjects: Computer Science - Artificial Intelligence, Computer Science - Information Theory, Statistics - Machine Learning
Abstract: We propose a notion of causal influence that describes the `intrinsic' part of the contribution of a node on a target node in a DAG. By recursively writing each node as a function of the upstream noise terms, we separate the intrinsic information added by each node from the one obtained from its ancestors. To interpret the intrinsic information as a {\it causal} contribution, we consider `structure-preserving interventions' that randomize each node in a way that mimics the usual dependence on the parents and does not perturb the observed joint distribution. To get a measure that is invariant with respect to relabelling nodes we use Shapley based symmetrization and show that it reduces in the linear case to simple ANOVA after resolving the target node into noise variables. We describe our contribution analysis for variance and entropy, but contributions for other target metrics can be defined analogously. The code is available in the package gcm of the open source library DoWhy., Comment: to appear at AISTATS 2024
Published: 2020

10. Causal structure based root cause analysis of outliers

Author: Janzing, Dominik, Budhathoki, Kailash, Minorics, Lenon, and Blöbaum, Patrick
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning, Mathematics - Statistics Theory
Abstract: We describe a formal approach to identify 'root causes' of outliers observed in $n$ variables $X_1,\dots,X_n$ in a scenario where the causal relation between the variables is a known directed acyclic graph (DAG). To this end, we first introduce a systematic way to define outlier scores. Further, we introduce the concept of 'conditional outlier score' which measures whether a value of some variable is unexpected *given the value of its parents* in the DAG, if one were to assume that the causal structure and the corresponding conditional distributions are also valid for the anomaly. Finally, we quantify to what extent the high outlier score of some target variable can be attributed to outliers of its ancestors. This quantification is defined via Shapley values from cooperative game theory., Comment: 11 pages, 9 Figures
Published: 2019

11. Causal Inference by Stochastic Complexity

Author: Budhathoki, Kailash and Vreeken, Jilles
Subjects: Computer Science - Learning, Computer Science - Artificial Intelligence
Abstract: The algorithmic Markov condition states that the most likely causal direction between two random variables X and Y can be identified as that direction with the lowest Kolmogorov complexity. Due to the halting problem, however, this notion is not computable. We hence propose to do causal inference by stochastic complexity. That is, we propose to approximate Kolmogorov complexity via the Minimum Description Length (MDL) principle, using a score that is mini-max optimal with regard to the model class under consideration. This means that even in an adversarial setting, such as when the true distribution is not in this class, we still obtain the optimal encoding for the data relative to the class. We instantiate this framework, which we call CISC, for pairs of univariate discrete variables, using the class of multinomial distributions. Experiments show that CISC is highly accurate on synthetic, benchmark, as well as real-world data, outperforming the state of the art by a margin, and scales extremely well with regard to sample and domain sizes.
Published: 2017

12. Ranking the Teams in European Football Leagues with Agony

Author: Neumann, Stefan, Ritter, Julian, Budhathoki, Kailash, Hutchison, David, Editorial Board Member, Kanade, Takeo, Editorial Board Member, Kittler, Josef, Editorial Board Member, Kleinberg, Jon M., Editorial Board Member, Mattern, Friedemann, Editorial Board Member, Mitchell, John C., Editorial Board Member, Naor, Moni, Editorial Board Member, Pandu Rangan, C., Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Terzopoulos, Demetri, Editorial Board Member, Tygar, Doug, Editorial Board Member, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Brefeld, Ulf, editor, Davis, Jesse, editor, Van Haaren, Jan, editor, and Zimmermann, Albrecht, editor
Published: 2019
Full Text: View/download PDF

13. Discovering Reliable Causal Rules

Author: Budhathoki, Kailash, primary, Boley, Mario, additional, and Vreeken, Jilles, additional
Published: 2021
Full Text: View/download PDF

14. Evaluating the Fairness of Discriminative Foundation Models in Computer Vision

Author: Ali, Junaid, primary, Kleindessner, Matthäus, additional, Wenzel, Florian, additional, Budhathoki, Kailash, additional, Cevher, Volkan, additional, and Russell, Chris, additional
Published: 2023
Full Text: View/download PDF

15. Ranking the Teams in European Football Leagues with Agony

Author: Neumann, Stefan, primary, Ritter, Julian, additional, and Budhathoki, Kailash, additional
Published: 2019
Full Text: View/download PDF

16. The Difference and the Norm — Characterising Similarities and Differences Between Databases

Author: Budhathoki, Kailash, Vreeken, Jilles, Goebel, Randy, Series editor, Tanaka, Yuzuru, Series editor, Wahlster, Wolfgang, Series editor, Appice, Annalisa, editor, Rodrigues, Pedro Pereira, editor, Santos Costa, Vítor, editor, Gama, João, editor, Jorge, Alípio, editor, and Soares, Carlos, editor
Published: 2015
Full Text: View/download PDF

17. Origo: causal inference by compression

Author: Budhathoki, Kailash and Vreeken, Jilles
Published: 2018
Full Text: View/download PDF

18. Causal Inference on Event Sequences

Author: Budhathoki, Kailash, primary and Vreeken, Jilles, additional
Published: 2018
Full Text: View/download PDF

19. Correlation by Compression

Author: Budhathoki, Kailash, primary and Vreeken, Jilles, additional
Published: 2017
Full Text: View/download PDF

20. The Difference and the Norm — Characterising Similarities and Differences Between Databases

Author: Budhathoki, Kailash, primary and Vreeken, Jilles, additional
Published: 2015
Full Text: View/download PDF

21. Causal Inference on Discrete Data

Author: Budhathoki, Kailash
Published: 2020

22. Accurate Causal Inference on Discrete Data

Author: Budhathoki, Kailash, primary and Vreeken, Jilles, additional
Published: 2018
Full Text: View/download PDF

23. Origo: causal inference by compression

Author: Budhathoki, Kailash, primary and Vreeken, Jilles, additional
Published: 2017
Full Text: View/download PDF

24. MDL for Causal Inference on Discrete Data

Author: Budhathoki, Kailash, primary and Vreeken, Jilles, additional
Published: 2017
Full Text: View/download PDF

25. Causal Inference by Compression

Author: Budhathoki, Kailash, primary and Vreeken, Jilles, additional
Published: 2016
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

25 results on '"Budhathoki, Kailash"'

1. LLM-Rank: A Graph Theoretical Approach to Pruning Large Language Models

2. Inference Optimization of Foundation Models on AI Accelerators

3. Evaluating the Fairness of Discriminative Foundation Models in Computer Vision

4. Meaningful Causal Aggregation and Paradoxical Confounding

5. Explaining the root causes of unit-level changes

6. DoWhy-GCM: An extension of DoWhy for causal inference in graphical causal models

7. Why did the distribution change?

8. Discovering Reliable Causal Rules

9. Quantifying intrinsic causal contributions via structure preserving interventions

10. Causal structure based root cause analysis of outliers

11. Causal Inference by Stochastic Complexity

12. Ranking the Teams in European Football Leagues with Agony

13. Discovering Reliable Causal Rules

14. Evaluating the Fairness of Discriminative Foundation Models in Computer Vision

15. Ranking the Teams in European Football Leagues with Agony

16. The Difference and the Norm — Characterising Similarities and Differences Between Databases

17. Origo: causal inference by compression

18. Causal Inference on Event Sequences

19. Correlation by Compression

20. The Difference and the Norm — Characterising Similarities and Differences Between Databases

21. Causal Inference on Discrete Data

22. Accurate Causal Inference on Discrete Data

23. Origo: causal inference by compression

24. MDL for Causal Inference on Discrete Data

25. Causal Inference by Compression

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

25 results on '"Budhathoki, Kailash"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources