Author: "Murali, Nihal" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Murali, Nihal"' showing total 9 results

Start Over Author "Murali, Nihal"

9 results on '"Murali, Nihal"'

1. Beyond Distribution Shift: Spurious Features Through the Lens of Training Dynamics

Author: Murali, Nihal, Puli, Aahlad, Yu, Ke, Ranganath, Rajesh, and Batmanghelich, Kayhan
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Deep Neural Networks (DNNs) are prone to learning spurious features that correlate with the label during training but are irrelevant to the learning problem. This hurts model generalization and poses problems when deploying them in safety-critical applications. This paper aims to better understand the effects of spurious features through the lens of the learning dynamics of the internal neurons during the training process. We make the following observations: (1) While previous works highlight the harmful effects of spurious features on the generalization ability of DNNs, we emphasize that not all spurious features are harmful. Spurious features can be "benign" or "harmful" depending on whether they are "harder" or "easier" to learn than the core features for a given model. This definition is model and dataset-dependent. (2) We build upon this premise and use instance difficulty methods (like Prediction Depth (Baldock et al., 2021)) to quantify "easiness" for a given model and to identify this behavior during the training phase. (3) We empirically show that the harmful spurious features can be detected by observing the learning dynamics of the DNN's early layers. In other words, easy features learned by the initial layers of a DNN early during the training can (potentially) hurt model generalization. We verify our claims on medical and vision datasets, both simulated and real, and justify the empirical success of our hypothesis by showing the theoretical connections between Prediction Depth and information-theoretic concepts like V-usable information (Ethayarajh et al., 2021). Lastly, our experiments show that monitoring only accuracy during training (as is common in machine learning pipelines) is insufficient to detect spurious features. We, therefore, highlight the need for monitoring early training dynamics using suitable instance difficulty metrics., Comment: Main paper: 12 pages, 2 tables, and 10 figures. Supplementary: 10 pages and 9 figures. Accepted in TMLR23 (https://openreview.net/pdf?id=Tkvmt9nDmB)
Published: 2023

2. Augmentation by Counterfactual Explanation -- Fixing an Overconfident Classifier

Author: Singla, Sumedha, Murali, Nihal, Arabshahi, Forough, Triantafyllou, Sofia, and Batmanghelich, Kayhan
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: A highly accurate but overconfident model is ill-suited for deployment in critical applications such as healthcare and autonomous driving. The classification outcome should reflect a high uncertainty on ambiguous in-distribution samples that lie close to the decision boundary. The model should also refrain from making overconfident decisions on samples that lie far outside its training distribution, far-out-of-distribution (far-OOD), or on unseen samples from novel classes that lie near its training distribution (near-OOD). This paper proposes an application of counterfactual explanations in fixing an over-confident classifier. Specifically, we propose to fine-tune a given pre-trained classifier using augmentations from a counterfactual explainer (ACE) to fix its uncertainty characteristics while retaining its predictive performance. We perform extensive experiments with detecting far-OOD, near-OOD, and ambiguous samples. Our empirical results show that the revised model have improved uncertainty measures, and its performance is competitive to the state-of-the-art methods., Comment: Accepted in WACV 2023
Published: 2022

3. Augmentation by Counterfactual Explanation -Fixing an Overconfident Classifier

Author: Singla, Sumedha, primary, Murali, Nihal, additional, Arabshahi, Forough, additional, Triantafyllou, Sofia, additional, and Batmanghelich, Kayhan, additional
Published: 2023
Full Text: View/download PDF

4. Shortcut Learning Through the Lens of Early Training Dynamics

Author: Murali, Nihal, Puli, Aahlad Manas, Yu, Ke, Ranganath, Rajesh, and Batmanghelich, Kayhan
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)
Abstract: Deep Neural Networks (DNNs) are prone to learn shortcut patterns that damage the generalization of the DNN during deployment. Shortcut Learning is concerning, particularly when the DNNs are applied to safety-critical domains. This paper aims to better understand shortcut learning through the lens of the learning dynamics of the internal neurons during the training process. More specifically, we make the following observations: (1) While previous works treat shortcuts as synonymous with spurious correlations, we emphasize that not all spurious correlations are shortcuts. We show that shortcuts are only those spurious features that are "easier" than the core features. (2) We build upon this premise and use instance difficulty methods (like Prediction Depth) to quantify "easy" and to identify this behavior during the training phase. (3) We empirically show that shortcut learning can be detected by observing the learning dynamics of the DNN's early layers, irrespective of the network architecture used. In other words, easy features learned by the initial layers of a DNN early during the training are potential shortcuts. We verify our claims on simulated and real medical imaging data and justify the empirical success of our hypothesis by showing the theoretical connections between Prediction Depth and information-theoretic concepts like V-usable information. Lastly, our experiments show the insufficiency of monitoring only accuracy plots during training (as is common in machine learning pipelines), and we highlight the need for monitoring early training dynamics using example difficulty metrics., Comment: Main paper: 10 pages and 8 figures. Supplementary: 6 pages and 6 figures. Preprint. Under review
Published: 2023
Full Text: View/download PDF

5. Classification and Re-Identification of Fruit Fly Individuals Across Days With Convolutional Neural Networks

Author: Murali, Nihal, primary, Schneider, Jon, additional, Levine, Joel, additional, and Taylor, Graham, additional
Published: 2019
Full Text: View/download PDF

6. Can Drosophila melanogaster tell who’s who?

Author: Schneider, Jonathan, primary, Murali, Nihal, additional, Taylor, Graham W., additional, and Levine, Joel D., additional
Published: 2018
Full Text: View/download PDF

7. Analysis of Q-learning on ANNs for robot control using live video feed

Author: Murali, Nihal, primary, Gupta, Kunal, additional, and Bhanot, Surekha, additional
Published: 2017
Full Text: View/download PDF

8. Beyond Distribution Shift: Spurious Features Through the Lens of Training Dynamics.

Author: Murali N, Puli A, Yu K, Ranganath R, and Batmanghelich K
Abstract: Deep Neural Networks (DNNs) are prone to learning spurious features that correlate with the label during training but are irrelevant to the learning problem. This hurts model generalization and poses problems when deploying them in safety-critical applications. This paper aims to better understand the effects of spurious features through the lens of the learning dynamics of the internal neurons during the training process. We make the following observations: (1) While previous works highlight the harmful effects of spurious features on the generalization ability of DNNs, we emphasize that not all spurious features are harmful. Spurious features can be " benign " or "harmful" depending on whether they are "harder" or "easier" to learn than the core features for a given model. This definition is model and dataset dependent. (2) We build upon this premise and use instance difficulty methods (like Prediction Depth (Baldock et al., 2021)) to quantify "easiness" for a given model and to identify this behavior during the training phase. (3) We empirically show that the harmful spurious features can be detected by observing the learning dynamics of the DNN's early layers . In other words, easy features learned by the initial layers of a DNN early during the training can (potentially) hurt model generalization. We verify our claims on medical and vision datasets, both simulated and real, and justify the empirical success of our hypothesis by showing the theoretical connections between Prediction Depth and information-theoretic concepts like 𝒱 -usable information (Ethayarajh et al., 2021). Lastly, our experiments show that monitoring only accuracy during training (as is common in machine learning pipelines) is insufficient to detect spurious features. We, therefore, highlight the need for monitoring early training dynamics using suitable instance difficulty metrics.
Published: 2023

9. Augmentation by Counterfactual Explanation - Fixing an Overconfident Classifier.

Author: Singla S, Murali N, Arabshahi F, Triantafyllou S, and Batmanghelich K
Abstract: A highly accurate but overconfident model is ill-suited for deployment in critical applications such as healthcare and autonomous driving. The classification outcome should reflect a high uncertainty on ambiguous in-distribution samples that lie close to the decision boundary. The model should also refrain from making overconfident decisions on samples that lie far outside its training distribution, far-out-of-distribution (far-OOD), or on unseen samples from novel classes that lie near its training distribution (near-OOD). This paper proposes an application of counterfactual explanations in fixing an over-confident classifier. Specifically, we propose to fine-tune a given pre-trained classifier using augmentations from a counterfactual explainer (ACE) to fix its uncertainty characteristics while retaining its predictive performance. We perform extensive experiments with detecting far-OOD, near-OOD, and ambiguous samples. Our empirical results show that the revised model have improved uncertainty measures, and its performance is competitive to the state-of-the-art methods.
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

9 results on '"Murali, Nihal"'

1. Beyond Distribution Shift: Spurious Features Through the Lens of Training Dynamics

2. Augmentation by Counterfactual Explanation -- Fixing an Overconfident Classifier

3. Augmentation by Counterfactual Explanation -Fixing an Overconfident Classifier

4. Shortcut Learning Through the Lens of Early Training Dynamics

5. Classification and Re-Identification of Fruit Fly Individuals Across Days With Convolutional Neural Networks

6. Can Drosophila melanogaster tell who’s who?

7. Analysis of Q-learning on ANNs for robot control using live video feed

8. Beyond Distribution Shift: Spurious Features Through the Lens of Training Dynamics.

9. Augmentation by Counterfactual Explanation - Fixing an Overconfident Classifier.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

9 results on '"Murali, Nihal"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources