Author: "Padhy, Suchismita" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Padhy, Suchismita"' showing total 5 results

Start Over Author "Padhy, Suchismita"

5 results on '"Padhy, Suchismita"'

1. On the geometry of generalization and memorization in deep neural networks

Author: Stephenson, Cory, Padhy, Suchismita, Ganesh, Abhinav, Hui, Yue, Tang, Hanlin, and Chung, SueYeon
Subjects: Computer Science - Machine Learning, Condensed Matter - Disordered Systems and Neural Networks, Statistics - Machine Learning
Abstract: Understanding how large neural networks avoid memorizing training data is key to explaining their high generalization performance. To examine the structure of when and where memorization occurs in a deep network, we use a recently developed replica-based mean field theoretic geometric analysis method. We find that all layers preferentially learn from examples which share features, and link this behavior to generalization performance. Memorization predominately occurs in the deeper layers, due to decreasing object manifolds' radius and dimension, whereas early layers are minimally affected. This predicts that generalization can be restored by reverting the final few layer weights to earlier epochs before significant memorization occurred, which is confirmed by the experiments. Additionally, by studying generalization under different model sizes, we reveal the connection between the double descent phenomenon and the underlying model geometry. Finally, analytical analysis shows that networks avoid memorization early in training because close to initialization, the gradient contribution from permuted examples are small. These findings provide quantitative evidence for the structure of memorization across layers of a deep neural network, the drivers for such structure, and its connection to manifold geometric properties., Comment: ICLR 2021
Published: 2021

2. Untangling in Invariant Speech Recognition

Author: Stephenson, Cory, Feather, Jenelle, Padhy, Suchismita, Elibol, Oguz, Tang, Hanlin, McDermott, Josh, and Chung, SueYeon
Subjects: Computer Science - Machine Learning, Condensed Matter - Disordered Systems and Neural Networks, Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Encouraged by the success of deep neural networks on a variety of visual tasks, much theoretical and experimental work has been aimed at understanding and interpreting how vision networks operate. Meanwhile, deep neural networks have also achieved impressive performance in audio processing applications, both as sub-components of larger systems and as complete end-to-end systems by themselves. Despite their empirical successes, comparatively little is understood about how these audio models accomplish these tasks. In this work, we employ a recently developed statistical mechanical theory that connects geometric properties of network representations and the separability of classes to probe how information is untangled within neural networks trained to recognize speech. We observe that speaker-specific nuisance variations are discarded by the network's hierarchy, whereas task-relevant properties such as words and phonemes are untangled in later layers. Higher level concepts such as parts-of-speech and context dependence also emerge in the later layers of the network. Finally, we find that the deep representations carry out significant temporal untangling by efficiently extracting task-relevant features at each time step of the computation. Taken together, these findings shed light on how deep auditory models process time dependent input signals to achieve invariant speech recognition, and show how different concepts emerge through the layers of the network., Comment: Advances in Neural Information Processing Systems. 2019
Published: 2020

3. Label-efficient audio classification through multitask learning and self-supervision

Author: Lee, Tyler, Gong, Ting, Padhy, Suchismita, Rouditchenko, Andrew, and Ndirango, Anthony
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Machine Learning, Computer Science - Sound, Statistics - Machine Learning
Abstract: While deep learning has been incredibly successful in modeling tasks with large, carefully curated labeled datasets, its application to problems with limited labeled data remains a challenge. The aim of the present work is to improve the label efficiency of large neural networks operating on audio data through a combination of multitask learning and self-supervised learning on unlabeled data. We trained an end-to-end audio feature extractor based on WaveNet that feeds into simple, yet versatile task-specific neural networks. We describe several easily implemented self-supervised learning tasks that can operate on any large, unlabeled audio corpus. We demonstrate that, in scenarios with limited labeled training data, one can significantly improve the performance of three different supervised classification tasks individually by up to 6% through simultaneous training with these additional self-supervised tasks. We also show that incorporating data augmentation into our multitask setting leads to even further gains in performance., Comment: Presented at ICLR 2019 Limited Labeled Data (LLD) Workshop
Published: 2019

4. Untangling in Invariant Speech Recognition

Author: Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences, Center for Brains, Minds, and Machines, Stephenson, Cory, Feather, Jenelle, Padhy, Suchismita, Elibol, Oguz, Tang, Hanlin, McDermott, Josh, Chung, SueYeon, Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences, Center for Brains, Minds, and Machines, Stephenson, Cory, Feather, Jenelle, Padhy, Suchismita, Elibol, Oguz, Tang, Hanlin, McDermott, Josh, and Chung, SueYeon
Abstract: © 2019 Neural information processing systems foundation. All rights reserved. Encouraged by the success of deep neural networks on a variety of visual tasks, much theoretical and experimental work has been aimed at understanding and interpreting how vision networks operate. Meanwhile, deep neural networks have also achieved impressive performance in audio processing applications, both as sub-components of larger systems and as complete end-to-end systems by themselves. Despite their empirical successes, comparatively little is understood about how these audio models accomplish these tasks. In this work, we employ a recently developed statistical mechanical theory that connects geometric properties of network representations and the separability of classes to probe how information is untangled within neural networks trained to recognize speech. We observe that speaker-specific nuisance variations are discarded by the network's hierarchy, whereas task-relevant properties such as words and phonemes are untangled in later layers. Higher level concepts such as parts-of-speech and context dependence also emerge in the later layers of the network. Finally, we find that the deep representations carry out significant temporal untangling by efficiently extracting task-relevant features at each time step of the computation. Taken together, these findings shed light on how deep auditory models process time dependent input signals to achieve invariant speech recognition, and show how different concepts emerge through the layers of the network.
Published: 2021

5. A Comparison of Loss Weighting Strategies for Multi task Learning in Deep Neural Networks

Author: Gong, Ting, primary, Lee, Tyler, additional, Stephenson, Cory, additional, Renduchintala, Venkata, additional, Padhy, Suchismita, additional, Ndirango, Anthony, additional, Keskin, Gokce, additional, and Elibol, Oguz H., additional
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

5 results on '"Padhy, Suchismita"'

1. On the geometry of generalization and memorization in deep neural networks

2. Untangling in Invariant Speech Recognition

3. Label-efficient audio classification through multitask learning and self-supervision

4. Untangling in Invariant Speech Recognition

5. A Comparison of Loss Weighting Strategies for Multi task Learning in Deep Neural Networks

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

5 results on '"Padhy, Suchismita"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources