Author: "Harma A" / Publication Type: Reports - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Harma A"' showing total 3 results

Start Over Author "Harma A" Publication Type Reports

3 results on '"Harma A"'

1. Effective Interplay between Sparsity and Quantization: From Theory to Practice

Author: Harma, Simla Burcu, Chakraborty, Ayan, Kostenok, Elizaveta, Mishin, Danila, Ha, Dongho, Falsafi, Babak, Jaggi, Martin, Liu, Ming, Oh, Yunho, Subramanian, Suvinay, and Yazdanbakhsh, Amir
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: The increasing size of deep neural networks necessitates effective model compression to improve computational efficiency and reduce their memory footprint. Sparsity and quantization are two prominent compression methods that have individually demonstrated significant reduction in computational and memory footprints while preserving model accuracy. While effective, the interplay between these two methods remains an open question. In this paper, we investigate the interaction between these two methods and assess whether their combination impacts final model accuracy. We mathematically prove that applying sparsity before quantization is the optimal sequence for these operations, minimizing error in computation. Our empirical studies across a wide range of models, including OPT and Llama model families (125M-8B) and ViT corroborate these theoretical findings. In addition, through rigorous analysis, we demonstrate that sparsity and quantization are not orthogonal; their interaction can significantly harm model accuracy, with quantization error playing a dominant role in this degradation. Our findings extend to the efficient deployment of large models in resource-limited compute platforms and reduce serving cost, offering insights into best practices for applying these compression methods to maximize efficacy without compromising accuracy.
Published: 2024

2. Accuracy Booster: Enabling 4-bit Fixed-point Arithmetic for DNN Training

Author: Harma, Simla Burcu, Chakraborty, Ayan, Sperry, Nicholas, Falsafi, Babak, Jaggi, Martin, and Oh, Yunho
Subjects: Computer Science - Machine Learning
Abstract: The unprecedented demand for computing resources to train DNN models has led to a search for minimal numerical encoding. Recent state-of-the-art (SOTA) proposals advocate for multi-level scaled narrow bitwidth numerical formats. In this paper, we show that single-level scaling is sufficient to maintain training accuracy while maximizing arithmetic density. We identify a previously proposed single-level scaled format for 8-bit training, Hybrid Block Floating Point (HBFP), as the optimal candidate to minimize. We perform a full-scale exploration of the HBFP design space using mathematical tools to study the interplay among various parameters and identify opportunities for even smaller encodings across layers and epochs. Based on our findings, we propose Accuracy Booster, a mixed-mantissa HBFP technique that uses 4-bit mantissas for over 99% of all arithmetic operations in training and 6-bit mantissas only in the last epoch and first/last layers. We show Accuracy Booster enables increasing arithmetic density over all other SOTA formats by at least 2.3x while achieving state-of-the-art accuracies in 4-bit training.
Published: 2022

3. School Choice for the Poor? The Limits of Marketisation of Primary Education in Rural India. CREATE Pathways to Access. Research Monograph No. 23

Author: Consortium for Research on Educational Access, Transitions and Equity (CREATE) and Harma, Joanna
Abstract: In recent years India has seen an explosion in low-fee private (LFP) schooling aimed at the poorer strata of society. This marketisation of primary education is a reaction to the well-documented failings of the government system. This paper looks at LFP schooling in one rural district of Uttar Pradesh, and compares government to low cost private schools in this area. It explores whether LFPs are affordable to the rural poor and marginalised by examining the key factors in parental decision making and ultimately discovering whether equity considerations are served. CREATE's zones of exclusion model is appended. (Contains 17 footnotes, 23 tables, and 1 figure.)
Published: 2010

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Harma A"'

1. Effective Interplay between Sparsity and Quantization: From Theory to Practice

2. Accuracy Booster: Enabling 4-bit Fixed-point Arithmetic for DNN Training

3. School Choice for the Poor? The Limits of Marketisation of Primary Education in Rural India. CREATE Pathways to Access. Research Monograph No. 23

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

3 results on '"Harma A"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources