Author: "Szankin, Maciej" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Szankin, Maciej"' showing total 26 results

Start Over Author "Szankin, Maciej"

26 results on '"Szankin, Maciej"'

1. LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models

Author: Sarah, Anthony, Sridhar, Sharath Nittur, Szankin, Maciej, and Sundaresan, Sairam
Subjects: Computer Science - Artificial Intelligence
Abstract: The abilities of modern large language models (LLMs) in solving natural language processing, complex reasoning, sentiment analysis and other tasks have been extraordinary which has prompted their extensive adoption. Unfortunately, these abilities come with very high memory and computational costs which precludes the use of LLMs on most hardware platforms. To mitigate this, we propose an effective method of finding Pareto-optimal network architectures based on LLaMA2-7B using one-shot NAS. In particular, we fine-tune LLaMA2-7B only once and then apply genetic algorithm-based search to find smaller, less computationally complex network architectures. We show that, for certain standard benchmark tasks, the pre-trained LLaMA2-7B network is unnecessarily large and complex. More specifically, we demonstrate a 1.5x reduction in model size and 1.3x speedup in throughput for certain tasks with negligible drop in accuracy. In addition to finding smaller, higher-performing network architectures, our method does so more effectively and efficiently than certain pruning or sparsification techniques. Finally, we demonstrate how quantization is complementary to our method and that the size and complexity of the networks we find can be further decreased using quantization. We believe that our work provides a way to automatically create LLMs which can be used on less expensive and more readily available hardware platforms.
Published: 2024

2. SimQ-NAS: Simultaneous Quantization Policy and Neural Architecture Search

Author: Sridhar, Sharath Nittur, Szankin, Maciej, Chen, Fang, Sundaresan, Sairam, and Sarah, Anthony
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Recent one-shot Neural Architecture Search algorithms rely on training a hardware-agnostic super-network tailored to a specific task and then extracting efficient sub-networks for different hardware platforms. Popular approaches separate the training of super-networks from the search for sub-networks, often employing predictors to alleviate the computational overhead associated with search. Additionally, certain methods also incorporate the quantization policy within the search space. However, while the quantization policy search for convolutional neural networks is well studied, the extension of these methods to transformers and especially foundation models remains under-explored. In this paper, we demonstrate that by using multi-objective search algorithms paired with lightly trained predictors, we can efficiently search for both the sub-network architecture and the corresponding quantization policy and outperform their respective baselines across different performance objectives such as accuracy, model size, and latency. Specifically, we demonstrate that our approach performs well across both uni-modal (ViT and BERT) and multi-modal (BEiT-3) transformer-based architectures as well as convolutional architectures (ResNet). For certain networks, we demonstrate an improvement of up to $4.80x$ and $3.44x$ for latency and model size respectively, without degradation in accuracy compared to the fully quantized INT8 baselines.
Published: 2023

3. InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning

Author: Sridhar, Sharath Nittur, Kundu, Souvik, Sundaresan, Sairam, Szankin, Maciej, and Sarah, Anthony
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: One-Shot Neural Architecture Search (NAS) algorithms often rely on training a hardware agnostic super-network for a domain specific task. Optimal sub-networks are then extracted from the trained super-network for different hardware platforms. However, training super-networks from scratch can be extremely time consuming and compute intensive especially for large models that rely on a two-stage training process of pre-training and fine-tuning. State of the art pre-trained models are available for a wide range of tasks, but their large sizes significantly limits their applicability on various hardware platforms. We propose InstaTune, a method that leverages off-the-shelf pre-trained weights for large models and generates a super-network during the fine-tuning stage. InstaTune has multiple benefits. Firstly, since the process happens during fine-tuning, it minimizes the overall time and compute resources required for NAS. Secondly, the sub-networks extracted are optimized for the target task, unlike prior work that optimizes on the pre-training objective. Finally, InstaTune is easy to "plug and play" in existing frameworks. By using multi-objective evolutionary search algorithms along with lightly trained predictors, we find Pareto-optimal sub-networks that outperform their respective baselines across different performance objectives such as accuracy and MACs. Specifically, we demonstrate that our approach performs well across both unimodal (ViT and BERT) and multi-modal (BEiT-3) transformer based architectures.
Published: 2023

4. Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient BERT

Author: Kundu, Souvik, Sridhar, Sharath Nittur, Szankin, Maciej, and Sundaresan, Sairam
Subjects: Computer Science - Computation and Language
Abstract: Large pre-trained language models have recently gained significant traction due to their improved performance on various down-stream tasks like text classification and question answering, requiring only few epochs of fine-tuning. However, their large model sizes often prohibit their applications on resource-constrained edge devices. Existing solutions of yielding parameter-efficient BERT models largely rely on compute-exhaustive training and fine-tuning. Moreover, they often rely on additional compute heavy models to mitigate the performance gap. In this paper, we present Sensi-BERT, a sensitivity driven efficient fine-tuning of BERT models that can take an off-the-shelf pre-trained BERT model and yield highly parameter-efficient models for downstream tasks. In particular, we perform sensitivity analysis to rank each individual parameter tensor, that then is used to trim them accordingly during fine-tuning for a given parameter or FLOPs budget. Our experiments show the efficacy of Sensi-BERT across different downstream tasks including MNLI, QQP, QNLI, SST-2 and SQuAD, showing better performance at similar or smaller parameter budget compared to various alternatives., Comment: 6 pages, 5 figures, 2 tables
Published: 2023

5. A Hardware-Aware Framework for Accelerating Neural Architecture Search Across Modalities

Author: Cummings, Daniel, Sarah, Anthony, Sridhar, Sharath Nittur, Szankin, Maciej, Munoz, Juan Pablo, and Sundaresan, Sairam
Subjects: Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing
Abstract: Recent advances in Neural Architecture Search (NAS) such as one-shot NAS offer the ability to extract specialized hardware-aware sub-network configurations from a task-specific super-network. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still under-explored. Popular methods decouple the super-network training from the sub-network search and use performance predictors to reduce the computational burden of searching on different hardware platforms. We propose a flexible search framework that automatically and efficiently finds optimal sub-networks that are optimized for different performance metrics and hardware configurations. Specifically, we show how evolutionary algorithms can be paired with lightly trained objective predictors in an iterative cycle to accelerate architecture search in a multi-objective setting for various modalities including machine translation and image classification.
Published: 2022

6. A Hardware-Aware System for Accelerating Deep Neural Network Optimization

Author: Sarah, Anthony, Cummings, Daniel, Sridhar, Sharath Nittur, Sundaresan, Sairam, Szankin, Maciej, Webb, Tristan, and Munoz, J. Pablo
Subjects: Computer Science - Artificial Intelligence
Abstract: Recent advances in Neural Architecture Search (NAS) which extract specialized hardware-aware configurations (a.k.a. "sub-networks") from a hardware-agnostic "super-network" have become increasingly popular. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still largely under-explored. For example, some recent network morphism techniques allow a super-network to be trained once and then have hardware-specific networks extracted from it as needed. These methods decouple the super-network training from the sub-network search and thus decrease the computational burden of specializing to different hardware platforms. We propose a comprehensive system that automatically and efficiently finds sub-networks from a pre-trained super-network that are optimized to different performance metrics and hardware configurations. By combining novel search tactics and algorithms with intelligent use of predictors, we significantly decrease the time needed to find optimal sub-networks from a given super-network. Further, our approach does not require the super-network to be refined for the target task a priori, thus allowing it to interface with any super-network. We demonstrate through extensive experiments that our system works seamlessly with existing state-of-the-art super-network training methods in multiple domains. Moreover, we show how novel search tactics paired with evolutionary algorithms can accelerate the search process for ResNet50, MobileNetV3 and Transformer while maintaining objective space Pareto front diversity and demonstrate an 8x faster search result than the state-of-the-art Bayesian optimization WeakNAS approach.
Published: 2022

7. Accelerating Neural Architecture Exploration Across Modalities Using Genetic Algorithms

Author: Cummings, Daniel, Sridhar, Sharath Nittur, Sarah, Anthony, and Szankin, Maciej
Subjects: Computer Science - Neural and Evolutionary Computing
Abstract: Neural architecture search (NAS), the study of automating the discovery of optimal deep neural network architectures for tasks in domains such as computer vision and natural language processing, has seen rapid growth in the machine learning research community. While there have been many recent advancements in NAS, there is still a significant focus on reducing the computational cost incurred when validating discovered architectures by making search more efficient. Evolutionary algorithms, specifically genetic algorithms, have a history of usage in NAS and continue to gain popularity versus other optimization approaches as a highly efficient way to explore the architecture objective space. Most NAS research efforts have centered around computer vision tasks and only recently have other modalities, such as the rapidly growing field of natural language processing, been investigated in depth. In this work, we show how genetic algorithms can be paired with lightly trained objective predictors in an iterative cycle to accelerate multi-objective architectural exploration in a way that works in the modalities of both machine translation and image classification.
Published: 2022

8. Sensi-Bert: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient Language Model

Author: Kundu, Souvik, primary, Sridhar, Sharath Nittur, additional, Szankin, Maciej, additional, and Sundaresan, Sairam, additional
Published: 2024
Full Text: View/download PDF

9. Super-resolved thermal imagery for high-accuracy facial areas detection and analysis

Author: Kwasniewska, Alicja, Ruminski, Jacek, Szankin, Maciej, and Kaczmarek, Mariusz
Published: 2020
Full Text: View/download PDF

10. InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning

Author: Sridhar, Sharath Nittur, primary, Kundu, Souvik, additional, Sundaresan, Sairam, additional, Szankin, Maciej, additional, and Sarah, Anthony, additional
Published: 2023
Full Text: View/download PDF

11. Thermal Image Processing for Respiratory Estimation from Cubical Data with Expandable Depth

Author: Szankin, Maciej, primary, Kwasniewska, Alicja, additional, and Ruminski, Jacek, additional
Published: 2023
Full Text: View/download PDF

12. Can AI See Bias in X-ray Images?

Author: Szankin, Maciej, primary and Kwasniewska, Alicja, additional
Published: 2022
Full Text: View/download PDF

13. Accelerating neural architecture exploration across modalities using genetic algorithms

Author: Cummings, Daniel, primary, Sridhar, Sharath Nittur, additional, Sarah, Anthony, additional, and Szankin, Maciej, additional
Published: 2022
Full Text: View/download PDF

14. Fully Automated AI-powered Contactless Cough Detection based on Pixel Value Dynamics Occurring within Facial Regions

Author: Szankin, Maciej, primary, Kwasniewska, Alicja, additional, Glowacka, Natalia, additional, Ruminski, Jacek, additional, Nicolas, Rey, additional, and Gamba, David, additional
Published: 2021
Full Text: View/download PDF

15. Improving Accuracy of Respiratory Rate Estimation by Restoring High Resolution Features with Transformers and Recursive Convolutional Models

Author: Kwasniewska, Alicja, primary, Szankin, Maciej, additional, Ruminski, Jacek, additional, Sarah, Anthony, additional, and Gamba, David, additional
Published: 2021
Full Text: View/download PDF

16. Deep Learning Optimization for Edge Devices: Analysis of Training Quantization Parameters

Author: Kwasniewska, Alicja, primary, Szankin, Maciej, additional, Ozga, Mateusz, additional, Wolfe, Jason, additional, Das, Arun, additional, Zajac, Adam, additional, Ruminski, Jacek, additional, and Rad, Paul, additional
Published: 2019
Full Text: View/download PDF

17. Evaluation of Facial Pulse Signals using Deep Neural Net Models

Author: Ruminski, Jacek, primary, Kwasniewska, Alicja, additional, Szankin, Maciej, additional, Kocejko, Tomasz, additional, and Mazur-Milecka, Magdalena, additional
Published: 2019
Full Text: View/download PDF

18. Evaluating Accuracy of Respiratory Rate Estimation from Super Resolved Thermal Imagery

Author: Kwasniewska, Alicja, primary, Szankin, Maciej, additional, Ruminski, Jacek, additional, and Kaczmarek, Mariusz, additional
Published: 2019
Full Text: View/download PDF

19. Influence of Thermal Imagery Resolution on Accuracy of Deep Learning based Face Recognition

Author: Szankin, Maciej, primary, Kwasniewska, Alicja, additional, and Ruminski, Jacek, additional
Published: 2019
Full Text: View/download PDF

20. Road Condition Evaluation Using Fusion of Multiple Deep Models on Always-On Vision Processor

Author: Szankin, Maciej, primary, Kwasniewska, Alicja, additional, Ruminski, Jacek, additional, and Nicolas, Rey, additional
Published: 2018
Full Text: View/download PDF

21. Optical Sensor Based Gestures Inference Using Recurrent Neural Network in Mobile Conditions

Author: Czuszynski, Krzysztof, primary, Kwasniewska, Alicja, additional, Szankin, Maciej, additional, and Ruminski, Jacek, additional
Published: 2018
Full Text: View/download PDF

22. Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions

Author: Wang, Mingshan, primary, Sirlapu, Tejaswini, additional, Kwasniewska, Alicja, additional, Szankin, Maciej, additional, Bartscherer, Marko, additional, and Nicolas, Rey, additional
Published: 2018
Full Text: View/download PDF

23. Real-Time Facial Features Detection from Low Resolution Thermal Images with Deep Classification Models

Author: Kwaśniewska, Alicja, primary, Rumiński, Jacek, additional, Czuszyński, Krzysztof, additional, and Szankin, Maciej, additional
Published: 2018
Full Text: View/download PDF

24. Improving Accuracy of Contactless Respiratory Rate Estimation by Enhancing Thermal Sequences with Deep Neural Networks.

Author: Kwasniewska, Alicja, Ruminski, Jacek, and Szankin, Maciej
Subjects: HOME wireless technology, ARTIFICIAL neural networks, VITAL signs, EMOTION recognition, IMAGE processing, DEEP learning
Abstract: Featured Application: The proposed Super Resolution Deep Neural Network allows for improving accuracy of Respiratory Rate (RR) estimation from extremely low resolution thermal sequences, i.e., 40 × 30 pixels. To the best of our knowledge deep learning hasn't been used for telemedicine use cases aimed at vital signs monitoring before. Thus, there are many potential applications where it can be useful, i.e., remote diagnostics using smart home platforms, long-distance vital signs monitoring in difficult to reach areas using cameras mounted on drones, monitoring of driver's and passengers' state of health in self-driving vehicles, emotions recognition from vital signs, or detecting unusual behaviors e.g., abnormal respiratory rate patterns at security checkpoints. Estimation of vital signs using image processing techniques have already been proved to have a potential for supporting remote medical diagnostics and replacing traditional measurements that usually require special hardware and electrodes placed on a body. In this paper, we further extend studies on contactless Respiratory Rate (RR) estimation from extremely low resolution thermal imagery by enhancing acquired sequences using Deep Neural Networks (DNN). To perform extensive benchmark evaluation, we acquired two thermal datasets using FLIR® cameras with a spatial resolution of 80 × 60 and 320 × 240 from 71 volunteers in total. In-depth analysis of the proposed Convolutional-based Super Resolution model showed that for images downscaled with a factor of 2 and then super-resolved using Deep Learning (DL) can lead to better RR estimation accuracy than from original high-resolution sequences. In addition, if an estimator based on a dominating peak in the frequency domain is used, SR can outperform original data for a down-scale factor of 4 and images as small as 20 × 15 pixels. Our study also showed that RR estimation accuracy is better for super-resolved data than for images with color changes magnified using algorithms previously applied in the literature for enhancing vital signs patterns. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

25. Evaluating Accuracy of Respiratory Rate Estimation from Super Resolved Thermal Imagery.

Author: Kwasniewska A, Szankin M, Ruminski J, and Kaczmarek M
Subjects: Algorithms, Image Enhancement, Image Processing, Computer-Assisted, Signal-To-Noise Ratio, Respiratory Rate
Abstract: Non-contact estimation of Respiratory Rate (RR) has revolutionized the process of establishing the measurement by surpassing some issues related to attaching sensors to a body, e.g. epidermal stripping, skin disruption and pain. In this study, we perform further experiments with image processing-based RR estimation by using various image enhancement algorithms. Specifically, we employ Super Resolution (SR) Deep Learning (DL) network to generate hallucinated thermal image sequences that are then analyzed to extract breathing signals. DL-based SR networks have been proved to increase image quality in terms of Peak Signal-to-Noise ratio. However, it hasn't been evaluated yet whether it leads to better RR estimation accuracy, what we address in this study. Our research confirms that for estimator based on the dominated peak in the frequency spectrum Root Mean Squared Error improves by 0.15bpm for 8-bit and by 0.84bpm for 16-bit data comparing to original sequences if hallucinated frames are used. Mean Absolute Error is reduced by 0.63bpm for average aggregator and by 2.06bpm for skewness. This finding can enable various remote monitoring solutions that may suffer from poorer accuracy due to low spatial resolution of utilized thermal cameras.
Published: 2019
Full Text: View/download PDF

26. Evaluation of Facial Pulse Signals using Deep Neural Net Models.

Author: Ruminski J, Kwasniewska A, Szankin M, Kocejko T, and Mazur-Milecka M
Subjects: Algorithms, Humans, Face, Neural Networks, Computer, Photoplethysmography, Pulse, Signal Processing, Computer-Assisted
Abstract: The reliable measurement of the pulse rate using remote photoplethysmography (PPG) is very important for many medical applications. In this paper we present how deep neural networks (DNNs) models can be used in the problem of PPG signal classification and pulse rate estimation. In particular, we show that the DNN-based classification results correspond to parameters describing the PPG signals (e.g. peak energy in the frequency domain, SNR, etc.). The results show that it is possible to identify regions of a face, for which reliable PPG signals can be extracted. The accuracy obtained for the classification task and the mean absolute error achieved for the regression task proved the usefulness of the DNN models.
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

26 results on '"Szankin, Maciej"'

1. LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models

2. SimQ-NAS: Simultaneous Quantization Policy and Neural Architecture Search

3. InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning

4. Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient BERT

5. A Hardware-Aware Framework for Accelerating Neural Architecture Search Across Modalities

6. A Hardware-Aware System for Accelerating Deep Neural Network Optimization

7. Accelerating Neural Architecture Exploration Across Modalities Using Genetic Algorithms

8. Sensi-Bert: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient Language Model

9. Super-resolved thermal imagery for high-accuracy facial areas detection and analysis

10. InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning

11. Thermal Image Processing for Respiratory Estimation from Cubical Data with Expandable Depth

12. Can AI See Bias in X-ray Images?

13. Accelerating neural architecture exploration across modalities using genetic algorithms

14. Fully Automated AI-powered Contactless Cough Detection based on Pixel Value Dynamics Occurring within Facial Regions

15. Improving Accuracy of Respiratory Rate Estimation by Restoring High Resolution Features with Transformers and Recursive Convolutional Models

16. Deep Learning Optimization for Edge Devices: Analysis of Training Quantization Parameters

17. Evaluation of Facial Pulse Signals using Deep Neural Net Models

18. Evaluating Accuracy of Respiratory Rate Estimation from Super Resolved Thermal Imagery

19. Influence of Thermal Imagery Resolution on Accuracy of Deep Learning based Face Recognition

20. Road Condition Evaluation Using Fusion of Multiple Deep Models on Always-On Vision Processor

21. Optical Sensor Based Gestures Inference Using Recurrent Neural Network in Mobile Conditions

22. Speaker Recognition Using Convolutional Neural Network with Minimal Training Data for Smart Home Solutions

23. Real-Time Facial Features Detection from Low Resolution Thermal Images with Deep Classification Models

24. Improving Accuracy of Contactless Respiratory Rate Estimation by Enhancing Thermal Sequences with Deep Neural Networks.

25. Evaluating Accuracy of Respiratory Rate Estimation from Super Resolved Thermal Imagery.

26. Evaluation of Facial Pulse Signals using Deep Neural Net Models.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

26 results on '"Szankin, Maciej"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources