Author: "İslamoğlu, Gamze" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"İslamoğlu, Gamze"' showing total 3 results

Start Over Author "İslamoğlu, Gamze"

Author: Wiese, Philip, İslamoğlu, Gamze, Scherer, Moritz, Macan, Luka, Jung, Victor J. B., Burrello, Alessio, Conti, Francesco, and Benini, Luca
Subjects: Computer Science - Hardware Architecture, Computer Science - Machine Learning
Abstract: One of the challenges for Tiny Machine Learning (tinyML) is keeping up with the evolution of Machine Learning models from Convolutional Neural Networks to Transformers. We address this by leveraging a heterogeneous architectural template coupling RISC-V processors with hardwired accelerators supported by an automated deployment flow. We demonstrate an Attention-based model in a tinyML power envelope with an octa-core cluster coupled with an accelerator for quantized Attention. Our deployment flow enables an end-to-end 8-bit MobileBERT, achieving leading-edge energy efficiency and throughput of 2960 GOp/J and 154 GOp/s at 32.5 Inf/s consuming 52.0 mW (0.65 V, 22 nm FD-SOI technology)., Comment: Pre-print manuscript submitted for review to the IEEE Design and Test Special Issue on tinyML
Published: 2024

Author: İslamoğlu, Gamze, Scherer, Moritz, Paulin, Gianna, Fischer, Tim, Jung, Victor J. B., Garofalo, Angelo, and Benini, Luca
Subjects: Computer Science - Hardware Architecture, Computer Science - Machine Learning
Abstract: Transformer networks have emerged as the state-of-the-art approach for natural language processing tasks and are gaining popularity in other domains such as computer vision and audio processing. However, the efficient hardware acceleration of transformer models poses new challenges due to their high arithmetic intensities, large memory requirements, and complex dataflow dependencies. In this work, we propose ITA, a novel accelerator architecture for transformers and related models that targets efficient inference on embedded systems by exploiting 8-bit quantization and an innovative softmax implementation that operates exclusively on integer values. By computing on-the-fly in streaming mode, our softmax implementation minimizes data movement and energy consumption. ITA achieves competitive energy efficiency with respect to state-of-the-art transformer accelerators with 16.9 TOPS/W, while outperforming them in area efficiency with 5.93 TOPS/mm$^2$ in 22 nm fully-depleted silicon-on-insulator technology at 0.8 V., Comment: Accepted for publication at the 2023 ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED)
Published: 2023
Full Text: View/download PDF

Author: İslamoğlu, Gamze, Çakıcı, Tuğberk Oğulcan, Güzelhan, Şeyda Nur, Afacan, Engin, and Dündar, Günhan
Published: 2021
Full Text: View/download PDF

Books, media, physical & digital resources

Searchworks