Author: "Wen-mei W. Hwu" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Wen-mei W. Hwu"' showing total 652 results

Start Over Author "Wen-mei W. Hwu"

652 results on '"Wen-mei W. Hwu"'

1. TBA: Faster Large Language Model Training Using SSD-Based Activation Offloading.

Author: Kun Wu 0002, Jeongmin Brian Park, Xiaofan Zhang 0001, Mert Hidayetoglu, Vikram Sharma Mailthody, Sitao Huang, Steven S. Lumetta, and Wen-Mei W. Hwu
Published: 2024
Full Text: View/download PDF

2. HiCCL: A Hierarchical Collective Communication Library.

Author: Mert Hidayetoglu, Simon Garcia de Gonzalo, Elliott Slaughter, Pinku Surana, Wen-Mei W. Hwu, William Gropp, and Alex Aiken
Published: 2024
Full Text: View/download PDF

3. LSM-GNN: Large-scale Storage-based Multi-GPU GNN Training by Optimizing Data Transfer Scheme.

Author: Jeongmin Brian Park, Kun Wu 0002, Vikram Sharma Mailthody, Zaid Qureshi, Scott A. Mahlke, and Wen-Mei W. Hwu
Published: 2024
Full Text: View/download PDF

4. GPU-Initiated On-Demand High-Throughput Storage Access in the BaM System Architecture.

Author: Zaid Qureshi, Vikram Sharma Mailthody, Isaac Gelado, Seungwon Min, Amna Masood, Jeongmin Brian Park, Jinjun Xiong, Chris J. Newburn, Dmitri Vainbrand, I-Hsin Chung, Michael Garland, William J. Dally, and Wen-Mei W. Hwu
Published: 2023
Full Text: View/download PDF

5. Parallelizing Maximal Clique Enumeration on GPUs.

Author: Mohammad Almasri, Yen-Hsiang Chang, Izzat El Hajj, Rakesh Nagi, Jinjun Xiong, and Wen-mei W. Hwu
Published: 2023
Full Text: View/download PDF

6. FSSD: FPGA-Based Emulator for SSDs.

Author: Luyang Yu, Yizhen Lu, Meghna Mandava, Edward Richter, Vikram Sharma Mailthody, Seungwon Min, Wen-Mei W. Hwu, and Deming Chen
Published: 2023
Full Text: View/download PDF

7. An efficient GPU implementation and scaling for higher-order 3D stencils.

Author: Omer Anjum, Mohammad Almasri, Simon Garcia de Gonzalo, and Wen-Mei W. Hwu
Published: 2022
Full Text: View/download PDF

8. Exploring HW/SW Co-Design for Video Analysis on CPU-FPGA Heterogeneous Systems.

Author: Xiaofan Zhang 0001, Yuan Ma, Jinjun Xiong, Wen-Mei W. Hwu, Volodymyr V. Kindratenko, and Deming Chen
Published: 2022
Full Text: View/download PDF

9. MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging.

Author: Mert Hidayetoglu, Tekin Biçer, Simon Garcia de Gonzalo, Bin Ren, Doga Gürsoy, Rajkumar Kettimuthu, Ian T. Foster, and Wen-Mei W. Hwu
Published: 2022
Full Text: View/download PDF

10. RackBlox: A Software-Defined Rack-Scale Storage System with Network-Storage Co-Design.

Author: Benjamin Reidys, Yuqi Xue, Daixuan Li, Bharat Sukhwani, Wen-mei W. Hwu, Deming Chen, Sameh W. Asaad, and Jian Huang 0006
Published: 2023
Full Text: View/download PDF

11. IGB: Addressing The Gaps In Labeling, Features, Heterogeneity, and Size of Public Graph Datasets for Deep Learning Research.

Author: Arpandeep Khatua, Vikram Sharma Mailthody, Bhagyashree Taleka, Tengfei Ma 0001, Xiang Song 0003, and Wen-mei W. Hwu
Published: 2023
Full Text: View/download PDF

12. CODAG: Characterizing and Optimizing Decompression Algorithms for GPUs.

Author: Jeongmin Brian Park, Zaid Qureshi, Vikram S. Mailthody, Andrew Gacek, Shunfan Shao, Mohammad Almasri, Isaac Gelado, Jinjun Xiong, Chris J. Newburn, I-Hsin Chung, Michael Garland, Nikolay Sakharnykh, and Wen-Mei W. Hwu
Published: 2023
Full Text: View/download PDF

13. PIGEON: Optimizing CUDA Code Generator for End-to-End Training and Inference of Relational Graph Neural Networks.

Author: Kun Wu 0002, Mert Hidayetoglu, Xiang Song 0003, Sitao Huang, Da Zheng, Israt Nisa, and Wen-Mei W. Hwu
Published: 2023
Full Text: View/download PDF

14. Accelerating Fourier and Number Theoretic Transforms using Tensor Cores and Warp Shuffles.

Author: Sultan Durrani, Muhammad Saad Chughtai, Mert Hidayetoglu, Rashid Tahir, Abdul Dakkak, Lawrence Rauchwerger, Fareed Zaffar, and Wen-Mei W. Hwu
Published: 2021
Full Text: View/download PDF

15. Mixed Precision Quantization for ReRAM-based DNN Inference Accelerators.

Author: Sitao Huang, Aayush Ankit, Plínio Silveira, Rodrigo Antunes, Sai Rahul Chalamalasetti, Izzat El Hajj, Dong Eun Kim, Glaucimar Aguiar, Pedro Bruel, Sergey Serebryakov, Cong Xu, Can Li, Paolo Faraboschi, John Paul Strachan, Deming Chen, Kaushik Roy 0001, Wen-Mei W. Hwu, and Dejan S. Milojicic
Published: 2021
Full Text: View/download PDF

16. PhraseScope: An Effective and Unsupervised Framework for Mining High Quality Phrases.

Author: Omer Anjum, Mohammad Almasri, Jinjun Xiong, and Wen-Mei W. Hwu
Published: 2021
Full Text: View/download PDF

17. Node-Aware Stencil Communication for Heterogeneous Supercomputers.

Author: Carl Pearson, Mert Hidayetoglu, Mohammad Almasri, Omer Anjum, I-Hsin Chung, Jinjun Xiong, and Wen-Mei W. Hwu
Published: 2020
Full Text: View/download PDF

18. FReaC Cache: Folded-logic Reconfigurable Computing in the Last Level Cache.

Author: Ashutosh Dhar, Xiaohao Wang, Hubertus Franke, Jinjun Xiong, Jian Huang 0006, Wen-Mei W. Hwu, Nam Sung Kim, and Deming Chen
Published: 2020
Full Text: View/download PDF

19. Alleviating Semantic-level Shift: A Semi-supervised Domain Adaptation Method for Semantic Segmentation.

Author: Zhonghao Wang, Yunchao Wei, Rogério Schmidt Feris, Jinjun Xiong, Wen-Mei W. Hwu, Thomas S. Huang, and Honghui Shi
Published: 2020
Full Text: View/download PDF

20. The Design and Implementation of a Scalable Deep Learning Benchmarking Platform.

Author: Cheng Li 0014, Abdul Dakkak, Jinjun Xiong, and Wen-Mei W. Hwu
Published: 2020
Full Text: View/download PDF

21. EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions.

Author: Yuhong Li, Cong Hao, Xiaofan Zhang 0001, Xinheng Liu, Yao Chen 0008, Jinjun Xiong, Wen-mei W. Hwu, and Deming Chen
Published: 2020
Full Text: View/download PDF

22. Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture.

Author: Seungwon Min, Kun Wu 0002, Sitao Huang, Mert Hidayetoglu, Jinjun Xiong, Eiman Ebrahimi, Deming Chen, and Wen-mei W. Hwu
Published: 2021
Full Text: View/download PDF

23. PUMA: A Programmable Ultra-efficient Memristor-based Accelerator for Machine Learning Inference.

Author: Aayush Ankit, Izzat El Hajj, Sai Rahul Chalamalasetti, Geoffrey Ndu, Martin Foltin, R. Stanley Williams, Paolo Faraboschi, Wen-mei W. Hwu, John Paul Strachan, Kaushik Roy 0001, and Dejan S. Milojicic
Published: 2019
Full Text: View/download PDF

24. FlatFlash: Exploiting the Byte-Accessibility of SSDs within a Unified Memory-Storage Hierarchy.

Author: Ahmed H. M. O. Abulila, Vikram Sharma Mailthody, Zaid Qureshi, Jian Huang 0006, Nam Sung Kim, Jinjun Xiong, and Wen-Mei W. Hwu
Published: 2019
Full Text: View/download PDF

25. Reinforcement Learning Based Text Style Transfer without Parallel Training Corpus.

Author: Hongyu Gong, Suma Bhat, Lingfei Wu 0001, Jinjun Xiong, and Wen-Mei W. Hwu
Published: 2019
Full Text: View/download PDF

26. MemXCT: memory-centric X-ray CT reconstruction with massive parallelization.

Author: Mert Hidayetoglu, Tekin Biçer, Simon Garcia De Gonzalo, Bin Ren, Doga Gürsoy, Rajkumar Kettimuthu, Ian T. Foster, and Wen-mei W. Hwu
Published: 2019
Full Text: View/download PDF

27. Accelerating Sparse Deep Neural Networks on FPGAs.

Author: Sitao Huang, Carl Pearson, Rakesh Nagi, Jinjun Xiong, Deming Chen, and Wen-Mei W. Hwu
Published: 2019
Full Text: View/download PDF

28. Update on k-truss Decomposition on GPU.

Author: Mohammad Almasri, Omer Anjum, Carl Pearson, Zaid Qureshi, Vikram S. Mailthody, Rakesh Nagi, Jinjun Xiong, and Wen-Mei W. Hwu
Published: 2019
Full Text: View/download PDF

29. Update on Triangle Counting on GPU.

Author: Carl Pearson, Mohammad Almasri, Omer Anjum, Vikram S. Mailthody, Zaid Qureshi, Rakesh Nagi, Jinjun Xiong, and Wen-Mei W. Hwu
Published: 2019
Full Text: View/download PDF

30. Analysis and Modeling of Collaborative Execution Strategies for Heterogeneous CPU-FPGA Architectures.

Author: Sitao Huang, Li-Wen Chang, Izzat El Hajj, Simon Garcia De Gonzalo, Juan Gómez-Luna, Sai Rahul Chalamalasetti, Mohamed El-Hadedy 0001, Dejan S. Milojicic, Onur Mutlu, Deming Chen, and Wen-Mei W. Hwu
Published: 2019
Full Text: View/download PDF

31. Near-Memory and In-Storage FPGA Acceleration for Emerging Cognitive Computing Workloads.

Author: Ashutosh Dhar, Sitao Huang, Jinjun Xiong, Damir A. Jamsek, Bruno Mesnet, Jian Huang 0006, Nam Sung Kim, Wen-Mei W. Hwu, and Deming Chen
Published: 2019
Full Text: View/download PDF

32. Accelerating reduction and scan using tensor core units.

Author: Abdul Dakkak, Cheng Li 0014, Jinjun Xiong, Isaac Gelado, and Wen-Mei W. Hwu
Published: 2019
Full Text: View/download PDF

33. A Compiler Framework for Optimizing Dynamic Parallelism on GPUs.

Author: Mhd Ghaith Olabi, Juan Gómez-Luna, Onur Mutlu, Wen-Mei W. Hwu, and Izzat El Hajj
Published: 2022

34. BaM: A Case for Enabling Fine-grain High Throughput GPU-Orchestrated Access to Storage.

Author: Zaid Qureshi, Vikram Sharma Mailthody, Isaac Gelado, Seungwon Min, Amna Masood, Jeongmin Brian Park, Jinjun Xiong, Chris J. Newburn, Dmitri Vainbrand, I-Hsin Chung, Michael Garland, William J. Dally, and Wen-Mei W. Hwu
Published: 2022
Full Text: View/download PDF

35. Parallelizing Maximal Clique Enumeration on GPUs.

Author: Mohammad Almasri, Yen-Hsiang Chang, Izzat El Hajj, Rakesh Nagi, Jinjun Xiong, and Wen-mei W. Hwu
Published: 2022
Full Text: View/download PDF

36. DLSpec: A Deep Learning Task Exchange Specification.

Author: Abdul Dakkak, Cheng Li 0014, Jinjun Xiong, and Wen-mei W. Hwu
Published: 2020

37. A Fast and Massively-Parallel Inverse Solver for Multiple-Scattering Tomographic Image Reconstruction.

Author: Mert Hidayetoglu, Carl Pearson, Izzat El Hajj, Levent Gürel, Weng Cho Chew, and Wen-Mei W. Hwu
Published: 2018
Full Text: View/download PDF

38. Application-Transparent Near-Memory Processing Architecture with Memory Channel Network.

Author: Mohammad Alian, Seungwon Min, Hadi Asgharimoghaddam, Ashutosh Dhar, Dong Kai Wang, Thomas Roewer, Adam J. McPadden, Oliver O'Halloran, Deming Chen, Jinjun Xiong, Daehoon Kim, Wen-Mei W. Hwu, and Nam Sung Kim
Published: 2018
Full Text: View/download PDF

39. PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses.

Author: Seungwon Min, Kun Wu 0002, Sitao Huang, Mert Hidayetoglu, Jinjun Xiong, Eiman Ebrahimi, Deming Chen, and Wen-Mei W. Hwu
Published: 2021

40. MLHarness: A Scalable Benchmarking System for MLCommons.

Author: Yen-Hsiang Chang, Jianhao Pu, Wen-Mei W. Hwu, and Jinjun Xiong
Published: 2021

41. Graph Neural Network Training with Data Tiering.

Author: Seungwon Min, Kun Wu 0002, Mert Hidayetoglu, Jinjun Xiong, Xiang Song 0003, and Wen-mei W. Hwu
Published: 2021

42. K-Clique Counting on GPUs.

Author: Mohammad Almasri, Izzat El Hajj, Rakesh Nagi, Jinjun Xiong, and Wen-Mei W. Hwu
Published: 2021

43. Open Relation Modeling: Learning to Define Relations between Entities.

Author: Jie Huang 0009, Kevin Chen-Chuan Chang, Jinjun Xiong, and Wen-Mei W. Hwu
Published: 2021

44. Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach.

Author: Jie Huang 0009, Kevin Chen-Chuan Chang, Jinjun Xiong, and Wen-mei W. Hwu
Published: 2021

45. RAI: A Scalable Project Submission System for Parallel Programming Courses.

Author: Abdul Dakkak, Carl Pearson, Cheng Li 0014, and Wen-mei W. Hwu
Published: 2017
Full Text: View/download PDF

46. Revisiting Online Autotuning for Sparse-Matrix Vector Multiplication Kernels on Next-Generation Architectures.

Author: Simon Garcia De Gonzalo, Simon D. Hammond, Christian R. Trott, and Wen-Mei W. Hwu
Published: 2017
Full Text: View/download PDF

47. Interpretable and Globally Optimal Prediction for Textual Grounding using Image Concepts.

Author: Raymond A. Yeh, Jinjun Xiong, Wen-Mei W. Hwu, Minh N. Do, and Alexander G. Schwing
Published: 2017

48. Generalize or Die: Operating Systems Support for Memristor-Based Accelerators.

Author: Pedro Bruel, Sai Rahul Chalamalasetti, Chris I. Dalton, Izzat El Hajj, Alfredo Goldman, Catherine Graves, Wen-Mei W. Hwu, Phil Laplante, Dejan S. Milojicic, Geoffrey Ndu, and John Paul Strachan
Published: 2017
Full Text: View/download PDF

49. Rebooting the Data Access Hierarchy of Computing Systems.

Author: Wen-mei W. Hwu, Izzat El Hajj, Simon Garcia De Gonzalo, Carl Pearson, Nam Sung Kim, Deming Chen, Jinjun Xiong, and Zehra Sura
Published: 2017
Full Text: View/download PDF

50. Hardware Acceleration of the Pair-HMM Algorithm for DNA Variant Calling.

Author: Sitao Huang, Gowthami Jayashri Manikandan, Anand Ramachandran 0001, Kyle Rupnow, Wen-mei W. Hwu, and Deming Chen
Published: 2017

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Database

Publisher

652 results on '"Wen-mei W. Hwu"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources