Search

Your search keyword '"Pellauer, Michael"' showing total 145 results

Search Constraints

Start Over You searched for: Author "Pellauer, Michael" Remove constraint Author: "Pellauer, Michael"
145 results on '"Pellauer, Michael"'

Search Results

1. FuseMax: Leveraging Extended Einsums to Optimize Attention Accelerator Design

2. Characterizing the Accuracy -- Efficiency Trade-off of Low-rank Decomposition in Language Models

3. TeAAL: A Declarative Framework for Modeling Sparse Tensor Accelerators

4. Exploiting Inter-Operation Data Reuse in Scientific Applications using GOGETA

5. Flexagon: A Multi-Dataflow Sparse-Sparse Matrix Multiplication Accelerator for Efficient DNN Processing

6. Accelerating Sparse Data Orchestration via Dynamic Reflexive Tiling

7. A Formalism of DNN Accelerator Flexibility

8. DiGamma: Domain-aware Genetic Algorithm for HW-Mapping Co-optimization for DNN Accelerators

9. Enabling Flexibility for Sparse Tensor Acceleration via Heterogeneity

10. Self-Adaptive Reconfigurable Arrays (SARA): Using ML to Assist Scaling GEMM Acceleration

11. Marvel: A Data-centric Compiler for DNN Operators on Spatial Accelerators

12. Heterogeneous Dataflow Accelerators for Multi-DNN Workloads

13. Understanding Reuse, Performance, and Hardware Cost of DNN Dataflows: A Data-Centric Approach Using MAESTRO

14. UCNN: Exploiting Computational Reuse in Deep Neural Networks via Weight Repetition

15. Conclusions

17. Orchestrating Compressed-Sparse Data

18. Buffer Hierarchies

19. Modeling Accelerator Design Space

20. Dataflow and Data Reuse

21. Networks-on-Chip

22. Introduction to Data Orchestration

23. Counterexamples and Proof Loophole for the C/C++ to POWER and ARMv7 Trailing-Sync Compiler Mappings

24. TriCheck: Memory Model Verification at the Trisection of Software, Hardware, and ISA

25. TeAAL: A Declarative Framework for Modeling Sparse Tensor Accelerators

31. Accelerating Sparse Data Orchestration via Dynamic Reflexive Tiling (Extended Abstract)

32. Symphony: Orchestrating Sparse and Dense Tensors with Hierarchical Heterogeneous Processing

33. Accelerating Sparse Data Orchestration via Dynamic Reflexive Tiling

34. Accelerating Sparse Data Orchestration via Dynamic Reflexive Tiling

46. ExTensor

47. Buffets

48. Leveraging latency-insensitivity to ease multiple FPGA design

49. A modular digital VLSI flow for high-productivity SoC design

50. INVITED: A Modular Digital VLSI Flow for High-Productivity SoC Design

Catalog

Books, media, physical & digital resources