Author: "Henri Fraisse" / Topic: 0202 electrical engineering, electronic engineering, information engineering - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Henri Fraisse"' showing total 5 results

Start Over Author "Henri Fraisse" Topic 0202 electrical engineering, electronic engineering, information engineering

5 results on '"Henri Fraisse"'

1. A Domain-Specific Architecture for Accelerating Sparse Matrix Vector Multiplication on FPGAs

Author: Lisa Liu, Henri Fraisse, Mansimran Benipal, Hossein Omidian, Abhishek Kumar Jain, and Dinesh D. Gaitonde
Subjects: 010302 applied physics, Modularity (networks), Memory hierarchy, Plug and play, Computer science, Sparse matrix-vector multiplication, 02 engineering and technology, Parallel computing, 01 natural sciences, 020202 computer hardware & architecture, 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering, Routing (electronic design automation), Field-programmable gate array, Block (data storage), Efficient energy use
Abstract: FPGAs allow custom memory hierarchy and flexible data movement with highly fine-grained control. These capabilities are critical for building high performance and energy efficient domain-specific architectures (DSAs), especially for workloads with irregular memory access and data-dependent communication patterns. Sparse linear algebra operations, especially sparse matrix vector multiplication (SpMV), are examples of such workloads and are becoming important due to their use in numerous areas of science and engineering. Existing FPGA-based DSAs for SpMV do not allow customization through plug and play of the building blocks. For example, most of these DSAs require switching network/crossbar architecture as a building block for routing matrix data to banked vector memory blocks. In this paper, we first present an approach where a custom network is built using simple blocks arranged in a regular fashion to exploit low-level architecture details. Further, we make use of this network to replace expensive crossbars employed in GEMX SpMV engine and develop an end-to-end tool-flow around mixed IP approach (HLS/RTL). Due to the modularity of our design, our tool-flow allows us to insert an additional block in the design to guarantee zero-stall from the accumulation stage. On Alveo U200, we report performance numbers of up to 4.4 GFLOPS (92% peak bandwidth utilization) using our accelerator (attached with one DDR4).
Published: 2020
Full Text: View/download PDF

2. SAT Based Place-And-Route for High-Speed Designs on 2.5D FPGAs

Author: Dinesh D. Gaitonde, Chirag Ravishankar, and Henri Fraisse
Subjects: business.industry, Computer science, Interface (computing), 020208 electrical & electronic engineering, Spec#, 02 engineering and technology, Clock skew, 020202 computer hardware & architecture, Embedded system, Scalability, 0202 electrical engineering, electronic engineering, information engineering, Place and route, Routing (electronic design automation), Field-programmable gate array, business, computer, Communication channel, computer.programming_language
Abstract: 2.5D stacking technology allows us to build high performance and high capacity FPGA devices at reasonable costs. The communication between multiple dies happen on a passive silicon interposer at high speed, which pose several interesting challenges. Due to clock skew characteristics across multiple dies and increase in the min-max spread of delays, place-and-route tools need to address inter-die hold violations and optimize for performance. We implement a tractable SAT based methodology to achieve this by minimally detouring data paths to meet all hold requirements while optimizing performance. We also confine the solution to a small window around each inter-die (Laguna) channel to reduce routing resource utilization, congestion, and scale the methodology to any Laguna channel utilization. We improve performance across the interface by 11% compared to a state-of-the-art commercial flow and meet a 500MHz spec on Xilinx(R) UltraScale+(TM) devices in 2E speedgrade. We address the scalability concerns of SAT and show how we can use this in practice with negligible runtimes in implementation tools. Our solution paves the way for FPGA-as-a-service platforms where fast inter-die communication, that does not interfere with user specific logic, is pivotal to their success.
Published: 2018
Full Text: View/download PDF

3. A SAT-based Timing Driven Place and Route Flow for Critical Soft IP

Author: Dinesh D. Gaitonde and Henri Fraisse
Subjects: Emulation, Computer science, business.industry, Soft IP, 02 engineering and technology, Timing closure, 020202 computer hardware & architecture, Embedded system, Scalability, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Place and route, Boolean satisfiability problem, Field-programmable gate array, business, Hardware_LOGICDESIGN, PCI Express
Abstract: Many FPGA designs contain soft IP tightly connected to hard blocks such as on-chip Processor, PCIE or IOs. Generally, these soft IPs pose significant timing closure challenges. In this paper, we propose a timing-driven Place and Route flow based on Boolean Satisfiability (SAT). Its main advantages over previous SAT-based approaches are its improved scalability and its timing awareness. We validate our flow using an IP targeting the emulation market. We demonstrate that our flow can significantly improve the usable bandwidth of FPGA IOs. Since the proposed flow is SAT based, the performance does not depend on specific ways in which more traditional place and route are usually tuned.
Published: 2018
Full Text: View/download PDF

4. Automated extra pipeline analysis of applications mapped to Xilinx UltraScale+ FPGAs

Author: Henri Fraisse, Ilya K. Ganusov, Rafael Trapani Possignolo, Ng Aaron, and Sabyasachi Das
Subjects: Computer science, business.industry, Pipeline (computing), Overhead (engineering), 0211 other engineering and technologies, 02 engineering and technology, 020202 computer hardware & architecture, Set (abstract data type), Computer architecture, Embedded system, 0202 electrical engineering, electronic engineering, information engineering, Benchmark (computing), Algorithm design, Hardware_ARITHMETICANDLOGICSTRUCTURES, Performance improvement, Heuristics, business, Field-programmable gate array, Hardware_REGISTER-TRANSFER-LEVELIMPLEMENTATION, 021106 design practice & management
Abstract: This paper describes the methodology and algorithms behind extra pipeline analysis tools released in the Xilinx Vivado Design Suite version 2015.3. Extra pipelining is one of the most effective ways to improve performance of FPGA applications. Manual pipelining, however, often requires significant efforts from FPGA designers who need to explore various changes in the RTL and re-run the flow iteratively. The automatic pipelining approach described in this paper, in contrast, allows FPGA users to explore latency vs. performance trade-offs of their designs before investing time and effort into modifying RTL. We describe algorithms behind these tools which use simple cut heuristics to maximize performance improvement while minimizing additional latency and register overhead. To demonstrate the effectiveness of the proposed approach, we analyse a set of 93 commercial FPGA applications and IP blocks mapped to Xilinx UltraScale+ and UltraScale generations of FPGAs. The results show that extra pipelining can provide from 18% to 29% potential Fmax improvement on average. It also shows that the distribution of improvements is bimodal, with almost half of benchmark suite designs showing no improvement due to the presence of large loops. Finally, we demonstrate that highly-pipelined designs map well to UltraScale+ and UltraScale FPGA architectures. Our approach demonstrates 19% and 20% Fmax improvement potential for the UltraScale+ and UltraScale architectures respectively, with the majority of applications reaching their loop limit through pipelining.
Published: 2016
Full Text: View/download PDF

5. Boolean Satisfiability-Based Routing and Its Application to Xilinx UltraScale Clock Network

Author: Alireza S. Kaviani, Abhishek Joshi, Dinesh D. Gaitonde, and Henri Fraisse
Subjects: 010302 applied physics, Computer science, 02 engineering and technology, Parallel computing, 01 natural sciences, 020202 computer hardware & architecture, Clock network, Distance-vector routing protocol, Link-state routing protocol, Robustness (computer science), 0103 physical sciences, Scalability, Multipath routing, Hardware_INTEGRATEDCIRCUITS, 0202 electrical engineering, electronic engineering, information engineering, Boolean satisfiability problem, Field-programmable gate array, Hardware_LOGICDESIGN
Abstract: Boolean Satisfiability (SAT)-based routing offers a unique advantage over conventional routing algorithms by providing an exhaustive approach to find a solution. Despite that advantage, commercial FPGA CAD tools rarely use SAT-based routers due to scalability issues. In this paper, we revisit SAT-based routing and propose two SAT formulations independent of routing architecture. We then demonstrate that SAT-based routing using either formulation dramatically outperforms conventional routing algorithms in both runtime and robustness for the clock routing of Xilinx UltraScale devices. Finally, we experimentally show that one of the proposed SAT formulations leads to a routing 18x faster and produces formulas 20x more compact than the other. This framework has been implemented into Vivado and is now currently used in production.
Published: 2016
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

5 results on '"Henri Fraisse"'

1. A Domain-Specific Architecture for Accelerating Sparse Matrix Vector Multiplication on FPGAs

2. SAT Based Place-And-Route for High-Speed Designs on 2.5D FPGAs

3. A SAT-based Timing Driven Place and Route Flow for Critical Soft IP

4. Automated extra pipeline analysis of applications mapped to Xilinx UltraScale+ FPGAs

5. Boolean Satisfiability-Based Routing and Its Application to Xilinx UltraScale Clock Network

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

5 results on '"Henri Fraisse"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources