Back to Search Start Over

Use of Computation-Unit Integrated Memories in High-Level Synthesis.

Authors :
Chao Huang
Ravi, Srivaths
Raghunathan, Anand
Jha, Niraj K.
Source :
IEEE Transactions on Computer-Aided Design of Integrated Circuits & Systems. Oct2006, Vol. 25 Issue 10, p1969-1989. 21p. 16 Diagrams, 4 Charts, 1 Graph.
Publication Year :
2006

Abstract

High-level synthesis (HLS) of memory-intensive applications has featured several innovations in terms of enhancements made to the basic memory organization and data layout. However, increasing performance and energy demands faced by application-specific integrated circuits (ASICs) are forcing designers to alter the fundamental architectural template of the HLS output, namely, a controller datapath associated with a memory subsystem (monolithic, partitioned, etc.). An architectural template for the HLS output that consists of a controller-datapath circuit associated with a memory subsystem into which computation units have been integrated is proposed. The enhanced memory subsystem is called computation-unit integrated memory (CIM). A CIM offers higher memory bandwidth (relative to what is offered through the system bus) to computation units present locally within it and reduces the overall communication between the memory subsystem and the controller datapath, thus providing a template highly suitable for deriving efficient implementations of memory-intensive applications. This paper addresses the challenge of providing a systematic synthesis framework for a CIM-based architecture. This framework can analyze the various tradeoffs involved in selecting suitable operations in a behavior for execution using a CIM and generate a high-performance low-overhead implementation. Efficient data reuse of register files have also been fully exploited to further improve system performance. Experiments with several behaviors indicate that an average performance improvement of 2.02× (a maximum of 2.70×) is possible with very low area overheads. The energy-delay product improves by an average of 2.5× (maximum of 3.8×). [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02780070
Volume :
25
Issue :
10
Database :
Academic Search Index
Journal :
IEEE Transactions on Computer-Aided Design of Integrated Circuits & Systems
Publication Type :
Academic Journal
Accession number :
22659720
Full Text :
https://doi.org/10.1109/TCAD.2005.862749