Back to Search Start Over

Distributed Data Cache Designs for Clustered VLIW Processors.

Authors :
Gibert, Enric
Sánchez, Jesús
González, Antonio
Source :
IEEE Transactions on Computers; Oct2005, Vol. 54 Issue 10, p1227-1241, 15p
Publication Year :
2005

Abstract

Wire delays are a major concern for current and forthcoming processors. One approach to deal with this problem is to divide the processor into semi-independent units referred to as clusters. A cluster usually consists of a local register file and a subset of the functional units, while the L1 data cache typically remains centralized in what we call partially distributed architectures. However, as technology evolves, the relative latency of such a centralized cache will increase, leading to an important impact on performance. In this paper, we propose partitioning the L1 data cache among clusters for clustered VLIW processors. We refer to this kind of design as fully distributed processors. In particular, we propose and evaluate three different configurations: a snoop-based cache coherence scheme, a word-interleaved cache, and flexible L0 buffers managed by the compiler. For each alternative, instruction scheduling techniques targeted to cyclic code are developed. Results for the Mediabench suite show that the performance of such fully distributed architectures is always better than the performance of a partially distributed one with the same amount of resources. In addition, the key aspects of each fully distributed configuration are explored. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00189340
Volume :
54
Issue :
10
Database :
Complementary Index
Journal :
IEEE Transactions on Computers
Publication Type :
Academic Journal
Accession number :
18224439
Full Text :
https://doi.org/10.1109/TC.2005.163