Back to Search Start Over

[Untitled]

Authors :
Joan-Manuel Parcerisa
Ramon Canal
Antonio González
Source :
International Journal of Parallel Programming. 29:59-79
Publication Year :
2001
Publisher :
Springer Science and Business Media LLC, 2001.

Abstract

Recent works^(1) show that delays introduced in the issue and bypass logic will become critical for wide issue superscalar processors. One of the proposed solutions is clustering the processor core. Clustered architectures benefit from a less complex partitioned processor core and thus, incur in less critical delays. In this paper, we propose a dynamic instruction steering logic for these clustered architectures that decides at decode time the cluster where each instruction is executed. The performance of clustered architectures depends on the inter-cluster communication overhead and the workload balance. We present a scheme that uses runtime information to optimize the trade-off between these figures. The evaluation shows that this scheme can achieve an average speed-up of 35% over a conventional 8-way issue (4eint+4efp) machine and that it outperforms other previous proposals, either static or dynamic.

Details

ISSN :
08857458
Volume :
29
Database :
OpenAIRE
Journal :
International Journal of Parallel Programming
Accession number :
edsair.doi...........63525ac40000e22ad690175e3cbd351e