Back to Search Start Over

ARCHER: a ReRAM-based accelerator for compressed recommendation systems.

Authors :
Shen, Xinyang
Liao, Xiaofei
Zheng, Long
Huang, Yu
Chen, Dan
Jin, Hai
Source :
Frontiers of Computer Science; Oct2024, Vol. 18 Issue 5, p1-14, 14p
Publication Year :
2024

Abstract

Modern recommendation systems are widely used in modern data centers. The random and sparse embedding lookup operations are the main performance bottleneck for processing recommendation systems on traditional platforms as they induce abundant data movements between computing units and memory. ReRAM-based processing-in-memory (PIM) can resolve this problem by processing embedding vectors where they are stored. However, the embedding table can easily exceed the capacity limit of a monolithic ReRAM-based PIM chip, which induces off-chip accesses that may offset the PIM profits. Therefore, we deploy the decomposed model on-chip and leverage the high computing efficiency of ReRAM to compensate for the decompression performance loss. In this paper, we propose ARCHER, a ReRAM-based PIM architecture that implements fully on-chip recommendations under resource constraints. First, we make a full analysis of the computation pattern and access pattern on the decomposed table. Based on the computation pattern, we unify the operations of each layer of the decomposed model in multiply-and-accumulate operations. Based on the access observation, we propose a hierarchical mapping schema and a specialized hardware design to maximize resource utilization. Under the unified computation and mapping strategy, we can coordinate the inter-processing elements pipeline. The evaluation shows that ARCHER outperforms the state-of-the-art GPU-based DLRM system, the state-of-the-art near-memory processing recommendation system RecNMP, and the ReRAM-based recommendation accelerator REREC by 15.79×, 2.21×, and 1.21× in terms of performance and 56.06×, 6.45×, and 1.71× in terms of energy savings, respectively. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
20952228
Volume :
18
Issue :
5
Database :
Complementary Index
Journal :
Frontiers of Computer Science
Publication Type :
Academic Journal
Accession number :
174454164
Full Text :
https://doi.org/10.1007/s11704-023-3397-x