Back to Search Start Over

ACC

Authors :
Derui Liu
Xiaoliang Wang
Siyu Yan
Zheng Xiaolong
Deng Weishan
Xia Yinben
Source :
SIGCOMM
Publication Year :
2021
Publisher :
ACM, 2021.

Abstract

For the widely deployed ECN-based congestion control schemes, the marking threshold is the key to deliver high bandwidth and low latency. However, due to traffic dynamics in the high-speed production networks, it is difficult to maintain persistent performance by using the static ECN setting. To meet the operational challenge, in this paper we report the design and implementation of an automatic run-time optimization scheme, ACC, which leverages the multi-agent reinforcement learning technique to dynamically adjust the marking threshold at each switch. The proposed approach works in a distributed fashion and combines offline and online training to adapt to dynamic traffic patterns. It can be easily deployed based on the common features supported by major commodity switching chips. Both testbed experiments and large-scale simulations have shown that ACC achieves low flow completion time (FCT) for both mice flows and elephant flows at line-rate. Under heterogeneous production environments with 300 machines, compared with the well-tuned static ECN settings, ACC achieves up to 20\% improvement on IOPS and 30\% lower FCT for storage service. ACC has been applied in high-speed datacenter networks and significantly simplifies the network operations.

Details

Database :
OpenAIRE
Journal :
Proceedings of the 2021 ACM SIGCOMM 2021 Conference
Accession number :
edsair.doi...........e01e103091298e95247cc92eff9ed342