Toward Scalable RDMA through Resource Prefetching

April 2025

摘要

RDMA network is being widely deployed in data centers, high-performance computing, and AI clusters. By ofﬂoading the network processing protocol stack to hardware, RDMA bypasses the operating system kernel, thereby enabling high performance and low CPU overhead. However, the protocol processing demands substantial communication resources, and due to the limited hardware resources, commercial NICs (Network Interface Cards) experience a signiﬁcant number of cache misses in large-scale connection scenarios. This results in performance degradation, indicating that RDMA lacks scalability. In this paper, we ﬁrst analyze the characteristics of resource access in RDMA. Based on these characteristics, we propose a resource access prediction and prefetching mechanism in the hardware, which preemptively fetches the resources required by the protocol processing pipeline to the on-chip cache. This mechanism increases the NIC’s cache hit ratio. Evaluation results demonstrate that our approach improves throughput by 125% and reduces latency by 17.9% under large-scale communication scenarios.

类型

期刊文章

出版物

In IEEE International Conference on High Performance Computing and Communications

Source Themes

Toward Scalable RDMA through Resource Prefetching

摘要

马振龙

计算机系统结构博士在读

相关