PDL Abstract

Unleashing In-network Computing on Scientific Workloads

arXiv:2009.02457v1 [cs.NI], 5 Sep 2020.

Daehyeok Kim1, Ankush Jain1, Zaoxing Liu1, George Amvrosiadis1, Damian Hazen2, Bradley Settlemyer3, Vyas Sekar1

1Carnegie Mellon University
2Lawrence Berkeley National Laboratory
3Los Alamos National Laboratory

Many recent efforts have shown that in-network computing can benefit various datacenter applications. In this paper, we explore a relatively less-explored domain which we argue can benefit from in-network computing: scientific workloads in high-performance computing. By analyzing canonical examples of HPC applications, we observe unique opportunities and challenges for exploiting in-network computing to accelerate scientific workloads. In particular, we find that the dynamic and demanding nature of scientific workloads is the major obstacle to the adoption of in-network approaches which are mostly open-loop and lack runtime feedback. In this paper, we present NSinC (Network-accelerated ScIeNtific Computing), an architecture for fully unleashing the potential benefits of in-network computing for scientific workloads by providing closed-loop runtime feedback to innetwork acceleration services. We outline key challenges in realizing this vision and a preliminary design to enable acceleration for scientific applications.