PARALLEL DATA LAB 

PDL Abstract

Unleashing In-network Computing on Scientific Workloads

arXiv:2009.02457v1 [cs.NI], 5 Sep 2020.

Daehyeok Kim1, Ankush Jain1, Zaoxing Liu1, George Amvrosiadis1, Damian Hazen2, Bradley Settlemyer3, Vyas Sekar1

1Carnegie Mellon University
2Lawrence Berkeley National Laboratory
3Los Alamos National Laboratory

http://www.pdl.cmu.edu

Many recent efforts have shown that in-network computing can benefit various datacenter applications. In this paper, we explore a relatively less-explored domain which we argue can benefit from in-network computing: scientific workloads in high-performance computing. By analyzing canonical examples of HPC applications, we observe unique opportunities and challenges for exploiting in-network computing to accelerate scientific workloads. In particular, we find that the dynamic and demanding nature of scientific workloads is the major obstacle to the adoption of in-network approaches which are mostly open-loop and lack runtime feedback. In this paper, we present NSinC (Network-accelerated ScIeNtific Computing), an architecture for fully unleashing the potential benefits of in-network computing for scientific workloads by providing closed-loop runtime feedback to innetwork acceleration services. We outline key challenges in realizing this vision and a preliminary design to enable acceleration for scientific applications.

FULL PAPER: pdf