Parallel Data Lab

Academia's premiere data systems research center.
Leading research in storage systems, databases, ML systems, cloud computing, data lakes, etc.

Learn More

Parallel Data Lab

Academia's premiere data systems research center.
Leading research in storage systems, databases, ML systems, cloud computing, data lakes, etc.

Learn More

Parallel Data Lab

Academia's premiere data systems research center.
Leading research in storage systems, databases, ML systems, cloud computing, data lakes, etc.

Learn More

Recent PDL Publications

The PDL Packet - Fall 2024 Newsletter

3 Papers at SOSP!

LithOS: An Operating System for Efficient Machine Learning on GPUs

Patrick H. Coppock, Brian Zhang, Eliot H. Solomon, Vasilis Kypriotis, Leon Yang, Bikash Sharma, Dan Schatzberg, Todd C. Mowry, Dimitrios Skarlatos

SOSP ’25, October 13–16, 2025, Seoul, Republic of Korea.

The surging demand for GPUs in datacenters for machine learning (ML) workloads has made efficient GPU utilization crucial. However, meeting the diverse needs of individual ML models while optimizing resource usage is challenging. To enable transparent, fine-grained management of GPU resources that maximizes GPU utilization and energy efficiency while maintaining strong isolation, an operating systems (OS) approach is needed. Hence this paper introduces LithOS, a first step towards a GPU OS.[...more]

COpter: Efficient Large-Scale Resource-Allocation via Continual Optimization

Suhas Jayaram Subramanya, Don Kurian Dennis, Gregory R. Ganger, Virginia Smith

SOSP ’25, October 13–16, 2025, Seoul, Republic of Korea.

Optimization-based resource allocation in large-scale systems often must trade-off responsiveness and allocation quality. Generally, allocations are reconsidered every few minutes (a round) by formulating and solving a new optimization problem. This paper introduces continual optimization, which reframes round-based resource allocation as a sequence of interconnected problems, leveraging the observation that these resource allocation problems often only change by small amounts across successive rounds to reduce solving times. [...more]

Moirai: Optimizing Placement of Data and Compute in Hybrid Clouds

Ziyue Qiu, Hojin Park, Jing Zhao, Yukai Wang, Arnav Balyan, Gurmeet Singh, Yangjun Zhang, Suqiang (Jack) Song, Gregory R. Ganger, George Amvrosiadis

SOSP ’25, October 13–16, 2025, Seoul, Republic of Korea.

The deployment of large-scale data analytics between onpremise and cloud sites, i.e., hybrid clouds, requires careful partitioning of both data and computation to avoid massive networking costs. We present Moirai, a cost-optimization framework that analyzes job accesses and data dependencies and optimizes the placement of both in hybrid clouds. Moirai informs the job scheduler of data location and access predictions [...more]

Parallel Data Lab

Parallel Data Lab

Parallel Data Lab

Recent PDL Publications

Recent PDL News

PDL Alum Jure Leskovec Receives CMU 2025 Alumni Achievement Award

Best Research Paper Runner-up at VLDB'25

Akshitha Sriraman Wins Google ML & Systems Award

Contact us

Recent Events

PDL Retreat 2024

PDL Retreat 2023

PDL Retreat 2022

Social Media