Parallel Data Laboratory

PDL Abstract

The CacheLib Caching Engine: Design and Experiences at Scale

14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), Virtual Event, Nov. 4–6, 2020.

Benjamin Berg¹, Daniel S. Berger^1,3, Sara McAllister¹, Isaac Grosof¹, Sathya Gunasekar², Jimmy Lu², Michael Uhlar², Jim Carrig², Nathan Beckmann¹, Mor Harchol-Balter¹, Gregory R. Ganger¹

1 Carnegie Mellon University
2 Facebook
3 Microsoft Research

http://www.pdl.cmu.edu/

Web services rely on caching at nearly every layer of the system architecture. Commonly, each cache is implemented and maintained independently by a distinct team and is highly specialized to its function. For example, an application-data cache would be independent from a CDN cache. However, this approach ignores the difficult challenges that different caching systems have in common, greatly increasing the overall effort required to deploy, maintain, and scale each cache.

This paper presents a different approach to cache development, successfully employed at Facebook, which extracts a core set of common requirements and functionality from otherwise disjoint caching systems. CacheLib is a generalpurpose caching engine, designed based on experiences with a range of caching use cases at Facebook, that facilitates the easy development and maintenance of caches. CacheLib was first deployed at Facebook in 2017 and today powers over 70 services including CDN, storage, and application-data caches.

This paper describes our experiences during the transition from independent, specialized caches to the widespread adoption of CacheLib. We explain how the characteristics of production workloads and use cases at Facebook drove important design decisions. We describe how caches at Facebook have evolved over time, including the significant benefits seen from deploying CacheLib. We also discuss the implications our experiences have for future caching design and research.

FULL PAPER: pdf / slides / talk video

FURTHER INFO: CacheLib, Facebook’s open source caching engine for web-scale services

PARALLEL DATA LAB

PDL Publications

PDL Abstract

The CacheLib Caching Engine: Design and Experiences at Scale

Contact us

Recent Events

PDL Retreat 2024

PDL Retreat 2023

PDL Retreat 2022

Social Media