Petascale Data Storage at CMU: Projects

Analyzing Failure Data
This project focuses on better understanding what makes systems unreliable in order to build more reliable systems.

FSSTATS: Static Survey of File System Statistics
The goal of this project is to make available tools and services that facilitate worldwide data collection on static file tree attributes and aggregate this data into a large database that can be queried and viewed by anyone.

GIGA+: Scalable Directories for Shared File Systems (or, How to Build Directories with Trillions of Files)
This project is about how to build file system directories that contain billions to trillions of entries and grow the number of entries instantly with all cores contributing concurrently. The central tenet of our work is extreme scalability through high parallelism.