PARALLEL DATA LAB 

PDL Abstract

Robust and Flexible Power-proportional Storage

ACM Symposium on Cloud Computing (SOCC). June 10-11, 2010, Indianapolis, IN. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-10-106, February 2010.

Hrishikesh Amur†, James Cipar, Varun Gupta, Gregory R. Ganger, Michael A. Kozuch‡,
Karsten Schwan†

Dept. Electrical and Computer Engineering
Carnegie Mellon University
Pittsburgh, PA 15213

†Georgia Institute of Technology
‡Intel Labs Pittsburgh

http://www.pdl.cmu.edu/

Power-proportional cluster-based storage is an important component of an overall cloud computing infrastructure. With it, substantial subsets of nodes in the storage cluster can be turned off to save power during periods of low utilization. Rabbit is a distributed file system that arranges its data-layout to provide ideal power-proportionality down to very low minimum number of powered-up nodes (enough to store a primary replica of available datasets). Rabbit addresses the node failure rates of large-scale clusters with data layouts that minimize the number of nodes that must be powered-up if a primary fails. Rabbit also allows different datasets to use different subsets of nodes as a building block for interference avoidance when the infrastructure is shared by multiple tenants. Experiments with a Rabbit prototype demonstrate its power-proportionality, and simulation experiments demonstrate its properties at scale.

KEYWORDS: Cluster Computing, Power-proportionality, Data-layout

FULL PAPER: pdf
FULL TR: pdf