PDL Abstract

Toward Upgrades-as-a-Service in Distributed Systems

Poster Session at Middleware 2009. 10th International Middleware Conference Urbana Champaign, Illinois, USA.

T. Dumitras, P. Narasimhan

Parallel Data Laboratory
Carnegie Mellon University
Pittsburgh, PA 15213


Unavailability in distributed enterprise systems is usually the result of planned events, such as upgrades, rather than failures. Major system upgrades entail complex data conversions that are dicult to perform on the y, in the face of live workloads. Minimizing the downtime imposed by such conversions is a time-intensive and error-prone manual process. We propose upgrades-as-a-service, a novel approach that can eliminate all the causes of planned downtime recorded during the upgrade history of one of the ten most popular websites. Building on the lessons learned from past research on live upgrades in middleware systems, upgrades-as-a-service trade off a need for additional hardware resources during the upgrade for the ability to perform end-to-end upgrades online, with minimal application-speci c knowledge.