Monday, August 1, 2011 - NOTE SPECIAL DAY
12:00 pm - 1:00 pm
PLACE: CIC 2101
TITLE: Cluster Management at Google
Cluster management is the term that Google uses to describe how we control the computing infrastructure in our datacenters that supports almost all of our external services. It includes allocating resources to different applications on our fleet of computers, looking after software installations, and monitoring. My goal is to present an overview of these systems, introduce the new cluster-manager tool we are building, and present some of the challenges that we're facing along the way. Many of these challenges represent research opportunities, too, so I'll spend the majority of the time discussing those.
John Wilkes has been at Google since 2008, where he is working on cluster management and service level agreements for infrastructure services. He is interested in far too many aspects of distributed systems, but a recurring theme has been technologies that allow systems to manage themselves. He received a PhD in computer science from the University of Cambridge, joined HP Labs in 1982, and was elected an HP Fellow and an ACM Fellow in 2002 for his work on storage system design. Along the way, he's been PC chair for SOSP, FAST, EuroSys and HotCloud, and served on the steering committees for EuroSys, SoCC, ICAC and FAST. He's listed an inventor on 36 US patents, and has an adjunct Professor appointment at Carnegie-Mellon University. In his spare time he continues, stubbornly, trying to learn how to blow glass.
SDI / LCS Seminar Questions?
Karen Lindenfelser, 86716, or visit www.pdl.cmu.edu/SDI/