DATE: Thursday, December 11, 2008
TIME: 12:00 pm - 1:00 pm
PLACE: Wean Hall 8220

Doug Terry
Microsoft Research Silicon Valley

Content-based Data Replication: Challenges and Techniques

As people increasingly use mobile devices and Internet services to manage and share large data collections, exploiting communication proximity and selectively replicating content is essential. Cimbiosys is a replicated storage platform that permits each device to define its own content-based filtering criteria and to share updates directly with other devices. This talk focuses on the key challenges in providing content-based partial replication in the face of fluid network connectivity, redefinable content filters, and changing content. I will discuss the design of Cimbiosys , which ensures two fundamental properties. First, every device eventually stores precisely those items whose latest version matches its custom filter. Second, every device represents its replication-specific metadata in a compact form, resulting in low data synchronization overhead. This permits ad hoc replication between newly encountered devices with overlapping interests and frequent synchronization between established partners, even over low bandwidth wireless networks.

Doug Terry is a Principal Researcher in the Microsoft Research Silicon Valley lab. His research focuses on the design and implementation of novel distributed systems and addresses issues such as information management, fault-tolerance, and mobility. He currently is serving as Chair of ACM's Special Interest Group on Operating Systems (SIGOPS). Prior to joining Microsoft, Doug was the co-founder and CTO of a start-up company named Cogenia, Chief Scientist of the Computer Science Laboratory at Xerox PARC, and an Adjunct Professor in the Computer Science Division at U. C. Berkeley, where he still occasionally teaches a graduate course on distributed systems. Doug has a Ph.D. in Computer Science from U. C. Berkeley.

