Thursday, Novemeber 14, 2013
TIME: Noon - 1:00 pm
PLACE: CIC - 4th floor (ISTC Panther Hollow Room)
SPEAKER: Dmitriy Ryaboy, Engineering Manager, Analytics Infrastructure, Twitter
TITLE: Realtime and Batch Data Processing @ Twitter *
Summingbird, an open-source project recently released by Twitter, allows engineers to easily build data processing pipelines that work both in a streaming context provided by Twitter Storm, and in offline batch context through Apache Hadoop. This talk will cover the practical motivation for building such a thing, and explain the core Summingbird architecture and components.
Dmitriy Ryaboy (@squarecog) manages the Twitter Analytics Infrastructure team. He's previously worked at Cloudera, Ask.com, and Lawrence Berkeley National Laboratory. He holds a Master's degree in VLIS from CMU and a Bachelor's in EECS from UC Berkeley.
VISITOR HOST: Andy Pavlo
VISITOR COORDINATOR: Jennifer Landefeld
SDI / ISTC SEMINAR QUESTIONS?
Karen Lindenfelser, 86716, or visit www.pdl.cmu.edu/SDI/
Joint with MCDS
*partially funded by