Tuesday, October 6, 2009
Yahoo is the primary developer of Hadoop and the runs the largest Hadoop clusters in the world. I will cover the scale of Hadoop usage at Yahoo and discuss recent challenges we have faced. I will end with a list of open research opportunities in Hadoop.
Eric Baldeschwieler has been a member of Yahoo!s web-search/cloud teams since 1996 and the founder of the Yahoo! team that has taken Hadoop from a 20 node prototype to a 25,000+ node service that powers key production applications across Yahoo! Previously he has worked on video Games, video special effects systems, and 3D rendering products. He holds a BS in Applied Math (CS) from Carnegie Mellon and an MS in CS from UC Berkeley.