Sessions

Large-scale Data Analysis Using the App Engine Pipeline API

Brett Slatkin

The Pipeline API makes it easy to analyze complex data using App Engine. This talk will cover how to build multi-phase Map Reduce workflows; how to merge multiple large data sources with "join" operations; and how to build reusable analysis components. It will also cover the API's concurrency model, how to debug in production, and built-in testing facilities.

Level: 301
Track: App Engine
Time: May 11, 04:15PM – 05:15PM
Room: Room 6

About Brett Slatkin

Brett is the co-creator of the PubSubHubbub protocol and a Software Engineer on the Google App Engine team, joining Google in 2005. He holds a B.S. in Computer Engineering from Columbia University.