Sessions
Large-scale Data Analysis Using the App Engine Pipeline API
The Pipeline API makes it easy to analyze complex data using App Engine. This talk will cover how to build multi-phase Map Reduce workflows; how to merge multiple large data sources with "join" operations; and how to build reusable analysis components. It will also cover the API's concurrency model, how to debug in production, and built-in testing facilities.
About Brett Slatkin
Brett is the co-creator of the PubSubHubbub protocol and a Software Engineer on the Google App Engine team, joining Google in 2005. He holds a B.S. in Computer Engineering from Columbia University.
