ansaurus

Question

Answer 1

+1 A:

Oozie http://yahoo.github.com/oozie/ is an Open Source server that Yahoo released to manage Hadoop & Pig workflow like you are asking

Cloudera has it in their latest distro with very good documentation https://wiki.cloudera.com/display/DOC/Oozie+Installation

Joe Stein 2010-09-02 20:47:04

Answer 2

A:

You should be able to generate the pig code for this pretty easily using Piglet, the Ruby Pig DSL: http://github.com/iconara/piglet

SquareCog 2010-09-05 06:53:53

Variable/looping sequence of jobs