ansaurus

Question

How can I load a file into a DataBag from within a Yahoo PigLatin UDF?

Answer 1

+1 A:

Cervo, There's a UDF in the piggybank that does more or less what you want, called LookupInFiles. Check out the source code, it should be pretty straightforward to convert to your needs.

http://svn.apache.org/viewvc/hadoop/pig/trunk/contrib/piggybank/java/src/main/java/org/apache/pig/piggybank/evaluation/string/LookupInFiles.java

Please email the list if you have any other issues, documentation suggestions, etc.

SquareCog 2010-05-07 04:16:47

That's basically how I ended up doing it... FileLocalizer.openDFSFile(filename, UDFContext.getUDFContext().getUDFProperties(myudf.class))To test for existence, I used try/catch around it (due to hadoop making part-0000, .. part-0009). I would have liked to figure out how to get a DataStorage or PigContext object that works so I could have used fileExists, etc.... but no luck....

Cervo 2010-05-20 03:09:09

ansaurus

tags:

views:

answers:

How can I load a file into a DataBag from within a Yahoo PigLatin UDF?

related questions