How to keep the sequence file created by map in hadoop | ansaurus

tags:

views:

24

answers:

1

Q:

How to keep the sequence file created by map in hadoop

Hi

I am using hadoop and working with a map task that creates files that I want to keep, currently I am passing these files through the collector to the reduce task. The reduce task then passes these files on to its collector, this allows me to retain the files.

My question is how do I reliably and efficiently keep the files created by map?

I know I can turn off the automatic deletion of map's output, but that is frowned upon are they any better approaches?

Thanks

A:

You could split it up into two jobs.

First create a map only job outputting the sequence files you want.

Then, taking your existing job (doing really nothing in the map anymore but you could do some crunching depending on your implementation & use cases) and reducing as you do now inputting the previous map only job through as your input to the second job.

You can wrap this all up in one jar running the 2 jars as such passing the output path as an argument to the second jobs input path.

Joe Stein 2010-08-11 23:44:22

Thanks, but I needed to use the files within the map. For example, I create an image and then extract certain features from the image. I decided to have each tasktracker create a sequence file, and have the map function retrieve a static reference to the sequence file.

akintayo 2010-08-13 23:34:37

related questions

In STL maps, is it better to use map::insert than []?

F#: How do I use Map with a Collection (like Regex Matches)?

Which data structure would you use: TreeMap or HashMap? (Java)

Geographical Data Visualization in a Web Application

Does .NET have a Dictionary implementation that is equivalent to Java's ConcurrentHashMap?

rails map.resources with has_many :through doesn't work?

Why does this map statement in Perl not compile?

Does OCaml have general map()/reduce() functions?

delete a specific entry in the map,but the iterator must point to the next element after the deletion.

Why are Maps returned by a JAX-WS call always empty?

Can I serialize map of STL in MFC using CArchive?

Hashtable/dictionary/map lookup with regular expressions

Is there a map() function in ExtJS?

how do you make a heterogeneous boost::map?

Is there a good way to have a Map<String, ?> get and put ignore case?

Random element in a map

gcc3.3 undefined reference to std::_Rb_tree<>::insert_unique

boost::shared_ptr STL container question

Persistence of std::map in C++

What's the point of Perl's map?

Hashtable in C++?

existance map in c++

Does anyone know of a widget for a desktop toolkit(GTK, Qt, WX) for displaying a map of US states?

Iterate Over Map

Why doesn't my Perl map return anything?