hadoop-plugins

Hadoop task schedulers: Capacity vs Fair sharing or something else?

Background My employer is progressively shifting our resource intensive ETL and backend processing logic from MySQL to Hadoop ( dfs & hive ). At the moment everything is still somewhat small and manageable ( 20 TB over 10 nodes ) but we intend to progressively increase the cluster size. Now that hadoop is being shifted into production...