I have an hadoop job with a pretty long map phase and I want other short jobs to be run in priority. For this I set the priority of my long job with hadoop job -set-priority job_id LOW.
问题在于,就我的长期工作而言,即使我完成30%的地图任务,削减者的复印阶段也开始。
My grid is then kind of bloked as all the reduce slots are taken by the LOW priority job. The other small jobs can do their map phases but they will never get any reducer until my long job is finihed.
Any idea? Thanks. J.