English 中文(简体)
地图缩小框架如何在地图完全完成之前决定进行缩小?
原标题:How map reduce framework can determine to run reduce before map is completely done?

我注意到,通常当HAMOOU集群群集不忙的时候,在地图面完全完成之前,会减少侧面开始进展?这怎么可能?我记得在某个地方读到的降低进度指标有些误导,但不能确切记得。 有人能对此说清楚吗?

最佳回答

33 1 / 3 3 1 1 / 3 1 1 3 3 2 / 3 /sub>。

问题回答

除了已经给出的答案外,该行为受地图属性 mapred. deduce. slowstart.fulled.maps 的控制,该属性是地图任务的百分比(0-1),必须在复制阶段开始之前完成。

这是因为 缩放符必须将地图输出复制到节点, 任务会运行。 如果一个地图任务已完成并承诺了输出, 您可以直接将它传输并合并到已有的缩放符中的数据中 。

这样就可以省下很多时间 而不是等到所有的地图绘制完成。





相关问题
Hadoop - namenode is not starting up

I am trying to run hadoop as a root user, i executed namenode format command hadoop namenode -format when the Hadoop file system is running. After this, when i try to start the name node server, it ...

What default reducers are available in Elastic MapReduce?

I hope I m asking this in the right way. I m learning my way around Elastic MapReduce and I ve seen numerous references to the "Aggregate" reducer that can be used with "Streaming" job flows. In ...

Establishing Eclipse project environment for HadoopDB

I have checked-out a project from SourceForge named HadoopDB. It uses some class in another project named Hive. I have used Eclipse Java build path setting to link source to the Hive project root ...

Hadoop: intervals and JOIN

I m very new to Hadoop and I m currently trying to join two sources of data where the key is an interval (say [date-begin/date-end]). For example: input1: 20091001-20091002 A 20091011-20091104 ...

hadoop- determine if a file is being written to

Is there a way to determine if a file in hadoop is being written to? eg- I have a process that puts logs into hdfs. I have another process that monitors for the existence of new logs in hdfs, but I ...

Building Apache Hive - impossible to resolve dependencies

I am trying out the Apache Hive as per http://wiki.apache.org/hadoop/Hive/GettingStarted and am getting this error from Ivy: Downloaded file size doesn t match expected Content Length for http://...

热门标签