English 中文(简体)
利用Hadoop,我是否保证使所有记录都与同样关键?
原标题:Using Hadoop, are my reducers guaranteed to get all the records with the same key?

在许多文本档案中,Im 采用Hive 实际本应uniq的字眼从事Hadoop的工作。 在减小步骤中,它选择了最近每个关键人物的记录。

www.un.org/Depts/DGACM/index_spanish.htm Hadoop是否保证,每个具有相同关键意义的记录,即按地图步骤的产出,都只能用于单一减员,即使许多减员正在跨越一个组群?

令我担心的是,在有相同钥匙的一组记录中出现磨擦之后,地图仪的输出可能会分裂。

最佳回答
问题回答

实际上,没有! 您可创建<代码>Partitioner,每当打电话get Partition时,将同样的钥匙发送到不同的削减者。 它对大多数申请来说,通常不是一个好主意。





相关问题
Hadoop - namenode is not starting up

I am trying to run hadoop as a root user, i executed namenode format command hadoop namenode -format when the Hadoop file system is running. After this, when i try to start the name node server, it ...

What default reducers are available in Elastic MapReduce?

I hope I m asking this in the right way. I m learning my way around Elastic MapReduce and I ve seen numerous references to the "Aggregate" reducer that can be used with "Streaming" job flows. In ...

Establishing Eclipse project environment for HadoopDB

I have checked-out a project from SourceForge named HadoopDB. It uses some class in another project named Hive. I have used Eclipse Java build path setting to link source to the Hive project root ...

Hadoop: intervals and JOIN

I m very new to Hadoop and I m currently trying to join two sources of data where the key is an interval (say [date-begin/date-end]). For example: input1: 20091001-20091002 A 20091011-20091104 ...

hadoop- determine if a file is being written to

Is there a way to determine if a file in hadoop is being written to? eg- I have a process that puts logs into hdfs. I have another process that monitors for the existence of new logs in hdfs, but I ...

Building Apache Hive - impossible to resolve dependencies

I am trying out the Apache Hive as per http://wiki.apache.org/hadoop/Hive/GettingStarted and am getting this error from Ivy: Downloaded file size doesn t match expected Content Length for http://...

热门标签