Question

我有一个3个奴隶聚居区,我正在一个网站上进行爬行。然而,只有1个奴隶在钓鱼(尽管其他奴隶还活着 ) 。如果只爬了1个区域,这是正常的行为吗? 有没有办法迫使其他奴隶爬行?

谢谢

Answer 1

As part of any Hadoop MR job design there is a decision how to split the work between mappers. In Your case nutch splits the fetching process by sites, and as a result only one mapper is used to fetch the data. If you hade more sites, it would split the load.
Here is a good description of the process: How does Nutch work with Hadoop cluster?

友情链接