由于这是一个古老的问题,许多人已经为这些问题提供了答案,我的回答是,像我这样的开端人,他们意外地跳入这一联系,而他们正在寻找一种办法,在Hadoop管理Hello World。
是的,Hadoop在JRV上运行。 但是,仅仅因为你不需要Hadoop来管理这种简单应用。 Hadoop用于分配处理。 这意味着假定你拥有大量数据集,而你的无辜计算机无法处理这一庞大的数据集。 然后,你将做些什么,就是从能够共同完成这项任务的无辜(商品)电脑中获取帮助。
In the Hadoop environment, we are using a framework called Map-Reduce in order to do this kind of task. So obviously if you are not using the Map-Reduce framework in the Hadoop environment, it s like you are using a Space ship to climbing up to your rooftop instead of a ladder.
Even though this is the common hello world code for almost every programming language, this is not the hello world program for Hadoop. Here you have a program called Word-Count, which will count the number of occurrences in each word in a large text file or in an n number of files.
另外,还有三种模式可供你管理。
- Local (Standalone) mode
- Pseudo Distributed mode
- Fully Distributed mode
My advice is to try to run the Word-Count program in Pseudo distributed mode as a beginner.