Note on Hadoop Tutorial : Introduction

来源:互联网 发布:上海至寻网络骗局 编辑:程序博客网 时间:2024/06/05 23:57

http://developer.yahoo.com/hadoop/tutorial/module1.html

Problem Scope

Hadoop is a large-scale distributed batch processing infrastructure. While it can be used on a single computer,its true power lies in its ability to scale to hundreds or thousands of computers, each with several processor cores.

Challenges at Large Scale

1. the major resources such as the processor time, memory, hard drive space and network bandwidth

2.