【读书笔记】Mining of massive datasets

来源:互联网 发布:淘宝店铺页头怎么做 编辑:程序博客网 时间:2024/05/22 06:45

For clustering computing, the file system must also be different from those legacy system, a brand new system which we called DFS(distributed file system) came out.

However there's something we have to pay attention:

1. DFS only matters when the data amount is huge.

2. the system should be rarely refreshed.

otherwise a DFS is not necessary.

*** ***

MapReduce只有当运行它的主控进程的计算节点崩溃时才需要重启,除该节点外其余节点遇到问题都不许要重启整个任务。

*** 未完 ***
0 0
原创粉丝点击