MapReduce是怎么切的

less than 1 minute read

当时学mr的时候有一个地方被卡了挺久，就是怎么分的M个map task和R个reduce task，特别是map完到reduce怎么分的。

我们以paper里的wordcount程序为例…

mr1

Map task还是比较好分的，水平切成M份就完事了。

每台机器Map完之后大概就是这么个形态：

mr1

然后会根据一个function把这个结果文件切成R份，可以用hash类比：

这里M1指的是第一台Map机器，Ri指的是第i个Reduce partition。

mr1

如果我是执行第一个Reduce任务的机器，那我就从每台Map机器上读取对应的R1 partition。因为每个单词只会在一个partition里面，就不会出现一个单词被分到多个reduce服务器的情况。

mr1

Software Engineering - What is “Just Right”?

3 minute read

Lessons Learned

Learning Redis Streams

2 minute read

XREAD [COUNT count] [BLOCK milliseconds] STREAMS key [key …] id [id …] Some details: If COUNT is unset, it reads EVERYTHING. The ID field is an EXCLUSIV...

How to achieve read-write separation using redis replication with Sentinels & the go-redis library?

2 minute read

Problem Want to read from replicas and only write to the master, to reduce master loads. Solution Create read-only clients using NewFailOverClient(opt) with ...

使用gomock和httptest测试api

less than 1 minute read

Mock

Allen Shao

MapReduce是怎么切的

You May Also Enjoy

Software Engineering - What is “Just Right”?

Learning Redis Streams

How to achieve read-write separation using redis replication with Sentinels & the go-redis library?

使用gomock和httptest测试api