MapReduce - Wikipedia, the free encyclopedia
MapReduce is a programming model and an associated implementation for processing and generating large data sets with a parallel, distributed algorithm on a cluster.[1][2] A MapReduce program is composed of a Map() procedure that performs filtering and sor...