Tags: Map

Introduction to MapReduce

Introduction to MapReduce

MapReduce is a programming model for processing large data sets with a parallel , distributed algorithm on a cluster (source: Wikipedia). Map Reduce when coupled with HDFS can be used...
Fake data science

Fake data science

Books, certificates and graduate degrees in data science are spreading like mushrooms after the rain. Unfortunately, many are just a mirage: some old guys taking advantage of the new ...
Some exciting stuff

Some exciting stuff

First, let me mention our new selection of resources and articles: 43 blogs that we discovered and liked this week. We found them listed as popular in various social networks - includ...