A word on HDFS (Hadoop Distributed File System)

Last time we have made an introduction to Hadoop MapReduce. We have seen that it relied on its file system: Hadoop Distributed File System (HDFS). Today we are going to take a look at this. What is HDFS? The goal of HDFS is to stored large amount of data in a distributed and fault tolerant way. A … Continue reading A word on HDFS (Hadoop Distributed File System)

Advertisements

An introduction to Hadoop MapReduce

Today we are going to talk about a famous BigData framework called Hadoop MapReduce. In this article, after presenting the framework, we will make a small example using Java and MapReduce Hadoop on Linux Raspbian (yes I am testing Hadoop on a raspberry pi!). What is MapReduce? MapReduce is a framework to process very large … Continue reading An introduction to Hadoop MapReduce