HADOOP

Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.


HADOOP


Basics of Hadoop

Hadoop Modules

HBase

Hive

Pig

Sqoop
')