- Hadoop Distributed File System. The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications.
- HBase is a column-oriented database management system that runs on top of HDFS.
- HIVE
- Sqoop
- Pig
- ZooKeeper
- NOSQL
- Mahout.