Phase #1 Introduction & Architecture

No Comments »


Summary of Independent Study Course - Apache Hadoop

No Comments »

Title of the topic : Apache Hadoop



Details of the research activity to be undertaken and its feasibility:
• Introduction to Hadoop
• Hadoop architecture
• Understanding Design Level Requirements for Hadoop
• Hadoop Distributed File System
• MapReduce
• Installing Hadoop
• Hadoop Ecosystem
• Hadoop Hive
• Hadoop Adavantages and Disadvantages


Outcome:
• Understanding Distributed Processing of Very Large datasets on Commodity Hardware.
• Understanding Map Reduce Framework
• Capable of Installing and Working with Hadoop