Description

The Course Name: IHAE – Hadoop Administration Essentials

The Duration: 5 Days

The Overview:

The purpose of this course is to teach participants the essentials knowledge of Hadoop Systems.

What You Will Learn:

The Students will learn how to install,  configure and manage Hadoop Systems.

  1. Hadoop Overview
  • Map/Reduce
  • Hadoop, YARN, and Spark
  • Mahout and MLib
  • Alternate Frameworks
  1. Hadoop Architecture
  • Hadoop Map/Reduce
  • YARN
  • HDFS
  • Spark
  • Cassandra
  • HBase
  • Hive
  • Pig
  1. Installing Hadoop
  • Linux Considerations
  • SSH Configuration
  • Hadoop Installation
  • OS Security
  • NamedNodes
  • Job Trackers
  1. Test-Running Hadoop Programs
  • Simple Map Reduce Test
  • Spark Test
  • Pig Test
  1. Cloud Installations
  • Amazon EC2
  • Amazon Elastic Map Reduce
  • Rackspace
  • Installing with Docker
  1. Optimization and Tuning
  • Performance Metrics
  • Node Sizing
  • Kernel Tuning
  1. Installing HBase
  • HBase Installation
  • ZooKeeper
  1. Previewing Hadoop 3