WhatsApp : +918121020333 / +919849510373

India: +91 40 6050 1418

USA: +1 516 8586 242

UK: +44 (0)203 371 0077

Hadoop Admin-online-training

Hadoop Admin Course Content

Big Data Introduction
  • What is Big Data ?
  • Big Data Facts
  • The Three V’s of Big Data
Understanding Hadoop
  • What is Hadoop ?
  • Why learn Hadoop ?
  • Relational Databases Versace Hadoop
  • Motivation for Hadoop
  • 6 Key Hadoop Data Types
The Hadoop Distributed File system
  • What is HDFS ?
  • HDFS components
  • Understanding Block storage
  • The Name Node
  • The Data Nodes
  • Data Node Failures
  • HDFS Commands
  • HDFS File Permissions
The MapReduce Framework
  • MapReduce Overview
  • Understanding MapReduce
  • The Map Phase
  • The Reduce Phase
  • WordCount in MapReduce
  • Running MapReduce Job
Planning Your Hadoop Cluster
  • Single Node Cluster Configuration
  • Multi Node Cluster Configuration
Cluster Maintenance
  • Checking HDFS Status
  • Cluster Breaking
  • Copying the Data b/w Clusters
  • Adding & Removing Cluster Nodes
  • Rebalancing the cluster
  • Name Node Metadata Backup
  • Cluster Upgrading
Installing & Managing Hadoop Ecosystem Projects
  • Sqoop
  • Flume
  • Hive
  • Pig
  • HBase
  • Oozie
Managing & Scheduling Jobs
  • Managing Jobs
  • The FIFO Scheduler
  • The Fair Schedule
  • How to stop & start jobs running on the cluster
Cluster Monitoring, Troubleshooting & Optimizing
  • General System conditions to Monitor
  • Name Node & Job Tracker Web Uis
  • View & Manage Hadoop’s Log files
  • Ganglia Monitoring Tool
  • Common cluster issues & their resolutions
  • Benchmark your cluster’s performance
Populating HDFS from External Sources
  • How to use Sqoop to import data from RDBMSs to HDFS
  • How to gather logs from multiple systems using Flume
  • Features of Hive, Hbase & Pig
  • How to populate HDFS from external Sources