Cloudera Administrator Training for Apache Hadoop

Cloudera Administrator Training for Apache Hadoop

Cloudera Administrator Training for Apache Hadoop Introduction:

Cloudera Administrator Training for Apache Hadoop Course Content

Installation of the Hadoop Cluster

Rationale for a Cluster Management Solution

  • Features of the Cloud era Manager

  • Installing  the Cloud era Manager

  • Hadoop (CDH) Installation

Hadoop Distributed File System (HDFS)
  • Features of the HDFS
  • Writing & Reading Files
  • Name Node Memory Considerations
  • Overview of the HDFS Security
  • Web UIs for HDFS
  • Using the Hadoop File Shell
MapReduce & Spark on the YARN
  • The Role of the Computational Frameworks
  • YARN: The Cluster Resource Manager
  • Concepts of the MapReduce
  • Concepts of the Apache Spark
  • Running Computational Frameworks on YARN
  • Exploring YARN Applications through the Web UIs, & the Shell
  • YARN Application Logs
Hadoop Configuration & Daemon Logs
  • Cloud era Manager Constructs for the Managing Configurations
  • Locating Configurations & Applying Configuration Changes
  • Managing Role Instances & Adding the Services
  • Configuring the HDFS Service
  • Configuring Hadoop Daemon Logs
  • Configuring the YARN Service
Getting Data into the HDFS
  • Ingesting Data From External Sources with Flume
  • Ingesting Data From Relational Databases with Sqoop
  • Interfaces of the REST
  • Best Practices for the Importing Data
Planning Your Hadoop Cluster
  • General Planning Considerations
  • Choosing the Right Hardware
  • Virtualization Options
  • Network Considerations
  • Configuring Nodes
Installing & Configuring Hive, Impala, & Pig
  • Hive
  • Impala
  • Pig
Hadoop Clients Including Hue
  • What Are the Hadoop Clients?
  • Installing & Configuring the Hadoop Clients
  • Installing & Configuring Hue
  • Hue Authentication & Authorization
Advanced Cluster Configuration
  • Advanced Configuration of the Parameters
  • Configuring Hadoop Ports
  • Configuring HDFS for Rack Awareness
  • High Availability of the Configuring HDFS
Hadoop Security
  • Why Hadoop Security Is Important
  • Hadoop’s Security System Concepts
  • What Kerberos Is & How It Works
  • Securing a Hadoop Cluster with Kerberos
  • Other Security Concepts
Resources of the Managing
  • Configuring croups with Static Service Pools
  • Fair Scheduler
  • Configuring Dynamic Resource Pools
  • YARN Memory & CPU Settings
  • Scheduling the mpala Query
Cluster Maintenance
  • Checking HDFS Status
  • Copying Data between Clusters
  • Adding & Removing the Cluster Nodes
  • Rebalancing the Cluster
  • Directory Snapshots
  • Cluster Upgrading
Cluster Monitoring & Troubleshooting
  • Cloud era Manager Monitoring Features
  • Monitoring Hadoop Clusters
  • Troubleshooting Hadoop Clusters
  • Common Misconfigurations