Cloudera Administrator Training for Apache Hadoop Course Content
Installation of the Hadoop Cluster
Rationale for a Cluster Management Solution
-
Features of the Cloud era Manager
-
Installing the Cloud era Manager
-
Hadoop (CDH) Installation
Hadoop Distributed File System (HDFS)
- Features of the HDFS
- Writing & Reading Files
- Name Node Memory Considerations
- Overview of the HDFS Security
- Web UIs for HDFS
- Using the Hadoop File Shell
MapReduce & Spark on the YARN
- The Role of the Computational Frameworks
- YARN: The Cluster Resource Manager
- Concepts of the MapReduce
- Concepts of the Apache Spark
- Running Computational Frameworks on YARN
- Exploring YARN Applications through the Web UIs, & the Shell
- YARN Application Logs
Hadoop Configuration & Daemon Logs
- Cloud era Manager Constructs for the Managing Configurations
- Locating Configurations & Applying Configuration Changes
- Managing Role Instances & Adding the Services
- Configuring the HDFS Service
- Configuring Hadoop Daemon Logs
- Configuring the YARN Service
Getting Data into the HDFS
- Ingesting Data From External Sources with Flume
- Ingesting Data From Relational Databases with Sqoop
- Interfaces of the REST
- Best Practices for the Importing Data
Planning Your Hadoop Cluster
- General Planning Considerations
- Choosing the Right Hardware
- Virtualization Options
- Network Considerations
- Configuring Nodes
Installing & Configuring Hive, Impala, & Pig
- Hive
- Impala
- Pig
Hadoop Clients Including Hue
- What Are the Hadoop Clients?
- Installing & Configuring the Hadoop Clients
- Installing & Configuring Hue
- Hue Authentication & Authorization
Advanced Cluster Configuration
- Advanced Configuration of the Parameters
- Configuring Hadoop Ports
- Configuring HDFS for Rack Awareness
- High Availability of the Configuring HDFS
Hadoop Security
- Why Hadoop Security Is Important
- Hadoop’s Security System Concepts
- What Kerberos Is & How It Works
- Securing a Hadoop Cluster with Kerberos
- Other Security Concepts
Resources of the Managing
- Configuring croups with Static Service Pools
- Fair Scheduler
- Configuring Dynamic Resource Pools
- YARN Memory & CPU Settings
- Scheduling the mpala Query
Cluster Maintenance
- Checking HDFS Status
- Copying Data between Clusters
- Adding & Removing the Cluster Nodes
- Rebalancing the Cluster
- Directory Snapshots
- Cluster Upgrading
Cluster Monitoring & Troubleshooting
- Cloud era Manager Monitoring Features
- Monitoring Hadoop Clusters
- Troubleshooting Hadoop Clusters
- Common Misconfigurations