Big Data Hadoop Administrator - Simplilearn | IT Training & Certification | Info Trek
Respect Your Dreams
Follow through on your goals with courses

Big Data Hadoop Administrator - Simplilearn

Location

Format What’s this?
Starting From
RM 2145.70
  1. 36 Hours
  1. HRDF SBL Claimable
  2. Certificate of Attendance available
  3. 180 days of access from date of purchase
  1. 0 Days
  1. All of our private classes are customized to your organization's needs.
  2. Click on the button below to send us your details and you will be contacted shortly.

Big Data Hadoop Administrator - Simplilearn

WHAT YOU WILL LEARN

Big Data and Hadoop Administrator Certification Training from Simplilearn equips you to take up Hadoop Administrator responsibilities in provisioning, installing, configuring, monitoring, maintaining and securing Hadoop and Hadoop Eco system components.

Training is designed to ensure that you are job ready for the role of Hadoop Administrator with implementation of real life Hadoop Administration industry projects spanned across 3 months.

This training is developed to give you a comprehensive understanding of all the steps necessary to operate and maintain a Hadoop cluster.

AUDIENCE

  • Systems administrators and IT managers
  • IT administrators and operators
  • IT Systems Engineer
  • Data Engineer
  • Data Analytics Administrator
  • Cloud Systems Administrator
  • Web Engineer

PREREQUISITES

Fundamental knowledge of any programming language and Linux environment. Participants should know how to navigate and modify files within a Linux environment. Existing knowledge of Hadoop & Java is not required.

COURSE OBJECTIVES

With Certification in Hadoop Administration training, you will be able to -

  • Master the understanding of Hadoop and Hadoop Administration eco system components
  • Plan Hadoop Clusters with installation and configuration of Hadoop as well as configuration of single node and multi node Hadoop Clusters
  • Become proficient in HDFS and Sqoop with the help of Demos and hands on Lab exercises.
  • Install & configure YARN with gaining in depth understanding of Map Reduce and YARN architecture
  • Become expert in recovering from node failures and troubleshoot common Hadoop cluster issues
  • Install and configure Hadoop Eco system components such as Hive, Pig, Impala, Ganglia, Nagios, Sqoop
  • Expertise in setup, configuration and management of security for Hadoop clusters using Kerbero

Expand All

Modules

Session I: Lesson 00—Course Overview
  • About Simplilearn’s Big Data and Hadoop Administrator course
Lesson 01—Introduction to Big Data and Hadoop
  • Introduction to Big Data
  • Introduction to Hadoop
  • Why Hadoop
  • Hadoop & Traditional RDBMS
  • Components of Hadoop & Hadoop Architecture
  • History and uses of Hadoop
Lesson 02—Planning Hadoop Cluster
  • Overview of Hadoop Clusters
  • Planning your Hadoop Cluster
  • Overview of Hardware and other Network configurations
  • Network Topology for Hadoop Clusters
  • Overview of Cluster Management
Lesson 03—Hadoop Installation and Configuration
  • Overview of various deployment types
  • Installing and configuring Hadoop
  • Configuring a single node Hadoop Cluster
  • Configuring a multi node Hadoop Cluster
  • Checking the correctness of Hadoop installation
  • Demos:
    • Install Ubuntu Server 12.04
    • Hadoop 1.0 in Ubuntu Server 12.04
    • Create a Clone of Hadoop Virtual Machine
    • Perform Clustering of the Hadoop Environment
    • Install Hadoop 2.0 in Ubuntu Server 12.0
Lesson 04—Advanced Cluster Configuration Features
  • Hadoop configuration overview and important configuration file
  • Configuration parameters and values
  • HDFS parameters MapReduce parameters
  • Hadoop environment setup
  • ‘Include’ and ‘Exclude’ configuration files
  • Demo: Configuration Settings of Hadoop
  • Lab Exercise
Lesson 05—Hadoop Distributed File System
  • Introduction to HDFS
  • Overview of HDFS Architecture
  • Overview of HDFS Sorage mechanisms
  • Overview of HDFS Rack
  • Writing and reading files from HDFS
  • Understanding the important commands of HDFS
  • Introduction to Squoop
  • Installing and configuring Sqoop
  • Demos:
    • Install Sqoop
    • HDFS Demo
  • Lab Exercise
Lesson 06—Overview of MapReduce and YARN
  • Introduction to MapReduce
  • MapReduce Architecture and working with MapReduce
  • Development and Libraries of Map Reduce
  • MapReduce components failures and recoveries
  • Introduction to YARN
  • YARN Architecture
  • Installing and configuring YARN
  • Working with YARN & YARN Web UI
Lesson 07—Important Hadoop Components
  • Understanding Hive
  • Installing and configuring Hive
  • Understanding Pig
  • Installing and configuring Pig
  • Understanding Impala
  • Installing and configuring Impala
  • Demos:
    • Install Hive
    • Install Pig
  • Lab Exercises

Lesson 08—Hadoop Administration and Maintenance
  • Namenode/Datanode directory structures and files
  • File system image and Edit log
  • The Checkpoint Procedure
  • Namenode failure and recovery procedure
  • Safe Mode
  • Metadata and Data backup
  • Potential problems and solutions / what to look for
  • Adding and removing nodes
  • Lab Exercise
Lesson 09—Hadoop Ecosystem Components
  • Eco system Component: Ganglia
    • Install and Configure Ganglia on a Cluster
    • Configure and Use Ganglia
    • Use Ganglia for Graphs
  • Eco system Component: Nagios
    • Nagios Concepts
    • Install and Configure Nagios on Cluster
    • Use Nagios for Sample Alerts And Monitoring
  • Eco system Component: Sqoop
    • Install and Configure Sqoop on Cluster
    • Import Data From Oracle/Mysql to Hive
  • Overview of Other Eco system Components:
    • Oozie
    • Avro
    • Thrift
    • Rest
    • Mahout
    • Cassandra
    • YARN
    • MR2
  • Hadoop Security
  • Kerberos and Hadoop
  • Why Hadoop Security is Important?
  • Hadoop’s Security System Concepts
  • What Kerberos is and How it Works?
  • Configuring Kerberos Security
  • Securing a Had

Course Reviews

No Remarks

0

0 Ratings