Big Data Hadoop Administrator - Simplilearn | IT Training & Certification | Info Trek
Respect Your Dreams
Follow through on your goals with courses

Big Data Hadoop Administrator - Simplilearn

  • On Demand Class Icon
    On Demand
    • HRDF SBL Claimable
    • Certificate of Attendance available
    • 180 days of access from date of purchase
    Starting From
    RM 1939.46
    36 Hours
  • Private Class Icon
    Private Class
    • All of our private classes are customized to your organization's needs.

      Click on the button below to send us your details and you will be contacted shortly.
    0 Days

Course Details

Expand All

Big Data and Hadoop Administrator Certification Training from Simplilearn equips you to take up Hadoop Administrator responsibilities in provisioning, installing, configuring, monitoring, maintaining and securing Hadoop and Hadoop Eco system components.

Training is designed to ensure that you are job ready for the role of Hadoop Administrator with implementation of real life Hadoop Administration industry projects spanned across 3 months.

This training is developed to give you a comprehensive understanding of all the steps necessary to operate and maintain a Hadoop cluster.

  • Systems administrators and IT managers
  • IT administrators and operators
  • IT Systems Engineer
  • Data Engineer
  • Data Analytics Administrator
  • Cloud Systems Administrator
  • Web Engineer

Fundamental knowledge of any programming language and Linux environment. Participants should know how to navigate and modify files within a Linux environment. Existing knowledge of Hadoop & Java is not required.

With Certification in Hadoop Administration training, you will be able to -

  • Master the understanding of Hadoop and Hadoop Administration eco system components
  • Plan Hadoop Clusters with installation and configuration of Hadoop as well as configuration of single node and multi node Hadoop Clusters
  • Become proficient in HDFS and Sqoop with the help of Demos and hands on Lab exercises.
  • Install & configure YARN with gaining in depth understanding of Map Reduce and YARN architecture
  • Become expert in recovering from node failures and troubleshoot common Hadoop cluster issues
  • Install and configure Hadoop Eco system components such as Hive, Pig, Impala, Ganglia, Nagios, Sqoop
  • Expertise in setup, configuration and management of security for Hadoop clusters using Kerbero

Modules

Expand All
  • About Simplilearn’s Big Data and Hadoop Administrator course
  • Introduction to Big Data
  • Introduction to Hadoop
  • Why Hadoop
  • Hadoop & Traditional RDBMS
  • Components of Hadoop & Hadoop Architecture
  • History and uses of Hadoop
  • Overview of Hadoop Clusters
  • Planning your Hadoop Cluster
  • Overview of Hardware and other Network configurations
  • Network Topology for Hadoop Clusters
  • Overview of Cluster Management
  • Overview of various deployment types
  • Installing and configuring Hadoop
  • Configuring a single node Hadoop Cluster
  • Configuring a multi node Hadoop Cluster
  • Checking the correctness of Hadoop installation
  • Demos:
    • Install Ubuntu Server 12.04
    • Hadoop 1.0 in Ubuntu Server 12.04
    • Create a Clone of Hadoop Virtual Machine
    • Perform Clustering of the Hadoop Environment
    • Install Hadoop 2.0 in Ubuntu Server 12.0
  • Hadoop configuration overview and important configuration file
  • Configuration parameters and values
  • HDFS parameters MapReduce parameters
  • Hadoop environment setup
  • ‘Include’ and ‘Exclude’ configuration files
  • Demo: Configuration Settings of Hadoop
  • Lab Exercise
  • Introduction to HDFS
  • Overview of HDFS Architecture
  • Overview of HDFS Sorage mechanisms
  • Overview of HDFS Rack
  • Writing and reading files from HDFS
  • Understanding the important commands of HDFS
  • Introduction to Squoop
  • Installing and configuring Sqoop
  • Demos:
    • Install Sqoop
    • HDFS Demo
  • Lab Exercise
  • Introduction to MapReduce
  • MapReduce Architecture and working with MapReduce
  • Development and Libraries of Map Reduce
  • MapReduce components failures and recoveries
  • Introduction to YARN
  • YARN Architecture
  • Installing and configuring YARN
  • Working with YARN & YARN Web UI
  • Understanding Hive
  • Installing and configuring Hive
  • Understanding Pig
  • Installing and configuring Pig
  • Understanding Impala
  • Installing and configuring Impala
  • Demos:
    • Install Hive
    • Install Pig
  • Lab Exercises

  • Namenode/Datanode directory structures and files
  • File system image and Edit log
  • The Checkpoint Procedure
  • Namenode failure and recovery procedure
  • Safe Mode
  • Metadata and Data backup
  • Potential problems and solutions / what to look for
  • Adding and removing nodes
  • Lab Exercise
  • Eco system Component: Ganglia
    • Install and Configure Ganglia on a Cluster
    • Configure and Use Ganglia
    • Use Ganglia for Graphs
  • Eco system Component: Nagios
    • Nagios Concepts
    • Install and Configure Nagios on Cluster
    • Use Nagios for Sample Alerts And Monitoring
  • Eco system Component: Sqoop
    • Install and Configure Sqoop on Cluster
    • Import Data From Oracle/Mysql to Hive
  • Overview of Other Eco system Components:
    • Oozie
    • Avro
    • Thrift
    • Rest
    • Mahout
    • Cassandra
    • YARN
    • MR2
  • Hadoop Security
  • Kerberos and Hadoop
  • Why Hadoop Security is Important?
  • Hadoop’s Security System Concepts
  • What Kerberos is and How it Works?
  • Configuring Kerberos Security
  • Securing a Had

Reviews

0
based on 0 ratings reviews