Apache Spark and Scala - Simplilearn | IT Training & Certification | Info Trek
Respect Your Dreams
Follow through on your goals with courses

Apache Spark and Scala - Simplilearn

  • On Demand Class Icon
    On Demand
    • HRDF SBL Claimable
    • Certificate of Attendance available
    • 180 days of access from date of purchase
    Starting From
    RM 2145.70
    32 Hours
  • Private Class Icon
    Private Class
    • All of our private classes are customized to your organization's needs.

      Click on the button below to send us your details and you will be contacted shortly.
    0 Days

Course Details

Expand All

With Simplilearn's Apache Spark and Scala certification training you would advance your expertise in Big Data Hadoop Ecosystem.

With this Apache Spark certification you will master the essential skills such as Spark Streaming, Spark SQL, Machine Learning Programming, GraphX Programming, Shell Scripting Spark.

And with real life industry project coupled with 30 demos you would be ready to take up Hadoop developer job requiring Apache Spark expertise.

Professionals aspiring for a career in field of real time Big data analytics

  • Analytics professionals
  • Research professionals
  • IT developers and testers
  • Data scientists
  • BI and reporting professionals
  • Students who wish to gain a thorough understanding of Apache Spark

With Certification in Apache Spark and Scala training, you will be able to-

  • Get clear understanding of the limitations of MapReduce and role of Spark in overcoming these limitations
  • Understand fundamentals of Scala Programming Language and it's features
  • Explain & master the process of installing Spark as a standalone cluster
  • Expertise in using RDD for creating applications in Spark
  • Mastering SQL queries using SparkSQL
  • Gain thorough understanding of Spark Streaming features
  • Master & describe the features of Spark ML Programming and GraphX Programming


Expand All
  • Course overview
  • Objectives
  • Limitations of MapReduce in Hadoop Objectives
  • Batch vs. Real-time analytics
  • Application of stream processing
  • How to install Spark
  • Spark vs. Hadoop Eco-system
  • Features of Scala
  • Basic data types and literals used
  • List the operators and methods used in Scala
  • Concepts of Scala
  • Features of RDDs
  • How to create RDDs
  • RDD operations and methods
  • How to run a Spark project with SBT
  • Explain RDD functions and describe how to write different codes in Scala
  • Explain the importance and features of SparkSQL
  • Describe methods to convert RDDs to DataFrames
  • Explain concepts of SparkSQL
  • Describe the concept of hive integration
  • Explain a concepts of Spark Streaming
  • Describe basic and advanced sources
  • Explain how stateful operations work
  • Explain window and join operations
  • Explain the use cases and techniques of Machine Learning (ML)
  • Describe the key concepts of Spark ML
  • Explain the concept of an ML Dataset, and ML algorithm, model selection via cross validation
  • Explain the key concepts of Spark GraphX programming
  • Limitations of the Graph Parallel system
  • Describe the operations with a graph
  • Graph system optimizations


based on 0 ratings reviews