Spark

Abstract

Spark is a popular framework for Big Data processing. It can be used for offline batch processing and for near real-time streaming.
In this course, the students will get a deep understanding on how to work with Spark, its various components and how to optimize for performance

Target Audience

Java developers, Team leaders, Product Managers

Prerequisites

Basic knowledge of Scala

Content

Spark Core:

  • What is Spark ?
  •  RDD Basics
  • Map Reduce and Shuffles
  • Caching
  • Web Monitoring
  • Usecases
  • Serialization
  • Troubleshooting

Spark SQL:

  • Introduction to Spark SQL & Data Frames
  • DataSets
  • Integration with Different Data Source
  • Use cases and Hand-on

Spark Streaming:

  • Introduction to Streaming
  • Spark streaming API
  • Use cases and Hands-on

 

 

Duration

2 days

Next public course

26/12, 27/12/2017

Enroll Now
Contact us
Share: