Cloudera

Cloudera Streaming Analytics: Using Apache Flink and SQL Stream Builder on CDP

14 hours
920,00 €
Classroom or Live Virtual Class
Classroom or Live Virtual Class

Description

During this two-day instructor-led training course, participants will learn development and operations for Cloudera Streaming Analytics, a framework for low-latency processing and analytics powered by Apache Flink and Cloudera‘s innovative SQL Stream Builder.

Through extensive hands-on exercises, students will gain experience deploying and managing a Flink cluster, developing and running Flink applications, and using SQL Stream Builder‘s continuous SQL to perform analytics on streaming data. 

PUE, Cloudera Strategic Partner, is authorized by this multinational to provide official training in Cloudera technologies.

PUE is also accredited and recognized to carry out consulting and mentoring services in the implementation of Cloudera solutions in the business field with the added value in the practical and business approach to knowledge that is translated in its official courses.

Audience and prerequisites

This course is designed for those who have experience with administration and application development on the Cloudera platform.

Prerequisites

Students must have at least basic familiarity with Java and Linux.

Cloudera Training for Apache Kafka course, or equivalent experience with Apache Kafka, is a recommend prerequisite.

Objectives

Students who successfully complete this course will be able to:

  • Deploy a Flink cluster using Cloudera Manager
  • Develop Flink batch and streaming applications
  • Run and view Flink jobs
  • Transform data streams
  • Use watermarks and windows to analyze streaming data
  • Analyze data with Cloudera SQL Stream Builder
  • Monitor Flink application metrics

Topics

Module 1: Overview

  • Introduction to Apache Flink and Stream Processing 
  • Typical Use Cases 
  • Related Products

Module 2: Basic Architecture

  • Logical 
  • Physical 
  • Parallelism 
  • Fault Tolerance 
  • Data Storage

Module 3: Service Deployment

  • Planning Requirements 
  • Installation 
  • Flink Dashboard 
  • Exercise: Running a Flink Program 

Module 4: Flink Basics

  • Execution Environment 
  • Flink Application Structure 
  • Create a Flink Project 
  • Build a Flink Program 
  • Exercise: Building a Simple Flink Program

Module 5: DataStream API

  •  Data Types and Serialization 
  • Sources and Sinks 
  • Data Pipelines and ETL 
  • Transformations 
  • Exercise: Batch Processing Using Flink
  • Exercise: Creating a Flink Streaming Application 
  • Using Kafka as a Source and Sink 
  • Exercise: Creating a Streaming Application Using a Kafka Source

Module 6: Flink SQL and Table API

  • Streaming Concepts
  • Programming Options
  • Integrations
  • Exercise: Using Flink SQL and Kafka

Module 7: Stateful Stream Processing

  • Connected Streams 
  • Streaming Analytics 
  • Event Time Processing 
  • Watermarks  
  • Windows 
  • Exercise: Tumbling Windows with Event Time

Module 8: Cloudera SQL Stream Builder

  • Overview 
  • SQL Stream Builder Console 
  • Analytics and Stream Processing 
  • Exercise: Creating SQL Stream Jobs

Module 9: Monitoring

  • Flink Metrics 
  • Checkpointing 
  • Backpressure 
  • Log Files 
  • Exercise: Monitoring and Checkpointing

Open calls