Description
During this two-day instructor-led training course, participants will learn development and operations for Cloudera Streaming Analytics, a framework for low-latency processing and analytics powered by Apache Flink and Cloudera‘s innovative SQL Stream Builder.
Through extensive hands-on exercises, students will gain experience deploying and managing a Flink cluster, developing and running Flink applications, and using SQL Stream Builder‘s continuous SQL to perform analytics on streaming data.
PUE, Cloudera Strategic Partner, is authorized by this multinational to provide official training in Cloudera technologies.
PUE is also accredited and recognized to carry out consulting and mentoring services in the implementation of Cloudera solutions in the business field with the added value in the practical and business approach to knowledge that is translated in its official courses.
Audience and prerequisites
This course is designed for those who have experience with administration and application development on the Cloudera platform.
Prerequisites
Students must have at least basic familiarity with Java and Linux.
Cloudera Training for Apache Kafka course, or equivalent experience with Apache Kafka, is a recommend prerequisite.
Objectives
Students who successfully complete this course will be able to:
- Deploy a Flink cluster using Cloudera Manager
- Develop Flink batch and streaming applications
- Run and view Flink jobs
- Transform data streams
- Use watermarks and windows to analyze streaming data
- Analyze data with Cloudera SQL Stream Builder
- Monitor Flink application metrics
Topics
Module 1: Overview
- Introduction to Apache Flink and Stream Processing
- Typical Use Cases
- Related Products
Module 2: Basic Architecture
- Logical
- Physical
- Parallelism
- Fault Tolerance
- Data Storage
Module 3: Service Deployment
- Planning Requirements
- Installation
- Flink Dashboard
- Exercise: Running a Flink Program
Module 4: Flink Basics
- Execution Environment
- Flink Application Structure
- Create a Flink Project
- Build a Flink Program
- Exercise: Building a Simple Flink Program
Module 5: DataStream API
- Data Types and Serialization
- Sources and Sinks
- Data Pipelines and ETL
- Transformations
- Exercise: Batch Processing Using Flink
- Exercise: Creating a Flink Streaming Application
- Using Kafka as a Source and Sink
- Exercise: Creating a Streaming Application Using a Kafka Source
Module 6: Flink SQL and Table API
- Streaming Concepts
- Programming Options
- Integrations
- Exercise: Using Flink SQL and Kafka
Module 7: Stateful Stream Processing
- Connected Streams
- Streaming Analytics
- Event Time Processing
- Watermarks
- Windows
- Exercise: Tumbling Windows with Event Time
Module 8: Cloudera SQL Stream Builder
- Overview
- SQL Stream Builder Console
- Analytics and Stream Processing
- Exercise: Creating SQL Stream Jobs
Module 9: Monitoring
- Flink Metrics
- Checkpointing
- Backpressure
- Log Files
- Exercise: Monitoring and Checkpointing