Description

Overview

One of the biggest challenges to success with big data has always been how to transport it. Conventional interoperability doesn’t cut it when it comes to integrating data with applications and real-time needs. Yet, needs continue to grow and data availability becomes more critical all the time. Even in scenarios that might not be considered “big data,” the need for services and data integration in the organization may be challenged simply by inadequate messaging and integration architecture. Kafka can serve as a key solution to address these challenges.

This hands-on training workshop gets you up and running with Apache Kafka so you can immediately take advantage of the low latency, massive parallelism and exciting use cases Kafka makes possible. Led by one of our enterprise engineering experts, you’ll get live instruction and coaching on how to be effective when using Kafka in your work or project.

In this Kafka Training Course, You Will:

Explore Apache Kafka Architecture
Learn to configure a distributed messaging broker
Learn the Apache Kafka architecture and data model
Learn about decoupled services and distributed systems
Learn to build robust systems using distributed messaging brokers
Learn best practices for configuring Kafka clusters in production
Write custom Kafka producers and consumers
Build an application that ingests data from a streaming API

Who Should Attend

System architects
Developers
Data engineers
DBAs
Anyone who wants to learn to use the Kafka messaging system for consuming data in their systems.

Course Overview

Part 1: Big Data and Distributed Systems Primer

Distributed Systems
High Availability
Latency and Scalability
Message Brokers and Queues
Decoupling Services
Lambda Architecture
Data Partitioning

Part 2: Introduction to Apache Kala

History
What is Kafka
Why Kafka
Features
Kafka in Production
High-Level Architecture

Part 3: Core Concepts

Kafka Guarantees/Message Ordering
Delivery Semantics
Dumb Broker vs. MOM
Kafka Semantics

Part 4: Kafka Cluster

Installing Cluster
Brokers
Consumers
Producers

Part 5: Apache Zookeeper

cluster management
roles
basic operations

Part 6: Kafka Producers

Role of Producer
Records
Message Durability
Batching and Compression
Create Console Producer
Publishing Data to Topics

Part 7: Kafka Consumers

Role of Consumer
Offsets
Consumers and Logs
Create Console Consumer
Performance tuning
Consumer Groups
Consumer Parallelism
Consumer Rebalancing

Part 8: The Kafka Data Model

Kafka Data Model
Topics
Partitions
Distribution
Reliability
Leaders/Followers
Replication Factor
Persistence

Part 9: The Kafka API

Producer API
Consumer API
Java, Scala, Python APIs
Creating/Modifying Topics
Partitioning Topics
Reading data from Kafka
Writing data to kafka

Part 10: Kafka in Production

Big Data Pipelines
Microservices
Case Study: Netflix
Apache Spark
Storm and Hadoop

Part: 11: Kafka Streams

Stream processing
High-Level Overview
Demo Application

Apache Kafka Data Streaming Boot Camp Virtual Classroom Live May 26, 2020

Price: $2,700

Enroll today to reserve your spot!

Description

Course Overview

Prerequisites

Find a Course

Corporate Services

Contact us