651-905-3729 Microsoft Silver Learning Partner EC Counsel Reseller compTIA Authorized Partner

Apache Kafka Data Streaming Boot Camp Virtual Classroom Live May 26, 2020

Price: $2,700

This course runs for a duration of 3 Days.

The class will run daily from 8:30 am EST to 4:30 pm EST.

Class Location: Virtual LIVE Instructor Led - Virtual Live Classroom.

Enroll today to reserve your spot!

Space is limited. Enroll today.

Enroll Now

Description

Overview

One of the biggest challenges to success with big data has always been how to transport it. Conventional interoperability doesn’t cut it when it comes to integrating data with applications and real-time needs. Yet, needs continue to grow and data availability becomes more critical all the time. Even in scenarios that might not be considered “big data,” the need for services and data integration in the organization may be challenged simply by inadequate messaging and integration architecture. Kafka can serve as a key solution to address these challenges.

This hands-on training workshop gets you up and running with Apache Kafka so you can immediately take advantage of the low latency, massive parallelism and exciting use cases Kafka makes possible. Led by one of our enterprise engineering experts, you’ll get live instruction and coaching on how to be effective when using Kafka in your work or project. 

In this Kafka Training Course, You Will:

  • Explore Apache Kafka Architecture
  • Learn to configure a distributed messaging broker
  • Learn the Apache Kafka architecture and data model
  • Learn about decoupled services and distributed systems
  • Learn to build robust systems using distributed messaging brokers
  • Learn best practices for configuring Kafka clusters in production
  • Write custom Kafka producers and consumers
  • Build an application that ingests data from a streaming API

Who Should Attend

  • System architects
  • Developers
  • Data engineers
  • DBAs
  • Anyone who wants to learn to use the Kafka messaging system for consuming data in their systems.

Course Overview

Part 1: Big Data and Distributed Systems Primer

  • Distributed Systems
  • High Availability
  • Latency and Scalability
  • Message Brokers and Queues
  • Decoupling Services
  • Lambda Architecture
  • Data Partitioning

Part 2: Introduction to Apache Kala

  • History
  • What is Kafka
  • Why Kafka
  • Features
  • Kafka in Production
  • High-Level Architecture

Part 3: Core Concepts

  • Kafka Guarantees/Message Ordering
  • Delivery Semantics
  • Dumb Broker vs. MOM
  • Kafka Semantics

Part 4: Kafka Cluster

  • Installing Cluster
  • Brokers
  • Consumers
  • Producers

Part 5: Apache Zookeeper

  • cluster management
  • roles
  • basic operations

Part 6: Kafka Producers

  • Role of Producer
  • Records
  • Message Durability
  • Batching and Compression
  • Create Console Producer
  • Publishing Data to Topics

Part 7: Kafka Consumers

  • Role of Consumer
  • Offsets
  • Consumers and Logs
  • Create Console Consumer
  • Performance tuning
  • Consumer Groups
  • Consumer Parallelism
  • Consumer Rebalancing

Part 8: The Kafka Data Model

  • Kafka Data Model
  • Topics
  • Partitions
  • Distribution
  • Reliability
  • Leaders/Followers
  • Replication Factor
  • Persistence

Part 9: The Kafka API

  • Producer API
  • Consumer API
  • Java, Scala, Python APIs
  • Creating/Modifying Topics
  • Partitioning Topics
  • Reading data from Kafka
  • Writing data to kafka

Part 10: Kafka in Production

  • Big Data Pipelines
  • Microservices
  • Case Study: Netflix
  • Apache Spark
  • Storm and Hadoop

Part: 11: Kafka Streams

  • Stream processing
  • High-Level Overview
  • Demo Application

Prerequisites

Participants in this workshop should have a working knowledge of at least one programming language (preferably Python, Java, or Scala) and be able to work from the command line in a Linux VM or container.