Stream Processing with Kafka Streams Training Course
Kafka Streams is a client-side library designed for building applications and microservices that exchange data with a Kafka messaging system. Traditionally, Apache Kafka has depended on tools like Apache Spark or Apache Storm to handle data flow between message producers and consumers. By invoking the Kafka Streams API within an application, data can be processed directly inside Kafka, eliminating the need to transfer data to a separate cluster for processing.
In this instructor-led live training, participants will learn how to integrate Kafka Streams into sample Java applications that facilitate data exchange with Apache Kafka for stream processing.
Upon completion of this training, participants will be able to:
- Grasp the features and benefits of Kafka Streams compared to other stream processing frameworks
- Process stream data directly within a Kafka cluster
- Develop a Java or Scala application or microservice that integrates with Kafka and Kafka Streams
- Write efficient code to transform input Kafka topics into output Kafka topics
- Build, package, and deploy the application
Audience
- Developers
Format of the course
- A mix of lectures, discussions, exercises, and extensive hands-on practice
Notes
- To request a customized training for this course, please contact us to arrange
Course Outline
Introduction
- Kafka vs Spark, Flink, and Storm
Overview of Kafka Streams Features
- Stateful and stateless processing, event-time processing, DSL, event-time based windowing operations, etc.
Case Study: Kafka Streams API for Predictive Budgeting
Setting up the Development Environment
Creating a Streams Application
Starting the Kafka Cluster
Preparing the Topics and Input Data
Options for Processing Stream Data
- High-level Kafka Streams DSL
- Lower-level Processor
Transforming the Input Data
Inspecting the Output Data
Stopping the Kafka Cluster
Options for Deploying the Application
- Classic ops tools (Puppet, Chef and Salt)
- Docker
- WAR file
Troubleshooting
Summary and Conclusion
Requirements
- An understanding of Apache Kafka
- Java programming experience
Open Training Courses require 5+ participants.
Stream Processing with Kafka Streams Training Course - Booking
Stream Processing with Kafka Streams Training Course - Enquiry
Stream Processing with Kafka Streams - Consultancy Enquiry
Testimonials (1)
Recalling/reviewing keypoints of the topics discussed.
Paolo Angelo Gaton - SMS Global Technologies Inc.
Course - Building Stream Processing Applications with Kafka Streams
Upcoming Courses
Related Courses
Administration of Confluent Apache Kafka
21 HoursConfluent Apache Kafka is a distributed event streaming platform engineered for high-throughput, fault-tolerant data pipelines and real-time analytics.
This instructor-led live training (available online or onsite) targets intermediate-level system administrators and DevOps professionals aiming to install, configure, monitor, and troubleshoot Confluent Apache Kafka clusters.
Upon completing this training, participants will be able to:
- Grasp the components and architecture of Confluent Kafka.
- Deploy and manage Kafka brokers, Zookeeper quorums, and essential services.
- Configure advanced features such as security, replication, and performance tuning.
- Utilise management tools to monitor and maintain Kafka clusters.
Course Format
- Interactive lectures and discussions.
- Ample exercises and practice sessions.
- Hands-on implementation within a live-lab environment.
Course Customisation Options
- To request customised training for this course, please contact us to arrange.
Apache Kafka Connect
7 HoursThis instructor-led, live training in Malaysia (online or onsite) is aimed at developers who wish to integrate Apache Kafka with existing databases and applications for processing, analysis, etc.
By the end of this training, participants will be able to:
- Use Kafka Connect to ingest large amounts of data from a database into Kafka topics.
- Ingest log data generated by an application servers into Kafka topics.
- Make any collected data available for stream processing.
- Export data from Kafka topics into secondary systems for storage and analysis.
Confluent Apache Kafka: Cluster Operations and Configuration
16 HoursConfluent Apache Kafka is an enterprise-grade distributed event streaming platform built on Apache Kafka. It supports high-throughput, fault-tolerant data pipelines and real-time streaming applications.
This instructor-led, live training (online or onsite) is aimed at intermediate-level engineers and administrators who wish to deploy, configure, and optimize Confluent Kafka clusters in production environments.
By the end of this training, participants will be able to:
- Install, configure, and operate Confluent Kafka clusters with multiple brokers.
- Design high-availability setups using Zookeeper and replication techniques.
- Tune performance, monitor metrics, and apply recovery strategies.
- Secure, scale, and integrate Kafka with enterprise environments.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Building Kafka Solutions with Confluent
14 HoursThis instructor-led, live training (available online or onsite) is designed for engineers who want to use Confluent, a distribution of Kafka, to build and manage a real-time data processing platform for their applications.
Upon completion of this training, participants will be able to:
- Install and configure the Confluent Platform.
- Leverage Confluent's management tools and services to simplify Kafka operations.
- Store and process incoming stream data effectively.
- Optimize and manage Kafka clusters efficiently.
- Secure data streams.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- This course is based on the open-source version of Confluent: Confluent Open Source.
- To request customized training for this course, please contact us to arrange.
Building Data Pipelines with Apache Kafka
7 HoursApache Kafka serves as a distributed streaming platform and has emerged as the industry standard for developing data pipelines. It addresses a wide array of data processing use cases, functioning effectively as a message queue, distributed log, stream processor, and more.
Our journey begins with an exploration of the theoretical foundations of data pipelines, followed by a deep dive into the core principles of Kafka. We will also examine essential components such as Kafka Streams and Kafka Connect.
A Practical Introduction to Stream Processing
21 HoursIn this instructor-led live training in Malaysia (onsite or remote), participants will learn how to set up and integrate various Stream Processing frameworks with existing big data storage systems, related software applications, and microservices.
Upon completing this training, participants will be able to:
- Install and configure various Stream Processing frameworks, such as Spark Streaming and Kafka Streaming.
- Understand and select the most suitable framework for specific tasks.
- Process data continuously, concurrently, and record by record.
- Integrate Stream Processing solutions with existing databases, data warehouses, data lakes, and other systems.
- Integrate the most appropriate stream processing library with enterprise applications and microservices.
Distributed Messaging with Apache Kafka
14 HoursThis course is designed for enterprise architects, developers, system administrators, and anyone seeking to understand and leverage a high-throughput distributed messaging system. If you have more specific requirements (e.g., focusing only on system administration aspects), this course can be tailored to better suit your needs.
Kafka for Administrators
21 HoursThis instructor-led, live training in Malaysia (online or onsite) is aimed at beginner-level, intermediate-level, and advanced-level system administrators and operations engineers who wish to use Apache Kafka to deploy, secure, monitor, and troubleshoot Kafka clusters.
By the end of this training, participants will be able to explain Kafka architecture and KRaft mode, operate and secure Kafka clusters, monitor performance and reliability, and resolve common production issues.
Apache Kafka for Developers
21 HoursThis instructor-led, live training in Malaysia (online or onsite) is aimed at intermediate-level developers who wish to develop big data applications with Apache Kafka.
By the end of this training, participants will be able to:
- Develop Kafka producers and consumers to send and read data from Kafka.
- Integrate Kafka with external systems using Kafka Connect.
- Write streaming applications with Kafka Streams & ksqlDB.
- Integrate a Kafka client application with Confluent Cloud for cloud-based Kafka deployments.
- Gain practical experience through hands-on exercises and real-world use cases.
Apache Kafka for Python Programmers
7 HoursThis instructor-led live training in Malaysia (online or onsite) is aimed at data engineers, data scientists, and programmers who wish to apply Apache Kafka features for data streaming with Python.
By the end of this training, participants will be able to use Apache Kafka to monitor and manage conditions in continuous data streams using Python programming.
Kafka Fundamentals for Java Developers
14 HoursThis instructor-led, live training in Malaysia (online or onsite) is aimed at intermediate-level Java developers who wish to integrate Apache Kafka into their applications for reliable, scalable, and high-throughput messaging.
By the end of this training, participants will be able to:
- Understand the architecture and core components of Kafka.
- Set up and configure a Kafka cluster.
- Produce and consume messages using Java.
- Implement Kafka Streams for real-time data processing.
- Ensure fault tolerance and scalability in Kafka applications.
Administration of Kafka Message Queue
14 HoursThis instructor-led, live training in Malaysia (online or onsite) is designed for intermediate-level system administrators who aim to leverage Kafka's message queuing features effectively.
By the conclusion of this training, participants will be able to:
- Comprehend Kafka's message queuing capabilities and architecture.
- Configure Kafka topics for message queuing scenarios.
- Produce and consume messages using Kafka.
- Monitor and manage Kafka as a message queue.
Security for Apache Kafka
7 HoursThis instructor-led live training in Malaysia (online or onsite) is designed for software testers who wish to implement network security measures within Apache Kafka applications.
By the end of this training, participants will be able to:
- Deploy Apache Kafka onto a cloud-based server.
- Implement SSL encryption to prevent security attacks.
- Add ACL authentication to track and control user access.
- Ensure that trusted clients have access to Kafka clusters via SSL and SASL authentication.
Apache Kafka and Spring Boot
7 HoursThis instructor-led, live training in Malaysia (online or onsite) is aimed at intermediate-level developers who wish to learn the fundamentals of Kafka and integrate it with Spring Boot.
By the end of this training, participants will be able to:
- Understand Kafka and its architecture.
- Learn how to install, configure, and set up a basic Kafka environment.
- Integrate Kafka with Spring Boot.
Administration of Kafka Topic
14 HoursThis instructor-led, live training in Malaysia (online or on-site) is designed for beginner to intermediate system administrators who want to learn how to effectively manage Kafka topics for efficient data streaming and processing.
By the end of this training, participants will be able to:
- Understand the fundamentals and architecture of Kafka topics.
- Create, configure, and manage Kafka topics.
- Monitor Kafka topics for health, performance, and availability.
- Implement security measures for Kafka topics.