Building Data Pipelines with Apache Kafka Training Course

Course Code



7 hours (usually 1 day including breaks)


Basic Java 8 or Scala knowledge is preferable. Please install Docker and Docker Compose if you want to run examples locally.


Apache Kafka is a distributed streaming platform. It is de facto a standard for building data pipelines and it solves a lot of different use-cases around data processing: it can be used as a message queue, distributed log, stream processor, etc.

We'll start with some theory behind data pipelines in general, then continue with fundamental concepts behind Kafka. We'll also discover important components like Kafka Streams and Kafka Connect.

Course Outline

  • Data pipelines 101: ingestion, storage, processing
  • Kafka fundamentals: topics, partitions, brokers, replication, etc.
  • Producer and Consumer APIs
  • Kafka Streams as a processing layer
  • Kafka Connect for integrating with external systems
  • Kafka best practices and tuning



Related Categories

Course Discounts

Course Discounts Newsletter

We respect the privacy of your email address. We will not pass on or sell your address to others.
You can always change your preferences or unsubscribe completely.

Some of our clients

is growing fast!

We are looking to expand our presence in the US!

As a Business Development Manager you will:

  • expand business in the US
  • recruit local talent (sales, agents, trainers, consultants)
  • recruit local trainers and consultants

We offer:

  • Artificial Intelligence and Big Data systems to support your local operation
  • high-tech automation
  • continuously upgraded course catalogue and content
  • good fun in international team

If you are interested in running a high-tech, high-quality training and consulting business.

Apply now!