Kafka My Kafka Note v67
Kafka My Kafka Note v67
Michael Mühlbeyer
Overview
Architecture
Kafka Connect
Kafka Stream
KSQL
Cluster Installation
https://www.quora.com/What-is-the-relation-between-Kafka-the-writer-and-Apache-Kafka-the-
distributed-messaging-system
http://kafka.apache.org/documentation.html#messageformat
Broker1
Topic
Consumer
1 2 3
Producer Partition 0
Consumer
1 2 3
Partition 1
mytopic
Broker2
P2 Partition 0
Consumer
1 2 3
Producer
P1
1 2 3
mytopic
Broker3 Consumer
P0
1 2 3
P1
1 2 3
Each Partition is stored on the Broker’s disk as one or more log files
– Do not get messed up with log4j logfiles
Each message in the log is identified by its offset number
Source: https://www.confluent.io/
Source: https://www.confluent.io/product/connectors/
Source: https://www.confluent.io/
Kafka
Source: https://www.confluent.io/
Source: https://www.confluent.io/
Source: https://www.confluent.io/
Source: https://www.confluent.io/
Source: https://www.confluent.io/
Open Source
Kafka
Oracle
Kafka
Elasticsearch
Kafka Connect Connect
Txn log
Goldengate
Kafka Streams
API
KSQL
Source: https://www.confluent.io/
Source: https://www.confluent.io/
Confluent Platform
Data
Real-time Applications
Integration
Monitoring & Administration
Confluent Control Center | Security
Confluent Platform
Hadoop Transformations
Operations
Replicator | Auto Data Balancing
Database Custom Apps
Data Compatibility
Schema Registry
Analytics
Data Warehouse
Apache Kafka®
… …
Core | Connect API | Streams API
Confidential 68
Source: https://www.confluent.io/
Install Confluent
sudo vi /etc/yum.repos.d/confluent.repo
[Confluent.dist]
name=Confluent repository (dist)
baseurl=http://packages.confluent.io/rpm/3.3/7
gpgcheck=1
gpgkey=http://packages.confluent.io/rpm/3.3/archive.key
enabled=1
[Confluent]
name=Confluent repository
baseurl=http://packages.confluent.io/rpm/3.3
gpgcheck=1
gpgkey=http://packages.confluent.io/rpm/3.3/archive.key
enabled=1
For a lab environment the above setting should be fine no need to change anything.
Start zookeeper