Apache Kafka is a high-throughput distributed messaging system that allows for both streaming and offline log processing. It uses Apache Zookeeper for coordination and supports activity stream processing and real-time pub/sub messaging. Kafka bridges the gaps between pure offline log processing and traditional messaging systems by providing features like batching, transactions, persistence, and support for multiple consumers.
Apache Kafka is introduced as a high-throughput distributed messaging system, emphasizing its architecture and playful comparison to JMS.
Kafka supports activity stream processing, real-time pub/sub messaging, and uses Apache Zookeeper. It addresses common messaging issues with high performance and persistence.
Kafka bridges gaps between various messaging issues, supporting both real-time processing and offline aggregation with frameworks like Hadoop. References to producer and consumer testing, and further emphasis on Kafka's capabilities in large data handling and stream processing.
The presentation concludes with gratitude and reiteration of Kafka's advantages in messaging and data processing.
Apache Kafka
A high-throughputdistributed messaging system.
“Distributed Publish-Subiscribe
messaging system, Hight
Troughtput, Persistent,
Partitioning Messages, Parallel
data lod into Hadoop.”
7.
Apache Kafka
A high-throughput distributed messaging system.
Pure offline log
processing are:
Real-time,
High Performance,
Hight Troughtput
Lightweight business logic(and
not lots) to deliver that.
8.
Apache Kafka
A high-throughput distributed messaging system.
Pure Messasing
issues(ActiveMQ/RabbitMQ):
NO API for Batching,
Transactional,
No persistence means, multiple
consumers are limited by arch.
Apache Kafka
A high-throughput distributed messaging system.
Supports *Activity Stream Processing*, like: Facebook/Scribe and Apache Flume.
*Activity Stream Processing* => Collecting, Aggregating, larges
ammout on data, very present on social business. Later you often do
offline analysys with hadoop. A.K.A Offline log Aggregation.