Apache Kafka
Introduction
Apache Kafka® is a distributed streaming platform.
A streaming platform has three key capabilities:
Publish and subscribe to streams of records, similar to a message queue or enterprise messaging system.
Store streams of records in a fault-tolerant durable way.
Process streams of records as they occur. Kafka is generally used for two broad classes of applications:
Building real-time streaming data pipelines that reliably get data between systems or applications
Building real-time streaming applications that transform or react to the streams of data
To understand how Kafka does these things, let's dive in and explore Kafka's capabilities from the bottom up.
First a few concepts:
Kafka is run as a cluster on one or more servers that can span multiple datacenters.
The Kafka cluster stores streams of records in categories called topics.
Each record consists of a key, a value, and a timestamp.
Kafka Adoption Stories
References
Tutorial https://www.udemy.com/apache-kafka/
Mastering Apache Kafka https://legacy.gitbook.com/book/jaceklaskowski/apache-kafka/details
Spring Boot and Kafka https://cloud.spring.io/spring-cloud-stream/
Kafka has many connectors https://www.confluent.io/product/connectors/ one for Blockchain.info too
KSQL - enables something really effective: reading, writing and transforming data in real-time and a scale using a semantic already known by the majority of the community working in the data space, the SQL! https://www.confluent.io/product/ksql/ oraz https://www.rittmanmead.com/blog/2017/10/ksql-streaming-sql-for-apache-kafka/
similar to blockchain in some way https://www.rittmanmead.com/blog/2017/10/ksql-streaming-sql-for-apache-kafka/
it is fast https://softwaremill.com/mqperf/
you can keep data forever there ... https://www.confluent.io/blog/okay-store-data-apache-kafka/
Google PubSub vs Kafka https://stackoverflow.com/questions/38572071/i-am-evaluating-google-pub-sub-vs-kafka
Neo4j Use Case: Low Latency Graph Analytics & OLTP - Update 1M Nodes in 90 secs with Kafka and Neo4j Bolt https://gist.github.com/graphadvantage/a148613f75818897e396a64957dc6ef1
What is Apache Kafka? Why is it so popular? Should you use it? https://techbeacon.com/what-apache-kafka-why-it-so-popular-should-you-use-it
Free books bundle https://www.confluent.io/apache-kafka-stream-processing-book-bundle
Rabbit & Kafka microservices https://news.ycombinator.com/item?id=11284765
Rabbit & Kafka https://tech.trello.com/why-we-chose-kafka/
Martin Kleppmann | Kafka Summit SF 2018 Keynote (Is Kafka a Database?) https://www.youtube.com/watch?v=v2RJQELoM6Y
Apache Kafka and Microservices
Apache Kafka and Blockchain
Last updated