Apache Kafka is a real-time data streaming technology that is used to collect, process, and store streaming event data or data that has no discrete beginning or end. Kafka takes streaming data and records exactly what happened and when. This record is called an immutable commit log. It is immutable because it can be appended to, but not otherwise changed. From there, users can subscribe to the log (access the data) and can also publish to it (add more data) from any number of streaming real-time applications, as well as other systems.
Kafka is an open-source system developed by the Apache Software Foundation and is written in Java & Scala. It is primarily used to build real-time streaming data pipelines and applications that adapt to the data streams. It combines messaging, storage, and stream processing to allow storage and analysis of both historical and real-time data.
This information was updated as of 01 January 2022. Changes may have occured since then.
Please help us improve by sharing your feedback to techhorizon@and.digital. Thank you.