Ruby client library for Apache Kafka based on librdkafka
Kafka provides a Ruby client for Apache Kafka that
leverages librdkafka for its
performance and general correctness.
Add this line to your application’s Gemfile:
gem "kafka"
And then execute:
$ bundle install
Or install it yourself as:
$ gem install kafka
For more examples see the examples directory.
For a detailed introduction on librdkafka which would be useful when working
with Kafka::FFI
directly, see
the librdkafka documentation.
require "kafka"
config = Kafka::Config.new("bootstrap.servers": "localhost:9092")
producer = Kafka::Producer.new(config)
# Asynchronously publish a JSON payload to the events topic.
event = { time: Time.now, status: "success" }
result = producer.produce("events", event.to_json)
# Wait for the delivery to confirm that publishing was successful.
result.wait
result.successful?
# Provide a callback to be called when the delivery status is ready.
producer.produce("events", event.to_json) do |result|
StatsD.increment("kafka.total")
if result.error?
StatsD.increment("kafka.errors")
end
end
require "kafka"
config = Kafka::Config.new({
"bootstrap.servers": "localhost:9092",
# Required for consumers to know what consumer group to join.
"group.id": "web.production.eventer",
})
consumer = Kafka::Consumer.new(config)
consumer.subscribe("events")
@run = true
trap("INT") { @run = false }
trap("TERM") { @run = false }
while @run
consumer.poll do |message|
puts message.payload
end
end
Kafka has a lot of potential knobs to turn and dials to tweak. AKafka::Config
uses the same configuration options as librdkafka (and most or
all from the Java client). The defaults are generally good and a fine place to
start.
Kafka::FFI
provides bindings to functions in
librdkafka.
All of the names are the same and mostly have named parameters to help with
calling them. Be aware that you will need to handle some memory management to
call most functions exported in the bindings. See
rdkafka.h
for any questions about usage and semantics.
All classes in Kafka::FFI
provide an object oriented mapping to the functions
exported on Kafka::FFI.rd_kafka_*
. Most will require understanding memory
management but most should be easier to use and safe than calling into
librdkafka directly.
There are already at least two good gems for Kafka:
ruby-kafka and
rdkafka. In fact we’ve used both
of these gems on Dead Man’s Snitch for quite a while and they’ve been great. We
really appreciate all of the work that has gone into them .
Unfortunately, keeping up with Kafka feature and protocol changes can be a full
time job. Development on ruby-kafka has stalled for that reason and many
consumer/producer libraries are migrating away from it.
As a heartbeat and cron job monitoring service, we depend on receiving and
processing reports from jobs reliably and quickly. Failing to receive a report
could mean waking someone up at 3AM or forcing them to take time away from
family or friends to deal with a false alarm. What started as a deep dive into
rdkafka to understand how best to use it reliably, we had ideas we wanted to
implement that probably wouldn’t have been a good fit for rdkafka so we decided
to start from scratch.
Our goal is to provide a stable and easy to maintain Kafka consumer / producer
for Ruby. With time as our biggest constraint it makes sense to leverage
librdkafka as it has full time maintenance and support by the team behind
Kafka. FFI makes it fast and easy to expose new librdkafka APIs as they are
added. A stable test suite means being able to meaningfully spend the limited
amount of time we have available to invest. Embracing memory management and
building clean separations between layers should reduce the burden to implement
new bindings as the rules and responsibilities of each layer are clear.
The Producer
is thread safe for publishing messages but should only be closed
from a single thread. While the Consumer
is thread safe for calls to #poll
only one message can be in flight at a time, causing the threads to serialize.
Instead, create a single consumer for each thread.
Kafka is not fork
safe. Make sure to close any Producers or Consumer before
forking and rebuild them after forking the new process.
Kafka requires Ruby 2.7+ and is tested against Ruby 2.7+ and Kafka 2.8+.
To get started with development make sure to have docker
, docker-compose
, andkafkacat
installed as they make getting
up to speed easier. Some rake tasks depend on ctags
.
Before running the test, start a Kafka broker instance
rake kafka:up
Then run the tests with
rake
When you’re done shut down the Kafka instance by running:
rake kafka:down
Bug reports and pull requests are welcome on GitHub at
https://github.com/deadmanssnitch/kafka. This project is intended to be a safe,
welcoming space for collaboration, and contributors are expected to adhere to
the code of conduct.
The gem is available as open source under the terms of the
MIT License.
Everyone interacting in the Kafka project’s codebases and issue trackers are
expected to follow the
code of conduct.