Tutorial

How to handle multiple event types in a Kafka topic with Avro

It's sometimes advantageous to produce distinct but related event types to the same topic, e.g., to guarantee the exact order of different events for the same key. For example, consider pageview and purchase records associated with the same customer ID. In order to properly attribute purchases to preceding pageviews, these distinct events must be sent to the same topic so that the order is preserved in one Kafka topic partition. But, let's say we also need to maintain the topic-name subject constraints with Schema Registry.

To accomplish this with Avro-formatted events, we can use schema references, where a schema contains a field whose type is a reference to another schema.

Example Avro schema with references

The example in this tutorial uses a top-level Avro schema specifying that a record is either a purchase or a pageview:

[
  "io.confluent.developer.avro.Purchase",
  "io.confluent.developer.avro.PageView"
]

Where these references are defined as follows:

{
  "type":"record",
  "namespace": "io.confluent.developer.avro",
  "name":"Pageview",
  "fields": [
    {"name": "url", "type":"string"},
    {"name": "is_special", "type": "boolean"},
    {"name": "customer_id", "type":  "string"}
  ]
}

{
  "type":"record",
  "namespace": "io.confluent.developer.avro",
  "name":"Purchase",
  "fields": [
    {"name": "item", "type":"string"},
    {"name": "amount", "type": "double"},
    {"name": "customer_id", "type": "string"}
  ]
}

Now, if you use the top-level schema for a topic, then you can produce either io.confluent.developer.avro.Purchase or io.confluent.developer.avro.Pageview records to the topic.

Running the example

In order to run this example, first clone the confluentinc/tutorials GitHub repository (if you haven't already) and navigate to the tutorials directory:

git clone git@github.com:confluentinc/tutorials.git
cd tutorials

Now you can either execute the unit test included with the example, or run the example in Docker or in Confluent Cloud.

Execute the unit tests

To run the unit tests, use the provided Gradle Wrapper:

./gradlew clean :multiple-event-types-avro:kafka:test --info

Run in Docker

Prerequisites

Docker running via Docker Desktop or Docker Engine
Docker Compose. Ensure that the command docker compose version succeeds.

Start Kafka and Schema Registry

Start Kafka by running:

docker compose -f ./docker/docker-compose-ksqldb.yml up -d

Create topic

Create the avro-events topic:

docker exec -t broker kafka-topics --create --topic avro-events --bootstrap-server broker:9092

Register schemas

Run the following task to register the schemas in Schema Registry:

./gradlew :multiple-event-types-avro:kafka:registerSchemasTask

Build the application

Build the application uberjar:

./gradlew :multiple-event-types-avro:kafka:shadowJar

Run the application

Run the application, which produces and consumes pageview and purchase events, with the following command:

java -jar multiple-event-types-avro/kafka/build/libs/multiple-event-types-avro-standalone-0.0.1.jar multiple-event-types-avro/kafka/local.properties

Cleanup

Stop Kafka and Schema Registry:

docker compose -f ./docker/docker-compose-ksqldb.yml down

Run in Confluent Cloud

Prerequisites

A Confluent Cloud account and a Kafka cluster created within it (quick start).

Create topic

Using the Confluent Cloud Console, create a topic with default settings called avro-events.

Generate client configuration

In the Confluent Cloud Console, navigate to the Cluster Overview page. Select Clients in the left-hand navigation and create a new Java client. Generate API keys during this step, and download the generated client configuration. Place it at multiple-event-types-avro/kafka/cloud.properties.

Register schemas

Run the following task to register the schemas in Schema Registry:

./gradlew :multiple-event-types-avro:kafka:registerSchemasTask

In the Confluent Cloud Console, navigate to Topics in the left-hand navigation, select the avro-events topic, and click Schema. Validate that a Value schema has been set.

Build the application

Build the application uberjar:

./gradlew :multiple-event-types-avro:kafka:shadowJar

Run the application

Run the application, which produces and consumes pageview and purchase events, with the following command. Note that we are passing the client configuration as an argument:

java -jar multiple-event-types-avro/kafka/build/libs/multiple-event-types-avro-standalone-0.0.1.jar multiple-event-types-avro/kafka/cloud.properties

In the Confluent Cloud Console, select the Messages tab for the avro-events topic and view the messages that are produced.

Cleanup

Delete the cluster used for this tutorial if you no longer need it.

Do you have questions or comments? Join us in the #confluent-developer community Slack channel to engage in discussions with the creators of this content.

NEWKafka® 101

NEWApache Flink® SQL

NEWApache Flink® Table API: Processing Data Streams in Java

NEWDesigning Event-Driven Microservices

NEWApache Flink® 101

NEWBuilding Flink® Apps in Java

NEWKafka® 101

Kafka® Connect 101

Kafka Streams 101

Schema Registry 101

ksqlDB 101

Data Mesh 101

NEWKafka® 101

NEWApache Flink® SQL

NEWApache Flink® Table API: Processing Data Streams in Java

NEWDesigning Event-Driven Microservices

NEWApache Flink® 101

NEWBuilding Flink® Apps in Java

NEWKafka® 101

Kafka® Connect 101

Kafka Streams 101

Schema Registry 101

ksqlDB 101

Data Mesh 101

Articles

Patterns

FAQs

Blog

NEWStreamables

NEWLearn More

Articles

Patterns

FAQs

Blog

NEWStreamables

NEWLearn More

Language Guides

Tutorials

Demos

Language Guides

Tutorials

Demos

Meetups

Ask the Community

Community Catalysts

NEWCommunity Use Cases

Confluent Developer Newsletter

Data Streaming Awards

NEWCurrent 2024

Kafka Summit 2024 - Bangalore

Kafka Summit 2024 - London

Current 2023

Kafka Summit 2023

Meetups

Ask the Community

Community Catalysts

NEWCommunity Use Cases

Confluent Developer Newsletter

Data Streaming Awards

NEWCurrent 2024

Kafka Summit 2024 - Bangalore

Kafka Summit 2024 - London

Current 2023

Kafka Summit 2023

NEWKafka® 101

NEWApache Flink® SQL

NEWApache Flink® Table API: Processing Data Streams in Java

NEWDesigning Event-Driven Microservices

NEWApache Flink® 101

NEWBuilding Flink® Apps in Java

NEWKafka® 101

Kafka® Connect 101

Kafka Streams 101

Schema Registry 101

ksqlDB 101

Data Mesh 101

Articles

Patterns

FAQs

Blog