Get Started Free
dan-weston

Dan Weston

Senior Curriculum Developer

Configuring MirrorMaker 2 - Exercise

In this hands-on exercise, we'll look at configuring MirrorMaker 2 and replicating data from one cluster to another.

To follow along, you'll need to have Docker installed. You'll also need to clone the GitHub repo for this course, Learn Hybrid Cloud Exercises.

The first thing we'll do is start our Docker container. We will be using a Kafka instance that contains two zookeeper instances, two brokers, a connect cluster, and a schema registry instance. As of the writing of this course, the most recent version of Kafka is 3.4 and Confluent Platform 7.4.0. By the time you take this course, there might have been other releases that break or change how things work and function. If you can't figure out why something isn't working please reach out to us using the Confluent Community Forum or Public Slack.

There are two ways to follow along with this exercise, either locally using a Docker file, or in Gitpod.

Locally using Docker: Download or copy the Docker Compose file for this course:

curl --silent --output docker-compose.yml \
https://github.com/confluentinc/learn-hybrid-cloud-exercises/tree/master/modules/module-10-files/docker-compose.yml

After you have cloned the GitHub repo start the cluster by typing (this step is done for you if you are using Gitpod):

docker-compose up -d

Using Gitpod:

Open in Gitpod

The rest of the instructions are the same whether you are using the local Docker file or Gitpod.

The first time this runs it will take a while as it is downloading all the necessary files. Once it is done make sure that everything is running by typing:

docker-compose ps

We need some topics and data in the Kafka instance, to create it, enter into broker container from a new terminal tab or window and execute the kafka-topics command:

docker exec -it broker /bin/bash

Create the topic:

kafka-topics --create --topic replicate_me --bootstrap-server broker:29092

Now let's create some messages that we can have replicated:

while [[ true ]]; do echo "$RANDOM" | kafka-console-producer --topic replicate_me --bootstrap-server broker:29092; sleep 1; done &

Next, we'll enter the Kafka Connect Docker container to run our commands. Open up a new terminal tab or window:

docker exec -it connect /bin/bash

To create a connector you'll want to take a look at the official Kafka Connect documentation. For this example, we'll just use the /connectors endpoint.

Create a new connector using the JSON format.

cat > connectors.json

{
  "name":"test_mirror",
  "config": {
    "connector.class": "org.apache.kafka.connect.mirror.MirrorSourceConnector",
    "name":"test_mirror",
    "source.cluster.alias":"source",
    "topics":"replicate_me",
    "source.cluster.bootstrap.servers":"broker:29092",
    "target.cluster.bootstrap.servers":"broker2:29093",
    "producer.override.bootstrap.servers":"broker2:29093",
    "offset-syncs.topic.replication.factor":"1"
  }
}

Hit Enter and then Ctrl-C

Now let's start our mirroring by performing the POST command to create our new connector:

cat connectors.json | curl -X POST -H 'Content-Type: application/json' localhost:8083/connectors --data-binary @-

You can verify that the new connector has been created:

curl localhost:8083/connectors

Open up a new terminal and connect to broker2:

docker exec -it broker2 /bin/bash

Let's look at the topics:

kafka-topics --list --bootstrap-server broker2:29093

You'll see our new topic source source.replicate_me

Finally, let's take a look at the messages that are being replicated. First, go back to the terminal window that is connected to our first broker and enter the following command:

kafka-console-consumer --topic replicate_me --bootstrap-server broker:29092

Then place the broker2 terminal window next to that and run this command:

kafka-console-consumer --topic source.replicate_me --bootstrap-server broker2:29093

As you can see the two brokers have the same messages being replicated to each one.

As we talked about in the MirrorMaker2 module, if you plan on having data replicated from one cluster to another you only need one Kafka Connect cluster configured for Active/Passive and Aggregation assuming you are only working with two clusters. However, for all other scenarios, Active/Active, Fan-out, Forwarding, and more than two Active/Passive or Aggregation clusters, you will need to configure a separate Kafka Connect instance for each target cluster. Or, said another way, for each target you will need a Kafka Connect instance. For this reason, it can be difficult to scale MirrorMaker2.

Use the promo code HYBRCLOUD101 & CONFLUENTDEV1 to get $25 of free Confluent Cloud usage and skip credit card entry.

Be the first to get updates and new content

We will only share developer content and updates, including notifications when new content is added. We will never send you sales emails. 🙂 By subscribing, you understand we will process your personal information in accordance with our Privacy Statement.