Enhance your career, get your certificate as a Data Streaming Engineer | Get your Certificate

course: Building Data Pipelines with Apache Kafka® and Confluent

How to Choose the Right CDC Method for Getting Data from RDBMS into Kafka

Tim Berglund

VP Developer Relations

How to Choose the Right CDC Method for Getting Data from RDBMS into Kafka

Both change data capture (CDC) methods have their strengths. Which one to use depends on your particular requirements. Here are the pros and cons of each:

Query-Based CDC

✅ Usually easier to set up: It's just a JDBC connection to your database, just like running a JDBC query from your application or favorite database dev tool.
✅ Requires fewer permissions: You're only querying the database, so you just need a regular read-only user with access to the tables.
🛑 Requires specific columns in source schema to track changes: If you don't have the option to modify the schema to include a timestamp or incrementing ID field, then life becomes somewhat difficult.
🛑 Impact of polling the database (or higher-latencies trade-off): If you run the same query too often against the database you're going to (quite rightly) have your DBA on the phone asking what's going on. But set the polling too infrequently, and you end up with data that is potentially less useful because of its age.
🛑 Can't track DELETEs: You can only query a relational database for data that's there right now. If the data's been deleted, you can't query it, and so you can't capture those events into Apache Kafka.
🛑 Can't track multiple events between polling interval: If a row changes several times during the period in which the connector polls, you'll only capture the latest state. In some cases, this may not matter (if you just want the latest state of a table). In others, it can matter a lot (if you're building applications that are driven by changes happening in the database).

Log-Based CDC

✅ Greater data fidelity: Everything is captured—inserts, updates, and even DELETEs. For each of these, we can also get the previous state of the row that changed.
✅ Lower latency and lower impact on the source database: Because we're not polling the database but reading the transaction log, it's lower latency, and we're putting less load on the database too.
🛑 More setup steps and higher system privileges required: The transaction log is a relatively low-level component of the database, so we need greater privileges in the database to access the API for it, and it can be more complicated to set up.

Do you have questions or comments? Join us in the #developer-confluent-io community Slack channel to engage in discussions with the creators of this content.

Use the promo code PIPELINES101 & CONFLUENTDEV1 to get $25 of free Confluent Cloud usage and skip credit card entry.

Get Started

Be the first to get updates and new content

We will only share developer content and updates, including notifications when new content is added. We will never send you sales emails. 🙂 By subscribing, you understand we will process your personal information in accordance with our Privacy Statement.

Apache Iceberg ™

Kafka® 101

Apache Flink® SQL

Apache Flink® Table API: Processing Data Streams in Java

Designing Event-Driven Microservices

Apache Flink® 101

Building Flink® Apps in Java

Kafka® 101

Kafka® Connect 101

Kafka Streams 101

Schema Registry 101

ksqlDB 101

Articles

Patterns

FAQs

Blog

Streamables

Learn More

Language Guides

Tutorials

Demos

Meetups

Community Slack

Community Catalysts

Community Use Cases

Confluent Developer Newsletter

Data Streaming Awards

NEWCurrent 2026

Past Current and Kafka Summit events

How to Choose the Right CDC Method for Getting Data from RDBMS into Kafka

How to Choose the Right CDC Method for Getting Data from RDBMS into Kafka

Query-Based CDC

Log-Based CDC

Use the promo code PIPELINES101 & CONFLUENTDEV1 to get $25 of free Confluent Cloud usage and skip credit card entry.

Get started with Confluent Cloud

Modules: Start from lesson 1Total 13

Related Resources

Be the first to get updates and new content

Modules: Start from lesson 1
Total 13