今日学习之debezium同步sqlserver数据库数据到kafka

郭卫东 · 2025-3-1 11:07:22

Debezium先容

Debezium是一个开源的分布式平台，用于捕获数据库的更改事件，并将这些事件转换为可观察的流。它可以连接到各种不同类型的数据库，包括MySQL、PostgreSQL、MongoDB等，捕获数据库中的数据更改，并将这些更改转发到消息代理系统（如Kafka）中，以便其他应用程序可以实时地消费这些更改事件。
Debezium的主要特点包括：
1.实时数据捕获：Debezium能够实时地捕获数据库中的更改事件，包括插入、更新和删除操作，以便其他应用程序可以实时地获取这些更改。
2.可观察的流：捕获的数据库更改事件会被转换为可观察的流，这意味着其他应用程序可以轻松地订阅和消费这些事件，以便实时地获取数据库中的更改。
3.支持多种数据库：Debezium支持连接到各种不同类型的数据库，包括MySQL、PostgreSQL、MongoDB等，使得它非常灵活和通用。
4.可扩展性：由于Debezium是一个分布式平台，它具有良好的可扩展性，可以处理大规模的数据流，并确保高可用性和容错性。
5.Debezium 运行在 Kafka Connect 之上，这使它能够充分利用 Kafka 的分布式架构

复制代码

Debezium部署方式

复制代码

本文概括

复制代码

准备工具：

kafka集群和sqlserver2019本文不做部署分析，只针对debezium分析
详细步骤

一、下载 debezium-connector-sqlserver
进入debezium官网下载connector组件传送门
二、在kafka目录新建一个 connectors 目录

将下载的debezium connector解压放在该目录下

进入kafka config目录并编辑 connect-distributed.properties文件

在最下面到场plugin.path=刚刚新建的connectors目录

# List of comma-separated URIs the REST API will listen on. The supported protocols are HTTP and HTTPS.
# Specify hostname as 0.0.0.0 to bind to all interfaces.
# Leave hostname empty to bind to default interface.
# Examples of legal listener lists: HTTP://myhost:8083,HTTPS://myhost:8084"
listeners=HTTP://:8083
# The Hostname & Port that will be given out to other workers to connect to i.e. URLs that are routable from other servers.
# If not set, it uses the value for "listeners" if configured.
#rest.advertised.host.name=
#rest.advertised.port=
#rest.advertised.listener=
rest.port=18083
# Set to a list of filesystem paths separated by commas (,) to enable class loading isolation for plugins
# (connectors, converters, transformations). The list should consist of top level directories that include
# any combination of:
# a) directories immediately containing jars with plugins and their dependencies
# b) uber-jars with plugins and their dependencies
# c) directories immediately containing the package directory structure of classes of plugins and their dependencies
# Examples:
# plugin.path=/usr/local/share/java,/usr/local/share/kafka/plugins,/opt/connectors,
#plugin.path=
# 保存connectors的路径fka_2.12-3.5.1
plugin.path=/opt/kafka/kafka_2.12-3.5.1/connectors

复制代码

首先启动kafka实行命令：

[root@test config]# /opt/kafka/kafka_2.12-3.5.1/bin/kafka-server-start.sh /opt/kafka/kafka_2.12-3.5.1/config/server.properties

复制代码

启动分布式模式

../bin/connect-distributed.sh /opt/kafka/kafka_2.12-3.5.1/config/connect-distributed.properties

复制代码

实行命令： curl localhost:8083 初出现下面则体现成功

sqlserver开启CDC脚本（可根据场景更改）

复制代码

接下来创建一个 Source 连接器，此前先要设定好这个连接器的相关设置，请求接口：localhost:8083/connectors/ json内容如下：

复制代码

我们监听kafka topic

./kafka-console-consumer.sh --bootstrap-server localhost:9092 --from-beginning --topic fullfillment.dbo.aaaa

复制代码

在sqlserver新增一条数据：

至此完成监听

免责声明：如果侵犯了您的权益，请联系站长，我们会及时删除侵权内容，谢谢合作！更多信息从访问主页：qidao123.com:ToB企服之家，中国第一个企服评测及商务社交产业平台。

		自动登录	找回密码
密码			立即注册

0 个回复