PostgreSQL的一主一从集群搭建部署 (两同步)
一、实验环境虚拟机名IP身份简称keep-postgres12-node1192.168.122.87主节点node1keep-postgres12-node2192.168.122.89备节点node2keep-postgres12-node3192.168.122.90备节点node3 二、安装数据库
源码包方式(主)
1、创建用户
# groupadd postgres
# useradd -g postgres postgres -m -s /bin/bash
# echo "Database@123" | passwd --stdin postgres
2、修改服务器内核信息
# echo '# 最大共享内存段大小 (默认值68719476736)
kernel.shmmax = 68719476736
# 可以使用的共享内存的总量 (默认值4294967296)
kernel.shmall = 4294967296
# 整个系统共享内存段的最大数目
kernel.shmmni = 4096
# 每个信号对象集的最大信号对象数
kernel.sem = 50100 64128000 50100 1280
# 文件句柄的最大数量
fs.file-max = 7672460
# 应用程序可使用的IPv4端口范围
net.ipv4.ip_local_port_range = 9000 65000
# 套接字接收缓冲区大小的缺省值
net.core.rmem_default = 1048576
# 套接字发送缓冲区大小的缺省值
net.core.wmem_default = 262144
# 套接字发送缓冲区大小的最大值
net.core.wmem_max = 1048576' >> /etc/sysctl.conf
# 使参数生效
# sysctl -p
3、安装依赖包
# yum install gcc gcc-c++ zlib-devel readline-devel perl-ExtUtils-Embed pam-devel openssl openssl-devel cmake libxslt-devel libxml2-devel openldap-devel python-devel tcl tcl-devel bison flex xmlto -y
4、创建数据库目录
# mkdir /data
# mkdir /data/postgres12.2
# chown -R postgres: /data
5、创建环境变量
# su - postgres
$ echo 'export PGPORT=5432
export PG_HOME=/data/postgres12.2
export PATH=$PG_HOME/bin:$PATH
export PGDATA=$PG_HOME/data
export LD_LIBRARY_PATH=$PG_HOME/lib
export LANG=en_US.utf8' >> ~/.bash_profile && source~/.bash_profile
6、下载源码包,并解压
链接1:https://ftp.postgresql.org/pub/source
链接2:https://www.postgresql.org/ftp/source/
# 安装postgresql-12.2版本
# 下载postgresql-12.2源码包
$ cd /data
$ wget https://ftp.postgresql.org/pub/source/v12.2/postgresql-12.2.tar.bz2
# 解压源码包
$ tar -xf postgresql-12.2.tar.bz2
7、编译源码包
$ ./configure --prefix=/data/postgres12.2 --with-pgport=5432 --with-openssl --with-perl \
--with-tcl --with-python --with-pam --without-ldap --with-libxml --with-libxslt \
--enable-thread-safety --with-wal-blocksize=16 --with-blocksize=8
# 包括第三方插件全部编译,包含文档和所有的contirb
$ gmake world
8、安装数据库
$ make && make install
9、设置wal日志目录
$ # 用于保存wal日志
$ mkdir $PG_HOME/arch
10、初始化数据库
$ # 创建目录
$ mkdir $PG_HOME/data
$ # 初始化数据库集簇
$ # --data-checksums 主从复制时需要
$ initdb -D $PGDATA -W --data-checksums
11、启动数据库
$ pg_ctl -D $PGDATA start
12、创建数据库用户和数据库
$ psql -d postgres -p 5432
postgres=# create database testdb;
postgres=# create user keep with Superuser password'keep';
13、设置远程毗连
# 设置毗连
$ cd $PGDATA
$ sed -i "s/#listen_addresses = 'localhost'/listen_addresses = '*'/g" postgresql.conf
$ echo 'host all all 0.0.0.0/0 md5' >> pg_hba.conf
# 重启数据库
$ pg_ctl stop
$ pg_ctl -D $PGDATA start
# 验证远程毗连
$ psql -d testdb -U keep -p 5432 -h 192.168.140.96
14、设置数据库日志
$ echo "# Add settings for extensions here
log_destination='csvlog'
logging_collector=on
log_directory='pg_log'
log_filename='postgresql-%Y-%m-%d.log'
log_truncate_on_rotation=off
log_rotation_age=1d
log_rotation_size=0
log_error_verbosity=verbose" >> $PGDATA/postgresql.conf
源码包方式(备 node2/node3)
1、创建用户
# groupadd postgres
# useradd -g postgres postgres -m -s /bin/bash
# echo "Database@123" | passwd --stdin postgres
2、修改服务器内核信息
# echo '# 最大共享内存段大小 (默认值68719476736)
kernel.shmmax = 68719476736
# 可以使用的共享内存的总量 (默认值4294967296)
kernel.shmall = 4294967296
# 整个系统共享内存段的最大数目
kernel.shmmni = 4096
# 每个信号对象集的最大信号对象数
kernel.sem = 50100 64128000 50100 1280
# 文件句柄的最大数量
fs.file-max = 7672460
# 应用程序可使用的IPv4端口范围
net.ipv4.ip_local_port_range = 9000 65000
# 套接字接收缓冲区大小的缺省值
net.core.rmem_default = 1048576
# 套接字发送缓冲区大小的缺省值
net.core.wmem_default = 262144
# 套接字发送缓冲区大小的最大值
net.core.wmem_max = 1048576' >> /etc/sysctl.conf
# 使参数生效
# sysctl -p
3、安装依赖包
# yum install gcc gcc-c++ zlib-devel readline-devel perl-ExtUtils-Embed pam-devel openssl openssl-devel cmake libxslt-devel libxml2-devel openldap-devel python-devel tcl tcl-devel bison flex xmlto -y
4、创建数据库目录
# mkdir /data
# mkdir /data/postgres12.2
# chown -R postgres: /data
5、创建环境变量
# su - postgres
$ echo 'export PGPORT=5432
export PG_HOME=/data/postgres12.2
export PATH=$PG_HOME/bin:$PATH
export PGDATA=$PG_HOME/data
export LD_LIBRARY_PATH=$PG_HOME/lib
export LANG=en_US.utf8' >> ~/.bash_profile && source~/.bash_profile
6、下载源码包,并解压
链接1:https://ftp.postgresql.org/pub/source
链接2:https://www.postgresql.org/ftp/source/
# 安装postgresql-12.2版本
# 下载postgresql-12.2源码包
$ cd /data
$ wget https://ftp.postgresql.org/pub/source/v12.2/postgresql-12.2.tar.bz2
# 解压源码包
$ tar -xf postgresql-12.2.tar.bz2
7、编译源码包
$ ./configure --prefix=/data/postgres12.2 --with-pgport=5432 --with-openssl --with-perl \
--with-tcl --with-python --with-pam --without-ldap --with-libxml --with-libxslt \
--enable-thread-safety --with-wal-blocksize=16 --with-blocksize=8
# 包括第三方插件全部编译,包含文档和所有的contirb
$ gmake world
8、安装数据库
$ make && make install
9、设置wal日志目录
$ # 用于保存wal日志
$ mkdir $PG_HOME/arch
设置互信
1、设置主机名
# keep-postgres12-node1
# echo '192.168.122.87 keep-postgres12-node1
192.168.122.88 keep-postgres12-node2
192.168.122.90 keep-postgres12-node3' >> /etc/hosts
# keep-postgres12-node2
# echo '192.168.122.87 keep-postgres12-node1
192.168.122.88 keep-postgres12-node2
192.168.122.90 keep-postgres12-node3' >> /etc/hosts
# keep-postgres12-node3
# echo '192.168.122.87 keep-postgres12-node1
192.168.122.88 keep-postgres12-node2
192.168.122.90 keep-postgres12-node3' >> /etc/hosts
2、设置postgres用户互信
# keep-postgres12-node1
$ ssh-keygen
$ ssh-copy-id keep-postgres12-node1
$ ssh-copy-id keep-postgres12-node2
$ ssh-copy-id keep-postgres12-node3
# 验证
$ ssh keep-postgres12-node1
$ ssh keep-postgres12-node2
$ ssh keep-postgres12-node3
# keep-postgres12-node2
$ ssh-keygen
$ ssh-copy-id keep-postgres12-node1
$ ssh-copy-id keep-postgres12-node2
$ ssh-copy-id keep-postgres12-node3
# 验证
$ ssh keep-postgres12-node1
$ ssh keep-postgres12-node2
$ ssh keep-postgres12-node3
# keep-postgres12-node3
$ ssh-keygen
$ ssh-copy-id keep-postgres12-node1
$ ssh-copy-id keep-postgres12-node2
$ ssh-copy-id keep-postgres12-node3
# 验证
$ ssh keep-postgres12-node1
$ ssh keep-postgres12-node2
$ ssh keep-postgres12-node3
三、修改主节点设置
1、创建同步用户
# 数据库需要在线状态
$ psql -c "create role replrole login replication encrypted password 'ReplRole@123';"
2、设置$PGDATA/pg_hba.conf
$ echo 'hostreplicationreplrolekeep-postgres12-node1trust
hostreplicationreplrolekeep-postgres12-node2trust
hostreplicationreplrolekeep-postgres12-node3trust
' >> $PGDATA/pg_hba.conf
3、修改数据库参数
$ echo "wal_level = replica
max_wal_senders=20
wal_keep_segments =64
# 开启归档
archive_mode = on
archive_command = 'cp %p /data/postgres12.2/arch/%f'
restore_command = 'cp /data/postgres12.2/arch/%f %p'
recovery_target_timeline = 'latest'
# full_page_writes是控制是否开启全页写入
full_page_writes = on
# 将每个磁盘页的全部内容写入到WAL
wal_log_hints = on
# 配置同步,此时pg2的优先级比pg3的要高
# 一个节点的sync_state为sync,剩下的是potential
synchronous_standby_names = 'standby_pg2,standby_pg3'
# 默认值为 on
# 可以设置为remote_write,对主库性能有利
# 开启同步模式,此刻需要强同步,数据无法入库
synchronous_commit = on
" >> $PGDATA/postgresql.conf
4、重启数据库
$ pg_ctl restart
5、synchronous_standby_names参数详解
# 该参数指定流复制中需要同步复制的服务器列表,需要配置standby服务器的名字
# 情况一:(无顺序要求)
# 只有这三台备机会被选为同步备机,即其中一个会被选为活动的同步备机,其余两台将处于备用状态。
synchronous_standby_names = 'standby_pg1,standby_pg2,standby_pg3'
# 设置WAL日志强同步至N个节点中的某M个节点
# 情况二:(无顺序要求)
# 只有这三台备机会被选为同步备机,即其中一个会被选为活动的同步备机,其余两台将处于备用状态。
synchronous_standby_names = 'ANY 1 (standby_pg1,standby_pg2,standby_pg3)'
# 情况三:(无顺序要求)
# 只有这三台备机会被选为同步备机,即其中两个会被选为活动的同步备机,其余一台将处于备用状态。
synchronous_standby_names = 'ANY 2 (standby_pg1,standby_pg2,standby_pg3)'
# 情况四:(按照指定节点的顺序)
# 从括号中列出的备机中按顺序选择前两台作为同步备机,选择不了两台节点时,则会无法操作。
synchronous_standby_names = 'FIRST 2 (standby_pg1,standby_pg2,standby_pg3)'
四、修改备节点设置
node2 设置
1、进行数据恢复
$ pg_basebackup -h keep-postgres12-node1 -p 5432 -U replrole -R -F p -P -D $PG_HOME/data
2、修改standby.signal设置
$ cd $PGDATA
$ cat standby.signal
primary_conninfo = 'host=keep-postgres12-node1 port=5432 user=replrole password=ReplRole@123 application_name=standby_pg2 options=''-c wal_sender_timeout=5000'''
restore_command = 'cp /data/postgres12.2/arch/%f %p'
archive_cleanup_command = 'pg_archivecleanup /data/postgres12.2/arch %r'
standby_mode = on
3、修改postgresql.auto.conf设置
# 注意:强同步必要注意在postgresql.auto.conf中加入application_name=standby_pg2
$ cat postgresql.auto.conf
primary_conninfo = 'user=replrole passfile=''/home/postgres/.pgpass'' host=''keep-postgres12-node1'' application_name=standby_pg2 port=5432 sslmode=prefer sslcompression=0 gssencmode=disable krbsrvname=postgres target_session_attrs=any'
node3 设置
1、进行数据恢复
$ pg_basebackup -h keep-postgres12-node1 -p 5432 -U replrole -R -F p -P -D $PG_HOME/data
2、修改standby.signal设置
$ cd $PGDATA
$ cat standby.signal
primary_conninfo = 'host=keep-postgres12-node1 port=5432 user=replrole password=ReplRole@123 application_name=standby_pg3 options=''-c wal_sender_timeout=5000'''
restore_command = 'cp /data/postgres12.2/arch/%f %p'
archive_cleanup_command = 'pg_archivecleanup /data/postgres12.2/arch %r'
standby_mode = on
3、修改postgresql.auto.conf设置
# 注意:强同步必要注意在postgresql.auto.conf中加入application_name=standby_pg3
$ cat postgresql.auto.conf
primary_conninfo = 'user=replrole passfile=''/home/postgres/.pgpass'' host=''keep-postgres12-node1'' application_name=standby_pg3 port=5432 sslmode=prefer sslcompression=0 gssencmode=disable krbsrvname=postgres target_session_attrs=any'
五、启动数据同步
# 注意:主库修改synchronous_standby_names = ‘standby_pg2’ 后,需等候备库吸收主库发送的WAL日志流并写入WAL文件,之后才向客户端返回成功。
1、主库查抄点
$ pg_controldata
pg_control version number: 1201
Catalog version number: 201909212
Database system identifier: 7435168474920575893
Database cluster state: `in production`
pg_control last modified: Tue 10 Dec 2024 04:23:13 PM CST
Latest checkpoint location: 0/2E000060
Latest checkpoint's REDO location: 0/2E000028
Latest checkpoint's REDO WAL file: 00000007000000000000002E
Latest checkpoint's TimeLineID: 7
Latest checkpoint's PrevTimeLineID: 7
Latest checkpoint's full_page_writes: on
Latest checkpoint's NextXID: 0:640
Latest checkpoint's NextOID: 24913
Latest checkpoint's NextMultiXactId:1
Latest checkpoint's NextMultiOffset:0
Latest checkpoint's oldestXID: 480
Latest checkpoint's oldestXID's DB: 1
Latest checkpoint's oldestActiveXID:640
Latest checkpoint's oldestMultiXid: 1
Latest checkpoint's oldestMulti's DB: 1
Latest checkpoint's oldestCommitTsXid:0
Latest checkpoint's newestCommitTsXid:0
Time of latest checkpoint: Tue 10 Dec 2024 04:23:13 PM CST
Fake LSN counter for unlogged rels: 0/3E8
Minimum recovery ending location: 0/0
Min recovery ending loc's timeline: 0
Backup start location: 0/0
Backup end location: 0/0
End-of-backup record required: no
wal_level setting: replica
wal_log_hints setting: on
max_connections setting: 100
max_worker_processes setting: 8
max_wal_senders setting: 20
max_prepared_xacts setting: 0
max_locks_per_xact setting: 64
track_commit_timestamp setting: off
Maximum data alignment: 8
Database block size: 8192
Blocks per segment of large relation: 131072
WAL block size: 16384
Bytes per WAL segment: 16777216
Maximum length of identifiers: 64
Maximum columns in an index: 32
Maximum size of a TOAST chunk: 1996
Size of a large-object chunk: 2048
Date/time type storage: 64-bit integers
Float4 argument passing: by value
Float8 argument passing: by value
Data page checksum version: 1
Mock authentication nonce: 2fd610c9c82cf604c47448c0e7c842aa9ac4944e8fea828fcc0d2425b6f3a6e4
2、启动备库
node2
$ pg_ctl start
node3
$ pg_ctl start
3、查看备库日志
node2
$ tail -f postgresql-2024-12-10.csv
2024-12-10 16:20:45.667 CST,,,24030,,6757f9dd.5dde,1,,2024-12-10 16:20:45 CST,,0,LOG,00000,"ending log output to stderr",,"Future log output will go to log destination ""csvlog"".",,,,,,"PostmasterMain, postmaster.c:1300",""
2024-12-10 16:20:45.672 CST,,,24032,,6757f9dd.5de0,1,,2024-12-10 16:20:45 CST,,0,LOG,00000,"database system was interrupted; last known up at 2024-12-10 16:13:44 CST",,,,,,,,"StartupXLOG, xlog.c:6269",""
2024-12-10 16:20:45.894 CST,,,24032,,6757f9dd.5de0,2,,2024-12-10 16:20:45 CST,,0,LOG,00000,"entering standby mode",,,,,,,,"StartupXLOG, xlog.c:6324",""
2024-12-10 16:20:45.906 CST,,,24032,,6757f9dd.5de0,3,,2024-12-10 16:20:45 CST,1/0,0,LOG,00000,"redo starts at 0/2B000060",,,,,,,,"StartupXLOG, xlog.c:7037",""
2024-12-10 16:20:45.907 CST,,,24032,,6757f9dd.5de0,4,,2024-12-10 16:20:45 CST,1/0,0,LOG,00000,"consistent recovery state reached at 0/2B000138",,,,,,,,"CheckRecoveryConsistency, xlog.c:7880",""
2024-12-10 16:20:45.908 CST,,,24030,,6757f9dd.5dde,2,,2024-12-10 16:20:45 CST,,0,LOG,00000,"database system is ready to accept read only connections",,,,,,,,"sigusr1_handler, postmaster.c:5153",""
2024-12-10 16:20:45.923 CST,,,24046,,6757f9dd.5dee,1,,2024-12-10 16:20:45 CST,,0,LOG,00000,"started streaming WAL from primary at 0/2C000000 on timeline 7",,,,,,,,"WalReceiverMain, walreceiver.c:371",""
node3
$ tail -f postgresql-2024-12-10.csv
2024-12-10 16:24:45.339 CST,,,25094,,6757facd.6206,1,,2024-12-10 16:24:45 CST,,0,LOG,00000,"ending log output to stderr",,"Future log output will go to log destination ""csvlog"".",,,,,,"PostmasterMain, postmaster.c:1300",""
2024-12-10 16:24:45.344 CST,,,25096,,6757facd.6208,1,,2024-12-10 16:24:45 CST,,0,LOG,00000,"database system was interrupted; last known up at 2024-12-10 16:23:13 CST",,,,,,,,"StartupXLOG, xlog.c:6269",""
2024-12-10 16:24:45.637 CST,,,25096,,6757facd.6208,2,,2024-12-10 16:24:45 CST,,0,LOG,00000,"entering standby mode",,,,,,,,"StartupXLOG, xlog.c:6324",""
2024-12-10 16:24:45.649 CST,,,25096,,6757facd.6208,3,,2024-12-10 16:24:45 CST,1/0,0,LOG,00000,"redo starts at 0/2E000028",,,,,,,,"StartupXLOG, xlog.c:7037",""
2024-12-10 16:24:45.651 CST,,,25096,,6757facd.6208,4,,2024-12-10 16:24:45 CST,1/0,0,LOG,00000,"consistent recovery state reached at 0/2E000100",,,,,,,,"CheckRecoveryConsistency, xlog.c:7880",""
2024-12-10 16:24:45.652 CST,,,25094,,6757facd.6206,2,,2024-12-10 16:24:45 CST,,0,LOG,00000,"database system is ready to accept read only connections",,,,,,,,"sigusr1_handler, postmaster.c:5153",""
2024-12-10 16:24:45.672 CST,,,25104,,6757facd.6210,1,,2024-12-10 16:24:45 CST,,0,LOG,00000,"started streaming WAL from primary at 0/2F000000 on timeline 7",,,,,,,,"WalReceiverMain, walreceiver.c:371",""
4、查抄数据库历程
# keep-postgres12-node1
$ ps -ef | grep postgres
postgres 26423 10 16:17 ? 00:00:00 /data/postgres12.2/bin/postgres
postgres 26424 264230 16:17 ? 00:00:00 postgres: logger
postgres 26426 264230 16:17 ? 00:00:00 postgres: checkpointer
postgres 26427 264230 16:17 ? 00:00:00 postgres: background writer
postgres 26428 264230 16:17 ? 00:00:00 postgres: walwriter
postgres 26429 264230 16:17 ? 00:00:00 postgres: autovacuum launcher
postgres 26430 264230 16:17 ? 00:00:00 postgres: archiver last was 00000007000000000000002E.00000028.backup
postgres 26431 264230 16:17 ? 00:00:00 postgres: stats collector
postgres 26432 264230 16:17 ? 00:00:00 postgres: logical replication launcher
postgres 27747 264230 16:20 ? 00:00:00` postgres: walsender replrole 192.168.122.89(54134) streaming 0/2F000148`
postgres 28509 264230 16:24 ? 00:00:00 `postgres: walsender replrole 192.168.122.90(62200) streaming 0/2F000148`
# keep-postgres12-node2
$ ps -ef | grep postgres
postgres 24030 10 16:20 ? 00:00:00 /data/postgres12.2/bin/postgres
postgres 24031 240300 16:20 ? 00:00:00 postgres: logger
postgres 24032 240300 16:20 ? 00:00:00 postgres: startup recovering 00000007000000000000002F
postgres 24042 240300 16:20 ? 00:00:00 postgres: checkpointer
postgres 24043 240300 16:20 ? 00:00:00 postgres: background writer
postgres 24045 240300 16:20 ? 00:00:00 postgres: stats collector
postgres 24046 240300 16:20 ? 00:00:00 `postgres: walreceiver streaming 0/2F000148`
# keep-postgres12-node3
$ ps -ef | grep postgres
postgres 25094 10 16:24 ? 00:00:00 /data/postgres12.2/bin/postgres
postgres 25095 250940 16:24 ? 00:00:00 postgres: logger
postgres 25096 250940 16:24 ? 00:00:00 postgres: startup recovering 00000007000000000000002F
postgres 25100 250940 16:24 ? 00:00:00 postgres: checkpointer
postgres 25101 250940 16:24 ? 00:00:00 postgres: background writer
postgres 25103 250940 16:24 ? 00:00:00 postgres: stats collector
postgres 25104 250940 16:24 ? 00:00:00 `postgres: walreceiver streaming 0/2F000148`
六、验证数据同步
1、主库查抄同步状态
一节点sync_state 为sync,一节点为potential
$ psql
postgres=# \x
Expanded display is on.
postgres=#SELECT * FROM pg_stat_replication;
-[ RECORD 1 ]----+------------------------------
pid | 27747
usesysid | 16642
usename | replrole
application_name | standby_pg2
client_addr | 192.168.122.89
client_hostname| keep-postgres12-node2
client_port | 54134
backend_start | 2024-12-10 16:20:45.918235+08
backend_xmin |
state | streaming
sent_lsn | 0/2F000148
write_lsn | 0/2F000148
flush_lsn | 0/2F000148
replay_lsn | 0/2F000148
write_lag |
flush_lag |
replay_lag |
sync_priority | 1
sync_state | sync
reply_time | 2024-12-10 16:36:26.891383+08
-[ RECORD 2 ]----+------------------------------
pid | 29698
usesysid | 16642
usename | replrole
application_name | standby_pg3
client_addr | 192.168.122.90
client_hostname| keep-postgres12-node3
client_port | 62202
backend_start | 2024-12-10 16:31:09.071909+08
backend_xmin |
state | streaming
sent_lsn | 0/2F000148
write_lsn | 0/2F000148
flush_lsn | 0/2F000148
replay_lsn | 0/2F000148
write_lag |
flush_lag |
replay_lag |
sync_priority | 2
sync_state | potential
reply_time | 2024-12-10 16:36:29.798595+08
2、测试数据同步
-- 主库执行
drop table if exists employees_multi_sync CASCADE;
CREATE TABLE employees_multi_sync (
id SERIAL PRIMARY KEY,
name VARCHAR(100),
position VARCHAR(100),
department VARCHAR(100),
hire_date DATE
);
INSERT INTO employees_multi_sync (name, position, department, hire_date)
SELECT
'Employee ' || (generate_series(1, 200))::text,
'Position ' || (generate_series(1, 200))::text,
'Department ' || (random() *(200-1)+1)::text,
'2010-01-01'::date + (generate_series(1, 200) * interval '1 day');
-- 备库查询数据
select count(1) from employees_multi_sync ;
-- 计算两边的md5值
SELECT md5(
string_agg(
id::text || '-' ||
name || '-' ||
position || '-' ||
department || '-' ||
TO_CHAR(hire_date, 'YYYY-MM-DD'),',')
) AS row_md5
FROM employees_multi_sync;
七、主备切换
1、停主库
$ pg_ctl stop -m fast
2、主从切换
备库升主
# 切换之后,$PGDATA下原有的 standby.signal 文件不存在了
$ pg_ctl promote
查看状态
$ pg_controldata | grep -i cluster
Database cluster state: `in production`
3、查抄原备库的设置(现主)
查抄设置文件postgresql.auto.conf
$ cat postgresql.auto.conf
# Do not edit this file manually!
# It will be overwritten by the ALTER SYSTEM command.
# 检查是否进行注释,配置还是当作备库,自相矛盾
# primary_conninfo = 'user=replrole passfile=''/home/postgres/.pgpass'' host=''keep-postgres12-node1'' application_name=standby_pg2 port=5432 sslmode=prefer sslcompression=0 gssencmode=disable krbsrvname=postgres target_session_attrs=any'
查抄设置文件postgresql.conf
# 修改synchronous_standby_names
$ vim postgresql.conf
synchronous_standby_names = 'standby_pg1,standby_pg3'
查抄状态
# 此处进程应为 walwriter,表示还是备机状态。若修改了postgresql.auto.conf,需重启数据库
$ ps -ef | grep postgres
postgres 25288 10 Dec02 ? 00:00:00 /data/postgres12.2/bin/postgres
postgres 25289 252880 Dec02 ? 00:00:00 postgres: logger
postgres 25294 252880 Dec02 ? 00:00:02 postgres: checkpointer
postgres 25295 252880 Dec02 ? 00:00:02 postgres: background writer
postgres 25296 252880 Dec02 ? 00:00:00 postgres: stats collector
postgres 29058 252880 16:11 ? 00:00:00 postgres: `walwriter`
postgres 29059 252880 16:11 ? 00:00:00 postgres: autovacuum launcher
postgres 29060 252880 16:11 ? 00:00:00 postgres: archiver last was 000000030000000000000020.partial
postgres 29061 252880 16:11 ? 00:00:00 postgres: logical replication launcher
$ pg_ctl restart
# 重启数据库后,备机状态为未连接,不会启动walsender
$ ps -ef | grep postgres
postgres 32261 10 16:20 ? 00:00:00 /data/postgres12.2/bin/postgres
postgres 32262 322610 16:20 ? 00:00:00 postgres: logger
postgres 32264 322610 16:20 ? 00:00:00 postgres: checkpointer
postgres 32265 322610 16:20 ? 00:00:00 postgres: background writer
postgres 32266 322610 16:20 ? 00:00:00 postgres: walwriter
postgres 32267 322610 16:20 ? 00:00:00 postgres: autovacuum launcher
postgres 32268 322610 16:20 ? 00:00:00 postgres: archiver
postgres 32269 322610 16:20 ? 00:00:00 postgres: stats collector
postgres 32270 322610 16:20 ? 00:00:00 postgres: logical replication launcher
4、备库设置standby.signal文件
node1
# standby.signal文件需要新建
$ cat standby.signal
# 添加以下内容
primary_conninfo = 'host=keep-postgres12-node2 port=5432 user=replrole password=ReplRole@123 application_name=standby_pg1 options=''-c wal_sender_timeout=5000'''
restore_command = 'cp /data/postgres12.2/arch/%f %p'
archive_cleanup_command = 'pg_archivecleanup /data/postgres12.2/arch %r'
standby_mode = on
node3
# standby.signal文件需要新建
$ cat standby.signal
# 修改host为keep-postgres12-node2
primary_conninfo = 'host=keep-postgres12-node2 port=5432 user=replrole password=ReplRole@123 application_name=standby_pg3 options=''-c wal_sender_timeout=5000'''
restore_command = 'cp /data/postgres12.2/arch/%f %p'
archive_cleanup_command = 'pg_archivecleanup /data/postgres12.2/arch %r'
standby_mode = on
5、备库修改设置文件postgresql.auto.conf
node1
$ cat postgresql.auto.conf
# 添加以下内容
primary_conninfo = 'user=replrole passfile=''/home/postgres/.pgpass'' host=''keep-postgres12-node2'' application_name=standby_pg2 port=5432 sslmode=prefer sslcompression=0 gssencmode=disable krbsrvname=postgres target_session_attrs=any'
node3
$ cat postgresql.auto.conf
# 修改host为keep-postgres12-node2
primary_conninfo = 'user=replrole passfile=''/home/postgres/.pgpass'' host=''keep-postgres12-node2'' application_name=standby_pg3 port=5432 sslmode=prefer sslcompression=0 gssencmode=disable krbsrvname=postgres target_session_attrs=any'
6、启动备库
node1
$ pg_ctl restart
node3
$ pg_ctl restart
7、查抄数据库历程
# 备库 node1
$ ps -ef | grep postgres
postgres 440 10 16:52 ? 00:00:00 /data/postgres12.2/bin/postgres
postgres 441 4400 16:52 ? 00:00:00 postgres: logger
postgres 442 4400 16:52 ? 00:00:00 postgres: startup recovering 000000080000000000000032
postgres 446 4400 16:52 ? 00:00:00 postgres: checkpointer
postgres 447 4400 16:52 ? 00:00:00 postgres: background writer
postgres 448 4400 16:52 ? 00:00:00 postgres: stats collector
postgres 449 4400 16:52 ? 00:00:00 `postgres: walreceiver streaming 0/320001C0`
# 主库
$ ps -ef | grep postgres
postgres 25853 10 16:47 ? 00:00:00 /data/postgres12.2/bin/postgres
postgres 25854 258530 16:47 ? 00:00:00 postgres: logger
postgres 25856 258530 16:47 ? 00:00:00 postgres: checkpointer
postgres 25857 258530 16:47 ? 00:00:00 postgres: background writer
postgres 25858 258530 16:47 ? 00:00:00 postgres: walwriter
postgres 25859 258530 16:47 ? 00:00:00 postgres: autovacuum launcher
postgres 25860 258530 16:47 ? 00:00:00 postgres: archiver
postgres 25861 258530 16:47 ? 00:00:00 postgres: stats collector
postgres 25862 258530 16:47 ? 00:00:00 postgres: logical replication launcher
postgres 26311 258530 16:52 ? 00:00:00 `postgres: walsender replrole 192.168.122.90(16994) streaming 0/320001C0`
postgres 26312 258530 16:52 ? 00:00:00 `postgres: walsender replrole 192.168.122.87(18939) streaming 0/320001C0`
# 备库 node3
$ ps -ef | grep postgres
postgres 31097 10 16:52 ? 00:00:00 /data/postgres12.2/bin/postgres
postgres 31098 310970 16:52 ? 00:00:00 postgres: logger
postgres 31099 310970 16:52 ? 00:00:00 postgres: startup recovering 000000080000000000000032
postgres 31103 310970 16:52 ? 00:00:00 postgres: checkpointer
postgres 31104 310970 16:52 ? 00:00:00 postgres: background writer
postgres 31105 310970 16:52 ? 00:00:00 postgres: stats collector
postgres 31106 310970 16:52 ? 00:00:00 `postgres: walreceiver streaming 0/320001C0`
8、查抄同步状态
$ psql
postgres=# \x
Expanded display is on.
postgres=# SELECT * FROM pg_stat_replication;
-[ RECORD 1 ]----+------------------------------
pid | 26311
usesysid | 16642
usename | replrole
application_name | standby_pg3
client_addr | 192.168.122.90
client_hostname| keep-postgres12-node3
client_port | 16994
backend_start | 2024-12-10 16:52:56.465729+08
backend_xmin |
state | streaming
sent_lsn | 0/32036768
write_lsn | 0/32036768
flush_lsn | 0/32036768
replay_lsn | 0/32036768
write_lag |
flush_lag |
replay_lag |
sync_priority | 2
sync_state | potential
reply_time | 2024-12-10 16:56:27.053456+08
-[ RECORD 2 ]----+------------------------------
pid | 26312
usesysid | 16642
usename | replrole
application_name | standby_pg1
client_addr | 192.168.122.87
client_hostname| keep-postgres12-node1
client_port | 18939
backend_start | 2024-12-10 16:52:58.753138+08
backend_xmin |
state | streaming
sent_lsn | 0/32036768
write_lsn | 0/32036768
flush_lsn | 0/32036768
replay_lsn | 0/32036768
write_lag |
flush_lag |
replay_lag |
sync_priority | 1
sync_state | sync
reply_time | 2024-12-10 16:56:27.047169+08
9、验证是否同步
-- 主库执行
drop table if exists employees_multi_sync1 CASCADE;
CREATE TABLE employees_multi_sync1 (
id SERIAL PRIMARY KEY,
name VARCHAR(100),
position VARCHAR(100),
department VARCHAR(100),
hire_date DATE
);
INSERT INTO employees_multi_sync1 (name, position, department, hire_date)
SELECT
'Employee ' || (generate_series(1, 200))::text,
'Position ' || (generate_series(1, 200))::text,
'Department ' || (random() *(200-1)+1)::text,
'2010-01-01'::date + (generate_series(1, 200) * interval '1 day');
-- 备库查询数据
select count(1) from employees_multi_sync1 ;
-- 计算两边的md5值
SELECT md5(
string_agg(
id::text || '-' ||
name || '-' ||
position || '-' ||
department || '-' ||
TO_CHAR(hire_date, 'YYYY-MM-DD'),',')
) AS row_md5
FROM employees_multi_sync1;
八、pg_rewind 工具
当主备集群中的备库不测瓦解,且颠末长时间,归档日志又被删除了,必要把这段时间的增量数据同步回来,那么就可以用 到pg_rewind 工具进行同步。pg_rewind 使一个PostgreSQL 数据目录与另一个数据目录一致。
$ pg_rewind --target-pgdata $PGDATA --source-server='host=keep-postgres12-node2 port=5432 user=keep password=keep dbname=postgres' -P
pg_rewind帮助下令
$ pg_rewind --help
pg_rewind resynchronizes a PostgreSQL cluster with another copy of the cluster.
Usage:
pg_rewind ...
Options:
-D, --target-pgdata=DIRECTORYexisting data directory to modify
--source-pgdata=DIRECTORYsource data directory to synchronize with
--source-server=CONNSTR source server to synchronize with
-n, --dry-run stop before modifying anything
-N, --no-sync do not wait for changes to be written
safely to disk
-P, --progress write progress messages
--debug write a lot of debug messages
-V, --version output version information, then exit
-?, --help show this help, then exit
免责声明:如果侵犯了您的权益,请联系站长,我们会及时删除侵权内容,谢谢合作!更多信息从访问主页:qidao123.com:ToB企服之家,中国第一个企服评测及商务社交产业平台。
页:
[1]