[pgpool-general: 5943] Split-brain remedy

Wed Feb 28 03:58:48 JST 2018

Hi All,

I have a 10.0.0.1/10.0.0.2 master/hot standby configuration with streaming
replication, where each node runs pgpool with watchdog enabled and postgres.

I shut down the network interface on 10.0.0.1 and wait until 10.0.0.2
triggers failover and promotes itself to master through my failover script.

Now the watchdogs on 10.0.0.1 and 10.0.0.2 are out of sync, have
conflicting views on which node has failed and both think they are master.

When I bring back the network interface on 10.0.0.1, 'show pool_nodes' says
that 10.0.0.1 is master/up and 10.0.0.2 is standby/down.

I want 10.0.0.1 to be standby and 10.0.0.2 to be master.

I've been playing with the failover script.. e.g.

if (default network gateway is pingable) {
    shut down pgpool and postgres
} else if (this node is standby) {
    promote this node to master
    create a job that will run every minute and try to recover failed node
(base backup)
    cancel the job upon successful recovery
}

Can you please help me with this? Any ideas would be highly appreciated.

Regards, Alex
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.sraoss.jp/pipermail/pgpool-general/attachments/20180227/2f5f1908/attachment.html>