[pgpool-general: 5012] PgPool fails to restart while being master

Thu Sep 22 02:49:03 JST 2016

I am finding this trouble in a two nodes cluster setup. Both nodes have the
same configuration (I can post it if needed). I can restart the standby
node as many times I want without problem.

When I try to restart the master node, it stops correctly but then when it
starts again it brings the following error message (redacted)

Sep 21 17:38:45 srv0.net pgpool: 2016-09-21 17:38:45: pid 1947: FATAL:
 failed to create watchdog receive socket
Sep 21 17:38:45 srv0.net pgpool: 2016-09-21 17:38:45: pid 1947: DETAIL:
 bind on "TCP:9009" failed with reason: "Address already in use"
Sep 21 17:38:45 srv0.net pgpool: 2016-09-21 17:38:45: pid 1940: LOG:
 watchdog child process with pid: 1947 exits with status 768
Sep 21 17:38:45 srv0.net pgpool: 2016-09-21 17:38:45: pid 1940: FATAL:
 watchdog child process exit with fatal error. exiting pgpool-II

Linked is the complete error message:
https://gist.github.com/therobot/dc410f0ccc56e302866329c43670ec38

It seems to me that PgPool fails to close the watchdog socket (9009) on
time and when it started it finds the socket still opened.

The init script sends a -INT signal to PgPool in order to stop it. I have
also tried replacing the killproc signal with 'pgpool stop -m smart' in the
init script without too much luck.

Thanks in advance.

-- 
Jacobo García López de Araujo.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.sraoss.jp/pipermail/pgpool-general/attachments/20160921/1ba25733/attachment.html>