[pgpool-general: 1566] Re: Online Recovery failed with unknown reason

Tatsuo Ishii ishii at postgresql.org
Wed Apr 3 17:21:51 JST 2013


Sounds like a hostname resolution problem. I am sure once I faced
similar problem but my memory fading...

Can you please show us PostgreSQL logs (both server0 and server1)?
--
Tatsuo Ishii
SRA OSS, Inc. Japan
English: http://www.sraoss.co.jp/index_en.php
Japanese: http://www.sraoss.co.jp

> Hi folks,
> 
> I have a setup of two nodes with Streaming Replication, pgpool running on
> both nodes monitoring each other backends. Everything is working fine,
> failover work and oneline recovery work like a charm.
> 
> Here is the problem, while server0 failover to server1, and server1 recover
> server0 had no problem; however, when i failover back, server1 failover to
> server0, server0 is not able to recover server1.
> 
>>From the log on Primary:
> Apr  1 16:14:52 se032c-94-30 pgpool[2456]: starting recovering node 1
> Apr  1 16:14:52 se032c-94-30 pgpool[2456]: starting recovery command:
> "SELECT pgpool_recovery('basebackup.sh', 'se032c-94-31',
> '/opt/postgres/9.2/data')"
> Apr  1 16:14:52 se032c-94-30 postgres[3096]: [2-1]
> [unknown]postgres3096ERROR:  pgpool_recovery failed
> Apr  1 16:14:52 se032c-94-30 postgres[3096]: [2-2]
> [unknown]postgres3096STATEMENT:  SELECT pgpool_recovery('basebackup.sh',
> 'se032c-94-31', '/opt/postgres/9.2/data')
> 
> The exact same script used on both node, so why one work but not the others.
> And from the log, I have no idea why it fails, setting debug=[1,2,3] does
> not help on troubleshooting.
> 
> I then went to manually execute the pgpool_recovery statement, i see the
> following:
> 
> template1=# SELECT pgpool_recovery('basebackup.sh', 'se032c-94-31',
> '/opt/postgres/9.2/data');
> ERROR:  pgpool_recovery failed
> template1=# SELECT pgpool_recovery('basebackup.sh', '10.89.94.31',
> '/opt/postgres/9.2/data');
>  pgpool_recovery
> -----------------
>  t
> (1 row)
> 
> 
> If hostname is used, the command will failed, but IP works fine.
> 
> I double check the hostname is pingable from localhost.
> 
> [root at se032c-94-30 ~]# ping se032c-94-31
> PING se032c-94-31.cisco.com (10.89.94.31) 56(84) bytes of data.
> 64 bytes from se032c-94-31.abc.com (10.89.94.31): icmp_seq=1 ttl=64
> time=0.254 ms
> 64 bytes from se032c-94-31.abc.com (10.89.94.31): icmp_seq=2 ttl=64
> time=0.311 ms
> 
> Please help!
> 
> Thanks~
> Ning


More information about the pgpool-general mailing list