[Pgpool-general] [pgPool]

Guillaume Lelarge guillaume at lelarge.info
Thu Feb 24 22:22:04 UTC 2011


Le 24/02/2011 18:17, Micka a écrit :
> Yes it was, Thx !
> 
> Now the replication functionnality work well !
> 
> but when the main database server goes down I got :
> 
> 2011-02-24 18:06:42 DEBUG: pid 1313: starting health checking
> 2011-02-24 18:06:42 DEBUG: pid 1313: health_check: 0 th DB node status: 2
> 2011-02-24 18:06:42 DEBUG: pid 1313: health_check: 1 th DB node status: 2
> 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: T
> 2011-02-24 18:06:44 DEBUG: pid 1347: num_fileds: 1
> 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: D
> 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: C
> 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: Z
> 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: T
> 2011-02-24 18:06:44 DEBUG: pid 1347: num_fileds: 1
> 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: D
> 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: C
> 2011-02-24 18:06:44 DEBUG: pid 1347: do_query: kind: Z
> 2011-02-24 18:06:52 DEBUG: pid 1313: starting health checking
> 2011-02-24 18:06:52 DEBUG: pid 1313: health_check: 0 th DB node status: 2
> 2011-02-24 18:06:52 DEBUG: pid 1313: health_check: 1 th DB node status: 2
> 2011-02-24 18:06:54 ERROR: pid 1347: pool_flush_it: write failed to backend
> (0). reason: Broken pipe offset: 0 wlen: 39
> 2011-02-24 18:06:54 ERROR: pid 1347: check_replication_time_lag: SELECT
> pg_current_xlog_location() failed
> 2011-02-24 18:07:02 DEBUG: pid 1313: starting health checking
> 2011-02-24 18:07:02 DEBUG: pid 1313: health_check: 0 th DB node status: 2
> 2011-02-24 18:07:02 ERROR: pid 1313: connect_unix_domain_socket_by_port:
> connect() failed: No such file or directory
> 2011-02-24 18:07:02 ERROR: pid 1313: health check failed. 0 th host  at port
> 5432 is down
> 2011-02-24 18:07:02 LOG:   pid 1313: set 0 th backend down status
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler called
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: starting to select
> new master node
> 2011-02-24 18:07:02 LOG:   pid 1313: starting degeneration. shutdown host
> (5432)
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1314
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1315
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1316
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1317
> 
> and :
> 
> 011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1334
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1335
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1336
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1337
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1338
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1339
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1340
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1341
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1342
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1343
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1344
> 2011-02-24 18:07:02 DEBUG: pid 1313: failover_handler: kill 1345
> 2011-02-24 18:07:02 LOG:   pid 1313: execute command:
> /usr/local/etc/failover.sh 0 "" 5432 /usr/local/pgsql/data 1 0 "" 0
> 2011-02-24 18:07:02 DEBUG: pid 1314: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 1315: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 1316: child received shutdown request signal
> 3
> *touch: cannot touch `/var/log/pgpool/trigger/trigger_file1': Permission
> denied*
> 2011-02-24 18:07:02 LOG:   pid 1313: failover_handler: set new master node:
> 1
> 2011-02-24 18:07:02 DEBUG: pid 1317: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 1318: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 1319: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 1320: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 12178: I am 12178
> 2011-02-24 18:07:02 DEBUG: pid 12179: I am 12179
> 2011-02-24 18:07:02 DEBUG: pid 12180: I am 12180
> 2011-02-24 18:07:02 DEBUG: pid 12177: I am 12177
> 2011-02-24 18:07:02 DEBUG: pid 12181: I am 12181
> 2011-02-24 18:07:02 DEBUG: pid 1322: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 12176: I am 12176
> 
> 011-02-24 18:07:02 DEBUG: pid 12205: I am 12205
> 2011-02-24 18:07:02 DEBUG: pid 1338: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 12206: I am 12206
> 2011-02-24 18:07:02 LOG:   pid 1313: failover done. shutdown host (5432)
> 2011-02-24 18:07:02 DEBUG: pid 1339: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 1313: pool_ssl: SSL requested but SSL support
> is not available
> 2011-02-24 18:07:02 DEBUG: pid 1340: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 1341: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: auth kind: 0
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data
> received
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data
> received
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data
> received
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data
> received
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data
> received
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data
> received
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data
> received
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data
> received
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data
> received
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data
> received
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: parameter status data
> received
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: backend key data received
> 2011-02-24 18:07:02 DEBUG: pid 1313: s_do_auth: transaction state: I
> 2011-02-24 18:07:02 DEBUG: pid 1342: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 1345: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 1344: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 1343: child received shutdown request signal
> 3
> 2011-02-24 18:07:02 DEBUG: pid 1313: do_query: kind: T
> 2011-02-24 18:07:02 DEBUG: pid 1313: num_fileds: 1
> 2011-02-24 18:07:02 DEBUG: pid 1313: do_query: kind: D
> 2011-02-24 18:07:02 DEBUG: pid 1313: do_query: kind: C
> 2011-02-24 18:07:02 DEBUG: pid 1313: do_query: kind: Z
> 2011-02-24 18:07:02 LOG:   pid 1313: find_primary_node: primary node id is 1
> 
> 
> 2011-02-24 18:07:02 DEBUG: pid 1313: child 1340 exits with status 0
> 2011-02-24 18:07:02 DEBUG: pid 1313: child 1341 exits with status 0
> 2011-02-24 18:07:02 DEBUG: pid 1313: child 1342 exits with status 0
> 2011-02-24 18:07:02 DEBUG: pid 1313: child 1343 exits with status 0
> 2011-02-24 18:07:02 DEBUG: pid 1313: child 1344 exits with status 0
> 2011-02-24 18:07:02 DEBUG: pid 1313: child 1345 exits with status 0
> 2011-02-24 18:07:02 DEBUG: pid 1313: reap_handler: normally exited
> 2011-02-24 18:07:12 DEBUG: pid 1313: starting health checking
> 2011-02-24 18:07:12 DEBUG: pid 1313: health_check: 0 th DB node status: 3
> 2011-02-24 18:07:12 DEBUG: pid 1313: health_check: 1 th DB node status: 2
> 2011-02-24 18:07:22 DEBUG: pid 1313: starting health checking
> 
> 
> what does it mean ?
> 
> in pgpoolAdmin, it says :
> 
> 5432 Down 0.5005433 Up. Connected. Running as standby server 0.500
> 
> Shouldn't I have the 5433 take the primary role ?
> 

Well, "running as standby server" says it all. It didn't take the
primary role. Reason is that the script couldn't create the trigger file
(/var/log/pgpool/trigger/trigger_file1). So you first need to fix this.


-- 
Guillaume
 http://www.postgresql.fr
 http://dalibo.com


More information about the Pgpool-general mailing list