View Issue Details

IDProjectCategoryView StatusLast Update
0000710Pgpool-IIGeneralpublic2021-06-08 14:11
Reporterjose Assigned To 
PriorityimmediateSeverityfeatureReproducibilityalways
Status closedResolutionopen 
Product Version4.2.2 
Summary0000710: No se realiza el failover del servidor PostgreSQL
DescriptionCuando el servidor de PostgreSQL maestro se apaga, el servidor en standby no se levante como servidor maestro. De forma que quedan todos como servidores standby.

 node_id | hostname | port | status | lb_weight | role | select_cnt | load_balance_node | replication_delay | replication_state | replication_sync_state | last_status_change
---------+----------+------+--------+-----------+---------+------------+-------------------+-------------------+-------------------+------------------------+---------------------
 0 | aba1 | 5432 | down | 0.333333 | standby | 11 | false | 0 | | | 2021-05-11 13:12:54
 1 | aba2 | 5432 | up | 0.333333 | standby | 16 | false | 0 | | | 2021-05-11 11:35:56
 2 | aba3 | 5432 | up | 0.333333 | standby | 3 | true | 0 | | | 2021-05-11 11:14:26
(3 rows)
Steps To ReproduceApagar el servidor PostgreSQL master y ver los nodos.
Additional InformationEn todos los servidores me muestra esta información:
postgres@aba3:~$ psql -h 192.168.1.155 -p 9999 -U pgpool postgres -c "show pool_nodes"
Password for user pgpool:
psql: error: server closed the connection unexpectedly
        This probably means the server terminated abnormally
        before or while processing the request.

2021-05-11 13:12:52: pid 39211: LOG: health check retrying on DB node: 0 (round:5)
2021-05-11 13:12:53: pid 39170: LOG: remote node "aba3:9999 Linux aba3" is asking to inform about quarantined backend nodes
2021-05-11 13:12:53: pid 39166: LOG: Pgpool-II parent process received inform quarantine nodes signal from watchdog
2021-05-11 13:12:53: pid 39211: LOG: failed to connect to PostgreSQL server on "aba1:5432", getsockopt() failed
2021-05-11 13:12:53: pid 39211: DETAIL: Operation now in progress
2021-05-11 13:12:53: pid 39211: ERROR: failed to make persistent db connection
2021-05-11 13:12:53: pid 39211: DETAIL: connection to host:"aba1:5432" failed
2021-05-11 13:12:53: pid 39211: LOG: health check failed on node 0 (timeout:0)
2021-05-11 13:12:53: pid 39211: LOG: received degenerate backend request for node_id: 0 from pid [39211]
2021-05-11 13:12:53: pid 39170: LOG: new IPC connection received
2021-05-11 13:12:53: pid 39170: LOG: failover request from local pgpool-II node received on IPC interface is forwarded to leader watchdog node "aba3:9999 Linux aba3"
2021-05-11 13:12:53: pid 39170: DETAIL: waiting for the reply...
2021-05-11 13:12:53: pid 39211: LOG: degenerate backend request for 1 node(s) from pid [39211], will be handled by watchdog
2021-05-11 13:12:54: pid 39166: LOG: Pgpool-II parent process received sync backend signal from watchdog
2021-05-11 13:12:54: pid 39170: LOG: new IPC connection received
2021-05-11 13:12:54: pid 39166: LOG: leader watchdog has performed failover
2021-05-11 13:12:54: pid 39166: DETAIL: syncing the backend states from the LEADER watchdog node
2021-05-11 13:12:54: pid 39170: LOG: new IPC connection received
2021-05-11 13:12:54: pid 39170: LOG: received the get data request from local pgpool-II on IPC interface
2021-05-11 13:12:54: pid 39170: LOG: get data request from local pgpool-II node received on IPC interface is forwarded to leader watchdog node "aba3:9999 Linux aba3"
2021-05-11 13:12:54: pid 39170: DETAIL: waiting for the reply...
2021-05-11 13:12:54: pid 39166: LOG: leader watchdog node "aba3:9999 Linux aba3" returned status for 3 backend nodes
2021-05-11 13:12:54: pid 39166: LOG: backend:0 is set to down status
2021-05-11 13:12:54: pid 39166: DETAIL: backend:0 is DOWN on cluster leader "aba3:9999 Linux aba3"
2021-05-11 13:12:54: pid 39166: LOG: primary node:-1 on leader watchdog node "aba3:9999 Linux aba3" is different from local primary node:0
2021-05-11 13:12:54: pid 39166: LOG: primary node was changed after the sync from "aba3:9999 Linux aba3"
2021-05-11 13:12:54: pid 39166: DETAIL: all children needs to be restarted
2021-05-11 13:12:54: pid 43821: ERROR: Failed to check replication time lag
2021-05-11 13:12:54: pid 43821: DETAIL: No persistent db connection for the node 0
2021-05-11 13:12:54: pid 43821: HINT: check sr_check_user and sr_check_password
2021-05-11 13:12:54: pid 43821: CONTEXT: while checking replication time lag
2021-05-11 13:12:54: pid 43821: LOG: worker process received restart request
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43789 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43790 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43791 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43792 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43793 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43794 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43795 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43796 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43797 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43798 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43799 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43800 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43801 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43802 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43803 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43804 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43805 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43806 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43807 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43808 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43809 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43810 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43811 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43812 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43813 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43814 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43815 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43816 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43817 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43818 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43819 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: child process with pid: 43820 exits with status 0
2021-05-11 13:12:54: pid 39166: LOG: worker child process with pid: 43821 exits with status 256
2021-05-11 13:12:54: pid 39166: LOG: fork a new worker child process with pid: 63968
2021-05-11 13:12:54: pid 63968: LOG: process started

 node_id | hostname | port | status | lb_weight | role | select_cnt | load_balance_node | replication_delay | replication_state | replication_sync_state | last_status_change
---------+----------+------+--------+-----------+---------+------------+-------------------+-------------------+-------------------+------------------------+---------------------
 0 | aba1 | 5432 | down | 0.333333 | standby | 11 | false | 0 | | | 2021-05-11 13:12:54
 1 | aba2 | 5432 | up | 0.333333 | standby | 16 | false | 0 | | | 2021-05-11 11:35:56
 2 | aba3 | 5432 | up | 0.333333 | standby | 3 | true | 0 | | | 2021-05-11 11:14:26
(3 rows)
TagsNo tags attached.

Activities

t-ishii

2021-05-12 17:05

developer   ~0003830

Please submit issues in English.

pengbo

2021-06-08 14:10

developer   ~0003872

No response for more than a month.
Close issue.

Issue History

Date Modified Username Field Change
2021-05-11 22:16 jose New Issue
2021-05-12 17:05 t-ishii Note Added: 0003830
2021-06-08 14:10 pengbo Note Added: 0003872
2021-06-08 14:11 pengbo Status new => closed