View Issue Details
| ID | Project | Category | View Status | Date Submitted | Last Update |
|---|---|---|---|---|---|
| 0000316 | Pgpool-II | Bug | public | 2017-06-23 19:43 | 2017-08-29 09:26 |
| Reporter | Seema | Assigned To | Muhammad Usama | ||
| Priority | urgent | Severity | major | Reproducibility | sometimes |
| Status | closed | Resolution | open | ||
| OS | freebsd | OS Version | FreeBSD 11.0-REL | ||
| Product Version | 3.5.3 | ||||
| Summary | 0000316: One postgresql backend node is marked down often | ||||
| Description | Hi, I am running one pgpool-II with two backend postgresql nodes, configured in 'replication mode' with 'load balancing'. PGPOOL VERSION: pgpool-II version 3.5.3 (ekieboshi), OS VERSION: FreeBSD 11.0-RELEASE-p1 FreeBSD 11.0-RELEASE-p1 #0 r306420 I am seeing that pgpool is marking the 'node 0' down frequently. I am seeing the following messages in the pgpool log. ============================================ Jun 23 14:33:46 apgp1 pgpool: 2017-06-23 14:33:46: pid 744: DEBUG: persistent DB connection to backend node 1 having status 2 is successful Jun 23 14:33:46 apgp1 pgpool: 2017-06-23 14:33:46: pid 744: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:46 apgp1 pgpool: 2017-06-23 14:33:46: pid 744: DEBUG: health check: clearing alarm Jun 23 14:33:46 apgp1 pgpool: 2017-06-23 14:33:46: pid 744: DEBUG: health check: clearing alarm Jun 23 14:33:49 apgp1 pgpool[753]: [7146-1] 2017-06-23 14:33:49: pid 753: DEBUG: pool_read: read -1 bytes from backend 0 Jun 23 14:33:49 apgp1 pgpool[746]: [3372-1] 2017-06-23 14:33:49: pid 746: DEBUG: pool_read: read -1 bytes from backend 0 Jun 23 14:33:49 apgp1 pgpool[746]: [3373-1] 2017-06-23 14:33:49: pid 746: LOG: received degenerate backend request for node_id: 0 from pid [746] Jun 23 14:33:49 apgp1 pgpool[753]: [7147-1] 2017-06-23 14:33:49: pid 753: LOG: received degenerate backend request for node_id: 0 from pid [753] Jun 23 14:33:49 apgp1 pgpool[746]: [3374-1] 2017-06-23 14:33:49: pid 746: DEBUG: sending 30 to the parent process with PID:744 Jun 23 14:33:49 apgp1 pgpool[744]: [21793-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler called Jun 23 14:33:49 apgp1 pgpool[746]: [3375-1] 2017-06-23 14:33:49: pid 746: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool[744]: [21794-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[746]: [3376-1] 2017-06-23 14:33:49: pid 746: DEBUG: write on backend 0 failed with error :"Broken pipe" Jun 23 14:33:49 apgp1 pgpool[744]: [21794-2] 2017-06-23 14:33:49: pid 744: DETAIL: starting to select new master node Jun 23 14:33:49 apgp1 pgpool[744]: [21795-1] 2017-06-23 14:33:49: pid 744: LOG: starting degeneration. shutdown host adb1.labs.blr.cherrylabs.in(5432) Jun 23 14:33:49 apgp1 pgpool[747]: [3372-1] 2017-06-23 14:33:49: pid 747: DEBUG: pool_read: read -1 bytes from backend 0 Jun 23 14:33:49 apgp1 pgpool[753]: [7148-1] 2017-06-23 14:33:49: pid 753: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool[746]: [3376-2] 2017-06-23 14:33:49: pid 746: DETAIL: while trying to write data from offset: 0 wlen: 5 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 746: DEBUG: pool_read: read -1 bytes from backend 0 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 753: DEBUG: pool_read: read -1 bytes from backend 0 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 746: LOG: received degenerate backend request for node_id: 0 from pid [746] Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 753: LOG: received degenerate backend request for node_id: 0 from pid [753] Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 746: DEBUG: sending 30 to the parent process with PID:744 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 746: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler called Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: starting to select new master node Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: LOG: starting degeneration. shutdown host adb1.labs.blr.cherrylabs.in(5432) Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 753: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool[746]: [3377-1] 2017-06-23 14:33:49: pid 746: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool[753]: [7149-1] 2017-06-23 14:33:49: pid 753: DEBUG: write on backend 0 failed with error :"Broken pipe" Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 746: DEBUG: write on backend 0 failed with error :"Broken pipe" Jun 23 14:33:49 apgp1 pgpool[753]: [7149-2] 2017-06-23 14:33:49: pid 753: DETAIL: while trying to write data from offset: 0 wlen: 5 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 746: DETAIL: while trying to write data from offset: 0 wlen: 5 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 753: DEBUG: write on backend 0 failed with error :"Broken pipe" Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 753: DETAIL: while trying to write data from offset: 0 wlen: 5 Jun 23 14:33:49 apgp1 pgpool[753]: [7150-1] 2017-06-23 14:33:49: pid 753: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool[747]: [3373-1] 2017-06-23 14:33:49: pid 747: LOG: received degenerate backend request for node_id: 0 from pid [747] Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 753: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 746: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool[747]: [3374-1] 2017-06-23 14:33:49: pid 747: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 747: DEBUG: pool_read: read -1 bytes from backend 0 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 747: LOG: received degenerate backend request for node_id: 0 from pid [747] Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 747: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool[747]: [3375-1] 2017-06-23 14:33:49: pid 747: DEBUG: write on backend 0 failed with error :"Broken pipe" Jun 23 14:33:49 apgp1 pgpool[747]: [3375-2] 2017-06-23 14:33:49: pid 747: DETAIL: while trying to write data from offset: 0 wlen: 5 Jun 23 14:33:49 apgp1 pgpool[747]: [3376-1] 2017-06-23 14:33:49: pid 747: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 747: DEBUG: write on backend 0 failed with error :"Broken pipe" Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 747: DETAIL: while trying to write data from offset: 0 wlen: 5 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 747: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool[744]: [21796-1] 2017-06-23 14:33:49: pid 744: LOG: Restart all children Jun 23 14:33:49 apgp1 pgpool[744]: [21797-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[744]: [21797-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:746 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: LOG: Restart all children Jun 23 14:33:49 apgp1 pgpool[744]: [21798-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[744]: [21798-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:747 Jun 23 14:33:49 apgp1 pgpool[744]: [21799-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[744]: [21799-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:748 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:746 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:747 Jun 23 14:33:49 apgp1 pgpool[748]: [3372-1] 2017-06-23 14:33:49: pid 748: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:748 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 748: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[749]: [652-1] 2017-06-23 14:33:49: pid 749: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[748]: [3373-1] 2017-06-23 14:33:49: pid 748: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool[744]: [21800-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[748]: [3374-1] 2017-06-23 14:33:49: pid 748: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool[744]: [21800-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:749 Jun 23 14:33:49 apgp1 pgpool[744]: [21801-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[749]: [653-1] 2017-06-23 14:33:49: pid 749: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool[749]: [654-1] 2017-06-23 14:33:49: pid 749: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool[744]: [21801-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:750 Jun 23 14:33:49 apgp1 pgpool[744]: [21802-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 749: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 748: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 748: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:749 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 749: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:750 Jun 23 14:33:49 apgp1 pgpool[750]: [7501-1] 2017-06-23 14:33:49: pid 750: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[751]: [11-1] 2017-06-23 14:33:49: pid 751: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 749: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool[744]: [21802-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:751 Jun 23 14:33:49 apgp1 pgpool[750]: [7502-1] 2017-06-23 14:33:49: pid 750: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 750: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[750]: [7503-1] 2017-06-23 14:33:49: pid 750: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 751: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21803-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[744]: [21803-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:752 Jun 23 14:33:49 apgp1 pgpool[744]: [21804-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[744]: [21804-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:753 Jun 23 14:33:49 apgp1 pgpool[744]: [21805-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 750: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[754]: [3046-1] 2017-06-23 14:33:49: pid 754: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:751 Jun 23 14:33:49 apgp1 pgpool[752]: [11-1] 2017-06-23 14:33:49: pid 752: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21805-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:754 Jun 23 14:33:49 apgp1 pgpool[754]: [3047-1] 2017-06-23 14:33:49: pid 754: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool[754]: [3048-1] 2017-06-23 14:33:49: pid 754: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:752 Jun 23 14:33:49 apgp1 pgpool[755]: [11-1] 2017-06-23 14:33:49: pid 755: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21806-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[744]: [21806-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:755 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:753 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 750: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 754: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 754: DEBUG: pool_write: to backend: 0 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 752: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:754 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 754: DEBUG: pool_write: to backend: 1 kind:X Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:755 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 755: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[756]: [13-1] 2017-06-23 14:33:49: pid 756: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21807-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[744]: [21807-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:756 Jun 23 14:33:49 apgp1 pgpool[757]: [11-1] 2017-06-23 14:33:49: pid 757: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 756: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21808-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[744]: [21808-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:757 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:756 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:757 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 757: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[758]: [11-1] 2017-06-23 14:33:49: pid 758: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21809-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[744]: [21809-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:758 Jun 23 14:33:49 apgp1 pgpool[759]: [11-1] 2017-06-23 14:33:49: pid 759: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21810-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 758: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:758 Jun 23 14:33:49 apgp1 pgpool[744]: [21810-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:759 Jun 23 14:33:49 apgp1 pgpool[760]: [11-1] 2017-06-23 14:33:49: pid 760: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21811-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 759: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:759 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 760: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21811-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:760 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:760 Jun 23 14:33:49 apgp1 pgpool[761]: [11-1] 2017-06-23 14:33:49: pid 761: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 761: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21812-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[744]: [21812-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:761 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:761 Jun 23 14:33:49 apgp1 pgpool[762]: [11-1] 2017-06-23 14:33:49: pid 762: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21813-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[744]: [21813-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:762 Jun 23 14:33:49 apgp1 pgpool[763]: [11-1] 2017-06-23 14:33:49: pid 763: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21814-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 762: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:762 Jun 23 14:33:49 apgp1 pgpool[744]: [21814-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:763 Jun 23 14:33:49 apgp1 pgpool[744]: [21815-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[764]: [11-1] 2017-06-23 14:33:49: pid 764: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21815-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:764 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 763: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:763 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 764: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:764 Jun 23 14:33:49 apgp1 pgpool[744]: [21816-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[765]: [11-1] 2017-06-23 14:33:49: pid 765: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21816-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:765 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:765 Jun 23 14:33:49 apgp1 pgpool[766]: [11-1] 2017-06-23 14:33:49: pid 766: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21817-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 765: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 766: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21817-2] 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:766 Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool: 2017-06-23 14:33:49: pid 744: DETAIL: kill process with PID:766 Jun 23 14:33:49 apgp1 pgpool[744]: [21818-1] 2017-06-23 14:33:49: pid 744: DEBUG: failover handler Jun 23 14:33:49 apgp1 pgpool[767]: [3474-1] 2017-06-23 14:33:49: pid 767: LOG: child process received shutdown request signal 3 Jun 23 14:33:49 apgp1 pgpool[744]: [21818-2] 2017-06-23 14:33:49: p ====================================================================================================== I have attached the logs and pgpool.conf file your your investigation. Thanks, Seema | ||||
| Tags | No tags attached. | ||||
|
|
|
|
|
Hi, I would like to add some more information here. The Postgresql DB nodes are behind the firewall. PGPOOL is outside the firewall. Thanks, Seema |
|
|
It seems PostgreSQL backend closed the socket to Pgpool-II. That's the trigger for the fail over. Maybe firewall issue? |
|
|
Hi t-ishii, But why is it always happening to node 0 only? I did think the Firewall may be cutting the connections, so I also tried setting the "connection_life_time = 3000" in pgpool.conf file. Firewall setting for connection timeout is set to 3600. But this did not help. Thanks, Seema |
|
|
Hi, I have the following observations. Today node 1 is marked down. I use the pop_attach_node command to attach the node back and it stays up for about 2 hours before being marked down again. Every time a postgreSql node is marked down by pgpool, I am seeing the following logs . On the postgresql server, log shows: Jun 27 16:31:04 adb2 postgres[81513]: [3-1] LOG: could not receive data from client: Operation timed out Jun 27 16:31:04 adb2 postgres[81513]: [4-1] DEBUG: unexpected EOF on client connection Jun 27 16:31:05 adb2 postgres[81515]: [3-1] LOG: could not receive data from client: Operation timed out Jun 27 16:31:05 adb2 postgres[81514]: [3-1] LOG: could not receive data from client: Operation timed out Jun 27 16:31:05 adb2 postgres[81514]: [4-1] DEBUG: unexpected EOF on client connection Jun 27 16:31:05 adb2 postgres[81515]: [4-1] DEBUG: unexpected EOF on client connection And on the PGPOOL Server, I am able to see the following logs: Jun 27 16:28:39 apgp1 pgpool[14359]: [3534-1] 2017-06-27 16:28:39: pid 14359: DEBUG: pool_read: read -1 bytes from backend 1 Jun 27 16:28:39 apgp1 pgpool[14349]: [3534-1] 2017-06-27 16:28:39: pid 14349: DEBUG: pool_read: read -1 bytes from backend 1 Jun 27 16:28:39 apgp1 pgpool[14349]: [3535-1] 2017-06-27 16:28:39: pid 14349: LOG: received degenerate backend request for node_id: 1 from pid [14349] Jun 27 16:28:39 apgp1 pgpool: 2017-06-27 16:28:39: pid 14349: DEBUG: pool_read: read -1 bytes from backend 1 Thanks, Seema |
|
|
URGENT. Please Help |
|
|
Hello Team, Request your help in resolving this issue. Thanks, Seema |
|
|
Hi Seema, I have analyzed the pgpool. conf and pgpool log files you shared and there doesn't seem anything wrong with Pgpool-II. The Pgpool-II logs and PostgreSQL log messages (pasted upthread) both suggest that the connection from Pgpool-II to PostgreSQL gets killed abruptly, which then because of your Pgpool-II setting ( fail_over_on_backend_error = on) causes the backend node failover. And I suspect this is the firewall which is killing that connection and causing all this. Since you mentioned that you have enabled connection timeout on your firewall, I think that could be causing this connection disconnections and eventually the failover. I think the we can also test this supposition by a simple test, Just by connecting a PSQL or some client with the Pgpool-II and leave it connected and unused for connection timeout ( firewall) amount of time and after that just fire any query. Since the firewall would already have killed that backend connection (after detecting no activity on it for connection timeout time) the Pgpool-II will get a broken pipe error and trigger the failover because of fail_over_on_backend_error = on. This scenario can also happen even if the client is issuing the queries on the connection but somehow all the queries end up landing on single backend and other backend connection remain un-used for connection timeout (firewall) time. So I suggest to disable the connection timeout on firewall side for Pgpool-II to backend connections and to see if it solves the issue. Thanks Best Regards Muhammad Usama |
|
|
Hello Muhammad, Thank you for replying. I will test the FW scenario suggested by you and revert back. I was also curious to know about the first scenario that you have mentioned - "client is issuing the queries on the connection but somehow all the queries end up landing on single backend and other backend connection remain un-used for connection timeout (firewall) time." How can we check whether this is happening? Reg the FW timeout causing PGPOOL-II disconnections. I had set the "connection_life_time = 3000" in pgpool.conf file. Since the FW connection timeout is set to 3600, I was hoping this configuration will force pgpool-II to disconnect all idle connections at 3000 sec and recestablish the connections before the FW will disconnect them. Is my thinking correct? But, it doesn't seem to be working though. Thanks & Regards, Seema |
|
|
You can analyse the Pgpool-II log after enabling "log_per_node_statement" config in pgpool.conf to see which query is routed to which backend node, secondly you can use the ' show pool_nodes' command which outputs the 'select_cnt' for each backend node. http://www.pgpool.net/docs/latest/en/html/sql-show-pool-nodes.html postgres=# show pool_nodes; node_id | hostname | port | status | lb_weight | role | select_cnt | load_balance_node | replication_delay ---------+-----------+------+--------+-----------+---------+------------+-------------------+------------------- 0 | localhost | 5432 | up | 0.500000 | primary | 2 | true | 0 1 | localhost | 5434 | up | 0.500000 | standby | 0 | false | 0 (2 rows) Regarding the connection_life_time in Pgpool-II, It only services the cashed backend connections, and does not affect the active backend connection. (The backend connection for which the frontend client is currently connected is considered as active even if does not have any activity on it). So for your scenario the connection_life_time config will not come into play even for the inactive connections for which the frontend client exists and their backend connections would still be killed by the firewall. |
|
|
Hello Muhammad, Currently we have moved both the DB Nodes and PGPOOL server to the same LAN and testing. As of now it seems to be working fine. We will be moving the DB Nodes across the FW again in couple of days time. I will let you know the status once we have tested it. Regards, Seema |
| Date Modified | Username | Field | Change |
|---|---|---|---|
| 2017-06-23 19:43 | Seema | New Issue | |
| 2017-06-23 19:43 | Seema | File Added: pgpool.conf | |
| 2017-06-23 19:43 | Seema | File Added: pgpool.log.errors-rep | |
| 2017-06-23 21:43 | Seema | Note Added: 0001543 | |
| 2017-06-26 16:11 | t-ishii | Note Added: 0001545 | |
| 2017-06-26 16:13 | t-ishii | Assigned To | => Muhammad Usama |
| 2017-06-26 16:13 | t-ishii | Status | new => assigned |
| 2017-06-26 17:17 | Seema | Note Added: 0001547 | |
| 2017-06-27 20:19 | Seema | Note Added: 0001552 | |
| 2017-06-28 14:45 | Seema | Note Added: 0001557 | |
| 2017-07-03 13:55 | Seema | Note Added: 0001563 | |
| 2017-07-04 00:10 | Muhammad Usama | Note Added: 0001565 | |
| 2017-07-04 14:30 | Seema | Note Added: 0001569 | |
| 2017-07-04 19:53 | Muhammad Usama | Note Added: 0001572 | |
| 2017-07-12 19:41 | Seema | Note Added: 0001581 | |
| 2017-08-29 09:26 | pengbo | Status | assigned => closed |