[pgpool-general: 4963] PGPool Child Processes constantly dying

Steven Livingstone steven at livz.org
Fri Sep 2 17:54:52 JST 2016


Hi. I have a scenario where i have a connection pooling client
(PGBouncer) making requests to PGPool and the child processes spun up
by PGPool create segmentation faults, increasing in number over time
until the client just gets errors.

I am new to PGPool debugging and gdb in general, so any help much
appreciated! I have turned off just about every advanced option in
PGPool.conf over the last week but still can't get to the bottom of
this.

I did managed to create a backtrace of *something* by attaching to one
of the child processes but in attaching to it, it doesn't die any more
:-S As soon as i detach, it dies and there is a segfault as shown
below (this is a small section ... i just get this constantly over
time). .... i suspect i am missing something in how to trace properly.

Sep  2 08:51:10 data-ha-node-0 pgpool[12573]: [7332-1] 2016-09-02
08:51:10: pid 12573: WARNING:  child process with pid: 31577 was
terminated by segmentation fault
Sep  2 08:51:10 data-ha-node-0 pgpool[12573]: [7333-1] 2016-09-02
08:51:10: pid 12573: LOG:  fork a new child process with pid: 31595
Sep  2 08:51:10 data-ha-node-0 pgpool[31589]: [7331-1] 2016-09-02
08:51:10: pid 31589: LOG:  new connection received
Sep  2 08:51:10 data-ha-node-0 pgpool[31589]: [7331-2] 2016-09-02
08:51:10: pid 31589: DETAIL:  connecting host=10.0.1.8 port=53526
Sep  2 08:51:11 data-ha-node-0 kernel: [69483.710126] pgpool[31589]:
segfault at ac ip 00000000004354a5 sp 00007fffffff91d0 error 4 in
pgpool[400000+132000]
Sep  2 08:51:11 data-ha-node-0 pgpool[12573]: [7334-1] 2016-09-02
08:51:11: pid 12573: WARNING:  child process with pid: 31589 was
terminated by segmentation fault
Sep  2 08:51:11 data-ha-node-0 pgpool[12573]: [7335-1] 2016-09-02
08:51:11: pid 12573: LOG:  fork a new child process with pid: 31601
Sep  2 08:51:11 data-ha-node-0 pgpool[31595]: [7333-1] 2016-09-02
08:51:11: pid 31595: LOG:  new connection received
Sep  2 08:51:11 data-ha-node-0 pgpool[31595]: [7333-2] 2016-09-02
08:51:11: pid 31595: DETAIL:  connecting host=10.0.1.8 port=53528
Sep  2 08:51:11 data-ha-node-0 kernel: [69484.288433] pgpool[31595]:
segfault at ac ip 00000000004354a5 sp 00007fffffff91d0 error 4 in
pgpool[400000+132000]
Sep  2 08:51:12 data-ha-node-0 pgpool[12573]: [7336-1] 2016-09-02
08:51:12: pid 12573: WARNING:  child process with pid: 31595 was
terminated by segmentation fault
Sep  2 08:51:12 data-ha-node-0 pgpool[12573]: [7337-1] 2016-09-02
08:51:12: pid 12573: LOG:  fork a new child process with pid: 31605
Sep  2 08:51:12 data-ha-node-0 pgpool[31586]: [7329-1] 2016-09-02
08:51:12: pid 31586: LOG:  new connection received





(gdb) bt
#0  0x00007ffff717ac33 in select () from /lib/x86_64-linux-gnu/libc.so.6
#1  0x0000000000426529 in wait_for_new_connections
(saddr=0x7fffffff95c0, timeout=0x7fffffff9440, fds=0x7686c0) at
protocol/child.c:1854
#2  do_child (fds=fds at entry=0x7686c0) at protocol/child.c:295
#3  0x00000000004076f5 in fork_a_child (fds=0x7686c0, id=1) at
main/pgpool_main.c:678
#4  0x00000000004080cc in reaper () at main/pgpool_main.c:2263
#5  0x000000000040c2a9 in PgpoolMain
(discard_status=discard_status at entry=0 '\000',
clear_memcache_oidmaps=clear_memcache_oidmaps at entry=0 '\000') at
main/pgpool_main.c:429
#6  0x0000000000405f13 in main (argc=<optimized out>, argv=<optimized
out>) at main/main.c:310

Any ideas?

/steven


More information about the pgpool-general mailing list