<div dir="rtl"><div dir="ltr">Hi Tatsuo.</div><div dir="ltr">It suddenly happened again during the weekend. This time I got errors in my log : </div><div dir="ltr"><div dir="ltr">-11 18:43:33 - [No Connection] [20902]LOG:  trying connecting to PostgreSQL server on "ptkpl-psgsqldb2:5432" by INET socket</div><div dir="ltr">[[No Connection]]([No Connection]) - 2018-05-11 18:43:33 - [No Connection] [20902]DETAIL:  timed out. retrying...</div><div dir="ltr"><div dir="ltr">11 18:44:03 - [No Connection] [18906]LOG:  failed to connect to PostgreSQL server on "ptkpl-psgsqldb2:5432", getsockopt() detected error "No route to host"</div><div dir="ltr">[[No Connection]]([No Connection]) - 2018-05-11 18:44:03 - [No Connection] [18906]LOG:  received degenerate backend request for node_id: 1 from pid [18906]</div><div dir="ltr"><br></div><div>and the pool keeped looking for the primary "find_primary_node: checking backend no 0/1/2" for  6 minutes. During all this time the primary was up and was working fine. What do you recommend to do ? Only after attaching the primary again everything worked. Why the pool didnt recognizer the primary ? I'm checking with my networking team If there was a network problem but I dont think that it is related.</div><div><br></div><div><br></div><div>Thanks , MARIEL.</div></div></div></div><div class="gmail_extra"><br><div class="gmail_quote"><div dir="ltr">2018-05-06 17:22 GMT+03:00 Tatsuo Ishii <span dir="ltr"><<a href="mailto:ishii@sraoss.co.jp" target="_blank">ishii@sraoss.co.jp</a>></span>:</div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Both "show pool_nodes" and pcp_node_info after all checks the status<br>

on the shared memory area. However the implementation is completely<br>

different; "show pool_nodes" is simpler and it's just a wrapper for<br>

showing the status as SQL. pcp_node_info is a client/server<br>

program. The status is retrieved by pcp server then is sent to pcp<br>

client (pcp_node_info) via pcp protocol.<br>

<br>

Also next time you'd better check the status file to very whether<br>

pcp_node_info tells the truth.<br>

<div class="HOEnZb"><div class="h5"><br>

Best regards,<br>

--<br>

Tatsuo Ishii<br>

SRA OSS, Inc. Japan<br>

English: <a href="http://www.sraoss.co.jp/index_en.php" rel="noreferrer" target="_blank">http://www.sraoss.co.jp/index_<wbr>en.php</a><br>

Japanese:<a href="http://www.sraoss.co.jp" rel="noreferrer" target="_blank">http://www.sraoss.co.<wbr>jp</a><br>

<br>

> No, I didnt check the status via "show pool_nodes". To be honest it<br>

isnt<br>

> the first time it happens. Does there a difference between show_pool_nodes<br>

> and pcp_node info on the deeper level ? I mean I know that show_pool_nodes<br>

> queries a view or a table, what about pcp_node_info ? I dont think that it<br>

> is related to repmgr..<br>

> <br>

> 2018-05-06 16:49 GMT+03:00 Tatsuo Ishii <<a href="mailto:ishii@sraoss.co.jp">ishii@sraoss.co.jp</a>>:<br>

> <br>

>> > Hi,<br>

>> > I have 3 postgres servers (one primary + 2 standbys) that have<br>

>> replciation<br>

>> > configured with repmgr:<br>

>> > pg1 - standby<br>

>> > pg2 - primary<br>

>> > pg3 - standby<br>

>> ><br>

>> > I have also 2 pgpool servers(v 3.7.2 and on each one there is one pool<br>

>> > instance. There isnt any watchdog, instead I have a vip address that<br>

>> > directs the requests to the available pgpool instance. I configured my<br>

>> own<br>

>> > metrics that check the status of the database nodes via the pcp<br>

>> interface.<br>

>> ><br>

>> > Today at 11:25 suddenly I got an alert that both my pgpools saw that the<br>

>> > primary node is down (via pcp). I connected and checked and indeed the<br>

>> > primary was down :<br>

>> > [postgres@pool2 log]$ pcp_node_info -h localhost -U postgres -p 9898 1<br>

>> -w<br>

>> > pg2 5432 2 0.333333 down standby<br>

>> ><br>

>> > I checked it in both pools and the same result. I immediatly attached<br>

>> them<br>

>> > and it worked. I wanted to understand why it happened but I dont see any<br>

>> > error in the logs. I attach the logs of both my pools. Can you help me<br>

>> > identify the problem ?<br>

>><br>

>> No idea. I have never seen PostgreSQL is detached without any trace in<br>

>> pgpool log. Have you seen the node status using "show pool_nodes"? If<br>

>> not, I suspect there's a bug with pcp_node_info. If you tried "show<br>

>> pool_nodes" and saw the same status as pcp_node_info, then I<br>

>> completely lose idea.<br>

>><br>

>> There may be a interaction with repmgr, but I am not familiar with<br>

>> repmgr and this is just a wild guess.<br>

>><br>

>> Best regards,<br>

>> --<br>

>> Tatsuo Ishii<br>

>> SRA OSS, Inc. Japan<br>

>> English: <a href="http://www.sraoss.co.jp/index_en.php" rel="noreferrer" target="_blank">http://www.sraoss.co.jp/index_<wbr>en.php</a><br>

>> Japanese:<a href="http://www.sraoss.co.jp" rel="noreferrer" target="_blank">http://www.sraoss.co.<wbr>jp</a><br>

>><br>

</div></div></blockquote></div><br></div>