[pgpool-general: 2619] Re: PGPool watchdog not releasing shared IP

Yugo Nagata nagata at sraoss.co.jp
Tue Mar 11 17:34:13 JST 2014


I'm sorry for forgetting to attach the patch.

On Tue, 11 Mar 2014 17:28:56 +0900
Yugo Nagata <nagata at sraoss.co.jp> wrote:

> Hi,
> 
> On Mon, 10 Mar 2014 10:26:02 -0400
> Chris Mann <chris at walkingthumbs.com> wrote:
> 
> > Hello,
> > 
> >  I’m running the latest stable build of PGPool, 3.3.2 (tokakiboshi).
> > 
> > I am seeing an error in the log about the interface. I’ve set the sticky bit for both ifconfig and arping:
> > 
> > 2014-03-07 15:33:29 LOG:   pid 21735: wd_escalation: escalating to master pgpool
> > WARNING: interface is ignored: Operation not permitted
> 
> This warning is not problem. This appears in my environment but pgpool works well.
> 
> I can't understand why "wd_IP_down: not delegate IP holder" appears.
> Could you try attached patch and send log messages in debug mode for analysis?
> 
> BTW, I found non-ascii characters in your pgpool.conf. For example;
> 
> > backend_hostname0 = ‘host1.our.domain'
>                             ^
> > delegate_IP = ‘192.168.1.158'
>                     ^
> I'm not sure that this relates the issue.
> 
> > 2014-03-07 15:33:31 LOG:   pid 21735: wd_escalation: escalated to master pgpool successfully
> > 2014-03-07 15:33:31 LOG:   pid 21735: wd_init: start watchdog
> > 2014-03-07 15:33:31 LOG:   pid 21735: pgpool-II successfully started. version 3.3.2 (tokakiboshi)
> > 
> > 
> > Here is my pgpool.conf:
> > 
> > # ----------------------------
> > # pgPool-II configuration file
> > # ----------------------------
> > #
> > # This file consists of lines of the form:
> > #
> > #   name = value
> > #
> > # Whitespace may be used.  Comments are introduced with "#" anywhere on a line.
> > # The complete list of parameter names and allowed values can be found in the
> > # pgPool-II documentation.
> > #
> > # This file is read on server startup and when the server receives a SIGHUP
> > # signal.  If you edit the file on a running system, you have to SIGHUP the
> > # server for the changes to take effect, or use "pgpool reload".  Some
> > # parameters, which are marked below, require a server shutdown and restart to
> > # take effect.
> > #
> > 
> > 
> > #------------------------------------------------------------------------------
> > # CONNECTIONS
> > #------------------------------------------------------------------------------
> > 
> > # - pgpool Connection Settings -
> > 
> > listen_addresses = '*'
> >                                    # Host name or IP address to listen on:
> >                                    # '*' for all, '' for no TCP/IP connections
> >                                    # (change requires restart)
> > port = 9999
> >                                    # Port number
> >                                    # (change requires restart)
> > socket_dir = '/tmp'
> >                                    # Unix domain socket path
> >                                    # The Debian package defaults to
> >                                    # /var/run/postgresql
> >                                    # (change requires restart)
> > listen_backlog_multiplier = 2
> >                                    # Set the backlog parameter of listen(2) to
> > 								   # num_init_children * listen_backlog_multiplier.
> >                                    # (change requires restart)
> > 
> > # - pgpool Communication Manager Connection Settings -
> > 
> > pcp_port = 9898
> >                                    # Port number for pcp
> >                                    # (change requires restart)
> > pcp_socket_dir = '/tmp'
> >                                    # Unix domain socket path for pcp
> >                                    # The Debian package defaults to
> >                                    # /var/run/postgresql
> >                                    # (change requires restart)
> > 
> > # - Backend Connection Settings -
> > 
> > backend_hostname0 = ‘host1.our.domain'
> >                                    # Host name or IP address to connect to for backend 0
> > backend_port0 = 5432
> >                                    # Port number for backend 0
> > backend_weight0 = 1
> >                                    # Weight for backend 0 (only in load balancing mode)
> > backend_data_directory0 = '/var/lib/postgresql/9.1/main'
> >                                    # Data directory for backend 0
> > backend_flag0 = 'ALLOW_TO_FAILOVER'
> >                                    # Controls various backend behavior
> >                                    # ALLOW_TO_FAILOVER or DISALLOW_TO_FAILOVER
> > backend_hostname1 = ‘host2.our.domain'
> > backend_port1 = 5432
> > backend_weight1 = 1
> > backend_data_directory1 = '/var/lib/postgresql/9.1./main'
> > backend_flag1 = 'ALLOW_TO_FAILOVER'
> > 
> > # - Authentication -
> > 
> > enable_pool_hba = on
> >                                    # Use pool_hba.conf for client authentication
> > pool_passwd = ''
> >                                    # File name of pool_passwd for md5 authentication.
> >                                    # "" disables pool_passwd.
> >                                    # (change requires restart)
> > authentication_timeout = 60
> >                                    # Delay in seconds to complete client authentication
> >                                    # 0 means no timeout.
> > 
> > # - SSL Connections -
> > 
> > ssl = off
> >                                    # Enable SSL support
> >                                    # (change requires restart)
> > #ssl_key = './server.key'
> >                                    # Path to the SSL private key file
> >                                    # (change requires restart)
> > #ssl_cert = './server.cert'
> >                                    # Path to the SSL public certificate file
> >                                    # (change requires restart)
> > #ssl_ca_cert = ''
> >                                    # Path to a single PEM format file
> >                                    # containing CA root certificate(s)
> >                                    # (change requires restart)
> > #ssl_ca_cert_dir = ''
> >                                    # Directory containing CA root certificate(s)
> >                                    # (change requires restart)
> > 
> > 
> > #------------------------------------------------------------------------------
> > # POOLS
> > #------------------------------------------------------------------------------
> > 
> > # - Pool size -
> > 
> > num_init_children = 32
> >                                    # Number of pools
> >                                    # (change requires restart)
> > max_pool = 4
> >                                    # Number of connections per pool
> >                                    # (change requires restart)
> > 
> > # - Life time -
> > 
> > child_life_time = 300
> >                                    # Pool exits after being idle for this many seconds
> > child_max_connections = 90
> >                                    # Pool exits after receiving that many connections
> >                                    # 0 means no exit
> > connection_life_time = 0
> >                                    # Connection to backend closes after being idle for this many seconds
> >                                    # 0 means no close
> > client_idle_limit = 0
> >                                    # Client is disconnected after being idle for that many seconds
> >                                    # (even inside an explicit transactions!)
> >                                    # 0 means no disconnection
> > 
> > 
> > #------------------------------------------------------------------------------
> > # LOGS
> > #------------------------------------------------------------------------------
> > 
> > # - Where to log -
> > 
> > log_destination = 'stderr'
> >                                    # Where to log
> >                                    # Valid values are combinations of stderr,
> >                                    # and syslog. Default to stderr.
> > 
> > # - What to log -
> > 
> > print_timestamp = on
> >                                    # Print timestamp on each line
> >                                    # (change requires restart)
> > print_user = on
> >                                    # Print user name on each line
> >                                    # (change requires restart)
> > log_connections = on
> >                                    # Log connections
> > log_hostname = on
> >                                    # Hostname will be shown in ps status
> >                                    # and in logs if connections are logged
> > log_statement = on
> >                                    # Log all statements
> > log_per_node_statement = on
> >                                    # Log all statements
> >                                    # with node and backend informations
> > log_standby_delay = 'none'
> >                                    # Log standby delay
> >                                    # Valid values are combinations of always,
> >                                    # if_over_threshold, none
> > 
> > # - Syslog specific -
> > 
> > syslog_facility = 'LOCAL0'
> >                                    # Syslog local facility. Default to LOCAL0
> > syslog_ident = 'pgpool'
> >                                    # Syslog program identification string
> >                                    # Default to 'pgpool'
> > 
> > # - Debug -
> > 
> > debug_level = 0
> >                                    # Debug message verbosity level
> >                                    # 0 means no message, 1 or more mean verbose
> > 
> > 
> > #------------------------------------------------------------------------------
> > # FILE LOCATIONS
> > #------------------------------------------------------------------------------
> > 
> > pid_file_name = '/var/run/pgpool/pgpool.pid'
> >                                    # PID file name
> >                                    # (change requires restart)
> > logdir = '/tmp'
> >                                    # Directory of pgPool status file
> >                                    # (change requires restart)
> > 
> > 
> > #------------------------------------------------------------------------------
> > # CONNECTION POOLING
> > #------------------------------------------------------------------------------
> > 
> > connection_cache = on
> >                                    # Activate connection pools
> >                                    # (change requires restart)
> > 
> >                                    # Semicolon separated list of queries
> >                                    # to be issued at the end of a session
> >                                    # The default is for 8.3 and later
> > reset_query_list = 'ABORT; DISCARD ALL'
> >                                    # The following one is for 8.2 and before
> > #reset_query_list = 'ABORT; RESET ALL; SET SESSION AUTHORIZATION DEFAULT'
> > 
> > 
> > #------------------------------------------------------------------------------
> > # REPLICATION MODE
> > #------------------------------------------------------------------------------
> > 
> > replication_mode = off
> >                                    # Activate replication mode
> >                                    # (change requires restart)
> > replicate_select = off
> >                                    # Replicate SELECT statements
> >                                    # when in replication or parallel mode
> >                                    # replicate_select is higher priority than
> >                                    # load_balance_mode.
> > 
> > insert_lock = on
> >                                    # Automatically locks a dummy row or a table
> >                                    # with INSERT statements to keep SERIAL data
> >                                    # consistency
> >                                    # Without SERIAL, no lock will be issued
> > lobj_lock_table = ''
> >                                    # When rewriting lo_creat command in
> >                                    # replication mode, specify table name to
> >                                    # lock
> > 
> > # - Degenerate handling -
> > 
> > replication_stop_on_mismatch = off
> >                                    # On disagreement with the packet kind
> >                                    # sent from backend, degenerate the node
> >                                    # which is most likely "minority"
> >                                    # If off, just force to exit this session
> > 
> > failover_if_affected_tuples_mismatch = off
> >                                    # On disagreement with the number of affected
> >                                    # tuples in UPDATE/DELETE queries, then
> >                                    # degenerate the node which is most likely
> >                                    # "minority".
> >                                    # If off, just abort the transaction to
> >                                    # keep the consistency
> > 
> > 
> > #------------------------------------------------------------------------------
> > # LOAD BALANCING MODE
> > #------------------------------------------------------------------------------
> > 
> > load_balance_mode = on
> >                                    # Activate load balancing mode
> >                                    # (change requires restart)
> > ignore_leading_white_space = on
> >                                    # Ignore leading white spaces of each query
> > white_function_list = ''
> >                                    # Comma separated list of function names
> >                                    # that don't write to database
> >                                    # Regexp are accepted
> > black_function_list = 'nextval,setval'
> >                                    # Comma separated list of function names
> >                                    # that write to database
> >                                    # Regexp are accepted
> > 
> > 
> > #------------------------------------------------------------------------------
> > # MASTER/SLAVE MODE
> > #------------------------------------------------------------------------------
> > 
> > master_slave_mode = on
> >                                    # Activate master/slave mode
> >                                    # (change requires restart)
> > master_slave_sub_mode = 'stream'
> >                                    # Master/slave sub mode
> >                                    # Valid values are combinations slony or
> >                                    # stream. Default is slony.
> >                                    # (change requires restart)
> > 
> > # - Streaming -
> > 
> > sr_check_period = 5
> >                                    # Streaming replication check period
> >                                    # Disabled (0) by default
> > sr_check_user = ‘rep'
> >                                    # Streaming replication check user
> >                                    # This is neccessary even if you disable streaming
> >                                    # replication delay check by sr_check_period = 0
> > sr_check_password = ‘password'
> >                                    # Password for streaming replication check user
> > delay_threshold = 0
> >                                    # Threshold before not dispatching query to standby node
> >                                    # Unit is in bytes
> >                                    # Disabled (0) by default
> > 
> > # - Special commands -
> > 
> > follow_master_command = ''
> >                                    # Executes this command after master failover
> >                                    # Special values:
> >                                    #   %d = node id
> >                                    #   %h = host name
> >                                    #   %p = port number
> >                                    #   %D = database cluster path
> >                                    #   %m = new master node id
> >                                    #   %H = hostname of the new master node
> >                                    #   %M = old master node id
> >                                    #   %P = old primary node id
> > 								   #   %r = new master port number
> > 								   #   %R = new master database cluster path
> >                                    #   %% = '%' character
> > 
> > 
> > #------------------------------------------------------------------------------
> > # PARALLEL MODE
> > #------------------------------------------------------------------------------
> > 
> > parallel_mode = off
> >                                    # Activates parallel query mode
> >                                    # (change requires restart)
> > pgpool2_hostname = 'bara.blabcake.com'
> >                                    # Set pgpool2 hostname
> >                                    # (change requires restart)
> > 
> > # - System DB info -
> > 
> > #system_db_hostname  = 'localhost'
> >                                    # (change requires restart)
> > #system_db_port = 5432
> >                                    # (change requires restart)
> > #system_db_dbname = 'pgpool'
> >                                    # (change requires restart)
> > #system_db_schema = 'pgpool_catalog'
> >                                    # (change requires restart)
> > #system_db_user = 'pgpool'
> >                                    # (change requires restart)
> > #system_db_password = ''
> >                                    # (change requires restart)
> > 
> > 
> > #------------------------------------------------------------------------------
> > # HEALTH CHECK
> > #------------------------------------------------------------------------------
> > 
> > health_check_period = 5
> >                                    # Health check period
> >                                    # Disabled (0) by default
> > health_check_timeout = 10
> >                                    # Health check timeout
> >                                    # 0 means no timeout
> > health_check_user = 'blagios'
> >                                    # Health check user
> > health_check_password = 'blagiosdb'
> >                                    # Password for health check user
> > health_check_max_retries = 3
> >                                    # Maximum number of times to retry a failed health check before giving up.
> > health_check_retry_delay = 2
> >                                    # Amount of time to wait (in seconds) between retries.
> > 
> > 
> > #------------------------------------------------------------------------------
> > # FAILOVER AND FAILBACK
> > #------------------------------------------------------------------------------
> > 
> > failover_command = 'ssh -i /home/user/sshkey blabcake@%H touch /tmp/postgresql.trigger.5432'
> > 
> > 
> > #'echo host:%h, new master id:%m, old master id:%M > /var/log/pgpool.log'
> >                                    # Executes this command at failover
> >                                    # Special values:
> >                                    #   %d = node id
> >                                    #   %h = host name
> >                                    #   %p = port number
> >                                    #   %D = database cluster path
> >                                    #   %m = new master node id
> >                                    #   %H = hostname of the new master node
> >                                    #   %M = old master node id
> >                                    #   %P = old primary node id
> > 								   #   %r = new master port number
> > 								   #   %R = new master database cluster path
> >                                    #   %% = '%' character
> > #failback_command = 'echo host:%h, new master id:%m, old master id:%M > /var/log/pgpool.log'
> >                                    # Executes this command at failback.
> >                                    # Special values:
> >                                    #   %d = node id
> >                                    #   %h = host name
> >                                    #   %p = port number
> >                                    #   %D = database cluster path
> >                                    #   %m = new master node id
> >                                    #   %H = hostname of the new master node
> >                                    #   %M = old master node id
> >                                    #   %P = old primary node id
> > 								   #   %r = new master port number
> > 								   #   %R = new master database cluster path
> >                                    #   %% = '%' character
> > 
> > fail_over_on_backend_error = on
> >                                    # Initiates failover when reading/writing to the
> >                                    # backend communication socket fails
> >                                    # If set to off, pgpool will report an
> >                                    # error and disconnect the session.
> > 
> > search_primary_node_timeout = 5
> >                                    # Timeout in seconds to search for the
> >                                    # primary node when a failover occurs.
> >                                    # 0 means no timeout, keep searching
> >                                    # for a primary node forever.
> > 
> > #------------------------------------------------------------------------------
> > # ONLINE RECOVERY
> > #------------------------------------------------------------------------------
> > 
> > #recovery_user = ’none'
> >                                    # Online recovery user
> > #recovery_password = ‘'none'
> >                                    # Online recovery password
> > #recovery_1st_stage_command = 'send_failover_trigger'
> >                                    # Executes a command in first stage
> > #recovery_2nd_stage_command = 'pgpool_recovery_pitr'
> >                                    # Executes a command in second stage
> > #recovery_timeout = 90
> >                                    # Timeout in seconds to wait for the
> >                                    # recovering node's postmaster to start up
> >                                    # 0 means no wait
> > #client_idle_limit_in_recovery = 0
> >                                    # Client is disconnected after being idle
> >                                    # for that many seconds in the second stage
> >                                    # of online recovery
> >                                    # 0 means no disconnection
> >                                    # -1 means immediate disconnection
> > 
> > 
> > #------------------------------------------------------------------------------
> > # WATCHDOG
> > #------------------------------------------------------------------------------
> > 
> > # - Enabling -
> > 
> > use_watchdog = on
> >                                     # Activates watchdog
> >                                     # (change requires restart)
> > 
> > # -Connection to up stream servers -
> > 
> > trusted_servers = ''
> >                                     # trusted server list which are used
> >                                     # to confirm network connection
> >                                     # (hostA,hostB,hostC,...)
> >                                     # (change requires restart)
> > ping_path = '/bin'
> >                                     # ping command path
> >                                     # (change requires restart)
> > 
> > # - Watchdog communication Settings -
> > 
> > wd_hostname = ‘pool1.our.domain'
> >                                     # Host name or IP address of this watchdog
> >                                     # (change requires restart)
> > wd_port = 9000
> >                                     # port number for watchdog service
> >                                     # (change requires restart)
> > wd_authkey = ''
> >                                     # Authentication key for watchdog communication
> >                                     # (change requires restart)
> > 
> > # - Virtual IP control Setting -
> > 
> > delegate_IP = ‘192.168.1.158'
> >                                     # delegate IP address
> >                                     # If this is empty, virtual IP never bring up.
> >                                     # (change requires restart)
> > ifconfig_path = '/sbin'
> >                                     # ifconfig command path
> >                                     # (change requires restart)
> > if_up_cmd = 'ifconfig eth0:0 inet $_IP_$ netmask 255.255.255.0'
> >                                     # startup delegate IP command
> >                                     # (change requires restart)
> > if_down_cmd = 'ifconfig eth0:0 down'
> >                                     # shutdown delegate IP command
> >                                     # (change requires restart)
> > 
> > arping_path = '/usr/bin'           # arping command path
> >                                     # (change requires restart)
> > 
> > arping_cmd = 'arping -U $_IP_$ -w 1'
> >                                     # arping command
> >                                     # (change requires restart)
> > 
> > # - Behaivor on escalation Setting -
> > 
> > clear_memqcache_on_escalation = on
> >                                     # Clear all the query cache on shared memory
> >                                     # when standby pgpool escalate to active pgpool
> >                                     # (= virtual IP holder).
> >                                     # This should be off if client connects to pgpool
> >                                     # not using virtual IP.
> >                                     # (change requires restart)
> > wd_escalation_command = ''
> >                                     # Executes this command at escalation on new active pgpool.
> >                                     # (change requires restart)
> > 
> > # - Lifecheck Setting -
> > 
> > # -- common --
> > 
> > wd_lifecheck_method = 'heartbeat'
> >                                     # Method of watchdog lifecheck ('heartbeat' or 'query')
> >                                     # (change requires restart)
> > wd_interval = 10
> >                                     # lifecheck interval (sec) > 0
> >                                     # (change requires restart)
> > 
> > # -- heartbeat mode --
> > 
> > wd_heartbeat_port = 9694
> >                                     # Port number for receiving heartbeat signal
> >                                     # (change requires restart)
> > wd_heartbeat_keepalive = 2
> >                                     # Interval time of sending heartbeat signal (sec)
> >                                     # (change requires restart)
> > wd_heartbeat_deadtime = 30
> >                                     # Deadtime interval for heartbeat signal (sec)
> >                                     # (change requires restart)
> > heartbeat_destination0 = 'rock.blabcake.com'
> >                                     # Host name or IP address of destination 0
> >                                     # for sending heartbeat signal.
> >                                     # (change requires restart)
> > heartbeat_destination_port0 = 9694
> >                                     # Port number of destination 0 for sending
> >                                     # heartbeat signal. Usually this is the
> >                                     # same as wd_heartbeat_port.
> >                                     # (change requires restart)
> > heartbeat_device0 = ''
> >                                     # Name of NIC device (such like 'eth0')
> >                                     # used for sending/receiving heartbeat
> >                                     # signal to/from destination 0.
> >                                     # This works only when this is not empty
> >                                     # and pgpool has root privilege.
> >                                     # (change requires restart)
> > 
> > #heartbeat_destination1 = 'host0_ip2'
> > #heartbeat_destination_port1 = 9694
> > #heartbeat_device1 = ''
> > 
> > # -- query mode --
> > 
> > wd_life_point = 3
> >                                     # lifecheck retry times
> >                                     # (change requires restart)
> > wd_lifecheck_query = 'SELECT 1'
> >                                     # lifecheck query to pgpool from watchdog
> >                                     # (change requires restart)
> > wd_lifecheck_dbname = 'template1'
> >                                     # Database name connected for lifecheck
> >                                     # (change requires restart)
> > wd_lifecheck_user = 'nobody'
> >                                     # watchdog user monitoring pgpools in lifecheck
> >                                     # (change requires restart)
> > wd_lifecheck_password = ''
> >                                     # Password for watchdog user in lifecheck
> >                                     # (change requires restart)
> > 
> > # - Other pgpool Connection Settings -
> > 
> > other_pgpool_hostname0 = ‘pool2.our.domain'
> >                                     # Host name or IP address to connect to for other pgpool 0
> >                                     # (change requires restart)
> > other_pgpool_port0 = 5432
> >                                     # Port number for othet pgpool 0
> >                                     # (change requires restart)
> > other_wd_port0 = 9000
> >                                     # Port number for othet watchdog 0
> >                                     # (change requires restart)
> > #other_pgpool_hostname1 = 'host1'
> > #other_pgpool_port1 = 5432
> > #other_wd_port1 = 9000
> > 
> > 
> > #------------------------------------------------------------------------------
> > # OTHERS
> > #------------------------------------------------------------------------------
> > relcache_expire = 0
> >                                    # Life time of relation cache in seconds.
> >                                    # 0 means no cache expiration(the default).
> >                                    # The relation cache is used for cache the
> >                                    # query result against PostgreSQL system
> >                                    # catalog to obtain various information
> >                                    # including table structures or if it's a
> >                                    # temporary table or not. The cache is
> >                                    # maintained in a pgpool child local memory
> >                                    # and being kept as long as it survives.
> >                                    # If someone modify the table by using
> >                                    # ALTER TABLE or some such, the relcache is
> >                                    # not consistent anymore.
> >                                    # For this purpose, cache_expiration
> >                                    # controls the life time of the cache.
> > relcache_size = 256
> >                                    # Number of relation cache
> >                                    # entry. If you see frequently:
> > 								   # "pool_search_relcache: cache replacement happend"
> > 								   # in the pgpool log, you might want to increate this number.
> > 
> > check_temp_table = on
> >                                    # If on, enable temporary table check in SELECT statements.
> >                                    # This initiates queries against system catalog of primary/master
> > 								   # thus increases load of master.
> > 								   # If you are absolutely sure that your system never uses temporary tables
> > 								   # and you want to save access to primary/master, you could turn this off.
> > 								   # Default is on.
> > 
> > check_unlogged_table = on
> >                                    # If on, enable unlogged table check in SELECT statements.
> >                                    # This initiates queries against system catalog of primary/master
> >                                    # thus increases load of master.
> >                                    # If you are absolutely sure that your system never uses unlogged tables
> >                                    # and you want to save access to primary/master, you could turn this off.
> >                                    # Default is on.
> > 
> > #------------------------------------------------------------------------------
> > # ON MEMORY QUERY MEMORY CACHE
> > #------------------------------------------------------------------------------
> > memory_cache_enabled = off
> > 								   # If on, use the memory cache functionality, off by default
> > memqcache_method = 'shmem'
> > 								   # Cache storage method. either 'shmem'(shared memory) or
> > 								   # 'memcached'. 'shmem' by default
> >                                    # (change requires restart)
> > memqcache_memcached_host = 'localhost'
> > 								   # Memcached host name or IP address. Mandatory if
> > 								   # memqcache_method = 'memcached'.
> > 								   # Defaults to localhost.
> >                                    # (change requires restart)
> > memqcache_memcached_port = 11211
> > 								   # Memcached port number. Mondatory if memqcache_method = 'memcached'.
> > 								   # Defaults to 11211.
> >                                    # (change requires restart)
> > memqcache_total_size = 67108864
> > 								   # Total memory size in bytes for storing memory cache.
> > 								   # Mandatory if memqcache_method = 'shmem'.
> > 								   # Defaults to 64MB.
> >                                    # (change requires restart)
> > memqcache_max_num_cache = 1000000
> > 								   # Total number of cache entries. Mandatory
> > 								   # if memqcache_method = 'shmem'.
> > 								   # Each cache entry consumes 48 bytes on shared memory.
> > 								   # Defaults to 1,000,000(45.8MB).
> >                                    # (change requires restart)
> > memqcache_expire = 0
> > 								   # Memory cache entry life time specified in seconds.
> > 								   # 0 means infinite life time. 0 by default.
> >                                    # (change requires restart)
> > memqcache_auto_cache_invalidation = on
> > 								   # If on, invalidation of query cache is triggered by corresponding
> > 								   # DDL/DML/DCL(and memqcache_expire).  If off, it is only triggered
> > 								   # by memqcache_expire.  on by default.
> >                                    # (change requires restart)
> > memqcache_maxcache = 409600
> > 								   # Maximum SELECT result size in bytes.
> > 								   # Must be smaller than memqcache_cache_block_size. Defaults to 400KB.
> >                                    # (change requires restart)
> > memqcache_cache_block_size = 1048576
> > 								   # Cache block size in bytes. Mandatory if memqcache_method = 'shmem'.
> > 								   # Defaults to 1MB.
> >                                    # (change requires restart)
> > memqcache_oiddir = '/var/log/pgpool/oiddir'
> > 				   				   # Temporary work directory to record table oids
> >                                    # (change requires restart)
> > white_memqcache_table_list = ''
> >                                    # Comma separated list of table names to memcache
> >                                    # that don't write to database
> >                                    # Regexp are accepted
> > black_memqcache_table_list = ''
> >                                    # Comma separated list of table names not to memcache
> >                                    # that don't write to database
> >                                    # Regexp are accepted
> > 
> > 
> > The script I’m using is very simple and it’s set on a cron job:
> > 
> > #!/bin/bash
> > 
> > #Set GATEWAYIPADDRESS to the gateway address of the delegate IP you're using.
> > 
> > ping -c5 GATEWAYIPADDRESS
> > 
> > which just pings the gateway IP address of the floating IP. With out this, PGPool would bring up the floating IP on failover, but no traffic would go over it. After talking with our VSP, they told me I needed to ping the gateway address to fix this. Clunky, yes. But it seems to have worked.
> > 
> > Here’s the cron I’m using with it:
> > 
> > * * * * * sleep 0;/home/user/pgpool2/scripts/gwping
> > * * * * * sleep 30;/home/user/pgpool2/scripts/gwping
> > 
> > 
> > On Mar 9, 2014, at 10:10 PM, Yugo Nagata <nagata at sraoss.co.jp> wrote:
> > 
> > > Hi,
> > > 
> > > On Fri, 7 Mar 2014 11:18:27 -0500
> > > Chris Mann <chris at walkingthumbs.com> wrote:
> > > 
> > >> Hello all,
> > >> 
> > >> I’ve two pgpools, one on it’s own server and one on a database server. When pgpool gets shutdown with a smart shutdown, it seems that pgpool can’t release the shared IP, and this results in hung state between the two, since the app servers are looking to that IP address. What I see in the logs is this:
> > >> 
> > >> 2014-03-06 20:04:31 DEBUG: pid 10686: wd_IP_down: not delegate IP holder
> > >> 
> > >> It’s almost as if both PGPools are trying to grab that delegate IP address.
> > > 
> > > Is there any kind of error message when pgpool starts up? Failure of 
> > > `ifconfig' or `arping' command might cause the wd_IP_down failure.
> > > 
> > > Could you please send pgpool's version, pgpool.conf and log messages?
> > > These would helpful to analyze this.
> > > 
> > >> 
> > >> I’ve worked with my VSP, and according to them, what I need to do is basically run a cron script to ping that IP address before the delegate IP address will float over to the other pgpool server. This is kind if clunky, is there some way to have pgpool perform this as part of it’s watchdogging?
> > > 
> > > I'm not sure what your script does exactly. However, if you want to execute some commands
> > > before ifconfig down VIP, you can specify your custom script to wp_down_cmd and ifconfig_path
> > > option, though I don't recommend this. I think proper configuration would resolve the problem.
> > > 
> > >> 
> > >> Thanks!
> > >> 
> > >> 
> > >> _______________________________________________
> > >> pgpool-general mailing list
> > >> pgpool-general at pgpool.net
> > >> http://www.pgpool.net/mailman/listinfo/pgpool-general
> > > 
> > > 
> > > -- 
> > > Yugo Nagata <nagata at sraoss.co.jp>
> > 
> 
> 
> -- 
> Yugo Nagata <nagata at sraoss.co.jp>


-- 
Yugo Nagata <nagata at sraoss.co.jp>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: escalation_error_all_for_analysis2.patch
Type: text/x-diff
Size: 4959 bytes
Desc: not available
URL: <http://www.sraoss.jp/pipermail/pgpool-general/attachments/20140311/06779990/attachment-0001.bin>


More information about the pgpool-general mailing list