[Pgpool-general] Several questions about pgpool

Sat Aug 12 12:35:40 UTC 2006

> the whole idea is to break one-to-one relationship between client and pgpool process and backend process.
> if daemon starts working its queries should be sent to idle backend connection if it exists, not wait
> until assigned child to finish process something time-consuming. 
> if there's no idle backend connections then it should be created. this way you keep unneeded backend
> connections to the minimum while still having advantages of pgpool - pooling, load-balancing etc.
> Again as I imagine it the client connections and backend connections within pgpool should be separated.
> Each of them should have a set of properties - username/database/protocol/whatever. Then at any moment
> when a client sends a query an idle backend connection with matching set of properties is used to serve it. 
> I think if implemented this would cause some additional overhead, but not significant
> and the win on busy systems (due to the fact you don't have to keep hundreds backends in memory)
> will  be well worth it.

To be honest, I doubt your idea makes things any better. In the long
run, connection pools for a particular username/database in a pgpool
process will be equally distributed among processes. So trying to find
a username/database pair in another process would be a wast of
time. Also I'm not clear about your definition for "idle
connection". If you mean that a connection wating for next transaction
from a client, not client already disconnected, reusing such a
connection will break the concept of "session" in the database
application.

> In no way I'm suggesting to abandon the pre-fork architecture. What I'm saying is
> it would be nice if pgpool would start to fork additional children after pre-forked one
> are exhausted or better yet if it could maintain a number of unused children at 
> all times much like apache does.
>  
> > Moreover, others may want to prevent unlimited number of client 
> > connections to be made to the backend. To control that, you need a 
> > parameter to limit the max number of connections made to the backend anyway.
> 
> A configuration parameters (something like "max_backend_connections" and "max_client_connections" 
> might be the best choice for it, again much like one can do it in apache.

Changing number of processes in the pool on demand would be nice (but
for different reason what you are suggesting). Actually I wanted to
implement such that functionality since I started to implement pgpool
(that's why the parameter name is num_init_children, rather than
num_children). Currently users have to set it to as large as it is to
prepare the maximum load. That would hart performance of in coming
connection accepting. This is due to the famous "stampede syndrome".

To implement this we need to know if all of child processes are busy
or not and it requires a interprocess communication using shared
memory.  pgpool does not have such that functionality right
now. However pgpool-II, the next generation of pgpool, already has it
and would be easy to implement on-demand-process-forking.
--
Tatsuo Ishii
SRA OSS, Inc. Japan