Fork–join queue

In queueing theory, a discipline within the mathematical theory of probability, a fork–join queue is a queue where incoming jobs are split on arrival for service by numerous servers and joined before departure.[1] The model is often used for parallel computations[2] or systems where products need to be obtained simultaneously from different suppliers (in a warehouse or manufacturing setting).[3]: 78–80 The key quantity of interest in this model is usually the time taken to service a complete job. The model has been described as a "key model for the performance analysis of parallel and distributed systems."[4] Few analytical results exist for fork–join queues, but various approximations are known.
The situation where jobs arrive according to a
Definition
On arrival at the fork point, a job is split into N sub-jobs which are served by each of the N servers. After service, sub-job wait until all other sub-jobs have also been processed. The sub-jobs are then rejoined and leave the system.[3]
For the fork–join queue to be stable the input rate must be strictly less than sum of the service rates at the service nodes.[8]
Applications
Fork–join queues have been used to model zoned RAID systems,[9] parallel computations[2] and for modelling order fulfilment in warehouses.[3]
Response time
The response time (or sojourn time[10]) is the total amount of time a job spends in the system.
Distribution
Ko and Serfozo give an approximation for the response time distribution when service times are exponentially distributed and jobs arrive either according to a
Average response time
An exact formula for the average response time is only known in the case of two servers (N=2) with exponentially distributed service times (where each server is an M/M/1 queue). In this situation, the response time (total time a job spends in the system) is[14]
where
- is the utilization.
- is the arrival rate of jobs to all the nodes.
- is the service rate across all the nodes.
In the situation where nodes are M/M/1 queues and N > 2, Varki's modification of mean value analysis can also be used to give an approximate value for the average response time.[15]
For general service times (where each node is an M/G/1 queue) Baccelli and Makowski give bounds for the average response time and higher moments of this quantity both in the transient and steady state situations.[16] Kemper and Mandjes show that for some parameters these bounds are not tight and show demonstrate an approximation technique.[10] For heterogeneous fork-join queues (fork-join queues with different service times), Alomari and Menasce propose an approximation based on harmonic numbers that can be extended to cover more general cases such as probabilistic fork, open and closed fork-join queues.[17]
Subtask dispersion
The subtask dispersion, defined to be the range of service times, can be numerically computed and optimal deterministic delays introduced to minimize the range.[18]
Stationary distribution
In general the
Heavy traffic/diffusion approximation
When the server is heavily loaded (service rate of the queue is only just larger than arrival rate) the queue length process can be approximated by a reflected Brownian motion which converges to the same stationary distribution as the original queueing process.[19][20] Under limiting conditions the state space of the synchronisation queues collapses and all queues behave identically.[21]
Join queue distribution
Once jobs are served, the parts are reassembled at the join queue. Nelson and Tantawi published the distribution of the join queue length in the situation where all servers have the same service rate.[14] Heterogeneous service rates and distribution asymptotic analysis are considered by Li and Zhao.[22]
Networks of fork–join queues
An approximate formula can be used to calculate the response time distribution for a network of fork–join queues joined in series (one after the other).[23]
Split–merge model
A related model is the split–merge model, for which analytical results exist.[2][24] Exact results for the split-merge queue are given by Fiorini and Lipsky.[25] Here on arrival a job is split into N sub-tasks which are serviced in parallel. Only when all the tasks finish servicing and have rejoined can the next job start. This leads to a slower response time on average.
Generalized (n,k) fork-join system
A generalization of the fork-join queueing system is the fork-join system where the job exits the system when any out of tasks are served. The traditional fork-join queueing system is a special case of the system when . Bounds on the mean response time of this generalized system were found by Joshi, Liu and Soljanin.[26][27]
References
- doi:10.1109/12.16501.
- ^ a b c Lebrecht, Abigail; Knottenbelt, William J. (June 2007). Response Time Approximations in Fork-Join Queues (PDF). 23rd Annual UK Performance Engineering Workshop (UKPEW). Archived from the original (PDF) on 29 October 2013. Retrieved 2 July 2009.
- ^ ISBN 978-3-540-89331-8.
- ^ Boxma, Onno; Koole, Ger; Liu, Zhen (1996). Queueing-theoretic Solution Methods for Models of Parallel and Distributed Systems (PDF) (Technical report). CWI. BS-R9425.
- ^ doi:10.1137/0144074.
- S2CID 124774848
- ^ .
- S2CID 120222029. Archived from the original(PDF) on 18 March 2012.
- ISBN 978-3-642-02923-3.
- ^ .
- ^ S2CID 122581916.
- S2CID 119551482.
- .
- ^ doi:10.1109/12.2213.
- ^ Varki, Elizabeth; Merchant, Arif; Chen, H. "M/M/1 Fork-join queue with variable sub-tasks" (PDF). Archived from the original (PDF) on 5 August 2010. Retrieved 29 March 2009.
- ^ Baccelli, François; Makowski, A. (1985), Simple computable bounds for the fork-join queue (PDF), National Institute for Research in Computer Science and Control Technical Report, retrieved 8 July 2011
- S2CID 422296.
- ISBN 978-3-642-40724-6.
- S2CID 206789463.
- ^ Varma, Subir (1990). "Heavy and Light Traffic Approximations for Queues with Synchronization Constraints (PhD thesis)" (PDF). University of Maryland. Retrieved 10 February 2013.
- S2CID 18115820.
- S2CID 124693767.
- ISBN 978-3-540-74468-9.
- ISBN 978-3-540-40814-7.
- S2CID 26219594.
- Bibcode:2012arXiv1210.3012J.
- Bibcode:2013arXiv1305.3945J.