M/M/1 queue

In

Poisson process and job service times have an exponential distribution. The model name is written in Kendall's notation. The model is the most elementary of queueing models^[1] and an attractive object of study as closed-form expressions can be obtained for many metrics of interest in this model. An extension of this model with more than one server is the M/M/c queue

.

Model definition

An M/M/1 queue is a stochastic process whose

state space

is the set {0,1,2,3,...} where the value corresponds to the number of customers in the system, including any currently in service.

Arrivals occur at rate λ according to a

Poisson process

and move the process from state i to i + 1.

Service times have an exponential distribution with rate parameter μ in the M/M/1 queue, where 1/μ is the mean service time.
All arrival times and services times are (usually) assumed to be independent of one another.^[2]
A single server serves customers one at a time from the front of the queue, according to a
first-come, first-served
discipline. When the service is complete the customer leaves the queue and the number of customers in the system reduces by one.
The buffer is of infinite size, so there is no limit on the number of customers it can contain.

The model can be described as a

transition rate matrix

Q={\begin{pmatrix}-\lambda &\lambda \\\mu &-(\mu +\lambda )&\lambda \\&\mu &-(\mu +\lambda )&\lambda \\&&\mu &-(\mu +\lambda )&\lambda &\\&&&&\ddots \end{pmatrix}}

on the state space {0,1,2,3,...}. This is the same continuous time Markov chain as in a

state space

diagram for this chain is as below.

Stationary analysis

The model is considered stable only if λ < μ. If, on average, arrivals happen faster than service completions the queue will grow indefinitely long and the system will not have a stationary distribution. The stationary distribution is the limiting distribution for large values of t.

Various performance measures can be computed explicitly for the M/M/1 queue. We write ρ = λ/μ for the utilization of the buffer and require ρ < 1 for the queue to be stable. ρ represents the average proportion of time which the server is occupied.

The probability that the stationary process is in state i (contains i customers, including those in service) is^[3]^{: 172–173}

\pi _{i}=(1-\rho )\rho ^{i}.\,

Average number of customers in the system

We see that the number of customers in the system is geometrically distributed with parameter 1 − ρ. Thus the average number of customers in the system is ρ/(1 − ρ) and the variance of number of customers in the system is ρ/(1 − ρ)². This result holds for any work conserving service regime, such as processor sharing.^[4]

Busy period of server

The busy period is the time period measured between the instant a customer arrives to an empty system until the instant a customer departs leaving behind an empty system. The busy period has probability density function^[5]^[6]^[7]^[8]

f(t)={\begin{cases}{\frac {1}{t{\sqrt {\rho }}}}e^{-(\lambda +\mu )t}I_{1}(2t{\sqrt {\lambda \mu }})&t>0\\0&{\text{otherwise}}\end{cases}}

where I₁ is a

modified Bessel function of the first kind,^[9] obtained by using Laplace transforms and inverting the solution.^[10]

The Laplace transform of the M/M/1 busy period is given by [11]^[12]^[13]^: 215

\mathbb {E} (e^{-sF})={\frac {1}{2\lambda }}(\lambda +\mu +s-{\sqrt {(\lambda +\mu +s)^{2}-4\lambda \mu }})

which gives the moments of the busy period, in particular the mean is 1/(μ − λ) and variance is given by

{\frac {1+{\frac {\lambda }{\mu }}}{\mu ^{2}(1-{\frac {\lambda }{\mu }})^{3}}}.

Response time

The average response time or sojourn time (total time a customer spends in the system) does not depend on scheduling discipline and can be computed using Little's law as 1/(μ − λ). The average time spent waiting is 1/(μ − λ) − 1/μ = ρ/(μ − λ). The distribution of response times experienced does depend on scheduling discipline.

First-come, first-served discipline

For customers who arrive and find the queue as a stationary process, the response time they experience (the sum of both waiting time and service time) has Laplace transform (μ − λ)/(s + μ − λ)^[14] and therefore probability density function^[15]

f(t)={\begin{cases}(\mu -\lambda )e^{-(\mu -\lambda )t}&t>0\\0&{\text{otherwise.}}\end{cases}}

Processor sharing discipline

In an M/M/1-PS queue there is no waiting line and all jobs receive an equal proportion of the service capacity.^[16] Suppose the single server serves at rate 16 and there are 4 jobs in the system, each job will experience service at rate 4. The rate at which jobs receive service changes each time a job arrives at or departs from the system.^[16]

For customers who arrive to find the queue as a stationary process, the Laplace transform of the distribution of response times experienced by customers was published in 1970,^[16] for which an integral representation is known.^[17] The waiting time distribution (response time less service time) for a customer requiring x amount of service has transform^[3]^: 356

W^{\ast }(s|x)={\frac {(1-\rho )(1-\rho r^{2})e^{-[\lambda (1-r)+s]x}}{(1-\rho r^{2})-\rho (1-r)^{2}e^{-(\mu /r-\lambda r)x}}}

where r is the smaller root of the equation

\lambda r^{2}-(\lambda +\mu +s)r+\mu =0.

The mean response time for a job arriving and requiring amount x of service can therefore be computed as x μ/(μ − λ). An alternative approach computes the same results using a spectral expansion method.^[4]

Transient solution

We can write a probability mass function dependent on t to describe the probability that the M/M/1 queue is in a particular state at a given time. We assume that the queue is initially in state i and write p_k(t) for the probability of being in state k at time t. Then^[2]^[18]

p_{k}(t)=e^{-(\lambda +\mu )t}\left[\rho ^{\frac {k-i}{2}}I_{k-i}(at)+\rho ^{\frac {k-i-1}{2}}I_{k+i+1}(at)+(1-\rho )\rho ^{k}\sum _{j=k+i+2}^{\infty }\rho ^{-j/2}I_{j}(at)\right]

where $i$ is the initial number of customers in the station at time $t=0$ , $\rho =\lambda /\mu$ , $a=2{\sqrt {\lambda \mu }}$ and $I_{k}$ is the

monotone functions.^[19]

Diffusion approximation

When the utilization ρ is close to 1 the process can be approximated by a reflected Brownian motion with drift parameter λ – μ and variance parameter λ + μ. This heavy traffic limit was first introduced by John Kingman.^[20]

References

ISBN 0-87335-181-9
.

^
ISBN 0471491101
.

^ ^a ^b Harrison, Peter; Patel, Naresh M. (1992). Performance Modelling of Communication Networks and Computer Architectures. Addison–Wesley.

^
doi:10.1023/A:1013913827667. Archived from the original
(PDF) on 2006-11-29.

doi:10.1007/BF01157854
.

JSTOR 2237497
.

MR 0097132
.

ISBN 1118211642
.

^ Adan, Ivo. "Course QUE: Queueing Theory, Fall 2003: The M/M/1 system" (PDF). Retrieved 2012-08-06.

ISBN 978-0-691-14062-9
.

ISBN 978-0-387-00211-8
.

doi:10.1007/BF01159399
.

ISBN 0471491101
.

ISBN 3-540-57297-X
.

ISBN 978-0-691-14062-9
.

^
doi:10.1145/321556.321568
.

JSTOR 2101088
.

ISBN 978-1-4612-7029-4
.

doi:10.1007/BF01182933
.

JSTOR 2984229
.

v
t
e
Queueing theory
Single queueing nodes

D/M/1 queue

M/D/1 queue

M/D/c queue

M/M/1 queue
Burke's theorem

M/M/c queue

M/M/∞ queue

M/G/1 queue
Pollaczek–Khinchine formula

Matrix analytic method

M/G/k queue

G/M/1 queue

G/G/1 queue
Kingman's formula

Lindley equation

Fork–join queue

Bulk queue

Arrival processes

Poisson point process

Markovian arrival process

Rational arrival process

Queueing networks

Jackson network
Traffic equations

Gordon–Newell theorem
Mean value analysis

Buzen's algorithm

Kelly network

G-network

BCMP network

Service policies

FIFO

LIFO

Processor sharing

Round-robin

Shortest job next

Shortest remaining time

Key concepts

Continuous-time Markov chain

Kendall's notation

Little's law

Product-form solution
Balance equation

Quasireversibility

Flow-equivalent server method

Arrival theorem

Decomposition method

Beneš method

Limit theorems

Fluid limit

Mean-field theory

Heavy traffic approximation
Reflected Brownian motion

Extensions

Fluid queue

Layered queueing network

Polling system

Adversarial queueing network

Loss network

Retrial queue

Information systems

Data buffer

Erlang (unit)

Erlang distribution

Flow control (data)

Message queue

Network congestion

Network scheduler

Pipeline (software)

Quality of service

Scheduling (computing)

Teletraffic engineering

Category

Discrete time

Bernoulli process

Branching process

Chinese restaurant process

Galton–Watson process

Independent and identically distributed random variables

Markov chain

Moran process

Random walk
Loop-erased

Self-avoiding

Biased

Maximal entropy

Continuous time

Additive process

Bessel process

Birth–death process
pure birth

Brownian motion
Bridge

Excursion

Fractional

Geometric

Meander

Cauchy process

Contact process

Continuous-time random walk

Cox process

Diffusion process

Dyson Brownian motion

Empirical process

Feller process

Fleming–Viot process

Gamma process

Geometric process

Hawkes process

Hunt process

Interacting particle systems

Itô diffusion

Itô process

Jump diffusion

Jump process

Lévy process

Local time

Markov additive process

McKean–Vlasov process

Ornstein–Uhlenbeck process

Poisson process
Compound

Non-homogeneous

Schramm–Loewner evolution

Semimartingale

Sigma-martingale

Stable process

Superprocess

Telegraph process

Variance gamma process

Wiener process

Wiener sausage

Both

Branching process

Galves–Löcherbach model

Gaussian process

Hidden Markov model (HMM)

Markov process

Martingale
Differences

Local

Sub-

Super-

Random dynamical system

Regenerative process

Renewal process

Stochastic chains with memory of variable length

White noise

Fields and other

Dirichlet process

Gaussian random field

Gibbs measure

Hopfield model

Ising model
Potts model

Boolean network

Markov random field

Percolation

Pitman–Yor process

Point process
Cox

Poisson

Random field

Random graph

Time series models

Autoregressive conditional heteroskedasticity (ARCH) model

Autoregressive integrated moving average (ARIMA) model

Autoregressive (AR) model

Autoregressive–moving-average (ARMA) model

Generalized autoregressive conditional heteroskedasticity (GARCH) model

Moving-average (MA) model

Financial models

Binomial options pricing model

Black–Derman–Toy

Black–Karasinski

Black–Scholes

Chan–Karolyi–Longstaff–Sanders (CKLS)

Chen

Constant elasticity of variance (CEV)

Cox–Ingersoll–Ross (CIR)

Garman–Kohlhagen

Heath–Jarrow–Morton (HJM)

Heston

Ho–Lee

Hull–White

Korn-Kreer-Lenssen

LIBOR market

Rendleman–Bartter

SABR volatility

Vašíček

Wilkie

Actuarial models

Bühlmann

Cramér–Lundberg

Risk process

Sparre–Anderson

Queueing models

Bulk

Fluid

Generalized queueing network

M/G/1

M/M/1

M/M/c

Properties

Càdlàg paths

Continuous

Continuous paths

Ergodic

Exchangeable

Feller-continuous

Gauss–Markov

Markov

Mixing

Piecewise-deterministic

Predictable

Progressively measurable

Self-similar

Stationary

Time-reversible

Limit theorems

Central limit theorem

Donsker's theorem

Doob's martingale convergence theorems

Ergodic theorem

Fisher–Tippett–Gnedenko theorem

Large deviation principle

Law of large numbers (weak/strong)

Law of the iterated logarithm

Maximal ergodic theorem

Sanov's theorem

Lévy
)

Inequalities

Burkholder–Davis–Gundy

Doob's martingale

Doob's upcrossing

Kunita–Watanabe

Marcinkiewicz–Zygmund

Tools

Cameron–Martin formula

Convergence of random variables

Doléans-Dade exponential

Doob decomposition theorem

Doob–Meyer decomposition theorem

Doob's optional stopping theorem

Dynkin's formula

Feynman–Kac formula

Filtration

Girsanov theorem

Infinitesimal generator

Itô integral

Itô's lemma

Karhunen–Loève theorem

Kolmogorov continuity theorem

Kolmogorov extension theorem

Lévy–Prokhorov metric

Malliavin calculus

Martingale representation theorem

Optional stopping theorem

Prokhorov's theorem

Quadratic variation

Reflection principle

Skorokhod integral

Skorokhod's representation theorem

Skorokhod space

Snell envelope

Stochastic differential equation
Tanaka

Stopping time

Stratonovich integral

Uniform integrability

Usual hypotheses

Wiener space

Classical

Abstract

Disciplines

Actuarial mathematics

Control theory

Econometrics

Ergodic theory

Extreme value theory (EVT)

Large deviations theory

Mathematical finance

Mathematical statistics

Probability theory

Queueing theory

Renewal theory

Ruin theory

Signal processing

Statistics

Stochastic analysis

Time series analysis

Machine learning

List of topics

Category

Retrieved from "https://en.wikipedia.org/w/index.php?title=M/M/1_queue&oldid=1190905585"

[1] ISBN 0-87335-181-9
.

[Kleinrock-2] 
ISBN 0471491101
.

[harrison-3] Harrison, Peter; Patel, Naresh M. (1992). Performance Modelling of Communication Networks and Computer Architectures. Addison–Wesley.

[guillemin-4] 
doi:10.1023/A:1013913827667. Archived from the original
(PDF) on 2006-11-29.

[5] :10.1007/BF01157854
.

[6] JSTOR 2237497
.

[7] MR 0097132
.

[8] ISBN 1118211642
.

[9] Adan, Ivo. "Course QUE: Queueing Theory, Fall 2003: The M/M/1 system" (PDF). Retrieved 2012-08-06.

[stewart-10] ISBN 978-0-691-14062-9
.

[11] ISBN 978-0-387-00211-8
.

[12] :10.1007/BF01159399
.

[13] ISBN 0471491101
.

[14] ISBN 3-540-57297-X
.

[15] ISBN 978-0-691-14062-9
.

[coffman-16] 
doi:10.1145/321556.321568
.

[17] JSTOR 2101088
.

[18] ISBN 978-1-4612-7029-4
.

[19] :10.1007/BF01182933
.

[20] JSTOR 2984229
.

[1]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[12]

[13]

[14]

[15]

[16]

[17]

[2]

[18]

[19]

[20]