Tracy–Widom distribution

The Tracy–Widom distribution is a

random Hermitian matrix. The distribution is defined as a Fredholm determinant

In practical terms, Tracy–Widom is the crossover function between the two phases of weakly versus strongly coupled components in a system.[1] It also appears in the distribution of the length of the

Kardar-Parisi-Zhang equation,^[3] in current fluctuations of the asymmetric simple exclusion process (ASEP) with step initial condition,^[4] and in simplified mathematical models of the behavior of the longest common subsequence problem on random inputs.^[5] See Takeuchi & Sano (2010) and Takeuchi et al. (2011)

for experimental testing (and verifying) that the interface fluctuations of a growing droplet (or substrate) are described by the TW distribution

F_{2}

(or

F_{1}

) as predicted by Prähofer & Spohn (2000).

The distribution $F_{1}$ is of particular interest in multivariate statistics.^[6] For a discussion of the universality of $F_{\beta }$ , $\beta =1,2,4$ , see Deift (2007). For an application of $F_{1}$ to inferring population structure from genetic data see Patterson, Price & Reich (2006). In 2017 it was proved that the distribution F is not infinitely divisible.^[7]

Definition as a law of large numbers

Let $F_{\beta }$ denote the cumulative distribution function of the Tracy–Widom distribution with given $\beta$ . It can be defined as a law of large numbers, similar to the central limit theorem.

There are typically three Tracy–Widom distributions, $F_{\beta }$ , with $\beta \in \{1,2,4\}$ . They correspond to the three gaussian ensembles: orthogonal ( $\beta =1$ ), unitary ( $\beta =2$ ), and symplectic ( $\beta =4$ ).

In general, consider a gaussian ensemble with beta value $\beta$ , with its diagonal entries having variance 1, and off-diagonal entries having variance $\sigma ^{2}$ , and let $F_{N,\beta }(s)$ be probability that an $N\times N$ matrix sampled from the ensemble have maximal eigenvalue $\leq s$ , then define^[8]

F_{\beta }(x)=\lim _{N\to \infty }F_{N,\beta }(\sigma (2N^{1/2}+N^{-1/6}x))=\lim _{N\to \infty }Pr(N^{1/6}(\lambda _{max}/\sigma -2N^{1/2})\leq x)

where

\lambda _{\max }

denotes the largest eigenvalue of the random matrix. The shift by

2\sigma N^{1/2}

centers the distribution, since at the limit, the eigenvalue distribution converges to the semicircular distribution with radius

2\sigma N^{1/2}

. The multiplication by

N^{1/6}

is used because the standard deviation of the distribution scales as

N^{-1/6}

(first derived in ^[9]).

For example:^[10]

F_{2}(x)=\lim _{N\to \infty }\operatorname {Prob} \left((\lambda _{\max }-{\sqrt {4N}})N^{1/6}\leq x\right),

where the matrix is sampled from the gaussian unitary ensemble with off-diagonal variance $1$ .

The definition of the Tracy–Widom distributions $F_{\beta }$ may be extended to all $\beta >0$ (Slide 56 in Edelman (2003), Ramírez, Rider & Virág (2006)).

One may naturally ask for the limit distribution of second-largest eigenvalues, third-largest eigenvalues, etc. They are known.^[11]^[8]

Functional forms

Fredholm determinant

$F_{2}$ can be given as the Fredholm determinant

F_{2}(s)=\det(I-A_{s})=1+\sum _{n=1}^{\infty }{\frac {(-1)^{n}}{n!}}\int _{(s,\infty )^{n}}\det _{i,j=1,...,n}[A_{s}(x_{i},x_{j})]dx_{1}\cdots dx_{n}

of the kernel $A_{s}$ ("Airy kernel") on square integrable functions on the half line $(s,\infty )$ , given in terms of Airy functions Ai by

A_{s}(x,y)={\begin{cases}{\frac {\mathrm {Ai} (x)\mathrm {Ai} '(y)-\mathrm {Ai} '(x)\mathrm {Ai} (y)}{x-y}}\quad {\text{if }}x\neq y\\Ai'(x)^{2}-x(Ai(x))^{2}\quad {\text{if }}x=y\end{cases}}

Painlevé transcendents

$F_{2}$ can also be given as an integral

F_{2}(s)=\exp \left(-\int _{s}^{\infty }(x-s)q^{2}(x)\,dx\right)

in terms of a solution

Painlevé equation

of type II

q^{\prime \prime }(s)=sq(s)+2q(s)^{3}\,

with boundary condition ${\textstyle \displaystyle q(s)\sim {\textrm {Ai}}(s),s\to \infty .}$ This function $q$ is a

Painlevé transcendent

.

Other distributions are also expressible in terms of the same $q$ :^[10]

{\begin{aligned}F_{1}(s)&=\exp \left(-{\frac {1}{2}}\int _{s}^{\infty }q(x)\,dx\right)\,\left(F_{2}(s)\right)^{1/2}\\F_{4}(s/{\sqrt {2}})&=\cosh \left({\frac {1}{2}}\int _{s}^{\infty }q(x)\,dx\right)\,\left(F_{2}(s)\right)^{1/2}.\end{aligned}}

Functional equations

Define

{\begin{aligned}F(x)&=\exp \left(-{\frac {1}{2}}\int _{x}^{\infty }(y-x)q(y)^{2}\,dy\right)\\E(x)&=\exp \left(-{\frac {1}{2}}\int _{x}^{\infty }q(y)\,dy\right)\end{aligned}}

then^[8]

F_{1}(x)=E(x)F(x),\quad F_{2}(x)=F(x)^{2},\quad \quad F_{4}\left({\frac {x}{\sqrt {2}}}\right)={\frac {1}{2}}\left(E(x)+{\frac {1}{E(x)}}\right)F(x)

Occurrences

Other than in random matrix theory, the Tracy–Widom distributions occur in many other probability problems.^[12]

Let $l_{n}$ be the length of the longest increasing subsequence in a random permutation sampled uniformly from $S_{n}$ , the permutation group on n elements. Then the cumulative distribution function of ${\frac {l_{n}-2N^{1/2}}{N^{1/6}}}$ converges to $F_{2}$ .^[13]

Asymptotics

Probability density function

Let $f_{\beta }(x)=F_{\beta }'(x)$ be the probability density function for the distribution, then^[12]

f_{\beta }(x)\sim {\begin{cases}e^{-{\frac {\beta }{24}}|x|^{3}},\quad x\to -\infty \\e^{-{\frac {2\beta }{3}}|x|^{3/2}},\quad x\to +\infty \end{cases}}

In particular, we see that it is severely skewed to the right: it is much more likely for

\lambda _{max}

to be much larger than

2\sigma {\sqrt {N}}

than to be much smaller. This could be intuited by seeing that the limit distribution is the semicircle law, so there is "repulsion" from the bulk of the distribution, forcing

\lambda _{max}

to be not much smaller than

2\sigma {\sqrt {N}}

.

At the $x\to -\infty$ limit, a more precise expression is (equation 49 ^[12])

f_{\beta }(x)\sim \tau _{\beta }|x|^{(\beta ^{2}+4-6\beta )/16\beta }\exp \left[-\beta {\frac {|x|^{3}}{24}}+{\sqrt {2}}{\frac {\beta -2}{6}}|x|^{3/2}\right]

for some positive number

\tau _{\beta }

that depends on

\beta

.

Cumulative distribution function

At the $x\to +\infty$ limit,^[14]

{\begin{aligned}F(x)&=1-{\frac {e^{-{\frac {4}{3}}x^{3/2}}}{32\pi x^{3/2}}}{\biggl (}1-{\frac {35}{24x^{3/2}}}+{\cal {O}}(x^{-3}){\biggr )},\\E(x)&=1-{\frac {e^{-{\frac {2}{3}}x^{3/2}}}{4{\sqrt {\pi }}x^{3/2}}}{\biggl (}1-{\frac {41}{48x^{3/2}}}+{\cal {O}}(x^{-3}){\biggr )}\end{aligned}}

and at the

x\to -\infty

limit,

{\begin{aligned}F(x)&=2^{1/48}e^{{\frac {1}{2}}\zeta ^{\prime }(-1)}{\frac {e^{-{\frac {1}{24}}|x|^{3}}}{|x|^{1/16}}}\left(1+{\frac {3}{2^{7}|x|^{3}}}+O(|x|^{-6})\right)\\E(x)&={\frac {1}{2^{1/4}}}e^{-{\frac {1}{3{\sqrt {2}}}}|x|^{3/2}}{\Biggl (}1-{\frac {1}{24{\sqrt {2}}|x|^{3/2}}}+{\cal {O}}(|x|^{-3}){\Biggr )}.\end{aligned}}

where

\zeta

is the Riemann zeta function, and

\zeta '(-1)=-0.1654211437

.

This allows derivation of $x\to \pm \infty$ behavior of $F_{\beta }$ . For example,

{\begin{aligned}1-F_{2}(x)&={\frac {1}{32\pi x^{3/2}}}e^{-4x^{3/2}/3}(1+O(x^{-3/2})),\\F_{2}(-x)&={\frac {2^{1/24}e^{\zeta ^{\prime }(-1)}}{x^{1/8}}}e^{-x^{3}/12}{\biggl (}1+{\frac {3}{2^{6}x^{3}}}+O(x^{-6}){\biggr )}.\end{aligned}}

Painlevé transcendent

The Painlevé transcendent has asymptotic expansion at $x\to -\infty$ (equation 4.1 of ^[15])

q(x)={\sqrt {-{\frac {x}{2}}}}\left(1+{\frac {1}{8}}x^{-3}-{\frac {73}{128}}x^{-6}+{\frac {10657}{1024}}x^{-9}+O(x^{-12})\right)

This is necessary for numerical computations, as the

q\sim {\sqrt {-x/2}}

solution is unstable: any deviation from it tends to drop it to the

q\sim -{\sqrt {-x/2}}

branch instead.^[16]

Numerics

Numerical techniques for obtaining numerical solutions to the Painlevé equations of the types II and V, and numerically evaluating eigenvalue distributions of random matrices in the beta-ensembles were first presented by Edelman & Persson (2005) using MATLAB. These approximation techniques were further analytically justified in Bejan (2005) and used to provide numerical evaluation of Painlevé II and Tracy–Widom distributions (for $\beta =1,2,4$ ) in S-PLUS. These distributions have been tabulated in Bejan (2005) to four significant digits for values of the argument in increments of 0.01; a statistical table for p-values was also given in this work. Bornemann (2010) gave accurate and fast algorithms for the numerical evaluation of $F_{\beta }$ and the density functions $f_{\beta }(s)=dF_{\beta }/ds$ for $\beta =1,2,4$ . These algorithms can be used to compute numerically the

excess kurtosis

of the distributions

F_{\beta }

.^[17]

$\beta$	Mean	Variance	Skewness	Excess kurtosis
1	−1.2065335745820	1.607781034581	0.29346452408	0.1652429384
2	−1.771086807411	0.8131947928329	0.224084203610	0.0934480876
4	−2.306884893241	0.5177237207726	0.16550949435	0.0491951565

Functions for working with the Tracy–Widom laws are also presented in the R package 'RMTstat' by Johnstone et al. (2009) and MATLAB package 'RMLab' by Dieng (2006).

For a simple approximation based on a shifted gamma distribution see Chiani (2014).

Shen & Serkh (2022) developed a spectral algorithm for the eigendecomposition of the integral operator $A_{s}$ , which can be used to rapidly evaluate Tracy–Widom distributions, or, more generally, the distributions of the $k$ th largest level at the soft edge scaling limit of Gaussian ensembles, to machine accuracy.

Tracy-Widom and KPZ universality

The Tracy-Widom distribution appears as a limit distribution in the universality class of the

KPZ equation

. For example it appears under

t^{1/3}

scaling of the one-dimensional

KPZ equation with fixed time.^[18]

Footnotes

^ Mysterious Statistical Law May Finally Have an Explanation, wired.com 2014-10-27
^ Baik, Deift & Johansson (1999).
^ Sasamoto & Spohn (2010)
^ Johansson (2000); Tracy & Widom (2009)).
^ Majumdar & Nechaev (2005).
^ Johnstone (2007, 2008, 2009).
^ Domínguez-Molina (2017).
^
ISBN 978-90-481-2810-5
.

ISSN 0550-3213
.

^ ^a ^b Tracy & Widom (1996).

ISSN 1687-0247
.

^
S2CID 119122520
.

^ Baik, Deift & Johansson 1999

S2CID 16324715
.

S2CID 13912236
.

ISBN 978-0-387-98931-0
.

S2CID 237903590
.

doi:10.1002/cpa.20347
.

^ called "Hastings–McLeod solution". Published by Hastings, S.P., McLeod, J.B.: A boundary value problem associated with the second Painlevé transcendent and the Korteweg-de Vries equation. Arch. Ration. Mech. Anal. 73, 31–51 (1980)

References

Baik, J.; Deift, P.; Johansson, K. (1999), "On the distribution of the length of the longest increasing subsequence of random permutations", MR 1682248
.

Bornemann, F. (2010), "On the numerical evaluation of distributions in random matrix theory: A review with an invitation to experimental mathematics", Markov Processes and Related Fields, 16 (4): 803–866,
Bibcode:2009arXiv0904.1581B
.

Chiani, M. (2014), "Distribution of the largest eigenvalue for real Wishart and Gaussian random matrices and a simple approximation for the Tracy–Widom distribution", S2CID 15889291
.

Sasamoto, Tomohiro; Spohn, Herbert (2010), "One-Dimensional Kardar-Parisi-Zhang Equation: An Exact Solution and its Universality", Physical Review Letters, 104 (23): 230602,
S2CID 34945972

Deift, P. (2007), "Universality for mathematical and physical systems" (PDF), S2CID 14133017
.

Dieng, Momar (2006), RMLab, a MATLAB package for computing Tracy-Widom distributions and simulating random matrices.

Domínguez-Molina, J.Armando (2017), "The Tracy-Widom distribution is not infinitely divisible", Statistics & Probability Letters, 213 (1): 56–60,
S2CID 119676736
.

Johansson, K. (2000), "Shape fluctuations and random matrices", S2CID 16291076
.

Johansson, K. (2002), "Toeplitz determinants, random growth and determinantal processes" (PDF), Proc.
MR 1957518
.

Johnstone, I. M. (2007), "High dimensional statistical inference and random matrices" (PDF), S2CID 88524958
.

Johnstone, I. M. (2008), "Multivariate analysis and Jacobi ensembles: largest eigenvalue, Tracy–Widom limits and rates of convergence", PMID 20157626
.

Johnstone, I. M. (2009), "Approximate null distribution of the largest root in multivariate analysis", PMID 20526465
.

Majumdar, Satya N.; Nechaev, Sergei (2005), "Exact asymptotic results for the Bernoulli matching model of sequence alignment", Physical Review E, 72 (2): 020901, 4,
S2CID 11390762
.

Patterson, N.; Price, A. L.;
PMID 17194218
.

Prähofer, M.; Spohn, H. (2000), "Universal distributions for growing processes in 1+1 dimensions and random matrices", S2CID 20814566
.

Shen, Z.; Serkh, K. (2022), "On the evaluation of the eigendecomposition of the Airy integral operator", S2CID 233407802
.

Takeuchi, K. A.; Sano, M. (2010), "Universal fluctuations of growing interfaces: Evidence in turbulent liquid crystals", S2CID 19315093

Takeuchi, K. A.; Sano, M.; Sasamoto, T.; Spohn, H. (2011), "Growing interfaces uncover universal fluctuations behind scale invariance", PMID 22355553

S2CID 119690132
.

S2CID 13912236
.

S2CID 17398688

MR 1989209
.

S2CID 14730756
.

Further reading

Bejan, Andrei Iu. (2005), Largest eigenvalues and sample covariance matrices. Tracy–Widom and Painleve II: Computational aspects and realization in S-Plus with applications (PDF), M.Sc. dissertation, Department of Statistics, The University of Warwick.

Edelman, A.; Persson, P.-O. (2005), Numerical Methods for Eigenvalue Distributions of Random Matrices,
Bibcode:2005math.ph...1068E
.

Edelman, A. (2003), Stochastic Differential Equations and Random Matrices, SIAM Applied Linear Algebra.

Ramírez, J. A.; Rider, B.; Virág, B. (2006), "Beta ensembles, stochastic Airy spectrum, and a diffusion", Journal of the American Mathematical Society, 24 (4): 919–944,
S2CID 10226881
.

External links

Kuijlaars, Universality of distribution functions in random matrix theory (PDF).

Tracy, C. A.; Widom, H., The distributions of random matrix theory and their applications (PDF).

Johnstone, Iain; Ma, Zongming; Perry, Patrick; Shahram, Morteza (2009), Package 'RMTstat' (PDF).

At the Far Ends of a New Universal Law, Quanta Magazine

v
t
e
Probability distributions (list)
Discrete
univariate
with finite
support

Benford

Bernoulli

beta-binomial

binomial

categorical

hypergeometric
negative

Poisson binomial

Rademacher

soliton

discrete uniform

Zipf

Zipf–Mandelbrot

with infinite
support

beta negative binomial

Borel

Conway–Maxwell–Poisson

discrete phase-type

Delaporte

extended negative binomial

Flory–Schulz

Gauss–Kuzmin

geometric

logarithmic

mixed Poisson

negative binomial

Panjer

parabolic fractal

Poisson

Skellam

Yule–Simon

zeta

Continuous
univariate
supported on a
bounded interval

arcsine

ARGUS

Balding–Nichols

Bates

beta

beta rectangular

continuous Bernoulli

Irwin–Hall

Kumaraswamy

logit-normal

noncentral beta

PERT

raised cosine

reciprocal

triangular

U-quadratic

uniform

Wigner semicircle

supported on a
semi-infinite
interval

Benini

Benktander 1st kind

Benktander 2nd kind

beta prime

Burr

chi

chi-squared
noncentral

inverse
scaled

Dagum

Davis

Erlang
hyper

exponential
hyperexponential

hypoexponential

logarithmic

F
noncentral

folded normal

Fréchet

gamma
generalized

inverse

gamma/Gompertz

Gompertz
shifted

half-logistic

half-normal

Hotelling's T-squared

inverse Gaussian
generalized

Kolmogorov

Lévy

log-Cauchy

log-Laplace

log-logistic

log-normal

log-t

Lomax

matrix-exponential

Maxwell–Boltzmann

Maxwell–Jüttner

Mittag-Leffler

Nakagami

Pareto

phase-type

Poly-Weibull

Rayleigh

relativistic Breit–Wigner

Rice

truncated normal

type-2 Gumbel

Weibull
discrete

Wilks's lambda

supported
on the whole
real line

Cauchy

exponential power

Fisher's z

Kaniadakis κ-Gaussian

Gaussian q

generalized normal

generalized hyperbolic

geometric stable

Gumbel

Holtsmark

hyperbolic secant

Johnson's S_U

Landau

Laplace
asymmetric

logistic

noncentral t

normal (Gaussian)

normal-inverse Gaussian

skew normal

slash

stable

Student's t

Tracy–Widom

variance-gamma

Voigt

with support
whose type varies

generalized chi-squared

generalized extreme value

generalized Pareto

Marchenko–Pastur

Kaniadakis κ-exponential

Kaniadakis κ-Gamma

Kaniadakis κ-Weibull

Kaniadakis κ-Logistic

Kaniadakis κ-Erlang

q-exponential

q-Gaussian

q-Weibull

shifted log-logistic

Tukey lambda

Mixed
univariate
continuous-
discrete

Rectified Gaussian

Multivariate
(joint)

Discrete:

Ewens

multinomial
Dirichlet

negative

Continuous:

Dirichlet
generalized

multivariate Laplace

multivariate normal

multivariate stable

multivariate t

normal-gamma
inverse

Matrix-valued:

LKJ

matrix normal

matrix t

matrix gamma
inverse

Wishart
normal

inverse

normal-inverse

complex

Directional

Univariate (circular) directional

Circular uniform

univariate von Mises

wrapped normal

wrapped Cauchy

wrapped exponential

wrapped asymmetric Laplace

wrapped Lévy

Bivariate (spherical)

Kent

Bivariate (toroidal)

bivariate von Mises

Multivariate

von Mises–Fisher

Bingham

Degenerate
and singular

Degenerate

Dirac delta function

Singular

Cantor

Families

Circular

compound Poisson

elliptical

exponential

natural exponential

location–scale

maximum entropy

mixture

Pearson

Tweedie

wrapped

Category

Commons

Retrieved from "https://en.wikipedia.org/w/index.php?title=Tracy–Widom_distribution&oldid=1195832418"

[1] Mysterious Statistical Law May Finally Have an Explanation, wired.com 2014-10-27

[FOOTNOTEBaikDeiftJohansson1999-2] Baik, Deift & Johansson (1999).

[3] Sasamoto & Spohn (2010)

[4] Johansson (2000); Tracy & Widom (2009)).

[FOOTNOTEMajumdarNechaev2005-5] Majumdar & Nechaev (2005).

[6] Johnstone (2007, 2008, 2009).

[FOOTNOTEDomínguez-Molina2017-7] Domínguez-Molina (2017).

[:0-8] 
ISBN 978-90-481-2810-5
.

[9] ISSN 0550-3213
.

[FOOTNOTETracyWidom1996-10] Tracy & Widom (1996).

[11] ISSN 1687-0247
.

[:1-13] 
S2CID 119122520
.

[14] Baik, Deift & Johansson 1999

[15] S2CID 16324715
.

[16] S2CID 13912236
.

[17] ISBN 978-0-387-98931-0
.

[18] S2CID 237903590
.

[19] :10.1002/cpa.20347
.

[12] "Hastings–McLeod solution". Published by Hastings, S.P., McLeod, J.B.: A boundary value problem associated with the second Painlevé transcendent and the Korteweg-de Vries equation. Arch. Ration. Mech. Anal. 73, 31–51 (1980)

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

Definition as a law of large numbers

Functional forms

Fredholm determinant

Painlevé transcendents

Functional equations

Occurrences

Asymptotics

Probability density function

Cumulative distribution function

Painlevé transcendent

Numerics

Tracy-Widom and KPZ universality

See also

Footnotes

References

Further reading

External links