Hurst exponent

The Hurst exponent is used as a measure of

Nile river's volatile rain and drought conditions that had been observed over a long period of time.^[1]^[2] The name "Hurst exponent", or "Hurst coefficient", derives from Harold Edwin Hurst

(1880–1978), who was the lead researcher in these studies; the use of the standard notation H for the coefficient also relates to his name.

In

Benoît Mandelbrot (1924–2010).^[3] H is directly related to fractal dimension, D, and is a measure of a data series' "mild" or "wild" randomness.^[4]

The Hurst exponent is referred to as the "index of dependence" or "index of long-range dependence". It quantifies the relative tendency of a time series either to regress strongly to the mean or to cluster in a direction.

power law; for the series it means that a high value tends to be followed by another high value and that future excursions to more high values do occur. A value in the range 0 – 0.5 indicates a time series with long-term switching between high and low values in adjacent pairs, meaning that a single high value will probably be followed by a low value and that the value after that will tend to be high, with this tendency to switch between high and low values lasting a long time into the future, also following a power law. A value of H=0.5 indicates short-memory

, with (absolute) autocorrelations decaying exponentially quickly to zero.

Definition

The Hurst exponent, H, is defined in terms of the asymptotic behaviour of the rescaled range as a function of the time span of a time series as follows;^[6]^[7]

\mathbb {E} \left[{\frac {R(n)}{S(n)}}\right]=Cn^{H}{\text{ as }}n\to \infty \,,

where

$R(n)$ is the range of the first $n$ cumulative deviations from the mean
$S(n)$ is the series (sum) of the first n
standard deviations
$\mathbb {E} \left[x\right]\,$ is the expected value
$n$ is the time span of the observation (number of data points in a time series)
$C$ is a constant.

Relation to Fractal Dimension

For self-similar time series, H is directly related to fractal dimension, D, where 1 < D < 2, such that D = 2 - H. The values of the Hurst exponent vary between 0 and 1, with higher values indicating a smoother trend, less volatility, and less roughness.^[8]

For more general time series or multi-dimensional process, the Hurst exponent and fractal dimension can be chosen independently, as the Hurst exponent represents structure over asymptotically longer periods, while fractal dimension represents structure over asymptotically shorter periods.^[9]

Estimating the exponent

A number of estimators of long-range dependence have been proposed in the literature. The oldest and best-known is the so-called rescaled range (R/S) analysis popularized by Mandelbrot and Wallis^[3]^[10] and based on previous hydrological findings of Hurst.^[1] Alternatives include DFA, Periodogram regression,^[11] aggregated variances,^[12] local Whittle's estimator,^[13] wavelet analysis,^[14]^[15] both in the time domain and frequency domain.

Rescaled range (R/S) analysis

To estimate the Hurst exponent, one must first estimate the dependence of the rescaled range on the time span n of observation.^[7] A time series of full length N is divided into a number of nonoverlapping shorter time series of length n, where n takes values N, N/2, N/4, ... (in the convenient case that N is a power of 2). The average rescaled range is then calculated for each value of n.

For each such time series of length $n$ , $X=X_{1},X_{2},\dots ,X_{n}\,$ , the rescaled range is calculated as follows:^[6]^[7]

Calculate the mean; $m={\frac {1}{n}}\sum _{i=1}^{n}X_{i}\,.$
Create a mean-adjusted series; $Y_{t}=X_{t}-m\quad {\text{ for }}t=1,2,\dots ,n\,.$
Calculate the cumulative deviate series $Z$ ; $Z_{t}=\sum _{i=1}^{t}Y_{i}\quad {\text{ for }}t=1,2,\dots ,n\,.$
Compute the range $R$ ; $R(n)=\operatorname {max} \left(Z_{1},Z_{2},\dots ,Z_{n}\right)-\operatorname {min} \left(Z_{1},Z_{2},\dots ,Z_{n}\right).$
Compute the standard deviation $S$ ; $S(n)={\sqrt {{\frac {1}{n}}\sum _{i=1}^{n}\left(X_{i}-m\right)^{2}}}.$
Calculate the rescaled range $R(n)/S(n)$ and average over all the partial time series of length $n.$

The Hurst exponent is estimated by fitting the power law $\mathbb {E} [R(n)/S(n)]=Cn^{H}$ to the data. This can be done by plotting $\log[R(n)/S(n)]$ as a function of $\log n$ , and fitting a straight line; the slope of the line gives $H$ . A more principled approach would be to fit the power law in a maximum-likelihood fashion.^[16] Such a graph is called a box plot. However, this approach is known to produce biased estimates of the power-law exponent.^{[clarification needed]} For small $n$ there is a significant deviation from the 0.5 slope.^{[clarification needed]} Anis and Lloyd^[17] estimated the theoretical (i.e., for white noise)^{[clarification needed]} values of the R/S statistic to be:

\mathbb {E} [R(n)/S(n)]={\begin{cases}{\frac {\Gamma ({\frac {n-1}{2}})}{{\sqrt {\pi }}\Gamma ({\frac {n}{2}})}}\sum \limits _{i=1}^{n-1}{\sqrt {\frac {n-i}{i}}},&{\text{for }}n\leq 340\\{\frac {1}{\sqrt {n{\frac {\pi }{2}}}}}\sum \limits _{i=1}^{n-1}{\sqrt {\frac {n-i}{i}}},&{\text{for }}n>340\end{cases}}

where $\Gamma$ is the

Euler gamma function.^{[clarification needed]} The Anis-Lloyd corrected R/S Hurst exponent^{[clarification needed}

] is calculated as 0.5 plus the slope of

R(n)/S(n)-\mathbb {E} [R(n)/S(n)]

.

Confidence intervals

No asymptotic distribution theory has been derived for most of the Hurst exponent estimators so far. However, Weron [18] used bootstrapping to obtain approximate functional forms for confidence intervals of the two most popular methods, i.e., for the Anis-Lloyd^[17] corrected R/S analysis:

Level	Lower bound	Upper bound
90%	0.5 − exp(−7.35 log(log M) + 4.06)	exp(−7.07 log(log M) + 3.75) + 0.5
95%	0.5 − exp(−7.33 log(log M) + 4.21)	exp(−7.20 log(log M) + 4.04) + 0.5
99%	0.5 − exp(−7.19 log(log M) + 4.34)	exp(−7.51 log(log M) + 4.58) + 0.5

and for DFA:

Level	Lower bound	Upper bound
90%	0.5 − exp(−2.99 log M + 4.45)	exp(−3.09 log M + 4.57) + 0.5
95%	0.5 − exp(−2.93 log M + 4.45)	exp(−3.10 log M + 4.77) + 0.5
99%	0.5 − exp(−2.67 log M + 4.06)	exp(−3.19 log M + 5.28) + 0.5

Here $M=\log _{2}N$ and $N$ is the series length. In both cases only subseries of length $n>50$ were considered for estimating the Hurst exponent; subseries of smaller length lead to a high variance of the R/S estimates.

Generalized exponent

The basic Hurst exponent can be related to the expected size of changes, as a function of the lag between observations, as measured by E(|X_t+τ−X_t|²). For the generalized form of the coefficient, the exponent here is replaced by a more general term, denoted by q.

There are a variety of techniques that exist for estimating H, however assessing the accuracy of the estimation can be a complicated issue. Mathematically, in one technique, the Hurst exponent can be estimated such that:^[19]^[20]

H_{q}=H(q),

for a time series

g(t),t=1,2,\dots

may be defined by the scaling properties of its structure functions

S_{q}

(

\tau

):

S_{q}=\left\langle \left|g(t+\tau )-g(t)\right|^{q}\right\rangle _{t}\sim \tau ^{qH(q)},

where

q>0

,

\tau

is the time lag and averaging is over the time window

t\gg \tau ,

usually the largest time scale of the system.

Practically, in nature, there is no limit to time, and thus H is non-deterministic as it may only be estimated based on the observed data; e.g., the most dramatic daily move upwards ever seen in a stock market index can always be exceeded during some subsequent day.^[21]

In the above mathematical estimation technique, the function $H (q)$ contains information about averaged generalized volatilities at scale $\tau$ (only $q = 1, 2$ are used to define the volatility). In particular, the $H 1$ exponent indicates persistent ( $H1 > .mw-parser-output .frac{white-space:nowrap}.mw-parser-output .frac .num,.mw-parser-output .frac .den{font-size:80%;line-height:0;vertical-align:super}.mw-parser-output .frac .den{vertical-align:sub}.mw-parser-output .sr-only{border:0;clip:rect(0,0,0,0);clip-path:polygon(0px 0px,0px 0px,0px 0px);height:1px;margin:-1px;overflow:hidden;padding:0;position:absolute;width:1px}1⁄2$ ) or antipersistent ( $H 1 < 1 ⁄ 2$ ) behavior of the trend.

For the BRW (

brown noise

,

1/f^{2}

) one gets

H_{q}={\frac {1}{2}},

and for pink noise (

1/f

)

H_{q}=0.

The Hurst exponent for white noise is dimension dependent,^[22] and for 1D and 2D it is

H_{q}^{1D}={\frac {1}{2}},\quad H_{q}^{2D}=-1.

For the popular Lévy stable processes and truncated Lévy processes with parameter α it has been found that

H_{q}=q/\alpha ,

for

q<\alpha

, and

H_{q}=1

for

q\geq \alpha

. Multifractal detrended fluctuation analysis^[23] is one method to estimate

H(q)

from non-stationary time series. When

H(q)

is a non-linear function of q the time series is a multifractal system.

Note

In the above definition two separate requirements are mixed together as if they would be one.

Markov processes (i.e., memory-free processes) and fractional Brownian motion scale at the level of 1-point densities (simple averages), but neither scales at the level of pair correlations or, correspondingly, the 2-point probability density.^{[clarification needed}

]

An efficient market requires a martingale condition, and unless the variance is linear in the time this produces nonstationary increments, $x (t + T) - x (t) \neq x (T) - x (0)$ . Martingales are Markovian at the level of pair correlations, meaning that pair correlations cannot be used to beat a martingale market. Stationary increments with nonlinear variance, on the other hand, induce the longtime pair memory of fractional Brownian motion that would make the market beatable at the level of pair correlations. Such a market would necessarily be far from "efficient".

An analysis of economic time series by means of the Hurst exponent using

Long-range dependency

and, thus of informational efficiency.

Hurst exponent has also been applied to the investigation of

long-range dependency in DNA,^[26] and photonic band gap materials.^[27]

Implementations

Matlab code for computing R/S, DFA, periodogram regression and wavelet estimates of the Hurst exponent and their corresponding confidence intervals is available from RePEc: https://ideas.repec.org/s/wuu/hscode.html
Implementation of R/S in Python: https://github.com/Mottl/hurst and of DFA and MFDFA in Python: https://github.com/LRydin/MFDFA
Matlab code for computing real Hurst and complex Hurst: https://www.mathworks.com/matlabcentral/fileexchange/49803-calculate-complex-hurst
Excel sheet can also be used to do so: https://www.researchgate.net/publication/272792633_Excel_Hurst_Calculator

References

^
doi:10.1061/TACEAT.0006518
.

^ Hurst, H.E.; Black, R.P.; Simaika, Y.M. (1965). Long-term storage: an experimental study. London: Constable.
^
doi:10.1029/wr004i005p00909
.

S2CID 119634845
.

^ Torsten Kleinow (2002)Testing Continuous Time Models in Financial Markets, Doctoral thesis, Berlin ^{[page needed]}

^
CiteSeerX 10.1.1.137.207
.

^
ISBN 978-0-306-42851-7
.

doi:10.1088/0031-8949/32/4/001
.

S2CID 15409721
.

ISSN 1944-7973
.

doi:10.1111/j.1467-9892.1983.tb00371.x
.

^ J. Beran. Statistics For Long-Memory Processes. Chapman and Hall, 1994.

doi:10.1214/aos/1176324317
.

S2CID 55110202
.

^ R. H. Riedi. Multifractal processes. In P. Doukhan, G. Oppenheim, and M. S. Taqqu, editors, The- ory And Applications Of Long-Range Dependence, pages 625–716. Birkh¨auser, 2003.

S2CID 9155618
.

^
ISSN 0006-3444
.

S2CID 3272761
.

doi:10.1088/1367-2630/11/9/093024
.

S2CID 16889851
.

Mandelbrot, Benoît B.
, The (Mis)Behavior of Markets, A Fractal View of Risk, Ruin and Reward (Basic Books, 2004), pp. 186-195
S2CID 13608683
.

S2CID 18417413
.

^ Joseph L McCauley, Kevin E Bassler, and Gemunu H. Gunaratne (2008) "Martingales, Detrending Data, and the Efficient Market Hypothesis", Physica, A37, 202, Open access preprint: arXiv:0710.2583

S2CID 120377241
.

S2CID 14067237
.

PMID 26373616
.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Hurst_exponent&oldid=1182198570"

[:0-1] 
doi:10.1061/TACEAT.0006518
.

[2] Hurst, H.E.; Black, R.P.; Simaika, Y.M. (1965). Long-term storage: an experimental study. London: Constable.

[:1-3] 
doi:10.1029/wr004i005p00909
.

[4] S2CID 119634845
.

[5] Torsten Kleinow (2002)Testing Continuous Time Models in Financial Markets, Doctoral thesis, Berlin ^{[page needed]}

[Qian-6] 
CiteSeerX 10.1.1.137.207
.

[Feder-7] 
ISBN 978-0-306-42851-7
.

[8] :10.1088/0031-8949/32/4/001
.

[9] S2CID 15409721
.

[10] ISSN 1944-7973
.

[11] :10.1111/j.1467-9892.1983.tb00371.x
.

[12] J. Beran. Statistics For Long-Memory Processes. Chapman and Hall, 1994.

[13] doi:10.1214/aos/1176324317
.

[14] S2CID 55110202
.

[15] R. H. Riedi. Multifractal processes. In P. Doukhan, G. Oppenheim, and M. S. Taqqu, editors, The- ory And Applications Of Long-Range Dependence, pages 625–716. Birkh¨auser, 2003.

[16] S2CID 9155618
.

[:2-17] 
ISSN 0006-3444
.

[18] S2CID 3272761
.

[19] :10.1088/1367-2630/11/9/093024
.

[20] S2CID 16889851
.

[21] Mandelbrot, Benoît B.
, The (Mis)Behavior of Markets, A Fractal View of Risk, Ruin and Reward (Basic Books, 2004), pp. 186-195

[22] S2CID 13608683
.

[23] S2CID 18417413
.

[24] Joseph L McCauley, Kevin E Bassler, and Gemunu H. Gunaratne (2008) "Martingales, Detrending Data, and the Efficient Market Hypothesis", Physica, A37, 202, Open access preprint: arXiv:0710.2583

[25] S2CID 120377241
.

[26] S2CID 14067237
.

[27] PMID 26373616
.

[1]

[2]

[3]

[4]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[19]

[20]

[21]

[22]

[23]

[26]

[27]