Review

Multifractal analysis of financial markets: a review

, , and

Published 19 November 2019 © 2019 IOP Publishing Ltd
, , Citation Zhi-Qiang Jiang et al 2019 Rep. Prog. Phys. 82 125901 DOI 10.1088/1361-6633/ab42fb

0034-4885/82/12/125901

Abstract

Multifractality is ubiquitously observed in complex natural and socioeconomic systems. Multifractal analysis provides powerful tools to understand the complex nonlinear nature of time series in diverse fields. Inspired by its striking analogy with hydrodynamic turbulence, from which the idea of multifractality originated, multifractal analysis of financial markets has bloomed, forming one of the main directions of econophysics. We review the multifractal analysis methods and multifractal models adopted in or invented for financial time series and their subtle properties, which are applicable to time series in other disciplines. We survey the cumulating evidence for the presence of multifractality in financial time series in different markets and at different time periods and discuss the sources of multifractality. The usefulness of multifractal analysis in quantifying market inefficiency, in supporting risk management and in developing other applications is presented. We finally discuss open problems and further directions of multifractal analysis.

Export citation and abstract BibTeX RIS

1. Introduction

1.1. A very brief history of econophysics

Physicists have long being interested in financial markets both practically and academically. On the investment side, Sir Isaac Newton, one of the greatest scientists in history, is reported to have invested in the South Sea Company stock in the 1720s and to have lost 20 000 pounds (about 3 million US dollars in today's money) as a result of the burst of the infamous South Sea bubble. His speculative endeavors made for the famous quote that 'I can calculate the motions of heavenly bodies, but not the madness of people'. In 1991, Doyne Farmer co-founded the Prediction Company together with Norman Packard and James McGill to design automatic statistical arbitrage strategies. Prediction Company was quite successful and was sold to UBS in 2006 and was then re-sold to Millenium Management in 2013. In 1994, Jean-Philippe Bouchaud and Didier Sornette founded together a research company called Science & Finance, which was merged with Capital Fund Management in 2000. In the mean time, Sornette left Science & Finance and Bouchaud became Chairman and Chief Scientist of Capital Fund Management. Many other examples abound of physicists flirting with Wall Street, Emmanuel Derman being a role model [1].

Intellectually, there are deep relationships (as well as crucial differences) between physics and finance [2, 3] that have inspired generations of physicists as well as economists. In general, physicists apprehend financial markets as complex systems and, as such, they conducted numerous scientific investigations. In the 1930s, Ettore Majorana wrote a paper entitled 'The value of statistical laws in physics and social sciences', which was published by Giovanni Gentile in 1942 after his disappearance [4], and its English translation by Mantegna was presented in the journal of Quantitative Finance in 2005 [5]. In the 1960s, Mandelbrot pioneered an empirical approach, which would be later viewed as a precursor of econophysics, in a series of seminal works of income distributions [610], price variation distributions of speculative assets [11, 12], and long-term correlations in financial and economic time series using the rescaled range (R/S) analysis [13, 14].

The current flourishing econophysics has multiple seeds that date from the 1990s. One can identify several main directions of research that are among the most active ones in the field of econophysics in the past three decades. We review briefly five of the research streams below.

The first stream of studies concentrated on the stylized facts of financial variables. We focus here on the heavy-tailed distribution of financial returns, as well as conceptual formulations, while other stylized facts will be presented in section 1.2. In the early 1960s, mathematician Benoit Mandelbrot (1963) pioneered the use of heavy-tailed distributions (stable Lévy laws) for asset price fluctuations, proposing to abandon the traditional Gaussian law [11]. In 1991, Rosario Mantegna studied the Lévy-walk-like superdiffusive behavior of indices of the Milan stock exchange [15], and with Gene Stanley the scaling behavior of the S&P 500 index in 1995 [16, 17]. In 1996, Ghashghaie et al studied the evolving return distributions at different timescales and the multifractal behavior in the structure function of the US dollar—German mark exchange rates [18]. In 1998, Laherrère and Sornette proposed stretched exponential distributions for time series in nature and economy as a complement to the often used power-law distributions [19], while Gopikrishnan et al observed the inverse cubic law [2022]. We note that, compared with power laws, stretched exponentials [2325] and lognormals [26] are competing alternatives for return distributions that share hard-to-distinguish tail properties. Note that, if there is a scaling behavior in the distributions at different time scales, there is no multifractality (see, e.g. section 2.2.4 on extended self-similarity and and section 6.4.1 on the multi-scaling behavior of recurrence intervals).

A second stream was devoted to the macroscopic modelling of financial markets. In 1900, mathematician Louis Bachelier developed the mathematical theory of diffusion to model the apparent random walk motion of bonds and stock options in the Paris stock market [27], five years before Albert Einstein established the theory of Brownian motion [28]. The geometric Brownian motion (exponential of a Brownian motion) was introduced empirically by Osborne in 1959 [29] and theoretically by Samuelson in 1965 [30], which has become the backbone of milestone financial economics theories including Markowitz's portfolio theory [31], the Black–Scholes–Merton option pricing formula [32, 33], and the capital asset pricing model [34] and its generalized factor models [35, 36]. In 1994, Bouchaud and Sornette generalized the Black–Scholes option pricing problem for a large class of stochastic processes [37] using the functional integration and derivative approach well-known in field theory. In 1996, Sornette et al proposed the log-periodic power-law singularity (LPPLS) model to study financial crashes modelled as critical events [38]. This approach was soon exploited further by Sornette and Johansen [39, 40], Feigenbaum and Freund [41, 42], Vandewalle et al [43, 44], Drożdż et al [45], Sornette and Zhou [4649] and many others. Identification of financial bubbles and forecasting of market turning points advanced successfully in diverse markets [5052].

The third stream is concerned with computational econophysics. In 1992, Takayasu et al [53] developed the first agent-based model in econophysics with the goal to provide an alternative to dynamical stochastic general equilibrium (DSGE) models by incorporating agents' heterogeneous characteristics and the role of extended networks. In 1994, Palmer et al designed an artificial stock market model composed of adaptive agents with bounded rationality [54]. In 1996, Bak et al introduced a multi-agent model based on diffusion and annihilation [55]. In 1997, Challet and Zhang introduced the minority game [56]. In 1999, Lux and Marchesi proposed an agent-based model with heterogenous chartists and fundamentalists [57]. In general, there are three basic ingredients for the price formation process in financial markets, including global news that influence more or less all market participants, mutual imitations among traders, and idiosyncratic preferences of traders [58, 59]. Overconfidence of traders in interpreting the predictive power of global news, which mis-attributes the success of news to predict returns to traders' own stock picking ability and leads to positive feedbacks and herding behavior, plays a crucial role in generating the main stylized facts [2, 58, 59], such as the fat-tailed distribution of returns, the absence of linear correlations in returns, the long memory in volatilities, and the multifractal nature of financial time series.

A fourth stream of investigations has been involved in the more recent use of the emerging field of network theory applied to finance. In 1999, Mantegna studied the hierarchical structure of the US stock market from the angle of minimal spanning trees [60]. Laloux et al and Plerou et al applied random matrix theory to stock return correlation matrices to identify the information contents hidden in eigenvalues and their corresponding eigenvectors [6163].

A fifth stream has been concerned with analysing and modelling asset dynamics at the transaction level [64, 65]. In this vein, the continuous-time random walk formalism is a quite pure physics native proposal for modeling tick-by-tick data [6671], and there has also been a strong growth of the use of self-excited Hawkes models to describe volatility clustering as well as trade dynamics [7276]. There is a derivation where the mid-price is conceptualized as the Brownian particle surrounding by 'solvent' particles (all the limit orders on the ask and on the bid sides), which is fundamental for understanding the microscopic origin of random walks in finance [77, 78].

1.2. Stylized facts

Financial markets are complex adaptive systems, in which universal and non-universal statistical laws emerge at the macroscopic level from the microscopic behavior of heterogeneous traders through reactions to external stimuli in a self-organized manner [50, 51, 7984]. These laws can be explored from the rich dataset of financial stock market price time series. As an illustration, the left panel of figure 1 shows the daily price trajectories $I(t)$ of two stock market indices, the Dow Jones Industrial Average (DJIA) index from 26 May 1896 to 29 December 2017 and the Shanghai Stock Exchange Composite (SSEC) index from 19 December 1990 to 29 December 2017. These two stock markets are representative of developed and emerging markets that are of great interest. The logarithmic return of $I(t)$ over a time interval $\Delta{t}$ is defined by

Equation (1)
Figure 1. Refer to the following caption and surrounding text.

Figure 1. Time series of daily closing prices (left), daily returns (middle) and daily volatilities (right) of two stock market indices. The upper panel is for the Dow Jones Industrial Average index and the lower panel is for the Shanghai Stock Exchange Composite index.

Standard image High-resolution image

The middle panel of figure 1 illustrates the daily return time series $r(t)$ with $\Delta{t}=1$ d and the right panel shows the corresponding volatility time series. Here, the volatility for a given day is defined as the square root of the sum of the squares of intraday returns of that day, where the intraday time scale needs to be sufficiently short (say 1 min) to provide a good convergence of the volatility estimator. There are other definitions of financial volatility [85]. The so-called 'realized volatility' (defined as the sum of squares of returns at smaller scale used in figure 1) is often said to be the most consistent by econometricians [86, 87], and optimal volatility estimators using OHLC (open high low close prices) have been proposed in [8890]. Clear bursty behaviors can be observed in the return time series, which are associated with volatility clustering. There are other variables than price such as inter-trade duration, bid-ask spread, trade size, trading volume and so on, which have been extensively studied in the literature [65, 9193].

Numerous statistical regularities or stylized facts have been reported in different financial markets [64, 9496]. We first recall three main stylized facts, which are universal properties of financial markets. The first stylized fact is the fat-tailed distributions of financial returns. The form of the distribution of asset price fluctuations plays crucial roles in asset pricing and risk management [25, 79, 80]. Early empirical and theoretical works argued that asset prices follow (geometric) Brownian motions [27, 29], which is the main assumption of the Black–Scholes option pricing model [32]. In the 1960s, Mandelbrot suggested that incomes and speculative price returns follow the Pareto–Lévy distribution, which has power-law tails

Equation (2)

whose exponents $1<\gamma<2$ claimed to be in the domain of attraction of stable Lévy laws [6, 7, 911]. Recall that Lévy laws generalise the Gaussian distribution, and their family exhausts all possible distributions that are stable under convolution, with no or only weak dependence between the random variables.

The possibility that financial returns might be distributed according to Lévy laws rather than being Gaussian immediately attracted the interest of notable financial economists, such as Fama [97], Fama and Roll [98], Samuelson [99], Sargent [100], and others. But there was also immediate resistance to abandon the statistical theory that exists for the normal case, which was non-existent for the other members of the Lévy laws, with the like of Cootner [101] and Granger and Orr [102] opposing strong arguments. Further in-depth studies concluded to the error of Mandelbrot, in the sense that, while the distributions of returns are fat-tailed, they are not so heavy as to be described by Lévy laws: more and more statistical tests brought increasing and unavoidable evidence that the variance of the return is not infinite (and thus the exponent $\gamma$ is larger than 2), thus irremediably excluding the heavy tail regime of Lévy laws with tail exponent less than 2 [103106]. The tools based on variance and covariance could still be used after all. This led to a lull in the interest of fat-tailed distributions, which was interrupted by the re-discovery by econophysicists in the early 1990s [1517].

More recent empirical investigations with more data converge to $\gamma\approx3$ (or more prudently that $2 < \gamma <4$ ) [2022]. Further empirical investigations have been conducted on financial returns at different time scales $\Delta{t}$ and reported that distributions vary from exponential to stretched exponential to power law [19, 107124]. These observations are in agreement with the fact that the fat-tailed nature of the distributions of returns at small scale with $\gamma>2$ implies that the domain of attraction at large scales is the Gaussian distribution. One should thus observe and we do observe a progressive transition from a power law like regime at small time scales to a Gaussian regime at large scales [18]. Note that the stretched exponential distribution serves as a convenient bridge between exponential and power-law distributions [19, 125]. In fact, the power law family of distributions can be shown to be asymptotically nested in the stretched exponential family, the later converging to the former in the limit of vanishing stretched exponential exponent [2325]. This apparently esoteric property is very useful practically as it provides the robust Wilks test to compare power laws and stretched exponentials for any data set of interest.

As illustrations of the above, the left column of figure 2 shows the empirical distributions of daily returns of the DJIA and SSEC indices. There are evident fat tails and outliers, compared with the inverted parabola shape that would be expected under the Gaussian hypothesis.

Figure 2. Refer to the following caption and surrounding text.

Figure 2. Three main stylized facts: Fat-tailed distribution of daily returns (left), absence of autocorrelation of daily returns (middle), and long-range correlations of daily volatilities (right). The upper panel is for DJIA and the lower panel is for SSEC.

Standard image High-resolution image

The second main stylized fact is the very short memory or absence of linear autocorrelations in return time series [94, 95, 126, 127], as shown in the middle panel of figure 2. The absence of significant linear correlations in returns over not-too-small timescales has been widely documented [128, 129]. The absence of linear correlations in returns is maintained by the fact that simple statistical arbitrages cannot make profits in financial markets. However, memory could exist in very short time scales (a few minutes) because the price dynamics deviates from equilibrium and price adjustment needs time to react to new information [79, 94]. Moreover, at the transaction level, linear correlations can survive because there are market frictions (such as transaction fees) that erase the profit based on short-term transactions that would accrue without it [130].

The third main stylized fact is the long-range linear correlation in financial volatility as shown in the right panel of figure 2, also known as volatility clustering in finance [131], which captures the tendency of large (small) changes in financial asset prices to be followed by large (small) price changes [11]. The phenomenon of volatility clustering can be modeled by the famous ARCH [132] and GARCH [133] models. Many empirical analyses has been carried out by finance researchers [134] and econophysicists [21, 135, 136].

1.3. Benchmark econometric models

In econometric finance, there are several cornerstone models, such as the autoregressive conditional heteroskedasticity (ARCH) model [132], the generalized autorregressive conditional heteroskedasticity (GARCH) model [133], the exponential GARCH (EGARCH) model [137], the integrated GARCH (IGARCH) model [138], the fractionally integrated GARCH (FIGARCH) model [139], and the autoregressive fractionally integrated moving average (ARFIMA) model [140, 141].

An autoregressive model of order p , AR(p ), is defined as

Equation (3)

where $a_1, \cdots, a_p$ are the parameters of the model, a0 is a constant, and $ \newcommand{\e}{{\rm e}} \epsilon_t$ is white noise. To model a financial time series using an ARCH process [132], the error term $ \newcommand{\e}{{\rm e}} \epsilon _{t}$ is split into a stochastic piece zt and a time-dependent standard deviation $\sigma_t$ such that

Equation (4)

The random variable zt is a strong white noise process. The series $\sigma_t^{2}$ is modelled by

Equation (5)

where $\alpha_0>0$ and $\alpha _{i}\geqslant 0$ for i  >  0.

Considering a regression with coefficient b between variables xt and yt, the GARCH$(\,p, q)$ model of the residuals is given by [133]

Equation (6)

where p  is the order of the GARCH terms $\sigma^2$ and q is the order of the ARCH terms $ \newcommand{\e}{{\rm e}} \epsilon ^{2}$ . In finance, one usually applies the GARCH model to the return time series formulated as (4) with $\sigma$ given by (6).

The EGARCH model is expressed as follows [137]

Equation (7)

which ensures the non-negativity of the conditional variance and the presence of the leverage effect. In section 4.5.3, we will introduce the self-excited multifractal (SEMF) model, which can be viewed as the simplest multifractal generalization of the GARCH process [133], and in fact it is strongly related to EGARCH and other variants [131, 137].

The GARCH$(\,p, q)$ model can be rewritten in an equivalent ARMA-type representation:

Equation (8)

where $\alpha(L) = \alpha_1L+\alpha_2L^2+\cdots+\alpha_qL^q$ and $\beta(L) = $ $ \beta_1L+\beta_2L^2+\cdots+\beta_pL^{\,p}$ are lag operators. This leads to a natural extension, the integrated GARCH$(\,p,q)$ process, which can be written as [138]

Equation (9)

and the FIGARCH class of models can be expressed as follows [139]

Equation (10)

where (1  −  L)d is the fractional differencing operator. The FIGARCH class of models exhibit long memory but do not have multifractal properties (though numerical analysis may produce spurious multifractality) and they are often used to compare with multifractal models.

In the ARCH, GARCH and EGARCH models, the autocorrelation of volatility decays at exponential rates. In contrast, the volatility of the fractionally integrated models has long memory. Empirical analyses show that GARCH-type models without long memory in volatility can hardly capture the multifractal nature in financial time series, while other models with long memory component can capture partly the apparent multifractality, which is however spurious by definition of the integrated models [142151]. Therefore, benchmark economic models fail to capture an important dimension of market complexity, namely the multifractal nature of financial markets [152, 153].

1.4. Introduction to multifractality

Multifractality has been observed in many diverse complex systems [154160], including financial markets [152]. The term 'multifractal' was coined by Frisch and Parisi in 1983 [161], as confirmed by Mandelbrot [162], who is the well respected Father of Fractals [163]. The concept of multifractality can be traced back to Novikov [164] and Mandelbrot [165, 166] in their study of turbulence in fluid mechanics. On the other hand, two groups led by Grassberger and Procaccia generalized the fractal dimension, the information dimension and the correlation dimension into a unified expression of generalized dimensions through the Rényi entropy [167169], with the initial aim of describing strange attractors in nonlinear dynamics [170]. Halsey et al also made seminal contributions in the early development of multifractal theories, with applications to fractal growth processes [171].

There are two equivalent descriptions of multifractals, i.e. via the function $\tau(q)$ or the function $f(\alpha)$ , where q is the order of certain moments, $\tau(q)$ is the mass exponent function, $\alpha$ is the singularity strength, and $f(\alpha)$ is the singularity spectrum. These two representations are related through the Legendre transform [161, 171] such that

Equation (11a)

and

Equation (11b)

Concerning the $\tau(q)$ representation, there are two additional equivalent functions Dq and $H(q)$ . The Dq function presents the generalized dimensions that are determined by

Equation (12)

while $H(q)$ is defined by

Equation (13)

and corresponds to the generalized Hurst exponents. The detailed meaning of these quantities and there relations will be made clear later.

In the study of turbulence, there are two classical methods for the analysis of multifractals, i.e. the structure function approach and the partition function approach [161, 164166, 172175]. Due to the striking similarities between turbulence and financial markets, though the analogy has its limitations [176], the multifractal nature of financial time series has attracted much interest [17, 18]. The structure function approach dominated in the first wave of multifractal analysis in econophysics [177]. Since the seminal work of Kantelhardt et al in 2002 [178], the multifractal detrended fluctuation analysis (MF-DFA) soon became the dominant method not only for financial time series but also for other time series. In 2008, Podobnik and Stanley introduced the detrended cross-correlation analysis (DCCA) for non-stationary time series [179], which was extended by one of us to analyze multifractal time series (MF-DCCA) [180]. These milestone works on DCCA and MF-DCCA approaches triggered the third wave of simultaneous multifractal analysis of two time series and the invention of variant types of multifractal analyses.

In financial markets, risk is a central concept in all financial activities. Large financial fluctuations, especially finance 'tsunamis' such as the one that broke out in 2008 [52], usually cause tremendous economic distress and are followed by substantial changes in risk perception and regulations [50, 51]. Identifying, measuring and forecasting financial risks are of great theoretical and practical significance in risk management. Econophysicists have shown that multifractal analysis provides an alternative way in studying market risks. Indeed, large singularity strengths qualify the behavior of small fluctuations, while small singularity strengths characterize large fluctuations. Therefore, quite a few efforts have been made to apply multifractal analysis to quantifying market inefficiency, measuring financial volatility, and so on. See section 8 for further elaboration.

1.5. Data sets

Here, we provide a brief description of available financial data. In general, one can consider three characteristics of financial data sets: the nature of the market, the considered variables, and the sampling frequency with which the data is recorded (see section 6). We consider them in turn.

There are different types of financial markets. In the financial sector, 'financial markets' often refer to the markets that are used to raise finance, including the capital markets for long term finance and the money markets for short term finance. More generally, we have capital markets (consisting of stock markets and bond markets), commodity markets, money markets, derivative markets, foreign exchange markets, cryptocurrency markets, spot markets and interbank lending markets. The most studied markets in econophysics are stock markets, including stock markets in developed countries and stock markets in emerging countries. Since the financial crisis of 2008 and the Great Recession that followed (also sometimes referred to as the 'Great Financial Tsunami') [181], housing markets are sometimes regarded as financial markets.

Concerning variables, it is not surprising that return, volatility and price are the most widely investigated financial quantities due to their relevance to characterise market states as well as their fluctuations. They are also largely accessible. Other macroscopic and microscopic financial variables include trading volume, bid-ask spread, price gap, illiquidity, order size, trade size, trade number, inter-trade duration, inter-order duration, inter-cancellation duration, recurrence interval (of return, volatility, trading volume, etc), market sentiment measure, and so on.

An important dimension of financial data sets is the time scales used to record them. The time scale can be monthly, weekly, daily, intraday high-frequency, and ultra-high-frequency at the transaction level and at the order level. Therefore, the data might be evenly sampled (without regard to non-trading time periods) and unevenly sampled (for ultra-high frequency data). There are also seasonal patterns, such as strong periodicities for electricity prices and intraday patterns in volatilities and trading volumes. These distinct features should be considered when performing a multifractal analysis or constructing multifractal models. Otherwise, spurious multifractality and artifacts will appear in the obtained results.

1.6. Organization of this review

This review provides an extensive survey of the multifractal analysis of financial time series. We start with different methods of multifractal analysis for univariate and multivariate time series in sections 2 and 3. We review also the methods that were invented in other fields, because they have potential application value in econophysics. We discuss important mathematical and econophysical models in section 4 that can deepen our understanding of the multifractal nature of financial markets. Important properties and subtle issues associated with the algorithms used in empirical multifractal studies are discussed in section 5, many of which are often overlooked by researchers leading to unreliable results. In section 6, we give an extensive survey of the literature on empirical multifractal analysis of financial time series, which overall confirms the presence of multifractality in financial markets. We raise in section 7 the important issue of apparent and effective multifractality and discuss different sources causing apparent multifractality. Different applications of the multifractal nature of financial time series are reviewed in section 8. We discuss open problems and provide perspectives for future research in section 9.

Although this review focuses on the multifractal analysis of financial markets, most of its contents are suitable for multifractal time series analysis in all other fields. Moreover, a large part of this review can be usefully read to inspire multifractal analyses of measures on surfaces and in higher dimensional spaces, because the multifractal methods for time series analysis have already been or can be extended to higher dimensions.

2. Multifractal analysis

2.1. Partition function approach (MF-PF)

2.1.1. Generalized dimensions and mass exponents.

Consider a measure m embedded in a geometric support ${\mathcal{F}}$ , whose density is $\dot{m}(t')$ at position $t'$ . By definition, $\int_{t'\in{\mathcal{F}}} \dot{m}(t'){\rm d}t'=1$ . The measure in the neighbourhood of $t'$ is $\dot{m}(t'){\rm d}t'$ . Using the idea of the classic box-counting method [163], we cover the geometric support ${\mathcal{F}}$ using boxes of size s. The integrated measure $m(s,t)$ in the tth box ${\mathcal{B}}(s,t)$ is

Equation (14)

where $\sum_t {\rm m}(s,t)=1$ for any s. The fractal dimension of ${\mathcal{F}}$ can be determined as follow

Equation (15)

where the term $\sum_t[m(s,t)]^0$ gives the number of non-empty boxes needed to cover the support ${\mathcal{F}}$ . Note that fractal dimension is also called similarity dimension or capacity dimension. It is obvious that, if there is no measure distributed in a box, the box should not be counted. In most cases in handling financial time series, we have D0  =  1. In the empirical determination of D0, we plot $\sum_t[m(s,t)]^0$ against s in log–log scales and the slope of the linear part in a proper scaling range is regarded as its estimate.

The information entropy or Shannon entropy of the measure can be defined as [182, 183]

Equation (16)

and one can define the information dimension as follows

Equation (17)

which was introduced by Balatoni and Rényi in [184, 185] and used to study strange attractors [186, 187].

A third index used to characterize the measure is the correlation dimension [188, 189]. The correlation dimension was originally introduced to study the scaling behavior of the correlation integral of time series of length N:

Equation (18)

where ${\mathcal{H}}(x)$ is the Heaviside function. The correlation dimension is defined as follows

Equation (19)

These three dimensions Df, $\sigma \ {\rm and} \ \nu$ can be unified into one framework of generalized dimensions [167169]. We define the q-order Rényi entropy as [190, 191]

Equation (20)

and the generalized dimension as [192]

Equation (21)

where

Equation (22)

is the q-order partition function of the measure. It is easy to verify that $D_0=D_f$ , $D_1=\sigma$ and $D_2=\nu$ . In practice, we can compute Dq according to equation (21). For a given q, we plot Iq(s) against $\ln{s}$ for various box sizes s and perform linear regression in a proper scaling range to obtain the slope Dq.

We can rewrite equation (21) as follows

Equation (23)

or

Equation (24)

where

Equation (25)

In practice, we can compute $\tau(q)$ according to equation (24). For a given q, we can compute $\chi(q,s)$ for various box sizes s and perform a linear regression of $\ln\chi(q,s)$ against $\ln{s}$ in a proper scaling range to obtain $\tau(q)$ . Alternatively, $\tau(q)$ can be transformed from Dq using equation (25).

Recently, Xiong and Shang proposed a variance-weighted partition function approach and established the relationship between the corresponding multifractal functions [193]. Numerical simulations show that the variance-weighted partition function approach performs comparatively as the standard partition function approach.

2.1.2. Scaling behavior of partition functions.

Based on the box-counting idea, the geometric support is partitioned into non-overlapping boxes of size s. The local singularity strength $\alpha$ in the tth box ${\mathcal{B}}(s,t)$ is defined according to the following relationships [171]:

Equation (26)

which may differ for different boxes. Let $N_s(\alpha)$ denote the number of boxes in which the singularity strengths are within $[\alpha,\alpha+d\alpha]$ . Hence, the fractal dimension of the set is determined according to

Equation (27)

where $\rho(\alpha)$ is the density of the singularity strength and $f(\alpha)$ is the fractal dimension of the boxes under consideration [171], which is usually called multifractal spectrum or singularity spectrum. One can directly obtain $\alpha$ from equation (26) and $f(\alpha)$ from equation (27) using local and pointwise singularity methods [194196].

Inserting equation (26) into the partition function (22) and rewriting the sum into an integral over $\alpha$ , we have

Equation (28)

Assume that $\rho(\alpha)$ is nonzero and non-singular. It therefore contributes a constant proportional factor to the leading behavior of the integral. Because s is small, according to the method of steepest descent, the integral will be dominated by the value of $\alpha(q)$ that makes the power $q\alpha'-f(\alpha')$ smallest. We thus replace $\alpha'$ by $\alpha(q)$ , which is defined by the extremal condition:

Equation (29a)

and

Equation (29b)

Then, equation (28) becomes

Equation (30)

It follows from equation (29) that

Equation (31a)

and

Equation (31b)

which means that the multifractal spectrum $f(\alpha)$ is a concave function and its slope at point $(\alpha(q),f(\alpha(q))$ is q.

Comparing equations (24) and (30), we have

Equation (32)

Taking the derivative of equation (32) with respect to q, we have

Equation (33)

Rewriting equations (32) and (33), we have

Equation (34a)

and

Equation (34b)

This shows that $ f(\alpha)$ is the Legendre transform of $\tau(q)$ [161, 171]. Therefore, after obtaining $\tau(q)$ , we can determine numerically $\alpha(q)$ using equation (34a) and $f(\alpha)$ using equation (34b). To qualify a multifractal measure, we can use either $(q,\tau)$ or $(\alpha,f)$ , which are equivalent.

2.1.3. Direct determination of a multifractal spectrum.

From the canonical perspective, we can obtain the $f(\alpha)$ function directly [197, 198]. We can define the canonical measures

Equation (35)

The singularity strength $\alpha(q)$ and its spectrum $f(q)$ can be computed by linear regressions in log–log scales using the following equations:

Equation (36a)

and

Equation (36b)

To compute $\alpha(q)$ , we can plot $\sum_t \mu(q,s,t) \ln{[m(s,t)]}$ against $\ln{s}$ , determine the scaling range and obtain the slope by linear regression. The $f(q)$ function can be determined similarly based on equation (36b). Combining equations (24) and (36), we can verify equation (32) easily. The mass exponent function $\tau{(q)}$ can be obtained by using equation (32).

2.1.4. Inverse partition function and inversion formula.

For financial volatilities, we can further define the inverse partition function [199]. We start with an illustration of the partition function approach using a high-frequency volatility time series.

Denote $\{I(t): t=1,\cdots, T\}$ the price time series of a financial asset. The logarithmic return $r(t)$ at the finest time resolution is calculated by

Equation (37)

where $t=2,\cdots,T$ . The absolute return is utilized as a proxy for volatility according to

Equation (38)

Figure 3(a) illustrates a segment of the 1 min volatility time series of the S&P 500 index, in which, according to the partition function method, the original volatility time series is divided into N boxes with identical size s  =  T/N. The box sizes are chosen such that the number of boxes of each size is an integer to cover the whole time series.

Figure 3. Refer to the following caption and surrounding text.

Figure 3. Multifractal analysis of high-frequency financial volatility based on the partition function approach and the inverse partition function approach. The time series contains about 1.7 million data points of the S&P 500's 1 min volatility from 1 January 1982 to 31 December 1999. (a) A segment of volatility time series $\{v(t): t= 1:1000\}$ . (b) Cumulative volatility measure $U(t)$ showing the determination of $\mu$ for a fixed scale s. (c) Cumulative volatility $U(t)$ showing the determination of sk for a fixed measure $\Delta{v}$ . (d) Dependence of $[\chi(s,q)]^{1/q-1}$ as a function of box size s for q  =  −3, q  =  −1, q  =  0, q  =  2, q  =  4, and q  =  6. The curves have been translated vertically by a factor of 0.001, 0.01, 0.1, 10, and 100 in turn for better visibility. The solid lines are power-law fits in the scaling range. (e) Dependence of $ \newcommand{\re}{{\rm Re}} \renewcommand{\dag}{\dagger}[\chi^\dagger(\mu,p)]^{1/p-1}$ on the thresholds $\mu$ . The curves have been translated vertically for clarity. The solid lines are the best fits to the data. (f) Testing the inversion formula in financial volatility.

Standard image High-resolution image

We define the continuous volatility measure $u(t')$ for any $t'\in (t-1, t]$ by

Equation (39)

where $t=1,\cdots,T$ . The cumulative function of the volatility measure is obtained as

Equation (40)

The function $U(t)$ of $v(t)$ in figure 3(a) is presented in figure 3(b). The measure in each box of size s is determined by

Equation (41)

which is also illustrated in figure 3(b). Usually, one computes the measure in a discrete way,

Equation (42)

in which T/s must be an integer to avoid edge effects (see section 5.1.2 for more information). When the continuous measure $u(t)$ is used, more choices for s values are possible as N can be any integer.

For a given order q, the direct partition function $\chi_q(s)$ can be estimated by using

Equation (43)

In figure 3(d), we plot $[\chi_q(s)]^{1/q-1}$ as a function of s for different q's. The scaling exponent function $\tau(q)$ can be obtained through power-law regressions to the data points in the scaling range, which spans about three orders of magnitude. We find from figure 3(f) that $\tau(q)$ is a nonlinear function of q, confirming the presence of multifractality in the volatility measure.

We now turn to investigate the inverse partition function. For each threshold $\mu$ , a sequence of exit times $\{s_j(\mu): j=1,\cdots,J\}$ can be determined successively by

Equation (44)

where $J = 1/\mu$ is an integer. Graphically, figure 3(c) shows how the exit times sj can be determined. The inverse measure is defined as the normalized exit time

Equation (45)

and the inverse partition function can be determined as follows:

Equation (46)

Figure 3(e) shows the dependence of $ \newcommand{\re}{{\rm Re}} \renewcommand{\dag}{\dagger}[\chi^\dagger(\mu,p)]^{1/(\,p-1)}$ as a function of the thresholds $\mu$ for different p  values. Power-law scaling can be observed over about three orders of magnitude:

Equation (47)

The scaling exponent function $ \newcommand{\re}{{\rm Re}} \renewcommand{\dag}{\dagger}\tau^\dagger(\,p)$ can be obtained through power-law regressions to the data points in the scaling range, as is shown in figure 3(f). The nonlinearity of the $ \newcommand{\re}{{\rm Re}} \renewcommand{\dag}{\dagger}\tau^\dagger(\,p)$ function confirms the presence of multifractality in the exit time measure.

An important issue arises concerning the relationship between $\tau(q)$ and $ \newcommand{\re}{{\rm Re}} \renewcommand{\dag}{\dagger}\tau^\dagger(\,p)$ . Mandelbrot and Riedi proved an elegant inversion formula mathematically for both discontinuous and continuous multifractal measures [200, 201]. Roux and Jensen independently proved the exact relation between the direct and inverse scaling exponents for the classical binomial measures [202]. Xu et al provided a simple 'proof' of the inversion formula for multinomial measures [203], which is presented below.

Let $\mu$ be a probability measure on $[0,1]$ with its integral function $M(t) = \mu([0,t])$ . Then its inverse measure can be defined by

Equation (48)

where $ \newcommand{\re}{{\rm Re}} \renewcommand{\dag}{\dagger}M^\dagger(s)$ is the inverse function of $M(t)$ . If $\mu$ is self-similar, then the relation $\mu = \sum_{i=1}^n p_i\mu(m_i^{-1}(\cdot))$ holds, where the mi's are similarity maps with scale contraction ratios $r_i\in(0,1)$ and $\sum_{i=1}^np_i=1$ with pi  >  0. The multifractal spectrum of measure $\mu$ is the Legendre transform $f(\alpha)$ of $\tau$ , which is defined by the generating function

Equation (49)

It can be shown [200, 201] that the inverse measure $ \newcommand{\re}{{\rm Re}} \renewcommand{\dag}{\dagger}\mu^\dagger$ is also self-similar with ratio $ \newcommand{\re}{{\rm Re}} \renewcommand{\dag}{\dagger}r_i^\dagger=p_i$ and $ \newcommand{\re}{{\rm Re}} \renewcommand{\dag}{\dagger}p_i^\dagger=r_i$ , whose multifractal spectrum $ \newcommand{\re}{{\rm Re}} \renewcommand{\dag}{\dagger}f^\dagger(\alpha^\dagger)$ is the Legendre transform of $ \newcommand{\re}{{\rm Re}} \renewcommand{\dag}{\dagger}\tau^\dagger$ , which is defined implicitly by

Equation (50)

It is easy to verify that the following inversion formula holds

Equation (51)

Two equivalent testable formulae follow immediately

Equation (52)

where the superscript ${[-1]}$ denotes the inverse function operator. Figure 3(f) verifies that the inversion formulae (52) hold for the high-frequency volatility time series of the S&P 500 index [199].

We can also relate the direct and inverse singularity strengths and the direct and inverse singularity spectra. From equation (51), we have

Equation (53)

Combining equations (51), (53) and the Legendre transform, we have

Equation (54)

Rigorous proofs of these results can also be found in [200, 201]. It is easy to obtain the relation between direct and inverse generalized dimensions [204]:

Equation (55)

where $ \newcommand{\re}{{\rm Re}} \renewcommand{\dag}{\dagger}D^\dagger(\,p)$ is the inverse generalized dimension function.

2.1.5. Ensemble averaging.

In the multifractal analysis of financial time series, almost all studies with very few exceptions focused on individual time series. One can further investigate an ensemble of financial time series of individual assets [205]. Ensemble averaging was developed to determine the multifractal dimensions of diffusion-limited aggregations [206210], and unearthed a subtle discrete hierarchy quantified by complex fractal dimensions [211].

One defines the quenched and annealed mass exponents $\tau_\mathrm{quen} (q)$ and $\tau_\mathrm{ann} (q)$ as follows,

Equation (56)

Equation (57)

where the angular brackets $\langle \cdot \rangle$ signify the ensemble average over all time series. The annealed exponents are more sensitive to rare samples of the ensemble with unusual values of $\chi_q(s)$ , while the quenched exponents are more characteristic of typical members of the ensemble [212].

The reasoning under the ensemble averaging lies in the consideration that one performs the measurement many times of the dynamics of an 'ensemble' of financial instruments. The ensemble averaging method provides an alternative way to measure market risks from individual equities other than the market index. It is also different from directly averaging the risk measures of many individual equities. In addition, although the method was initially developed for the partition function approach, its extension to other multifractal analysis methods is straightforward.

2.2. Structure function approach

2.2.1. Direct structure functions (MF-SF).

Another important statistical quantity of turbulent fields is the structure function of velocity increments [172, 173]. Classical experiments have been carried out to measure the structure function and its nonlinear scaling behaviors [174]. Such nonlinear scaling behavior is also termed as multifractality [161, 175]. The structure function approach has been also employed to investigate financial time series [18, 177, 213]. The qth-order structure function is also called the qth-order height-height correlation function in the literature [214, 215].

Consider a time series $\{X(i):i=1,\cdots, N\}$ . The increments or innovations over time scale s are defined as

Equation (58)

For financial assets, $X(i)$ is the logarithmic price and $\Delta{X}$ is the return over the time interval of duration s. The qth-order structure function is defined as the qth-order moment of the increment distribution:

Equation (59)

We stress that $q\geqslant0$ for time series in turbulence [174], finance [177, 216], and other fields, because $\Delta{X}(i,s)$ can be zero such that the moments of negative orders are not defined. When q  =  0, we have $ \newcommand{\e}{{\rm e}} K(0,s)\equiv 1$ , which is independent of the scale s. When q  =  2, $K(2,s)$ is proportional to the autocorrelation function $ \langle{X(i)X(i-s)}\rangle$ [177, 216].

For self-similar time series, we expect to have

Equation (60)

where $H(q)$ is the generalized Hurst exponent. The scaling function is obtained as [217]

Equation (61)

When q  =  0, we have

Equation (62)

In practice, for each fixed q, we calculate the $K(q,s)$ values with varying s values and perform a linear least-squares regression of $\ln{K(q,s)}$ against $\ln{s}$ in a properly chosen scaling range $[s_{\min},s_{\max}]$ to obtain $qH(q)$ and $\tau(q)$ . It follows that

Equation (63)

We can further determine the singularity function $\alpha(q)$ and the singularity spectrum $f(\alpha)$ through the Legendre transform. When $X(t)$ is monofractal, $H(q)=H$ is a constant. When $X(t)$ has a multifractal nature, $H(q)$ decreases with q.

According to the definition of $K(q,s)$ , we have

Equation (64)

where $\sum_t$ is the sum for $s+1\leqslant{t}\leqslant{T}$ with $v>0$ . Denote k the proportionality coefficient of the scaling relation (60). It is obvious that k  >  0. Combining equations (60) and (61), we have

Equation (65)

Let $g(x)=x\ln{x}$ . We have $g'(x)>0$ when $x\in[1/e,+\infty)$ and $g'(x)<0$ when $x\in(0,1/e]$ .

Figure 4 presents the results of the multifractal analysis of the daily time series of the logarithmic price $X(t)$ of the Dow Jones Industrial Average index (DJIA, 1896–2015) based on the structure function approach.

Figure 4. Refer to the following caption and surrounding text.

Figure 4. Multifractal analysis of the daily time series of the logarithmic price $X(t)$ of the Dow Jones Industrial Average index (DJIA, 1896–2015) based on the structure function approach. (a) The daily price time series of the DJIA index. (b) Power-law dependence of the structure function $K(q,s)$ on time lag s for different q values. (c) The generalized Hurst exponent $H(q)$ as a function of q. (d) The scaling exponent function $\tau(q)$ . (e) The singularity strength function $\alpha(q)$ . (f) The multifractal singularity spectrum $f(\alpha)$ .

Standard image High-resolution image

In a seminal study [213], Müller et al presented a statistical analysis of financial data documenting scaling laws in the form of equation (60) for q  =  1 and q  =  2, though they did not mention 'multifractal', 'multiscaling' or 'structure function'. Their analysis included intra-day prices over 3 years and daily prices over 15 years of four foreign exchange spot rates (DEM/USD, JPY/USD, CHF/USD, USD/GBP) and intra-day prices over 3 years of gold (XAU/USD). Similar analyses for the first- and second-order moments have been adopted by other economists [86].

2.2.2. Multifractal fluctuation analysis (MF-FA).

The method of fluctuation analysis is a classical approach to extract the Hurst exponent of time series [218], which has been applied extensively in econophysics [219223]. The fluctuation function can be calculated as follows,

Equation (66)

where H is the fluctuation analysis (FA) exponent, i.e. approximatively the Hurst index. A value of the Hurst exponent H larger than 0.5 means that the time series is correlated, For H  <  0.5, the time series is anti-correlated, and for H  =  0.5, it is uncorrelated.

One can extend the fluctuation analysis in equation (66) to higher orders as follows [224226],

Equation (67)

which enables us to understand the multifractal nature in the dynamics of markets. The relationship between the scaling exponent $\zeta(q)$ and the generalized Hurst exponent $H(q)$ can be described by equation (61). When q  =  2, $H = H(2)$ is the Hurst exponent presented in equation (67). Using the Legendre transform, we can obtain the singularity strength $\alpha$ and its spectrum $f(\alpha)$ .

2.2.3. Inverse statistics and inverse structure functions.

Analogous to the inverse measure in the partition function approach [201, 227], Jensen [228] investigated the moments of exit time or first passage time. For a given increment $\Delta{X}$ , using definition (58), the exit time scale s at time i is defined by

Equation (68)

which is the minimal time needed for the fluctuation to exceed the threshold $\Delta{X}\geqslant0$ . The first passage time has important usage in finance. For instance, Cho and Frees adopted it to construct an asymptotically unbiased volatility estimator [229]. They argued that the natural volatility estimator focuses on how much the price changes, whereas the 'temporal' estimator based on the first passage time focuses on how quickly the price changes [229]. The inverse statistics have also some connections to the persistence probability [230233]. The variable $s(i, \Delta{X})$ and derived quantities have natural interpretations in finance, such the 'time to recovery' after a drawdown and are part of metrics used in a standard way in evaluating performance of financial strategies and hedge-funds. This explains that inverse statistics have been extensively applied in finance, as well as in other fields like in the study of heart beat rate and cardiac diseases [234].

If $X(i)$ is a Brownian motion, the distribution of s is solved analytically as a Gamma distribution [235, 236]:

Equation (69)

where a is proportional to $\Delta{X}$ . Since asset prices are not Brownian motions, Simonsen et al [237] proposed a generalized shifted Gamma distribution

Equation (70)

which has a power-law tail $\Pr(s)\sim s^{-(\delta+1)}$ . The Gamma distribution in equation (69) is recovered if $\nu=1$ , $\delta=1/2$ , s0  =  0 and $\beta=a$ . They analyzed the wavelet filtered logarithmic daily closing prices of the DJIA (26 May 1896 to 5 June 2001) and found that the distribution (70) with $\delta\approx0.5$ fits the data well. This observation seems quite ubiquitous for uniformly spaced time series of indices and stocks in different markets [238240], although there are studies reporting that $\delta$ can deviate from 0.5 [241, 242]. For tick-by-tick data at the transaction level, the tail exponent can be larger than 0.5, as found for five highly liquid stocks (AstraZeneca, GlaxoSmithKline, Lloyds TSB Group, Shell, and Vodafone) in the electronic market (SETS) of the London Stock Exchange (LSE) during the year 2002 [239] and the foreign exchange rate of DEM against USD for the full year of 1998 ($\delta=1.4$ ) [243].

Simonsen et al [237] argued that the most probable first passage time s* is the optimal investment horizon and found that s* scales with $\Delta{X}$ as a power law,

Equation (71)

where $\gamma\approx1.8$ , deviating from $\gamma=2$ for Brownian motions. It is found that $\gamma <2$ holds for regularly spaced samples in most markets and the $\gamma$ value is larger in developed stock markets than emerging markets [238, 244]. For tick-by-tick data that are unevenly sampled, it was reported that $\gamma>2$ [245].

For asset prices, it is natural to also consider the minimal time needed for the price to depreciate by a certain return $\Delta{X}\leqslant 0$ . Then, the exit time s at time i is defined by [246]

Equation (72)

A lot of research has shown that the optimal investment horizon is longer for gains than for losses, which is termed the gain-loss asymmetry [244, 246254]. Several models have been proposed to reproduce the gain-loss asymmetry [255257]. Nevertheless, there are also assets that do not show significant gain-loss asymmetry [241, 258].

An analysis in the frequency space based on the discrete wavelet transform showed that the gain-loss asymmetry is introduced mainly by the low-frequency content of the price series and the asymmetry disappears if enough of the low-frequency content is removed [257]. Alternatively, the gain-loss asymmetry is found to be caused by the non-Pearson-type autocorrelations in the time series [259]. Furthermore, the gain-loss asymmetry vanishes if the temporal dependence structure is destroyed by shuffling the time series [260].

The gain-loss asymmetry can be related to the leverage effect as follows. Recall first that the leverage effect is the fact that, after a large negative return, the volatility increases significantly and then relaxes back, usually roughly exponential in time [261263]. In contrast, after a positive return, there is no significant change of volatility. And the causality is from (negative) returns to future volatility and not the reverse, in the sense that a variation of volatility does not produce any measurable change of the expected return in the future. The leverage effect can be interpreted as due to the impact of a loss on the risk perception of the firm, which increases as the equity over debt ratio has decreased as a result of the loss [261]. It also reflects a behavioral response of investors, who after a loss frantically reassess their risk exposure and readjust their portfolios, leading to large price moves. The leverage effect allows one to account for both (i) the asymmetry in the average time for a price drop of a fixed percentage versus a price gain of the same value (for instance 10 d for a loss of 5% and 20 d for a gain of 5% on the DJIA) and (ii) the fact that this asymmetry in waiting times for positive versus negative objectives is stronger for indices and weaker or absent for individual stocks [257, 260].

Consider a fixed drop target of $-5\%$ . Such a drop is achieved by a succession of daily returns, more negative than positive. When some negative loss occurs on one day, the leverage effect leads to an increase of the amplitude of the next daily return. Because we are calculating a waiting time conditional to a cumulative drop of $-5\%$ , the following returns will tend to be negative and with larger amplitude due to the leverage effect. In contrast, for the waiting time to reach a positive gain of $+5\%$ , a majority of the daily moves will be positive, and the leverage effect will be weaker or absent if there are no or weak negative returns along the price path. As a consequence, as the amplitude of the daily returns tend to be small for a sequence of daily returns involved in a positive level ($+5\%$ ), the average waiting time to reach a positive return ($+5\%$ ) is larger than for a negative return goal. The asymmetry of a single stock is small because the leverage effect has a rather short memory and the difference between the number of up and down daily moves for reaching the target threshold is not large. In contrast, for an index, or equivalently a portfolio of N stocks, one must account for the ubiquitous existence of correlations between the returns of the stocks constituting the portfolio. These correlations, which are in general positive (most stocks move approximately together on average), produce an amplification of the gain-loss asymmetry, because the leverage effect is stronger by the averaging over the idiosyncratic residuals of the constituting stocks. This also leads to predict that the gain-loss asymmetry for a portfolio is the stronger, the stronger is the average correlation coefficients of its constituting stocks.

The moments of the exit time, called the distance structure functions [228] or inverse structure functions [264, 265], are defined by

Equation (73)

Due to the duality between the structure function and the inverse structure function, one can intuitively expected that there is a power-law scaling stating that

Equation (74)

where $\phi(\,p)$ is a nonlinear concave function [228].

Synthetic data for the GOY shell model of turbulence showed perfect power-law dependence of the inverse structure functions on the velocity threshold [228]. For two-dimensional turbulence, the inverse structure functions exhibit well-defined multifractal properties [265]. In contrast, for three-dimensional turbulence, the inverse structure functions of an experimental time series at high Reynolds number do not exhibit clear power law scaling [264]. Nevertheless, different experiments show that the inverse structure functions of three-dimensional turbulence exhibit a more general scaling behavior called extended self-similarity (see next subsection) [266268]. To our knowledge, the behavior of inverse structure functions of financial asset prices has not been studied.

The inversion formula relating the direct scaling exponents $\zeta(q)$ of direct structure functions and the inverse scaling exponents $\phi(\,p)$ of inverse structure functions can also be derived [269, 270] and give

Equation (75)

These expressions are verified by the simulated velocity fluctuations from the shell model [202]. However, this prediction (75) cannot be confirmed by wind-tunnel turbulence experiments (Reynolds numbers ${{{\rm Re}}}=400{{\rm \mbox{--}}}1100$ ) [267], which is not surprising because the inverse structure function does not scale as a power law of the velocity increment thresholds [264, 267].

2.2.4. Extended self-similarity.

To investigate the scaling properties of the inverse structure functions, one can define a set of relative exponents using the framework of extended self-similarity (ESS) [271]:

Equation (76)

where q0 is the order taken as a reference value. If equation (60) holds, we have

Equation (77)

When the time series is monofractal, $H(q)$ is independent of q, that is, H(q)  =  H(q0). It follows for monofractal time series that

Equation (78)

In the case of velocity structure functions in fluid mechanics, q0  =  3 is a natural choice based on the exact Kolmogorov's four-fifth law [268, 272]. For financial time series, we suggest to choose q0  =  2 because $H(2)$ is the Hurst index. Generally, the ESS approach provides a wider scaling range for the extraction of scaling exponents.

Figure 5(a) presents the log–log plots of $K(q,s)$ against K2(s) for $q= 1, 2, \cdots, 6$ with $s \in [1,2^{11}]$ . The straight lines hold over more than two orders of magnitude and over at least 3.5 orders of magnitudes for $q\leqslant3$ , showing the existence of extended self-similarity in the structure functions. The scaling range for small q's seems to be broader than for large q's. The ESS scaling exponents $\xi(q,2)$ are shown in figure 5(b). There is a weak indication that $\xi(q,2)$ has a nonlinear dependence as a function of q, with a downward curvature making the curve depart from the linear dependence $\xi(q,2) = q/2$ observes for small q's.

Figure 5. Refer to the following caption and surrounding text.

Figure 5. Extended self-similarity analysis of the daily price time series of the DJIA index (1896–2015). (a) Power-law dependence of the qth-order structure function $K(q,s)$ on the second-order structure function K2(s) for different q values. (b) The ESS scaling exponent $\xi(q)$ as a function of q. (c) The distributions of normalized volatility $|\Delta{X}(t,s)|/\sigma(s)$ for s  =  1, s  =  2 and s  =  4.

Standard image High-resolution image

Whether a time series possesses monofractal or multifractal behaviors is related to the innovation distributions. For simplicity of notation, we denote $v=|\Delta{X}(t,s)|$ . By definition, we have

Equation (79)

where $\Pr(s;v)$ is the distribution of $v$ , $\sigma(s)$ is the standard deviation of $v$ , $x=v/\sigma$ is the normalized volatility, and $\Phi(s;x)$ is the distribution of x. Eliminating $\sigma(s)$ using q  =  2, it follows that

Equation (80)

If $\Phi(s;x)$ is universal and independent of s, then the last term in equation (80) is a number independent of s (and thus of K2(s)) and the monofractal behavior $\xi(q,2) = q/2$ follows. Figure 5(c) shows the distributions of the normalized volatility $|\Delta{X}(t,s)|/\sigma(s)$ at three different scales. Although the three curves well overlap in their tails, they show clear differences in the bulk around x  =  1. This suggests that the ESS cannot be nonfractal, which is consistent with the presence of nonlinear curvature in figure 5(b).

We note that the ESS analysis is not constrained to the structure functions. Rather, it can be used for the moments of other multifractal analysis approaches. In validating and qualifying the multiscaling behaviour of the recurrence intervals of financial volatility, the relative dependence of recurrence interval moments of different orders of empirical time series has been investigated [273], which is in essence an ESS analysis [274].

2.3. Wavelet transform approaches

2.3.1. Multifractal analysis based on wavelet transform (MF-WT).

The wavelet transform is widely used as a mathematical microscope to analyze time series [275, 276]. In particular, it can be used to analyze the singular structure of fractal and multifractal time series [277279].

The wavelet transform of function $X(t)$ is [275, 276]

Equation (81)

where $b\in{\mathbb{R}}$ is the position parameter, $s \in{\mathbb{R}}^+$ is the dilation parameter, $\psi(t)$ is the analyzing 'mother' wavelet. Gaussian wavelets are widely adopted [280] and are defined as derivatives of the Gaussian function:

Equation (82)

where cn is the normalization coefficient. The second order (n  =  2) Gaussian wavelet is known as the Mexican hat.

To perform wavelet transform on a time series $\{X(i)\}_{i=1}^N$ , one usually discretizes equation (81) in which b can take any value along the sampling spacing and s takes ns scales forming a geometric sequence:

Equation (83)

where $\lambda>1$ . The discretised wavelet transform of a given time series $Y(i)$ is then

Equation (84)

The wavelet transform is used to identify different types of singularities in the signals in the time-scale plane, depending on the order n of derivative of the mother wavelet (82). More generally, choosing a mother wavelet $\psi$ satisfying $\int x^{m+1} \psi(s) {{\rm d}} x = 0$ implies that the wavelet transform is blind to polynomial dependence up to order m and only higher order powers are detected. For instance, for m  =  0, the wavelet transform is independent of the level of the signal and informs essentially on the local slope. For m  =  1, the wavelet transform is independent of the level of the signal and of its local slope, and it informs on the local curvature. We should also mention the existence of orthogonal wavelet transforms [280282] and biorthogonal wavelet transform [283], which can be used directly to determine the Hurst index of a long-range correlated time series [284286].

To investigate the multifractal nature of a time series, one can calculate the qth order moments of the wavelet coefficients [287, 288]

Equation (85)

and we expect that

Equation (86)

We note that Mq(s) might diverge for $q\leqslant0$ because some values of $|W(i,k)|$ might be close or equal to 0.

One can obtain the scaling exponents through ordinary least-squares regression. Or, we can adopt the weighted least-squares regression by putting more weight on the points with smaller variance [286]. Denoting

Equation (87)

the corresponding weight wj is

Equation (88)

and the objective function is

Equation (89)

The estimate of $H(q)$ is determined as follows:

Equation (90)

2.3.2. Wavelet transform modulus maxima (WTMM).

A more elegant approach, which removes the redundancy contained in continuous wavelet transforms, is based on the wavelet transform modulus maxima (WTMM) [284, 288290]. At each scale sj, there exists several local maxima of |W(sj, i)|. These maxima form several maxima lines in the $(i,s)$ plane. The WTMM is capable of detecting all the singularities of a large class of time series [291]. The traditional method based on partition functions may face problems in dealing with the situation of $q\leqslant0$ . However, we will find that the multifractal analysis based on WTMM can overcome this difficulty because the wavelet coefficients on the WTMM line are local maxima that are non 0 [290].

Assume that there are Nj local maxima at scale sj and $W(i_{j,k},s_j)$ is the kth local maximum located at ij ,k. The qth order moments defined on the WTMM are

Equation (91)

or alternatively the partition functions read

Equation (92)

For multifractal time series, we have

Equation (93)

The generalized dimensions and singularity spectrum can then be calculated straightforwardly.

Analogous to the thermodynamic approximation in the direct determination of singularities and singularity spectrum in section 2.1.3, we can define the canonical measure [289],

Equation (94)

such that the singularity strength $\alpha$ is

Equation (95)

and the singularity spectrum $f(\alpha)$ is

Equation (96)

In applications, we use different scales sj and perform (weighted) linear regression on the log–log scales.

Using respectively the partition function approach and wavelet analysis to detect the multifractal nature of a single time series, it is found that $\tau^{{{\rm PF}}}(q)=\tau^{{{\rm WT}}}(q)+q$ and $\alpha^{{{\rm PF}}}(q)=\alpha^{{{\rm WT}}}(q)+1$ [292294].

2.3.3. Wavelet leader (MF-WL).

A more recent multifractal formalism is based on wavelet leaders [295300]. Wavelet leaders, introduced by Jaffard [295, 296], are defined based on discrete wavelet coefficients, which decompose the time series $X(i)$ on the bases $\{\psi_{j,k}\}_{j \in Z, k \in Z}$ composed of discrete wavelets $\psi_{j, k}$ . The integers $j \in Z$ and $k \in Z$ define the scale s  =  2j and location b  =  k2j. The discrete wavelets $\{\psi_{j,k}\}_{j \in Z, k \in Z}$ are space-shifted and scale-dilated templates of an analyzing wavelet $\psi(t)$ :

Equation (97)

which usually uses the Daubechies wavelet of order 1 for the mother wavelet. The discrete wavelet coefficients are defined as

Equation (98)

One defines a dyadic interval $\lambda(\,j, k)$ as

Equation (99)

and denotes the union of interval $\lambda$ and its two adjacent neighbors as $3\lambda$ ,

Equation (100)

The wavelet leader $ \newcommand{\e}{{\rm e}} \ell(\,j, k)$ is defined by [295, 296]

Equation (101)

which corresponds to the largest value of the absolute wavelet coefficients $|W(\,j',k')|$ calculated on intervals $(k-1)2^{\,j} \leqslant 2^{\,j'} k' < (k+2)2^{\,j}$ with $0 < j' \leqslant j$ . Note that, to compute the wavelet leaders, all the fine scales $2^{\,j'} \leqslant 2^{\,j}$ must be considered.

The multifractal analysis based on wavelet leaders can be described as follows [295300]. As usual, we can define the moments of order q by

Equation (102)

where nj is the number of wavelet leaders at scale s  =  2j. One can also expect the following scaling behavior if the underlying processes are jointly multifractal,

Equation (103)

where $\zeta(q)$ is related to $\tau(q)$ by

Equation (104)

We can also provide a canonical approach to determine the singularity strength function $\alpha(q)$ and the multifractal spectrum $f(\alpha(q))$ directly, as for the WTMM case in section 2.3.2.

An interesting application of wavelet leaders is the pointwise wavelet leaders entropy [301], which is defined over different scales (j ) for a given point (k) by

Equation (105)

where $\rho_{j,k}\log_2\rho_{j,k}=0$ if $\rho_{j,k}=0$ and

Equation (106)

When applied to the DJIA index (1928–2011), the resulting wavelet leader entropy is able to identify historical large market fluctuations [301].

Recently, wavelet p -leaders are introduced to characterize the p -exponents of negative pointwise regularity [302]. Based on the wavelet coefficients calculated in equation (98), the wavelet p -leaders are defined by

Equation (107)

The p -leader multifractal formalism relies on the sample moments of the p -leaders [303]:

Equation (108)

It is found that MF-DFA characterizes local regularity via the two-exponent, not the Hölder exponent, and the wavelet p -leader multifractal formalism can be viewed as an wavelet-based extension of MF-DFA [303]. The p -leaders suffer from finite-resolution effects, which can be resolved through an explicit and universal closed-form correction [304].

2.4. Detrended fluctuation approaches

2.4.1. General framework.

Consider a time series $\{X(i)|i =1,2,\cdots,N\}$ , whose data points are usually the cumulative sums of a time series of innovations. Let $\widetilde{X}(t)$ be the local trend function, which is usually determined locally around the data points using some method. The detrended residuals are defined by

Equation (109)

We divide $ \newcommand{\e}{{\rm e}} \{\epsilon(i)\}$ into $N_s=\mathrm{int}[N/s]$ non-overlapping segments with boxes of size s, where $\mathrm{int}[y]$ is the largest integer that is no larger than y . We denote the $v$ th segment as $ \newcommand{\e}{{\rm e}} S_v=\{\epsilon((v-1)s+j)| j=1,2, \cdots, s\}$ . The local detrended fluctuation function $F_v(s)$ in the $v$ th box is defined as the r.m.s. of the detrended residuals:

Equation (110)

The qth-order overall detrended fluctuation is

Equation (111)

where q can take any real value except q  =  0. When q  =  0, we have

Equation (112)

according to L'Hôspital's rule.

When the whole series $ \newcommand{\e}{{\rm e}} \{\epsilon(i)\}$ cannot be completely covered by Ns boxes, one can use 2Ns boxes to cover the series from both ends of the series, where $N_s=\mathrm{int}[N/s]$ and a short part at each end of the residuals may remain uncovered [178]. We denote the $v$ th segment partitioned from left to right as $ \newcommand{\e}{{\rm e}} S_v=\{\epsilon((v-1)s+j)| j=1,2, \cdots, s\}$ for $v=1,2, \cdots, N_s$ and the $v$ th segment partitioned from left to right as $ \newcommand{\e}{{\rm e}} S_v=\{\epsilon(N-(v-N_s)s+j)| j=1,2, \cdots, s\}$ for $v=N_s+1,N_s+2, \cdots, 2N_s$ . The local detrended fluctuation function $F_v(s)$ in the $v$ th box is defined as the r.m.s. of the detrended residuals,

Equation (113a)

for $v=1,2, \cdots, N_s$ and

Equation (113b)

for $v=N_s+1,N_s+2, \cdots, 2N_s$ . The qth-order overall detrended fluctuation is

Equation (114)

where q can take any real value except for q  =  0. When q  =  0, we have

Equation (115)

according to L'Hôspital's rule.

Varying the values of the segment size s, we can determine the power-law relation between the function Fq(s) and the size scale s,

Equation (116)

According to the standard multifractal formalism, the multifractal scaling exponent $\tau(q)$ can be used to characterize the multifractal properties, which reads

Equation (117)

where Df is the fractal dimension of the geometric support of the multifractal measure [178]. For time series analysis, we have Df  =  1. If the scaling exponent function $\tau(q)$ is a nonlinear function of q, the time series is regarded to have a multifractal nature.

Following the approach determining directly $f(\alpha)$ from the partition function [197, 198, 305], the singularity strength and its spectrum in the detrended fluctuation approaches can also be determined directly [306]. Defining the canonical measure $\mu(q,s,v)$

Equation (118)

we have

Equation (119a)

and

Equation (119b)

Numerical experiments based on the p -model and fractional Brownian motions showed that MF-DMA and its direct determination variant have comparative performance to unveil the fractal and multifractal properties [306].

2.4.2. Multifractal detrended fluctuation analysis (MF-DFA).

The detrended fluctuation analysis (DFA) was invented originally to investigate the long-range dependence in coding and noncoding DNA nucleotide sequences [307]. The multifractal extension of DFA, MF-DFA, was developed shortly thereafter [308, 309]. Since the work of Kantelhardt et al [178], MF-DFA has become one of the most important methods for multifractal analysis in diverse fields.

In the MF-DFA method, the trend function is a polynomial [178], and the linear trend is most generally adopted. For each point i, its trend value $\widetilde{X}(i)$ is determined together with other points in the same box $S_v$ , using a polynomial fitting of order l to the s data points in $i\in S_v$ :

Equation (120)

The detrending process for $ \newcommand{\e}{{\rm e}} \ell=1$ is shown in figure 6. The figure shows a case with Ns  <  N/s in which the data points at each end of the time series should not be included to form a box. Otherwise, one obtains incorrect results with $f_{\max}=f(q=0)>1$ , as will be elaborated in section 5.1.2. In the standard procedure implementing MF-DFA, researchers remove the constant shift $\langle{\Delta{X}}\rangle$ from the increments $\Delta{X}$ and work on the cumulative profile

Equation (121)

where $\langle{\Delta{X}}\rangle$ is the sample mean of the $\Delta{X}(i)$ series. However, when the trend function is polynomial, this manipulation does not have any impact on the results.

Figure 6. Refer to the following caption and surrounding text.

Figure 6. Detrending process in the MF-DFA method. The time series has 500 data points and the timescale is s  =  150. The first row shows the detrending process from left to right in which the data points with $i=451, 452, \cdots, 500$ should not be used in calculating F, and the second row illustrates the detrending process from right to left in which the data points with $i=1, 2, \cdots, 150$ should not be used.

Standard image High-resolution image

A subtle issue of the MF-DFA method is the choice of polynomial order. Using classic mathematical models (fractional Brownian motion, Lévy process and binomial measure) and real-world time series (foreign exchange rate and literary texts), Oświęcimka et al found that the calculated singularity spectra could be very sensitive to the order of the detrending polynomial and this sensitivity depends on the analyzed time series [310]. Without a deep understanding of the underlying mechanism driving the dynamics, it is hard to determine which polynomial order should be used. Anyway, for financial time series such as returns, we can check if the value of the Hurst index $H(2)$ is close to 0.5, based on the fundamental stylized fact of the absence of long memory in financial returns [94], which however might not hold at the transaction level [130].

A possible solution is to use 'feasible trend functions' in the multifractal feasibly detrended fluctuation analysis (MFFDFA) [311]. One presets a priori a set of any trend functions, say, $\{ax^2+bx+c, a\sin(x^2)+bx+c, ax^3+bx+c\}$ , and fit each segment with all the preset trend functions. For each segment, only one detrended function is chosen which has the maximum coefficient of determination R2. Hence, for each time scale s, the trend functions for different segments are not necessary the same. Numerical experiments on fractional Brownian motions and binomial measures show that the MFFDFA method outperforms the MF-DFA method [311]. For binomial measures, the improvement is partially due to the fact that the fluctuation functions Fq(s) for MFFDFA have much smaller log-periodic oscillation amplitudes, which reduces the impact of the choice of scaling ranges [312].

2.4.3. Multifractal detrending moving average analysis (MF-DMA).

Based on the moving average (MA) technique for estimating the Hurst exponent of self-affinity signals [313], the detrending moving average (DMA) analysis considers the difference between the original signal and its moving average function [314, 315]. The DMA method can also be easily implemented to estimate the correlation properties of non-stationary series without any a priori assumptions, which is widely applied to the analysis of real-world time series [316323] and synthetic signals as well [324]. Extensive numerical experiments unveil that the performance of DMA is comparable to DFA with slightly different priorities under different situations and both DFA and DMA remain 'The Methods of Choice' in determining the Hurst index of time series [325327].

The DMA method was generalized to MF-DMA for the investigation of the multifractal nature of time series [328]. The main difference between the MF-DFA and MF-DMA algorithms is about the detrending procedure. In the MF-DMA approach, one calculates the moving average function $\widetilde{X}(i)$ in a moving window of size s [321],

Equation (122)

where $\lfloor{y}\rfloor$ is the largest integer not greater than y , $\lceil{y}\rceil$ is the smallest integer not smaller than y , and $\theta\in[0,1]$ is the position parameter. In other words, one considers $\lceil(s-1)(1-\theta)\rceil$ data points in the past and $\lfloor(s-1)\theta\rfloor$ points in the future to determine $\widetilde{X}(i)$ . When i is close to each of the endpoints, the size of the box might be less than s. One can either abandon those points or calculate the moving averages directly. Usually, we consider three special cases, corresponding to three representative versions of MF-DMA:

  • MF-BDMA. The first case $\theta=0$ refers to the backward moving average [325], in which the moving average function $\widetilde{X}(i)$ is calculated over all the past s  −  1 data points of the signal.
  • MF-CDMA. The second case $\theta=0.5$ corresponds to the centered moving average [325], where $\widetilde{X}(i)$ contains half past and half future information in each window. This version is also termed MF-CMA [329].
  • MF-FDMA. The third case $\theta=1$ is called the forward moving average, where $\widetilde{X}(i)$ considers the trend information of s  −  1 data points in the future.

We note that the size of the window used to calculate the moving averages must be identical to the segment size in calculating the detrended functions [328, 329]. The detrending process is illustrated in figure 7.

Figure 7. Refer to the following caption and surrounding text.

Figure 7. Detrending process in the MF-BDBA method ($\theta=0$ ). The time series has 500 data points and the timescale is s  =  150. The time series shown are obtained by truncating the points on both ends from a much longer time series. The first row shows the detrending process from left to right in which the data points with $i=451, 452, \cdots, 500$ should not be used in calculating F, and the second row from right to left in which the data points with $i=1, 2, \cdots, 150$ should not be used.

Standard image High-resolution image

2.4.4. Other detrending algorithms.

Although determining the intrinsic trend embedded in non-stationary time series is difficult, there are many different options [330]. Because the intrinsic trend of a time series is usually unknown, methods have different performances for different types of time series. Hence, quite a few variants of MF-DFA and MF-DMA have been proposed. Local trends can be further classified into discontinuous ones and continuous ones. MF-DFA and MF-DMA belong respectively to these two classes (see also figures 6 and 7).

Zhou and Leung proposed the multifractal temporally weighted detrended fluctuation analysis (MF-TWDFA) in moving windows [331]. For each i, they fit a linear function to the data points $(X(i-s), \cdots, X(i), \cdots, X(i+s))$ in the moving window of size 2s  +  1, using weighted least-squares regression. The objective function is

Equation (123)

where $|i-j|\leqslant{s}$ , wij's are weights that decrease with increasing $|i-j|$ . In other words, the deviation $\left[X-a_0-a_1\times(i+j)\right]^2$ has a larger weight when j  is far from i. The weights wij in MF-TWDFA are defined by [331]

Equation (124)

For the equally weighted case,

Equation (125)

we obtain the multifractal moving window detrended fluctuation analysis (MF-MWDFA) [331]. These two methods belong actually to the continuous class, since the trend changes mildly without jumps. Indeed, when l  =  0, we recover the centered MF-DMA method [331].

The empirical mode decomposition (EMD) is an innovative method of analysis for nonlinear and non-stationary time series, which decomposes the time series $\{X((v-1)s+i)\}_{i=1}^s$ into a finite number of intrinsic mode functions (IMFs) [332]. Let $ \newcommand{\tr}{{\rm tr}} \{x(i)\}_{i=1}^s\triangleq\{X((v-1)s+i)\}_{i=1}^s$ . The decomposition is a sifting process, which has six steps [330]: (1) Identify all extrema of $x(i)$ ; (2) Interpolate the local maxima to form an upper envelope $U(i)$ ; (3) Interpolate the local minima to form a lower envelope $L(i)$ ; (4) Calculate the mean envelope:

Equation (126)

(5) Extract the mean from the signal

Equation (127)

(6) Check whether Ck(i) satisfies the IMF conditions. If the IMF conditions are fulfilled, Ck(i) is an IMF, and the sifting stops. Otherwise, let x(i)  =  Ck(i) and keep sifting. Finally, we obtain the EMD-based local trend of the time series segment $\{X((v-1)s+i)\}_{i=1}^s$ :

Equation (128)

Qian et al used this local trend in the MF-DFA method and developed the EMD-based MF-DFA method [333]. Numerical experiments on multifractal binomial measures indicate that the EMD-based MF-DFA method has comparable performance with respect to the MF-DFA method except that the EMD-based method performs slightly better for small singularities [333]. The method has been applied to analyze stock market indices [333, 334], foreign exchange rates [335], and certified emission reduction (CER) and European Union Allowances (EUA) futures prices [336], all of which confirm the presence of multifractality in financial returns.

In order to handle time series with highly non-stationary sinusoidal and polynomial overlying trends, Horvatic et al proposed to detrend with polynomials of varying orders [337]. In this method, the polynomial order $ \newcommand{\e}{{\rm e}} \ell$ is not fixed but increases with the scale s and the detrended fluctuations at different scales should be normalized to secure an identical intercept. Although an objective criterion for the determination of the varying order is lacking, a subjective choice ($ \newcommand{\e}{{\rm e}} \ell=1$ for s  =  4, $ \newcommand{\e}{{\rm e}} \ell=2$ for s  =  6, $ \newcommand{\e}{{\rm e}} \ell=4$ for s  =  10, $ \newcommand{\e}{{\rm e}} \ell=8$ for s  =  18, and $ \newcommand{\e}{{\rm e}} \ell=16$ for s  >  18) seemingly works well [337]. The underlying logic is that we need high-order polynomials to detrend properly the oscillations.

Du et al constructed the dual-tree complex wavelet transform based multifractal detrended fluctuation analysis (DTCWT-MFDFA) for non-stationary time series [338]. Based on the anti-aliasing and nearly shift-invariance of the dual-tree complex wavelet transform [339, 340], the raw time series is decomposed through the pyramid algorithm to extract the scale-dependent trends and the fluctuations from the wavelet coefficients. The length of the non-overlapping segment on each time scale is computed through the Hilbert transform and each of the extracted fluctuations is partitioned into non-overlapping segments with identical size. The detrended fluctuation function for each segment can be calculated. The DTCWT-MFDFA method is found to outperform the MF-DFA method for multifractal binomial measures [338].

In order to perform the MF-DFA, Manimaran et al proposed to detrend using discrete wavelet transform [341]. The discrete wavelets belonging to the Daubechies family are adopted to analyze the raw time series up to level s. The resulting low-pass coefficients capture the polynomial trends in the data. One then reconstructs a time series using these low-pass coefficients to represent the local trends at different scales. The method has been tested for synthetical time series and applied to NASDAQ and BSE indices [342, 343]. Ghosh et al used Db4 and Db6 basis sets to isolate respectively the local linear and quadratic trends at different scales from the closing prices of fifteen large-capitalization companies listed on the New York Stock Exchange (NYSE) [344]. Multifractality is observed in the investigated stock indices and individual stocks.

We note that other detrending methods used to preprocess the raw time series in section 5.3.6 can also be used in the local detrending at different scales to construct variants of MF-DFA.

2.5. Other methods

In this section, we will review several other multifractal analysis methods. It should be keep in mind that different multifractal analysis methods have been proposed in diverse fields and it is hard to mention all of them. We focus on the methods that are relevant to those discussed in the previous sections and that are utilized for financial time series analysis. Besides the ones presented below, we would like to mention the multifractal signal fluctuation analysis based on the 0–1 test [345] and the focus-based multifractal formalism [346].

2.5.1. Multiplier method.

The concept of the multiplier was introduced by Novikov to describe the intermittency and scale self-similarity in turbulent flows [164]. In econophysics, one can apply this concept to study volatility multipliers. For high-frequency financial data, we first construct an additive measure in the time interval $[t_1,t_2]$ , which is the sum of absolute returns:

Equation (129)

The absolute return is actually a measure of the volatility on $[t_1,t_2]$ [85]. The volatility time series $|r(t)|$ is divided into boxes of identical size s. Each mother box is further partitioned into a daughter boxes, in which a is called a base. The multiplier ma,s is determined by the ratio of the measure on a daughter box to that on her mother box [347, 348].

If the multiplier is scale invariant, for any two bases a and b, the multiplier density functions $p_a(m_a)$ and $p_b(m_b)$ are related through Mellin transform $\mathbb{M}$ as follows [348]:

Equation (130)

which leads to

Equation (131)

The mass scaling exponent $\tau(q)$ of the moment of ma can be obtained as [347, 348]

Equation (132)

where D0 is the fractal dimension of the support of the measure. For time series, we have D0  =  1.

The local singularity exponent $\alpha$ and its spectrum $f(\alpha)$ are related to $\tau(q)$ through the Legendre transform. It follows that

Equation (133)

and

Equation (134)

Equations (131)–(134) imply that, for each q, each of these characteristic multifractal quantities converges to a constant in the scaling range and is independent of the base a.

Note that, for financial volatility, q  >  −1 [349], which is quite analogous to the situation in turbulence [267]. Because the returns can be zero, the multiplier m can be zero and the probability density $p_a(0)\neq0$ at ma  =  0. When m is smaller than a small number $\delta$ , $p(m)=p(0)$ is a constant. For a given small $\delta$ , $\int_{\delta}^1m^qp(m)dm = C$ is finite. However, the moment

Equation (135)

diverges when $q\leqslant-1$ . This property is verified empirically [349], based on the idea that the integrand $m_a^qp_a(m_a)$ diverges for a given q [268, 350].

The scale-invariant multiplier distribution is perceived to be more fundamental than the standard multifractal spectrum $f(\alpha)$ [347, 348]. Moreover, it needs exponentially less computational time and is more accurate than the partition function approach [347, 348].

Using 1 min high-frequency data of the S&P 500 index, Jiang and Zhou reported that the volatility multiplier is scale invariant and the volatility has the property of multifractality [349]. Niu and Wang confirmed the presence of scale invariance in the multiplier distributions in the Shanghai SSEC index and in the simulated data from the model of 'voter interacting dynamic system' [351].

2.5.2. Multifractal Hilbert–Huang spectral analysis.

The Hilbert–Huang transform combines the EMD and the Hilbert spectral analysis [332, 352], in which the Hilbert transform is applied to each component Ck(i) in equation (127). The overall trend can be expressed as

Equation (136)

where ${\mathbf{j}}^2=-1$ . For each mode, the Hilbert–Huang spectrum is defined as the squared amplitude

Equation (137)

The Hilbert–Huang marginal spectrum of the original time series is then written as

Equation (138)

which corresponds to an energy density at frequency $\omega$ . This method is known as EMD-based Hilbert spectral analysis (EMD-HSA), which we term the Hilbert–Huang spectral analysis (HHSA) for simplicity.

Huang, Schmitt, Lu and Liu proposed the multifractal Hilbert–Huang spectral analysis (MF-HHSA) by generalizing the Hilbert–Huang transform from the second order to arbitrary order [353, 354]. The Hilbert–Huang spectrum $H(\omega,i)$ represents the original signal at the local level and can be used to define the joint probability density function $p(\omega,\mathcal{A})$ of the frequency $\omega_k$ and amplitude $\mathcal{A}_k$ . The Hilbert–Huang marginal spectrum in equation (138) can be rewritten as

Equation (139)

which corresponds to the second-order moment. Huang et al generalize equation (139) into arbitrary-order moments

Equation (140)

where $q \geqslant 0$ and $h(\omega)=\mathcal{L}_2(\omega)$ . In the inertial range, we assume the following scaling relation,

Equation (141)

where $\tau(q)$ is the corresponding scaling exponent function in the amplitude-frequency space.

Li and Huang applied this MF-HHSA method to the CSI 300 index of the Chinese stock market and confirmed the presence of multifractality [355].

2.5.3. EMD-based multifractal methods.

Welter and Esquef proposed an EMD-based dominant amplitude multifractal formalism (EMD-DAMF), which is implemented as follows [356]:

  • Step 1:  
    Perform EMD on $X(i)$ to obtain K IMFs Ck(i).
  • Step 2:  
    Determine the amplitude ak(i) and characteristic timescale s for Ck(i). The characteristic timescale s could be estimated in each mode by the Hilbert spectral analysis (HSA) as the reciprocal of the mean frequency, or simply by twice the mean time interval among all adjacent zero crossings.
  • Step 3:  
    Determine the dominant amplitude coefficients
    Equation (142)
    where $k=1,2,\cdots,K$ , $j=1,2,\cdots, n_k$ , nk is the number of local maxima of ak(i), and Ik,j is a time support around the j th maxima of ak(i).
  • Step 4:  
    Calculate the qth order moment for different q values:
    Equation (143)
    where the characteristic time scale sk can be obtained by HSA in each mode as the reciprocal of the mean (linear) frequency or simply by the mean time interval among all adjacent zero crossings.
  • Step 5:  
    Estimate the scaling function
    Equation (144)

In real applications, Welter and Esquef suggested that the number of siftings is fixed to ten (that is, K  =  10) and signal envelopes are computed through cubic spline interpolation [356]. They validated the method with fractional Brownian motions [357], Lévy processes [358], Multifractal random wavelet cascades [359, 360], and multifractal multiplicative cascades from the p  model [361]. Note that the EMD-DAMF method has a small number nk of amplitude maxima at larger scale modes and a limited number of scales [356]. The latter drawback might impact the determination of the scaling range.

Alternatively, Zhou et al proposed the so-called EMD-DFA method [362]. The integrated time series $X(i)$ is decomposed into a set of intrinsic mode functions and a residual or trend component. All intrinsic time scales s are identified by computing the number of data points between each neighboring local minima throughout each intrinsic mode function (IMF). For a given s, all IMFs are resampled through replacing by 0's all data points corresponding to scales not equal to s and the fluctuation time series Xs(i) is obtained by summing all resampled IMFs. The overall fluctuation at scale s is the root mean square of Xs(i):

which produces better scaling than the DFA method [362]. The EMD-DFA method can be extended for multifractal time series by working on the q-order moments:

Equation (145)

Note that the EMD-DFA method does not have an explicit detrending form like equation (109).

2.5.4. Multiscale multifractal analysis.

Gieraltowski et al proposed the multiscale multifractal analysis (MMA) based on the MF-DFA method [363]. After calculating all values of the overall fluctuations Fq(s) by the MF-DFA method, rather than estimating $H(q)$ from equation (116) in the scaling range, they proceeded to estimate the local generalized Hurst exponents $H(q,s)$ in moving windows around different scales. In this method, two free parameters are introduced, the moving window size and the step. Although the original MMA method is designed for MF-DFA, it can obviously be applied to other methods of multifractal analysis.

The MMA method has been applied to investigate the multifractal nature of daily returns of the Dow Jones Industrial Average (DJIA), New York Stock Exchange Index (NYSE), S&P 500 Index, Heng Seng Index (HSI), Shanghai Stock Exchange Composite Index and Shenzhen Stock Exchange Component Index (SZCI) from 12 May 1992 to 8 May 2012 [364, 365] and eight stock market indices (CAC 40, DAX, FTSE 100, HSI, NIKKEI 225, SSEC, NASDAQ, S&P 500) from 3 January 2000 to 1 October 2014 [366].

More generally, this method is equivalent to calculate the local logarithmic slopes of the Fq(s) functions:

Equation (146)

which is widely utilized in the study of structure functions in turbulence [265, 268], as well as in DFA [367]. The idea to estimate local logarithmic slopes is especially informative when the scaling range is narrow and it can serve as a method for the determination of scaling ranges. Different numerical methods can be adopted to estimate numerically the local logarithmic slopes $H(q,s)$ and the linear fitting in a moving window suggested in the MMA method is one of them. Empirical analyses have been performed on financial time series, such as the WTI crude oil market [368] and stock market indices [369], without referring to the name of multiscale multifractal analysis.

2.5.5. Time-varying multifractal analysis.

To quantify the evolution of a multifractal spectrum, one can perform a multifractal analysis of time series in moving windows or rolling windows. We would like to stress that short window sizes will reduce the estimation accuracy of multifractal quantities and increase the uncertainty of the results. Time-varying multifractal quantities have practical implications, as discussed in section 8.2.4.

Alternatively, Xiong and Zhang proposed the time-varying multifractal spectrum distribution (TM-MFSD) [370]. The key idea is to perform multifractal analysis on the instantaneous autocorrelation function

Equation (147)

rather than the raw time series $X(i)$ , where the time-delayed conjugation X*(i) of the analyzed series $X(i)$ is selected as the window function. To obtain the time-varying multifractal spectrum distribution of a time series, one can adopt different multifractal analysis methods described in previous sections, such as WTMM [370], wavelet leaders [371, 372], MF-DMA [373], and MF-DFA [374]. Essentially, the multifractal analysis is performed with these methods on moving windows.

2.5.6. Phase space reconstruction.

A classic method in nonlinear dynamics is to embed properly the time series into a phase space and estimate the generalized dimensions Dq of the data points in the phase space [167, 188, 189, 375], which is however less applied in econophysics. Based on the delay embedding technique [376], the time series $X(i)$ can be converted into a sequence of vectors:

Equation (148)

where $j=1,\cdots,N_v$ , $N_v$ is the number of points (or vectors) in the phase space, $\tau$ is a suitably chosen time delay, and d is the embedding dimension of the phase space. The embedding dimension d and the delay $\tau$ can be determined in different ways [189, 377381].

The relative number of data points within a distance R to point i is calculated by

Equation (149)

where ${\mathcal{H}}$ is the Heaviside function. The generalized correlation integral Cq(s) is [382]

Equation (150)

which is the generalization of the correlation integral [167, 188, 189]. The generalized dimensions are given by [375]

Equation (151)

Lee investigated the multifractal characteristics of the 1 min volatility of the Korea composite stock price index KOSPI from 20 March 1992 to 28 February 2007 [383]. The volatility time series is first detrended with a wavelet filter and then a multifractal analysis in the phase space with $\tau=1$ and d  =  1 is performed to confirm the presence of multifractality.

2.5.7. Asymmetric multifractal analysis.

Inspired by the stylized fact that the correlation between two assets is usually asymmetric [384387], Alvarez-Ramirez et al developed the asymmetric multifractal detrended fluctuation analysis (A-MF-DFA) to detect asymmetric multifractal scaling in individual time series [388]. In this method, the trend function is linear such that ai  =  0 for all i  >  1 in equation (120). The main idea is to calculate the upwards (downwards) fluctuation functions in which the removed linear trends have non-negative (negative) slopes. Mathematically, the qth-order upwards overall detrended fluctuation is

Equation (152)

and the qth-order downwards overall detrended fluctuation is

Equation (153)

where $k_v=1$ if the slope of the linear trend (120) is nonnegative, that is $a_1\geqslant0$ , and $k_v=-1$ if the slope a1  <  0. When q  =  0, we apply L'Hôspital's rule as in equation (115). The respective generalized Hurst exponents can be determined by

Equation (154)

Note that the original A-MF-DFA method uses the mean absolute value of the residuals in each segment $v$ [388].

Empirical studies using the A-MF-DFA method investigated and confirmed the asymmetric multifractal behavior in the daily returns of the DJIA index from 23 May 1980 to 25 August 2008 [389], the WTI oil price from 2 January 1986 to 14 December 2010 [389], the SSEC index from 19 December 1990 to 27 April 2012 and the SZCI index from 2 April 1991 to 27 April 2012 [390], and the DJIA, NASDAQ, NYSE, and S&P500 indices from 1 January 1991 to 31 December 2015 [391].

The A-MF-DFA method can be easily extended to design the asymmetric multifractal detrending moving average analysis (A-MF-DMA), in which moving averages serve as the trend function and the direction (or 'sign') is determined exactly as for the A-MF-DFA method [392]. The asymmetric multiscale detrended fluctuation analysis is obtained by calculating the local exponents at different time scales, which has been applied to investigate the hourly returns of the California electricity spot prices during years 1999 and 2000 [393].

Cao et al developed the asymmetric multifractal detrended cross-correlation analysis (MF-ADCCA) method [394], as an asymmetry extension of the MF-X-DFA method (see section 3.4). The method has been applied to identify asymmetric cross-correlations between the daily returns of the SSEC index and six foreign exchange rates from 22 July 2005 to 13 January 2012 [394], the 5 min returns of the CSI 300 index spot and futures returns from 30 June 2011 to 7 June 2013 [395], the daily returns of the gold spot prices in the Shanghai Gold Exchange (Au 99.95) and the New York Gold Exchange from 2 January 2003 to 27 April 2012 [396], the EU ETS for carbon emission trading products versus energy futures—Brent crude oil (Brent), Richards bay coal (Coal), UK base electricity (Electricity) and UK natural gas (Gas)—from 1 January 2008 to 31 December 2012 [397], the daily returns of the WTI price and eight foreign exchange rates (CAD, MXN, NOK, GBP, JPY, AUD, EUR and KRW) against the USD from 4 January 2000 to 31 December 2014 [398], the Shanghai SSEC and four other indices (US S&P 500, Germany DAX, India BSESN, and Brazil BVSP) from 1 January 2002 to 26 September 2014 [399], the WTI crude oil prices and eight stock merket indices (Brazil IBOVESPA, Chile IPSA, Argentina MERVAL, Mexico IPC, Colombia COLCAP, Peru SPBVL, S&P500 and DJIA) from the 28 January 2005 to 29 December 2016 [400], and the daily data of Bitcoin till 25 August 2017 [401].

Chen and Zheng proposed the asymmetric joint multifractal analysis based on partition functions [402], as an extension of the MF-X-PF method (see section 3.1). This extension becomes feasible for financial volatility because one can use the associated returns to define the signs in the segments.

2.5.8. Multifractal diffusion entropy analysis.

The diffusion entropy analysis (DEA) for time series $\{X(i)\}_{i=1}^N$ is based on the Shannon entropy [403406], which can be determined by taking asset prices as an example. For a given diffusion time or time scale s, we calculate the return series

which denotes the total diffusion distance of s jump steps. For each s, the values of $\{\Delta{X}(i,s)\}_{i=1}^N$ are covered by Ns non-overlapping intervals of length $ \newcommand{\e}{{\rm e}} \epsilon(s)$ . The interval length $ \newcommand{\e}{{\rm e}} \epsilon(s)$ is chosen as a fraction of the square root of the variance of the fluctuations $\{\Delta{X}(i,1)\}$ , which is independent of the time scale s. We count the number Nj(s) of the returns $\Delta{X}(i,s)$ that fall in the j th interval and the probability (or frequency) by means of

The Shannon entropy of the diffusion process is determined as

The scaling behavior is characterized by

where $\delta$ is the scaling exponent.

For fractional Brownian motions, the DEA scaling exponent is equal to the Hurst index, that is, $\delta=H$ [405], which does not necessary hold for other processes such as Lévy processes [405, 406]. For financial assets, it seems that one should work on returns to obtain approximately $\delta=H$ ; otherwise, $\delta$ will be greater than 0.9 [407409]. There is also an evident finite-size effect that the entropy $I(s)$ deviates from the 'theoretical' value for small scales and converges to the theoretical value for large s [406]. It is possible to overcome this drawback using the idea of integrated moments [410] (see section 5.1.5 for details).

Using DEA, Palatella et al have provided very interesting discussions of multifractality, non-linear time correlation and fat tails for financial market activities [69, 70]. An alternative generalization of DEA for multifractal analysis is to replace the Shannon entropy by the Rényi entropy expressed in equation (20), leading to the multifractal diffusion entropy analysis (MF-DEA), whose main output is a spectrum of scaling exponents $\delta(q)$ [411]. Analogous to the multiscale multifractal analysis based on the MF-DFA method [363] (see section 2.5.4), the multiscale multifractal diffusion entropy analysis can be designed [412]. We notice that $\delta(q)$ is not related directly to the well-known generalized dimensions Dq. Hence, the MF-DEA method unveils multifractality in certain mappings of the original time series, but not in the original time series per se.

We should however mention that the MF-DEA method has some drawbacks [413], as all approaches have weaknesses accompanying their positive properties. First, the MF-DEA method for negative values of q needs to be re-defined. Second, in the DEA or MF-DEA method, the estimation of the probabilities pj(s) has a crucial impact on the estimation of the Rényi entropy and consequently the $\delta(q)$ spectrum [414]. As a result, an optimal choice of $ \newcommand{\e}{{\rm e}} \epsilon(s)$ is required [414]. Third, in general, the MF-DEA method does not allow one to investigate correlations between extremely small and extremely large fluctuations. Fourth, the Rényi entropy has a particular property of conserving the scaling term.

2.5.9. Symbolization presentation and Rényi dimensions.

Xu and Beck applied symbolic dynamics techniques to financial returns to determine the Rényi dimensions or the generalized dimensions Dq [415]. There are different symbolization techniques that can convert a financial time series $\{X(i)\}_{i=1}^N$ into a symbol sequence. Li and Wang proposed an approach by considering the speed of price changes, which is quantified by an angle [416, 417],

Equation (155)

which falls within the interval $[-90^{\circ},90^{\circ}]$ . To symbolize the raw time series, we partition $[-90^{\circ},90^{\circ}]$ into 2n subintervals, which are usually almost symmetric with respect to $\theta=0^{\circ}$ . Li and Wang considered four subintervals $(-90^{\circ},-45^{\circ})$ , $[-45^{\circ},0^{\circ})$ , $[0^{\circ},45^{\circ})$ , and $[45^{\circ},90^{\circ})$ and four corresponding symbols D, d, r and R, representing respectively fast falls, slow falls, slow rises and fast rises of prices.

The symbolization method of Xu and Beck is similar [415]. They suggested to partition the return domain $\Delta{X_i}\in(-1,1)$ into 2n subintervals, mapped into 2n symbols. When n  =  1, the two subintervals are $(-1,0)$ and $[0,\infty)$ and two corresponding symbols d and u, representing respectively falling down and rising of the prices. When n  =  2, the four subintervals are (−1, −  c1), [−c1, 0], [0, c2) and $[c_2,\infty]$ , where $c_1\in(0,1)$ and $c_2\in[0,\infty]$ are parameters. Note that Xu and Beck used open intervals, which may discard some data points. In addition, a return cannot be less than  −1. For stock markets with price limit rules, the subintervals should be adjusted accordingly.

For any manner of symbolization, if $\theta_i$ or $\Delta{X}_i$ is an element of a certain subinterval, the ith symbol Si is determined by the corresponding symbol. In this way, the raw time series is mapped to a symbol sequence {Si}. The symbol sequence is divided into Nk segments of equal length k. For any given k, we obtain $\omega(k)=(2n){}^k$ allowed symbol subsequences or configurations. The occurrence probability of each configuration Cj of length k can be determined, denoted as $p_j = p(C=C_j)$ . Xu and Beck recalled the well-known Rényi information [191]

Equation (156)

and the Rényi dimension

Equation (157)

where $\varepsilon=(2n){}^{-k}$ .

The method has been applied to the daily and 1 min price time series of several US stocks (Alcoa, Bank of America, General Electric, Intel, Johnson & Johnson, Coca Cola and WalMart) from January 1998 to May 2013 and the generalized dimensions were calculated and compared for different values of k [415].

3. Joint multifractal analysis

3.1. Joint multifractal analysis based on partition functions (MF-X-PF)

A complex system usually contains two or more distinct multifractal measures that are distributed simultaneously on the same geometric support. In turbulent flows, the velocity, temperature, and concentration fields are embedded in the same space as joint multifractal measures [305], in which the scaling behavior of the joint moments $\langle[m_x(s,i)]^{\,p/2}[m_y(s,i)]^{q/2}\rangle$ of two multifractal measures mx and my over box size s is investigated. We can also call this method the multifractal cross-correlation analysis based on the partition function approach (MF-X-PF) [418]. In the original framework, the orders are p  and q rather than p /2 and q/2. When considering the degenerate case of $m_x=m_y$ and p   =  q, the joint moment becomes $\langle[m_x(s,i)]^{q}\rangle$ , which recovers exactly the partition function approach presented in section 2.1. The case of p   =  q has been investigated in [419]. There are also mathematical treatments of the bivariate case [420, 421] and the general case of many measures [422, 423].

In natural science, the joint multifractal analysis has also been applied to study the joint multifractal nature between topographic indices and crop yield in agronomy [424, 425], nitrogen dioxide and ground-level ozone [426], wind patterns and land surface air temperature [427], and temperature and nitrogen dioxide [428].

3.1.1. General formalism for arbitrary $ {(\,p,q)} $ pairs.

For the integrated measures mx(s,i) and mx(s,i) in the ith box of size s, the local singularity strengths $\alpha_x$ and $\alpha_y$ are defined as [418]

Equation (158)

Denoting $N_s(\alpha_x,\alpha_y)$ as the number of boxes needed to cover the neighborhoods of points with singularities $\alpha_x$ and $\alpha_y$ , the fractal dimension of the set is determined according to

Equation (159)

in which $f(\alpha_x,\alpha_y)$ is the joint distribution of the two singularities [305], or the joint multifractal spectrum.

Through the joint partition function [418]

Equation (160)

we can obtain the joint mass exponent function $\tau_{xy}(\,p,q)$ as

Equation (161)

Inserting equation (158) into the joint partition function, rewriting the sum into a double integral over $\alpha_x$ and $\alpha_y$ , and then applying the steepest descent approach to estimate the integral at small s values, we have

Equation (162)

where

Equation (163)

Taking the partial derivative of equation (162) over p  or q, we have

Equation (164)

We can then obtain the double Legendre transforms:

Equation (165a)

and

Equation (165b)

From the canonical perspective, we can obtain the $f(\alpha_x,\alpha_y)$ function directly by defining two canonical measures

Equation (166)

The two singularity strengths $\alpha_x(\,p)$ and $\alpha_x(\,p)$ and the joint multifractal spectrum fxy(p ,q) can be computed by linear regressions in log–log scales using the following equations:

Equation (167a)

Equation (167b)

Equation (167c)

The joint mass exponent function can be obtained by using equation (162).

3.1.2. Analytical results for binomial measures.

Xie et al derived analytical results for binomial measures with multipliers px and py [418]. They found that the two integrated measures mx(s,i) and my(s,i) in boxes of size s  =  2l are related explicitly by

Equation (168)

where 2L is the length of the time series,

Equation (169)

and

Equation (170)

When $p_x+p_y=1$ , we have $\beta=-1$ . When $p_x=p_y$ , we have $\beta=1$ and $C(s)=1$ . When both px and py are greater than 0.5 or less than 0.5, that is, $(\,p_x-0.5)(\,p_y-0.5)>0$ , we have $\beta>0$ ; Otherwise, when $(\,p_x-0.5)(\,p_y-0.5)<0$ , we have $\beta<0$ .

The joint mass exponent function is expressed as follows [418]

Equation (171)

where

Equation (172)

and

Equation (173)

It follows that

Equation (174)

and

Equation (175)

We obtain immediately the relationship between $\alpha_x$ and $\alpha_y$ ,

Equation (176)

which shows that $f_{xy}(\alpha_x,\alpha_y)$ is a curve along this line rather than a surface and the line segment (176) is the projection of $f_{xy}(\alpha_x,\alpha_y)$ onto the $(\alpha_x,\alpha_y)$ plane.

Xie et al obtained that [418]

Equation (177)

such that the width of the singularity spectrum of $\alpha_y$ is

Equation (178)

When py  =  0.5, $\Delta \alpha_y=0$ . In this case, the measure is neither multifractal nor monofractal since it is uniformly distributed on the support. Similarly, we have

Equation (179)

Further, the multifractal spectrum $f_{xy}(\alpha_x,\alpha_y)$ can be expressed as [418]

Equation (180)

It follows immediately that

Equation (181)

where $ \newcommand{\tr}{{\rm tr}} f_{xy}(Q)\triangleq f_{xy}(\alpha_x,\alpha_y; Q)$ . It implies that $f_{xy}(\alpha_x,\alpha_y)$ is symmetric with respect to the line Q  =  0. In addition, it is found that

Equation (182)

When Q  <  0, ${{\rm d}}f_{xy}(Q)/{{\rm d}}Q>0$ so that fxy(Q) is a monotonically increasing function of Q. When Q  >  0, ${{\rm d}}f_{xy}(Q)/{{\rm d}}Q>0$ so that fxy(Q) is a monotonically decreasing function of Q. Therefore, the maximum of fxy(Q) is 1 and its minimum is 0.

3.1.3. The case p   =  q.

The multifractal cross-correlation analysis based on statistical moments (MFSMXA) proposed in [419] is actually a special case of MF-X-PF$(\,p,q)$ when p   =  q. We call it MF-X-PF$(q)$ for consistence [418]. In this case, we have

Equation (183)

where

Equation (184)

Taking the derivative of equation (183) over q and applying equation (184), we have

Equation (185)

Defining that

Equation (186)

we have

Equation (187a)

and

Equation (187b)

which is the Legendre transform.

It is derived that [418]

Equation (188)

and

Equation (189)

We define the generalized joint Hurst exponent Hxy(q) as

Equation (190)

It follows that

Equation (191)

where Hx(q) and Hy(q) are the generalized Hurst exponent of mx and my. These relations were observed numerically using the MF-X-DFA method [180], the MF-X-DMA method [144] and the MF-X-PF method with p   =  q [419].

If we can define the canonical measures

Equation (192)

the two singularity strengths $\alpha_x(\,p)$ and $\alpha_x(\,p)$ and the joint multifractal spectrum fxy(p ,q) can be computed directly:

Equation (193a)

where equations (167a) and (167b) are used in the second equality, and

Equation (193b)

The joint mass exponent function can be obtained by using equation (187b).

Xiong and Shang proposed the weighted MF-X-PF$(q)$ method (W-MFSMXA) [429]. Their numerical experiments on two-exponent ARFIMA processes, binomial measures and NBVP time series illustrate that W-MFSMXA slightly outperforms MF-X-PF$(q)$ for relatively short time series.

3.2. Joint structure function analysis (MF-X-SF)

3.2.1. General formalism for arbitrary $ {(\,p,q)} $ pair.

The joint multifractal analysis based on the structure function approach, termed MF-X-SF, also has its root in turbulence, in which the joint structure function of the velocity and temperature fluctuations was introduced to study the cross-correlation between the temperature and velocity dissipation fields in a heated turbulent jet [430, 431]. For two self-similar time series $X(i)$ and $Y(i)$ with $i=1,2,\cdots,N$ , the $(\,p,q)$ -order joint structure function is defined as

Equation (194)

where $\Delta{X(i,s)}=X(i)-X(i-s)$ and $\Delta{Y(i,s)} \,= $ $ Y(i)-Y(i-s)$ . The special case of p   =  1 is called by some researchers as the analogous multifractal cross-correlation analysis (AMF-XA) [432], in which the authors were seemingly unaware of previous works [430, 433]. Because the innovations $\Delta{X}$ and $\Delta{Y}$ could be negative, the structure functions Kxy(p ,q,s) are usually defined on positive integers of p  and q. Therefore, it is more convenient to use absolute innovations to define the joint structure function

Equation (195)

which is widely adopted in the structure function approach, as well as in the multifractal height cross-correlation analysis (MFHXA) [433]. If there is a scaling relation between the joint structure function Kxy(p ,q,s) and the scale s, we can define the scaling exponent function $\tau_{xy}(\,p,q)$ as

Equation (196)

which is an analog of the MF-X-PF$(\,p,q)$ approach.

We further define the singularity functions $\alpha_x(\,p,q)$ and $\alpha_y(\,p,q)$ as the partial derivatives of $\tau_{xy}(\,p,q)$ with respect to p  and q, respectively, as in equation (165a). The singularity spectrum function $ \newcommand{\tr}{{\rm tr}} f_{xy}(\,p,q)\triangleq f_{xy}(\alpha_x,\alpha_y)$ can be obtained as in equation (165b). We note that it is hard to define a generalized Hurst index Hxy(p , q) in a consistent way. The joint structure function approach has been considered for the multifractal random walks [434].

3.2.2. The case p   =  q.

A special case of the MF-X-SF$(\,p,q)$ approach when p   =  q was proposed by Kristoufek, which was termed the multifractal height cross-correlation analysis [433]. For consistency, we call it the MF-X-SF$(q)$ approach. In this case, the qth order joint structure function is defined by

Equation (197)

If there is a scaling relation between the joint structure function Kxy(p , q, s) and the scale s, we can define the scaling exponent function $\tau_{xy}(q)$ as

Equation (198)

where Hxy(q) are the generalized bivariate Hurst exponents or the generalized joint Hurst exponents. Under certain conditions, we have [433]

Equation (199)

We define the joint singularity function and its spectrum through the Legendre transform:

Equation (200a)

and

Equation (200b)

If equation (199) holds, we have

Equation (201)

according to the definitions that $\tau_x(q)=qH_x(q)-1$ , $\tau_y(q)=qH_y(q)-1$ and $\tau_{xy}(q)=qH_{xy}(q)-1$ , and

Equation (202)

according to the definitions that $\alpha_x(q)={{{\rm d}}}\tau_x(q)/{{{\rm d}}}q$ , $\alpha_y(q)={{{\rm d}}}\tau_y(q)/{{{\rm d}}}q$ and $\alpha_{xy}(q)={{{\rm d}}}\tau_{xy}(q)/{{{\rm d}}}q$ . It follows from equation (200b) that

Equation (203)

3.3. Multifractal wavelet coherence analysis (MF-WCA)

3.3.1. Cross wavelet transformation and wavelet coherence analysis.

The cross wavelet transform (XWT) of two time series $\{X_i\}_{i=1}^N$ and $\{Y_i\}_{i=1}^N$ was introduced by Hudgins et al to study the highly intermittent pattern in atmospheric turbulence [435], which is defined by

Equation (204)

The cross-wavelet power is

Equation (205)

The wavelet coherence $\rho_{xy}^2(s)$ is defined as

Equation (206)

The wavelet coherence is able to uncover the co-movement between two time series in the time-frequency domain, which is widely applied in finance and economics [436438].

3.3.2. MF-X-WT.

Similar to the MF-X-PF method, Jiang et al designed the multifractal cross wavelet analysis [439]. It generalizes the concept of wavelet coherence and constructs the following joint partition function:

Equation (207)

The definition of the partition function allows us to uncover the more intricate relationship between the coherency and the scale under different scopes, which corresponds to a cross-multifractal behavior. If the underlying processes are jointly multifractal, the result is a scaling behavior:

Equation (208)

where ${\mathcal{T}}_{xy}(\,p, q)$ is the joint mass exponent function, which can be estimated by regressing $\ln \chi_{xy} (\,p,q,s)$ against $\ln s$ in the scaling range for a given pair $(\,p,q)$ .

Analogous to the double Legendre transform in the joint multifractal analysis based on the partition function approach MF-X-PF$(\,p,q)$ [418], we define the joint singularity strength function hx and hy

Equation (209)

Equation (210)

and the multifractal spectrum $D_{xy}(h_x, h_y)$

Equation (211)

When p   =  0 and q  =  0, the joint partition function in equation (207) is equal to the number of wavelet coefficients, or equivalently the total number of data points of the original series. In other words, we have

Equation (212)

which results in

Equation (213)

In contrast, in the MF-X-PF$(\,p, q)$ method, $\tau_{xy} (0,0)=-1$ . Therefore,

Equation (214)

It follows immediately that hx(p , q), hy(p , q) and $D_{xy}(h_x, h_y)$ obtained from the MF-X-WT$(\,p,q)$ method are different from the joint singularity strengths $\alpha_x(\,p,q)$ , $\alpha_y(\,p,q)$ and the joint multifractal spectrum $f_{xy}(\alpha_x, \alpha_y)$ obtained from the MF-X-PF$(\,p,q)$ method.

Analogous to the partition function approach [197, 198], we can directly estimate the joint singularity strength hx and hy and the joint multifractal spectrum Dxy(p , q) using

Equation (215)

Equation (216)

Equation (217)

where

Thus we can directly determine the joint singularity strength functions hx(p , q) and hy(p , q) and the joint multifractal function Dxy(p , q) from equations (215)–(217) through linear regressions.

Extensive numerical experiments on two multifractal binomial measures unveil connections between the theoretical joint multifractal formulas of MF-X-PF$(\,p,q)$ derived in [418] and the empirical joint multifractal features of MF-X-WT$(\,p,q)$ [439]:

Equation (218)

Equation (219)

Equation (220)

Equation (221)

where $\tau_{xy}(\,p,q)$ , $\alpha_x(\,p,q)$ , $\alpha_y(\,p,q)$ and $f_{xy}(\alpha_x, \alpha_y)$ are given in equations (171), (174), (175) and (180).

Although the MF-X-WT$(\,p,q)$ method is able to successfully identify the monofractal or multifractal cross correlations in time series, the results show non-negligible deviations from the theoretical results [439]. Nevertheless, empirical applications to financial returns and financial volatilities uncover evident cross-multifractality [439].

3.3.3. MF-X-WTMM.

The implementation of a joint multifractal analysis based on WTMM is not straightforward. We can conveniently determine the two sets of modulus maxima lines ${\mathcal{L}}_x$ and ${\mathcal{L}}_y$ for $X(i)$ and $Y(i)$ . However, there is no one-to-one correspondence between ${\mathcal{L}}_x$ and ${\mathcal{L}}_y$ because there are different numbers of modulus maxima for the two time series at a given scale. Lin and Sharif found a nice solution to overcome this difficulty by properly pairing up the modulus maxima lines in ${\mathcal{L}}_x$ and ${\mathcal{L}}_y$ [440, 441]. Two modulus maxima lines in ${\mathcal{L}}_x$ and ${\mathcal{L}}_y$ are paired up if they are close to each other in the time coordinate. The joint partition function can thus be defined as [440]

Equation (222)

which scales as a power law with respect to the scale s:

Equation (223)

Note that we use p  and q instead of p /2 and q/2 following [440]. In this case, the Legendre transform is expressed as [440]

Equation (224a)

and

Equation (224b)

where $p=\partial{f}/\partial{\alpha_x}$ and $q=\partial{f}/\partial{\alpha_y}$ .

The validity of the MF-X-WTMM method is confirmed by numerical experiments on coupled random binomial cascades [440]. Coupled random binomial cascades are characterized by a coupling strength parameter $g\in[0,1]$ , which g  =  0 and g  =  1 stand respectively for uncoupled and fully coupled time series [305]. Their numerical result that the $f(\alpha_x, \alpha_y)$ spectrum collapses into a one-dimensional curve in the fully coupled case is in excellent agreement with the analytical result of equation (176) [418].

3.3.4. MF-X-WL.

Due to the inherent drawbacks associated with the multifractal wavelet transform analysis, the MF-X-WT$(\,p,q)$ is not recommended for multifractal cross-correlation analysis. In addition to the MF-X-WTMM method, a suitable alternative candidate is the wavelet leaders [295300]. Jiang et al generalized the MF-WL method to the joint multifractal analysis based on wavelet leaders with two moment orders p  and q, termed MF-X-WL$(\,p, q)$ [442].

One first obtains the wavelet leaders $ \newcommand{\e}{{\rm e}} \ell_x(\,j,k)$ and $ \newcommand{\e}{{\rm e}} \ell_y(\,j,k)$ of $\{X_i\}_1^N$ and $\{Y_i\}_1^N$ at different scales s  =  2j, and then defines the joint moment with orders p  and q as [442]

Equation (225)

where nj is the number of wavelet leaders at scale s  =  2j. When X  =  Y and p   =  q, we recover the traditional multifractal formalism based on wavelet leaders presented in section 2.3.3. Further scaling analyses follow the conventional way as other similar methods.

Through numerical experiments, it is found that the MF-X-WL$(\,p,q)$ method can successfully detect joint multifractality in binomial measures and monofractality in bivariate fractional Brownian motions [442]. The method has also been used to investigate the joint multifractality between the daily returns of the DJIA and NASDAQ indices, and between their daily volatilities as well. The MF-X-WL$(\,p,q)$ method works for both positive and negative $(\,p,q)$ values. However, the method seemingly cannot provide very accurate results.

3.4. Multifractal detrended cross-correlation analysis (MF-DCCA)

Inspired by the seminal work of Podobnik and Stanley on the detrended cross-correlation analysis [179], Zhou proposed the multifractal detrended cross-correlation analysis, which is a combination of MF-DFA and DCCA [180]. The method was initially termed MF-DXA [180] and then MF-X-DFA so that one can distinguish it from the MF-X-DMA method which is a combination of MF-DMA and DCCA [144].

Consider two time series $\{X(i)\}$ and $\{Y(i)\}$ of the same length N, where $i=1, 2, \cdots, N$ . Each time series is covered with $N_s={{\rm int}}[N/s]$ non-overlapping boxes of size s. The two segments of time series within the $v$ th box $[l_v+1,l_v+s]$ are denoted by $X_v(k)$ and $Y_v(k)$ with $k=1,\cdots,s$ , where $l_v=(v-1)s$ . Assume that the local trend functions of $\{X_v(k)\}_{k=1}^s$ and $\{Y_v(k)\}_{k=1}^s$ are $\{\widetilde{X}_v(k)\}$ and $\{\widetilde{Y}_v(k)\}$ , respectively. The cross-correlation for each box is calculated as

Equation (226)

The qth order cross-correlation is calculated as

Equation (227)

when $q\neq0$ and

Equation (228)

We then expect the following scaling relation

Equation (229)

There are many different methods for the determination of $\widetilde{X}_v$ and $\widetilde{Y}_v$ . The local trend functions could be polynomials [307, 443], which recovers the MF-X-DFA method [180]. The local trend functions could also be moving averages [313, 314], in which case the algorithm is called MF-X-DMA [144]. We can certainly use other detrending approaches as described in section 2.4.4. When X  =  Y, MF-X-DFA reduces to MF-DFA and MF-X-DMA reduces to MF-DMA. The DCCA method based on DMA was proposed by He and Chen [444], which is a special case of the MF-X-DMA method with $ \newcommand{\e}{{\rm e}} q\equiv 2$ .

For dependent multifractal binomial measure pairs [361] and dependent multifractal random walk pairs [445], it is found numerically that [180]

Equation (230)

Further, He and Chen gave an approximate derivation of the following inequality [446]:

Equation (231)

However, both relations (230) and (231) deviate from many empirical results [180, 446448].

For the monofractal case with q  =  2 such as two coupled autoregressive fractional moving average (ARFIMA) signals with identical error terms, it is shown with certain approximations that [179, 449, 450]

Equation (232)

where Hx(2) and Hy(2) are the Hurst exponents of time series {Xi} and {Yi}. Kristoufek derived equation (232) for two ARFIMA processes regardless of the correlation between error terms, as long as it remains non-zero and for an ARFIMA process and an MA process [451]. Presenting the memory effects as power spectrum in the frequency domain, Kristoufek showed that the bivariate Hurst exponent cannot be larger than the average of the separate Hurst exponents [452]:

Equation (233)

which holds for several stochastic processes [451458]. The difference can be qualified by the DCCA coefficient differentiation [459].

There are also variants of MF-X-DFA. The time-delay MF-X-DFA method includes a time delay in pairing two time series such that the detrended fluctuation functions in segments read [460]

Equation (234)

When $\delta=0$ , we recover the original MF-X-DFA method. Shi et al generalized the multiscale multifractal analysis [363] to multiscale multifractal detrended cross-correlation analysis, called MSMF-DXA [461]. This method is applied to the pairs of daily returns of six stock market indices (DJIA, NASDAQ, S&P500, SSCI, SZCI and HSI) [461], return pairs (DJIA versus NYSE and DJIA versus HSI) in different time periods [462], and pairs of opening price, closing price, highest price, lowest price and transaction volume of S&P 500, HSI, SSCI and ASX [463]. Cao and Xu suggested to determine the local trends using the maximum overlap wavelet transform (MODWT) [464]. Cao et al proposed the volatility-constrained multifractal detrended cross-correlation analysis [465]. Numerical investigations using mathematical models on the performance of these methods have not been conducted yet.

The MF-X-DFA method has been generalized to investigate the multifractal cross-correlation behavior among multiple time series, which leads to the coupling detrended fluctuation analysis (CDFA) [466]. For multiple time series Xj(i) with $j=1, 2, \cdots, n$ , the local detrended fluctuation function becomes

Equation (235)

Other methods for joint multifractal analysis can also be extended to multiple time series in a similar way. A similar treatment is to sum the local fluctuations of multiple time series [467]:

Equation (236)

This method is reported to work well for two-component ARFIMA processes and binomial measures.

The CDFA method has been applied to hourly concentrations of four air pollutants (NO2, NOx, total hydrocarbon content THC and O3) at Tehran from 2 February 2006 to 31 July 2007 [466], weekly foreign exchange rates of three currencies (EUR/USD, GBP/USD and JPY/USD) from 14 December 1998 to 31 January 2011 [466], daily warehouse-out quantity records of steel products from three warehouses (YZ, JS and BY) from 1 January 2009 to 1 October 2013 [468], and daily closing price data of four Asian stock markets' indices (SSEC, NIKKEI 225, KOSPI and Indian SENSEX) from 1 July 2003 to 1 March 2015 [469].

3.5. Multifractal cross-correlation analysis (MF-CCA)

Oświęcimka et al argued that most existing methods for cross-correlation analysis often present the serious limitation of reporting spurious multifractal cross-correlations when there are none [148]. To remedy this drawback, they proposed the multifractal cross-correlation analysis (MF-CCA), which incorporates the sign of fluctuations $F_{v}(s)$ into the overall detrended fluctuation Fxy(q, s):

Equation (237)

where $\sum_{v=1}^{N_s} {\mathrm{sign}}\left(F_{v}(s)\right)\neq{N_s}$ in many cases. When Fxy(q, s) is negative for every s, one represents  −Fxy(q, s) with respect to s in the log–log plot [148, 179]. The MF-CCA method is a direct, natural multifractal generalization of the detrended cross-correlation analysis, which is recovered when q  =  2 [179]. Numerical experiments on coupled ARFIMA processes and coupled Markov-switching multifractal (MSM) processes validate the suitability of MF-CCA in extracting joint monofractal or multifractal properties in two time series.

Combining the multifractal temporally weighted detrended fluctuation analysis (MF-TWDFA) [331] and the multifractal cross-correlation analysis (MF-CCA) [148], Wei et al designed the multifractal temporally weighted detrended cross-correlation analysis (MFTWXDFA) to quantify power-law cross-correlations in two time series and confirmed the validity of the MFTWXDFA method with numerical experiments on bivariate fractional Brownian motions, coupled ARFIMA processes, and multifractal binomial measures [470]. The method is applied to confirm the presence of joint multifractality in three pairs of daily stock index returns (S&P 500 and FTSE 100, S&P 500 and NIKKEI 225, and S&P 500 and HSI) from 4 January 2001 to 30 September 2016 [470].

The idea of using signed fluctuation applies to other methods, such as MF-X-DMA and MF-X-SF. Indeed, Wang et al proposed a variant of the MF-X-SF$(q)$ method by taking into account of the information on the innovation signs in the definition of structure functions [149]:

Equation (238)

This definition should be treated with caution because $K'_{xy}(q,s)$ are not defined on all $q\in{\mathbb{R}}$ values and their values could be negative so that the scaling relation (198) does not hold. If this happens, one may argue that there is no cross-correlation scaling between the two time series [149].

3.6. Multifractal detrended partial cross-correlation analysis (MF-DPXA)

The observed long-range power-law cross-correlations between two time series may not be caused by their intrinsic relationship but by a common third driving force or by common external factors [471473]. If the influence of the common external factors on the two time series are additive, we can use partial correlation to measure their intrinsic relationship [474]. To extract the intrinsic long-range power-law cross-correlations between two time series affected by common driving forces, Liu developed the detrended partial cross-correlation analysis (DPXA), which combines the ideas of DCCA and partial correlation, and studied the DPXA exponents of variable cases [475]. The DPXA method has been proposed independently by Yuan et al [476]. Qian et al generalized DPXA to analyze multifractal time series, resulting in the multifractal detrended partial cross-correlation analysis (MF-DPXA) [477].

Assume that two stationary time series $\{x(t):t=1,\cdots, N\}$ and $\{y(t):t=1, \cdots, N\}$ , such as returns, depend on n common external driving factors characterized by a sequence of time series $\{z_i(t):t=1, 2, \cdots, N\}$ with $i=1, \cdots, n$ . Each time series is partitioned into $N_s={{\rm int}}[N/s]$ non-overlapping windows of size s. For each window, say the $v$ th window $[l_v+1,l_v+s]$ where $l_v=(v-1)s$ , we calibrate respectively two linear regression models for ${\mathbf{x}}_v$ and ${\mathbf{y}}_v$ ,

Equation (239)

where ${\mathbf{x}}_v=[x_{l_v+1},\ldots,x_{l_v+s}]^{\mathrm{T}}$ , ${\mathbf{y}}_v=[y_{l_v+1},\ldots,y_{l_v+s}]^{\mathrm{T}}$ , ${\mathbf{r}}_{x,v}$ and ${\mathbf{r}}_{y,v}$ are the vectors of the residuals, and

Equation (240)

is the matrix of the n external forces in the $v$ th window, where $\mathbf{x}^{{\rm T}}$ is the transform of $\mathbf{x}$ . We obtain the estimates $\mathbf{\hat{\beta}}_{x,v}$ and $\mathbf{\hat{\beta}}_{y,v}$ of the n-dimensional parameter vectors $\mathbf{\beta}_{x,v}$ and $\mathbf{\beta}_{y,v}$ and the sequence of residuals

Equation (241)

We further obtain the residual profiles, that is,

Equation (242)

where $k=1,\cdots,s$ .

Assuming that $\widetilde{R}_{x,v}$ and $\widetilde{R}_{y,v}$ are respectively the local trend functions of $R_{x,v}$ and $R_{x,v}$ , the detrended partial cross-correlation in each window can be computed as

Equation (243)

and the qth order detrended partial cross-correlation is

Equation (244)

when $q\neq0$ and

Equation (245)

when q  =  0. We then expect the scaling relation

Equation (246)

According to the standard multifractal formalism, the multifractal mass exponents $\tau(q)$ can be used to characterize the multifractal properties as

Equation (247)

where Df  =  1 is the fractal dimension of the geometric support of the time series [178]. If $\tau(q)$ is a nonlinear function of q, the partial cross-correlations of the time series are multifractal. According to the Legendre transform, we obtain the singularity strength function $\alpha(q)$ and the multifractal spectrum $f(\alpha)$ as [171]

Equation (248)

The MF-DPXA method is an extension to MF-DCCA methods. Different choices of the local trends result in different variants. When MF-DPXA is implemented with DFA or DMA, we obtain MF-PX-DFA or MF-PX-DMA. Using two binomial measures contaminated by Gaussian noise with very low signal-to-noise ratio, numerical experiments show that MF-DPXA is significantly advantageous over MF-DCCA [477]. It is also easy to extend the DPXA idea to other methods of joint multifractal analysis. In this vein, the performance of different MF-DPXA methods should be investigated and compared.

4. Multifractal models

4.1. Multiplicative cascade models

4.1.1. Deterministic multinomial models.

Multiplicative cascade models generate multinomial measures that are constructed in a recursive manner [162]. Multiplicative cascades exist in financial markets since there is evidence of a causal information cascade from large scales to fine scales [478, 479], which is quite similar to turbulence. It is from this insight that the MRW and many other models emerged. The multiplicative cascade process starts with an interval $[0,1]$ on with the measure is uniformly distributed. Without loss of generality, assume that the total measure on the interval is 1. At the first iteration, the interval is divided into b segments whose lengths are $\{s_i:i=1,\cdots,b\}$ and the measure is redistributed on each segment such that the measure on the ith segment is mi. At the second iteration, each segment is divided into b smaller segments and the measure on it is redistributed in the same way. This procedure goes to infinity, resulting in a multiscale multinomial measure. The mass exponent function can be solved from the following equation [171]:

Equation (249)

In this iterative process, the daughter segments are not necessary to cover the mother segment. Therefore, we have

Equation (250)

It can be proven that

Equation (251)

where the equality '=' holds if and only if all $\ln{m_i}/\ln{s_i}$ terms are identical. In this situation, the measure is uniformly distributed on the support and is monofractal such that $\tau(q)$ is a linear function of q. According to equation (25), we have $ \newcommand{\e}{{\rm e}} D_q \equiv D_0 = D_f$ , indicating that Dq is independent of q. Since $\alpha(q)=\tau'(q)$ , we have $\alpha'(q)= \tau''(q)= 0$ , suggesting that the singularity width $\alpha_{\max}-\alpha_{\min}=0$ .

It is easy to verify that, when q  =  0 and q  =  1,

Equation (252)

are respectively solutions to equation (249). Because $\tau(q)$ is a monotonically increasing function of q for multifractals, these solutions are the unique solutions. For the Dq function, we have

Equation (253)

where the equality holds if and only if all $\ln{m_i}/\ln{s_i}$ terms are identical. In addition, the limits $D_{+\infty}$ and $D_{- \infty}$ exist when $q\to\pm\infty$ , which read

Equation (254a)

and

Equation (254b)

For the $\alpha(q)$ function, we have

Equation (255)

where the equality holds if and only if all $\ln{m_i}/\ln{s_i}$ terms are identical. In addition, the limits $\alpha(+\infty)$ and $\alpha(- \infty)$ exist when $q\to\pm\infty$ , which read [162]

Equation (256a)

and

Equation (256b)

Note that the $\alpha(q)$ function has a very similar shape to the Dq function. For the $f(\alpha(q))$ function, the relationship ${{\rm d}}f(\alpha)/{{\rm d}}\alpha=q$ still holds and $f_{\max} = f(q=0) =1$ . Moreover, we have

Equation (257)

where the equality in the second formula holds if and only if all $\ln{m_i}/\ln{s_i}$ terms are identical. The proofs of these properties can be found in [82] and references therein. The multifractal measures discussed here are conservative. Non-conservative multifractal measures exhibit different properties [480].

When si  =  1/b for all i's, we have

Equation (258)

The generalized dimension function Dq and the generalized Hurst exponent function $H(q)$ can be explicitly expressed due to equations (12) and (13). According to equation (11a), we have

Equation (259)

The multifractal spectrum $f(\alpha)$ can also be explicitly expressed from the parameter q using equation (11b).

4.1.2. The p -model.

The most widely used multifractal model in econophysics is the p -model that generates multifractal binomial measures in a recursive way [361], It is also frequently used for testing the performance of different multifractal methods. The p -model is a special case of the deterministic multiplicative cascade model with b  =  2 and $s_1=s_2=1/2$ . To construct the binomial measure, one starts with the zeroth iteration k  =  0, where the data set $z(i)$ consists of one value, z(0)(1)  =  1. In the kth iteration, the data set $\{z^{(k)}(i): i = 1, 2, \cdots, 2^k\}$ is obtained from

Equation (260)

for $i = 1, 2, \cdots, 2^{k-1}$ . An early construction of binomial measures can be attributed to de Wijs in 1951 [163, 481, 482].

When $k\to\infty$ , z(k)(i) approaches a binomial measure. According to equation (249), we have

Equation (261)

The analytical expressions of Dq and $H(q)$ can be obtained using equations (12) and (13) respectively. According to the Legendre transform expressed in equations (11a) and (11b), we have

Equation (262)

and

Equation (263)

Equation (263) was derived by Calvet et al [483].

We now perform a multifractal analysis of a binomial measure with m1  =  0.3 and length N  =  220. Figure 8(a) presents the first 1000 data points of the measure. The partition function approach and the direct determination approach are utilized and compared. We first work on the partition function approach. Figure 8(b) illustrates the power-law dependence of $\chi(q,s)$ on scale s for different q values from  −10 to 10. All the curves for different q values have nice power-law scaling in a broad scaling range. The slopes obtained by linear regressions of $\ln\chi(q,s)$ against $\ln{s}$ are estimates of $\tau(q)$ , which are shown in figure 8(e). We obtain Dq using equation (25), $\alpha(q)$ , $f(q)$ and $f(\alpha)$ using the Legendre transform, which are presented in figures 8(f)(i). Based on the direct determination approach, figure 8(c) plots the dependence of $\sum_i \mu(q,s,i) \ln{[m(s,i)]}$ against s and figure 8(d) the dependence of $\sum_i\mu(q,s,i)\ln[\mu(q,s,i)]$ against s in linear-log coordinates, where $m(s,i)$ is the total measure in the ith box of size s and $\mu(q,s,i)$ is the corresponding canonical measure. We also observe nice linearity. The slopes of the linear fits in figures 8(c) and (d) are estimates of $\alpha(q)$ and $f(q)$ respectively, which are also shown in figures 8(g) and (h). We obtain $\tau(q)$ by the Legendre transform and Dq according to equation (25), which are depicted in figures 8(e) and (f), respectively. In figure 8(e), we also mark $\tau(0)=-1$ and $\tau(1)=0$ , while in figure 8(i), we show that $f'(\alpha)|_{q=0}=0$ and $f'(\alpha)|_{q=1}=1$ . In figures 8(e)(i), we also show the corresponding analytical functions. It is evident that both the classical partition function approach and the direct determination approach unveil perfectly the multifractal nature of the binomial measure with very high accuracy. Note that the choice of s values crucially impacts the results, because of the log-periodic oscillations in deterministic binomial measures [312], which will be discussed in section 5.

Figure 8. Refer to the following caption and surrounding text.

Figure 8. Multifractal analysis of a deterministic binomial measure with m1  =  0.30. (a) First 1000 data points of the binomial measure. (b) Power-law dependence of $\chi(q,s)$ on box size s for different q values. (c) Linear dependence of $\sum_i \mu(q,s,i) \ln{[m(s,i)]}$ against $\ln{s}$ . (d) Linear dependence of $\sum_i\mu(q,s,i)\ln[\mu(q,s,i)]$ against $\ln{s}$ . (e) The mass exponent function $\tau(q)$ . (f) The generalized dimensions Dq. (g) The singularity strength function $\alpha(q)$ . (h) The $f(q)$ function. (i) The multifractal singularity spectrum $f(\alpha)$ . In plots ((e)–(i)), the results presented with two types of markers are obtained from the classical partition function approach and the direct determination approach, in which we also show the corresponding analytic functions expressed in equations (261)–(263).

Standard image High-resolution image

4.1.3. Stochastic multinomial models: discrete multiplier distribution.

We can introduce randomness into multiplicative processes to construct stochastic multinomial models [162, 165, 166, 484]. We focus on the uni-scale case that a segment is divided into b sub-segments of identical length. At the first step, a unity segment of measure $\mu_0=1$ is divided into b non-overlapping subsegments. With probability $p'_i$ , measures $m'_{i,1},\cdots,m'_{i,b}$ are assigned to the subsegments. There are k probabilities such that the construction matrix is expressed as

Equation (264)

At the second step, each of the b sub-segments is divided into b segments of length 1/b2. For the ith sub-segment, with probability $p_j'$ , the measures are $m'_{i,1}m'_{j,1},\cdots,m'_{i,b}m'_{j,b}$ . This process goes to infinity, resulting in stochastic multinomial measures.

Let $m_{(i-1)b+j}=m'_{ij}$ , $p_{(i-1)b+j}=p'_i$ and n  =  kb, we rewrite equation (264) as

Equation (265)

Note that $\sum_{i=1}^{n}p_i=b$ . The mass exponent function $\tau(q)$ is the solution of the following equation:

Equation (266)

It follows immediately that

Equation (267)

The generalized dimension function Dq and the generalized Hurst exponent function $H(q)$ can be explicitly expressed from equations (12) and (13). According to equation (11a), we have

Equation (268)

Hence, all the multifractal quantities have analytical expressions.

In figure 9, we illustrate an example of a stochastic binomial measure, together with its multifractal properties. An interesting feature is the presence of negative dimensions $f(\alpha)<0$ . We will come back to discuss this issue later in section 5.2.3.

Figure 9. Refer to the following caption and surrounding text.

Figure 9. Multifractal properties of time series generated from a stochastic binomial model in which $m'_{1,1}=0.3$ with $p'_1=0.4$ and $m'_{2,1}=0.8$ with $p'_2=0.6$ in equation (264). (a) A realization of the stochastic binomial measure. (b) The mass exponent function $\tau(q)$ . (c) The multifractal singularity spectrum $f(\alpha)$ .

Standard image High-resolution image

4.1.4. Stochastic multinomial models: continuous multiplier distribution.

When the multipliers have continuous distributions, we obtain a large class of stochastic multiplicative cascade models [162]. The well-known lognormal model originated in fluid mechanics [172, 485]. For lognormal binomial measures [143, 483, 486], the multipliers follow the lognormal distribution in which the logarithmic multipliers have mean $\lambda$ and variance $\sigma^2$ . The mass exponent function is

Equation (269)

and the singularity spectrum is

Equation (270)

For log-Poisson binomial measures [143, 483, 486], the multipliers follows the Poisson distribution

Equation (271)

and the singularity spectrum is

Equation (272)

For log-Gamma binomial measures [143, 483, 486], the multipliers follows the Gamma distribution

Equation (273)

where $\beta>0$ and $\gamma>0$ , and the singularity spectrum is

Equation (274)

Similar random cascading processes such as the $\mathcal{W}$ -cascades on the dyadic trees of their orthogonal wavelet coefficients can also generate stochastic multifractal time series with explicit expressions of $\tau(q)$ and $f(\alpha)$ [360]. For lognormal $\mathcal{W}$ -cascades [360], we have

Equation (275)

and

Equation (276)

where $\mu$ and $\sigma^2$ are respectively the mean and the variance of $\ln|W|$ . The minimum and maximum singularities are obtained when $f(\alpha)=0$ :

Equation (277)

For log-Poisson $\mathcal{W}$ -cascades [360], we have

Equation (278)

and

Equation (279)

where the law of $\ln|W|$ is the same as $P\ln\delta+\gamma$ , and $\lambda$ is the mean and the variance of the Poisson variable P. The differences of the multifractal expressions between the $\mathcal{W}$ -cascades and the conventional cascade models stem from the different expressions of the multiplier distributions.

The discussions about stochastic continuous multifractals with some other multiplier distributions can be found in [82, 487, 488], in which anomalous multifractal behavior emerges.

4.2. Multifractal model of asset returns (MMAR)

4.2.1. Basic MMAR model.

Mandelbrot et al proposed a seminal multifractal model of asset returns (MMAR) [217], which is also termed Brownian motions in multifractal time (BMMT) [489]. In the MMAR model, the logarithmic price is assumed to be a subordinate process of a Brownian motion and a multifractal process of trading time. The MMAR model contains three assumptions:

Assumption 1. $X (t)$ is a subordinate process

Equation (280)

where t is the clock time, $B(t)$ is a fractional Brownian motion with self-affine index H and $\theta(t)$ is a stochastic non-decreasing trading time.

Assumption 2. The trading time (or 'subordinator') $\theta(t)$ is the cumulative distribution function of a multifractal measure $\mu$ defined on $[0, T]$ such that it is a multifractal process with continuous, non-decreasing paths, and stationary increments.

Assumption 3. The processes $B(t)$ and $\theta(t)$ are independent.

The trading time $\theta(t)$ plays a crucial role in the MMAR model, which presumably causes the price $X(t)$ to be multifractal. It is shown that, under these three assumptions, the process $X(t)$ is multifractal with stationary increments and scaling function [217].

Equation (281)

When $\theta(t)$ is a canonical multifractal, the increments of $X(t)$ have fat tails [217, 490]. When H  =  1/2, the price process displays both uncorrelated increments and long memory in volatility [217]. Therefore, the MMAR model with H  =  1/2 is able to reproduce the main stylized facts of financial returns [94]. A natural choice of the canonical multifractal measure $\mu$ in assumption 2 is from the random multiplicative cascade models [143, 162, 217, 483, 486].

Recent empirical results show that the intertrade durations do have multifractal nature [491495], which provides strong supports for assumption 2. Calvet et al developed in detail the theory of multiplicative multifractals that is closely relevant to the $\theta(t)$ process [483]. Fisher et al unveiled the presence of multifractal scaling laws in the DEM/USD (bid/ask quotes from 31 October 1992 to 1 September 1993 and daily data from June 1973 to December 1996) and JPY/USD exchange rates data [496]. They found that the simulated multifractal data from the MMAR generating process is consistent with the DEM/USD data, but do not appear completely consistent with the JPY/USD data [496]. Calvet and Fisher confirmed the presence of multifractality and the consistency with the MMAR model in the DEM/USD data, and in the daily data from 1962 to 1998 of the value-weighted NYSE-AMEX-NASDAQ index ('CRSP Index'), Archer Daniels Midland (ADM), General Motors (GM), Lockheed-Martin, Motorola, and United Airlines (UAL) as well [143].

Günay investigated the comparative performance of the lognormal MMAR model with respect to four benchmark econometric models (GARCH, EGARCH, FIGARCH and MRS-GARCH) using daily stock index returns of four emerging markets (Croatia, Greece, Poland and Turkey) from 4 January 2000 to 3 July 2014 [497]. He estimated the parameters of the five models and simulated each model with 1000 runs. He found that the deviation of the average mass scaling exponents of the lognormal MMAR model from those of the empirical results is the smallest.

Using the daily returns of 12 foreign exchange rates against the USD from January 1993 to February 2012 as an example, Goddard and Onali proposed a joint test for long memory and multifractality in the framework of the lognormal MMAR model [498].

Batten et al modified the MMAR model by representing the trading time in assumption 2 with the series of volume ticks [499], that is, the trading time $\theta(t)$ is given by the normalized volume ticks $\overline{V}_t$ :

Equation (282)

where $V_t\geqslant 0$ denotes the number of volume ticks at time t. The rational of this model is based on the fact that trading volumes possess multifractal nature that is universal in different markets [222, 500504]. They studied the 5 min EUR/USD spot quotes and trading ticks from 5 January 2006 to 31 December 2007. They found that the model can produce admissible VaR forecasts at the 12 h forecast horizon and outperform the historical simulation approach and the benchmark GARCH(1,1) location-scale VaR model at the daily horizon.

A caveat about the construction of some versions (including the initial one [217]) of the MMAR model should be stressed. Assumption 2 for the subordinator $\theta(t)$ defines it in general without paying attention to the condition of causality. In general, a multifractal measure is a geometrical construction that exists over all times, without a specific definition or dependence on the arrow of time. In other words, the multifractal measure pre-exists time. The fact that the subordinator $\theta(t)$ can mimic a time flow results from the construction consisting in taking the cumulative distribution function of this multifractal measure. However, the measure at a given instant already 'knows' its future values in the constructions that are for instance purely geometric without attention to causality. While these classes of MMAR models may appear good candidates to describe financial price time series, they should be only considered as toys, with no serious relevance to real markets. Indeed, the arguably most important of all properties that models of financial markets should obey is that of causality. Otherwise, spurious conclusions will derive in portfolio allocation and optimisation, in derivative pricing, investment strategies and so on.

4.2.2. Poisson MMAR model.

In assumption 2, when the trading time $\theta(t)$ is the c.d.f. of a Poisson multifractal measure $\mu$ defined on $[0, T]$ , we obtain the Poisson MMAR model or PMM for short [505]. The PMM improves the MMAR in the sense that the resulting measure is grid-free since its partitioned sub-intervals do not have identical length bk and it can be viewed as a stochastic volatility model with a Markov latent vector [505]. The PPM leads to the Markov-switching multifractal models for volatility.

4.3. Markov-switching multifractal (MSM) models

4.3.1. The framework of MSM models.

The Markov-switching multifractal (MSM) model designed by Calvet and Fisher is a discrete-time Markov process with multi-frequency stochastic volatility [505, 506]. The return process is specified as

Equation (283)

where the random variables $ \newcommand{\e}{{\rm e}} \epsilon_i$ are i.i.d. standard Gaussians ${\mathcal{N}}(0,1)$ and $\sigma_i$ is a stochastic volatility with $\bar{k}$ volatility components $M_{1,i}, M_{2,i}, \cdots, M_{\bar{k},i}$ decaying at heterogeneous frequencies $\gamma_1, \gamma_2, \cdots, \gamma_{\bar{k}}$ such that

Equation (284)

where $\sigma$ is the unconditional standard deviation of the return ri under the assumption that the multipliers M1,i, M2,i, ⋯, $M_{\bar{k},i}$ are statistically independent. The random volatility components Mk,i are random multipliers that are persistent, non-negative and canonical such that

Equation (285)

for all i's. It is further assumed for simplicity that the multipliers $M_{1,i}, M_{2,i}, \cdots, M_{\bar{k},i}$ at a given time i are statistically independent.

In the MSM model, one stacks the volatility components in period i into a $1\times\bar{k}$ vector

Equation (286)

The vector ${\overrightarrow{M}}_i$ is assumed to be a first-order Markov process and thus defines the volatility states. When the state vector ${\overrightarrow{M}}_{i-1}$ is determined in period i  −  1, the i period multiplier Mk,i for each $k\in\{1,2,\cdots,\bar{k}\}$ is either drawn from a fixes distribution $P(M)$ with probability $\gamma_k$ or remains unchanged at its current state:

Equation (287)

where mk,i is drawn from $P(M)$ . The MSM model allows for a wide variety of specifications for the distribution $P(M)$ and any discrete or continuous distribution with ${\mathbb{E}}(M)=1$ defined on $M\in(0,\infty)$ could be utilized.

Introducing two parameters $\gamma_{\bar{k}}\in(0,1)$ and $b\in(1,\infty)$ , the transition probabilities are specified as

Equation (288)

or equivalently

Equation (289)

Obviously, $\gamma_k$ is a monotonically increasing function of k so that Mk,i with small k values correspond to low-frequency components and Mk,i with large k values correspond to high-frequency components. For small values of $\gamma_1$ and k, a Taylor polynomial approximation of equation (289) gives

Equation (290)

Hence, the transition probabilities of low-frequency components grow approximately at geometric rate b and the growth rate of the transition probabilities of high-frequency components slows down.

The MSM model can produce the main stylized facts of financial asset returns. It contains multifrequency volatility and the price process is a multifractal jump-diffusion process [507]. The changes of low-frequency multipliers induce discontinuous volatility and long memory in volatility, high-frequency multipliers produce substantial outliers and hence fat tails in returns, and the hierarchical cascade of the multiplicative process ensures the emergence of multifractality in returns. The multifractal spectrum of the price process is fully determined by the probability distribution $P(M)$ of the multipliers [143].

4.3.2. Binomial MSM model.

Calvet and Fisher proposed a parsimonious setup in which the probability distribution $P(M)$ is a binomial distribution with the random variables taking values $m_0\in[1,2]$ or $2-m_0\in[0,1]$ with equal probability, which is a binomial MSM model [506]. For the binomial MSM model and other MSM models with discrete multiplier distributions, the model parameters ($\sigma$ , b, $\gamma_{\bar{k}}$ and parameters in $P(M)$ ) can be estimated via the maximum likelihood estimation (MLE) method, because the close-form likelihood can be expressed and the number of frequencies $\bar{k}$ can be determined using the Akaike information criterion (AIC) or the Bayesian information criterion (BIC) by comparing the likelihoods of different MSM($\bar{k}$ ) models that have different numbers of free parameters [506].

Using the daily exchange rates of the Deutsche Mark (DEM), Japanese Yen (JPY), British Pound (GBP), and Canadian dollar (CAD) against the US Dollar for about 30 years, Calvet and Fisher estimated binomial MSM models with four parameters and more than a thousand states. They found that the MSM model outperforms benchmark econometric models (GARCH, Markov-switching GARCH and FIGARCH) both in-sample and out-of-sample and considerable gains in forecasting accuracy are obtained at horizons of 10 to 50 d [506]. Using daily data of two stock indices (the Dow Jones Composite 65 Average Index and NIKKEI 225 Average Index) from January 1969 to October 2004, two foreign exchange rates (GBP and AUD against USD) from March 1973 to February 2004, and two US treasury bond rates (TB-1y and TB-2y) from June 1976 to October 2004, it is found that the estimated model captures the temporal dependence of the data, through estimating and comparing the generalized Hurst exponents $H(q=1)$ and $H(q=2)$ for both empirical data and simulated data of the binomial MSM model [508].

4.3.3. Lognormal MSM model.

Lux studied the lognormal MSM model [509], in which multipliers are determined by random draws from a lognormal distribution with parameters $\lambda$ and S:

Equation (291)

According to the canonical condition (285), we have

Equation (292)

Hence, the lognormal SMS model has four parameters. Lux proposed a generalized method of moments (GMM) estimator together with linear forecasts for model calibration, which can be applied to any continuous distribution with any number of volatility components. His numerical experiments show that the GMM estimator performs well for the binomial and lognormal models.

The mass exponent function $\tau(q)$ defined through structure functions of the lognormal MSM model can be explicitly expressed as [263]

Equation (293)

The parameter $\lambda$ can be determined by fitting equation (293) to the empirically obtained $\tau(q)$ curve. Based on the DAX data, Eisler and Kertész showed that the MF-DFA method provides a nice estimate of $\lambda$ , which is only slightly worse than the GMM estimate [263]. It follows from the Legendre transform that (see [510])

Equation (294)

and

Equation (295)

We see that $\alpha(q)$ is a linear function of q without limits. When $q\geqslant\lambda/(\lambda-1)$ , we have $\alpha\leqslant 0$ .

Empirical analysis of five exchange rates (DEM, GBP, CND, JPY and CHF against USD) from 1 January 1979 to 31 December 2004 shows that the lognormal MSM model performs comparably to the binomial MSM model [509]. Using 1 min high frequency data of the Polish WIG20 index sampled over the period 1999–2004, Oświęcimka et al found that the lognormal MSM model is able to reproduce some main stylized facts including the fat-tailed distribution, the absence of correlations and multifractality in the return time series [510]. Based on the same data sets as in [508], it is found that the lognormal MSM model performs almost identically to the binomial MSM model in reproducing the generalized Hurst exponents $H(1)$ and $H(2)$ of logarithmic prices, indicating that the parsimonious discrete specification of the binomial MSM model is flexible enough for empirical analysis [511]. Similar results are obtained for absolute returns and squared returns for small q values, but not for large q values [512]. The determination of the generalized Hurst exponents $H(q)$ in these works is based on the structure function approach. The situation might be improved if one uses other multifractal analyses such as MF-DFA [263].

4.3.4. Other extensions.

The time series generated from the lognormal MSM model has a vanishing leverage autocorrelation [263], which is inconsistent with the well-known leverage effect in financial time series [94]. Using the idea in [513], Eisler and Kertész extended the MSM model by introducing leverage autocorrelations [263]:

Equation (296)

where $K(i)$ is a kernel function, which is qualitatively the leverage autocorrelation function in an appropriate limit [263, 513]. Obviously, this extension can be applied to other stochastic volatility models. Alternatively, incorporating the MSM volatility into the mean-reversion term in the square-root volatility process of the Heston model [514], the leverage effect emerges naturally due to the correlation between the Gaussian innovations of the volatility and the Gaussian innovations of the stock price in the Heston model [515]. Note that a large class of stochastic volatility processes can produce the leverage effect [262, 516].

The univariate MSM model can be extended to bivariate and multivariate MSM models to investigate volatility comovement between two or many financial assets [517]. For multivariate MSM models with discrete multiplier distributions, it is possible to derive the closed-form likelihoods, and parameter estimation can be conducted by maximum likelihood for state spaces of moderate size and by simulated likelihood through a particle filter in high-dimensional cases [517]. The GMM estimation approach proposed in [509] can also be designed for multivariate MSM models with discrete or continuous multiplier distributions [518]. This model is homogeneous, which can be improved by allowing correlations between volatility components to form a non-homogeneous bivariate MSM model to gain better risk prediction power [519].

Other extensions include the Copula-MSM model for characterizing spot and weekly futures price dynamics and more generally other pairs of financial assets [520], the MSM-t model in which the innovations $ \newcommand{\e}{{\rm e}} \epsilon_i$ follow the Student distribution [521], the MSM-skewed t model in which the innovations follow a skewed Student distribution [522], the level-MSM model for interest rates which incorporates the well-known level effect observed in interest rates [523], the modified Hull–White model of interest rates in which the volatility of the short term rate is driven by an MSM model [524], the cyclic MSM model for insurance claims arising from climatic events in which the term $ \newcommand{\e}{{\rm e}} \epsilon_i$ is set to the smoothed average of claims observed during the corresponding month of the year [525], the realized volatility lognormal MSM (RV-LMSM) model in which the non-RV $\sigma_i$ in equation (284) is replaced by the realized volatility [526], and the MSM duration (MSMD) model for the analysis of intertrade durations in financial markets [527529].

Overall, different specifications of MSM-type models have comparative performance and on average outperform benchmark econometric models in capturing the multifractal behavior of financial variables, in forecasting financial volatility, and in measuring market risks.

4.4. Multifractal random walk (MRW)

4.4.1. The original model.

Bacry et al introduced the multifractal random walk (MRW) to model stochastic volatility in which the volatility can be reduced as an exponential of a long memory process [445]. The MRW model was inspired by the seminal idea of causal cascades in financial markets, in which a multivariate multifractal description of a portfolio of assets was also introduced and studied [478, 479]. The MRW model possesses solvable multifractal properties [434, 445, 530], which belongs to the class of lognormal continuous cascade models of asset returns [531533]. The MRW can be extended to the multivariate MRW [434]. Bacry and Muzy further proved the equivalence between the two definitions of MRW and MMAR [531].

Consider a multifractal process $X(t)$ , whose discrete process are sampled at discrete times $k\Delta{t}$ is $X_{\Delta{t}}(t)=X_{\Delta{t}}(i\Delta{t})$ , where $i=1,\cdots,N$ . The increment $\delta_{\Delta{t}}X_{\Delta{t}}(i\Delta{t})$ of the MRW is constructed as

Equation (297)

and the corresponding MRW is

Equation (298)

where $X(0)=0$ and $N=T/\Delta{t}$ . Here, $\varepsilon_{\Delta{t}}$ and $\omega_{\Delta{t}}$ are independent, $\varepsilon_{\Delta{t}}$ is a Gaussian white noise with variance $\sigma^2{\Delta{t}}$ , while $\omega_{\Delta{t}}$ is Gaussian and its covariance is

Equation (299)

where

Equation (300)

Thus the variance is ${{\rm Var}}({\omega_{\Delta{t}}(i)})=\lambda^2\ln(T/\Delta{t})$ . To ensure that the variance of $X_{\Delta{t}}(t)$ converges when $\Delta{t}\to0$ , we have

Equation (301)

It follows that the variance of $X_{\Delta{t}}(t)$ is $\sigma^2t$ , which is independent of $\Delta{t}$ . Because $\omega_{\Delta{t}}$ is Gaussian, this model is the lognormal MRW model and can be extended to other distributions [531, 534].

It has been proved that the scaling exponents of the structure functions of $X_{\Delta{t}}(t)$ is [445]

Equation (302)

when the scale s is less than N. When s  >  N, we have

Equation (303)

Speaking differently, in the latter case, $\omega_{\Delta{t}}$ is a Gaussian white noise so that $\delta_{\Delta{t}}X_{\Delta{t}}(i\Delta{t})$ is also a Gaussian white noise. Hence, $X_{\Delta{t}}(t)$ is a Brownian motion. Ludeña provides approximate confidence intervals of $\zeta(q)$ for empirical data [535]. One can estimate the MRW model using the generalized method of moments [532, 533] or an approximated maximum likelihood method [536].

Bacry et al [445] showed that this construction above has a well-defined and unique time-continuous limit, making the MRW the single quadratic multifractal generalisation of the fractional Brownian process with a bona-fide continuous limit (similar to the continuous limit of a discrete random walk that defines the Wiener process). Sornette et al [537] presented a convenient alternative representation of the MRW in terms of an exponential of a long memory process with a specific asymptotic $1/\sqrt{t}$ decaying kernel, which has been generalised to arbitrary power law kernel [538, 539] and to endogenous self-excited processes [540].

The MRW model is able to capture the multifractal nature of financial returns, such as the daily DJIA returns from 1900 to 2003 [541] and intraday S&P 500 future index over the period of 1988–1999 [532]. It also predicts the power-law tailed distribution of financial returns [532, 542].

4.4.2. Fractional MRW.

If $\varepsilon_{\Delta{t}}$ is a fractional Gaussian noise, the MRW becomes the multifractal fractional random walk (MFRW) and the scaling exponents are [445]

Equation (304)

Mathematical treatments for the MFRW can be found in [543546]. The application of the MFRW model is limited in econophysics since financial returns are usually uncorrelated such that $H\approx 0.5$ . Rypdal and Løvsletten proposed two mean-reverting MRW models in which the anti-correlations are modeled either through a drift term as in an Ornstein–Uhlenbeck process or by using fractional Gaussian noise with H  <  0.5 for the innovations [547]. They applied the models to estimate the multifractal behavior in the 1 h electricity spot prices in Norwegian Kroner market from 4 May 1992 to 27 August 2011.

4.4.3. Skewed MRW.

The MRW is a useful model in finance which is validated with respect to certain properties on the memory of volatility but is not validated for a fully faithful description of the stock market returns [548].

To capture the leverage effect that past price returns and future volatilities are negatively correlated, Pochart and Bouchaud proposed the skewed multifractal random walk (SMRW) [513]:

Equation (305)

where the kernel $K(i,j)$ decays as a power law

Equation (306)

They derived that

Equation (307)

The SMRW model has been applied to understanding the volatility smiles in option pricing [513]. Further mathematical treatments and empirical validation using the daily data of five stock market indices (CAC 40, DAX, FTSE 100, S&P 500 and DJIA) from 3 December 1990 to 15 February 2010 can be found in [549].

4.5. Exponentials of long memory processes

4.5.1. Alternative representation of MRW.

Sornette et al provided an alternative representation of the MRW [537],

Equation (308)

where $\delta_{\Delta{t}}X(t)=X(t)-X(t-\Delta{t})$ is the increment, W1(t) is a Wiener process with unit diffusion coefficient, and $\omega_{\Delta{t}}(t)$ is the log-volatility which obeys an autoregressive equation

Equation (309)

where $\omega^0_{\Delta{t}}$ is a constant, $ \newcommand{\e}{{\rm e}} \eta(t)$ denotes the process of information flow in the form of a standardized Gaussian white noise and the memory kernel $h_{\Delta{t}}(\cdot)$ is a causal function ensuring that the system is not anticipative. Thus $\omega(t)$ represents the response of the market to incoming information $ \newcommand{\e}{{\rm e}} \eta(t)$ up to the time t. At time t, the distribution of $\omega_{\Delta{t}}(t)$ is Gaussian with mean $\omega^0_{\Delta{t}}$ and variance $V_{\Delta{t}} = \int_0 ^\infty{h}^2_{\Delta{t}}(t'){{{\rm d}}}t' = \lambda^2 \ln \left( \frac{Te^{3/2}}{\Delta t} \right)$ . The covariance of $\omega_{\Delta{t}}(t)$ ,

Equation (310)

specifies entirely the random process. When $\Delta{t}$ is sufficiently small, we have

Equation (311)

The long-range dependence and multifractality of the stochastic volatility are caused by the slow power-law decay of the memory kernel in equation (311).

Different from the techniques and evidence of multifractality in the domain of $f(\alpha)$ reported in this review, it can be derived from the above representation of the MRW that there is also a nice temporal signature of multifractality, in the sense that there is a full spectrum of exponent $p(A)$ describing the 1/tp (A) relaxation of the volatility from a peak of amplitude A [537]. Such multifractality has also been found in the multifractal stress activation model of earthquakes [550, 551], which has been confirmed on many catalogs [552, 553], and exogenous shocks and on endogenous multifractal responses in financial markets [58, 59].

4.5.2. Quasi-multifractal model.

Saichev and Sornette generalized the MRW model to a large class of continuous stochastic processes [538]. Similar to equation (308), the increment process is

Equation (312)

where $\omega(t)$ is expressed as

Equation (313)

through a Wiener process $W(t)$ and a power-law kernel

Equation (314)

where $\varphi\geqslant0$ and the microscopic characteristic scale $ \newcommand{\e}{{\rm e}} \ell$ regularizes the singularity of the power law in the propagator $h(t)$ at t  =  0 and ${\mathcal{H}}(t)$ is the Heaviside function ensuring the condition of causality. The special case $\varphi=0$ recovers the MRW model with a truncation at an integral scale. The spectrum of the quasi-multifractal process can be computed and it is found that multifractality holds over a large range of dimensionless scales for $\varphi>0$ [538, 554].

Saichev and Filimonov considered the discrete version of the quasi-multifractal model and proposed a new method for the determination of the quasi-multifractal spectrum [539].

4.5.3. Self-excited multifractal model.

Filimonov and Sornette introduced the self-excited multifractal (SEMF) model, in which the absolute increments of the process are expressed as exponentials of a long memory of past increments [540]. The SEMF model has a self-excitation mechanism combined with exponential nonlinearity such that future values of the process explicitly depends on past ones. The self-excitation captures the microscopic origin of the emergent endogenous self-organization properties of financial markets, such as the cascade in information flows, the triggering of aftershocks by previous large fluctuations and the 'reflexive' interactions.

Without loss of generality, we denote the time increment $\Delta t \to 1$ . In discrete time $t=0, 1, 2, \cdots, n$ , the SEMF process reads $X_n=\sum_{i=0}^n \Delta{X}_i$ , where its increments are given by the following relation

Equation (315)

where

Equation (316)

in which the kernel has a power-law form

Equation (317)

The random variables $ \newcommand{\e}{{\rm e}} \eta_n$ is an i.i.d. Gaussian with zero-mean and unit variance representing a flow of external news, the parameter $\sigma$ stands for the impact amplitude of the external news and sets the dimension and scale of $\Delta{X}_n$ and Xn. The sum in equation (316) ensures that the dynamics is self-excited.

The discrete SEMF process can be viewed as the simplest multifractal generalization of the GARCH process [133], and in fact it is strongly related to EGARCH and other variants [131, 137] defined in section 1.3. The external flow of news $ \newcommand{\e}{{\rm e}} \eta_n$ can be either positive or negative, controlling the signs of the returns. The absolute returns (or volatilities) are determined by all past returns with a decaying weight as a function of their distance to the present. The SEMF process can reproduce all the standard stylized facts found in financial time series, such as heavy tails of the return distribution, absence of autocorrelation of the returns, long-range correlation of the squared increments, multifractality and the leverage effect, which are robust to the specification of the parameters and the shape of the memory kernel [540].

The continuous-time SEMF process can be formally defined by the stochastic integro-differential equation in the Ito sense [540]:

Equation (318)

where $dW(t)$ is the increment of the regular Wiener process and $h(t)$ is a memory kernel function. The expression (318) is justified as the formal notation of the discrete formulation (315) in the limit when $\Delta{t}\to0$ . When there is no memory in the kernel such that $h(t)=0$ , we retrieve the standard random walk.

4.6. Multifractal continuous-time random walk

Continuous-time random walks (CTRWs), initially proposed by Montroll and Weiss [555], has been studied extensively and applied to diverse fields [556560]. In particular, the CTRW formalism has been applied to model asset price dynamics [6671], where the price fluctuations and the associated waiting times between successive price fluctuations are jointly considered.

Motivated by Scher and Montroll's valley model [556], Perelló et al and Kasprzak et al modeled the pausing-time (also known as intertrade durations or waiting times between successive transactions) density at the transaction level and developed the multifractal continuous-time random walk (MF-CTRW) model [561, 562]. We present briefly this model below. We note that the model is quite general and can be applied to other socioeconomic systems.

Note that the Scher–Montroll model using CTRW can be mapped onto the ETAS (or Hawkes) model of triggering between events (earthquakes and other systems) [563]. And there is a strong use of self-excited Hawkes models to describe volatility clustering as well as trade dynamics [76].

4.6.1. Interevent duration distributions.

Following Scher and Montroll's valley model [556], the unconditional interevent duration density $\psi(t)$ is expressed based on the conditional density of the priority $ \newcommand{\e}{{\rm e}} \epsilon$ (or volume for transactions):

Equation (319)

where

Equation (320)

is the conditional density with $s_0, \beta>0$ and $ \newcommand{\e}{{\rm e}} \rho(\epsilon)$ is the density of $ \newcommand{\e}{{\rm e}} \epsilon$ [561, 562]. This framework is in essence a mixture of distributions hypothesis from subordinated processes [564].

When the priority $ \newcommand{\e}{{\rm e}} \epsilon$ has different density $ \newcommand{\e}{{\rm e}} \rho(\epsilon)$ , $\psi(t)$ will have different forms [561]. When $ \newcommand{\e}{{\rm e}} \epsilon$ is a constant, $\psi(t)$ is an exponential. When $ \newcommand{\e}{{\rm e}} \epsilon$ is distributed uniformly, $\psi(t)\sim t^{-1}$ . When $ \newcommand{\e}{{\rm e}} \epsilon$ follows the Laplace distribution, $\psi(t)$ has a power-law tail with arbitrary exponent for $t\gg s_0$ .

4.6.2. Multifractal CTRW.

Within the MF-CTRW formalism [561], the q-moment is defined as follows

Equation (321)

where

Equation (322)

It is found that none of the three $ \newcommand{\e}{{\rm e}} \rho(\epsilon)$ densities discussed above could produce a multifractal structure [561]. Rather, a proper selection of weight $ \newcommand{\e}{{\rm e}} \rho(\epsilon)$ to produce multifractality is in the form of the exponential or Weibull family:

Equation (323)

where $\alpha>0$ and $\sigma>0$ .

The moment $\langle{t^q}\rangle$ exists if $\alpha>1$ and q  >  −1. When $\alpha=2$ , one has

Equation (324)

where $ \newcommand{\e}{{\rm e}} L=\exp(\alpha^2\beta^2/4)$ . When $\alpha\neq2$ , one has

Equation (325)

where $ \newcommand{\e}{{\rm e}} L=\exp[(\alpha-1)(\beta\sigma/\alpha){}^{\alpha/(\alpha-1)}]$ . Kasprzak et al further derived the generalized partition function in a closed form within the MF-CTRW model and unveiled higher-order phase transitions in financial markets [562].

4.7. Agent based models

A promising direction is to understand the microscopic origins of apparent multifractality in financial time series through computational experiments using agent-based models (ABMs) [2]. We provide here a parsimonious survey of such efforts without going into the details of the models.

4.7.1. Percolation models.

Percolation models for financial markets are based on stochastic formation of opinion clusters with many variants of the rules of cluster merging and splitting [565, 566]. The multifractal analysis of returns using the structure function approach illustrates that the scaling exponent function $\xi(q)$ varies with model parameters and that properly determined parameters are able to produce results comparable to those of the DJIA index [567]. Several percolation models have been studied aiming at exploring the emergence of multifractal behavior in artificial financial time series, such as the stochastic cellular automata model [568], the particle-cluster aggregation model on scale-free networks [569, 570], the percolation model on the Sierpinski carpet lattice [571], and other variants of the percolation model [572577].

4.7.2. Spin models.

Spin models, especially Ising models, have also been adopted to model the price formation process of financial assets [578]. In a sophisticated extension of the Ising model, Sornette and Zhou incorporated global news for all agents, imitation among neighboring agents and idiosyncratic preference of individual agents with an irrational interpretation of the predictive power of global news [58, 59]. The model can generate multifractal financial time series in a self-organized way. There are also Ising models for stock markets with market makers [579581], on Sierpinski carpet lattice [582], and on small-world networks [583, 584]. These models also succeeded in generating multifractal returns.

4.7.3. Empirical order-driven models.

Mike and Farmer introduced an empirical behavioral model for order-driven markets [585], known as the Mike-Farmer model [586], in which the basic rules (the long memory in order directions and a Student distribution in relative order prices) are based on the statistical regularities extracted from the order flow data of real stocks. Gu and Zhou improved the Mike-Farmer model by considering further the long memory in relative order prices [587]. The Gu-Zhou model is able to produce multifractality in the generated return series [587] and in the recurrence intervals of returns [588].

4.7.4. Other models.

Thompson and Wilson found that the zero-intelligence model of Farmer et al [589] is not able to produce multifractality in the generated returns, but the positive-intelligence model can [590]. Crepaldi et al showed that, although the usual minority games do not produce multifractal returns, the non-synchronous minority games in the non-ergodic phase can generate multifractal returns [591]. Multifractal behavior of financial time series is also observed in other types of models, such as an artificial dealer market model [592], a double auction model [593], the stochastic voter model [594, 595], the stochastic interacting contact model [596, 597], the epidemic model [598], and strategic models with fundamentalist agents [599, 600].

5. Properties and important issues

5.1. Determination of multifractal quantities

5.1.1. Computational precision.

In most multifractal analyses, we need to calculate high-order moments. The problem of computational precision may arise. Consider the partition function approach as an example. When $m(s,t)\ll1$ and $q\gg1$ , the corresponding values of the partition function $\chi$ will be too small such that the computer is out of memory. To overcome this problem, we can compute the logarithm of the partition function ln$\chi_{q}(s)$ rather than the partition function itself. With a simple manipulation, we obtain [205]:

Equation (326)

where $m_{\max}=\max \limits_{t}\{m(s,t)\}$ is the maximum of $m(s,t)$ in the boxes.

5.1.2. What if N/s is not integer.

Multifractal analysis deals with the scaling behavior of time series at different scales. The most common way is to partition the time series of length N into nonoverlapping windows of scale s. Hence, it is not uncommon that N/s is not an integer. If one uses a window with size less than s, one may obtain incorrect results such as $f(\alpha)>1$ , which happened sometimes in empirical studies. A usual strategy is to cover the time series from both sides and keep only the $\mathrm{int}[N/s]$ boxes of size s for each covering [178].

Alternatively, we can adopt the idea of the sand box method [601, 602], in which, instead of covering the times series, one uses Ms boxes of size s with the centers of the boxes randomly distributed on the time series [603]. Taking the family of MF-DFA as an example, the qth overall fluctuation is

Equation (327)

This approach has an additional advantage to reduce fluctuations around the power-law scaling, especially when there are intrinsic log-periodic oscillations such as in binomial measures [312].

5.1.3. Range of q and convergence of moments.

A preliminary condition for multifractal analysis is the estimation accuracy of the moments. When $|q|$ is large, the estimated moment is usually unreliable since the empirical integrand $x^q\Pr(x)$ for the moment diverges, where x is the measure and $\Pr(x)$ is its probability distribution in the partition function approach [350]. The divergence phenomenon has been empirically shown in turbulence data [268, 350] and financial data [349]. Similarly, when $|q|$ is larger, the estimated derivatives ${\rm d}F_q(s)/{\rm d}s$ deviate significantly from the theoretical values at small scales and fluctuate a lot at large scales in MF-DMA [604]. We believe that the divergence phenomenon is ubiquitous in multifractal analysis.

The estimation accuracy of moments is worse when $|q|$ is larger and the time series length N is shorter. Unfortunately, there are no theoretical results of the dependence on the length N of the maximum q for which the convergence of the integrand is ensured. With turbulence data, Zhou et al estimated that $N(q=8) \approx 6 \cdot 10^{6}$ and $N(q=9) \approx 5 \cdot 10^{7}$ [268]. For low-frequency financial time series, such as daily returns, we suggest that the range of q should be $|q|\leqslant6$ , or even smaller.

5.1.4. Determination of scaling range.

If a time series is fractal or multifractal, there are power-law scaling relationships between moment-like quantities and scales. In practice, taking $F_q(s) \sim {s}^{h(q)}$ in equation (116) as an example, the scaling exponents are usually estimated using linear least-squares regressions of $\ln{F_q(s)}$ against $\ln{s}$ in a scaling range $[s_{\min},s_{\max}]$ . In other words, the scaling laws hold only on the scaling range. The scaling range should be wide enough to ensure the presence of scale-invariant characteristics [605608]. In many cases, the change of scaling range causes dramatic variations of the estimates and sometimes very strange singularity spectrum shapes [609, 610], some of which are called 'inverted multifractality' [329]. Hence, as the scaling range plays a crucial role in the estimation of scaling exponents, many efforts have been made for its determination. Nevertheless, it remains a difficult task that has not been fully solved [327]. The mostly adopted method is actually eye balling.

Wang et al argued that scaling laws hold in the scaling range but not in the two ranges outside the scaling range such that one can fit the field with three power laws by minimizing the RMS of residuals, called the three-segment method by some researchers [611]. Because the assumption of two extra power laws is merely empirical, this method is not widely adopted. However, when Fq(s) exhibits a crossover phenomenon as in many financial time series [135, 205, 496, 612615], the idea can be used to simultaneously determine the crossover scale $s_{\times}$ and the two scaling exponents H1 and H2 [610, 616]. We rewrite the bi-power law for q  =  2 as follows

Equation (328)

To estimate the three parameters $s_{\times}$ , H1 and H2, we minimize the following function

Equation (329)

where c1 is not a free parameter due to the constraint that the two straight lines should intersect at $s_{\times}$ , that is,

Equation (330)

This method does not allow one to determine the left and right cutoffs of the two scaling ranges. The left cutoff can be determined if we combine other methods [617, 618]. Alternatively, Fang et al suggested to use a polynomial function for the second range and take the first range as the scaling range, which however fixed $s_{\min}=1$ [619]. Ge and Leung studied a method to identify crossover scales based on statistical inference [620].

Xiong et al argued that the optimal right cutoff $s_{\max}$ can be determined based on the intersection of the $\ln{F_{q_1}(s)}$ and $\ln{F_{q_2}(s)}$ curves, where q1 and q2 are respectively the minimum and maximum positive values of q under investigation [621]. The candidate right cutoff is determined as follows

Equation (331)

In order to avoid being trapped within local optima, it is required that the variation around $s_{\max}$ should be large enough:

Equation (332)

where j   =  1 and 2. This method works well for traffic flow time series [621]. Extensive numerical experiments are necessary to check if the method is suitable for other time series.

Berntson and Stoll proposed a range erosion method based on the test of self-similarity [622]. Starting from the largest range, we fit the data and perform a residual test of self-similarity, which fits second-order polynomials to the residual plots and took significant second-order polynomials as a rejection signal of self-similarity. If the self-similarity is rejected, we erode the endpoints to shorten the data and repeat the fit-and-test procedure until either too few points remained to perform a regression or the data showed self-similarity over the remaining range. This erosion is suggested to be performed in three ways: (1) removing the left endpoint, (2) removing the right endpoint, and (3) removing both endpoints. The largest remaining logarithmic range is taken as the scaling range. Interestingly, Xia et al proposed a range expansion method based on Theil's inequality coefficient [623]. It starts from the two points in the center of the scale sequence and Theil's inequality coefficient $\delta(s_i,s_j)$ is calculated. If $\delta(s_i,s_j)$ is less than a prefixed threshold $\delta_0$ , we proceed to the nest step. We include the nearest left scale si  −  1 to the scaling range if $\delta(s_i-1,s_j)<\delta(s_i,s_j+1)$ and $\delta(s_i-1,s_j)<\delta_0$ or the nearest right scale sj  +  1 to the scaling range if $\delta(s_i-1,s_j)>\delta(s_i,s_j+1)$ and $\delta(s_i,s_j+1)<\delta_0$ . The iteration stops when $\delta(s_i,s_j)>\delta_0$ and the final scaling range is regarded as optimal.

Fei et al proposed another automatic method to identify the optimal scaling range [624], which is implemented through the method of exhaustion or brute force to obtain the RMS of the residuals of the fit to the data within all potential scaling ranges $[s_i,s_j]$ with $s_j/s_i$ greater than a prefixed constant and choose the scaling range with the minimal RMS. Similarly, Gulich and Zunino proposed a criterion based on a brute-force algorithm to systematically determine optimal fitting regions in DFA and MF-DFA [625]. Although the study of the algorithm mainly focuses on DFA and MF-DFA, it can obviously be applied to other multifractal analysis. One first determines a set of scales $\{s_i\}_{i=1}^{n}$ that are evenly spaced in the logarithmic ordinate. An empirical suggestion is to use s1  =  4 and sn  =  [N/4] which embraces the scaling range of Grech and Mazur [626628]. Next, one performs linear regression for each pair $[s_i,s_j]$ to obtain the R-squared statistic (or coefficient of determination) $R^2_{ij}$ . To ensure that the scaling law is not spurious, the scaling range should be wide enough [605]. Hence, the scaling range width $\ln{s_j}-\ln{s_i}$ should be greater than a prefixed constant, say one order of magnitude. The pair with the largest $R^2_{ij}$ is regarded as the optimal scaling range $[s_i,s_j]$ . The criteria of these two methods are essentially the same. For multifractal analysis, one obtains the average $R^2_{ij}$ over different q values [625]. In addition, Leonarduzzi et al designed a nonparametric bootstrap based procedure to achieve automated selection of scaling range, which also falls in the class of exhaustion methods [629].

Wu suggested that a qualified fit in the estimation of fractal dimensions should satisfy three criteria [630]: (1) the linearity between $\ln{F_q(s)}$ and $\ln{s}$ is significant; (2) the maximum deviation of each point does not exceed twice the standard deviation of the fit residuals; and (3) the standard deviation of the estimate is less than a preset value. After exhaustive calculations, the widest range among those for all the qualified fits (largest $\ln{s_j}-\ln{s_i}$ ) is used as the optimal scaling range [630]. Because this method does not ask for a best goodness-of-fit, one does not need to pose a constraint on $\ln{s_j}-\ln{s_i}$ . For multifractal analysis, we suggest to modify the third criterion by requiring that the relative deviation of the estimate is less than a preset percentage, say 5%. The optimal scaling range for q  =  2 is often used for other q's. However, it is not uncommon that the crossovers become evident for large $|q|$ 's but not for q  =  2 when there is a crossover phenomenon or the scaling range becomes narrower for large $|q|$ 's. Hence, it is better to determine the optimal scaling range based on the largest q value. Alternatively, one may determine the optimal scaling range for each q, in which an extra constraint should be posed ensuring that the scaling ranges for different q's should overlap at least partly. We can further combine the multiple linear regression (328) for crossover phenomena.

Grech and Mazur argued that the scaling range is related to the length N of the analyzed time series, the level of long-term memory described by the Hurst exponent H of the time series, and the goodness-of-fit quantified by the coefficient of determination R2 or equivalently u  =  1  −  R2 [626628]. They provided an excellent analysis on the behavior of scaling ranges in DFA and DMA and obtained their expressions for fixed confidence levels, which is the percentage of time series matching the assumed criterion for R2. In their analysis, they fixed $s_{\min}=8$ . For DFA, it is found that

Equation (333)

where a1  =  3.40, a2  =  4.16, a3  =  0.0097 and a4  =  −96 for the confidence level 97.5% and a1  =  3.95, a2  =  5.03, a3  =  0.0070 and a4  =  −106 for confidence level 95% [626]. For DMA, it is found that

Equation (334)

where D0  =  0.558, D1  =  0.640, $ \newcommand{\e}{{\rm e}} \eta_0=1.130$ , $ \newcommand{\e}{{\rm e}} \eta_1=0.135$ , $\xi_0=1.049$ and $\xi_1= 0.657$ for the confidence level 97.5% and D0  =  0.400, D1  =  1.125, $ \newcommand{\e}{{\rm e}} \eta_0=1.146$ , $ \newcommand{\e}{{\rm e}} \eta_1=0.163$ , $\xi_0=0.975$ and $\xi_1= 0.593$ for confidence level 95% [627]. In most studies, researchers adopt $s_{\max}=N/4$ recommended by Kantelhardt et al [178], which is too large for most time series and one should use a much smaller $s_{\max}$ to ensure an more accurate estimation of H [628].

When we are estimating the Hurst exponent of a given time series, we first choose the goodness of fit R2 and the confidence level (for instance 97.5% or 95%). In this way, the expression of $s_{\max}(u,N,H)$ is determined, but not its value, because H is unknown. We next scan different scales (denoted as $s'$ ) and obtain the estimate of H by fitting the power law in the scaling range $[8,s']$ , which is a function of $s'$ , i.e. $\hat{H}(s')$ . The 'optimal' right cutoff of the scaling range is determined as follows

Equation (335)

and the estimate of H is

Equation (336)

When we are performing MF-DFA or MF-DMA, we determine $s_{\max}$ first and utilize the same scaling range for all q values. Certainly, this automatic procedure requires further validation because it is possible to have multiple $s_{\max}$ values as well as multiple $\hat{H}$ values. Moreover, the choice of $s_{\min}=8$ is rather arbitrary [327].

The methods mentioned above are based on direct fitting of the data. Another class of methods are based on identifying the flat region in the local derivatives. In empirical analysis, visual inspection is frequently used [327, 604, 631, 632]. One of the earliest method, named the self-similar ratio algorithm, was proposed by Hong and Hong in [633]. Although the algorithm was designed to determine the scaling grange in fractal dimension estimation, it can be easily adopted for multifractal analysis. The self-similar ratio Ri is defined as follows

Equation (337)

If the logarithmic scales $\{\ln{s_i}\}$ are evenly spaced, $s_{i-1}/{s_i}$ is a constant and Ri is also a constant for each q. For each range, if $R_i=\langle{R_i}\rangle$ (or $R_i-R_{i-1}=0$ ) is statistically significant, the range is a qualified scaling range. The widest qualified scaling range is then the optimal scaling range. We notice that Ri is in essence equivalent to the local derivative $\ln{R_i}/\ln(s_{i-1}/{s_i})$ . Li et al determined the scaling range by removing points whose local derivatives are far from the average derivative [634]. Du et al proposed an exhaustion method on local derivatives, in which qualified scaling ranges should ensure that the correlation coefficient between the derivatives and the logarithmic scales is greater than 0.99 [635]. Among all the qualified scaling ranges, the one with the minimal variance of the derivatives is selected as the optimal scaling range. Ji et al developed an algorithm by combining the K-means algorithm and a point-slope-error algorithm [636]. Moreover, we can also perform tests of curvilinearity in the local derivative time series, as in [622].

Zhou et al proposed to consider the second-order derivatives since they should be zero or nearly zero within the scaling range [637], which is reminiscent of testing $R_i-R_{i-1}=0$ in the self-similar ratio method [633]. The method uses the simulated annealing genetic fuzzy C-means clustering algorithm to divide the data into three groups: positive fluctuations, zero fluctuations, and negative fluctuations around the horizontal line of zero second-order derivatives. Again, other identification methods developed for $\ln{F(s)}$ and its first-order local derivatives can also be adopted to develop new scaling range identification methods based on the second-order derivatives.

A relevant but less considered issue is the accuracy of the estimated exponent [638], which should also be considered during the attempt to determine the scaling range.

5.1.5. Deviation of moments from the asymptotic behavior.

A severe problem might occur for many time series, namely that the scaling dependence of the moments on the time scales deviates from the asymptotic behavior. This happens usually together with the finite-size effect when the time series is not long enough [639]. In addition, the discretization of a continuous process also induces deviations from the asymptotic behavior due to the effect of auto-correlations and non-asymptotic convergence described by the central limit theory [410]. To overcome these difficulties in measuring the generalized Hurst exponents of financial time series, Buonocore, Aste and Di Matteo introduced an integrated structure function based estimation method [410].

Theoretically, for monofractal or multifractal processes, the scaling of the structure functions $K(q,s)$ against the time scale s expressed in equation (60) can be rewritten as

Equation (338)

where $\ln{K(q,s)}$ is the dependent variable, $\ln{s}$ is the independent variable, and ${\zeta(q)}$ and A are model parameters to be estimated. When applied to real data, equation (338) holds only asymptotically and a correction function $g(\ln{s})$ should be introduce, that is,

Equation (339)

Assuming that

Equation (340)

and integrating equation (339), we obtain

Equation (341)

where ${\hat{K}}(q,s)$ are empirically calculated from the real time series at different time scales.

The estimation proceeds as follows [410]. For each q, we determine the autocorrelation length $s_{\max}$ , integrate the empirical $\ln{\hat{K}}(q,s)$ for $s\in[1,s_{\max}]$ , and determine $s_{\min}$ by the behavior of $c(s)$ defined in equation (340). The interval $[s_{\min}, s_{\max}]$ is taken as the scaling range and we can determine the scaling exponents in the scaling range. One then checks if the correction function $g(\ln{s})$ converges to 0 and if its integral converges to a constant. With the scaling exponents $\zeta(q)$ estimated, we perform parabolic and quartic fits to equations (342) and (343) and choose the best one to obtain a function form of $\zeta(q)$ . This process is repeated for different criteria for determining $s_{\max}$ and one then chooses the one with the best goodness-of-fit.

Alternatively, Qiu et al proposed an unbiased factorial-moment-based estimation for moments [640]. Numerical analysis of the p -model (they called it the probability redistribution model) verifies that the new method can extract exactly the multifractal behavior from several hundred recordings and outperforms the basic partition function approach, the MF-DFA method and the WTMM method. They applied the method to the daily volatility of the Shanghai stock market SSEC index from 1995 to 2015 in 3y rolling windows and unveiled intriguing dynamics of the multifractal behavior. It is also found that the partition functions in the two windows 1995–1997 and 2007–2009 containing the Asian crisis and the latest Great Crash obey comparatively better power law scaling in wider scaling ranges. Although the estimation method through integrated moments is based on the structure function approach, while the factorial-moment-based estimation is based on the partition function approach, they are both very promising, because their extensions to other multifractal analysis methods are straightforward. What is left is to check their performance.

5.1.6. Assumed forms for $ {\tau(q)} $ (or $ {\zeta(q)} $ ) and $ {f(\alpha)} $ .

In some studies, researchers fitted the empirical curves of $\tau(q)$ , $\zeta(q)$ or $f(\alpha)$ . If the time series can be modelled in a multiplicative cascades, we can use the mathematical models with known multifractal properties as described in section 5.5.1. There are also other assumptions that do not relate directed to any underlying multifractal mechanisms.

In order to give a smooth presentation of the scaling exponents and a quantitative assessment of the degree of multifractality, Buonocore et al suggested to use polynomial expressions for $\zeta(q)$ [410, 641]. The quadratic (or parabolic) expression is

Equation (342)

with only one parameter B  <  0, and the quartic polynomial expression is

Equation (343)

with only two parameters C and D  <  0. The coefficients for the terms satisfy the following constraints:

Equation (344)

Equation (345)

Equation (346)

The fits to the polynomials are conducted over the range $q\geqslant-1$ , because real return time series usually have fat tails with the tail exponent $\gamma>2$ [20, 642, 643] such that moments of returns with q  <  −1 do not exist. Similarly, a cumulant expansion of $\zeta(q)$ suggests an infinite polynomial form of $\zeta(q)$ under the multifractal framework of multiplicative cascades [479] or wavelet leaders [303, 644].

For the quadratic expression, based on the Legendre transform presented in equation (11), we have

and

It follows immediately that

Equation (347)

We can see that $\alpha$ can be negative when $q>(0.5-2B)/2B$ , $\alpha_{\min}$ and $\alpha_{\max}$ (when $q\to\pm\infty$ ) does not exist, the most probable singularity is $\alpha_0=0.5-2B$ , and $f(\alpha)$ could be negative.

If the singularity spectrum is bounded within $[\alpha_{\min},\alpha_{\max}]$ and $f(\alpha_{\min}) = f (\alpha_{\max}) = 0$ when $q\to\pm\infty$ , such as the p -model, the form of the $f(\alpha)$ can be assumed to be [645]

Equation (348)

where $\gamma_1$ , $\gamma_2$ , $\alpha_{\min}$ , $\alpha_{\max}$ and $\alpha_1=\alpha(1)$ are a set of parameters characterising a particular $f(\alpha)$ curve. Due to the geometric properties of $f(\alpha)$ , $0<\gamma_1,\gamma_2 <1$ . Combining $\frac{\rm d}{{\rm d}\alpha}f(\alpha)|_{q=1}=1$ and $\alpha_1 = f(\alpha_1)$ , we have

Equation (349)

Hence, the $f(\alpha)$ curve has four free parameters. There are also quadratic and quartic functional forms assumed for $f(\alpha)$ [646, 647]. If we consider $\frac{\rm d}{{\rm d}\alpha}f(\alpha)|_{q=0}=0$ , the linear term must vanish!

5.2. Some important properties

5.2.1. Discrete scale invariance and log-periodicity.

Fractals and multifractal measures are scale invariant [163]. Scale invariance characterizes the relationship between the values of an observable $\mathcal{O}$ at two distinct scales s and $\lambda{s}$ :

Equation (350)

where $\lambda$ is the magnification factor or scaling ratio and $\kappa$ is the corresponding scaling factor of the observable. A continuous ratio $\lambda$ results in continuous scale invariance (CSI), in which equation (350) holds for arbitrary positive real values of $\lambda$ . In many natural, technologic and social systems, a characteristic scale exists so that the general solution to equation (350) is not just a power simple law implied by CSI but take the form (see [648], references therein, and citing papers)

Equation (351)

where $D=\ln \kappa / \ln \lambda $ is the fractal dimension and $\psi(\lambda{s}) = \psi(s)$ is a log-periodic function (i.e. a periodic function of $\ln s$ ) with period $\ln\lambda$ . In these cases, ${\mathcal{O}}(s)$ obeys the symmetry of discrete scale invarance (DSI), in which the system is scale invariant only under integer powers of specific values of the magnification factor $\lambda$ with the signature of log-periodic oscillations decorating power law scaling relationships.

Log-periodic power-law singularity (LPPLS) signals have been observed in financial time series, and has been used for the identification of bubbles and antibubbles and the forecast of turning points [50]. This has been applied to the NIKKEI 225 antibubble [40], the US housing bubble [49], the Chinese stock market antibubble [649] and bubble [52], and the crude oil bubble of 2008 [650]. In addition to the ubiquitous presence of LPPLS in fractal time series [648], Zhou and Sornette unveiled the presence of DSI in certain multifractal measures and joint multifractal measures via the methods of MF-PF and MF-X-PF [312], such as the detrended fluctuation functions of multifractal binomial measures although the authors may not notice this phenomenon [178, 345, 467]. In an interesting work [651], Xiao et al converted the detrended cross-correlation analysis to a new form with a log-periodic oscillation modulated power law scaling and illustrated the idea using five stock market indices (DJIA, FTSE, DAX, SSEC and NIKKEI), five sectors in Shanghai stock market, and ten sectors in DJIA from 22 December 1998 to 30 April 2010. If LPPLS exists in DCCA, one can rationally expect its presence in MF-DCCA presentations.

The presence of LPPLS in the moments calculated in different multifractal analysis approaches has an important implication in the determination of the scaling exponents. This issue is closely related to the choice of the scales si and hence the scaling range $[s_1, s_m] = [s_{\min},s_{\max}]$ . A natural strategy is to ensure that the scaling range contain several complete oscillations such that $(\ln{s_{\max}}{{\rm \mbox{--}}}\ln{s_{\min}})/\ln\lambda$ is an integer and each log-periodic oscillation contains the same number of scales with the same set of phases.

5.2.2. Translation invariance.

In the detrending methods, the innovations $\Delta{X}$ is usually adjusted to have zero mean before the cumulative summation procedure. When the trend function is imposed to have a constant term, such as polynomial trends, the corresponding method (such as MF-DFA) is translation invariant, that is, $\Delta{X}$ and $\Delta{X}+c$ have the same multifractal properties, where c is an arbitrary constant.

However, the situation can be very different for other detrending methods such as MF-DMA [652]. The cumulative sum (or profile) of $\Delta{X}+c$ is

Equation (352)

and the moving average at t obtained from t  −  s1 to t  +  s2 for window size s is

Equation (353)

where $\theta$ is defined in equation (122). Hence, the residual of the constant shift c after removing the moving average is

Equation (354)

which is a constant for a given window size s. One finds that the residual $ \newcommand{\e}{{\rm e}} \epsilon _{c}(t)=0$ only when $\theta=0.5$ . Otherwise, we will observe a crossover in the F versus s scaling plots [652].

Therefore, the centered MF-DMA is translation invariant, while other MF-DMA versions are not. This analysis raises the serious issue of whether the mean should be removed from the innovations in equation (121). For time series with nonzero means, this analysis explains at least partly the discrepancy of MF-DMA results for different $\theta$ values.

5.2.3. Negative dimensions $ {f(\alpha)<0} $ .

An intriguing feature of multifractals is the possible existence of negative dimensions in the multifractal spectrum, that is, $f(\alpha)<0$ for large or small singularities $\alpha$ [653655]. Negative dimensions have also been observed in the singularity spectra of financial time series, such as financial volatility [349], intertrade durations [492], WIG index returns [609], returns of the spot and futures prices of WTI crude oil [656], foreign exchange rates [657], CSI 300 index returns [658], MASI index returns and MADEX index returns [659], and S&P 500 index returns [151], and so on, for which different multifractal analysis methods have been applied.

The meaning of negative $f(\alpha)$ was firstly discussed by Cates and Witten [206], who noted that negative dimensions are induced by the ensemble averaging of 'supersamples'. Negative dimensions also appear if the multiplier distribution is continuous [348, 487, 488, 660], such as financial volatility multipliers [349]. A simple illustrative analytical example shows that negative dimensions arise from either the existence of intrinsic randomness or from a random treatment of a deterministic process [347]. This coincides with Mandelbrot's insights based on his works on random multifractals [654, 655].

5.2.4. Skewness of multifractal spectra.

In diffusion-limited aggregation (DLA) processes, the multifractal spectrum shows anomalies, in which there is a critical value $q_{{\rm c}}$ such that the partition functions diverge faster than power laws and the $\tau(q)$ function does not exist when $q<q_{{\rm c}}$ [661663]. These anomalies can be explained within the framework of left-sided multifractals based on infinite multinomial measures, in which the generating function reads [664, 665]

Equation (355)

A mathematical treatment can be found in [227]. When $p_i=2^{-i}$ and $r_i=i^{-\lambda}-(i+1){}^{-\lambda}$ (such that $\sum_{i=1}^\infty{p_i}=\sum_{i=1}^\infty{r_i}=1$ ) where $\lambda>0$ is a parameter, the critical q value $q_{{\rm c}}=0$ and the multifractal spectrum is one-sided:

Equation (356)

where c and $c'$ are positive constants depending on $\lambda$ and $\kappa=\max\{2,\lambda/(\lambda-1)\}$ [664, 665]. Hence, $f(\alpha)$ is a monotonically increasing function of $\alpha$ with $f_{\max}=1$ . Numerically, we have to adopt finite approximations of the measures and the resulting multifractal spectra are left-sided and converge gradually to the exact one-sided expression. Obviously, the inverse multifractal spectra are right-sided (see section 2.1.4 for inverse measures).

For empirical multifractal spectra, the skewness can be defined analogously to probability distributions. A negatively skewed spectrum has a longer or fatter left tail, which is also said to be left skewed or skewed to the left. A positively skewed spectrum has a longer or fatter right tail, which is also said to be right skewed or skewed to the right. A simple measure of spectrum skewness is the ratio of the slope of the left part of $f(\alpha)$ to the absolute slope of the right part of $f(\alpha)$ [646]. The whole spectrum is left skewed if the ratio is larger than 1. A more natural measure of skewness is [647]

Equation (357)

where ${\rm skew} = 1$ for un-skewed spectra, ${\rm skew} > 1$ for right-skewed spectra, and ${\rm skew} < 1$ for left-skewed spectra. An un-skewed spectrum is not necessarily symmetric. This measure has also been adopted in empirical analyses [666, 667]. In empirical applications, we suggest to impose $q_{\max}=q_{\min}$ for the estimation of skew in expression (357). It is argued that left-skewed multifractals have more fine-structures in large fluctuations, while right-skewed multifractals have more fine-structures in small fluctuations. Skewed (or asymmetric) multifractal spectra are observed in many empirical financial systems, in which the spectra of financial returns are more likely to be left-skewed while intertrade durations develop quite systematically a right-skewed asymmetry in multifractality [668, 669]. It is not unexpected that there are always exceptions in social sciences [491, 493].

5.3. Superposed components and crossover phenomena

Crossover phenomena in the scaling behavior of financial time series are ubiquitous [135, 205, 496, 610, 612615]. In turbulent flows, there are well-defined inertial range and viscous range [670]. Financial markets are probably analogous to turbulence in certain sense [17, 18, 176]. However, financial frictions are hardly the causes of such crossover phenomena because the crossover scales are usually much larger than the 'friction scale'. Instead, there are mechanical interpretations that superposed time series can induce crossovers. The effects of trends have been studied 20 years ago [671]. Kantelhardt et al performed an extensive numerical analysis on the effects of polynomial and periodic trends on the scaling behavior of DFA [672].

5.3.1. Superposition rule.

Assume that an additive trend $u(t)$ is added to a long-range correlated time series $X(t)$ to form a new time series

Equation (358)

where $X(t)$ exhibits mono-scaling behavior without crossovers

Equation (359)

If the detrended residuals of $X(t)$ and $u(t)$ are uncorrelated, there is a superposition law in the fluctuation behavior for DFA [443] and DMA [652],

Equation (360)

which holds for MF-DFA and MF-DMA. The crossover scale $s_\times$ can be determined by the following equation:

Equation (361)

In many cases, two power laws appear respectively for $s<s_\times$ and $s>s_\times$ . More generally, the appearance of crossover phenomena can be viewed as the results of the competition between multiple mechanisms, be they endogenous or exogenous, which produce power-law correlated properties [673].

5.3.2. Polynomial trends.

In a seminal work, Hu et al studied analytically and numerically the effects of superposed polynomial trends [443]. Consider the following polynomial trends added to the increments series:

Equation (362)

For DFA-$ \newcommand{\e}{{\rm e}} \ell$ , where $ \newcommand{\e}{{\rm e}} \ell$ is the order of the polynomial detrending function, if $ \newcommand{\e}{{\rm e}} \ell\geqslant{p+1}$ , the superposed polynomial trend $u(t)$ is 'absorbed' by the detrending function $\tilde{X}(t)$ . Hence, $u(t)$ has no effects on the scaling behavior of $X(t)$ and there is no crossover phenomenon.

For linear superposed trend u(t)  =  a1t, its detrended fluctuation function is

Equation (363)

where $k_0=\sqrt{5}/60$ . According to equation (361), we have

Equation (364)

showing that the crossover scale $s_\times$ depends only on a1 in a power-law form with a power-law exponent $1/(H-2)$ . When $s\ll{s}_\times$ , Fx(s) dominates and the scaling exponent of $z(t)$ is H. When $s\gg{s}_\times$ , Fu(s) dominates and the scaling exponent of $z(t)$ is 2.

For quadratic trends $u(t) = a_2 t^2$ , the detrended fluctuation function is

Equation (365)

where $k_0=\sqrt{15}/90$ . The crossover scale is

Equation (366)

The intrinsic scaling behavior is observed for $s\ll{s}_\times$ . It is found that $s_\times$ depends not only on a1, but also on the length of the time series N. Since H  <  1, $s_\times$ is smaller for longer time series. In this case, the scaling range becomes narrower and it is more difficult to estimate H.

The effects of polynomial trends on DMA are different [652]. For a constant trend with p   =  0, when $\theta=0.5$ , Fu(s)  =  0 and there is no crossover. Recall that $\theta$ is defined in equation (122). When $\theta=0$ and $\theta=1$ , we have

Equation (367)

There is a crossover at

Equation (368)

which shows that the crossover scale $s_{\times}$ is a power-law function of a0 with exponent $-1/(1-H)$ .

For linear trends with p   =  1, when $1\ll s \ll N$ , F2(s) behaves as

Equation (369)

It follows that

Equation (370)

When a crossover appears, the left part of the fluctuation function with $s<s_{\times}$ reflects the behavior of $X(t)$ , while the right part with $s>s_{\times}$ is dominated by the polynomial trends.

5.3.3. Periodic trends.

Montanari et al performed numerical investigations showing that the presence of periodicity in a time series may have an influence on the estimation of the Hurst exponent H and some estimators falsely detect the presence of long-range dependence [674]. For DFA, Hu et al studied the effects of sinusoidal trends and found that the Fz(s) function has three crossovers and exhibits a hump in the model that reflects the effects of the sinusoidal trend [443].

For a sinusoidal trend

Equation (371)

Fu possesses two power laws with a crossover scale $s_{2\times}$ .

When $s\ll{s}_{2\times}$ , we have

Equation (372a)

When $s\gg{s}_{2\times}$ , we have

Equation (372b)

It follows that

Equation (373)

When $s<s_{2\times}$ , the competition of Fu1(s) and Fx(s) produces a crossover at

Equation (374)

When $s>s_{2\times}$ , the competition of Fu2(s) and Fx(s) introduces a third crossover at

Equation (375)

Therefore, there are four power-law scaling regions: (1) when $s<s_{1\times}$ , Fx dominates and the scaling exponent is H; (2) when $s_{1\times}<s<s_{2\times}$ , Fu1 dominates and the scaling exponent is $\alpha_S=2$ ; (3) When $s_{2\times}<s<s_{3\times}$ , Fu2 dominates as a constant and the scaling exponent is 0; and (4) when $s>s_{3\times}$ , Fx dominates and the scaling exponent is H.

Note that all the three crossover scales are independent of N. When the time series is short, the fourth scaling region might disappear. The first scaling region is usually short and is not reliable for estimating H. If we want to estimate H based on the first and fourth regions, we need to have very long time series. These properties also applies for multifractal time series. The difficulty to have a suitable scaling range often leads to improperly selected scaling ranges, which makes fail the detection of the intrinsic multifractal spectrum [675].

5.3.4. Power-law trends.

Hu et al also investigated the effect of power-law trends [443]

Equation (376)

whose detrended fluctuation function reads

Equation (377)

The function $\alpha(\lambda)$ has the following form

Equation (378)

where $ \newcommand{\e}{{\rm e}} \ell$ is the order of the detrending polynomial function (120), and the function $\gamma(\lambda)$ is

Equation (379)

It follows that the crossover scale is approximately

Equation (380)

Obviously, the scaling regions depend strongly on $ \newcommand{\e}{{\rm e}} \ell$ , as well as on aP, N and $\lambda$ .

5.3.5. Additive noises.

When $u(t)$ is a fractional Gaussian noise with Hurst index Hu, the detrended fluctuation function is

Equation (381)

Combining with equation (359), we obtain

Equation (382)

Without loss of generality, we assume that H  >  Hu. Hence, when $s<s_{\times}$ , Fu(s) dominates. To correctly identify the intrinsic correlation property of $X(t)$ , the scaling range should be selected to the right of $s_{\times}$ . Numerical studies on this case with $X(t)$ being a monofractal or a multifractal time series show the important of correctly identifying the crossover scales and the scaling range as well [675, 676]. When a0 is sufficiently small, $u(t)$ has no effect on the scaling behavior of $X(t)$ . Consider the specific case where H  −  Hu  =  0.3. If the magnitude of noise $u(t)$ increases 10 times, the crossover $s_\times$ will increase about 2000 times! This implies that the crossover scales are not easy to observe [675, 676].

5.3.6. Preprocessing time series with trends.

Many high-frequency financial time series have daily cycles known as intraday patterns that occur in returns [677679], trading volumes [502, 680682], volatility [136, 677, 683], bid-ask spread [684687], intertrade duration [492, 493, 688690], and so on. Hence, one can preprocess the raw time series by removing the intraday patterns or cycles and then perform multifractal analysis on the cycle-adjusted time series.

The widely used preprocessing method in econophysics is to remove the intraday patterns (or more generally seasonal patterns). Suppose that $\{Y(i): i=1, 2, \cdots, N_DK\}$ is a high-frequency time series evenly sampled on ND trading days, where K is the number of points in each day. The intraday pattern is determined by averaging the values at each moment over different trading days:

Equation (383)

where $k=1, 2, \cdots, K$ . The adjusted data are obtained by removing the intraday pattern from the raw data [136]:

Equation (384)

where $n=1, 2, \cdots, N_D$ . For bid-ask spread [686], intertrade durations [492] and trading volume [502], no much differences are observed between the scaling results of raw time series and adjusted time series. If some $\overline{Y}(k)$ is very small, we may adopt the following procedure:

Equation (385)

for all the values of n and k.

Hu et al proposed to adopt an adaptive detrending on the original time series and then perform MF-DFA on the detrended time series [691]. The adaptive trend is determined as follows. The raw time series is partitioned into segments of length 2n  +  1, where any two successive segments overlap by n  +  1 points and the length of the last segment may be smaller than 2n  +  1. For each segment $S_v$ , we fit a polynomial of order $ \newcommand{\e}{{\rm e}} \ell$ , denoted $y_v(\,j)$ with $j=1, 2, \cdots, 2n+1$ . The adaptive trend for the overlapped region can be determined as

Equation (386)

for $j=1, 2, \cdots, n+1$ . The whole adaptive trend is

Equation (387)

where $V={\mathrm{int}}[N/n]$ . The resulting time series for further multifractal analysis is

Equation (388)

The adaptive trend does not have any jumps or discontinuities. However, this preprocessing does not eliminate the jumps in the trend function $\widetilde{Y}(t)$ in MF-DFA. We stress that n is irrelevant to the box size s and $ \newcommand{\e}{{\rm e}} \ell$ is irrelevant to l in equation (120). Both parameters n and $ \newcommand{\e}{{\rm e}} \ell$ are suggested to be determined by requiring that the variance of the detrended data $y(i)$ decays slowly when $ \newcommand{\e}{{\rm e}} \ell$ is further increased and/or 2n  +  1 is further decreased [691]. Ferreira et al used adaptive multifractal approaches (A-MF-DFA and A-MF-X-DFA) to investigate the daily indices time series of 48 stock markets from January 1995 to February 2014 [692]. In their results, the estimated Hurst indices hxx, hyy and hxy are all larger than 1, implying that they cumulatively sum the index before removing the local trends.

There are other detrending methods designed for different types of trends, such as wavelet transform [693], singular-value decomposition [694, 695], chaotic singular-value decomposition [696], Fourier transform [697700], Laplace transform [701, 702], Orthogonal V-system smoothing [703], EMD detrending [704706], data-driven detrending with echo state networks [707], and so on. Essentially, any method that is designed for trend detection can be used to detrend the raw time series. A careful investigation should compare the multifractal analysis results between raw and detrended time series. It is reported that the method of removing seasonal patterns performs better than the Fourier filtering method and the adaptive detrending method in detrending a time series with regular periods or trends [708, 709].

5.4. Effects of nonlinearity and filters

5.4.1. Effects of nonlinearity.

Chen et al studied the effects of non-stationarity in time series present in three ways on the outcomes of detrended fluctuation analysis [710]. The first type of non-stationarity is introduced by a 'cutting' procedure which removes segments of a given length from a time series and stitches together the remaining parts. The cutting procedure does not have any influences on the scaling behavior of persistent time series (H  >  0.5), but can warp the $F(s)$ function upwards.

The second type of non-stationarity is caused by segments with different properties, such as different standard deviations or different correlation exponents. For non-stationary persistent time series comprised of segments with two different values of the standard deviation, its scaling behavior remains the same as stationary persistent time series with constant standard deviation. For non-stationary antipersistent time series, a crossover emerges in the scaling behavior of $F(s)$ . For non-stationary time series with different local correlations, the scaling behavior also changes.

The third type of non-stationarity is realized by adding to a time series a tunable concentration of random outliers or spikes with different amplitudes asp. The detrended fluctuation function of correlated spikes behaves as

Equation (389)

where k0 is a constant, p  is the concentration of spikes, and $\alpha$ is the scaling exponent to estimate. It follows that

Equation (390)

The intrinsic scaling behavior dominates for $s\gg{s_{\times}}$ .

It is worth mentioning an interesting work that investigates how extreme data loss affects the scaling behavior of long-range power-law correlated and anticorrelated signals [711]. It is found that the global scaling of persistent time series is robust to extreme data loss and the scaling exponent remains practically unchanged even for extreme data loss of up to 90%. In contrast, the global scaling of antipersistent time series is vulnerable to extreme data loss and changes to uncorrelated behavior even with a very small fraction of extreme data loss.

5.4.2. Effects of filters.

Chen et al studied the effects of filters on DFA [712]. Investigating the effects of filters is to compare the scaling behaviors of the raw time series $X(t)$ and its transformed time series $g(t)=g(X(t))$ . As expected, it is found that linear filters $g(t) = a_1X(t)+a_0$ do not change the correlation properties. In contrast, the effects of nonlinear power law g(t)  =  aXk(t) and logarithmic $ \newcommand{\e}{{\rm e}} g(t) = \ln[X(t)+\epsilon]$ filters strongly depend on H, k and $ \newcommand{\e}{{\rm e}} \epsilon$ . No obvious crossovers can be identified.

Wang et al studied the effects of linear and nonlinear filters on multifractal analysis of binomial multifractal measures [713]. For linear filter, $g(t) = 2X(t)+1$ , where a is a constant, has the same multifractal properties as $X(t)$ . When quadratic and cubic filters are applied, MF-PF and MF-DFA show that the filtered time series g(t)  =  X2(t) and g(t)  =  X3(t) exhibit multifractality, in which $\alpha_{\min}$ becomes smaller and $\alpha_{\max}$ becomes larger with the increase of the filter order. For logarithmic filters $ \newcommand{\e}{{\rm e}} g(t) = \ln[X(t)+\epsilon]$ , with the increase of $ \newcommand{\e}{{\rm e}} \epsilon$ , $\alpha_{\max}$ remains almost the same and $\alpha_{\min}$ decreases.

Song and Shang investigated numerically the effects of linear and nonlinear filters on MF-X-DFA [714]. They found that the linear filter $g(t) = a_1X(t) + a_0$ has no effect on the multifractal cross-correlation properties. In contrast, nonlinear filters including nonlinear polynomial filter g(t)  =  aXk(t), logarithmic filter $ \newcommand{\e}{{\rm e}} g(t) = \ln[X(t) + \epsilon]$ , exponential filter $ \newcommand{\e}{{\rm e}} g(t) = \exp[a_1X(t) + a_0]$ and power-law filter g(t)  =  [X(t)  +  a]k have significant influence on the multifractal cross-correlation behavior.

Note that, although linear filters has no effect on DFA, MF-DFA and MF-X-DFA, they should affect the scaling behavior of DMA, MF-DMA and MF-X-DMA with $\theta\neq0.5$ due to the breaking of translation invariance in DMA.

In connection with section 4.5, applying an exponential filter onto a long-memory process (for instance a Hurst process) leads to multifractality. In other words, the mechanism of section 4.5 can be re-interpreted in the context of the study of the impact of filters are creating multifractility from an initial time series that is simply mono-fractal. This phenomenon was noticed early in [171, 715].

5.5. Performance of different methods

5.5.1. Mathematical models as reference.

In order to compare the performance of different multifractal analysis methods, one usually applies them to mathematical models with known multifractal nature. The first class of reference models are monofractal, such as fractional Brownian motions [357], autoregressive fractionally integrated moving average (ARFIMA) processes [140, 141]. The second class of reference models are bifractal, such as ordinary Lévy processes [716], exponentially truncated Lévy processes [717], and power-law tailed time series [718] (see section 7.1.1). The third class of reference models have multifractal nature, including multiplicative cascade models, MMAR models, MSM models, and MRW models (see section 4).

For joint multifractal analysis, one can use bivariate fractional Brownian motions (BFBMs) [453, 454, 457], two-component ARFIMA processes [179, 719], and two multiplicative measures [144, 180]. These models can be simply extended for multivariate time series.

5.5.2. Performance measures.

Let $H(q)$ , $\tau(q)$ and $f(\alpha)$ denote the true functions representing the multifractal properties of a given system. For each mathematical model, if it is stochastic, we generate K realizations. Using a certain multifractal analysis method, we obtain Hk(q), $\tau_k(q)$ and $f_k(\alpha)$ for the kth realization. The mean estimates are

Equation (391a)

Equation (391b)

and

Equation (391c)

where the averages are performed at fixed q. Their corresponding variances are

Equation (392a)

Equation (392b)

and

Equation (392c)

which can be used for measuring the estimation precision.

The performance of a multifractal analysis method is mainly measured by the absolute deviations of the estimates to the true values

Equation (393a)

Equation (393b)

and

Equation (393c)

or the relative errors

Equation (394a)

Equation (394b)

and

Equation (394c)

The absolute deviations or the relative errors allow us to identify which method performs relatively better for different q values. These measures can be adopted for estimation accuracy. Certainly, the simplest one is to measure the differences between the means of estimates and the theoretical values.

To qualify the total performance of a method, we suggest to use the average errors:

Equation (395a)

Equation (395b)

and

Equation (395c)

where $\langle{}\rangle_q$ denotes the average over all q values. We argue that the averages of the absolute deviations are not suitable measures of method performance. There are certainly other performance measures. For instance, computational complexity is often used as a measure of algorithm performance. However, this is not a crucial issue nowadays. The width of the scaling range is also an important measure, which however should be used only when there is sufficient estimation accuracy.

5.5.3. Length of time series ('finite size effects').

It can be expected that very short time series may produce 'wrong' estimates, because the estimated multifractal properties usually deviate more for shorter time series, as confirmed with the log-Poisson binomial, log-Gamma binomial and log-normal binomial measures [604]. There is also an evident finite-size effect for the partition function approach [639].

The inpacts of time series length can be quantified at least numerically. Weron studied numerically the confidence interval of the estimated DFA exponent of white Gaussian noise [720]. He found that the confidence interval depends on the length N  =  2n of time series and on the scaling range. The confidence interval is wider for shorter time series. At the confidence level of 95%, the confidence interval is

Equation (396)

Similar expressions have been obtained numerically for 10 estimators [721].

5.5.4. Incomplete comparison of performance.

When inventing a novel variant or novel method of multifractal analysis, we stress the importance of first performing intensive tests of the method on known multifractal time series, as emphasised in a slightly different context by Filimonov and Sornette [722] (see also the commentary at www.er.ethz.ch/media/essays/PNAS.html). Indeed, applying a novel (and therefore 'unknown') method on an unknown data set is likely to lead to incorrect conclusions, since the novel phenomena that arise often result from all types of biases and distortions introduced by the analysing method. Instead, when using a known multifractal time series, the deviation from the theoretically known multifractal properties inform on the properties and biases introduced by the novel analysing method. In turn, this allows the researcher to separate the distortions introduced by her analysis technique from the novel effects that she is hoping to discover. Good scientific practice should require that this procedure be performed without exception, preventing potentially many spurious claims to develop in the literature.

Following on this, there are efforts to compare different estimators for long-range dependence in time series. In several important studies, about 10 estimators including R/S analysis and DFA are compared and the (local) Whittle estimator is found to have the best performance in accuracy and precision [674, 721, 723]. When the R/S analysis and DFA are compared, numerical evidence shows that DFA performs better in accuracy and precision for different lengths of time series [720, 724]. It is also found that centered DMA and DFA are able to obtain accurate estimates [326, 327]. However, if the form of the trend in the time series is not a priori known or the long-memory scaling exponent is greater than 1 for colored noises, DFA is recommended [325, 326].

There are many variants of the classical multifractal analysis methods. However, the performance of many of them has not been validated with numerical experiments on synthetic tests and are thus open to the Filimonov–Sornette critic expressed above [722]. Most comparisons have been conducted on MF-DFA, MF-DMA, WTMM and MF-WL.

Kantelhardt et al studied the multifractal properties of river discharge and precipitation time series using MF-DFA and WTMM and found that the results obtained are in good agreement within the error margins [725]. Oświęcimka et al performed multifractal analysis on financial time series using MF-DFA and WTMM and argued that MF-DFA is better for a global detection of multifractal behavior, while WTMM is the optimal tool for the local characterization of the scaling properties of signals [726]. Figliola et al compared the results of multifractal analysis of EEG time series from an epileptic seizure subject using MF-DFA and MF-WL and concluded that both methods are equivalent for estimating multifractal properties [727]. However, these comparisons are not conclusive because they are not obtained from extensive numerical experiments on time series with a priori known multifractal properties. Again, the importance of performing extensive tests on synthetic time series cannot be over-emphasised.

Kantelhardt et al found that, for deterministic multifractal binomial measures with additive linear and quadratic trends, both WTMM and MF-DFA are able to obtain nice detrending and MF-DFA have slight advantages in precision for negative q values and short series [178]. Oświęcimka et al compared the performance of MF-DFA and WTMM using fractional Brownian motions, bifractal Lévy flights, and four sorts of multifractal binomial cascades [728]. For fractional Brownian motions, MF-DFA can correctly identify its monofractality, while WTMM often produces spurious multifractality. For bifractal Lévy flights, both methods produce spurious multifractality and MF-DFA performs relatively better. For deterministic binomial measures, MF-DFA can well capture the right part of the multifractal spectrum, but widens the left part. In contrast, WTMM can well captures the left part when a proper wavelet is adopted, but not the right part. In addition, the multifractal spectra obtained by MF-DFA with different detrending polynomials are almost the same, while those obtained by WTMM with different wavelets could be very different. Hence, MF-DFA performs relatively better than WTMM for deterministic binomial measures. MF-DFA outperforms WTMM remarkably for log-Gamma and log-Poisson binomial measures. For log-normal binomial measures, the two methods perform comparably well. Murguía and Rosu confirmed that MF-DFA outperforms WTMM for deterministic binomial measures [729]. Salat et al found that MF-DFA outperforms MF-PF and WTMM for deterministic binomial measures [196].

Based on log-normal multiplicative random wavelet cascades [360], Jaffard et al found that both WTMM and MF-WL produced the same multifractal spectrum [730]. Turiel et al tested the performance of MF-PF, WTMM, gradient modulus wavelet projection (GMWP) and gradient histogram (GH) and found that the GMWP method is the one attaining the best performance [294]. Serrano and Figliola compared the performance of MF-DFA and MF-WL based on Cantor sets and deterministic binomial measures [300]. They found that MF-WL provides more accurate estimations for these two time series.

Gu and Zhou investigated the performance of MF-DFA and MF-DMA based on deterministic binomial measures [328]. They found that the estimated multifractal scaling exponent $\tau(q)$ and the singularity spectrum $f(\alpha)$ are in good agreement with the theoretical values. In addition, the backward and forward MF-DMA methods have the best performance, MF-DFA is slightly worse especially for q  >  0, while the centered MF-DMA method has the worse performance. This conclusion is confirmed by Xi et al [731], who also found that the backward and forward MF-DMA methods are less sensitive to the length of deterministic binomial measures than MF-DFA and the centered MF-DMA is the most sensitive. Schumann and Kantelhardt found that both centered MF-DMA and MF-DFA can unveil nicely the multifractal nature of deterministic binomial measures [329].

Huang et al compared the performance of MF-HHSA, MF-SF, MF-DFA and MF-WL by analyzing fractional Brownian motions, stochastic multinomial measures and multifractal random walks [354]. For fractional Brownian motions, MF-SF and MF-WL provide a better estimation with narrower singularity width. For multifractal time series under investigation, it seems that the MF-HHSA and MF-DFA methods provide better singularity spectra than MF-SF and MF-WL.

Welter and Esquef compared the performance of MF-DFA, MF-DMA and EMD-DAMF in studying fractional Brownian motions, Lévy flights, multifractal random walks, and deterministic binomial measures [356]. For fBm processes, all the methods perform well and the estimate by EMD-DAMF is slightly more accurate than MF-DFA and MF-DMA, but with lower precision. For Lévy processes, the EMD-DAMF outperforms the MF-DFA and MF-DMA methods, both in accuracy and precision. For multifractal random walks, MF-DFA performs best both in accuracy and precision. MF-DMA and EMD-DAMF have similar accuracy and MF-DMA is more precise. For deterministic binomial measures, MF-DFA and MF-DMA outperform EMD-DAMF, both in accuracy and precision, and MF-DMA is more accurate than MF-DFA with similar precision.

Comparative studies on the performance of methods for joint multifractal analysis are relatively rare. Jiang and Zhou compared MF-X-DFA and MF-X-DMA [144]. They found that, for bivariate fractional Brownian motions and two-component ARFIMA processes, the MF-X-DFA and centered MF-X-DMA algorithms outperform the forward and backward MF-X-DMA algorithms. For binomial measures, the forward MF-X-DMA algorithm exhibits the best performance, the centered MF-X-DMA algorithms performs worst. Xi et al obtained similar (but not the same) results for binomial measures [732]. Cao and Shi found that, for two-component and mix-correlated ARFIMA processes, the backward and forward MF-X-DMA methods outperform centered MF-X-DMA, MF-X-DFA and MFDCCA-MODWT [733]. Kristoufek performed an extensive Monte Carlo simulation study to compare the performance of detrended cross-correlation analysis (DCCA), height cross-correlation analysis (HXA) and detrending moving-average cross-correlation analysis (DMCA) [734]. He found that DMCA strongly outperforms DCCA and HXA in the standard power-law cross-correlation case and the power-law coherency case. However, DCCA outperforms in the sense that it is less sensitive to the short-term memory.

5.5.5. Which method should be used?

It seems that there is no consensus on which method has the best performance for multifractal analysis. More numerical comparisons are required to gain a deeper understanding on the performance of different methods. First, more mathematical models with a priori known multifractal properties should be investigated. Second, different performance criteria should be considered, such as estimation accuracy which quantifies the deviation of estimated to the true values, estimation precision which qualifies the dispersion of estimates to the mean values, size effect of time series, width of scaling ranges, and sensitivity to the choice of scaling ranges. Unfortunately, even if one is quite convinced that a certain method performs the best for some tests of multifractal time series, we cannot extrapolate this conclusion directly to other time series.

In applications to real data, without an understanding of the underlying mechanisms that drive the dynamics of the complex system, it is hard to determine which method would be more suitable. If there is a trend that can be identified, MF-DFA with high-order detrending polynomials is recommended. In general, we suggest that different methods should be adopted such that we can compare their results qualitatively and quantitatively. For financial time series such time series of returns, we can check if the value of the Hurst index $H(2)$ is close to 0.5, based on the fundamental stylized fact of the absence of long memory in financial returns [94], which has also a strong theoretical underpinning in the concept of no-arbitrage opportunity. This can be used as a useful criterion to assess the results obtained from different methods. Indeed, there are works reporting incorrect results in the sense that the Hurst index $H(2)$ deviates significantly from 0.5.

6. Empirical evidence of multifractal markets

In the past three decades, hundreds of empirical studies have been carried out to investigate the presence of multifractality in financial markets. In the 1990s, there were only a few seminal papers, most of them applied the structure function approach [18, 142, 152, 213, 735, 736] or the box-counting method [737]. With the development of new multifractal analysis methods, empirical studies boomed in the 21st century.

After a careful and thorough survey of the literature, we observe three trends. First, the structure function approach is widely applied in the early works and the multifractal detrended fluctuation analysis dominates later. Second, not surprisingly, the return time series of stock market indices attract most attention, while foreign exchange rates and commodities are also investigated by many researchers. Third, most financial time series studies have daily sampling frequency. The second and third trends are simply due to the availabilities of different data sets. We also find no clear evidence for the presence of universality in the multifractal properties found empirically.

There are also very few studies on economic variables other than financial variables, such as the monthly aggregate price indices of the US consumer price index (CPI) and producer price index (PPI) from 1975 to 2011 [738], the daily spot rates of VLGCs on the benchmark Persian Gulf (Ras Tanura) to Japan (Chiba) route from 3 January 1992 to 24 June 2009 [739], the daily spot rates of ships in tanker markets from 27 January 1998 (or 1 July 2004 or 15 July 2005) to 5 August 2013 [740], and the daily Baltic Panamax index (BPI) and Baltic Capesize index (BCI) from 1 March 1999 to 26 February 2015 [741]. It is not surprising that multifractal analysis is mainly carried out on financial time series because economic time series are usually recorded monthly or quarterly, which are not long enough for multifractal analysis. Therefore, there is only one study applying the MF-SF method on the monthly prices of NASDAQ for the period from 1984 to 2000 [742].

Based on these observations, we argue that it is of less significance to try to confirm again and again the presence of multifractality in financial index returns in future studies. The evidence is clear. More attention should be paid to other financial variables such as liquidity measures and to the comparison of multifractality between different assets. More importantly, one should try to unveil the financial mechanisms and implications of multifractality.

6.1. Asset returns and prices

While the section is titled 'asset returns', this is a slight misnomer as the investigated multifractal properties refer mainly to those of the absolute values of the returns and, in general, not to the signed returns. In this sense, since the absolute value of a return is a proxy for the volatility, the investigated multifractal properties are mainly those of the financial volatility of the various asset classes enumerated below. The subsection below dedicated specifically to volatility refers to studies that have used the volatility estimated in different ways as their direct inputs. Indeed, there is no multifractality when the signs are included. Either explicitly or implicitly, the return signs are removed in most multifractal analyses.

6.1.1. Daily stock market indices.

The following researches deal with many stock market indices in single studies.

  • Selçuk investigated daily stock market indices from Argentina, Brazil, Hong Kong, Indonesia, Korea, Mexico, Philippines, Singapore, Taiwan and Turkey for 8-28 years ending at 29 December 2000 using MF-SF [743],
  • Di Matteo et al investigated 32 major indices of both developed and emerging markets from 1990 or 1993 to 2001 using MF-SF [177, 216, 744],
  • Bianchi and Pianese investigated eight major stock indexes (DJIA from 1 October 1928, Bovespa from 27 April 1993, HSI from 31 December 1986, NIKKEI 225 from 4 January 1984, CAC40 from 1 March 1990, Footsie 100 from 2 April 1984, and MibTel from 19 July 1993) till October of 2004 using MF-SF [745],
  • Zunino et al investigated Latin-American stock markets (Argentina, Brazil, Chile, Colombia, Mexico, Peru, Venezuela) and the US from January 1995 to February 2007 using MF-DFA [746],
  • Gao et al investigated seven indices (DJIA, S&P500 and NASDAQ from 4 January 1986, DAX from 11 November 1990, FTSE 100 from 6 January 1986, SSEC from 4 January 2000, and HSI from 31 December 1986) till 6 September 2011 using MF-SF [747],
  • Trincado and Vindel investigated FTSE 100, CAC 40, DAX 30, IBEX 35, NASDAQ 100 NIKKEI 225, and DJIA from 1 January 1992 to 31 August 2009 using MF-SF [748],
  • Sensoy investigated 15 Middle East and North African (MENA) stock market indices from January 2007 to December 2012 using MF-SF [749],
  • Lin et al investigated DJIA from 22 January 1953 to 28 June 2012, S&P 500 from 3 January 1950 to 13 June 2012, HSI from 31 December 1986 to 20 June 2012, FTSE from 2 April 1984 to 20 June 2012, CAC from 1 March 1990 to 3 July 2012, DAX from 26 November 1990 to 3 July 2012, SSEC from 19 December 1990 to 28 June 2012, and SZCI from 3 April 1991 to 28 June 2012 using MF-SF [750],
  • Horta et al investigated the stock market indices of Belgium, France, Greece, Japan, the Netherlands, Portugal, the UK and the US from 4 January 1999 to 19 March 2013 using MF-DMA [751],
  • Rizvi et al investigated 11 developed markets (Australia, France, Germany, Hong Kong, Japan, Netherlands, Spain, Sweden, Switzerland, United Kingdom, United States) and 11 islamic markets (Bahrain, Bangladesh, Egypt, Indonesia,, Jordan, Kuwait, Malaysia, Oman, Pakistan, Saudi Arabia, Turkey) from 1 January 2001 till 31 December 2013 using MF-DFA [752],
  • Stosic et al investigated 13 stock market indices (AEX, BFX, BSESN, BVSP, FCHI, FTSE, GSPC, HSI, KS11, MXX, N225, SSMI, TWII) from 30 August 2006 to 1 March 2014 using MF-DFA [753],
  • Arshad et al investigated the MSCI benchmark index of each individual market (Malaysia, Singapore, Indonesia and South Korea) from 1 January 1990 until 31 July 2013 using MF-DFA [754],
  • Ferreira et al investigated the indices of 48 stock markets from January 1995 to February 2014 using A-MF-DFA [692], and
  • Wang and Wang investigated the SSCI from 24 February 2003 to 17 February 2017 using MF-DFA [755].

Other studies focus on one or a few stock market indices over different periods. For the US market, a few indices have been studied, including DJIA (from February 1991 to May 1997 using MF-SF [736], from 1 January 1897 to 3 February 1999 using MF-SF [756], from 1900 to 2000 using WTMM [757], from 3 January 1928 to 18 October 2000 using MF-SF [758], from 1900 to 2005 using MF-SF [759], and from 1928 to 2012 using MF-DFA [632]), S&P 500 (from 2 January 1980 to 31 December 1992 using MF-SF [760], from July 1997 to December 2007 using MF-DFA [761], from 30 December 1927 to 3 September 2008 using MF-DFA [609], from 20 September 2004 to 10 October 2008 using MF-WT [762], and from 20 December 1990 to 7 January 2016 using MF-DMA [151]), NYSE index (from January 1966 to December 2002 using MF-SF and MF-DFA [233] and from 30 December 1927 to 3 September 2008 using MF-DFA [609]), and NASDAQ (from 30 December 1927 to 3 September 2008 using MF-DFA [609]).

Quite a few indices of Chinese stock markets have been investigated, such as SSEC (from December 20, 1990 to April 30, 2008 using MF-DFA [763], from December 20, 1990 to December 30, 2010 using MF-DFA [764], from 5 January 1999 to 24 February 2012 using MF-DMA [577], from 20 December 1990 to 7 January 2016 using MF-DMA [151]), SZCI (from 3 April 1991 to 30 December 2010 using MF-DFA [764] and from 5 January 1999 to 24 February 2012 using MF-DMA [577]), SHSE 50 (from 2 February 1998 to 27 February 2008 using MF-DFA [765]), SZSE 100 (from February 2, 1998 to February 27, 2008 using MF-DFA [765]), CSI 300 (from 5 April 2006 to 9 May 2014 using MF-DFA [766]), and CSI 800 (from 4 January 2005 to 23 March 2016 [767]).

Other investigations have been conducted on the Japanese TOPIX (from 6 January 1986 to 23 August 1999 [768] using MF-SF), the German DAX (from 1 October 1959 to 30 December 1998 using MF-SF [769] and from 29 September 1959 to 4 September 2001 using MF-SF [770]), the Korean KOSPI (from April 1981 to 2003 using MF-SF [771, 772]), the Iranian TEPIX (from 20 May 1995 to 18 March 2004 using MF-SF [773]), Taiwan's TAIEX (from January 1983 to May 2006 using MF-SF and MF-DFA [774]) and OTC index (from January 1995 to May 2006 using MF-SF and MF-DFA [774], Hong Kong's HSI (from 5 January 1999 to 24 February 2012 using MF-DMA [577], from January 1995 to May 2001 using MF-SF [775]) and HSCEI (from May 2 2012 to January 27 2016 using MF-DFA [776]), India's SENSEX (from July 1997 to December 2007 using MF-DFA [761] and from 1 July 1997 to 31 December 2010 using MF-DFA [777]) and NSE index (from August 2002 to December 2007 using MF-DFA [761] and from 1 June 1990 to 31 December 2010 [777], using MF-DFA), the Polish WIG (from 16 April 1991 to 10 October 2008 using MF-DFA [609]), Morocco's MASI and MADEX using MF-DFA [659], Turkey's ISE-100 (from 4 January 1988 to 16 July 2001 using MF-WT [778]), and the Spanish IBEX 35 from 1990 to 2000 using MF-WT [779].

Although there is overwhelming empirical evidence indicating the presence of multifractality in the daily returns of stock market indices, an exception was reported on the daily Bombay stock exchange index (BSE) data of year 2000, in which $\zeta(q)$ is found to be linear against q [780]. This is most likely due to the fact that the sample size (245 data points) is too short so that it is difficult to determine the scaling range. In addition, there are studies using prices as the variable $\Delta{X}$ in the MF-DFA analysis such that the detrending is conducted on the 'cumulative sum' of prices [501, 780782], which either results in incorrect estimates of $\zeta(q)$ or $H(q)$ or misinterpretations of the results on reshuffling. Indeed, the measure derived from the time series under study should be stationary in order to qualify as a genuine measure. Prices and log-prices are not stationary and using them is well-known to lead to spurious conclusions. In regression analysis, this is known as the spurious regression problem [783, 784].

6.1.2. High-frequency stock market indices.

High-frequency indices have been studied for different stock markets, such as the US market (15 min S&P 500 index from June 24 to September 30 in 1999 using MF-SF [785], 5 min S&P500 index from 1 February 2001 to 1 February 2002 using WTMM [786], and 5 min DJIA from 19 September 1994 to 16 October 2002 using MF-SF [787]), the French market (30s CAC 40 from 3 January 1993 until 31 December 1996 using MF-SF [788]), the German market (1 min DAX data from 28 November 1997 to 30 December 1999 using MF-SF [770]), the Russian market (30 min RTS-70 from 9 December 1996 to 22 February 2002 using MF-WT [778]), the Korean market (1 min KOSPI from 30 March 1992 to 30 November 1999 using MF-SF [789793]), the Polish market (1 min WIG20 from 1 April 1999 to 31 October 2005 using MF-DFA [794]), the Chinese market (1 min SSEC from 4 May 1998 to 1 June 2005 using MF-DFA [795], 5 min SSEC from January 2003 to April 2008 using MF-DMA [328], and 5 min CSI 300 from 4 April 2005 to 19 January 2012 using MF-DMA [658]), the Taiwan market (1 min TAIEX from February 2000 to December 2003 using MF-SF and MF-DFA [774] and from 2 January 1999 to 28 August 2007 using MF-DFA [796]), the Spanish market (13s IBEX 35 from January 2 2009 to December 31 2010 using MF-DFA [797]), the 1 min Euro Stoxx 50 index from May 2008 to April 2009 using MF-DFA [632], and the daily CDS of 11 sector indices in the US stock market from 14 December 2007 to 31 December 2014 using MF-DFA [798].

6.1.3. Stock market sector indices.

Some studies considered stock market sector indices. Sector indices, as well as stock market indices, can be viewed as portfolios of stocks. Using the MD-DFA method, Wang, Xiang and Pandey investigated the daily computer sector index and electronics sector index of China's growth enterprise market from September 2010 to August 2011 [799], Kim et al analyzed the daily prices of 144 equity funds from 1 January 2002 to 31 December 2010 using MF-DFA [800], Zhuang, Wei and Ma worked on 10 sector indices of China stock market from 1 January 2005 to 3 April 2013 [801], Yang, Zhu and Wang studied the daily CSI energy sub-industry index from 4 January 2005 to 15 June 2015 [802], and Shahzad et al researched 11 sector indices in the US stock market from 14 December 2007 to 31 December 2014 [798].

6.1.4. Individual stocks.

The US stock market contributes the largest number of individual stocks that have been studied, containing 5 min prices of a particular stock traded on the New York Stock Exchange (NYSE) from 4 January 1993 to 30 June 1999 using MF-SF [803], daily prices of 2249 US stocks over 10–30 years using MF-DFA [224], 5 min prices of Yahoo!, Microsoft and IBM from 1 February 2001 to 1 February 2002 using WTMM [786], daily prices of stock Intel between January 1990 and December 2002 using MF-SF and MF-DFA [233], 1 min prices of the Johnson and Johnson stock during the year of 2000 using MF-SF and MF-DFA [233], 5 min prices of stock Intel during the year of 2000 using MF-SF and MF-DFA [233], tick-by-tick prices of 30 DJIA stocks from 1 December 1997 to 31 December 1999 using MF-DFA and WTMM [726], daily prices of 30 DJIA stocks using MF-SF [495], high-frequency prices of 30 DJIA stocks from the 1 July to 31 December 2004 using MF-DFA [804], trade-by-trade returns of stock GE for calendar years 2000–2003 [150], 1 min–10 min prices of 31 DJIA stocks for the years 2008–2011 using MF-DFA [805].

Studied time series of individual stocks in other markets include tick-by-tick data of 30 DAX companies from 1 December 1997 to 31 December 1999 using MF-DFA and WTMM [491, 726], daily prices of 202 constituent stocks of NIKKEI 225 from 4 January 2000 to 9 February 2006 using MF-DFA [806], 1 min 150 stocks in Taiwan Stock Exchange from 2 January 1999 to 28 August 2007 using MF-DFA [796], daily prices of 172 KOSPI stocks from 1980 to 2003 using MF-DFA [807], trade-by-trade prices of 27 KOSPI stocks in 2008 using MF-DFA [494], daily prices of stocks in electronics and computer sectors of China's growth enterprise market from 2010 to 2011 using MF-DFA [799], daily prices of 13 Moroccan family business stocks using MF-DFA [659], daily prices of 17 most liquid Spanish stocks from 1 January 1990 to 24 May 2001 using MF-WT [808, 809], 1 min prices of 4 most important Spanish stocks in 1999 using MF-WT [808], daily prices of 35 most liquid Spanish stocks from June 1996 to June 2006 using MF-WT [810], daily prices of 19 Spanish stocks from 1990 to 2000 using MF-WT [779], and 5 min prices of Russian Joint Stock Company and Gazprom joint stock company from 1 February 2001 to 1 February 2002 using MF-WT [786].

6.1.5. Equity futures.

The multifractal nature of equity futures has been confirmed on 10 s prices of the DAX futures during 60 trading days using WTMM [757], 5 min prices of the CSI 300 futures from 16 April 2010 to 12 April 2012 using MF-DFA [811], 10 min prices of the CSI 300 futures IF1009 from 16 April 2010 to 17 September 2010 using MF-DFA [812], and 5 min prices of the CSI 300 futures from 30 June 2011 to 7 June 2013 using A-DFA [395].

6.1.6. Foreign exchange rates.

The global foreign exchange market is the largest financial market by transaction volumes. The multifractal nature of foreign exchange rates has been confirmed by numerous empirical studies. Some excellent works researched dozens of foreign exchange rates. For instance, Vandewalle and Ausloos [813] and Di Matteo et al [177, 216] investigated respectively 23 and 29 daily foreign exchange rates using the MF-SF method. Drożdż et al performed an interesting study using MF-DFA on the 1 min rates from 21:00 on 2 January 2004 to 21:00 on 30 March 2008 of two foreign exchange triangles (USD/EUR, EUR/GBP and GBP/USD) and (JPY/GBP, GBP/CHF and CHF/JPY) [668]. Pont et al obtained the ensemble singularity spectrum of the daily returns of 9 currency exchange rates (pairs of AUD, USD, EUR, JPY and CHF except for AUD/CHF) from 1992 to 1997 using MF-WT [779].

The Japanese Yen (JPY) with respect to the USD is among the most studied. In finance, this currency pair is particularly interesting due to its enormous transaction volume and for the carry trade that have been implemented for decades as a major funding source. In the standard logic, one borrows in Yens at the low interest rates that have been prevalent in Japan for decades, then sells the Yens to buy US dollars, which are invested in higher yielding instruments such as US treasury bonds. This carry trade, which contributes to pushing the Yen value lower, has been part of the strategy of the Japanese society, which is so dependent on exports for its economic health and on the safe haven status of its currency. Time series are recorded at different frequencies, such as daily data (from 1 June 1973 to 1 June 1988 using MF-SF [213], from 1 January 1980 to 31 December 1996 using MF-SF [735], from January 1971 to June 2003 using MF-SF [814], from 1 January 1990 to 2 November 2003 using MF-SF [815], from January 1971 to June 2003 using MF-SF [771], from 1 January 1990 to 2 November 2003 using MF-SF [816], from 15 December 1998 to 9 November 2008 using WTMM [817], from 2 January 1975 to 12 November 2010 using MF-DMA [657], and from 1991 to 2005 using MF-DFA [818]), tick-by-tick data (from 1 March 1986 to 1 March 1989 using MF-SF [213]), 1 min data (in 2005 [819] and 2008 [820] using MF-DFA), and 5 min data (from 1 February 2001 to 1 February 2002 using WTMM [786]).

For the Deutsche Mark (DEM) with respect to the USD, the MF-SF method has been applied to daily data during different periods (from 1 June 1973 to 1 June 1988 [213], from 1 January 1980 to 31 December 1996 [735], from 1 January 1990 to 2 November 2003 [816], and from 4 June 1973 to 31 December 2001 [815]), tick-by-tick data (from 1 March 1986 to 1 March 1989 [213] and from 1 October 1992 until 30 September 1993 [821]), 15s data (from 1 October 1992 until 30 September 1993 [788]), and 5 min data (from 4 January 1987 to 31 December 1998 [822]). The MF-DFA method has also been applied to the 20s FX rates from 1 October 1992 to 30 September 1993 [823].

For the Euro exchange rates, studies have been carried out for 1 min EUR/USD in 2005 using MF-DFA [819], 5 min EUR/USD from 5 January 2006 to 31 December 31 2007 using MF-SF [499], daily EUR/USD from 15 December 1998 to 9 November 2008 using WTMM [817], daily EUR/USD from 1 January 1990 to 2 November 2003 using MF-SF [816], daily EUR/USD from January 2007 to October 2009 using MF-DFA [824], daily USD/EUR from 4 January 1999 to 12 November 2010 using MF-DMA [657], 1 min EUR/USD in 2008 using MF-DFA [820], and daily data of Croatian Kuna (HRK), Czech Koruna (CZK), Hungarian Forint (HUF), Polish Zlot (PLN), Romanian Leu (RON), Slovak Koruna (SKK) and Slovenian Tolar (SIT) against UER from 4 January 1999 to 31 August 2010 using MF-DFA [335].

For the UK GBP exchange rates, multifractal analyses have been performed on tick-by-tick USD/GBP from 1 March 1986 to 1 March 1989 using MF-SF [213], daily USD/GBP from 1 June 1973 to 1 June 1988 using MF-SF [213], daily GBP/USD from 1 January 1990 to 2 November 2003 using MF-SF [816], daily GBP/USD from 2 January 1975 to 12 November 2010 using MF-DMA [657], and daily GBP/USD from 15 December 1998 to 9 November 2008 using WTMM [817].

For the Chinese Yuan, multifractal investigations have been conducted on daily CNY/USD from 25 July 2005 to 12 November 2010 using MF-DMA [657], daily CNY/USD from 19 November 1997 to April 2008 and CNY/EUR from 1 January 1999 to April 2008 using MF-SF [825], daily CNY/USD from 19 December 1990 to 2 August 2012 and CNY/HKD from 13 January 1994 to 2 August 2012 using MF-DFA [826], daily prices of the onshore and offshore RMB exchange rates (CNY and CNH) from 2 May 2012 to 27 January 2016 using MF-DFA [776], and daily CIB-CNY composite index (CCI) from 21 July 2005 to 18 June 2010 using MF-DMA [827].

For the Korean Won (KRW) exchange rates, the studied time series include daily KRW/USD from April 1981 to December 2002 using MF-SF [771, 772], daily KRW/USD from 1991 to 2005 using MF-DFA [818], 1 min USD-KRW in year 2008 using MF-DFA [820], and the managed and independent float exchange rates using MF-DFA [666].

For the Australian Dollar (AUD), investigations have been conducted on 1 min AUD/USD in 2005 using MF-DFA [819], daily AUD/USD from 15 December 1998 to 9 November 2008 using WTMM [817], daily AUD/USD from 2 January 1975 to 12 November 2010 using MF-DMA [657], and the managed and independent float exchange rates using MF-DFA [666].

The multifractal nature in quite a few other currencies in different economies has also been researched, such as daily Brazilian real (BRL/USD from 3 January 2000 to 12 November 2010 using MF-DMA [657] and the managed and independent float exchange rates using MF-DFA [666]), daily Bulgarian Lev (BGN/USD from February 1991 to May 1997 using MF-SF [736] and from 23 October 1995 to 11 February 2003 using MF-SF [815]), daily Canada Dollar (CAD/USD from 1 January 1990 to 2 November 2003 using MF-SF [816], from 2 January 1975 to 12 November 2010 using MF-DMA [657], and from 15 December 1998 to 9 November 2008 using WTMM [817]), daily Denmark Krone (DKK) from 2 January 1975 to 12 November 2010 using MF-DMA [657], daily French Francs (USD/FRF from 1 January 1979 to 30 November 1993 using MF-SF [828] and CHF/FRF, DEM/FRF, GBP/FRF and JPY/FRF from 1 January 1979 to 30 November 1993 using MF-SF [142]), daily Hong-Kong Dollar (HKD/USD from 1991 to 2005 using MF-DFA [818]), daily Indian Rupee (INR/USD from 4 January 1999 to 12 November 2010 using MF-DMA [657] and from January 1995 to December 2012 using MF-DFA [782]), daily Iranian Rial (IRR/USD from 24 September 1989 to 15 November 2003 using MF-DFA [829]), daily Malaysian Ringgit (USD/RM, YEN/RM and SGD/RM from 2 April 1985 to 30 April 2001 using WTMM [830] and managed and independent float exchange rates using MF-DFA [666]), daily Mexican Peso (MXP/USD from 20 December 1994 to about 2001 using MF-SF [831] and from 8 November 1993 to 12 November 2010 using MF-DMA [657]), daily New Zealand Dollar (NZD/USD from 15 December 1998 to 9 November 2008 using WTMM [817] and the managed and independent float exchange rates using MF-DFA [666]), daily Norway Krone (NOK/USD) from 2 January 1975 to 12 November 2010 using MF-DMA [657], daily Thailand Thai Baht (THB/USD from 4 January 1999 to 12 November 2010 using MF-DMA [657], from 1991 to 2005 using MF-DFA [818], and the managed and independent float exchange rates using MF-DFA [666]), Swiss Francs (tick-by-tick CHF/USD from 1 March 1986 to 1 March 1989 using MF-SF [213], daily CHF/USD from 1 June 1973 to 1 June 1988 using MF-SF [213], and daily CHF/USD from 1 January 1990 to 2 November 2003 using MF-SF [816]), daily Sweden Krona (managed and independent float SEK/USD exchange rates using MF-DFA [666]), daily Taiwan New Dollar (managed and independent float TWD/USD exchange rates using MF-DFA [666]), daily data of Romanian Leu (ROL), Slovak Koruna (SKK) and Slovenian Tolar (SIT) from 22 October 1995 to 11 February 2003 using MF-SF [816], and daily Russian Ruble (RUR), Ukraine Hryvnia (UAH), Kazakhstani Tenge (KZT), Estonian Kroon (EEK), Lithuanian litas (LTL), Latvian lats (LVL), Croatian Kuna (HRK), Czech Koruna (CSK), Hungarian Forint (HUF), and Polish Zloty (PLZ) from 23 October 1995 to 11 February 2003 using MF-SF [815].

In the past few years, cryptocurrencies have attracted significant attention. With Mf-DFA, evidence of the presence of multifractality in the return time series of Bitcoin has been reported over different time periods at different sampling frequencies, such as daily data from 2010 to 2017 [832835], 12-h data from 14 September 2011 to 20 November 2017 [836], and from 1 January 2012 to 31 March 2018 [837] and from 1 January 2014 to 31 December 2016 [838].

6.1.7. Commodities.

In the commodity markets, the multifractal properties of gold prices and of returns have been studied, such as tick-by-tick prices of gold (XAU/USD) from 1 March 1986 to 1 March 1989 using MF-SF [213], daily gold prices from February 1991 to May 1997 using MF-SF [736], daily prices of gold traded in COMEX over the period of 13 July 1990 to 15 September 2009 using MF-DFA [839], daily LME gold prices from 1968 to 2010 using MF-DFA [840], daily gold prices from January 1973 to October 2011 using MF-DFA [841], daily AU99.99 closing prices of the Shanghai Gold Exchange from 30 October 2002 and 30 November 2012 using MF-DFA [842], daily gold prices in the Indian market and daily global consumer price index from January 1985 to June 2013 using MF-DFA [843], daily gold consumer price index (CPI) and the gold market in China, India and Turkey from 1993 to July 2013 using MF-DFA [844], and daily gold prices of China and India from 1985 to 2013 using MF-DFA [667].

In the oil and gas market, studies have been conducted on daily spot prices of WTI crude oil (from November 1981 to about 2001 using MF-SF [845], from 2 January 1990 to 28 February 2001 using MF-SF [846], from 2 January 1986 to 28 May 2004 using MF-SF [847], from 20 May 1987 to 30 September 2008 using MF-DFA [848], from 1986 to 2008 using MF-SF [849], from 20 May 1987 to 14 October 2009 using MF-DFA [850], from 2 January 1991 to 31 December 2013 [851], from 2 January 1986 to 28 September 2012 using MF-DFA [852],), Brent crude oil (from November 1981 to about 2001 using MF-SF [845], from 20 May 1987 to 30 September 2008 using MF-DFA [848], from 20 May 1987 to 14 October 2009 using MF-DFA [850], Dubai crude oil from November 1981 to about 2001 using MF-SF [845], conventional gasoline (New York Harbor) from 2 January 1991 to 31 December 2013 [851], heating oil (New York Harbor) from 2 January 1991 to 31 December 2013 [851], jet fuel (US Gulf Coast) from 2 January 1991 to 31 December 2013 [851], and natural gas (Henry Hub, LA) from 24 January 1991 to 28 February 2001 [846]. Other related studies include daily prices of the WTI crude oil futures traded in NYMEX from 2 January 1985 to 10 May 2011 using MF-DMA [146], daily data of DJ Brent crude index from January 1999 to January 2015 and DJ energy index and DJ petroleum index from January 2006 to January 2015 using MF-DFA [853], daily crack spread of gasoline which is the difference between the spot prices of gasoline (New York Harbor) and WTI crude oil from 2 January 1986 to 26 July 2011 using MF-DMA [854], monthly price indexes of World Bank non-fuel and World Bank energy from 1960 until 2013 using MF-DFA [853], and a steam coal FOB price in Qinhuangdao Port from 2006 to 2013 [855].

For the electricity market, researchers have studied hourly spot prices in the Spain electricity exchange from 1 January 1998 to 31 May 2006 using MF-DFA [856], 30 min electricity prices and demand data in Australia's national electricity market from December 1998 to June 2008 using MF-DFA [857], 1 h electricity prices and demand data from 1 January 2000 to 20 June 2008 for Alberta and from 1 May 2002 to 30 June 2008 for Ontario in Canadian electricity markets using MF-DFA [858], electricity prices of California 1999-2000 and PJM 2001-2002 using MF-DFA [859], 15 min day-ahead monthly bid prices of 5 bid areas from 1 April 2012 to 31 March 2014 using MF-DFA [860], and daily electric load records of a certain power system from 1 January 1999 to 31 December 2011 using MF-DFA [861].

Concerning multifractal detrended fluctuation analysis of agricultural commodities, Matia et al considered the daily prices of 9 commodities over 10–30 years [224], He and Chen studied the daily closing prices of hard winter wheat futures on the Zhengzhou Commodity Exchange from 28 December 1993 to 18 September 2009, and soy meal futures from 17 July 2000 to 18 September 2009, No. 1 soybean futures from 15 March 2002 to 18 September 2009 and corn futures from 22 September 2004 to 18 September 2009 on the Dalian Commodity Exchange [862], Kim et al researched the daily prices of leek, radish, onion, and Korean cabbage from 3 January 2001 to 23 September 2009 in the public wholesale market of Seoul Agricultural & Marine Products Corporation [863], Wang and Hu investigated the daily prices of four agricultural commodity futures (corn, soybean, wheat and rice) on the Chicago Board of Trade (CBOT) from 1 January 2000 to 31 September 2014 [864], and Delbianco et al worked on the monthly data of World Bank's soybean price index and agricultural price index from 1968 until 2013 [853].

6.1.8. Interest rates.

Multifractal analyses on interest rates and their futures have mainly used the MF-SF method and have been conducted on daily data of daily Korean treasury bond futures from December 2001 to June 2002 for KTB206 and from March 2002 to September 2002 for KTB209 [865], US treasury rates with different maturities from 1990 to 2001 [216], daily Eurodollar interbank interest rates with different maturity dates from 1990 to 1996 [216], daily Pound sterling, US dollar, Japanese Yen, Australian dollar and the EURO, LIBOR and Indonesian Rupiah for different maturities (3, 6, 9 and 12 months) from January 2000 to December 2005 using MF-SF and MF-DFA [866], daily Eurodollar rates from 1990 to 1996 [177], and US treasury bond rates from 1997 to 2001 [177].

The MF-DFA method has also been applied to the interest rates of daily Pound sterling, US dollar, Japanese Yen, Australian dollar, the EURO, LIBOR and Indonesian Rupiah for different maturities (3, 6, 9 and 12 months) from January 2000 to December 2005 [866], daily US treasury bonds from 2 January 1970 to 30 November 2006 [867], 1 min Korean Treasury Bond (KTB412) futures from 1 July 2004 to 31 December 2004 [867], daily Shanghai interbank offered rate (SHIBOR) with eight term values from 8 October 2006 to 31 December 2012 [868], and daily US Effective Federal Funds from 1 January 2000 to 31 September 2014 [864].

6.1.9. Illusionary multifractality.

Some studies applied the MF-PF method to stock market indices or asset prices such that the finest measure defined at each time t is the ratio of the index price at t to the sum of all index prices. Representative works are obtain from the Hong Kong market (1 min HSI from 3 January 1994 to 28 May 1997 [869, 870] and 30 continuous trading days from 3 January 1994 [871]), the Shanghai stock market (1 min SSEC from 19 January 1999 to 30 December 2005 [872], 5 min SSEC from January 1999 to July 2001 [873], from 4 May 1998 to 1 June 2005 [795], from 4 January 2005 to 30 December 2005 [874], and from 2 January 2004 to 29 June 2012 [875], and daily SSEC from 4 January 2005 to 30 December 2005 [874]), the Taiwan stock market (daily TSPI from 1987 to 2002 [876] and 1 min TAIEX from 3 May 1999 to 30 November 2007 [877]), the US market (5 min S&P 500 from 2 January 2004 to 29 June 2012 [875]), and the oil price from 1973 to 2007 [878].

All these works produce illusionary multifractality simply because these indices are not singular and a characteristic is that the singularity is close to 1 such that the singularity width $\Delta\alpha$ is close to 0 [879, 880]. However, this illusionary multifractality seems able to capture to some extent the fluctuation characteristics of indices and has the potential power of volatility forecasting (see section 8.3).

An exception is reported for the hourly day-ahead and real-time electricity prices in the Pennsylvania-New Jersey-Maryland (PJM) electricity market from 1 January 2008 to 31 December 2012, in which the singularity width is significantly larger than 0 [881]. This is due to the fact that the electricity prices fluctuate widely and the two investigated price time series look like stock volatility time series (see figure 2 of [881]).

6.2. Volatilities

6.2.1. Stock markets.

For the US stock market, multifractality has been reported to be present in 1 min S&P 500 index volatility from 1 January 1982 to 31 December 1999 using MF-PF [199] and the multiplier method [349], high-frequency volatility of the 30 DJIA stocks from 1 July 2004 to 31 December 2004 using MF-DFA [804], 1 min volatility of the 31 DJIA stocks from 2008 to 2011 using MF-CCA [805], daily DJIA volatility from 18 May 1995 to 18 May 2015 using MF-PF [882], and daily NASDAQ from 18 May 1995 to 18 May 2015 using MF-PF [882].

For the Chinese stock market, researchers studied 1 min volatility of 1139 Chinese stocks and 2 indexes (SSCI and SZCI) from January 2004 to June 2006 using MF-PF [205], daily volatility of 36 SSE 180 stocks from 9 July 2002 to 9 July 2007 using MF-SF [883], 1 min SSEC volatility from 20 September 2007 to 10 May 2010 using MF-DFA [884], 5 min SSEC and SZCI volatility from 4 January 2002 to 31 December 2008 using MF-DFA [885], daily volatilities of the CSI energy index from 4 January 2005 to 15 June 2015 using MF-DFA [802], daily SSEC and SZCI volatility from 18 May 1995 to 18 May 2015 using MF-PF [882], and 5 min realized volatility of the SSEC index from 1 January 2000 to 31 May 2014 and the S&P 500 index from 2 January 1996 to 24 June 2013 [886].

Other studies considered daily DAX volatility from 1 October 1959 to 30 December 1998 using MF-PF [769], 1 min upward and downward volatility (or positive and negative returns) of the DAX index from 28 November 1997 to 30 December 1999 and from 1 May 2002 to 1 May 2004 using MF-DFA [887], daily volatility of the Hang Seng China Enterprises Index (HSCEI) from 2 May 2012 to 27 January 2016 using MF-DFA and MF-CCA [776], daily volatility of 35 most liquid stocks in the Spanish stock market from June 1996 to June 2006 using MF-WT [810], trade-by-trade volatility of 27 highly capitalized individual companies listed on the KOSPI stock market in 2008 and 2009 using MF-DFA [494], daily volatility of CAC 40, DAX, FTSE, NASDAQ, S&P500 and Canada TSE indices from 2 January 2007 to 31 December 2009 using MF-SF [888], and daily volatility of the Casablanca market index and 111 Moroccan family business stocks from 25 March 2013 to 22 March 2016 on the Casablanca Stock Exchange (Morocco) using MF-DFA [889].

There are also other works dealing with cross-correlations between volatility and other financial variables (see section 6.5 below). A distinct study performed MF-DFA on the logarithmic difference of daily implied volatility of CAC 40, DAX and S&P 500 index options from January 2002 to January 2009 [890].

6.2.2. Foreign exchange rates.

On the foreign exchange markets, researchers investigated 20-sec DEM/USD volatility from 1 October 1992 to 30 September 1993 using MF-DFA [823], daily USD/JPY volatility from 10 August 2003 to 31 May 2006 using MF-WT [762], daily CHY/USD volatility from 1 January 2005 to 1 November 2014 using MF-MF [891], and daily volatility of the onshore and offshore RMB exchange rates (CNY and CNH) from 2 May 2012 to 27 January 2016 using MF-DFA and MF-CCA [776].

6.2.3. Commodities.

The multifractal properties of the daily volatility time series of several commodities have been analyzed, such as agriculture futures volatility of Hard Winter wheat from 28 December 1993 to 11 May 2009 and Strong Gluten wheat from 28 March 2003 to 11 May 2009 on the Zhengzhou Commodity Exchange, and Soy Meal from 17 July 2000 to 11 May 2009 and Soy Bean 1 futures from 15 March 2002 to 11 May 2009 on the Dalian Commodity Exchange using MF-PF [892], WTI crude oil from 13 July 1990 to 6 March 2009 using MF-DFA [893], from 2 January 1990 to 9 March 2010 using MF-DFA [656] using MF-DFA and from 1 January 2005 to 1 November 2014 using MF-PF [891], Brent and WTI crude oil from 20 May 1987 to 14 October 2009 using MF-PF [850], and gold from 1 January 2005 to 1 November 2014 using MF-PF [891].

6.3. Liquidity measures

6.3.1. Trading volumes.

Trading volume is an important measure of market liquidity and the main force driving price movements [894897]. For the US stock market, Bolgorian and Raei applied MF-DFA to daily S&P 500 trading volume from 1980 to 2009 [898], Rak et al used MF-CCA to study 1 min transaction number and trading volume of 31 DJIA stocks from 2008 to 2011 [805], and Moyano et al used MF-DFA to analyze 1 min trading volume of 30 DJIA stocks from 1 July 2004 to 31 December 2004 [500].

For the Chinese stock market, Ureche-Rangau and de Rorthays applied MF-SF to daily trading volume of 36 SSE180 stocks from 9 July 2002 to 9 July 2007 [883], Mu et al investigated 1 min trading volume of 22 Chinese stocks in 2003 using MF-DFA [502], Yuan, Zhuang and Liu analyzed daily trading volume of SSEC from 20 December 1990 to 30 December 2010 and SZCI from 3 April 1991 to 30 December 2010 using MF-DFA [764], and Wang et al studied 5 min trading volume of the CSI300 index futures from 16 April 2010 to 12 April 2012 using MF-DFA [811].

On the Korean stock market, Lee and Lee applied MF-DFA to 1 min trading volume of KOSPI from 30 March 1992 to 30 November 1999 [501], Lee used MF-PF to investigate daily trading of KOSPI from November 1997 to September 2010 [503], Yim, Oh and Kim performed MF-DFA on trade-by-trade trading volume of 27 highly capitalized individual companies in 2008 and 2009 [494], and Oh adopted MF-DMA to investigate the aggregated trading volumes of buying, selling, and normalized net investor trading [899].

In addition, using MF-DFA, Stošić et al studied daily trading volume of 13 stock market indices (AEX, BFX, BSESN, BVSP, FCHI, FTSE, GSPC, HSI, KS11, MXX, N225, SSMI, TWII) from 30 August 2006 to 1 March 2014 [753], while Bolgorian analysed daily trading volume of individual and institutional traders on the stocks listed in the Tehran Stock Exchange from 1998 to 2009 [900].

In the multifractal analysis of trading volume, the majority of studies in econophysics work on the normal volume or dollar volume. However, the values of trading volume are affected by share split. Therefore, it is recommended to investigate turnover rates instead of trading volumes [901].

6.3.2. Bid-ask spread.

The bid-ask spread is another important measure of market liquidity. Gu, Chen and Zhou applied MF-PF to 1 min spread of a liquid Chinese stock in 2003 and found no evidence of multifractality [686]. In contrast, Qiu et al performed MF-DFA on the 1 min bid-ask spread of four liquid Chinese stocks (000001, 000539, 600198, 600663) from 2004 to 2006 and unveiled evidence supporting the presence of multifractality [902]. Obviously, more research should be conducted on bid-ask spread using different multifractal analysis methods.

6.3.3. Price gaps on order books.

Price gap is defined as the logarithmic price difference between the first two occupied price levels on the same side of a limit order book (LOB), which is a key determinant of market depth and hence market liquidity [903, 904]. Gu et al studied 26 A-share stocks traded on the Shenzhen Stock Exchange using MF-DFA and found that the gap time series are long-range correlated and possess multifractal properties [905]. They also compared the singularity spectrum width $\Delta\alpha_{{\rm b}}$ of buy LOBs with the singularity spectrum width $\Delta\alpha_{{\rm s}}$ of sell LOBs for all the stocks and reported a linear relationship between them:

Equation (397)

This suggests that the multifractal properties change across different stocks.

6.4. Interevent durations

6.4.1. Recurrence intervals.

Recurrence intervals are waiting times between successive events whose values exceed certain threshold. Recurrence interval analysis has been performed on financial time series of volatility [273, 906909], return [910913] and trading volume [914916]. Recurrence interval is often called return interval [906], which is however not suitable for econophysics since the term 'return' is widely used for logarithmic price changes.

The multifractal nature of recurrence intervals of stock returns in an artificial market has been unveiled via the MF-DFA method [588], in which the return series are generated by an empirical behavioral order-driven model [587]. It is found that the singularity spectrum width $\Delta\alpha$ increases with the Hurst index of relative prices of the submitted orders. However, multifractal analysis on real financial time series is rare. One usually investigates the multiscaling behavior of recurrence interval distributions from the nonlinear behavior in the interval moments [273, 274].

6.4.2. Intertrade durations.

Intertrade duration is the waiting time between successive transactions. Oświęcimka et al applied MF-DFA to the intertrade durations of 30 DAX stocks from 1 December 1997 to 31 December 1999 [491], Jiang, Chen and Zhou used MF-DFA to analyze the intertrade durations of 23 liquid Chinese stocks in 2003 [492], Ruan and Zhou investigated 1 min intertrade durations of a Chinese stock and its warrant from 22 August 2005 to 23 August 2006 using MF-DMA and MF-DFA [493], and Yim, Oh and Kim performed MF-DFA on the intertrade durations of 27 highly capitalized individual companies listed on the KOSPI stock market in 2008 and 2009 [494].

Heyde numerically constructed the activity time (transaction time) of the S&P 500 index and 30 DJIA consistent stocks from Ito's formula and used MF-SF to confirm the multifractal nature [495].

Perelló et al and Kasprzak et al developed the multifractal continuous-time random walk model for intertrade durations and found that the model predictions fit well the empirical results obtained from the tick-by-tick data of the futures contracts on the German index DAX, the American DJIA, the Polish WIG20 index, and the US-DM and US-EUR foreign exchange rates, as well as a single stock Telefonica [561, 562].

Intertrade duration plays an important role in autoregressive conditional duration model [688], multifractal models for asset returns [217, 505], and Markov-switching multifractal models [143]. These empirical results confirm the presence of multifractality in the intertrade durations, providing solid backup for the main assumption in MMAR models and MSM models.

6.4.3. Other microstructure variables.

Ni et al performed MF-DFA on intercancelation durations calculated from the limit order book data of 22 liquid Chinese stocks in 2003 [917], while Yue et al applied MF-DMA to the time series of the order aggressiveness of 43 Chinese stocks (see [610] for the definition of 'aggressiveness'). These studies confirmed the presence of multifractality in certain microstructure variables.

6.5. Two time series: MF-X

In the last decade, some joint multifractal analyses have been performed on distinct pairs of financial variables. Most of these studies adopt the MF-X-DFA method [180]. Without further justification, the method used in the empirical analyses reviewed below is MF-X-DFA.

6.5.1. Return-return pairs.

The multifractal nature in the cross-correlations in pairs of asset returns has been widely studied. For stock markets, researchers have investigated the daily returns of A-share and B-share market indices in Shanghai and Shenzhen from 2 January 1996 to 1 April 2010 [918], the daily returns of SSEC and SZCI indices from 4 April 1991 to 13 April 2009 [460], the daily returns of SSEC and SZCI indices from 4 January 2002 to 31 December 2008 [147], the daily returns of SSEC and surrounding markets (HSI, NIKKEI 225 and KOSPI) from 1 January 1997 to 31 December 2011 [448], the six daily return pairs of Amman AMX, Egyptian EGX, Casablanca MASI and Tunis TUNINDEX indices in 2837 trading days from 1 January 2000 [919], the daily returns of IBM and MSFT from 1987 to 2012 [920], the 1 min returns of two pairs of German (CBK-DBK and DBK-EON) using MF-X-DFA and MF-CCA [148], the daily returns of DJIA and DAX from 12 January 1990 to 12 October 2013 using MF-X-DFA and MF-CCA [148], the daily CSI 300 spot and futures returns from 16 April 2010 to 17 February 2012 using MF-X-DMA [921], the daily returns between market return and 6 portfolio excess returns constructed from all stocks in NYSE, AMEX and NASDAQ based on the companies' sizes and book-to-market (BM) ratios from 2 January 1990 to 29 January 2016 [922], the daily instantaneous and average instantaneous cross-correlations between the US and the Chinese stock markets [923], the daily returns of the CDS and 11 sector indices in the US stock market from 14 December 2007 to 31 December 2014 [798].

For oil and emission markets, scholars have worked on the daily returns of WTI crude oil spot and futures prices from 2 January 1990 to 9 March 2010 [656], the daily WTI crude oil and Brent crude oil from 3 January 1995 to 13 August 2013 [924], four pairs (CER-WTI, CER-Brent, EUA-WTI and EUA-Brent) between daily carbon prices (Certification Emission Reduction CER from 14 March 2008 to 15 December 2012 and EU emission Allowance EUA from 22 April 2005 to 15 December 2012) and daily spot prices of WTI and Brent crude oil [925], three pairs among daily time series of oil (Brent Crude Oil Futures), gas (New York Mercantile Exchange Henry Hub Natural Gas Futures), and CO2 (European Climate Exchange EU Allowance EUA Futures) from 22 April 2005 to 30 April 2013 [926], daily returns between two crude oil prices (Brent and WTI) and the Baltic Dry Index BDI from 19 October 1988 to 3 February 2015 [927], and daily returns of WTI oil and Baltic Exchange Dirty Tanker index (BDTI) from 4 January 2000 to 4 November 2015 [928].

The relationship between daily returns of crude oils and stock markets has also attracted much interest, such as WTI and three major US stock market indices (DJIA, NASDAQ and S&P 500) from 2 January 2002 to 29 June 2012 [929], WTI and BRIC stock market indices (Brazil IBOVESPA, Russian RTS, Indian SENSEX and Chinese SSEC) from 1 January 2002 to 30 September 2012 [930], WTI and ten CSI 100 sector indices of China's stock market from 1 January 2005 to 3 April 2013 [801], WTI and ten CSI 300 sector indices from 4 January 2005 to 2 November 2015 [931], and WTI and six GCC stock market indices (Saudi Arabian Tadawul All-Share Index, Qatar Doha Securities Market, Dubai General Index, Abu Dhabi General Index, Kuwait Stock Exchange Index, Dubai General Index, and Oman MSM30 Index) as well as S&P 500 index from 5 January 2004 to 31 August 2013 [932]. A related energy-stock pair is the electricity prices in the Indian Energy Exchange and stock index SENSEX from 1 April 2009 to 31 March 2014, in which prices are used as the input $\Delta{X}$ [933].

The joint multifractal nature between crude oil prices and other financial assets has also been studied and confirmed, such as two pairs of daily returns of the crude oil (WTI and Brent) and USD/INR from 3 January 1995 to 13 August 2013 [924], daily returns of WTI and five exchange rates (AUD, CAD, MXN, RUB and ZAR) from 1 January 1996 to 31 December 2014 [934], daily returns of WTI crude oil and US agricultural commodities (corn, soybean, oat and wheat) from 3 January 1994 to 31 December 2012 [935], two pairs of daily returns of the crude oil (WTI and Brent) and gold from 3 January 1995 to 13 August 2013 [924], daily returns of WTI versus the 3 month Treasury bill rate and the US dollar index from 4 February 1994 to 29 February 2016 [936],

Other pairs are daily data of SSEC and CNY/USD from 1 August 2005 to 20 October 2011 [937], daily returns of Hang Seng HSCEI index and RMB exchange rates (CNY and CNH) from 2 May 2012 to 27 January 2016 (MF-X-DFA and MF-CCA) [776], daily data of market fear gauge VIX and two Japanese Yen exchange rates (USD/JPY and AUD/JPY) from 5 January 1998 to 18 April 2016 [938], daily prices of India SENSEX and USD/INR from January 1995 to December 2012 [782], daily returns of same futures in the Chinese and US commodity futures markets (wheat from 1 November 1999 to 13 December 2010, soybean from 4 January 1999 to 13 December 2010, soy meal from 17 July 2000 to 13 December 2010, and corn from 22 September 2004 to 13 December 2010) in the US and China [939], daily returns of gold and USD/INR from 3 January 1995 to 13 August 2013 [924], daily returns of spot and futures markets of nonferrous metals (aluminum, copper, nickel and zinc) in the London Mercantile Exchange [940], daily returns between London and Shanghai gold prices from 2 January 2003 to 27 April 2012 (MF-X-DFA and MF-X-DMA) [941], daily returns of RMB nominal effective exchange rate index (NEER) and four Euronext Rogers international commodity indices (whole, energy, metals, agriculture) from 21 July 2005 to 15 March 2016 [942], 5 min returns of Chinese treasury futures contracts and treasury ETF from 6 September 2013 to 26 September 2014 [943], daily returns of the 3 month Treasury bill rate and the US dollar index from 4 February 1994 to 29 February 2016 [936], daily CNY/USD and CNH/USD from 1 May 2012 to 29 February 2016 [944], daily data of US gold price and Bombay Stock Exchange SENSEX from January 1990 to May 2013 [945], daily returns of the US interest rate (effective federal funds rate) and four CBOT agricultural commodity futures (corn, soybean, wheat and rice) from 1 January 2000 to 31 September 2014 [864], and daily returns of Argentina stock market Merval index and the international prices of three commodities (soybeans, corn and wheat) from 1 August 2000 to 20 April 2012 [946].

6.5.2. Volatility-volatility pairs.

The joint multifractal nature between two volatility time series has also been investigated. For stock market indices, some studies have been done on the pair of daily DJIA and NASDAQ from July 1993 to November 2003 [180], the pair of daily A-share and B-share market indices in Shanghai and Shenzhen from 2 January 1996 to 1 April 2010 [918], the pair of daily SSEC and SZCI from 4 April 1991 to 13 April 2009 [460] and from 4 January 2002 to 31 December 2008 [147], the six pairs of daily SSEC, SZCI, DJIA and NASDAQ from 18 May 1995 to 18 May 2015 using MF-X-PF [882], and the pair of daily DJIA and NASDAQ from 2 January 1997 to 29 December 2014 using MF-X-PF [429].

Other studies consider the daily volatility pair of the corresponding agricultural futures in the Chinese and US commodity futures markets (wheat from 28 December 1993 to 3 June 2010, soy meal from 17 July 2000 to 3 June 2010, soybean from 15 March 2002 to 3 June 2010, and corn from 22 September 2004 to 3 June 2010) [447] and (wheat from 1 November 1999 to 13 December 2010, soybean from 4 January 1999 to 13 December 2010, soy meal from 17 July 2000 to 13 December 2010, and corn from 22 September 2004 to 13 December 2010) in the US and China [939], the pair of positive series and negative series of Leu-EUR and Leu-USD from June 1999 to November 2012 [947], the three pairs of daily volatility of crude oil, gold and CHY/USD exchange rate from 1 January 2005 to 1 November 2014 using MF-X-PF [891], the daily WTI crude oil and US agricultural commodities (corn, soybean, oat, wheat) from 3 January 1994 to 31 December 2012 [935], the daily US interest rate (effective federal funds rate) and four CBOT agricultural commodities (corn, soybean, wheat and rice) from 1 January 2000 to 31 September 2014 [864], the daily Hang Seng China Enterprises Index (HSCEI) versus the daily onshore and offshore RMB exchange rates (CNY and CNH) from 2 May 2012 to 27 January 2016 (MF-X-DFA and MF-CCA) [776], and the daily spot and futures prices of nonferrous metals (aluminum, copper, nickel and zinc) traded on the London Mercantile Exchange [940].

6.5.3. Price-volume pairs.

The joint multifractal analysis of asset price and trading volume investigates the temporal features of the price-volume relationship. In finance, the price-volume relationship considers either return and volume or volatility and volume. Analogously, Wang et al studied the pair of 5 min return and volume time series of the CSI 300 index futures from 16 April 2010 to 12 April 2012 [811], while Lin researched the pair of daily volatility and volume of NASDAQ and S&P 500 using MF-X-PF [421]. We note that the trade-by-trade price returns and waiting times of E.ON (EOA) and Deutsche Bank (DBK) from 28 November 1997 to 31 December 1999 have also been studied using MF-X-DFA and MF-CCA [148].

In the econophysics community, however, the overwhelmingly largest number of investigations have been conducted on pairs of return and volume variation, stimulated by the seminal work by Podobnik et al on the S&P 500 index [914]. Subsequent studies cover hourly returns of day-ahead spot price of electricity and log-difference of trading volume in the Czech Republic from 1 January 2009 to 30 November 2012 using MF-CCA [948], daily Shanghai SSEC index from 20 December 1990 to 30 December 2010 and Shenzhen SZCI index from 3 April 1991 to 30 December 2010 [764], daily SSEC from 20 December 1990 to 30 December 2010 and SZCI from 3 April 1991 to 30 December 2010 [764], daily data of four agricultural commodity futures traded on China's Zhengzhou Commodity Exchange (hard winter wheat from 28 December 1993 to 12 March 2010) and Dalian Commodity Exchange (soy meal from 17 July 2000 to 12 March 2010, No. 1 soybeans from 15 March 2002 to 12 March 2010, corn from 22 September 2004 to 12 March 2010) and four agricultural commodity futures traded on the Chicago Board of Trade (wheat from 1 July 1978 to 12 March 2010, soy meal from 1 July 1978 to 12 March 2010, soybeans from 1 July 1978 to 16 March 2010, and corn from 1 July 1978 to 12 March 2010) [446], daily data of the Moroccan stock market index MASI from 1 January 2000 to 20 January 2017 [949], daily data of 13 stock market indices (AEX, BFX, BSESN, BVSP, FCHI, FTSE, GSPC, HSI, KS11, MXX, N225, SSMI, TWII) from 30 August 2006 to 1 March 2014 [753], daily data of the 50 constituent stocks of the Nifty index of National Stock Exchange (Mumbai) from 01 January 2002 to 31 December 2014 [950], daily data of Au99.99 on the Shanghai Gold Exchange (SGE) from 30 October 2002 to 30 January 2015 and the gold futures on the Shanghai Futures Exchange (SHFE) from 9 January 2008 to 30 January 2015 [951], daily data of Bitcoin markets [833].

There are also studies on price and volume that are added up before the detrending procedure, such as daily data of copper futures and aluminum futures traded on the Shanghai Futures Exchange (SHFE) from 15 January 2004 to 4 July 2011 and copper and aluminum futures on the London Metal Exchange (LME) [952], daily copper futures series of Shanghai Futures Exchange (SHFE) from 15 September 1993 to 4 July 2011 and aluminum from 5 October 1998 to 4 July 2011 [953], daily electricity prices and loads in the California electricity market 1999–2000 using adaptive MF-X-DFA and MF-X-SF [149, 432], and 1 h and daily prices and loads data of California power market (1999–2000) and PJM power market (2001–2002) [954].

6.5.4. Open-source information and returns.

There are a number of innovative applications of the MF-X-DFA method, in the form of the study of pairs made of asset returns and open-source information, including (i) China's SSE 50 Index returns and Mass and New Media News [955], (ii) Chinese stock market index returns and the individual investor sentiments [956], (iii) the pair made of two commonly-employed online sentiment proxies (the financial and economic attitudes revealed by search from Google Trends and daily happiness sentiment from Twitter) [834], (iv) pairs made of the price returns or the logarithmic volume changes of Bitcoin and the logarithmic changes of Google Trends [904], and (v) A-share and H-share price returns and media coverage of A-H pair stocks in China [957]. All these studies confirmed the presence of multifractal cross-correlations in the investigated pairs.

7. Sources of apparent multifractality

Overwhelming empirical evidence shows that various financial time series of assets and their derivatives exhibit multifractality. However, the underlying physical mechanisms are not clear. This raises an important and subtle issue on the sources of the observed multifractality. It is usually argued that the sources of multifractality in financial time series are fat tails and/or long-range temporal correlations [178]. However, possessing linear correlations or long memory in time series is not sufficient for the emergence of multifractality and a nonlinear process is required to have intrinsic multifractality [538]. Further investigations consider the impacts of linear and nonlinear correlations on the observed multifractality [639, 958].

An even more critical question is to ask whether the empirical extracted multifractality is intrinsic or apparent. Indeed, there are many studies showing that empirical multifractal analysis of time series generated from monofractal financial models and mathematical models can produce spurious multifractality [959, 960]. Moreover, the estimated generalized Hurst exponents may deviate significantly from the expected values, especially for short time series [773, 799]. Therefore, statistical tests are absolutely necessary in empirical multifractal analysis.

It is necessary to propose a standardized terminology here for a consistent picture. Throughout this review, the empirical multifractal properties ($\tau(q)$ or $f(\alpha)$ ) extracted from time series using multifractal analysis methods is called apparent multifractality, although it is used to represent spurious multifractality by some researchers. Then, we refer to the apparent multifractality obtained from monofractal time series as spurious multifractality. Time series generated from nonlinear, multiplicative processes possess intrinsic or true multifractality, whose nonlinear component in the apparent multifractal is the effective multifractality [639].

7.1. Spurious multifractality and finite-size effect

7.1.1. Bifractality of fat-tailed time series.

Mantegna and Stanley defined the truncated Lévy flight (TLF) [961], and Kopoken introduced the exponentially truncated Lévy flight (ETLF) [962]. Nakao studied the asymptotical scaling behavior of the structure functions of uncorrelated time series whose amplitudes follow the exponentially truncated Lévy distribution and derived analytically the mass scaling exponents [717]:

Equation (398)

where $0<\gamma\leqslant 2$ is the stable index, and the singularity strength and singularity spectrum are given by

Equation (399)

and

Equation (400)

The bifractality also holds for ordinary Lévy processes [716] and time series for which the innovations have a power-law distribution with tail index $0<\gamma<2$ [718].

In addition, consider the class of stochastic process $X(t)$ whose increments form a strictly stationary sequence having heavy-tailed marginal distribution with index $\gamma$ , and which satisfies the strong mixing property with an exponentially decaying rate and zero mean. For $\gamma > 1$ , the scaling exponents of the structure functions can be expressed as [495, 963965]

Equation (401)

For fractional Lévy flights with tail index $\gamma$ and Hurst exponent H, the scaling exponents of the structure functions are [966]

Equation (402)

which depend on both the fat-tailedness and the linear correlations.

7.1.2. Spurious multifractality.

Bouchaud et al built an asymptotically monofractal model and found spurious multifractality through the structure function approach [959]. LeBron proposed a simple three-factor stochastic volatility (TFSV) model, which is asymptotically monofractal, and observed numerically spurious multifractality [756]. Synthetical time series generated from the model after calibrating with daily DJIA returns and the DJIA data share very similar structure functions. Bianchi calibrated multifractional Brownian motions with the daily S&P 500 index from 20 October 1982 to 27 May 2004 and the daily NIKKEI 225 index from 04 January 1984 to 27 May 2004 and found that the raw time series and their multifractional Brownian motion surrogates have very similar multiscaling behavior in the partition functions [967]. Similar results were obtained for other six major stock indices [745]. These results show that we cannot distinguish monofractality and multifractality in the time series generated from stochastic volatility models and multifractional Brownian motion models, and further tests for nonlinearity are called for.

Spurious multifractality has also been observed in the numerical multifractal analysis of time series generated from monofractal or bifractal mathematical models. For instance, for Lévy flights, the estimated $\tau(q)$ function is close to the bifractal expression (398), but is more or less smoothed around $q=\gamma$ , which results in a spectrum of singularities. There are well-conducted examples. For uncorrelated time series with a non-Gaussian exponential amplitude distribution, spurious multifractality is numerically observed [960]. The linear absolute increments series of Devil's staircases show weak spurious multifractality under the local singularity method and partition function approach but exhibit nice monofractality under the punctual singularity method [195]. The linear absolute increments series of fractional Brownian motions show strong spurious multifractality when using the local singularity method and prudent singularity method and relatively weak spurious multifractality when using the partition function approach [195].

Furthermore, numerical experiments show that the singularity width of the spurious multifractality depends on the power law tail exponent $\gamma$ and on the Hurst exponent H. For uncorrelated time series with q-Gaussian distributions that are bifractal, the singularity width $\Delta\alpha$ increases with decreasing tail exponent and the width values are quite large in the Lévy regime under MF-DFA and WTMM [968]. The fact that $\Delta\alpha$ decreases with increasing N has also been observed in the MF-DFA analysis of monofractal time series [609]. For Fourier transform surrogates of stochastic binomial measures, the singularity width $\Delta\alpha$ obtained from MF-DFA and centered MF-DMA increases with the Hurst index H [329].

7.1.3. Finite-size effect.

Zhou generated random numbers of different sizes (N) from the empirical distribution of DJIA volatilities, and different linear correlations with different Hurst exponents (H) were added by manipulating the generated time series using the IAAFT algorithm [639]. The surrogate time series thus do not have multifractal nature by construction. Numerical experiments unveil that the singularity width depends on N and H as follows

Equation (403)

This means that $\Delta\alpha(H,N)\to 0$ for long time series and the decay is faster for small H. Therefore, the linear correlation component $\Delta\alpha_{{\rm LM}}$ is the outcome of the finite-size effect and the linear correlation component of the apparent multifractal is

Equation (404)

Similarly, for autocorrelated time series with H  >  0.5 generated based on the Fourier filtering method [969], Grech and Pamuła determined numerically the dependence of $\Delta{h}$ with respect to the Hurst exponent H and the sample size N [970]:

Equation (405)

or, almost equivalently [971],

Equation (406)

both of which also suggest that the singularity width decreases with increasing sample size and decreasing Hurst exponent. There are corrections to these formulae when q is not sufficiently large [972].

Overall, time series display apparent multiscaling behaviour as a consequence of a slow transition phenomenon on finite time scales.

7.2. Empirical investigation of different sources

7.2.1. Impacts of linear and nonlinear correlations (Shuffle).

To investigate and distinguish different sources of apparent multifractality, the simplest and most adopted method is to shuffle or reshuffle the raw time series and compare the singularity widths of the raw time series ($\Delta\alpha$ ) and the shuffled time series ($\Delta\alpha_{{\rm shuf}}$ ). Since the shuffling procedure destroys any linear or nonlinear long-range correlations while preserving the distribution of the values of the time series, $\Delta\alpha_{{\rm shuf}}$ qualifies the contribution of the broad probability distribution and probably the finite-size effect as well [178]

Equation (407)

The impact of correlations on the apparent multifractality is qualified by [178]

Equation (408)

which can be abbreviated into a single value [746, 973]

Equation (409)

where $\Delta{H}=H_{\max}-H_{\min}$ . Equivalently, we have

Equation (410)

where $\Delta\alpha=\alpha_{\max}-\alpha_{\min}$ . A comparison of the values of $\Delta\alpha_{{\rm shuf}}$ and $\Delta\alpha_{{\rm corr}}$ can determine which source dominates.

Most empirical analyses reported a significant decrease of the apparent multifractality after shuffling, indicating the nontrivial impact of long-range correlations. Empirical examples contain the daily returns of 29 commodities and 2445 stocks [224], the trade-by-trade returns and the intertrade durations of 30 DJIA stocks and 30 DAX stocks from 1 December 1997 to 31 December 1999 [726], the Iranian rial-US dollar exchange rates from 24 September 1989 to 15 November 2003 ($\Delta\alpha = 3.54$ and $\Delta\alpha_{{\rm shuf}} = 0.60$ ) [829], the 1 min WIG20 index returns from 4 January 1999 to 31 October 2005 ($\Delta\alpha \approx 0.14$ and $\Delta\alpha_{{\rm shuf}} \approx 0.04$ ) [794], the 1 min index returns of S&P 500, DAX and WIG 20 from May 2004 to May 2006 [113], the 20s returns of the DEM/USD exchange rates from 1 October 1992 to 30 September 1993 ($\Delta\alpha \approx 0.69$ and $\Delta\alpha_{{\rm shuf}} \approx 0.34$ ) [823], the 20s volatilities of the DEM/USD exchange rates from 1 October 1992 to 30 September 1993 ($\Delta\alpha \approx 2.5 $ and $\Delta\alpha_{{\rm shuf}} \approx 0.37$ ) [823], the daily DJIA returns from 26 May 1896 to 27 April 2007 ($\Delta\alpha = 0.22$ and $\Delta\alpha_{{\rm shuf}} = 0.18$ ) [958], the daily returns of the Indian BSE index from July 1997 to December 2007 ($\Delta\alpha = 0.442$ and $\Delta\alpha_{{\rm shuf}} = 0.399$ ), the Indian NSE index from August 2002 to December 2007 ($\Delta\alpha = 0.558$ and $\Delta\alpha_{{\rm shuf}} = 0.473$ ), and the S&P 500 index from July 1997 to December 2007 ($\Delta\alpha = 0.548$ and $\Delta\alpha_{{\rm shuf}} = 0.328$ ) [761], the daily returns of the SENSEX index from January 2003 to December 2009 [781], the daily volatility of spot WTI and Brent oil prices from 20 May 1987 to 14 October 2009 ($\Delta\alpha = 0.439$ and $\Delta\alpha_{{\rm shuf}} = 0.142$ for WTI and $\Delta\alpha = 0.371$ and $\Delta\alpha_{{\rm shuf}} = 0.159$ for Brent) [850], the daily returns of four foreign exchange rates (JPY/USD, HKD/USD, KRW/USD and THD/USD) from 1991 to 2005 [818], the daily returns of the Czech PX index from April 1994 to December 2010 ($\Delta\alpha = 0.69$ and $\Delta\alpha_{{\rm shuf}} = 0.44$ ) [334], the daily gold price returns from 1973 to 2011 [841], the daily returns of the Athens Stock Exchange index, the Irish Stock Exchange index and the Portuguese Stock Exchange index [974], the daily index returns for the US and seven Asian markets from 1 July 2002 to 31 December 2012 [975], the daily returns of the CSI energy sub-industry index (ESI) from 4 January 2005 to 15 June 2015 ($\Delta\alpha = 0.505$ and $\Delta\alpha_{{\rm shuf}} = 0.427$ ) [802], and so on.

There are also examples for which the reshuffled time series has a broader singularity width than the raw time series. For instance, it is reported that $\Delta\alpha = 0.773$ and $\Delta\alpha_{{\rm shuf}} = 1.029$ for the daily volatilities of the CSI energy sub-industry index (ESI) from 4 January 2005 to 15 June 2015 [802], and $\Delta\alpha = 0.44$ and $\Delta\alpha_{{\rm shuf}} = 0.50$ for the daily returns of the Polish WIG index from June 1993 and December 2010 [334]. The observation that $\Delta\alpha_{{\rm shuf}}$ is approximately equal to or even greater than $\Delta\alpha$ should be interpreted with caution. The basic explanation is that the apparent multifractality detected in the time series under investigation is independent of any correlation structures in the data and is thus fully determined by the usually non-Gaussian distribution. However, due to the ubiquitous presence of finite-size effects in multifractal analysis, even multifractal time series generated from Markov-switching multifractal models may lead to the same result [976]. Hence, the observation $\Delta\alpha_{{\rm shuf}} \geqslant \Delta\alpha$ per se does not deny the presence of intrinsic multifractality. One should either perform statistical tests based on the IAAFT algorithm or choose a proper scaling range. Indeed, such an anomaly disappears when the scaling range contains larger time scales [641]. The shuffling test does show that $H(1)=H(2)$ is rejected for time series generated from the multifractal model of asset returns [977].

The adopted multifractal analysis method could also have impacts on the results. Caraiani found that, for the daily returns of the Hungarian BUX index from June 1993 to December 2010, $\Delta\alpha = 0.54$ and $\Delta\alpha_{{\rm shuf}} = 0.34$ for the MF-DFA method and $\Delta\alpha = 0.49$ and $\Delta\alpha_{{\rm shuf}} = 0.65$ for the EMD-MF-DFA method [334]. Although the results of the shuffling test could be different, there is a general result showing that $\alpha_{{\rm shuf}}(q=0)\approx 0.5$ for shuffled time series.

7.2.2. Impact of nonlinear correlations (Surrogate).

A time series has only linear correlations if all the correlation structures of the time series are contained in the autocorrelation function, or equivalently, in the Fourier power spectrum. In this case, its dynamics can be captured by AR or ARMA models. If the correlation structures of a time series cannot be captured by the power spectrum, it contains nonlinearity. There are several methods to generate linear surrogates without nonlinearity. The most commonly used methods for generating surrogates include the Fourier transform algorithm [978], the amplitude adjusted Fourier transform (AAFT) algorithm [978], and the iterative amplitude adjusted Fourier transform (IAAFT) algorithm [979].

The Fourier transform surrogate time series are constructed to have the same linear correlations as the raw data, while any nonlinear correlations are eliminated and the amplitude distribution becomes Gaussian [980]. One performs discrete Fourier transform of the raw time series, randomizes the phases of the amplitude through multiplying the discrete Fourier transform of the data by random phases, and performs an inverse Fourier transform to create phase-randomized surrogates [978]. The phase randomization approach, adopted in the multifractal analysis of financial time series, was introduced for the multifractal analysis of heartbeat intervals [981]. Because the phase randomization procedure weakens non-Gaussianity in time series, the difference $ H_{{\rm PDF}}(q) = H(q)- H_{{\rm FT}}(q)$ between $H(q)$ and $H_{{\rm FT}}(q)$ is usually argued to reflect the impact of broad distributions [856, 982]. This can also be represented as [746, 973]

Equation (411)

or equivalently

Equation (412)

However, by construction, both the impacts of PDF and nonlinear correlations are removed from the Fourier transform surrogates. The difference $\Delta\alpha- \Delta\alpha_{{\rm FT}}$ contains both PDF and nonlinearity effects.

Numerous studies have implemented the shuffling test and the Fourier transform surrogate test to understand the sources of the empirically estimated apparent multifractality in financial time series. The majority of them found that

Equation (413)

An almost complete list includes the 5 min stock returns for the 100 largest US companies traded on NYSE or NASDAQ from 1 December 1997 to 31 December 1999 [983], the daily returns of the Iranian rial-US dollar exchange rates from 24 September 1989 to 15 November 2003 [829], the daily TEPIX returns from 20 May 1995 to 19 January 2005 [973], 1 min returns of the TAIEX index and 150 stocks of the TSEC Taiwan 50 and TSEC Taiwan Mid-Cap 100 indices from January 2001 to August 2007 [796], the daily return and volatility time series of market indices for Argentina, Brazil, Chile, Colombia, Mexico, Peru, Venezuela and the US from January 1995 to February 2007 [746], the daily returns of 172 individual stocks included in the Korean KOSPI index from 1980 to 2003 [807], the daily averaging prices of four Korean agricultural commodities (leek, radish, onion, and Korean cabbage) from 3 January 2001 to 23 September 2009 [863], the daily logarithmic variations of trading volume of the S&P 500 index from 1980 to 2009 [900], the daily returns of spot and futures prices of WTI crude oil from 2 January 1990 to 9 March 2010 [656], the daily exchange rate returns of USD/AUD, USD/EUR and CNY/USD [657], the daily and weekly spot rates of VLGCs on the benchmark Persian Gulf (Ras Tanura) to Japan (Chiba) route from 3 January 1992 to 24 June 2009 [739], the daily returns of four commodity futures (hard winter wheat from 28 December 1993 to 18 September 2009, soy meal from 17 July 2000 to 18 September 2009, No. 1 soybean from 15 March 2002 to 18 September 2009, and corn from 22 September 2004 to 18 September 2009) in China (it is more complicated for the USA commodity futures) [862], the 1 min realized volatility of SSEC from 4 January 2000 to 31 December 2007 [984], the daily returns of CIB-CNY Composite Index (CCI) compiled by the China Industrial Bank Research from 21 July 2005 to 30 June 2008 [827], the daily returns of 144 Korean equity and balanced funds from 1 January 2002 to 31 December 2010 [800], the daily returns of the CSI energy sub-industry index from 4 January 2005 to 15 June 2015 [802], and the daily volume changes of the Moroccan MASI index from 1 January 2000 to 20 January 2017 [949].

There are also examples showing that

Equation (414)

such as the 5 min returns of the SSEC and SZCI from 4 January 2002 to 31 December 2008 [885], the daily returns of the world gold prices from 1968 to 2010 [840], the daily logarithmic variations of individual and institutional traders' trading volume in Tehran Stock Exchange from 1998 to 2009 [900], the daily exchange rates of CAD/USD, JPY/USD, USD/GBP, DKK/USD, NOK/USD, MXN/USD, BRL/USD, INR/USD and THB/USD [657], the daily returns of CIB-CNY Composite Index from 30 June 2008 to 18 June 2010 [827], the 10 min returns of the CSI 300 futures IF1009 from 16 April 2010 to 17 September 2010 [812], the daily spot rates of ships in tanker markets from 27 January 1998 (or 1 July 2004 or 15 July 2005) to 5 August 2013 [740], the daily returns of WTI oil and Baltic Exchange Dirty Tanker index from 4 January 2000 to 4 November 2015 [928], and the daily returns of the Moroccan MASI index from 1 January 2000 to 20 January 2017 [949].

The impacts of different sources of apparent multifractality could change for different sample periods. Zhou et al investigated the 5 min high-frequency returns of China's CSI 300 index from 4 April 2005 to 19 January 2012 and found that $\Delta\alpha > \Delta\alpha_{{\rm FT}} > \Delta\alpha_{{\rm shuf}}$ [658]. They further found that $\Delta\alpha > \Delta\alpha_{{\rm FT}} > \Delta\alpha_{{\rm shuf}}$ for the period from 8 April 2005 to 16 April 2010 and $\Delta\alpha > \Delta\alpha_{{\rm shuf}} > \Delta\alpha_{{\rm FT}}$ for the period from 19 April 2010 to 19 January 2012, where 16 April 2010 is the date when the CSI 300 stock index futures were first listed on the China Financial Future Exchange. Even more complicated situations are observed for the daily gold price returns in the Indian market and the daily global consumer price index returns [843, 844].

In contrast, the AAFT algorithm aimed at generating surrogate time series having the same Fourier spectrum and distribution as the raw data [978]. The values of the raw data are first replaced by random numbers taken from the normal distribution based on the rank-ordering technique. The rank-ordered time series has the same rank ordering as the raw series such that their ith elements have the same rank in both time series. The Fourier transform surrogate of the rank-ordered time series is obtained and transformed back to have the same amplitude distribution as the raw time series through rank ordering. However, the AAFT algorithm can produce the same power spectrum only for time series with infinite length in the limit $N\to\infty$ and does not usually result in the same sample power spectra [979]. To overcome this shortcoming, the IAAFT algorithm is introduced, which improves the AAFT algorithm [979]. In the IAAFT algorithm, the values of the raw time series $\{x(i)| i=1,2,\cdots,N\}$ are sorted, resulting in a new sequence {yN}. We obtain the squared amplitudes $\{Y_k^2\}$ of the Fourier transform of {yN}. The initial sequence $\{y_N^{(0)}\}$ of the iteration is a random shuffle of {yN}. In the j th iteration, the squared amplitudes $\{Y_k^{2,(\,j)}\}$ of the Fourier transform of $\{y_N^{(\,j)}\}$ are obtained and replaced by $\{Y_k^2\}$ , which are transformed back, and then the resulting series are replaced by {xN} with rank ordering. The iteration stops until a given accuracy is reached. For both AAFT and IAAFT, the surrogates still contain Fourier phase correlations [985]. Unfortunately, the IAAFT surrogate test has been applied only in very few studies [639, 741, 958, 986]. We note that there are also wavelet-based methods to generate multifractal surrogates preserving the multifractal nature of the raw time series [987, 988].

Similar studies have been conducted on the multifractal cross correlations of pairs of time series, such as the daily returns of copper futures and aluminum futures traded on the Shanghai Futures Exchange (SHFE) from 15 January 2004 to 4 July 2011 and copper and aluminum futures on the London Metal Exchange (LME) [952], the daily returns of the BDI-Brent pair and the BDI-WTI pair [951], the daily exchange rates CNY/USD and CNH/USD from 1 May 2012 to 29 February 2016 [944], the daily returns and volume changes of the Moroccan MASI index from 1 January 2000 to 20 January 2017 [949], and the daily returns of WTI oil and Baltic Exchange Dirty Tanker index from 4 January 2000 to 4 November 2015 [928], in which the singularity widths reduce for the shuffled and surrogate data. There are also cases in which an even larger width is observed after shuffling or surrogate, such as the daily data of market fear gauge VIX and two Japanese Yen exchange rates (USD/JPY and AUD/JPY) from 5 January 1998 to 18 April 2016 [938], the daily returns of RMB nominal effective exchange rate index (NEER) and four Euronext Rogers international commodity indices (whole, energy, metals, agriculture) from 21 July 2005 to 15 March 2016 [942], and the pairs between the steel warehouse-out quantity of three warehouses (YZ, JS and BY) in China [468]. There anomalous results should be interpreted with caution and further studies are needed.

7.2.3. Impact of broad distributions.

For log-normal $\mathcal{W}$ -cascades [360], according to equation (277), the singularity width is expressed as

Equation (415)

where $\sigma$ is the standard deviation of the $\ln(w)$ random variable which is normally distributed. It follows immediately that $\Delta\alpha$ increases with $\sigma$ . Oh et al performed extensive numerical analysis to validate this relation between singularity width and extreme values [818]. Specifically, they generated multifractal time series from log-normal $\mathcal{W}$ -cascades with different means $\mu$ and different standard deviations $\sigma$ . The tail exponents $\gamma$ of the multifractal data were obtained using the maximum likelihood estimator of Clauset et al [989]. The tail exponent is smaller if the tail is fatter and there are more extreme values. It is found that $\Delta\alpha$ is independent of $\mu$ and increases when $\gamma$ decreases.

To investigate the impact of the distribution in empirical studies, Zhou adopted the rank-ordered remapping technique to generate surrogate time series by replacing the raw data with random numbers drawn from a prescribed distribution while keeping the original orders [958]. The algorithm is described as follows. For a given distribution, we generate a sequence of random numbers $\{z_0(i)| i=1,2,\cdots,N\}$ [990], which are rearranged such that the rearranged series $\{z(i)| i=1,2,\cdots,N\}$ has the same rank ordering as the raw series $\{x(i)| i=1,2,\cdots,N\}$ . In other words, $z(i)$ should be the nth largest number in sequence $\{z(i)| i=1,2,\cdots,N\}$ if and only if $x(i)$ is the nth largest number in the $\{x(t)| i=1,2,\cdots,N\}$ sequence [910, 958]. The series $\{z(i)\}$ is rescaled to have the same standard deviation $\sigma$ of the returns $\{x(i)\}$ :

Equation (416)

where $\sigma_z$ is the standard deviation of $\{z(i)\}$ and $\mu$ is the sample mean of $\{x(i)\}$ .

Zhou investigated two families of distributions with fat tails in the MF-DFA analysis of daily DJIA returns [958]. The first one is a family of 'double' Weibull distributions

Equation (417a)

where the shape parameter $\beta$ describes the heaviness of the tails and we require that $\beta<1$ . The second one is a family of Student's t distributions

Equation (417b)

which have power-law tails with tail exponent $\gamma$ . Numerical simulations showed that $\Delta\alpha$ decreases with increasing $\beta$ for the Weibull family and increasing $\gamma$ for the Student family [958]. Speaking differently, time series with heavier tails (or smaller $\beta$ or $\gamma$ ) exhibit stronger multifractality. Qualitatively similar results are obtained for the MF-PF method [639].

He and Wang investigated the impact of the distribution on the multifractality of SSEC and S&P 500 returns from 20 December 1990 through 7 January 2016 using MF-DFA [151]. The singularity width of the apparent multifractality is $\Delta\alpha = 0.593$ for SSEC and $\Delta\alpha = 0.597$ for S&P 500. When Gaussian surrogates are used, the singularity width decreases significantly to $\Delta\alpha_{{\rm surr}} = 0.294$ for SSEC and $\Delta\alpha_{{\rm surr}} = 0.241$ for S&P 500. In contrast, shuffling the raw time series has less impact and $\Delta\alpha_{{\rm shuf}} = 0.645$ for SSEC and $\Delta\alpha_{{\rm shuf}} = 0.400$ for S&P 500.

7.2.4. Impact of extreme values.

The multifractal behavior of time series is influenced by large fluctuations, especially extreme events such as market crashes, because the small singularities characterize large fluctuations unveiled by large q values. Different types of investigation have been designed and applied.

Studies show that removing a single market crash has a large influence on the singularity width. Using the structure function approach, LeBaron compared the multifractal properties of the daily DJIA index time series sampled from 1 January 1897 to 3 February 1999 and the same sample with the crash on 19 October 1987 removed [756]. He found that the removal of the Black Monday crash had significant influence mainly on multifractal properties for positive q's. The minimum singularity $\alpha_{\min}$ became smaller and the singularity width $\Delta\alpha$ narrower. Based on value-weighted NYSE-AMEX-NASDAQ index ('CRSP Index') and five individual stocks from July 1962 to December 1998, Calvet and Fisher showed that high-order structure functions of the whole samples vary considerably, with striking linearity after simply removing the crash on 19 October 1987 from the samples [143].

One can also compare the multifractal behavior of crisis periods and calm periods. Using MF-DFA, Siokis compared the multifractal behavior of daily DJIA returns around two crashes (from August 1928 to January 1931 and August 1986 to December 1988 respectively) and a 'non-crisis' calm period in 1964. The two crashes happened on 28 October 1929 with one-day loss of 14.5% and on 19 October 1987 with a one-day loss of 25.6% [991]. It is found that, with $q\in[-5,5]$ , $\Delta\alpha = 1.04$ for the 1929 crash, $\Delta\alpha=0.97$ for 1987, and $\Delta\alpha=0.19$ for the calm period. Redelico and Proto investigated the NASDAQ 100 index and reported that the singularity spectrum widened from the calm period before 1 April 1997 to the bubble period before 03/27/2000 [992].

The multifractal behavior of the whole sample and of the time series around crises also differs. Caraiani studied the multifractal behavior of the daily return time series of three emerging European stock market indices (Czech PX index from April 1994 to December 2010, Hungarian BUX index from June 1993 to December 2010, and the Polish WIG index between June 1993 and December 2010) [334]. The results obtained from MF-DFA and EMD-based MF-DFA show that the singularity width $\Delta\alpha$ is larger during the crisis period (2008–2009) than during the whole sample, with one exception. Although the crisis does influence the overall shape of the multifractal spectrum, no significant influence of the crisis can be found on the multifractality of the series according to the $\chi^2$ -test.

There are also significant differences in the multifractal behavior before and after crashes. Yalamova investigated the multifractal behavior of seven stock market indices (DJIA, S&P500, AUS, TSX, NIKKEI, NASDAQ and FTSE) before and after the 1987 crash [993]. The most probable singularity exponent $\alpha_0$ decreases for DJIA, S&P500, TSX, NASDAQ and FTSE, increases for AUS, and remains stable for NIKKEI. The intermittency $\lambda$ estimated from the log-normal MMAR model increases, and $\alpha_{\min}$ decreases (most pronounced for AUS and NIKKEI). Siokis compared the multifractal behavior of daily DJIA returns 300 trading days before and after the two crashes (1929, 1987) and found that the singularity width increases dramatically after the crashes [991]. Oh et al investigated how the 1997 Asian crisis affected the multifractality of four Asian foreign exchange rates (JPY/USD, HKD/USD, KRW/USD and THB/USD) from 1991 to 2005 [818]. They found that, for the two sub-periods from 1991 to 1996 (before the crisis) and from 1998 to 2005 (after the crisis), the singularity width became larger after the crisis. Yim, Oh and Kim performed multifractal analysis of the return time series of 27 highly capitalized TAQ stock data sets in the KOSPI market and obtained the singularity widths in three periods, that is, 2 January 2008 to 16 September 2008 (before crisis), 17 September 2008 to 26 November 2008 (during crisis) and 27 November 2008 to December 2009 (after crisis). They found counter-intuitively that the singularity width in the crisis period is smaller than those before and after crisis periods [494]. The multifractal strength of the daily returns of sector indices increased after the great financial crisis in 2008, as shown with nine Dow Jones sector ETF indices from 12 June 2000 to 23 July 2015 [994] and ten DJIM sector indices of Islamic stock markets from 9 November 1998 to 5 March 2015 [995].

Replacing extreme values with non-extreme values can also be used to illustrate the impacts of extreme values on the degree of multifractality. Oh et al created new time series versions by replacing values above a certain threshold M in units of the standard deviation $\sigma$ of the time series by linear interpolation [818]. Taking the daily return time series of four foreign exchange markets JPY/USD, HKD/USD, KRW/USD and THD/USD from 1991 to 2005 as examples, they showed that $\Delta\alpha$ increases with M. Zhou adopted a slightly different replacement strategy by using returns sampled randomly from the return series with $|r(t)| <M\sigma$ to analyse daily DJIA returns from 26 May 1896 to 27 April 2007 and obtained the same qualitative conclusion [958], which was confirmed by Siokis [991]. Alternatively, Green et al proposed to replace any returns with $|r(t)|\leqslant{M}\sigma$ by ${{\rm sign}}(r)M\sigma$ for the daily returns of the DJIA from 1928 to 2012 and 1 min returns of the Dow Jones Euro Stoxx 50 from May 2008 to April 2009 and found the similar effect [632]. Similar results are observed for the daily index returns for the US and seven Asian markets from 1 July 2002 to 31 December 2012 [975] and for the S&P 500, NASDAQ, FTSE 100 and NIKKEI 225 during highly anxious time periods around three major crises (the Black Monday, Dot-Com and the latest Great Recession) [996].

He and Wang investigated the effects of extreme events on the apparent multifractality of the daily return time series from 20 December 1990 through 7 January 2016 for China's SSEC index and the S&P 500 index [151]. They replaced the 2.5% largest (positive) returns and 2.5% smallest (negative) returns with returns randomly from the remaining 95% returns. They found that the removal of the 5% extreme returns mainly impacts the behavior for positive q's, which reduces the minimum singularity $\alpha_{\min}$ and narrows the singularity width $\Delta\alpha$ . Zhou et al observed similar narrowing effect after replacing the 5% extreme returns of the 5 min CSI 300 index for the periods from 8 April 2005 to 16 April 2010 and from 4 April 2005 to 19 January 2012, but a widening effect with $\Delta{\alpha}$ from 0.29 to 0.33 for the period from 19 April 2010 to 19 January 2012 [658].

When a high-frequency intraday price or index time series is analyzed with the partition function approach, the overnight jumps may influence the apparent multifractality. Lee et al considered the 1 min time series of Korean KOSPI from 30 March 1992 through 30 November 1999 and found that the removal of overnight jumps changed the $H(q)$ curve [791].

Indeed, the local Hölder exponents localized at the extreme events are also outliers when compared with the local Hölder exponents of other non-extreme events [997]. Therefore, removing the extreme events is equivalent to removing singularity outliers and thus narrows the width of the singularity spectrum. Identifying local Hölder exponent outliers also provides a possible method for the detection of outliers in time series [997].

7.3. Components of apparent multifractality

The sources of apparent multifractality in empirical time series are mainly due to the linear and nonlinear correlations and the fat-tailedness of probability distribution [500, 639, 958]. It is important to further discriminate, decompose and quantify the degree of influence of different sources. However, it is very difficult to discriminate and decompose the quantitative influence of different sources because they are nonlinearly entangled. A feasible strategy is to assume that the impacts of these sources are independent.

Using the MF-DFA method, Moyano et al quantified the multifractal components with the generalized Hurst exponents [500]. The correlation component $\Delta{h}_{{\rm corr}}$ is obtained through shuffling the raw time series to eliminate linear and nonlinear correlations such that [178, 500, 983]

Equation (418)

The influence of a non-Gaussian PDF is determined by comparing the multifractal behavior of shuffled time series and their Fourier transform surrogates. The shuffled time series differ from their Fourier transform surrogates only in the PDFs, as the Fourier transform surrogates have Gaussian distributions. Hence, the PDF component of multifractality is (see also [804])

Equation (419)

They further suggested that the nonlinearity component is the multifractality embedded in Fourier transform surrogates of shuffled time series:

Equation (420)

Moyano et al studied the 1 min trading volume time series of the 30 DJIA stocks from the 1st of July until the 31 December of 2004 and found that $\Delta{H}=0.675$ , $\Delta{H}_{{\rm corr}}= 0.027$ , $\Delta{H}_{{\rm PDF}}=0.445$ , and $\Delta{H}_{{\rm nlin}}=0.203$ [500].

For the partition function method of multifractal analysis, Zhou assumed that the apparent multifractality can be decomposed into three components caused by the nonlinear correlation ($\Delta\alpha_{{\rm NL}}$ ), the linear correlation ($\Delta\alpha_{{\rm LM}}$ ) and the fat-tailed PDF ($\Delta\alpha_{{\rm PDF}}$ ) [639]:

Equation (421)

The intrinsic multifractal nature is characterized by the effective multifractality $\Delta\alpha_{{\rm ef\,\!f}}$ composing of the nonlinearity component $\Delta\alpha_{{\rm NL}}$ and the PDF component $\Delta\alpha_{{\rm PDF}}$ :

Equation (422)

where the linear correlation component $\Delta\alpha_{{\rm LM}}$ can be determined by IAAFT surrogate time series of the raw time series. Note that the concepts of efficient multifractality $\Delta\alpha_{{\rm ef\,\!f}}$ and linear correlation component $\Delta\alpha_{{\rm LM}}$ are essentially equivalent to observed multifractality and multifractal bias studied by Grech and Pamuła [998].

It is found that the shuffled time series have vanishing singularity width, namely, $\Delta\alpha_{\mathrm{shuf}} \approx 0$ , which means that the fat-tailedness of PDF alone cannot produce any apparent multifractality [639]. To determine $\Delta\alpha_{{\rm NL}}$ and $\Delta\alpha_{{\rm PDF}}$ , Zhou assumed that the PDF component of Gaussian surrogates is negligible, that is, $\Delta\alpha_{{\rm norm,PDF}}=0$ , where Gaussian surrogates are normally distributed and have the same rank ordering as the raw data [639]. Hence, the nonlinearity component does not change when the PDF is replaced by a Gaussian distribution:

Equation (423)

According to equation (422), the effective singularity width of the Gaussian surrogates is

Equation (424)

It follows immediately that

Equation (425)

Zhou determined the multifractal components of the daily DJIA volatility time series [639]. Chen et al used the framework of equation (421) to determine the multifractal components of the daily return time series of Baltic Panamax Index and Baltic Capesize Index from 1 March 1999 to 26 February 2015 [741], in which they proposed an alternative, self-consistent calculation method based on $\Delta\alpha_{{\rm PDF}}=\Delta\alpha_{{\rm shuf}}$ obtained from the shuffled time series and $\Delta\alpha_{{\rm PDF}}+\Delta\alpha_{{\rm LM}}$ is the apparent multifractality of the IAAFT surrogates. Chen et al also obtained the three components of the multifractal cross correlations between the daily returns of WTI oil and Baltic Exchange Dirty Tanker index from 4 January 2000 to 4 November 2015 [928]. Cao and Xu studied the sources of multifractality in the daily returns of carbon emission rights in futures markets of Certified Emission Reduction and European Union Allowances from 14 March 2008 to 31 December 2012 [336]. For surrogates generated with shuffling and rank ordering with Gaussian noise, they found that $\Delta\alpha>\Delta\alpha_{{\rm shuf}}>\Delta\alpha_{{\rm surr,G}}$ , suggesting that the fat-tailed PDF has a stronger impact than the correlation component.

As a general remark, we stress that the assumption of independent components in the above decomposition methods is only a first-order assumption. In addition, the decomposition methods may not apply to other multifractal analysis methods [639].

7.4. Statistical tests

7.4.1. Test based on parameterized multifractal models.

Lux proposed a parametric reshuffle test [999]. Assume that the dynamics of the financial time series under investigation follow certain multifractal cascades with the mass exponent function $\tau(q; \lambda)$ and the multifractal spectrum $f(\alpha; \lambda)$ having explicit expressions, where $\lambda$ is the parameter of the multifractal model and there exist a critical value $\lambda_c$ such that the time series becomes monofractal when $\lambda=\lambda_c$ . Suitable candidate multifractal models include the p -model [361], the log-normal MMAR model [483], and so on. Because the reshuffled time series do not contain any true multifractality, the corresponding value of $\lambda$ is $\lambda_c$ . The null hypothesis is thus

Equation (426)

and the null model is the reshuffled time series with any temporal correlations eliminated. The test can be conducted based on the mass exponent function $\tau(q; \lambda)$ or the multifractal spectrum $f(\alpha; \lambda)$ .

For the raw time series and a number of reshuffled time series, the empirical expressions of $\tau(q)$ (or $f(\alpha)$ ) are obtained and fitted to $\tau(q; \lambda)$ (or $f(\alpha; \lambda)$ ) to obtain the estimate of $\lambda$ and the distribution of $\lambda_c$ . One-sided or two-sided test can be designed following the standard procedure. The p -value of the one-sided test is the probability that $\lambda>\lambda_c$ . If the p -value falls below the prefixed significance level, the null hypothesis is rejected (meaning that the deviation of the measured value of $\lambda$ from $\lambda_c$ is unlikely to be just a statistical fluctuation of the non-multifractal null model).

Lux performed empirical analyses on the daily time series of the New York Stock Exchange Composite index (01/1966–12/1998) and the German DAX (10/1959–12/1998), the DEM/USD exchange rates (01/1974–12/1998), and the gold prices of the London Precious Metal Exchange (01/1978–12/1998) [999]. Although the estimated $\lambda$ values from $\tau(q; \lambda)$ and $f(\alpha; \lambda)$ differ to some extent, the corresponding p -values are very close to each other. Surprisingly, the results show that the null model cannot be rejected. Lux argued that the overall conclusion is not necessarily a demonstration of the absence of multifractality, but could also be an implication of the poor power of the adopted multifractal analysis [999]. One should note that the p  model and the MSM log-normal model have parabolic shapes and are symmetric, while empirical multifractal spectra are often skewed [669]. Hence, a third possibility is that the dynamics of the time series cannot be properly described by the mathematical model adopted in the test. In other words, the null hypothesis is not rejected because the multifractal model is misspecified.

7.4.2. Tests based on $ {f(\alpha)} $ .

Multifractality of a time series is quantified by measures of multifractal strength with the multifractal strength defines such that it vanishes for monofractal time series. Hence, the natural idea is to use a multifractal strength as the test statistic and check if it is significantly different from zero. The null hypothesis is that the time series is monofractal and the alternative hypothesis is that the raw time series possesses multifractal properties.

Intuitively, the test consists in asking if the empirical multifractal spectrum is undistinguishable from a spectrum shrinking to the neighborhood of $(\alpha,f)=(0,1)$ . The most common measure of multifractal strength is thus the singularity width $\Delta\alpha$ . The associated null hypothesis is

Equation (427)

where $\Delta\alpha_{{\rm null}}$ is the singularity width of monofractal time series generated from the null model. The p -value is the associated probability of false alarm for multifractality, which is determined as [205, 879, 880]

Equation (428)

where n is the number of realizations generated from the null model and ${\bf{{\rm I}}}$ is the indicator function. Similarly, defining $F = [\,f(\alpha_{\min}) + f(\alpha_{\max})]/2$ , an analogous null hypothesis is [205, 879, 880]

Equation (429)

where the p -value is

Equation (430)

We can also use other measures of multifractal strength (see section 8.1) such as $\Delta{D}=D_{\max}-D_{\min}$ , $\Delta{H}=H_{\max}-H_{\min}$ or $\Delta{H}(1,2)$ as the statistic and perform the same statistical test. Because all these measures are positive, we adopt the one-side test.

Because the analytical expressions of the asymptotic distributions of the multifractal strength measures are unknown, numerical determination of the distributions is needed. To proceed with the statistical tests, we need to determine the null model. An early null model is Gaussian white noises for the test of long-range correlations [720, 1000], which is however not suitable for multifractality tests. The widely used null model is the shuffled time series [205, 879, 880]. A somewhat similar idea is to use bootstrap and block bootstrap [10011003], which is conducted on the residuals of the AR model [1004]. These methods apply well for uncorrelated time series such as financial returns [1005, 1006]. However, for long-range correlated time series, these methods destroy both linear and nonlinear correlations and their null hypothesis implies that there is not any temporal correlations in the tested time series.

The proper null model for testing the presence of multifractality should generate surrogate time series that have the same distribution and the same linear correlations as the raw time series, but do not possess any nonlinear correlations. Indeed von Hardenberg et al have already discussed this issue and suggested to use the AAFT algorithm or the further refined IAAFT algorithm [960]. Although they did not provide any empirical examples, we argue that the IAAFT null model is the one that should be adopted.

7.4.3. Tests based on $ {\tau(q)} $ .

The multifractal strength based on the $\tau(q)$ curve can also be defined. The main idea is to test if $\tau(q)$ is a linear function of q. If the Hurst exponent of the raw time series is known, a more specific statistic is [797]:

Equation (431)

Inspired by the test of self-similarity [622], we can assume a parabolic form for $\tau(q)$ :

Equation (432)

and test if the second-order coefficient C is significantly different from 0. Since $\tau(q)$ of multifractals are concave, that is, C  <  0, the null hypothesis reads

Equation (433)

Other candidate statistics can also be adopted, such as the area enclosed by $\tau(q)$ and the diagonal line through $(q_{\min},\tau(q_{\min}))$ and $(q_{\min},\tau(q_{\min}))$ , the average distance between $\tau(q)$ and the diagonal line, the maximum distance between $\tau(q)$ and the diagonal line, and so on. Again, the IAAFT surrogate algorithm is used as the null model.

Alternatively, the bootstrapped wavelet leader test has been thoroughly studied, which uses jointly wavelet leaders, log-cumulants and bootstrap procedures [297, 298]. Under the leader-based multifractal formalism, the function $\zeta(q)$ is expanded as a polynomial of q

Equation (434)

where cp are the log-cumulants. At each scale, n bootstrap samples are generated from the original sample of wavelet leaders, drawn blockwise and with replacement. This approach is used together with the MF-WL method for the multifractal analysis.

One can also design non-parametric statistical tests by considering $\tau''(q)={{\rm d}}^2\tau(q)/{{\rm d}}q^2$ or equivalently $H'(q)={{\rm d}}H(q)/{{\rm d}}q$ [1007], which is related to the concept of intermittency (see section 8.1.3). If a time series is monofractal instead of multifractal, the scaling function $\tau(q)$ is linear with respect to q and $H(q)$ is a constant. Hence, the null hypothesis that the time series is monofractal corresponds to the statistics $\tau''(q)$ and $H'(q)$ being equal to 0:

Equation (435)

Statistics using this idea that can be numerically implemented have been developed [1007]. Application of the method to intraday 5 min return time series of the EUR and JPY exchange rates against the USD and of the S&P 500 and of the NASDAQ-100 equity indices from 3 January 2007 until 31 December 2010 shows that the null hypothesis of monofractality is strongly rejected in all cases; however, local monofractality cannot be rejected for some shorter sub-periods [1007]. We think that this intriguing method and its applications deserve further studies. For instance, one can consider also finite size effects and improve the numerical calculation of $\tau''(q)$ and $H'(q)$ by replacing the central difference with the analytic derivative of fitted polynomials [990, 1008, 1009].

8. Applications

In addition to the multifractal analysis methods presented in section 6 to investigate the presence of multifractality in financial time series, we now survey miscellaneous applications of multifractality. Multifractal strengths can be used as measures to quantify market inefficiency and market risks. The rational is that a perfectly efficient market should be characterised by the absence of any pattern, so that relative price changes (returns) are pure white noise. In contrast, the presence of multifractality reveals remarkable features and thus the strength of multifractality can be interpreted as a measure of the deviation from perfect efficiency.

We will show below that the most convincing empirical result is that multifractality-based methods usually perform better in forecasting volatility and Value-at-Risk than conventional econometric models.

8.1. Measures of multifractality strength

8.1.1. Measures based on multifractal spectrum.

The most widely adopted measure of multifractal strength is the difference between the maximum singularity $\alpha_{\max}$ and the minimum singularity $\alpha_{\min}$ ,

Equation (436)

which is the width of the singularity spectrum or the singularity width for short. In empirical analysis, the value of $\Delta\alpha$ depends on q and it is not realistic to hope to obtain $\alpha_{\min}$ and $\alpha_{\max}$ with $q\to\pm\infty$ due to the finite size of time series. Two methods can then be adopted. The first one is to calculate $\Delta\alpha$ for a given interval $[q_{\min},q_{\max}]$ ,

Equation (437)

which is used in the majority of the literature. Usually, a symmetric interval of q with $q_{\min}+q_{\max}=0$ is adopted. This method is reasonable because one usually compares the multifractal strengths for different time series on the same interval of q, be they from real systems or surrogates. The second method assumes an explicit expression of the singularity spectrum $f(\alpha)$ , fits the empirical data to it, and determines $\alpha_{\min}$ or $\alpha_{\max}$ as solutions of

Equation (438)

This method does not work when the assumed expressions of $f(\alpha)$ for different time series do not match the empirical multifractal spectra. Moreover, it suffers the shortcoming that the equation $f(\alpha)=0$ may have no solution or only one solution such as for multinomial measures [171].

An alternative measure is to quantify how $f(\alpha)$ deviates from f   =  1 [205, 879, 880], by estimating

Equation (439)

This measure is not recommended because it performs worse than $\Delta\alpha$ . Different binomial measures could have different $\Delta\alpha$ values but with the same value of F  =  1 [171].

8.1.2. Measures based on generalized Hurst exponents.

A similar measure of multifractal strength based on the generalized Hurst exponents is also utilized in some studies [763, 764, 1010], which reads

Equation (440)

This measure is qualified since $H(-\infty)=\alpha(-\infty)$ and $H(\infty)=\alpha(\infty)$ for many multifractal models. Gu et al argued that this measure is not appropriate for some financial time series when there are $q\in[q_{\min},q_{\max}]$ such that $ H(q)>H(q_{\min})$ or $H(q)<H(q_{\max})$ [1011]. This anomalous phenomenon corresponds to the situation that $H(q)$ is not a strictly decreasing function of q, which has been observed for some financial time series [609, 1011] or synthetic time series contaminated by trends [675, 676], and often leads to twisted multifractal spectrum [609, 675]. Hence, Gu and Zhang proposed to calculate $\Delta{H}$ as follows [852]

Equation (441)

In empirical analysis, however, such results should be treated with caution. It is possible that the anomalous phenomenon stems from a improper determination of the multifractal properties, such as a bad choice of the scaling range or an unsuitable preprocessing of the data.

Wang et al defined the following measure by taking monofractals as the reference [1012]

Equation (442)

which is the average of $|H(q)-0.5|$ over all q values used in the multifractal analysis. Grech proposed another measure considering the average distance between $H(q)$ and $H(2)$ for the respective multifractal and monofractal cases [1013]

Equation (443)

which can be modified when multifractal biases are taken into account [1013]. These two measures are essentially similar. However, using $H(2)$ instead of 0.5 allows the measure to investigate long-range correlated time series such as volatility in a more reasonable way.

Morales et al used the following measure [1014, 1015]

Equation (444)

which is very useful when the moments for negative q are not defined. Some researchers use $H(1)$ to study market efficiency, while some others use

Equation (445)

to measure multifratal strength and hence market inefficiency, where q  =  1 and q  =  2 are often chosen [1016]. Wang et al defined the following measure by taking monofractals as the reference [1012]

Equation (446)

which is a generalized form suggested in [1017] with $q_{\max}=-q_{\min}=10$ . Rizvi et al used $q_{\max}=-q_{\min}=4$ [752, 1018]. When $H(2)\approx0.5$ , these quantities reduce to $\frac{1}{2}\Delta{H}$ with $-q_{\min}=q_{\max}=10$ or 4 in equation (440). Note that D is a simplified form of $\overline{\Delta{H}(q,2)}$ in equation (442).

8.1.3. Intermittency.

The intermittency of a time series can also be used to quantify the strength of multifractality [10191021]. The expression of intermittency can be derived through the partition function approach [769]. Let us define a measure $\mu_{i,1}$ as

Equation (447)

which is the normalized volatility at the finest scale if $X(i)$ is the logarithmic price. The average measure in interval $[i,i+s-1]$ of length s is

Equation (448)

where $\mu_{i,s}$ is the total measure in the ith interval as defined in equation (42). The scaling properties are obtained through

Equation (449)

which defines a scaling exponent function $\kappa(q)$ . Combining with the scaling relation (24), we have

Equation (450)

which leads to the following equality

Equation (451)

The intermittency is then defined as [736, 769, 10191021]

Equation (452)

The intermittency is also known as the sparseness or the information codimension because D1 is the information dimension. There are applications to foreign exchange rates [736, 769, 813, 1022], gold prices [736], and stock market indices [736, 1023].

8.1.4. Relationship between different measures.

The multifractal strength measures discussed above are used to qualify market inefficiency. However, there is no consensus on which measure performs best in qualifying market inefficiency. As we will see below, different market inefficiency indexes result in different ranks of markets. Fortunately, some common features exist. For instance, developed markets are more efficient than emerging markets on average, over the set of different measures.

The relationship between different measures is an important problem, which is of theoretical and practical significance. An empirical analysis of 34 stock market indices from 1 January 1995 to 20 September 2010 shows that there is no evident correlation between $\delta{H}(2)=H(2)-0.5$ and $\Delta{H}$ [1011]. In contrast, a further analysis of the daily closing WTI crude oil spot prices from 2 January 1986 to 28 September 2012 unveils a weak correlation between $\delta{H}(2)$ and $\Delta{H}$ with a Pearson correlation coefficient of 0.373 when the window size is 1 year and a strong correlation with a Pearson correlation coefficient of 0.807 when the window size is 4 years [852]. The latter results highlight the importance of choosing proper estimation methods since smaller windows lead to noisier estimates.

8.2. Measuring market inefficiency

The study of market efficiency has a long history. The efficient markets hypothesis has attracted numerous theoretical and empirical interest [128, 129]. Traditional procedures in the weak-form EMH literature check linear serial correlations with the variance ratio tests, unit root, nonlinear serial dependence, long memory mainly quantified by the Hurst exponent, martingale difference sequence tests, time reversibility test, and so on [1024]. A market may also have different efficiency at different periods and more generally time-varying efficiency [1024], which is reminiscent of Lo's 'adaptive markets hypothesis' [1025] and Sornette's 'Emerging Market Intelligence Hypothesis' [2].

8.2.1. Market inefficiency index based on fractality measures.

There are many measures used to qualify the degree of long memory in time series, such as the Hurst exponent H, the fractal dimensions D0 and the first order autocorrelation $\rho(1)$ estimated from different methods. Denoting Mi as the ith measure estimated from the ith method, which is defined on the interval $[M_{i,\min},M_{i,\max}]$ , and $M_i^*$ the corresponding theoretical value for uncorrelated time series, the distance $M_i-M_i^*$ is a measure of market inefficiency. Kristoufek and Vosvrda proposed a combined measure for market inefficiency [1026]

Equation (453)

They call this combined measure the 'efficiency index' with notation EI. However, intuitively, we recommend the name of 'inefficiency index' together with the notation IE. In this way, the market with larger IE is more inefficient.

The market inefficiency index has been used to analyze the efficiency of 41 stock market indices for the period from 2000 to August of 2011 [1026, 1027], 38 stock market indices between January 2000 and August 2011 with different methods [1027], 25 commodities in the period between 1 January 2000 to 22 July 2013 [1028], and 142 worldwide currencies from 1 January 2011 to 30 November 2014 [1029]. It correctly identifies that developed markets are more efficient while emerging markets are inefficient.

There is also cumulating empirical evidence suggesting that multifractal analysis can be used to quantify the degree of inefficiency of markets in the sense that more developed stock markets have weaker multifractality [746, 1010] and an emerging market evolves with time to be more efficient with a narrowing singularity width [1017]. Therefore, the measures of multifractal strength can be used as qualifiers of market inefficiency. It is also common in the literature that the multifractal strength is used as the measure of market efficiency. Similarly, we suggest to treat the multifractal strength as a measure of the market inefficiency rather than of the market efficiency [746, 1010].

When there are two scaling ranges delimited by a crossover scale, we focus on the behavior at large scales since the scaling behavior at small scales is often dominated by the finite-size effects discussed in section 7.1.3. In addition, we compare the multifractal strength of the apparent multifractality because the true multifractality is not always available in the literature. For inefficiency indexes defined from fractal analysis, it is better to perform statistical tests to ensure that the estimated measure is significantly different from that of the uncorrelated counterpart [1005, 1030].

8.2.2. Ranking market efficiency.

Di Matteo et al calculated the generalized Hurst exponents $H(1)$ and $H(2)$ with the structure function approach and used these exponents to differentiate stock markets in their development stage [216]. They ranked 29 foreign exchange rates, 32 Stock market indices, 12 Treasury bond and bill rates with different maturity dates and 16 Eurodollar rates with the maturity dates ranging from 3 months to 4 years. A strong relation between the Hurst exponents and the development stage of the market is observed such that less developed markets have in general higher Hurst exponents $H(1)$ and $H(2)$ . Sensoy and Tabak ranked 27 stock markets in the European Union using $H(1)$ calculated from the daily data from 2 January 1999 to 25 February 2013 [1031] and 17 stock markets from the daily data from 02 January 2003 to 31 January 2013 [1032].

Todea and Plesoianu used the inefficiency measure $\delta{H}(1)=|H(1)-0.5|$ as the dependent argument in a regression model to investigate the influence of foreign portfolio investment on informational efficiency in Central and Eastern European stock markets [1033]. Sensoy obtained the time-varying inefficiency measure $\delta{H}(1)$ of 19 members of the Federation of Euro-Asian Stock Exchanges (FEAS) using daily closing prices from January 2007 to December 2012 and used the average IE to rank these markets [1016]. Sensoy obtained the time-varying inefficiency measure $\delta{H}(1)$ of the 19 FEAS member stock markets using daily closing prices from January 2007 to December 2012 and used the average IE to rank these markets [1016].

Zunino et al performed multifractal analysis on the daily return time series of 32 stock market indices from 2 January 1995 to 23 July 2007 [1010]. They found that on average the group of developed markets have lager values of $\Delta{H}$ and $\Delta\alpha$ than the group of emerging markets. Zunino et al ranked 8 Latin-American market (Argentina, Brazil, Chile, Colombia, Mexico, Peru, Venezuela and the US) by $\Delta{H}$ of the market index returns from January 1995 to February 2007 and identified the US market as the least inefficient [746]. Wang and Wu studied the daily returns of four crude oil futures contracts with the maturities of 1 to 4 months traded on the NYMEX from 2 January 1985 to 10 May 2011 and found that $\Delta{H}$ and $\Delta\alpha$ decrease for longer maturity [146]. Rizvi et al used D with $q_{\max}=-q_{\min}=4$ to rank 11 developed markets and 11 Islamic markets from 1 January 2001 till 31 December 2013 and found that developed markets are on average relatively more efficient than Islamic markets [752].

There are also studies devoted to rank the efficiency of industrial sectors. Zhuang et al compared the multifractal behavior of 10 sector indices of the China stock market from 1 January 2005 to 3 April 2013 and found that the industrial sectors of Information Technology, Health Care and Telecommunication Services are more efficient, while those of energy, materials and consumer discretionary are less efficient [801]. Tiwari et al analyzed the daily closing spot price data for nine Dow Jones sector ETF indices from 12 June 2000 to 23 July 2015 and found that the utilities sector is the more efficient, while the financial sector and the telecommunications sector are the most inefficient [994]. Mensi et al studied the daily returns of 10 DJIM sector indices of Islamic stock markets from 9 November 1998 to 5 March 2015 and found that the consumer goods sector is the most efficient while the financial sector is the most inefficient [995].

8.2.3. Changing efficiency at different periods.

As discussed in section 7.2.4, extreme events have a strong influence on the apparent multifractality of time series, especially for market crashes. In most cases, the market became more inefficient with a wider singularity width after a crash. We survey below the efficiency of markets in different periods. Special attention is paid to political or external events.

The degree of market efficiency changes with different market states. Oświęcimka et al observed different multifractal behavior in the positive and negative returns of 1 min DAX index in two market states, bull (28 November 1997 to 30 December 1999) and bear (1 May 2002 to 1 May 2004), and found that the bear market was more inefficient than the bull market [887]. Arshad and Rizvi compared four East Asian stock market (Malaysia, Singapore, Indonesia and South Korea) from 1 January 1990 until 31 July 2013 and found that South Korea was the most efficient and the four markets were more efficient in booms than in recessions [754]. Rizvi and Arshad divided the NIKKEI index from 1990 to 2013 into 14 successive booms and recessions and found that the market inefficiency index D fluctuated from time to time [1018].

The Chinese currency RMB experienced two major reforms. Before July of 2005, China's RMB was pegged to the USD with very small fluctuations. On 21 July 2005, the People's Bank of China (PBC) launched a reform to take a managed float regime. On 19 June 2010, the PBC resumed its foreign exchange reform to enhance the RMB exchange rate flexibility after halting the RMB appreciation during the global economic recession. Qin et al investigated the evolution of RMB exchange efficiency during the three periods using daily RMB/USD rates from 19 December 1990 to 2 August 2012 and RMB/HKD rates from 13 January 1994 to 2 August 2012 [826]. They found that the values of $\Delta{H}$ in the three periods are 4.918, 0.415 and 0.248 for the RMB/USD and 2.952, 1.232 and 0.569 for the RMB/HKD. It is evident that the RMB markets became more efficient. Stošić investigated the change of market efficiency of eight foreign exchange rates (Australia, Brazil, Malaysia, New Zealand, South Korea, Sweden, Taiwan, and Thailand), which transited from managed exchanged rates to independent float exchange rates [666]. They found that the transition from managed to independent float regime caused a decrease of the singularity width $\Delta\alpha$ , indicating an increase in market efficiency, except for Malaysia ($\Delta\alpha$ increased) and South Korea (comparable $\Delta\alpha$ ).

Some researchers investigated the effect of the price-limit rule on market efficiency. Wang, Liu and Gu investigated the impact of the price-limit reform taking effect on 16 December 1996 on the efficiency of the Shenzhen stock market [1017]. The stock prices could fluctuate freely before the reform. Based on the daily closing price returns of the SZCI index from 3 April 1991 to 15 December 2008, they found that the market inefficiency index D, as defined in equation (446), decreased from 0.89 to 0.36 in the long term, indicating an improved market efficiency after the price-limit reform. Similar results were observed for the Shanghai stock market [1012]. However, the improvement of market efficiency is more likely attributed to the very high inefficiency in the initial years after the establishment of the markets, so that the price-limit reform played a less significant role [1012].

Other events also influence market efficiency. Zhou et al studied the 5 min high-frequency returns of CSI 300 from 4 April 2005 to 19 January 2012 and found that the introduction of the CSI 300 futures (CSI300IF) on 16 April 2010 improved the efficiency of the Chinese stock market, in which $\Delta{H}$ decreased from 0.406 to 0.180 and $\Delta\alpha$ decreased from 0.559 to 0.291 [658]. He et al worked on the daily returns of CSI 300 from 5 April 2006 to 9 May 2014 and found that the width of the multifractal spectrum became narrower from $\Delta\alpha=0.395$ to $\Delta\alpha=0.158$ after the introduction of CSI300IF implying that the market became more efficient [766]. Gu, Chen and Wang studied the daily returns of the WTI and Brent crude oil prices from 20 May 1987 to 30 September 2008 and divided the time series into three periods separated by the two Gulf Wars that broke out on 24 February 1991 and 30 March 2003 [848]. They found that the two markets became more efficient along the three periods, that is, $\Delta{H}_{{\rm I}} > \Delta{H}_{{\rm II}} \gtrapprox \Delta{H}_{{\rm III}}$ . Wang et al investigated the daily closing data of gold prices traded on the COMEX from 13 July 1990 to 15 September 2009 and found that the singularity width $\Delta\alpha$ decreased from 0.558 in the early period before July 1999 to 0.411 in the late period [839].

Recently, Ruan et al conducted an interesting study by considering the effect of the Shanghai-Hong Kong Stock Connect (implemented on 17 November 2014) on the efficiency of both markets using the daily closing prices of SSEC and HSI from 1 January 2007 to 31 December 2016 [1034]. They found that, after the Connect, the multifractality degree quantified by $\Delta{H}$ and $\Delta\alpha$ decreased for the SSEC index and increased for the HSI index, indicating that the Shanghai market became more efficient while the Hong Kong market became less efficient. Their quantitative results further suggest that the Hong Kong market became less efficient than the Shanghai market after the implementation of the Connect, which is counter-intuitive. It suggests that the apparent multifractality may be a biased measure of market inefficiency.

8.2.4. Time-varying efficiency.

Wang et al studied the evolution of the inefficiency D of the daily returns of gold traded on the COMEX from 13 July 1990 to 15 September 2009 [839]. The inefficiency became smaller along time. However, external shocks increased market inefficiency. Ghosh et al calculated the singularity width $\Delta\alpha$ in successive five-year windows of the daily gold price returns from January 1973 to October 2011 and observed an overall decreasing trend [841], which signals the improvement of the gold market efficiency.

Wang et al studied the daily return and volatility time series of the Shanghai SSEC index from 19 December 1990 to 15 December 2008 and determined the values of inefficiency indexes $\overline{\Delta{H}(q,2)}$ and D in moving windows of size 1008 trading days (about four years) [1012]. The two market inefficiency indexes evolved very similarly and exhibited an overall decreasing trend. The inefficiency indexes were very high at the early stage of the Shanghai stock market.

Sensoy and Tabak studied the time-varying efficiency of 27 European stock markets in the European Union using $H(1)$ calculated from the daily data from 2 January 1999 to 25 February 2013 [1031]. They found that the 2008 global financial crisis caused more inefficiency on almost all stock markets while the Eurozone sovereign debt crisis decreased only the efficiency of the markets in France, Spain and Greece. In addition, joining EU for the late members did not have a uniform effect on their markets' efficiency. Sensoy and Tabak also studied 17 global stock markets from 2 January 2003 to 31 January 2013 and confirmed the impact of the 2008 crisis on market efficiency [1032].

Wang and Wu calculated the evolving singularity width $\Delta\alpha$ of the return and volatility time series of gasoline crack spread, which is the difference series between daily spot price data of WTI crude oil and gasoline (New York Harbor) from 2 January 1986 to 26 July 2011, and found that the second Gulf War that broke out on 20 March 2003 caused a sharp increase in market inefficiency while the first Gulf War did not [854].

There are also studies of the time-varying multifractal strength of cross correlations, such as pairs of WTI crude oil and BRIC stock markets [930] and pairs of US interest rates and agricultural commodity markets [864]. However, there is no simple interpretation of the joint multifractality concerning the efficiency of markets.

8.3. Multifractal volatility

8.3.1. Definitions of multifractal volatility.

Let $r(t)$ be the daily return and $RV(t)$ be the realized variance constructed by taking the sum of squared intraday returns. The scaled realized variance is a recommended proxy for volatility [1035]:

Equation (454)

where the sum is carried over the n days of the sample. Inspired by this measure, Wei and Wang introduced the multifractal volatility (MFV) [872]

Equation (455)

where $\Delta\alpha(t)$ is the singularity width obtained from the intraday data on the tth day. To have a full analog to the definition (454), the multifractal volatility could be

Equation (456)

Alternatively, Chen and Wu used realized volatilities $RV(t)$ to replace squared returns $r(t)$ in the definition of multifractal volatility [885]

Equation (457)

The logic behind these definitions of multifractal volatility is that $\Delta\alpha(t)$ can be viewed as a measure of asset risk or market risk.

The singularity width $\Delta\alpha$ is estimated empirically by analyzing asset price time series using the partition function approach [872] or MF-DFA [885]. Although the former method provides illusionary multifractality, the multifractal volatility is found to be strongly correlated with the realized volatility and the multifractal volatility models still show promising predictive power [872].

8.3.2. Volatility forecasting.

The multifractal volatility defined in section 8.3.1 can be incorporated into existing econometric volatility models for volatility forecasting. Consider the ARFIMA$(1, d, 1)$ model as an example, which is expressed as

Equation (458)

where $\mu$ is the mean of the realized volatility, L is the lag operator, coefficients d, $\phi$ and $\theta$ are fixed and unknown, and $ \newcommand{\e}{{\rm e}} \epsilon(t)$ is Gaussian white noise with zero mean and variance $ \newcommand{\e}{{\rm e}} \sigma^2_{\epsilon}$ . Based on the ARFIMA$(1, d, 1)$ model, Wei and Wang proposed the multifractal volatility model as [872]

Equation (459)

which shares the same form as equation (458). Extensions to other financial econometric volatility models are similar, but less studied. Using high-frequency stock market index data, it was shown that the multifractal volatility model shows better performances in one-day ahead volatility forecasting than the realized volatility model, stochastic volatility model and GARCH-type models [872, 875, 885, 10361038]. In contrast, the heterogeneous autoregressive model of realized volatility (HAR-log(RV)) performs better than the multifractal volatility model in predicting future volatility [1039]. The multifractal volatility model can also be integrated into copula models to construct the so-called copula-multifractal volatility model, where the multifractal volatility model is used to describe the marginal distributions of returns and the dynamic copulas model the time-varying dependence structure [875, 1036]. Empirical analysis on high-frequency financial time series shows that in general the copula-multifractal volatility model has better hedging effectiveness than the GARCH-type copula models [875, 1036]. The model can also be used to describe the contagion effect between financial markets [875, 1040].

Another important direction concerning multifractal volatility forecasting is based on the multifractal model of asset returns (MMAR) and in particular the Markov-switching multifractal (MSM) model (see an excellent composition by Calvet and Fisher [1041], and also sections 4.2 and 4.3). In their seminal work, Calvet and Fisher formulated a powerful framework of volatility forecasting [505]. Lux and Morales Arias performed extensive numerical experiments to show that the MSM model outperforms GARCH, FIGARCH, the stochastic volatility model (SV) and the long memory stochastic volatility model (LMSV) in volatility forecasting [1042]. Lux and Kaizoji compared the performance of GARCH, FIGARCH, ARFIMA and lognormal MSM models in volatility forecasting using daily data of two samples of stocks in the Tokyo Market from 1975 to 2001 [1043]. Based on the relative mean squared error (MSE) and mean absolute error (MAE), they found that the long memory models ARFIMA and FIGARCH outperform the short memory GARCH and ARMA models, while the log-normal MSM model performs the best. The performance differences of these models are more significant for longer forecasting horizon. The conclusion is the same for sub-periods (1986–1990, 1991–1995, and 1996–2001), as well as for volume forecasting. Ben Nasr et al investigated the daily data of the Global Dow Jones Islamic Market World Index from 1 January 1996 to 2 September 2013 and found that the MSM model outperforms long memory GARCH-type models (FIGARCH and FITVGARCH), which in turn performs better than the short-memory GARCH model [1044]. Lux and Morales-Arias studied the daily data of 25 all-share stock indices, 11 10-year government bond indices and 12 real estate security indices at the country level from January 1990 to January 2008 [521]. They found that the Binomial MSM and lognormal MSM with either normal or Student innovations have comparable performance in volatility forecasting and perform better than GARCH and FIGARCH. They also found that the combined predictors perform even better than single models and the FIGARCH-MSM model seems to perform the best. Chuang et al found the MSM model performed better than implied, GARCH and historical volatilities in realized volatility forecasting of the daily data of the S&P 100 index option and equity option from 3 January 2000 to 31 October 2009 [1045]. Wang et al considered the daily spot data of WTI and Brent crude oil from 4 January 1993 to 9 September 2013 and found that MSM models provide better volatility forecasts than GARCH-class models and the historical volatility model [1046]. Lux et al confirmed this conclusion on daily data of WTI oil prices from 2 January 1875 to 31 December 1895 and from 2 January 1985 to 24 March 2014 [1047]. Segnon et al confirmed the better performance of the MSM model over GARCH and FIGARCH on the daily carbon dioxide emission allowance prices from 16 January 2009 to 20 January 2015 [1048]. Liu et al compared Binomial MSM models with normal and Student-t innovations and GARCH-type models (GARCHN, GARCH-t and GARCH-Skewed t) using daily closing prices of SSEC from 15 July 1991 to 8 June 2015 and found that the Binomial MSM model with Skewed-t innovations dominates [522].

There are also other models that are reported to be superior to the MSM models. Wang et al proposed a support vector machine (SVM) based MSM approach to forecast volatility of the SSEC index from 2 January 1991 and ending on 31 December 2010 and the SZCI from 2 May 1991 and ending on 31 December 2010 and found that the SVM-MSM model improves the MSM model and both models are superior to the GARCH(1,1) model [1049]. Lux et al found that the RV-ARFIMA model performs the best over the RV-MSM model and other volatility models, especially in tranquil periods, while the RV-MSM model performs much better during turmoil periods [526]. Cordis and Kirby developed a flexible class of discrete stochastic autoregressive volatility (DSARV) models [1050]. They performed empirical analysis of the daily data of five individual US stocks (Alcoa, General Electric, Coca Cola, JP Morgan Chase and Exxon Mobil) and the S&P 500 index from 1 February 1976 to 31 January 2001 and found that the simple first- and second-order DSARV models outperform MSM models in forecasting volatility. Using daily WTI and Brent crude oil price data from 6 January 1992 to 31 December 2014, Charles and Darné found that asymmetric models estimated on jump filtered returns outperform the GARCH, GAS, GJR-GARCH, EGARCH and MSM models on raw returns [1051].

The Markov-switching multifractal duration (MSMD) model [527] has also been applied to volatility forecasting. Aldrich et al studied the transaction-level data of the Chicago Mercantile Exchange near-month E-mini S&P 500 Futures contract from 1 January 2014 to 31 December 2014 and found that the compound MSMD model outperforms the compound exponential model, the compound ACD model and the ACD-GARCH(1,1) model in out-of-sample forecasting of realized volatility [529]. Žikeš et al studied the transaction-level data of three major foreign exchange futures contracts (CHF, EUR and JPY) traded on the Chicago Mercantile Exchange from 9 November 2009 to 29 January 2010 and found that the MSMD models exhibit higher volatility forecasting ability than the long-memory stochastic duration model and the short-memory ACD model [528].

MRW models also have good performance in volatility forecasting. Bacry et al compared the performance of the MRW model and normal and t-Student GARCH(1,1) models in volatility forecasting using the daily prices of four FX rates (CAD, JPY, CHF and GBP) against the USD [532]. For each time series, they calibrated the models using in-sample data from 1 July 1977 to 28 December 1989 and performed out-of-sample prediction from 29 December 1989 to 20 March 2006 at four different prediction scales (1, 5, 20 and 50 d). They found that the MRW model provides smallest mean absolute prediction errors for all time series and at all prediction scales. The advantage of the MRW model over the GARCH models in volatility forecasting has been confirmed for the daily returns of 29 French CAC40 constituent stocks from 1990 to 2005 [533]. Duchon et al obtained precise prediction formulas for volatility forecasting and option pricing by avoiding the problem of the estimation of the so-called 'integral length' (a key parameter of the MRW model) [1052], as highlighted by Saichev and Sornette [538].

There are also efforts to predict higher order moments such as the kurtosis, based on the scaling behavior of financial returns, which are known as return density forecasts [1053, 1054]. The multifractal approach has been shown to provide substantial improvements in predictive ability of the return density over the ARCH and GARCH(1,1) models for the EUR/USD exchange rate, and comparable results for forecasting the daily return density of the S&P500 and NASDAQ-100 equity index [1054].

8.3.3. Value-at-risk forecasting.

Wei et al proposed a method to measure daily Value-at-Risk (VaR) through the combination of a multifractal volatility based econometric model and extreme value theory [1055]. Based on high-frequency SSEC data, VaR backtesting techniques show that the proposed VaR measures outperform many linear and nonlinear GARCH-type models at high-risk levels [1055].

Lee et al used the MMAR model for VaR forecasting [1056]. Empirical analysis of the daily data of KOSPI, S&P500, and USD/KRW foreign exchange from 1990 to 2012 shows that MMAR generates more stable and accurate VaR forecasting than Gaussian VaR, t-distribution VaR, GARCH VaR, and historical VaR [1056]. Batten et al studied the 5 min returns of EUR/USD spot quotes and trading ticks from 5 January 2006 to 31 December 2007 and found that a modified MMAR model performs better than the historical simulation approach and the GARCH(1,1) location-scale VaR model [499].

Lux et al worked on the daily data of WTI oil prices from 2 January 1875 to 31 December 1895 and from 2 January 1985 to 24 March 2014 and found that MSM gives better VaR forecasting than GARCH models [1047]. Ben Nasr et al studied the daily data of the Global Dow Jones Islamic Market World Index from 1 January 1996 to 2 September 2013 and found that the MSM model performs better than long memory GARCH-type models (FIGARCH and FITVGARCH), which in turn outperform the short-memory GARCH model [1044].

Malo investigated the Nord Pool electricity spot prices from March 1998 to January 2006 and found that the Copula-MSM model has better VaR prediction ability than GARCH models [520]. Liu and Lux considered the daily data of two stock market indices (the Dow Jones composite 65 average index and the NIKKEI 225 average index) from 6 January 1970 to 30 December 2008, two foreign exchange rates (USD/BPG and DEM/GBP) from 1 March 1973 to 31 December 2008, and a bond portfolio of US 1 and 2 year treasury constant maturity bond rates from 1 June 1976 to 31 December 2008 [519]. They found that the bivariate MSM model with heterogeneous volatility correlations provides better forecasting results than the homogeneous benchmark and the bivariate DCC-GARCH model [519]. Herrera et al proposed a MSM Peaks-Over-Threshold (MSM-POT) model and compared the performance of the MSM-POT model with self-exciting models and a GARCH-EVT approach on the extreme returns of six daily commodity futures prices (Brent and WTI crude oil, cocoa, cotton, copper, and gold) [1057]. They found that the MSM-POT model provides the most accurate VaR forecasts.

Bacry et al compared the performances of MRW and normal and t-Student GARCH(1,1) models in 1 d conditional VaR forecasting using the daily prices of four foreign exchange rates (CAD, JPY, CHF and GBP) against the USD [532]. Compared to GARCH-type models, the MRW model provides comparative or better results at different VaR levels from 0.5% to 10%. The advantage of the MRW model over GARCH-type models in VaR forecasting has been confirmed for the daily returns of five stock market indices (CAC40, FUTSEE, DAX, DJIA and NIKKEI) from 1990 and 2005 [533].

8.4. Complex networks

8.4.1. Multiscale cross-correlation coefficients.

The superposition law $F^2_{x+y}(s) = F^2_{x}(s) + F^2_{x}(s)$ presented in equation (360) is derived based on the assumption that the two time series are independent. Inspired by the superposition law, Yang proposed a new, maybe the first, multiscale correlation coefficient [1058]

Equation (460)

The correlation $\rho_{xy}(s)=1$ at all scales if x  =  y .

Zebende defined the multiscale detrended cross-correlation coefficient (known as DCCA coefficient for short) as [1059]

Equation (461)

where Fxy(s) and Fxx(s) are respectively the overall fluctuation functions from the DFA-based DCCA method and the DFA method. The coefficient $\rho_{{\rm DCCA}}(s)$ can be easily extended to define other multiscale cross-correlation coefficient based on different cross-correlation analysis, such as the DCCA with general local trends besides the polynomial trends [455], the detrending moving-average cross-correlation ($\rho_{{\rm DMCA}}$ ) [1060], the cross-correlation analysis ($\rho_{{\rm CCA}}$ ) with signs [1061], the joint structure function approach ($\rho_{{\rm HXA}}$ ) [1062]. There are also wavelet-based cross-correlation coefficients or wavelet coherence as discussed in section 3.3.2. To assess the significance level of the empirically estimated $\rho(s)$ values, one should perform statistical tests with proper null models [455].

The properties of the DCCA coefficient and its advantage over conventional Pearson coefficient have been extensively studied through numerical experiments with mathematical models [455, 10591065]. It has been proved for $\rho_{{\rm DCCA}}$ [455], $\rho_{{\rm DMCA}}$ [455, 1060], and $\rho_{{\rm CCA}}$ [1061] that

Equation (462)

where $\rho(s)=-1$ if X  =  −Y, $\rho(s)=1$ if X  =  Y, and $\rho(s)=0$ if X and Y are uncorrelated. It is reported that the CCA coefficient $\rho_{{\rm CCA}}$ performs better than the DCCA coefficient $\rho_{{\rm DCCA}}$ as $\rho_{{\rm CCA}}$ can better capture the intrinsic cross correlation between time series [1061] and the DMCA coefficient $\rho_{{\rm DMCA}}$ outperforms $\rho_{{\rm DCCA}}$ and $\rho_{{\rm HXA}}$ since the variance of $\rho_{{\rm DMCA}}$ decays the fastest [1062].

Kwapień et al extended the multiscale DCCA coefficient to the q-dependent detrended cross-correlation coefficient (or multiscale qDCCA coefficient) based on MF-DFA and MF-CCA [1061]:

Equation (463)

When q  =  2, equation (463) reduces to equation (461). They proved that

Equation (464)

when q  >  0, which does not hold when q  <  0. The performance of the qDCCA coefficient has been investigated with cross-correlated, partially cross-correlated and uncorrelated time series [1061].

Similarly, the DPXA or DPCCA coefficients are defined as [476, 477]

Equation (465)

which considers common external driving forces on time series X and Y. Considering the USD index as a common external factor of the prices of crude oil and gold, it is found that the intrinsic multiscale cross-correlation coefficients between gold and oil are lower than their crude counterparts $\rho_{{{\rm g}},{{\rm o}}:{{\rm d}}}(s) <\rho_{{{\rm g}},{{\rm o}}}(s)$ [477]. Lin et al studied the net cross-correlations and influence among five major world gold markets (London, New York, Shanghai, Tokyo and Mumbai) at different time scales and identified the London gold market as the most influential [1066]. There are other similar measures like partial wavelet coherence [1067, 1068].

8.4.2. Multiscale DCCA networks.

In contrast to the Pearson correlation coefficients, the scale-dependent DCCA coefficients are less sensitive to contaminated noises and the amplitude ratio between slow and fast components [1069]. Hence, the DCCA coefficients are used to construct multiscale DCCA matrices for a bundle of financial time series, where the element for scale s is $\rho_{ij}(s)$ [10701072]. Here, the term 'DCCA' refers to the general meaning that is not constrained to detrended cross-correlation analysis [179] and different types of DCCA coefficients presented in section 8.4.1 can be adopted. Such a construction results in multiscale DCCA matrices.

The multiscale DCCA correlation or distance matrices can be studied under the framework of random matrix theory, mainly to identify the information contents in eigenvalues deviating from their distribution predicted from the null hypothesis and their corresponding eigenvectors [6163]. Wang et al compared the results of DCCA matrices and Pearson correlation matrices using the daily closing prices of 462 constituent stocks of S&P 500 index from 3 January 2005 to 31 August 2012 [1070]. They found that the eigenvalue distributions of random DCCA matrices are different from those of random Pearson correlation matrices. The minimum and maximum eigenvalues and the ensemble eigenvalue distributions can be empirically obtained for a given sample of financial time series. Similar to the case of the correlation matrix [1073], the largest eigenvalue of the DCCA matrix reflects the movement mode of the whole market. Conlon et al investigated the dynamics and the Epps effect [91, 1074, 1075] with time-varying multiscale MODWT cross-correlation matrices of daily returns of 49 Euro Stoxx 50 stock from May 1999 to August 2007 and the 1 min returns of these stocks from May 2008 to April 2009 [1076]. They found that the optimal portfolio depends not only on scale but also on the moving window size. Sun and Liu used the DCCA matrix to study the optimal portfolio strategy, in which the matrix element is the average DCCA coefficient over different scales [1077].

The clustering features and hierarchical structures of multiscale DCCA distance matrices are often investigated through their minimum spanning trees (MSTs) [60] or planar maximally filtered graphs (PMFGs) [1078]. Wang et al investigated the daily FX rates of 44 major currencies in the period of 2007–2012 and found that the exchange rates exhibit geographic features in the MSTs. In addition, the normalized tree length, the average shortest path length and the maximum degree decrease with s [1071]. Wang et al studied the multiscale DCCA MSTs and PMFGs of the daily closing prices of 457 constituent stocks of the S&P 500 Index during the period from 3 January 2005 to 31 December 2012, where the DCCA coefficients are obtained through the maximal overlap discrete wavelet transform (MODWT) [1072]. They found that the network structure fluctuates at different timescales and there is an evident sector clustering effect for stocks.

Multifractal extension of DCCA coefficients results in q-dependent DCCA coefficients $\rho_{ij}(q,s)$ or MF-DCCA coefficients [1061], which can be used to form MF-DCCA correlation or distance matrices [1079]. Empirical analysis of the 1 min returns of the 100 largest stocks traded on the New York Stock Exchange over the period of 1998-1999 shows that the structures of the q-MST graphs vary across different q and s values, such that the q-MSTs complement the MF-DCCA coefficients $\rho_{ij}(q,s)$ in disentangling 'hidden' correlations that cannot be uncovered from the DCCA MST graphs [1079]. In addition, more diverse cross-correlations among return time series lead to more variable tree topology for different q's.

The basic DCCA and MF-DCCA matrices are fully connected, which can be manipulated to construct other networks. Unweighed networks can be deduced by posing thresholds on the DCCA coefficients. Two nodes are connected if and only if the absolute DCCA coefficient exceeds the threshold. Alternatively, a more natural threshold is determined by the critical value that the DCCA coefficient is significantly different from zero. Moreover, we can also consider the signs of the DCCA coefficients, which should have influences on the clustering of nodes.

8.4.3. Multifractality-based networks.

Due to the bivariate nature of the generalized Hurst exponent function Hij(q), of the joint mass scaling exponent function $\tau_{ij}(q)$ and of the joint multifractal spectrum $f_{ij}(\alpha)$ , different networks can be constructed based on these functions. The main concern lies in their performance compared with other methods.

Catalano and Figliola constructed the matrix whose element is the bivariate Hurst exponent Hij between the return time series of two assets i and j  [1080]. An unweighted, undirected network is then introduced whose element Mij  =  1 if |Hij  −  0.5|  >  u and 0 otherwise. They chose the threshold u to be u  =  0.04 and studied the daily price data of 6 commodity indices (industrial metal, precious metal, grain, energy, softs, and livestock) constructed from 24 commodities by Dow Jones, which cover the period from 2 January 1991 to 23 April 2012. They found that the two networks in the last seven years are more densely connected. While this idea of network construction is suggestive, one should remember that, for financial returns, the Hij exponents are usually close to 0.5. Therefore, athreshold of u  =  0.04 does not guarantee a statistically significant deviation from 0.5.

Lin et al defined the average Hurst surface distance as [462]

Equation (466)

where the average is taken over q and s [462]. The distance matrix is able to cluster stock market indices in different geographic regions, but not for individual stocks [462]. Natural extensions of the average Hurst surface distance matrix are to vary the order q and the scale s.

Xu and Beck introduced the Rényi difference matrix Rij [415] (see also section 2.5.9),

Equation (467)

where $q_{\max}$ and $q_{\min}$ are the maximum and minimum values of q under consideration, $\kappa$ is a parameter, and $K(q)$ is the Rényi entropy

Equation (468)

Although this method is initially introduced from the symbol sequences mapped from financial time series, it is generally suitable for other multifractal analysis methods. The Rényi difference matrix has the potential to identify stocks with unusual behaviors [415].

Similarly, we can define the multifractality difference matrix based on $H(q)$ , $\tau(q)$ and $f(\alpha)$ . Taking $H(q)$ as an example, we have

Equation (469)

which represents the difference of multifractality in time series i and j . This could be an interesting topic for future research.

8.4.4. Time series of network variables.

The network nodes are characterized by many local variables, such as degree and cluster coefficient. If the growth process of the network (or the time stamps of nodes) is known, we can obtain the time series of local variables. These time series can be investigated with multifractal analysis and the results might contain useful information about the networks.

One example concerns the networks mapped from time series such that the nodes have time stamps corresponding to the raw time series. Indeed, there are many methods that map a time series to a complex network, in which a data point or a segment of data points converts to a node and an edge is formed due to certain relationship between nodes [416, 10811084]. The statistical properties of the time series of node degree or clustering coefficient can be investigated. Using the nearest neighbour approach [1082], Caraiani constructed three complex networks mapped from the daily returns of the Czech PX from April 1994 to December 2010, the Hungarian BUX from June 1993 to December 2010, and Polish WIG from June 1993 to December 2010 and found that the time series of node clustering coefficient possess multifractality [1085].

Another example is to label time stamps for the nodes following the movements of a random walker abiding to specific rules [10861088]. Multifractal cross-correlation analysis (MF-CCA) of the time series of degree, clustering coefficient, and closeness centrality can distinguish between Erdös–Rényi, Barabási–Albert, and Watts–Strogatz networks [1089]. Also, the degree of right-sided asymmetry of multifractal spectra of the time series of these local node variables is linked with the degree of small-worldness present in networks [1090]. This method can be applied to financial networks.

8.4.5. Eigenportfolios.

Denote $\lambda_k$ and ${\mathbf{u}}_k=[u_{k1}, \cdots, u_{kN}]^{{\rm T}}$ as the kth largest eigenvalue of the correlation matrix and its associated eigenvector. For each eigenvector, we can construct its eigenportfolio, whose returns are calculated as

Equation (470)

where ${\mathbf{r}}(t)=[r_1(t),\cdots,r_i(t),\cdots, r_N(t)]^{{\rm T}}$ , and ${\mathbf{u}}_k^{{\rm T}}{\mathbf{r}}$ denotes the projection of the return time series on the eigenvector ${\mathbf{u}}_{k}$ . The use of the normalization term is to ensure that the constructed return Rk is the return of a portfolio. It is of interest to investigate the statistical properties of the eigenportfolios. An analysis of the 5 min return time series of 100 stocks in the American market (NYSE and NASDAQ) shows that the eigenportfolio return series for different eigenvectors have different multifractal spectra and the one for the largest eigenvalue $\lambda_1$ is to the right of the average spectrum of all other eigenportfolios [10911093].

8.4.6. Correlation matrix structure and multifractality.

Correlation matrices are constructed from a bundle of time series and the structure of correlation matrices is expected to be related to the properties of the time series. In an interesting work, Morales et al unveiled the relationship between correlation matrix structure and multifractal strength [1094]. They constructed a hierarchical tree using clustering algorithms. For each node (or time series), its hierarchical order n can be determined as the number of nodes on the unique path from the root node to the node under consideration. Based on the daily returns of the 342 most capitalised stocks continuously traded on the NYSE from 2 January 1997 to 31 December 2012, a significant correlation between n and $\Delta{H(1,2)}$ is observed up to a certain level of hierarchy.

8.5. Other applications

8.5.1. Calibration of agent-based models.

Agent-based modelling provides a computational view for the understanding of complex human behavior in socioeconomic systems [2], which might be 'a better way to help guide financial policies' [1095]. Agent-based models are constructed with a set of microscopic rules. These microscopic rules could be incomplete or misspecified. Hence, it is of crucial importance to perform model calibrations by comparing the macroscopic properties between the financial time series generated by computational models and by real markets. Besides the main stylized facts, the multifractal nature of real markets is also considered in model calibration by some researchers.

The Cont-Bouchaud model is a percolation model built on a d-dimensional lattice Ld, in which the probability a that a cluster of traders want to buy is a key model parameter [565]. The multifractal analysis of the returns using the structure function approach illustrates that the scaling exponent function $\xi(q)$ varies with model parameters and properly determined parameters are able to produce results comparable to the DJIA index [567]. An incomplete list of other studies include the multifractal analysis of the return recurrence intervals [588] in the modified Mike-Farmer model [585, 587], the multiscaling behavior of cumulative absolute returns in a double auction model [593], and multifractality in the time series in a continuum percolation model [577].

8.5.2. Detection of outliers.

There are efforts to use multifractal models for the detection of outliers in time series. Based on the bivariate MSM model of Calvet, Fisher and Thompson [517], Idier derived indicators for the detection of crises and extreme volatilities and applied the method to pairs of stock market indices (NYSE, FTSE, DAX and CAC) [1096].

There are also studies attempting to provide early warning signals of outliers using multifractal strength. Cristescu et al explored the possibility of using intermittency to predict the occurrence of financial crises [1097]. da Fonseca et al defined the area variation rate based on the analogous specific heat $-\tau''(q)$ and suggested that it has potential power in identifying historical crashes [1098]. Morales et al calculated the dynamical generalized Hurst exponents through the weighted structure function approach and found hints that the multifractal strength $\Delta{H(1,2)}=H(1)-H(2)$ can be a tool to monitor unstable periods in financial time series [1014, 1015]. Fernández-Martínez et al suggested to use $H(1)$ as an indicator for the identification of herding behavior and bubble onset [1099]

8.5.3. Trading strategy.

There are quite a few applications of the partition function approach to asset prices or indices directly, showing illusionary multifractality (see section 6.1.9). A characteristic feature of the illusionary multifractality is that the singularity spectrum shrinks to the vicinity of $(\alpha,f)=(1,1)$ such that the singularity width $\Delta\alpha$ is close to 0. However, there seems to be information content in illusionary multifractality. Maillet and Michel found that the singularity width positively correlates to the market shock index quantifying the probability of exceeding the current volatility values, in which the Spearman correlation coefficient is equal to 0.87 [1100]. Some researchers report that the width of the estimated multifractal spectrum is correlated to the price fluctuation in the future and thus can be used to predict price fluctuations [869, 873, 877]. Analogous to $\Delta\alpha$ , some researchers also used another quantity [869, 870, 873, 877, 952]

Equation (471)

which is reported to be able to uncover some information about price fluctuations.

Alternatively, Dewandaru et al used the inefficiency index D, together with momentum factors to construct factor models for the forecasting of future returns [1101]. The analysis was conducted on daily returns in moving windows of three years of a sample of Islamic stocks listed in the Dow Jones Islamic market from 1996 to 2011. They found that the strategy based on multifractal strength produced monthly excess returns of 6.12%, outperforming other momentum strategies with higher Sharpe ratio.

9. Perspectives

Multifractal analysis for univariate and multivariate financial time series has attracted a strong interest in the econophysics community, caused mainly by the invention of new methods and the easier accessibility to huge amount of financial data. There are still many new multifractal analysis methods being proposed, most of which are variants of existing classical methods. In this direction, we argue that one must assess the performance of the new methods with extensive numerical experiments on mathematical models and compare their performance with other competing methods. It is not easy to invent new methods with better performance. However, such methods are still valuable in deepening our understanding. A pivotal task in multifractal analysis of empirical time series is to choose a method. However, there is no consensus on which method is the best. Although it is possible to rank the performance of different methods for a specific mathematical model, the results often change for different mathematical models. Summarising the existing literature, overall, we can recommend methods based on DFA, DMA and wavelet leaders.

Another pivotal task in empirical multifractal analysis is to determine the suitable scaling range, over which the scaling exponents are estimated. Usually, a small change of the scaling range will result in significant variations in the estimated exponents. It is not uncommon that the mass scaling exponents and the multifractal spectrum extracted from empirical time series are problematic, especially when there are linear or nonlinear trends in the time series. Consequently, a wrong estimate of the degree and nature of multifractal is obtained, which has harmful effects on understanding the behavior of financial markets. The choice of the scaling range is more important and difficult for short time series, which is common for low-frequency financial time series. Although many subjective (by eye-balling) and objective criteria have been proposed, further studies on the determination of scaling range are required. The main reason is that the existing criteria are specific to different degrees to the cases under study and thus lack universality.

Theoretically and practically, it is of great significance to dissect the sources of apparent multifractality for empirical time series. Macroscopically, one usually considers the linear correlations, the nonlinear correlations and the non-Gaussian distribution of the time series. Although it is not clear if the intrinsic multifractality is introduced by the nonlinearity only, it is clear that a time series with only linear correlations and fat-tailed distribution cannot produce any multifractality. In empirical studies, measures of multifractality strength are calculated based on the apparent multifractality. We argue that further scrutiny is necessary on this issue. Otherwise, for example, one may obtain an unreliable ranking of the efficiency of markets. On the other hand, researchers have also proposed different classes of agent-based models (ABMs) for financial markets to generate time series that possesses multifractality. Indeed, agent-based modelling provides us a nice viewpoint to understand the sources of multifractality in financial time series from the microscopic perspective, which is unfortunately less explored. Studies can be conducted to identify microscopic ABM rules that cause the emergence of multifractality in macroscopic time series. Furthermore, it helps uncover the relationship between human behavior and multifractality in financial markets.

So far, there is a large consensus on the presence of multifractality in most financial time series, such as returns, volatilities, trading volumes, recurrence intervals and intertrade durations. Hence, it is no longer the priority to continue confirming and adding new evidence on the presence of multifractality in these financial quantities. Rather, one should work on other financial quantities. Furthermore, in the study of multifractal cross correlations, common driving forces can be considered, in order to extract the intrinsic multifractal cross-correlations between time series. Other applications of multifractality, which are more demanding, illustrate the usefulness of multifractal analyses in financial markets, such as asset pricing and risk management.

Another very promising direction is to relate multifractal analysis to complex financial networks. On the one hand, we can construct equity networks based on the multifractal nature of financial time series. Some efforts have already made. There are however much work to do. On the other hand, we can study the multifractal nature of complex financial networks and it implications as well. We expect to witness this research direction to flourish in the coming years.

Acknowledgments

We thank Rui-Qi Han, Jun-Chao Ma, Wen-Bo Tang, Yu-Lei Wan, Yan-Hong Yang, Peng Yue, and Xiao Zhang, especially Xing-Lu Gao and Peng Wang for assistance. This work was partially supported by the National Natural Science Foundation of China (Grant Nos. U1811462, 71532009) and the Fundamental Research Funds for the Central Universities.

Please wait… references are loading.

Biographies

Zhi-Qiang Jiang

Zhi-Qiang Jiang is a Professor of Finance and a Professor of Management Science and Engineering at East China University of Science and Technology. He received his PhD in applied mathematics in 2011 from East China University of Science and Technology. His scientific interests focus mostly on econophysics and sociophysics, especially multifractality in finance, human cooperation and altruistic behaviors.

Wen-Jie Xie

Wen-Jie Xie is an Assistant Professor of Finance at East China University of Science and Technology. He received his PhD in applied mathematics in 2014 from East China University of Science and Technology. His scientific interests focus mostly on complex networks and machine learning, especially financial networks, trading networks and systemic financial risks.

Wei-Xing Zhou

Wei-Xing Zhou is a Professor of Finance, a Professor of Applied Mathematics, and a Professor of Management Science and Engineering at East China University of Science and Technology. He received his PhD in chemical engineering in 2001 from East China University of Science and Technology and did his post-doc at the University of California, Los Angeles in the group of Didier Sornette. His scientific interests focus mostly on econophysics and sociophysics, especially multifractal analysis, financial bubbles and antibubbles, complex socio-economic networks, and systemic financial risks.

Didier Sornette

Didier Sornette is a Professor on the Chair of Entrepreneurial Risks, associated with the department of Physics, and the department of Earth Sciences at ETH Zurich. He is also special professor at the Institute of Innovative Research of Tokyo Tech, Tokyo and Chair Professor at SUSTech (Southern University of Science and Technology), Shenzhen, China and dean of the Institute of Risk Analysis, Prediction and Management, Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen, China. He obtained his PhD and Habilitation in Physical Sciences at the University of Nice in 1985 and did his post-doc at College de France in the Condensed Matter Laboratory of Prof. P.G. de Gennes. He is Director of the Financial Crisis Observatory, co-founder of the ETH Risk Center, Member of the Swiss Finance Institute, and AAAS Fellow. His scientific interests focus on the prediction of crises, dragon-kings and extreme events in all scientific fields.