Comparing High Dimensional Conditional Covariance Matrices: Implications For Portfolio Selection
Comparing High Dimensional Conditional Covariance Matrices: Implications For Portfolio Selection
Comparing High Dimensional Conditional Covariance Matrices: Implications For Portfolio Selection
b
Big Data Institute
Universidad Carlos III de Madrid
and
Department of Economics
Universidade Federal de Santa Catarina
c
Department of Statistics
Universidad Carlos III de Madrid
Abstract
Portfolio selection based on high dimensional covariance matrices is a key challenge in data-rich
environments with the curse of dimensionality severely affecting most of the available covariance
models. We challenge several multivariate Dynamic Conditional Correlation (DCC)-type and
Stochastic Volatility (SV)-type models to obtain minimum variance and mean-variance portfo-
lios with up to 1000 assets. We conclude that, in a realistic context in which transaction costs are
taken into account, although DCC-type models lead to portfolios with lower variance, modeling the
covariance matrices as latent Wishart processes with a shrinkage towards the diagonal covariance
matrix delivers more stable optimal portfolios with lower turnover and higher information ratios.
Our results reconcile previous findings in the portfolio selection literature as those claiming for
equicorrelations, a smooth dynamic evolution of correlations or correlations close to zero.
Keywords: GARCH, Minimum variance portfolio, Mean-variance portfolio, Risk-adjusted re-
turns, Stochastic volatility, Turnover-constrained portfolios.
JEL classification: C53; G17.
∗
Corresponding author. e-mail: [email protected]
1
The Wishart specification has been recently used by Hautsch and Voigt (2019) to model large realized covariance
matrices.
1/2
rt = Ht εt (1)
where εt is an N × 1 Gaussian white noise vector with covariance IN , the N × N identity matrix,
and Ht is the N × N positive definite conditional covariance matrix of rt at time t. In this section,
we briefly describe the alternative specifications to forecast covariance matrices of large systems of
returns considered in this paper.
If covariance matrices were constant over time, Ht = H is the unconditional covariance matrix
of asset returns, which can be estimated by the sample covariance matrix of returns. Alternative
estimators of the unconditional covariance matrix are considered in the Supplementary Material.
One of the most popular specifications of the conditional covariance matrix, Ht , in equation
(1), prominent in the industry and among market participants, is based on the RiskMetrics 1994
(hereafter RM-1994) methodology; see J.P.Morgan/Reuters (1996), Mina and Xiao (2001) and
Alexander (2008). According to RM-1994, one-step-ahead conditional covariance matrices are
obtained as an exponential weighted moving average (EWMA) of quadratic forms of past returns
as follows
t−1
X
0
Ht = (1 − λ) λi−1 rt−i rt−i . (2)
i=1
where λ = 0.94 for daily data. Note that, given that λ = 0.94, the weight placed in older observa-
tions is decreasing exponentially.2 Zumbach (2007a,b) extend RM-1994 to the RM-2006 approach
by proposing a (pseudo) long-memory model for the covariance matrices in which the weights of
past quadratic forms of returns decay hyperbolically rather than exponentially. According to RM-
2006, the conditional covariance matrices are obtained as a weighted sum of EWMAs rather than
a single EWMA, as follows
14
X
Ht = ωi Σit−1 , (3)
i=1
h i
where the weights are given by ωi = 1c 1 − ln(τ i)
7.35
with c being a normalization constant that
√ i−1
ensures that 14
P
i=1 ωi = 1 and τi = 4 × 2 . The EWMAs in equation (3) are given by
where λi = exp −1
τi
.3
It is important to note that, if there are more observations than assets, the RiskMetrics covari-
2
The EWMA filter is a particular case of the filter obtained if the Kalman filter were implemented when the
parametric model for conditional covariance matrices is the multivariate stochastic volatility model of Harvey, Ruiz
and Shephard (1994) with all variances and covariances restricted to have the same variances of the transition noise
and such that the smoothing parameter is 0.94. The only difference is that in the model proposed by Harvey,
Ruiz and Shephard (1994), the specification is for log-variances while in the RiskMetrics methodology variances are
modelled directly.
3
We use the Matlab routine riskmetrics2006 available in the MFE Toolbox provided by Kevin Sheppard; see
Sheppard (2012) for details.
We consider the DCC specification proposed by Engle, Ledoit and Wolf (2019) that merges
the original DCC model of Engle (2002) with the shrinkage principle, which is largely applied to
portfolio optimization problems in order to obtain covariance matrices less prone to estimation
error, specially in high dimensional problems; see, for instance, Ledoit and Wolf (2004a, 2017a).
In the DCC model, Ht is decomposed as follows
Ht = Dt Ψt Dt , (5)
where Dt is an N × N diagonal matrix with its i-th diagonal element, hi,t , being the conditional
standard deviation of the i-th asset. We assume that each h2i,t follows a univariate GARCH(1,1)
process although a variety of univariate conditional variance specifications could be used for this
purpose. Finally, Ψt is the conditional correlation matrix of the returns, which is governed by the
following correlation targeting dynamics
where st = (r1,t /h1,t , . . . , rN,t /hN,t )0 and α and β are scalar parameters guiding the dynamics of all
correlations and C is the unconditional covariance matrix of st .
Estimation of the DCC model is carried out in three steps. In the first step, QML estimates of
the parameters of the univariate GARCH(1,1) models for each asset are obtained. The estimated
volatilities are used to devolatize the return series.
In the second step, the unconditional covariance matrix, C, is estimated. Engle (2002) proposes
estimating C by the sample covariance matrix of the devolatized returns, st . It is known, however,
that the standard sample covariance estimator is prone to estimation error. To circumvent this
problem, Engle, Ledoit and Wolf (2019) propose estimating C by using the non-linear shrinkage
Given that, in equation (1), the conditional means of returns are assumed to zero, we follow
Windle and Carvalho (2014) and adopt only the WSV part of the model proposed by Uhlig (1994)
and Uhlig (1997). The WSV model specifies a multiplicative law of motion for the stochastic
precision matrix, Ht−1 , which is driven by a singular multivariate Beta distribution shock as follows
d+1
Ht−1 = −1 0
U(Ht−1 −1
) Θt U(Ht−1 ), (7)
d
where U(Ht−1 ) is the upper triangular matrix obtained from the Cholesky decomposition of Ht−1 and
Θt are random iid draws from an N -dimensional singular multivariate beta distribution, BN ( d2 , 12 ),
4
Alternative estimators of the matrix C are considered in the Supplementary Material.
5
In order to estimate the DCC-NLS model, we use and adapt some of the Matlab codes of the MFE Toolbox
developed by Professor Kevin Sheppard from Oxford University and available in his web page.
−1
p(Ht+1 |rt ) ∼ WN (d, [dSt+1 ]−1 ), (9)
d 1
St+1 = St + rt r 0 . (10)
d+1 d+1 t
It is worth noting that the dynamics of the precision matrix is governed by a unique parameter,
d, that can be estimated by Maximum Likelihood (ML); see Kim (2014). We refer the reader to
Section A of the Supplementary Material that brings additional details regarding filtering and ML
estimation of the WSV model considered in this paper.
Substituting backwards in equation (10), it is possible to obtain the following expression for St
t−1
X
0
St = λt S0 + (1 − λ) λi−1 rt−i rt−i , (11)
i=1
d
where λ = d+1
< 1. In expression (11), the evolution of St is the same as that implied by the RM-
1994 covariance matrices in (2). The main difference between (2) and (11) is that in the former,
d
the discount parameter is fixed at λ = 0.94 while in the latter, λ is estimated. Given that λ = d+1
and d > N + 1, if N is large, λ ≈ 1. Consequently, (11) implies that the estimated conditional
6
See the Supplementary material in the online appendix for details.
Very recently, De Nard, Ledoit and Wolf (2020) propose estimating conditional covariance
matrices of large dimensions by blending an approximate factor model (AFM) with time-varying
conditional heteroscedastic idiosyncratic noises and show empirically that this combination yields
portfolios with minimum variance among those considered by them. Assuming that there is just
one unique common factor, De Nard, Ledoit and Wolf (2020) propose the following AFM-DCC-NLS
7
In the empirical application following later in this paper, T = 1250.
10
rt = A + Bft + ut , (12)
where A = (α1 , ..., αN ) and B = (β1 , ..., βN ) are N × 1 vectors of constants and factor loadings,
respectively, ft is the market factor assumed to have zero mean and variance σf2 and ut is an N × 1
conditionally Gaussian white noise vector with covariance matrix Σut 8 . The conditional covariance
matrix of rt is given by:
Ht = σf2 BB 0 + Σut . (13)
Estimation of the AFM-DCC-NLS model is performed in two steps. In the first step, after
estimating by OLS the parameters in A and B in (12), the residuals, ût , are obtained as usual. In
the second step, ût are used to estimate Σut , the time-varying conditional covariance matrix of the
residuals, using the DCC-NLS described above.
Define the return of a portfolio at time t as Rt = wt0 rt with wt = (w1,t , ..., wN,t )0 being the
portfolio weights at time t obtained at time t − 1. In this section, we describe the portfolio
selection policies considered in this paper and the criteria for evaluating their performance.
We start by considering two very simple portfolio strategies. First, we consider equal-weigted
1
(EW) portfolios with the weight of each asset being equal to wit = N
, ∀i, t.
The second portfolio considered, proposed by Kirby and Ostdiek (2012), is the volatility timing
(VT) (or inverse variance) portfolio. In this portfolio, the weight of each asset is proportional to
8
We consider the AFM model with only one factor (the market factor), since the results in De Nard, Ledoit
and Wolf (2020) suggest that additional factors do not improve the performance of portfolios. The market factor is
defined as the return of the market portfolio in excess of the risk-free rate and is obtained from Kenneth French’s
data library.
11
(1/σi2 )
wit = PN 2
, ∀t, (14)
i=1 (1/σi )
where σi2 is the variance of the i-th asset. The VT portfolio is equivalent to the solution of the
minimum variance portfolio described below obtained when all off-diagonal covariance elements of
Ht are equal to zero. Two interesting aspects of the VT portfolio are that i) it does not require
any covariance matrix inversion, and ii) it does not generate negative weights. In practice σi2 is
estimated by the sample variance of the i-th asset.
The third portfolio selection policy considered is based on an investor who adopts the minimum
variance criterion in order to decide her portfolio allocations. A very large body of literature in
portfolio optimization considers this particular policy. For instance, Clarke, De Silva and Thorley
(2006, 2011) are extensive practitioner-oriented studies on the composition and performance of
minimum variance portfolios; see also Engle and Kelly (2012) who evaluate whether equicorrelation
is better than different correlations using minimum variance portfolios and Kastner (2019) who
compares alternative covariance matrices when N = 300 in terms of minimum variance portfolios.
The minimum variance portfolio problem is defined as follows
min wt0 Ht wt
wt
(15)
subject to wt0 ι = 1,
ι0 Ht −1
wt = 0 −1 . (16)
ι Ht ι
In practice, feasible portfolio weights, ŵt , are obtained by replacing in equation (16), the unknown
covariance matrix, Ht by an estimate, Ĥt , which is obtained at time t − 1 using each of the
specifications described above.
12
We also consider the mean-variance with a momentum signal portfolio proposed by Engle,
Ledoit and Wolf (2019), which is based on an investor who wishes to minimize portfolio risk
subjected to a target portfolio return. This portfolio optimization problem is defined as follows
min wt0 Ht wt
wt
w0 m = b
t
(17)
subject to
wt0 ι = 1,
where m is the signal variable and b is the target return. The solution to (17) is given by
m (Cb − D) + ι (E − Db)
wt = Ht −1 (18)
EC − D2
13
PN
where ||a||1 = i=1 |a| is the 1-norm of the N -dimensional vector a and wt∗ is the portfolio obtained
at time t − 1 and after taking into account the changes in asset prices between periods t − 1 and t.
The constant κ controls for the degree to which the portfolio turnover is penalized when solving
(19). We set κ = 1 × 10−3 .
One important aspect of the turnover-constrained portfolios is that they require numerical
solutions. For that purpose, we use the CVX software (CVX Research, 2012).
The evaluation of the portfolios’ performance is based on the one-step-ahead portfolio returns,
Rt . We first compute their over-time mean and standard deviation, R̄P and σ P , respectively. A
portfolio is prefered when its variance is as small as possible.
Following, Gasbarro, Wong and Zumwalt (2007), DeMiguel and Nogales (2009), Zakamouline
and Koekebakker (2009), Behr, Guetter and Miebs (2013) and Hautsch and Voigt (2019), among
many others, we also evaluate the portfolios by the risk-adjusted returns measured by the infor-
mation ratio (IR), which is defined as follows9
R̄P
IR = P . (20)
σ
A superior covariance forecasting model should provide portfolios with low variance and/or large
IRs.
9
Note that in some of these works, they use the Sharpe ratio instead of the IR; see, for example, Goodwin (1998)
and Israelsen (2005) for the definition of the Sharpe ratio and IR.
14
N
∗
P
where turnovert = wj,t − wj,t is the portfolio turnover at time t, defined as the fraction of
j=1
wealth traded between periods t − 1 and t with wt∗ being the allocation vector at period t − 1
after taking into account the changes in asset prices between periods t − 1 and t. Finally, c is
the fee that must be paid for each transaction that is measured in terms of basis points (b.p.).
Note that, in practice, transaction costs depend on (possible time-varying) institutional rules and
the liquidity supply in the market. It is unavoidable that transaction costs are underestimated
or overestimated in individual assets. French (2008) estimates the trading cost in 2006, based on
stocks traded on NYSE, AMEX, and NASDAQ, including “total commissions, bid-ask spreads, and
other costs investors pay for trading services”, and finds that this cost has dropped significantly
over time going “from 146 basis points in 1980 to a tiny 11 basis points in 2006.” Hautsch and
Voigt (2019) mention that scenarios with c < 100 can be associated with rather small transaction
costs; see also Kirby and Ostdiek (2012) who consider c = 50 b.p.. To be conservative, in order
to take into account the impact of proportional transaction costs, we consider the cases in which
c ∈ {0, 5, 10} b.p..
Finally, we test for the statistical significance of the pairwise differences between the variances
and IRs of two portfolios derived by using the one-sided p-value of the prewhitened HACP W test
described by Ledoit and Wolf (2011) and Ledoit and Wolf (2008) for the variance and the IR,
respectively.
15
We fit the conditional covariance models described in Section 2 to a large system with up to
1000 assets traded in the US stock market. The data set consists of returns (including dividends)
of all NYSE, AMEX and NASDAQ stocks observed daily from 01/01/1970 to 12/31/2016. It
is important to note that, although the empirical results may depend on the particular assets
analyzed as well as on their observation frequency, given the very large number of individual
returns observed daily considered, we can expect the results to be rather general and of interest to
portfolio managers; see also Engle, Ledoit and Wolf (2019) and De Nard, Ledoit and Wolf (2020)
who analyze the same data set observed over different spans of time.
The covariance models (unconditional, RM-2006, DCC-NLS, AFM-DCC-NLS, WSV and SWSV)
are recursively estimated every month (we adopt the common convention that 21 consecutive days
constitute a month), at investment dates h = 1, ..., 505, using a rolling window scheme based on in-
vestment universes with N ∈ {100, 500, 1000} assets starting using data observed from 01/01/1970
to 12/11/1974 with T = 1250 observations. Following Engle, Ledoit and Wolf (2019) and De Nard,
Ledoit and Wolf (2020), the investment universes are obtained as follows. We find the set of stocks
that have a complete return history over the most recent T = 1250 days as well as a complete
return “future” over the next 21 days. We then look for possible pairs of highly correlated stocks,
that is, pairs of stocks with returns with a sample correlation exceeding 0.95 over the past 1250
days. With such pairs, if they should exist, we remove the stock with the lower volume on in-
vestment date h. Of the remaining set of stocks, we then pick the largest N stocks (as measured
by their market capitalization on investment date h) as our investment universe. In this way, the
investment universe changes slowly from one investment date to the next. In line with Brandt,
Santa-Clara and Valkanov (2009), we do not include the risk-free asset in the investment oppor-
tunity set as including this asset boils down to a change in the scale of the stock portfolio weights
and is not interesting per se. Therefore, for each model and investment universe, N , we perform
a total of 505 rolling window estimations. Using these estimates, at each day from 12/12/1974 to
12/31/2016, we obtain the corresponding one-step-ahead predictions of the covariance matrices,
with a total of 10,605 predictions.
16
10
The results obtained with N = 100 and N = 1000 are qualitatively similar and are available in the Supplemen-
tary Material.
17
18
In this section, we perform a large scale portfolio selection exercise. Our approach to portfolio
construction is largely inspired by Engle, Ledoit and Wolf (2019) and De Nard, Ledoit and Wolf
(2020) with portfolio weights updated on a monthly basis in order to avoid excessive turnover
levels associated to daily re-balancing.11 For each investment universe with N ∈ {100, 500, 1000}
assets, EW and VT portfolios are constructed. Also, for each N , one-step-ahead forecasts of the
conditional covariance matrices are obtained by each of the covariance models considered in Section
4. Then, minimum variance and mean-variance with a momentum signal portfolios are constructed
both with and without turnover constrains.
Table 1 reports the average turnover and the annualized average, standard deviation and IR
of returns net of transaction costs of the EW, VT and minimum variance portfolios. The returns
have been computed over the out-of-sample period under the three scenarios of proportional trans-
action costs, namely, c = 0, 5 and 10 b.p.. Panel A brings results for portfolios with N = 100
assets, whereas Panels B and C report results for portfolios with N = 500 and N = 1000 assets,
respectively. Consider first the results for the average turnover. Obviously, regardless of N , the
average turnover is very similar for EW and VT portfolios, around 0.10, and clearly smaller than
for the minimum variance portfolios constructed with any of the estimators of the covariance ma-
trices considered. One remarkable aspect of the results reported in Table 1 is that the turnover of
the minimum variance portfolios are substantially different among alternative covariance specifica-
tions. We find that the SWSV specification consistently outperforms all competitors as it delivers
minimum variance portfolios with a much lower turnover. On the other hand, the maximum aver-
age turnover is always achieved in portfolios based on the RM-2006 covariances. With respect to
portfolios based on unconditional covariances, we can observe that they have larger turnovers as N
11
De Nard, Ledoit and Wolf (2020) also propose reducing the turnover by using “averaged forecasting”. When
the frequency of the observed returns is daily but the portfolio is held for a month, the covariance matrices are
averaged over one-step-ahead forecasts over the 21 periods. However, this procedure is not trivial to implement and
we do not pursue it further.
19
20
21
22
The performance results for the mean-variance portfolios with momentum signal are reported in
Table 3. We can observe that both the turnovers and the standard deviations of the mean-variance
portfolios are larger for all specifications of the conditional covariance matrices considered. The
larger turnovers of the mean-variance portfolios could be due to the fact that the mean-variance
problem is known to be very sensitive to estimation of the mean returns; see Jagannathan and
Ma (2003). Very often, the estimation error in the mean returns degrades the overall portfolio
performance and introduces an undesirable level of portfolio turnover. In fact, existing evidence
suggests that the performance of optimal portfolios that do not rely on estimated mean returns
is better; see DeMiguel, Garlappi and Uppal (2009). As expected, the results reported in Table
3 reveal that the risk-adjusted performance of mean-variance portfolios is, in fact, substantially
affected by the presence of transaction costs. However, the main conclusions for the minimum
variance portfolios are corroborated. The DCC-NLS and AFM-DCC-NLS specifications deliver
less risky mean-variance portfolios when N is large, therefore corroborating the previous results
for the minimum variance portfolios reported in Table 1 as well as the results reported in De Nard,
Ledoit and Wolf (2020). However, the best performance in terms of IR when transaction costs
are considered is obtained with the SWSV model (1.73 when N = 1000 and transaction cost is 10
b.p.).
The results of the turnover-constrained mean-variance portfolios are reported in Table 4. We
can observe that, with the level of penalization κ = 1 × 10−3 , the turnover-constrained mean-
variance portfolios display, in general, the same numbers as those reported in Table 3 for the
corresponding unconstrained portfolios.
23
In this subsection, we carry out several robustness checks to validate the main conclusions.
First, we consider alternative estimators of the covariance matrices. In particular, on top of
estimating the unconditional covariance matrices using the sample covariance, we also estimate
it by the linear shrinkage (Unconditional-LS) method of Ledoit and Wolf (2004b) and by the
analytical non-linear shrinkage (Unconditional-NLS) method of Ledoit and Wolf (2019). We also
consider two variants for the estimation of the DCC model, namely i) the original DCC proposal
of Engle (2002) in which C is estimated by the sample covariance matrix of devolatized residuals,
denoted by DCC-Sample, and ii) the estimator of C considered in Engle, Ledoit and Wolf (2019)
in which C is estimated by the linear shrinkage (LS) approach of Ledoit and Wolf (2004b), denoted
as DCC-LS. Finally, we also consider the factor specification of De Nard, Ledoit and Wolf (2020)
with Σut modelled by the Whishart model described in section 2.4. We denote this specification
as AFM-WSV when the initial covariance matrix, S0 , is a equicorrelation matrix and AFM-SWSV
when S0 is diagonal. The results for these additional specifications of the covariance matrices are
reported in Section C of the Supplementary Material.
In Section C of the Supplementary Material, we also discuss the implementation of alternative
portfolio policies. In particular, we consider the value-weighted policies, and an alternative version
of the volatility timing policy proposed in Kirby and Ostdiek (2012).
Looking at the robustness checks reported in the Supplementary Material accompanying this
paper, related to the implementation of alternative covariance specifications and additional port-
folio policies, we show that the results are reassuring. When comparing among alternative uncon-
ditional estimators, we observe that the unconditional-LS outperforms the unconditional, and the
unconditional-NLS outperforms both. This result suggests that the non-linear shrinkage developed
in Ledoit and Wolf (2012, 2019) is in fact an improvement with respect to the linear shrinkage as
well as with respect to the traditional sample covariance estimator. A similar finding is observed
when comparing among alternative DCC specifications. We observe that the DCC-LS outperforms
the DCC-Sample and the DCC-NLS outperforms both.
Introducing a common factor and modelling the covariance of the idiosyncratic errors using
24
6. Concluding remarks
Modelling and forecasting large dimensional conditional covariance matrices in a data-rich en-
vironment is challenging. Most models for dynamic covariance matrices suffer from the curse of
dimensionality, which creates difficulties in the estimation process when considering applications
involving hundreds or thousands of time series. We compare the one-step-ahead correlations ob-
tained from the unconditional, RM-2006, DCC, AFM-DCC, WSV and SWSV covariance models
in an empirical application based on daily returns of NYSE, NASDAQ and AMEX stocks, with up
to 1000 assets. We show that the pairwise correlations obtained using the SWSV model are close
to zero, more stable over time and have less cross-sectional dispersion than those obtained by any
of the other specifications considered. We evaluate the performance of the correlations using them
to select minimum variance and mean-variance portfolios, as well as turnover-constrained mini-
mum variance and mean-variance portfolios. The SWSV correlations deliver more stable optimal
portfolios weights, implying a lower turnover in comparison to the alternative conditional covari-
ance specifications considered. We find that the risk-adjusted performance of the SWSV model is
consistently superior to that of alternative specifications, specially when considering trading costs.
Furthermore, one further attractive of SWSV is that it is computationally very simple even in the
presence of very large portfolios.
25
26
The Table reports performance statistics for equally weighted (EW), variance targeting (VT) and minimum variance portfolios with N ∈ {100, 500, 1000} assets
obtained with several covariance models. Information ratios (IR) are computed using returns net of transaction costs of 0, 5, and 10 b.p. Mean returns, standard
deviation, and IR are reported in annual terms whereas turnovers are in monthly terms. All figures are based on out-of-sample observations. The out-of-sample
period goes from 12/12/1974 to 12/31/2016 (10,605 daily observations) resulting in a total of 505 months. Portfolio weights are updated on a monthly basis. One,
two, and three asterisks denote that the standard deviation (IR) is statistically larger (smaller) with respect the smallest standard deviation (largest IR) at the 10%,
5%, and 1% levels, respectively. The smallest (largest) standard deviation (IR) and those which are not significantly larger (smaller) are highlighted in bold.
Electronic copy available at: https://ssrn.com/abstract=3222808
The Table reports performance statistics for turnover-constrained minimum variance portfolios with N ∈ {100, 500, 1000} assets obtained with several covariance
models. Information ratios (IR) are computed using returns net of transaction costs of 0, 5, and 10 b.p. Mean returns, standard deviation, and IR are reported
in annual terms whereas turnovers are in monthly terms. All figures are based on out-of-sample observations. The out-of-sample period goes from 12/12/1974 to
12/31/2016 (10,605 daily observations) resulting in a total of 505 months. Portfolio weights are updated on a monthly basis. One, two, and three asterisks denote
Electronic copy available at: https://ssrn.com/abstract=3222808
that the standard deviation (IR) is statistically larger (smaller) with respect the smallest standard deviation (largest IR) at the 10%, 5%, and 1% levels, respectively.
The smallest (largest) standard deviation (IR) and those which are not significantly larger (smaller) are highlighted in bold.
SWSV 0.23 14.63 11.77∗∗∗ 1.24 14.49 11.77∗∗∗ 1.23 14.35 11.77∗∗∗ 1.22
The Table reports performance statistics for mean-variance portfolios with momentum signal with N ∈ {100, 500, 1000} assets obtained with a set of covariance
models. Information ratios (IR) are computed using returns net of transaction costs of 0, 5, and 10 b.p. Mean returns, standard deviation, and IR are reported
in annual terms whereas turnovers are in monthly terms. All figures are based on out-of-sample observations. The out-of-sample period goes from 12/12/1974 to
12/31/2016 (10,605 daily observations) resulting in a total of 505 months. Portfolio weights are updated on a monthly basis. One, two, and three asterisks denote
Electronic copy available at: https://ssrn.com/abstract=3222808
that the standard deviation (IR) is statistically larger (smaller) with respect the smallest standard deviation (largest IR) at the 10%, 5%, and 1% levels, respectively.
The smallest (largest) standard deviation (IR) and those which are not significantly larger (smaller) are highlighted in bold.
SWSV 0.83 16.61 14.51∗∗∗ 1.14 16.12 14.51∗∗∗ 1.11 15.62 14.51∗∗∗ 1.08
The Table reports performance statistics for turnover-constrained mean-variance portfolios with momentum signal with N ∈ {100, 500, 1000} assets obtained with
a set of covariance models. Information ratios (IR) are computed using returns net of transaction costs of 0, 5, and 10 b.p. Mean returns, standard deviation, and
IR are reported in annual terms whereas turnovers are in monthly terms. All figures are based on out-of-sample observations. The out-of-sample period goes from
12/12/1974 to 12/31/2016 (10,605 daily observations) resulting in a total of 505 months. Portfolio weights are updated on a monthly basis. One, two, and three
Electronic copy available at: https://ssrn.com/abstract=3222808
asterisks denote that the standard deviation (IR) is statistically larger (smaller) with respect the smallest standard deviation (largest IR) at the 10%, 5%, and 1%
levels, respectively. The smallest (largest) standard deviation (IR) and those which are not significantly larger (smaller) are highlighted in bold.
SWSV 0.81 16.62 14.52∗∗∗ 1.15 16.14 14.52∗∗∗ 1.11 15.65 14.52∗∗∗ 1.08
0.9 T=1000
T=1250
0.8 T=1500
0.7
Coefficient of S0
0.6
0.5
0.4
0.3
0.2
0.1
0
0.995 0.99625 0.99750 0.99875 1.00000
Values of
The Figure plots the weight placed in the initial covariance matrix, S0 , as a function of the decay parameter, λ, for
sample sizes of T ∈ {1000, 1250, 1500} in the computation of the one-step-ahead conditional covariance matrix, HT +1 .
31
0 0 0
0 0 0
The Figure plots the evolution of the out-of-sample one-step-ahead median pairwise correlations (solid blue line) along with the 25th and 75th percentiles (dashed
lines) when N = 500 obtained with different specifications of the conditional covariance matrices.
Relative weight of S0 in E[H t+1 |rt]
1
N=100
N=500
N=1000
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
Nov79 Dec83 Jan88 Mar92 Apr96 Jun00 Jul04 Sep08 Oct12 Dec16
The Figure plots the evolution of the relative weight of S0 in the one-step-ahead forecast of the SWSV model.
6 MSV
DCC-NLS
0
Dec74 Jun85 Dec95 Jun06 Dec16
6 MSV
DCC-NLS
0
Dec74 Jun85 Dec95 Jun06 Dec16
The Figure plots out-of-sample monthly turnover of the minimum variance (top panel) and mean-variance (bottom
panel) portfolios with N = 1000 assets obtained with the WMS2 V (blue lines) and the DCC-NLS (red lines) estimated
covariance matrices.
33
Alexander, C., 2008. Market Risk Analysis: Practical Financial Econometrics. volume 2. John Wiley & Sons.
Behr, P., Guetter, A., Miebs, F., 2013. On portfolio optimization: Imposing the right constraints. Journal of Banking & Finance 37, 1232–1242.
Brandt, M.W., Santa-Clara, P., Valkanov, R., 2009. Parametric portfolio policies: Exploiting characteristics in the cross-section of equity returns. The
Review of Financial Studies 22, 3411–3447.
Carnero, M.A., Peña, D., Ruiz, E., 2004. Persistence and kurtosis in GARCH and stochastic volatility models. Journal of Financial Econometrics 2,
319–342.
Clarke, R.G., De Silva, H., Thorley, S., 2006. Minimum-variance portfolios in the us equity market. The Journal of Portfolio Management 33, 10–24.
Clarke, R.G., De Silva, H., Thorley, S., 2011. Minimum-variance portfolio composition. Journal of Portfolio Management 37, 31.
CVX Research, I., 2012. CVX: Matlab software for disciplined convex programming, version 2.0. http://cvxr.com/cvx.
De Nard, G., Ledoit, O., Wolf, M., 2020. Factor models for portfolio selection in large dimensions: The good, the better and the ugly. Forthcoming,
Journal of Financial Econometrics .
DeGroot, M.H., 2004. Optimal statistical decisions. John Wiley & Sons.
Della Corte, P., Sarno, L., Thornton, D.L., 2008. The expectation hypothesis of the term structure of very short-term rates: Statistical tests and
economic value. Journal of Financial Economics 89, 158–174.
DeMiguel, V., Garlappi, L., Uppal, R., 2009. Optimal versus naive diversification: How inefficient is the 1/n portfolio strategy? Review of Financial
Studies 22, 1915–1953.
DeMiguel, V., Martin-Utrera, A., Nogales, F.J., Uppal, R., 2020. A portfolio perspective on the multitude of firm characteristics. Forthcoming, The
Review of Financial Studies .
DeMiguel, V., Nogales, J., 2009. Portfolio selection with robust estimation. Operations Research 57, 560–577.
Engle, R., 2002. Dynamic conditional correlation: A simple class of multivariate generalized autoregressive conditional heteroskedasticity models.
Journal of Business & Economic Statistics 20, 339–350.
Engle, R., Kelly, B., 2012. Dynamic equicorrelation. Journal of Business & Economic Statistics 30, 212–228.
Engle, R.F., Ledoit, O., Wolf, M., 2019. Large dynamic covariance matrices. Journal of Business & Economic Statistics 37, 363–375.
French, K.R., 2008. Presidential address: The cost of active investing. The Journal of Finance 63, 1537–1573.
Gasbarro, D., Wong, W., Zumwalt, J., 2007. Stochastic dominance of ishares. European Journal of Finance 13, 89–101.
Goodwin, T., 1998. The information ratio. Financial Analysts Journal 54, 34–43.
Han, Y., 2006. Asset allocation with a high dimensional latent factor stochastic volatility model. Review of Financial Studies 19, 237–271.
Harvey, A., Ruiz, E., Shephard, N., 1994. Multivariate stochastic variance models. The Review of Economic Studies 61, 247–264.
Hautsch, N., Voigt, S., 2019. Large-scale portfolio allocation under transaction costs and model uncertainty. Journal of Econometrics 212, 221–240.
Israelsen, C., 2005. A refinemen of the sharpe ratio and information ratio. Journal of Asset Management 5, 423–427.
Jagannathan, R., Ma, T., 2003. Risk reduction in large portfolios: Why imposing the wrong constraints helps. The Journal of Finance 58, 1651–1684.
Jegadeesh, N., Titman, S., 1993. Returns to buying winners and selling losers: Implications for stock market efficiency. The Journal of finance 48,
65–91.
Kastner, G., 2019. Sparse Bayesian time-varying covariance estimation in many dimensions. Journal of Econometrics 210, 98–115.
Kim, D., 2014. Maximum likelihood estimation for vector autoregressions with multivariate stochastic volatility. Economics Letters 123, 282–286.
Kirby, C., Ostdiek, B., 2012. It’s all in the timing: simple active portfolio strategies that outperform naive diversification. Journal of Financial and
Quantitative Analysis 47, 437–467.
Ledoit, O., Wolf, M., 2004a. Honey, I shrunk the sample covariance matrix. Journal of Portfolio Management 30, 110–119.
34
Electronic copy available at: https://ssrn.com/abstract=3222808
Ledoit, O., Wolf, M., 2004b. A well-conditioned estimator for large-dimensional covariance matrices. Journal of Multivariate Analysis 88, 365–411.
Ledoit, O., Wolf, M., 2008. Robust performance hypothesis testing with the Sharpe ratio. Journal of Empirical Finance 15, 850–859.
Ledoit, O., Wolf, M., 2011. Robust performance hypothesis testing with the variance. Wilmott Magazine 55, 86–89.
Ledoit, O., Wolf, M., 2012. Nonlinear shrinkage estimation of large-dimensional covariance matrices. The Annals of Statistics 40, 1024–1060.
Ledoit, O., Wolf, M., 2017a. Nonlinear shrinkage of the covariance matrix for portfolio selection: Markowitz meets goldilocks. The Review of Financial
Studies 30, 4349–4388.
Ledoit, O., Wolf, M., 2017b. Numerical implementation of the QuEST function. Computational Statistics & Data Analysis 115, 199–223.
Ledoit, O., Wolf, M., 2019. Analytical nonlinear shrinkage of large-dimensional covariance matrices. Forthcoming, Annals of Statistics .
Mina, J., Xiao, J.Y., 2001. Return to RiskMetrics: the evolution of a standard. RiskMetrics Group 1, 1–11.
Moura, G.V., Noriller, M.R., 2019. Maximum likelihood estimation of a TVP-VAR. Economics Letters 174, 78–83.
Pakel, C., Shephard, N., Sheppard, K., Engle, R., 2020. Fitting vast dimensional time-varying covariance models. Journal of Business & Economic
Statistics doi: 1080/07350015.2020.1713795.
Santos, A.A., 2019. Disentangling the role of variance and covariance information in portfolio selection problems. Quantitative Finance 19, 57–76.
Sheppard, K., 2012. Forecasting high dimensional covariance matrices. John Wiley & Sons.
Stivers, C., Sun, L., 2016. Mitigating estimation risk in asset allocation: Diagonal models versus 1/N diversification. Financial Review 51, 403–433.
Thornton, D.L., Valente, G., 2012. Out-of-sample predictions of bond excess returns and forward rates: An asset allocation perspective. Review of
Financial Studies 25, 3141–3168.
Uhlig, H., 1994. On singular Wishart and singular multivariate Beta distributions. The Annals of Statistics 22, 395–405.
Uhlig, H., 1997. Bayesian vector autoregressions with stochastic volatility. Econometrica 65, 59–74.
Windle, J., Carvalho, C., 2014. A tractable state-space model for symmetric positive-definite matrices. Bayesian Analysis 9, 759–792.
Zakamouline, V., Koekebakker, S., 2009. Portfolio performance evaluation with generalized sharpe ratios: Beyond the mean and variance. Journal of
Banking & Finance 33, 1241–1254.
Zumbach, G., 2007a. A gentle introduction to the RM2006 methodology. RiskMetrics Group .
Zumbach, G., 2007b. The RiskMetrics 2006 methodology. Technical report, RiskMetrics Group .
35
Electronic copy available at: https://ssrn.com/abstract=3222808
Supplementary Material
d
E [Θt ] = IN . (22)
d+1
d+1 d
E Ht−1 |Ht−1
−1 −1 0 −1 −1
= U(Ht−1 ) IN U(Ht−1 ) = Ht−1 , (23)
d d+1
which is a very flexible stochastic process that is able to mimic several persistent processes in small samples.
The prior distribution in (8) implies that the initial precision matrix, H1−1 , follows a Wishart distribution
with expected value given by E[H1−1 ] = S0−1 , which makes the selection of the hyperparameter S0 more intuitive
since it is related to a covariance matrix. Now suppose a researcher observes a single draw r1 , knowledge
about H1−1 can be updated analytically via Bayes rule, and the conjugacy between the multivariate Normal
in (1) and the Wishart prior in (8):
yielding a posterior for H1−1 |r1 of the form WN (d + 1, [(d + 1)S1 ]−1 ), where S1 = (d + 1)−1 [dS0 + r1 r10 ], and
such that E[H1−1 |r1 ] = S1−1 .
A predictive density for H2−1 |r1 follows from Theorem 1 in Uhlig (1994), which shows that combining the
multiplicative transition in (8) with the Wishart posterior for H1−1 |r1 yields a prior density for H2−1 given
by WN (d, [dS1 ]−1 ). Since the predictive density for the conditional precision H2−1 |r1 is WN (d, [dS1 ]−1 ), a
one-step-ahead prediction for the precision matrix H2−1 |r1 is given by E[H2−1 |r1 ] = S1−1 .
Additionally, as H2−1 |r1 ∼ WN (d, [dS1 ]−1 ) and r2 |H2−1 ∼ NN (0, H2 ), the joint density of r2 , H2−1 |r1 is
proportional to:
Suppose now that d is treated as a fixed parameter, the period-2 contribution to the likelihood function,
36
Electronic copy available at: https://ssrn.com/abstract=3222808
p(r2 |r1 ; d), can be obtained by marginalizing the joint density in (25) with respect to the N (N + 1)/2 distinct
variables in the symmetric matrix H2 over the set of all values such that H2 is positive definite (see, for
example, DeGroot, 2004, Section 9.11). Note that, similarly to (24), the right side of (25) is proportional to a
Wishart density for H2 |r2 , r1 , and such a density must integrate to unit over the above mentioned set. Hence,
integrating the function on the right side of (25) yields the relation:
−(d+1)
p(r2 |r1 ) ∝ |[dS1 + r2 r20 ]| 2 ,
−(d+1)
∝ |dS1 | 1 + d−1 r20 S1−1 r2 2 ,
(26)
which, combined with the remaining constants of p(r2 |H2−1 ) and p(H2−1 |r1 ), yields a multivariate t-distribution.
d
Thus, p(r2 |r1 ; d) ∼ tN (0, Σ2 , d + 1 − N ), where Σ2 = d+1−N S1 , and tN is the N -dimensional multivariate t
distribution. These recursions follow analytically until period T , and we collect them here for convenience:
where
d
Σt = St−1 , (30)
d+1−N
d 1
St = St−1 + rt r0 . (31)
d+1 d+1 t
One important aspect of the original WSV model proposed by Uhlig (1997) is that its estimation is based on
Bayesian methods. In this paper, we consider a maximum likelihood estimation procedure developed in Kim
(2014). Although Uhlig (1997) does not present results for the likelihood function of his model, an analytical
solution to it is a direct by-product of his results (see, for example, Kim 2014). Notice that p(rt |rt−1 ; d) in eq.
(27) gives the prediction error decomposition, thus the log-likelihood function could be written as:
TN d+1 d+1−N
log f (r1:T ) = − log[(d + 1 − N )π] + T log Γ − T log Γ
2 2 2
T T
1X d+1X 1
− log |Σt | − log 1 + rt Σ−1 0
t rt , (32)
2 t=1 2 t=1 d+1−N
where Γ(·) denotes the gamma function. Conditional on S0 , we estimate the unique parameter d in (32) by
maximizing the log-likelihood function in eq. (32).
37
Electronic copy available at: https://ssrn.com/abstract=3222808
regardless of the cross-sectional dimension, N , the level of the correlations obtained using SWSV is lower
than those obtained when any of the other alternative models is implemented to estimate the conditional
covariance matrices. The second conclusion is that, in general, at each moment of time, the dispersion of the
pairwise conditional correlations among returns decreases slightly with the cross-sectional dimension. In any
case, regardless of N , the dispersion of the pairwise correlations estimated using the WSV models is smallest
and close to zero, implying equi-correlations, i.e. the correlations of all pairs of returns are approximately the
same. Finally, we can observe that, regardless of N , the variability over time of the pairwise correlations is
much larger when looking at the correlations estimated using RM-2006. The variability of the correlations
estimated using the unconditional sample covariances or any of the two WSV models considered is much
similar and much smoother than that of the pairwise correlations estimated using the DCC-based models.
The former correlations are approximately constant over time, only jumping at crisis times, in particular, in
October 1987 and in October 2008. The pairwinse correlations estimated by the SWSV model when N = 1000
are approximately constant and equal to zero.
DCC-Sample: Ĉ = T1 SS 0 . This corresponds to the original DCC formulation in which C is estimated by the
sample covariance matrix of devolatized residuals.
PN 0
DCC-LS: C is estimated by C̄ = i=1 ρδ̄i + (1 − ρ)δi ui ui , where δi for i = 1, . . . , N denote the i-th
eigenvalue of the sample covariance matrix of devolatized residuals with corresponding eigenvector ui for
i = 1, . . . , N and δ̄ = N1 N
P
i=1 δi . This specification corresponds to the linear shrinkage (LS) approach
of Ledoit and Wolf (2004b) with ρ being the shrinkage intensity. The authors provide closed form
expression for this parameter.
38
Electronic copy available at: https://ssrn.com/abstract=3222808
Finally, we also consider the factor approach of De Nard, Ledoit and Wolf (2020) in which the covariance
matrices of the idiosyncratic noises are estimated with the WSV and SWSV models.
Tables 5 and 6 report the performance of minimum variance portfolios without and with turnover restric-
tions, respectively, when these additional estimators of the conditional covariance matrices are implemented.
We can observe, that none of these specifications improve on the optimal portfolios reported in the main text.
Finally, Tables 7 and 8 report the corresponding results for the mean-variance portfolios without and with
turnover restrictions, respectively, with the same conclusions as before.
39
Electronic copy available at: https://ssrn.com/abstract=3222808
Table 5: Performance of minimum variance portfolios
The Table reports performance statistics for the VW, EW-TQ and VT-GARCH portfolios and minimum variance portfolios with N ∈ {100, 500, 1000} assets obtained with a
several additional covariance models. Information ratios (IR) are computed using returns net of transaction costs of 0, 5, and 10 b.p. Mean returns, standard deviation, and IR
are reported in annual terms whereas turnovers are in monthly terms. All figures are based on out-of-sample observations. The out-of-sample period goes from 12/12/1974 to
12/31/2016 (10,605 daily observations) resulting in a total of 505 months. Portfolio weights are updated on a monthly basis. One, two, and three asterisks denote that the the
standard deviation (IR) is statistically larger (smaller) than the smallest standard deviation (largest IR) at the 10%, 5%, and 1% levels, respectively.
The Table reports performance statistics for turnover-constrained minimum variance portfolios with N ∈ {100, 500, 1000} assets obtained with several additional covariance
models. Information ratios (IR) are computed using returns net of transaction costs of 0, 5, and 10 b.p. Mean returns, standard deviation, and IR are reported in annual terms
whereas turnovers are in monthly terms. All figures are based on one-step-ahead returns from 12/12/1974 to 12/31/2016 (10,605 daily observations) resulting in a total of
505 months. Portfolio weights are updated on a monthly basis. One, two, and three asterisks denote that the standard deviation (IR) is statistically larger (smaller) than the
smallest standard deviation (largest IR) at the 10%, 5%, and 1% levels, respectively.
Unconditional-NLS 0.50 13.18 11.71∗∗ 1.13∗∗ 12.87 11.71∗∗ 1.10∗∗∗ 12.57 11.71∗∗ 1.07∗∗∗
DCC-Sample 2.19 12.29 11.48∗∗∗ 1.07∗∗ 10.97 11.49∗∗∗ 0.95∗∗∗ 9.66 11.51∗∗∗ 0.84∗∗∗
DCC-LS 2.16 12.32 11.46∗∗∗ 1.08∗∗ 11.03 11.47∗∗∗ 0.96∗∗∗ 9.73 11.49∗∗∗ 0.85∗∗∗
AFM-WSV 0.58 13.17 11.64∗∗ 1.13∗∗ 12.82 11.64∗∗ 1.10∗∗ 12.47 11.64∗∗ 1.07∗∗∗
AFM-SWSV 0.58 13.19 11.64∗∗ 1.13∗∗ 12.84 11.64∗∗ 1.10∗∗ 12.49 11.64∗∗ 1.07∗∗∗
AFM-WSV 0.38 11.97 8.49∗∗∗ 1.41∗∗∗ 11.74 8.49∗∗∗ 1.38∗∗∗ 11.52 8.49∗∗∗ 1.36∗∗∗
AFM-SWSV 0.37 12.78 8.47∗∗∗ 1.51∗∗∗ 12.56 8.47∗∗∗ 1.48∗∗∗ 12.33 8.47∗∗∗ 1.46∗∗∗
The Table reports performance statistics for mean-variance portfolios with momentum signal with N ∈ {100, 500, 1000} assets obtained with several additional covariance
models. Information ratios (IR) are computed using returns net of transaction costs of 0, 5, and 10 b.p. Mean returns, standard deviation, and IR are reported in annual terms
whereas turnovers are in monthly terms. All figures are based on out-of-sample observations. The out-of-sample period goes from 12/12/1974 to 12/31/2016 (10,605 daily
observations) resulting in a total of 505 months. Portfolio weights are updated on a monthly basis. One, two, and three asterisks denote that the standard deviation (IR) is
statistically larger (smaller) than the smallest standard deviation (largest IR) at the 10%, 5%, and 1% levels, respectively.
Unconditional-NLS 1.10 15.00 14.06∗∗ 1.07∗ 14.34 14.06∗∗ 1.02∗∗ 13.68 14.06∗∗ 0.97∗∗
DCC-Sample 2.68 14.89 13.88∗∗∗ 1.07 13.28 13.88∗∗∗ 0.96∗∗ 11.68 13.91∗∗∗ 0.84∗∗∗
DCC-LS 2.64 14.93 13.86∗∗∗ 1.08 13.35 13.86∗∗∗ 0.96∗∗ 11.76 13.89∗∗∗ 0.85∗∗∗
AFM-WSV 1.21 15.11 13.98∗∗ 1.08 14.38 13.98∗∗ 1.03∗ 13.65 13.98∗∗ 0.98∗∗
AFM-SWSV 1.21 15.13 13.98∗∗ 1.08 14.40 13.98∗∗ 1.03∗ 13.68 13.98∗∗ 0.98∗∗
AFM-WSV 1.01 14.17 9.93∗∗∗ 1.43∗∗∗ 13.56 9.93∗∗∗ 1.36∗∗∗ 12.95 9.94∗∗∗ 1.30∗∗∗
AFM-SWSV 1.00 14.96 9.94∗∗∗ 1.50∗∗∗ 14.36 9.95∗∗∗ 1.44∗∗∗ 13.76 9.95∗∗∗ 1.38∗∗∗
The Table reports performance statistics for turnover-constrained mean-variance portfolios with momentum signal with N ∈ {100, 500, 1000} assets obtained with several
additional covariance models. Information ratios (IR) are computed using returns net of transaction costs of 0, 5, and 10 b.p. Mean returns, standard deviation, and IR are
reported in annual terms whereas turnovers are in monthly terms. All figures are based on one-step-ahead returns from 12/12/1974 to 12/31/2016 (10,605 daily observations)
resulting in a total of 505 months. Portfolio weights are updated on a monthly basis. One, two, and three asterisks denote that the standard deviation (IR) is statistically larger
(smaller) than the smallest standard deviation (largest IR) at the 10%, 5%, and 1% levels, respectively.
Unconditional-NLS 1.06 15.01 14.06∗∗ 1.07∗ 14.38 14.06∗∗ 1.02∗∗ 13.74 14.06∗∗ 0.98∗∗
DCC-Sample 2.62 14.88 13.87∗∗∗ 1.07 13.31 13.87∗∗∗ 0.96∗∗ 11.74 13.90∗∗∗ 0.84∗∗∗
DCC-LS 2.58 14.92 13.85∗∗∗ 1.08 13.37 13.85∗∗∗ 0.97∗∗ 11.83 13.88∗∗∗ 0.85∗∗∗
AFM-WSV 1.17 15.12 13.98∗∗ 1.08 14.42 13.98∗∗ 1.03∗ 13.71 13.99∗∗ 0.98∗∗
AFM-SWSV 1.17 15.15 13.98∗∗ 1.08 14.44 13.98∗∗ 1.03∗ 13.74 13.99∗∗ 0.98∗∗
AFM-WSV 0.92 14.22 9.95∗∗∗ 1.43∗∗∗ 13.67 9.95∗∗∗ 1.37∗∗∗ 13.12 9.95∗∗∗ 1.32∗∗∗
AFM-SWSV 0.91 15.02 9.96∗∗∗ 1.51∗∗∗ 14.48 9.96∗∗∗ 1.45∗∗∗ 13.93 9.97∗∗∗ 1.40∗∗∗
0 0 0
0 0 0
The Figure plots the evolution of the out-of-sample one-step-ahead median pairwise correlations (solid blue line) along with the 25th and 75th percentiles (dashed lines) when
N = 100 obtained with different specifications of the conditional covariance matrices.
Unconditional DCC-NLS WSV
0 0 0
0 0 0
The Figure plots the evolution of the out-of-sample one-step-ahead median pairwise correlations (solid blue line) along with the 25th and 75th percentiles (dashed lines) when
N = 1000 obtained with different specifications of the conditional covariance matrices.