but also to present the
different subtleties in evaluation of models in the context of risk management.
For better visualization of performance, the above measures are plotted for
each model as a function of increasingly extreme classes of realizations. To
construct the classes of extreme realizations we use two different
parameters such that - as they increase - they define a class of more and more
extreme observations. These are:
From the latter, it should be clear that - when comparing plots for different models against the volatility percentile - a given point on the X-axis will always refer to the same data. On the other hand, for plots against the confidence level - a point on the X-axis will refer to differing data sets since this discrimination depends on the model itself.
The performance measures presented in this paper are:
As already mentioned in section
, we have one further
variation in the presentation of performance measures - the univariate
context based on 10000 prediction-realization pairs and
the multivariate context based on 1000 prediction-realization
pairs.
The two contexts allow us to distinguish between the performance of
a model for forecasting a risk factor with the performance of
the model for forecasting the portfolio.
Knowledge of the relative performance of the model in the univariate
and multivariate context highlights an important distinction.
It is usually the case that the methodology used for
the multivariate computation pt(P) as prescribed by a given model
can be refined without affecting the univariate computation pt(k).
(For all the models presented in this paper - barring historical simulation -
this means that it is possible to modify the methodology for updating the off
diagonal elements of the covariance matrix
without changing the
prescription for the diagonal elements.) The relative performance of a given model
in the univariate and multivariate context can indicate if the
point of weakness for the model lies in the computation of pt(P) and
can therefore motivate further research in rectifying the point of weakness.