3-Basic Stats
3-Basic Stats
3-Basic Stats
Learning Objectives :
1. Probability Density function
2. Normal Distribution
3. Correlation and Covariance
4. Various methods of computing Volatility
5. Vasicek Model
Probability Density Function- Basic Concepts
❑ Let X be a continuous random variable . Then a probability distribution or probability density
function (pdf) of X is a function f(x) such that for any two numbers a and b with b >= a,
P ( a<= X <= b) is ∫f(x) dx , with integration from a to b
❑That is, the probability that X takes on a value in the interval [a, b] is the area above this interval
and under the graph of the density function. The graph of f(x) is often referred to as the density
curve
❑For f (x) to be a legitimate pdf, it must satisfy the
following two conditions: f (x)
CDF
1.20
1.00
0.80
0.60
0.40
0.20
-
3
Normal Distribution
❑ Of all the distribution functions, Normal Distribution is amongst the most important
❑Numerous economic and physical measures and indicators are normally distributed
❑A continuous rv X is said to have a normal distribution with parameters μ and σ, if the pdf of X is
❑Each density curve is symmetric about μ and bell-shaped, so the center of the bell (point of
symmetry) is both the mean of the distribution and the median.
❑ The Standard Normal Distribution is a reference distribution from which information on other
distributions can be obtained using z = (x -μ )/ σ. Z is negative for values to left of the mean
❑For any z value, area to the left of the curve can be found from empirical tables 4
Important Z values and Percentiles
5
Normal Probability Distributions
Let p= 0.99
Mean, µ 0
Std Dev, σ 1
Then z Value= F-(p) 2.33 =NORM.INV(p,mean, SD)
X pdf CDF
And P value for a given z , P(z) 0.99 =NORM.DIST(p,mean, SD)
-4.00 0.00 0.00
-3.50 0.00 0.00
-3.00 0.00 0.00
-2.50 0.02 0.01 PDF and CDF
-2.00 0.05 0.02
1.05
-1.50 0.13 0.07
-1.00 0.24 0.16 0.85
-0.50 0.35 0.31
0.65
- 0.40 0.50
0.50 0.35 0.69 0.45
1.00 0.24 0.84
0.25
1.50 0.13 0.93
2.00 0.05 0.98 0.05
2.50 0.02 0.99
(0.15)
3.00 0.00 1.00
3.50 0.00 1.00 pdf CDF
4.00 0.00 1.00 6
Inverse Normal Function
CDF
1.20 ✓ Enter a p value ( say 0.8)
1.00
0.80
✓ Use the NORM.INV function to
0.60 compute the z value
0.40
0.20
✓ This gives the Z value corresponding to
- the area of 89%
✓ In other words we are finding z using
the inverse function F- (p)
pdf ✓ We are thus inverting the CDF function
0.45
0.40
0.35
0.30
0.25
0.20
0.15
0.10
0.05
-
7
Typical problem
Z Value-LB NA
Z Value-LB (0.20)
Z Value-UB 0.20 Z Value-UB (0.60)
✓A bank estimates that its profit next year is normally distributed with a mean of 0.8% of assets and
the standard deviation of 2% of assets. How much equity (as a percentage of assets) does the
company need to be (a) 99% sure that it will have a positive equity at the end of the year and (b)
99.9% sure that it will have positive equity at the end of the year? Ignore taxes.
11
Covariance and Correlation
❑ When two random variables X and Y are not independent, it is frequently of interest to assess how
strongly they are related to one another.
❑For discrete random variables it is given by the equation Cov(X,Y) =Average of ∑ (X-μx) (Y-μy)
❑For a strong positive relationship, product will be positive , for a strongly negative relationship
product will be negative
❑If they are not corelated , the negatives and positives will mostly cancel out , giving a product closer
to Zero
Cov(X, Y)/σxσy
12
Illustrating COVAR and CORREL
Sigma a 2.87
13
Volatility
❑ Volatility is defined as the Standard Deviation of the return provided per unit of time
✓Suppose that Si is the value of a variable on day i. The volatility per day is the standard
deviation of returns ln(Si /Si-1)
❑Normally days when markets are closed are ignored in volatility calculations
❑Of the variables needed to price an option the one that cannot be observed directly is volatility
❑We can therefore imply volatilities from market prices and vice versa
14
Computing volatility- Daily Standard Deviation
❑We need only remember the current estimate of the variance rate and the most recent observation
on the market variable
❑l = 0.94 has been found to be a good choice across a wide range of market variables
✓Risk Metrics, a database created by JPM , uses λ=0.94 for updating volatility estimates across a
range of market variables
18
Estimating Volatilities - GARCH
❑ In GARCH (1,1) we assign some weight to the long-run average variance rate
2n = g V L + a u n2 − 1 + b 2n − 1
❑Maximum weightage , 80% at least is assigned to β
❑Balance weight is distributed between ά and Ύ
❑Since weights must sum to 1
g + a + b =1
19
Computing volatility- GARCH
Relatives daily µi
Weighted
returns
Simplified formulae
Day Prices daily µ2i Weights squared
𝑺𝒊 𝒖𝒊 𝒖𝟐𝒊 𝜶𝜷𝒊−𝟏
0 20.00
ൗ𝑺
𝒊−𝟏 𝝈𝟐𝒏 = 𝝎 + 𝜶𝒖𝟐𝒏−𝟏 + 𝜷𝝈𝟐𝒏−𝟏
1 19.80 1.01010 1.01% 0.00010 10% 0.000010
2 20.13 1.01646 1.63% 0.00027 8.00% 0.000021
3 20.15 1.00119 0.12% 0.00000 6.40% 0.000000
Where 𝝎= ΎVL where VL is the long
4 20.18 1.00149 0.15% 0.00000 5.12% 0.000000 Assumed Parameters run average variance rate
5 20.17 0.99961 -0.04% 0.00000 4.10% 0.000000
53 20.00 0.99732 -0.27% 0.00001 0.00% 0.000000 α 10%
54 20.02 1.00090 0.09% 0.00000 0.00% 0.000000 β 80% And 𝜶+𝜷+ Ύ =1
55 20.00 0.99915 -0.09% 0.00000 0.00% 0.000000 γ 10%
56 20.00 0.99988 -0.01% 0.00000 0.00% 0.000000 σ2(LR) 0.00010
57 20.04 1.00181 0.18% 0.00000 0.00% 0.000000 ω 0.000010
Key difference: We are assuming a
58 20.04 1.00002 0.00% 0.00000 0.00% 0.000000 Long Run Volatility / SD and
59 20.04 0.99983 -0.02% 0.00000 0.00% 0.000000
60 19.98 0.99743 -0.26% 0.00001 0.00% 0.000000
assigning it a certain weight
0.000032
σ2(MA) 0.000825% σ2n 0.008210%
σ(MA) 0.2873% σn 0.9061%
20
Problem : HULL 10.19
Suppose that the price of an asset at close of trading yesterday was $300 and its volatility was
estimated as 1.3% per day. The price at the close of trading today is $298. Update the volatility
estimate using (a) The EWMA model with λ = 0.94 and (b) The GARCH(1,1) model with w =
0.000002, alpha = 0.04, and beta = 0.94.
21
Comparing the three approaches
Method
𝒎
Volatility / Standard Deviation 𝟏 Assigns equal weight to each
𝝈𝟐𝒏 = 𝒖𝟐𝒏−𝒊
( Conventional Way) 𝒎 day
𝒊=𝟏
22
Forecasting Volatility using GARCH
𝝈𝟐𝒏 = 𝝎 + 𝜶𝒖𝟐𝒏−𝟏 + 𝜷𝝈𝟐𝒏−𝟏
Key formulae we have learnt
𝝈𝟐𝒏 = 𝜸𝑽𝑳 + 𝜶𝒖𝟐𝒏−𝟏 +𝜷𝝈𝟐𝒏−𝟏
❑ This equation forecasts the volatility on t day forward using the information available
at end of day n-1
❑ The Variance Rate exhibits mean reversion with reversion level of 𝑽𝑳 and a reversion
rate of 1- 𝜶-𝜷
❑ 𝜸 is effectively the rate at which the Volatility mean reverts 23
Problem : HULL 10.21
Suppose that the parameters in a GARCH(1,1) model are a = 0.03, b= 0.95 and w = 0.000002.
(a) What is the long-run average volatility?
(b) If the current volatility is 1.5% per day, what is your estimate of the volatility in 20, 40, and 60 days?
(c) Suppose that there is an event that increases the volatility from 1.5% per day to 2% per day. Estimate the effect on
the volatility in 20, 40, and 60 days.
𝑬 𝝈𝟐𝒏+𝒕 = 𝑽𝑳 + 𝜶 + 𝜷 𝒕 𝝈𝟐𝒏 − 𝑽𝑳
Therefore , Volatility in 20 days = 0.0001+ ( 0.98) ^20 * ( 1.5%^2 – 0.00001) = 0.000183
Volatility = SQRT(0.000183) = 1.35%
24
Vasicek Model
❑Very important for determining the credit risk capital for a portfolio of loans
❑For a large portfolio of loans, each of which has a probability of PD of defaulting by time T the
Worst Case Default Rate that will not be exceeded at the X% confidence level is
N PD + r N ( X )
−1 −1
WCDR = N
1 − r
❑Credit VAR at X % confidence interval is WCDR X EAD X LGD less Expected Loss
25
Vasicek Model
❑ The result from the Vasicek model is used to determine the Credit VAR
Expected Loss
Probability
N PD + r N ( X )
−1 −1
WCDR = N
1− r
26
Problem : HULL 11.19
Suppose that a bank has made a large number loans of a certain type. The one-year
probability of default on each loan is 1.2%. The bank uses a Gaussian copula for time to
default. It is interested in estimating a “99.97% worst case” for the percent of loan that
default on the portfolio. Show how this varies with the copula correlation.
PD 1.2%
Confidence Level 99.97%
VALUE as given
Correlation N-1(PD) N-1(X) by Formula N(Value)
0 -2.257 3.432 -2.257 1.2%
0.2 -2.257 3.432 -0.808 21.0%
0.4 -2.257 3.432 -0.112 45.5%
0.6 -2.257 3.432 0.634 73.7%
0.9 -2.257 3.432 3.157 99.9%
27