SlideShare a Scribd company logo
1 of 25
Dummy Variables
Introduction
• Discuss the use of dummy variables in
Financial Econometrics.
• Examine the issue of normality and the
use of dummy variables to correct any
problem
• Show how dummy variables affect the
regression
• Assess the use of intercept and slope
dummy variables
The Normality Assumption
• In general we assume the error term is
normally distributed.
• Financial data often fails this assumption
due to the volatile nature of the data and
the numbers of outliers.
• The normality of the error term can be
tested using the Bera-Jarque test, which
tests for the presence of skewness (non-
symmetry) and kurtosis (fat tails)
Bera-Jarque Test
• This test for normality in effect tests for the
coefficients of skewness and excess kurtosis
being jointly equal to 0
nsobservatioofnumber
kurtosisexcessoftcoefficien
skewnessoftcoefficien
]
24
)3(
6
[
2
1
2
2
2
1
−
−
−
−
+=
T
b
b
bb
TW
Bera-Jarque Test
• The statistic follows the chi-squared distribution
with 2 degrees of freedom.
• The null hypothesis is that the distribution is
normal.
• i.e. if we get a Bera-Jarque statistic of 4.78, the
critical value is 5.99 (5%), then as 4.78<5.99 we
would accept the null hypothesis that the error
term is normally distributed.
• Most computer programmes report this statistic.
Remedies for non-normality
• The non-normality is often caused by a couple of
observations in the tails of the distribution, these
observations are often termed outliers.
• The simplest way to solve the problem is to use
a dummy variable, often called an impulse
dummy variable, which takes the value of 0,
except the one outlier observation which takes
the value of 1.
• This has the effect of forcing the residual for this
observation to 0.
• To determine where the outlier is, we could
simply plot the residuals against time.
Non-normality
• The use of this type of dummy variable is
controversial, as some argue it is an
artificial method of improving the
regression, by in effect removing the
influence of this particular observation.
• However an outlier can have an
excessively strong effect on a model,
giving an unrealistic result, so needs to be
taken into account.
Dummy Variable for Single Outlier
• In a regression of stock prices against income for
the UK, an outlier was noticed for 1992 month 9,
when the UK left the ERM. A dummy variable was
added to account for this. This produced the
following result:
.91992var1
.87.1,78.0R
(0.20)(0.23)(0.43)
80.087.067.0ˆ
2
mforiabledummyD
DW
Dys ttt
−
==
++=
Dummy Variables
• The previous set of results can be interpreted in
the usual way, in this case the dummy variable
has a significant t-statistic (4), so the outlier has
a significant effect on the regression, or put
another way the UK leaving the ERM had a
significant effect on UK stock prices.
• In many cases however the outlier will be more
difficult to interpret and may not correspond to a
particular event.
Dummy Variables
• Dummy variables are discrete variables taking a
value of ‘0’ or ‘1’. They are often called ‘on’ ‘off’
variables, being ‘on’ when they are 1.
• Dummy variables can be used either as
explanatory variables or as the dependent
variable.
• When they act as the dependent variable there
are specific problems with how the regression is
interpreted, however when they act as
explanatory variables they can be interpreted in
the same way as other variables.
Types of Explanatory Dummy
Variable
• Qualitative dummy variables: i.e. age, sex, race,
health.
• Seasonal dummy variables: depends on the
nature of the data, so quarterly data requires
three dummy variables etc.
• Dummy variables that represent a change in
policy:
– Intercept dummy variables, that pick up a change in
the intercept of the regression
– Slope dummy variables, that pick up a change in the
slope of the regression
Dummy Variables
• If y is a teachers salary and
Di = 1 if a non-smoker
Di = 0 if a smoker
We can model this in the following way:
tii uDy ++= βα
Dummy Variables
• This produces an average salary for a
smoker of E(y/Di =0) =α.
• The average salary of a non-smoker will
be E(y/Di = 1) = α + β.
• This suggests that non-smokers receive a
higher salary than smokers.
Dummy Variables
• Equally we could have used the dummy
variable in a model with other explanatory
variables. In addition to the dummy variable
we could also add years of experience (x),
to give:
tiii uxDy +++= δβα
Dummy Variables
α
α+β
Non-smoker
Smoker
y
x
Seasonal Dummy Variables
• The use of seasonal dummy variables is widespread in
finance due to the ‘day of the week’ effect on asset
prices.
• They take the same format as other dummy variables,
i.e. a January dummy variable would consist of 0, except
every observation in January which has the value of 1.
• For monthly data, we include 11 dummy variables,
quarterly data 3 etc. i.e. we have as many dummies as
months, quarters etc minus 1.
• The excluded month acts as the reference category, i.e.
all the other dummies refer to differences between
themselves and this reference month.
Seasonal Dummy variables
• If we have the following model of share prices for a
gas and electricity firm, where the share price is
regressed against 3 dummy variables. (Using
quarterly data)
tt
tt
tt
t
tt
yysQ
yysQ
yysQ
ysQ
yDDDs
80.040.580.020.060.5ˆ:4
80.090.480.070.060.5ˆ:3
80.040.480.020.160.5ˆ:2
80.060.5ˆ:1
80.020.070.020.160.5ˆ 432
+=+−=
+=+−=
+=+−=
+=
+−−−=
Seasonal Dummy variables
• The regression can not be carried out if all
the seasonal dummies are added (i.e. 4
for quarterly data), as there is perfect
multicollinearity
• Although we can use the t-test to
determine if the seasonal dummy is
significant, we usually use an F-test to
determine if they are jointly significant.
Slope Dummy Variables
• The type of dummy variable considered so far is
the intercept dummy variable, we could also use
dummy variables to model changes in the slope
of the regression line, these are known as slope
or interaction dummy variables.
• We can include either types of dummy variable
or more commonly both types in a regression, to
account for changes in the intercept and slope of
the regression line.
Slope Dummy Variables
• The slope dummy variable consists of a term
which is the product of an explanatory variable
and dummy variable (Dx):
ttt
t
ttt
t
tttttt
uxy
DWhen
uxy
DWhen
uxDxDy
++++=
=
++=
=
++++=
)()(
1
0
2110
10
2110
ββαα
βα
ββαα
Slope Dummy Variable
• Given the following results from a demand for bank
loans (bl) model, with house prices (hp) as the
explanatory variable. The dummy variable takes the
value of 0 before 1979 and 1 afterwards. The slope
dummy is going to determine the change in lending
as a result of changes to the credit laws, i.e. it is
easier to borrow based on the value of a persons
house.
18.056.012.078.0ˆ
ttttt DhphpDlb +++=
Slope Dummy variables
• We then get two separate regression lines,
before and after 1979, with different
intercepts and slope coefficients:
tt
tt
hplb
Post
hplb
e
74.090.0ˆ
:1979
56.078.0ˆ
:1979Pr
+=
−
+=
−
Test for Structural Stability
• Although the Chow test is usually used to test for
a structural break, an alternative test involving
the dummy variables can also be used.
• It involves running two regressions, one with the
dummy variables (unrestricted model) and
collecting the RSS.
• The other regression excludes the dummy
variables (restricted model) and collect this RSS.
• Use the F-test formula to produce the F-statistic
and compare with the critical values, the null
hypothesis being that the regression is
structurally stable.
The Dummy Variable Approach to
Testing for a Structural Break
• Instead of two separate regressions on each
sub-sample, as in the Chow test, we just need
the single regression with the dummy variables
(as well as without the dummy variables)
• The dummy variable approach allows us to test
a variety of hypotheses about any structural
break
• The dummy variable approach allows us to
determine if it is the intercept or slope that is
different
• Using the Chow test requires testing of sub-
samples, which reduces the degrees of freedom
Conclusion
• When running a regression, we assume the
error term is normally distributed
• The Bera-Jarque test is used to determine if the
error term is normally distributed.
• To overcome non-normality, we can use an
impulse dummy variable to account for any
outliers.
• Dummy variables have a variety of uses, mostly
being used to model qualitative effects
• Dummy variables can be in either intercept or
slope form.

More Related Content

Similar to Dummy ppt

FE3.ppt
FE3.pptFE3.ppt
FE3.pptasde13
 
Diagnostic Tests.ppt
Diagnostic Tests.pptDiagnostic Tests.ppt
Diagnostic Tests.pptNavyaPS2
 
Common mistakes in measurement uncertainty calculations
Common mistakes in measurement uncertainty calculationsCommon mistakes in measurement uncertainty calculations
Common mistakes in measurement uncertainty calculationsGH Yeoh
 
2. Module II (1) FRM.pdf
2. Module II (1) FRM.pdf2. Module II (1) FRM.pdf
2. Module II (1) FRM.pdfItzGA
 
Trust Region Policy Optimization, Schulman et al, 2015
Trust Region Policy Optimization, Schulman et al, 2015Trust Region Policy Optimization, Schulman et al, 2015
Trust Region Policy Optimization, Schulman et al, 2015Chris Ohk
 
MSc Finance_EF_0853352_Kartik Malla
MSc Finance_EF_0853352_Kartik MallaMSc Finance_EF_0853352_Kartik Malla
MSc Finance_EF_0853352_Kartik MallaKartik Malla
 
Time Series Analysis.pptx
Time Series Analysis.pptxTime Series Analysis.pptx
Time Series Analysis.pptxSunny429247
 
1_--_sci_method.ppt
1_--_sci_method.ppt1_--_sci_method.ppt
1_--_sci_method.pptMervatMarji2
 
Analytical chemistry lecture 3
Analytical chemistry lecture 3Analytical chemistry lecture 3
Analytical chemistry lecture 3Sunita Jobli
 
Lecture 4 - Linear Regression, a lecture in subject module Statistical & Mach...
Lecture 4 - Linear Regression, a lecture in subject module Statistical & Mach...Lecture 4 - Linear Regression, a lecture in subject module Statistical & Mach...
Lecture 4 - Linear Regression, a lecture in subject module Statistical & Mach...Maninda Edirisooriya
 
need to realize in r studio (regression).pptx
need to realize in r studio (regression).pptxneed to realize in r studio (regression).pptx
need to realize in r studio (regression).pptxSmarajitPaulChoudhur
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)Ryan Herzog
 
Multivariate Linear Regression.ppt
Multivariate Linear Regression.pptMultivariate Linear Regression.ppt
Multivariate Linear Regression.pptTanyaWadhwani4
 

Similar to Dummy ppt (20)

FE3.ppt
FE3.pptFE3.ppt
FE3.ppt
 
Diagnostic Tests.ppt
Diagnostic Tests.pptDiagnostic Tests.ppt
Diagnostic Tests.ppt
 
Common mistakes in measurement uncertainty calculations
Common mistakes in measurement uncertainty calculationsCommon mistakes in measurement uncertainty calculations
Common mistakes in measurement uncertainty calculations
 
2. Module II (1) FRM.pdf
2. Module II (1) FRM.pdf2. Module II (1) FRM.pdf
2. Module II (1) FRM.pdf
 
Trust Region Policy Optimization, Schulman et al, 2015
Trust Region Policy Optimization, Schulman et al, 2015Trust Region Policy Optimization, Schulman et al, 2015
Trust Region Policy Optimization, Schulman et al, 2015
 
MSc Finance_EF_0853352_Kartik Malla
MSc Finance_EF_0853352_Kartik MallaMSc Finance_EF_0853352_Kartik Malla
MSc Finance_EF_0853352_Kartik Malla
 
Validity andreliability
Validity andreliabilityValidity andreliability
Validity andreliability
 
Time Series Analysis.pptx
Time Series Analysis.pptxTime Series Analysis.pptx
Time Series Analysis.pptx
 
Ch4 slides
Ch4 slidesCh4 slides
Ch4 slides
 
1_--_sci_method.ppt
1_--_sci_method.ppt1_--_sci_method.ppt
1_--_sci_method.ppt
 
Logistical Regression.pptx
Logistical Regression.pptxLogistical Regression.pptx
Logistical Regression.pptx
 
Forcasting methods
Forcasting methodsForcasting methods
Forcasting methods
 
Analytical chemistry lecture 3
Analytical chemistry lecture 3Analytical chemistry lecture 3
Analytical chemistry lecture 3
 
Ch13 slides
Ch13 slidesCh13 slides
Ch13 slides
 
Forecasting Examples
Forecasting ExamplesForecasting Examples
Forecasting Examples
 
Lecture 4 - Linear Regression, a lecture in subject module Statistical & Mach...
Lecture 4 - Linear Regression, a lecture in subject module Statistical & Mach...Lecture 4 - Linear Regression, a lecture in subject module Statistical & Mach...
Lecture 4 - Linear Regression, a lecture in subject module Statistical & Mach...
 
need to realize in r studio (regression).pptx
need to realize in r studio (regression).pptxneed to realize in r studio (regression).pptx
need to realize in r studio (regression).pptx
 
Time series analysis
Time series analysisTime series analysis
Time series analysis
 
Topic 5 (multiple regression)
Topic 5 (multiple regression)Topic 5 (multiple regression)
Topic 5 (multiple regression)
 
Multivariate Linear Regression.ppt
Multivariate Linear Regression.pptMultivariate Linear Regression.ppt
Multivariate Linear Regression.ppt
 

Recently uploaded

Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Tina Ji
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.Aaiza Hassan
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLSeo
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsApsara Of India
 
Unlocking the Secrets of Affiliate Marketing.pdf
Unlocking the Secrets of Affiliate Marketing.pdfUnlocking the Secrets of Affiliate Marketing.pdf
Unlocking the Secrets of Affiliate Marketing.pdfOnline Income Engine
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Lviv Startup Club
 
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Howrah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Roomdivyansh0kumar0
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Neil Kimberley
 
Best Basmati Rice Manufacturers in India
Best Basmati Rice Manufacturers in IndiaBest Basmati Rice Manufacturers in India
Best Basmati Rice Manufacturers in IndiaShree Krishna Exports
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Servicediscovermytutordmt
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessAggregage
 
HONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsHONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsMichael W. Hawkins
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfPaul Menig
 
Event mailer assignment progress report .pdf
Event mailer assignment progress report .pdfEvent mailer assignment progress report .pdf
Event mailer assignment progress report .pdftbatkhuu1
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear RegressionRavindra Nath Shukla
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMRavindra Nath Shukla
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
 
BEST ✨ Call Girls In Indirapuram Ghaziabad ✔️ 9871031762 ✔️ Escorts Service...
BEST ✨ Call Girls In  Indirapuram Ghaziabad  ✔️ 9871031762 ✔️ Escorts Service...BEST ✨ Call Girls In  Indirapuram Ghaziabad  ✔️ 9871031762 ✔️ Escorts Service...
BEST ✨ Call Girls In Indirapuram Ghaziabad ✔️ 9871031762 ✔️ Escorts Service...noida100girls
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Dave Litwiller
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Serviceritikaroy0888
 

Recently uploaded (20)

Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
Russian Faridabad Call Girls(Badarpur) : ☎ 8168257667, @4999
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
 
Unlocking the Secrets of Affiliate Marketing.pdf
Unlocking the Secrets of Affiliate Marketing.pdfUnlocking the Secrets of Affiliate Marketing.pdf
Unlocking the Secrets of Affiliate Marketing.pdf
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
 
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Howrah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
 
Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023Mondelez State of Snacking and Future Trends 2023
Mondelez State of Snacking and Future Trends 2023
 
Best Basmati Rice Manufacturers in India
Best Basmati Rice Manufacturers in IndiaBest Basmati Rice Manufacturers in India
Best Basmati Rice Manufacturers in India
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Service
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for Success
 
HONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsHONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael Hawkins
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdf
 
Event mailer assignment progress report .pdf
Event mailer assignment progress report .pdfEvent mailer assignment progress report .pdf
Event mailer assignment progress report .pdf
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear Regression
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
 
BEST ✨ Call Girls In Indirapuram Ghaziabad ✔️ 9871031762 ✔️ Escorts Service...
BEST ✨ Call Girls In  Indirapuram Ghaziabad  ✔️ 9871031762 ✔️ Escorts Service...BEST ✨ Call Girls In  Indirapuram Ghaziabad  ✔️ 9871031762 ✔️ Escorts Service...
BEST ✨ Call Girls In Indirapuram Ghaziabad ✔️ 9871031762 ✔️ Escorts Service...
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Service
 

Dummy ppt

  • 2. Introduction • Discuss the use of dummy variables in Financial Econometrics. • Examine the issue of normality and the use of dummy variables to correct any problem • Show how dummy variables affect the regression • Assess the use of intercept and slope dummy variables
  • 3. The Normality Assumption • In general we assume the error term is normally distributed. • Financial data often fails this assumption due to the volatile nature of the data and the numbers of outliers. • The normality of the error term can be tested using the Bera-Jarque test, which tests for the presence of skewness (non- symmetry) and kurtosis (fat tails)
  • 4. Bera-Jarque Test • This test for normality in effect tests for the coefficients of skewness and excess kurtosis being jointly equal to 0 nsobservatioofnumber kurtosisexcessoftcoefficien skewnessoftcoefficien ] 24 )3( 6 [ 2 1 2 2 2 1 − − − − += T b b bb TW
  • 5. Bera-Jarque Test • The statistic follows the chi-squared distribution with 2 degrees of freedom. • The null hypothesis is that the distribution is normal. • i.e. if we get a Bera-Jarque statistic of 4.78, the critical value is 5.99 (5%), then as 4.78<5.99 we would accept the null hypothesis that the error term is normally distributed. • Most computer programmes report this statistic.
  • 6. Remedies for non-normality • The non-normality is often caused by a couple of observations in the tails of the distribution, these observations are often termed outliers. • The simplest way to solve the problem is to use a dummy variable, often called an impulse dummy variable, which takes the value of 0, except the one outlier observation which takes the value of 1. • This has the effect of forcing the residual for this observation to 0. • To determine where the outlier is, we could simply plot the residuals against time.
  • 7. Non-normality • The use of this type of dummy variable is controversial, as some argue it is an artificial method of improving the regression, by in effect removing the influence of this particular observation. • However an outlier can have an excessively strong effect on a model, giving an unrealistic result, so needs to be taken into account.
  • 8. Dummy Variable for Single Outlier • In a regression of stock prices against income for the UK, an outlier was noticed for 1992 month 9, when the UK left the ERM. A dummy variable was added to account for this. This produced the following result: .91992var1 .87.1,78.0R (0.20)(0.23)(0.43) 80.087.067.0ˆ 2 mforiabledummyD DW Dys ttt − == ++=
  • 9. Dummy Variables • The previous set of results can be interpreted in the usual way, in this case the dummy variable has a significant t-statistic (4), so the outlier has a significant effect on the regression, or put another way the UK leaving the ERM had a significant effect on UK stock prices. • In many cases however the outlier will be more difficult to interpret and may not correspond to a particular event.
  • 10. Dummy Variables • Dummy variables are discrete variables taking a value of ‘0’ or ‘1’. They are often called ‘on’ ‘off’ variables, being ‘on’ when they are 1. • Dummy variables can be used either as explanatory variables or as the dependent variable. • When they act as the dependent variable there are specific problems with how the regression is interpreted, however when they act as explanatory variables they can be interpreted in the same way as other variables.
  • 11. Types of Explanatory Dummy Variable • Qualitative dummy variables: i.e. age, sex, race, health. • Seasonal dummy variables: depends on the nature of the data, so quarterly data requires three dummy variables etc. • Dummy variables that represent a change in policy: – Intercept dummy variables, that pick up a change in the intercept of the regression – Slope dummy variables, that pick up a change in the slope of the regression
  • 12. Dummy Variables • If y is a teachers salary and Di = 1 if a non-smoker Di = 0 if a smoker We can model this in the following way: tii uDy ++= βα
  • 13. Dummy Variables • This produces an average salary for a smoker of E(y/Di =0) =α. • The average salary of a non-smoker will be E(y/Di = 1) = α + β. • This suggests that non-smokers receive a higher salary than smokers.
  • 14. Dummy Variables • Equally we could have used the dummy variable in a model with other explanatory variables. In addition to the dummy variable we could also add years of experience (x), to give: tiii uxDy +++= δβα
  • 16. Seasonal Dummy Variables • The use of seasonal dummy variables is widespread in finance due to the ‘day of the week’ effect on asset prices. • They take the same format as other dummy variables, i.e. a January dummy variable would consist of 0, except every observation in January which has the value of 1. • For monthly data, we include 11 dummy variables, quarterly data 3 etc. i.e. we have as many dummies as months, quarters etc minus 1. • The excluded month acts as the reference category, i.e. all the other dummies refer to differences between themselves and this reference month.
  • 17. Seasonal Dummy variables • If we have the following model of share prices for a gas and electricity firm, where the share price is regressed against 3 dummy variables. (Using quarterly data) tt tt tt t tt yysQ yysQ yysQ ysQ yDDDs 80.040.580.020.060.5ˆ:4 80.090.480.070.060.5ˆ:3 80.040.480.020.160.5ˆ:2 80.060.5ˆ:1 80.020.070.020.160.5ˆ 432 +=+−= +=+−= +=+−= += +−−−=
  • 18. Seasonal Dummy variables • The regression can not be carried out if all the seasonal dummies are added (i.e. 4 for quarterly data), as there is perfect multicollinearity • Although we can use the t-test to determine if the seasonal dummy is significant, we usually use an F-test to determine if they are jointly significant.
  • 19. Slope Dummy Variables • The type of dummy variable considered so far is the intercept dummy variable, we could also use dummy variables to model changes in the slope of the regression line, these are known as slope or interaction dummy variables. • We can include either types of dummy variable or more commonly both types in a regression, to account for changes in the intercept and slope of the regression line.
  • 20. Slope Dummy Variables • The slope dummy variable consists of a term which is the product of an explanatory variable and dummy variable (Dx): ttt t ttt t tttttt uxy DWhen uxy DWhen uxDxDy ++++= = ++= = ++++= )()( 1 0 2110 10 2110 ββαα βα ββαα
  • 21. Slope Dummy Variable • Given the following results from a demand for bank loans (bl) model, with house prices (hp) as the explanatory variable. The dummy variable takes the value of 0 before 1979 and 1 afterwards. The slope dummy is going to determine the change in lending as a result of changes to the credit laws, i.e. it is easier to borrow based on the value of a persons house. 18.056.012.078.0ˆ ttttt DhphpDlb +++=
  • 22. Slope Dummy variables • We then get two separate regression lines, before and after 1979, with different intercepts and slope coefficients: tt tt hplb Post hplb e 74.090.0ˆ :1979 56.078.0ˆ :1979Pr += − += −
  • 23. Test for Structural Stability • Although the Chow test is usually used to test for a structural break, an alternative test involving the dummy variables can also be used. • It involves running two regressions, one with the dummy variables (unrestricted model) and collecting the RSS. • The other regression excludes the dummy variables (restricted model) and collect this RSS. • Use the F-test formula to produce the F-statistic and compare with the critical values, the null hypothesis being that the regression is structurally stable.
  • 24. The Dummy Variable Approach to Testing for a Structural Break • Instead of two separate regressions on each sub-sample, as in the Chow test, we just need the single regression with the dummy variables (as well as without the dummy variables) • The dummy variable approach allows us to test a variety of hypotheses about any structural break • The dummy variable approach allows us to determine if it is the intercept or slope that is different • Using the Chow test requires testing of sub- samples, which reduces the degrees of freedom
  • 25. Conclusion • When running a regression, we assume the error term is normally distributed • The Bera-Jarque test is used to determine if the error term is normally distributed. • To overcome non-normality, we can use an impulse dummy variable to account for any outliers. • Dummy variables have a variety of uses, mostly being used to model qualitative effects • Dummy variables can be in either intercept or slope form.