SlideShare una empresa de Scribd logo
1 de 36
Descargar para leer sin conexión
Topic:
Topic:
Topic:
Topic: Introduction to Panel Data Analysis
Introduction to Panel Data Analysis
Introduction to Panel Data Analysis
Introduction to Panel Data Analysis
2
Introduction to Panel Data Analysis
OUTLINE
OUTLINE
OUTLINE
OUTLINE
1.0
1.0
1.0
1.0Preamble
Preamble
Preamble
Preamble
 Time
Time
Time
Time series data
series data
series data
series data
 Cross
Cross
Cross
Cross-
-
-
-sectional data
sectional data
sectional data
sectional data
 Panel data
Panel data
Panel data
Panel data
 Practical illustrations
Practical illustrations
Practical illustrations
Practical illustrations
 Objectives of the course
Objectives of the course
Objectives of the course
Objectives of the course
2.0
2.0
2.0
2.0 Why panel data?
Why panel data?
Why panel data?
Why panel data?
3.0
3.0
3.0
3.0 Structure of panel data
Structure of panel data
Structure of panel data
Structure of panel data
 Balanced panel vs. Unbalanced panel
Balanced panel vs. Unbalanced panel
Balanced panel vs. Unbalanced panel
Balanced panel vs. Unbalanced panel
 Macro panel vs. Micro panel
Macro panel vs. Micro panel
Macro panel vs. Micro panel
Macro panel vs. Micro panel
 Practical illustrations
Practical illustrations
Practical illustrations
Practical illustrations
 Activity 1
Activity 1
Activity 1
Activity 1
 Activ
Activ
Activ
Activity 2
ity 2
ity 2
ity 2
4.0
4.0
4.0
4.0 Panel data models
Panel data models
Panel data models
Panel data models
 Static panel vs. Dynamic panel
Static panel vs. Dynamic panel
Static panel vs. Dynamic panel
Static panel vs. Dynamic panel
 Estimation of static panel data models
Estimation of static panel data models
Estimation of static panel data models
Estimation of static panel data models
 Pooled regression
Pooled regression
Pooled regression
Pooled regression
 Practical illustrations
Practical illustrations
Practical illustrations
Practical illustrations
 Activity 3
Activity 3
Activity 3
Activity 3
 Fixed effects
Fixed effects
Fixed effects
Fixed effects
 Empirical illustrations
Empirical illustrations
Empirical illustrations
Empirical illustrations
 Activity 4
Activity 4
Activity 4
Activity 4
 Random effects
Random effects
Random effects
Random effects
 Empirical illustrations
Empirical illustrations
Empirical illustrations
Empirical illustrations
 Activity 5
Activity 5
Activity 5
Activity 5
3
Introduction to Panel Data Analysis
1.0 Preamble
In applied research, three categories of data sets have remained prominent. They are time
series data, cross-sectional data and panel data. A time series data set is a collection of
observations on several variables over a period of time e.g. Exchange rate and Consumer Price
Index (CPI) in Nigeria over the period 1980 - 2008. A cross-sectional data set refers to data
collected on several units such as households, firms, countries, states, regions etc. at a
particular point in time. Examples of these data in Nigeria are the 2009 Labour Force Survey
(LFS) and 2006 Core Welfare Indicators Questionnaire (CWIQ). However, a panel data set refers
to data covering several units over a period of time. Thus, a panel data set is a combination of
time series data and cross-sectional data. Examples of panel data are Gross Domestic Product
(GDP) per capita for Sub-saharan African (SSA) countries over the period 1960 – 2008 and stock
prices of listed companies in Nigeria over the period 1990 - 2010.
In a typical symbolic representation, time series variables are usually denoted by subscript ‫ݐ‬
while cross-sectional variables are denoted by subscript ݅. Since panel data have both time
series and cross-sectional dimensions, their variables are represented by subscript ݅‫ݐ‬. Using the
absolute income hypothesis as an example, we can specify the following single-equation
models in respect of these data sets below:
‫ܥ‬௧ = ߙ + ߚܻ௧ + ߝ௧; ‫ݐ‬ = 1, … , ܶ (1)
‫ܥ‬௜ = ߙ + ߚܻ௜ + ߝ௜; ݅ = 1, … , ܰ (2)
‫ܥ‬௜௧ = ߙ + ߚܻ௜௧ + ߝ௜௧; (3)
Where ‫ܥ‬ denotes consumption and ܻ represents disposable income. ‫ݐ‬ is the time series
dimension and ݅ is the cross-sectional dimension. Equation (1) above follows a time series
framework, equation (2) follows a cross-sectional framework while equation (3) follows a panel
data framework. If we are testing this hypothesis in a time series framework using Nigerian
data for example over the period 1980 - 2010, it implies that ‫ݐ‬ = 1980, 1981, … ,2010. If it
follows a cross-sectional framework using data covering SSA for year 2010, then, ݅= list of SSA.
If we are dealing with a panel data framework also using data covering SSA however over a
period of 1980 - 2010, ݅= list of SSA and ‫ݐ‬ = 1980, 1981, … ,2010.
Note the following on the three modelling frameworks:
1. On the time series case, we are dealing with only one unit – ܰ = 1 (i.e. one individual,
one firm, one country, state or any other single type of unit) over a period of time -
‫ݐ‬ = 1, … , ܶ. Thus, we can draw one or more variables over a period of time for the
single unit. The total number of observations is ܶ.
4
2. On the cross-sectional case, we are dealing with several units - ݅ = 1, … , J (individuals,
firms, states or countries) at a specific point in time – H B 1. That is, each unit has only
one data point on each variable. Therefore, the total number of observations is J.
3. As regards the panel data framework, several units are considered over a period of time.
That is, one or more variables can be drawn on each unit over a period of time.
Therefore, the total number of observations is JH.
We consider practical illustrations of these data sets below.
Figure 1: Time series data set covering selected macroeconomic variables in Nigeria over the
period 1980 – 2007.
Time Series
Dimension
5
Figure 2: Cross-sectional data set covering selected SSA for year 2005
Figure 3: Panel Data covering selected SSA over the period 1996-2005
Cross-sectional
Dimension
6
For the purpose of this exercise, our subsequent discussions shall focus on panel data and,
therefore, we provide a more detailed explanation on the latter in the next sections. At the end
of the lecture, students should be able to:
 Structure panel data for the purpose of estimation
 Deal with pooled regression models, fixed effect models and random effect models
 Compute relevant diagnostic tests to validate panel data regressions
 Interpret panel data regressions
2.0 Why Panel Data?
The use of panel data in applied research is increasingly gaining relevance both in developed
and developing countries partly because of the need to harmonize regional policies and more
generally due to the inherent benefits for empirical analyses. Hsiao (2003) documents the
advantages of using panel data. These are represented in Baltagi (2008) and summarized as
follows:
1. Panel data provide sufficient observations and, consequently, more sample variability,
less collinearity, more degrees of freedom, and more accurate inference of model
parameters. For example, when dealing with cross-sectional data, ܶ = 1 and similarly,
for time series data, ܰ = 1. However, in the case of panel data, ‫ݐ‬ = 1, … , ܶ like wise
݅ = 1, … , ܰ providing more observations and more sample variability than either cross-
sectional data or time series data.
2. In connection with (1), panel data models better capture the complexity of human
behaviour than a single cross-section or time series data. For example, consider a cross-
sectional sample of university students with an average grade of 50% in all the courses
for the same period in time. This suggests that every student has the chances of having
50% grade based on information obtained from their performance at a particular
year/level of academic study. Thus, current information about a student’s academic
performance is a perfect predictor of his/her future performance. However, sequential
observations for students contain information about their academic performance in
different years/levels of academic study that are captured in the cross-sectional
framework. With panel data models, performance of each student can be observed over
time and more informed judgments can be made.
Similarly, consider a time series sample of a student’s academic performance.
Generalizing with the information obtained from the student will lead to unbiased and
inefficient estimates. Therefore, by pooling a cross-sectional sample of students over
time, variations in each unit over time are captured.
3. In connection with (2), panel data models are better able to capture the heterogeneity
inherent in each individual unit. The structure of panel data suggests that the cross-
sectional units whether individuals, firms, states or countries are heterogeneous. In
empirical modelling, ignoring these heterogeneous effects when in fact they exist leads
to biased and inefficient results. We can illustrate this with an empirical example using
Baltagi and Levin (1992) paper. They consider cigarette demand across 46 American
7
states for the years 1963-88. Consumption is modelled as a function of lagged
consumption, price and income. They however note that there are a lot of variables that
may be state-invariant or time-invariant that may affect consumption. Examples of
these state invariant variables are advertisement on nationwide television and radio and
national policies while time-invariant variables are religion and education.
4. In connection (2), behaviour of economic agents is intrinsically dynamic and this can be
accounted for using panel data models. Cross-sectional models can only ascertain the
behavioural pattern at a particular point and therefore cannot be used to capture
behavioural dynamics. Similarly, time series data that provides information over a
period of time only limits itself to one unit and, therefore, we are unable to compare
changes in the behaviour of different economic agents. For example, we may be
interested in quantifying the consumption pattern of a number of cross-sections over
time. These dynamics can neither be captured by cross-sectional framework nor time
series models. However, since panel data sets provide time series on each cross-
sectional unit in a group, it becomes relatively easy to evaluate the changes in the
behavioural pattern using the data.
5. The use of micro-data in panel regressions gives more accurate prediction than the
prediction of aggregate data often used in the time series case. More notably, if the
micro units are heterogeneous, policy prescriptions drawn from time series aggregate
data may be invalid. Panel data containing time series observations for a number of
individuals is ideal for investigating the “homogeneity” versus “heterogeneity” issue
(Hsiao, 2006).
Although not limited to those highlighted above, these benefits have provided some reasonable
justification for the consideration of panel data in empirical and policy based studies.
3.0 Structure of Panel Data
Essentially, panel data sets are broadly categorized into two namely: Balanced Panel data and
Unbalanced Panel data. The balanced panels refer to a case where all the cross-sectional units
have equal time series dimension. That is, all the cross-sectional units are observed over the
same period of time. Thus, balanced panels are also referred to complete panels. The
unbalanced panels on the other hand refer to situations where the cross-section units under
scrutiny have unequal time periods. While some cross-sectional units may have complete
information over a specified period (say ܶ), others may have unequal time periods less than ܶ
which may be due to missing observations. This is practically possible particularly in longitudinal
data sets of ܶ dimension where the death of one or more individuals in a survey or exit of one
or more firms in a market being surveyed or withdrawal of one or more countries from an
economic union/cooperation being studied creates some missing observations for the affected
cross-sectional units.
Whether balanced or unbalanced, panel data sets can either be micro-based or macro-based.
When large cross-sectional units (usually households or individuals) are surveyed over a short
time period (not exceeding a maximum of ܶ = 20), such panel data are referred to as micro
8
panels. However, when large cross-sectional units (usually countries) are surveyed over a
considerable period of time (exceeding 20 time periods), such are regarded as macro panels.
Thus, for micro panels, ܰ is large and ܶ is small (less than 20) while for macro panels, we are
dealing with large ܰ and ܶ. As a result of the nature of time series dimension of both micro
and macro panels, they attract different treatments in empirical research. Since we are dealing
with large ܶ in macro panels, the problem of unit root (i.e. non-stationarity) must be addressed.
Also, because macro panels usually involve cross-section of countries, the problem of cross-
sectional dependence may also be evident and therefore, has to be dealt otherwise the results
will be inefficient. We provide below a schema showing the structure of panel data.
Figure 4: Panel Data Structure
Equal T for all i’s Unequal T for the i’s
If ܶ  20 If ܶ  20
Panel Data
Balanced/Complete
Panels
Unbalanced/Incomplete
Panels
Micro Panels Macro Panels
Stationarity and Cross-
sectional Independence
Unit Root and Cross-
sectional Dependence
9
3.1 Some Practical Illustrations
How can we properly structure panel data for estimation purpose? Our instructional software
for the purpose of this exercise is Eviews 7 version. We provide a step-by-step procedure on
how to structure panel data.
Step 1:It is convenient to start with an empty Excel Sheet. We provide below a figure showing a
fresh excel sheet with appropriate labels.
Figure 5: Panel Data Structure in Excel Sheet
Individuals/Households/
Firms/States/Countries
Code for representing
each CS for ease of
identification
Day/Week/Month/Year
List of variables
of interest
10
For example, if CS=countries= Ghana, Nigeria and Time period = years= 2004 – 2009 with the
same set of variables as in figure 5, then we have:
Figure 6: Panel Data for CS=countries= Ghana, Nigeria and Time period = years= 2004 – 2009
Activity 1:
Arrange the following time series data in tables 1- 10 for selected SSA in panel structure in excel
worksheet. Note that llr=Liquid Liabilities as a percentage of GDP; pcr = Private Credit by
Deposit Money Banks as a percentage of GDP; bdr=Bank Deposits as a percentage of GDP; and
rgdp=GDP per capita (constant 2000 US$). Also, all the countries have the same units of
measurement. This is very important in panel data to make meaningful comparative analyses.
Table 1: Time Series for Togo
Year llr Pcr bdr Rgdp
1996 23.0315 16.0748 15.0802 250.4119
1997 20.0072 15.3494 13.713 276.0873
1998 21.3712 17.1686 14.4285 259.7281
1999 21.1455 15.905 13.4929 256.5784
2000 24.5442 15.6822 14.9995 245.9931
11
Table 2: Time Series for Tunisia
Year Llr pcr bdr rgdp
1996 46.0225 48.8067 36.0321 1743.74
1997 48.1615 47.5828 38.2282 1813.536
1998 49.4405 48.7962 39.3322 1876.205
1999 50.9677 48.8294 40.3478 1963.991
2000 54.605 53.1492 43.7798 2033.071
2001 56.1331 57.1981 45.5335 2108.913
2002 58.4117 59.5272 47.2955 2120.055
2003 57.1218 58.3255 46.1847 2224.77
2004 56.8293 58.2998 46.4354 2337.122
2005 58.9073 60.0528 48.4716 2412.397
Table 3: Time Series for Tanzania
Year llr pcr bdr Rgdp
1996 21.9841 4.4241 14.678 253.0263
1997 19.5539 3.0814 13.1927 255.4859
1998 18.3521 3.7249 12.5247 258.7052
1999 18.0459 4.2732 12.2517 261.5302
2000 18.5975 4.4316 12.8029 268.2303
2001 18.9531 4.5071 13.6432 277.8862
2001 26.1392 15.3362 16.2111 237.9157
2002 23.95 13.3844 16.2703 240.5463
2003 23.862 14.2132 18.3394 240.2094
2004 26.4359 15.8993 20.6205 240.7338
2005 28.478 16.9002 22.0857 237.1547
12
2002 19.9563 5.1765 14.7567 290.4531
2003 21.5444 6.6206 16.1864 299.0575
2004 21.0916 7.4539 15.9761 311.0443
2005 24.6267 8.9208 18.9296 323.8417
Table 4: Time Series for Uganda
Year llr pcr bdr Rgdp
1996 10.9502 4.4198 7.6369 214.9474
1997 11.7424 4.5096 8.5124 219.2205
1998 12.5713 4.5914 9.3281 223.2856
1999 13.6531 5.208 10.1499 234.2139
2000 14.2259 5.2279 10.6857 240.0347
2001 15.31 5.0121 11.6458 244.2106
2002 16.7173 4.9609 12.8386 251.7746
2003 17.658 5.319 13.6737 255.3492
2004 17.6943 5.5014 13.7752 260.6697
2005 17.8651 5.3874 13.8932 269.2797
Table 5: Time Series for South Africa
Year llr pcr bdr Rgdp
1996 47.5964 57.0558 45.0357 3019.966
1997 50.2232 60.0383 47.6385 3029.774
1998 53.2823 63.5497 50.7814 2974.682
1999 54.4287 65.1732 51.8763 2972.204
2000 52.7026 64.996 50.1436 3019.947
2001 48.3147 69.8244 50.8579 3046.309
2002 42.4572 63.777 50.0018 3127.809
13
2003 41.9225 59.6852 51.3644 3186.26
2004 40.384 62.1936 51.3092 3301.071
2005 41.4266 65.3924 53.2743 3428.973
Table 6: Time Series for Niger
Year llr pcr bdr Rgdp
1996 12.8354 4.2251 6.9918 167.9679
1997 10.5011 3.7052 5.8092 166.455
1998 7.4821 3.5422 4.6794 177.246
1999 6.914 3.8822 4.5292 169.9728
2000 8.425 4.6702 5.5463 161.666
2001 9.3728 4.9679 6.182 167.1001
2002 9.7845 5.084 6.5798 166.144
2003 10.827 5.2062 6.7173 167.4735
2004 13.8676 5.9907 7.9314 160.7414
2005 14.0486 6.4895 8.0553 166.3869
Table 7: Time Series for Nigeria
Year llr pcr bdr Rgdp
1996 12.8536 8.4201 8.1133 367.982
1997 13.8668 9.7126 9.097 367.7034
1998 16.7043 11.8227 11.1183 364.6282
1999 18.7888 12.4598 13.157 358.9527
2000 17.5213 10.416 12.6448 368.5392
2001 25.1358 14.8838 18.0648 370.2718
2002 26.7725 16.09 19.3215 366.5663
2003 24.7666 14.5882 17.2996 395.7558
14
2004 24.407 15.3438 17.1914 409.3429
2005 17.3979 12.1546 13.121 428.4006
Table 8: Time Series for Kenya
Year llr pcr bdr Rgdp
1996 37.9709 19.5837 27.572 422.6506
1997 39.4461 21.9409 30.3111 413.4464
1998 39.0445 23.3882 30.5958 416.0453
1999 38.4162 24.9332 29.9474 414.6931
2000 37.4519 25.5872 29.2896 406.5445
2001 36.3671 24.0622 28.9476 411.1291
2002 37.7525 23.503 30.2175 402.8055
2003 39.4971 23.0859 31.6685 403.9767
2004 39.0288 23.2237 31.7099 413.5642
2005 38.448 23.8282 31.6453 425.8653
Table 9: Time Series for Morocco
Year llr pcr bdr Rgdp
1996 70.75 27.2908 46.3059 1280.803
1997 72.3646 37.7556 52.9304 1233.059
1998 69.4816 46.5368 55.5144 1308.17
1999 74.1824 50.5895 57.9418 1296.227
2000 79.5742 54.1697 62.6492 1301.872
2001 81.045 52.7253 64.4227 1382.973
2002 86.9772 53.3611 69.1398 1411.385
2003 88.3486 53.2476 70.6133 1480.456
15
2004 90.6856 54.2893 72.8972 1540.83
2005 97.1707 57.7634 78.3015 1561.895
Table 10: Time Series for Botswana
Year llr pcr bdr Rgdp
1996 20.882 11.3541 18.8731 2725.958
1997 20.7326 9.7002 18.9318 2938.997
1998 24.4001 10.5208 22.6142 3185.879
1999 27.409 12.6244 25.3974 3354.679
2000 26.0662 14.0234 23.9447 3572.956
2001 24.8806 13.9019 23.2918 3706.873
2002 27.9743 16.6799 26.5515 3867.929
2003 27.6 17.9923 26.1646 4060.682
2004 29.4949 19.6398 27.353 4264.321
2005 28.1966 19.3561 26.1213 4382.497
Step 2: Save your panel data generated from tables 1 – 10 above in a directory you can easily
remember (say Desktop) and ensure the excel worksheet is in 2003 version. Kindly note the file
name for your worksheet. This is important for loading data into Eviews for estimation purpose.
Step 3: Load data into Eviews. This requires the following sub-steps:
a. View your data in the excel sheet to ensure they are structured in panel data form.
b. Click the Eviews 7 icon. You will find the Eviews window as shown below:
16
c. Define the structure of the Panel Data in Eviews. The following pictures show relevant
steps on how to do this.
(i) Opening the Eviews Workfile
17
(ii) Choose Balanced Panel as the workfile structure type
(iii) By choosing balanced panel, it allows you to indicate the time and cross-section dimensions
for your panel data as shown below:
18
(iv) By clicking OK in (iii) above, you will find the Eviews work area in panel data form
Time Series
Dimension
Cross-sectional
Dimension
Work Area
Total Observations
19
d. Import panel data from the excel file: Although, there are several methods of loading data into
Eviews, for the purpose of this exercise however, we will consider the import method. Students
may explore other methods later. This also involves a number of sub-steps as follows:
(i) Choose import method as shown below
20
(ii) By clicking Read Text-Lotus-Excel in (di) above, you will find a box prompting you to indicate
the excel file name containing the panel data. Ensure the excel file is not opened before you
import the data.
(iii) Following the information provided in (dii) above and clicking ok, you will find the import
box below:
File Directory
File name
File type
Column  row identifiers for the
first cell of actual data in excel
No. of columns
for actual data
Sample period
21
(iv) By clicking ok after providing necessary information in (diii) above, the Eviews work area is updated
to reflect the data imported. This is shown below:
When you compare (div) with (civ), you will find that the former contains data on variables intended
to be modelled in panel data form while the latter is an empty Eviews workfile.
(e) At this juncture, save your Eviews workfile to secure the data
22
Also note the file name for your Eviews workfile and its directory.
Activity 2
Structure tables 1 -5 in exercise 1 in panel data form in excel sheet and create an Eviews
workfile for the generated excel sheet using the import method.
4.0 Panel Data Models
Panel data models are broadly divided into two namely Static Panel data models and Dynamic
Panel Models. The most notable difference between the two models is the inclusion of the
lagged dependent variable as a regressor in the latter. This exercise focuses on the static panel
data models. A typical static panel data regression can be expressed as:
ܻ௜௧ = ߙ + ෍ ߚ௞ܺ௞௜௧
௃
௞ୀଵ
+ ‫ݑ‬௜௧ ; ݇ = 1, … , ‫ܬ‬ (4)
where ܻ is the dependent variable and ܺ௞ are the explanatory variables. The subscripts ݅ and ‫ݐ‬
as earlier defined refer to cross-sectional dimension and time series dimension respectively. ‫ݑ‬௜௧
is the composite error term which can be decomposed further into specific effects and
remainder disturbance term. Henceforth, we will refer to ݅ as individual observations which of
course may imply either households, firms, countries or any other group of units. There are two
sets of specific effects namely the individual specific effects and time specific effects. If only one
set of specific effects is included in the regression, such is referred to as one-way error
components model. However, if both sets of specific effects are included, we refer to the model
as two-way error components model. Equations (5) and (6) show decomposition of ‫ݑ‬௜௧ into
one-way and two-way error components.
‫ݑ‬௜௧ = ߮௜ + ߝ௜௧ (5ܽ)
‫ݑ‬௜௧ = ߛ௧ + ߝ௜௧ (5ܾ)
‫ݑ‬௜௧ = ߮௜ + ߛ௧ + ߝ௜௧ (6)
where ߮௜ and ߛ௧ denote the unobserved individual and time specific effects respectively. We
shall limit our empirical applications to the one-way error components.
4.1 Estimation of Panel Data Models
Basically, the static panel data models can be estimated using Ordinary Least Square (OLS),
Fixed Effects (FE) and Random Effects (RE). Each of these methods has its underlying
assumptions which must necessarily be satisfied to obtain unbiased and efficient estimates. We
consider in turn each of these methods with the underlying assumptions.
23
Figure 5: Panel Data Models
4.1.1 Pooled Regression
The pooled regression involves estimating equation (4) with OLS. The Pooled regression is
employed based on the assumption that the ܺ௞ are able to capture all the relevant
characteristics of the individual units and, therefore, the unobserved specific effects as
expressed in equations (5) and (6) are redundant. Thus, ߮௜ in the case of equation (5a), ߛ௧ in
equation (5b) and both effects in the case of (6) will be dropped and and a pooled OLS
regression may be used to fit the model, treating all the observations for all of the time periods
as a single sample. However, ignoring these effects when in fact they are significant yields
inefficient estimates and biased standard errors.
Empirical Application
We are using the panel data in the Eviews workfile created from tables 1-10 to illustrate the
pooled regression. The Panel data regression model is given as:
‫݌݀݃ݎ‬௜௧ = ߙ + ߚଵ݈݈‫ݎ‬௜௧ + ߚଶ‫ݎܿ݌‬௜௧ + ߚଷܾ݀‫ݎ‬௜௧ + ‫ݑ‬௜௧
We have carefully structured the procedure for estimating the model in Eviews in the following pictures
below:
Panel Data Models
Static Panel Data
Models
Dynamic Panel Data
Models
Pooled
Regression
Fixed
Effects
Arellano
and Bover
Random
Effects
Arellano 
Bond
Blundell and
Bond
24
Model specification and estimation
(a) Equation Editor
(b) Pooled Regression
25
(c) Pooled Regression Results
Interpretation of Regression Results
Based on the probability values, llr and bdr are statistically significant while pcr is not. Since the
regressors in the model are in percentages, therefore, 1% increase in llr leads to a significant
reduction in rgdp by 122.64USD on the average. Conversely, a 1% increase in bdr leads to a
significant rise in bdr by 177.15USD on the average. Although, the effect of pcr is not
statistically significant, its increase by 1% has the potential effect of reducing rgdp by 3.37USD.
Note that llr is a typical measure of financial depth in economy and thus of the overall size of
the financial sector; pcr is a measure of the activity of financial intermediaries (i.e. ability to
channell savings to investors) and bdr is a measure of size and efficiency of the financial
intermediaries as it captures the ability of the banks to mobilize funds from the public (the
surplus unit). Based on these definitions, the economic interpretations can also be offered.
The two dominant approaches used when the unobserved specific effects are significant in a
panel data model are the fixed effects regressions and the random effects regression models.
We consider these two in the next sections.
Activity 3
Using the Eviews workfile created in exercise 2, estimate the panel data model below assuming
there is no specific effects.
Z[]IA B C * DR^^ZIA * D_]`ZIA * DaYZIA * bI * FIA
Prob. Value (Pv) is used to
determine the level of
significance of each regressor in
the model. If the Pv is  0.05 for
example, it implies that the
regressor in question is
statistically significant at 5%
level; otherwise, it is not
significant at that level.
26
4.1.2 Fixed Effects Regressions
As earlier emphasized, one of the approaches used to capture specific effects in panel data
models is the fixed effects (FE) regression. Thus, equation (4) is modified using any of the error
components equations (5a, 5b or 6) depending on the specific effects being considered. The FE
approach is based on the assumption that the effects are fixed parameters that can be
estimated. Consequently, a number of econometric problems may be encountered: (i) the
unobserved specific fixed effects may be correlated with the regressors employed; (ii) some of
the regressors may be correlated with shocks (remainder disturbance term) that affect the
dependent variable; and (iii) there is possibility of simultaneity biases resulting from the
endogeneity of some regressors. The FE approach can overcome these problems by using any
of the three estimation methods namely: Within Group (WG) estimator; Least Square Dummy
Variable (LSDV) estimator and First Difference (FD) estimator. The WG estimator involves the
deviation approach which eliminates the unobserved effects. Similarly, the FD estimator
involves taking the first difference of the modified panel data model to eliminate the
unobserved effects. Thus, under the WG estimator and FD estimator, the model is transformed
in such a way that the unobserved effect is eliminated. The LSDV estimator however involves
the inclusion of dummy variables as regressors in the panel data model (equation 4) to capture
the specific effects. In this case, either the intercept ߙ or one of the dummy variables must be
dropped to avoid perfect collinearity or dummy trap problem.
We provide below the specifications in respect of these estimation techniques for fixed effects.
Within Regression: ܻ௜௧ − ܻ
ത௜. = ෍ ߚ௞(ܺ௞௜௧ −
௃
௞ୀଵ
ܺ
ത௞௜.) + ߝ௜௧ − ߝ௜. ; ݇ = 1, … , ‫ܬ‬ (7)
First Difference Regression: ܻ௜௧ − ܻ௜௧ିଵ = ෍ ߚ௞(ܺ௞௜௧ −
௃
௞ୀଵ
ܺ௜௧ିଵ) + ߝ௜௧ − ߝ௜௧ିଵ (8)
Or
First Difference Regression: ∆ܻ௜௧ = ෍ ߚ௞∆ܺ௞௜௧
௃
௞ୀଵ
+ ߝ௜௧ − ߝ௜௧ିଵ (8)
LSDV Regression: ܻ௜௧ = ߙ + ෍ ߚ௞ܺ௞௜௧
௃
௞ୀଵ
+ ෍ ߛ௜Γ௜ +
ே
௜ୀଶ
ߝ௜௧ ; (9)
Or
LSDV Regression: ܻ௜௧ = ෍ ߚ௞ܺ௞௜௧
௃
௞ୀଵ
+ ෍ ߛ௜Γ௜ +
ே
௜ୀଵ
ߝ௜௧ ; (9)
To confirm whether the specific effects estimated are actually fixed effects, the F-test is used in
this regard. The null hypothesis for this test is given as:
‫ܪ‬଴: ߤଵ = ߤଶ = ⋯ = ߤଶ = 0
27
We reject ‫ܪ‬଴ when the F-statistic is statistically significant suggesting that the fixed effects are
important in the model, otherwise, we do not reject it.
Empirical Application
We are still using the panel data in the Eviews workfile created from tables 1-10 to illustrate the
FE regressions. The Panel data regression model with fixed effects is given as:
Z[]IA B C * DR^^ZIA * D_]`ZIA * DaYZIA * bI * FIA
(a) Equation editor
(b) Model Specification
Click panel options to include
the fixed effects
28
(c) Specifying the fixed effects
(d) Fixed Effects Regression
Click ok after specifying
the fixed effects
Fixed effects
29
The interpretation of regression results here is not different from the style used under the
pooled regression. However, we need to further test whether the inclusion of the effects is
significant or not.
(e) Fixed Effects Testing
Prob. Value (Pv) is used to
test the significance of the
fixed effects in the model.
If the Pv is  0.05 for
example, it implies that the
effects are statistically
significant at 5% level;
otherwise, they are not
significant at that level.
1
2
3
30
Recall that the null hypothesis for the fixed effect test is that the fixed effects are not important
in the model. Based on the Pv, we can conclude that the fixed effects are significant in the
model and, therefore, estimating without these effects which is the case in pooled regression
will yield biased standard errors and policy prescriptions drawn from the analysis will be invalid.
Activity 4
Using the Eviews workfile created in exercise 2, estimate the panel data model below assuming
the cross-sectional units are fixed parameters that can be estimated. Also, conduct relevant
diagnostic tests to validate your results.
‫݌݀݃ݎ‬௜௧ = ߙ + ߚଵ݈݈‫ݎ‬௜௧ + ߚଶ‫ݎܿ݌‬௜௧ + ߚଷܾ݀‫ݎ‬௜௧ + ߤ௜ + ߝ௜௧
4.1.3 Random Effects Regressions
One of the fundamental limitations of fixed effects regressions is that there is usually a high loss
of degrees of freedom when there are too many parameters to estimate due to large cross-
sectional units. This high loss of degrees of freedom can be avoided if the cross-sectional units
can be assumed random. In addition, the specific effects (߮௜  ߛ௧) are assumed independent of
ߝ௜௧ and also ܺ௞ are independent of both the specific effects (߮௜  ߛ௧) and ߝ௜௧. Thus, random
effects model assumes exogeneity of all regressors. Practically, this may be the case if the
individual observations constitute a random sample from a given population. However, where
the unobserved specific effects are not distributed independently of the explanatory variables,
the fixed effects should be used instead to avoid the problem of unobserved heterogeneity
bias. The random effects regression is given as:
ܻ௜௧ − ߶ܻ
ത௜. = ෍ ߚ௞(ܺ௞௜௧ −
௃
௞ୀଵ
߶ܺ
ത௞௜.) + ߝ௜௧ − ߶ߝҧ௜. (10)
Where ߶ = 1 −
ఙഄ
మ
்ఙഋ
మାఙഄ
మ . If there are no random effects, then, ߪఓ
ଶ
= 0 and therefore ߶ = 0. By
substituting this into equation (10) gives back equation (4). In such a case, we can perform
either the pooled regression or the fixed effects regression. Thus, we may need to test further
using the F-test earlier explained to ascertain whether there are fixed effects or not.
To confirm whether the specific effects estimated are actually random effects and are
uncorrelated with the explanatory variables, we carry out the Hausman test. The rejection of
the null hypothesis (when the statistic is statistically significant) implies an adoption of the fixed
effects model and non-rejection is considered as an adoption of the random effects model. The
rejection of the null hypothesis also implies correlated specific effects are better captured with
31
fixed effects model. Figure 6 below provides some useful hints on how to choose the most
appropriate panel data regression model.
Empirical Application
We are still using the panel data in the Eviews workfile created from tables 1-10. The Panel data
regression model with random effects is given as:
Z[]IA B C * DR^^ZIA * D_]`ZIA * DaYZIA * bI * FIA
(a) Equation editor
(b) Model Specification
Click panel options to include
the random effects
32
(c) Specifying the random effects
(d) Random Effects Regression
33
The interpretation of regression results here is not different from the style used under the
pooled regression. However, we need to further test whether the inclusion of the random
effects is significant and also whether these effects are correlated or not. As earlier explained,
the Hausman test is used in this regard.
(e) Random Effects Testing
Prob. Value (Pv) is used to
test whether the random
effects are correlated in the
model. If the Pv is  0.05
for example, it implies that
the effects are correlated at
5% level; otherwise, they
are not at that level.
34
It should be noted that one of the assumptions underlying the use of random effects model is
that specific effects are independently and identically distributed. To confirm whether the
specific effects estimated are actually random effects and are uncorrelated, the Hausman test is
often used. As earlier emphasized, the rejection of the null hypothesis (when the statistic is
statistically significant) implies an adoption of the fixed effects model and non-rejection is
considered as an adoption of the random effects model. The rejection of the null hypothesis
also implies correlated specific effects are better captured with fixed effects model. Based on
the pv of the test summary as shown above, the effects are correlated and therefore the fixed
effects regression may give a better fit than the random effects model.
Figure 6: Static Panel Data Models
Yes No
No
Yes
Yes No
Are the cross-sectional units
randomly sampled?
Consider Pooled, Fixed Effects
 Random Effects regressions
Consider Fixed Effects
Regression and Pooled
regressions
Perform Hausman Test to
choose the appropriate Model
Does the Hausaman Test
indicate non-rejection of H0?
Perform Random Effects
Regression
Perform F-test to choose btw
the two regressions
Does the F-test indicate non-
rejection of H0?
Perform Pooled Reg. Perform Fixed Effects Reg.
35
The table below will guide you as regards the interpretation of linear regression results.
Model Interpretation
Linear – Linear:
‫ݕ‬௜௧ = ߙ଴ + ߙଵ‫ݔ‬௜௧ + ‫ݒ‬௜௧
a 1 unit change in ‫ݔ‬ will lead to a ߙଵ unit change in the mean of ‫ݕ‬. For
example, if the estimated coefficient is 0.05 that implies a one unit
increase in ‫ݔ‬ will generate a 0.05 unit increase in ‫ݕ‬.
Linear – Log (Semi-log):
‫ݕ‬௜௧ = ߙ଴ + ߙଵln‫ݔ‬௜௧ + ‫ݒ‬௜௧
a 1 per cent change in ‫ݔ‬ will lead to a (ߙଵ/100) unit change in the
mean of ‫ݕ‬. For example, if the estimated coefficient is 72.07, it implies
that a 1 per cent increase in ‫ݔ‬, will lead to a 0.72 unit increase in ‫ݕ‬.
Log – Linear (Semi-log):
ln‫ݕ‬௜௧ = ߙ଴ + ߙଵ‫ݔ‬௜௧ + ‫ݒ‬௜௧
a 1 unit change in ‫ݔ‬ will lead to a (100ߙଵ) per cent change in the mean
of ‫ݕ‬. For example, if the estimated coefficient is -0.20 that means a one
unit increase in ‫ݔ‬ will lead to a 20 per cent decrease in ‫ݕ‬.
Double log:
ln‫ݕ‬௜௧ = ߙ଴ + ߙଵln‫ݔ‬௜௧ + ‫ݒ‬௜௧
a 1 per cent change in ‫ݔ‬ will lead to a ߙଵ per cent change in the mean
of ‫ݕ‬. For example, if the estimated coefficient is 0.05 that implies a one
per cent increase in ‫ݔ‬ will yield a 0.05 per cent increase in ‫ݕ‬.
Activity 5
Using the Eviews workfile created in exercise 2, estimate the panel data model below assuming
the cross-sectional units are randomly drawn. Also, conduct relevant diagnostic tests to validate
your results.
‫݌݀݃ݎ‬௜௧ = ߙ + ߚଵ݈݈‫ݎ‬௜௧ + ߚଶ‫ݎܿ݌‬௜௧ + ߚଷܾ݀‫ݎ‬௜௧ + ߤ௜ + ߝ௜௧
References
Baltagi, B.H. (2008). Econometric Analysis of Panel Data. Fourth Edition. John Wiley  Sons Ltd.,
UK.
Hsiao, C. (2003). Analysis of Panel Data. Cambridge University Press, Cambridge.
36

Más contenido relacionado

La actualidad más candente

Eco Basic 1 8
Eco Basic 1 8Eco Basic 1 8
Eco Basic 1 8
kit11229
 
Dummy variable
Dummy variableDummy variable
Dummy variable
Akram Ali
 
Trade creation & diversion notes
Trade creation & diversion notesTrade creation & diversion notes
Trade creation & diversion notes
mattbentley34
 

La actualidad más candente (20)

Panel data analysis
Panel data analysisPanel data analysis
Panel data analysis
 
Types of data by kamran khan
Types of data by kamran khanTypes of data by kamran khan
Types of data by kamran khan
 
PanelDataAnalysis.pdf
PanelDataAnalysis.pdfPanelDataAnalysis.pdf
PanelDataAnalysis.pdf
 
Dummy variables
Dummy variablesDummy variables
Dummy variables
 
MACROECONOMICS-CH4
MACROECONOMICS-CH4MACROECONOMICS-CH4
MACROECONOMICS-CH4
 
ECONOMETRICS
ECONOMETRICSECONOMETRICS
ECONOMETRICS
 
Granger causality test
Granger causality testGranger causality test
Granger causality test
 
Functional Forms of Regression Models | Eonomics
Functional Forms of Regression Models | EonomicsFunctional Forms of Regression Models | Eonomics
Functional Forms of Regression Models | Eonomics
 
Ch8 slides
Ch8 slidesCh8 slides
Ch8 slides
 
Basic concepts of_econometrics
Basic concepts of_econometricsBasic concepts of_econometrics
Basic concepts of_econometrics
 
Eco Basic 1 8
Eco Basic 1 8Eco Basic 1 8
Eco Basic 1 8
 
Econometrics chapter 8
Econometrics chapter 8Econometrics chapter 8
Econometrics chapter 8
 
Basic theory of IS-LM model
Basic theory of IS-LM modelBasic theory of IS-LM model
Basic theory of IS-LM model
 
Natural rate of unemploymenty
Natural rate of unemploymentyNatural rate of unemploymenty
Natural rate of unemploymenty
 
Econometrics
EconometricsEconometrics
Econometrics
 
Dummy variable
Dummy variableDummy variable
Dummy variable
 
panel data.ppt
panel data.pptpanel data.ppt
panel data.ppt
 
Trade creation & diversion notes
Trade creation & diversion notesTrade creation & diversion notes
Trade creation & diversion notes
 
Heteroscedasticity
HeteroscedasticityHeteroscedasticity
Heteroscedasticity
 
Introduction to multilevel modelling | Ian Brunton-Smith
Introduction to multilevel modelling | Ian Brunton-SmithIntroduction to multilevel modelling | Ian Brunton-Smith
Introduction to multilevel modelling | Ian Brunton-Smith
 

Similar a Panel data content

Application of panel data to the effect of five (5) world development indicat...
Application of panel data to the effect of five (5) world development indicat...Application of panel data to the effect of five (5) world development indicat...
Application of panel data to the effect of five (5) world development indicat...
Alexander Decker
 
Application of panel data to the effect of five (5) world development indicat...
Application of panel data to the effect of five (5) world development indicat...Application of panel data to the effect of five (5) world development indicat...
Application of panel data to the effect of five (5) world development indicat...
Alexander Decker
 
Introduction Q Ti A
Introduction  Q Ti AIntroduction  Q Ti A
Introduction Q Ti A
Zoha Qureshi
 
Data Matrix Of Cpi Data Distribution After Transformation...
Data Matrix Of Cpi Data Distribution After Transformation...Data Matrix Of Cpi Data Distribution After Transformation...
Data Matrix Of Cpi Data Distribution After Transformation...
Kimberly Jones
 
Statics for the management
Statics for the managementStatics for the management
Statics for the management
Rohit Mishra
 
Social Science Statistics STA2122.501 ● ONLINE Project 3
Social Science Statistics STA2122.501 ● ONLINE Project 3Social Science Statistics STA2122.501 ● ONLINE Project 3
Social Science Statistics STA2122.501 ● ONLINE Project 3
ChereCheek752
 

Similar a Panel data content (20)

Application of panel data to the effect of five (5) world development indicat...
Application of panel data to the effect of five (5) world development indicat...Application of panel data to the effect of five (5) world development indicat...
Application of panel data to the effect of five (5) world development indicat...
 
Application of panel data to the effect of five (5) world development indicat...
Application of panel data to the effect of five (5) world development indicat...Application of panel data to the effect of five (5) world development indicat...
Application of panel data to the effect of five (5) world development indicat...
 
Time Series Analysis.pptx
Time Series Analysis.pptxTime Series Analysis.pptx
Time Series Analysis.pptx
 
econometric modeling.pptx
econometric modeling.pptxeconometric modeling.pptx
econometric modeling.pptx
 
Introduction Q Ti A
Introduction  Q Ti AIntroduction  Q Ti A
Introduction Q Ti A
 
Crowdfunding
CrowdfundingCrowdfunding
Crowdfunding
 
Data Matrix Of Cpi Data Distribution After Transformation...
Data Matrix Of Cpi Data Distribution After Transformation...Data Matrix Of Cpi Data Distribution After Transformation...
Data Matrix Of Cpi Data Distribution After Transformation...
 
currency wars
currency warscurrency wars
currency wars
 
Article1
Article1Article1
Article1
 
Unit 1 - Statistics (Part 1).pptx
Unit 1 - Statistics (Part 1).pptxUnit 1 - Statistics (Part 1).pptx
Unit 1 - Statistics (Part 1).pptx
 
F0351036039
F0351036039F0351036039
F0351036039
 
Forecasting Gasonline Price in Vietnam Based on Fuzzy Time Series and Automat...
Forecasting Gasonline Price in Vietnam Based on Fuzzy Time Series and Automat...Forecasting Gasonline Price in Vietnam Based on Fuzzy Time Series and Automat...
Forecasting Gasonline Price in Vietnam Based on Fuzzy Time Series and Automat...
 
Oriskany' TCS Multi-Factor Timing - Factor Investing White Paper
Oriskany' TCS Multi-Factor Timing - Factor Investing White PaperOriskany' TCS Multi-Factor Timing - Factor Investing White Paper
Oriskany' TCS Multi-Factor Timing - Factor Investing White Paper
 
Statics for the management
Statics for the managementStatics for the management
Statics for the management
 
Statics for the management
Statics for the managementStatics for the management
Statics for the management
 
Machine Learning in Banking
Machine Learning in Banking Machine Learning in Banking
Machine Learning in Banking
 
Statistics
StatisticsStatistics
Statistics
 
Dynamic Causal Relationships among the Greater China Stock markets
Dynamic Causal Relationships among the Greater China Stock marketsDynamic Causal Relationships among the Greater China Stock markets
Dynamic Causal Relationships among the Greater China Stock markets
 
Social Science Statistics STA2122.501 ● ONLINE Project 3
Social Science Statistics STA2122.501 ● ONLINE Project 3Social Science Statistics STA2122.501 ● ONLINE Project 3
Social Science Statistics STA2122.501 ● ONLINE Project 3
 
Notes1
Notes1Notes1
Notes1
 

Último

Call Girls Chickpet ☎ 7737669865☎ Book Your One night Stand (Bangalore)
Call Girls Chickpet ☎ 7737669865☎ Book Your One night Stand (Bangalore)Call Girls Chickpet ☎ 7737669865☎ Book Your One night Stand (Bangalore)
Call Girls Chickpet ☎ 7737669865☎ Book Your One night Stand (Bangalore)
amitlee9823
 
Call Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证
怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证
怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证
tufbav
 
CALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcR
CALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcRCALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcR
CALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcR
dollysharma2066
 
怎样办理圣芭芭拉分校毕业证(UCSB毕业证书)成绩单留信认证
怎样办理圣芭芭拉分校毕业证(UCSB毕业证书)成绩单留信认证怎样办理圣芭芭拉分校毕业证(UCSB毕业证书)成绩单留信认证
怎样办理圣芭芭拉分校毕业证(UCSB毕业证书)成绩单留信认证
ehyxf
 
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
amitlee9823
 
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
motiram463
 

Último (20)

Deira Dubai Escorts +0561951007 Escort Service in Dubai by Dubai Escort Girls
Deira Dubai Escorts +0561951007 Escort Service in Dubai by Dubai Escort GirlsDeira Dubai Escorts +0561951007 Escort Service in Dubai by Dubai Escort Girls
Deira Dubai Escorts +0561951007 Escort Service in Dubai by Dubai Escort Girls
 
Call Girls Chickpet ☎ 7737669865☎ Book Your One night Stand (Bangalore)
Call Girls Chickpet ☎ 7737669865☎ Book Your One night Stand (Bangalore)Call Girls Chickpet ☎ 7737669865☎ Book Your One night Stand (Bangalore)
Call Girls Chickpet ☎ 7737669865☎ Book Your One night Stand (Bangalore)
 
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...
Book Paid Lohegaon Call Girls Pune 8250192130Low Budget Full Independent High...
 
Call Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night StandCall Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In RT Nagar ☎ 7737669865 🥵 Book Your One night Stand
 
9004554577, Get Adorable Call Girls service. Book call girls & escort service...
9004554577, Get Adorable Call Girls service. Book call girls & escort service...9004554577, Get Adorable Call Girls service. Book call girls & escort service...
9004554577, Get Adorable Call Girls service. Book call girls & escort service...
 
怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证
怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证
怎样办理维多利亚大学毕业证(UVic毕业证书)成绩单留信认证
 
Top Rated Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated  Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...Top Rated  Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
Top Rated Pune Call Girls Shirwal ⟟ 6297143586 ⟟ Call Me For Genuine Sex Ser...
 
CALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcR
CALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcRCALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcR
CALL GIRLS IN Saket 83778-77756 | Escort Service In DELHI NcR
 
怎样办理圣芭芭拉分校毕业证(UCSB毕业证书)成绩单留信认证
怎样办理圣芭芭拉分校毕业证(UCSB毕业证书)成绩单留信认证怎样办理圣芭芭拉分校毕业证(UCSB毕业证书)成绩单留信认证
怎样办理圣芭芭拉分校毕业证(UCSB毕业证书)成绩单留信认证
 
Shikrapur Call Girls Most Awaited Fun 6297143586 High Profiles young Beautie...
Shikrapur Call Girls Most Awaited Fun  6297143586 High Profiles young Beautie...Shikrapur Call Girls Most Awaited Fun  6297143586 High Profiles young Beautie...
Shikrapur Call Girls Most Awaited Fun 6297143586 High Profiles young Beautie...
 
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
Vip Mumbai Call Girls Kalyan Call On 9920725232 With Body to body massage wit...
 
VVIP Pune Call Girls Warje (7001035870) Pune Escorts Nearby with Complete Sat...
VVIP Pune Call Girls Warje (7001035870) Pune Escorts Nearby with Complete Sat...VVIP Pune Call Girls Warje (7001035870) Pune Escorts Nearby with Complete Sat...
VVIP Pune Call Girls Warje (7001035870) Pune Escorts Nearby with Complete Sat...
 
(ISHITA) Call Girls Service Aurangabad Call Now 8617697112 Aurangabad Escorts...
(ISHITA) Call Girls Service Aurangabad Call Now 8617697112 Aurangabad Escorts...(ISHITA) Call Girls Service Aurangabad Call Now 8617697112 Aurangabad Escorts...
(ISHITA) Call Girls Service Aurangabad Call Now 8617697112 Aurangabad Escorts...
 
HLH PPT.ppt very important topic to discuss
HLH PPT.ppt very important topic to discussHLH PPT.ppt very important topic to discuss
HLH PPT.ppt very important topic to discuss
 
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)
(=Towel) Dubai Call Girls O525547819 Call Girls In Dubai (Fav0r)
 
Introduction-to-4x4-SRAM-Memory-Block.pptx
Introduction-to-4x4-SRAM-Memory-Block.pptxIntroduction-to-4x4-SRAM-Memory-Block.pptx
Introduction-to-4x4-SRAM-Memory-Block.pptx
 
Develop Keyboard Skill.pptx er power point
Develop Keyboard Skill.pptx er power pointDevelop Keyboard Skill.pptx er power point
Develop Keyboard Skill.pptx er power point
 
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Th...
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Th...Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Th...
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Th...
 
VVIP Pune Call Girls Balaji Nagar (7001035870) Pune Escorts Nearby with Compl...
VVIP Pune Call Girls Balaji Nagar (7001035870) Pune Escorts Nearby with Compl...VVIP Pune Call Girls Balaji Nagar (7001035870) Pune Escorts Nearby with Compl...
VVIP Pune Call Girls Balaji Nagar (7001035870) Pune Escorts Nearby with Compl...
 
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
(👉Ridhima)👉VIP Model Call Girls Mulund ( Mumbai) Call ON 9967824496 Starting ...
 

Panel data content

  • 1. Topic: Topic: Topic: Topic: Introduction to Panel Data Analysis Introduction to Panel Data Analysis Introduction to Panel Data Analysis Introduction to Panel Data Analysis
  • 2. 2 Introduction to Panel Data Analysis OUTLINE OUTLINE OUTLINE OUTLINE 1.0 1.0 1.0 1.0Preamble Preamble Preamble Preamble Time Time Time Time series data series data series data series data Cross Cross Cross Cross- - - -sectional data sectional data sectional data sectional data Panel data Panel data Panel data Panel data Practical illustrations Practical illustrations Practical illustrations Practical illustrations Objectives of the course Objectives of the course Objectives of the course Objectives of the course 2.0 2.0 2.0 2.0 Why panel data? Why panel data? Why panel data? Why panel data? 3.0 3.0 3.0 3.0 Structure of panel data Structure of panel data Structure of panel data Structure of panel data Balanced panel vs. Unbalanced panel Balanced panel vs. Unbalanced panel Balanced panel vs. Unbalanced panel Balanced panel vs. Unbalanced panel Macro panel vs. Micro panel Macro panel vs. Micro panel Macro panel vs. Micro panel Macro panel vs. Micro panel Practical illustrations Practical illustrations Practical illustrations Practical illustrations Activity 1 Activity 1 Activity 1 Activity 1 Activ Activ Activ Activity 2 ity 2 ity 2 ity 2 4.0 4.0 4.0 4.0 Panel data models Panel data models Panel data models Panel data models Static panel vs. Dynamic panel Static panel vs. Dynamic panel Static panel vs. Dynamic panel Static panel vs. Dynamic panel Estimation of static panel data models Estimation of static panel data models Estimation of static panel data models Estimation of static panel data models Pooled regression Pooled regression Pooled regression Pooled regression Practical illustrations Practical illustrations Practical illustrations Practical illustrations Activity 3 Activity 3 Activity 3 Activity 3 Fixed effects Fixed effects Fixed effects Fixed effects Empirical illustrations Empirical illustrations Empirical illustrations Empirical illustrations Activity 4 Activity 4 Activity 4 Activity 4 Random effects Random effects Random effects Random effects Empirical illustrations Empirical illustrations Empirical illustrations Empirical illustrations Activity 5 Activity 5 Activity 5 Activity 5
  • 3. 3 Introduction to Panel Data Analysis 1.0 Preamble In applied research, three categories of data sets have remained prominent. They are time series data, cross-sectional data and panel data. A time series data set is a collection of observations on several variables over a period of time e.g. Exchange rate and Consumer Price Index (CPI) in Nigeria over the period 1980 - 2008. A cross-sectional data set refers to data collected on several units such as households, firms, countries, states, regions etc. at a particular point in time. Examples of these data in Nigeria are the 2009 Labour Force Survey (LFS) and 2006 Core Welfare Indicators Questionnaire (CWIQ). However, a panel data set refers to data covering several units over a period of time. Thus, a panel data set is a combination of time series data and cross-sectional data. Examples of panel data are Gross Domestic Product (GDP) per capita for Sub-saharan African (SSA) countries over the period 1960 – 2008 and stock prices of listed companies in Nigeria over the period 1990 - 2010. In a typical symbolic representation, time series variables are usually denoted by subscript ‫ݐ‬ while cross-sectional variables are denoted by subscript ݅. Since panel data have both time series and cross-sectional dimensions, their variables are represented by subscript ݅‫ݐ‬. Using the absolute income hypothesis as an example, we can specify the following single-equation models in respect of these data sets below: ‫ܥ‬௧ = ߙ + ߚܻ௧ + ߝ௧; ‫ݐ‬ = 1, … , ܶ (1) ‫ܥ‬௜ = ߙ + ߚܻ௜ + ߝ௜; ݅ = 1, … , ܰ (2) ‫ܥ‬௜௧ = ߙ + ߚܻ௜௧ + ߝ௜௧; (3) Where ‫ܥ‬ denotes consumption and ܻ represents disposable income. ‫ݐ‬ is the time series dimension and ݅ is the cross-sectional dimension. Equation (1) above follows a time series framework, equation (2) follows a cross-sectional framework while equation (3) follows a panel data framework. If we are testing this hypothesis in a time series framework using Nigerian data for example over the period 1980 - 2010, it implies that ‫ݐ‬ = 1980, 1981, … ,2010. If it follows a cross-sectional framework using data covering SSA for year 2010, then, ݅= list of SSA. If we are dealing with a panel data framework also using data covering SSA however over a period of 1980 - 2010, ݅= list of SSA and ‫ݐ‬ = 1980, 1981, … ,2010. Note the following on the three modelling frameworks: 1. On the time series case, we are dealing with only one unit – ܰ = 1 (i.e. one individual, one firm, one country, state or any other single type of unit) over a period of time - ‫ݐ‬ = 1, … , ܶ. Thus, we can draw one or more variables over a period of time for the single unit. The total number of observations is ܶ.
  • 4. 4 2. On the cross-sectional case, we are dealing with several units - ݅ = 1, … , J (individuals, firms, states or countries) at a specific point in time – H B 1. That is, each unit has only one data point on each variable. Therefore, the total number of observations is J. 3. As regards the panel data framework, several units are considered over a period of time. That is, one or more variables can be drawn on each unit over a period of time. Therefore, the total number of observations is JH. We consider practical illustrations of these data sets below. Figure 1: Time series data set covering selected macroeconomic variables in Nigeria over the period 1980 – 2007. Time Series Dimension
  • 5. 5 Figure 2: Cross-sectional data set covering selected SSA for year 2005 Figure 3: Panel Data covering selected SSA over the period 1996-2005 Cross-sectional Dimension
  • 6. 6 For the purpose of this exercise, our subsequent discussions shall focus on panel data and, therefore, we provide a more detailed explanation on the latter in the next sections. At the end of the lecture, students should be able to: Structure panel data for the purpose of estimation Deal with pooled regression models, fixed effect models and random effect models Compute relevant diagnostic tests to validate panel data regressions Interpret panel data regressions 2.0 Why Panel Data? The use of panel data in applied research is increasingly gaining relevance both in developed and developing countries partly because of the need to harmonize regional policies and more generally due to the inherent benefits for empirical analyses. Hsiao (2003) documents the advantages of using panel data. These are represented in Baltagi (2008) and summarized as follows: 1. Panel data provide sufficient observations and, consequently, more sample variability, less collinearity, more degrees of freedom, and more accurate inference of model parameters. For example, when dealing with cross-sectional data, ܶ = 1 and similarly, for time series data, ܰ = 1. However, in the case of panel data, ‫ݐ‬ = 1, … , ܶ like wise ݅ = 1, … , ܰ providing more observations and more sample variability than either cross- sectional data or time series data. 2. In connection with (1), panel data models better capture the complexity of human behaviour than a single cross-section or time series data. For example, consider a cross- sectional sample of university students with an average grade of 50% in all the courses for the same period in time. This suggests that every student has the chances of having 50% grade based on information obtained from their performance at a particular year/level of academic study. Thus, current information about a student’s academic performance is a perfect predictor of his/her future performance. However, sequential observations for students contain information about their academic performance in different years/levels of academic study that are captured in the cross-sectional framework. With panel data models, performance of each student can be observed over time and more informed judgments can be made. Similarly, consider a time series sample of a student’s academic performance. Generalizing with the information obtained from the student will lead to unbiased and inefficient estimates. Therefore, by pooling a cross-sectional sample of students over time, variations in each unit over time are captured. 3. In connection with (2), panel data models are better able to capture the heterogeneity inherent in each individual unit. The structure of panel data suggests that the cross- sectional units whether individuals, firms, states or countries are heterogeneous. In empirical modelling, ignoring these heterogeneous effects when in fact they exist leads to biased and inefficient results. We can illustrate this with an empirical example using Baltagi and Levin (1992) paper. They consider cigarette demand across 46 American
  • 7. 7 states for the years 1963-88. Consumption is modelled as a function of lagged consumption, price and income. They however note that there are a lot of variables that may be state-invariant or time-invariant that may affect consumption. Examples of these state invariant variables are advertisement on nationwide television and radio and national policies while time-invariant variables are religion and education. 4. In connection (2), behaviour of economic agents is intrinsically dynamic and this can be accounted for using panel data models. Cross-sectional models can only ascertain the behavioural pattern at a particular point and therefore cannot be used to capture behavioural dynamics. Similarly, time series data that provides information over a period of time only limits itself to one unit and, therefore, we are unable to compare changes in the behaviour of different economic agents. For example, we may be interested in quantifying the consumption pattern of a number of cross-sections over time. These dynamics can neither be captured by cross-sectional framework nor time series models. However, since panel data sets provide time series on each cross- sectional unit in a group, it becomes relatively easy to evaluate the changes in the behavioural pattern using the data. 5. The use of micro-data in panel regressions gives more accurate prediction than the prediction of aggregate data often used in the time series case. More notably, if the micro units are heterogeneous, policy prescriptions drawn from time series aggregate data may be invalid. Panel data containing time series observations for a number of individuals is ideal for investigating the “homogeneity” versus “heterogeneity” issue (Hsiao, 2006). Although not limited to those highlighted above, these benefits have provided some reasonable justification for the consideration of panel data in empirical and policy based studies. 3.0 Structure of Panel Data Essentially, panel data sets are broadly categorized into two namely: Balanced Panel data and Unbalanced Panel data. The balanced panels refer to a case where all the cross-sectional units have equal time series dimension. That is, all the cross-sectional units are observed over the same period of time. Thus, balanced panels are also referred to complete panels. The unbalanced panels on the other hand refer to situations where the cross-section units under scrutiny have unequal time periods. While some cross-sectional units may have complete information over a specified period (say ܶ), others may have unequal time periods less than ܶ which may be due to missing observations. This is practically possible particularly in longitudinal data sets of ܶ dimension where the death of one or more individuals in a survey or exit of one or more firms in a market being surveyed or withdrawal of one or more countries from an economic union/cooperation being studied creates some missing observations for the affected cross-sectional units. Whether balanced or unbalanced, panel data sets can either be micro-based or macro-based. When large cross-sectional units (usually households or individuals) are surveyed over a short time period (not exceeding a maximum of ܶ = 20), such panel data are referred to as micro
  • 8. 8 panels. However, when large cross-sectional units (usually countries) are surveyed over a considerable period of time (exceeding 20 time periods), such are regarded as macro panels. Thus, for micro panels, ܰ is large and ܶ is small (less than 20) while for macro panels, we are dealing with large ܰ and ܶ. As a result of the nature of time series dimension of both micro and macro panels, they attract different treatments in empirical research. Since we are dealing with large ܶ in macro panels, the problem of unit root (i.e. non-stationarity) must be addressed. Also, because macro panels usually involve cross-section of countries, the problem of cross- sectional dependence may also be evident and therefore, has to be dealt otherwise the results will be inefficient. We provide below a schema showing the structure of panel data. Figure 4: Panel Data Structure Equal T for all i’s Unequal T for the i’s If ܶ 20 If ܶ 20 Panel Data Balanced/Complete Panels Unbalanced/Incomplete Panels Micro Panels Macro Panels Stationarity and Cross- sectional Independence Unit Root and Cross- sectional Dependence
  • 9. 9 3.1 Some Practical Illustrations How can we properly structure panel data for estimation purpose? Our instructional software for the purpose of this exercise is Eviews 7 version. We provide a step-by-step procedure on how to structure panel data. Step 1:It is convenient to start with an empty Excel Sheet. We provide below a figure showing a fresh excel sheet with appropriate labels. Figure 5: Panel Data Structure in Excel Sheet Individuals/Households/ Firms/States/Countries Code for representing each CS for ease of identification Day/Week/Month/Year List of variables of interest
  • 10. 10 For example, if CS=countries= Ghana, Nigeria and Time period = years= 2004 – 2009 with the same set of variables as in figure 5, then we have: Figure 6: Panel Data for CS=countries= Ghana, Nigeria and Time period = years= 2004 – 2009 Activity 1: Arrange the following time series data in tables 1- 10 for selected SSA in panel structure in excel worksheet. Note that llr=Liquid Liabilities as a percentage of GDP; pcr = Private Credit by Deposit Money Banks as a percentage of GDP; bdr=Bank Deposits as a percentage of GDP; and rgdp=GDP per capita (constant 2000 US$). Also, all the countries have the same units of measurement. This is very important in panel data to make meaningful comparative analyses. Table 1: Time Series for Togo Year llr Pcr bdr Rgdp 1996 23.0315 16.0748 15.0802 250.4119 1997 20.0072 15.3494 13.713 276.0873 1998 21.3712 17.1686 14.4285 259.7281 1999 21.1455 15.905 13.4929 256.5784 2000 24.5442 15.6822 14.9995 245.9931
  • 11. 11 Table 2: Time Series for Tunisia Year Llr pcr bdr rgdp 1996 46.0225 48.8067 36.0321 1743.74 1997 48.1615 47.5828 38.2282 1813.536 1998 49.4405 48.7962 39.3322 1876.205 1999 50.9677 48.8294 40.3478 1963.991 2000 54.605 53.1492 43.7798 2033.071 2001 56.1331 57.1981 45.5335 2108.913 2002 58.4117 59.5272 47.2955 2120.055 2003 57.1218 58.3255 46.1847 2224.77 2004 56.8293 58.2998 46.4354 2337.122 2005 58.9073 60.0528 48.4716 2412.397 Table 3: Time Series for Tanzania Year llr pcr bdr Rgdp 1996 21.9841 4.4241 14.678 253.0263 1997 19.5539 3.0814 13.1927 255.4859 1998 18.3521 3.7249 12.5247 258.7052 1999 18.0459 4.2732 12.2517 261.5302 2000 18.5975 4.4316 12.8029 268.2303 2001 18.9531 4.5071 13.6432 277.8862 2001 26.1392 15.3362 16.2111 237.9157 2002 23.95 13.3844 16.2703 240.5463 2003 23.862 14.2132 18.3394 240.2094 2004 26.4359 15.8993 20.6205 240.7338 2005 28.478 16.9002 22.0857 237.1547
  • 12. 12 2002 19.9563 5.1765 14.7567 290.4531 2003 21.5444 6.6206 16.1864 299.0575 2004 21.0916 7.4539 15.9761 311.0443 2005 24.6267 8.9208 18.9296 323.8417 Table 4: Time Series for Uganda Year llr pcr bdr Rgdp 1996 10.9502 4.4198 7.6369 214.9474 1997 11.7424 4.5096 8.5124 219.2205 1998 12.5713 4.5914 9.3281 223.2856 1999 13.6531 5.208 10.1499 234.2139 2000 14.2259 5.2279 10.6857 240.0347 2001 15.31 5.0121 11.6458 244.2106 2002 16.7173 4.9609 12.8386 251.7746 2003 17.658 5.319 13.6737 255.3492 2004 17.6943 5.5014 13.7752 260.6697 2005 17.8651 5.3874 13.8932 269.2797 Table 5: Time Series for South Africa Year llr pcr bdr Rgdp 1996 47.5964 57.0558 45.0357 3019.966 1997 50.2232 60.0383 47.6385 3029.774 1998 53.2823 63.5497 50.7814 2974.682 1999 54.4287 65.1732 51.8763 2972.204 2000 52.7026 64.996 50.1436 3019.947 2001 48.3147 69.8244 50.8579 3046.309 2002 42.4572 63.777 50.0018 3127.809
  • 13. 13 2003 41.9225 59.6852 51.3644 3186.26 2004 40.384 62.1936 51.3092 3301.071 2005 41.4266 65.3924 53.2743 3428.973 Table 6: Time Series for Niger Year llr pcr bdr Rgdp 1996 12.8354 4.2251 6.9918 167.9679 1997 10.5011 3.7052 5.8092 166.455 1998 7.4821 3.5422 4.6794 177.246 1999 6.914 3.8822 4.5292 169.9728 2000 8.425 4.6702 5.5463 161.666 2001 9.3728 4.9679 6.182 167.1001 2002 9.7845 5.084 6.5798 166.144 2003 10.827 5.2062 6.7173 167.4735 2004 13.8676 5.9907 7.9314 160.7414 2005 14.0486 6.4895 8.0553 166.3869 Table 7: Time Series for Nigeria Year llr pcr bdr Rgdp 1996 12.8536 8.4201 8.1133 367.982 1997 13.8668 9.7126 9.097 367.7034 1998 16.7043 11.8227 11.1183 364.6282 1999 18.7888 12.4598 13.157 358.9527 2000 17.5213 10.416 12.6448 368.5392 2001 25.1358 14.8838 18.0648 370.2718 2002 26.7725 16.09 19.3215 366.5663 2003 24.7666 14.5882 17.2996 395.7558
  • 14. 14 2004 24.407 15.3438 17.1914 409.3429 2005 17.3979 12.1546 13.121 428.4006 Table 8: Time Series for Kenya Year llr pcr bdr Rgdp 1996 37.9709 19.5837 27.572 422.6506 1997 39.4461 21.9409 30.3111 413.4464 1998 39.0445 23.3882 30.5958 416.0453 1999 38.4162 24.9332 29.9474 414.6931 2000 37.4519 25.5872 29.2896 406.5445 2001 36.3671 24.0622 28.9476 411.1291 2002 37.7525 23.503 30.2175 402.8055 2003 39.4971 23.0859 31.6685 403.9767 2004 39.0288 23.2237 31.7099 413.5642 2005 38.448 23.8282 31.6453 425.8653 Table 9: Time Series for Morocco Year llr pcr bdr Rgdp 1996 70.75 27.2908 46.3059 1280.803 1997 72.3646 37.7556 52.9304 1233.059 1998 69.4816 46.5368 55.5144 1308.17 1999 74.1824 50.5895 57.9418 1296.227 2000 79.5742 54.1697 62.6492 1301.872 2001 81.045 52.7253 64.4227 1382.973 2002 86.9772 53.3611 69.1398 1411.385 2003 88.3486 53.2476 70.6133 1480.456
  • 15. 15 2004 90.6856 54.2893 72.8972 1540.83 2005 97.1707 57.7634 78.3015 1561.895 Table 10: Time Series for Botswana Year llr pcr bdr Rgdp 1996 20.882 11.3541 18.8731 2725.958 1997 20.7326 9.7002 18.9318 2938.997 1998 24.4001 10.5208 22.6142 3185.879 1999 27.409 12.6244 25.3974 3354.679 2000 26.0662 14.0234 23.9447 3572.956 2001 24.8806 13.9019 23.2918 3706.873 2002 27.9743 16.6799 26.5515 3867.929 2003 27.6 17.9923 26.1646 4060.682 2004 29.4949 19.6398 27.353 4264.321 2005 28.1966 19.3561 26.1213 4382.497 Step 2: Save your panel data generated from tables 1 – 10 above in a directory you can easily remember (say Desktop) and ensure the excel worksheet is in 2003 version. Kindly note the file name for your worksheet. This is important for loading data into Eviews for estimation purpose. Step 3: Load data into Eviews. This requires the following sub-steps: a. View your data in the excel sheet to ensure they are structured in panel data form. b. Click the Eviews 7 icon. You will find the Eviews window as shown below:
  • 16. 16 c. Define the structure of the Panel Data in Eviews. The following pictures show relevant steps on how to do this. (i) Opening the Eviews Workfile
  • 17. 17 (ii) Choose Balanced Panel as the workfile structure type (iii) By choosing balanced panel, it allows you to indicate the time and cross-section dimensions for your panel data as shown below:
  • 18. 18 (iv) By clicking OK in (iii) above, you will find the Eviews work area in panel data form Time Series Dimension Cross-sectional Dimension Work Area Total Observations
  • 19. 19 d. Import panel data from the excel file: Although, there are several methods of loading data into Eviews, for the purpose of this exercise however, we will consider the import method. Students may explore other methods later. This also involves a number of sub-steps as follows: (i) Choose import method as shown below
  • 20. 20 (ii) By clicking Read Text-Lotus-Excel in (di) above, you will find a box prompting you to indicate the excel file name containing the panel data. Ensure the excel file is not opened before you import the data. (iii) Following the information provided in (dii) above and clicking ok, you will find the import box below: File Directory File name File type Column row identifiers for the first cell of actual data in excel No. of columns for actual data Sample period
  • 21. 21 (iv) By clicking ok after providing necessary information in (diii) above, the Eviews work area is updated to reflect the data imported. This is shown below: When you compare (div) with (civ), you will find that the former contains data on variables intended to be modelled in panel data form while the latter is an empty Eviews workfile. (e) At this juncture, save your Eviews workfile to secure the data
  • 22. 22 Also note the file name for your Eviews workfile and its directory. Activity 2 Structure tables 1 -5 in exercise 1 in panel data form in excel sheet and create an Eviews workfile for the generated excel sheet using the import method. 4.0 Panel Data Models Panel data models are broadly divided into two namely Static Panel data models and Dynamic Panel Models. The most notable difference between the two models is the inclusion of the lagged dependent variable as a regressor in the latter. This exercise focuses on the static panel data models. A typical static panel data regression can be expressed as: ܻ௜௧ = ߙ + ෍ ߚ௞ܺ௞௜௧ ௃ ௞ୀଵ + ‫ݑ‬௜௧ ; ݇ = 1, … , ‫ܬ‬ (4) where ܻ is the dependent variable and ܺ௞ are the explanatory variables. The subscripts ݅ and ‫ݐ‬ as earlier defined refer to cross-sectional dimension and time series dimension respectively. ‫ݑ‬௜௧ is the composite error term which can be decomposed further into specific effects and remainder disturbance term. Henceforth, we will refer to ݅ as individual observations which of course may imply either households, firms, countries or any other group of units. There are two sets of specific effects namely the individual specific effects and time specific effects. If only one set of specific effects is included in the regression, such is referred to as one-way error components model. However, if both sets of specific effects are included, we refer to the model as two-way error components model. Equations (5) and (6) show decomposition of ‫ݑ‬௜௧ into one-way and two-way error components. ‫ݑ‬௜௧ = ߮௜ + ߝ௜௧ (5ܽ) ‫ݑ‬௜௧ = ߛ௧ + ߝ௜௧ (5ܾ) ‫ݑ‬௜௧ = ߮௜ + ߛ௧ + ߝ௜௧ (6) where ߮௜ and ߛ௧ denote the unobserved individual and time specific effects respectively. We shall limit our empirical applications to the one-way error components. 4.1 Estimation of Panel Data Models Basically, the static panel data models can be estimated using Ordinary Least Square (OLS), Fixed Effects (FE) and Random Effects (RE). Each of these methods has its underlying assumptions which must necessarily be satisfied to obtain unbiased and efficient estimates. We consider in turn each of these methods with the underlying assumptions.
  • 23. 23 Figure 5: Panel Data Models 4.1.1 Pooled Regression The pooled regression involves estimating equation (4) with OLS. The Pooled regression is employed based on the assumption that the ܺ௞ are able to capture all the relevant characteristics of the individual units and, therefore, the unobserved specific effects as expressed in equations (5) and (6) are redundant. Thus, ߮௜ in the case of equation (5a), ߛ௧ in equation (5b) and both effects in the case of (6) will be dropped and and a pooled OLS regression may be used to fit the model, treating all the observations for all of the time periods as a single sample. However, ignoring these effects when in fact they are significant yields inefficient estimates and biased standard errors. Empirical Application We are using the panel data in the Eviews workfile created from tables 1-10 to illustrate the pooled regression. The Panel data regression model is given as: ‫݌݀݃ݎ‬௜௧ = ߙ + ߚଵ݈݈‫ݎ‬௜௧ + ߚଶ‫ݎܿ݌‬௜௧ + ߚଷܾ݀‫ݎ‬௜௧ + ‫ݑ‬௜௧ We have carefully structured the procedure for estimating the model in Eviews in the following pictures below: Panel Data Models Static Panel Data Models Dynamic Panel Data Models Pooled Regression Fixed Effects Arellano and Bover Random Effects Arellano Bond Blundell and Bond
  • 24. 24 Model specification and estimation (a) Equation Editor (b) Pooled Regression
  • 25. 25 (c) Pooled Regression Results Interpretation of Regression Results Based on the probability values, llr and bdr are statistically significant while pcr is not. Since the regressors in the model are in percentages, therefore, 1% increase in llr leads to a significant reduction in rgdp by 122.64USD on the average. Conversely, a 1% increase in bdr leads to a significant rise in bdr by 177.15USD on the average. Although, the effect of pcr is not statistically significant, its increase by 1% has the potential effect of reducing rgdp by 3.37USD. Note that llr is a typical measure of financial depth in economy and thus of the overall size of the financial sector; pcr is a measure of the activity of financial intermediaries (i.e. ability to channell savings to investors) and bdr is a measure of size and efficiency of the financial intermediaries as it captures the ability of the banks to mobilize funds from the public (the surplus unit). Based on these definitions, the economic interpretations can also be offered. The two dominant approaches used when the unobserved specific effects are significant in a panel data model are the fixed effects regressions and the random effects regression models. We consider these two in the next sections. Activity 3 Using the Eviews workfile created in exercise 2, estimate the panel data model below assuming there is no specific effects. Z[]IA B C * DR^^ZIA * D_]`ZIA * DaYZIA * bI * FIA Prob. Value (Pv) is used to determine the level of significance of each regressor in the model. If the Pv is 0.05 for example, it implies that the regressor in question is statistically significant at 5% level; otherwise, it is not significant at that level.
  • 26. 26 4.1.2 Fixed Effects Regressions As earlier emphasized, one of the approaches used to capture specific effects in panel data models is the fixed effects (FE) regression. Thus, equation (4) is modified using any of the error components equations (5a, 5b or 6) depending on the specific effects being considered. The FE approach is based on the assumption that the effects are fixed parameters that can be estimated. Consequently, a number of econometric problems may be encountered: (i) the unobserved specific fixed effects may be correlated with the regressors employed; (ii) some of the regressors may be correlated with shocks (remainder disturbance term) that affect the dependent variable; and (iii) there is possibility of simultaneity biases resulting from the endogeneity of some regressors. The FE approach can overcome these problems by using any of the three estimation methods namely: Within Group (WG) estimator; Least Square Dummy Variable (LSDV) estimator and First Difference (FD) estimator. The WG estimator involves the deviation approach which eliminates the unobserved effects. Similarly, the FD estimator involves taking the first difference of the modified panel data model to eliminate the unobserved effects. Thus, under the WG estimator and FD estimator, the model is transformed in such a way that the unobserved effect is eliminated. The LSDV estimator however involves the inclusion of dummy variables as regressors in the panel data model (equation 4) to capture the specific effects. In this case, either the intercept ߙ or one of the dummy variables must be dropped to avoid perfect collinearity or dummy trap problem. We provide below the specifications in respect of these estimation techniques for fixed effects. Within Regression: ܻ௜௧ − ܻ ത௜. = ෍ ߚ௞(ܺ௞௜௧ − ௃ ௞ୀଵ ܺ ത௞௜.) + ߝ௜௧ − ߝ௜. ; ݇ = 1, … , ‫ܬ‬ (7) First Difference Regression: ܻ௜௧ − ܻ௜௧ିଵ = ෍ ߚ௞(ܺ௞௜௧ − ௃ ௞ୀଵ ܺ௜௧ିଵ) + ߝ௜௧ − ߝ௜௧ିଵ (8) Or First Difference Regression: ∆ܻ௜௧ = ෍ ߚ௞∆ܺ௞௜௧ ௃ ௞ୀଵ + ߝ௜௧ − ߝ௜௧ିଵ (8) LSDV Regression: ܻ௜௧ = ߙ + ෍ ߚ௞ܺ௞௜௧ ௃ ௞ୀଵ + ෍ ߛ௜Γ௜ + ே ௜ୀଶ ߝ௜௧ ; (9) Or LSDV Regression: ܻ௜௧ = ෍ ߚ௞ܺ௞௜௧ ௃ ௞ୀଵ + ෍ ߛ௜Γ௜ + ே ௜ୀଵ ߝ௜௧ ; (9) To confirm whether the specific effects estimated are actually fixed effects, the F-test is used in this regard. The null hypothesis for this test is given as: ‫ܪ‬଴: ߤଵ = ߤଶ = ⋯ = ߤଶ = 0
  • 27. 27 We reject ‫ܪ‬଴ when the F-statistic is statistically significant suggesting that the fixed effects are important in the model, otherwise, we do not reject it. Empirical Application We are still using the panel data in the Eviews workfile created from tables 1-10 to illustrate the FE regressions. The Panel data regression model with fixed effects is given as: Z[]IA B C * DR^^ZIA * D_]`ZIA * DaYZIA * bI * FIA (a) Equation editor (b) Model Specification Click panel options to include the fixed effects
  • 28. 28 (c) Specifying the fixed effects (d) Fixed Effects Regression Click ok after specifying the fixed effects Fixed effects
  • 29. 29 The interpretation of regression results here is not different from the style used under the pooled regression. However, we need to further test whether the inclusion of the effects is significant or not. (e) Fixed Effects Testing Prob. Value (Pv) is used to test the significance of the fixed effects in the model. If the Pv is 0.05 for example, it implies that the effects are statistically significant at 5% level; otherwise, they are not significant at that level. 1 2 3
  • 30. 30 Recall that the null hypothesis for the fixed effect test is that the fixed effects are not important in the model. Based on the Pv, we can conclude that the fixed effects are significant in the model and, therefore, estimating without these effects which is the case in pooled regression will yield biased standard errors and policy prescriptions drawn from the analysis will be invalid. Activity 4 Using the Eviews workfile created in exercise 2, estimate the panel data model below assuming the cross-sectional units are fixed parameters that can be estimated. Also, conduct relevant diagnostic tests to validate your results. ‫݌݀݃ݎ‬௜௧ = ߙ + ߚଵ݈݈‫ݎ‬௜௧ + ߚଶ‫ݎܿ݌‬௜௧ + ߚଷܾ݀‫ݎ‬௜௧ + ߤ௜ + ߝ௜௧ 4.1.3 Random Effects Regressions One of the fundamental limitations of fixed effects regressions is that there is usually a high loss of degrees of freedom when there are too many parameters to estimate due to large cross- sectional units. This high loss of degrees of freedom can be avoided if the cross-sectional units can be assumed random. In addition, the specific effects (߮௜ ߛ௧) are assumed independent of ߝ௜௧ and also ܺ௞ are independent of both the specific effects (߮௜ ߛ௧) and ߝ௜௧. Thus, random effects model assumes exogeneity of all regressors. Practically, this may be the case if the individual observations constitute a random sample from a given population. However, where the unobserved specific effects are not distributed independently of the explanatory variables, the fixed effects should be used instead to avoid the problem of unobserved heterogeneity bias. The random effects regression is given as: ܻ௜௧ − ߶ܻ ത௜. = ෍ ߚ௞(ܺ௞௜௧ − ௃ ௞ୀଵ ߶ܺ ത௞௜.) + ߝ௜௧ − ߶ߝҧ௜. (10) Where ߶ = 1 − ఙഄ మ ்ఙഋ మାఙഄ మ . If there are no random effects, then, ߪఓ ଶ = 0 and therefore ߶ = 0. By substituting this into equation (10) gives back equation (4). In such a case, we can perform either the pooled regression or the fixed effects regression. Thus, we may need to test further using the F-test earlier explained to ascertain whether there are fixed effects or not. To confirm whether the specific effects estimated are actually random effects and are uncorrelated with the explanatory variables, we carry out the Hausman test. The rejection of the null hypothesis (when the statistic is statistically significant) implies an adoption of the fixed effects model and non-rejection is considered as an adoption of the random effects model. The rejection of the null hypothesis also implies correlated specific effects are better captured with
  • 31. 31 fixed effects model. Figure 6 below provides some useful hints on how to choose the most appropriate panel data regression model. Empirical Application We are still using the panel data in the Eviews workfile created from tables 1-10. The Panel data regression model with random effects is given as: Z[]IA B C * DR^^ZIA * D_]`ZIA * DaYZIA * bI * FIA (a) Equation editor (b) Model Specification Click panel options to include the random effects
  • 32. 32 (c) Specifying the random effects (d) Random Effects Regression
  • 33. 33 The interpretation of regression results here is not different from the style used under the pooled regression. However, we need to further test whether the inclusion of the random effects is significant and also whether these effects are correlated or not. As earlier explained, the Hausman test is used in this regard. (e) Random Effects Testing Prob. Value (Pv) is used to test whether the random effects are correlated in the model. If the Pv is 0.05 for example, it implies that the effects are correlated at 5% level; otherwise, they are not at that level.
  • 34. 34 It should be noted that one of the assumptions underlying the use of random effects model is that specific effects are independently and identically distributed. To confirm whether the specific effects estimated are actually random effects and are uncorrelated, the Hausman test is often used. As earlier emphasized, the rejection of the null hypothesis (when the statistic is statistically significant) implies an adoption of the fixed effects model and non-rejection is considered as an adoption of the random effects model. The rejection of the null hypothesis also implies correlated specific effects are better captured with fixed effects model. Based on the pv of the test summary as shown above, the effects are correlated and therefore the fixed effects regression may give a better fit than the random effects model. Figure 6: Static Panel Data Models Yes No No Yes Yes No Are the cross-sectional units randomly sampled? Consider Pooled, Fixed Effects Random Effects regressions Consider Fixed Effects Regression and Pooled regressions Perform Hausman Test to choose the appropriate Model Does the Hausaman Test indicate non-rejection of H0? Perform Random Effects Regression Perform F-test to choose btw the two regressions Does the F-test indicate non- rejection of H0? Perform Pooled Reg. Perform Fixed Effects Reg.
  • 35. 35 The table below will guide you as regards the interpretation of linear regression results. Model Interpretation Linear – Linear: ‫ݕ‬௜௧ = ߙ଴ + ߙଵ‫ݔ‬௜௧ + ‫ݒ‬௜௧ a 1 unit change in ‫ݔ‬ will lead to a ߙଵ unit change in the mean of ‫ݕ‬. For example, if the estimated coefficient is 0.05 that implies a one unit increase in ‫ݔ‬ will generate a 0.05 unit increase in ‫ݕ‬. Linear – Log (Semi-log): ‫ݕ‬௜௧ = ߙ଴ + ߙଵln‫ݔ‬௜௧ + ‫ݒ‬௜௧ a 1 per cent change in ‫ݔ‬ will lead to a (ߙଵ/100) unit change in the mean of ‫ݕ‬. For example, if the estimated coefficient is 72.07, it implies that a 1 per cent increase in ‫ݔ‬, will lead to a 0.72 unit increase in ‫ݕ‬. Log – Linear (Semi-log): ln‫ݕ‬௜௧ = ߙ଴ + ߙଵ‫ݔ‬௜௧ + ‫ݒ‬௜௧ a 1 unit change in ‫ݔ‬ will lead to a (100ߙଵ) per cent change in the mean of ‫ݕ‬. For example, if the estimated coefficient is -0.20 that means a one unit increase in ‫ݔ‬ will lead to a 20 per cent decrease in ‫ݕ‬. Double log: ln‫ݕ‬௜௧ = ߙ଴ + ߙଵln‫ݔ‬௜௧ + ‫ݒ‬௜௧ a 1 per cent change in ‫ݔ‬ will lead to a ߙଵ per cent change in the mean of ‫ݕ‬. For example, if the estimated coefficient is 0.05 that implies a one per cent increase in ‫ݔ‬ will yield a 0.05 per cent increase in ‫ݕ‬. Activity 5 Using the Eviews workfile created in exercise 2, estimate the panel data model below assuming the cross-sectional units are randomly drawn. Also, conduct relevant diagnostic tests to validate your results. ‫݌݀݃ݎ‬௜௧ = ߙ + ߚଵ݈݈‫ݎ‬௜௧ + ߚଶ‫ݎܿ݌‬௜௧ + ߚଷܾ݀‫ݎ‬௜௧ + ߤ௜ + ߝ௜௧ References Baltagi, B.H. (2008). Econometric Analysis of Panel Data. Fourth Edition. John Wiley Sons Ltd., UK. Hsiao, C. (2003). Analysis of Panel Data. Cambridge University Press, Cambridge.
  • 36. 36