CH 01 Wooldridge 6e PPT Updated
CH 01 Wooldridge 6e PPT Updated
1
Learning resources
Prescribed Wooldridge, J.M. (2013). Introductory
Textbook Econometrics: A Modern Approach, 5th or 6th
Edition, South-Western Cengage Learning.
2
3
Assessment
ASSESSMENT VALUE
ASSESSMENT ITEM AND DUE DATE
NUMBER (/100)
4
The Nature of
Chapter 1
Econometrics
and Economic
Data
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license distributed with a
certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. © kentoh/Shutterstock.
What is Econometrics?
6
Why do we study econometrics?
• Rare in economics (and many other areas without
labs!) to have experimental data
• Need to use non-experimental, or observational data to
make inferences
• Important to be able to apply economic theory to real
world data
7
The Nature of Econometrics
and Economic Data
● What is econometrics?
• Econometrics = use of statistical methods to analyze economic data
• Econometricians typically analyze nonexperimental data
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 8
The Nature of Econometrics
and Economic Data
● Steps in econometric analysis
• 1) Economic model (this step is often skipped)
• 2) Econometric model
● Economic models
• Maybe micro- or macromodels
• Often use optimizing behaviour, equilibrium modeling, …
• Establish relationships between economic variables
• Examples: demand equations, pricing equations, …
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 9
Why is it so important?
• An empirical analysis uses data to test a theory or to
estimate a relationship
• A formal economic model can be tested
• Theory may be ambiguous as to the effect of some
policy change – can use econometrics to evaluate the
program
10
RESEARCH PROCESS
11
The Nature of Econometrics
and Economic Data
● Economic model of crime (Becker (1968))
• Derives equation for criminal activity based on utility maximization
Hours spent in
criminal activities
Age
“Wage” of cri-
minal activities Probability of Expected
Wage for legal
Other Probability of conviction if sentence
employment
income getting caught caught
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 12
The Nature of Econometrics
and Economic Data
● Model of job training and worker productivity
• What is effect of additional training on worker productivity?
• Formal economic theory not really needed to derive equation:
Hourly wage
Years of formal
education Weeks spent
Years of work- in job training
force experience
• Other factors may be relevant, but these are the most important (?)
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 13
The Nature of Econometrics
and Economic Data
● Econometric model of criminal activity
• The functional form has to be specified
• Variables may have to be approximated by other quantities
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 14
The Nature of Econometrics
and Economic Data
● Econometric model of job training and worker productivity
Unobserved deter-
minants of the wage
16
The Nature of Econometrics
and Economic Data
● Econometric analysis requires data
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 17
The Nature of Econometrics
and Economic Data
● Cross-sectional data sets
• Sample of individuals, households, firms, cities, states, countries,
or other units of interest at a given point of time/in a given period
• Cross-sectional observations are more or less independent
• For example, pure random sampling from a population
• Sometimes pure random sampling is violated, e.g. units refuse to
respond in surveys, or if sampling is characterized by clustering
• Cross-sectional data typically encountered in applied
microeconomics
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 18
The Nature of Econometrics
and Economic Data
● Cross-sectional data set on wages and other characteristics
Indicator variables
(1 = yes, 0 = no)
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 20
The Nature of Econometrics
and Economic Data
● Time series data
• Observations of a variable or several variables over time
• For example, stock prices, money supply, consumer price index,
gross domestic product, annual homicide rates, automobile sales, …
• Time series observations are typically serially correlated
• Ordering of observations conveys important information
• Data frequency: daily, weekly, monthly, quarterly, annually, …
• Typical features of time series: trends and seasonality
• Typical applications: applied macroeconomics and finance
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 21
The Nature of Econometrics
and Economic Data
● Time series data on minimum wages and related variables
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 22
The Nature of Econometrics
and Economic Data
● Pooled cross sections
• Two or more cross sections are combined in one data set
• Cross sections are drawn independently of each other
• Pooled cross sections often used to evaluate policy changes
• Example:
• Evaluate effect of change in property taxes on house prices
• Random sample of house prices for the year 1993
• A new random sample of house prices for the year 1995
• Compare before/after (1993: before reform, 1995: after reform)
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 23
The Nature of Econometrics
and Economic Data
● Pooled cross sections on housing prices
Property tax
Size of house
in square feet
Number of bedrooms
Number of bathrooms
Before reform
After reform
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 24
The Nature of Econometrics
and Economic Data
● Panel or longitudinal data
• The same cross-sectional units are followed over time
• Panel data have a cross-sectional and a time series dimension
• Panel data can be used to account for time-invariant unobservables
• Panel data can be used to model lagged responses
• Example:
• City crime statistics; each city is observed in two years
• Time-invariant unobserved city characteristics may be modeled
• Effect of police on crime rates may exhibit time lag
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 25
The Nature of Econometrics
and Economic Data
● Two-year panel data on city crime statistics
Number of
police in 1986
Number of
police in 1990
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 26
The Nature of Econometrics
and Economic Data
● Causality and the notion of ceteris paribus
Definition of causal effect of on :
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 27
The Nature of Econometrics
and Economic Data
● Causal effect of fertilizer on crop yield
• “By how much will the production of soybeans increase if one
increases the amount of fertilizer applied to the ground”
• Implicit assumption: all other factors that influence crop yield such
as quality of land, rainfall, presence of parasites etc. are held fixed
● Experiment:
• Choose several one-acre plots of land; randomly assign different
amounts of fertilizer to the different plots; compare yields
• Experiment works because amount of fertilizer applied is unrelated
to other factors influencing crop yields
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 28
The Nature of Econometrics
and Economic Data
● Measuring the return to education
• “If a person is chosen from the population and given another
year of education, by how much will his or her wage increase?”
• Implicit assumption: all other factors that influence wages such as
experience, family background, intelligence etc. are held fixed
● Experiment:
• Choose a group of people; randomly assign different amounts of
education to them (infeasable!); compare wage outcomes
• Problem without random assignment: amount of education is
related to other factors that influence wages (e.g. intelligence)
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 29
The Nature of Econometrics
and Economic Data
● Effect of law enforcement on city crime level
• “If a city is randomly chosen and given ten additional police officers,
by how much would its crime rate fall?”
• Alternatively: “If two cities are the same in all respects, except that
city A has ten more police officers than city B, by how much would
the two cities‘ crime rates differ?”
● Experiment:
• Randomly assign number of police officers to a large number of
cities
• In reality, number of police officers will be determined by crime rate
(simultaneous determination of crime and number of police)
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 30
The Nature of Econometrics
and Economic Data
● Effect of the minimum wage on unemployment
• “By how much (if at all) will unemployment increase if the minimum
wage is increased by a certain amount (holding other things fixed)?”
● Experiment:
• Government randomly chooses minimum wage each year and
observes unemployment outcomes
• Experiment will work because level of minimum wage is unrelated
to other factors determining unemployment
• In reality, the level of the minimum wage will depend on political
and economic factors that also influence unemployment
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 31
The Nature of Econometrics
and Economic Data
● Testing predictions of economic theories
• Economic theories are not always stated in terms of causal effects
• For example, the expectations hypothesis states that long term
interest rates equal compounded expected short term interest rates
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use a s permitted in a license
distributed with a certain product or service or otherwise on a password -protected website or school-approved learning management system for classroom use. 32
Statistics Review
– Mean – Standard Deviation
– Median – Histograms
– Mode – Skewness
– Range – Kurtosis
– Interquartile Range – Covariance
– Variance – Correlation Coefficient
33
Measures of central tendency – Review
34
Mean – Review
x
data set
i
x
• (2) Divide the sum by the number
of values in the data set i
3x 1x 1x 1x 1x 2x 2x 1x 1x 1x
• (3) Find the value that has the most occurrences
mode = 11 (°F)
– Interquartile range
– Variance
– Standard deviation
43
An Example Data Set
• Daily low temperatures recorded in Chapel Hill
(01/18-01/31, 2005, °F)
Jan. 18 – 11 Jan. 25 – 25
Jan. 19 – 11 Jan. 26 – 33
Jan. 20 – 25 Jan. 27 – 22
Jan. 21 – 29 Jan. 28 – 18
Jan. 22 – 27 Jan. 29 – 19
Jan. 23 – 14 Jan. 30 – 30
Jan. 24 – 11 Jan. 31 – 27
• For these 14 values, we will calculate all measures
of dispersion
44
Range – Review
• Range – The difference between the largest and
the smallest values
• (1) Sort the data in ascending order
• (2) Find the largest value
max
• (3) Find the smallest value
min
• (4) Calculate the range
range = max - min
• Vulnerable to the influence of outliers 45
Range – Review
• Range – The difference between the largest and
the smallest values
• (1) Sort the data in ascending order
11, 11, 11, 14, 18, 19, 22, 25, 25, 27, 27, 29, 30, 33
• (2) Find the largest value
max = 33
• (3) Find the smallest value
min = 11
• (4) Calculate the range
range = 33 – 11 = 22 46
Interquartile Range – Review
(x x)
i
2
s
2 i 1
n 1
49
Variance – Review
• (1) Calculate the mean
x
• (2) Calculate the deviation for each value
xi x
• (3) Square each of the deviations
( xi x ) 2
52
Variance – Review
53
Standard Deviation – Review
• Standard deviation is equal to the square root
of the variance
(x x)
i
2
s i 1
n 1
• Compared with variance, standard deviation
has a scale closer to that used for the mean and
the original data
54
Standard Deviation – Review
• (1) Calculate the mean
x
• (2) Calculate the deviation for each value
xi x
• (3) Square each of the deviations
( xi x ) 2
• (1) – (5)
s2 = 57.8
16-20 2
21-25 3
26-30 4
31-35 1
59
Building a Histogram – Review
• 3. Plot the frequencies of each class
60
Skewness – Review
• Skewness measures the degree of asymmetry
exhibited by the data
• Positive skewness – More observations below
the mean than above it
• Negative skewness – A small number of low
observations and a large number of high ones
n
(x x)
i
3
For the example data set:
skewness i 1
3
ns Skewness = -0.2069
61
Skewness – Review
Source: http://library.thinkquest.org/10030/3smodsas.htm
62
Skewness – Review
n
Mode
Median
i
( x x ) 3
skewness i 1
3
Mean
ns
i
( x x ) 3
Mode
skewness i 1
ns 3 Median
Mean
A B
i
( x x ) 3
skewness i 1
3
ns
Source: http://mathworld.wolfram.com/NormalDistribution.html
(x x)
i
4
For the example data set:
kurtosis i
4
3
ns Kurtosis = 1.6891 > 0
67
Kurtosis – Review
n
(x x)
i
4
kurtosis i
4
3
ns
68
Kurtosis – Review
Source: http://espse.ed.psu.edu/Statistics/Chapters/Chapter3/Chap3.html 69
Kurtosis – Review
• Platykurtic– When the kurtosis < 0, the
frequencies throughout the curve are closer to be
equal (i.e., the curve is more flat and wide)
• Thus, negative kurtosis indicates a relatively flat
distribution
• Leptokurtic– When the kurtosis > 0, there are
high frequencies in only a small part of the curve
(i.e, the curve is more peaked)
• Thus, positive kurtosis indicates a relatively
peaked distribution
70
Kurtosis – Review
72
An Example - Covariance
• The covariance between Test Score and STR is negative:
so is the correlation…
73
Correlation Coefficient – Review
• The correlation coefficient is defined in terms of the
covariance:
cov( X ,Y ) XY
corr X ,Y r XY
var( X )var(Y ) X Y
74
The correlation
coefficient
measures
linear
association
75
Exercise
Obs 1 2 3 4 5 6 7 8 9 10 11 12 13
wage 3.1 3.2 3 6 5.3 8.8 11 5 3.6 18 6.3 8.1 8.8
educ 11 12 11 8 12 16 18 12 12 17 16 13 12
• Data transformation
77