Paper 1: First Draft
Use the results from the first two homework assignments, what you have learned in class and from reading the papers and textbook for the course to answer the following questions. Did assigning a person to get an encouraging phone call increase their probability of voting? Can we get an estimate of the causal effect of getting a call encouraging you to vote on your probability of voting with non experimental data by using regression to adjust for differences between people who got a call and those that didn’t (why or why not)? Be clear on what is going wrong.
Structure of Paper
You can use the papers on the reading list as a guide for what a paper should look like. You can include your tables in the body of the paper or put them at the end of the paper. Your paper should be 10 pages long wihout counting tables and you may want to include the following sections:
Abstract: One paragraphs that sums the paper up. Write this first.
Intro: One page that sums the paper up with more detail than the abstract. Should tell the reader why the questions the paper answers are important and cover data, econometric methods, results and conclusion.
Data: Describe the data you use in the analysis and how it was generated. You may need to do some research online.
Methods: Describe the statistical methods used. Include the equations for the regressions you will run.
Results: Describe and interpret your statistical findings.
Conclusion: Interpret your findings.
*1
eststo control: estpost sum newreg contact busy age female vote98 vote00 vote02 if treat_real==0
eststo treatment: estpost sum newreg contact busy age female vote98 vote00 vote02 if treat_real==1
eststo difference: estpost ttest newreg contact busy age female vote98 vote00 vote02, by (treat_real)unequal
esttab control treatment difference using “HW1.csv”, cells(“mean(pattern(1 1 0) fmt(3))sd(pattern(1 1 0) fmt(3)) b(star pattern(0 0 1) fmt (3))p(pattern(0 0 1) fmt(4))”) title(“HW1.csv”)label replace plain
*2Yes, it is correctly implemented, the treated and untreated groups are nearly the same, shows they are randomly enough.
*3
ttest vote02, by(treat_real)
*t-stat (3.5212) is higher than the t-critical,and the p-value 0.0002 is small.it is statistically significant, yet not large in a practical sense.
*4
reg vote02 treat_real
outreg2 using “reg_1.txt”, replace nonote se label bdec(3)
tab treat_real, summarize(vote02)
reg vote02 treat_real newreg
outreg2 using “reg_1.txt”, replace nonote se label bdec(3)
reg vote02 treat_real newreg contact
outreg2 using “reg_1.txt”, replace nonote se label bdec(3)
reg vote02 treat_real newreg contact
outreg2 using “reg_1.txt”, replace nonote se label bdec(3)
reg vote02 treat_real newreg contact age
outreg2 using “reg_1.txt”, replace nonote se label bdec(3)
reg vote02 treat_real newreg contact age female
outreg2 using “reg_1.txt”, replace nonote se label bdec(3)
reg vote02 treat_real newreg contact age female vote98
outreg2 using “reg_1.txt”, replace nonote se label bdec(3)
reg vote02 treat_real newreg contact age female vote98 vote00
outreg2 using “reg_1.txt”, replace nonote se label bdec(3)
reg vote02 treat_real newreg contact age female vote98 vote00 state
outreg2 using “reg_1.txt”, replace nonote se label bdec(3)
*5
*more covariates leads to more precision and less bias.
*6
*we won’t get unbiased estimate of the causal effect because of the selection bais. not all the people will answer the phone due to all sorts of different reasons.
*1
eststo control: estpost sum newreg contact busy age female vote98 vote00 vote02 if contact==0
eststo treatment: estpost sum newreg contact busy age female vote98 vote00 vote02 if contact==1
eststo difference: estpost ttest newreg contact busy age female vote98 vote00 vote02, by (contact)unequal
esttab control treatment difference using “HW2.csv”, cells(“mean(pattern(1 1 0) fmt(3))sd(pattern(1 1 0) fmt(3)) b(star pattern(0 0 1) fmt (3))p(pattern(0 0 1) fmt(4))”) title(“HW2.csv”)label replace plain
*2
* no, Table 1 does not suggest that the control group will provide a good counter factual for the treatment group’s voting potential outcomes. almost every part are different except female.
*3
*the different group in homework1 are more similar, hm2 does not. as we know, old people are much more likely to answer the phone.
*4
reg vote02 contact
outreg2 using “reg_2.numbers”, replace nonote se label bdec(3)
tab contact, summarize(vote02)
reg vote02 contact newreg
outreg2 using “reg_2.numbers”, append nonote se label bdec(3)
reg vote02 contact newreg
outreg2 using “reg_2.numbers”, append nonote se label bdec(3)
reg vote02 contact newreg
outreg2 using “reg_2.numbers”, append nonote se label bdec(3)
reg vote02 contact newreg age
outreg2 using “reg_2.numbers”, append nonote se label bdec(3)
reg vote02 contact newreg age female
outreg2 using “reg_2.numbers”, append nonote se label bdec(3)
reg vote02 contact newreg age female vote98
outreg2 using “reg_2.numbers”, append nonote se label bdec(3)
reg vote02 contact newreg age female vote98 vote00
outreg2 using “reg_2.numbers”, append nonote se label bdec(3)
reg vote02 contact newreg age female vote98 vote00 state
outreg2 using “reg_2.numbers”, append nonote se label bdec(3)
*5
*adding covariates reduces bias and increase precision, the relationship between the covariates and the outcome are mostly positive.
*6
*hw1 has not very significant effect unlike the hm2. The tables are different for the treatment and control groups are different. in hm1, standard errors remain the same as adding the covariates. in hm2 when the covariates were added, the standard errors for contact decreased.
*7
*yes, adding covariates to the regression as in question 4 reduce the bias, the upward bias of contact was cancel when the variables were added. one of the most is the age for it has most different means.
*8
*no, the regression does not eliminate all the bias. he treatment and control groups have a very different means.
1
Changed wording in several areas to more accurately describe analyses process
Changed regressions to have robust standard errors
Added additional death subcategories table without birthday variable for comparison
Removed Contractions
Removed unneeded titles from some graphs
Corrected conclusion
2
Brandon Lang
Econ 104
Prof. Dobkin
3/6/18
The Minimum Legal Drinking Age and Young Adult Mortality
Abstract
This paper attempts to show how the presence of a minimum legal drinking age (MLDA) affects the
likelihood of a young adult drinking and the mortality rates of young adults. Through the combined use of
NHIS data with mortality data, a link between drinking and mortality rates is able to be drawn, and it can
be observed as to whether or not the MLDA has an effect on these rates. If there are differences found in
the likelihood of drinking and mortality rates between the group that is below the MLDA and the group
above the MLDA while randomizing the groups so as to attempt to eliminate bias, the effect of the MLDA
on whether or not a person drinks and mortality rates can be found. In analyzing the effect of turning 21
on the drinking rate and mortality rates, a regression discontinuity design is used.
Introduction
For many years, a concern of citizens and policy makers has been at what age should the U.S. MLDA be
set at. For most of the country historically, there was no legal minimum, however this changed after the
end of prohibition, with many states setting the minimum age at which people could consume alcohol at
21 years old. This was changed in 1971 with the adoption of the 26th Amendment, which prohibited states
and the federal government from using age as a reason for denying the right to vote to citizens who were
at least 18 years old, and many states reduced their MLDA to the same. This did not last, however, as the
Drinking Age Act of 1984 required states to start introducing laws to raise the legal purchasing age of
3
alcohol by citizens to 21. A major point of concern for determining at what age people should be given
the ability to consume alcohol legally is its effects on public health. We want to if the presence of the
MLDA reduces the proportion of the population that drinks and by how much does it affect the mortality
rate of young adults
To address these questions, this paper analyzes data from the National Health Interview Sample and
mortality data for 19 to 23 year olds. These data set are used in conjunction with each other to find the
probability of people drinking and how mortality rates are affected by the presence of a MLDA and effects
of becoming 21 years old. Two groups are created for this, with one being a treatment group consisting
of subjects younger than the MLDA and a control group consisting of subjects that are older who are not
affected by the MLDA.
Data
on the subjects’ characteristics from the NHIS is used, with the sample means
of both groups being found to not be statistically different. With the groups being shown to be adequately
similar, we can look at what differences the MLDA causes between the groups, if it does at all. If it is found
that the trends differ notably between the groups and that the MLDA is the cause of this difference, it can
be estimated as to how many more people are kept from drinking or dying due to the MLDA’s existence.
Through analyses using regression discontinuity design, it is found that there is a very notable increase in
the likelihood of alcohol consumption and mortality rates for subjects after the MLDA treatment becomes
active on them. In looking closely at the causes of death, we see that several of the causes of death for
the subjects which increase in rate are related to the consumption of alcohol. This analysis supports the
hypothesis that the MLDA does in fact decrease the probability of an individual to drink and decreases the
mortality rate for subjects who’s ages are below the MLDA, restricting their ability, even though not
entirely, to consuming alcohol.
Data
4
This paper uses data from the National Health Interview Sample Adult Files 1997-2007 (NHIS), compiled
by the Centers for Disease Control (CDC) and from the mortality data of 19 to 23 year olds using official
death certificates. The NHIS selects from an average sample of 36,600 households from the civilian
population and using questionnaires they call the Family Core, Sample Adult Core, and Sample Child Core,
with data from the Family Core being used. To obtain data for the Family Core, family members who were
at their household at the time of visit were questioned about sociodemographic characteristics, the health
of the family, and any injuries of family members. The NHIS sample contains data for over 61,000 people,
showing whether or not 18 to 29 year olds consume alcohol and the rates at which those who do. It
displays variables such as if they don’t have insurance, if they have a high school diploma, they’re Hispanic,
white, black, have employment, are married, working a low wage job, are attending school, they’re male,
the number of days it’s been since they turned 21, the percentage of days out of the year that they drink
alcohol, their age, and whether or not they drink alcohol. The mortality data shows the causes of death
for specific age profiles, these include death by alcohol consumption, homicide, suicide, motor vehicle
accidents, drug use, and other external causes, with all causes being categorized as either internal causes
of death or external causes of death.
Given that we are trying to analyze the effect of the MLDA on a group which is subject to it while
comparing them against a control group that isn’t with the two groups being similar in every way except
for whether they are under the MLDA, American subjects who are under the MLDA cannot be compared
to subjects in other countries who are not under the affect of an MLDA because there are too many
different factors that can be affected by the difference in geography and culture to provide unbiased data.
Because of this limitation, we need to be able to compare subjects who are just under the MLDA to
subjects that are just older than the MLDA and can drink legally.
Methods
5
Due to the sheer volume of subjects being observed, attempting to look at the trends that may be present
between the likelihood of individuals drinking and ages becomes very difficult as figures displaying this
are far too cluttered. To provide clear, readable, and ultimately meaningful results, the regressions had
to be designed in such a way that they analyzed the average ages instead of individual ages, reducing the
clutter of the produced figures substantially. Binwidth is used to determine the distribution of the age
profile of whether or not people drink alcohol. After trial and error, the bin width that I decided to use is
50 days because it has 21 as the midpoint, so the bin spreads from 21 but don’t cut across 21, separating
people by the number of months until or after their 21st birthday. I chose this bin width because it does
not average across the threshold so that people who are almost 21 are not in the same bin with people
that just turned 21 and the graph’s results are very clearly readable. Bandwidth is used to determine what
range of age should be used to spread out the points. After I determined that the range pf ages that should
be used is between 19 and 23, I chose to use a bandwidth of 2 years. I chose this bandwidth because
people younger than 19 and older than 23 are too far away from the MLDA so they likely aren’t similar
enough to be comparable, such as people under 19 still attending high school and people over 23 likely
being safer more experienced drinkers. The results also don’t average across the threshold, creating a
clear, centered jump with easily readable results. I then chose to use a range (Y) of likelihood of alcohol
consumption of 0.4-0.7. I chose this range because it presents the results very clearly and centers the very
noticeable jump at 21. The results of this are shown in Figure 1.
Finally, to help make the estimates as accurate as they can, quadratic regressions are introduced. In order
to increase the estimate’s accuracy as much as possible, new information has to be brought in to reduce
bias. To do this, new variables are created in Stata:
-agec: The difference of a subject’s age from the MLDA
-post: Set to 1 if a subject is older than the MLDA and 0 if younger; also called over21
-agec_post: Interaction variable for agec and post; also called over21_post
6
-age_50days: Width used to determine the bin subjects are placed in
-agec_sq: Quadratic form of agec
-agec_sq_post: Interaction variable for agec_sq and post; also called agec_sq_over21
-agec_50days_sq: Quadratic form for agec_50days
-agec_50days_cu: allows for third degree regression
To find the drinking rate age profile, the first stage relationship between the MLDA and whether or not
people drink alcohol must be found; this asks subjects “do you drink yes or no; how many drinks per week
do you have on average”. This is the proportion of people that began drinking at 21. To do this, a scatter
plot showing a regression discontinuity design will be created using the regressions:
??????_????ℎ?? = ? + ? ???_50??? + ? ???_50???_?? + ? if age >= 19 & age <= 21
??????_????ℎ?? = ? + ? ???_50??? + ? ???_50???_?? + ? if age >= 21 & age <= 23
We then need to find the effect that the MLDA has on the probability of subjects to consume alcohol.
Before it is possible to analyze the effect that the MLDA has on the mortality rate, it is important to
establish that it actually has an effect on the drinking rate which potentially affects the mortality rate in
many cases. This is found through a table made with the regressions:
??????_????ℎ?? = ? + ? ????21 + ? ???? + ? ????_????21 + ? ????ℎ??? + ? if age >= 21 &
age <= 23
??????_????ℎ?? = ? + ? ????21 + ? ???? + ? ????_????21 + ? ????ℎ??? + ? ????_?? +
? ????_??_????21 + ? if age >= 21 & age <= 23
??????_????ℎ?? = ? + ? ????21 + ? ???? + ? ????_????21 + ? ????ℎ??? + ? ????_?? +
? ????_?? + ? ????_??_????21 + ? ????_??_????21 + ? if age >= 21 & age <= 23
We will then need to check our balances and see if the MLDA has any statistically significant effect on
any of the sample characteristics from the NHIS data. To do this, a regression table will be created. The
regressions used for this include:
7
????????? = ? + ? ???? + ? ???? + ? ????_???? + ? if age >= 19 & age <= 23
??_??????? = ? + ? ???? + ? ???? + ? ????_???? + ? if age >= 19 & age <= 23
???????? = ? + ? ???? + ? ???? + ? ????_???? + ? if age >= 19 & age <= 23
?ℎ??? = ? + ? ???? + ? ???? + ? ????_???? + ? if age >= 19 & age <= 23
????? = ? + ? ???? + ? ???? + ? ????_???? + ? if age >= 19 & age <= 23
???????? = ? + ? ???? + ? ???? + ? ????_???? + ? if age >= 19 & age <= 23
??????? = ? + ? ???? + ? ???? + ? ????_???? + ? if age >= 19 & age <= 23
???????_?? = ? + ? ???? + ? ???? + ? ????_???? + ? if age >= 19 & age <= 23
?????_??ℎ??? = ? + ? ???? + ? ???? + ? ????_???? + ? if age >= 19 & age <= 23
???? = ? + ? ???? + ? ???? + ? ????_???? + ? if age >= 19 & age <= 23
Now we can look at how all causes of death affect different age groups. This is done by creating a scatter
plot diagram in which each individual point represents a group of subjects that are of similar ages in so
that they are the same number of months away from turning or having turned 21, making a much more
readable chart. The regressions used to give the results for this are:
??? = ? + ? ????_?? + ? if age >= 19 & age < 21
??? = ? + ? ????_?? + ? if age >= 21 & age < 23
Then once we are able to see what the trends are for mortality rates and age groups, it is necessary to
look deeper by analyzing specifically how fatalities by alcohol and motor vehicle accident are affected.
To do this, we produce a scatter plot and use these regressions:
??? = ? + ? ???? + ? ????_?? + ? if age >= 19 & age < 21
??? = ? + ? ???? + ? ????_?? + ? if age >= 21 & age < 23
????ℎ?? = ? + ? ???? + ? ????_?? + ? if age >= 19 & age < 21
????ℎ?? = ? + ? ???? + ? ????_?? + ? if age >= 21 & age < 23
8
Finally, we can find the effects of age and the MLDA on the different mortality rate categories and
groups. To do this we use multiple regressions:
??? = ? + ? ???? + ? ???? + ? ????_?? + ? ????_?? + ? ????_???? + ? ????_??_????
+ ? ????_??_???? + ? ????ℎ??? + ?
???????? = ? + ? ???? + ? ???? + ? ????_?? + ? ????_?? + ? ????_????
+ ? ????_??_???? + ? ????_??_???? + ? ????ℎ??? + ?
???????? = ? + ? ???? + ? ???? + ? ????_?? + ? ????_?? + ? ????_????
+ ? ????_??_???? + ? ????_??_???? + ? ????ℎ??? + ?
???????? = ? + ? ???? + ? ???? + ? ????_?? + ? ????_?? + ? ????_????
+ ? ????_??_???? + ? ????_??_???? + ? ????ℎ??? + ?
??????? = ? + ? ???? + ? ???? + ? ????_?? + ? ????_?? + ? ????_????
+ ? ????_??_???? + ? ????_??_???? + ? ????ℎ??? + ?
????? = ? + ? ???? + ? ???? + ? ????_?? + ? ????_?? + ? ????_????
+ ? ????_??_???? + ? ????_??_???? + ? ????ℎ??? + ?
??? = ? + ? ???? + ? ???? + ? ????_?? + ? ????_?? + ? ????_???? + ? ????_??_????
+ ? ????_??_???? + ? ????ℎ??? + ?
????ℎ?? = ? + ? ???? + ? ???? + ? ????_?? + ? ????_?? + ? ????_????
+ ? ????_??_???? + ? ????_??_???? + ? ????ℎ??? + ?
??????????ℎ?? = ? + ? ???? + ? ???? + ? ????_?? + ? ????_?? + ? ????_????
+ ? ????_??_???? + ? ????_??_???? + ? ????ℎ??? + ?
We can then attempt to find how the MLDA affects the mortality rate specifically through drinking as a an
instrumental variable (IV). To do this, the reduced form must be found for the mortality categories. The
results of the reduced form and the first stage are then divided by each other to produce the instrumental
variable (IV) estimate. To confirm the finding of the IV estimate that is found through this method, an IV
regression is used.
I used robust standard errors for all of my regressions.
Results
9
Listed first are some of the figures that resulted from the trial and error process to determine the right
parameters. Using the age range of 19-23 for subjects and grouping them by the number of months they
have until or after their 21st birthday provides clear and easily understandable results.
Figure 1: Scatter Plot Tests
It appears that a bin width of 50 days allows for a good balance of minimizing clutter for ease of reading
and showing enough results so that trends in the data can be identified. Using these parameters, we can
move on to look at the effect of regressing the probability of subjects consuming alcohol with age.
10
Figure 2: Drinking Age First Stage Relationship
This provides a first stage relationship between the MLDA and whether or not people drink alcohol. It
shows that a large jump in subjects’ probability of drinking occurs at the age of 21 when they are no longer
below the MLDA.
11
Table 1: Changes in Drinking with Birthday Variable
(1) (2) (3)
VARIABLES drinks_alcohol drinks_alcohol drinks_alcohol
over21 0.088 0.111 0.115
(0.083) (0.083) (0.085)
agec 0.044*** -0.024 -0.051
(0.009) (0.037) (0.093)
o.agec_over21 – – –
o.birthday – – –
agec_sq -0.034* -0.068
(0.018) (0.108)
agec_cu -0.011
(0.035)
o.agec_sq_over21 – –
o.agec_cu_over21 –
Constant 0.559*** 0.536*** 0.532***
(0.010) (0.016) (0.021)
Observations 8,933 8,933 8,933
R-squared 0.003 0.003 0.003
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1
This tables shows us that when subjects are aged above the MLDA, the probability that they will consume
alcohol increases by 11.5 percentage points. This supports the hypothesis that the MLDA has a negative
effect on the probability of people below the age of 21 consuming alcohol. With these results providing
assurance that the MLDA does in fact have a negative effect on the likelihood of minors drinking, the
effect of the MLDA on mortality rates can be observed.
12
Table 2: Sample Characteristics Test
(1) (2) (3) (4) (5) (6) (7) (8) (9) (10)
VARIABLES uninsured hs_diploma hispanic white black employed married working_lw going_school male
post -0.018 0.011 -0.010 0.016 -0.014 0.008 -0.030*** 0.010 0.007 0.015
(0.013) (0.011) (0.012) (0.014) (0.010) (0.014) (0.010) (0.014) (0.011) (0.014)
agec 0.026*** 0.023*** 0.000 -0.007 0.009 0.059*** 0.051*** 0.058*** -0.058*** -0.023**
(0.008) (0.007) (0.008) (0.009) (0.007) (0.009) (0.006) (0.009) (0.007) (0.009)
agec_post -0.017 -0.020** -0.000 0.006 -0.005 -0.026** 0.002 -0.024** 0.019* 0.029**
(0.012) (0.010) (0.011) (0.013) (0.009) (0.012) (0.009) (0.012) (0.010) (0.013)
Constant 0.318*** 0.821*** 0.241*** 0.554*** 0.157*** 0.642*** 0.152*** 0.642*** 0.166*** 0.428***
(0.010) (0.008) (0.009) (0.010) (0.007) (0.010) (0.007) (0.010) (0.008) (0.010)
Observations 18,824 18,824 18,824 18,824 18,824 18,824 18,824 18,824 18,824 18,824
R-squared 0.001 0.003 0.000 0.000 0.000 0.014 0.018 0.014 0.019 0.000
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1
This table shows us that the post variable—indicating if a subject is over the age of 21 and is in the
treatment group—is not statistically significant for any of the subject characteristics with the only
exception being the marriage variable. Given that the average American adult marries at about 28 years
of age, the effect of being older than 21 being statistically significant with regards to marriage is consistent
as post-21-year-olds are closer to the average marriage age than pre-21-year-olds. This should not be a
cause for bias in this analyses, so the characteristics are shown to be similar enough.
13
Figure 3
This graph shows the mortality rate by all causes per 100,000 people for subjects within the age range of
19 and 23, with each individual point representing a group of subjects that are of similar ages in so that
they are the same number of months away from turning or having turned 21. This chart indicates that the
number of fatalities faced by subjects rises very notably once the age of 21 is reached, with the trend in
mortality diminishing as subjects become older. This means that there must be something happening at
and after the age of 21 to cause this jump.
14
Figure 4
This chart breaks down the mortality categories to isolate the observations for fatalities related to motor
vehicle accidents (MVA) and alcohol. It shows that there is a sharp increase in both mortality rates, with
both trends also diminishing as the age of the groups increases. Given that death by alcohol consumption
and MVA are directly or at least typically related to drinking and that the jump is occurring at the age of
21, this indicates that the MLDA certainly is having some kid of effect on the treated group.
15
Table 3: Effects of drinking on subcategories of deaths
(1) (2) (3) (4) (5) (6) (7) (8) (9)
VARIABLES all internal external homicide suicide drugs mva alcohol externalother
agec -5.634 -3.706 -1.929 1.158 1.336 0.895 -5.500 0.679 -0.020
(8.154) (3.482) (7.120) (2.127) (1.745) (1.267) (4.247) (0.592) (1.835)
post 8.720*** 1.962 6.757*** -0.878 2.834** -0.084 3.905** 0.470** 0.746
(2.904) (1.421) (2.360) (0.667) (1.336) (0.345) (1.485) (0.230) (0.488)
agec_sq -6.922 -6.650 -0.272 0.060 1.572 0.669 -3.435 1.005 0.468
(9.933) (4.032) (9.303) (2.532) (2.218) (1.610) (5.445) (0.631) (2.602)
agec_cu -2.055 -2.227 0.172 -0.070 0.513 0.216 -1.098 0.430** 0.409
(3.290) (1.345) (3.207) (0.835) (0.747) (0.561) (1.862) (0.197) (0.924)
agec_post 2.378 1.983 0.395 2.232 -6.254 0.950 4.557 -1.779 -0.078
(12.430) (5.727) (12.097) (3.206) (5.129) (1.746) (6.997) (1.192) (2.738)
agec_sq_post 6.995 9.836 -2.841 -4.543 2.792 -2.751 1.666 0.407 -0.796
(13.796) (6.333) (13.892) (3.776) (5.410) (2.142) (8.034) (1.428) (3.388)
agec_cu_post 2.305 1.405 0.900 1.580 -1.753 0.540 1.405 -0.924* -0.221
(4.376) (2.034) (4.442) (1.225) (1.670) (0.732) (2.573) (0.474) (1.143)
o.agec_post – – – – – – – –
birthday 4.403** 0.832 3.571** 1.508*** -2.133* 0.042 3.068*** 0.865*** 0.583*
(2.159) (1.029) (1.652) (0.374) (1.121) (0.214) (1.060) (0.136) (0.342)
Constant 92.279*** 19.208*** 73.071*** 17.571*** 11.896*** 4.001*** 29.385*** 1.421*** 9.373***
(1.495) (0.778) (1.247) (0.483) (0.315) (0.229) (0.795) (0.151) (0.258)
Observations 48 48 48 48 48 48 48 48 48
R-squared 0.703 0.823 0.526 0.447 0.534 0.740 0.747 0.727 0.408
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1
These results indicate that we can expect alcohol to increase the mortality rate from alcohol, motor
vehicle accidents, and suicide. Ease of access to alcohol increasing deaths from alcohol consumption when
not being under the age of the MLDA is pretty direct, but for MVA and suicide, the correlation isn’t quite
as direct but can still be made. With nearly one third of MVA’s being a result of alcohol-impaired driving
16
crashes, it makes sense that the mortality rate from MVA’s should increase once people have access to
alcohol. In the case of suicide, alcohol is considered to be a depressant which can limit a person’s ability
to think rationally, and when a person is considering suicide this can affect their decision making process
in the worst way.
Table 4: Effects of drinking on subcategories of deaths without birthday
(1) (2) (3) (4) (5) (6) (7) (8)
VARIABLES all internal external homicide suicide drugs mva alcohol
agec -5.634 -3.706 -1.929 1.158 1.336 0.895 -5.500 0.679
(8.052) (3.438) (7.030) (2.101) (1.723) (1.251) (4.194) (0.585)
post 11.263*** 2.443** 8.820*** -0.007 1.602 -0.060 5.678*** 0.970***
(2.529) (1.029) (2.067) (0.739) (1.082) (0.262) (1.565) (0.353)
agec_sq -6.922 -6.650 -0.272 0.060 1.572 0.669 -3.435 1.005
(9.808) (3.982) (9.186) (2.500) (2.190) (1.590) (5.376) (0.623)
agec_cu -2.055 -2.227 0.172 -0.070 0.513 0.216 -1.098 0.430**
(3.249) (1.328) (3.167) (0.824) (0.738) (0.554) (1.839) (0.194)
agec_post -6.790 0.250 -7.040 -0.908 -1.814 0.863 -1.832 -3.580**
(11.262) (4.559) (10.692) (3.267) (4.394) (1.493) (6.952) (1.453)
agec_sq_post 16.047 11.547** 4.500 -1.443 -1.593 -2.665 7.974 2.185
(12.869) (5.380) (12.645) (3.787) (4.894) (1.920) (7.963) (1.630)
agec_cu_post -0.331 0.907 -1.238 0.678 -0.476 0.515 -0.431 -1.441***
(4.158) (1.800) (4.127) (1.227) (1.575) (0.673) (2.566) (0.529)
o.agec_post – – – – – – –
Constant 92.279*** 19.208*** 73.071*** 17.571*** 11.896*** 4.001*** 29.385*** 1.421***
(1.476) (0.769) (1.231) (0.477) (0.312) (0.226) (0.785) (0.149)
Observations 48 48 48 48 48 48 48 48
R-squared 0.689 0.821 0.511 0.402 0.491 0.740 0.729 0.668
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1
Looking at Table 3 and Table 4 we see that being over 21 is a statistically significant factor for mortality
due to all, external, suicide, MVA, alcohol and external other causes. When we adjust for the inclusion
subjects’ birthdays the coefficients on all significant categories decrease, excluding suicides. This means
that the information supplied by Table 3 is an over estimate that doesn’t consider the fact that when
people turn 21 they are likely to overly take advantage of their newly legal abilities were illegal for them
until their 21st birthday.
17
The IV regression resulted in an estimate of 79.875512 with a standard error of 19.246303
Conclusion
The presence of the MLDA does reduce the proportion of people who drink, having an effect of
approximately 11.5 percentage points on the drinking rate. The MLDA also appears to reduce the death
rate. Categories such as internal cause of death and drugs are outside of the effects of the MLDA as the
MLDA does not stop 19-year olds from catching diseases which makes them sick or obtaining drugs
illegally off the streets. Categories such as Alcohol, MVA, and Suicide are or can be related to alcohol
consumption, so the enforcement of a MLDA helps to prevent the deaths of young adults under the age
of 21 by restricting access to alcohol. As soon as it’s legal for an individual to drink, we see immediate
spikes in the overall death rate.
Given the IV estimate that was found of approximately 80, this means that once the MLDA is no longer in
effect on subjects, the consumption of alcohol causes the death rate to increase by nearly 80 per 100,000.
This supports the claim that the MLDA does reduce the mortality rate of individuals who are aged under
the MLDA. However, when determining if a variable can be considered instrumental, there are three
assumptions that must be met. Those assumptions are the exclusion restriction, relevance condition, and
the exogeneity assumption. The MLDA affects our outcome on deaths through restricting access to alcohol
for people under the age of 21, so it has a direct effect on the outcome due to it restricting one group
from accessing alcohol, failing to meet the exclusion restriction. The MLDA meets the relevance condition
given that, despite the fact it legally restricts people from obtaining alcohol, it cannot stop them from
consuming alcohol through illegal means. Finally, the MLDA does meet the exogeneity assumption given
that the groups are shown to be randomized, even if the MLDA only affects people under the age of 21
18
while having no effect on those older, meaning that people in one age group can’t be randomly assigned
to be in the other. This leads to the conclusion that MLDA is an inadequate instrument.
1
Brandon Lang
Econ 104
Prof. Dobkin
1/31/18
Paper One
Abstract
This paper atte
m
pts to analyze the effect of sending eligible voters in Iowa a pre-recorded phone
call message telling them to vote in an upcoming election in 2002 to find if their likelihood of
voting is changed. To study the effects of the call, two approaches are used: one in which the
treatment group is assigned by subjects simply being sent the message with the control group
not be sent it, and the second in which the treatment group is assigned by subjects listening to
the message to completion and the control group not listening to the message. All kinds of data
are pooled together to help isolate the effect of the treatment when producing regression
estimates. Although the listening to completion treatment shows more of an effect on subjects’
likelihood of voting, both methods produce murky results as they are both subject to bias.
Introduction
Data
Methods
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
Th
is
stu
dy
re
so
ur
ce
w
as
sh
ar
ed
v
ia
C
ou
rs
eH
er
o.
co
m
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
2
vote02 = B0+B1treat_real+u
Results
Using the treatment variable:
Table 1
untreated treated difference
mean sd mean sd b p
vote02 0.59 0.49 0.61 0.49 -0.01** (0.01)
contact 0.00 0.00 0.46 0.50 -0.46*** (0.00)
newreg 0.05 0.21 0.05 0.22 -0.00 (0.57)
busy 0.00 0.00 0.03 0.17 -0.03*** (0.00)
age 55.80 18.95 55.78 18.82 0.01 (0.94)
female 0.56 0.50 0.56 0.50 0.00 (0.35)
main voter:
turnout in 01
0.73 0.44 0.73 0.44 0.00 (0.67)
vote98 0.57 0.49 0.57 0.49 -0.00 (0.67)
county 59.70 30.64 59.55 30.70 0.16 (0.56)
Observations 85931 15000 100931
Standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1
This table shows the comparison of the means, standard deviations, and p-values of the
differences between the treatment and control groups. Results were successfully randomized, as
is indicated by the means for almost all of the sample characteristics being nearly or completely
identical between the treated and untreated groups, with the only exceptions being “contact”
and “busy” due to the untreated group not having received calls.
The difference in voting rates for the treatment and control groups is .0120234, or an increase
1.2 percentage points in the likelihood of voting. It is statistically significant as the t-stat (2.77) is
higher than the t-critical (1.960) and the p-value is very small (0.006). Even though it is statistically
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
Th
is
stu
dy
re
so
ur
ce
w
as
sh
ar
ed
v
ia
C
ou
rs
eH
er
o.
co
m
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
3
significant, the actual practical effect is very small.
Table 2
(1) (2) (3) (4) (5) (6) (7)
VARIABLES vote02 vote02 vote02 vote02 vote02 vote02 vote02
treat_real 0.0120**
*
0.012**
*
0.013**
*
0.012**
*
0.012**
*
0.012**
*
0.012**
*
(0.00434
)
(0.004) (0.004) (0.004) (0.004) (0.004) (0.004)
newreg
–
0.315**
*
0.105**
*
0.160**
*
0.173**
*
0.160**
*
0.160**
*
(0.007) (0.007) (0.006) (0.006) (0.007) (0.007)
main voter: turnout in
01
0.582**
*
0.443**
*
0.442**
*
0.400**
*
0.400**
*
(0.003) (0.003) (0.003) (0.004) (0.004)
vote98
0.274**
*
0.257**
*
0.274**
*
0.275**
*
(0.003) (0.003) (0.003) (0.003)
age
0.001**
*
0.002**
*
0.002**
*
(0.000) (0.000) (0.000)
female
–
0.023**
*
–
0.023**
*
(0.003) (0.003)
busy
-0.016
(0.020)
Constant 0.594*** 0.609**
*
0.162**
*
0.104**
*
0.039**
*
0.065**
*
0.065**
*
(0.00167
)
(0.002) (0.003) (0.003) (0.005) (0.005) (0.005)
Observations 100,931 100,931 100,931 100,931 100,931 98,327 98,327
R-squared 0.000 0.019 0.260 0.318 0.321 0.304 0.304
Standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1
This table shows the effects of regressing the effect of the treatment on the likelihood of a subject
voting in 2002. Covariates are added progressively to reduce bias and make more accurate
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
Th
is
stu
dy
re
so
ur
ce
w
as
sh
ar
ed
v
ia
C
ou
rs
eH
er
o.
co
m
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
4
estimates as to the effect of the treatment variable. Added covariates include whether or not a
subject is a newly registered voter (newreg), whether or not they voted in the 2000 election
(vote00), whether or not they voted in the 1998 election (vote98), the average age of the subjects
(age), whether or not the subject is female (female), and if the subject receiving the message was
busy when called (busy).
The addition of more covariates reduces bias and increases the precision of a model’s prediction,
as the inclusion of more variables that are strong predictors of the outcome reduces dilution in
the regression. This indicates that the treatment effect on the likelihood of voting is understated
without the inclusion of the other covariates.
Using the contact variable:
Table 3
Standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1
Table 4
NoContact Contact Difference
mean sd mean sd b p
vote02 0.59 0.49 0.67 0.47 -0.08*** (0.00)
age 55.59 18.93 58.57 18.74 -2.98*** (0.00)
female 0.56 0.50 0.58 0.49 -0.02* (0.01)
newreg 0.05 0.22 0.04 0.21 0.00 (0.10)
Vote98 0.57 0.50 0.61 0.49 -0.04*** (0.00)
main voter:
turno~01
0.73 0.44 0.77 0.42 -0.04*** (0.00)
Observations 94021 6910 100931
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
Th
is
stu
dy
re
so
ur
ce
w
as
sh
ar
ed
v
ia
C
ou
rs
eH
er
o.
co
m
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
5
(1) (2) (3) (4) (5) (6)
VARIABLES vote02 vote02 vote02 vote02 vote02 vote02
contact 0.0768*** 0.060*** 0.052*** 0.052*** 0.046*** 0.040***
(0.00611) (0.006) (0.006) (0.006) (0.005) (0.005)
age
0.006*** 0.006*** 0.006*** 0.002*** 0.002***
(0.000) (0.000) (0.000) (0.000) (0.000)
female
-0.029*** -0.029*** -0.023*** -0.023***
(0.003) (0.003) (0.003) (0.003)
newreg
-0.221*** -0.047*** 0.159***
(0.007) (0.007) (0.007)
vote98
0.421*** 0.274***
(0.003) (0.003)
main voter:
turnout in 01
0.399***
(0.004)
Constant 0.591*** 0.272*** 0.279*** 0.316*** 0.271*** 0.065***
(0.00160) (0.005) (0.005) (0.005) (0.005) (0.005)
Observations 100,931 100,931 98,327 98,327 98,327 98,327
R-squared 0.002 0.050 0.058 0.067 0.219 0.304
Standard errors in parentheses
***p<0.01, **p<0.05, *p<0.1
Conclusion
Although It is apparent that the control group in Table 3 will not provide a good counter factual
for the treatment group earnings outcomes as the results in every category are very different in
most categories except for female. This indicates that the observed groups aren’t similar enough
and that bias is likely present. The fact that this is no longer completely randomly assigned but
also has an opt-in element is not accounted for and likely contributes to the different outcomes.
When focusing on the effects of subjects simply receiving the treatment, the means of the
treated and control groups were nearly identical, but this table shows greater differences
between the groups which are unlikely to simply be the result of random chance. A potentially
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
Th
is
stu
dy
re
so
ur
ce
w
as
sh
ar
ed
v
ia
C
ou
rs
eH
er
o.
co
m
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
6
major contributing difference may be that the kinds of people who would pick up the phone to
listen to the message to completion are different from those who wouldn’t, with one potential
difference being that those who opt-in could be unemployed. This could mean that subjects who
listen to the message are likely to be either school-aged young people or elderly retired people.
The addition of covariates reduced the bias given that as more variables were added, the upward
bias of contact was diminished. The covariate which reduced the bias the most was age,
decreasing the effect of contact on the outcome by 1.8%. A probable reason for this could be
that, as speculated previously, a significant portion of the treatment group could be retirees. The
mean age of the treatment group being about 58 years old indicates that this is probable. As this
population is likely able to spend more time at home on account of being unemployed, it’s
reasonable to say that they have more free time to be at home to pick up the phone when called
and listen to the message to completion, potentially increasing their likelihood of voting.
Table 1 shows that the treatment and control groups have different means which indicates that
they aren’t similar enough to draw accurate conclusions about the treatment’s effects without
bias. Due to the contact variable being opt-in, the randomness of the experiment was diminished.
Due to the it being indicated that age has a large effect on contact, it is possible that older people
are more likely to vote generally, so even though they may listen to the message to completion,
they may have intended to go out to vote before even receiving it, making the treatment
pointless and giving it an upward bias in its effect on the outcome. The absence of other variables
which may effect subjects’ likelihood of voting in the election very likely causes further upward
bias for the treatment.
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
Th
is
stu
dy
re
so
ur
ce
w
as
sh
ar
ed
v
ia
C
ou
rs
eH
er
o.
co
m
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
7
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
Th
is
stu
dy
re
so
ur
ce
w
as
sh
ar
ed
v
ia
C
ou
rs
eH
er
o.
co
m
Powered by TCPDF (www.tcpdf.org)
https://www.coursehero.com/file/36871964/Paper-1-pdfpdf/
http://www.tcpdf.org
We provide professional writing services to help you score straight A’s by submitting custom written assignments that mirror your guidelines.
Get result-oriented writing and never worry about grades anymore. We follow the highest quality standards to make sure that you get perfect assignments.
Our writers have experience in dealing with papers of every educational level. You can surely rely on the expertise of our qualified professionals.
Your deadline is our threshold for success and we take it very seriously. We make sure you receive your papers before your predefined time.
Someone from our customer support team is always here to respond to your questions. So, hit us up if you have got any ambiguity or concern.
Sit back and relax while we help you out with writing your papers. We have an ultimate policy for keeping your personal and order-related details a secret.
We assure you that your document will be thoroughly checked for plagiarism and grammatical errors as we use highly authentic and licit sources.
Still reluctant about placing an order? Our 100% Moneyback Guarantee backs you up on rare occasions where you aren’t satisfied with the writing.
You don’t have to wait for an update for hours; you can track the progress of your order any time you want. We share the status after each step.
Although you can leverage our expertise for any writing task, we have a knack for creating flawless papers for the following document types.
Although you can leverage our expertise for any writing task, we have a knack for creating flawless papers for the following document types.
From brainstorming your paper's outline to perfecting its grammar, we perform every step carefully to make your paper worthy of A grade.
Hire your preferred writer anytime. Simply specify if you want your preferred expert to write your paper and we’ll make that happen.
Get an elaborate and authentic grammar check report with your work to have the grammar goodness sealed in your document.
You can purchase this feature if you want our writers to sum up your paper in the form of a concise and well-articulated summary.
You don’t have to worry about plagiarism anymore. Get a plagiarism report to certify the uniqueness of your work.
Join us for the best experience while seeking writing assistance in your college life. A good grade is all you need to boost up your academic excellence and we are all about it.
We create perfect papers according to the guidelines.
We seamlessly edit out errors from your papers.
We thoroughly read your final draft to identify errors.
Work with ultimate peace of mind because we ensure that your academic work is our responsibility and your grades are a top concern for us!
Dedication. Quality. Commitment. Punctuality
Here is what we have achieved so far. These numbers are evidence that we go the extra mile to make your college journey successful.
We have the most intuitive and minimalistic process so that you can easily place an order. Just follow a few steps to unlock success.
We understand your guidelines first before delivering any writing service. You can discuss your writing needs and we will have them evaluated by our dedicated team.
We write your papers in a standardized way. We complete your work in such a way that it turns out to be a perfect description of your guidelines.
We promise you excellent grades and academic excellence that you always longed for. Our writers stay in touch with you via email.