Selection Bias: Will You Make More Going to a Private University? - YouTube

Channel: Marginal Revolution University

[3]
♪ [music] ♪
[9]
- [Josh] Welcome back.
[10]
Today we continue our pursuit of causal knowledge.
[14]
Recall that private university alumni earn wages 14% higher
[18]
on average than the wages earned by public university grads.
[22]
Does that mean private university education
[25]
causes your wages to go up?
[28]
As with most of the questions we ponder,
[30]
the facts are not in dispute,
[33]
rather it's the interpretation that's contentious.
[36]
- [Narrator] Let's compare private school graduates
[39]
with those who attended public schools.
[42]
Private university grads differ in a number of ways.
[46]
For example, they have higher SAT scores.
[50]
Attendees of private universities score 120 points higher on average.
[55]
These SAT stars sport orange sweatshirts.
[60]
Private university grads also come from wealthier families --
[64]
13% higher than public university students.
[68]
The rich kids have green pants.
[71]
It would seem that public/private comparisons
[74]
are not apples to apples.
[76]
Perhaps their 14% wage gain is caused
[79]
by pre-existing differences in earnings potential
[82]
rather than by private university attendance.
[86]
- Like many who have walked before us
[87]
in search of causal knowledge,
[89]
we are waylaid here by selection bias.
[93]
- [Narrator] Selection bias misleads us
[95]
into interpreting naive comparisons as causal effects.
[100]
- [whisper] Come with me.
[101]
- [Narrator] Here we see selection bias tricking us
[104]
by directing traffic.
[106]
Those with higher test scores go to the left
[109]
towards private universities.
[111]
Those with lower family incomes go to the right
[115]
towards public universities.
[118]
Public/private comparisons have causal force
[120]
only when the groups compared are otherwise identical on average.
[126]
For then, we can happily say ceteris paribus.
[130]
But private schools are typically more selective and more expensive
[134]
than their public university counterparts.
[137]
So those going left are not comparable to those going right.
[142]
This is how selection bias bewitches us.
[146]
- Although colleges really do select their applicants,
[149]
the term selection bias refers to any comparison plagued
[154]
by systematic differences between groups
[156]
other than the difference we're focused on.
[160]
When the groups being compared differ in many ways,
[162]
we've lost ceteris paribus.
[165]
Selection bias is the principal enemy facing metrics students
[168]
and metrics masters alike.
[171]
Our five most important weapons in the fight against it
[174]
are the Furious Five of Econometrics.
[177]
- [whisper] The Furious Five.
[178]
- [Josh] Selection bias is insidious and pervasive,
[181]
but our weapons are powerful,
[183]
for we do not need to ensure that the individuals
[186]
to be compared are identical.
[188]
We don't need virtual clones.
[191]
Rather we need only ensure that the groups compared
[194]
be the same on average.
[197]
Our most powerful weapon, strong and dependable,
[201]
is random assignment of group membership.
[204]
Imagine a secret experiment in which applicants
[207]
to public and private colleges
[209]
are randomly assigned to attend one or the other.
[212]
Seems only fair.
[214]
And maybe we'll learn something from this too.
[217]
In the interest of science, I have proposed such an experiment
[220]
at MIT where I teach econometrics.
[223]
I'd like to replace our skilled but well-paid admissions officers
[226]
with a coin toss.
[228]
Random assignment of college admission ensures
[231]
that when it comes to cross school comparisons,
[233]
ceteris is paribus on average.
[236]
Unfortunately, for science, I have not yet convinced MIT admissions
[241]
to replace its staff with a stack of pennies.
[244]
- As we'll discuss later, random assignment is often impossible
[248]
or impractical so we must look for practical
[251]
and inexpensive strategies that have the same
[254]
ceteris paribus inducing power as random assignment.
[259]
Kamal, where should we look?
[261]
- [Kamal] I don't know.
[262]
If we could somehow control for...
[264]
- Correct.
[265]
- [whisper] What?
[266]
- Metrics masters are control freaks.
[268]
We implement statistical strategies
[271]
that make the groups choosing different paths as similar as possible.
[275]
Rather than simply comparing wages of public and private alums,
[278]
we look within sets of alumni
[280]
that have similar ability and background.
[283]
Within these sets, we make public/private comparisons
[286]
but not across them.
[288]
This strategy moves us one giant step closer to ceteris paribus
[293]
and apples to apples comparisons.
[295]
Let's look again at the Furious Five.
[297]
- [whisper] Furious Five.
[298]
- [Josh] Our principal tool in the struggle
[300]
for control is regression.
[303]
Regression is a neat way to compare two groups
[306]
while simultaneously holding many differences
[308]
between those in the groups fixed.
[311]
- Do regression estimates show that private university education
[313]
is worth paying for?
[315]
- [Man] Good question.
[316]
- [Narrator] Using regression to adjust for applicant ability
[318]
and family background and a few demographic characteristics
[322]
like race and sex reduces the private college premium
[325]
from 14% to 9%.
[328]
- Nine percent still seems pretty good.
[330]
- [Man] Nine is legit.
[331]
- [Narrator] But do we have true ceteris paribus?
[334]
Camilla?
[335]
- [Camilla] Um... I'm not sure we controlled for everything,
[338]
Maybe private school students are more ambitious or smarter
[342]
in ways not fully captured by test scores.
[344]
If so, comparisons are not apples to apples
[347]
even after the adjustments you speak of.
[349]
- Worrying indeed.
[352]
The possibility that the variables we've adjusted for
[355]
using regression do not fully account for group differences
[358]
is called omitted variables bias or OVB.
[362]
- [man] That's bad.
[363]
- [Narrator] OVB is selection bias in a regression context.
[367]
- We suffer the effects of OVB when the regression we've got
[371]
is not the one we want.
[372]
The regression we want, the regression of our dreams,
[376]
has more and better controls than the one we've got.
[379]
How can we control for something like ambition?
[383]
Is there an ambition index?
[386]
It's not easy to produce ceteris paribus comparisons.
[390]
Regression is a tool.
[391]
It's not magic.
[393]
Yet sometimes the results unearthed by this tool are striking.
[397]
Masters of Econometrics professor Stacy Dale
[400]
and Alan Krueger faced the challenge of selection bias and OVB.
[405]
In a famous academic study, Dale and Krueger controlled
[408]
for the many possible differences between students who've attended
[411]
different types of schools.
[413]
They had the insight that selection bias in this context originates
[417]
in two forces: student ambition and college opportunity.
[422]
Most students have a pretty good sense of their own aptitude,
[425]
inclination, and motivation for school work.
[428]
These forces are summarized by the type of schools they apply to.
[433]
At the same time, college admissions offices invest
[436]
massive hours and energy into ascertaining
[439]
who will succeed on campus.
[441]
They evaluate and select for academic ability and college commitment.
[447]
What if we compare the outcomes
[448]
of those who had the exact same acceptances and rejections?
[452]
Compare two high school students, Maya and Mariana.
[456]
Both admitted to UNC and Duke but not to Yale.
[460]
Similarly ambitious and judged similarly capable
[463]
by these three schools' admissions offices,
[466]
Maya opts for Duke because a friend is going there,
[469]
while Mariana heads to UNC in Chapel Hill.
[472]
- [woman] Come on, Heels!
[473]
- [Josh] Maya and Mariana are not clones of course,
[476]
and they've chosen different types of schools for personal reasons.
[479]
But otherwise, they have a lot in common.
[482]
The personal factors that drive them to choose between the schools
[485]
on their menu might not be closely related
[488]
to their future earning power.
[490]
Pooling many such comparisons moves us one giant step closer
[494]
to ceteris paribus.
[496]
Remarkably, a regression model that controls for the sets of schools
[500]
to which applicants have applied and been admitted
[503]
shows almost no difference in earnings between public
[506]
and private graduates.
[508]
In other words, averaging over a large number
[511]
of Maya to Mariana type comparisons,
[514]
the private school premium falls to nothing.
[518]
Maya might have enjoyed her expensive Duke education,
[522]
but on average at least students like her will do no better
[526]
in the labor market than comparable public school peers.
[530]
That's quite a change from our initial 14% earnings gap
[534]
favoring elite school alumni.
[537]
Regression has the power to turn a clouded statistical night
[540]
into a clear causal day.
[543]
But you'll need to know a little more before you can regress
[546]
with skill and confidence.
[550]
- [Narrator] You're on your way to mastering econometrics.
[554]
Make sure this video sticks by taking a few quick practice questions.
[558]
Or, if you're ready, click for the next video.
[562]
You can also check out MRU's website
[564]
for more courses, teacher resources, and more.
[568]
♪ [music] ♪