Surveys and Samples - YouTube

Channel: Khan Academy

[0]
What we have here is six examples of possible surveys
[5]
that might be conducted.
[6]
What we want to do is critique them, see whether they're
[9]
actually doing things that will give meaningful results,
[12]
or results that you can actually make some
[14]
judgment based on.
[15]
So let's read the directions.
[16]
It says, comment on the way the following samples have
[19]
been chosen.
[20]
For the unsatisfactory cases, suggest a way to improve the
[25]
sample choice.
[25]
All right.
[26]
So A, you want to find whether wealthier people have more
[33]
nutritious diets by interviewing people coming out
[36]
of a five-star restaurant.
[38]
So this is skewed on two directions.
[42]
One, if you're only interviewing people out of a
[46]
five-star restaurant, you're going to have a
[48]
disproportionate number of wealthy people.
[52]
And even the non-wealthy people who are eating at the
[55]
five-star restaurant, they feel like they have enough
[57]
money to eat there, so they're probably not that not-wealthy.
[61]
They're probably pretty close to the threshold of wealthy.
[65]
The other skew is that you're interviewing people at a
[67]
restaurant, so you're disproportionately probably
[70]
going to get people who eat out, as opposed to people who
[73]
cook for themselves.
[75]
And depending on whether this is a really healthy
[77]
restaurant, maybe some type of organic, vegan, who knows
[80]
what, restaurant, where you're going to get a
[81]
disproportionate number of healthy people.
[83]
Or maybe it's a really unhealthy restaurant.
[85]
Maybe it's they put a lot of butter and sugar in the food.
[89]
Who knows what it's all about?
[91]
But this seems skewed on two dimensions.
[94]
The best way to do it is, survey some proportion of
[98]
wealthy people in an environment where it isn't
[101]
being skewed by the type of food they eat.
[103]
So you, once again, you wouldn't want to survey people
[106]
at the organic grocery, because once again, you'd be
[108]
skewing for people who maybe eat a little healthier.
[111]
You'd want to do it maybe at some type of wealthy type of
[116]
event, or something that's not food-oriented.
[119]
I'd have to think more about what that might be.
[122]
And you'd also want to survey people who are not wealthy, so
[127]
at some other type of event where most of the population
[130]
doesn't fall under the wealthy category.
[131]
Or probably the best thing would be sampling people at an
[135]
event that doesn't discriminate on wealth, and
[138]
just surveying people and just ask them, in a broad bucket,
[140]
do you make more than x?
[142]
Whatever your definition of wealthy is.
[144]
Or is your wealth, or is your income less than x?
[147]
And then ask people what do they eat on a daily basis,
[150]
something like that might work.
[151]
So in general, I'm not a fan, not a fan.
[154]
I don't think this passes mustards.
[157]
It's biased on really two dimensions.
[161]
B, you want to find out if there is a pedestrian crossing
[165]
needed at a certain intersection by interviewing
[169]
people walking by that intersection.
[173]
So this is better than the first survey, but it still
[176]
seems a little biased.
[178]
And it depends how you define needed.
[182]
Is needed if the people walking around the pedestrian
[185]
crossing think it's needed, then this is OK.
[188]
But if you want to objectively find out whether people in
[191]
your community think it's needed, you want to have a
[193]
broader sample.
[195]
So you might want to also include drivers.
[199]
Because obviously, the pedestrian crossing is going
[202]
to affect them.
[203]
You want to interview taxpayers in general.
[206]
And obviously a lot of people might be all of the above.
[209]
And you also might be missing pedestrians who would be
[212]
walking by that intersection if there was a pedestrian
[215]
crossing, but they're avoiding it, and so those are the
[217]
people who would want it the most, but they're not even
[220]
going to be there because they don't think that it's worth
[222]
walking around there because you can't cross the street.
[224]
So you're missing a whole sample of people.
[226]
The best way I would do this is I would look within a,
[230]
maybe a three block radius, or maybe a six block radius, of
[234]
all of the homes and offices, and I would randomly survey a
[237]
sample of those people.
[239]
And I think that would be a better indication of whether
[242]
the pedestrian crossing is needed.
[244]
And actually, even a better way would be do
[246]
it without a survey.
[247]
You'd actually want to look at patterns of people walking and
[249]
all of that, and compared to other things
[251]
that might be needed.
[252]
But this one definitely doesn't work for me.
[258]
Now C, you want to find out if women talk more than men by
[262]
interviewing an equal number of men and women.
[266]
That seems reasonable.
[268]
I guess if your survey, if you do it in a non-biased
[271]
environment-- if you don't do it at, like, a business school
[274]
or a law school where people might talk disproportionately,
[277]
or it might be skewed towards women or men more.
[281]
I don't know which way it would be skewed.
[283]
But as long as you do it in a neutral place, and I guess
[285]
their survey is, maybe they'll ask some arbitrary question.
[288]
And they'll just time, or they'll count how many words
[291]
the women talk versus the number of words the men talk.
[296]
That seems like a reasonable way to see if women
[298]
talk more than men.
[298]
And once again, as long as you're not talking to some
[301]
skewed sample, or biased sample, this
[303]
seems pretty good.
[304]
And it's good if it's an equal number.
[305]
There's an equal number of men and women in the world.
[307]
This seems to be-- I'll give it a smiley face right here.
[315]
D, you want to find whether students in your school get
[320]
too much homework by interviewing a stratified
[323]
sample of students from each grade level.
[326]
Well the first thing I want to do is, what is
[328]
a stratified sample?
[332]
And all that means is, you're going to sample people in the
[334]
same proportion that they exist in the population.
[337]
So let's say that your population-- your school--
[339]
had, say they had 100 ninth graders.
[344]
It has 200 tenth graders.
[348]
300 eleventh graders.
[352]
And 400 twelfth graders.
[355]
Actually, the school I attended actually, it went in
[357]
the reverse order.
[358]
I think we had like 800 ninth graders and 400 twelfth
[362]
graders since so many people dropped out.
[364]
But anyway, we don't have to be realistic here.
[366]
So you had a total school population
[368]
of about 1,000 students.
[374]
So if you want to do a stratified sample, let's say
[376]
you wanted to have a sample size of,
[378]
let's say, 50 students.
[380]
So you wanted to sample 50 students, and you wanted it to
[383]
be a stratified sample.
[385]
What that means is, you had 100 ninth graders out of 1,000
[388]
in your school.
[389]
So 10% of your school's population is ninth graders.
[393]
So in your stratified sample, 10% also
[397]
need to be ninth graders.
[397]
So 10% of 5-- sorry-- 10% of 50 is 5.
[401]
So you'd survey 5 ninth graders.
[406]
In both situations, this is 10% of the population, it's
[415]
going to be 10% of your sample.
[417]
Likewise, you had 20% were tenth graders in the
[420]
population, so 20% of your sample.
[422]
Or 20% times 50 is 10 tenth graders should be surveyed.
[430]
And then 30% of your school is eleventh graders, so 30% of
[435]
your survey.
[436]
So that should be 15 eleventh graders.
[441]
And then finally 40% were twelfth graders.
[443]
So in this situation, 40% of your survey, which is 20,
[448]
should be twelfth graders.
[449]
So that's all-- that's what a stratified sample means.
[454]
Now let's talk about whether this is
[455]
actually a good survey.
[456]
You want to find out whether students in your school get
[458]
too much homework, and you're interviewing
[459]
students in your school.
[461]
The one thing you definitely will find out is you will find
[463]
out whether students in your school think-- think-- they
[469]
think they get too much homework by interviewing a
[472]
stratified sample.
[472]
That would be true.
[473]
That's what you would find out.
[475]
Now, whether you could say that in a broader sense
[478]
they're getting too much homework, you might want to
[480]
include a larger audience, especially the other
[484]
stakeholders in the whole homework doing process,
[487]
especially parents and teachers.
[493]
And actually a better way to do it would be, probably to
[496]
compare the students in your school to a very similar
[499]
school that has a different amount of homework, and maybe
[503]
see the results on standardized tests.
[504]
And you want to control.
[506]
You want to make sure that there isn't a huge difference
[508]
between the income levels, or the education levels of the
[511]
kids' parents, and all that.
[512]
I mean, it becomes much more involved.
[513]
But this one still seems a little bit narrow, and all
[515]
you're going to find out is whether students think they
[517]
get too much homework.
[518]
And I can almost guarantee you, they will think that they
[521]
get too much homework, and so this survey might be a little
[523]
bit of a waste of time.
[525]
So I will put a frowny face.
[527]
E, you want to find out whether there should be more
[530]
public buses running during rush hour by interviewing
[534]
people getting off the bus.
[536]
So once again, what you're going to find out is whether
[539]
people who ride, people getting off the bus think
[542]
there should be more buses.
[544]
So I won't write it here, but you're not going to find out
[547]
objectively whether there should be more buses.
[549]
Frankly, the best way to evaluate that, I think, is to
[552]
just see how full, on average, the buses are.
[555]
I would actually poll the actual buses and see how full
[558]
they are during rush hour.
[560]
If 90% of them are 100% full, or people can't get on a bus
[566]
because they're full and that's happening frequently,
[569]
and their total wait time is just getting ridiculous,
[571]
that's probably a better assessment than
[573]
just surveying people.
[574]
But even beyond that, this is skewed on several levels.
[577]
One, you are surveying only the people riding the bus.
[581]
You're not surveying the other people in the community who
[583]
might have an opinion on this: drivers, taxpayers who would
[588]
have to pay for more buses.
[589]
And then even within the bus-riding population, you
[593]
have to think about when they're getting off the bus.
[596]
If you're getting people off the bus during rush hour, I
[599]
can almost guarantee you that they're going to say, yeah,
[601]
we'd like more buses because then they would have less wait
[603]
time, and the buses would be less crowded, and they
[605]
wouldn't have to sit next to anybody.
[607]
So even if it's not needed, they're going to say it is.
[609]
But if you interview people maybe getting off the bus
[612]
during lunchtime, who don't use the bus during rush hour,
[614]
they might say no, we don't need more buses at rush hour.
[617]
We need more at lunchtime.
[618]
So you're going to get different answers depending on
[620]
when the people are getting off the bus, and obviously,
[623]
you're going to get very different answers from people
[625]
not riding the bus.
[626]
So if you want a broader audience, if you're kind of
[628]
the city councilman trying to figure out what to do, I, once
[631]
again, like the crosswalk, I would survey a random sample
[636]
of people within three miles of the bus route, or
[641]
something like that.
[641]
I think that might be a better result.
[644]
F, F-- I'll switch colors.
[647]
You want to find out whether children should be allowed to
[649]
listen to music while doing their homework by interviewing
[652]
a stratified sample of male and female
[653]
students in your school.
[655]
Once again, stratified sample just means that, OK, if my
[658]
school is 60% male, then 60% of my sample will be male.
[662]
And that doesn't really seem relevant here.
[664]
F seems very similar to the, do students think they get too
[668]
much homework?
[669]
Just as almost every student is going to say they get too
[671]
much homework, almost every student should say they should
[673]
be allowed to listen to music if they want to.
[676]
And that's not going to be-- that's not really going to
[678]
answer the question of whether children should be allowed.
[682]
If you want a broader sample, you might-- just like the
[685]
situation with the homework-- include
[689]
parents and include teachers.
[693]
But really, if you wanted to answer this question, the best
[696]
way would actually be to look at, to survey students who
[700]
listen to music while doing their homework, and students
[703]
who don't listen to music while doing their homework,
[706]
and see their relative grade performance, and that might
[710]
give you a better answer.
[712]
So once again, I think this one's a little weak.
[715]
And the bus one was also a little weak.
[718]
So the only good one here, if it's done right, is C.
[721]
The other ones, they all needed a little bit of work to
[723]
actually be surveys that would give meaningful information.