馃攳
Statistical questions | Data and statistics | 6th grade | Khan Academy - YouTube
Channel: Khan Academy
[0]
- [Voiceover] As we start
exploring the world of statistics,
[4]
it's worth asking ourself,
[6]
what is the word statistics even mean?
[9]
Statistics is really a
broad category of things
[12]
that you might do with data.
[14]
So it generally deals with
data, collecting data.
[18]
So actually let me write these down.
[19]
It's involving collecting data,
[22]
collecting data.
[23]
You could present data
in tables or charts,
[26]
or just as lists of numbers,
[28]
or however you might do it.
[30]
It is analyzing the data,
[32]
analyzing, analyzing,
[36]
presenting and analyzing data.
[37]
So this whole class of just all this stuff
[40]
that you might do with
data to answer a question
[43]
or try to figure out what's going on,
[45]
or just to learn about the world,
[47]
the whole class of things
is called statistics.
[50]
Now an idea that will
come up very frequently
[53]
in statistics is the
notion of variability.
[57]
Variability.
[59]
In everyday language, variability,
[61]
it's how much something is ...
[64]
How much does it vary?
[65]
How much does it change?
[67]
It's the same notion in statistics.
[69]
In statistics, variability
is the degree to which
[72]
data points are different from each other,
[74]
the degree to which they vary.
[76]
Just as an example of that
[78]
to just make it a little
bit more concrete,
[80]
let's say you were to go to five people,
[82]
and you were to ask them,
[84]
how many bricks did you eat yesterday?
[88]
Each of the people say, well I ...
[90]
Person one says, "I
don't eat bricks at all.
[92]
"I don't even know how to do that.
[93]
"I ate zero bricks."
[94]
Then next person says zero,
[96]
the next person says zero,
[97]
fourth person says zero,
[98]
and the fifth person says zero.
[100]
Fair enough, so that was our data point
[102]
on the different data points on ...
[103]
And I'm already doing statistics
[105]
just by going out there and asking them
[106]
how many bricks they ate.
[108]
Then I ask them how many grapes
[111]
did you eat yesterday?
[113]
The first person says "I ate zero grapes."
[116]
But the next person says
"I survive on grapes.
[118]
"I ate 235 grapes."
[120]
The next person says, "Yeah I like grapes.
[122]
"I ate 17 grapes."
[124]
Then the person after that
says that they five grapes.
[127]
Then next person also survives on grapes,
[130]
even to a larger degree.
[131]
They ate 318 grapes.
[134]
So if you look at these two data sets,
[136]
one is the number of bricks
someone ate yesterday,
[138]
the other one is how many
grapes they ate yesterday,
[140]
you immediately see that
there's more variability here.
[142]
All of these data points are zero,
[144]
while these, they change a good bit
[146]
from data point to data point.
[148]
So we have a sense that
there is more variability
[151]
in this data set.
[153]
Now one of the things we will start doing
[155]
a lot in statistics is try to measure
[158]
how much more, how much variability is.
[160]
How can we can quantify that?
[161]
How can we put a number on it?
[162]
How can we measure variability?
[164]
This is a big aspect of statistics,
[167]
but we won't do that in this video.
[168]
There are future videos for doing that.
[170]
But just as we go into
the world of statistics,
[173]
we should think about
when should our brain
[177]
even start getting into statistics mode,
[179]
thinking about the tools
that we have at our disposal,
[182]
about collecting data and
measuring variability,
[185]
and measuring and finding numbers
[187]
that somehow represent a pool
of data that has variability.
[191]
So the question we should ask ourselves is
[194]
what questions in the world
are statistical questions?
[197]
So statistical,
[200]
statistical questions.
[204]
So let's come up with a
[205]
definition for statistical questions,
[206]
the type of question
where we would want to
[208]
start bringing out our
statistical toolkit.
[213]
One possible way to think about
[214]
when you need to bring out
your statistical toolkit
[216]
is these are questions
that to answer them,
[219]
to answer,
[222]
you need to collect data with variability.
[226]
To answer, you need
[228]
to collect
[231]
data with variability.
[235]
I apologize for my handwriting.
[238]
Data with variability.
[240]
That's W-I-T-H.
[242]
Data with
[246]
variability.
[248]
So you're saying, okay
that kinda makes sense,
[250]
but I need to see some tangible questions
[252]
or tangible examples of things
[254]
that are statistical questions
[256]
and things that are not
statistical questions.
[258]
I would say fair enough.
[260]
Let's look at some examples.
[263]
So here I have six questions,
[265]
and I encourage you to
pause this video right now.
[268]
Before I work through it, think about it.
[270]
Based on this definition
of a statistical question,
[272]
which of these questions are statistical,
[274]
would require your statistical toolkit,
[277]
and which of these are not statistical?
[281]
So assuming you had a go at it,
[282]
let's go through these one by one.
[284]
So the first question,
[285]
how much does my pet grapefruit weigh?
[288]
You know, it's bizarre to begin with
[289]
to have a pet grapefruit,
[290]
but is this a statistical question?
[294]
What do I need to do to answer it?
[296]
I have to take my pet grapefruit out.
[297]
I have to weigh it.
[299]
Then I have to just write that down.
[300]
Just doing that I am collecting data,
[303]
so you could argue that
maybe I'm kinda starting
[306]
to mess with statistics a little bit,
[307]
but I'm just getting one data point.
[309]
So I might weigh it and I might see
[311]
my grapefruit weighs one pound,
[314]
but that's not data with variability.
[317]
That's just one data point.
[318]
In order to have
variability you have to have
[319]
multiple data points
[321]
and should be at least
possible that they could vary.
[324]
So, for example, all of
these folks ate zero bricks
[327]
but maybe it was possible that
someone actually ate a brick.
[330]
But here I have just one data point.
[332]
With one data point, you
can't have variability,
[334]
so this is not a statistical question.
[337]
I just collect a data point.
[338]
Next question, what is the average number
[341]
of cars in a parking
lot on Monday mornings?
[344]
To think about whether it
is a statistical question,
[346]
we just have to think about
[347]
what do I have to do to
answer that question?
[349]
I would have to go out to the parking lot
[351]
on multiple Monday mornings,
[353]
and measure the number of cars.
[355]
So on the first Monday morning
[356]
I might see there are 50 cars.
[358]
The next Monday morning
I might go out there
[360]
and count there's 49 cars.
[362]
The next Monday morning I
might see 50 cars again.
[364]
The next Monday morning
I might see 63 cars.
[368]
So I'm collecting multiple data points
[370]
to answer this question.
[371]
Then I'm going to take
the average of all these,
[373]
but I'm collecting multiple data points
[375]
to answer this question.
[379]
It's definitely possible that
there could be variation here,
[381]
that there could be variability,
[383]
so this is a statistical question.
[386]
Next question, am I hungry?
[389]
It's an important question.
[390]
We ask it to ourselves multiple times.
[392]
In fact, sometimes our
bodies just tell it to us.
[396]
But I am definitely not collecting ...
[397]
I guess you could say I'm collecting
[398]
some type of feelings from my stomach
[401]
or how weak I feel or not,
[403]
but it's definitely not
data with variability.
[405]
I'm either hungry or not
hungry on a given day.
[408]
I mean if you said broader,
[410]
how does my hunger change from day to day
[412]
and you came up some type of a
scale for rating your hunger,
[415]
all right maybe that's more statistical.
[417]
But just am I hungry, a yes-no question.
[420]
This is not ...
[422]
To answer this I do not have to
[423]
collect data with variability,
[425]
so this is not a statistical question.
[428]
How many teeth does my mother have?
[430]
To do this I would have
to go find my mother,
[432]
and then I would have to
ask her to open her mouth,
[435]
and count the teeth in her mouth.
[436]
Maybe I'd get a number like 30.
[438]
So it's kind of like how much
does my pet grapefruit weigh.
[441]
I do have to collect one data point,
[445]
but one data point is not
going to have variability,
[448]
so I am not collecting
data with variability,
[450]
so this is not a statistical question.
[452]
If I said how many teeth
do all of the mothers
[455]
that I know have on average,
[457]
or what's the range of number of teeth
[459]
of the mothers I know have,
[460]
that would be statistical.
[463]
But this is just one data
point, so not statistical.
[465]
How much time do the members of my family
[468]
spend eating per year?
[472]
Once again, what do I need to
do to answer this question?
[475]
I would have to go either observe
[478]
or survey my family members,
[480]
maybe my mom, my wife, my children,
[483]
and my uncles, aunts, whoever else,
[485]
and I would say how much
do you eat each day?
[487]
I would add them all up to figure out
[488]
how much they eat in a year.
[490]
Maybe family member A
[492]
eats 813 hours in a year.
[495]
Family member B ate, I don't know,
[498]
732 hours in the year.
[502]
So you see the general notion
that I will be collecting
[505]
multiple data points from
the different family members.
[508]
There very well, and in fact,
[510]
there's very likely to
be variation in that.
[512]
In fact, I might even see
variation from year to year.
[515]
Person A is probably going to eat
[516]
a different number of
hours in the next year.
[518]
So I'm definitely going to
collect data with variability
[521]
in order to answer this question.
[523]
So that is a statistical question.
[525]
Then finally, I have the question,
[527]
how many times have I watched Star Wars?
[530]
This is very similar to how
many teeth does my mother have,
[533]
or how much does my pet grapefruit weigh.
[535]
I just have to count the number of times
[537]
that I watched Star Wars.
[538]
Maybe I watched it seven times.
[541]
Just one data point.
[542]
No variability here.
[543]
If I said on average how many times
[546]
have my co-workers watched Star Wars,
[547]
then I'm gonna have to
collect data with variability.
[549]
I'm gonna collect multiple data points,
[551]
and it's definitely
possible that my co-workers
[553]
have watched it different
numbers of times.
[555]
But for this question in particular,
[557]
where it's just one
data point to answer it,
[558]
how many times have I watched Star Wars?
[560]
My answer in this case
actually I think is seven.
[563]
Then not a statistical question.
[566]
So hopefully that gives you a sense
[567]
of statistics variability
[569]
and what a statistical question even is.
Most Recent Videos:
You can go back to the homepage right here: Homepage





