馃攳
The Central Limit Theorem, Clearly Explained!!! - YouTube
Channel: StatQuest with Josh Starmer
[0]
even if you're not normal
[3]
the average
[5]
is normal
[8]
hello I'm Josh starmer and welcome to
[11]
stat Quest today we're going to talk
[13]
about the central limit theorem and it's
[15]
going to be clearly explained
[18]
note for this stat quest to make any
[21]
sense at all you should be familiar with
[23]
the normal distribution if not check out
[26]
the normal distribution clearly
[28]
explained
[29]
it would also be helpful if you were
[31]
familiar with the concept of sampling
[33]
from a statistical distribution if not
[36]
check out sampling from a statistical
[39]
distribution clearly explained
[42]
the central limit theorem is the basis
[45]
for a lot of statistics and the good
[47]
news is that it's a pretty simple
[49]
concept
[51]
in this stat Quest I'll explain what the
[54]
central limit theorem is and why it's
[56]
important
[58]
like most things in statistics I think
[61]
the central limit theorem is easiest to
[63]
understand if we look at some examples
[66]
so let's start with a uniform
[68]
distribution
[70]
this one goes from zero to one
[74]
it's called The Uniform distribution
[76]
because there is an equal probability of
[79]
selecting values between 0 and 1.
[82]
the probabilities are all equal and thus
[85]
are uniform
[88]
we can collect 20 random samples from
[90]
this uniform distribution
[93]
and then calculate the mean of the
[95]
samples
[97]
and on the right we can draw a histogram
[100]
of the mean value
[103]
since we only have one mean value the
[105]
histogram isn't very interesting
[108]
but after we collect 10 more samples and
[112]
collect 10 more means
[115]
the histogram starts to look a little
[117]
more interesting
[119]
here's the histogram after collecting 20
[122]
samples and calculating 20 means
[126]
30 mains
[128]
40 means
[130]
50 means
[132]
60 means
[133]
70 means 80 means 90 means and 100 means
[141]
after adding 100 means to the histogram
[144]
it's pretty easy to see that these means
[147]
are normally distributed
[150]
however to make it easy to see that the
[153]
means are normally distributed we can
[156]
overlay a normal distribution
[159]
you might have noticed that in the last
[161]
two slides I put means are normally
[164]
distributed in bold I did this because
[167]
this is what the central limit theorem
[169]
is all about
[172]
even though these means were calculated
[174]
using data from a uniform distribution
[178]
the means themselves are not uniformly
[181]
distributed instead the means are
[184]
normally distributed
[186]
bam
[188]
here's another example
[191]
this time we'll start with an
[193]
exponential distribution
[195]
just like before we can collect 20
[198]
random samples from this exponential
[200]
distribution
[202]
and just like before we can calculate
[205]
the mean of the 20 samples
[208]
and lastly we can draw a histogram of
[211]
that mean over here on the right
[214]
after we collect 10 samples and
[217]
Calculate 10 means the histogram starts
[220]
to look a little more interesting
[223]
here's the histogram after 20 means
[226]
30 means 40 means 50 means 60 means 70
[232]
means 80 means 90 means and 100 means
[239]
after adding 100 means to the histogram
[242]
we can see that they are normally
[244]
distributed
[246]
even though these means were calculated
[249]
using data from an exponential
[251]
distribution
[252]
the means themselves are not
[254]
exponentially distributed instead the
[258]
means are normally distributed
[261]
[Music]
[263]
so far we have seen that the means
[266]
calculated from samples taken from a
[268]
uniform distribution
[270]
are normally distributed
[273]
and means calculated from samples taken
[276]
from an exponential distribution
[279]
are also normally distributed
[282]
well it turns out that it doesn't matter
[285]
what distribution you start with
[288]
if you collect samples from those
[290]
distributions
[292]
the means will be normally distributed
[296]
yes there's a little asterisk here that
[299]
means there's some fine print that will
[301]
come later for now just know it's really
[304]
fine print and not worth spending too
[306]
much time worrying about
[309]
double bam
[312]
cool but what are the practical
[314]
implications of knowing that the means
[316]
are normally distributed
[319]
when we do an experiment we don't always
[321]
know what distribution our data comes
[324]
from
[326]
to this the central limit theorem says
[329]
who cares
[332]
the sample means will be normally
[334]
distributed
[336]
because we know that the sample means
[338]
are normally distributed
[341]
we don't need to worry too much about
[343]
the distribution that the samples came
[345]
from
[347]
we can use the means normal distribution
[349]
to make confidence intervals
[352]
do t-tests where we ask if there's a
[355]
difference between the means from two
[357]
samples
[358]
and Anova where we ask if there is a
[361]
difference among the means from three or
[363]
more samples
[364]
in pretty much any statistical test that
[367]
uses the sample mean
[370]
triple bam
[372]
note out there in the wild some folks
[376]
say that in order for the central limit
[378]
theorem to be true the sample size must
[381]
be at least 30.
[383]
this is just a rule of thumb and
[385]
generally considered safe however as you
[389]
can see in the examples here where I use
[391]
a sample size of 20 the rule was meant
[394]
to be broken
[396]
here's the fine print
[398]
in order for the central limit theorem
[400]
to work at all you have to be able to
[403]
calculate a mean from your sample
[406]
off the top of my head I can think of
[409]
only one distribution the Koshi
[411]
distribution that doesn't have a sample
[413]
mean and after doing biostatistics for
[416]
20 years I've never come across it in
[419]
practice
[420]
that said if you know of distributions
[423]
that don't have means put them in the
[425]
comments below and tell us what they're
[427]
used for I'm curious about how common
[429]
this occurs
[432]
hooray we've made it to the end of
[434]
another exciting stat Quest if you like
[437]
this stat Quest and want to see more of
[438]
them please subscribe and if you want to
[441]
support stack Quest well consider buying
[444]
one or two of my original songs alright
[446]
until next time Quest on
Most Recent Videos:
You can go back to the homepage right here: Homepage





