馃攳
Standard Deviation - YouTube
Channel: Bozeman Science
[4]
hi it's mr. Andersen and in this video
[6]
I'm going to talk about standard
[8]
deviation when you're collecting data in
[10]
a science lab the amount of data you
[12]
collect is important so is the average
[14]
but another important statistic is going
[16]
to be the standard deviation of your
[18]
sample and so in this video I'm going to
[21]
show you what it is conceptually I'm
[23]
then going to show you how to calculate
[24]
standard deviation by hand and then
[26]
finally I'm going to show you how to
[27]
calculate it using a spreadsheet and so
[30]
first of all what is it well to
[31]
understand standard deviation you have
[33]
to understand the normal distribution
[35]
and so what does that mean well your
[37]
it's a bell-shaped curve you might think
[39]
of it like that
[39]
and so in the United States most men are
[42]
about 5 foot 9 in other words that's the
[44]
average right here that's the mean or in
[48]
statistics that's the x-bar but there's
[51]
going to be a lot of men who obviously
[52]
are taller than that and a lot who are
[54]
shorter than that and so the standard
[56]
deviation is going to measure the spread
[59]
or the variation in this bell-shaped
[62]
curve and so basically if we were to go
[64]
right over here this dark area is going
[67]
to be one standard deviation above and
[69]
one standard deviation below the mean or
[72]
it's going to be below the average and
[74]
there's something cool about that about
[76]
68% of the individuals are going to be
[78]
in this area so one standard deviation
[81]
above and below but for you to look at
[83]
this for example down here is two
[85]
standard deviations and so 95% of
[89]
individuals are going to be within two
[91]
standard deviations from that mean and
[93]
then finally if we go way down here 99%
[97]
of individuals are going to be within
[99]
three standard deviations of the mean
[101]
but the standard deviation is going to
[103]
vary depending on the data that you
[105]
collect and so if we had two curves like
[107]
this so if this is one curve and then we
[111]
had another curve that looked like this
[116]
that data plotted up plotted on the same
[119]
curve this one is going to have a
[120]
smaller standard deviation than this one
[123]
they're both going to have standard
[125]
deviations obviously they're going to
[126]
have amounts where it's 68 95 and 90
[129]
percent of the people but this one down
[131]
here since it's more spread out is going
[133]
to have a higher standard deviation and
[135]
so how do we calculate that well the
[137]
equation is a little scary
[140]
the scary part it ends up being right
[142]
here so students are a little scared by
[144]
that the summation symbol but it's
[146]
actually pretty straightforward it's not
[148]
that hard to calculate the standard
[149]
deviation and so let me show you how to
[151]
do that and so first thing you want to
[153]
do is you want to have a data set so
[155]
here's going to be our data set right
[156]
here and to make this easy let's say we
[158]
just have four pieces of data one two
[161]
three four and five so you're collecting
[163]
data and this is the data in your data
[164]
table and you want to figure out the
[166]
standard deviation to that well to set
[168]
that up we're basically going to take
[170]
the square root of the summation of this
[172]
divided by the degrees of freedom so
[174]
that sounds a little bit scary and so
[176]
let's go to the scariest part to begin
[178]
with so let's look at what's going on
[179]
right here underneath that square root
[182]
and so what this is so if we go like
[184]
this the summation of x minus x-bar
[187]
squared basically means for each of
[192]
these data points that I have we're
[193]
going to have to figure out what's right
[197]
here so X minus X bar and so the first
[200]
thing we have to do is figure out what
[201]
the average is we have to figure out
[203]
what X bar is well basically if I add
[205]
one two three four five together I get
[208]
15 and if I divide but that by n which
[212]
is the total number of data points so in
[214]
this case N equals five so we have five
[217]
data points over here so if I divide 15
[220]
by five hopefully you can figure out an
[222]
average the average is going to be three
[224]
and so the mean is three or the average
[227]
is three so what we have to do is we
[229]
have to calculate this value for all
[232]
five of these data points what does that
[235]
mean well right here we're going to use
[237]
X and X for the first case is going to
[240]
be one so that's going to be one minus
[242]
three and then we're going to square
[244]
that so what is that one minus three and
[247]
we square that is going to be negative
[249]
two and if we square that so that's
[251]
negative two squared and if we square
[252]
that that's four let's go to the next
[255]
one well this is two minus three so that
[257]
stays the same so that's negative one
[259]
squared
[261]
and so that's going to be negative 1
[262]
squared or that's going to equal 1 if we
[265]
go to the next one that's easy that's 3
[267]
minus 3 squared equals 0 and if we
[270]
square 0 that's going to be 0 go to the
[274]
next one that's going to be 4 minus 3
[276]
and so that's going to be 1 squared or
[280]
equal to 1 and then finally if we go 5
[282]
minus 3 square it that's going to be 2
[285]
squared and that's going to equal 4 and
[288]
so if you ever see the summation sign
[290]
don't be scared by that it's not scary
[292]
at all it just means you got to do a lot
[294]
of work so for each of these data points
[296]
1 through 5 I had to calculate what was
[299]
in there and then I have to add it all
[301]
up so I have to add 4 plus 1 plus 1 plus
[304]
4 and if I add all those up I get 10 and
[307]
so what's going to be inside there it's
[309]
simply going to be 10 so let's figure
[311]
out the rest of my standard deviation
[312]
standard deviation is going to be the
[314]
square root in this case we solve this
[317]
as equal to 10 and then we're going to
[319]
divide that by n minus 1
[321]
so what's n that's our sample size in
[324]
this case it's 5 and so we take n minus
[326]
1 and that's going to equal 4 and so
[328]
what is our standard deviation it's the
[330]
square root of 10 divided by 4 which is
[333]
2.5 or if we take the standard deviation
[335]
of the square root of 2.5 that's going
[339]
to be something like 1.5 8 and so you're
[342]
going to have to use a calculator to
[344]
figure that out well what does that mean
[345]
if we were to plot this data as a
[347]
histogram for example this would be our
[350]
standard deviation one point five eight
[353]
and so it takes awhile to figure that
[356]
out based on doing it by hand and so if
[361]
you want to give it a try and so here's
[364]
data set over here and so try to
[366]
calculate the standard deviation using
[368]
this data set over here and try to do it
[370]
by a hand I'll put the answer down in
[373]
the description below the video but I
[375]
would give it a try so it's worth doing
[377]
once on your own and again this is going
[379]
to be our formula standard deviation and
[381]
so try to do that try to do that by hand
[384]
and so I'll wait no I won't wait for you
[387]
to do that pause the video try to do
[389]
this one and I'm going to show you how
[390]
to calculate this really really quickly
[392]
so I'm going to show you the spread
[393]
she shortcut and so how did how do you
[396]
do that in a spreadsheet it's pretty
[398]
simple so what I'm going to do is going
[399]
to take this data and I'm going to
[400]
switch over here to excel so here's the
[402]
data right here zero two four five and
[404]
seven and so I've entered my data in in
[407]
two different cells and now I'm going to
[409]
figure out the mean just to show you how
[411]
easy this is to figure out the mean I'm
[413]
going to hit it equals here and then I'm
[414]
just going to start typing so I'm going
[416]
to type in average because the
[417]
spreadsheet is not going to use the word
[418]
mean so I type in average and then I
[421]
select my data I hit a close parenthesis
[424]
I hit and and it's going to give me my
[425]
average which is going to be three point
[427]
six so if I wanted to know the average
[429]
there it is if I wanted to know the
[431]
median for example I could just type
[433]
median and I could go down like that
[435]
so spreadsheets are super simple and so
[438]
what are we looking for we're looking
[440]
for the standard deviation so how do I
[443]
do that I just hit equals I then start
[445]
typing stdev can you see how it pops up
[448]
right here standard deviation
[449]
parentheses then I'm going to select
[451]
that and then I'm going to go like that
[454]
so what's the standard deviation it's
[455]
2.7 what does that mean we had a bigger
[458]
spread in the second data set that we
[460]
did in the first set a higher standard
[461]
deviation and if you did it by hand it
[463]
should look something like that
[464]
so that's standard deviation and I hope
[467]
that's helpful
Most Recent Videos:
You can go back to the homepage right here: Homepage





