Average or Central Tendency: Arithmetic Mean, Median, and Mode - YouTube

Channel: Khan Academy

[0]
Let's say I've got a set of numbers.
[2]
2, say I've got three 3's, I've got a couple of 4's, and
[8]
I've got a 10 there.
[10]
And what we want to do is find the middle of these numbers.
[14]
We want to represent these numbers with the center of the
[17]
numbers, or the middle of the numbers, just so we have a
[19]
sense of where these numbers roughly are.
[24]
And this central tendency that we're going to try to get out
[27]
of these numbers, we're going to call the average.
[31]
The average of this set of numbers.
[34]
And you've, I'm sure, heard the word average before, but
[38]
we're going to get a little bit more detailed on the
[41]
different types of averages in this video.
[44]
The one you're probably most familiar with, although you
[46]
might have not seen it referred to in this way, is
[49]
the arithmetic mean, which literally says, look, I, the
[58]
arithmetic mean of this set of numbers, is literally the sum
[61]
of all of these numbers divided by the number of
[63]
numbers there are.
[64]
So the arithmetic mean for this set right here is going
[67]
to be 2 plus 3 plus 3 plus 3 plus 4 plus 4 plus 10, all of
[77]
that over, how many numbers do I have?
[79]
1, 2, 3, 4, 5, 6, 7.
[82]
All of that over 7.
[84]
And what is this equal to?
[85]
This is 2 plus 9, which is 11, plus 8, which is 19, plus 10,
[94]
which is 29.
[95]
So this is going to be equal to 29/7, or you could say it's
[101]
equal to 4 and 1/7.
[103]
If I got my calculator out, we could figure out
[105]
the decimal of this.
[106]
But this is a representation of the central tendency, or
[110]
the middle of these numbers.
[112]
And it kind of makes sense.
[113]
4 and 1/7, it's a little bit higher than 4.
[117]
We're kind of close to the middle of our number range
[121]
right there.
[121]
And you might say, well, it's a little skewed to the right
[123]
and what caused that?
[124]
And well, gee, 10 is a little bit larger than all of the
[126]
other numbers.
[127]
It's kind of an outlier.
[129]
Maybe that skewed this average up, the arithmetic mean.
[133]
So there are other types of averages, although this is the
[136]
one that, if people just say, hey, let's take the average of
[139]
these numbers, and they don't really tell you more, they're
[141]
probably talking about the arithmetic mean.
[143]
The other forms of average, though, are the median, and
[150]
this literally is the middle number.
[155]
If there are two middle numbers, you actually take the
[158]
arithmetic mean of those two middle numbers.
[160]
You actually find the number halfway in between those two
[163]
middle numbers.
[164]
So the median of this set right here-- let me just
[167]
rewrite them.
[168]
So I have a 2, a 3, 3, 3, 4, 4, 10.
[177]
So, let's see, we have seven numbers right here.
[179]
The middle number, if I go 1, 2, 3, to the
[182]
right, we're there.
[184]
If we go 1, 2, 3 to the left, we're there.
[186]
The middle number is that 3 right there.
[189]
I just listed them in order, and I said, well, look, 3, you
[193]
could think of it as the fourth number from the right,
[196]
and it's also the fourth number from the left.
[199]
3 is the middle number.
[201]
And this case, it is the median.
[203]
So in this case, 3, if you use the median, is our average.
[209]
And that also makes sense.
[210]
I mean, it's literally the middle number, and if you look
[212]
at this set of numbers, it kind of does represent the
[215]
central tendency of this set.
[217]
Now just to be clear, it was very clear what the middle
[220]
number was, because I had an odd number of numbers.
[225]
I had three on each side of the three, so it was very easy
[228]
to figure out the median, the middle number.
[230]
But if I had a situation-- let's say I have the situation
[233]
where I have 2, 3, 4, and 5.
[237]
Let's say that's my set of numbers.
[241]
Well, here, there is no one middle number.
[244]
The 3 is closer to the left than it is to the right.
[247]
The 4 is closer to the right than it is to the left.
[250]
There's actually two middle numbers here.
[252]
The two middle numbers here are the 3 and the 4.
[257]
And here, when you have two middle numbers, which occurs
[260]
when you have an even number in your data set, there the
[264]
median is halfway in between these two numbers.
[267]
So in this situation, the median is going to be 3 plus 4
[271]
over 2, which is equal to 3.5.
[275]
And if you look at this data set, that's not what our
[277]
original problem was, but if you look at this data set
[284]
right there, you're actually going to find that the
[287]
arithmetic mean and the median here is the exact same thing.
[292]
Let's calculate it.
[293]
What 's the arithmetic mean over here?
[295]
It's going to be 2 plus 3 plus 4 plus 5, which is what?
[300]
5 plus 9, which is equal to 14, over 4.
[309]
And what's this equal to?
[310]
14/4 is 3 and 2/4, or 3 and 1/2, the exact same thing.
[316]
So for this data set, they were the same thing.
[319]
For this data set, our median is a little bit lower.
[324]
It's 3, while our arithmetic mean is 4 and 1/7.
[328]
And I really want you to think about why that is.
[330]
And it has a lot to do with this 10 that sits out there.
[332]
All of these other numbers are pretty close to whichever
[337]
average you want to pick, whether it's the arithmetic
[339]
mean or it's the median.
[342]
But this 10 is kind of an outlier, or it
[347]
skews the data set.
[350]
Maybe it's so much larger than the other numbers, that it
[353]
makes the arithmetic mean seem larger than maybe is
[357]
representative of this data set.
[359]
And that's something important to think about.
[361]
When you're finding the average for something, most
[367]
people will immediately go to the arithmetic mean.
[369]
But in a lot of cases, median will make a lot more sense, if
[373]
you have these really large or really small numbers that
[376]
could skew the data set.
[378]
I mean, you can imagine, if this wasn't a 10-- or let's
[381]
imagine adding another number here.
[382]
If I added the number 1 million, if I added 1 million
[387]
to this data set, if that was the eighth number, the
[390]
arithmetic mean is going to be this huge number.
[392]
It's going to be much larger than what is representative of
[396]
most of the numbers in this data set.
[398]
But the median is still going to work.
[400]
The median is still going to be about 3 and a half, right?
[403]
If you had 1 million here, it would be 1, 2, 3, 4.
[407]
The middle two numbers would be that.
[409]
It would be 3 and 1/2.
[410]
So the median is less sensitive to one or two
[413]
numbers at the extremes that otherwise would skew the mean.
[418]
Now, the last form of average I want to talk
[421]
about is the mode.
[425]
It has nothing to do with ice cream.
[428]
The mode is literally the most frequent number.
[436]
And in this data set, it's pretty clear what the most
[439]
frequent number is.
[441]
I only have one 2, I have three 3's, I have two 4's, I
[445]
have one 10, and even if want to include the million, I only
[448]
have one million there.
[449]
So here, the number that occurs most
[451]
frequently is the 3.
[455]
So, once again, the mode seems like a pretty good measure of
[458]
central tendency or a pretty good average
[461]
for this data set.
[463]
Now the mode, it's a little tricky to deal with, and you
[465]
won't see it used that often, because it becomes a little
[469]
ambiguous when-- you know, look at this data set:
[472]
2, 3, 4, and 5.
[475]
What is the mode there?
[476]
All of these numbers are equally frequent.
[479]
So if you have a situation like this, then you might
[481]
just-- the mode really loses its meaning.
[483]
It might force you anyway to take the median or the
[486]
mean in some form.
[487]
But if you really do have numbers that one shows up a
[490]
lot more than the other, then the mode starts to make sense.
[494]
So, hopefully, this has given you a pretty good overview of
[498]
how to represent the central tendency of a data set.
[505]
Very fancy word.
[506]
But it's just saying, look, we're trying to represent with
[508]
one number all of this data.
[510]
And you might say, hey, why do we even worry about that?
[511]
It only has seven numbers here or eight numbers here.
[514]
But you can imagine if you had 7 million numbers or 7 billion
[516]
numbers, and you don't want to show someone all of that data.
[519]
You just want to give someone a sense of what those numbers
[522]
are on average.
[525]
And as we said, the arithmetic mean is what I see being used
[529]
the most. But in situations where you might have numbers
[533]
that would skew the arithmetic mean, because they're so large
[535]
or they're so small, the median might
[538]
make a lot of sense.