馃攳
Adjusted R Squared - YouTube
Channel: Prof. Essa
[0]
This is Stephanie from StatisticsHowTo.com
and in this video I'll be talking about Adjusted
[6]
R Squared.
In the previous video on the coefficient of
[10]
determination that was r squared.
I talked about how variables can be related
[14]
to each other.
And how r squared can give us an idea how
[18]
well data points fit our line.
Let say, I鈥檓 looking at the price of pizzas.
[22]
So I have my price on the Y-axis, and on the X.
[28]
Let's say, I think the price is mostly due to
the price of dough and toppings.
[38]
Now as the price of dough and toppings increase,
the price in pizza is also likely to increase.
[44]
And I might get an r squared.
That's pretty high, maybe 89%.
[50]
89% of the price of pizza is due to the price
of dough and toppings.
[57]
Now let's say I take this one step further and I think that as well as the price of dough
[64]
and toppings.
Let's say I think that whether preparers are
[68]
male or female.
Now I鈥檓 going to add another axis here,
[71]
I鈥檓 going to say this is my Z-axis.
So if could think of this as a 3D model.
[78]
And on the Z-axis, I鈥檓 going to suggest
that male or female pizza tossers.
[86]
That also has something to do with the price
of pizza.
[88]
As I add more and more of these independent variables,
r squared will always go up.
[94]
So when I add male or female pizza tossers
r squared might go up to say 90%.
[102]
Our r squared never decreases.
So you might be looking at this and thinking,
[107]
oh this is obviously the better model because
r squared has gone up to 90%.
[112]
But this is complete rubbish our male or female
pizza tossers have nothing to do with the
[117]
price of pizza.
But the more of these independent variables
[120]
I add, the higher r squared.
So this becomes a problem.
[125]
We can fix this problem by using adjusted
r squared.
[129]
If you run regression analysis your r squared
might be 89% for the price of dough and toppings.
[135]
Adjusted r squared is always going to be lower.
And let say my adjusted r squared was 80%.
[143]
Now when we run the adjusted r squared for
the extra variable which is the male or the
[147]
female pizza tossers which has nothing to
do with the price of pizza.
[152]
Your adjusted r squared,
We'll take into account that the extra useless
[156]
variable and it will reduce the figure a little
bit.
[159]
Let's say, it reduces it to 72%.
So based on the adjusted r squared, you would
[165]
choose the first model.
And you would really discount male or female
[169]
pizza tossers.
This becomes important because you can keep
[172]
on adding these independent variables.
You can add the price of a dough mixer,
[177]
the age of the customer that comes to the
door.
[179]
You can keep on adding variables and r squared
will always increase.
[183]
The adjusted r squared will take into account
what kinds of variables you are adding to the
[188]
model.
The adjusted r squared formula looks a little
[191]
complicated.
But if you already have r squared the coefficient
[194]
of determination, then the formula is pretty
simple.
[197]
n is the number of items in your data set.
And this k those are those independent variables
[204]
we were talking about.
For the price of pizza, being a result of
[208]
the price of dough and toppings, and the sex
of the worker, these are your independent
[213]
variables and this is k.
And hopefully you can see that as k increases,
[219]
as k goes up,
adjusted r squared will always fall.
[224]
Unless, there's a significant increase in
the r squared value.
[229]
So basically what this formula does is the
more decent independent variables you add
[234]
that is independent variables that actually
fit the model and cause r squared to increase.
[239]
As long as you keep on adding good data,
adjusted r squared will increase.
[243]
But if you add a complete rubbish to k and
k increases but r squared does not, the adjusted
[250]
r squared will fall.
Most of the time you don't really need to
[254]
calculate adjusted r squared by hand,
Microsoft Excel and most other statistical
[259]
packages will do the work for you.
That鈥檚 the general overview of adjusted
[263]
r squared.
Check us out at StatisticsHowTo.com for more
[266]
videos and articles on everything Elementary
Statistics.
Most Recent Videos:
You can go back to the homepage right here: Homepage





