Adjusted R Squared - YouTube

Channel: Prof. Essa

[0]
This is Stephanie from StatisticsHowTo.com and in this video I'll be talking about Adjusted
[6]
R Squared. In the previous video on the coefficient of
[10]
determination that was r squared. I talked about how variables can be related
[14]
to each other. And how r squared can give us an idea how
[18]
well data points fit our line. Let say, I鈥檓 looking at the price of pizzas.
[22]
So I have my price on the Y-axis, and on the X.
[28]
Let's say, I think the price is mostly due to the price of dough and toppings.
[38]
Now as the price of dough and toppings increase, the price in pizza is also likely to increase.
[44]
And I might get an r squared. That's pretty high, maybe 89%.
[50]
89% of the price of pizza is due to the price of dough and toppings.
[57]
Now let's say I take this one step further and I think that as well as the price of dough
[64]
and toppings. Let's say I think that whether preparers are
[68]
male or female. Now I鈥檓 going to add another axis here,
[71]
I鈥檓 going to say this is my Z-axis. So if could think of this as a 3D model.
[78]
And on the Z-axis, I鈥檓 going to suggest that male or female pizza tossers.
[86]
That also has something to do with the price of pizza.
[88]
As I add more and more of these independent variables, r squared will always go up.
[94]
So when I add male or female pizza tossers r squared might go up to say 90%.
[102]
Our r squared never decreases. So you might be looking at this and thinking,
[107]
oh this is obviously the better model because r squared has gone up to 90%.
[112]
But this is complete rubbish our male or female pizza tossers have nothing to do with the
[117]
price of pizza. But the more of these independent variables
[120]
I add, the higher r squared. So this becomes a problem.
[125]
We can fix this problem by using adjusted r squared.
[129]
If you run regression analysis your r squared might be 89% for the price of dough and toppings.
[135]
Adjusted r squared is always going to be lower. And let say my adjusted r squared was 80%.
[143]
Now when we run the adjusted r squared for the extra variable which is the male or the
[147]
female pizza tossers which has nothing to do with the price of pizza.
[152]
Your adjusted r squared, We'll take into account that the extra useless
[156]
variable and it will reduce the figure a little bit.
[159]
Let's say, it reduces it to 72%. So based on the adjusted r squared, you would
[165]
choose the first model. And you would really discount male or female
[169]
pizza tossers. This becomes important because you can keep
[172]
on adding these independent variables. You can add the price of a dough mixer,
[177]
the age of the customer that comes to the door.
[179]
You can keep on adding variables and r squared will always increase.
[183]
The adjusted r squared will take into account what kinds of variables you are adding to the
[188]
model. The adjusted r squared formula looks a little
[191]
complicated. But if you already have r squared the coefficient
[194]
of determination, then the formula is pretty simple.
[197]
n is the number of items in your data set. And this k those are those independent variables
[204]
we were talking about. For the price of pizza, being a result of
[208]
the price of dough and toppings, and the sex of the worker, these are your independent
[213]
variables and this is k. And hopefully you can see that as k increases,
[219]
as k goes up, adjusted r squared will always fall.
[224]
Unless, there's a significant increase in the r squared value.
[229]
So basically what this formula does is the more decent independent variables you add
[234]
that is independent variables that actually fit the model and cause r squared to increase.
[239]
As long as you keep on adding good data, adjusted r squared will increase.
[243]
But if you add a complete rubbish to k and k increases but r squared does not, the adjusted
[250]
r squared will fall. Most of the time you don't really need to
[254]
calculate adjusted r squared by hand, Microsoft Excel and most other statistical
[259]
packages will do the work for you. That鈥檚 the general overview of adjusted
[263]
r squared. Check us out at StatisticsHowTo.com for more
[266]
videos and articles on everything Elementary Statistics.