What does P-Value mean in Regression? - YouTube

Channel: Bhavesh Bhatt

[1]
Ever had difficulty in understanding what exactly is p-value.
[5]
this tutorial gives you
[6]
a brief idea on how you can use p-values in your hypothesis testing.
[12]
One fine day when
[13]
I was working on a regression problem, I came across such a screen and got a bit confused
[18]
as to which features a significant based on the p-values which are denoted on the rightmost
[23]
column, so I decided to make this video so that none of you are ever confused because
[28]
of p-value.
[31]
Let's understand first what a NULL hypothesis is.
[37]
NULL hypothesis is basically
[38]
states that nothing new is happening and the old theory is true for regression it simply
[43]
means that there is no relation between the dependent and independent variable and the
[48]
coefficients of independent variable denoting the dependent variable are zero.
[55]
Now what is
[56]
p value mean p value is basically used in hypothesis testing to help you support or
[61]
reject the null hypothesis. p-value is an evidence against your null hypothesis smaller the value
[68]
stronger the evidence to reject the null hypothesis.
[73]
A simple example to give you an idea of how
[78]
null hypothesis and p value work is say, for example a pizza place claims that their
[84]
delivery times are 30 minutes or less on an average but I think it's more than that so
[89]
what do I do?I conduct a hypothesis test.
[92]
My null hypothesis is that the mean delivery
[95]
time is 30 minutes and my alternate hypothesis which I want it to be true is that the mean
[101]
time is greater than 30 minutes. I randomly sample some delivery times and run the data
[107]
through the hypothesis test and my p value turns out to be 0.001.
[115]
In simple terms that
[117]
there is a probability of 0.001 that you will mistakenly reject the pizza places claim
[123]
that they're delivery time is less than or equal to 30 minutes since typically we are
[128]
willing to reject the null hypothesis when this probability is less than 0.05 you conclude
[134]
that the pizza placed is actually wrong.
[138]
So if your p value is less than 0.05 reject the
[142]
null hypothesis if it's greater than 0.05 do not reject the null hypothesis.
[149]
Now coming
[149]
back to our problem. I have five variables age, time, last contact age, total call count
[157]
success call counter and all of this and I have a p value associated with each variable which
[162]
are my significant variables in which are not my significant variables is what my p
[167]
value is going to signify.The p value for each term test for the null hypothesis that
[174]
the coefficient is equal to 0 no effect so a low p value which is less than 0.05 indicates that you
[182]
can reject the null hypothesis in other words a predator that has a low p value is likely
[187]
to be a meaningful addition to your model because the changes in the predictor's value are related to the changes in response
[194]
variable. Conversely a larger p value suggest that changes in the predictor are not associated
[202]
with the changes in the response.
[206]
So in our case the p value for the first three variables
[210]
are less than 0.05 so I reject the null hypothesis meaning that the weights or the coefficients
[217]
are not 0 and except the alternate hypothesis, which is their standard coefficients which
[224]
are given after the regression has been computed and in simple terms the first three terms
[229]
having p values less than 0.05 contribute to the predictor that is my final output variable.
[239]
Thank you for watching this video. I hope that you found this video informative. Do subscribe
[245]
to the channel on the link given below. Thank you.