馃攳
How to Calculate a Correlation (and P-Value) in Microsoft Excel - YouTube
Channel: unknown
[0]
In our last video, we went ahead and
calculated the correlation on these
[3]
values,
[4]
but we didn't get a significance value
for the correlation to see whether it
[9]
was statistically significant or not.
[12]
And unfortunately in Microsoft Excel, when
we run the correlation procedure through
[17]
the data analysis toolpak it does not
give us a p-value,
[21]
so we can't assess whether a given
correlation is statistically significant
[26]
or not. But thankfully there is a work-
around here and what we need to do is if
[31]
we go to regression, and select that, we
can obtain a p-value this way.
[38]
Now what we'll do is - notice there's two
boxes here - input Y range and input X
[43]
range;
[44]
for input Y range we'll go ahead and
select the exam grade values and I'm
[49]
also going to select the label or the
variable name in the first row here,
[52]
exam grade.
[54]
Next click in the input X range box here
and then select hours studied and all the
[60]
values there and then be sure if you do
select the variable names as I have here
[66]
be sure to select labels. Okay that looks
good,
[71]
let's go ahead and click OK. And then
here we get our output; I'm going to
[75]
expand this a little bit just to the
values we need there's a lot of
[78]
information here but I really only need
a few values and let's go ahead and
[84]
highlight those here.
[85]
First of all let's make this font a
little bit bigger, so it's easier to read.
[91]
All right and then we'll expand what we
need here. Now first of all if you watched
[99]
the last video on obtaining a
correlation under where it says multiple
[104]
R in regression
[106]
if we have just two variables, like we do
in this correlation example where we
[111]
have hours studied and exam grade then
with two variables the multiple R is
[117]
just identical to the Pearson r. So
notice how this is also once again
[124]
.86.
[126]
So we could
[127]
really just run regression to get our
correlation if you look under multiple
[131]
R here. Now for the p-value what we
want to do is we can go to the ANOVA
[135]
table and under significance here this
is our p-value and it is very small
[142]
there. Now just in case you're interested
there's also the exact same p-value when
[148]
we have just two variables also is
located under hours studied where it says p
[153]
value.
[154]
So notice how these two are the exact
same here significance F
[159]
and p-value; they're identical. Now what
we're going to do here is we'll use alpha
[165]
of .05
[167]
and the decision rule is as follows: if our
p-value, as given by the significance F,
[172]
or p-value right here
[174]
if it's less than .05, then
the correlation is statistically
[180]
significant
[181]
and since .0001 is definitely less than
.05,
[187]
that indicates the correlation is in
fact significant.
[192]
Since
[193]
this correlation coefficient of .86 is statistically
significant,
[196]
that means that there is a significant
positive relationship between hours
[202]
studied and the grade on the exam. And we
can go ahead and write these results as
[209]
follows:
[210]
There is a significant positive
relationship between the number of hours
[214]
spent studying and the grade on the exam
and then I have r, and that's for
[219]
pearson's r, 11, and that's equal to the
degrees of freedom, where df is equal to N
[225]
minus 2 and N is equal to 13, so 13-2 is
11.
[239]
So
[241]
r(11) equals .86 and that was the value of Pearson's
r if you remember. And I put p is less
[246]
than .001. Now,
alternatively, you could put p is less
[250]
than .05 if you wanted to.
[253]
Now this p-value gives us more
information it is more informative than
[256]
the p-value of less than .05.
But since we used an alpha of point zero
[262]
five in this test, it would be acceptable
to put p is less than .05 if
[267]
we wanted to, but this does provide more
information. And the reason why I said p
[272]
is less than .001, is because
if you look back at our window in Excel
[278]
in our output
[279]
this value is less than .001
as it goes to four decimal places
[287]
before the one appears, but it's not less
than point .0001.
[292]
In any event, in most cases when you see
a p-value reported like this
[297]
it usually doesn't get smaller than less
than .001; that's about
[301]
the limit where we report it in our
written results.
[305]
This concludes the video on obtaining
the p-value for the correlation
[309]
coefficient using the Data Analysis
Toolpak in Microsoft Excel.
Most Recent Videos:
You can go back to the homepage right here: Homepage





