SPSS for Beginners 5: Correlations - YouTube

Channel: Research By Design

[1]
In this video, I'm gonna show you how to calculate correlations in SPSS. I'll focus on Pearson's
[6]
r, although the process will be pretty similar for other types of correlations correlation
[10]
coefficients. So just a quick review on on Pearson's r, it describes the relationship
[14]
between two continuous variables, it ranges between negative one and positive one, where
[19]
negative one means a perfect inverse relationship, zero indicates no relationship at all, and
[25]
positive one indicates a perfect direct relationship. Anything between zero and one, or zero and
[32]
negative one, indicates varying strengths of relationship. So let's pretend we're interested
[37]
in the relationship between IQ and GPA. So first let's get some data in here, which I'm
[44]
just gonna paste in from some numbers I made up earlier. You can pause the video now and
[50]
enter them in. So as you can probably guess, this first column are IQ scores, they range
[56]
from about the 80s to around 120. The second column indicates different GPA scores, which
[63]
have the potential to range between zero, which means you're failing everything, and
[67]
4.0 means you're acing everything. And it looks like in this dataset the lowest score
[72]
is about 0.8 and the highest GPA is about 3.7. So before we do anything else, let's
[80]
make this easy on ourselves and rename these variables. So I'm gonna go over to variable
[85]
view, and just rename this first one IQ scores, and the second one GPA. And I think all the
[95]
other default settings can stay the same. So these data are set up in a very specific
[103]
way. The order of the scores really doesn't matter as long as each pair stays together.
[109]
So for example, this first person has an IQ of 112 and a GPA of 3.3. And this pair of
[117]
scores can go anywhere, it can go at the top, it can go somewhere towards the bottom or
[121]
it can go in even the middle. The point is, they need to stay together. And indeed these
[129]
scores I have in here aren't ordered in any logical manner, the only thing that's important
[133]
is that each IQ score stays with each GPA, cause it reflects scores obtained from one
[138]
individual person. This is one person's IQ and GPA, this is another person's IQ and GPA,
[144]
some third person's IQ and GPA. If we change the order of one set of scores, but leave
[149]
the others the same, our correlation coefficient won't reflect reality, because it'll be mismatching
[155]
scores from different people. So just remember that data from each person must be paired
[160]
together. Also you can see there are fifteen pairs of scores. We don't talk about there
[167]
being thirty different data points, because when we're doing correlations, it's all about
[171]
how many different pairs of scores you have. So now that we have the data in here, it'll
[176]
be relatively easy to start on the correlation. All we do is we go up to analyze, and under
[183]
the correlate option, there are different types of correlations we can calculate but
[189]
Pearson's r is a type of bivariate, so we go there. And once we're there we see this
[195]
window that we've dealt with before. All the variables we have are on the left, everything
[201]
we wanna analyze we move over to the right. So in this case we need to move over at least
[205]
two variables, if I move over just one it won't let me run the analysis, OK is not available.
[213]
Because if we only had one variable, what would we compare to? So we need to have at
[216]
least two things. We have some different options here too, there are different types of correlation
[226]
coefficients you can calculate, Pearson's r is one of the most widely used ones, so
[229]
it's already checked right here. And it's also the most appropriate for these data,
[236]
there are also ones like Kendall's Tau or Spearman's Rho. We could use those for different
[241]
types of data, for now let's just stick with Pearson's. And below, SSPS also wants to know
[246]
if we are going to run a two-tailed or a one-tailed significance test, and whether to flag significant
[253]
correlations. I haven't talked about significance tests very much yet, but these options deal
[258]
with detecting whether are correlation coefficients are significantly different from zero. In
[263]
other words, would we expect to see similar results in the real world? And I'll talk about
[269]
that more in just a minute. For now, you can just leave those options where they are. So
[274]
once you're ready, just click OK and your output window will pop up. Now what it's showing
[280]
us is in a correlation matrix, that's this box right here. This shows the correlation
[286]
coefficient of every combination of variables. So we have these two rows, one for IQ, one
[293]
for GPA, and we have two columns, one for IQ, one for GPA. So where each row and column
[302]
meet, we see the correlation coefficient between these two variables. We can also see--the
[307]
correlation coefficient's the first thing--then we see the significance level, or P value,
[312]
and then the N, which is the number of pairs of scores each correlation coefficient is
[316]
based on. And remember from earlier we have fifteen pairs of scores. So in this first
[321]
square in this matrix, we see the correlation coefficient between IQ and itself. No big
[327]
surprise, it's one, it's a perfect correlation. We see another perfect correlation down here
[333]
in the lower right, which is the correlation between GPA and itself. You'll see these ones
[341]
in every correlation matrix you run, because like I said earlier, SPSS will compare every
[348]
combination of variables, including each variable and itself. So just be aware that all the
[354]
comparisons will be the same on both sides of this diagonal of ones. The comparison that
[360]
we're probably most interested in is the correlation between IQ and GPA. And notice down here,
[366]
this is the same thing as the correlation between GPA and IQ. It doesn't matter which
[371]
one you do against the other, the order doesn't matter. So first it's showing us the correlation
[377]
is very high, it's .986, or almost, almost one. Below that we also have the significance
[384]
level or p-value, which is very small. It's definitely less than .05 or .01, which are
[391]
typical cutoffs. I'll explain p-values more in a later video, but for now just know that
[396]
anything smaller than .05 means the correlation is significant, so we'd probably see it in
[401]
the real world. And this correlation was so small that we can't even see it with three
[406]
decimal places. It's probably something like .00001. And in this case the correlation has
[413]
been flagged with two asterisks, which means it's significant to the level of less than
[419]
.01, which is why it says that down here. And one last thing I wanna mention is that
[424]
we can compare more than two variables in our matrix. We can actually do a lot more,
[429]
we just need to toss in more variables in the mix. So getting back to the spreadsheet,
[436]
I've magically included some data on people's shoe sizes. So this first person, with an
[442]
IQ of 112, and a GPA of 3.30, also happens to have a shoe size of eight. So what we can
[452]
do is, if we go back to analyze, and under correlate bivariate, we can toss in these
[460]
data on shoe sizes into the mix, into our correlation matrix. So what we get when we
[465]
hit OK is, right below, we get an even bigger correlation matrix, one with three rows and
[473]
three columns. So what it's telling us is that the correlation between shoe size and
[482]
IQ is is .04. The correlation between shoe size and GPA is .06. In other words they're
[490]
both very very small. In other words, there's no relationship between shoe size and these
[495]
other things. And just for the record to you can also see these same coefficients up here,
[500]
between IQ and shoe size, there's the correlation coefficient between GPA and shoe size, there's
[505]
the correlation coefficient right there. So those are the basics of correlations in SPSS,
[512]
just remember that you can make your correlation matrix as big as you want, but they're easier
[517]
to read if you keep them small, so try and limit your comparisons of things that you
[521]
think would be theoretically interesting. Also, don't confuse correlation coefficients
[526]
with p-values, or the significance level. Remember that you want the correlation coefficient
[530]
to be big, close to one, and you want the p-value to be small, hopefully less than .05.