馃攳
Two-sample t test for difference of means | AP Statistics | Khan Academy - YouTube
Channel: Khan Academy
[0]
kaito grows tomatoes in two separate
[2]
fields when the tomatoes are ready to be
[5]
picked he is curious as to whether the
[7]
sizes of his tomato plants differ
[10]
between the two fields he takes a random
[13]
sample of plants from each field and
[15]
measure the and measures the heights of
[17]
the plants
[19]
here is a summary of the results
[21]
so what i want you to do is pause this
[23]
video and conduct a two sample t test
[27]
here and let's assume that all of the
[29]
conditions for
[31]
inference are met the random condition
[33]
the normal condition and the
[35]
independence condition and let's assume
[37]
that we are working with a significance
[39]
level of 0.05
[42]
so can pause the video and conduct the
[44]
two sample t test here to see whether
[46]
there's evidence that the sizes of
[49]
tomato plants differ between the fields
[52]
all right now let's work through this
[53]
together so like always let's first
[56]
construct our null hypothesis and that's
[59]
going to be the situation where there is
[61]
no difference between the mean sizes so
[64]
that would be that the mean size in
[66]
field a is equal to the mean size in
[69]
field b
[70]
now what about our alternative
[72]
hypothesis
[74]
well he wants to see whether the sizes
[76]
of his tomato plants differ between the
[78]
two fields he's not saying whether a is
[80]
bigger than b or whether b is bigger
[82]
than a and so his alternative hypothesis
[85]
would be around his suspicion that the
[88]
mean of a is not equal to the mean of b
[92]
that they differ
[94]
and to do this two sample t test now we
[98]
assume the null hypothesis
[102]
we assume
[103]
our null hypothesis and remember we're
[105]
assuming that all of our conditions for
[106]
inference are met and then we want to
[108]
calculate a t statistic based on this
[111]
sample data that we have and our t
[114]
statistic
[116]
is going to be equal to the differences
[118]
between the sample means
[121]
all of that over our estimate of the
[123]
standard deviation of the sampling
[125]
distribution of the difference of the
[127]
sample means
[128]
this will be the sample standard
[130]
deviation from sample a squared
[133]
over the sample size from a
[136]
plus the sample standard deviation from
[139]
the b sample squared over the sample
[142]
size from b
[144]
and let's see we have all the numbers
[145]
here to calculate it this numerator is
[148]
going to be equal to
[150]
1.3 minus 1.6 1.3 minus 1.6
[155]
all of that over
[156]
the square root of let's see the
[159]
standard deviation the sample standard
[161]
deviation from the sample from field a
[163]
is 0.5 if you square that you're going
[166]
to get 0.25
[169]
and then that's going to be over the
[171]
sample size from field a
[173]
over 22
[176]
plus 0.3 squared so that is 0.3 squared
[181]
is 0.
[187]
all of that over the sample size from
[189]
field b all that over 24.
[193]
the numerator is just going to be
[194]
negative 0.3
[196]
negative 0.3
[198]
divided by
[200]
the square root of .25
[205]
divided by 22
[207]
plus
[209]
.09
[211]
divided by 24
[213]
and that gets us
[216]
negative 2.44
[219]
approximately negative 2.44
[223]
and so if you think about a t
[224]
distribution
[226]
and we'll use our calculator to figure
[228]
out this probability so this is a
[229]
t-distribution right over here
[232]
this would be the assumed mean of our
[235]
t-distribution and so we got a result
[238]
that is negative we get a t statistic of
[240]
negative 2.44 so we're right over here
[243]
so this is
[245]
negative 2.44
[247]
and so we want to say what is the
[248]
probability from this t distribution of
[251]
getting something at least this extreme
[253]
so it would be
[254]
this area and it would also be and it
[257]
would also be this area if we got 2.44
[260]
above the mean it would also be this
[262]
area
[262]
and so what i could do is i'm going to
[264]
use my calculator to figure out this
[266]
probability right over here and then i'm
[268]
just going to multiply that by 2 to get
[270]
this one as well
[272]
so the probability
[274]
of getting
[276]
a
[277]
t value i guess i could say where it's
[279]
absolute value is greater than or equal
[283]
to 2.44
[286]
is going to be approximately equal to
[289]
i'm going to go to second
[291]
distribution i'm going to go to the
[294]
cumulative distribution function for our
[296]
t distribution
[298]
click that
[299]
and since i want to think about
[301]
this
[302]
tail probability here and then i'm just
[304]
going to multiply by 2. the lower bound
[305]
is a very very very negative number you
[308]
could view that as functionally negative
[310]
infinity
[311]
the upper bound is negative 2.44
[314]
negative 2.44
[318]
and now what's our degrees of freedom
[320]
well if we take the conservative
[322]
approach it'll be the smaller of the two
[324]
samples minus one
[327]
well the smaller of the two samples is
[329]
22 and so 22 minus 1 is 21.
[333]
so put 21 in there
[336]
2
[337]
21
[339]
and now
[341]
i can paste
[342]
and i get
[345]
that number right over there and if i
[346]
multiply that by 2 because this just
[348]
gives me the probability of getting
[349]
something lower than that but i also
[351]
want to think about the probability of
[352]
getting something 2.44 or more above the
[356]
mean of our t distribution so times 2
[359]
is going to be equal to approximately
[363]
0.024
[367]
so approximately
[369]
0.024
[372]
and what i want to do then is compare
[374]
this to my significance level and you
[376]
can see very clearly this right over
[378]
here this is equal to our p-value
[381]
our p-value in this situation
[384]
our p-value in this situation is clearly
[387]
less than our significance level
[389]
and because of that we said hey assuming
[391]
the null hypothesis is true we got
[393]
something that's a pretty low
[394]
probability below our threshold so we
[396]
are going to reject
[398]
reject our null hypothesis which tells
[401]
us that there is so this suggests
[404]
this suggests
[407]
the alternative hypothesis that there is
[409]
indeed a difference between the sizes of
[412]
the tomato plants in the two fields
Most Recent Videos:
You can go back to the homepage right here: Homepage





