00:01
Here we have a question that involves college students, gender, and drinking habits.
00:05
A study of college students examined the link between gender and drinking habits.
00:10
So students were asked the number of times over the previous two weeks in which they'd had five or more drinks in one sitting.
00:16
And the results are shown in the table below for both males and females.
00:20
So we are to test for an association between drinking habits and gender.
00:25
And you can see here that the questions involved categorizing the number of drinks as 0, 1 to 2, 3 to 4, and 5 or more.
00:36
And again, we have two groups, males, and females.
00:41
So first thing to do is write down our hypotheses.
00:45
So as we're stating our hypotheses, the null hypothesis states that there is no association between gender and drinking habits.
00:55
And then the alternative hypothesis states that there is an association between gender and drinking habits.
01:03
Now, our table shows us in the black typeface here, the observed counts.
01:10
We need to calculate the expected counts ourselves.
01:13
And remember how to find the expected counts.
01:16
What we'll do is take the row total times the column total and divide by the overall total.
01:25
So, for example, in the first cell, the row total is 18 ,712, the column total 8 ,956, the overall total 27 ,268.
01:39
So you can see that those three numbers are used in the calculation right here to get 6 ,146 as the expected count.
01:48
So i did that for all eight cells in the table, and then in parentheses in green wrote the expected counts for each of those cells.
01:58
The next step is to use the expected counts along with the observed counts for each cell to calculate the kai squared statistic.
02:09
So what do we do here? for each cell, we'll take our observed count, and starting with the first cell, the observed count, was 5 .3 .3 .5 .3.
02:20
402, will subtract the expected count.
02:23
We'll square that difference and divide by the expected count.
02:27
We're going to have eight terms, one for each cell, that do the same thing, squared difference of observed minus expected divided by expected.
02:37
So i didn't write all eight of them out here, but you can see that pattern to calculate the kai square statistic...