00:01
So we're going to be assuming that for each of the smoking versus non -smoking, that the proportion in age category 1 for smokers are the same.
00:15
So for category 1, the proportion of smokers is equal to the non -smokers.
00:22
For category 2, that the proportion of smokers versus non -smokers is the same, et cetera, for all four.
00:31
So this is true for all four age categories.
00:42
And alternately, they're not all proportions for each of them are equal.
00:52
And the kye squared statistic we get in our number of degrees of freedom will end up being the number of rows is two, so less one.
01:00
The number of columns is four less one, so we have three degrees of freedom and that statistic ironically comes out to be six and if we're using a 5 % significance level and we draw a little picture of a distribution we put all 0 .5 here for three degrees of freedom that cutoff point or that critical value is 7 .81 and we have a value of six so we would fail to reject null to the the null.
01:35
So it would appear as though these proportions are approximately equal for each of the age categories.
01:41
Part b we wanted to find what the p value is.
01:44
And that p value is this area.
01:47
And we know that area is going to be greater than 0 .05.
01:50
Since we did not, we failed to reject the null.
01:53
And that comes out to be 0 .1116.
01:57
That's our p value.
01:59
Now if we wanted to graph a conditional distribution, and i'm going to let green be a smoker and i'll let red stand for a non -smoker and if i quick look at my my conditional distributions it looks like the highest one is about about 28 percent so if i have this be there's 10 .1 .2 .0 .3 there's 0 .4...