An Introduction to Statistical Methods and Data Analysis

R. Lyman Ott, Michael Longnecker

Chapter 6 Inferences Comparing 2 Population Central Values - all with Video Answers

Educators

Chapter Questions

Problem 1

Refer to the oil-spill case study.
a. What are the populations of interest?
b. What are some factors other than flora density that may indicate that the oil spill has affected the marsh?
c. Describe a method for randomly selecting the tracts where flora density measurements were to be taken.
d. State several hypotheses that may be of interest to the researchers.

Priya Manhas

Numerade Educator

03:30

Problem 2

For each of the situations, set up the rejection region:
a. $H_0: \mu_1=\mu_2$ versus $H_a: \mu_1 \neq \mu_2$ with $n_1=12, n_2=15$, and $\alpha=.05$
b. $H_0: \mu_1 \leq \mu_2+3$ versus $H_a: \mu_1>\mu_2+3$ with $n_1=n_2=25$ and $\alpha=.01$
c. $H_0: \mu_1 \geq \mu_2-9$ versus $H_a: \mu_1<\mu_2-9$ with $n_1=13, n_2=15$, and $\alpha=.025$

Jameson Kuper

Numerade Educator

08:08

Problem 3

Conduct a test of $H_0: \mu_1 \geq \mu_2-2.3$ versus $H_a: \mu_1<\mu_2-2.3$ for the sample data summarized here. Use $\alpha=.01$ in reaching your conclusions.
(TABLE CAN'T COPY)

Mohan Jain

Numerade Educator

01:32

Problem 4

Refer to Exercise 6.3.
a. What is the level of significance for your test?
b. Place a $99 \%$ confidence interval on $\mu_1-\mu_2$.

Akhil Choudhary

Numerade Educator

05:36

Problem 5

In an effort to link cold environments with hypertension in humans, a preliminary experiment was conducted to investigate the effect of cold on hypertension in rats. Two random samples of 6 rats each were exposed to different environments. One sample of rats was held in a normal environment at $26^{\circ} \mathrm{C}$. The other sample was held in a cold $5^{\circ} \mathrm{C}$ environment. Blood pressures and heart rates were measured for rats for both groups. The blood pressures for the 12 rats are shown in the accompanying table.
a. Do the data provide sufficient evidence that rats exposed to a $5^{\circ} \mathrm{C}$ environment have a higher mean blood pressure than rats exposed to a $26^{\circ} \mathrm{C}$ environment? Use $\alpha=.05$.
b. Evaluate the three conditions required for the test used in part (a).
c. Provide a $95 \%$ confidence interval on the difference in the two population means.
(TABLE CAN'T COPY)

Amany Waheeb

Numerade Educator

View

Problem 6

The Department of Natural Resources (DNR) received a complaint from recreational fishermen that a community was releasing sewage into the river where they fished. These types of releases lower the level of dissolved oxygen in the river and hence cause damage to the fish residing in the river. An inspector from the DNR designs a study to investigate the fishermen's claim. Fifteen water samples are selected at locations on the river upstream from the community and fifteen samples are selected downstream from the community. The dissolved oxygen readings in parts per million (ppm) are given in the following table.
$$
\begin{array}{llllllllllllllll}
\hline \text { Upstream } & 5.2 & 4.8 & 5.1 & 5.0 & 4.9 & 4.8 & 5.0 & 4.7 & 4.7 & 5.0 & 4.6 & 5.2 & 5.0 & 4.9 & 4.7 \\
\text { Downstream } & 3.2 & 3.4 & 3.7 & 3.9 & 3.6 & 3.8 & 3.9 & 3.6 & 4.1 & 3.3 & 4.5 & 3.7 & 3.9 & 3.8 & 3.7 \\
\hline
\end{array}
$$
a. In order for the discharge to have an impact on fish health, there needs to be at least an .5 ppm reduction in the dissolved oxygen. Do the data provide sufficient evidence that there is a large enough reduction in the mean dissolved oxygen between the upstream and downstream water in the river to impact the health of the fish? Use $\alpha=.01$.
b. Do the required conditions to use the test in part (a) appear to be valid?
c. What is the level of significance of the test in part (a)?
d. Estimate the size of the difference in the mean dissolved oxygen readings for the two locations on the river using a $99 \%$ confidence interval.

Victor Salazar

Numerade Educator

Problem 7

An industrial engineer conjectures that a major difference between successful and unsuccessful companies is the percentage of their manufactured products returned because of defectives. In a study to evaluate this conjecture, the engineer surveyed the quality control departments of 50 successful companies (identified by the annual profit statement) and 50 unsuccessful companies. The companies in the study all produced products of a similar nature and cost. The percentages of the total output returned by customers in the previous year are provided in following table.
a. Do the data provide sufficient evidence that successful businesses have a lower percentage of their products returned by customers? Use $\alpha=.05$.
$$
\begin{array}{lrrrrrrrrrr}
\hline \text { Unsuccessful } & 11.35 & 9.19 & 10.30 & 8.59 & 4.98 & 6.82 & 6.03 & 11.15 & 9.38 & 8.32 \\
\text { Businesses } & 8.34 & 7.69 & 13.58 & 10.49 & 11.07 & 6.98 & 9.77 & 9.36 & 8.39 & 7.98 \\
& 6.56 & 6.85 & 8.06 & 7.71 & 11.04 & 11.69 & 9.40 & 10.00 & 5.45 & 9.67 \\
& 8.93 & 7.32 & 13.70 & 8.67 & 10.08 & 8.53 & 9.14 & 9.02 & 6.70 & 5.66 \\
& 8.26 & 7.07 & 12.23 & 11.93 & 4.76 & 13.81 & 11.41 & 6.44 & 9.50 & 8.99 \\
\text { Successful } & 10.24 & 6.16 & 5.06 & 10.64 & 6.77 & 10.13 & 4.59 & 1.38 & 8.81 & 1.97 \\
\text { Businesses } & 5.43 & 6.32 & 0.43 & 7.30 & 0.47 & 10.82 & 9.34 & 2.39 & 11.06 & 4.19 \\
& 5.09 & 8.20 & 10.51 & 1.94 & 9.82 & 6.69 & 0.91 & 6.17 & 0.17 & 7.47 \\
& 3.62 & 2.23 & 1.08 & 9.16 & 6.07 & 7.51 & 4.46 & 2.13 & 2.41 & 7.24 \\
& 4.06 & 7.70 & 8.32 & 6.33 & 3.83 & 4.96 & 9.05 & 6.41 & 0.27 & 8.48 \\
\hline
\end{array}
$$
b. Do the required conditions for applying your test in part (a) appear to be valid?
c. In order for the difference in percentage returns to have an economical impact, the difference must be at least $5 \%$. Is there significant evidence that the percentage for successful businesses is at least $5 \%$ less that the percentage for unsuccessful businesses?
d. Estimate the difference in the percentages of returns for successful and unsuccessful businesses using a $95 \%$ confidence interval.

Check back soon!

03:34

Problem 8

The number of households currently receiving a daily newspaper has decreased over the last 10 years, and many people state they obtain information about current events through television news and the Internet. To test whether people who receive a daily newspaper have a greater knowledge of current events than people who don't, a sociologist gave a current events test to 25 randomly selected people who subscribe to a daily newspaper and to 30 randomly selected persons who do not receive a daily newspaper. The following stem-and-leaf graphs give the scores (maximum score is 70 ) for the two groups. Does it appear that people who receive a daily newspaper have a greater knowledge of current events? Be sure to evaluate all necessary conditions for your procedures to be valid.

Jameson Kuper

Numerade Educator

03:42

Problem 9

The study of concentrations of atmospheric trace metals in isolated areas of the world has received considerable attention because of the concern that humans might somehow alter the climate of the earth by changing the amount and distribution of trace metals in the atmosphere. Consider a study at the South Pole, where, over a 2 -month period, seventy air samples were obtained. In thirty-five of the samples, the amount of magnesium was determined. In the remaining thirty-five samples, the amount of europium was determined.
$$
\begin{array}{lccc}
\hline & \text { Sample Size } & \text { Sample Mean } & \text { Sample Standard Deviation } \\
\hline \text { Magnesium } & 35 & 1.0 & 2.21 \\
\text { Europium } & 35 & 17.0 & 12.65 \\
\hline
\end{array}
$$
a. What are the populations of interest in this study?
b. Is there significant evidence of a difference in the mean magnesium and Europium levels? Use $\alpha=.05$.
c. What is the level of significance of your test?
d. Estimate the mean levels of magnesium and Europium using a 95\% confidence interval.

Sheryl Ezze

Numerade Educator

06:35

Problem 10

Refer to Exercise 6.9.
a. Based on the values of the sample mean and sample standard deviation for magnesium, provide a reason why the distribution of magnesium does not have a normal distribution.
b. Are the inferences given in Exercise 6.9 valid based on your answer in part (a)?

Sonam Khatri

Numerade Educator

02:49

Problem 11

PCBs have been in use since 1929 , mainly in the electrical industry, but it was not until the 1960 s that they were found to be a major environmental contaminant. In the paper "The Ratio of DDE to PCB Concentrations in Great Lakes Herring Gull Eggs and Its Use in Interpreting Contaminants Data" [Journal of Great Lakes Research (1998) 24(1):12-31], researchers report on the following study. Thirteen study sites from the five Great Lakes were selected. At each site, 9 to 13 herring gull eggs were collected randomly each year for several years. Following collection, the PCB content was determined. The mean PCB content at each site is reported in the following table for the years 1982 and 1996.
(TABLE CAN'T COPY)
a. Legislation was passed in the 1970 s restricting the production and use of PCBs. Thus, the active input of PCBs from current local sources has been severely curtailed. Do the data provide evidence that there has been a significant decrease in the mean PCB content of herring gull eggs?
b. Estimate the size of the decrease in mean PCB content from 1982 to 1996, using a $95 \%$ confidence interval.
c. Evaluate the conditions necessary to validly test the hypotheses and construct the confidence intervals using the collected data.
d. Does the independence condition appear to be violated?

Sheryl Ezze

Numerade Educator

Problem 12

Refer to Exercise 6.11. There appears to be a large variation in the mean PCB content across the 13 sites. How could we reduce the effect of variation in PCB content due to site differences on the evaluation of the difference in the PCB content means between the 2 years?

Check back soon!

Problem 13

A firm has a generous but rather complicated policy concerning end-of-year bonuses for its lower-level managerial personnel. The policy's key factor is a subjective judgment of "contribution to corporate goals." A personnel officer took samples of 24 female and 36 male managers to see whether there was any difference in bonuses, expressed as a percentage of yearly salary. The data are listed here:
(TABLE CAN'T COPY)
a. What are the populations of interest in this study?
b. Is there significant evidence that the mean bonus percentage for males is more than five units larger than the mean bonus percentage for females? Use $\alpha=.05$.
c. What is the level of significance of your test?
d. Estimate the difference in the mean bonus percentages for males and females using a $95 \%$ confidence interval.

Check back soon!

01:35

Problem 14

Provide the rejection region for the Wilcoxon rank sum test for each of the following sets of hypotheses:
a. $H_0: \Delta=0$ versus $H_a: \Delta \neq 0$ with $n_1=8, n_2=9$, and $\alpha=.10$
b. $H_0: \Delta=0$ versus $H_a: \Delta<0$ with $n_1=6, n_2=7$, and $\alpha=.05$
c. $H_0: \Delta=0$ versus $H_a: \Delta>0$ with $n_1=5, n_2=9$, and $\alpha=.025$

Lucas Finney

Numerade Educator

01:41

Problem 15

Random samples of size $n_1=8$ and $n_2=8$ were selected from populations A and B, respectively. The data are given in the following table.
$$
\begin{array}{lllllllll}
\hline \text { Population A } & 4.3 & 4.6 & 4.7 & 5.1 & 5.3 & 5.3 & 5.8 & 5.4 \\
\text { Population B } & 3.5 & 3.8 & 3.7 & 3.9 & 4.4 & 4.7 & 5.2 & 4.4 \\
\hline
\end{array}
$$
a. Test for a difference in the medians of the two populations using an $\alpha=.05$ Wilcoxon rank sum test.
b. Place a $95 \%$ confidence interval on the difference in the medians of the two populations.

James Kiss

Numerade Educator

Problem 16

Refer to Exercise 6.15.
a. Test for a difference in the means in the two populations using an $\alpha=.05 \mathrm{t}$-test.
b. Place a $95 \%$ confidence interval on the difference in the means of the two populations.
c. Compare the inferences obtained from the results from the Wilcoxon rank sum test and the $t$-test.
d. Which inferences appear to be more valid, inferences on the means or the medians?

Check back soon!

05:35

Problem 17

A cable TV company was interested in making its operation more efficient by cutting down on the distance between service calls while still maintaining at least the same level of service quality. A treatment group of 18 repairpersons was assigned to a dispatcher who monitored all the incoming requests for cable repairs and then provided a service strategy for that day's work orders. A control group of 18 repairpersons was to perform their work in a normal fashion-that is, by providing service in roughly a sequential order as requests for repairs were received. The average daily mileages for the 36 repairpersons are recorded here:
$$
\begin{array}{lrrrrrr}
\hline \text { Treatment Group } & 62.2 & 79.3 & 83.2 & 82.2 & 84.1 & 89.3 \\
& 95.8 & 97.9 & 91.5 & 96.6 & 90.1 & 98.6 \\
& 85.2 & 87.9 & 86.7 & 99.7 & 101.1 & 88.6 \\
\text { Control Group } & 97.1 & 70.2 & 94.6 & 182.9 & 85.6 & 89.5 \\
& 109.5 & 101.7 & 99.7 & 193.2 & 105.3 & 92.9 \\
& 63.9 & 88.2 & 99.1 & 95.1 & 92.4 & 87.3 \\
\hline
\end{array}
$$
a. What are the populations of interest in this study?
b. Is there significant evidence that the treatment group had a smaller average daily mileage than the control group? Use $\alpha=.05$.
c. What is the level of significance of your test?
d. Estimate the difference in the average daily mileage for the treatment and control groups using a $95 \%$ confidence interval.
e. There are three possible procedures that could be applied to answer the questions in parts (b), (c), and (d). Which of these procedures appears to be the most valid?

Foster Wisusik

Numerade Educator

10:00

Problem 18

The paper "Serum Beta-2-Microglobulin (SB2M) in Patients with Multiple Myeloma Treated with Alpha Interferon" [Journal of Medicine (1997) 28:311-318] reports on the influence of alpha interferon administration in the treatment of patients with multiple myeloma (MM). Twenty newly diagnosed patients with MM were entered into the study. The researchers randomly assigned the 20 patients to the two groups. Ten patients were treated with both intermittent melphalan and sumiferon (treatment group), whereas the remaining 10 patients were treated only with intermittent melphalan (control group). The SB2M levels were measured before and at days 3,8 , and 15 and months 1,3 , and 6 from the start of therapy. The measurement of SB2M was performed using a radioimmunoassay method. The measurements before treatment are given here.
$$
\begin{array}{lllllllllll}
\hline \text { Treatment Group } & 2.9 & 2.7 & 3.9 & 2.7 & 2.1 & 2.6 & 2.2 & 4.2 & 5.0 & 0.7 \\
\text { Control Group } & 3.5 & 2.5 & 3.8 & 8.1 & 3.6 & 2.2 & 5.0 & 2.9 & 2.3 & 2.9 \\
\hline
\end{array}
$$
a. Plot the sample data for both groups using boxplots or normal probability plots.
b. Based on your findings in part (a), which procedure appears more appropriate for comparing the distributions of SB2M?
C. Is there significant evidence that there is a difference in the distribution of SB2M for the two groups?
d. Discuss the implications of your findings in part (c) for the evaluation of the influence of alpha interferon.

Gus Steppen

Numerade Educator

00:46

Problem 19

The simulation study described in Section 6.3 evaluated the effect of heavy-tailed and skewed distributions on the level of significance and power of the $t$ test and Wilcoxon rank sum test. Examine the results displayed in Table 6.13, and then answer the following questions.
a. What has a greater effect, if any, on the level of significance of the $t$ test, skewness or heavy-tailness?
b. What has a greater effect, if any, on the level of significance of the Wilcoxon rank sum test, skewness or heavy-tailness?

Maxime Rossetti

Numerade Educator

02:28

Problem 20

Refer to Exercise 6.19.
a. What has a greater effect, if any, on the power of the $t$ test, skewness or heavy tailedness?
b. What has a greater effect, if any, on the power of the Wilcoxon rank sum test, skewness or heavy tailedness?

Nick Johnson

Numerade Educator

00:47

Problem 21

Refer to Exercises 6.19 and 6.20.
a. For what type of population distributions would you recommend using the $t$ test? Justify your answer.
b. For what type of population distributions would you recommend using the Wilcoxon rank sum test? Justify your answer.

Maxime Rossetti

Numerade Educator

04:39

Problem 22

Provide the rejection region for the paired $t$ test for each of the following sets of hypotheses:
a. $H_0: \mu_d=0$ versus $H_d: \mu_d \neq 0$ with $n=19$, and $\alpha=.05$
b. $H_0: \mu_d \leq 0$ versus $H_a: \mu_d>0$ with $n=8$, and $\alpha=.025$
c. $H_0: \mu_d \geq 0$ versus $H_a: \mu_d<0$ with $n=14$, and $\alpha=.01$

Karen Song

Numerade Educator

03:24

Problem 23

A random sample of eight pairs of twins was randomly assigned to treatment A or treatment B. The data are given in the following table.
$$
\begin{array}{lcccccccc}
\hline \text { Twins } & \mathbf{1} & \mathbf{2} & \mathbf{3} & \mathbf{4} & \mathbf{5} & \mathbf{6} & \mathbf{7} & \mathbf{8} \\
\hline \text { Treatment A } & 48.3 & 44.6 & 49.7 & 40.5 & 54.3 & 55.6 & 45.8 & 35.4 \\
\text { Treatment B } & 43.5 & 43.8 & 53.7 & 43.9 & 54.4 & 54.7 & 45.2 & 34.4 \\
\hline
\end{array}
$$
a. Is there significant evidence that the two treatments differ using an $\alpha=.05$ paired $t$ test.
b. Is there significant evidence that the two treatments differ using an $\alpha=.05$ sign test.
c. Do your conclusions in parts (a) and (b) agree?
d. How do your inferences about the two treatments based on the paired $t$ test and based on the sign test differ?

Sheryl Ezze

Numerade Educator

03:55

Problem 24

Refer to Exercise 6.23.
a. What is the level of significance of the paired $t$ test?
b. What is the level of significance of the sign test?
c. Place a $95 \%$ confidence interval on the mean difference between the responses from the two treatments.
d. Which of the two procedures, the paired $t$ test or the sign test, appears to be more valid in this study?

Jameson Kuper

Numerade Educator

07:36

Problem 25

Refer to the data of Exercise 6.11. A potential criticism of analyzing these data as if they were two independent samples is that the measurements taken in 1996 were taken at the same sites as the measurements taken in 1982. Thus, there is the possibility that there will be a strong positive correlation between the pair of observations at each site.
a. Plot the pairs of observations in a scatterplot with the 1982 values on the horizontal axis and the 1996 values on the vertical axis. Does there appear to be a positive correlation between the pairs of measurements? Estimate the correlation between the pairs of observations?
b. Compute the correlation coefficient between the pairs of observations. Does this value confirm your observations from the scatterplot? Explain your answer.
c. Answer the questions posed in parts (a) and (b) of Exercise 6.11 using a paired data analysis. Are your conclusions different from the conclusions you reached treating the data as two independent samples?

Trent Speier

Numerade Educator

Problem 26

Researchers are studying two existing coatings used to prevent corrosion in pipes that transport natural gas. The study involves examining sections of pipe that had been in the ground at least 5 years. The effectiveness of the coating depends on the pH of the soil, so the researchers recorded the pH of the soil at all 20 sites at which the pipe was buried prior to measuring the amount of corrosion on the pipes. The pH readings are given here. Describe how the researchers could conduct the study to reduce the effect of the differences in the pH readings on the evaluation of the difference in the two coatings' corrosion protection.
(TABLE CAN'T COPY)

Check back soon!

00:39

Problem 27

Suppose you are a participant in a project to study the effectiveness of a new treatment for high cholesterol. The new treatment will be compared to a current treatment by recording the change in cholesterol readings over a 10-week treatment period. The effectiveness of the treatment may depend on each participant's age, body fat percentage, diet, and general health. The study will involve at most 30 participants because of cost considerations.
a. Describe how you would conduct the study using independent samples.
b. Describe how you would conduct the study using paired samples.
c. How would you decide which method, paired or independent samples, would be more efficient in evaluating the change in cholesterol readings?

James Kiss

Numerade Educator

02:30

Problem 28

The paper "Effect of Long-Term Blood Pressure Control on Salt Sensitivity" []ournal of Medicine (1997) 28:147-156] describes a study evaluating salt sensitivity (SENS) after a period of antihypertensive treatment. Ten hypertensive patients (diastolic blood pressure between 90 and 115 mmHg ) were studied after at least 18 months on antihypertensive treatment. SENS readings, which were obtained before and after the patients were placed on an antihypertensive treatment, are given here.
(TABLE CAN'T COPY)
a. Is there significant evidence that the mean SENS value decreased after the patient received antihypertensive treatment?
b. Estimate the size of the change in the mean SENS value.
c. Do the conditions required for using the $t$ procedures appear to be valid for these data? Justify your answer.

Sheryl Ezze

Numerade Educator

View

Problem 29

A study was designed to measure the effect of home environment on academic achievement of 12 -year-old students. Because genetic differences may also contribute to academic achievement, the researcher wanted to control for this factor. Thirty sets of identical twins were identified who had been adopted prior to their first birthday, with one twin placed in a home in which academics were emphasized (Academic) and the other twin placed in a home in which academics were not emphasized (Nonacademic). The final grades (based on 100 points) for the 60 students are given here.
(TABLE CAN'T COPY)
a. Is there a difference in the mean final grades between the students in an academically oriented home environment and those in a nonacademically oriented home environment. Use $\alpha=.05$.
b. Estimate the size of the difference in the mean final grades of the students in academic and nonacademic home environments using a $95 \%$ confidence interval.
c. Do the conditions for using the $t$ procedures appear to be satisfied for these data?
d. Does it appear that using twins in this study to control for variation in final scores was effective as compared to taking a random sample of 30 students in both types of home environments? Justify your answer.

Victor Salazar

Numerade Educator

03:38

Problem 30

Provide the rejection region for the Wilcoxon signed-rank test for each of the following sets of hypotheses::
a. $H_0: M=0$ versus $H_a: M \neq 0$ with $n=19$, and $\alpha=.05$
b. $H_0: M \leq 0$ versus $H_a: M>0$ with $n=8$, and $\alpha=.025$
c. $H_0: M \geq 0$ versus $H_a: M<0$ with $n=14$, and $\alpha=.01$

Kayla Laughman

Numerade Educator

View

Problem 31

A random sample of eight pairs of twins were randomly assigned to treatment A or treatment B. The data are given in the following table.
$$
\begin{array}{lcccccccc}
\hline \text { Twins } & \mathbf{1} & \mathbf{2} & \mathbf{3} & \mathbf{4} & \mathbf{5} & \mathbf{6} & \mathbf{7} & \mathbf{8} \\
\hline \text { Treatment A } & 48.3 & 44.6 & 49.7 & 40.5 & 54.3 & 55.6 & 45.8 & 35.4 \\
\text { Treatment B } & 43.5 & 43.8 & 53.7 & 43.9 & 54.4 & 54.7 & 45.2 & 34.4 \\
\hline
\end{array}
$$
a. Is there significant evidence that the two treatments differ using an $\alpha=.05$ Wilcoxon signed-rank test.
b. Compare your conclusion with the conclusions obtained using the paired $t$ test and sign test in Exercise 6.23.

Victor Salazar

Numerade Educator

06:39

Problem 32

Refer to Exercise 6.31.
a. What is the level of significance of the Wilcoxon signed-rank test?
b. Compare the levels of significance of the Wilcoxon signed-rank test, paired $t$ test, and sign test for the data set in Exercise 6.31?
c. Place a $95 \%$ confidence interval on the mean difference between the responses from the two treatments.
d. Which of the three procedures, the Wilcoxon signed-rank test, paired $t$ test or sign test, appears to be most valid test for this study?

Sheryl Ezze

Numerade Educator

Problem 33

Use the level and power values for the paired $t$ test and Wilcoxon signed-rank test given in Table 6.18 to answer the following questions.
a. For small sample sizes, $n \leq 20$, does the actual level of the $t$ test appear to deviate from the nominal level of $\alpha=.05$ ?
b. Which type of deviations from a normal distribution, skewness or heavytailedness, appears to have the greater affect on the $t$ test?
c. For small sample sizes, $n \leq 20$, does the actual level of the Wilcoxon signed-rank test appear to deviate from the nominal level of $\alpha=.05$ ?
d. Which type of deviations from a normal distribution, skewness or heavytailedness, appears to have the greater effect on the Wilcoxon signed-rank test?

Check back soon!

01:09

Problem 34

Use the level and power values for the paired $t$ test and Wilcoxon signed-rank test given in Table 6.18 to answer the following questions:
a. Suppose a level .05 test is to be applied to a paired data set that has differences that are highly skewed to the right. Will the Wilcoxon signed-rank test's "actual" level or the paired $t$ test's actual level be closer to .05 ? Justify your answer.
b. Suppose a boxplot of the differences in the pairs from a paired data set has many outliers, with an equal number above and below the median. If a level $\alpha=.05$ test is applied to the differences, will the Wilcoxon signed-rank test's "actual" level or the paired $t$ test's actual level be closer to .05 ? Justify your answer.

Maxime Rossetti

Numerade Educator

04:36

Problem 35

A study was conducted to determine whether automobile repair charges are higher for female customers than for male customers. Twenty auto repair shops were randomly selected from the telephone book. Two cars of the same age, brand, and engine problem were used in the study. For each repair shop, the two cars were randomly assigned to a man and woman participant and then taken to the shop for an estimate of repair cost. The repair costs (in dollars) are given here.
(TABLE CAN'T COPY)
a. Which procedure, $t$ or Wilcoxon, is more appropriate in this situation? Why?
b. Are repair costs generally higher for female customers than for male customers? Use $\alpha=.05$.

James Kiss

Numerade Educator

View

Problem 36

The effect of Benzedrine on the heart rate of dogs (in beats per minute) was examined in an experiment on 14 dogs chosen for the study. Each dog was to serve as its own control, with half of the dogs assigned to receive Benzedrine during the first study period and the other half assigned to receive a placebo (saline solution). All dogs were examined to determine the heart rates after 2 hours on the medication. After 2 weeks in which no medication was given, the regimens for the dogs were switched for the second study period. The dogs previously on Benzedrine were given the placebo, and the others received Benzedrine. Again, heart rates were measured after 2 hours.

The following sample data are not arranged in the order in which they were taken but have been summarized by regimen. Use these data to test the research hypothesis that the distribution of heart rates for the dogs when receiving Benzedrine is shifted to the right of that for the same animals when on the placebo. Use a one-tailed Wilcoxon signed-rank test with $\alpha=.05$.
(TABLE CAN'T COPY)

Victor Salazar

Numerade Educator

01:14

Problem 37

A study is being planned to evaluate the possible side effects of an anti-inflammatory drug. It is suspected that the drug may lead to an elevation in the blood pressure of users of the drug. A preliminary study of two groups of patients, one receiving the drug and the other receiving a placebo, provides the following information on the systolic blood pressure (in mm Hg ) of the two groups:
$$
\begin{array}{lcc}
\hline \text { Group } & \text { Mean } & \text { Standard Deviation } \\
\hline \text { Placebo } & 129.9 & 18.5 \\
\text { Anti-inflammatory drug } & 135.5 & 18.7 \\
\hline
\end{array}
$$
Assume that both groups have systolic blood pressures that have a normal distribution with standard deviations relatively close to the values obtained in the pilot study. Suppose the study plan provides for the same number of patients in the placebo group as in the treatment group. Determine the sample size necessary for an $\alpha=.05 t$ test to have a power of .80 to detect an increase of 5 mm Hg in the blood pressure of the treatment group relative to that of the placebo group.

Manik Pulyani

Numerade Educator

01:14

Problem 38

Refer to Exercise 6.37. Suppose that the agency sponsoring the study specifies that the group receiving the drug should have twice as many patients as the placebo group. Determine the sample sizes necessary for an $\alpha=.05 t$ test to have a power of .80 to detect an increase of 5 mm Hg in the blood pressure of the treatment group relative to that of the placebo group.

Manik Pulyani

Numerade Educator

01:35

Problem 39

Refer to Exercise 6.37. The researchers also need to obtain precise estimates of the mean difference in systolic blood pressures for people who use the anti-inflammatory drug versus those who do not.
a. Suppose the sample sizes are the same for both groups. What sample size is needed to obtain a $95 \%$ confidence interval for the mean difference in systolic blood pressure between the users and nonusers having a width of at most 5 mm Hg .
b. Suppose the user group will have twice as many patients as the placebo group. What sample size is needed to obtain a $95 \%$ confidence interval for the mean difference in systolic blood pressures between the users and nonusers having a width of at most 5 mm Hg .

Adriano Chikande

Numerade Educator

03:56

Problem 40

An environmental impact study was performed in a small state to determine the effectiveness of scrubbers on the amount of pollution coming from the cooling towers of a chemical plant. The amounts of pollution (in ppm ) detected from the cooling towers before and after the scrubbers were installed are given below for 23 cooling towers.
$$
\begin{array}{lcc}
\hline & \text { Mean } & \text { Standard Deviation } \\
\hline \text { Before scrubber } & 71 & 26 \\
\text { After scrubber } & 63 & 25 \\
\text { Difference }=\text { before }- \text { after } & 8 & 20 \\
\hline
\end{array}
$$
Suppose a larger study is planned for a state with a more extreme pollution problem.
a. How many chemical plant cooling towers need to be measured if we want a probability of .90 of detecting a mean reduction in pollution of 10 ppm due to installing the scrubbers using an $\alpha=.01$ test?
b. What assumptions did you make in part (a) in order to compute the sample size?

Maxime Rossetti

Numerade Educator

02:18

Problem 41

Refer to Exercise 6.40. The state regulators also need to obtain a precise estimate of the mean reduction in the pollution level after installing the scrubbers. What sample size is needed to obtain a $99 \%$ confidence interval having width of 8.5 ppm ?

Ana Carolina Da Cruz

Numerade Educator

01:59

Problem 42

Long-distance runners have contended that moderate exposure to ozone increases lung capacity. To investigate this possibility, a researcher exposed 12 rats to ozone at the rate of two parts per million for a period of 30 days. The lung capacity of the rats was determined at the beginning of the study and again after the 30 days of ozone exposure. The lung capacities (in mL ) are given here.
$$
\begin{array}{|c|c|c|c|c|c|c|c|c|c|c|c|c|}
\hline \text { Rat } & \mathbf{1} & 2 & 3 & 4 & 5 & 6 & 7 & 8 & 9 & 10 & 11 & 12 \\
\hline \text { Before exposure } & 8.7 & 7.9 & 8.3 & 8.4 & 9.2 & 9.1 & 8.2 & 8.1 & 8.9 & 8.2 & 8.9 & 7.5 \\
\hline \text { After exposure } & 9.4 & 9.8 & 9.9 & 10.3 & 8.9 & 8.8 & 9.8 & 8.2 & 9.4 & 9.9 & 12.2 & 9.3 \\
\hline
\end{array}
$$
a. Is there sufficient evidence to support the conjecture that ozone exposure increases lung capacity? Use $\alpha=.05$. Report the $p$-value of your test.
b. Estimate the size of the increase in lung capacity after exposure to ozone using a $95 \%$ confidence interval.
C. After completion of the study, the researcher claimed that ozone causes increased lung capacity. Is this statement supported by this experiment?

Sriparna Bhattacharjee

Numerade Educator

Problem 43

In an environmental impact study for a new airport, the noise levels of various jets were measured just seconds after their wheels left the ground. The jets were either wide-bodied or narrow-bodied. The noise levels in decibels ( dB ) are recorded here for 15 wide-bodied jets and 12 narrow-bodied jets.
(TABLE CAN'T COPY)
a. Do the two types of jets have different mean noise levels? Report the level of significance of the test.
b. Estimate the size of the difference in mean noise levels between the two types of jets using a $95 \%$ confidence interval.
c. How would you select the jets for inclusion in this study?

Check back soon!

View

Problem 44

An entomologist is investigating which of two fumigants, $F_1$ or $F_2$, is more effective in controlling parasities in tobacco plants. To compare the fumigants, nine fields of differing soil characteristics, drainage, and amount of wind shield were planted with tobacco. Each field was then divided into two plots of equal area. Fumigant $F_1$ was randomly assigned to one plot in each field and $F_2$ to the other plot. Fifty plants were randomly selected from each field, 25 from each plot, and the numbers of parasites were counted. The data are in the following table.
$$
\begin{array}{lccccccccc}
\hline \text { Field } & \mathbf{1} & \mathbf{2} & \mathbf{3} & \mathbf{4} & \mathbf{5} & \mathbf{6} & \mathbf{7} & \mathbf{8} & \mathbf{9} \\
\hline \text { Fumigant } F_1 & 77 & 40 & 11 & 31 & 28 & 50 & 53 & 26 & 33 \\
\text { Fumigant } F_2 & 76 & 38 & 10 & 29 & 27 & 48 & 51 & 24 & 32 \\
\hline
\end{array}
$$
a. What are the populations of interest?
b. Do the data provide sufficient evidence to indicate a difference in the mean levels of parasites for the two fumigants? Use $\alpha=.10$. Report the $p$-value for the experimental data.
c. Estimate the size of the difference in the mean numbers of parasites between the two fumigants using a $90 \%$ confidence interval.

Victor Salazar

Numerade Educator

Problem 45

Refer to Exercise 6.44. An alternative design of the experiment would involve randomly assigning fumigant $F_1$ to nine of the plots and $F_2$ to the other nine plots, ignoring which fields the plots were from. What are some of the problems that may occur in using the alternative design?

Check back soon!

Problem 46

Following the March 24, 1989, grounding of the tanker Exxon Valdez in Alaska, approximately 35,500 tons of crude oil were released into Prince William Sound. The paper "The Deep Benthos of Prince William Sound, Alaska, 16 Months After the Exxon Valdez Oil Spill" (Feder and Blanchard, 1998) reports on an evaluation of deep benthic infauna after the spill. Thirteen sites were selected for study. Seven of the sites were within the oil trajectory, and six were outside the oil trajectory. Collection of environmental and biological data at two depths, 40 m and 100 m , occurred in the period July $1-23,1990$. One of the variables measured was population abundance (individuals per square meter). The values are given in the following table.
(TABLE CAN'T COPY)
a. After combining the data from the two depths, does there appear to be a difference in population mean abundances between the sites within and outside the oil trajectory? Use $\alpha=.05$.
b. Estimate the size of the difference in the mean population abundances at the two types of sites using a $95 \%$ confidence interval.
c. What are the required conditions for the techniques used in parts (a) and (b)?
d. Check to see whether the required conditions are satisfied.

Check back soon!

View

Problem 47

Refer to Exercise 6.46. Answer the following questions using the combined data for both depths.
a. Use the Wilcoxon rank sum test to assess whether there is a difference in population abundances between the sites within and outside the oil trajectory. Use $\alpha=.05$.
b. What are the required conditions for the techniques used in part (a)?
c. Are the required conditions satisfied?
d. Discuss any differences in the conclusions obtained using the $t$ procedures and the Wilcoxon rank sum test.

Victor Salazar

Numerade Educator

05:55

Problem 48

Refer to Exercise 6.46. The researchers also examined the effect of depth on population abundance.
a. Plot the four data sets using side-by-side boxplots to demonstrate the effect of depth on population abundance.
b. Separately for each depth, evaluate differences between the sites within and outside the oil trajectory. Use $\alpha=.05$.
C. Are your conclusions at 40 m consistent with your conclusions at 100 m ?

Michaela Flitsch

Numerade Educator

Problem 49

Refer to Exercises 6.46-6.48.
a. Discuss the veracity of the following statement: "The oil spill did not adversely affect the population abundance; in fact, it appears to have increased the population abundance."
b. A possible criticism of the study is that the six sites outside the oil trajectory were not comparable in many aspects to the seven sites within the oil trajectory. Suppose that the researchers had data on population abundance at the seven within-trajectory sites prior to the oil spill. What type of analysis could be used on these data to evaluate the effect of the oil spill on population abundance? What are some advantages to using these data rather than the data in Exercise 6.46?
c. What are some possible problems with using the before and after oil spill data in assessing the effect of the spill on population abundance?

Check back soon!

Problem 50

A study was conducted to evaluate the effectiveness of an antihypertensive product. Three groups of 20 rats each were randomly selected from a strain of hypertensive rats. The 20 rats in the first group were treated with a low dose of an antihypertensive product, the second group with a higher dose of the same product, and the third group with an inert control. The amounts of decrease in systolic blood pressure 30 minutes after the rats receive an injection are given in the following table. Note that negative values represent increases in blood pressure.
a. Compare the mean drops in blood pressure for the high-dose group and the control group. Use $\alpha=.05$ and report the level of significance.
b. Estimate the size of the difference in the mean drops for the high-dose and control groups using a $95 \%$ confidence interval.
c. Do the conditions required for the statistical techniques used in parts (a) and (b) appear to be satisfied? Justify your answer.

Check back soon!

Problem 51

Refer to Exercise 6.50.
a. Compare the mean drops in blood pressure for the low-dose group and the control group. Use $\alpha=.05$ and report the level of significance.
b. Estimate the size of the difference in the mean drops for the low-dose and control groups using a $95 \%$ confidence interval.
c. Do the conditions required for the statistical techniques used in parts (a) and (b) appear to be satisfied? Justify your answer.

Check back soon!

Problem 52

Refer to Exercise 6.50.
a. Compare the mean drops in blood pressure for the low-dose group and the highdose group. Use $\alpha=.05$ and report the level of significance.
b. Estimate the size of the difference in the mean drops for the low-dose and highdose groups using a $95 \%$ confidence interval.
c. Do the conditions required for the statistical techniques used in parts (a) and (b) appear to be satisfied? Justify your answer.

Check back soon!

01:14

Problem 53

Refer to Exercise 6.50 .
a. Describe the populations to which the inferences provided in Exercises 6.50-6.52 are relevant.
b. A much larger study is to be designed to further examine the effectiveness of the high-dose level of the drug. How many rats would be needed in the new study to be $90 \%$ confident that an $\alpha=.05$ test would detect a reduction of 10 mm Hg by the high-dose level relative to the mean blood pressure readings of the control group? Hint: Assume that the decreases in blood pressure for the high-dose and control groups have normal distributions with standard deviations of 30 mm Hg .
c. The company producing the drug wants a precise estimate of the mean reduction in the systolic blood pressure after injection with a high dose of the drug. What sample size is needed to obtain a $99 \%$ confidence interval having width of 5 mm Hg ?

Manik Pulyani

Numerade Educator

03:34

Problem 54

To assess whether degreed nurses received a more comprehensive training than registered nurses, a study was designed to compare the two groups. The state nursing licensing board randomly selected 50 nurses from each group for evaluation. They were given the state licensing board examination, and their scores are given in the following table.
(TABLE CAN'T COPY)
a. Can the licensing board conclude that the mean score of nurses who receive a BS in nursing is higher than the mean score of registered nurses? Use $\alpha=.05$.
b. Report the $p$-value for your test.
c. Estimate the size of the difference in the mean scores of the two groups of nurses using a $95 \%$ confidence interval.
d. The mean test scores are considered to have a meaningful difference only if they differ by more than 40 points. Is the observed difference in the mean scores a meaningful one?

Robin Corrigan

Numerade Educator

05:28

Problem 55

All persons running for public office must report the amounts of money spent during their campaigns. Political scientists have contended that female candidates generally find it difficult to raise money and therefore spend less in their campaigns than do male candidates. Suppose the accompanying data represent the campaign expenditures of a randomly selected group of 20 male and 20 female candidates for the state legislature. Do the data support the claim that female candidates generally spend less in their campaigns for public office than do male candidates?
a. Estimate the size of the difference in the mean campaign expenditures between female and male candidates using a $95 \%$ confidence interval.
b. Is there a significant difference at the .05 level in the mean campaign expenditures between female and male candidates?
C. Is there a practical difference in the mean campaign expenditures between female and male candidates?
d. Are the conditions necessary to analyze the data using the $t$ test to satisfied?

Sheryl Ezze

Numerade Educator

03:19

Problem 56

Refer to Exercise 6.55.
a. To what populations are the conclusions obtained in Exercise 6.55 relevant?
b. A more precise estimate of the mean expenditure for female candidates is requested. How many female candidates would need to be included in the new study to estimate the mean expenditure using a $95 \%$ confidence interval having a width of at most $$\$ 10$$ ?

Jameson Kuper

Numerade Educator

Problem 57

After strip-mining for coal, the state land office requires the mining company to restore the land to its condition prior to mining. One of many factors that is considered is the pH of the soil, which is an important factor in determining what types of plants will survive in a given location. The area to be mined was divided into grids before the mining took place. Fifteen grids were randomly selected, and the soil pH was measured before mining. When the mining was completed, the land was restored, and another set of pH readings was taken on the same 15 grids; see the accompanying table.
$$
\begin{array}{|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|c|}
\hline \text { Location } & 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 & 9 & 10 & 11 & 12 & 13 & 14 & 15 \\
\hline \text { efore } & 10.02 & 0.16 & 9.96 & 10.01 & 9.87 & 10.05 & 10.07 & 10.08 & 10.05 & 10.04 & 10.09 & 10.09 & 9.92 & 10.05 & 10.13 \\
\hline \text { After } & 10.21 & 10.16 & 10.11 & 10.10 & 10.07 & 10.13 & 10.08 & 10.30 & 10.17 & 10.10 & 10.06 & 10.37 & 10.24 & 10.19 & 10.13 \\
\hline
\end{array}
$$
a. What is the level of significance of the test for a change in mean pH after reclamation of the land?
b. What is the research hypothesis that the land office was testing?
c. Estimate the change in mean soil pH after strip-mining using a $99 \%$ confidence interval.
d. The land office assessed a fine on the mining company because the $t$ test indicated a significant difference in mean pH after the reclamation of the land. Is the assessment of the fine supported by the data? Justify your answer using the results from parts (a) and (c).

Check back soon!

01:38

Problem 58

Refer to Exercise 6.57. Based on the land office's decision in the test of hypotheses, could it have made (select one of the following)
a. A Type I error?
b. A Type II error?
c. Both a Type I and a Type II error?
d. Neither a Type I nor a Type II error?

Hailey Tomashek

Numerade Educator

06:15

Problem 59

Company officials are concerned about the length of time a particular drug retains its potency. A random sample (sample 1) of 10 bottles of the product is drawn from current production and analyzed for potency. A second sample (sample 2) is obtained, stored for 1 year, and then analyzed. The readings obtained are as follows:
$$
\begin{array}{lrrrrrrrrrr}
\hline \text { Sample 1 } & 10.2 & 10.5 & 10.3 & 10.8 & 9.8 & 10.6 & 10.7 & 10.2 & 10.0 & 10.6 \\
\text { Sample 2 } & 9.8 & 9.6 & 10.1 & 10.2 & 10.1 & 9.7 & 9.5 & 9.6 & 9.8 & 9.9 \\
\hline
\end{array}
$$
a. What is the research hypothesis?
b. Compute the values of the $t$ and $t^{\prime}$ statistics? Why are they equal for this data set?
c. What are the $p$-values for the $t$ and $t^{\prime}$ statistics? Why are they different?
d. Are the conclusions concerning the research hypothesis the same for the two tests if we use $\alpha=.05$ ?
e. Which test, $t$ or $t^{\prime}$, is more appropriate for this data set?

Robin Corrigan

Numerade Educator

03:00

Problem 60

An industrial concern has experimented with several different mixtures of the four components-magnesium, sodium nitrate, strontium nitrate, and a binder-that comprise a rocket propellant. The company has found that two mixtures in particular give higher flareillumination values than the others. Mixture 1 consists of a blend composed of the proportions $.40, .10, .42$, and .08 , respectively, for the four components of the mixture; mixture 2 consists of a blend using the proportions $.60, .25, .10$, and .05 . Twenty different blends ( 10 of each mixture) are prepared and tested to obtain the flare-illumination values. These data appear here (in units of 1,000 candles).
$$
\begin{array}{lllllllllll}
\text { Mixture 1 } & 185 & 192 & 201 & 215 & 170 & 190 & 175 & 172 & 198 & 202 \\
\text { Mixture 2 } & 221 & 210 & 215 & 202 & 204 & 196 & 225 & 230 & 214 & 217 \\
\hline
\end{array}
$$
a. Plot the sample data. Which test(s) could be used to compare the mean illumination values for the two mixtures?
b. Give the level of significance of the test and interpret your findings.

Khoobchandra Agrawal

Numerade Educator

Problem 61

Refer to Exercise 6.60. Instead of conducting a statistical test, use the sample data to answer the question, What is the difference in mean flare illuminations for the two mixtures?

Check back soon!

02:00

Problem 62

Refer to Exercise 6.60. Suppose we wish to test the research hypothesis that $\mu_1<\mu_2$ for the two mixtures. Assume that the population distributions are normally distributed with a common $\sigma=12$. Determine the sample size required to obtain a test having $\alpha=.05$ and $\beta\left(\mu_d\right)<.10$ when $\mu_2-\mu_1 \geq 15$.

Victor Salazar

Numerade Educator

Problem 63

Refer to the epilepsy study data in Table 3.19. Use the data for the number of seizures after 8 weeks for the placebo patients and for the patients treated with the drug progabide to answer the following questions.
a. Do the data support the conjecture that progabide reduces the mean number of seizures for epileptics? Use both a $t$ test and the Wilcoxon test with $\alpha=.05$.
b. Which test appears to be more appropriate for this study? Why?
c. Estimate the size of the difference in the mean numbers of seizures between the two groups.

Check back soon!

View

Problem 64

Many people purchase sport utility vehicles (SUVs) because they think they are sturdier and hence safer than regular cars. However, preliminary data have indicated that the costs for repairs of SUVs are higher than for midsize cars when both vehicles are in an accident. A random sample of 8 new SUVs and 8 midsize cars is tested for front-impact resistance. The amounts of damage (in hundreds of dollars) to the vehicles when crashed at 20 mph head on into a stationary barrier are recorded in the following table.
$$
\begin{array}{lcccccccc}
\hline \text { Car } & \mathbf{1} & \mathbf{2} & \mathbf{3} & \mathbf{4} & \mathbf{5} & \mathbf{6} & \mathbf{7} & \mathbf{8} \\
\hline \text { SUV } & 14.23 & 12.47 & 14.00 & 13.17 & 27.48 & 12.42 & 32.59 & 12.98 \\
\text { Midsize } & 11.97 & 11.42 & 13.27 & 9.87 & 10.12 & 10.36 & 12.65 & 25.23 \\
\hline
\end{array}
$$
a. Plot the data to determine whether the conditions required for the $t$ procedures are valid.
b. Do the data support the conjecture that the mean damage is greater for SUVs than for midsize vehicles? Use $\alpha=.05$ with both the $t$ test and the Wilcoxon test.
c. Which test appears to be the more appropriate procedure for this data set?
d. Do you reach the same conclusions from both procedures? Why or why not?

Shu Naito

Numerade Educator

01:39

Problem 65

Refer to Exercise 6.64. The small number of vehicles in the study has led to criticism of the results. A new study is to be conducted with a larger sample size. Assume that both populations of damages are normally distributed with a common $$\sigma=\$ 700$$.
a. Determine the sample size that allows us to be $95 \%$ confident that the estimate of the difference in mean repair costs is within $$\$ 500$$ of the true difference.
b. For the research hypothesis $H_a: \mu \mathrm{suV}>\mu_{\mathrm{MID}}$, determine the sample size required to obtain a test having $\alpha=.05$ and $\beta\left(\mu_d\right)<.05$ when $$\mu_{\text {SUV }}-\mu_{\mathrm{MID}} \geq \$ 500$$.

Adriano Chikande

Numerade Educator

01:06

Problem 66

The following memorandum opinion on statistical significance was issued by the judge in a trial involving many scientific issues. The opinion has been stripped of some legal jargon and has been taken out of context. Still, it can give us an understanding of how others deal with the problem of ascertaining the meaning of statistical significance. Read this memorandum and comment on the issues raised regarding statistical significance.

Tyler Moulton

Numerade Educator

Problem 67

Defining the Problem (1). Lead is an environmental pollutant especially worthy of attention because of its damaging effects on the neurological and intellectual development of children. Morton et al. (1982) collected data on lead absorption by children whose parents worked at a factory in Oklahoma where lead was used in the manufacture of batteries. The concern was that children might be exposed to lead inadvertently brought home on the bodies or clothing of their parents. Levels of lead (in micrograms per deciliter) were measured in blood samples taken from 33 children who might have been exposed in this way. They constitute the exposed group.
Collecting the Data (2). The researchers formed a control group by making matched pairs. For each of the 33 children in the exposed group they selected a matching child of the same age, living in the same neighborhood, and with parents employed at a place where lead is not used.

The data set LEADKIDS contains three variables, each with 33 cases. All involve measurements of lead in micrograms per deciliter of blood.

Check back soon!

02:28

Problem 68

Analyzing Data, the Interpreting the Analyses, and Communicating the Results (4). A paired $t$ test for the difference data in Exercise 6.67 is shown here.
The $p$-value in the output reads .000 , which means that it is smaller than .0005 ( 1 chance in 2,000 ). Thus, it is extremely unlikely that we would see data as extreme as those actually collected unless workers at the battery factory were contaminating their children. We reject the null hypothesis and conclude that the difference between the lead levels of children in the exposed and control groups is large enough to be statistically significant.

The next question is whether the difference between the two groups is large enough to be of practical importance. This is a judgment for people who know about lead poisoning to make, not for statisticians. The best estimate of the true (population) mean difference is 15.97 , or about 16. On average, children of workers in the battery plant have about $16 \mu \mathrm{g} / \mathrm{dl}$ more lead than their peers whose parents do not work in a lead-related industry. Almost any toxicologist would deem this increase to be dangerous and unacceptable. (The mean of the control group is also about 16. On average, the effect of having a parent who works in the battery factory is to double the lead level. Doubling the lead level brings the average value for exposed children to about 32 , which is getting close to the level where medical treatment is required. Also remember that some toxicologists believe that any amount of lead is harmful to the neurological development of children.)
a. Should the $t$ test we did have been one-sided? In practice, we must make the decision to do a one-sided test before the data are collected. We might argue that having a parent working at the battery factory could not decrease a child's exposure to lead.
1) Write the null hypothesis and its one-sided alternative in both words and symbols. Perform the test. How is its p-value related to the p-value for the two-sided test?
2) It might be tempting to argue that children of workers at a lead-using factory could not have generally lower levels of lead than children in the rest of the population. But can you imagine a scenario in which the mean levels would really be lower for exposed children?
b. We used a $t$ test to confirm our impression that exposed children have more lead in their blood than their control counterparts. Although there is no clear reason to prefer nonparametric tests for these data, verify that they yield the same conclusion as the $t$ test does.

Nick Johnson

Numerade Educator

01:21

Problem 69

The article "Increased Risk for Vitamin A Toxicity in Severe Hypertriglyceridemia" [Annals of Internal Medicine (1987) 105:877-879 (CC) American College of Physicians)] illustrates the importance of checking whether the appropriate conditions have been met prior to applying a statistical procedure. The data consist of the retinyl ester concentrations $(\mathrm{mg} / \mathrm{dl})$ of nine normal individuals and nine Type V hyperlipoproteinemic subjects.
$$
\begin{array}{lrrrrrrrrr}
\hline \text { Type V Subjects } & 1.4 & 2.5 & 4.6 & 0.0 & 0.0 & 2.9 & 1.9 & 4.0 & 2.0 \\
\text { Normal Subjects } & 30.9 & 134.6 & 13.6 & 28.9 & 434.1 & 101.7 & 85.1 & 26.5 & 44.8 \\
\hline
\end{array}
$$
a. Assess whether the data sets support the condition that both population distributions have normal distributions with equal variances.
b. Test for a difference in the mean retinyl ester concentrations of the two groups using the pooled $t$ test, separate-variance $t$ test, and Wilcoxon rank sum test. Use $\alpha=.01$.
C. Based on your conclusions in part (a), which test statistic would you recommend to test for a difference in the mean retinyl ester concentrations of the two groups?

Sheryl Ezze

Numerade Educator