• Home
  • Textbooks
  • STATS Modeling the World
  • Displaving and Describing Categorical Data

STATS Modeling the World

David E. Bock, Paul F. Velleman, Richard D. De Veaux

Chapter 3

Displaving and Describing Categorical Data - all with Video Answers

Educators


Chapter Questions

03:11

Problem 1

Find a bar graph of categorical data from a newspaper, a magazine, or the Internet.
a) Is the graph clearly labeled?
b) Does it violate the area principle?
c) Does the accompanying article tell the W's of the variable?
d) Do you think the article correctly interprets the data? Explain.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
03:09

Problem 2

Find a pie chart of categorical data from a newspaper, a magazine, or the Internet.
a) Is the graph clearly labeled?
b) Does it violate the area principle?
c) Does the accompanying article tell the W's of the variable?
d) Do you think the article correctly interprets the data? Explain.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
02:20

Problem 3

Find a frequency table of categorical data from a newspaper, a magazine, or the Internet.
a) Is it clearly labeled?
b) Does it display percentages or counts?
c) Does the accompanying article tell the W's of the variable?
d) Do you think the article correctly interprets the data? Explain.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
03:21

Problem 4

Find a contingency table of categorical data from a newspaper, a magazine, or the Internet.
a) Is it clearly labeled?
b) Does it display percentages or counts?
c) Does the accompanying article tell the W's of the variables?
d) Do you think the article correctly interprets the data? Explain.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
01:03

Problem 5

The pie chart summarizes the genres of 120 first-run movies released in 2005 .
a) Is this an appropriate display for the genres? Why/why not?
b) Which genre was least common?

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
01:31

Problem 6

The pie chart shows the ratings assigned to 120 first-run movies released in 2005 .
a) Is this an appropriate display for these data? Explain.
b) Which was the most common rating?

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
01:30

Problem 7

Here is a bar chart summarizing the 2005 movie genres, as seen in the pie chart in Exercise $5 .$
a) Which genre was most common?
b) Is it easier to see that in the pie chart or the bar chart? Explain.

Matthew Wagner
Matthew Wagner
Numerade Educator
01:14

Problem 8

Here is a bar chart summarizing the 2005 movie ratings, as seen in the pie chart in Exercise 6 .
a) Which was the least common rating?
b) An editorial claimed that there's been a growth in $\mathrm{PG}$ 13 rated films that, according to the writer, "have too much sex and violence," at the expense of G-rated films that offer "good, clean fun." The writer offered the bar chart below as evidence to support his claim. Does the bar chart support his claim? Explain.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
01:45

Problem 9

An article in the Winter 2003 issue of Chance magazine reported on the Houston Independent School District's magnet schools programs. Of the 1755 qualified applicants, 931 were accepted, 298 were waitlisted, and 526 were turned away for lack of space. Find the relative frequency distribution of the decisions made, and write a sentence describing it.

Kari Hasz
Kari Hasz
Numerade Educator
02:06

Problem 10

The Chance article about the Houston magnet schools program described in Exercise 9 also indicated that 517 applicants were black or Hispanic, 292 Asian, and 946 white. Summarize the relative frequency distribution of ethnicity with a sentence or two (in the proper context, of course).

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
06:28

Problem 11

The Centers for Disease Control and Prevention (www.cdc.gov) lists causes of death in the United States during 2004:
$$
\begin{array}{l|c}
\text { Cause of Death } & \text { Percent } \\
\hline \text { Heart disease } & 27.2 \\
\text { Cancer } & 23.1 \\
\text { Circulatory diseases and stroke } & 6.3 \\
\text { Respiratory diseases } & 5.1 \\
\text { Accidents } & 4.7
\end{array}
$$
a) Is it reasonable to conclude that heart or respiratory diseases were the cause of approximately $33 \%$ of U.S. deaths in $2003 ?$
b) What percent of deaths were from causes not listed here?
c) Create an appropriate display for these data.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
01:31

Problem 12

An investigation compiled information about recent nonmilitary plane crashes (www. planecrashinfo.com). The causes, to the extent that they could be determined, are summarized in the table.
$$
\begin{array}{l|c}
\text { Cause } & \text { Percent } \\
\hline \text { Pilot error } & 40 \\
\text { Other human error } & 5 \\
\text { Weather } & 6 \\
\text { Mechanical failure } & 14 \\
\text { Sabotage } & 6
\end{array}
$$
a) Is it reasonable to conclude that the weather or mechanical failures caused only about $20 \%$ of recent plane crashes?
b) In what percent of crashes were the causes not determined?
c) Create an appropriate display for these data.

Lynn Larson
Lynn Larson
Numerade Educator
02:53

Problem 13

Data from the International Tanker Owners Pollution Federation Limited (www.itopf.com) give the cause of spillage for 312 large oil tanker accidents from 1974-2006. Here are displays.
a) Write a brief report interpreting what the displays show.
b) Is a pie chart an appropriate display for these data? Why or why not?

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
02:06

Problem 14

Twenty-six countries won medals in the 2006 Winter Olympics. The table lists them, along with the total number of medals each won:
$$
\begin{array}{l|c|l|c}
\text { Country } & \text { Medals } & \text { Country } & \text { Medals } \\
\hline \text { Germany } & 29 & \text { Finland } & 9 \\
\text { United States } & 25 & \text { Czech Republic } & 4 \\
\text { Canada } & 24 & \text { Estonia } & 3 \\
\text { Austria } & 23 & \text { Croatia } & 3 \\
\text { Russia } & 22 & \text { Australia } & 2 \\
\text { Norway } & 19 & \text { Poland } & 2 \\
\text { Sweden } & 14 & \text { Ukraine } & 2 \\
\text { Switzerland } & 14 & \text { Japan } & 1 \\
\text { South Korea } & 11 & \text { Belarus } & 1 \\
\text { Italy } & 11 & \text { Bulgaria } & 1 \\
\text { China } & 11 & \text { Great Britain } & 1 \\
\text { France } & 9 & \text { Slovakia } & 1 \\
\text { Netherlands } & 9 & \text { Latvia } & 1
\end{array}
$$
a) Try to make a display of these data. What problems do you encounter?
b) Can you find a way to organize the data so that the graph is more successful?

Kari Hasz
Kari Hasz
Numerade Educator
01:48

Problem 15

The Pew Research Center for the People and the Press (http:// people-press.org) has asked a representative sample of U.S. adults about global warming, repeating the question over time. In January 2007 , the responses reflected an increased belief that global warming is real and due to human activity. Here's a display of the percentages of respondents choosing each of the major alternatives offered:
List the errors in this display.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
01:37

Problem 16

A survey of athletic trainers (Scott $\mathrm{F}$. Nadler, Michael Prybicien, Gerard A. Malanga, and Dan Sicher. "Complications from Therapeutic Modalities: Results of a National Survey of Athletic Trainers." Archives of Physical Medical Rehabilitation 84 [June 2003]) asked what modalities (treatment methods such as ice, whirlpool, ultrasound, or exercise) they commonly use to treat injuries. Respondents were each asked to list three modalities. The article included the following figure reporting the modalities used:
a) What problems do you see with the graph?
b) Consider the percentages for the named modalities. Do you see anything odd about them?

Kari Hasz
Kari Hasz
Numerade Educator
02:21

Problem 17

The organization Monitoring the Future (www.monitoringthefuture.org) asked 2048 eighth graders who said they smoked cigarettes what brands they preferred. The table below shows brand preferences for two regions of the country. Write a few sentences describing the similarities and differences in brand preferences among eighth graders in the two regions listed.
$$
\begin{array}{l|c|c}
\text { Brand preference } & \text { South } & \text { West } \\
\hline \text { Marlboro } & 58.4 \% & 58.0 \% \\
\text { Newport } & 22.5 \% & 10.1 \% \\
\text { Camel } & 3.3 \% & 9.5 \% \\
\text { Other (over 20 brands) } & 9.1 \% & 9.5 \% \\
\text { No usual brand } & 6.7 \% & 12.9 \%
\end{array}
$$

Kari Hasz
Kari Hasz
Numerade Educator
View

Problem 18

In an effort to reduce the number of gunrelated homicides, some cities have run buyback programs in which the police offer cash (often $$\$ 50$$ ) to anyone who turns in an operating handgun. Chance magazine looked at results from a four-year period in Milwaukee. The table on the next page shows what types of guns were turned in and what types were used in homicides during a four-year period. Write a few sentences comparing the two distributions.
$$
\begin{array}{l|c|c}
\text { Caliber of gun } & \text { Buyback } & \text { Homicide } \\
\hline \text { Small }(.22, .25, .32) & 76.4 \% & 20.3 \% \\
\text { Medium }(.357, .38,9 \mathrm{~mm}) & 19.3 \% & 54.7 \% \\
\text { Large }(.40, .44, .45) & 2.1 \% & 10.8 \% \\
\text { Other } & 2.2 \% & 14.2 \%
\end{array}
$$

Victor Salazar
Victor Salazar
Numerade Educator
02:54

Problem 19

Here's a table that classifies movies released in 2005 by genre and MPAA rating:
a) The table gives column percents. How could you tell that from the table itself?
b) What percentage of these movies were comedies?
c) What percentage of the PG-rated movies were comedies?
d) Which of the following can you learn from this table? Give the answer if you can find it from the table.
i) The percentage of $\mathrm{PG}-13$ movies that were comedies
ii) The percentage of dramas that were R-rated
iii) The percentage of dramas that were G-rated
iv) The percentage of 2005 movies that were $\mathrm{PG}$ -rated comedies

Jerelyn Nevil
Jerelyn Nevil
Numerade Educator
02:43

Problem 20

Here's another table showing information about 120 movies released in 2005 . This table gives percentages of the table total:
a) How can you tell that this table holds table percentages (rather than row or column percentages)?
b) What was the most common genre/rating combination in 2005 movies?
c) How many of these movies were PG-rated comedies?
d) How many were G-rated?
e) An editorial about the movies noted, "More than three-quarters of the movies made today can be seen only by patrons 13 years old or older." Does this table support that assertion? Explain.

Kari Hasz
Kari Hasz
Numerade Educator
02:27

Problem 21

Prior to graduation, a high school class was surveyed about its plans. The following table displays the results for white and minority students (the "Minority" group included African-American, Asian, Hispanic, and Native American students):
a) What percent of the seniors are white?
b) What percent of the seniors are planning to attend a 2-year college?
c) What percent of the seniors are white and planning to attend a 2-year college?
d) What percent of the white seniors are planning to attend a 2-year college?
e) What percent of the seniors planning to attend a 2-year college are white?

Kari Hasz
Kari Hasz
Numerade Educator
02:59

Problem 22

SStudents in an Intro Stats course were asked to describe their politics as "Liberal," "Moderate," or "Conservative." Here are the results:
a) What percent of the class is male?
b) What percent of the class considers themselves to be "Conservative"?
c) What percent of the males in the class consider themselves to be "Conservative"?
d) What percent of all students in the class are males who consider themselves to be "Conservative"?

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
08:05

Problem 23

Look again at the table of postgraduation plans for the senior class in Exercise $21 .$
a) Find the conditional distributions (percentages) of plans for the white students.
b) Find the conditional distributions (percentages) of plans for the minority students.
c) Create a graph comparing the plans of white and minority students.
d) Do you see any important differences in the postgraduation plans of white and minority students? Write a brief summary of what these data show, including comparisons of conditional distributions.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
06:34

Problem 24

Look again at the table of political views for the Intro Stats students in Exercise $22 .$
a) Find the conditional distributions (percentages) of political views for the females.
b) Find the conditional distributions (percentages) of political views for the males.
c) Make a graphical display that compares the two distributions.
d) Do the variables Politics and Sex appear to be independent? Explain.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
03:12

Problem 25

The Chance magazine article described in Exercise 9 further examined the impact of an applicant's ethnicity on the likelihood of admission to the Houston Independent School District's magnet schools programs. Those data are summarized in the table below:
a) What percent of all applicants were Asian?
b) What percent of the students accepted were Asian?
c) What percent of Asians were accepted?
d) What percent of all students were accepted?

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
02:38

Problem 26

Look once more at the table summarizing the political views of Intro Stats students in Exercise 22 .
a) Produce a graphical display comparing the conditional distributions of males and females among the three categories of politics.
b) Comment briefly on what you see from the display in a.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
02:35

Problem 27

Examine the table about ethnicity and acceptance for the Houston Independent School District's magnet schools program, shown in Exercise $25 .$ Does it appear that the admissions decisions are made independent of the applicant's ethnicity? Explain.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
04:15

Problem 28

A survey of autos parked in student and staff lots at a large university classified the brands by country of origin, as seen in the table.
a) What percent of all the cars surveyed were foreign?
b) What percent of the American cars were owned by students?
c) What percent of the students owned American cars?
d) What is the marginal distribution of origin?
e) What are the conditional distributions of origin by driver classification?
f) Do you think that the origin of the car is independent of the type of driver? Explain.

Carson Merrill
Carson Merrill
Numerade Educator
06:27

Problem 29

Just how accurate are the weather forecasts we hear every day? The following table compares the daily forecast with a city's actual weather for a year:
a) On what percent of days did it actually rain?
b) On what percent of days was rain predicted?
c) What percent of the time was the forecast correct?
d) Do you see evidence of an association between the type of weather and the ability of forecasters to make an accurate prediction? Write a brief explanation, including an appropriate graph.

Ronald Prasad
Ronald Prasad
Numerade Educator
06:00

Problem 30

In 2000 , the Journal of the American Medical Association (JAMA) published a study that examined pregnancies that resulted in the birth of twins. Births were classified as preterm with intervention (induced labor or cesarean), preterm without procedures, or term/post-term. Researchers also classified the pregnancies by the level of prenatal medical care the mother received (inadequate, adequate, or intensive). The data, from the years $1995-1997$, are summarized in the table below. Figures are in thousands of births. (JAMA $284[2000]: 335-341)$
a) What percent of these mothers received inadequate medical care during their pregnancies?
b) What percent of all twin births were preterm?
c) Among the mothers who received inadequate medical care, what percent of the twin births were preterm?
d) Create an appropriate graph comparing the outcomes of these pregnancies by the level of medical care the mother received.
e) Write a few sentences describing the association between these two variables.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
17:47

Problem 31

Acompany held a blood pressure screening clinic for its employees. The results are summarized in the table below by age group and blood pressure level:
a) Find the marginal distribution of blood pressure level.
b) Find the conditional distribution of blood pressure level within each age group.
c) Compare these distributions with a segmented bar graph.
d) Write a brief description of the association between age and blood pressure among these employees.
e) Does this prove that people's blood pressure increases as they age? Explain.

Khalida Dawar
Khalida Dawar
Numerade Educator
01:15

Problem 32

The Centers for Disease Control and Prevention (CDC) has estimated that $19.8 \%$ of Americans over 15 years old are obese. The CDC conducts a survey on obesity and various behaviors. Here is a table on self-reported exercise classified by body mass index (BMI):
a) Are these percentages column percentages, row percentages, or table percentages?
b) Use graphical displays to show different percentages of physical activities for the three BMI groups.
c) Do these data prove that lack of exercise causes obesity? Explain.

Dominador Tan
Dominador Tan
Numerade Educator
01:44

Problem 33

Hearing anecdotal reports that some patients undergoing treatment for the eating disorder anorexia seemed to be responding positively to the antidepressant Prozac, medical researchers conducted an experiment to investigate. They found 93 women being treated for anorexia who volunteered to participate. For one year, 49 randomly selected patients were treated with Prozac and the other 44 were given an inert substance called a placebo. At the end of the year, patients were diagnosed as healthy or relapsed, as summarized in the table:
Do these results provide evidence that Prozac might be helpful in treating anorexia? Explain.

Dominador Tan
Dominador Tan
Numerade Educator
04:10

Problem 34

For a period of five years, physicians at McGill University Health Center followed more than 5000 adults over the age of 50 . The researchers were investigating whether people taking a certain class of antidepressants (SSRIs) might be at greater risk of bone fractures. Their observations are summarized in the table:
Do these results suggest there's an association between taking SSRI antidepressants and experiencing bone fractures? Explain.

Dominador Tan
Dominador Tan
Numerade Educator
03:52

Problem 35

The following table shows the number of licensed U.S. drivers by age and by sex (www.dot.gov):
$$
\begin{array}{l|rr|r}
\text { Age } & \begin{array}{c}
\text { Male Drivers } \\
\text { (number) }
\end{array} & \begin{array}{c}
\text { Female Drivers } \\
\text { (number) }
\end{array} & \text { Total } \\
\hline 19 \text { and under } & 4,777,694 & 4,553,946 & \mathbf{9 , 3 3 1 , 6 4 0} \\
20-24 & 8,611,161 & 8,398,879 & \mathbf{1 7 , 0 1 0 , 0 4 0} \\
25-29 & 8,879,476 & 8,666,701 & \mathbf{1 7 , 5 4 6 , 1 7 7} \\
30-34 & 9,262,713 & 8,997,662 & \mathbf{1 8 , 2 6 0 , 3 7 5} \\
35-39 & 9,848,050 & 9,576,301 & \mathbf{1 9 , 4 2 4 , 3 5 1} \\
40-44 & 10,617,456 & 10,484,149 & \mathbf{2 1 , 1 0 1 , 6 0 5} \\
45-49 & 10,492,876 & 10,482,479 & \mathbf{2 0 , 9 7 5 , 3 5 5} \\
50-54 & 9,420,619 & 9,475,882 & \mathbf{1 8 , 8 9 6 , 5 0 1} \\
55-59 & 8,218,264 & 8,265,775 & \mathbf{1 6 , 4 8 4 , 0 3 9} \\
60-64 & 6,103,732 & 6,147,569 & \mathbf{1 2 , 2 5 1 , 3 6 1} \\
65-69 & 4,571,157 & 4,643,913 & \mathbf{9 , 2 1 5 , 0 7 0} \\
70-74 & 3,617,908 & 3,761,039 & \mathbf{7 , 3 7 8 , 9 4 7} \\
75-79 & 2,890,155 & 3,192,408 & \mathbf{6 , 0 8 2 , 5 6 3} \\
80-84 & 1,907,743 & 2,222,412 & \mathbf{4 , 1 3 0 , 1 5 5} \\
85 \text { and over } & 1,170,817 & 1,406,271 & \mathbf{2 , 5 7 7 , 0 8 8} \\
\hline \text { Total } & \mathbf{1 0 0 , 3 8 9 , 8 8 1} & \mathbf{1 0 0 , 2 7 5 , 3 8 6} & \mathbf{2 0 0 , 6 6 5 , 2 6 7}
\end{array}
$$
a) What percent of total drivers are under 20 ?
b) What percent of total drivers are male?
c) Write a few sentences comparing the number of male and female licensed drivers in each age group.
d) Do a driver's age and sex appear to be independent? Explain?

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
01:53

Problem 36

A study by the University of Texas Southwestern Medical Center examined 626 people to see if an increased risk of contracting hepatitis $C$ was associated with having a tattoo. If the subject had a tattoo, researchers asked whether it had been done in a commercial tattoo parlor or elsewhere. Write a brief description of the association between tattooing and hepatitis C, including an appropriate graphical display.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
08:05

Problem 37

Most patients who undergo surgery make routine recoveries and are discharged as planned. Others suffer excessive bleeding, infection, or other postsurgical complications and have their discharges from the hospital delayed. Suppose your city has a large hospital and a small hospital, each performing major and minor surgeries. You collect data to see how many surgical patients have their discharges delayed by postsurgical complications, and you find the results shown in the following table.
a) Overall, for what percent of patients was discharge delayed?
b) Were the percentages different for major and minor surgery?
c) Overall, what were the discharge delay rates at each hospital?
d) What were the delay rates at each hospital for each kind of surgery?
e) The small hospital advertises that it has a lower rate of postsurgical complications. Do you agree?
f) Explain, in your own words, why this confusion occurs.

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
04:29

Problem 38

A company must decide which of two delivery services it will contract with. During a recent trial period, the company shipped numerous packages with each service and kept track of how often deliveries did not arrive on time. Here are the data:
a) Compare the two services' overall percentage of late deliveries.
b) On the basis of the results in part a, the company has decided to hire Pack Rats. Do you agree that Pack Rats delivers on time more often? Explain.
c) The results here are an instance of what phenomenon?

Dominador Tan
Dominador Tan
Numerade Educator
07:37

Problem 39

A 1975 article in the magazine Science examined the graduate admissions process at Berkeley for evidence of sex discrimination. The table below shows the number of applicants accepted to each of four graduate programs:
a) What percent of total applicants were admitted?
b) Overall, was a higher percentage of males or females admitted?
c) Compare the percentage of males and females admitted in each program.
d) Which of the comparisons you made do you consider to be the most valid? Why?

Jeremiah Mbaria
Jeremiah Mbaria
Numerade Educator
02:31

Problem 40

Can you design a Simpson's paradox? Two companies are vying for a city's "Best Local Employer" award, to be given to the company most committed to hiring local residents. Although both employers hired 300 new people in the past year, Company A brags that it deserves the award because $70 \%$ of its new jobs went to local residents, compared to only $60 \%$ for Company B. Company B concedes that those percentages are correct, but points out that most of its new jobs were full-time, while most of Company A's were part-time. Not only that, says Company $B$, but a higher percentage of its full-time jobs went to local residents than did Company A's, and the same was true for part-time jobs. Thus, Company B argues, it's a better local employer than Company $\mathrm{A}$.
Show how it's possible for Company B to fill a higher percentage of both full-time and part-time jobs with local residents, even though Company A hired more local residents overall.

Dominador Tan
Dominador Tan
Numerade Educator