• Home
  • Textbooks
  • Understandable Statistics, Concepts and Methods
  • The Binomial Probability Distribution and Related Topics

Understandable Statistics, Concepts and Methods

Charles Henry Brase, Corrinne Pellillo Brase

Chapter 5

The Binomial Probability Distribution and Related Topics - all with Video Answers

Educators


Section 1

Introduction to Random Variables and Probability Distributions

02:01

Problem 1

Which of the following are continuous variables, and which are discrete?
(a) Number of traffic fatalities per year in the state of Florida
(b) Distance a golf ball travels after being hit with a driver
(c) Time required to drive from home to college on any given day
(d) Number of ships in Pearl Harbor on any given day
(e) Your weight before breakfast each morning

Hubert Agamasu
Hubert Agamasu
Numerade Educator
00:39

Problem 2

Which of the following are continuous variables, and which are discrete?
(a) Speed of an airplane
(b) Age of a college professor chosen at random
(c) Number of books in the college bookstore
(d) Weight of a football player chosen at random
(e) Number of lightning strikes in Rocky Mountain National Park on a given day

Hossam Mohamed
Hossam Mohamed
Numerade Educator
00:39

Problem 3

Consider each distribution. Determine if it is a valid probability distribution or not, and explain your answer.
(a)$$\begin{array}{c|ccc} \hline x & 0 & 1 & 2 \\ \hline P(x) & 0.25 & 0.60 & 0.15 \\ \hline \end{array}$$
(b)$$\begin{array}{c|ccc} \hline x & 0 & 1 & 2 \\ \hline P(x) & 0.25 & 0.60 & 0.20 \\ \hline \end{array}$$

Hossam Mohamed
Hossam Mohamed
Numerade Educator
01:06

Problem 4

At State College all classes start on the hour, with the earliest start time at 7 A.M. and the latest at 8 p.M. A random sample of freshmen showed the percentages preferring the listed start times. $$\begin{array}{l|ccccc} \text { Start Time } & 7 \text { or } 8 \text { A.M. } & 9,10, \text { or } 11 \text { A.M. } & 12 \text { or } 1 \text { P.M. } & \text { 1 } \text { P.M. or later } & \text { After } 5 \text { P.M. } \\ \hline \% \text { preferring } & 10 \% & 35 \% & 28 \% & 25 \% & 15 \% \end{array}$$ Can this information be used to make a discrete probability distribution? Explain.

Nick Johnson
Nick Johnson
Numerade Educator
01:56

Problem 5

Consider two discrete probability distribution with the same sample space and the same expected value. Are the standard deviations of the two distributions necessarily equal? Explain.

Hossam Mohamed
Hossam Mohamed
Numerade Educator
00:49

Problem 6

Consider the probability distribution of a random variable $x$. Is the expected value of the distribution necessarily one of the possible values of $x ?$ Explain or give an example.

Hossam Mohamed
Hossam Mohamed
Numerade Educator
00:44

Problem 7

Consider the probability distribution shown in Problem 3 (a). Compute the expected value and the standard deviation of the distribution.

Hossam Mohamed
Hossam Mohamed
Numerade Educator
00:54

Problem 8

For a fundraiser, 1000 raffle tickets are sold and the winner is chosen at random. There is only one prize, 500 dollar in cash. You buy one ticket.
(a) What is the probability you will win the prize of 500 dollar?
(b) Your expected earnings can be found by multiplying the value of the prize by the probability you will win the prize. What are your expected earnings?
(c) Interpretation If a ticket costs 2 dollar, what is the difference between your "costs" and "expected earnings"? How much are you effectively contributing to the fundraiser?

Hossam Mohamed
Hossam Mohamed
Numerade Educator
02:41

Problem 9

We can use the random-number table to simulate outcomes from a given discrete probability distribution. Jose plays basketball and has probability 0.7 of making a free-throw shot. Let $x$ be the random variable that counts the number of successful shots out of 10 attempts. Consider the digits 0 through 9 of the random-number table. since Jose has a $70 \%$ chance of making a shot, assign the digits 0 through 6 to "making a basket from the free throw line" and the digits 7 through 9 to "missing the shot."
(a) Do $70 \%$ of the possible digits 0 through 9 represent "making a basket"?
(b) Start at line $2,$ column 1 of the random-number table. Going across the row, determine the results of 10 "trials." How many free-throw shots are successful in this simulation?
(c) Your friend decides to assign the digits 0 through 2 to "missing the shot" and the digits 3 through 9 to "making the basket." Is this assignment valid? Explain. Using this assignment, repeat part (b).

Hossam Mohamed
Hossam Mohamed
Numerade Educator
02:23

Problem 10

What is the age distribution of promotion-sensitive shoppers? A supermarket super shopper is defined as a shopper for whom at least $70 \%$ of the items purchased were on sale or purchased with a coupon. The following table is based on information taken from Trends in the United States (Food Marketing Institute, Washington, D.C.). $$\begin{array}{|l|ccccc|} \hline \text { Age range, years } & 18-28 & 29-39 & 40-50 & 51-61 & 62 \text { and over } \\ \hline \text { Midpoint } x & 23 & 34 & 45 & 56 & 67 \\ \hline \begin{array}{l} \text { Percent of } \\ \text { super shoppers } \end{array} & 7 \% & 44 \% & 24 \% & 14 \% & 11 \% \\ \hline \end{array}$$ For the 62 -and-over group, use the midpoint 67 years.
(a) Using the age midpoints $x$ and the percentage of super shoppers, do we have a valid probability distribution? Explain.
(b) Use a histogram to graph the probability distribution of part (a).
(c) Compute the expected age $\mu$ of a super shopper.
(d) Compute the standard deviation $\sigma$ for ages of super shoppers.

Hossam Mohamed
Hossam Mohamed
Numerade Educator
01:25

Problem 11

What is the income distribution of super shoppers (see Problem 10). In the following table, income units are in thousands of dollars, and each interval goes up to but does not include the given high value. The midpoints are given to the nearest thousand dollars. $$\begin{array}{|l|cccccc|} \hline \text { Income range } & 5-15 & 15-25 & 25-35 & 35-45 & 45-55 & 55 \text { or more } \\ \hline \text { Midpoint } x & 10 & 20 & 30 & 40 & 50 & 60 \\ \hline \begin{array}{l} \text { Percent of } \\ \text { super shoppers } \end{array} & 21 \% & 14 \% & 22 \% & 15 \% & 20 \% & 8 \% \\ \hline \end{array}$$
(a) Using the income midpoints $x$ and the percent of super shoppers, do we have a valid probability distribution? Explain.
(b) Use a histogram to graph the probability distribution of part (a).
(c) Compute the expected income $\mu$ of a super shopper.
(d) Compute the standard deviation $\sigma$ for the income of super shoppers.

Nick Johnson
Nick Johnson
Numerade Educator
16:18

Problem 12

What was the age distribution of nurses in Great Britain at the time of Florence Nightingale? Thanks to Florence Nightingale and the British census of $1851,$ we have the following information (based on data from the classic text Notes on Nursing, by Florence Nightingale). Note: In 1851 there were 25,466 nurses in Great Britain. Furthermore, Nightingale made a strict distinction between nurses and domestic servants. $$\begin{array}{|l|lllllll|}
\hline \text { Age range (yr) } & 20-29 & 30-39 & 40-49 & 50-59 & 60-69 & 70-79 & 80+ \\ \hline \text { Midpoint } x & 24.5 & 34.5 & 44.5 & 54.5 & 64.5 & 74.5 & 84.5 \\ \hline \begin{array}{l} \text { Percent of } \\ \text { nurses } \end{array} & 5.7 \% & 9.7 \% & 19.5 \% & 29.2 \% & 25.0 \% & 9.1 \% & 1.8 \% \\ \hline \end{array}$$
(a) Using the age midpoints $x$ and the percent of nurses, do we have a valid probability distribution? Explain.
(b) Use a histogram to graph the probability distribution of part (a).
(c) Find the probability that a British nurse selected at random in 1851 was 60 years of age or older.
(d) Compute the expected age $\mu$ of a British nurse contemporary to Florence Nightingale.
(e) Compute the standard deviation $\sigma$ for ages of nurses shown in the distribution.

Evelyn Cunningham
Evelyn Cunningham
Numerade Educator
12:31

Problem 13

The following data are based on information taken from Daily Creel Summary, published by the Paiute Indian Nation, Pyramid Lake, Nevada. Movie stars and U.S. presidents have fished Pyramid Lake. It is one of the best places in the lower 48 states to catch trophy cutthroat trout. In this table,
$x=$ number of fish caught in a 6 -hour period. The percentage data are the percentages of fishermen who catch $x$ fish in a 6 -hour period while fishing from shore. $$\begin{array}{|l|ccccc|} \hline x & 0 & 1 & 2 & 3 & 4 \text { or more } \\ \hline \% & 44 \% & 36 \% & 15 \% & 4 \% & 1 \% \\ \hline \end{array}$$
(a) Convert the percentages to probabilities and make a histogram of the probability distribution.
(b) Find the probability that a fisherman selected at random fishing from shore catches one or more fish in a 6 -hour period.
(c) Find the probability that a fisherman selected at random fishing from shore catches two or more fish in a 6 -hour period.
(d) Compute $\mu,$ the expected value of the number of fish caught per fisherman in a 6 -hour period (round 4 or more to 4 ).
(e) Compute $\sigma,$ the standard deviation of the number of fish caught per fisherman in a 6 -hour period (round 4 or more to 4 ).

Evelyn Cunningham
Evelyn Cunningham
Numerade Educator
03:44

Problem 14

USA Today reported that approximately $25 \%$ of all state prison inmates released on parole become repeat offenders while on parole. Suppose the parole board is examining five prisoners up for parole. Let $x=$ number of prisoners out of five on parole who become repeat offenders. The methods of Section 5.2 can be used to compute the probability assignments for the $x$ distribution.
$$\begin{array}{c|cccccc} \hline x & 0 & 1 & 2 & 3 & 4 & 5 \\ \hline P(x) & 0.237 & 0.396 & 0.264 & 0.088 & 0.015 & 0.001 \\ \hline \end{array}$$
(a) Find the probability that one or more of the five parolees will be repeat offenders. How does this number relate to the probability that none of the parolees will be repeat offenders?
(b) Find the probability that two or more of the five parolees will be repeat offenders.
(c) Find the probability that four or more of the five parolees will be repeat offenders.
(d) Compute $\mu,$ the expected number of repeat offenders out of five.
(e) Compute $\sigma,$ the standard deviation of the number of repeat offenders out of five.

Carson Merrill
Carson Merrill
Numerade Educator
01:22

Problem 15

The college hiking club is having a fundraiser to buy new equipment for fall and winter outings. The club is selling Chinese fortune cookies at a price of 1 dollar per cookie. Each cookie contains a piece of paper with a different number written on it. A random drawing will determine which number is the winner of a dinner for two at a local Chinese restaurant. The dinner is valued at 35 dollar. since the fortune cookies were donated to the club, we can ignore the cost of the cookies. The club sold 719 cookies before the drawing.
(a) Lisa bought 15 cookies. What is the probability she will win the dinner for two? What is the probability she will not win?
(b) Lisa's expected earnings can be found by multiplying the value of the dinner by the probability that she will win. What are Lisa's expected earnings? How much did she effectively contribute to the hiking club?

Hossam Mohamed
Hossam Mohamed
Numerade Educator
01:29

Problem 16

Spring Break: Caribbean Cruise The college student senate is sponsoring a spring break Caribbean cruise raffle. The proceeds are to be donated to the Samaritan Center for the Homeless. A local travel agency donated the cruise, valued at 2000 dollar. The students sold 2852 raffle tickets at 5 dollar per ticket.
(a) Kevin bought six tickets. What is the probability that Kevin will win the spring break cruise to the Caribbean? What is the probability that Kevin will not win the cruise?
(b) Expected earnings can be found by multiplying the value of the cruise by the probability that Kevin will win. What are Kevin's expected earnings? Is this more or less than the amount Kevin paid for the six tickets? How much did Kevin effectively contribute to the Samaritan Center for the Homeless?

Hossam Mohamed
Hossam Mohamed
Numerade Educator
08:10

Problem 17

Jim is a 60 -year-old Anglo male in reasonably good health. He wants to take out a 50,000 dollar term (i.e., straight death benefit) life insurance policy until he is $65 .$ The policy will expire on his 65 th birthday. The probability of death in a given year is provided by the Vital Statistics Section of the Statistical Abstract of the United States (116th edition). $$\begin{array}{|l|ccccc|} \hline x=\text { age } & 60 & 61 & 62 & 63 & 64 \\ \hline P( \text { death at this age) } & 0.01191 & 0.01292 & 0.01396 & 0.01503 & 0.01613 \\ \hline \end{array}$$ Jim is applying to Big Rock Insurance Company for his term insurance policy.
(a) What is the probability that Jim will die in his 60 th year? Using this probability and the 50,000 dollar death benefit, what is the expected cost to Big Rock Insurance?
(b) Repeat part (a) for years $61,62,63,$ and $64 .$ What would be the total expected cost to Big Rock Insurance over the years 60 through $64 ?$
(c) If Big Rock Insurance wants to make a profit of 700 dollar above the expected total cost paid out for Jim's death, how much should it charge for the policy?
(d) If Big Rock Insurance Company charges 5000 dollar for the policy, how much profit does the company expect to make?

Evelyn Cunningham
Evelyn Cunningham
Numerade Educator
08:14

Problem 18

Sara is a 60 -year-old Anglo female in reasonably good health. She wants to take out a 50,000 dollar term (i.e., straight death benefit) life insurance policy until she is $65 .$ The policy will expire on her 65 th birthday. The probability of death in a given year is provided by the Vital Statistics Section of the Statistical Abstract of the United States (116th edition). $$\begin{array}{|l|lcccc|} \hline x=\text { age } & 60 & 61 & 62 & 63 & 64 \\ \hline P( \text { death at this age) } & 0.00756 & 0.00825 & 0.00896 & 0.00965 & 0.01035 \\ \hline \end{array}$$ Sara is applying to Big Rock Insurance Company for her term insurance policy.
(a) What is the probability that Sara will die in her 60 th year? Using this probability and the 50,000 dollar death benefit, what is the expected cost to Big Rock Insurance?
(b) Repeat part (a) for years $61,62,63,$ and $64 .$ What would be the total expected cost to Big Rock Insurance over the years 60 through $64 ?$
(c) If Big Rock Insurance wants to make a profit of 700 dollar above the expected total cost paid out for Sara's death, how much should it charge for the policy?
(d) If Big Rock Insurance Company charges 5000 dollar for the policy, how much profit does the company expect to make?

Evelyn Cunningham
Evelyn Cunningham
Numerade Educator
03:35

Problem 19

Golf Norb and Gary are entered in a local golf tournament. Both have played the local course many times. Their scores are random variables with the following means and standard deviations. $$\text { Norb, } x_{1}: \mu_{1}=115 ; \sigma_{1}=12 \quad \text { Gary, } x_{2}: \mu_{2}=100 ; \sigma_{2}=8$$
In the tournament, Norb and Gary are not playing together, and we will assume their scores vary independently of each other.
(a) The difference between their scores is $W=x_{1}-x_{2} .$ Compute the mean, variance, and standard deviation for the random variable $W.$
(b) The average of their scores is $W=0.5 x_{1}+0.5 x_{2}$. Compute the mean, variance, and standard deviation for the random variable $W.$
(c) The tournament rules have a special handicap system for each player. For Norb, the handicap formula is $L=0.8 x_{1}-2 .$ Compute the mean, variance, and standard deviation for the random variable $L.$
(d) For Gary, the handicap formula is $L=0.95 x_{2}-5 .$ Compute the mean, variance, and standard deviation for the random variable $L.$

Nick Johnson
Nick Johnson
Numerade Educator
02:04

Problem 20

Repair Service A computer repair shop has two work centers. The first center examines the computer to see what is wrong, and the second center repairs the computer. Let $x_{1}$ and $x_{2}$ be random variables representing the lengths of time in minutes to examine a computer $\left(x_{1}\right)$ and to repair a computer $\left(x_{2}\right) .$ Assume $x_{1}$ and $x_{2}$ are independent random variables. Long-term history has shown the following times: Examine computer, $x_{1}: \mu_{1}=28.1$ minutes; $\sigma_{1}=8.2$ minutes Repair computer, $x_{2}: \mu_{2}=90.5$ minutes; $\sigma_{2}=15.2$ minutes
(a) Let $W=x_{1}+x_{2}$ be a random variable representing the total time to examine and repair the computer. Compute the mean, variance, and standard deviation of $W$.
(b) Suppose it costs 1.50 dollar per minute to examine the computer and 2.75 dollar per minute to repair the computer. Then $W=1.50 x_{1}+2.75 x_{2}$ is a random variable representing the service charges (without parts). Compute the mean, variance, and standard deviation of $W .$
(c) The shop charges a flat rate of 1.50 dollar per minute to examine the computer, and if no repairs are ordered, there is also an additional 50 dollar service charge. Let $L=1.5 x_{1}+50 .$ Compute the mean, variance, and standard deviation of $L.$

Hossam Mohamed
Hossam Mohamed
Numerade Educator
01:48

Problem 21

Insurance Risk Insurance companies know the risk of insurance is greatly reduced if the company insures not just one person, but many people. How does this work? Let $x$ be a random variable representing the expectation of life in years for a 25 -year-old male (i.e., number of years until death). Then the mean and standard deviation of $x$ are $\mu=50.2$ years and $\sigma=11.5$ years (Vital Statistics Section of the Statistical Abstract of the United States, 116th edition). Suppose Big Rock Insurance Company has sold life insurance policies to Joel and David. Both are 25 years old, unrelated, live in different states, and have about the same health record. Let $x_{1}$ and $x_{2}$ be random variables representing Joel's and David's life expectancies. It is reasonable to assume $x_{1}$ and $x_{2}$ are independent. $$\begin{aligned} &\text { Joel, } x_{1}: \mu_{1}=50.2 ; \sigma_{1}=11.5\\ &\text { David, } x_{2}: \mu_{2}=50.2 ; \sigma_{2}=11.5 \end{aligned}$$ If life expectancy can be predicted with more accuracy, Big Rock will have less risk in its insurance business. Risk in this case is measured by $\sigma$ (larger $\sigma$ means more risk).
(a) The average life expectancy for Joel and David is $W=0.5 x_{1}+0.5 x_{2}$ Compute the mean, variance, and standard deviation of $W$.
(b) Compare the mean life expectancy for a single policy $\left(x_{1}\right)$ with that for two policies $(W).$
(c) Compare the standard deviation of the life expectancy for a single policy $\left(x_{1}\right)$ with that for two policies $(W).$
(d) The mean life expectancy is the same for a single policy $\left(x_{1}\right)$ as it is for two policies $(W),$ but the standard deviation is smaller for two policies. What happens to the mean life expectancy and the standard deviation when we include more policies issued to people whose life expectancies have the same mean and standard deviation (i.e., 25 -year-old males)? For instance, for three policies, $W=(\mu+\mu+\mu) / 3=\mu$ and $\sigma_{W}^{2}=(1 / 3)^{2} \sigma^{2}+(1 / 3)^{2} \sigma^{2}+(1 / 3)^{2} \sigma^{2}=$ $(1 / 3)^{2}\left(3 \sigma^{2}\right)=(1 / 3) \sigma^{2}$ and $\sigma_{W}=\frac{1}{\sqrt{3}} \sigma .$ Likewise, for $n$ such policies, $W=\mu$ and $\sigma_{W}^{2}=(1 / n) \sigma^{2}$ and $\sigma_{W}=\frac{1}{\sqrt{n}} \sigma .$ Looking at the general result, is it appropriate to say that when we increase the number of policies to $n$, the risk decreases by a factor of $\sigma_{W}=\frac{1}{\sqrt{n}} ?$

Hossam Mohamed
Hossam Mohamed
Numerade Educator
02:46

Problem 22

This problem shows you how to make a better blend of almost anything. Let $x_{1}, x_{2}, \ldots x_{n}$ be independent random variables with respective variances $\sigma_{1}^{2}$ $\sigma_{2}^{2}, \ldots, \sigma_{n}^{2}.$ Let $c_{1}, c_{2}, \ldots, c_{n}$ be constant weights such that $0 \leq c_{i} \leq 1$ and $c_{1}+c_{2}+\ldots+$ $c_{n}=1 .$ The linear combination $w=c_{1} x_{1}+c_{2} x_{2}+\ldots+c_{n} x_{n}$ is a random variable with variance $$\sigma_{w}^{2}=c_{1}^{2} \sigma_{1}^{2}+c_{2}^{2} \sigma_{2}^{2}+\cdots+c_{n}^{2} \sigma_{n}^{2}$$
(a) In your own words write a brief explanation regarding the following statement: The variance of $w$ is a measure of the consistency or variability of performance or outcomes of the random variable $w$. To get a more consistent performance out of the blend $w$, choose weights $c_{i}$ that make $\sigma_{w}^{2}$ as small as possible. Now the question is how do we choose weights $c_{i}$ to make $\sigma_{w}^{2}$ as small as possible? Glad you asked! A lot of mathematics can be used to show the following choice of weights will minimize $\sigma_{w}^{2}.$ $$c_{i}=\frac{\frac{1}{\sigma_{i}^{2}}}{\left[\frac{1}{\sigma_{i}^{2}}+\frac{1}{\sigma_{2}^{\frac{1}{2}}}+\cdots+\frac{1}{\sigma_{n}^{2}}\right]} \quad \text { for } i=1,2, \cdots, n$$ (Reference: Introduction to Mathematical Statistics, 4th edition, by Paul Hoel.)
(b)Two types of epoxy resin are used to make a new blend of superglue. Both resins have about the same mean breaking strength and act independently. The question is how to blend the resins (with the hardener) to get the most consistent breaking strength. Why is this important, and why would this require minimal $\sigma_{w}^{2} ?$ Hint: We don't want some bonds to be really strong while others are very weak, resulting in inconsistent bonding. Let $x_{1}$ and $x_{2}$ be random variables representing breaking strength (lb) of each resin under uniform testing conditions. If $\sigma_{1}=8$ lb $\sigma_{2}=12$ lb, show why a blend of about $69 \%$ resin 1 and $31 \%$ resin 2 will result in a superglue with smallest $\sigma_{w}^{2}$ and most consistent bond strength.
(c) Use $c_{1}=0.69$ and $c_{2}=0.31$ to compute $\sigma_{w}$ and show that $\sigma_{w}$ is less than both $\sigma_{1}$ and $\sigma_{2}$ The dictionary meaning of the word synergetic is "working together or cooperating for a better overall effect." Write a brief explanation of how the blend $w=c_{1} x_{1}+c_{2} x_{2}$ has a synergetic effect for the purpose of reducing variance.

Hossam Mohamed
Hossam Mohamed
Numerade Educator
02:09

Problem 23

Three grades of jet fuel are blended to be used on long commercial flights. Each grade cost about the same and has about the same ignition or burn rate (gal/min) in a jet engine. The question is how to make the blend for the most consistent ignition rate. Assume the ignition rates of the three grades of fuel are statistically independent.
(a) Write a brief explanation in which you discuss why consistent ignition rate is important. Hint: You sure do not want to run out of fuel up in the air! Also you don't want to land with a lot of extra fuel instead of payload cargo.
(b) Let $x_{1}, x_{2}, x_{3}$ be random variables representing ignition rate (gal/min) for the three grades of fuel. After extensive testing (flying under different conditions) the standard deviations are found to be $\sigma_{1}=7, \sigma_{2}=12$ and $\sigma_{3}=15 .$ Explain why a blend of approximately $64 \%$ grade 1 fuel with $22 \%$ grade 2 fuel and $14 \%$ grade 3 fuel would give the most consistent ignition rate. Hint: See Problem 22.
(c) Use $c_{1}=0.64, c_{2}=0.22,$ and $c_{3}=0.14$ to compute $\sigma_{w}$ and show that $\sigma_{w}$ is less than each of $\sigma_{1,} \sigma_{2}$ and $\sigma_{3} .$ Would you say the blended jet fuel has a more consistent ignition rate than each separate fuel? Explain.

Hossam Mohamed
Hossam Mohamed
Numerade Educator