Math 105, Topics in Mathematics |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Lesson 8: Estimation
Introduction
For this chapter you need an advanced calculator. We discuss the concepts and theory and then use a calculator to solve the problems. The name of this game of Statistics is to try to understand the POPULATION on the basis of the information available in the SAMPLE. Among what we mean by "understand" is to estimate the values of the population PARAMETERS. The game here is to use suitable sample STATISTICS to estimate population parameters. For example, we may like to use the sample mean x as an estimate for the population mean μ. We consider two types of estimation of parameters.
|
z-Values : P(Z > z α ) = α | |
α | z-Value |
---|---|
z.005 | 2.58 |
z.01 | 2.33 |
z.02 | 2.05 |
z.025 | 1.96 |
z.05 | 1.65 |
z.1 | 1.28 |
Z-values : the Flash program |
A (1- α )100 percent confidence interval for the mean μ:
Suppose X is a random variable with mean μ and variance σ2. We want to construct a confidence interval for μ.
We assume that σ is known. Let X1,X2, ... , Xn be a sample from X. Note that from CLT we have, approximately,
P(-z α/2 < Z < z α/2
) = 1 - α
where Z=(X-μ)/(σ/n1/2).
If we simplify, we get
P(X-E < μ < X+E)=1-
α
where E=(z α/2σ)/n1/2.
We have the following theorem.
Theorem. Assume that σ is known. Then a (1- α)100 percent confidence interval for μ is given by
The Z-Interval |
X-E
< μ < X+E
where E=(z α/2σ)/n1/2 |
l = X - E is
called the left-end-point. r = X + E is called the right-end-point. |
Remarks.
P(L < μ < infinity) = 1 - α.
Then (L, infinity) will be a (1- α) 100 percent one-sided (upper) confidence interval for μ.Definitions and Formulas
l = 2(z α/2σ)/n1/2.
E =(z α/2σ)/n1/2.
Use of Calculators, (if you have a TI-83): |
---|
|
|
Problems on 8.1: Point and Interval Estimation
Use your calculator to solve the following problems. You can have a look at the flash-animated solutions for conceptual reasons.
Exercise 8.1.1. Assume that you have a normal population with mean μ and standard deviation σ = 15. Suppose you have collected a sample of size 25, and the sample mean x was found to be 81.
Exercise 8.1.2. Assume that you have a normal population with mean μ and standard deviation σ = 9.8. Suppose you have collected a sample of size 14, and the sample mean x was found to be 151.1.
Exercise 8.1.3. The time taken by an athlete to run an event is normally distributed with mean μ, and known standard deviation σ = 3.5 seconds. To estimate the mean μ he ran 16 times, and sample mean was found to be x = 33 seconds.
Exercise 8.1.4. A population has normal distribution with standard deviation σ = 17. Suppose you collect a sample size 211 and your sample mean in x = 18.
Exercise 8.1.5. The tuition X paid by a student per semester in a university has a distribution with mean μ and σ = $416. Suppose you collect a sample of size 313 students and your sample mean is x = $1240.
Exercise 8.1.6. It is suspected that an industrial plant is polluting the water stream. To determine the extent of damage, water sample of size n = 13 was collected and the dissolved oxygen concentration was measured. The mean concentration was found to be x = 2.3. It is known from past experience that σ = 0.45.
Let X be a normal random variable with mean μ and variance σ2. Unlike in the last section, in this section we assume that σ is not known, and we try to compute a confidence interval of μ. In the last section, the main tool (or fact) that we used was
Z=(X-μ)/( σ/n1/2)
has N(0,1) distribution. In this section, we use the distribution of
T= (X-μ)/(S /n1/2).
The distribution of T is known as t-distribution with degrees of freedom n-1, which we have not discussed. As we did for the N(0,1) random variable, we now give the properties of t-distribution.
About t-distribution
Given a positive integer ν, there is a random variable T = tν that is said to have t-distribution with degrees of freedom ν. The useful properties of t-distribution is listed below:
![]() |
Flash animation of the pdf of t-Distribution. |
Theorem. Let X be a normal random variable with mean μ and standard deviation σ. Let X1,X2, ... , Xn be a sample of size n from the X population. Then
T= (X-μ)/(S /n1/2)
has t-distribution with degrees of freedom n-1.
So
P(-tn-1,α/2 < T= (X-µ)/(S /n1/2) < tn-1,α/2 ) = 1- α.
If we simplify, then we get
P(X-E < µ < X+E)=1-
α
where E = (s/n½)tn-1,α/2.
A (1- α) 100 percent Confidence Interval for µ
Under the terms of the theorem, a (1- α) 100 percent confidence interval for µ is given by
The T-Interval |
X-E <
µ < X+E where E = (s/n½)tn-1,α/2 . |
l = X - E will be called the left-end-point
r = X + E will be called the right-end-point |
E is also called the margin of error. |
Use of Calculators (if you have a TI-83): |
---|
|
The margin of error = E = (width of the interval)/2 |
Problems on 8.2: When σ is Unknown
Use your calculator to solve the following problems.
You can have a look at the flash-animated solutions for conceptual
reasons.
Exercise 8.2.1. Assume that we have normal populations with mean μ and standard deviation σ. We have a sample of size n = 18 that has sample mean x = 170.5, and standard deviation s = 13.3. Compute a 99 percent confidence interval for μ. Solution
Exercise 8.2.2. Suppose that the time taken to complete a problem in the Math 105 test is normally distributed with mean µ and standard deviation σ. A sample of size 23 was taken, and sample mean and standard deviation were found to be x = 4.7 and s = .47. Estimate the mean time μ taken to complete a problem using a 98 percent confidence interval. Solution
Exercise 8.2.3. It is assumed that the
lifetime (in hours) of light bulbs produced in a factory is normally
distributed with mean μ and standard deviation σ. To estimate
μ the following data was collected on the lifetime of bulbs:
5110 | 4671 | 6441 | 3331 | 5055 | 5270 | 5335 | 4973 | 1837 |
7783 | 4560 | 6074 | 4777 | 4707 | 5263 | 4978 | 5418 | 5123 |
Exercise 8.2.4. To estimate the mean time
taken to complete an event by an athlete, the athlete ran several
times, and the following sample of time taken to complete the event was
collected:
24.7 | 23.8 | 28.2 | 25.3 | 21.8 |
35.3 | 33.1 | 31.3 | 22.5 | 22.3 |
21.8 | 31.5 | 34.5 | 24.2 | 21.3 |
22.6 | 29.5 | 23.1 | 33.3 |
Compute a 95 percent confidence interval for the mean μ. Solution
Exercise 8.2.5. Suppose we collect a sample from a normal population of size n = 40 with sample mean X = 18.6 and standard deviation s = 9.486. Constuct a 95 percent confidence interval for mean μ. Solution
Exercise 8.2.6. The time taken by an athlete to run an event is normally distributed with mean μ and unknown standard deviation σ. To estimate the mean μ he ran 16 times, and the sample mean was found to be x = 33 seconds and sample standard deviation s = 3.5 seconds.
Exercise 8.2.7. Suppose that a sample of size n = 40 pumpkins collected from a farm had a mean weight x = 18.6 pounds and standard deviation s = 7.3 pounds. Give a 99 percent, approximate, confidence interval for the mean weight μ of the pumpkins in the farm. Solution
Exercise 8.2.8. A factory pays the workers depending on the number of units they produce. A sample of 72 workers produced a mean x = 13.4 units and standard deviation s = 2.1 units. Compute a 95 percent confidence interval for mean number μ of units produced by the workers.
Exercise 8.2.9. The mean μ daily number of classified ads published in a newspaper needs to be estimated. A sample over 84 days produced a mean x = 9910 ads and standard deviation s = 1105 ads. Give a 90 percent confidence interval for µ .
Exercise 8.2.10. To estimate the mean weight
μ (in pounds) of salmon in a river the following sample was collected:
34.7 | 33.8 | 38.2 | 20.3 | 27.8 | 45.3 | 43.1 | 37.3 | 32.5 | 32.3 |
31.8 | 41.5 | 44.5 | 29.2 | 25.3 | 29.6 | 39.5 | 29.1 | 37.3 |
Compute a 99 percent confidence interval for the sample mean μ.
Solution
Exercise 8.2.11. To estimate the mean μ
birth weight of the babies, the following data on birth weight was
collected.
8.8 | 8.1 | 6.3 | 9.7 | 6.3 |
7.1 | 5.3 | 7.7 | 9.1 | 8.1 |
8.2 | 7.9 | 8.3 | 8.9 | 9.0 |
10.1 | 9.9 | 8.8 | 7.8 | 5.2 |
7.2 | 7.4 | 9.1 | 8.6 | 6.2 |
6.3 | 5.2 | 8.3 | 5.9 | 5.5 |
7.1 | 8.1 | 7.9 | 6.3 | 6.9 |
9.1 | 8.1 | 7.0 | 4.9 | 5.3 |
6.3 | 7.1 | 6.3 | 6.1 | 5.8 |
5.7 | 6.8 | 8.3 | 7.7 |
Let p be the population proportion of a certain attribute. Typical examples are:
Y = 1 | if success |
Y = 0 | if failure |
Y is a Bernoulli(p) random variable. We draw a sample X1,X2, ... , Xn from the Y population, let
X = X1+ ... +Xn
be the total number of success and
X=X/n
be the sample proportion of success. It follows from CLT that, approximately, the sample proportion X has
N(µ X,
σX)-distribution
where µ X = p and σ
X = [(p(1-p))/n]1/2.
Therefore,
P(-z α/2 < (X-p)/σ X < z α/2 ) = 1- α.
In an attempt to compute a confidence interval for p, we simplify and get
P(X-z α/2 σX < p < X+z α/2 σ X) ) = 1- α.
In an attempt to compute a confidence interval for p, we simplify and get
P(X-z α/2 σX < p < X+z α/2 σ X) ) = 1- α.
Because p is unknown, this will not produce a confidence interval for p. But the sample proportion x of success is a point estimate of p. We have, an approximate, (1- α) 100 percent confidence interval for p given by
The 1-Proportion-Z--Interval |
x-e < p < x+e |
where
|
l = x - e will be called the left-end-point
r = x + e will be called the right-end-point |
e is also called the margin of error. |
A conservative
margin of error E is defined as E = z α/2 (1/4n)1/2. |
It can be checked that the margin of error e
is always less than or equal to the conservative
margin of error E.
Remark. During President Clinton's term in the White House we often
heard TV news commentators read something like the following:
President Clinton has 64 percent approval rating. The poll has a margin of error plus or minus 3.1 percentage points. The poll surveyed 972 people.
They mean that the sample proportion x of people who "approve" President Clinton is 0.64. Normally they don't tell us the level of confidence they are using. Assuming that they use 95 percent confidence interval, they mean that E = z α/2 (1/4n)1/2
= 1.96(1/ 4*972)1/2 = 0.031.
Use of Calculators ( TI-83) |
---|
|
Problems on 8.3: About the Population Proportion
Use your calculator to solve the following problems.
You can have a look at the flash-animated solutions for conceptual
reasons.
Exercise 8.3.1. In a sample of 197 apples from a lot, 19 were found to be sour. Set a 99 percent confidence interval for the proportion p of sour apples in the lot. Solution
Exercise 8.3.2. A new vaccine was tried on 147 randomly selected individuals, and it was determined that 97 of them developed immunity. Find a 95 percent confidence interval for the proportion p of individuals in the population for whom the vaccine would help. Solution
Exercise 8.3.3. For the coming congressional election, a poll was conducted: Out of 887 randomly selected voters interviewed, 389 said that they would vote for Candidate A, 359 said that they would vote for Candidate B.
Exercise 8.3.4. A pollster wants to estimate the proportion p of Americans who thought that President Clinton should not have been impeached in 1998. The pollster interviewed 711 individuals, and 420 agreed. Compute a 95 percent confidence interval for p.
Exercise 8.3.5. The proportion p of defective light bulbs produced by a machine needs to be estimated. A sample of 812 bulbs were tested, and 162 were found to be defective. Compute a 98 percent confidence interval for p.
Exercise 8.3.6. In a poll read on October 28, 1998, it was revealed that 60 percent of Americans wanted President Clinton rebuked but not impeached. It was also given that poll was conducted among 1,013 adults, and it had a margin of error of 3 percentage points.