Math 365, Elementary Statistics |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Lesson 8 : Comparing Two PopulationsSatya Mandal Due Date: Visit the homework site. IntroductionIn this lesson, two populations will be compared by interval estimation. The following will be considered:
8.1 Confidence Interval of μ1- μ2Suppose X, Y are two similar random variables. Let mean and standard deviation of X be, respectively, μ1 and σ1. Let mean and standard deviation of Y be, respectively, μ2 and σ2. We want to compute a confidence interval for the difference μ1- μ2. We proceed as follows.
Problem Solving: As in sections 7.1 - 7.3, the TI-84 has a method that essentially computes the 2-Sample Z-interval and the other two confidence intervals in these section. In any case, we will use above the formulas along with the help of invNormal function of TI-84 (solve by a "Long Hand Method"). Problems on 8.1: Confidence Interval of μ1 - μ2 Exercise 8.1.1. Suppose we have two normal populations with means μ1, μ2 and standard deviation σ1, σ2 respectively. It is known that σ1 = 8.1 and σ2 = 11.3. A sample of size m = 64 was collected from the first population, and the sample mean was found to be x = 3.7. A sample of size n = 99 was collected from the second population, and the sample mean was found to be y = 4.1. Compute a 99 percent confidence interval for the difference of mean μ1- μ2. Solution:
Level of confidence = 99 percent. So 1 - α = .99, α = .01 and α/2 =.005. Therefore, zα/2 = z.005 = invNormal(.995)= 2.5758 MOE = E = zα/2
σ
= zα/2√( σ12/m +
σ
22/n)
= 2.5758
√
((8.1)2/64 +
(11.3)2/99)
=
3.9191
Exercise 8.1.2. The birth weight of babies in developed and developing countries are normally distributed with mean μ1, μ2 and standard deviation σ1, σ2, respectively. (My data is not real.) Given σ1 = 2.3 pounds and σ2 = 2.9 pounds. A sample of size m = 35 babies from the developed nations were collected and the sample mean birth weight was found to be x = 8.9 pounds. A sample of size n = 48 babies from the developing nations was collected and the sample mean birth weight was found to be y = 7.1 pounds. Determine the margin of error of the difference μ1- μ2 and a confidence interval at the 95 percent level of confidence. Solution:
Level of confidence = 95 percent. So 1 - α = .95, α = .025 and α/2 =.025. Therefore, zα/2 = z.05 = invNormal(1-.025)= 1.9600 MOE = E = zα/2
σ
= zα/2√( σ12/m +
σ
22/n)
= 1.9600
√
((2.3)2/35 +
(2.9)2/48)
=
1.1197
Exercise 8.1.3. African elephants and Indian elephants are different in height, weight, and length of ear and tusk. It is natural to assume that all these are normally distributed. The mean height and standard deviation of African elephants are μ1, σ1 = 1.2 feet, respectively. The mean height and standard deviation of Indian elephants are μ2, σ2 = 1.1 feet, respectively. A sample of size 25 African elephants were collected and the sample mean height was found to be x = 10.9 feet. A sample of size 28 Indian elephants was collected and the sample mean height was found to be y = 9.1 feet. Determine the margin of error and a confidence interval of the difference μ1- μ2 at the 98 percent level of confidence.
Solution:
Level of confidence = 98 percent. So 1 - α = .98, α = .02 and α/2 =.01. Therefore, zα/2 = z.01 = invNormal(1-.01)= 2.3263 MOE = E = zα/2
σ
= zα/2√( σ12/m +
σ
22/n)
= 2.3263
√
((1.2)2/25 +
(1.1)2/28)
=
.7386
Exercise 8.1.4.The mean weight of King salmon in Kenai and Anchor River would have to be compared. The mean weight of King in Kenai is μ1 and the standard deviation σ1 = 7.7 pounds. The mean weight of King in Anchor is μ2 and the standard deviation σ2 = 9.1 pounds. A sample of 51 King from Kenai had a mean X = 31 pounds. A sample of 63 King from Anchor had a mean Y = 33 pounds. Determine the margin of error and a confidence interval of the difference μ1 - μ2 at the 97 percent level of confidence. Solution:
Level of confidence = 97 percent. So 1 - α = .97, α = .03 and α/2 =.015. Therefore, zα/2 = z.015 = invNormal(1-.015)= 2.1701 MOE = E = zα/2
σ
= zα/2√( σ12/m +
σ
22/n)
= 2.1701
√
((7.7)2/51 +
(9.1)2/63)
=
3.4154
Exercise 8.1.5. There is a difference between fall semester grades and spring semester grades. The mean percentage score in fall is μ1 and the standard deviation σ1 = 27 percent. The mean percentage score in spring is μ2 and the standard deviation σ2 = 23 percent. A sample of 87 students in fall had a sample mean score X = 73 percent. A sample of 77 students in spring had a sample mean score Y = 69 percent. Determine the margin of error and a confidence interval of the difference μ1 - μ2 at the 96 percent level of confidence. Solution:
Level of confidence = 96 percent. So 1 - α = .96, α = .04 and α/2 =.02. Therefore, zα/2 = z.02 = invNormal(1-.02) = 2.0537 MOE = E = zα/2
σ
= zα/2√( σ12/m +
σ
22/n)
= 2.0537
√
((27)2/87 +
(23)2/77)
=
8.0198
Exercise 8.1.6. The difference in mean annual salary of the professors in two state universities have to be estimated. The mean annual salary in the University -I is μ1 and the standard deviation σ1 = $16,000. The mean annual salary in the University -II is μ2 and the standard deviation σ2 = $11,500. A sample of 47 professors in University-I had a mean salary X = $79,000. A sample of 58 professors in University-II had a mean salary Y = $71,500 Determine the margin of error and a confidence interval of the difference μ1 - μ2 at the 94 percent level of confidence. Solution:
Level of confidence = 94 percent. So 1 - α = .94, α = .06 and α/2 =.03. Therefore, zα/2 = z.03 = invNormal(.97) = 1.8808 MOE = E = zα/2
σ
= zα/2√( σ12/m +
σ
22/n)
= 1.8808
√
((16000)2/47 +
(11500)2/58)
=
5228.1439
8.2 When σ1 and σ2 are UnknownAs in the last section, X, Y represent two similar populations. In this section also, we will construct confidence intervals for the difference of means μ1 - μ2, when the standard deviations σ1, σ2 are unknown. As a price, we would have to assume that X has N(μ1, σ1) distribution and Y has N(μ2, σ2) distribution. Take a sample X1, X2, …, Xm of size m from the X population, and take another sample Y1,Y2, …, Yn of size n from the Y population. We proceed as follows.
Problems on 8.2: σ1 and σ2 are Unknown Exercise 8.2.1. Suppose that two "similar" normal populations have means μ1, μ2 respectively and same standard deviations σ. A sample of size m = 11 from the first population the sample mean was found to be x = 13.2 and the sample standard deviation s1 = 2.33. A sample of size n = 13 was collected from the second population that had a sample mean y = 11.5 and sample standard deviation s2 = 2.73. Compute the pooled estimate sp of σ and a confidence interval for μ 1- μ2 at the 96 percent level of significance. Solution:
The pooled estimate of σ is given by Sp
=
√
([(m-1)SX2+(n-1)SY2 ]/
[m+n-2])
MOE = E =
tm+n -2, α/2Sp
√
(1/m + 1/n)
=
2.1829*2.5560
√(1/11 + 1/13)
= 2.2858 Exercise 8.2.2. Suppose we have two normal populations with means μ1, μ2 and equal standard deviation σ. A sample of size m = 64 was collected from the first population and the sample mean and standard deviation were found to be x = 3.7, s1 = 9.2 . A sample of size n = 99 was collected from the second population and the sample mean and standard deviation were y = 4.1, s2 = 8.7. Compute the pooled estimate sp of σ and a confidence interval for μ 1- μ2 at the 95 percent level of significance. Solution:
The pooled estimate of σ is given by Sp
=
√
([(m-1)SX2+(n-1)SY2 ]/
[m+n-2])
MOE = E =
tm+n -2, α/2Sp
√
(1/m + 1/n)
=
1.9748*8.8990*
√(1/64 + 1/99)
= 2.8187 Exercise 8.2.3. The difference in mean monthly water consumption in two adjacent towns has to be estimated estimated. A sample 37 household in the Town-I had a sample mean 6300 gallons and standard deviation 450 gallons. A sample 49 household in the Town-II had a sample mean 6800 gallons and standard deviation 650 gallons. Compute a 94 percent confidence interval for the difference μ1 - μ2. Solution:
The pooled estimate of σ is given by Sp
=
√
([(m-1)SX2+(n-1)SY2 ]/
[m+n-2])
MOE = E =
tm+n -2, α/2Sp
√
(1/m + 1/n)
=
1.9065*572.8990*
√(1/37 + 1/49)
= 237.8840 Determine the margin of error and a confidence interval of the difference μ1 - μ2 at the 94 percent level of confidence. Exercise 8.2.4. The birth weight of the babies
in developed and developing countries are normally distributed with
mean μ1,
μ2 and equal standard deviation
σ. (My data is not real.) Suppose the following data about
the birth weight from developed and developing nations were collected.
Compute the pooled estimate sp of σ and a confidence interval for μ 1- μ2 at the 97 percent level of significance. Solution:
The pooled estimate of σ is given by Sp
=
√
([(m-1)SX2+(n-1)SY2 ]/
[m+n-2])
MOE = E =
tm+n -2, α/2Sp
√
(1/m + 1/n)
=
2.2445*1.2560*
√(1/21 + 1/24)
= .8424 Exercise 8.2.5. African elephants and Indian
elephants are different in height, weight, and length of ear and tusk.
It is natural to assume that all these are normally distributed. Assume
that the height of African and Indian elephants have an equal mean σ.
The mean heights of African elephants and Indian elephants are μ1,
μ2, respectively. Suppose
the following data were collected on the height of elephants from the
two continents (these are not real data).
Compute the pooled estimate sp of σ and a confidence interval for μ 1- μ2 at the 99 percent level of significance. Solution:
The pooled estimate of σ is given by Sp
=
√
([(m-1)SX2+(n-1)SY2 ]/
[m+n-2])
MOE = E =
tm+n -2, α/2Sp
√
(1/m + 1/n)
=
2.7079*1.1451*
√(1/20 + 1/21)
= .9688 8.3 Comparing Two Population ProportionsIn this section, estimation of the difference p1-p2 of the proportions of an attibute in two populations by confidence interval will be considered. Examples of such differences of proportions would include (1) the difference between the proportions p1 - p2 of the male and female populations who earn than fifty thousand dollars annually; (2) the difference p1 - p2 of the proportions of defective items produced by the old machine and the new machine in a factory. Connsider proportions of an attribute A in two populations. Let p1 and p2 represent the proportions of the attribute A, in Population I and Population II, respectively. A confidence interval of p1-p2 will be constructed. A sample of size m from Population I and a sample of size n Population II are collected. Let X be the number of sample members that have the attribute A and X=X/m be the sample proportion that has the attribute A. We take a sample from Population 2 of size n. Let Y be the number of sample members that has attribute A and Y=Y/n be the sample proportion that has the attribute A. ( In other words, X is the number of "success" and X=X/m is the proporrtion of success in Population I sample. Similarly, Y and Y=X/m for the Population II-sample.) Imortanly, these two samples are collected independently. (An Example: Compare proportion of the male and female members in a community who earn more thatn $50,000 annually. A sample of m male members are interviewed and X would be the number of those who make more than fifty thousand annually and X=X/m would be the sample proportion of those who make more than fifty thousand annually. Similarly, interview n female members and Y=Y/n would be the sample proportion of female members who make more than fifty thousand.) We develop a confidence interval for p1-p2 as follows.
Problems on 8.3: Comparing Two Population Proportions. Exercise 8.3.1. Two independent samples were collected from two populations to compare the proportions p1, p2 of an attribute A present, respectively, in these two populations. Use 95 percent confidence interval to estimate p1-p2. It is given that x = 55 had the attribute A in a sample of size m = 117 from the first population and y = 37 had the attribute A in a sample of size n = 79 from the second sample. Solution Solution:
Level of confidence = 95 percent. So 1 - α = .95, α = .05 and α/2 =.025. Therefore, zα/2 = z.05 = invNormal(1-.025)= 1.9600 E= Zα/2
√[
X(1-X)/m + Y(1-Y)/n
] LEP = X - Y - E = .4701 - .4684 - .142435 = -.140735 REP = X - Y + E = .4701 + .4684 + .142435 = .114135 Exercise 8.3.2. To compare the proportions
p1, p2 of defective lamps produced by new production center and old the production center,
respectively, samples were collected. In a sample of
157 lamps
from the new center, 26 were found to be defective;
and in a sample
of 141 lamps from the old center,
32 were defective. Compute a 99 percent confidence
interval for p1-p2 Solution:
Level of confidence = 99 percent. So 1 - α = .99, α = .01 and α/2 =.005. Therefore, zα/2 = z.005 = invNormal(.995)= 2.5758 E= Zα/2
√[
X(1-X)/m + Y(1-Y)/n
] LEP = X - Y - E = .1656 - .2270 - .118281 = -.179681 REP = X - Y + E = .1656 - .2270 + .118281 = .056881 Exercise 8.3.3. To compare the proportions
p1,p2 of men and women, respectively, who watch
football, data was collected. In a sample of 199 men, 83 said that they
watch football; and in a sample of 161 women, 51 said they watch football.
(These are not real data.) Construct a 99 percent confidence interval
for p1-p2.
Solution:
Level of confidence = 99 percent. So 1 - α = .99, α = .01 and α/2 =.005. Therefore, zα/2 = z.005 = invNormal(.995)= 2.5758 E= Zα/2
√[
X(1-X)/m + Y(1-Y)/n
] LEP = X - Y - E = .4171 - .3168 - .130804 = -.030504 REP = X - Y + E = .4171 - .3168 + .130804 = .231104 Exercise 8.3.4. Two varieties of grapes are compared. To compare the proportions p1, p2 of acceptable grapes in these two varieties, respectively, samples were drawn. In a sample of 131 grapes from the variety I, 107 were acceptable. In a sample of 143 grapes from the variety II, 113 were acceptable. Construct a 97 percent confidence interval for the difference p1-p2. Solution:
Level of confidence = 97 percent. So 1 - α = .97, α = .03 and α/2 =.015. Therefore, zα/2 = z.015 = invNormal(1-.015)= 2.1701 E= Zα/2
√[
X(1-X)/m + Y(1-Y)/n
] LEP = X - Y - E = .8168 - .7902 - .104111 = -.077511 REP = X - Y + E = .8168 - .7902 + .104111 = .130711 Exercise 8.3.5. To compare the proportions p1, p2 of students, respectively, in two state universities who pay more than $15 K tuition per year, samples were collected. In a sample of 217 students in the university I, 129 paid more than $15 K. In a sample of 313 students in the university II, 158 paid more than $15 K. Construct a 98 percent confidence interval for the difference p1-p2. Solution:
Level of confidence = 98 percent. So 1 - α = .98, α = .02 and α/2 =.01. Therefore, zα/2 = z.01 = invNormal(1-.01)= 2.3263 E= Zα/2
√[
X(1-X)/m + Y(1-Y)/n
] LEP = X - Y - E = .5945 - .5048 - .101656 = -.011956 REP = X - Y + E = .5945 - .5048 + .101656 = .191356 Exercise 8.3.6. To compare the proportions p1, p2 of college graduates who earn more than 50 K, in two states, data was collected. In a sample of 444 college graduates in the state I, 334 earn more than 50 K. In a sample of 546 college graduates in the state II, 414 earn more than 50 K. Construct a 96 percent confidence interval for the difference p1-p2. Solution:
Level of confidence = 96 percent. So 1 - α = .96, α = .04 and α/2 =.02. Therefore, zα/2 = z.02 = invNormal(1-.02) = 2.0537 E= Zα/2
√[
X(1-X)/m + Y(1-Y)/n
] LEP = X - Y - E = .7523 - .7582 - .056448 = -.062343 REP = X - Y + E = .7523 - .7582 + .056448 = .050548 Exercise 8.3.7. It is believed that women are safer drivers than men. Let p1, p2 denote the proportions of women and men drivers, respectively, who were involved in an auto accident in a year period. In a sample of a size 739 women drivers 39 were involved in auto accident during this period. During the same period, in a sample of size 1215 men 79 were involved in auto accident in a year. Construct a 95 percent confidence interval for the difference p1-p2. |