MATH 365, Elementary Statistics

Lesson 8 : Comparing Two Populations

Satya Mandal

Introduction

8.1 Confidence Interval of μ₁- μ₂

8.2 When σ₁ and σ₂ are Unknown

8.3 Comparing Two Population Proportions

Homework 26 - 28

Due Date: Visit the homework site.

Introduction

In this lesson, two populations will be compared by interval estimation. The following will be considered:

Compute confidence intervals of the difference μ₁- μ₂ of the means of two populations. For example, difference μ₁ - μ₂ between the mean annual income of the male population μ₁ and the mean annual income of the female population μ₂ could of some interest.
Compute a confidence interval of the difference p₁-p₂ of the proportions of an attribute present (or proportions of "success") in two populations. For example, there may be some interest in the difference p₁-p₂ between of the proportion p₁ of the defective items produced by the new machine and the proportion p₂ of the defective items produced by the old machine.

8.1 Confidence Interval of μ₁- μ₂

Suppose X, Y are two similar random variables. Let mean and standard deviation of X be, respectively, μ₁ and σ₁. Let mean and standard deviation of Y be, respectively, μ₂ and σ₂. We want to compute a confidence interval for the difference μ₁- μ₂. We proceed as follows.

A sample X₁, X₂, …, X_m, of size m, is drawn from the X population and a sample Y₁, Y₂, …, Y_n, of size n, is drawn from the Y population. Let
X = (X₁+X₂+ … +X_m)/m

Y = (Y₁+Y₂+ … +Y_n)/n

be the corresponding sample means.
BY CLT, we have that X has
N(μ₁, σ₁/√m )

distribution and Y has

N(μ₂, σ₂/√n )

distribution.
The statistic X-Y will be used as an estimator of μ₁- μ₂.
Assume that the X samples and Y samples are mutually independent. In that case, it follows that X-Y has
N(μ₁ - μ₂, σ) - distribution, where σ = √( σ₁²/m + σ₂²/n ).
It follows that
P(-z_α/2 ≤ ((X-Y) - (μ₁ - μ₂)) /σ ≤ z_α/2 ) = 1 - α.

where σ is as above in (4).
If we simplify, we get
P(X-Y -z_α/2 σ ≤ μ₁ - μ₂ ≤ X-Y +z_α/2 σ ) = 1 - α.

where σ is as above in (4).
Theorem. A (1-α)100 percent confidence interval for μ₁- μ₂ is given by

x-y -z_α/2 σ ≤ μ₁ - μ₂ ≤ x-y +z_α/2 σ . Which is rewritten as x-y - E ≤ μ₁ - μ₂ ≤ x- y + E

where E = z_α/2 σ and σ is as above in (4).

This formula is usable if we know the values σ₁ and σ₂. Informally, we call this the 2-sample Z-interval.
The margin of error (MOE) is defined as

E = z_α/2 σ = z_α/2 √( σ₁²/m + σ₂²/n)
As in lesson 7, we will use the terminologies LEP and REP.
All the above would be "approximate", which we take the liberty to not mention. If X and Y are normal, then all the above are exact.
When the samples sizes m and n are both large, then we can use the sample standard deviations s₁ ≅ σ₁ and s₂ ≅ σ₂, which can be used in the formula for MOE E.

Problem Solving: As in sections 7.1 - 7.3, the TI-84 has a method that essentially computes the 2-Sample Z-interval and the other two confidence intervals in these section. In any case, we will use above the formulas along with the help of invNormal function of TI-84 (solve by a "Long Hand Method").

Problems on 8.1: Confidence Interval of μ₁ - μ₂

Exercise 8.1.1. Suppose we have two normal populations with means μ₁, μ₂ and standard deviation σ₁, σ₂ respectively. It is known that σ₁ = 8.1 and σ₂ = 11.3. A sample of size m = 64 was collected from the first population, and the sample mean was found to be x = 3.7. A sample of size n = 99 was collected from the second population, and the sample mean was found to be y = 4.1. Compute a 99 percent confidence interval for the difference of mean μ₁- μ₂.