MATH 251, Probability and Statistics I, Fall 2005, Wed. Nov. 2, Day 29After classcorrected

Read 6.1; I'll go through "Choosing the sample size" (391-2) next time, but read Cautions, p. 393-4 carefully. I won't lecture on it or outline it on the webpage.
Hand in: 
p. 396, using formula 
6.5, 6.7 biomarker, C
6.13, 6.14 study habits
6.3, 6.4 changes of m with n and C
6.9  schoolchildren/third grade
Hand in: (moved from Read column)
6.20, 21 Use Applet:  Confidence Interval

Postpone 6.15 and 6.12
6.15abc calories--CI and nearness.
6.12 skewness

Cautions (pp. 393-4) 
6.30 kindergarten, margin of error
6.29ab four intervals (Notice, these questions are about a binomial distribution, B(4, .95))

Read, discuss 
Hand in:
6.20, 21 Use applet.

6.31 Gallup, margin of error

Optional 
(more practice) 
6.6, 6.8 OC
Take 4 Numbers (random sample) from the Birkenstock box:  Find mean xbar.  Find xbar + .841.  This is your interval estimate of the unknown mean of the box's population.  ("margin of error" is .841) (Return your numbers afterward.)
Add your values to the list, and graph your interval on the transparency circulating.
     If xbar = 8.0       7.159|_____________8.0_____________|8.841

Quiz Monday: Knowing and using: Binomial distribution formula (p. 349 bottom)
mean and st. dev. for Binomial:  X (count), p-hat (proportion)   (Summary p.350)
mean and st. dev. for X-bar from SRS of size n   (Summary p. 368)
Normal?  Central limit theorem: says yes for all of the above distributions, approximately, for n large.
If population(s) normal to start with, linear combinations stay normal (including X-bar), mean and s.d. follow algebra rules (as last quiz.)
Mean and variance quizzes back:  Main problems:
     Mean and s.d. of a distribution (population) are actually completely parallel to our computations for data, viewed correctly:
          X    1   2   4
          p   .2  .2  .6
# out of 10    2   2   6 Consider a population of 10, with these proportions.
 Mean:  (1+1+2+2+4+4+4+4+4+4)/10 = (1·2 + 2·2 + 4·6)/10 = 1·.2 + 2·.2 + 4·.6 = 3
 Population variance (divide by n)     [Sample variance (divide by n-1)--the one we learned in Chapter 1]
 = [(1-3)2+(1-3)2+ (2-3)2+(2-3)2 +(4-3)2+(4-3)2+(4-3)2+(4-3)2+(4-3)2+(4-3)2+(4-3)2]/10
= [(1-3)2·2 + (2-3)2·2 + (4-3)2·6]/10 = [(1-3)2·.2 + (2-3)2·.2 + (4-3)2·.6]

Variance of X - Y = Variance of X +(-1)Y = Variance of X + Y, because (-1)2 = 1.
Pesky added constant.  Another way to remember what  the variance of a +bX is.
    Let Y = a, with probability 1 (no matter what happens, a is the result).
  y |  a          Mean of Y = a.   Variance of Y = 0
  p |  1          Then sigma2(a + bX) = sigma2(Y + bX) = sigma2(Y) + sigma2(bX) = 0 +  sigma2(bX) = 0+ b2sigma2(X)


Homework questions? Day 28

Introduction to Statistical Inference:
  Requires: Random sample or Randomized experiment.  (Our theory: Simple Random Sample usually)
First example:  Use sample mean xbar  to "estimate" (unknown) population mean µ

 Mean of 4 grades (HW#3.75, sec. 3.4) estimates population mean of all 10 ("known"= 69.4)
       E.g. 69.75,  64.25,  73.5    (Each is a "point estimate")

Confidence intervals (sec. 6.1) This is one of the two big ideas of inference that we will study.  Chapter 7 will extend this simple idealized situation, so this needs to be firmly in place.

Interval estimate:  xbar + margin of error (fudge factor)  estimates population mean µ (69.4)

    69.75 + 1:   "µ is between 65.75 and 73.75"  True
    69.75 + 4:   "µ is between 65.75 and 73.75"  True
     73.5 + 4:    "µ is between 69.5 and 77.5"  False
     73.5 + 5:    "µ is between 68.5 and 78.5"  True
      64.25 + 4:   "µ is between 60.25 and 68.25"  False
      64.25 + 5:   "µ is between 59.25 and 69.25"  False

A level C  Confidence interval estimate of a(n unknown) population parameter:

Confidence Interval of the form  estimate + margin-of-error  for the mean µ with Confidence level C: (p.388) (Table A, or Table D (back flyleaf), t dist. bottom row)
Example:  Sample of size 9 from a Normal population with unknown mean and pop. s.d. sigma = 6,  xbar = 12.
  Find a 90% CI estimate for the unknown mean µ:  z* = 1.645,  (sigma)/ sqrt(n) = 6/3=2, so m = 3.290;
                       CI is 12 + 3.290, or  8.710 to 15.290.

The Birkenstock box contains numbers from a normally distributed population, with population standard deviation 2.
You each constructed a 60% confidence interval for the unknown mean:
    n = 4.
    Standard deviation of sample mean = 2/sqrt(4) = 2/2 = 1
    z* for C = 60% is .841, so margin of error m is .841 times 1= .841.
To get the z* for C = 60% from the normal table, note that this is the middle 60%, which leaves 40% to be split between the 2 tails.  So 20% above z*,  and 80% below.  Go into the body of table A, find 80%= .8000 is between values .7995 and .8023, closer to .7995.  The z value with .7995 below it is .84.  Table D gives it more precisely as  .841.
Start here Friday:  How many people captured the true mean?
( previous classes,11/20 = 55% ,  22/29= 76%.   9/18 = 50% , 11/20 = 55%,  15/22= 68%,  16/24 = 67%
16/18 = 88%  7/13 = 54%.  Combined, 107/164 = 65%
Quite variable for small samples, but settling down?)

Applet:  Confidence Interval shows it's not the individual interval that C describes, but the Method.

Why does the formula work? (pp. 387-8, briefer there)
1. A particular xbar is within m of the population mean, if and only if the interval xbar + m contains the population mean.
     P(µ - m < Xbar < µ + m) =  P(Xbar -m < µ  < Xbar + m ) = Prob. that CI contains µ.

2. We choose z* (and from it m) so that the probability that Xbar is within m of the population mean   is C.

Extension:  If n is  large, we can use the formula even if population is not normal.
     (Because only the distribution of Xbar is used, and Xbar is normal!  Central Limit Theorem)
Cautions read pp. 393-4

Sievers home  Math251-Fall05/Dayps29.htm  11am   11/2/05
This page belongs to Sally Sievers who is solely responsible for its content. Please see our statement of responsibility.