Math 151 , Day 34 Monday, April 21, 2008  .. hit reload...

HW Day34 .  P. 355, choosing n for a desired C and m.   Check,  sample size 14.17.  Start Reading Please! Chapter 15, Significance testing (the "other" big topic in inference)
Moore Ch. 14, Day 35  Hand in

Sample size for C.I. (& review of CI computations)You can check using the bottom section of the Confidence Interval Excel sheet.
p. 356, 14.10 Estimating mean IQ
p. 358 14.24 Hotel managers
p. 360, 14.33 calibrating a scale

A.  New Shoeboxes: On a Separate sheet:  (2 shoeboxes. )The shoeboxes are outside my door if you missed doing them in class. For each sample of size  4 from a shoebox, write down the values, find the mean, (know which box you got them from: White #s, green box. Yellow, red top.) and tell whether you believe the population mean for that box is 20, or something bigger. (Your gut feeling.) Does it help to know that the standard deviations for the shoeboxes are both 4?  Bring your sample numbers and xbars  to class  to pool, and Keep for further computations.)   (This is related to Chapter 15, where we'll learn the formal methods.)
= = = = = = = = = Postpone the rest
Beginning Ch. 15
p. 364, 15.1  Anemia
p. 364, 15.2  Older students (done in class)

Read, 
to discuss
Optional
A few  problems good to review for the exam (repeated)
p. 419, 17.7 Day care, parameter or statistic
p. 422, 17.27 and 28 means vs. individuals.  In #27, they're taking the "about what range" to be the interval containing the middle 99.7%--almost all. (Answer to last question of #28 is
"no"--histogram of individual values in sample will be distributed (roughly) like the population.)
p. 421, 17.26 WAIS, n = 1, n = 60 (Answers: a) about .3707, b) 100, 1.936, c).0049, d) a could be quite different; b still correct, c approx. right bcs of Central Lim. Th.))
Exam 4 this Friday .  Covers  Ch. 10 , Ch. 11 up to p. 286 only, Ch. 14 all, Ch. 16 thru p. 391. (i.e. thru Day 34 HW).    Sample Exam  (Handed out Fri. Outside my door..)  Solutions .
   Sign up Today or Wed. for early (>9:30) start:  Confirm with me any other time to take it.

     Buffer against one low hour exam:
The final % exam grade minus 10 points will be substituted for the lowest hour exam grade, if it is higher.

Examples:
Ex1 Ex2 Ex3
Ex4 final % final -10
Student 1 Original 85 80 85
60 85 75, replaces lower 60
Treated 85 80 85
75 85 <-- These will be used.
Student 2 Original 85 80 80
70 75 65, lower than 70, don't replace.
Treated 85 80 80
70 75
Student 3 Original 85 50 75
55 85 75, replaces lower 50
Treated 85 75 75
55 85 <--These will be used
This is to encourage any who are nervous about Exam 4, and to encourage all to try to put it together for the final.

New Shoeboxes:
On a Separate sheet: 
(2 shoeboxes. )The shoeboxes are outside my door if you missed doing them in class.  From each shoebox, take a sample of size 4, and write down the values.  (Know which box you got them from: White #s, green box. Yellow, red top.)   Do A above for HW.

Confidence intervals,
Day 32 and  Day 33
RECAP:  Confidence Interval of the form  estimate + margin-of-error  for the mean µ with Confidence level C: (p.349-50) (Table A, or Table C, t dist. bottom row)
 
            Check your calculations with the ConfidenceInterval.xls Excel spreadsheet
 Tradeoffs: for sharper (narrower) margin of error, must  accept lower confidence level, OR take larger sample.

In practice: pp. 388-391
SRS--other random samples get other formulas. 
   Nonrandom or biased  samples simply can't do C.I.

    Sometimes we can plausibly think of data as SRS from large population (rolling dice, repeated weighings on scale)
     -- For experiments, randomizing into groups allows us to use the methods; but be careful about generalizing far beyond our "volunteers" type.
     Ask how reasonably "like" a SRS the sample is.

Xbars are  normal!  OK IF 1) population is normal, or 2) n big enough for Central Limit theorem.
    Outliers?  Trouble (xbar is sensitive).   Slight outliers ok (see next)
    Skewness?  "Moderate" sample size allows CLTh to overcome all but strong skewness. (Numbers for "moderate" in Ch. 18)
Sigma for population is known.  Rarely true in practice. 
          Large n? Could substitute s calculated from sample as "good" estimate of sigma.
          Small n--Ch. 18, a slight modification of these methods takes care of unknown sigma.

- - - - - - - - - - - - - - - - - - - - - -
Homework questions?    Day 33
Homework 14.34 and 14.35 , p. 160.     
     14.35:    T = true value of parameter,  * = value of this statistic  * = results of other surveys
                           ** *
                        * ******
               * * ** ******T**** *** ** * *   

Clustering will be around True value, not around the one we got this time.

New today: Planning ahead:  Choose sample size big enough to satisfy desired: margin of error, confidence level. p. 355
Given C and m = margin of error,  (and sigma), find n.
   Method:  Use C to find z*.  Plug in to formula for m, and solve for n.  Or memorize formula for n and plug in to it.
,    n = (z* sigma / m)2
     Note:  z* sigma still on top.  m and n change places, and whole thing is squared!
           Round up!  If you get n = 5.06, you need a sample of size 6 to get your margin of error at least as short as you want.
                      ConfidenceInterval.xls  Excel spreadsheet will check your calculations.  Show your work on HW!


Why does the CI formula work? (optional)
Not done Mon.

Algebra gives P(Xbar - m < µ <  Xbar + m) = C

= = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = 

"Statistics means never having to say you're certain."
Confidence interval Estimation made our best guess at an unknown population mean.
Testing will investigate a claim made that the unknown mean is actually a particular value.
~~~~~~~~~~~~~~~~
 
Ch. 15: "Significance tests use an elaborate vocabulary, but the basic idea is simple: an outcome that would "rarely" happen if a claim were true--is good evidence that the claim is NOT true." (p.363 top)
Suppose someone claims that the average height of Wells women over the years is 70" (5'10").  I take samples (151 classes) every year.  Last term my sample has mean 65.67" (n = 20ish). Standard deviation for heights of women in population is supposed to be about 2.5" , so s.d. for means from samples of 20 is about 2.5/4.48= 0.56.
IF
the real mean is 70", my sample is astonishingly unusual:  z =(65.67-70)/0.56= -4.33 /0.56 = -7.73, 7.73 s.d's below the mean.  Conclude the claim is Not true. Did this Mon.

- - - - - - - - - - - - - - - - - - - - - - - -
Extended Standard Normal Table--"Normal Tails" (also from Weblinks page, )
  z         P(Z < z)                         P(Z > z)   = same in scientific notation: E-03 = 10-3
3.00   .9986501019683700    .0013498980316301    1.35E-03
4.00   .9999683287581670    .0000316712418331    3.17E-05
5.00   .9999997133484280    .0000002866515718    2.87E-07
6.00   .9999999990134120    .0000000009865877    9.87E-10
7.00   .9999999999987200    .0000000000012799    1.28E-12
8.00   .9999999999999990    .0000000000000007    6.66E-16  Below this, machine can't compute. If your assumptions lead you to a(n almost) impossible z value, question your assumptions!
(The basis of significance/hypothesis testing)

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Need machinery
to analyze less "obvious" results--build in effect of standard deviation (if s.d. were 10" would my sample still be inconsistent with the claim?) and sample size (if n were only 4 would that change my result?) .
Do 15.2 p. 365 Are older students like traditionals, or higher (on average) on this measure?:  Normal, s.d. = 30.  Claim:  pop. mean = 115. (same as traditional.) 
   n = 25 older students.   IF mean is really 115, Xbars are N(115, 6).   Sketch!  Hypothetical xbars:
      xbar = 118.6 .  118.6-115=3.6.  This is 3.6/6 = 0.6 s.d.'s above the mean, a pretty typical kind of value.
      xbar = 125.8    125.8-115=10.8.   This is 10.8/6 = 1.8 s.d's above the mean, high enough to be pretty unusual (how unusual? Table A) if the mean is really 115.
      xbar =  139     139-115=24. This is  24/6 =  4 s.d.'s above the mean, unreasonably high if the mean is really 115.
   So 125.8 or (more so!) 139 would be evidence that the mean for this group (older students) is NOT 115, is in fact higher.

- - - - - - - - - - - - - - - - - - - - - - - -
General exam questions, if time; continue next time.  Continue with Ch. 15


Sievers home  Math151-Sp08/Days34.htm  2pm 4/21/08
This page belongs to Sally Sievers who is solely responsible for its content. Please see our statement of responsibility.