Math 151 , Day 38, Monday, Dec. 1, 2008 .After class, links fixed. Hit reload .

HW Day38   Continue  Ch. 15, to p. 376. Then 377-79. (Optional: Two-sided Tests from Confidence intervals pp. 379-80) Check: p. 381  Hypotheses: 15.26, 27.  Test statistic15.28.   P-value (one-sided) 15.31, 32.  P-value (two-sided) 15.29, 30, 33.  Significance 15.34.  Test<--CI (0ptional) 15.35  Which of the answers to 35 is self-contradictory?  Which one makes logical sense?
 Next: Read rest of Ch. 16, to p. 396. Optional: Lightly, for the words and concepts of power, effect size, type I and II errors  pp. 396 to 402.   Check p. 405, 16.20, 16.24, 16.25 (c you should be able to take for granted.) 16.26 (they say b--but you can't really do probabilities on the nonresponse and other errors, so I don't think this is a well posed answer.  A better answer would talk about how much you can trust the interval) 16.27.  Ch 17 outlines this section of the book.
Start reading Ch. 18!  We'll repeat the CI and test work, only with s instead of sigma, and t instead of z. 

Hand in  Wed. 

If you didn't, do A from Day 37 (shoebox simulation)
Complete and repair Day 37 HW.
Be sure to do 18,19, 37, 42, 43, 44,on a separate sheet, leaving a little space for using the "bracketing" numbers from table C.

Postpone the rest:

Setups and Calculations.  (Started last time) Use the Applet:  P-value of a test of significance to check your work.  For 18, 19, 37, 38, 42, 43,44, be sure to:,Write down your H's, your xbar, your z,  your P. Make a rough sketch of the normal dist. when H0 is true and the direction(s) of evidence for Ha . Mark your z on it. 
Use Table A (normal table) to find P-value.
Now: Use table C with your z to find "bracketing"  numbers for P:   ___< P < ___.  Check that your P calculated from Table A is between the bracketing numbers from Table C
p. 376, 15.18 Water quality
p. 376 15.19 SAT  Check the mean you calculate with the back of the book.
p. 382, 15.37 IQ test scores (The mean from the data is 105.84)
p. 383, 15.38 hotel managers
p. 383  15.42 Supreme Court
p. 383, 15.43  wrong alternative
p. 383, 15.44 the wrong p

"Significance"and table C:
p.379 15.21&22  significance, Table C, 1 and 2 sided
p. 379 15.23 23 significance, Table C, 2 sided

Cautions about significance tests (and CI's). These problems mostly cover old ideas, carried forward. You should be able to do them, after reading the assigned parts of Ch. 16.
p. 393, 16.5 Is it significant?
p. 394-5 16.6&7  Acid rain.  Do them by hand, and on the Applet.  You should, of course, get (close to) the same answers both ways.
p. 395, 16.8 Acid rain, Confidence intervals.  (there are only 3, not 6, since #6 and #7 are the "same" problem)
p. 396 16.9 rich parents and education

p. 407. 16.32 evidence, pacemakers
p. 407, 16.34 a, b. larger samples
p. 407, 16.35 significance is good for...
p. 408 16.36 sensitive questions (CI)
p. 408 16.37 college degrees (CI)
p. 408 16.39 supermarket shoppers (The data are in order, so a stemplot is easy)

p. 409 16.43 comparing package designs (What did they not tell us that we would want to know?)
p. 409, 16.44 island life (correlation coefficient)
p. 409, 16.45 helping welfare mothers  (The Clinton "welfare reform" depended, probably too much, on studies of this sort)

Read, 
to discuss
For 15.35, p. 382: Ignoring the actual question:  Which (a or b) of the answers to 35 is self-contradictory?  Which one makes logical sense (whether or not it's true)?  Sketch a normal curve and mark out the (two-sided) areas for alpha = .10 and alpha = .05.
Optional 
(more practice) 
 
Your Real shoebox results: If you didn't last time: Write your xbars , z's, P-values, and <.10 (Y/N) (one on each paper--yellow or white).  , make a dot for each on the circulating dotplot transparency.  Put Y's on the papers when you've plotted your dots.
Your Shoebox simulations:  Hand in
the sheet with your simulations and dotplots.  On the pad circulating, write how many out of your 10 simulations had P-value
<.10. (were "significant at alpha = .10)  (For Mean = 20, and Mean = 24)

Exams back. Discussion done today.  Solutions.
Wed. Dec. 17, 9-12 a.m.  If this is a problem for you, please email me very soon.  
   Alternative--Monday afternoon (The favorite so far.), Tuesday morning/afternoon?? (Email your possibilities; I'll pick one!)
  Full exam schedule is at   http://www.wells.edu/pdfs/finals.pdf
     Registrar's page with link to this and other good stuff: http://www.wells.edu/academic/regist.htm
Ch. 15: "Significance tests use an elaborate vocabulary, but the basic idea is simple: an outcome that would "rarely" happen if a claim were true--is good evidence that the claim is NOT true." (p.363 top)
Day 35 for other details.  Summary, comments:

We went through the machinery of testing and used the Applet:  P-value of a test of significance. Will finish the work on this page next time.

The game:
Before taking data, define
H0: "Null hypothesis" A claim or statement about the population we would like to show is NOT true.
   Stated usually as:  A parameter = a particular value.  H0: µ =1000 hrs.  ("Average lightbulb life".)
Ha: "Alternative hypothesis" A claim or statement about the population we are trying to find evidence FOR.
      Stated usually as: The parameter  is >, or <, (one-tail tests) --
                       or NOT = the particular value. (two-tail)
    Ha:   µ  > 1000 hrs. (Suppose we have a New process that makes them burn longer. We hope.)
    Other possible alternatives: Ha:   µ  < 1000 hrs.  (Want evidence that Mfr.'s claim is inflated)
             (two-sided=two-tail) Ha:   µ  Not = 1000 hrs.  (Want evidence that Assembly line process is"off")

   Some authorities say you should always do two-sided tests.  Others say:  If you have a hope or suspicion; are only interested in one direction, then do it that way.  What's NOT OK is to look at your data and then decide your alternative hypothesis.
HW questions?  Day 37: 15.3, 4, 6, 7. Also p. 381, #45, #43a

Take data.  Calculate test statistic. For µ, test statistic is the z-score of xbar. (Start with xbar, standardize using mean of H0)
    Is it an unlikely result if  H0 is true?  Then that is evidence against H0.
HW questions?  Day 37: 15.8, 9, 10

Measuring the strength of the evidence against H0 (a common measuring stick for all distributions and parameters)--how weird is my observation if H0 is true?:
P-value of a test:  The probability, computed assuming that H0 is true, that the observed outcome would take a value as extreme or more extreme than that actually observed (if we could repeat taking-data again).  p. 368. Table A
    The smaller the P-value, the stronger the data's evidence against H0 ( for Ha).

For a test of µ  , using xbar (sigma known), the P-value is
--the area of the tail beyond the observed xbar, in the direction of Ha (one-sided)
(--or twice that area (two-sided).)
<>Applet:  P-value of a test of significance automates this.  (Uses "raw" scale of xbars, rather than z-scores).  Table A
HW questions?  Day 37: 15.12, 13, 14 (one-sided).  11 (two-sided.)
Other HW questions on machinery? 18, 19, 37(2sided), 38(?2sided), 42, 43(2sided), 44(wrongP)

As n increases, the P-value corresponding to a particular x-bar (if it's in the correct tail) gets smaller. Applet:  15.40
    Makes sense: more data; more evidence against H0 (if it's actually not true)
      Same issue as: Effect of sample size on distribution of x-bars: NormalandXbar.xls

A "Significance level" alpha is a probability level we decide on  in advance as being the "rarely" amount that will push us over into believing (well, sort of) that the H0 claim  is not true. (Historically older language than P-value.  Appropriate levels vary by discipline.)
We tend to use simple benchmark numbers for it, like .10 (1 in 10), .05 (1 in 20), .01 (1 in 100).
When the P-value is less  than (or equal to) a particular significance level alpha (say .05), we say,
    "The results are significant at the alpha = .05 level," or "The results are significant (P< .05)" .  Giving actual P is better, if you can.
 Lightbulbs:  One-sided:  .0228 = P-value.  More than  2% and less than 3% chance of getting a result this far out (in this direction) if we did it again.
          "Significant at the alpha =.03 level.  Also at the alpha = .05 level"  (P-value says,  rarer than these levels)
          "Not significant at the alpha = .02 level.  Also not significant at the alpha = .01 level"  (P-value says, more common than these levels)
   Two-sided:  .0456 = P-value.  (Barely) less than 5% chance of getting a result this far out if we did it again.
            "Significant at the alpha = .05 level. (Also at alpha = .10).   Not significant at the alpha = .04 level.  Nor .01 level.
Applet:  Statistical Significance
You can pick the alpha you desire, and see if your x-bar lies outside the "alpha" barrier(s). (approach of p. 376-79) But P-value is more informative.
HW 15, 16, 17

"Real" shoebox data.  Last term
This term Updated
(I can see that several yellow samples with mean 25 or above haden't been plotted yet.)  Are now (Wed.)

- - - - - - - - - - - - - - - -
What if you don't have the Z-table but only have the t-table (Table C)?
What if you have a demanded level of significance, alpha?
    Table C: a limited list of probabilities  across the bottom rows:
            = Tail values for the bell curve distribution.   (one sided = one tail, two sided = two symmetrical tails)
        The value in the z* row above  P is the corresponding standard normal value ("critical value"). 
                 Check z* = 1.960,  prob. .025 above it (or below -1.960).  .05 farther out than it.  Corresponds to Table A.
      
  Do this: Find your z from the data. Make a sketch of the standard normal curve and mark your z on it.  Mark the direction(s) of Ha.
    (If your z is in the direction(s) of Ha, continue.  Otherwise the results are hopelessly not significant: you can quit.)
Find the two z*'s in Table C that bracket your z (ignore minus sign).  Find the corresponding P's.
    e.g. z =2.111
                                                 z = 2.111
      z*         2.054 \/ 2.326
One-sided P  ...  .02     .01
Two-sided P  ...  .04     .02 


      So the P-value for your z is: between .02 and .01 (If it's a one sided test)
         &  between double those 2 p's--between .04 and .02 (If it's a two sided test)

    Test is significant at the bigger bracketing probability; not sig. at the smaller.
One sided: P-value is less than .02 and greater than .01
        Significant at the .02 level,not at the .01 level
Two sided: P-value is less than .04 and greater than .02
        Significant at the .04 level,not at the .02 level
If you have a specific demanded significance level, compare it with these levels.
            If  a test is significant at level b, then it is significant at every level bigger than b.
            If a test is Not significant at level d, then it is Not significant at every level smaller than d.
    "Significant at a":  probability of getting my results (again) by chance (if H0 is true) is less than (or =) a. My result is less common than a.
Results   Significant at    Not significant at
p bigger  .10      .05      .01      .005     .001 smaller
                        /\
                        P-value (one-sided)
                        z-value 2.054
z* smaller 1.282   1.645  | 2.326    2.576    3.091 bigger
  You can compare z directly to z* for your desired alpha.  z >z*?  Significant at that alpha.  
    The 2-sided is a bit tricky. 
Don't halve or double z's, ever!--it doesn't work!)

Ch. 15: "Significance tests use an elaborate vocabulary, but the basic idea is simple: an outcome that would "rarely" happen if a claim were true--is good evidence that the claim is NOT true." (p.363 top)
I'm not making it up that this idea is important:  Financial Times (influential and high-end British newspaper) this winter:
   (with formatting and pictures)  (without)   Statistical Significance: #10 of "The Ten Things Everyone Should Know About Science"

- - - - - - - More NEW STUFF- -...  -  - -
back to
Ch. 16, cautions:  (Same old: SRS, normal pop. or Xbar, sigma known)
>>How small a P is convincing evidence against H0?  (What alpha, to "Reject H0?)
  
--Is Ha surprising?  (Entrenched opinion is "for" H0 .  )  Need strong evidence (small P).
   --Is rejecting  H0 expensive?  Need strong evidence for Ha
        [May need to repeat experiment for doubters]
No sharp border between "significant" and "not"--though decisions may need to be made.

>>Statistical significance is not the same as practical significance ("clinical significance") 
      Tiny difference (from null value) can be statistically significant if sample size is large.
       Big difference may not be statistically significant if sample size is too small.
Do confidence intervals:  Estimate the size of the effect, not just yes/no of test..

More cautions next.


Sievers home  Math151-Fall08/Dayf38.htm  11pm 12/2/08
This page belongs to Sally Sievers who is solely responsible for its content. Please see our statement of responsibility.