MATH 251, Probability and Statistics I, Fall 2005, Fri. Nov. 4, Day 30After Class, corrected

  Reading:  Finish 6.1.  Start 6.2.  I think it's very well presented but there are a lot of ideas, and it will take several times through.  First time through: Read to p. 406,  skim "statistical significance" (406-top of 409).  Read "Tests for a population mean" (I'll be doing this early)(409-12), skim the rest.  I'll do the "tests for a population mean"  and then discuss the more general patterns a bit, then the Significance point of view.    Ahead, 6.3, a brief glance at 6.4, then 7.1
Hand in: 
p. 396, CI 
6.15abc calories--CI and nearness.
6.12 skewness

Sample size p. 396
6.22, 6.23, salaries
6.25, 6.26 bone study, + with dropouts 
6.28 lab scale
- - - - - - - - - - - 
Beginning 6.2, p. 416ff.
Do A, details below

Postpone the rest:
6.54 SSHA--1-sided test of mean 
6.60 Applet--P-value: Also add to your list xbar = -.2, -.4.  Do as instructed; then change the null hypothesis to 2-sided and re-make the list, confirming that your p-values are double the p-values for |x-bar|.  (BTW, the second alternative, generating a sample from the given distribution, has been mis-programmed!  It takes a mean from a sample from the distribution  of X-bars, not the population, so gives wrong answers.)
6.38, 6.39 shading & finding P-values

6.35 stating hypotheses.   State in terms of parameters.
6.34 stating hypotheses.  (For a you have to think of a baseline or control.  State in terms of parameters.
6.33ab wrong statements
6.32 wrong statements

Read, discuss 
6.28 lab scale This is one of the 
few situations where the idea 
of known sigma and unknown 
mu makes sense; the mean on
 a scale tends to "slip". 

- - - - - - - - - - 
Postpone
6.36 stating hypotheses

Optional 
 

- - - - - - - - - - 
Postpone
More practice: 
6.54 onesided.

A.  For each of your samples of size 4  from the two shoeboxes *(keep track of color of paper/ which box they came from!):      test H0:  µ=20 vs.  Ha:   µ  > 20.  Do it like this:
--Find xbar.
--Find z (assuming the population mean is 20, and the population s.d. is 4, so the s.d. of xbar is 2). z = (20-xbar)/2
--Use the standard normal table to find the probability to the right of your z.  (this is the P-value)
--Is your P-value smaller (less likely) than alpha = .10?  If so, your result is "significant at the alpha = .10 level"
--Do you think the box really has mean 20?
Be ready with these answers to pool and compare next time.
*I'll leave the boxes outside my door, so If you didn't get your samples in class for any reason, you can come and get them.
= = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = =
New Shoeboxes:  Take 4 from each, write them down (White from green box, yellow from red-top box) For HW, find means (for each box separately), and the rest of the computations (above).)  Know  which box!  Does that box have population mean 20, or some number >20?

Exam 2: Open book takehome.  Available probably Friday Nov. 11 (Day 33).  Due Before I leave for Thanksgiving break (Day 37)

Quiz Monday: Knowing and using: Binomial distribution formula (p. 349 bottom)
mean and st. dev. for Binomial:  X (count), p-hat (proportion)   (Summary p.350)
mean and st. dev. for X-bar from SRS of size n   (Summary p. 368)
Normal?  Central limit theorem: says yes for all of the above distributions, approximately, for n large.
If population(s) normal to start with, linear combinations stay normal (including X-bar), mean and s.d. follow algebra rules (as last quiz.)
Questions on quiz?

Homework questions? Day 29
Continuing 6.1
Common general pattern for CI:   estimate + z* sigmaestimate (e.g. for proportion p:  p-hat + z* sigmap-hat )
Level C confidence interval for the mean, with margin of error m,
where P(-z* < Z < z*) = C
Day 29: why does it work?

Relation of m (margin of error, half width), C (confidence level), and n (sample size), (and sigma)
    C and z* get bigger and smaller together (standard normal sketch)
Want bigger C?  Must accept bigger m.  Trade off confidence (Big C)  vs. precision (small m).
    OR: bigger n will make smaller m. This makes sense: bigger sample size, more info-->more accurate estimate.
            (square root makes it Expensive: have to quadruple n to halve m)
    So smaller m can be achieved only by   » accepting lower confidence level (smaller C),
        » or by increasing sample size (bigger n)               » or by decreasing sigma.

Visual example:  Applet,  Confidence Intervals.You can change the C, with the same xbars, see the m change.  (n and sigma are fixed.) (50 intervals display. More will overwrite the drawings of old ones.  But they accumulate numerically  in the right panel till you reset.)

    Sigma:  We can't usually change it, it comes with the population.  But bigger sigma (more population variability) will give bigger m (wider CI), i.e. less accuracy in prediction (for the same C and n).  Ways to affect sigma: 1) measuring--be very careful. Better instrument?  2) Science  projects directed by Prof. Wahl:  Experiments on chickens bred to be "identical"--very low variability from one to the other.  Therefore very small samples suffice.

Planning ahead:  Choose sample size big enough to satisfy desired: margin of error, confidence level.
Given C and m (and sigma), find n.
   Method:  Use C to find z*.  Plug in to formula for m, and solve for n.  Or memorize formula for n and plug in to it.
     Note:  z* sigma still on top.  m and n change places, and whole thing is squared!
           Round up!  If you get n = 5.06, you need a sample of size 6 to get your margin of error at least as short as you want.
   (What about sigma?  Estimate from a pilot study or similar situation.)
Start here Monday
Significance tests, Sec 6.2
Significance tests use an elaborate vocabulary, but the basic idea is simple: a result that would "rarely" happen if a claim were true--is good evidence that the claim is NOT true."
- - - - - - - - - - - - - - - - - - - - - - - -
Extended Standard Normal Table
  z         P(Z < z)                         P(Z > z)   = same in scientific notation: E-03 = 10-3
3.00   .9986501019683700    .0013498980316301    1.35E-03
4.00   .9999683287581670    .0000316712418331    3.17E-05
5.00   .9999997133484280    .0000002866515718    2.87E-07
6.00   .9999999990134120    .0000000009865877    9.87E-10
7.00   .9999999999987200    .0000000000012799    1.28E-12
8.00   .9999999999999990    .0000000000000007    6.66E-16
Below this, machine can't compute. If your assumptions lead you to an (almost) impossible z value, question your assumptions!
(The basis of significance/hypothesis testing)
- - - - - - - - - - - - - - - - - - - - - -
Shoeboxes 1 and 2:  I claim the mean value  for each shoebox is µ = 20.  Am I telling you the truth?  I can't remember for sure.  I do know that the distribution in the box is normal,  standard deviation is 4.
I do remember that if  µ is not 20, then it is greater than 20. µ > 20.
Take a sample of size 4, find xbar.   Once for each shoebox!
How far from 20 is it?  Measure that in standard deviations of Xbar. (That is, find z for xbar.  Note s.d. for sampling dist of xbar is 2 (why?) ).  Did you get a far-out value of z?

The game:
Before taking data, define
H0: "Null hypothesis" A claim or statement about the population we would like to show is NOT true.
       Stated usually as:  A parameter = a particular value.  H0: µ =1000 hrs.  (Average lightbulb life.)
Ha: "Alternative hypothesis" A claim or statement about the population we are trying to find evidence FOR.
          Stated usually as: The parameter  is >, or <, (one-tail tests) --or NOT = the particular value. (two-tail)
            Ha:   µ  > 1000 hrs. (Suppose we have a New process that makes them burn longer. We hope.)
     Other possible alternatives: Ha:   µ  < 1000 hrs.  (Want evidence that Mfr.'s claim is inflated)
             (two-sided=two-tail) Ha:   µ  Not = 1000 hrs.  (Want evidence that Assembly line process is"off")

Take data.  Calculate a Test statistic (result).  Is it an unlikely result if  H0 is true?  Then that is evidence against H0 (and for Ha.)

Measuring the strength of the evidence against H0  (a common measuring stick for all distributions and parameters):
P-value of a test:  The probability, computed assuming that H0 is true, that the observed test statistic would take a value as extreme or more extreme than that actually observed (if we could repeat taking-data again).  p. 405.
    The smaller the P-value, the stronger the data's evidence against H0 ( for Ha).

For a test of µ, using xbar (sigma known), the P-value is
--the area of the tail beyond the observed xbar, in the direction of Ha(one tail)
--or twice that area (two-tail).
We usually calculate it by standardizing the observed xbar (assuming H0 true) and looking in the normal table. (pp. 409-12)

Example:  H0: µ =1000 hrs.  (Average lightbulb life.)  Design a competing bulb:   Show it's better.
                 Ha:   µ  > 1000 hrs.
        Sample of size n = 25.  Population sigma = 150 hrs. Get xbar = 1060 hrs.  Are these bulbs better?
            Draw a picture of  the distribution of X-bars, assuming H0  is true;  shade the region of P-value P(Xbar>1060)
                 z = (1060-1000) ÷ (150/5) = 2.
         P(Z > 2) =  .0228  More than  2% and less than 3% chance of getting a result this high if we did it again.
   Different question:  quality control of "old" assembly line.   Ha:   µ  Not = 1000 hrs.
      Same picture, but P-value is symmetrical tails, P(Xbar < -1060) + P(Xbar>1060).  As "far out" in either  direction.
         P-value = 2P(Z>2) = .0456.  More than 4% and less than 5% chance of getting a result this far out if we did it again.

Shoebox:  H0: µ =20       Ha:   µ  > 20    (if it's not 20, I know it's greater.)
How far from 20 is your xbar? Find z for your xbar:  (xbar - 20)/ 2, since s.d. of xbar is 2, and µ =20 under the null H0.
Is this a far-out value of z? What is the probability of being farther out, i.e. being in the right hand  tail beyond this z?  That's the P-value.

Null and Alternative hypotheses:    Both are statements about the population.  But Not a symmetrical situation.
    Want evidence FOR the Alternative "New is better than old (maybe just different); change has taken place; a relationship does exist," etc.
     The Null is a "straw man" to be knocked down; the position of the skeptic who has to won over.  "New is no better/the same as the old.  No change has taken place.  No relationship exists."
Null Also needs to be something that, if true, gives a specific distribution of the test statistic; so we can tell if the test statistic is "unusual" compared to what's expected under the Null hypothesis.
So Null is Usually a specific parameter value that the test statistic is an estimator for.
             H0: µ =1000 hrs;   test statistic Xbar estimates µ; so Xbar should be close to 1000 if H0  is true.

In doing research, much effort should (& does) go into creating appropriate and testable hypotheses, and clarifying the data and test statistic that will be used.

Next:  "Statistical significance."


Sievers home  Math251-Fall05/Dayps30.htm  11am   11/4/05
This page belongs to Sally Sievers who is solely responsible for its content. Please see our statement of responsibility.