Math 151 , Fall 2005, Day 34 Mon. Nov. 14  Hit reload ...after class, ver. 2

Exam 3, Friday Nov. 18 (Day 36)  Covers work Chapter 14  thru today's HW.
  "sample exam" problems were given out Friday (link below)
Day 34: (Re)Reading: Chapter 20+21 thru p. 392 (Activstats is good here too.) Then continue (Alpha levels) through 395.  Lightly through Error types and Power . Read What can go wrong p. 401 and the rest. (SPSS won't do proportion computations, but some other programs do; it's good to have an idea what you might see, p. 402.)
Hand in (All D&V) 
Ch. 20, p.387 #7, Find the mistakes The first mistake is that both hypotheses are written with incorrect notation.  The second is that the alternative hypothesis is chosen wrongly.  I would write the company's goal as "more than 90% succeed"--I think that makes it clearer what structure to use.
Optional but good: MCS-353-71 (adapted):  Political candidates
To get their names on the ballot of a local election, political candidates often must obtain petitions bearing the signatures of a minimum number of registered voters.  In Pinellas County, Florida, a certain political candidate obtained petitions with 18,200 signatures (St. Petersburg Times, Apr. 7, 1992). To verify that the names on the petitions were signed by actual registered voters, election officials randomly sampled 100 of the names and checked each for authenticity.  Only two were invalid signatures. 
a) Is 98 out of 100 verified signatures sufficient to believe that more than 17,000 of the total 18,200 signatures are valid? 
b) Repeat part (a) if only 16,000 valid signatures are required. 
Based on Statistics, McClave and Sincich, pg. 353
c) Construct a 95% CI for the proportion of valid signatures.   Turn the endpoints of the CI into numbers of valid signatures by multiplying by 18,200.

Postpone the rest:
Two-sided:  For some reason, D&V don't model or assign any 2-sided problems (except #8).  We need to be used to them for later, so here are a few.
b) Use your green shoebox result to do a Two sided test against the null hypothesis p = .5.
Ch. 20, p.387  #8 Find the mistakes
From ActivStats, copied here:
 MRA-304-2:  Kerrich Coin Toss  While he was a prisoner of the Germans during World War II, the British statistician John Kerrich tossed a coin 10,000 times.  He got 5067 heads.  Take Kerrich's tosses to be an SRS from the population of all possible tosses of his coin.  If the coin is perfectly balanced, p = 0.5.  Is there reason to think that Kerrich's coin was not balanced? 

 TRE-396-9:  Store Checkout-Scanner Accuracy (adapted from Activstats HW):
In a study of store checkout-scanners, 1234 items were checked and 20 of them were found to be overcharges (based on data from "UPC Scanner Pricing Systems: Are They Accurate?" by Goodstein, Journal of Marketing, Vol. 58).  Before scanners were used, the overcharge rate was estimated to be about 1% . Based on these results, do scanners appear to give a different rate of overcharges than the old method of keying in the price?  (All items had to have individual price tags; scanning is much less labor-intensive.)  Do the steps, finding the P-value and stating a conclusion. 
= = = = = = = = = = 
"Significance" Ch. 21, p. 404 
1 P-value
3, 4 Alpha 
5, 6 Significant?

Read,
  to 
discuss 
Optional 
Exam 3 on Friday.Handout: Sample problems.
 I will give you, on the test, the formulas for SD(p-hat), SD(x-bar), n for given C and ME.  The rest you need to memorize.
I will give you the Z and the T table; but be ready to find the z* for a C not in the T table!
Hypothesis test questions on the exam: I will only ask one-sided hypothesis test questions.  "Moderately strong" evidence will be taken to mean a P-value of .05 or less.
Check the sample problems Handout to see what's not appropriate now.

Homework questions: Day 33
Continuing with Hypothesis testing (often called Significance testing)
Correction to Day 34 lecture:  on the board I wrote SE(p-hat) =  square root of (poqo/n) when calculating z's for a test.
   I should have written SD(p-hat) =  square root of (poqo/n).  We are assuming the real proportion in the population is po , so this is actually the standard deviation.  The Standard Error SE is when you are estimating the unknown standard deviation by using p-hat and q-hat, from the data, in place of the p and q you don't know.  Standard Error

Class spent working HW, mechanics and meaning of P-value.  Pick up here next.
Use CI to estimate true value.  Two-sided tests.    Notes:  Day 32

"Statistically significant" result (p.256):  An observed difference is too large for us to believe that it is likely to have occurred naturally (i.e. because of random variation.)  add: If H (no population difference) is truly the case.

Especially if we must make a decision to Reject Ho  (or retain it)---
  Set "benchmark" or "cutoff" level  "alpha"  "significance level":  (p. 393-4)
        If  P-value is less than alpha, we say the test is "significant at level alpha"
                      (Seeing the result (again) would be rarer than alpha, if the null hypothesis is true)

Different fields/ questions use different alphas.  If rejecting  Ho is startling, or costly, want a smaller alpha.
  Usually round numbers:  .10 (1 in 10), .05 (1 in 20), .01 (1 in 100), .001 (1 in a thousand) etc.
    Smoke detector : P = 0.4% = .004.  IS Significant at .10, .05, .01 levels.  NOT significant at .001 level.

Cautions:  (p. 394)
   P = .0499 "Is significant" but P = .0501 "is NOT significant" at .05 level.  Best to report P-values, whether or not you use an alpha to base a decision on.
  "Statistically significant" difference isn't necessarily either  *large*  or *of any importance*.  (With enough data, you can detect even  small deviations from the null hypothesis.)


Sievers home  Math151-Fall05/Dayf34.htm  3:30pm 11/14/05
This page belongs to Sally Sievers who is solely responsible for its content. Please see our statement of responsibility.