Math 151 , Spring 06, Day 30 Friday, April 15 Hit reload  after class (changes made Mon. day 31)

Day 30 (Friday, April 15): Reading: Ch. 19, Confidence Intervals for Proportions.  Quite well written, packed with stuff.  Ch. Activstats Ch. 19 does a good job with confidence intervals for proportions.  Next, Ch. 20
Hand in (All D&V)   Hand in Day 30's hw on Wed. (due to # of questions Mon.)
From Day 29's page: finish if you didn't, hand in Mon.:
C) SD vs. SE: (p. 347, SE) a)  You know that in the simulation of flipping a fair coin 30 times, p=.5.  Calculate  the SD(p-hat). (Check with the bottom of Day 28's covered material. This is the same no matter what actual proportion you got, since it is based on the population proportion p=.5.
Now suppose you  actually got 21 heads, p-hat = 0.7.  And suppose you DON'T know p.  Your best estimate of the SD(p-hat) is found by putting 0.7 in to the formula for SD.  Do it.  This is SE(p-hat), the "standard error of the sample proportion" (It's different depending on your sample).  Compare with SD(p-hat), the "standard deviation of the sample proportion."
b)  Suppose you measure the heights of (a sample of n =) 25 daffodils and find they have a mean of 30 cm and a standard deviation of 2 cm.  The seller of the bulbs says that the population of this variety of daffodils has standard deviation 1.7 cm. 
Find  SD(sample mean) and SE(sample mean).

A)Having gotten your 30 slips of paper from the green shoebox and counted how many 1's you have--Calculate the proportion of 1's,  p-hat, for your sample (you did this already).  Also plug in p-hat (instead of p) to the formula for SD(p-hat): so if you got 12/30, p-hat = .4.  "q-hat" = 1 - p-hat = 1-.4 = .6.   SD formula: square root of (p ·q/n)= square root of (.4 ·.6/30) = square root of .008 = .089 (This is SE(p-hat), p. 347).  Bring these estimates of p and SD(p-hat) to class next time.  (Shoebox is outside my door if you still need to do it.)

New: 
B) If you haven't finished it in class: Take your p-hat for the 30 slips from the shoebox:
      add and subtract your estimate of the SD SE(p-hat) = , getting (two numbers)
      Multiply 1.96 times , and add and subtract the result to p-hat, getting   The thing you're adding and subtracting is the margin of error, ME, for a confidence interval for p.   You've found: # of 1's, p-hat. SE(p-hat). p-hat + SE. ME for 95% = 1.96SE.  p-hat + 1.96SE
      Bring these to class to add to the pooled class results.
<>Ch. 19p. 366
5, 6 Conclusions. Do these with the "Don't misstate..." section, pp. 361-2.
9 Cars
D).  Redesigned:  Use the Normal table to find z* for a 99% CI (This is the z* for which 99% of the area is (symmetrical) between -z* and +z*).  Find z* for a 90% CIAs you do problems 13, 11, 21, keep track of the z*'s you've found, and fill in this table:
  Confidence level:       90%      95%        98%       99%
   critical value  z*:        ?             ?              ?            ?
13 Teenage drivers 
11 Ghosts 
21 Rickets (98% CI)

Postpone the rest:  (but look at them!)
3, 4 Conditions

16 Local news--new prison?

On a separate page, save to be part of Day 31's assignment:
--
For the Ch. 19 problems below, do the parts you can and leave space for calculating n.

ME, C, n pp. 356-7, 361-3.  Problems p. 366ff.
7, 8  Relationships, ME, C, n
23 Deer ticks CI + find n.
25 Graduation--find n The answers in the back use the 25% as the p to plug in.  Redo part a (only) using 50% as the p (what you would do if you had no idea what p would be.). How many subjects do you "save" by using the 25%?
26 Hiring--find n
28 Hiring again--find n
29  Pilot study--find n Use the pilot study to guesstimate p.  Now: if you want to be  conservative, you'll use a p closer to .5 than this.  What sample size would you recommend if you (conservatively) use p = .25?  How many more cars will it "cost" to be conservative?

Read,
  to 
discuss
Optional 
Exam 2 "earn-back-points" work deadline delayed to Wednesday at class time.
Review sampling distribution of p-hat and y-bar. Day 29

Homework questions? Day 29
  Take your p-hat for the 30 slips from the shoebox:
      add and subtract your estimate of the SD, getting (two numbers)
      Multiply 1.96 times , and add and subtract the result to p-hat, getting 
    Add your results to the list circulating.
      Also draw your on the graph circulating    |------o------|

Next job: (Ch. 19) We usually DON'T KNOW the population parameter; use the statistic from our sample to ESTIMATE it.
YOU don't know the real proportion of 1's in the green shoebox.  Each of you has an estimate.  In "real life" you won't have a bunch of classmates with other samples; you'll only have your own. (Also, in this case, I know the real proportion. Not so in "real life")  How "good" is your estimate of the real p?

You know how the sampling distributions of sample proportions (and sample means) behave; we'll use that.  But we want to know how much they are spread, and for that we need the parameter p (and q) for proportions, (and the parameter sigma for means)
And we don't know those!  So we use the sample statistics p-hat and s in place of them.

Standard Error (p. 347):  When we estimate the standard deviation of a sampling distribution of a statistic, using the data from our sample, we call that the Standard Error  of the statistic.

Confidence Interval Estimate of p: (Chapter 19)
p-hatis your best guess at p, but it's bound to be wrong, almost always.  (see p. 356)
Make an interval estimate of p, by adding and subtracting a Margin of Error (ME)
   For instance, 39% + 2%.
Say "This interval contains (captures) the true proportion p."  Wrong.  It may or may not, and you have no way of knowing.

What we can do  is use a rule to construct the ME so that intervals made using the rule will contain p a known proportion of the time.  The "known proportion" is our confidence level.  If our rule makes ME's that capture p 95% of the time, we've made 95% confidence intervals.  "I have 95% confidence that this interval captures the true proportion p."

A level C confidence interval for a parameter  is an interval, usually of the form estimate + margin of error,
  found from data, in such a way that
C% of all random samples will yield intervals that capture the true parameter value.

Rule for ME:   ME = z* SE(p-hat), where z* is the "critical value" from the Standard Normal table that has C% of the area in the symmetric central interval between -z* and +z*.
Level C confidence interval for population proportion p:  "One -proportion  z-interval"

(Why it works:  later.)
Example:  You drew a sample of size n =30. p is the (unknown) proportion of 1's in the shoebox. You found the sample  proportion, and you calculated the SE for the sample proportion. Use z* =1.  Then C is about 68%.
Calculate:  if I got 12/30, p-hat = .400.  "q-hat" = 1 - p-hat = 1-.4 = .6.   SD formula: square root of (p ·q/n)= square root of (.4 ·.6/30) = square root of .008 = .089 = SE(p-hat).
68% Confidence Interval:  .400 + .089, or  (.311, .489).
Whose intervals captured the real proportion?  (Expect roughly 68% of you to do so.)

Usually, want higher Confidence Level:  90%, 95%, 99%....
     For 95%:  z* = (approximately 2) = 1.96
           (How?  95% in the middle.  2.5% in each tail.  .0250 is to the left of what?? -1.96.)
           && Shortcut: Table T, p. A-53, bottom two rows.  ("infinity" row is the Standard Normal values)
      z*·SE(p-hat) = 1.96·.089 = .174  95% Confidence Interval:  .400 + .174, or  (.226, .574).

Start here Monday:
Note Trade-off
:  Higher Confidence ---Wider interval (bigger ME. Less "precision")

Assumptions/conditions:  Assumes Central Limit Theorem for proportions is appropriate.
  Independence:¿¿Data values shouldn't affect each other.   ¿¿ Randomization helps!   ¿¿n < 10% of population.
  Sample Size:  Expect at least 10 successes and 10 failures (rephrase of  np, nq > 10)

  Bias?  Here's why we studied bias in sampling.  Biases or other bad sampling methods can make our computations worthless! p. 363.
Reprise:  Level C confidence interval for population proportion p: 
 "One -proportion  z-interval"   Chapter 19

Note Trade-off:  Higher Confidence ---Wider interval (bigger ME. Less "precision")
Desire:  Small Margin of Error ME + High confidence C.  p. 361-2
But they grow and shrink together: High confidence--Low precision ; High precision (small ME)--low confidence.

Way out:  increase n, the sample size.  (Shrinks SE)  How big a sample size for desired ME and C?   Next time:


 
Sievers home  Math151-Sp06/Daysp30.htm  4pm 4/12/06
This page belongs to Sally Sievers who is solely responsible for its content. Please see our statement of responsibility.