| Hand in: p. 396, CI 6.15abc calories--CI and nearness. 6.12 skewness Sample size p. 396 Postpone the rest (yes), but read
to see what will happen: 6.35 stating hypotheses.
State
in terms of parameters. |
Read,
discuss 6.28 lab scale This is one of the few situations where the idea of known sigma and unknown mu makes sense in practice; the mean on a scale tends to "slip". - - - - - - - - - - |
Optional - - - - - - - - - - |
Exam 2: Open book takehome. Available probably Friday Nov. 9 (Day 33). Due Before I leave for Thanksgiving break (Mon. Day 37)
Quiz Monday: Knowing and using: Binomial
distribution formula (p. 349 bottom)
mean and st. dev. for Binomial: X (count),
p-hat (proportion) (Summary p.350)
mean and st. dev. for X-bar from SRS of size
n (Summary p. 368)
Normal? Central limit theorem: says
yes for all of the above distributions, approximately, for n large.
If population(s) normal to start with, linear
combinations stay normal (including X-bar), mean and s.d. follow
algebra rules (as last quiz.)
Questions on quiz? I won't
require you to find Normal probabilities, just to find the means,
s.d.'s and shapes; probabilities only for binomial, using the formula.
You may leave answers in form ready for a simple calculator
(+,-,*,/,sqrt).
Sample problems for quiz (from done HW)
Binomial: 5.12, using the formula to do (b). Don't need to find the
prob. in (d).
5.25 Finding mean and s.d. of sample proportion, only.
Means: 5.35a, plus what is the *shape* of the distribution of the
xbars (hint: first 5 words of (b))
5.44, b: not calculating the probability, but finding the distribution
(shape, mean, s.d.) for Hole - Shaft.
- - - - -
If you want the solutions for the last quiz, email me; I'll send them
as a Word attachment to you.
- - - - - - - - - - - - - -
Homework questions? Day
29
Continuing 6.1
Common general pattern for CI: estimate + z*
sigmaestimate (e.g. for proportion p: p-hat +
z* sigmap-hat )
Level C confidence interval for the mean,
with margin of error m,
where
P(-z* < Z < z*) = C
Day 29: Review, and why does it
work? Your shoebox intervals:
5 of 10 contain the true mean (7)
Relation of
m
(margin
of error, half width),
C (confidence level), and n (sample
size), (and sigma)
C and z* get bigger and smaller together
(standard normal sketch)
Want
bigger
C? Must accept bigger m. Trade off confidence
(Big
C) vs. precision (small m).
OR: bigger n will
make smaller m. This makes sense: bigger sample size, more
info-->more
accurate estimate.
(square root makes it Expensive: have to quadruple n to halve m)
So smaller m can be achieved only
by
» accepting lower confidence level (smaller C),
» or by increasing
sample size (bigger n)
» or by decreasing sigma.
Visual example: Applet, Confidence Intervals.You can change the C, with the same xbars, see the m change. (n and sigma are fixed.) (50 intervals display. More will overwrite the drawings of old ones. But they accumulate numerically in the right panel till you reset.)
Sigma: We can't usually change it, it comes with the population. But bigger sigma (more population variability) will give bigger m (wider CI), i.e. less accuracy in prediction (for the same C and n). Ways to affect sigma: 1) measuring--be very careful. Better instrument? 2) Science projects directed by Prof. Wahl: Experiments on chickens bred to be "identical"--very low variability from one to the other. Therefore very small samples suffice. (Potential problem: can you generalize from these chickens to ordinary chickens?)
Planning ahead: Choose sample
size big enough to satisfy desired: margin of error, confidence
level.
Given C and m (and sigma), find n.
Method: Use C to find z*. Plug in to formula
for m, and solve for
n. Or memorize formula for n and plug in to it.
Note: z* sigma still on top. m and n change places, and
whole
thing is squared!
Round
up! If you get n = 5.06, you need a sample of size 6 to get
your
margin of error at least as short as you want.
(What about sigma? Estimate from a pilot study or
similar situation.)
..
Significance tests, Sec 6.2
Significance tests use an elaborate vocabulary, but the basic
idea is simple: a result that would "rarely" happen if a claim were
true--is good evidence that the claim is NOT true."
- - - - - - - - - - - - - - - - - - - - - - - -
Extended Standard Normal Table
(also linked off main 251 page>Weblinks. Normal Tails
z P(Z <
z)
P(Z > z) = same in scientific notation: E-03 = 10-3
3.00
.9986501019683700 .0013498980316301
1.35E-03
4.00
.9999683287581670 .0000316712418331
3.17E-05
5.00
.9999997133484280 .0000002866515718
2.87E-07
6.00
.9999999990134120 .0000000009865877
9.87E-10
7.00
.9999999999987200 .0000000000012799
1.28E-12
8.00
.9999999999999990 .0000000000000007
6.66E-16
Below this, machine can't compute. If your
assumptions lead you to an (almost) impossible z value, question your
assumptions!
(The basis of significance/hypothesis testing)
- - - - - - - - - - - - - - - - - - - - - -
Shoeboxes 1 and 2: I claim
the mean value for each shoebox is µ = 20. Am I telling you the
truth? I can't remember for sure. I do know that the
distribution in the box is normal, standard deviation is 4.
I do remember that if µ is not 20, then it is greater than
20. µ > 20.
Take a sample of size 4, find
xbar. Once for each shoebox!
How far from 20 is it?
Measure that in standard deviations of Xbar. (That is, find z
for xbar. Note s.d. for sampling dist of xbar is 2 (why?)
). Did you get a far-out value of z?
Got to here Friday
The game:
Before taking data, define
H0: "Null hypothesis" A claim or statement about the
population we would like to show is NOT true.
Stated usually as: A
parameter
= a particular value. H0: µ =1000
hrs. (Average lightbulb life.)
Ha: "Alternative hypothesis" A claim or statement about
the population we are trying to find evidence FOR.
Stated usually
as: The parameter is >, or <, (one-tail tests) --or NOT =
the
particular value. (two-tail)
Ha: µ > 1000 hrs. (Suppose
we
have a New process that makes them burn longer. We hope.)
Other possible alternatives: Ha:
µ < 1000 hrs. (Want evidence that Mfr.'s
claim
is inflated)
(two-sided=two-tail) Ha: µ
Not
= 1000 hrs. (Want evidence that Assembly line process is"off")
Take data. Calculate a Test statistic (result). Is it an unlikely result if H0 is true? Then that is evidence against H0 (and for Ha.)
Measuring the strength of the evidence against H0
(a common measuring stick for all distributions and parameters):
P-value of a test: The
probability,
computed assuming that H0 is true, that the observed test
statistic
would
take a value as extreme or more extreme than that actually observed
(if we could repeat taking-data again). p. 405.
The smaller the P-value, the stronger
the
data's evidence against H0 ( for Ha).
For a test of µ, using xbar (sigma known), the
P-value
is
--the area of the tail beyond the observed xbar, in the
direction of Ha(one tail)
--or twice that area (two-tail).
We usually calculate it by standardizing the observed xbar (assuming
H0 true) and looking in the normal table. (pp. 409-12)
Example: H0:
µ =1000 hrs. (Average
lightbulb life.) Design a competing bulb: Show
it's better.
Ha: µ > 1000 hrs.
Sample
of size n = 25. Population sigma = 150 hrs. Get
xbar = 1060 hrs. Are these bulbs better?
Draw a picture of the distribution of X-bars, assuming H0
is true; shade the region of P-value P(Xbar>1060)
z = (1060-1000) ÷ (150/5) = 2.
P(Z > 2) = .0228 More than 2% and less than 3%
chance of getting a result this high if we did it again.
Applet: P-value of a test of
significance" will animate/calculate this.
Different question: quality
control of "old" assembly line. Ha:
µ Not = 1000 hrs.
Same
picture, but P-value is symmetrical tails, P(Xbar < -1060) +
P(Xbar>1060). As "far out" in either direction.
P-value = 2P(Z>2) = .0456. More than 4% and less than 5%
chance of getting a result this far out if we did it again.
Shoebox: H0:
µ =20 Ha:
µ > 20 (if it's not 20, I know
it's greater.)
How far from 20 is your xbar?
Find
z for your xbar: (xbar - 20)/ 2, since s.d. of xbar is 2, and µ
=20 under the null H0.
Is this a far-out value of z?
What
is the probability of being farther out, i.e. being in the right
hand
tail beyond this z? That's the P-value.
Null and Alternative hypotheses:
Both
are statements about the population. But Not a symmetrical
situation.
Want evidence FOR the Alternative.
"New is better than old (maybe just different); change has
taken place; a relationship does exist," etc.
The Null is a "straw man" to
be knocked down; the position of the skeptic who has to won
over.
"New is no better/the same as the old. No change
has
taken place. No relationship exists."
Null Also needs to be something that, if true, gives
a specific distribution of the test statistic; so we can tell if
the
test statistic is "unusual" compared to what's expected under the Null
hypothesis.
So Null is Usually a specific parameter value that the test
statistic is an estimator for.
H0: µ =1000 hrs; test statistic Xbar
estimates
µ; so Xbar should be close to 1000 if H0 is
true.
In doing research, much effort should (& does) go into creating appropriate and testable hypotheses, and clarifying the data and test statistic that will be used.
Next: "Statistical significance."
| Sievers home | Math251-Fall07/Day2s30.htm | 7pm | 11/6/07 |