HW Day41: Reading Ch.
18: for flavor, big ideas. You're not required to be able to read
the t-table or do any "new" types of computation. You should
read to see that we're repeating the CI and test work, only with s
instead of sigma, and "t" instead of "z". First to p. 441, then the
rest. READ about Matched pairs (p 444-7)and
Robustness (p.447-9)! Check p. 451
18.15, 16, first, then 23, 24. Optional(hand
computation): 17, 18,19, 20, 21, 22
Read the Front page of the SPSS handout,
compare the output shown there with the results in the book with the
same example. Learn to read the output!
Review Ch.9, p. 219 and around
(Completely randomized experiment, especially with 2 treatments only),
and p. 224 (Matched pairs experimental design) .
Read Ch. 19, pp. 460-61 only! (Comparing 3 or more
independent groups requires Analysis of Variance, Ch. 25)
|
Hand in Wednesday . For the following problems, get the SPSS output here (Word file) or from the white
folder outside my door(yes!),
and use it to find the answers.) p. 453, 18.27 Sharks The P-value is
better than the .05 level mentioned. What is it (rounded to 3
decimal places)? In-class Final Exam problem:
Link active now! Do
this, with whatever help you need, and BRING your result to the
IN-CLASS Final. It is Problem #1 of the IN-CLASS exam. Paper copy
outside my door Wed 9:30. |
Read, to discuss |
Optional Homework marked postpone on Day 40 (t-procedures by hand) |
Homework
questions? Day 40
IF you use a particular alpha as a "cutoff" between
"reject H0
" and "failing to reject H0"--we can talk about probability
of rejecting H0 when it's true--and alpha is that
probability!
And we can talk about the "power" of the test to "detect" an
alternative of (say) 24: (probability of rejecting H0 correctly)
Applet:
"Power" (of a test to detect a difference). For
the shoebox situation, the power is .761.
There can be a lot of overlap between two populations,
but a small difference in means can be "statistically
significant", if the sample size is only big enough to detect it!
Ch.
18: Inference for population mean (realistic), a quick look.
What we actually
did: See Day 40 for more
detail (optional)
The most unrealistic of our "simple conditions" for
inference (p. 344) was that we knew the population standard deviation
sigma. We remove that condition here.
If we substitute s, the sample standard
deviation, for sigma, the population standard deviation, in our
Normal distribution formulas:
If n is quite big, the value of the sample standard
deviation will be close to the same as the value from the
population, and our work's approximately right.
But if n is smaller, estimating sigma by s will add
in extra variability! Problem solved by modifying the
Z-distribution!
Standard error of the (sample) mean =
Standard deviation of xbar, estimated from the data.
"Standard
error of the mean": s/sqrt(n) SEM, SEXbar,
etc.
When you estimate the standard
deviation of a statistic, the resulting
estimate is called the "standard error" of the statistic.
t-distribution
family: like standard normal only slightly fatter in the tails,
slightly more spread. Mean = 0. Symmetrical around 0.
t(k) is the
t distribution with k degrees of freedom.
Comparison with normal (Excel
graph)
Lower d.f.--fatter tails. Higher d.f.--more like standard normal.
Standardizing xbar with s instead of sigma results in
the one-sample t statistic,
t-distribution with n-1degrees of
freedom.
Conditions for inference about a mean: (p. 434)
++ SRS (or reasonable facsimile)
++ Population is Normal. (Can relax
to symmetric, single-peaked unless n "very small")
"One-sample"
t- procedures: SRS of size n. Use Xbar
to estimate µ.
Confidence intervals:
where
t* is a little larger than the corresponding z*.
(By hand, we'd get t* from n-1 row of Table C, instead of z* from
bottom row. But not going to do "by hand" this term.)
Significance tests: State hypotheses as in Ch. 15, find t from data, by:
Calculating the one-sample
t-statistic, using the null hypothesis value of µ (call
it µ0)
Then
proceed as if it were a "z", except we need a table for "t" instead of
Z (Table C; not going to do this term)
Mostly with real data, you'll let computer packages do these
computations.
Get Handout for
SPSS Ch. 18 (white folder outside my door)
Look at Front page output, decipher it. Note
Std.ErrorMean (standard error of the mean), t.
What SPSS calls "Sig. (2-tailed)" = "2-sided P-value"
If you have a one sided alternative, and your xbar is in the
correct direction, divide the SPSS Sig. by 2 to get P.
READ the rest:
MATCHED PAIRS t
procedures-- "Paired samples"(SPSS), "Paired
comparisons" Review:
Ch.9 p. 224
before--after, left
hand--right hand, Drug A vs. Drug B on the same person or on a
matched pair.
For each pair, find the difference
in the observed values. Then treat these differences as if
they are "the" data set, from a normal population, and do One-sample
t procedures.
Usually (always?) the null hypothesis
will be " µ = 0", there is "no
difference" between the treatments.
The cola loss-of-sweetness example (SPSS handout, example 18.3, p. 440)
was actually matched pairs: each "loss" number was a before-after
difference; they just didn't tell us the before numbers or the after
numbers.
ROBUST procedures: a confidence
interval or significance test is called robust if the
confidence level or P-value doesn't change very much when the
assumptions of the procedure are violated. pp. 447-450.
Assumption: Population is Normal.
t-procedures are quite robust against
nonnormality. But sensitive to outliers, bad skewness. Look at
data. Need SRS!!
Details: n <15
t ok if data roughly symmetric, single peak, no outliers. Don't
use if skewed or outliers. (How out is an outlier?)
n > 15 t ok unless there is strong skewness, or
outliers.
n > 40 or so: t ok even if there is skewness.
(Outliers? I suggest trying with and without them, see what
changes).
Matched-pairs data (differences) are often more normal in shape than the separate variables ("oddness" is often the same for both items in a pair, and disappears in subtraction. Another reason why this is a nice experimental design. )
If you can't do t-procedures, there are procedures involving
medians, or other approaches (Ch. 26)
Another situation which uses
t-statistics is the one in
Chapter 19
"Two-sample problems".
Two random samples, independent of each other, from distinct
populations. (Populations are normally distributed)
Often--comparing means from an experiment with two treatments (usually "control" and "treatment"). Review Ch.9, p. 219 and around.
/--- Group 1, n1---- Treatment 1---\
/
\
Random
asst.(?)
Compare results --"means"
\
/
\--- Group 2, n2---- Treatment 2---/
To examine the difference of the two means, µ1
- µ2, we look at the difference of the xbars.
We need the Standard Error of the difference xbar1
- xbar2
,
and then we can proceed as before, more or less (with some
adjustments.)
But we've run out of time....
| Sievers home | Math151-Fall07/Dayf41.htm | 1:30pm | 12/4/07 |