HW Day41: Finish
Reading Ch. 18: First to p. 441, then the rest. READ about
Matched pairs (p 444-7)and Robustness (p.447-9)! Check
(new) p. 451, 23, 24.
Read the Front page of the SPSS handout,
compare the output shown there with the results in the book with the
same example. Learn to read the output!
Review Ch.9, p. 219 and around (Completely randomized
experiment, especially with 2 treatments only), and p. 224 (Matched
pairs experimental design) .
Read Ch. 19, pp. 460-61 only! (Comparing 3 or more
independent groups requires Analysis of Variance, Ch. 25)
|
Hand in Wednesday .
p. 455 18.36 b. Calcium and blood pressure CI . Conditions. See Robustness below, text pp. 447-450for details For the following problems, use the SPSS output handed out or here
(Word file) or from the white folder outside my door, and use it
to find the answers.) p. 453, 18.27 Sharks The P-value is
better than the .05 level mentioned. What is it (rounded to 3
decimal places)? In-class Final Exam
problems 1 and 2: Do these
pages, with whatever help you need, and BRING your result to the
IN-CLASS Final. It is Problems #1 and #2 of the IN-CLASS exam.
Document who you work with. Paper copy in class, then available
outside my door.. |
Read, to discuss |
Optional SPSS: Do Homework Day 40 (t-procedures by hand) that had
actual data: |
Final Exam Wed. Dec. 17, 9-12
a.m. Closed
book, Bring two (2) sheets of notes. (I'll provide Tables A and C)
If this time is a problem for you, please email me very very soon.
Alternatives--Monday
afternoon, Tuesday afternoon also. Preferably starting early, 12:30
or after. (But don't make yourself uncomfortable!) I hope to finish
by 4, not drive home in the dark.
Full exam schedule is at http://www.wells.edu/pdfs/finals.pdf
Registrar's page with link to this and other good stuff:
http://www.wells.edu/academic/regist.htm
Signup on attendance clipboard Today! (or Wednesday). Further difficulties?
Get in touch with me ASAP!
Exam 1 1/2 to 2 times as long as hourlies. Comprehensive but with special attention to the material covered
since Exam 4. (Ch. 16, 18, 17 review.) Reading but not creating
SPSS. Most problems will be similar to the types on hour exams
and HW. Handout: TAKEHOME PORTION: Do
this, with whatever help you need, and BRING your result to the IN-CLASS Final.
It is Problems #1 and #2 of the IN-CLASS exam. Paper copy in class, then
outside my door..
Late HW accepted up to when you begin your Final Exam.
Homework questions? Day 40
Ch. 18: Inference for population mean (realistic)
What we did: See
Day 40 for more detail (optional)
The most unrealistic of our "simple conditions" for
inference (p. 344) was that we knew the population standard deviation
sigma. We remove that condition here.
If we substitute s, the sample standard
deviation, for sigma, the population standard deviation, in our
Normal distribution formulas:
If n is quite big, the value of the sample standard
deviation will be close to the same as the value from the
population, and our work's approximately right.
But if n is smaller, estimating sigma by s will add
in extra variability! Problem solved by modifying the
Z-distribution!
Standard error of the (sample) mean =
Standard deviation of xbar, estimated from the data.
"Standard
error of the mean": s/sqrt(n) SEM, SEXbar,
etc.
When you estimate the standard
deviation of a statistic, the resulting
estimate is called the "standard error" of the statistic.
t-distribution
family: like standard normal Z only slightly
fatter in the tails, slightly more spread. Mean = 0. Symmetrical
around 0.
t(k) is the
t distribution with k degrees of freedom.
Comparison with normal (Excel
graph)
Lower d.f.--fatter tails. Higher d.f.--more like standard normal.
Standardizing xbar with s instead of sigma results in
the one-sample t statistic,
t-distribution with n-1degrees of
freedom.
Conditions for inference about a mean: (p. 434)
++ SRS (or reasonable facsimile)
++ Population is Normal. (Can relax
to symmetric, single-peaked unless n "very small")
"One-sample"
t- procedures: SRS of size n. Use Xbar
to estimate µ.
Confidence intervals:
where
t* is a little larger than the corresponding z*.
(By hand, we get t* from n-1 row of Table C, instead of z* from
bottom row. )
Significance tests: State hypotheses as in Ch. 15, find t from data, by:
Calculating the one-sample
t-statistic, using the null hypothesis value of µ (call
it µ0)
Then
proceed as if it were a "z", except we need a table for "t" instead of
Z (Table C)
New today:
Mostly with real data, you can let computer packages do these
computations. Excel
t-procedures
Get SPSS
Handout for Ch. 18 ( white folder
outside my door) (version handed out
last time has correct output, instructions to do it are a little
outdated.)
Look at Front page output, decipher it. Note
Std.ErrorMean (standard error of the mean), t.
What SPSS calls "Sig. (2-tailed)" = "2-sided P-value"
If you have a one sided alternative, and your
xbar is in the correct direction, divide the SPSS Sig. by 2
to get P.
MATCHED PAIRS t
procedures-- "Paired samples"(SPSS), "Paired
comparisons" Review:
Ch.9 p. 224
before--after, left
hand--right hand, Drug A vs. Drug B on the same person or on a
matched pair.
For each pair, find the difference
in the observed values. Then treat these differences as if
they are "the" data set, from a normal population, and do One-sample
t procedures.
Usually (always?) the null hypothesis
will be " µ
= 0", there is "no difference"
between the treatments.
The cola loss-of-sweetness example (SPSS handout, example 18.3, p. 440)
was actually matched pairs: each "loss" number was a before-after
difference; they just didn't tell us the before numbers or the after
numbers.
By hand, See Day
40, bottom
ROBUST
procedures: a confidence interval or significance test is
called robust if the confidence level or P-value doesn't change
very much when the assumptions of the procedure are violated. pp.
447-449. Assumption: Population is
Normal.
(..)
t-procedures are quite robust against
nonnormality. But sensitive to outliers, bad skewness. Look at
data. Need SRS!!
Details: n <15
t ok if data roughly symmetric, single peak, no outliers. Don't
use if skewed or outliers. (How out is an outlier?)
n > 15 t ok unless there is strong skewness, or
outliers.
n > 40 or so: t ok even if there is skewness.
(Outliers? I suggest trying with and without them, see what
changes).
Matched-pairs data (differences) are often more normal in shape than the separate variables ("oddness" is often the same for both items in a pair, and disappears in subtraction. Another reason why this is a nice experimental design. )
If you can't do t-procedures, there are procedures involving
medians, or other approaches (Ch. 26)
Another situation which uses
t-statistics is the one in
Chapter 19
"Two-sample problems".
Two random samples, independent of each other, from distinct
populations. (Populations are normally distributed)
Often--comparing means from an experiment with two treatments (usually "control" and "treatment"). Review Ch.9, p. 219 and around.
/--- Group 1, n1---- Treatment 1---\
/
\
Random
asst.(?)
Compare results --"means"
\
/
\--- Group 2, n2---- Treatment 2---/
To examine the difference of the two means, µ1
- µ2, we look at the difference of the xbars.
We need the Standard Error of the difference xbar1
- xbar2
,
and then we can proceed as before, more or less (with some
adjustments.)
But enough already....
| Sievers home | Math151-Fall08/Dayf41.htm | 11am | 12/10/08 |