Math 151 , Day 41, Wednesday, May 7, 2008  .After class, corrections..  Hit reload .

HW Day41:  Finish Reading Ch. 18: First to p. 441, then the rest.  READ about Matched pairs (p 444-7)and Robustness (p.447-9)! Check (new) p. 451, 23, 24. 
Read the Front page of the SPSS handout, compare the output shown there with the results in the book with the same example.  Learn to read the output!

Review Ch.9, p. 219 and around (Completely randomized experiment, especially with 2 treatments only), and p. 224 (Matched pairs experimental design) .
Read Ch. 19, pp. 460-61 only!  (Comparing 3 or more independent groups requires Analysis of Variance, Ch. 25)

Hand in  Friday . 
t-procedures:

p. 455 18.36 b. Calcium and blood pressure CI . Conditions. See Robustness below, text pp. 447-450for details

For the following problems, use the SPSS output handed out or here (Word file) or from the white folder outside my door, and use it to find the answers.)

p. 432 18.10 Ancient air test Do these calculations to check the SPSS work: Show that the StdErrorMean on the output is the Std.deviation/sqrt(n), and that
Mean Difference = Mean -(Null hypothesis)TestValue, and
t = (Mean - (Null hypothesis)TestValue)/StdErrorMean.

p. 453, 18.27  Sharks  The P-value is better than the .05 level mentioned.  What is it (rounded to 3 decimal places)?   
p. 457, 18.41 Auto crankshafts. ALSO, what is the difference (in mm) between the mean for these 16 crankshafts, and the value it's supposed to be (224mm)?

In-class Final Exam problem: Link active, document corrected! Correction: for p.1, A-c, "Sketch and label the normal curve representing the distribution of xbars from all possible SRS’s of size 6 from Joe, if H0 is true.To label your graph, approximate sigma (the unknow population st.dev.) by s (the standard deviation calculated from the sample.)" Do this page, with whatever help you need, and BRING your result to the IN-CLASS Final. It is Problem #1 of the IN-CLASS exam. Paper copy in class, then outside my door..
= = = = = = = = = = = = = = = = = = = = = =
+ + + + + + + + + + +
Chapter 19 problems, with echos of Chapters 8 and 9.   Note that we can use the same analysis method whether data is from a sample (Example 19.1b)  or an experiment (Example 19.1 a, c) or an observational study.
p. 461, 19.1, 2, 3, 4.  For each, after deciding which design it is, tell if  the data comes from a sample, an observational study, or an experiment

Read, 
to discuss

Optional

SPSS: Do Homework Day 40 (t-procedures by hand) that had actual data:
Using SPSS for one-sample procedures (with SPSS Handout) A.  Redo the example on the front page of the handout, getting the result of example 18.3
B. The inclass example of Milk bacteria: (Day 40)do it on SPSS:
Dataset as SPSS file, Dataset as text (.dat) file  (If you import from the text file, remember to check that the Measure is Scale)
C. Matched pairs: Redo the results on the back page of the handout, getting the results of example 18.4.

Datasets in SPSS for HW problems, if you want to do them yourself:
AncientAir18_7n10.sav

Sharks18-27.sav
Crankshafts18-41.sav

Get  Review Exercise (FinalReviewSp08.doc).    . This is optional, but if you hand it in by the time you start the inclass final, it will count 50%; "in class" the other 50% of the Final Exam grade.  Get all the help you can find on the Review Exercise but make sure you understand and write the final result yourself.  Show your work, document your help!

Final exam: Tues. May 13, 7-10 pm (evening!)  Two sheets of notes. I'll give you the usual tables.
   Alternatives-- Tuesday afternoon, starting any time after 1:00; finishing by 5:30.  Wednesday morning, 9-12.
        Choose a time (clipboard) so I know when you're coming.
Difficulties? Get in touch with me ASAP!
             Full exam schedule is at   http://www.wells.edu/pdfs/finals.pdf
Exam 1 1/2 to 2 times as long as hourlies. Comprehensive but with special attention to the material covered since Exam 4. Reading but not creating SPSS.  Will certainly be broader in range than the Review Exercise; but most problems will be similar to the types on hour exams and HW.

Homework questions?  Day 40


Ch. 18:  Inference for population mean (realistic)
What we did: See Day 40 for more detail (optional) 

The most unrealistic of our "simple conditions" for inference (p. 344) was that we knew the population standard deviation sigma.  We remove that condition here.
If we substitute s, the sample standard deviation, for sigma, the population standard deviation, in our Normal distribution formulas:
    If n is quite big, the value of the sample standard deviation  will be close to the same as the value from the population, and our work's approximately right.
    But if n is smaller, estimating sigma by s will add in extra variability!   Problem solved by modifying the Z-distribution!
Standard error of the (sample) mean =    Standard deviation of xbar, estimated from the data.
  "Standard error of the mean":  s/sqrt(n) SEM, SEXbar, etc.
       When you estimate the standard deviation of a statistic, the resulting estimate is called the "standard error" of the statistic.

t-distribution family:  like standard normal Z only slightly fatter in the tails, slightly more spread.  Mean = 0. Symmetrical around 0.
          t(k) is the t distribution with k degrees of freedom.
 Comparison with normal (Excel graph)
Lower d.f.--fatter tails.  Higher d.f.--more like standard normal.

Standardizing xbar with s instead of sigma results in
   the one-sample t statistic, t-distribution with n-1degrees of freedom.

Conditions for inference about a mean: 
(p. 434)
    ++ SRS
(or reasonable facsimile)
    ++ Population is Normal. 
(Can relax to symmetric, single-peaked unless n "very small")

"One-sample" t- procedures: SRS of size n.  Use Xbar to estimate µ.
Confidence intervals:     where t* is a little larger than the corresponding z*.
(By hand, we get t* from n-1 row of Table C, instead of z* from bottom row. )

Significance tests:  State hypotheses as in Ch. 15, find t from data, by:
 Calculating the one-sample t-statistic, using the null hypothesis value of µ (call it µ0)
Then proceed as if it were a "z", except we need a table for "t" instead of Z (Table C)

New today:
Mostly with real data, you can let computer packages do these computations. Excel t-procedures
Get  SPSS Handout for Ch. 18 ( white folder outside my door)
  
Look at Front page output, decipher it. 
Note Std.ErrorMean (standard error of the mean), t.
What SPSS calls "Sig. (2-tailed)" = "2-sided P-value"
If you have a one sided alternative, and your xbar is in the correct direction, divide the SPSS Sig. by 2 to get P.


MATCHED PAIRS t procedures-- "Paired samples"(SPSS), "Paired comparisons"
Review: Ch.9 p. 224
   before--after, left hand--right hand, Drug A vs. Drug B on the same person or on a matched pair.
For each pair, find the difference in the observed values.  Then treat these differences as if they are "the" data set, from a normal population, and do One-sample t procedures.
Usually (always?) the null hypothesis will be " µ = 0", there is "no difference" between the treatments.
The cola loss-of-sweetness example (SPSS handout, example 18.3, p. 440) was actually matched pairs: each "loss" number was a before-after difference; they just didn't tell us the before numbers or the after numbers.
    By hand, See Day 40, bottom

 

ROBUST procedures:  a confidence interval or significance test is called robust if the confidence level or P-value doesn't change very much when the assumptions of the procedure are violated.  pp. 447-449.   Assumption:  Population is Normal.
(Didn't quite finish talking about this Wed. Read the details yourself)
t-procedures are quite robust against nonnormality. But sensitive to outliers, bad skewness. Look at data.  Need SRS!!
 Details:  n <15   t ok if data roughly symmetric, single peak, no outliers.  Don't use if skewed or outliers.  (How out is an outlier?)
              n > 15  t ok unless there is strong skewness, or outliers.
              n > 40 or so:  t ok even if there is skewness.  (Outliers?  I suggest trying with and without them, see what changes).  

Matched-pairs data (differences) are often more normal in shape than the separate variables ("oddness" is often the same for both items in a pair, and disappears in subtraction.  Another reason why this is a nice experimental design. )

If you can't do t-procedures, there are procedures involving medians, or other approaches (Ch. 26)


 Another situation which uses t-statistics is the one in Chapter 19
"Two-sample problems".  Two random samples,  independent of each other, from distinct  populations. (Populations are normally distributed)
Often--comparing means from an experiment with two treatments (usually  "control" and "treatment"). Review Ch.9, p. 219 and around.
                 /--- Group 1, n1---- Treatment 1---\
               /                                    \
 Random asst.(?)                                       Compare results --"means"
               \                                    /
                \--- Group 2, n2---- Treatment 2---/
To examine  the difference of the  two means, µ1 - µ2, we look at the difference of the xbars. 
We need the Standard Error of the difference  xbar1 - xbar2 , and then we can proceed as before, more or less (with some adjustments.)
But enough already....


Sievers home  Math151-Sp08/Days41.htm  1pm 5/7/08
This page belongs to Sally Sievers who is solely responsible for its content. Please see our statement of responsibility.