Math 151 , Day 41, Monday, Dec.3, 2007  .After class,HW&Text Finally complete..  Hit reload .

HW Day41:  Reading Ch. 18: for flavor, big ideas. You're not required to be able to read the t-table or do any "new" types of computation.   You should read to see that we're repeating the CI and test work, only with s instead of sigma, and "t" instead of "z". First to p. 441, then the rest.  READ about Matched pairs (p 444-7)and Robustness (p.447-9)! Check p. 451 18.15, 16, first, then 23, 24. Optional(hand computation):  17, 18,19, 20, 21, 22
Read the Front page of the SPSS handout, compare the output shown there with the results in the book with the same example.  Learn to read the output!

Review Ch.9, p. 219 and around (Completely randomized experiment, especially with 2 treatments only), and p. 224 (Matched pairs experimental design) .
Read Ch. 19, pp. 460-61 only!  (Comparing 3 or more independent groups requires Analysis of Variance, Ch. 25)

Hand in  Wednesday . 
t-procedures:

p. 434, 18.1 and 2,  s<-->standard error
p. 432 18.25 ONLY compute the correct t-statistic, NOT compare with Table C or P-value. read carefully.  (one or more t-values was incorrectly computed.)

For the following problems, get the SPSS output here (Word file) or from the white folder outside my door(yes!), and use it to find the answers.)
p. 437 18.7 Ancient air  CI Make a dotplot or stemplot to examine the data.  It will look somewhat skewed, but with so little data this kind of scatteredness can happen easily from a normal distribution.  We should report that the skewness may make our CI only approximately accurate. 

p. 432 18.10 Ancient air test ALSO do these calculations to check the SPSS work: Show that the StdErrorMean is the Std.deviation/sqrt(n), and that
Mean Difference = Mean -(Null hypothesis)TestValue, and
t = (Mean - (Null hypothesis)TestValue)/StdErrorMean.

p. 453, 18.27  Sharks  The P-value is better than the .05 level mentioned.  What is it (rounded to 3 decimal places)?   
p. 457, 18.41 Auto crankshafts. ALSO, what is the difference (in mm) between the mean for these 16 crankshafts, and the value it's supposed to be (224mm)?

In-class Final Exam problem: Link active now!  Do this, with whatever help you need, and BRING your result to the IN-CLASS Final. It is Problem #1 of the IN-CLASS exam. Paper copy outside my door Wed 9:30.
= = = = = = = = = = = = = = = = = = = = = =
+ + + + + + + + + + + YES
Chapter 19 problems, with echos of Chapters 8 and 9.   Note that we can use the same analysis method whether data is from a sample (Example 19.1b)  or an experiment (Example 19.1 a, c) or an observational study.
p. 461, 19.1, 2, 3, 4.  For each, after deciding which design it is, tell if  the data comes from a sample, an observational study, or an experiment

Read, 
to discuss

Optional

Homework marked postpone on Day 40 (t-procedures by hand)
Using SPSS for one-sample procedures (with SPSS Handout) A.  Redo the example on the front page of the handout, getting the result of example 18.3
B. The inclass example of Milk bacteria: (Day 40)do it on SPSS:
Dataset as SPSS file, Dataset as text (.dat) file  (If you import from the text file, remember to check that the Measure is Scale)
C. Matched pairs: Redo the results on the back page of the handout, getting the results of example 18.4.

Datasets in SPSS for HW problems, if you want to do them yourself:
AncientAir18_7n10.sav

Sharks18-27.sav
Crankshafts18-41.sav

Get  Review Exercise (YouthstatsReview.doc). YouthstatsReview (SPSS) file     . This is optional, but if you hand it in by the time you start the inclass final, it will count 50%; "in class" the other 50% of the final Exam grade.  Get all the help you can find on the Review Exercise but make sure you understand and write the final result yourself.  Show your work!

Final exam: Thurs. Dec. 13, 9-12am. Comprehensive, closed book, 2 sheets of notes.
Alternative time--Tueday Dec. 11 morning/afternoon (any time from 9 on, but must finish before 5 sharp!)  Signup on attendance clipboard.  Further difficulties?  Get in touch with me ASAP!Other arrangements, please confirm with me!
Full exam schedule is at   http://www.wells.edu/academic/dates.htm#exams  Exam 1 1/2 to 2 times as long as hourlies. Comprehensive but with special attention to the material covered since Exam 4. Reading but not creating SPSS.  Will certainly be broader in range than the Review Exercise; but most problems will be similar to the types on hour exams and HW.

Homework questions?  Day 40
IF you use a particular alpha as a "cutoff" between "reject H0 " and "failing to reject H0"--we can talk about probability of  rejecting H0 when it's true--and alpha is that probability
And we can talk about the "power" of the test to "detect" an alternative of (say) 24:  (probability of rejecting H0 correctly)
       Applet: "Power" (of a test to detect a difference).   For  the shoebox situation, the power is .761.
There can be a lot of overlap between two populations, but a small difference in means can be "statistically significant", if the sample size is only big enough to detect it!


Ch. 18:  Inference for population mean (realistic), a quick look.
What we actually did: See Day 40 for more detail (optional) 

The most unrealistic of our "simple conditions" for inference (p. 344) was that we knew the population standard deviation sigma.  We remove that condition here.
If we substitute s, the sample standard deviation, for sigma, the population standard deviation, in our Normal distribution formulas:
    If n is quite big, the value of the sample standard deviation  will be close to the same as the value from the population, and our work's approximately right.
    But if n is smaller, estimating sigma by s will add in extra variability!   Problem solved by modifying the Z-distribution!
Standard error of the (sample) mean =    Standard deviation of xbar, estimated from the data.
  "Standard error of the mean":  s/sqrt(n) SEM, SEXbar, etc.
       When you estimate the standard deviation of a statistic, the resulting estimate is called the "standard error" of the statistic.

t-distribution family:  like standard normal only slightly fatter in the tails, slightly more spread.  Mean = 0. Symmetrical around 0.
          t(k) is the t distribution with k degrees of freedom.
 Comparison with normal (Excel graph)
Lower d.f.--fatter tails.  Higher d.f.--more like standard normal.

Standardizing xbar with s instead of sigma results in
   the one-sample t statistic, t-distribution with n-1degrees of freedom.

Conditions for inference about a mean: 
(p. 434)
    ++ SRS
(or reasonable facsimile)
    ++ Population is Normal. 
(Can relax to symmetric, single-peaked unless n "very small")

"One-sample" t- procedures: SRS of size n.  Use Xbar to estimate µ.
Confidence intervals:     where t* is a little larger than the corresponding z*.
(By hand, we'd get t* from n-1 row of Table C, instead of z* from bottom row. But not going to do "by hand" this term.)

Significance tests:  State hypotheses as in Ch. 15, find t from data, by:
 Calculating the one-sample t-statistic, using the null hypothesis value of µ (call it µ0)
Then proceed as if it were a "z", except we need a table for "t" instead of Z (Table C; not going to do this term)

Mostly with real data, you'll let computer packages do these computations.
Get Handout for SPSS Ch. 18 (white folder outside my door)
  
Look at Front page output, decipher it. 
Note Std.ErrorMean (standard error of the mean), t.
What SPSS calls "Sig. (2-tailed)" = "2-sided P-value"
If you have a one sided alternative, and your xbar is in the correct direction, divide the SPSS Sig. by 2 to get P.

READ the rest:
MATCHED PAIRS t procedures-- "Paired samples"(SPSS), "Paired comparisons"
Review: Ch.9 p. 224
   before--after, left hand--right hand, Drug A vs. Drug B on the same person or on a matched pair.
For each pair, find the difference in the observed values.  Then treat these differences as if they are "the" data set, from a normal population, and do One-sample t procedures.
Usually (always?) the null hypothesis will be " µ = 0", there is "no difference" between the treatments.
The cola loss-of-sweetness example (SPSS handout, example 18.3, p. 440) was actually matched pairs: each "loss" number was a before-after difference; they just didn't tell us the before numbers or the after numbers.

ROBUST procedures:  a confidence interval or significance test is called robust if the confidence level or P-value doesn't change very much when the assumptions of the procedure are violated.  pp. 447-450.   Assumption:  Population is Normal.
t-procedures are quite robust against nonnormality. But sensitive to outliers, bad skewness. Look at data.  Need SRS!!
 Details:  n <15   t ok if data roughly symmetric, single peak, no outliers.  Don't use if skewed or outliers.  (How out is an outlier?)
              n > 15  t ok unless there is strong skewness, or outliers.
              n > 40 or so:  t ok even if there is skewness.  (Outliers?  I suggest trying with and without them, see what changes).  

Matched-pairs data (differences) are often more normal in shape than the separate variables ("oddness" is often the same for both items in a pair, and disappears in subtraction.  Another reason why this is a nice experimental design. )

If you can't do t-procedures, there are procedures involving medians, or other approaches (Ch. 26)


 Another situation which uses t-statistics is the one in Chapter 19
"Two-sample problems".  Two random samples,  independent of each other, from distinct  populations. (Populations are normally distributed)
Often--comparing means from an experiment with two treatments (usually  "control" and "treatment"). Review Ch.9, p. 219 and around.
                 /--- Group 1, n1---- Treatment 1---\
               /                                    \
 Random asst.(?)                                       Compare results --"means"
               \                                    /
                \--- Group 2, n2---- Treatment 2---/
To examine  the difference of the  two means, µ1 - µ2, we look at the difference of the xbars. 
We need the Standard Error of the difference  xbar1 - xbar2 , and then we can proceed as before, more or less (with some adjustments.)
But we've run out of time....


Sievers home  Math151-Fall07/Dayf41.htm  1:30pm 12/4/07
This page belongs to Sally Sievers who is solely responsible for its content. Please see our statement of responsibility.