| Hand in Wednesday:
A. Go to Text website, http://www.whfreeman.com/scc, (or http://bcs.whfreeman.com/bps3e/ ): and play with the Correlation/Regression Applet. Create a data set of around 10-15 points with r = -.65 (close to it). Add the meanX&meanY lines, and make a sketch of your result on your paper to hand in. (Or you can print it out like this: Hit the Printscreen while holding down the Alt button. This puts the image of the active window on the Clipboard. Open Word, do Edit>Paste. Then you can print the Word document.) Using SPSS to find correl.
coeff.
(Back page of Scatterplot handout:
Analyze>Correlate>Bivariate)
Sec. 2.2 Correlation (no SPSS ).
|
Read, to discuss
Moore p. 99 Use data of 2.17.You graphed this by hand for Sec. 2.1. Guess what r is; look in the back of the book to see how close you got. p. 106 2.29 blunders C. Many communities find a strong positive correlation between the amount of ice cream sold in a given month and the number of drownings that occur in that month. Does this mean that ice cream causes drowning? If not, can you think of an alternative explanation for the strong association? D. Explain why one would expect to find a positive correlation between the number of fire engines that respond to a fire and the amount of damage done in the fire. Does this mean that the damage would be less extensive if only fewer fire engines were dispatched? Explain. |
Optional
|
| Regression prep. Review of straight lines: p. 124, 2.39, 2.40. Optional, but be able to do. Most people did fine on lines on the pretest. If these are a problem, ask someone NOW! Any MathClinic assistant can help with these. Also Just the Basics on reserve covers it. A. Open the Excel file RegressionSlope (or in the folder RegressionDemos in ClassMaterial\Math151). Change x-y values in the yellow boxes and watch the line change. Change x-values in col. F and watch the "run" (red line) change. Notice the slope = the coefficient of x = the rise/run = increase in y per unit increase in x. Fix it so the increase in x (the "run") is exactly 1. Print the page to hand in. B. Practice fitting lines: Use the text website ("Do this" below) and try to fit at least 4 different data sets. Write down on your paper what you discovered (were your judgment errors consistent in any ways--did you have any surprises?) Moore p. 111, 2.31 acid rain No data, therefore no SPSS (draw the line by hand) |
Read,
to discuss |
Op
tion al
|
| Hand in
Later!regression
with SPSS (you can get your printouts now, answer
other questions later if you like.).
C. Use the SPSS Scatterplot handout and graph the regression line for govsal on avgpay (as shown, back page), also the lines for the 4 separate groups (either on one graph or on panels.) Print them out and keep them. Start answering questions 6-11, on p. 3 of the handout. Keep till you can answer all questions. Moore p. 111 2.32 (Manatees) all parts. Import the
dataset into
SPSS (Class Materials\Math151) In
D. For the data of Moore, p103, 2.22 (metabolism),
(SPSS) Print
out a graph with the regression line
|
Read,
to discuss |
Op
tion al
|
Correlation: Day
11
--You won't have to calculate a correlation coefficient by hand. This
formula is a bad one for hand computation (roundoff error); if you must
do one by hand, find the computational formula in an old textbook.
--Eyeballing: sketch xbar and ybar lines, see how much data is
in + quadrants, how much in - quadrants.
--Strength of correlation says NOTHING about causality! Strong
correlation could be:
A causes B/
B causes A/ C causes both A and B/ just chance that they go together in
this data set.
Graphing Straight lines? p. 124, 2.39, 2.40
Regression
line: Section 2.3, Predicts or estimates a y
(vertical)
value for a given x (horizontal) value: Straight line!
Formula yhat = a + b x.
To predict
a y-value for a given x-value, plug the x value into the formula and
calculate.
To do it graphically, use the Up-and-Over method (Fig. 2.10, p.107):
Find the x, go straight up to the line, then go over to the y-axis;
that
y-value is the predicted y.
a is
y-intercept.
b
is slope (b multiplies x, the horizontal
value): If x increases one unit, yhat increases b
units.
RegressionSlope.xls
or
in ClassMaterial\Math151\RegressionDemos
We all get the same line from a batch of data because we use the "least-squares best fit" criterion (pp. 107-8): we'll investigate this more closely later.
Do this: Practice fitting "least
squares
best fit" lines: Author's website,
http://www.whfreeman.com/scc, (ClickNetscape toolbars to
minimize
them, if needed. If line drawing doesn't work, try the newer
version
at http://bcs.whfreeman.com/bps3e/
)
Choose "Statistical Applets",
Correlation/Regression.
Check in the "Show least-squares line" box and put in some data
points.
Check in the "Show Mean X &Mean Y lines" box; see if #3 below
holds.
Repeat for a few data sets.
--Try fitting the line yourself: (Uncheck the "Show ..." boxes.)
Put in some data points. Now click Draw Line. Click and
drag
in the picture and you'll get a line with 3 blobs. Drag the center and
it will go up and down, Drag an end and the slope will change. Put the
line in the best place for predicting y's from x's. If you do
well
by the "least squares" criterion, the green bar up top will shrink
close
to 0 (but in the newer version you have to be really
good.
Dumb.) Check in the "Show Mean X &Mean Y lines" box;
adjust
your line. Check in the "Show least-squares line" box and see how
you did.
| Sievers home | Math151-Fall04/Dayf14.htm | 11:30pm | 9/26/04 |