| Hand in
Wednesday
pp. 143-4, 5.35, 37 (SPSS) Drilling into
the past, silicon
(one
clear outlier) To graph the lines with and without the
outlier on the same graph, make a new variable and put 1's in every
case but the
outlier--give the outlier 0. Then use this variable as your
legend or panel variable. You'll also get a "nuisance" horizontal
line at the outlier; ignore it. p. 129, 5.7 (SPSS) does fast driving waste fuel? residuals There is a data file for problem 5.7, and its third column is the residuals. Do all the parts, and Also with 5.7, In SPSS, Make a variable containing the residuals (Handout, bottom p. 4. Also bottom of this page.) The values should match the ones in the book/SPSS file. SPSS Handout p. 3 (Governors' salaries): You can now finish#12, the last question. Hand it all in Wednesday!. B. Use Residuals.xls from the website or the lab to graph these data sets, along with a graph of the residuals. Print the results, and describe the shape of the residuals (it may help to connect the dots with pencil, to see the pattern.)a) x 1 2 8 4 6 9 y 1 3 6 6 7 5 b) x 1 2 7 4 6 9 y 7 6 2 4 2 1 p.133, 5.9 Farm population (SPSS) Do a, b, c (read p. 132 for a good word to use in part c). Also, make a variable containing the residuals, and plot it against the x (year) values. Draw (in pencil) a horizontal line at height 0. What pattern do you see in the residuals? p 179 7.28, 29, 30 (SPSS) Soap
in the shower. Also, look
carefully at the graph and guess why there is no data after day
21. (Read p. 132 for the word to describe using the line for day
30, and a discussion of the issue) |
Read, to discuss If you haven't, Look at this, especially with reference to the r standard deviations in y for every 1 standard deviation in x: A. Open the Excel file RegressionSlope (or in the folder RegressionDemosExcel for D&V in ClassMaterial\Math151 D&V). Change x-y values in the yellow boxes and watch the line change. Change x-values in col. F and watch the "run" (red line) change, in the rightmost 2 graphs. Notice the slope = the coefficient of x = the rise/run = increase in y per unit increase in x. Fix it so the increase in x (the "run") is exactly 1. Also, look at the leftmost graph, where the length of the standard deviations are shown, and note that in standard-deviation units, the rise is r s.d.'s in y for each s.d. run in x. <>C. Use Applet http://www.whfreeman.com/BPS4e Correlation/regression. Make a cloud of data (about 15 points), put in the regression line. Play with an outlier: drag a point to the far left (or right) and drag it up and down. Try it if it's in the middle range of x's. (Drag it up and down.) Answer: Where is it most influential? Now add a bunch more points (50 is max.) Play with an outlier again. Does the outlier have more or less influence with a larger data set? Postpone: p. 136, 5.12 lurking variables |
Optional Postpone: p. 136, 5.11, lurking variables
|
Extrapolation--
extra (outside) polation (putting a point): Using the line to predict
outside
the range of x's you have data for. Linear relationships don't go
on forever; straight line is often a first approximation to a
more complicated relationship.
Start here Wednesday:
"Lurking" variable:
has an important effect, but not one of the variables studied.
The trouble with lurking
variables is that by definition you don't know they're there.
Association does not
imply
causation. Establishing
that x "causes" y:
difficult: Ch. 9
| Sievers home | Math151-Sp07/Daysp19.htm | 10:45am | 3/14/07 |