| Hand
in: A. You are driving on the thruway from Syracuse to Rochester and keep track of how many vehicles you pass and how many pass you. You find that these 2 numbers are the same. Your speed on the thruway is: (a) the Mean speed of the cars, (b) the Median speed of the cars, (c) the Modal speed of the cars. Choose one, and justify your choice. Standard deviation B. Find the mean and standard deviation of 2, 2, 4, 8 by hand. p. 50, 2.9 Blood phosphate Do a and b by hand. Use SPSS or some other tool** to do c. Write your answers from screen to paper. Also (re)make a dotplot of the data, mark the mean with a wedge, and indicate the standard deviation s with <----> lines from the mean to both sides, s long. (like the sketch below) p. 51, 2.10 xbar=7.50, s = 2.03
the same for both dist's. Don't do the calculations--just make
stemplots & compare their shapes! |
Read, to discuss
|
Optional p. 62, 2.40, 2.43 Play with summary numbers. Use the Applet, One variable statistical calculator; type data in at the Data tab. |
Summaries
of Middle & Spread continued--"Systems:"
-- (Midrange, Range Very
sensitive to outliers--they use only the max and min!)
-- Median, IQR (+
Quartiles Q1, Q3, 5-number summary), based on percentiles (j'th
percentile is > j% of the data)
-- Mean, StandardDeviation "y-bar"
(or "x-bar"), "s" (good for symmetric unimodal, no outliers)
Standard
deviation
(measure of Spread that goes with mean)
Variance s2:
(almost) average
of squared deviations from the mean.
(Divide by (n-1)
"degrees of freedom")
s :
Standard deviation is the square
root of
the
variance.
Computation: I will require you to know how to do it by hand for
4 or 5 observations
(see BPS4e p. 48-9 for formula & computation example.
)
Demo: 1,1,2,4, mean = 2, sum of squared deviations
= 6, variance = 2, s = 1.41
1,1,2,4,12, mean = 4, sum of squared deviations = 86, variance =
21.5, s = 4.64.
(Midcomputation check: Sum of deviations from the mean (before
squaring
each) always = 0 )
--s is Always > 0 (0 only if all observations are =)
--s units the same as those of the
observations
(squared and squarerooted).
Physics: angular momemtum (spinning ice skater)
Not so weird: High school geometry?
Remember
Pythagorean theorem: c2
= a2 + b2:
hypotenuse of right triangle is also square root of a sum of squares.
Very
sensitive
to outliers (the outliers contribute much more than their
share to the Sum of
Squared Deviations from the Mean)
SPSS, for simple computation: Handout
Organizing a statistical problem:
Four-step process (pp. 53-5, & inside front
cover)
State: the issue to be explored, question to be addressed
(real-world) (In hw problems, often already stated.)
Formulate: What statistical tools, measures,
analyses should we use to answer the question?
Solve: Carry out the process. (May need to
back up & try again. Decide on mean, s.d., but stemplot shows
badly skewed? go back and decide on 5#summary instead.)
Conclude: Give the conclusion as it addresses the
real-world question/issue.
NO: Begin p. 55, 2.12 in class in pairs
(or 3's). Decide what analyses to do; start doing them (make a
copy for each)
Ch. 3, Density curves, BPS4e pp.64-69
GET handout HW sheet:
"Tables for Simple Models (Densities)"
Link!((Density
Handout )
(When values can take on any of a continuous interval
of numbers)
Example: Spinner: Label edge with continuous values from
0 to 1. Spinning should produce 1/10 of all spins in each colored
sector.
Simulations of 500, 3000 spins show roughly true. More spins would get
closer.
(Histograms of simulations)
Abstraction, idealized histogram ("Mathematical model") = Density curve. Describes a theoretical distribution of data.
Any density curve: is a curveMany, many density curves are possible, modeling many phenomena.
--always on or above the horizontal axis
--has area exactly 1 underneath it.This allows area to represent proportion of "histogram" between specified values.
(We will assume the proportion of observations precisely equal to a value is 0. "So proportion less than 2" is the same number as "proportion less than or equal to 2.")
Median, mean, percentiles, standard deviation are defined for a density curve in analogy to those for a histogram.For the spinner, the density curve is "Uniform on 0 to 1". If you have two spinners like this, spin both at once and add the results--the corresponding density curve is "triangular, symmetric, on 0 to 2" A more complicated mechanism will produce data corresponding to the density curve I have called "trapezoid, -1 to 2" A very important one is the "normal" distribution family.
Many densities have tables to describe them. Especially tables showing area to the left of (below) a given value ("Cumulative Proportion").
You will make and use "Cumulative Proportion" tables for the simple distributions on the handout. These are similar to the table we will use to describe the Normal distribution.| Sievers home | Math151-Fall07/Dayf5.htm | 2pm | 9/8/07 |