| Hand in Friday
p.
136 5.13 hospitals: big = bad? p. 198, 8.4, 5, 6 population/sample Postpone the rest: p.209-10, 8.38 b only Traffic lights |
Read, to discuss
p.
136, 5.12 lurking variables
|
Optional p. 136, 5.11, lurking variables Postpone the rest: p. 209, 8.35 Use table B (more practice) p. 209, 8.34 seat belt use |
Association does not
imply
causation
Strong association/correlation between A and B could be:
A causes B/ B causes A/ C
causes both
A and B (lurking C)/ just Chance that they go together in this
data
set.
Direction? Rooster causes sun to rise by
crowing?
Both variables "caused" by a lurking variable?
Lurking variable can be part of the cause
--Women with a history of heavy antibiotic use have higher rates of
breast cancer.
--Baby rats whose mothers licked and groomed
them more grew up to be more exploratory, social, less
timid.
Cause? Effect? How to tell?
Establishing that x "causes" y:
difficult:
Best: Do an experiment
in which we change x, keep lurking variables under control. (E.g.
Rats.
Ch.9)
Otherwise: Strong
association. Consistent over many studies. Higher x-->stronger
y.
X precedes y in time. A plausible mechanism exists (parallel
studies?)
Generalize rat grooming to humans?
Chapters 1 through 5 have covered analyzing
data
that was given to us--what it said about itself.
Informally, develop
guesses,
suspicions, hypotheses about the world the data came from.
From Exploration to Inference p. 186
Ch.
8&9: Producing Data:
Aim:
create data sets that will allow us to make inferences to a
larger
world than just the data we have.
Observational
Study: Observes individuals, measures variables, does not
influence the responses. (ch.8)
Sometimes observe individuals who are (more or less) conveniently at
hand, or, better,
Take Sample from a population, examine it....
(ch.8)
Experiment:
Imposes
treatment
on individuals, to see how the treatment
influences the response.
(ch.9)
(SAMPLING) BIAS: The design of a study is biased if
it systematically favors certain outcomes.
Check our "sample" of digits
Sample survey: (attempt to) choose a representative
sample from a large, varied population. Not Easy!
Some issues: What population do we want
to understand? What exactly do we want to measure?
Non-probability samples (sampling badly):
Why the same number of digits in each label? Each individual 3-digit chunk is as likely as any other 3-digit chunk. But a 1- or 2-digit chunk is more likely than any 3-digit chunk. So 2 will come up more often than 12, but 02 will come up just as often as 12.
Why across? For consistency
on
HW, go the way they say (so you get the answer in the book).
In practice, you can read up, down, backwards, as long as you decide
beforehand, and don't change in the middle of choosing the sample.
+ + + + + + + + + + + + + + + + + +
| Sievers home | Math151-Fall06/Daym20.htm | 10:45am | 10/11/06 |