| Hand in
(All D&V
p 238ff. unless otherwise noted) 1, 4, 6, 7, 8, 9 Do parts a,b,c,d, and e, and f, and hand them all in. ) 23 Sampling methods 11 Parent opinion I 15, 16 Phone and cell phone surveys p. 267 #41 Security |
Read,
to discuss p. 265 #29 Home- Postpone: |
Optional |
Homework questions? Day 20
Recap: Sample
Chosen from a Population
(varies)
(fixed, but usually unknown)
Calculate
Numerical summary: Statistic
(Latin)
Parameter(Greek
letter) (D&Vp227)
Examples:
Sample mean xbar Population
mean mu (µ)
Sample st. dev. s Pop.
standard dev. sigma
The actual value of the Statistic will vary,
depending on the particular sample.
"Sampling variability" = "sampling
error"
The Statistic "estimates" the Parameter.
We hope it is close to the parameter. If we choose simple
random
samples, we can understand the pattern of values the statistic can
take.
Want sample to be representative of population, statistic to estimate
paramater well, but variability happens...
Now: Simple Random Sample (SRS)
of size
n: See
Day 20 for details:
Sampling frame.
Using SPSS to Sample. Get
Handout.
Using Random Number Table to sample, see below.
Other sample designs: Stratified random, Cluster, Multistage,
Systematic
- - - - - - - - - - - - -
Sources of Bias in
sampling: any systematic failure of a sample (or its method) to
represent its population. (E.g. sampling
frame excludes a "different" part of population.)
Bad sampling designs:Not using randomness:
Using Random Number Table
to sample
(p. A-49) Example: Ch. 11 pp. 216-7 The
Step-by-step
simulation effectively takes a random sample of size 3 from the 57
students.
Every digit, every sequence of digits, is equally
likely to be "next" in any direction. (Divisions
into
5 is just for legibilty)
To use: label everyone in the population
with a number.
Important: Every labeling number needs the
same
number of digits.
To label 9 people, use the labels 1,2,3,....9
(1-digit
chunks)
To label 15 people, use the labels 01, 02, ...10,
11, ...15 (2-digit chunks)
To label 125 people, use the labels 001, 002, ...
124, 125 (3-digit chunks)
Pick a place (at random) in the table, start reading
across in that size chunk. Get n eligible
numbers (discard repeats)
For example : 07511
88915
41267 16853 84569 79367 ..
From 9 people, a sample n = 5: 0,7,
5,
1,
1, 8, 8, 9,
1, 5, 4, (sample is individuals 7, 5, 1, 8, 9)
From 15 people, a sample 07,
51, 18, 89, 15,
41, 26, 71, 68, 53, 84, 56, 97, 93, 67.... keep reading,
go to next line (or back to top line) if you need
more. Individuals 7, 15,...are chosen using this line.
From 125 people, a sample 075,
118,
891, 541, 267, 168, 538, 456, 979, 367...keep reading.
Individuals
75, 118, ...
Why the same number of digits in each
label?
Each individual 3-digit chunk is as likely as any other 3-digit
chunk.
But a 1- or 2-digit chunk is more likely than any 3-digit chunk. So
2 will come up more often than 12, but 02 will come
up
just as often as 12.
Why across? For consistency
on HW, Start where I say and go across (so everyone who does it
right
gets the same answer.). In practice, you can read up, down,
backwards,
as long as you decide beforehand, and don't change in the middle
of choosing the sample.
= = =
= = = = = = = = = = = = = = = = = = = = = = =
Start here Mon. after break:
D&V Ch13 Goal:
show cause-and-effect. Predictor-->Response
Observational Study: Observe
individuals; don't do anything to them;
do not influence the responses.
Can indicate
strength of relationship, differences, but not cause and
effect.
(Often not with samples, but with selected group(s).) Lurking
variables?!?
(Fisher: Smokers smoke to soothe irritabilities that may
cause
cancer.)
Retrospective:
gather data after the fact (observe that x% of men hospitalized
with heart disease were/are smokers)
Prospective:
choose individuals in advance. Measure them; or follow them, as
events
happen. (Framingham Heart Study: 5,209
(2,873
women and 2,336 men) healthy residents between 30 and 60 years of
age.
Followed from 1948 to now. A second-generation cohort recruited 1971,
Minority
group 1995 http://www.framingham.com/heart/)
Experiment: Impose
treatments
on individuals, to see how the treatment
influences the response.
Compare treatments' effects.
Do something to: "Experimental Units" =
"Subjects"
Treatment: A Specific experimental condition.
Factor: = Explanatory (Predictor) Variable we
manipulate.
Levels: Specific
values of a factor that we set.
Response variable(s)
E.g. 2 headache medications, in combination?
A two-factor experiment, each with 3 levels. 9 possible
treatments.
Factor A: Aspirin: levels None, 500 mg,
1000 mg
Factor B: Caffeine: levels None, 50 mg, 100 mg
Response variable: reported pain relief
| Aspirin | ||||
| None | 500 mg | 1000 mg | ||
| None | Treatment 1 | Treatment 2 | Treatment 3 | |
| Caffeine | 50 mg | Treatment 4 | Treatment 5 | Treatment 6 |
| 100 mg | Treatment 7 | Treatment 8 | Treatment 9 |
E.g. (Day 13, MRA-95-13 )Corn yield= response variable. One Factor = Planting rate. 5 Levels=the rates.
| Sievers home | Math151-Fall06/Daysp21.htm | 2:15pm | 3/17/06 |