MATH 251, P&S I, Fall 2007, Wed. Sept. 26, Day 15.After class.

Reading  "Simpson's paradox", pp. 588-590
Please read: Chapter 3: Sec. 3.1 (Intro) and 3.2 through Ex. 3.11 p. 207. Finish 3.2.  Ahead,  3.3, 3.4
Hand in:
9.15 applicants (Simpson's paradox) (SPSS) See how much you can get SPSS to do.  (Hint. For c, use school as your "layer" variable) 
==Postpone the rest!==  
Sec. 3.1 p. 196ff. 
3.3, cell phones
3.4 tv violence
3.7 beer/wine
- - - - - - - 
Sec. 3.2 p. 210ff.
3.10 adolescents
3.12 aspirin design, significance
3.19  fabric finishing
3.21  Random allocation with Applet
3.28   Randomness doesn't guarantee alikeness (Applet)
3.18  x% off?   Use the Applet to choose the subjects (everyone's will be different?)
3.22  x% off Display of 2-factor results.
Read, discuss 


======
Sec. 3.1

3.1 cola
3.5animation
- - - - - - 
Sec. 3.2
3.9, 3.11

 

Optional 

 

Stratton Dedication http://www.wells.edu/CelebratingConnections/schedule.htm
  
Especially Ira Flatow 3:00 Saturday, + ribbon cutting. 
     Ushers needed:
contact Prof. Collmer (ccollmer) or Ms. Sweet (esweet)

HW Questions?
  Day 13Day 14
Data analysis project, in pairs.   * Handout: *  Preliminary report due <9:30 am Oct. 5, Day 19.  Final paper 9:30 am Oct.12, Day 21.  
Pairs:


 Summary:  Categorical data with SPSS: (p. 6, Intro handout)
Pre-tallied?  Data> Weight Cases>Count to Frequency box.
 Analyze>Descriptive Statistics>Crosstabs.  Cells button.  (3-way? Third to Layer box)
Graph>Interactive> Bar:   100% box for stacked percents,  one variable to horiz. axis, other to legend box, stacked or clustered .  Third to panel.

Simpson's paradox:  An association or comparison that holds for all or several subgroups can reverse direction when the data are combined into a single group.  Day 14
- - - - - -Quiz.
Start here Friday
- - - - - - - - - - - - - - - - - -

Chapters 1 and 2 have covered analyzing data that was given to us--what it said about itself. 
    "Exploratory Data Analysis"
Informally, use to develop guesses, suspicions, hypotheses about the world the data came from.
"Anecdotal evidence"--haphazard information, often noticed because striking.  Often unrepresentative of anything.

Ch. 3:  Producing Data:  Aim:  create data sets that will allow us to make inferences to a larger world than just the data we have.    "Statistical Inference" = "Confirmatory Analysis"   Exploratory  vs. Confirmatory
Design how to get data...
       Observational Study:  Observes individuals, measures variables, does not influence the responses. (3.3)
                    Take Sample from a population, examine it,
                           hope it's representative so we can infer that population is like sample.
                            (Not very useful for cause-and-effect--see sec. 2.5)
                    (Census--whole population)
        Experiment: Imposes treatment  on individuals, to see how the treatment influences  the response. (3.2)
                            Best for cause-and-effect.
Confounding:  Two variables (explanatory or lurking) are confounded when you can't sort out their effects on a response variable.
--Used to be: coffee drinking and smoking--most people did both, or neither...
--4 years ago:: women who ate at least one serving/day of whole grain (cereal, bread) much less likely to have heart attack.
   (Who eats whole grains?  Were those variables taken into account? ?)

Design of experiments
Sec 3.2    Special jargon!
Do something to:

    "Experimental Units" = "Subjects"   (cases)
Treatment:  Specific experimental condition.
Factor: Explanatory Variable which we manipulate.
    Levels: Specific values of a factor that we set.
Response variable(s)

E.g. 2 headache medications, in combination?
A two-factor experiment, each with 3 levels. 9 possible treatments.
    Factor A: Aspirin:  levels None, 500 mg, 1000 mg
    Factor B: Caffeine: levels None, 50 mg, 100 mg
Response variable: reported pain relief



Aspirin


None 500 mg 1000 mg

None Treatment 1 Treatment 2 Treatment 3
Caffeine 50 mg Treatment 4 Treatment 5 Treatment 6

100 mg Treatment 7 Treatment 8 Treatment 9

Lurking and confounding variables:  Control--how?  Nothing except experimental treatment should differentially affect response.   Biased design: Systematically favors certain outcomes
    Compare responses under several treatments, look at differences.
Placebo effect: a positive response to a "sham" medical treatment--if you believe it will work, it very likely will. (Tinkerbell?)
    A medical treatment must be shown to be better than a placebo (at least) to be approved by the FDA.
               placebo="I shall please" (Latin)
To control for the placebo effect, All treatments should "look alike".  Treatment 1 above should be a pill with no medicine--a "placebo".  (Some experiments even try to duplicate side effects of actual medication.)

"Control group" Group that gets the "baseline"--"null"-- "none" or "placebo" level of the factor.  Should be "just like" the group(s) that get the "treatment" ("real" levels of the factor).  So Treatment 1 above will go to the "control group", the other 8 will go to "experimental" or "treatment groups."
Murky language here:  "Experimental vs. control" or "Treatment vs. control" is different usage from "Treatments", one of which is the "control"="none"/"placebo".
     *Sometimes the Control  is the current "best practice" treatment, rather than none.
Note:  sometimes in "hard sciences", no baseline; sometimes even no comparison.  Just Treatment-->Observe response.
Start here Monday:
Comparative experiment:  How to get groups of experimental units  "just like" one another?  or at least not biased.
  Biased design: Systematically favors certain outcomes.
Randomize who goes into which group.  (Usually our batch of  experimental units is not a random sample from the population of all individuals-- usually volunteers, etc.)
Randomized comparative experiment : Diagrams of design, IPS pp. 202, 205
Completely randomized: all exp. units allocated at random among the treatments.

E.g. does acupuncture work for Backache(1)  (2)  ?  Response: report of symptoms.
    One factor, 3 Levels:  None (music?, "Usual care?"), Acupuncture (wrong), Acupuncture (right ). 3 treatments.
            (control(s)?)
      30 subjects with Backache:  Randomize, 10 each treatment.  Administer treatments.  Compare symptoms. (Do diagram)

Picking groups with Simple Random Sample applet www.whfreeman.com/ips5e :  Label your subjects with numbers 1 thru 30. Set "population" equal to 30.  Pick "sample" of size 10 from the 30; they get the first treatment. (Write down their numbers.)  From the remainder, pick another "sample" of size 10 for 2nd treatment.  (Keep track where it starts in the sample list.)   The 10 remaining get the 3rd treatment.  (Next time: using a paper Random Digits Table--pp. 203-4; sampling with SPSS)

Why 10 each, not just 1 each?  Repeat the experimental treatments on many units allows "averaging out" chance variation in the units.  (Don't confuse the repetition  needed within one experiment, with "replication" of the whole experiment in a different time and place to confirm its results.)

Principles of designing an experiment (p. 203) See above...

Statistical Significance (first def.):  An observed effect (difference) so large that it would rarely occur by chance is called statistically significant.  (We prefer to think it's a "real" effect of our explanatory variable rather than a "lucky" chance effect.)

Designed experiments are our best shot at cause-->effect  but!
Need to "replicate" in different contexts, with variations.

Placebo and biasing effects can result from expectations of medical staff.  "Blind" means subject doesn't know who's getting "real" treatment.  "Double blind" means neither subject, nor staff administering treatment, nor people recording response variables know who's getting which treatment. (That should really be triple blind?) Most convincing for cause/effect.

Usually an experiment treats the placebo effect as a confounding variable, and is designed so placebo effect will work equally on all groups.  There is no attempt to measure the placebo effect.  ("All" drug studies.)
        PMS/acupuncture:  Acupuncture (wrong) vs. Acupuncture (right).
Sometimes an experiment deliberately tries to measure the placebo effect (as in the article).
        Acupuncture (wrong) vs. Music/usual care.

Lack of realism Do sociology, psychology, (medical?) experiments generalize to "real life?"  Time, money, subjects.  Ethical questions...


Sievers home  Math251-Fall07/Day2s15.htm   10pm    9/28/07
This page belongs to Sally Sievers who is solely responsible for its content. Please see our statement of responsibility.