| Hand in:
(Review) Income depends on height?!
Read the article and answer this. (Review) p. 128, 2.29 dates'
heights. Added after class 9.9 a, b, c, d nonresponse Two-way tables with SPSS. pp.
612ff. IPS give no raw data sets to practice crosstabs on; all the
data are
pre-tallied. 9.26 Web ref's (SPSS) Do everything they ask
for except
for the "significance test." Postpone this problem! 9.15 applicants (Simpson's
paradox) (SPSS) See how much you can get SPSS to do. (Hint.
For c, use school as your "layer" variable) |
Read, discuss
|
Optional |
- - - - - - - - - - - - - - - - - - - - -
Relationships: We know how to analyze/summarize
quantitative
vs. quantitative (scatterplot), and categorical vs. quantitative
(side-by -side histograms, stemplots, boxplots) . Now
Categorical vs.
Categorical
Sec. 9.1 "Two way tables"
"Two way table"
"Contingency
table" "Crosstab(ulation)" Hair color vs. Class
year.
A thousand people are interviewed by the census bureau, and the results
tabulated in this two way table.
Working Status vs. Sex.
| Women | Men | Total | |
| In Labor Force | 350 | 450 | 800 |
| Not in Labor Force | 150 | 50 | 200 |
| Total | 500 | 500 | 1000 |
What is the "Percent of women in the labor force" ?
Write your
answer down on a scrap of paper.
answer
When you write or see percents, be clear what
is on the bottom of the fraction (even if it takes longer to
say)!!.
From the New Yorker magazine,
traditionally
the most literary and error-free of all, Feb.14/21, '05:
CORRECTION: The Mail of January 3rd contained the incorrect statistic that four-fifths of Bush voters identified moral values as the most important factor in their decision. In fact, four-fifths of those identifying moral values as the most important factor of their decision were Bush voters.Marginal distribution: Distribution of one variable, ignoring/summingover the other.
|
|
Conditional distribution: Distribution of one variable,
with the individuals being only those which satisfy a condition in the
other variable.
For women, their conditional distribution
as to working status; For
men, their distribution as to working status.
"Column %s"--columns add to 100%: "conditional
distributions
of working status by sex ".
| Women | Men | Total | |
| In Labor Force | 350/500 = 70% | 450/500 = 90% | 80% |
| Not in Labor Force | 150/500 = 30% | 50/500 = 10% | 20% |
| Total | 500/500=100% | 500/500=100% | 100% |
For those in the labor force,
conditional
distribution as to sex.
For those not in the labor
force, conditional distribution as to sex.
"Row
%s"--rows add to 100%: "conditional distributions of sex by
working
status."
| Women | Men | Total | |
| In Labor Force | 350/800 = 43.8% | 450/800 = 56.2% | 800/800=100% |
| Not in Labor Force | 150/200 = 75% | 50/200 = 25% | 200/200=100% |
| Total | 50% | 50% | 100% |
Graphs to compare proportions: parallel sets of
bar graphs, see text, p. 603,.
Segmented (stacked) bar charts, of % (so
total length the same) (Redundant if there are only 2
segments)
% Women
O
% Men X
OOOOOOOOOOOOOOXXXXXXXXXXXXXXXXXX In Labor Force
OOOOOOOOOOOOOOOOOOOOOOOOXXXXXXXX Not in Labor Force
Can do segmented bars of raw numbers, conveys different info:
25 Women
O
25 Men X
OOOOOOOOOOOOOOXXXXXXXXXXXXXXXXXX In Labor Force
OOOOOOOOXX
Not in Labor Force
Categorical data with SPSS: (p. 6,
Intro
handout)
Pre-tallied? Data> Weight Cases>Count to Frequency
box.
Analyze>Descriptive Statistics>Crosstabs.
Cells
button.
(3-way? Third to Layer box)
Graph>Interactive> Bar: 100% box for stacked
percents,
one variable to horiz. axis, other to legend box, stacked or clustered
. Third to panel.
.Start here Wed..
Simpson's paradox: An association or
comparison that holds
for all or several subgroups can reverse direction when
the
data are combined into a single group.
Example from text. p. 588
example 9.10
SPSS output
Parallel Continuous situation:
Cars.sav , like econ graduates problem (Ch.2). (X=weight,
Y=time to accelerate to 60. Heavier car should be slower? Oops.
Panel
with #of cylinders, or color with horsepower.)
| Sievers home | Math251-Fall07/Day2s14.htm | 6pm | 9/25/07 |
| Women | Men | Total | |
| In Labor Force | 350 | 450 | 800 |
| Not in Labor Force | 150 | 50 | 200 |
| Total | 500 | 500 | 1000 |