MATH 251, Probability and Statistics I, Fall 2001, Oct. 10, Day 17

Another potential data file for your project.  C:\Program Files\SPSS\  Employee Data has information on 474 employees hired by a midwestern bank between 1969 and 1971.  The bank was subsequently involved in employment discrimination litigation.  This data set may contribute evidence for or against. (We've already looked at World95, a rich file with many interesting aspects.  Other data sets here may be self-explanatory enough to use.)

In-class exam Friday, closed book, thru 3.3  Knowing and using formulas, definitions, proofs--no questions on how to use SPSS, but all other aspects of homework are possible... Day 4 #A or Day 10 #B for sure (n =3 is sufficient).

Questions about exam?
HW questions?

Sec. 3.4:
  We know that a sample from a population will not exactly represent the population.  If we take a random sample, the behavior of samples will not be individually predictable, but there will be predictable pattern in many random samples from the same population.  Knowing the pattern will be  as good as we can do.

                 Population  Choose from it a Sample (varies)
Calculate
Numerical summary: Parameter(Greek letter)    Statistic (Latin)
    Examples:               Population mean mu                Sample mean xbar
                            Pop. standard dev. sigma         Sample st. dev. s
                         Pop. median                   Sample median
                         Pop. proportion p             Sample proportion p-hat
                         etcetera.
The actual value of the Statistic will vary, depending on the particular sample. "Sampling variability"
The Statistic "estimates" the Parameter.  (is an "estimator" of the parameter) We hope it is close to the parameter.  If we choose simple random samples, we can understand the pattern of values the statistic can take.

Sampling distribution of statistic:  Distribution of values of the statistic from all possible samples (of size n).
    Variability of statistic: spread of its distribution.
        Dependent ONLY on sample size, not pop. size (as long as population is at least 10 times sample size)
            Variance times (N-n)/(N-1), where N is pop. size
        As sample size increases, variability decreases.
    Bias of statistic:  An estimator is unbiased if the mean of its sampling distribution = the parameter we're estimating.
 
 Read 3.4 (toward inference)
Hand in (Monday) : 3.51, 52, 53, 54 parameter/statistic
3.55 bias and variability
3.56, 57, 58  sample size & variability

3.59 a and 3.60 a--repeat each one a total of 3 times
(3 values for phat, 3 values for xbar).  Make a dotplot for 
your 3 values for each problem.  We will pool the results Monday.

Read, discuss 
 
Optional


Sievers home  Math251-Fall01/DayP17.htm      10/10/01
This page belongs to Sally Sievers who is solely responsible for its content. Please see our statement of responsibility.