|
Hand in Friday: |
Read, to discuss |
Optional More practice on Normal: p. 261, 10.17, ACT scores - - - - p. 272,11.3 Bearings (Param./Stat.) |
Exams not finished. Sorry again!
Hand in now, your sheet of results from 10.55 and 10.56 (Probability applet
results) if you didn't last time. If you did 10.3 since
Monday, add your results to the
sheet circulating.
Add to the sheet circulating: your 3 means from 11.6, p. 277.
--
HW questions?Day 28
Discrete, Continuous Random Variables. Normal Random
Variables. Notes Continuous Day 26
Ch. 11:
Sample/Population, Statistic/Parameter
We know that a sample from a population will
not exactly represent the population. If we take a random
sample, the behavior of samples will not be individually
predictable, but there will be predictable pattern in many random
samples from the same population. Knowing the pattern will
be as good as we can do.
Next:..
How does sample mean behave? ( pp.275-86)
Sample Chosen from a Population
(varies)
(fixed, but usually unknown)
Calculate Numerical
summary: Statistic
estimating Parameter
xbar
µ
We take a simple random sample of size n, find the sample mean xbar.
It will be different depending on the sample, so we have a random
phenomenon. We measure the outcome as a number--the sample
mean--so we have a
random variable X bar.
Law of Large Numbers (p. 273-4,
"LLN") Take observations at random from a population with
population mean µ. Then as
the number of observations n increases, the sample mean xbar gets
closer and closer to µ.
(Even if the population is infinitely large! Note--we don't say how
big n needs to be for how close here.)
OR Let the sample size n get bigger. Then the xbars
will eventually get very close to the population mean µ.
OR As the sample size increases, the sample mean gets closer to
the population mean µ.
OR For a very large sample, the sample mean will (almost
certainly) be very close to the population mean.
e.g. the bigger my statistics class, the closer their mean height
should be to the U.S. mean height for women.
(Statistics means never having to say you're
certain...)
Applet:
http://www.whfreeman.com/bps4e
"Law of Large Numbers" Roll a single die. X = number.
µ=
3.5. (Think X is number of spaces you can move in a board game.
Average per roll is 3.5.)
My result at home --1st roll: x
= 5 n=1, Xbar = 5/1 = 5.
Roll again, x = 2. n=2, Xbar = (5+2)/2 = 3.5
Again, x = 1 n=3, Xbar =
(5+2+1)/3 = 2.67 Again...
... Xbar for large
n; close to 3.5.
Got to
here Wed.
Now: keep a fixed
sample size n:
What probability distribution describes the random
phenomenon of finding xbar from a SRS?
That is, what is the distribution of the random variable Xbar,
when the experiment is to take a simple random sample of size n?
We'll call it the "sampling
distribution of the (sample) mean" (p. 275-7,
then details 278-86)
This is the distribution of means of all
possible SRS's of size n.
10.3b "penny"
p = .1 Last term's results, Previous classes, some of the possible sample
proportions. Simulation of sampling distribution
of proportion.
If you have this R.V.: X = 1 if
Heads, 0 if Tails, then P(X=1) = .1, P(X=0) = .9. And Xbar = (sum
of all observations)/n (# of Heads)/n = sample proportion! So
it's also sampling distribution of mean! Theoretical distribution
This term (startlingly
different!)
What do we see?
--Shape: Looks normal-ish
--Center: Mean of xbars ~ mean of dist. of X. (.1)
--Spread: SD of xbars is smaller than that of population
X. Applet:
http://www.whfreeman.com/bps4e Normal approx to binomial,
p = .1, n = 1, then n = 100 (our data is for n = 200, even
narrower.) Horizontal scale is in n, number of possible heads,
and we worked in p, proportion of heads. Just think of
space between 0 and the given number as being from 0 to 1, black
line (mean) is at .1, always.
..
HW due today: #11.6 p. 277 (modified):
each get 3 SRS's of size 4, find 3 means: will pool to get
histogram of Sampling distribution of mean
Quincunx board: Result for one ball is "average" of going
+ or going - at each level
(entry + pin 1+pin2+ ...+ pin 6).Continue,
toward behavior of sample means:
Recap: How do sample means behave?
Sample (varies) Chosen from a Population(fixed, but usually unknown)
Numerical summary:
Statistic
xbar
Parameter µ
We take a simple random sample of size n, find the sample
mean xbar. It will be different depending on the sample,
so we have a random phenomenon. We measure the outcome as a
number, the sample mean, so we have a
Law of Large Numbers
(p.273-4, "LLN") Take observations
at random from a population with population mean µ. Then as the number of
observations n increases, the sample mean xbar gets closer and
closer to µ. (Even if the
population is infinite!
Note--we don't say how big n needs to be for
how close here.)
Now: keep a fixed sample size n:
What is the distribution of the random variable Xbar, when the
experiment is to take a simple random sample of size n? This
is the distribution of means of all
possible SRS's of size n.
We'll call it the "sampling
distribution of the (sample) mean" (p. 275-7,
then details 278-86)
Whatever
the population
distribution of
X,
that we draw the sample from, (see p. 278)
(as long as the population is large compared to the sample
(at least 10 to 20 times sample)
Example: "Normal"
body
temperature 98.6 deg. on average. (Assume this is true.)
(Normal table A)
Assume normal distribution, & s.d.among many
people is 0.6. (0.7 is a better
assumption, I'm told, but .6 is easier to think with.)
...
Probability that one
individual's normal temperature is below 98.0 degrees? P(X < 98.0) = P(Z < (98.0-98.6)/.6) = P(Z<-1)
~~.16.
Take SRS of
9 people. Sampling distribution of the mean? mu = 98.6, s.d. of Xbars = .6/sqrt(9) = .2
Probability that the mean is below 98.0? P(Xbar
< 98.0) = P(Z < (98.0-98.6)/.2) = P(Z<-.6/.2) =
P(Z<-3)=.0013
Probability that one
(random) healthy individual's normal temperature is above 98.8?
Probability that the mean of a sample
of 4 is above 98.8?
Probability that the mean of a sample
of 36 is above 98.8?
Probability that the mean of a sample
of 100 is above 98.8?
All of these are P(Xbar>98.8) for
different sample sizes n. (Normal
table A)
| Sample size n |
s.d. of Xbars = (pop.s.d.)/sqrt(n) |
z = (raw-mean)/s.d. |
P(Xbar>98.8)= P(Z>z) |
| 1 |
.6/1 = .6 |
((98.8-98.6)/.6 = .2/.6 =
.33 |
P(Z>.33) = .3707 |
| 4 |
.6/2 = .3 |
(98.8-98.6)/.3 = .2/.3 = .67 | P(Z>.67) = .2514 |
| 36 |
|||
| 100 |
Got about here Friday.
Next: Population is NOT necessarily normal:
= = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = =
= = = = = = = = = = = =
SPSS
simulation: average of spinners
which can land on any number
between 0 and 1.
Population--one spinner. distribution flat between
0 and 1, mean .49 s.d. = .29
n = 2, Average of 2 spinners is Xbar. Distribution
triangular between 0 and 1, mean .50, s.d. .21.
.29/sqrt(2)
=.205
n = 4, Average of 4 spinners is Xbar. Distribution
normalish between 0 and 1, mean .50, s.d. .15. .29/sqrt(4)
=.145
n = 15, Average of 15 spinners is Xbar. Distribution
normal between 0 and 1, mean .50, s.d. .09. .29/sqrt(15)
=.076
Xbars from SRS:
Mean of Xbars is mean of
population.
Standard deviation of Xbars is
s.d. of population divided by square root of n.
As sample size increases,
sampling
distribution of Xbars gets more and more normal-shaped.
(Central Limit Theorem)
Central Limit Theorem...
How large is "large"? How approximate is
"approximate"?
If the population was close
to normal, n doesn't need to be very large.
If the population is not badly
skewed or bimodal, n=25 already gives a pretty good
approximation to normal.
Author's website
applet, Central
Limit theorem for a highly skewed dist.
--Sum of 3 dice (divide bottom scale by 3
to get average of 3 dice)
--Pictures, n=1, 2, 4, 25 CLTpics.pdf or on overhead.
--Rice
U. Applet (kind of creaky) where you can change/create the
population dist. Sometimes it fails in pieces (sd=0) or
crashes if you try to use all the options, but is pretty good if you
stick to the Mean, and use only the top 3 displays, and don't go for
huge numbers of reps..
| Sievers home | Math151-F08/Dayf29.htm | 11pm | 11/9/08 |