Math 151 , Fall 2006, Monday Day 28, Oct 30  Hit reload .After class

Exam not graded yet
HW Day 28 Read Chapter 10,the rest (p. 256 on)  (Note Normal distribution is back) . Check 10.21, 28.  Begin Ch. 11 (pp. 286-291 optional).First pp. 271-77. Then to 286.   Check p. 294: 11.17, 18 (parameter/statistic, sampling dist.).  11.19, 20 (behavior of xbars, mean & s.d.)  11.22, 23, 24 (behavior of xbars, more) 
Hand in:
p. 268, 10.44 Benford One more discrete probability

Continuous sample spaces:
 For A and B, Use Densities Handout, from Day 7.Copies of the HW handout are in class/ outside my door if you can't find yours.)
Change language from "description of a population of data" to "pick an individual from the population, call the value X"
A. ("Uniform")   X = number the spinner points to.
a) (example)  The probability that the spinner points to a number less than .6 = P( X < .6) = .6 .
b) P (.2 < X < .6) = ?   Say it in words: ?
c) For what x is there probability .4 of being greater than x ?      (In notation: P(X > x) = .4.  Find x)
B.  Y = (number you get from) the sum of two spinners. ("Triangular")
  This is the same random variable as Y in 10.14, p. 259!
a) The probability that the sum is a number less than .6  =  P(       ?        ) =.18     P(Y > .6) =  ?  
b) P(Y < 1.6)  =          P (Y < 1) =   ?             P( 1 < Y < 1.6) =  ?
c)  P(Y > x) = .92  Find x:  ?   (Hint:  P(Y<x) = .08)

p. 259, 10.13 uniform, 0-1 (Note, this is distribution A on the handout)
p. 259, 10.14  sum of two uniform (Note, this is distribution B, "Triangular", on the handout)
pp. 236-7 10.48 and 10.50 uniform on 0-2 (This corresponds to a single spinner with its edge labeled with values going from 0 to 2, rather than the 0 to 1 we used in our homework up to now. Use the area of rectangles formula to find the probabilities.)

Normal distribution:  Restate each problem from "The probability that X is..." to "The proportion of the population of x's that ...." and use your old techniques.
p. 259, 10.15  Iowa Test Scores
p. 269, 10.49 Did you vote?
p. 269,  10.51 NAEP scores
p. 269,  10.53 Friends

READ Personal Probability, pp. 261-2.  All our theory will be developed using the "frequentist" point of view (probability = proportion in the long run).  But there is another theory based on Personal Probability, sometimes called "Bayesian".
p. 262, 10.18

On a separate sheet using http://www.whfreeman.com/bps4e  "Probability " applet: :  If you do it jointly, one sheet for both people (I'll aggregate the results)
p. 270, 10.55 runs of free throws
p. 270, 10.56 a.  For b, do 20 people 10 times, but do 320 people only twice.  Record not only the proportion (.63 or whatever) but the fraction (like 201/320.  208/320 = .65 exactly)

- - - - - - - - - Ch. 11- - -
p. 272 11.1 caffeine (Param./Stat.)
p. 272 11.2 voters(Param./Stat.)
Postpone these two: p. 275, 11.4 means in action (LLN)
                                 
p. 275, 11.5 insurance (LLN)

p. 277, 11.6 sampling distribution of exam scores Do a and a modified version of b; Do b this way.  Close your eyes and put your finger down somewhere on table B (Don't use row 116!! unless you land there.).  Start reading the table where your fingertip lands.  Record your sampleof 4,  and find xbar for your sample.
Now Repeat part b, to get a total of  3 values of xbar. (You can just keep reading the table where you left off, or you can put your finger in a different spot).  Record your 3 xbars,  Make a dotplot of your 3 xbars and bring the values to class to be compiled with everyone else's..
Read, to discuss 


 

Optional 
  More practice:
p. 261, 10.17, ACT scores

- - - -
p. 272,11.3 Bearings (Param./Stat.)
 
<>Exam not graded yet
If you didn't Wednesday: Add your results from the 200 "coin flips" with p = .10 to the circulating transparency (#10.3b).

Jenn O'Neill's new hours: "Monday 6-8pm Tuesday 3:30-5:30pm and Wednesday 6:30-8:00pm. I am also available for
individual appointments if these times aren't convenient."

~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~
"Probability" of particular something happening: proportion of times it would happen in a very long series of independent repetitions of the phenomenon.
    (independence:  outcome of one trial (repetition) must not influence the outcome of any other.)
http://www.whfreeman.com/bps4e  "Probability " applet: 1 toss at a time--settles down slowly.  Look at results so far, 200 tosses, p = .1
<>   
Recap:  Probability Models
Random phenomenon,
    Sample space S:  set of all possible outcomes (no overlap of descriptions)
    Event:  any outcome or set of outcomes
    Probability model: S, and a way of assigning a probability to each event.
Sample space depends on what you want to know:
Phenomenon: Flip coin twice.
    S1 = {HH, HT, TH, TT}     S2 = {0, 1, 2} number of heads   S3 = {Y, N} both are heads?

Probability rules:  A an event in sample space S, P(A) is "the probability that  A occurs"
    These rules are all true for proportions in long run (Probabilities), prop.of counts, proportions of areas.
    1.  0 < P(A) < 1
    2. P(S) = 1
    3. For any event A, P(A does not occur) = 1 - P(A)
    4.  A and B are  disjoint if they have no outcomes in common (can't happen simultaneously.)
        If A and B are disjoint, their probabilities add:  P(A or B) = P(A) + P(B)

Finite sample spaces: 
Assign a probability to each outcome (>0) so they add to 1.  
    Prob. of an event is sum of prob's of its outcomes.
Two principles for assigning probabilities:
--Sometimes, a properly chosen sample space will have equally likely outcomes. You can use this to find other probabilities.

Phenomenon: Flip coin twice.
    S1 = {HH, HT, TH, TT}     S2 = {0, 1, 2} number of heads   S3 = {Y, N} both are heads?
Sample space  | HH | HT | TH | TT |   EQUALLY LIKELY
       Prob's | .25| .25| .25| .25|  P(tail followed by head)=?
Sample space  | 2  |    1    |  0 P(at least 1 tail)=?   P(1 of each) = ?
       Prob's | .25|   .50   | .25|  P(at least 1 Head)= ?  P(2 Heads) = ?
Sample space  | Y  |       N      |
       Prob's | .25|     .75      |
--If you pick an individual at random from a population, the probability that one individual will be XYZ is the same as the proportion of XYZ's in the population.
Pick one person from U.S. Pop. (Age 25 +)

Sample space:
No HS degree
       HS only     .
1-3 yrs College
 4 + yrs College
Proportion in pop.
18.3%
33.9%
24.8%
23.0%
Probability 
.183
.339
.248
.230

Often the sample space is naturally expressed in numbers, thus

Random Variable:  (X, Y, Z...) Variable whose value is a numerical outcome of a random phenomenon.
 Probability distribution of X tells us what values X can take and how to assign probabilities to them.
    If X has a finite number of possible values (Discrete distributions), nothing new except notation.
     P(X < 2) is "Prob. that X is less than 2."
Flip coin twice. R.V. X = number of heads:  Distribution given by table.
             x| 2  |    1    |  0 |
       P(X=x) | .25|   .50   | .25|  P(X > 1) = ?   Words:  Prob that # heads is > 1
                                     P(X = 2) = ?         Prob that # heads is 2

Looking ahead (back)
Random variables with intervals of outcomes ("continuous") Ch.10 (p. 256 on) 
If the sample space is an interval of values (or the whole line), the way we assign probabilities to events is with a density curve (Ch. 3, cf. Day 7 on) (remember density curves were idealizations of histograms--of repeating the "experiment" many many times)
  P(a <  X < b) = the probability that X is between a and is the area under the density curve, between a and b.
We declare P (X = a) = 0 , so P(a <  X < b) = P(a < X < b)
Notation: Use capital letter for the random variable, the "label" of the phenomenon.  Use small letters for particular values it can have.  But this rule is often broken--Moore uses x-bar where many (I) would use X-bar.

Review "Density curves" HW day 7, restating these parts as probability questions, and adding a bit:
    (Copies of the HW handout are in class/ outside my door if you can't find yours.)
Change language from "description of a population of data" to "pick an individual from the population, call the value X"
A. ("Uniform")   X = number the spinner points to.
a) (example)  The probability that the spinner points to a number less than .6 = P( X < .6) = .6 .
b) P (.2 < X < .6) = ?   Say it in words: ?
c) For what x is there probability .4 of being greater than x ?      (In notation: P(X > x) = .4.  Find x)
B.  Y = (number you get from) the sum of two spinners. ("Triangular")
  This is the same random variable as Y in 10.14, p. 259!
a) The probability that the sum is a number less than .6  =  P(       ?        ) =.18     P(Y > .6) =  ?  
b) P(Y < 1.6)  =          P (Y < 1) =   ?             P( 1 < Y < 1.6) =  ?
c)  P(Y > x) = .92  Find x:  ?   (Hint:  P(Y<x) = .08)
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Our most important probability model: NORMAL DISTRIBUTION family.  Same techniques as before, only we ask "probability that one chosen at random..." instead of "proportion of all..."  Review Normal techniques: Day 7 Day 8, Day 9
 Take a random sample of size 1 from a population which is N(110, 25). 
(Give an individual, chosen at random, the "Classic IQ test", which has a normal distribution, mean 110, s.d. 25.   X is the score on the test.)
Find P(100 < X < 140), prob. that individual gets between 100 and 140.   Work is on Day 9, what proportion.

<>~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~
We know that a sample from a population will not exactly represent the population.  If we take a random sample, the behavior of samples will not be individually predictable, but there will be predictable pattern in many random samples from the same population.  Knowing the pattern will be  as good as we can do.
Ch. 11:
        Sample Chosen from a  Population
          (varies)             (fixed, but usually unknown)
Calculate
Numerical summary: Statistic (Latin) Parameter(Greek letter)
    Examples:           Sample mean xbar    Population mean mu (µ)
                       Sample st. dev. s    Pop. standard dev. sigma
                        Sample median     Pop. median
                Sample proportion p-hat  Pop. proportion p
                Sample line height y-hat  Pop. regression line height y
The actual value of the Statistic will vary, depending on the particular sample. "Sampling variability"
Got to here Monday
The Statistic "estimates" the Parameter.  We hope it is close to the parameter.  If we choose simple random samples, we can understand the pattern of values the statistic can take.
Some examples of  statistics:
    Height:   U.S. young women: pop. mean= 64.5", pop. s.d. 2.5"  (ed. 2 p.66.  BPS4e p. 76, 87 says women 20-29 is N(64, 2.7))
                                               Math 151, Spring '01,  xbar = 64.2,     s = 3.75.
                                                                        Fall '01,   xbar = 65.01,    s = 3.22.
                                                                     Spring '02,  xbar = 64.53,    s = 2.91.
                                                                       Fall '02,    xbar = 63.89,     s = 2.48.
                                                                   Spring '03,  xbar = 64.98,    s = 3.29
                                                                     Spring '04,  xbar = 65.33,    s = 2.25
                                                                       Fall '04,  xbar = 64.68,     s = 3.54

                                                                    Spring '05,  xbar =64.31 ,    s =2.93
                                                                         Fall '05  xbar =63.92 ,    s =2.80
                                                                    Spring '06  xbar =62.93 ,    s =2.78
                                                                        Fall '06  xbar =62.81 ,   s =  2.65 
<>   Coin flip: Proportion of heads  p = 1/2 (?)       p-hat =  256/520 = .492  (combined data from many past classes)
    Thumbtack:  Proportion of point-up p =  (??)       p-hat =  441/691 = .6382  (one past class, Math 251)

Next: How does sample mean behave? ( pp.275-86)
                 Sample Chosen from a  Population
                  (varies)            (fixed, but usually unknown)
Calculate Numerical summary: Statistic estimatingParameter
                                    xbar                   µ
We take a simple random sample of size n, find the sample mean xbar.  It will be different depending on the sample, so we have a random phenomenon.  We measure the outcome as a number, the sample mean, so we have a
                       random variable X bar.

Law of Large Numbers (p. 273-4, "LLN")  Take observations at random from a population with population mean µ. Then as the number of observations n increases, the sample mean xbar gets closer and closer to µ.
(Even if the population is infinitely large! Note--we don't say how big n needs to be for how close here.)
  OR Let the sample size n get bigger.  Then  the xbars will eventually get very close to the population mean µ.
  OR As the sample size increases, the sample mean gets closer to the population mean µ.
  OR For a very large sample, the sample mean will (almost certainly) be very close to the population mean.
e.g. the bigger my statistics class, the closer their mean height should be to the U.S. mean height for women.
Applet: http://www.whfreeman.com/bps4e  "Law of Large Numbers"  Roll a single die.  X = number.  µ= 3.5. (Think X is number of spaces you can move in a board game.  Average per roll is 3.5.)
My result Oct.27: x = 5    n=1 Xbar = 5/1 = 5
          Roll again, x = 2.  n=2, Xbar = (5+2)/2 = 3.5
            Again,     x = 1   n=3.  Xbar = (5+2+1)/3 = 2.67
  Again... ...   Xbar for large n; close to 3.5.

Now:  keep a fixed sample size n:
What probability distribution describes the random phenomenon of finding xbar from a SRS?
That is, what is the distribution of the random variable Xbar, when the experiment is to take a simple random sample of size n?
We'll call it the "sampling distribution of the (sample) mean" (p. 275-7, then details 278-86)
This is the distribution of means of all possible SRS's of size n. 
10.3b  "penny" p = .1 
Transparency of some of the possible sample proportions.  Simulation of sampling distribution of proportion.
If you have this R.V.:  X = 1 if Heads, 0 if Tails, then P(X=1) = .1, P(X=0) = .9.  And Xbar = sample proportion!  So it's also sampling distribution of mean!
What do we see?
--Shape: Looks normal-ish
--Center:  Mean of xbars ~ mean of dist. of X.  (.1)
--Spread:  SD of xbars  is smaller than that of population X. 
Applet: http://www.whfreeman.com/bps4e  Normal approx to binomial, p = .1, n = 1, then n = 100 (our data is for n = 200, even narrower.)  Horizontal scale is in n, number of possible heads, and we worked in p, proportion of heads.  Just think of  space between 0 and the given number as being from  0 to 1, black line (mean) is at .1, always.

HW tonite #11.6 (modified): each get 3 SRS's  of size 4, find 3 means: will pool to get histogram of Sampling distribution of mean
Quincunx board:  Result for one ball is "average" of going + or going - at each level

      (entry + pin 1+pin2+ ...+ pin 6).

Sievers home  Math151-Fall06/Daym28.htm  11am 10/30/06
This page belongs to Sally Sievers who is solely responsible for its content. Please see our statement of responsibility.