Meaning of r2

The number of irrelevant answers on a test is graphed versus the age of the child being tested (n = 10).
How "good" is age at predicting/estimating/accounting for the number of irrelevant answers? (Assuming the straight "regression" line is doing the predicting.)
Don't know age?  Measure the variability of the irrelevant answers with the variance:
Find the distance of each child's irrelevant answers from the mean of all the irrelevant answers,
square the distances, sum them, divide by n-1 (= variance)
( take the square root = standard deviation.)pic

If you use age to predict, get the predicted irr. answers for each age-point, and find their variance.
Take the ratio  of the variances. (or of sums of squares).  That's r2.
                                                          See also Rsquared Excel file.

How do these ideas relate to the RESIDUALS?
For each point,
Residual = Observed -Predicted
Predicted + Residual = Observed
These relationships hold whether we measure the Predicted and Observed from "0," or from the mean line for Y, as shown in the pictures. Also, (only true if we use the least squares line)
VarPredicted + VarResidual = VarObserved

1- r2 is the fraction of the original variance left in the residuals (unaccounted for). 


Sievers home    3pm  9/26/07
Math151-F07/Rsquaredmeaning.htm
 

This page belongs to Sally Sievers who is solely responsible for its content. Please see our statement of responsibility.