The two most common measures of variability, namely the Variance
and its close descendant the Standard Deviation, owe their popularity
to the importance of the Normal Distribution, which we shall study
later. Normal distributions, which play an important role in both
descriptive and inferential statistics, are completely determined by
two "parameters": their mean and their variance.
The variance describes the heterogeneity of a distribution and is
calculated from a formula that involves every score in the distribution.
It is typically symbolized by the letter s with a superscript "2".
The formula is
Variance = s2 =
Sum (Scores - Mean)2/(n - 1)
The square root (the positive one) of the variance is known as
the "standard deviation." It
is symbolized by s with no superscript.
Sq. Root of Variance = Standard Deviation, denoted by s
Use the formula for the variance and standard deviation to calculate both
for these scores: 8, 10, 12, 14, 16. Note that n = 5. Make the
calculations on a piece of paper. You should get a variance of 10 and a standard
deviation equal to the square root of 10 which is 3.16.
Here's
an online demonstration of how the standard deviaiton is calculated,
from John Behrens's stats course.
Properties of the Standard Deviation
As the scores in a distribution become more heterogeneous, more
"spread out" and different, the value of the standard deviation grows
larger.
If I told you that the standard deviation of 6th
grade students Reading grade equaivalent scores was 1.68 yrs (in Grade
Equivalent units) and the standard deviation of their Math scores was
0.94 yrs, you would know that the students are more varied in Reading
performance than in Math performance. Can you come up with an
educationally sound explanation of why this might be so?
In normal distributions, roughly two-thirds of the scores lie within a
distance of one standard deviation of the mean; 95% lie within two
standard deviations of the mean; and 99.7% lie within 3 standard
deviations.