There is considerable variation in the
salinity of the vials. The salinity level
in the sample of 20 varies from 43.91
to 62.58, a range of nearly 20.
Suppose you did not realize that and
assumed that every vial would have exactly
the same salinity value. You would take
a vial, assume that the machine was incorrectly
adjusted and try to correct the setting.
If you subsequently checked another vial
you would assume you had either not corrected
sufficiently or over-corrected depending
on the result and readjust the machine
accordingly.
The quality guru W. Edwards Deming used
a 'funnel' experiment to illustrate this:
We have taken a sample of 20 vials to
avoid relying on a single result. The
idea is that this 'averages out' the vial
to vial variation and gives a more reliable
estimate.
This is an example of 'inferential
statistics'. We have used the
sample mean to estimate
the process mean (also
known as the population mean).
- The sample mean is called a 'statistic'
and is represented by
.
A statistic is a hard and fast value
calculated from the sample data
- The process mean is called a parameter
and is represented by μ
(by convention parameters are represented
by Greek letters).
The sample mean is not a perfect estimate
of the process mean. If we take a number
of samples we will find that the mean
varies from sample to sample, although
the variation in the sample means is less
than the individual vial to vial variation.
The larger the sample, the more reliable
the estimate. We can never know the exact
value of the process mean, although we
can estimate it to any required accuracy
by taking a sufficiently large sample.
Later in the course we will discover
how to quantify the likely error in the
accuracy of a sample.