Sampling & Sampling distribution
by
Asif Hanif
Statistical Inference
• The purpose of statistical inference is to obtain
information about a population from information
contained in a sample.
• A population is the set of all the elements of
interest.
• A sample is a subset of the population.
• The sample results provide only estimates of
the values of the population characteristics.
• A parameter is a numerical characteristic of a
population.
• With proper sampling methods, the sample
results will provide “good” estimates of the
population characteristics.
Advantages of Sampling
• Sampling saves money, time and energy; as
it is much cheaper to collect the desired
information from a small sample than from
the whole population.
• Sampling is essential to obtaining the data
when the measurement process physically
damages or destroys the sampling unit
under investigation.
Advantages of Sampling
• Sampling may be the only means available
for obtaining the needed information when
the population appear to be infinite or is
inaccessible such as the population of
mountains or thickly forested areas.
• Sampling has much smaller “non-
response”, following up of which is much
easier.
Representative Sample
A sample which accurately reflects the
distribution of relevant variables in the
target population
Biased sample
Means that those selected are not typical or
representative of the larger population from
which they have been selected
Sampling and Non-Sampling error
A sample being only a part of a population cannot
perfectly represent the population, no matter how
carefully the sample is selected, so the difference
between the statistic and parameter occurs.
Sampling error = X bar - µ
Non-sampling errors includes all human error,
Faulty sampling frame, non-response to
questionnaire, misclassification of data etc.
Probability vs. Non-probability
Sampling
• Probability – Sample is selected based on a
random process (which means it is based on
chance).
• Non-probability– Sample is selected using
some nonrandom process (not based on
chance).
Types of Probability Samples
• Simple Random Sample
• Systematic Sample
• Stratified Random Sample
• Cluster Sample
Simple Random Sampling
Each element in the population has an
equal probability (chance) of being
selected for the sample
Simple Random Sampling
Sampling With Replacement:
Sampling without replacement
It is the procedure used most often.
In large sampling projects, computer-generated
random numbers are often used to automate the
sample selection process.
Stratified Random Sample
• Strata
– is a group of people who share a
common characteristic.
Examples of strata– race, gender, marital
status.
Stratified Random Sampling
• The population is first divided into groups of
elements called strata.
• Each element in the population belongs to one
and only one stratum.
• Best results are obtained when the elements
within each stratum are as much alike as possible
(i.e. homogeneous group).
• A simple random sample is taken from each
stratum.
• Formulas are available for combining the stratum
sample results into one population parameter
estimate.
Cluster Sampling
• The population is first divided into separate
groups of elements called clusters.
• Ideally, each cluster is a representative small-
scale version of the population (i.e.
heterogeneous group).
• A simple random sample of the clusters is then
taken.
• All elements within each sampled (chosen)
cluster form the sample.
Systematic Sampling
• If a sample size of n is desired from a population
containing N elements, we might sample one
element for every N/n elements in the population.
• We randomly select one of the first N/n elements
from the population list.
• We then select every N/nth element that follows in
the population list.
• This method has the properties of a simple
random sample, especially if the list of the
population elements is a random ordering.
Sampling Distribution of the x
Sample Mean
Population A simple random sample
with mean of n elements is selected
=? from the population.
The value of x is used to The sample data
make inferences about provide a value for
the value of . the sample mean x .
Sampling Distribution of the
Sample Mean
The sampling distribution of the
sample mean is the probability
distribution of all possible values of
the sample mean.
Sampling Distribution of the
Sample Mean
For random samples of size n selected
from a population with mean and
standard deviation the sampling
distribution of the sample mean
• has a mean of x , the population
mean;
• has a standard deviation of x .
n
Example
Assume that a population consists of “five”
students and the marks obtained by them in a
certain class are: 20, 15, 12, 16, 18.
Draw all possible random samples of two
students when sampling is performed
i) With replacement ii) Without replacement
Calculate the mean marks for each sample
Non-probability Sampling
Obtain subjects in some nonrandom
way.
Typically volunteers
Quota Sampling
• Divide the population into various categories
• Determine the number of people to be selected for
each category
• For each category, you obtain an accidental
sample until you fill the quota of people you need
for that category
Purposive or Judgmental Sample
• Select your sample on the basis of your
knowledge of the population and nature of
your research aims.
Snowball Sample
• Find someone who fits the criteria for the
study (accidental/convenience sample)
• Interview person and at end of interview,
you ask if he/she knows of other people
who meet the study criteria and may be
willing to participate