0% found this document useful (0 votes)

137 views15 pages

Sampling and Estimation

sampling and estimation

Uploaded by

PRIYADARSHI GOURAV

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

137 views15 pages

Sampling and Estimation

sampling and estimation

Uploaded by

PRIYADARSHI GOURAV

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Statistical Analysis in Finance

Session 3:
Sampling and Estimation

Dr. Nemanja Radić

www.cranfield.ac.uk/som

Statistical Analysis in Finance

Content :
Sessions 1 & 2: Probability and Probability Distributions
SESSION 3: SAMPLING AND ESTIMATIONS
Session 4: Hypothesis Testing
Session 5: Problem Solving
Sessions 6 & 7: Regression Analysis
Session 8: Regression Models with Dummy Variables
Sessions 9 &10: Problem Solving and Exam Revision
2
Statistical Analysis in Finance

Reading:
Statistical Techniques in Business and Economics
(17/E) by Douglas A. Lind, William G. Marchal and
Samuel A. Wathen 2017. McGraw-Hill. Chapters 8
and 9 .

Intended Learning Outcomes

• Understand simple random sampling, sampling

distribution and sampling error.

• Understand Central Limit Theorem and its

importance.

• Be familiar with techniques of point estimation.

• Be able to estimate confidence intervals for a

variety of data.
4
Sampling

• Population - consists of all members of a specified

group.

• Population Parameter is unknown.

• Sample - a subset of the population.

• Sample Statistic is calculated from sample and
used to make inferences about the population.

Most Commonly Used Probability

Sampling Methods

• Simple Random Sample:

• A sample selected so that each item or person in the population has
the same chance of being included.

• Systematic Random Sampling:

• The items or individuals of the population are arranged in some

order. A random starting point is selected and then every kth
member of the population is selected for the sample.

6
Most Commonly Used Probability Sampling
Methods (cont’d)

• Stratified Random Sampling

• A population is first divided into subgroups, called strata, and a
sample is selected from each stratum. Useful when a population can
be clearly divided in groups based on some characteristics.

• Cluster Sampling
• A population is divided into clusters using naturally occurring
geographic or other boundaries. Then, clusters are randomly
selected and a sample is collected by randomly selecting from each
cluster.

Stratified versus Cluster Sampling

• Stratified Sampling • Cluster Sampling

• Sample consists • Sample consists
of elements from of elements from
each group. the selected
groups.
• Preferred when
the objective is to • Preferred when
increase the objective is to
precision. reduce costs.

8
Selecting Samples in Finance

• Investment analysts commonly work with both time-series and

cross-sectional data.

• No economic basis for how long a time series should be.

• May need to combine data from two different periods, such as
fixed and floating exchange rate regimes.
• As a consequence, we would not be sampling from a population
described by a single set of parameters.

• Whenever we sample cross-sectionally, certain assumptions must

be met if we wish to summarize the data in a meaningful way.

• For example, might choose to summarize company-level data by

industry.
9

Parameter versus Statistics

• Population is described by parameters.

• A parameter is a constant, whose value may be
unknown.
• Only one population.
• Sample is described by statistics.
• A statistic is a random variable whose value depends
on the chosen random sample.
• Statistics are used to make inferences about the
population parameters.
• Can draw multiple random samples of size n.

10
Sampling Error

The sampling error is the difference between a

sample statistic and its corresponding
population parameter.
Examples:
X -µ
s -s
s2 - s 2
p -p
11

Sampling Distribution of the Sample

Mean

• The sampling distribution of the sample mean is a probability

distribution consisting of all possible sample means of a given
sample size selected from a population.

• It is not to be confused with the sample distribution, i.e. the

distribution of values in a sample (notice the - ing in the
ending)

• To get the sampling distribution of a sample mean, we need

to first select all possible samples of the same size from the
population, calculate the mean from each sample, and then
construct the distribution of all the means we calculated.

12
Sampling Distribution of the Sample
Means – Example 1

A firm has seven production employees (considered the population). The

hourly earnings of each employee are given in the table below.

1. What is the population mean?

2. What is the sampling distribution of the sample mean for samples of
size 2?
3. What is the mean of the sampling distribution?
4. What observations can be made about the population and the
sampling distribution?
13

Central Limit Theorem

If all samples of a particular size are selected from any population, the
sampling distribution of the sample mean is approximately a normal
distribution. This approximation improves with larger samples.

• If the population follows a normal probability distribution, then for any

sample size the sampling distribution of the sample mean will also be
normal.

• If the population distribution is symmetrical (but not normal), the normal

shape of the distribution of the sample mean emerge with samples as small
as 10.

• If a distribution that is skewed or has thick tails, it may require samples of

30 or more to observe the normality feature.

• The mean of the sampling distribution ( µ x ) equal to μ and the variance

equal to σ2/n.
19
Sampling Methods and the Central Limit
Theorem

Point Estimate

• A point estimate is a single value (point) derived from a

sample and used to estimate a population value.

X ® µ
s ® s
s2 ® s 2
p ® p
21
Confidence Interval (C.I.)

• A CI estimate is a range of values constructed from sample data so

that the population parameter is likely to occur within that range at a
specified probability.
• The specified probability is called the degree of confidence,
symbolised as 1 – α.
• α denotes the probability of error, also known as the level of
significance. This is the allowed probability that the estimation
procedure will generate an interval does not contain the true
parameter.

• If we let α = 5%, we are (1 – α)% [(e.g. 95% )] confident that a single

95% C.I. contains the population mean.
• We are justified in making this statement because we know that
95% of all possible C.I. constructed in the same manner will
contain the population mean.
22

Construction of C.I.

• A (1 – α)% confidence interval for a parameter has the

following structure:

• Point estimate ± Reliability factor x standard error

• Point estimate is value of sample statistic

• Reliability factor = a number based on the assumed

distribution of the point estimate and the degree of confidence
(1 – α) for the C.I.

• The standard error (standard deviation of the sample means)

of the sample statistic providing the point estimate.
23
Factors affecting confidence interval
estimates

The width of a confidence interval are determined by:

1.The sample size, n.

2.The variability in the population, usually σ

estimated by s.

3.The desired level of confidence.

Confidence Intervals for a Mean – σ Known

• A (1-α) % confidence for population mean μ when we are

sampling from a normal distribution with known variance
σ2 is given by
s
X± z
n
• We use the following reliability factors when we construct
C.I. Based on standard normal distribution:
• Confidence Intervals (C.Is):
• 90%, a = 0.10, z = 1.65.
• 95%, a = 0.05, z = 1.96.
• 99%, a = 0.01, z = 2.58.
25
C.I. for a Mean – σ Unknown

• If we are sampling from a population with

unknown variance

Then a (1-α) % C.I. for the population mean μ is

given by:

s
X± t
n
where the number of df for t is n-1 and n is the sample size

The t-distribution

• It is, like the z distribution, a continuous distribution, defined by a

single parameter known degrees of freedom, df.
• It is, like the z distribution, bell-shaped and symmetrical.
• There is not one t distribution, but rather a family of t distributions. All
t distributions have a mean of 0, but their standard deviations differ
according to the sample size, n.
• The t distribution is more spread out and flatter at the center than
the standard normal distribution As the sample size increases,
however, the t distribution approaches the standard normal distribution

29
Comparing the z and t Distributions
when n is small, 95% Confidence Level

t distribution has a grater spread. the value of t for a given level of

confidence is larger in magnitude. t distribution is flatter or more spread out. 30

Confidence Interval for the Mean

– Example 3

A tyre manufacturer wishes to Given in the problem :

investigate the tread life of its
tyres. A sample of 10 tyres driven n = 10
50,000 miles revealed a sample
mean of 0.32 inch of tread x = 0.32
remaining with a standard
deviation of 0.09 inch. s = 0.09
Construct a 95 percent
confidence interval for the
population mean.
Compute the C.I. using the
Would it be reasonable for the
manufacturer to conclude that t - dist. (since s is unknown)
after 50,000 miles the population
mean amount of tread remaining s
is 0.30 inches? X ± ta ,n -1
n
31
C.I. for a Proportion (π)

To develop a confidence interval for a proportion, we need to meet

the following assumptions.
1. The binomial conditions, discussed in last week, have been met.
Briefly, these conditions are:

a. The sample data is the result of counts.

b. There are only two possible outcomes.
c. The probability of a success remains the same from one trial
to the next.
d. The trials are independent. This means the outcome on one
trial does not affect the outcome on another.
2. The values np and n(1-p) should both be greater than or equal
to 5. This condition allows us to invoke the central limit theorem
and employ the standard normal distribution, that is, z, to complete
a confidence interval.

C.I. for a Proportion – σ Known

• A (1-α) % confidence interval of the population

proportion is given by

p (1 - p )
p± z
n
X
where p =
n

34
Selecting an appropriate sample size

There are 3 factors that determine the size of a

sample, none of which has any direct relationship to
the size of the population.

• The level of confidence desired.

• The margin of error the researcher will tolerate.

• The variation in the population being Studied.

Sample size for estimating the population

mean

s
E = z
n
2
æ z ×s ö
n=ç ÷
è E ø
Where:
n is the size of the sample.
Z is the standard normal value corresponding to the desired level of
confidence.
! is the population standard deviation.
E is the maximum allowable error.

37
Sample size for estimating a population
proportion

p (1 - p )
E= z 2
n æZö
n = p (1 - p )ç ÷
èEø
where:
n is the size of the sample
z is the standard normal value corresponding to
the desired level of confidence
π is the population proportion
E is the maximum allowable error
39

L8 Statistical Estimation 1
No ratings yet
L8 Statistical Estimation 1
48 pages
(Cox (1972) ) Regression Models and Life Tables PDF
No ratings yet
(Cox (1972) ) Regression Models and Life Tables PDF
35 pages
Biostatistics 541/699, Exam 2: Solutions
No ratings yet
Biostatistics 541/699, Exam 2: Solutions
4 pages
Biostatistics for Medical Students
100% (1)
Biostatistics for Medical Students
32 pages
Introduction To Probability and Statistics (IPS) : Endterm
No ratings yet
Introduction To Probability and Statistics (IPS) : Endterm
16 pages
Wayne Daniel
100% (6)
Wayne Daniel
186 pages
Regression Analysis, Tools and Techniques
100% (3)
Regression Analysis, Tools and Techniques
3 pages
Assignment I
100% (1)
Assignment I
4 pages
MQM100 MultipleChoice Chapter3
100% (2)
MQM100 MultipleChoice Chapter3
21 pages
Basic Statics
No ratings yet
Basic Statics
218 pages
Epi
No ratings yet
Epi
29 pages
Lecture 4
No ratings yet
Lecture 4
161 pages
Inferential Estimation
100% (1)
Inferential Estimation
74 pages
SPSS & Minitab Guide for Students
No ratings yet
SPSS & Minitab Guide for Students
187 pages
Weekly Quiz 6: Bootstrap & Statistics
100% (1)
Weekly Quiz 6: Bootstrap & Statistics
8 pages
Evaluation of Evidence
No ratings yet
Evaluation of Evidence
51 pages
Master of Statistics Program Guide
100% (1)
Master of Statistics Program Guide
24 pages
The Three MS: Analysis Data
No ratings yet
The Three MS: Analysis Data
5 pages
Ch. 9 Multiple Choice Review Questions: 1.96 B) 1.645 C) 1.699 D) 0.90 E) 1.311
100% (1)
Ch. 9 Multiple Choice Review Questions: 1.96 B) 1.645 C) 1.699 D) 0.90 E) 1.311
5 pages
Ch9 Fundamentals of Hypothesis Testing One Sample
No ratings yet
Ch9 Fundamentals of Hypothesis Testing One Sample
28 pages
Data Arrangement and Presentation Formation of Tables and Charts
No ratings yet
Data Arrangement and Presentation Formation of Tables and Charts
55 pages
Pharmaceutical Biostatistics Guide
No ratings yet
Pharmaceutical Biostatistics Guide
6 pages
Sta 121 Slides
100% (1)
Sta 121 Slides
103 pages
Ss Notes
No ratings yet
Ss Notes
34 pages
Statistics Assignment Guidelines
No ratings yet
Statistics Assignment Guidelines
5 pages
Practice Exam III
100% (2)
Practice Exam III
8 pages
Statistics For Health Research: Non-Parametric Methods
100% (1)
Statistics For Health Research: Non-Parametric Methods
56 pages
I. Multiple Choice: (10 Pts. Ea.) A 1. A Statistical Inference Is
No ratings yet
I. Multiple Choice: (10 Pts. Ea.) A 1. A Statistical Inference Is
7 pages
Review Questions of Midterm Chapters 1-4
100% (2)
Review Questions of Midterm Chapters 1-4
2 pages
EPIData Presentation
No ratings yet
EPIData Presentation
36 pages
Sample Class Test Research Methodology For Business
No ratings yet
Sample Class Test Research Methodology For Business
10 pages
Applied Longitudinal Analysis Lecture Notes
No ratings yet
Applied Longitudinal Analysis Lecture Notes
475 pages
Stat Paper For 2 Year
100% (1)
Stat Paper For 2 Year
3 pages
Descriptive Statistics Quiz
100% (1)
Descriptive Statistics Quiz
17 pages
IE401 Lecture 3 Descriptive Statistics Grouped Data
No ratings yet
IE401 Lecture 3 Descriptive Statistics Grouped Data
51 pages
Cross Sectional Studies 1
No ratings yet
Cross Sectional Studies 1
49 pages
English and Stastics SAT 1
No ratings yet
English and Stastics SAT 1
5 pages
Chapter 10-Inference About Means and Proportions With Two Populations
No ratings yet
Chapter 10-Inference About Means and Proportions With Two Populations
69 pages
How To Use The WRF Registry: WRF Software Architecture Working Group
No ratings yet
How To Use The WRF Registry: WRF Software Architecture Working Group
62 pages
Measurement Levels & Hypothesis Testing Quiz
No ratings yet
Measurement Levels & Hypothesis Testing Quiz
6 pages
4 Confidence Intervals
100% (1)
4 Confidence Intervals
49 pages
Clinical Research Study Designs: The Essentials
No ratings yet
Clinical Research Study Designs: The Essentials
8 pages
Worksheet For Engineers
100% (2)
Worksheet For Engineers
2 pages
Answer Key - Epi Midterm Study Guide - 2018
No ratings yet
Answer Key - Epi Midterm Study Guide - 2018
8 pages
Estimation of The Mean and Proportion
100% (1)
Estimation of The Mean and Proportion
59 pages
Advanced Statistical Distributions
No ratings yet
Advanced Statistical Distributions
13 pages
BIOSTATISTICS
100% (5)
BIOSTATISTICS
18 pages
Spss Project (Prashant Rajput)
No ratings yet
Spss Project (Prashant Rajput)
23 pages
Sampling and Sampling Distributionsnew
100% (1)
Sampling and Sampling Distributionsnew
13 pages
Biostatistics Final Tests 2017 18-1802 PDF
No ratings yet
Biostatistics Final Tests 2017 18-1802 PDF
66 pages
Biostatistics for Nursing Students
100% (1)
Biostatistics for Nursing Students
40 pages
Questions & Answers Chapter - 7 Set 1
No ratings yet
Questions & Answers Chapter - 7 Set 1
6 pages
Statistics MCQs: Chapters 1-6
No ratings yet
Statistics MCQs: Chapters 1-6
6 pages
One Proportion Z-Tests in SPSS
No ratings yet
One Proportion Z-Tests in SPSS
2 pages
Poisson Distribution Guide & Problems
No ratings yet
Poisson Distribution Guide & Problems
15 pages
AMOS Multi-Group Analysis Guide
No ratings yet
AMOS Multi-Group Analysis Guide
22 pages
Stat Course Outline Unity University
No ratings yet
Stat Course Outline Unity University
3 pages
Statistical Inference
No ratings yet
Statistical Inference
52 pages
SB K49 Lecture7
No ratings yet
SB K49 Lecture7
57 pages
Lecture 5
No ratings yet
Lecture 5
130 pages
Rental Income Analysis Q3 21
No ratings yet
Rental Income Analysis Q3 21
7 pages
Probability For Finance
No ratings yet
Probability For Finance
40 pages
Statistical Analysis in Finance Session 4: Hypothesis Testing
No ratings yet
Statistical Analysis in Finance Session 4: Hypothesis Testing
32 pages
Regression Analysis
100% (1)
Regression Analysis
43 pages
21 Whitacre Equus Trombone 1&2
No ratings yet
21 Whitacre Equus Trombone 1&2
7 pages
Definitions and The Scope of Applied Linguistics (Revised) - Ulfahnurfarida2
No ratings yet
Definitions and The Scope of Applied Linguistics (Revised) - Ulfahnurfarida2
3 pages
Network Configurations & Topologies
No ratings yet
Network Configurations & Topologies
4 pages
LCPC Assessment Form 001 A Barangay
No ratings yet
LCPC Assessment Form 001 A Barangay
3 pages
Sales Catalogue - 40LW AHU - Gurgaon India Factory
No ratings yet
Sales Catalogue - 40LW AHU - Gurgaon India Factory
2 pages
2024 1 3
No ratings yet
2024 1 3
7 pages
Basic Math - 2014 PDF
No ratings yet
Basic Math - 2014 PDF
5 pages
Q4-W3 - Weekly-Home-Learning-Plan-for-Grade-2MAY 31 - JUNE 4
No ratings yet
Q4-W3 - Weekly-Home-Learning-Plan-for-Grade-2MAY 31 - JUNE 4
4 pages
Managerial Grid Model
100% (1)
Managerial Grid Model
8 pages
English Language: 8700/2 Paper 2 Writers' Viewpoints and Perspectives Mark Scheme
No ratings yet
English Language: 8700/2 Paper 2 Writers' Viewpoints and Perspectives Mark Scheme
20 pages
Hypothesis Testing: By: Janice Galus Cordova
75% (4)
Hypothesis Testing: By: Janice Galus Cordova
22 pages
ECON1005 Notes Unit 6
No ratings yet
ECON1005 Notes Unit 6
42 pages
Optical Flow Visualization Methods
No ratings yet
Optical Flow Visualization Methods
21 pages
Insolation PDF
No ratings yet
Insolation PDF
472 pages
From To Everything You Wanted To Know About The Future of Your Work But Were Afraid To Ask Codex4799 PDF
No ratings yet
From To Everything You Wanted To Know About The Future of Your Work But Were Afraid To Ask Codex4799 PDF
55 pages
Whole Numbers: Number Sense to 100,000
No ratings yet
Whole Numbers: Number Sense to 100,000
44 pages
Child Study 2003
100% (1)
Child Study 2003
37 pages
Plum Blossom Divination Course
100% (2)
Plum Blossom Divination Course
1 page
IFC/COBie 2012 Report: BIM Trial Findings
No ratings yet
IFC/COBie 2012 Report: BIM Trial Findings
18 pages
THE FEASIBILITY OF COCONUT HUSK ASH (Cocos Nucifera), SAND, AND WASTE PAPER MATERIALS AS COMPONENTS IN CEMENT-MAKING
100% (1)
THE FEASIBILITY OF COCONUT HUSK ASH (Cocos Nucifera), SAND, AND WASTE PAPER MATERIALS AS COMPONENTS IN CEMENT-MAKING
6 pages
European Journal of Marketing: Article Information
No ratings yet
European Journal of Marketing: Article Information
26 pages
The Impact of Positive and Negative Word of Mouth On Brand Choice (PDF Download Available) PDF
No ratings yet
The Impact of Positive and Negative Word of Mouth On Brand Choice (PDF Download Available) PDF
24 pages
The Deep Ocean Life in The Abyss Louise Allcock Michael Vecchione Download
No ratings yet
The Deep Ocean Life in The Abyss Louise Allcock Michael Vecchione Download
15 pages
Continuous Probability Distributions: Mcgraw-Hill/Irwin
No ratings yet
Continuous Probability Distributions: Mcgraw-Hill/Irwin
20 pages
Summative 3.1
No ratings yet
Summative 3.1
2 pages
Autonomous and Mobile Robotics: Prof. Giuseppe Oriolo
No ratings yet
Autonomous and Mobile Robotics: Prof. Giuseppe Oriolo
16 pages
SAP SRM Consultant Profile
No ratings yet
SAP SRM Consultant Profile
5 pages
Semantic Framework for Linguists
100% (1)
Semantic Framework for Linguists
257 pages
CIE4801 Transportation and Spatial Modelling: 4-Step Model Reprise, Forecasting
No ratings yet
CIE4801 Transportation and Spatial Modelling: 4-Step Model Reprise, Forecasting
44 pages
Installation Load and Working Capacity of Jacked Piles
No ratings yet
Installation Load and Working Capacity of Jacked Piles
5 pages

Sampling and Estimation

Uploaded by

Sampling and Estimation

Uploaded by

Statistical Analysis in Finance

Dr. Nemanja Radić

Statistical Analysis in Finance

Intended Learning Outcomes

• Understand simple random sampling, sampling

• Understand Central Limit Theorem and its

• Be familiar with techniques of point estimation.

• Be able to estimate confidence intervals for a

• Population - consists of all members of a specified

• Population Parameter is unknown.

• Sample - a subset of the population.

Most Commonly Used Probability

• Simple Random Sample:

• Systematic Random Sampling:

• The items or individuals of the population are arranged in some

• Stratified Random Sampling

Stratified versus Cluster Sampling

• Stratified Sampling • Cluster Sampling

• Investment analysts commonly work with both time-series and

• No economic basis for how long a time series should be.

• Whenever we sample cross-sectionally, certain assumptions must

• For example, might choose to summarize company-level data by

Parameter versus Statistics

• Population is described by parameters.

The sampling error is the difference between a

Sampling Distribution of the Sample

• The sampling distribution of the sample mean is a probability

• It is not to be confused with the sample distribution, i.e. the

• To get the sampling distribution of a sample mean, we need

A firm has seven production employees (considered the population). The

1. What is the population mean?

Central Limit Theorem

• If the population follows a normal probability distribution, then for any

• If the population distribution is symmetrical (but not normal), the normal

• If a distribution that is skewed or has thick tails, it may require samples of

• The mean of the sampling distribution ( µ x ) equal to μ and the variance

• A point estimate is a single value (point) derived from a

• A CI estimate is a range of values constructed from sample data so

• If we let α = 5%, we are (1 – α)% [(e.g. 95% )] confident that a single

• A (1 – α)% confidence interval for a parameter has the

• Point estimate ± Reliability factor x standard error

• Point estimate is value of sample statistic

• Reliability factor = a number based on the assumed

• The standard error (standard deviation of the sample means)

The width of a confidence interval are determined by:

1.The sample size, n.

2.The variability in the population, usually σ

3.The desired level of confidence.

Confidence Intervals for a Mean – σ Known

• A (1-α) % confidence for population mean μ when we are

• If we are sampling from a population with

Then a (1-α) % C.I. for the population mean μ is

• It is, like the z distribution, a continuous distribution, defined by a

t distribution has a grater spread. the value of t for a given level of

Confidence Interval for the Mean

A tyre manufacturer wishes to Given in the problem :

To develop a confidence interval for a proportion, we need to meet

a. The sample data is the result of counts.

C.I. for a Proportion – σ Known

• A (1-α) % confidence interval of the population

There are 3 factors that determine the size of a

• The level of confidence desired.

• The margin of error the researcher will tolerate.

• The variation in the population being Studied.

Sample size for estimating the population

You might also like