KEMBAR78

Is Bigger Better?: An Introduction To Sample Size Calculations | PDF | Type I And Type Ii Errors | Confidence Interval

Open navigation menu

Scribd

0% found this document useful (0 votes)

53 views52 pages

Is Bigger Better?: An Introduction To Sample Size Calculations

We wish to compare the mean weight loss after 6 months for two diets. We expect the high protein diet to result in 5kg greater weight loss than the standard diet. From previous studies, the SD of weight loss is 3kg. We wish to detect this difference with 80% power at the 5% significance level. Required sample size per group? Flinders Centre for Epidemiology & Flinders Centre for Epidemiology & Flinders Centre for Epidemiology & Flinders Centre for Epidemiology & Flinders Centre for Epidemiology & Flinders Centre for Epidemiology & Flinders Centre for Epidemiology &

Uploaded by

Fakhrul Firdaus

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views52 pages

Is Bigger Better?: An Introduction To Sample Size Calculations

We wish to compare the mean weight loss after 6 months for two diets. We expect the high protein diet to result in 5kg greater weight loss than the standard diet. From previous studies, the SD of weight loss is 3kg. We wish to detect this difference with 80% power at the 5% significance level. Required sample size per group? Flinders Centre for Epidemiology & Flinders Centre for Epidemiology & Flinders Centre for Epidemiology & Flinders Centre for Epidemiology & Flinders Centre for Epidemiology & Flinders Centre for Epidemiology & Flinders Centre for Epidemiology &

Uploaded by

Fakhrul Firdaus

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 52

Is bigger better?

An introduction to sample size calculations

Presented by:
Dr Adrian Esterman

Flinders Centre for Epidemiology &

Scenario 1 All studies Scenario 2
Precision Power

Descriptive Hypothesis testing

Sample surveys Simple - 2 groups

Quality control
Complex studies

Flinders Centre for Epidemiology &

Scenario 1
Suppose we want to estimate the proportion
of people in our target population with a
given characteristic:
• The proportion with depression
• The proportion with an artficial leg
• The proportion receiving incorrect medication

Flinders Centre for Epidemiology &

Scenario 1
Example

• My target population is all South

Australians aged 17 and over
• I want to find out what proportion have
an undergraduate degree
• Please raise your hand if you have an
undergraduate degree

Flinders Centre for Epidemiology &

Scenario 1

Target Random
Sample
Population

Infer
Measure
Characteristic

Flinders Centre for Epidemiology &

Scenario 1
True proportion in target population = P
Estimated proportion from sample = p

How likely is it that p is exactly equal to P?

Flinders Centre for Epidemiology &

Scenario 1

We would like 95 times out of 100,

P to fall in this range

0 p 1
Sample

Flinders Centre for Epidemiology &

Scenario 1
The range of plausible values of our sample
proportion p in which the true population
proportion P is likely to fall 95 times out of
100 is called the 95% Confidence Interval
for P

Flinders Centre for Epidemiology &

Scenario 1
95% CI
for P

0 p 1
Sample

Flinders Centre for Epidemiology &

Scenario 1
The 95% CI for p is a measure of how
accurate your sample estimate is of the true
population proportion

95% Confidence
Interval

Sample size

Flinders Centre for Epidemiology &

Scenario 1
Example
We want to estimate the proportion of the
South Australian population with COPD.
We think it will be about 12%.

We would like a 95% CI of p ± 2%.

Flinders Centre for Epidemiology &

Scenario 1

Flinders Centre for Epidemiology &

Flinders Centre for Epidemiology &
Flinders Centre for Epidemiology &
Flinders Centre for Epidemiology &
Flinders Centre for Epidemiology &
Flinders Centre for Epidemiology &
Flinders Centre for Epidemiology &
p=50% with 95% CI 50% +/- 5%

400
Required sample size

300
200
100
0

Size of target population

Flinders Centre for Epidemiology &

Statcalc
Statcalc is included as part of the Epiinfo
suite of programs. This is available free of
charge from:

http://www.cdc.gov/epiinfo/

Flinders Centre for Epidemiology &

Scenario 2

We wish to formally test the difference

between two means or two proportions

Flinders Centre for Epidemiology &

Scenario 2
Three bits of information required to determine
the sample size

Type I & II Variation

errors Clinical
effect

Flinders Centre for Epidemiology &

Process of hypothesis testing Type I &
II errors
1. State a Null hypothesis (H0)
2. State an Alternative hypothesis (HA)
3. Decide on a suitable statistical test based on
the Null hypothesis
4. Calculate the test statistic
5. Check the associated probability (p-value)
6. If p  0.05 reject the Null hypothesis

Flinders Centre for Epidemiology &

Process of hypothesis testing Type I &
II errors
Note
If the Alternative hypothesis is:
parameter 1  parameter 2
we calculate the p-value for a two-sided test

If the Alternative hypothesis is:

parameter 1 > parameter 2
we calculate the p-value for a one-sided test

Flinders Centre for Epidemiology &

Type I &
II errors
What is a p-value?

1. It is a probability, and hence lies between 0 and 1.

2. It is a measure of surprise. In fact how surprised we
are to get a test statistics that large, if the Null
hypothesis were true.

Flinders Centre for Epidemiology &

Type I &
II errors
Type I and II errors
Statistical True state of null hypothesis
decision
Hypothesis true Hypothesis false

Reject Null Type I error Correct (Power)

hypothesis

Accept Null Correct Type II error

hypothesis

Flinders Centre for Epidemiology &

Type I &
What causes a Type I error II errors

• Bias
• Confounding
• Effect modification
• Misclassification

Flinders Centre for Epidemiology &

Type I &
What causes a Type II error II errors

• Sample size too small

• Confounding
• Effect modification
• Misclassification

Flinders Centre for Epidemiology &

Example of setting error levels Type I &
II errors
New drug for lowering cholesterol
• Slightly better efficacy than existing drugs
• Much more expensive than existing drugs

What are the consequences of making a Type I error?

What are the consequences of making a Type II error?

Flinders Centre for Epidemiology &

Example 1 Type I &
II errors
New drug for lowering cholesterol
Slightly better efficacy than existing drugs
• Much more expensive than existing drugs

Conclusion
• Requires stringent Type I error (say 0.01)
• Can managed with relaxed Type II error (say 0.20)

Flinders Centre for Epidemiology &

Example 2 Type I &
II errors

Trial of new brochure to help people quit smoking

• Successful in 20% of smokers
• Negligible cost

What are the consequences of making a Type I error?

What are the consequences of making a Type II error?

Flinders Centre for Epidemiology &

Example 2 Type I &
II errors

Trial of new brochure to help people quit smoking

• Successful in 20% of smokers
• Negligible cost

Conclusion
• Can relax Type I error (say 0.10)
• Requires stringent Type II error (say 0.05)

Flinders Centre for Epidemiology &

Scenario 2
Three bits of information required to determine
the sample size

Type I & II Variation

errors Clinical
effect

Flinders Centre for Epidemiology &

Clinical
Your Alternative hypothesis states effect
that you expect one group to have a
different mean or proportion to the
other group, but how much by?

• From the literature •  15% change

• From a pilot study • Change of  1 SD
• Clinically judgement • Interim analysis

Flinders Centre for Epidemiology &

Scenario 2
Three bits of information required to determine
the sample size

Type I & II Variation

errors Clinical
effect

Flinders Centre for Epidemiology &

Variation

Is there a difference between the two means?

Mean 1 Mean 2

Systolic Blood Pressure

Flinders Centre for Epidemiology &

Variation

It depends upon the range of the distributions

Systolic Blood Pressure

Flinders Centre for Epidemiology &

Variation

To judge whether the difference between

two means is large or small, we compare it
with some measure of the variability of the
distributions

Flinders Centre for Epidemiology &

Variation

Variability

All statistical tests are based on the following ratio:

Difference between parameters

Test Statistic =
v / n

As n  v/n  Test statistic 

Flinders Centre for Epidemiology &

Variation

2
v x Test statistic
n =
Difference

Flinders Centre for Epidemiology &

Variation

The test-statistic is usually:

• Chi-squared for comparing two proportions
• Student’s t for comparing two means
• F-statistic for comparing two variances
• Z-statistic for comparing two correlation coefficients

but may be more complicated

Flinders Centre for Epidemiology &

Scenario 2
Example for two means
We wish to undertake an RCT of an intervention to
improve quality of life. At the end of the study, the
mean PCS of the SF-36 for the control group is
expected to be 35. We expect that in the
intervention group, the mean PCS will be 45. The
standard deviation of the PCS is 10.

Flinders Centre for Epidemiology &

Flinders Centre for Epidemiology &
1 – Type I
Error

1 – Type II
Error

Flinders Centre for Epidemiology &

Scenario 2
Example for two proportions
In a prospective study of hip protectors, we expect
that in the untreated group 10% of elderly people
will suffer a hip fracture. In the treated group we
expect this to reduce to 5%.

Flinders Centre for Epidemiology &

Flinders Centre for Epidemiology &
Winepiscope
Winepiscope is available free of charge
from:

http://www.clive.ed.ac.uk/winepiscope/

Flinders Centre for Epidemiology &

Allowing for dropouts
dropouts
Nearly all studies have at least some subjects who
withdraw, are lost to follow up, or who die

If n is the sample size computed by the program,

and we expect lose d% of subjects, then the
requires sample size is N is given by:

N = (100 x n) / (100 – d)

Flinders Centre for Epidemiology &

Allowing for dropouts
dropouts
Example
The sample size program tells us that we need 120
in each group and we are expecting a 15%
drop out.

N = (100 x 120) / (100 – 15)

= 141

Flinders Centre for Epidemiology &

Is bigger better?

For both descriptive and hypothesis testing

studies, the answer is yes.

1. Increasing the sample size will have no effect

on Type I errors which are largely due to bias
and/or confounding.
2. There is no point in having a larger sample size
than that required for precision or power.

Flinders Centre for Epidemiology &

Is bigger better?

For both descriptive and hypothesis testing

situations, the answer is yes. However:

1. Increasing the sample size will have no effect

on Type I errors which are largely due to bias
and/or confounding.
2. There is no point in having a larger sample size
than that required for precision or power.

Flinders Centre for Epidemiology &

For copies of this presentation

Please email Kylie Thomas at:

kylie.thomas@flinders.edu.au

Flinders Centre for Epidemiology &

You might also like

Sample Size
No ratings yet
Sample Size
6 pages
Measuring The Occurrence of Disease: Dr. Elijah Kakande MBCHB, MPH Department of Public Health
No ratings yet
Measuring The Occurrence of Disease: Dr. Elijah Kakande MBCHB, MPH Department of Public Health
25 pages
Sample - Size - Calculation - 2024-Esnat Chirwa
No ratings yet
Sample - Size - Calculation - 2024-Esnat Chirwa
38 pages
Statatistical Inferences
No ratings yet
Statatistical Inferences
22 pages
Topic 1
100% (1)
Topic 1
37 pages
Internatiional Financial Management: Unit I
No ratings yet
Internatiional Financial Management: Unit I
51 pages
4.3. Parametric & Nonparametric Tests
No ratings yet
4.3. Parametric & Nonparametric Tests
26 pages
Sampling Distributions of Sample Means and Proportions PDF
No ratings yet
Sampling Distributions of Sample Means and Proportions PDF
14 pages
Partial Correlation
No ratings yet
Partial Correlation
28 pages
Basic Statistical Concepts and Methods
100% (1)
Basic Statistical Concepts and Methods
122 pages
Evaluation of Evidence
No ratings yet
Evaluation of Evidence
51 pages
13 Practical Statistics Using SPSS Revision 2009
100% (1)
13 Practical Statistics Using SPSS Revision 2009
60 pages
Albright DADM 6e - PPT - Ch07
No ratings yet
Albright DADM 6e - PPT - Ch07
29 pages
Advanced Educational Stats Guide
No ratings yet
Advanced Educational Stats Guide
25 pages
Sampling Techniques Explained
No ratings yet
Sampling Techniques Explained
53 pages
Choosing The Correct Statistical Test
No ratings yet
Choosing The Correct Statistical Test
26 pages
Albright DADM 6e - PPT - Ch05
No ratings yet
Albright DADM 6e - PPT - Ch05
48 pages
Chapter 9. Test of Hypotheses For A Single Sample
No ratings yet
Chapter 9. Test of Hypotheses For A Single Sample
98 pages
Biostatistics for Nursing Students
100% (1)
Biostatistics for Nursing Students
40 pages
SPSS2 Workshop Handout 20200917
No ratings yet
SPSS2 Workshop Handout 20200917
17 pages
Sample Size Determination: BY DR Zubair K.O
100% (1)
Sample Size Determination: BY DR Zubair K.O
43 pages
Chapter 5 - The Standard Trade Model
No ratings yet
Chapter 5 - The Standard Trade Model
57 pages
The Three MS: Analysis Data
No ratings yet
The Three MS: Analysis Data
5 pages
Understanding Forest Plots in Meta-Analysis
100% (1)
Understanding Forest Plots in Meta-Analysis
16 pages
Chapter 1 Statistics
No ratings yet
Chapter 1 Statistics
41 pages
Statistics for Educators & Analysts
100% (1)
Statistics for Educators & Analysts
5 pages
1MATH - MW - Unit 4.1 (Introductory Topics in Statistics)
100% (1)
1MATH - MW - Unit 4.1 (Introductory Topics in Statistics)
30 pages
2.2 Hyphothesis Testing (Continuous)
No ratings yet
2.2 Hyphothesis Testing (Continuous)
35 pages
Data Validation & Research
No ratings yet
Data Validation & Research
41 pages
What Is Hypothesis Testing
100% (1)
What Is Hypothesis Testing
32 pages
Time Series Characteristic
No ratings yet
Time Series Characteristic
72 pages
Data Types
No ratings yet
Data Types
8 pages
Master of Statistics Program Guide
100% (1)
Master of Statistics Program Guide
24 pages
Stats Annova Two Way
No ratings yet
Stats Annova Two Way
4 pages
2.1 Descriptive Statistics Contd
No ratings yet
2.1 Descriptive Statistics Contd
20 pages
Multivariate Analysis IBS
No ratings yet
Multivariate Analysis IBS
20 pages
Types of Data Analysis Explained
100% (1)
Types of Data Analysis Explained
28 pages
Two-Variable Regression Analysis
100% (1)
Two-Variable Regression Analysis
46 pages
Biostatistics Introduction and Variables
No ratings yet
Biostatistics Introduction and Variables
3 pages
Statistics For Health Research: Non-Parametric Methods
100% (1)
Statistics For Health Research: Non-Parametric Methods
56 pages
Sample MT 1 Mckey
No ratings yet
Sample MT 1 Mckey
6 pages
Epidemiology: Understanding Selection Bias
No ratings yet
Epidemiology: Understanding Selection Bias
4 pages
Types of Sampling
100% (1)
Types of Sampling
3 pages
T Test
No ratings yet
T Test
21 pages
Bayes' Law and Probability Concepts
No ratings yet
Bayes' Law and Probability Concepts
7 pages
A Lesson 1 Introduction To Statistics & SPSS
100% (1)
A Lesson 1 Introduction To Statistics & SPSS
8 pages
Anova Notes
No ratings yet
Anova Notes
7 pages
Cohort Study - 09 - 12 - 24
No ratings yet
Cohort Study - 09 - 12 - 24
44 pages
Hotelling T-Square
No ratings yet
Hotelling T-Square
16 pages
Cross Sectional Studies 1
No ratings yet
Cross Sectional Studies 1
49 pages
Sampling Techniques
No ratings yet
Sampling Techniques
6 pages
Multiple Regression in SPSS
No ratings yet
Multiple Regression in SPSS
17 pages
Types of Statistical Analysis
No ratings yet
Types of Statistical Analysis
2 pages
Axiomatic Probability in Engineering
No ratings yet
Axiomatic Probability in Engineering
6 pages
Advanced Statistical Distributions
No ratings yet
Advanced Statistical Distributions
13 pages
Statistical Significance & Association
No ratings yet
Statistical Significance & Association
21 pages
Statistical Analysis of Blood Glucose
No ratings yet
Statistical Analysis of Blood Glucose
253 pages
Hypothesis Testing & P-Value Guide
No ratings yet
Hypothesis Testing & P-Value Guide
16 pages
Epidemiology
No ratings yet
Epidemiology
43 pages
Module 3b - Random Sampling and Sampling Error
No ratings yet
Module 3b - Random Sampling and Sampling Error
32 pages
Hypothesis Tests About The Mean and Proportion: Prem Mann, Introductory Statistics, 9/E
No ratings yet
Hypothesis Tests About The Mean and Proportion: Prem Mann, Introductory Statistics, 9/E
126 pages
Kruskal Wallis Test PDF
100% (2)
Kruskal Wallis Test PDF
2 pages
GROUP G RESEARCH
No ratings yet
GROUP G RESEARCH
31 pages
Module 5
No ratings yet
Module 5
53 pages
Final Exam
No ratings yet
Final Exam
13 pages
All Table PDF
100% (1)
All Table PDF
101 pages
ANOVA Analysis: Tests and Results
No ratings yet
ANOVA Analysis: Tests and Results
40 pages
Embry 7.2 Assignment
No ratings yet
Embry 7.2 Assignment
2 pages
Introductory Statistics Exploring The World Through Data 1st Edition Gould Test Bank Download
100% (6)
Introductory Statistics Exploring The World Through Data 1st Edition Gould Test Bank Download
44 pages
Du Lieu Thuc Hanh Anova
No ratings yet
Du Lieu Thuc Hanh Anova
28 pages
Research Methods Student Activity 5
No ratings yet
Research Methods Student Activity 5
8 pages
Hypothesis Testing: Erma M. Orada
No ratings yet
Hypothesis Testing: Erma M. Orada
20 pages
Inferential Statistics: Positive Correlation
No ratings yet
Inferential Statistics: Positive Correlation
9 pages
The Statistics Tutor's Quick Guide To Commonly Used Statistical Tests
No ratings yet
The Statistics Tutor's Quick Guide To Commonly Used Statistical Tests
53 pages
Sample Size Formula For A Comparative Study PDF
88% (8)
Sample Size Formula For A Comparative Study PDF
2 pages
Lampiran Frequency Table: Pengetahuan
No ratings yet
Lampiran Frequency Table: Pengetahuan
8 pages
SPSS t-Test Analysis Guide
No ratings yet
SPSS t-Test Analysis Guide
4 pages
Chi Square, Lambda
No ratings yet
Chi Square, Lambda
20 pages
7) M A1 Hypothesis Testing Notes
No ratings yet
7) M A1 Hypothesis Testing Notes
54 pages
Statistics for Business Students
No ratings yet
Statistics for Business Students
5 pages
Math 102 PT
No ratings yet
Math 102 PT
4 pages
Levine Smume7 Bonus Ch12
No ratings yet
Levine Smume7 Bonus Ch12
12 pages
Insufficient Evidence To Reject H .: Sections 9.3, 9.4, 9.5, Problems 42, 43, 46, 49, 52, 53, 54, 56, 66
No ratings yet
Insufficient Evidence To Reject H .: Sections 9.3, 9.4, 9.5, Problems 42, 43, 46, 49, 52, 53, 54, 56, 66
4 pages
SLNotes 09
No ratings yet
SLNotes 09
33 pages
Q4 Statistics and Probability 11 - Module 1
No ratings yet
Q4 Statistics and Probability 11 - Module 1
18 pages
Chapter 10
No ratings yet
Chapter 10
35 pages
Hypothesis Testing: 10.1 Testing The Mean of A Normal Population
No ratings yet
Hypothesis Testing: 10.1 Testing The Mean of A Normal Population
13 pages
Hypothesis Testing - by DR - Giridhar K.V.
No ratings yet
Hypothesis Testing - by DR - Giridhar K.V.
43 pages
STAT 166 Hypothesis Testing Guide
No ratings yet
STAT 166 Hypothesis Testing Guide
6 pages
Ego's Fflags
No ratings yet
Ego's Fflags
11 pages