0% found this document useful (0 votes)

94 views24 pages

BIOM4025 - Statistical Modelling - QA Session 2

This document discusses a Q&A session on statistical modeling. It addresses questions about using R vs RStudio, fixing a broken URL, what to include in scientific papers, explaining variance and degrees of freedom, and clarifying the differences between standard deviation, standard error, and confidence intervals. The document also provides examples of the normal and t-distributions and discusses when to standardize data and how critical values are determined.

Uploaded by

Lauren Joslyn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views24 pages

BIOM4025 - Statistical Modelling - QA Session 2

Uploaded by

Lauren Joslyn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

BIOM4025 - Statistical Modelling - Q&A session 2

Data and distributions

Erik Postma / Centre for Ecology and Conservation / University of Exeter
Today
Data and distributions
Different types of data

Mean, median and mode

Variance, standard deviation and standard error

The normal distribution

Probability density functions
Probabilistic statements about data and estimates derived from these
data

Standard normal (or 𝑧 ) distribution

𝑡 -distribution
3/24
Questions about the lecture
Questions about the lecture
" Should we use R or Rstudio?

Both!
‘R’ does the calculations
‘RStudio’ makes ‘R’ easier to use

5/24
Questions about the lecture
'https://shiny01.cles.ex.ac.uk/biom4025/app_02_1_1/
I can’t seem to get the links for the

(https://shiny01.cles.ex.ac.uk/biom4025/app_02_1_1/) to work, I’ve tried

safari and google chrome but no luck unfortunately. Not sure if its just me?

Sorry, I made a typo in the URL. Should be fixed now!

Next time, post questions like this in the Questions about the module
channel, where I will see them earlier.

6/24
Questions about the lecture
'analysis
If we were to write a scientific paper, would we do stuff like this in the
or is it just for us to understand the principles of stats?

See Practicals 2-5 for examples of what to write in a paper.

Means, variances, standard deviations, standard errors and confidence
intervals are all commonly reported.
Degrees of freedom, 𝑧 and 𝑡 -values are central to most statistical tests
as they will provide you with the p-value. More in Lecture 3!

7/24
Questions about the lecture
'‘ThisWhile explaining the n - 1 part of the equation for variation you say
is because we have first estimated the mean from our data’. You make
reference to it again saying ‘We lose one degree of freedom because we
have estimated the mean from the data’. I didn’t quite understand what that
meant.

'degrees
could you please explain the concept of variance and in particular the
of freedom again

'in variation.
Please could you further explain why we subtract 1 from the sample size

8/24
Variance
𝑛 ⎯⎯⎯ 2
2
∑𝑖=1 (𝑥𝑖 − 𝑥)
𝜎𝑥 =
𝑛−1

The mean squared deviation from the estimated mean is always larger
than the mean squared deviation from the true mean
Our estimate of the true mean will explain some of the variance around
the true mean
By dividing by 𝑛 − 1 we account for the fact that we estimate the mean
from our data and we don’t use the (unknown) true mean

9/24
Degrees of freedom

10/24
Degrees of freedom
The number of independent values that can vary freely
For example:
5 values: 6 , 4 , 5 , 2 , 3
6+4+5+2+3 20
Mean = 5
= 5
=4

If you know four out of five values and the mean, you know the fifth
value
Every parameter we estimate from our data constrains the value of an
observation
Degrees of freedom (d.f.) is sample size minus number of parameters
estimated from the data
11/24
Questions about the lecture
' Could you explain the variance histogram?
' When to use standard error?
'errorcanshould
you talk more about how and when standard deviation or standard
be applied to data and on graphs? can you go over confidence
interval of the mean again?

12/24
Questions about the lecture
'deviation
I am having a hard time understanding the difference between standard
and standard error - do you mind going over them again?

'dataCanandyouwhen
go over when you would use standard error when reporting
you would use standard deviation?

'thanWhydegrees
do we use sample size as the denominator for standard error rather
of freedom?

13/24
Estimating the mean
Sample size:
1 30 100

1 11 21 31 41 51 61 71 81 91 100

Number of repetitions:
1 500 1,000

1 101 201 401 601 801 1,000

Add samples one at a

time

Draw new sample

Error bars:

None

14/24
Standard deviation
‾∑
‾‾‾‾‾‾‾‾‾‾‾‾
𝑛 ⎯⎯⎯ 2‾

√
𝑖=1 (𝑥𝑖 − 𝑥)
𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 (𝑜𝑟 𝜎) =
𝑛−1

= √‾𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒
‾‾‾‾‾‾‾‾‾‾‾‾‾
(𝑜𝑟 𝜎 2‾)

Standard deviation: Measure of the amount of variation among

individuals in a sample

15/24
Standard error
𝑆𝐷
SE𝑥¯ =
√𝑛
Standard error: Measure of the uncertainty around an estimate

IF we were to repeat our experiment many times, the standard error

would be the standard deviation of our estimates

In practice we have just a single estimate, so we infer the standard error

from the variation in our data (in the case of the mean, from the
standard deviation)

16/24
Standard deviation vs. standard error
Only report standard deviation if you want to quantify the amount of
variation among observations (e.g. individuals)
Report standard errors whenever you are presenting estimats, e.g. of
the mean, the regression coefficient, or the difference between two
means.

17/24
Questions about the lecture
'andCanwhether
you please explain the bit about confidence interval of the mean
a 0 is included again?

95% confidence interval gives us the range that, with a probability of

95%, contains the true mean

There is 5% probability that the true mean lies outside of this range

If the 95% confidence interval excludes zero, the probability that the
true mean is zero, is less than 5%

Testing a mean against zero is usually not very interesting, but the
same logic applies to all estimates (e.g. of a slope or a difference
between two means) 18/24
The normal distribution
True mean: The mean of 𝑥 :
-100 0 100

[1] -0.2988209
-100 -60 -40 -20 0 20 40 60 80 100

True variance:
1 10 100 The variance of 𝑥 :

1 11 21 31 41 51 61 71 81 91 100
[1] 8.101418
Sample size
100 1,000
The standard deviation of 𝑥 :
10 109 208 406 604 802 1,000

Add fitted normal [1] 2.846299

distribution to plot
The standard error of the mean of 𝑥 :

[1] 0.2860638

95% confidence interval of mean of

𝑥:

[1] -0.8595059 0.2618642

19/24
Questions about the lecture
' Do we always standardise data?

No, but you can and it can be useful sometimes

We standardise parameters estimated from our data (e.g. slope,
difference among groups) all the time

Express slope or difference in standard errors units

Allows to obtain p-value using standard normal or 𝑡 distribution
What is the probability of finding a difference equal to or larger than 𝑥
standard errors if the true difference is 0?

20/24
Questions about the lecture
'decided?
What is the definition of a critical value? How is the critical value

The value of 𝑧 (and −𝑧 ) or 𝑡 (and −𝑡 ) for which you would like the area
under the curve
You decide on the critical value you want to use
For significance testing and a significance threshold of 5%, it is the
value for which the area under the curve between −𝑧 and 𝑧 (or −𝑡 and 𝑡
) is 0.95

21/24
t-distribution
𝑒𝑠𝑡𝑖𝑚𝑎𝑡𝑒
𝑡=
𝑠. 𝑒.
Critical value:
0 1.96 4

0 0.4 0.8 1.2 1.6 2 2.4 2.8 3.2 3.6 4

Sample size:
3 200

3 23 43 63 83 103 123 143 163 183 200

Invert?

Size of shaded area (i.e. probability):

[1] 0.949

22/24
Questions about the lecture
'1.96xS.E
In the normal distribution the confidence interval is mean(x) +/-
as 95% of data falls between +/- 1.96 S.E, but as each t-
distribution is different and there is no set value for where 95% of the data
fall, how do you work out the 95% confidence interval if the data is instead
from a t distribution?

Quick and dirty: Mean ± 2 × standard error

Exact confidence interval depends on sample size
In R: use the confint() function

23/24
Questions about the lecture
'butI Iunderstood the math behind the confidence interval and t-distributions
didn’t quite understand what it was useful for in real-life. Can we see
an example?

See Lecture 3.

24/24

Che 411 L2
No ratings yet
Che 411 L2
22 pages
2006 Geog090 Week06 Lecture01 CentralLimitTheorem
No ratings yet
2006 Geog090 Week06 Lecture01 CentralLimitTheorem
37 pages
Statistics II - Confidence Intervals, Estimation
No ratings yet
Statistics II - Confidence Intervals, Estimation
80 pages
Che 411 L2
No ratings yet
Che 411 L2
31 pages
Statistical Methods in Social Sciences
No ratings yet
Statistical Methods in Social Sciences
69 pages
Statistics ESCP
No ratings yet
Statistics ESCP
383 pages
Error and Uncertainty: General Statistical Principles
No ratings yet
Error and Uncertainty: General Statistical Principles
8 pages
Lecture 4
No ratings yet
Lecture 4
38 pages
Chapter 5 - RM
No ratings yet
Chapter 5 - RM
22 pages
Unit 8 Textbook
0% (2)
Unit 8 Textbook
47 pages
Week 4 Bioscience
No ratings yet
Week 4 Bioscience
37 pages
Statistics Practise Questions
No ratings yet
Statistics Practise Questions
19 pages
Lectures Up To Final Assignments
No ratings yet
Lectures Up To Final Assignments
33 pages
Lecture 6 Estimation
No ratings yet
Lecture 6 Estimation
8 pages
Precision & Accuracy in Experiments
No ratings yet
Precision & Accuracy in Experiments
42 pages
Statistics 1 AQA Revision Notes
No ratings yet
Statistics 1 AQA Revision Notes
7 pages
03 Estimation IITB PDF
No ratings yet
03 Estimation IITB PDF
58 pages
Confidence Interval Estimation Guide
No ratings yet
Confidence Interval Estimation Guide
61 pages
Lecture 3
No ratings yet
Lecture 3
14 pages
Basic Statistical Concepts Review
No ratings yet
Basic Statistical Concepts Review
8 pages
Introduction To Uncertainty: Asma Khalid and Muhammad Sabieh Anwar
No ratings yet
Introduction To Uncertainty: Asma Khalid and Muhammad Sabieh Anwar
36 pages
Bio Statistics
No ratings yet
Bio Statistics
97 pages
Inbound 588667172330667162
No ratings yet
Inbound 588667172330667162
30 pages
APP601S Chapter 4 - Data Handling in Anal Chem
No ratings yet
APP601S Chapter 4 - Data Handling in Anal Chem
42 pages
Chapter1 Statistic
No ratings yet
Chapter1 Statistic
33 pages
RP Notes Unit 4 - Distribution Fucntions
No ratings yet
RP Notes Unit 4 - Distribution Fucntions
5 pages
Basic Probability Reference Sheet: February 27, 2001
No ratings yet
Basic Probability Reference Sheet: February 27, 2001
8 pages
Measures of Variability Lec 7: DR - Nesrin H. Darwesh University of Duhok-College of Dentistry
No ratings yet
Measures of Variability Lec 7: DR - Nesrin H. Darwesh University of Duhok-College of Dentistry
48 pages
Error Analysis - Statistics: - Accuracy and Precision - Individual Measurement Uncertainty
No ratings yet
Error Analysis - Statistics: - Accuracy and Precision - Individual Measurement Uncertainty
33 pages
Alistair Benson Quantitative Analytical Methods 2
No ratings yet
Alistair Benson Quantitative Analytical Methods 2
75 pages
Data Types:: Basic Statistics
No ratings yet
Data Types:: Basic Statistics
23 pages
Ci 1
No ratings yet
Ci 1
47 pages
Random Variables & Sampling
100% (1)
Random Variables & Sampling
5 pages
Review of Chapters 1-5
No ratings yet
Review of Chapters 1-5
21 pages
A (Very) Brief Review of Statistical Inference: 1 Some Preliminaries
No ratings yet
A (Very) Brief Review of Statistical Inference: 1 Some Preliminaries
9 pages
2NUBIONormalCurve2T24 25
No ratings yet
2NUBIONormalCurve2T24 25
50 pages
Stats
No ratings yet
Stats
3 pages
Theory Term2
No ratings yet
Theory Term2
9 pages
Confidence Intervals PDF
No ratings yet
Confidence Intervals PDF
5 pages
PSYCH 240: Statistics For Psychologists: Interval Estimation: Understanding The T Distribution
No ratings yet
PSYCH 240: Statistics For Psychologists: Interval Estimation: Understanding The T Distribution
44 pages
Liv-Stats 2
No ratings yet
Liv-Stats 2
15 pages
A Session 18 2021
No ratings yet
A Session 18 2021
36 pages
اسايمنت
No ratings yet
اسايمنت
28 pages
Basic Statistics
No ratings yet
Basic Statistics
23 pages
5-6.sampling Error and Confidence Interval
No ratings yet
5-6.sampling Error and Confidence Interval
74 pages
CHAPTERS
No ratings yet
CHAPTERS
17 pages
Lecture 5 - 6-260
No ratings yet
Lecture 5 - 6-260
10 pages
Biostatistics Revision DR - NJ
No ratings yet
Biostatistics Revision DR - NJ
67 pages
Introduction To Biostatistics - 20250506
No ratings yet
Introduction To Biostatistics - 20250506
40 pages
Chapter 8 & (Part) Chapter 12: Distribution of Sample Means: Chapters 8 & 12: Page 1
No ratings yet
Chapter 8 & (Part) Chapter 12: Distribution of Sample Means: Chapters 8 & 12: Page 1
14 pages
Confidence Intervals for Estimation
No ratings yet
Confidence Intervals for Estimation
0 pages
Fstats ch2 PDF
No ratings yet
Fstats ch2 PDF
16 pages
Estimation, Standard Errors and Confidence Limits: 3.1 Sampling Variation
No ratings yet
Estimation, Standard Errors and Confidence Limits: 3.1 Sampling Variation
7 pages
Confidence Intervals Concept
No ratings yet
Confidence Intervals Concept
10 pages
Normal Probability Distribution
No ratings yet
Normal Probability Distribution
32 pages
Understanding Frequency Distributions
No ratings yet
Understanding Frequency Distributions
9 pages
Question: How Do We Estimate Precision Error?
No ratings yet
Question: How Do We Estimate Precision Error?
38 pages
Internal Paper
No ratings yet
Internal Paper
20 pages
Reliance JIO
No ratings yet
Reliance JIO
69 pages
STATISTICS
No ratings yet
STATISTICS
28 pages
Developing Pragmatic Competence (Tugas Semantics)
100% (1)
Developing Pragmatic Competence (Tugas Semantics)
31 pages
Basic Statistics For Lms
0% (1)
Basic Statistics For Lms
23 pages
USP General Chapter 41: Balance Requirements
100% (1)
USP General Chapter 41: Balance Requirements
81 pages
The Managerial Determinants of Accounting Conservatism During COVID-19 Era: Evidence From Saudi Arabia
No ratings yet
The Managerial Determinants of Accounting Conservatism During COVID-19 Era: Evidence From Saudi Arabia
8 pages
AI Tools For Learning and Development
No ratings yet
AI Tools For Learning and Development
34 pages
Quantitative Methods 1: Key Concepts
No ratings yet
Quantitative Methods 1: Key Concepts
35 pages
Measures of Reliability in Sports Medicine and Science: Will G. Hopkins
No ratings yet
Measures of Reliability in Sports Medicine and Science: Will G. Hopkins
15 pages
Temp File MRCGP Revision Guide FreeBook2
No ratings yet
Temp File MRCGP Revision Guide FreeBook2
73 pages
Nifty Analysis & Religare Strategy
98% (53)
Nifty Analysis & Religare Strategy
38 pages
Glass Recycling in Cement Production An
No ratings yet
Glass Recycling in Cement Production An
7 pages
Balance Accuracy and Repeatability Standards
No ratings yet
Balance Accuracy and Repeatability Standards
1 page
Strategies For Business Education in An Era of Economic Uncertainties in Nigeria
No ratings yet
Strategies For Business Education in An Era of Economic Uncertainties in Nigeria
21 pages
Assignment 5040
No ratings yet
Assignment 5040
15 pages
Statistical Analysis
No ratings yet
Statistical Analysis
50 pages
FX Exposure for Finance Students
No ratings yet
FX Exposure for Finance Students
12 pages
Risk Factors For Dysmenorrhea Among Young Adult Female University Students
No ratings yet
Risk Factors For Dysmenorrhea Among Young Adult Female University Students
6 pages
Specimen Paper CS1
No ratings yet
Specimen Paper CS1
7 pages
Basic Statistics For The Health Sciences 3rd Edition by Jan Kuzma 0874845874 9780874845877 - Download The Ebook Today and Own The Complete Version
No ratings yet
Basic Statistics For The Health Sciences 3rd Edition by Jan Kuzma 0874845874 9780874845877 - Download The Ebook Today and Own The Complete Version
55 pages
Fin320 Simulation 2020 July
100% (1)
Fin320 Simulation 2020 July
40 pages
Blindfold Chess
100% (1)
Blindfold Chess
39 pages
Grease Analysis - Monitoring Grease Serviceability
No ratings yet
Grease Analysis - Monitoring Grease Serviceability
6 pages
CS ELEC 4 Midterm Module
No ratings yet
CS ELEC 4 Midterm Module
59 pages
H2 Mathematics - Use of Graphing Calculator TI84 (Binomial and Normal Distribution)
No ratings yet
H2 Mathematics - Use of Graphing Calculator TI84 (Binomial and Normal Distribution)
3 pages
BMS 7 TH Model
No ratings yet
BMS 7 TH Model
3 pages
Crunchit! 2.0 Quick Start Guide: Texas A&M University
No ratings yet
Crunchit! 2.0 Quick Start Guide: Texas A&M University
26 pages
Center, Spread and Shape of Distribution
No ratings yet
Center, Spread and Shape of Distribution
11 pages
Lean Six Sigma Green Belt Project
No ratings yet
Lean Six Sigma Green Belt Project
25 pages
GRR Study MSA Template
No ratings yet
GRR Study MSA Template
21 pages
Topic 8 - Risk & Return - Slides
No ratings yet
Topic 8 - Risk & Return - Slides
13 pages

BIOM4025 - Statistical Modelling - QA Session 2

Uploaded by

BIOM4025 - Statistical Modelling - QA Session 2

Uploaded by

BIOM4025 - Statistical Modelling - Q&A session 2

Data and distributions

Mean, median and mode

The normal distribution

Standard normal (or 𝑧 ) distribution

(https://shiny01.cles.ex.ac.uk/biom4025/app_02_1_1/) to work, I’ve tried

Sorry, I made a typo in the URL. Should be fixed now!

See Practicals 2-5 for examples of what to write in a paper.

1 101 201 401 601 801 1,000

Add samples one at a

Draw new sample

Standard deviation: Measure of the amount of variation among

IF we were to repeat our experiment many times, the standard error

In practice we have just a single estimate, so we infer the standard error

95% confidence interval gives us the range that, with a probability of

Add fitted normal [1] 2.846299

95% confidence interval of mean of

[1] -0.8595059 0.2618642

No, but you can and it can be useful sometimes

Express slope or difference in standard errors units

0 0.4 0.8 1.2 1.6 2 2.4 2.8 3.2 3.6 4

3 23 43 63 83 103 123 143 163 183 200

Size of shaded area (i.e. probability):

Quick and dirty: Mean ± 2 × standard error

You might also like