0% found this document useful (0 votes)

2K views27 pages

A Marginalisation Paradox Example

This document summarizes a marginalization paradox example involving Bayesian inference on a tuberculosis transmission model. It shows that using an improper prior that is not well-defined can lead to inconsistencies between the prior and posterior marginal distributions. Specifically, the data contains no information about an absolute rate parameter θ, so its posterior should match the prior, but due to the improper prior this is not the case. The paradox arises from attempting to marginalize over an improper prior distribution.

Uploaded by

dpra23

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2K views27 pages

A Marginalisation Paradox Example

Uploaded by

dpra23

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

A Marginalisation Paradox Example

Dennis Prangle

28th October 2009

Overview

Bayesian inference recap

Example of error due to a marginalisation paradox
(Very) rough overview of general issues
Part I

Bayesian Inference
Bayesian Inference

Prior distribution on parameters θ: p(θ)

Model for the data X : f (X |θ)
Posterior distribution is (using Bayes’ theorem):

p(θ)f (X |θ)
f (θ|X ) = R
p(θ)f (X |θ)dθ

n.b. p(θ) only needed up to proportionality

Bayesian inference performed using computational Monte
Carlo methods (e.g. MCMC)
Typically also don’t need normalisation constant for p(θ) as
ratios used
Improper Prior

A probability density p(θ) (roughly speaking!) satisfies:

1 p(θ)
R ≥0
2 p(θ)dθ = 1
An improper prior doesn’t require condition 2
R
Instead can have p(θ)dθ = ∞
Example: p(θ) = 1 “improper uniform”
Sometimes used to represent prior ignorance
Resulting posterior often a proper distribution
⇒ meaningful conclusions (. . . or are they?!)
Part II

Example: Tuberculosis in San Francisco

Background: Tuberculosis

Tuberculosis is an infectious disease spread by bacteria

Epidemiological interest lies in estimating rates of
transmission and recovery
Conjectured that data on bacteria mutation provides
information → more accurate inference
Background: Paper

Tanaka et al (2006) investigated a Tuberculosis outbreak in

San Francisco in 1991/2
473 samples of Tuberculosis bacteria taken at a particular date
Genotyped according to a particular genetic marker
Samples split into clusters which share the same genotype

Cluster size 1 2 3 4 5 8 10 15 23 30
Number of clusters 282 20 13 4 2 1 1 1 1 1
Model: Underlying disease process

Assume initially there is one case

3 event types: birth, death, mutation (→ new genotype)
Suppose there are N cases at some time
Rate of births: αN
Rate of deaths: δN
Rate of mutations θN
Defines a continuous time Markov process model
We don’t care about times (no data) so can reduce to discrete
time Markov process
Model: Producing data

Run the disease process until there are 10,000 cases

(If the disease dies out, rerun)
Take a simple random sample of 473 cases
Convert to data on genotype frequencies
Prior

Some information on θ from previous studies

Prior distribution N(0.198, 0.067352 ) chosen
Corresponding density denoted p(θ)
Ignorance for other parameters
Proposed (improper) overall prior:

p(θ) if 0 < δ < α
p(α, δ, θ) =
0 otherwise
Motivation:
Marginal for θ is p(θ)
Marginal for (α, δ) is improper uniform:

1 if 0 < δ < α
0 otherwise
Restriction α > δ ⇒ zero prior probability on parameters
where epidemic usually dies out
Results

See Tanaka et al paper

Note change from prior
Parameter Redundancy

All parameters are proportional to rates

Multiplying all by a constant affects only rate of events
But this is irrelevant to our model
Model is over-parameterised:
(α, δ, θ) and (kα, kδ, kθ) give same likelihood
Reparameterisation

Reparameterise to:

a = α/(α + δ + θ)
d = δ/(α + δ + θ)
θ=θ

Motivation: keep θ as have prior info for it

a and d tell us everything about relative rates
Only θ has info on absolute rates. . .
. . . and θ has info on absolute rates only
Parameter constraints:
α, δ, θ > 0 ⇒ a, d, θ ≥ 0
and also a + d ≤ 1
Requirement α > δ in prior ⇒ a > d
Paradox (intuitive)

In new parameterisation, θ equiv to absolute rate info

But data has no information on absolute rates
So (marginal) θ posterior should equal prior?????
Analytic Results 1: Jacobian

Recall:
a = α/(α + δ + θ)
d = δ/(α + δ + θ)
θnew = θ
Solve to give:
α = aθnew /(1 − a − d)
δ = dθnew /(1 − a − d)
θ = θnew
Differentiate for Jacobian:
 
θnew (1 − d) aθ a(1 − a − d)
J = (1−a−d)−2  dθ θnew (1 − a) d(1 − a − d)
0 0 1
2 (1 − a − d)−3
|J| = θnew
Analytic Results 2: Reparameterised prior

Recall p(α, δ, θ) = p(θ)I [0 < δ < α]

(where p(θ) is a normal pdf)
Then:

p(a, d, θnew ) = p(θ)I [0 < δ < α]|J|

2
= θnew p(θnew )I [0 < d < a](1 − a − d)−3
Analytic Results 3: Posterior

Recall likelihood depends on a, d only

i.e. f (X |λ) = f (a, d)
So posterior is:
2
π(a, d, θnew ) ∝ θnew p(θnew )I [0 < d < a](1 − a − d)−3 f (a, d)

If this is proper, then posterior marginal for θ is:

2
π(θnew ) ∝ θnew p(θnew )

Matches results graph

Paradox and explanation

The prior was constructed to have marginal p(θ)

The model contains no data on θ
But we have shown that the posterior acts like ∝ θ2 p(θ)
(easy to falsely conclude that change is due to data)
PARADOX
The problem is that marginal distributions are not well defined
for improper priors
R
i.e. p(α, δ, θ)dαdδ is not a pdf (integral not 1)
Attempting to normalise gives /∞ problems
Prior didn’t really have claimed marginal
Practical resolution

Prior aimed to combine ignorance on α, δ with prior

knowledge on θ
In (a, d, θ) reparameterisation, range of (a, d) is finite
Combine p(θ) with a uniform marginal on (a, d) using
independence
For this parameterisation does give proper prior
So priors are well defined
(side issue: is uniform best representation of ignorance?)
Part III

Marginalisation Paradoxes: theory

Subjective Bayes viewpoint

Priors should represent prior beliefs

Only a probability distribution represent beliefs coherently
Therefore don’t use improper priors
(this is the resolution used earlier)
Objective Bayes viewpoint

Conclusions shouldn’t depend on subjective beliefs

(c.f. frequentist analysis)
Instead use objective reference priors
Lots of theory for choosing these
Will often be improper (e.g. Jeffrey’s prior)
So marginalisation paradoxes a real issue
The marginalisation paradox

Well-known Bayesian inference paradox

From Dawid, Stone, Zidek (RSS B 1973; read paper)
For models with a particular structure. . .
. . . there are two marginalisation approaches to Bayesian
inference
For improper priors, these typically do not agree
Large literature; claims of resolution but not fully
acknowledged
Is my example a special case of this?
Part IV

Conclusion
Conclusion

Be wary of marginalisation issues for improper priors!

Bibliography

A. P. Dawid, M. Stone, and J. V. Zidek Marginalization

paradoxes in Bayesian and structural inference JRSS(B),
35:189-233, 1973.
Mark M. Tanaka, Andrew R. Francis, Fabio Luciani, and S. A.
Sisson. Using Approximate Bayesian Computation to Estimate
Tuberculosis Transmission Parameters from Genotype Data.
Genetics, 173:1511–1520, 2006.

Introduction To Bayesian Methods With An Example
No ratings yet
Introduction To Bayesian Methods With An Example
25 pages
Modern Bayesian Econometrics
No ratings yet
Modern Bayesian Econometrics
100 pages
Bayesian Statistics
No ratings yet
Bayesian Statistics
76 pages
Bayesian Week2 LectureNotes
No ratings yet
Bayesian Week2 LectureNotes
14 pages
Bayesian Modeling - Student
No ratings yet
Bayesian Modeling - Student
26 pages
MIT18 05S14 Class16 Slides
No ratings yet
MIT18 05S14 Class16 Slides
33 pages
Problem Set 1 Sol
No ratings yet
Problem Set 1 Sol
7 pages
Bayesian Inference: The Basics
No ratings yet
Bayesian Inference: The Basics
37 pages
19-Bayesian 2
No ratings yet
19-Bayesian 2
39 pages
Bayes Lectures English
No ratings yet
Bayes Lectures English
74 pages
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
No ratings yet
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
53 pages
ETC 2420/5242 Lab 10 2016: Purpose
No ratings yet
ETC 2420/5242 Lab 10 2016: Purpose
11 pages
Bayesian Theory-Priors, Part 1: Other Reading
No ratings yet
Bayesian Theory-Priors, Part 1: Other Reading
14 pages
An Overview of Bayesian Econometrics
No ratings yet
An Overview of Bayesian Econometrics
30 pages
Course On Bayesian Methods in Environmental Valuation: Basics (Continued) : Models For Proportions and Means
No ratings yet
Course On Bayesian Methods in Environmental Valuation: Basics (Continued) : Models For Proportions and Means
34 pages
Var PPTS
No ratings yet
Var PPTS
249 pages
Baysian-Slides 16 Bayes Intro
No ratings yet
Baysian-Slides 16 Bayes Intro
49 pages
Slides PDF
No ratings yet
Slides PDF
40 pages
Intro To Bayes Approach. Reasons To Be Bayesian: Differences Between Bayesian and Frequentist Approaches 1
No ratings yet
Intro To Bayes Approach. Reasons To Be Bayesian: Differences Between Bayesian and Frequentist Approaches 1
9 pages
1 Inference
No ratings yet
1 Inference
9 pages
MIT18 05S14 Class14 Slides
No ratings yet
MIT18 05S14 Class14 Slides
26 pages
20-Bayesian 310456690
No ratings yet
20-Bayesian 310456690
34 pages
Class 7 8 Unlocked
No ratings yet
Class 7 8 Unlocked
18 pages
Bayesian Modelling Tuts-4-9
No ratings yet
Bayesian Modelling Tuts-4-9
6 pages
Slides 1
No ratings yet
Slides 1
73 pages
MCMC Bayes PDF
No ratings yet
MCMC Bayes PDF
27 pages
Bayes Intro PT 2
No ratings yet
Bayes Intro PT 2
13 pages
Intro Bayes Time Series 1
No ratings yet
Intro Bayes Time Series 1
72 pages
Bayesian Inference Slides 2021
No ratings yet
Bayesian Inference Slides 2021
37 pages
Lecture 20 - Bayesian Analysis
No ratings yet
Lecture 20 - Bayesian Analysis
4 pages
24 Intro To Bayesian Inference
No ratings yet
24 Intro To Bayesian Inference
33 pages
PML Class 1 2025
No ratings yet
PML Class 1 2025
54 pages
Bayesian Interlude - Marginalization
No ratings yet
Bayesian Interlude - Marginalization
7 pages
Notes4 BayesianLearning
No ratings yet
Notes4 BayesianLearning
8 pages
Lecture Notes For Probability and Statistics
No ratings yet
Lecture Notes For Probability and Statistics
7 pages
A Beginner's Notes On Bayesian Econometrics (Art)
No ratings yet
A Beginner's Notes On Bayesian Econometrics (Art)
21 pages
TheoryIdeasInla Screen
No ratings yet
TheoryIdeasInla Screen
69 pages
Lecture 2 - 4 Prior
No ratings yet
Lecture 2 - 4 Prior
51 pages
Reading 1 - Bandits Chapter 34-36
No ratings yet
Reading 1 - Bandits Chapter 34-36
57 pages
Linear Regression Model: Alan Ledesma Arista
No ratings yet
Linear Regression Model: Alan Ledesma Arista
12 pages
Lecture 4
No ratings yet
Lecture 4
11 pages
MIT18 650F16 Bayesian Statistics
No ratings yet
MIT18 650F16 Bayesian Statistics
18 pages
Bayesian Data Analysis - Reading Instructions 2: Chapter 2 - Outline
No ratings yet
Bayesian Data Analysis - Reading Instructions 2: Chapter 2 - Outline
36 pages
CLASS 2025 Bayesian Framework
No ratings yet
CLASS 2025 Bayesian Framework
46 pages
DarkArts Handout
No ratings yet
DarkArts Handout
11 pages
Wasserstein Complexity Penalization Priors: A New Class of Penalizing Complexity Priors
No ratings yet
Wasserstein Complexity Penalization Priors: A New Class of Penalizing Complexity Priors
35 pages
Bayesian Linear Regression Guide
No ratings yet
Bayesian Linear Regression Guide
29 pages
BST413 12jan Page1to11
No ratings yet
BST413 12jan Page1to11
11 pages
Lecture2 2013
No ratings yet
Lecture2 2013
60 pages
Single Parameter Models
No ratings yet
Single Parameter Models
37 pages
CH 5
No ratings yet
CH 5
45 pages
Lecture 5
No ratings yet
Lecture 5
23 pages
Bayesian Conjugate Priors Explained
No ratings yet
Bayesian Conjugate Priors Explained
5 pages
Lec 1 Prob Bayesian Modeling
No ratings yet
Lec 1 Prob Bayesian Modeling
49 pages
Bayesian Parameter Estimation
No ratings yet
Bayesian Parameter Estimation
40 pages
Intro-Bayes Theory
No ratings yet
Intro-Bayes Theory
17 pages
Stat 111
No ratings yet
Stat 111
7 pages
Problem Set 1
No ratings yet
Problem Set 1
3 pages
Tut 7
No ratings yet
Tut 7
1 page
Report Writing
100% (3)
Report Writing
34 pages
Confusion of Confidence Intervals and Credibility Intervals in Meta-Analysis
No ratings yet
Confusion of Confidence Intervals and Credibility Intervals in Meta-Analysis
7 pages
Lean Six Sigma Green Belt: Hypothesis Testing
No ratings yet
Lean Six Sigma Green Belt: Hypothesis Testing
77 pages
Research Methodology: Dr. Norzaida Abas
No ratings yet
Research Methodology: Dr. Norzaida Abas
36 pages
Name: Reg. No.: Lab Exercise:: Shivam Batra 19BPS1131
100% (1)
Name: Reg. No.: Lab Exercise:: Shivam Batra 19BPS1131
10 pages
Hands-On Bayesian Neural Networks
No ratings yet
Hands-On Bayesian Neural Networks
24 pages
Forecasting for Industrial Management
No ratings yet
Forecasting for Industrial Management
49 pages
Assignment
No ratings yet
Assignment
6 pages
BES220 S2 Oct2022 - Memo
No ratings yet
BES220 S2 Oct2022 - Memo
8 pages
Normalitas Dan ANOVA
No ratings yet
Normalitas Dan ANOVA
2 pages
Pengaruh Motivasi Belajar Mahasiswa
No ratings yet
Pengaruh Motivasi Belajar Mahasiswa
13 pages
Ejercicios Mercado de Trabajo Carlos Flores Tapullima
No ratings yet
Ejercicios Mercado de Trabajo Carlos Flores Tapullima
17 pages
Polynomial Regression
No ratings yet
Polynomial Regression
15 pages
CASTILLO, RYAN CARL C. (Activity #2)
No ratings yet
CASTILLO, RYAN CARL C. (Activity #2)
3 pages
Nonparametric Error Estimation
No ratings yet
Nonparametric Error Estimation
14 pages
Pawitan and Lee - Draft-1-80
No ratings yet
Pawitan and Lee - Draft-1-80
80 pages
Learning Platform
No ratings yet
Learning Platform
4 pages
Summary of Chapter 4
No ratings yet
Summary of Chapter 4
3 pages
Presentation PPT Financial Engineering
No ratings yet
Presentation PPT Financial Engineering
21 pages
OceanofPDF - Com Power and Sample Size in R - Catherine M Crespi
No ratings yet
OceanofPDF - Com Power and Sample Size in R - Catherine M Crespi
574 pages
M2 - Two-Way Factorial ANOVAs
No ratings yet
M2 - Two-Way Factorial ANOVAs
26 pages
Engineering Forecasting Techniques
No ratings yet
Engineering Forecasting Techniques
10 pages
10 F Test and Analysis of Variance ANOVA
No ratings yet
10 F Test and Analysis of Variance ANOVA
7 pages
Econometrics Assignment 2
No ratings yet
Econometrics Assignment 2
3 pages
STA513-11 Analisis Regresi Berganda
No ratings yet
STA513-11 Analisis Regresi Berganda
47 pages
PLS Tutorial PDF
No ratings yet
PLS Tutorial PDF
12 pages
Introduction To Econometrics, 5 Edition: Chapter 3: Multiple Regression Analysis
No ratings yet
Introduction To Econometrics, 5 Edition: Chapter 3: Multiple Regression Analysis
16 pages
Kruskal-Wallis Test Guide
No ratings yet
Kruskal-Wallis Test Guide
28 pages
Jurnal RRB Speaking Ester Dan Khairunnisa Revised
No ratings yet
Jurnal RRB Speaking Ester Dan Khairunnisa Revised
7 pages

A Marginalisation Paradox Example

Uploaded by

A Marginalisation Paradox Example

Uploaded by

A Marginalisation Paradox Example

28th October 2009

Bayesian inference recap

Prior distribution on parameters θ: p(θ)

n.b. p(θ) only needed up to proportionality

A probability density p(θ) (roughly speaking!) satisfies:

Example: Tuberculosis in San Francisco

Tuberculosis is an infectious disease spread by bacteria

Tanaka et al (2006) investigated a Tuberculosis outbreak in

Assume initially there is one case

Run the disease process until there are 10,000 cases

Some information on θ from previous studies

See Tanaka et al paper

All parameters are proportional to rates

Motivation: keep θ as have prior info for it

In new parameterisation, θ equiv to absolute rate info

Recall p(α, δ, θ) = p(θ)I [0 < δ < α]

p(a, d, θnew ) = p(θ)I [0 < δ < α]|J|

Recall likelihood depends on a, d only

If this is proper, then posterior marginal for θ is:

Matches results graph

The prior was constructed to have marginal p(θ)

Prior aimed to combine ignorance on α, δ with prior

Marginalisation Paradoxes: theory

Priors should represent prior beliefs

Conclusions shouldn’t depend on subjective beliefs

Well-known Bayesian inference paradox

Be wary of marginalisation issues for improper priors!

A. P. Dawid, M. Stone, and J. V. Zidek Marginalization

You might also like