0% found this document useful (0 votes)

64 views8 pages

Multiple Testing

Lesson 4 of STAT 555 discusses the multiple testing problem, emphasizing the importance of understanding p-values, false discovery rates, and various correction methods like the Bonferroni and Benjamini-Hochberg procedures. It highlights the challenges of high throughput analysis where numerous tests can lead to false positives, and introduces techniques for estimating the proportion of null hypotheses to improve statistical inference. The lesson also covers the use of histograms of p-values for estimating false discovery rates and the significance of q-values in this context.

Uploaded by

Huaqing Li

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views8 pages

Multiple Testing

Uploaded by

Huaqing Li

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

3/8/2018 https://onlinecourses.science.psu.

edu/stat555/print/book/export/html/56

Published on STAT 555 (https://onlinecourses.science.psu.edu/stat555)

Home > Lesson 4: Multiple Testing

Lesson 4: Multiple Testing

Key Learning Goals for this Lesson:

Understand the multiple testing problem

Interpret a histogram of p-values from independent tests
Be familiar with the Bonferroni method
Understand false discovery and non-discovery rates
Be familiar with some FDR estimation methods

An event that is rare if we have only one opportunity to observe can become quite common if we are
observing thousands of events. For example, when you roll 2 fair dice, getting double sixes happens
only about 1 out of 36 times. But if you roll 3600 times, you expect to get about 100 rolls with 2 sixes.

The p-value is the probability of obtaining a result at least as extreme as the observed result if the null
hypothesis is true. Suppose we accept p < 0.05 as "extreme". If we do 10,000 (independent) tests, and
all the null hypotheses are true, we expect about 5% of the tests (i.e. about 500) to have p < 0.05.

This is a huge problem in high throughput analysis, because we are usually doing thousands of tests.
We do not want to waste our time following up false positive hypotheses. But if we use conventional p-
value cut-offs, this will be inevitable.

This chapter discusses some approaches to correcting our inference methods when we are doing
multiple tests.

4.1 - Mistakes in Statistical Testing

The table below shows the possible correct and incorrect outcomes for a single test:

Not Significant Significant

H0 True type I Error

H0 False type II Error

A type I error is called a false discovery. A type II error is called a false non-discovery. Generally false
discoveries are considered to be more serious than false nondiscoveries, although this is not always
the case. Investigators usually follow-up with discoveries, so that false discoveries can lead to
misleading and expensive follow-up studies. But nondiscoveries are usually abandoned, so that false
nondiscoveries can lead to missing potentially important results.
https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56 1/8
3/8/2018 https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56

In high throughput studies we typically test each of our m features individually, leading to the following
table:

Not Significant Significant

H0 True U V m0

H0 False T S m − m0

W R m

The total errors are T + V.

The false discovery proportion FDP is V / R.

The false discovery rate is the expected value of V / R, given that R ≠ 0.

Similarly, the false nondiscovery proportion FNP is T / W.

The false nondiscovery rate is the expected value of T / W, given that RW≠ 0.

π0 = m0 /m is the proportion of null tests.

Before 1995, the objective of multiple testing correction was to control Pr(V > 0), the so-called Family-
Wise Error Rate (FWER).

The problem is: as m0 grows, so does Pr(V > 0) for any fixed cut-off one would use to ascertain
statistical significance.

4.2 - Controlling Family-wise Error Rate

Pr(V > 0) is called the family-wise error rate or FWER. It is easy to show that if you declare tests
significant for p < α then FWER ≤ min(m0 α, 1).

The most commonly used method which controls FWER at level α is called Bonferroni's method. It
rejects the null hypothesis when p < α/m . (It would be better to use m0 but we don't know what it is -
more on that later.)

The Bonferroni method is guaranteed to control FWER, but it has a big problem. It greatly reduces your
power to detect real differences. For example, suppose the effect size is 2 and you are doing a t-test,
rejecting for p < 0.05. With 10 observations per group, the power is 99%. Now suppose you have 1000
tests, and use the Bonferroni method. That means that to reject, we need p < 0.00005. The power is
now only 29%. If you have 10 thousand tests (which is small for genomics studies) the power is only
10%.

Sometimes the "Bonferroni-adjusted p-values are reported". They are just:

pb = min(mp, 1) .

Another simple more powerful but less popular method uses the sorted p-values:
https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56 2/8
3/8/2018 https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56

p(1) ≤ p(2) ≤ ⋯ ≤ p(m)

Holmes showed that the FWER is controlled with the following algorithm:

Compare p(i) with α/(m − i + 1) . Starting from i = 1, reject until p(i) is greater.

The most significant test must therefore pass the Bonferroni criterion. However, if it is significant, the
next most significant is tested at a less stringent level. Heuristically, after rejecting the most significant
test, we conclude the m0 ≤ m − 1 and use m − 1 for the next correction, and so on sequentially.

The Holmes method is more powerful than the Bonferroni method, but it is still not very powerful. We
can also compute "Holmes-adjusted p-values" ph(i) = min((m − i + 1)p(i) , 1) .

4.3 -1995 - Two Huge Steps for Biological

Inference
In 1995, the first microarray was "spotted" and hybridized, starting the "omics" revolution in biology.

Also in 1995, independently, Benjamini and Hochberg conceived of the idea of False Discovery Rate
or FDR. Their idea was that for large m, we do not expect all of the null hypotheses to be true, and so
we do not want to stringently control Pr(V > 0). Instead, we want to control the expected proportion of
our discoveries that are false assuming we make at least one discovery, that is FDR=E(V/R|R>0).

Let q be the target FDR. Benjamini and Hochberg proved that if q is the target FDR rejecting while
p(i) ≤ qi/m controls FDR at level q. [1]

The Benjamini and Hochberg method is used extensively in bioinformatics and other "big data"
disciplines. It requires the tests to be independent. This is seldom true in "omics" data for which our
features may be gene expression or proteins, which occur in pathways which induce correlated
behavior. However, in a follow-up paper, Benjamini and Hochberg also showed that their procedure
controls FDR for certain types of correlation.

The BH procedure may not work so well for highly correlated data such as SNP frequencies for SNPs
that are densely located. Considerable work has gone into developing FDR controlling procedures for
highly correlated data such as dense SNPs and neuroimaging data. The Benjamini and Yekutieli (BY)
method [2] controls FDR for any correlation structure, but is much less powerful than the BH method.

Although the BH procedure is meant to control FDR, not the FWER, "BH-adjusted p-values" computed
as pBH(i) = min(mp(i)/i, 1) are often used as adjusted p-values.

The BH procedure is more powerful than the Holmes procedure.

All of the procedures could be made more powerful because we really only need to adjust for the null
tests. If we only knew m0 we could adjust for it instead of m, giving us larger cut-off values. Fortunately,
it turns out that when we have done many tests, it is fairly easy to estimate m0 . There are many
estimation methods.

FDR controlling or estimation methods that estimate m0 and use it in place of m, are called adaptive
FDR methods.

https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56 3/8
3/8/2018 https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56

[1] Benjamini, Y. and Hochberg, Y. (1995). Controlling the false discovery rate: a practical and
powerful approach to multiple testing. JRSSB, 57, 289--300.http://www.jstor.org/stable/2346101 [1]

[2] Benjamini, Yoav, and Daniel Yekutieli. "The control of the false discovery rate in multiple testing
under dependency." Annals of statistics (2001): 1165-
1188. http://projecteuclid.org/download/pdf_1/euclid.aos/1013699998 [2]

4.4 - Estimating m0 (or π0 )

The proportion of hypotheses that are truly null is an important parameter in many situations. For
example, when comparing normal and diseased tissues we might hypothesize a greater number of
genes have changed response in the normal tissue than in the diseased when challenged by a toxin.

For multiple testing adjustments, the proportion of null hypotheses among the m tests is important,
because we need only adjust for the m0 = π0 m tests which are actually null. The other m − m0 can
contribute only to the true discoveries.

Estimation of π0 starts with the observation that when the test statistic is continuous, then the
distribution of p-values for the null tests is uniform on the interval (0,1). This is because for any
significance level α the proportion of tests with p-value less than α is α . By subtraction, we can see
that the proportion of tests in any interval inside (0,1) is just the length of the interval.

Below we see a histogram of p-values from t-tests on data simulated to be 10000 2-sample tests. For
each 2-sample test, both samples have the same mean and SD. Since the means are the same, the
null hypothesis is true. We can see the distribution is not quite uniform, but is pretty close because
these are "observed" values and so have deviations from the ideal histogram. Below this histogram is
a histogram in which the two samples have different means but the same power to detect the
difference. As expected, the p-values are skewed to small values. The percentage of p-values in this
histogram with p-value less than α is the power of the test when significance is declared at level α .

https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56 4/8
3/8/2018 https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56

Notice that in the histogram of non-null p-values, there are very few p-values bigger than 0.2 and almost
none bigger than 0.5.

Of course when we actually test many features, some of which have the same means in both samples
and some of which do not, we get a mix of histograms of this type. We should get a histogram
something like the one below. In each bin of the histogram, we do not know which tests come from the
truly null hypotheses, and which from the truly non-null. But we do know that the large p-values come
mostly from the null distribution, and that the histogram should be fairly flat where most of the p-values
come from the null. The red line shows an estimate of what the part of the histogram coming from the
null should look like (although it might be a bit lower than it should be, comparing with the top histogram
above).

There are several estimators of π0 based on this picture. Some try to estimate where the histogram
flattens out. The two simplest are Storey's method and the Pounds and Cheng method.
https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56 5/8
3/8/2018 https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56

The Pounds and Cheng [1] method is based on noting that the expected average of the null p-values is
0.5. They then assume that all of the non-null p-values are exactly 0. Then
π0̂ = 2 ∗ average(p-value) because on average we expect the average p-value to be

π0 ∗ 0.5 + (1 − π0 ) ∗ 0) .

Storey’s method [2] uses the histogram directly, assuming that all p-values bigger than some cut-off λ
come from the null distribution. Then we expect m0 (1 − λ) of the tests to have p-value greater than λ.
We then count and get some number mλ p-values in this region, giving us m̂ 0 = mλ /(1 − λ) or
̂ = mλ /(m ∗ (1 − λ)) . Sophisticated implementations of the method estimate a value for λ but
π0
λ = 0.5 works quite well in most cases.

[1] Storey, John D. "A direct approach to false discovery rates." Journal of the Royal Statistical
Society: Series B (Statistical Methodology) 64.3 (2002): 479-
498.http://www.genomine.org/papers/directfdr.pdf [3]

[2] Pounds, Stan, and Cheng Cheng. "Improving false discovery rate estimation."Bioinformatics 20.11
(2004): 1737-1745. http://bioinformatics.oxfordjournals.org/content/20/11/1737.full.pdf [4]

4.5 - q-Values
Storey's method also leads to a direct estimate of FDP. If we reject at level α we expect the number of
false discoveries to be αm0 . So the estimate of FDP is αm̂ 0 /R .

This leads directly to the Storey q-value [1] which is often interpreted as either an FDR-adjusted p-value
or FDP(p) where p is any observed p-value in the experiment.

We start by sorting the p-values as we do for the BH or Holmes procedures.

Note that if we reject for p ≤ p(i) then the total rejections will be at least i (with equality unless two or
more of the p-values are equal to p(i) ). Let R(α ) be the number of rejections when we reject for all
p ≤ α . Then define the q-values by:

q(1) = p(1) m̂ 0 /R( p(1) )

q(i + 1) = max(q(i), p(i+1) m̂ 0 /R( p(i+1) )

This associates a q-value with each feature, which estimates the FDP if you reject the null hypothesis
for this feature and all features which are this significant or more. Often we pick a cut-off for the q-value
and reject the null hypothesis for all features with q-value less than or equal to our cut-off.

[1] Storey, John D. "The positive false discovery rate: a Bayesian interpretation and the q-value."
Annals of statistics(2003): 2013-2035. http://projecteuclid.org/download/pdf_1/euclid.aos/1074290335
[5]

4.6 - Using the Histogram of p-values

We have seen that the histogram of all the p-values of our features plays an important role in estimating
FDR. This suggests that we should plot our p-values.

https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56 6/8
3/8/2018 https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56

Below is a set of plots from a microarray experiment with multiple treatments. A test was done for each
pair of treatments, leading to a large number of pairs, each giving a p-value for each feature. The p-
values for each pair of treatments is shown below. Notice that they all have the expected shape, and so
the use of q-values is appropriate.

Sometimes our tests do NOT have the expected distribution. Below is the histogram of p-values for a
set of p-values from an experiment using an antigene microarray. The histogram is very informative - it
tells me that the test statistic does not have the assumed null distribution, and so the computed p-values
are not valid. After looking at the data with the microarray provider, we concluded that some of the
statistical analysis steps were not appropriate. Another statistical analysis pipeline will need to be
developed.

https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56 7/8
3/8/2018 https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56

I always look at the histogram of p-values before interpreting the p-values, computing q-values, or
estimating π0 . There are many reasons that it might not have the ideal shape. If the data are counts
(like sequencing data) the histogram has a different characteristic shape, which we will discuss when
we discuss sequencing data. If the data are intensities, a hump at low p (rather than the peak near
p=0) might indicate correlation among the tests, due to strong association of the features. Another
possibility is that the test statistic does not have the assumed null distribution. For example, if a block
design was used but the blocks were not accounted for in the statistical analysis, the degrees of
freedom of the test statistic will be incorrect. Another possibility is that the data are highly skewed and
the sample size is too small for the t-statistic to have a t-distribution.

Source URL: https://onlinecourses.science.psu.edu/stat555/node/56

Links:
[1] http://www.jstor.org/stable/2346101
[2] http://projecteuclid.org/download/pdf_1/euclid.aos/1013699998
[3] http://www.genomine.org/papers/directfdr.pdf
[4] http://bioinformatics.oxfordjournals.org/content/20/11/1737.full.pdf
[5] Storey's method also leads to a direct estimate of FDP. If we reject at level α we expect the number of false discoveries
to be αm0 . So the estimate of FDP is αm̂ 0 /R . This leads directly to the Storey q-value [1] which is often interpreted as
either an FDR-adjusted p-value or FDP(p) where p is any observed p-value in the experiment. [1] Storey, John D. "The
positive false discovery rate: a Bayesian interpretation and the q-value." Annals of statistics (2003): 2013-2035.
http://projecteuclid.org/download/pdf_1/euclid.aos/1074290335

https://onlinecourses.science.psu.edu/stat555/print/book/export/html/56 8/8

Index PDF
No ratings yet
Index PDF
19 pages
Multiple Testing Multiple Testing: Statistical Inference
No ratings yet
Multiple Testing Multiple Testing: Statistical Inference
19 pages
Lecture BDS 9-23-24 Print
No ratings yet
Lecture BDS 9-23-24 Print
13 pages
Statisticians' Guide to FDR Control
No ratings yet
Statisticians' Guide to FDR Control
24 pages
Multiple Testing in QTL Mapping: Lucia Gutierrez Lecture Notes Tucson Winter Institute
No ratings yet
Multiple Testing in QTL Mapping: Lucia Gutierrez Lecture Notes Tucson Winter Institute
18 pages
Statisticians' Guide to FDR Control
No ratings yet
Statisticians' Guide to FDR Control
13 pages
m09 Inference
No ratings yet
m09 Inference
20 pages
Lecture 04
No ratings yet
Lecture 04
9 pages
12 Multiple Comparisons
No ratings yet
12 Multiple Comparisons
10 pages
Multiplicity Control in Clinical Trials.
No ratings yet
Multiplicity Control in Clinical Trials.
6 pages
Adapt: An Interactive Procedure For Multiple Testing With Side Information
No ratings yet
Adapt: An Interactive Procedure For Multiple Testing With Side Information
24 pages
A Direct Approach To Estimating False Discovery Rates Conditional On Covariates
No ratings yet
A Direct Approach To Estimating False Discovery Rates Conditional On Covariates
23 pages
Lecture BDS 10-23-24 Print
No ratings yet
Lecture BDS 10-23-24 Print
13 pages
Mas S Mohktar Email: Mas - Dayana@um - Edu.my Phone (Office) : 0379677681
No ratings yet
Mas S Mohktar Email: Mas - Dayana@um - Edu.my Phone (Office) : 0379677681
22 pages
Repeated Measures ANOVA PDF
No ratings yet
Repeated Measures ANOVA PDF
30 pages
Overview of Hypothesis Testing: Laura Lee Johnson, PH.D
No ratings yet
Overview of Hypothesis Testing: Laura Lee Johnson, PH.D
71 pages
Statistics Nature
No ratings yet
Statistics Nature
2 pages
Lecture 28 30
No ratings yet
Lecture 28 30
88 pages
Slides CH 14
No ratings yet
Slides CH 14
50 pages
Inferential Statistics FWACP - 035611
No ratings yet
Inferential Statistics FWACP - 035611
54 pages
DMDA Unit-5 Notes
No ratings yet
DMDA Unit-5 Notes
35 pages
2010 Ls I Excerpt
No ratings yet
2010 Ls I Excerpt
7 pages
1 3 Multiple Hypothesis Testing
No ratings yet
1 3 Multiple Hypothesis Testing
14 pages
Probability and Statistics - Lecture 4
No ratings yet
Probability and Statistics - Lecture 4
35 pages
24 BMEN E4110 Lecture7 2
No ratings yet
24 BMEN E4110 Lecture7 2
38 pages
Hypothesis Testing For Means & Proportions
No ratings yet
Hypothesis Testing For Means & Proportions
20 pages
Covariate Adaptive False Discovery Rate Control With Applications To Omics-Wide Multiple Testing
No ratings yet
Covariate Adaptive False Discovery Rate Control With Applications To Omics-Wide Multiple Testing
56 pages
Hypothesis Python
No ratings yet
Hypothesis Python
42 pages
042 Multiple Hypothesis Testing PPTX - Bill Howe
No ratings yet
042 Multiple Hypothesis Testing PPTX - Bill Howe
8 pages
EDUR8132 IntroductoryNotes 17august2010
No ratings yet
EDUR8132 IntroductoryNotes 17august2010
5 pages
Hypothesis Testing Z-Test
No ratings yet
Hypothesis Testing Z-Test
29 pages
Lecture 9 - Null Hypothesis Significance Testing (Part1)
No ratings yet
Lecture 9 - Null Hypothesis Significance Testing (Part1)
20 pages
Q3: Ans: Statistical Hypothesis:: Importance
No ratings yet
Q3: Ans: Statistical Hypothesis:: Importance
3 pages
Statistics (Autosaved)
No ratings yet
Statistics (Autosaved)
75 pages
Lecture 05
No ratings yet
Lecture 05
6 pages
Inferential Statistics Part 1
No ratings yet
Inferential Statistics Part 1
10 pages
Biostats Midterms
No ratings yet
Biostats Midterms
4 pages
Multiple Regression & Hypothesis Testing
No ratings yet
Multiple Regression & Hypothesis Testing
29 pages
Things To Know PDF
No ratings yet
Things To Know PDF
56 pages
EDUR8132 IntroductoryNotes
No ratings yet
EDUR8132 IntroductoryNotes
6 pages
Statistics Can Be Broadly Classified Into Two Categories Namely (I) Descriptive Statistics and (II) Inferential Statistics
0% (1)
Statistics Can Be Broadly Classified Into Two Categories Namely (I) Descriptive Statistics and (II) Inferential Statistics
59 pages
Efron Mixture
No ratings yet
Efron Mixture
9 pages
Statistical Fallacies and Errors in Medical Research
No ratings yet
Statistical Fallacies and Errors in Medical Research
41 pages
Introduction To Hypothesis Testing
No ratings yet
Introduction To Hypothesis Testing
42 pages
Inferential Statistics in Details
No ratings yet
Inferential Statistics in Details
652 pages
Lizard SVL: Hypothesis Testing & CIs
No ratings yet
Lizard SVL: Hypothesis Testing & CIs
5 pages
Estimation
No ratings yet
Estimation
39 pages
Chapter 7 Hypothesis Testing and Sample Size Determination - 2
No ratings yet
Chapter 7 Hypothesis Testing and Sample Size Determination - 2
69 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
5 pages
L8 Hypothsis Testing
No ratings yet
L8 Hypothsis Testing
29 pages
Introduction To Hypothesis Testing and Estimation
No ratings yet
Introduction To Hypothesis Testing and Estimation
28 pages
Lecture 01
No ratings yet
Lecture 01
8 pages
Chapter 04
No ratings yet
Chapter 04
31 pages
Lecture BDS 8-23-24 Print
No ratings yet
Lecture BDS 8-23-24 Print
11 pages
Analysis of Variance & Correlation
No ratings yet
Analysis of Variance & Correlation
31 pages
LAB. Psychometrics Lesson 1
No ratings yet
LAB. Psychometrics Lesson 1
73 pages
MPC006
No ratings yet
MPC006
22 pages
PROCATINATION
No ratings yet
PROCATINATION
6 pages
Engineering Lab: Compressor & Pump
No ratings yet
Engineering Lab: Compressor & Pump
3 pages
Avika Jain Data Handling Presentation Roll No.12 Class 8TH
No ratings yet
Avika Jain Data Handling Presentation Roll No.12 Class 8TH
13 pages
PMLS 1 p4
No ratings yet
PMLS 1 p4
8 pages
Unit IV
No ratings yet
Unit IV
2 pages
Diverse Topics Overview: Science, Economy, and Society
No ratings yet
Diverse Topics Overview: Science, Economy, and Society
130 pages
Theory Cheat Sheet
No ratings yet
Theory Cheat Sheet
2 pages
Design of An Electrical Installation of A Storey Building: January 2020
No ratings yet
Design of An Electrical Installation of A Storey Building: January 2020
12 pages
BSB41419 R2
No ratings yet
BSB41419 R2
4 pages
Nepal College of Information Technology: Hydropower Project - PUWA KHOLA 1, Ilam
No ratings yet
Nepal College of Information Technology: Hydropower Project - PUWA KHOLA 1, Ilam
18 pages
CAPE Management of Business Syllabus
100% (1)
CAPE Management of Business Syllabus
64 pages
Environmental Thesis Topic Help
100% (1)
Environmental Thesis Topic Help
8 pages
Astrology's Secret: The D-60 Chart
100% (4)
Astrology's Secret: The D-60 Chart
10 pages
Head
No ratings yet
Head
11 pages
The Science Fiction Films of Andrei Tarkovsky PDF
No ratings yet
The Science Fiction Films of Andrei Tarkovsky PDF
14 pages
Exhibitor List Fabex MS Ksa 2023
No ratings yet
Exhibitor List Fabex MS Ksa 2023
28 pages
Name: Yuni Rafika Class: English Education Subject: Paragraph Writing NPM: 190402090030
No ratings yet
Name: Yuni Rafika Class: English Education Subject: Paragraph Writing NPM: 190402090030
6 pages
Lost Cities Ancient Mysteries of Africa Arabia
100% (1)
Lost Cities Ancient Mysteries of Africa Arabia
428 pages
2.2 Physical Properties
No ratings yet
2.2 Physical Properties
4 pages
User's Manual Calibration Unit PGT120: Wolfgang Warmbier Untere Gießwiesen 21 D-78247 Hilzingen/Germany
No ratings yet
User's Manual Calibration Unit PGT120: Wolfgang Warmbier Untere Gießwiesen 21 D-78247 Hilzingen/Germany
7 pages
100 Questions To Ask Family Members
100% (3)
100 Questions To Ask Family Members
3 pages
4 A's Detailed Lesson Plan
No ratings yet
4 A's Detailed Lesson Plan
5 pages
Physic SS1
No ratings yet
Physic SS1
50 pages
Novel Efficient Multiwavelet-Based Modulation For Downlink NOMA Systems
No ratings yet
Novel Efficient Multiwavelet-Based Modulation For Downlink NOMA Systems
5 pages
Esz 28 700
No ratings yet
Esz 28 700
1 page
1.1 Wood Is One of The Earliest Materials Used By: Words
No ratings yet
1.1 Wood Is One of The Earliest Materials Used By: Words
8 pages
Group 2 Chapter 3 and Chapter 4 FINAL 1
No ratings yet
Group 2 Chapter 3 and Chapter 4 FINAL 1
27 pages
AA - Heretic Astartes v1
No ratings yet
AA - Heretic Astartes v1
164 pages
Feel Better Blueprint Jennifer Mclean
100% (2)
Feel Better Blueprint Jennifer Mclean
18 pages
General Biology 2: Quarter 3 - Module 1 Genetics
100% (3)
General Biology 2: Quarter 3 - Module 1 Genetics
12 pages

Multiple Testing

Uploaded by

Multiple Testing

Uploaded by

3/8/2018 https://onlinecourses.science.psu.

Published on STAT 555 (https://onlinecourses.science.psu.edu/stat555)

Lesson 4: Multiple Testing

Understand the multiple testing problem

4.1 - Mistakes in Statistical Testing

Not Significant Significant

H0 False type II Error

Not Significant Significant

The total errors are T + V.

The false discovery proportion FDP is V / R.

Similarly, the false nondiscovery proportion FNP is T / W.

π0 = m0 /m is the proportion of null tests.

4.2 - Controlling Family-wise Error Rate

Sometimes the "Bonferroni-adjusted p-values are reported". They are just:

p(1) ≤ p(2) ≤ ⋯ ≤ p(m)

4.3 -1995 - Two Huge Steps for Biological

The BH procedure is more powerful than the Holmes procedure.

4.4 - Estimating m0 (or π0 )

We start by sorting the p-values as we do for the BH or Holmes procedures.

q(1) = p(1) m̂ 0 /R( p(1) )

q(i + 1) = max(q(i), p(i+1) m̂ 0 /R( p(i+1) )

4.6 - Using the Histogram of p-values

Source URL: https://onlinecourses.science.psu.edu/stat555/node/56

You might also like