0% found this document useful (0 votes)

107 views6 pages

Statistical Test Methods For Hypothesis Testing

Uploaded by

Vasant bhoknal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

107 views6 pages

Statistical Test Methods For Hypothesis Testing

Uploaded by

Vasant bhoknal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

International Journal of Scientific & Engineering Research, Volume 4, Issue 9, September-2013 709

ISSN 2229-5518

Comparison of methods for detecting outliers

Manoj K, Senthamarai Kannan K

Abstract - An outlier is an observations which deviates or far away from the rest of data. There are two kinds of outlier methods, tests
discordance and labeling methods. In this paper, we have considered the medical diagnosis data set finding outlier with discordancy test
and comparing the performance of outlier detection. Most of the outlier detection methods considered as extreme value is an outlier. In
some cases of outlier detection methods no need to use statistical table. The suggested outlier detection methods using the context of
detection sensitivity and difficulties of analyzing performance for outlier detections are compared.

Index Terms — Discordance test, Dixon, Generalized ESD, Grubbs, Hampel, Outlier Detection

——————————  ——————————

1 INTRODUCTION The fig-1 represents the three points 81.5, 79.5, and 78.8
are far away from the data set. In the three values are

O utlier is an interesting field of data mining. The

identification of outliers can lead to the unexpected
knowledge discovery in the areas such as credit card
considered as outlier.
Anscombe (1960), have attempted to categorize the
different ways in which outliers may arise. It is relevant to
fraud detection, criminal behaviors detection, computer
consider them in rather more detail. In taking observations,
intrusion detection, calling card fraud detection etc.
different sources of variability can be encountered. We can
Applications such as outlier detection customized distinguish three of these.

IJSER
marketing, network intrusion detection, weather
prediction, pharmaceutical research and exploration in
Inherent variability:
science databases require the detection of outliers.
This is the expression of the way in which observations
intrinsically vary over the population; such variation is a
Barnett and Lewis (1978) defined as in a sample of
natural feature of the population and uncontrollable. Thus,
moderate size taken from a certain population it appears
for example, measurements of heights of men will reflect
that one or two values are surprisingly far away from the
the amount of variability indigenous to that population.
main group. D.M. Hawkins (1980) gives definition to
outlier as: An outlier is an observation, which so much
Measurement error:
deviates from other observations as to arouse suspicions
Often we must take measurements on members of a
that it was generated by a different mechanism. Example,
population under study. Inadequacies in the measuring
dataset from Laurie Davies (1993)
instrument superimpose a further degree of variability on
9.1, 79.5, 26.8, 81.5, 19.1, 15.2, 22.6, 28.8, 24.1, 23.6,
the inherent factor. The rounding of obtaining values, or
18.6, 17.3, 25.8, 78.8, 23.1, 11.9, 20.1, 20.3, 14.1, 26.5
mistakes in recording, compound the measurement error:
outlier detection they are part of it. Some control of this type of variability is
possible.
80

Execution error:
60

A further source of variability arises in the imperfect

collection of our data. We may inadvertently choose a
x

biased sample or include individuals who are not truly

representative of the population we aimed to sample.

Again, sensible precautions may reduce such variability.
20

5 10 15 20
Treatment of Outliers
Index The various outlier methods are using to test and
compared in this paper. Recently, most of people affected
Fig - 1. Scatter plot for outlier detection
by the blood pressure. They have to resort to the hospital to
check their health conditions. The treatments cannot cure in
———————————————— single day. They need every time after consumption of
• K. Manoj, Research Scholar, Department of Statistics drugs, blood pressure is checking by physician. Sometimes
Manonmaniam Sundaranar University, Tamil Nadu, India,
E-mail: manojstatms@gmail.com
measuring the blood pressure referred to false
• K. Senthamarai Kannan, Professor, Department of Statistics measurements. It may be negligence of the physician or the
Manonmaniam Sundaranar University, Tamil Nadu, India, measuring error instrumented. It is not a valid measure of
E-mail: senkannan2002@gmail.com treatment. In this situation using outlier detection method
IJSER © 2013
http://www.ijser.org
International Journal of Scientific & Engineering Research, Volume 4, Issue 9, September-2013 710
ISSN 2229-5518

is very useful to find the right treatment. location of outlying observations that there might be
several ways of approaching the problem, which depended
to a large extent on the object in view. One might, for in
2 RELATED WORK
stance, be primarily interested in pruning the
The previous studies using outliers methods to find the observations in order to secure a more accurate analysis
different methodologies and results. Armin of what was left, example to obtain the most reliable
Bohrer (2008) proposed method for using Dixon’s outlier estimate of a mean. Or one might be particularly
test has been calculated using Monte Carlo simulation one interested in identifying the genuinely exceptional
sided two-sided case critical values are determined. observations, in order to a new insight into the phenomena
Barbato G. et.al (2011) discussed about a several statistical under study. In the first case the criterion of what was best
methods that are currently in use for outlier identification might be the effect on the standard error of estimation, in
and their performance are compared theoretically for the second case the risk of wrongly deciding whether an
typical statistical distributions of experimental data and observation was exceptional or not. The procedures
considering values derived from the distribution of extreme discussed in the following paper start from the basis of
order statistics as reference terms. risks of misclassification rather than of estimation errors.

Grubbs (1969) describes the procedures are given for McMillan (1971) describes performances of three
determining statistically whether the highest observation, procedures for treatments of outliers in normal samples are
the highest and lowest observations, the two highest evaluated. The first procedure is the continuous application
observations, the lowest observations, or more of the of the usual maximum residual test. The largest value is an
observation in the sample are statistical outliers. outlier if the largest studentized residual exceeds a already
Khrominski (2010) using various methods of outlier determined value. If one outlier is detected, the test is
detection in medical diagnoses. They discussed repeated on the remaining observations, in the process to

IJSER
investigated the usefulness of selected outlier detection continue until no further outliers are detected. The second
methods in the context of detection speed and performance procedure is two largest observations are declared to
analysis and the difficulty of automating the performance outliers if the sum of two largest studentized residuals
analysis by using the test methods for outlier detection. exceeds a predetermined value. In the third procedure of
the two largest values are considered outliers if the ratio of
Thomas et. al., (1988) describes the outlier test procedure the corrected sum of squares omitting these values to the
was found to influence the interlabaratory standard total corrected sum of squares is less than a critical ratio.
deviations (SDs), but not the averages. It was shown that The procedure performances are evaluated for samples in
even small number of differences in the numbers of outliers which two of the values have means different from the
detected can change the SD severely. Comparing the common mean of the remainder of the sample.
outliers test procedures for Hampel, Grubbs and Graf-
Henning, it was found that Hampel test detected the most Tietjen and Moore (1972) are described problems of
outliers. Tietjen (1973) proposed a procedure of repeated application and "masking". They suggested as
studentizing or standardizing the residuals by dividing appropriate to over-come these problems are two new
them by their estimated standard deviations is proposed statistics: L k which is based on the k largest (observed)
for testing for outliers in simple linear regression. values and E k which is based on the k largest (in absolute
value) residuals. Jacqueline and Hawkins (1981) proposed
Paul and Fung (1991) are concerned with describes the method for accurate bounds a represented for the fractiles
procedures for detecting multiple y outliers in the linear of the maximum normed residual (which is often used to
regression. The generalized extreme studentized residual test for a single outlier) for two way and three way layouts
(GESR) procedure, controls which type I error rate, is and its shown that the second Bonferroni bound of the
developed and approximate formula to calculate the critical value is an excellent approximation of the critical
percentile is given for large samples and more accurate value being much more accurate the first B on ferroni upper
percentiles for n ≤ 25 are tabulated. The procedure bound. The third Bonferroni (upper) bound is expensive to
performance is compared with others by Monte Carlo compute and agrees with the second bound to at least four
techniques and found to be superior. However, the decimal places for all factor combinations considered.
procedure fails in detecting y outliers that are on high-
leverage cases. They suggest a two-phase procedure. The Laurie Davies and Ursula Gather (1993) approach to
phase- 1 a set of suspect observations is identified by GESR identifying outliers is to assume that the outliers have a
and one of the diagnostics applied sequentially and different distribution from the remaining observations.
phase- 2 a backward testing is conducted using the GESR They define outliers in terms of their position relative to the
procedure to see which of the suspect cases are outliers. model for the good observations. The identification outlier
They analyzed several examples in this paper. problem is then the problem of identifying those
observations that lie in a so-called outlier region. A more
Quesenberry (1961) discussed on the rejection and detailed analysis shows that methods based on robust
IJSER © 2013
http://www.ijser.org
International Journal of Scientific & Engineering Research, Volume 4, Issue 9, September-2013 711
ISSN 2229-5518

statistics perform better with respect to worst-case 3.2 Quartile Method

behavior. They given a concrete outlier identifier based on a Quartile method is no need to use in statistical tables. To
suggestion of Hampel. find the outlier using the quartile method it is necessary to
carry out the following steps:
Step: 1
Calculate the upper quartile: Q3 – 75% of the data in
Rosner (1975) proposed with "many outlier" procedures the data set are lower than this.
that can detect more than one outlier in a sample. various Step: 2
many outlier procedures are proposed via power, Calculate the lower quartile: Q1 – 25% of the data in
comparisons in Section 3 are found to be much superior to the data set are higher than this.
one-outlier procedures in detecting many outliers. They Step: 3
compare several different many outlier procedures find Calculate the gap between the quartiles: H=Q3 – Q1
that the procedure based on the extreme studentized A value lower than Q1 – 1.5.H and higher than Q3+1. 5. H
deviate (ESD) is slightly the best. Finally, 5%, 1% and .5% is considered to be a mild outlier. A value lower than Q1-3.H
points are given for the ESD procedure for various sample and higher than Q3+3.H is considered to be an extreme
sizes. outlier.

3 METHODOLOGY 3.3 Dixon’s Test

In this section discussed about some formal tests using The test developed by Dixon (1950) and used to the test
outlier detection. The described method consists of the is appropriate for small sample size. The test has some
information about way of counting the outlier values for limitations to n ≤ 30, were later on extended to n ≤ 40
the tests. The method testing with a formula necessary to (UNI 9225: 1988). The test first step for organizing the data
in an ascending order, and then the next step is to count

IJSER
find the outliers in the data set. In these methods final test
description discussed some conditions under which a parameter R.
decision whether checking data is an outlier or not is made. The test has various test statistics. Suppose for testing
large set of element to be an outlier, the sample arranged in
There are two kinds of outlier methods, Formal Method ascending order X1 ≤ X2 ≤. … ≤ Xn Implying that the large
and Informal Method. It is usually called, ‘Tests of sample element is given by Xn. Dixon proposed the
Discordance’ and ‘Labeling Methods’ respectively. A following test statistics defined as
detection test procedure must need to a statistical test, 𝑥𝑛 − 𝑥𝑛−1
termed here a test of discordance. They are usually based 𝑅10 = , 𝑓𝑓𝑓 3 ≤ 𝑛 ≤ 7
on assuming some well-behaving distribution, and test if 𝑥𝑛 − 𝑥1
𝑥𝑛 − 𝑥𝑛−1
the target of extreme value point is an outlier in the 𝑅11 = , 𝑓𝑓𝑓 8 ≤ 𝑛 ≤ 10
distribution. 𝑥𝑛 − 𝑥2
𝑥𝑛 − 𝑥𝑛−2
𝑅21 = , 𝑓𝑓𝑓 11 ≤ 𝑛 ≤ 13
3.1 Grubbs Test 𝑥𝑛 − 𝑥2
𝑥𝑛 − 𝑥𝑛−2
Grubbs (1969) used to detect a single outlier in a 𝑅22 = , 𝑓𝑓𝑓 14 ≤ 𝑛 ≤ 30
𝑥𝑛 − 𝑥3
univariate data set. The data set that follows an
For testing the smallest sample element to be an outlier,
approximately normal distribution. Grubbs' test is defined as
the sample is ordered in descending order implying that
the following two hypotheses:
the smallest sample element is labeled 𝑋𝑛 . All the selection
H0: There is no outlier in the data set
of the test statistics depends on the Dixon’s criteria.
H1: There is at least single outlier in the data set
The general formula for Grubbs' test statistic is defined as:
The variable 𝑋𝑛 is marked as an outlier, when the
max Yi − Y corresponding statistic 𝑅(𝑛) exceeds a critical value, which
G= depends on the selected significance level 𝛼.
s
Where 𝑦𝑖 is the element of the data set, Y and s
denoting the sample mean and standard deviation and the The calculated value of the parameter R is compared
test statistic is the largest absolute deviation from the with the Dixon’s test critical value for choosing statistical
sample mean in units of the sample standard deviation. The significance. When the calculated value of parameter R is
calculated value of parameter G is compared with the bigger than the critical value then it is possible to accept
critical value for Grubb’s test. When the calculated value data from the data set as an outlier.
higher or lower than the critical value of choosing statistical
3.4 Hampel Method
significance, then the calculated value can be accepted as
and outlier. The statistical significance (𝛼) describes the To calculate Hampel’s test statistical tables are not
maximum mistake level which a person searching for necessary. Theoretically, this method is resistant, which
outlier can accept. means that it is not sensitive to outliers, it also has no
restrictions as to the abundance of the data set.
IJSER © 2013
http://www.ijser.org
International Journal of Scientific & Engineering Research, Volume 4, Issue 9, September-2013 712
ISSN 2229-5518

𝛼
Hampel’s test performs the steps for data sets are as 𝑝 =1−
2(𝑛−𝑖+1)
follows:
i. Compute the median (Me) for the total data set. The
Number of outliers is determined by finding the largest I
median is described as the numeric value and
such that I > λi. Simulation studies by Rosner (1983) indicate
separating the higher half of a data set from the lower
that this critical value approximation is very accurate
half.
for 𝑛 ≥ 25. It is used to test with higher number of outliers
ii. Compute the value of the deviation 𝑟𝑖 from the
than expected when testing for outliers among data coming
median value; this calculation should be done for all
from a normal distribution.
elements from the data set:
𝒓𝒊 = (𝒙𝒊 − 𝑴𝑴)
where, 𝑥 − simple data from the data set,
𝑖 − belongs to the set for 1 to n. 4 Results and Discussion
𝑛 − number of all element of the set Normal Probability Plot of Blood Pr

𝑀𝑀 − median
iii. Calculate the median for deviation 𝑀𝑀|𝑟𝑖|

80
iv. Check the conditions: |𝑟𝑖 | ≥ 4.5𝑀𝑀|𝑟𝑖 |
If the condition is executed, then the value from the data

Sample Quantiles
set can be accepted as an outlier.

60
3.5 Generalized ESD Test for Outliers

40
Rosner (1983) used in the generalized (extreme
Studentized deviate) ESD test to detect one or more outliers
in a univariate data set that follows an approximately normal

IJSER
20
distribution.

The generalized ESD test (Rosner 1983) only requires that -2 -1 0 1 2

an upper bound for the suspected number of outliers be Theoretical Quantiles

specified.
Fig - 2. Normal probability plot for outlier detection
Given the upper bound, r, the generalized ESD test
In this experiment, we use blood pressure reduction in
essentially performs r separate tests: a test for single outlier, a
after taking the drug reading data. The data were collected
test for two outliers, and so on up to r outliers. The
from Tirunelveli Government health center. For the test
generalized ESD test is defined for the hypothesis:
purpose we take only 30 samples from the data set.
H 0 : There is no outlier found in the data set
The normal probability plot fig. 1 representing the data
H a : There are up to r outliers in the data set
with outlier value deviates from the original data. The plot
Test Statistic: Compute
indicates the outliers point far away from samples. The fig. 2
max𝑖 |𝑥𝑖 −𝑥̅ | shows that the outlier values removed by using outlier
𝑅𝑖 = detection methods and it follow as a normally distributed.
𝑠
Normal Probability Plot of Blood Pr

Remove the observation that maximizes |𝑥𝑖 − 𝑥̅ | and then

compute the above statistic with 𝑛 − 1 observations. Repeat
and continues the process until r observations have been
20

removed. Then the results in r test statistics R 1 , R 2 , ..., R r .

Sample Quantiles

Significance Level: 𝛼
15

Critical Region: Corresponding test statistics r to calculate

the following r critical values
10

(𝑛−𝑖)𝑡𝑝,𝑛−𝑖−1
𝜆𝑖 =
2 )(𝑛−𝑖+1)
�(𝑛−𝑖−1+𝑡𝑝,𝑛−𝑖−1
5

where 𝑖 = 1, 2, … . , 𝑟, 𝑡𝑝,𝑣 is the 100 p percentage point from -2 -1 0 1 2

the t distribution with ν degrees of freedom and

Theoretical Quantiles

Fig - 3. Outlier removing after Normal probability plot

The various discordancy methods are used in the
IJSER © 2013
http://www.ijser.org
International Journal of Scientific & Engineering Research, Volume 4, Issue 9, September-2013 713
ISSN 2229-5518

experiment to detect the outliers. Table-1 represents the total work.

number of outliers detected by the experiments. In these
experiments Grubb’s and Dixon tests have given the same References
results in repeated experiments. Three other methods such as 1. Anscombe F. J and Irwin Guttman (1960). Rejection
Hampel, Quartile and Generalized ESD test are same results of outliers. Technometrics, Vol. 2, No. 2, pp. 123-
in the experiments. The first two methods are detected 3 147.
outliers for each. But in case the two numbers of outliers only 2. Armin Bohrer (2008). One-sided and Two sided
strongly detected. Remaining one outlier is the small number Critical Values for Dixon’s Outlier Test for Sample
of the observation. The two tests only needed for repeated Sizes up to n=30, Economic Quality Control, Vol. 23,
experiments after detecting outliers. No. 1, 5-13.
3. Barbato, G., Barini, E. M., Genta, G., & Levi, R.
Table - 1 Total number of outlier detection in the blood (2011). Features and performance of some outlier
pressure after taking drug-reading data detection methods, Journal of Applied Statistics,
38:10, 2133-2149.
Sample Size N=30 4. Barnett V. and Lewis T. (1978). Outliers in statistical
Number of Outlier data. John Wiley & Sons.
Sig-α Detected 5. Chrominski Kornel, Magdalena TKACZ (2010).
Outlier Tests Outliers Comparison of outlier detection methods in
(Two-tailed test) Test
With removing Total biomedical data, Journal of Medical Informatics &
outlie after test Technologies Vol. 16, ISSN 1642-6037.
rs 1st 2nd 6. Dixon, W.J. (1950). Analysis of extreme values. Ann.
Math. Stat. 21, 4, 488-506.
Grubbs Test 1 1 1 3 7. Grubbs F. E. (1969), Procedures for detecting

IJSER
Dixon Test Critical 1 1 1 3 outlying Observations in Samples. American
value Statistical Association and American Society for
Hampel 2 0 0 2
0.5% Quality. Technometrics, Vol. 11. No. pp. 1-21.
Quartile Method 2 0 0 2 8. Hawkins D. M. (1980), Identification of Outliers,
Generalized ESD 2 0 0 2 Chapman & Hall, London.
9. Jacqueline S. Galpin and Douglas M. Hawkins
The other three outlier methods strongly detect outliers in (1981). Rejection of a Single Outlier in Two- or
a single experiment. The major outlier is finding easy and Three-Way Layouts, Technometrics, Vol. 23, No. 1,
quick in the experiments. In these experiments no need pp. 65-70.
critical value for Hampel and Quartile methods and other 10. Laurie Davies and Ursula Gather (1993).The
tests must needed for critical value to detect the outliers. Identification of Multiple Outliers. Journal of the
American Statistical Association, Vol. 88, No. 423,
The R software tested the experimental purpose of the pp. 782- 792.
tested methods used for R scripts. Lukasz Komsta (2006) is 11. Lukasz Komsta (2006). Processing data for outlier: R
used for example for the R codes for Dixon, Generalized ESD News, Vol 6/2.
Test and Grubb’s tests. 12. McMillan R. G. (1971). Tests for One or Two
Outliers in Normal Samples with Unknown
5 Conclusions Variance, Technometrics, Vol. 13, No. 1, pp. 87-100.
The table-1 describes that outlier values detected by the 13. Paul S. R, and Karen Y. Fung(1991). A Generalized
five-outlier detection methods. Grubbs and Dixon test had Extreme Studentized Residual Multiple-Outlier-
low sensitivity for outlier detection in the experiment (every Detection Procedure in Linear Regression,
test detected single outlier and find only minimum or Technometrics, Vol. 33, No. 3, pp. 339-348.
maximum value). The other three methods can find single 14. Quesenberry C. P. and David H. A. (1961). Some
experiment to identify the maximum outliers. The methods Tests for Outliers, Biometrika, Vol. 48, No. 3/4, pp.
Hampel, Quartile and Generalized ESD test can find easy 379-390.
and average detection levels are equal to find the maximum 15. Rorabacher, D.B. (1991). Statistical Treatment for
outliers. The result reveals that the three methods (Hampel, Rejection of Deviant Values: Critical Values of
Quartile and Generalized ESD) are much better than Grubbs Dixon Q Parameter and Related Subrange Ratios at
and Dixon test. the 95 percent Confidence Level. Anal. Chem. 83, 2,
139-146.
Acknowledgement 16. Rosner Bernard(1975), On the Detection of many
The first author acknowledges the UGC for outliers, Technometrics, Vol. 17, No. 2 (May, 1975),
awarding the Scheme of Rajiv Gandhi National Fellowship pp. 221-227.
(RGNF) for providing financial support to carry out this 17. Rosner, Bernard (1983), Percentage Points for a
Generalized ESD Many-Outlier Procedure,
IJSER © 2013
http://www.ijser.org
International Journal of Scientific & Engineering Research, Volume 4, Issue 9, September-2013 714
ISSN 2229-5518

Technometrics, 25(2), pp. 165-172.

18. Tietjen G. L., Moore R. H., Beckman R. J. (1973).
Testing for a Single Outlier in Simple Linear
Regression. Technometrics, Vol. 15, No. 4, pp. 717-
721.
19. Tietjen Gary L. and Moore Roger H. (1972). Some
Grubbs-Type Statistics for the Detection of Several
Outliers, Technometrics, Vol. 14, No. 3, pp. 583-597.
20. UNI 9225(1988), Precision of Test Methods:
Determination of Repeatability and Reproductivity
by Inter-labaratory Tests, Ente Nazionable Italiano
di Unificazione, Milano.

IJSER

Lesson 7. Standard Deviation
No ratings yet
Lesson 7. Standard Deviation
31 pages
Data Cleaning for Survey Analysts
No ratings yet
Data Cleaning for Survey Analysts
66 pages
Unit 2 Part 1
100% (1)
Unit 2 Part 1
44 pages
X-Bar and R Charts: NCSS Statistical Software
No ratings yet
X-Bar and R Charts: NCSS Statistical Software
26 pages
Control Chart Presentation
100% (2)
Control Chart Presentation
17 pages
Quality Presentation
No ratings yet
Quality Presentation
56 pages
Basic Statistical Process Control
No ratings yet
Basic Statistical Process Control
30 pages
MST326 3 Problem
100% (1)
MST326 3 Problem
37 pages
Seven QC Tools Tool #5: Part 1-Run Chart
No ratings yet
Seven QC Tools Tool #5: Part 1-Run Chart
6 pages
Tests For One Poisson Mean
No ratings yet
Tests For One Poisson Mean
9 pages
Chapter 11 The Seven Quality Control Tools and Intoduction To Statistics
100% (1)
Chapter 11 The Seven Quality Control Tools and Intoduction To Statistics
14 pages
Lesson - 5.1 - Design of Experiments - Improve - Phase
100% (1)
Lesson - 5.1 - Design of Experiments - Improve - Phase
39 pages
Process Capability Insights
No ratings yet
Process Capability Insights
18 pages
Body of Quality Knowledge PDF
100% (1)
Body of Quality Knowledge PDF
5 pages
Lasting Improvements in Manufacturing Performance, in Search of A New Theory
100% (1)
Lasting Improvements in Manufacturing Performance, in Search of A New Theory
40 pages
STA4C04 - Statistical Inference and Quality Control
No ratings yet
STA4C04 - Statistical Inference and Quality Control
170 pages
Lot Number Explanation
100% (1)
Lot Number Explanation
2 pages
Six Sigma Problem-Solving Models
100% (1)
Six Sigma Problem-Solving Models
36 pages
Seven QC Tools Tool #7: Stratification: Lesson Structure
100% (1)
Seven QC Tools Tool #7: Stratification: Lesson Structure
2 pages
Process Capability (Asi Rev5)
No ratings yet
Process Capability (Asi Rev5)
84 pages
Audit Results Summary SQI Rev 0
100% (1)
Audit Results Summary SQI Rev 0
3 pages
Measurement System
No ratings yet
Measurement System
35 pages
8 Control Plan - 23966531 - P03
No ratings yet
8 Control Plan - 23966531 - P03
5 pages
18MEO113T - DOE - Unit 1 - AY2023-24 ODD
No ratings yet
18MEO113T - DOE - Unit 1 - AY2023-24 ODD
120 pages
Mahasneh JK D 2016 PDF
100% (1)
Mahasneh JK D 2016 PDF
212 pages
Cause-And-Effect Diagram: Why Implement Cost of Quality (COQ) ?
No ratings yet
Cause-And-Effect Diagram: Why Implement Cost of Quality (COQ) ?
3 pages
Quality at Source
No ratings yet
Quality at Source
41 pages
Quality Concepts: What Is Quality, Difference Between Qualitycontrol and Quality Assurance, Role of TQM/BE Deptt in NTPC, Cost
No ratings yet
Quality Concepts: What Is Quality, Difference Between Qualitycontrol and Quality Assurance, Role of TQM/BE Deptt in NTPC, Cost
5 pages
Statistical Process Control: Purpose
No ratings yet
Statistical Process Control: Purpose
42 pages
Module 4
100% (1)
Module 4
13 pages
QMS 6 Sigma Benchmarking
No ratings yet
QMS 6 Sigma Benchmarking
56 pages
Define Phase: Six Sigma
No ratings yet
Define Phase: Six Sigma
52 pages
Deming Juran Crosby
No ratings yet
Deming Juran Crosby
15 pages
Article - Mitigate The Risk
No ratings yet
Article - Mitigate The Risk
1 page
Six Sigma (Green Belt
0% (1)
Six Sigma (Green Belt
18 pages
Process Capability
No ratings yet
Process Capability
4 pages
TQM - 601 Module 12 - Quality Method PDCA and PDSA
100% (2)
TQM - 601 Module 12 - Quality Method PDCA and PDSA
17 pages
Control Charts: A Guide to SPC and Quality Monitoring
100% (2)
Control Charts: A Guide to SPC and Quality Monitoring
17 pages
7-QC Tools: in Search of Excellence
No ratings yet
7-QC Tools: in Search of Excellence
106 pages
Statistical Process Control (SPC) : Praful Mehta
No ratings yet
Statistical Process Control (SPC) : Praful Mehta
66 pages
Quality Management Systems
No ratings yet
Quality Management Systems
26 pages
Six Sigma Control PDF
100% (1)
Six Sigma Control PDF
74 pages
Joseph: The Juran Trilogy
100% (1)
Joseph: The Juran Trilogy
2 pages
Quality Management & Six Sigma Guide
No ratings yet
Quality Management & Six Sigma Guide
62 pages
Systems and Quality Day2
100% (1)
Systems and Quality Day2
86 pages
Variable and Types of Statistical Variables
100% (1)
Variable and Types of Statistical Variables
9 pages
QRQC Problem Solving Techniques
No ratings yet
QRQC Problem Solving Techniques
1 page
Quality Function Deployment
No ratings yet
Quality Function Deployment
8 pages
Problem Solutions Methods PPT 8D 5W 1H QC Story
No ratings yet
Problem Solutions Methods PPT 8D 5W 1H QC Story
15 pages
ASQ Presentation On Process Capability Final 02 Feb 2016
No ratings yet
ASQ Presentation On Process Capability Final 02 Feb 2016
80 pages
Meet Mini Tab 14
No ratings yet
Meet Mini Tab 14
138 pages
TQM Syllabus.
No ratings yet
TQM Syllabus.
3 pages
CH 02
No ratings yet
CH 02
54 pages
Cost of Quality in Manufacturing
No ratings yet
Cost of Quality in Manufacturing
58 pages
SOP - Root Cause Analysis Draft
100% (1)
SOP - Root Cause Analysis Draft
1 page
ISAT 600 Progress Report 3
No ratings yet
ISAT 600 Progress Report 3
4 pages
On Normalization and Algorithm Selection For Unsupervised Outlier Detection
No ratings yet
On Normalization and Algorithm Selection For Unsupervised Outlier Detection
34 pages
Outlier Detection Techniques
No ratings yet
Outlier Detection Techniques
28 pages
Handling Outliers
No ratings yet
Handling Outliers
6 pages
Anomaly Detection and Outlier Analysis
No ratings yet
Anomaly Detection and Outlier Analysis
25 pages
Types of Causes in SPC
No ratings yet
Types of Causes in SPC
23 pages
Quality Control Defect Analysis
No ratings yet
Quality Control Defect Analysis
3 pages
Zero Defect Strategy - Case Study
No ratings yet
Zero Defect Strategy - Case Study
5 pages
QC Planning
No ratings yet
QC Planning
34 pages
Zero Defect Method-1
No ratings yet
Zero Defect Method-1
9 pages
Shainin Study
100% (1)
Shainin Study
15 pages
SPC Tutorial SIA
No ratings yet
SPC Tutorial SIA
88 pages
Understanding Control Plans
No ratings yet
Understanding Control Plans
5 pages
SPCcalculator
No ratings yet
SPCcalculator
16 pages
Just In Time (JIT) Inventory Guide
No ratings yet
Just In Time (JIT) Inventory Guide
22 pages
ANSI B92!1!1996 Involute Splines and Inspection
75% (4)
ANSI B92!1!1996 Involute Splines and Inspection
163 pages
Process For CIP
No ratings yet
Process For CIP
1 page
Process For Alternative Process Control
No ratings yet
Process For Alternative Process Control
1 page
The Role of Variation, Error, and Complexity in Manufacturing Defects"
No ratings yet
The Role of Variation, Error, and Complexity in Manufacturing Defects"
28 pages
Fulltext01 1
No ratings yet
Fulltext01 1
124 pages
Din 50902 1994 en
100% (2)
Din 50902 1994 en
8 pages
Simplifying Six Sigma Methodology Using Shainin D.O.E
No ratings yet
Simplifying Six Sigma Methodology Using Shainin D.O.E
6 pages
CP, CPK, PP, and PPK - A Guide To Process Capability and Capability Indices
No ratings yet
CP, CPK, PP, and PPK - A Guide To Process Capability and Capability Indices
13 pages
Lua Programming Quick Reference
No ratings yet
Lua Programming Quick Reference
6 pages
Routers Claves
No ratings yet
Routers Claves
2 pages
Get Response
No ratings yet
Get Response
15 pages
Project Cost Management Template
100% (3)
Project Cost Management Template
8 pages
Roxtec Transit Designer™: Online Tool For Easy Design of Cable and Pipe Transits
No ratings yet
Roxtec Transit Designer™: Online Tool For Easy Design of Cable and Pipe Transits
2 pages
MBA Syllabus 2019-21 PDF
100% (1)
MBA Syllabus 2019-21 PDF
266 pages
67067bos54070 cp12
No ratings yet
67067bos54070 cp12
21 pages
ScrumMaster Training Book
100% (14)
ScrumMaster Training Book
125 pages
NRC 2018 Rules & Regulations Football
No ratings yet
NRC 2018 Rules & Regulations Football
14 pages
Cbok 2006
No ratings yet
Cbok 2006
20 pages
Iso 26866
No ratings yet
Iso 26866
20 pages
Account 421
No ratings yet
Account 421
537 pages
Serial Number Robot
No ratings yet
Serial Number Robot
3 pages
Television Téléviseur Televisor Digital A Color Con Pantalla de Cristal Líquido
No ratings yet
Television Téléviseur Televisor Digital A Color Con Pantalla de Cristal Líquido
48 pages
Project Design Brief (G2)
No ratings yet
Project Design Brief (G2)
1 page
Ball - Animation Slides
No ratings yet
Ball - Animation Slides
36 pages
Indian Mobile Brands & Ambassadors
No ratings yet
Indian Mobile Brands & Ambassadors
9 pages
Updated Synopsis Format
No ratings yet
Updated Synopsis Format
6 pages
Daftar Harga: MPPI-62537 MNYW-50151 MNBL-63504 MNBL-63506
No ratings yet
Daftar Harga: MPPI-62537 MNYW-50151 MNBL-63504 MNBL-63506
4 pages
Duration:: Internship Report From Wolkite University Ict Center Olkite
No ratings yet
Duration:: Internship Report From Wolkite University Ict Center Olkite
50 pages
Manual SIMOTION Web Accumulator V3.0.0
No ratings yet
Manual SIMOTION Web Accumulator V3.0.0
59 pages
Report On Martingale Theory
No ratings yet
Report On Martingale Theory
13 pages
20 Multiple Choice Questions and Answers For Personal Communication Systems - RAMON (Revised)
No ratings yet
20 Multiple Choice Questions and Answers For Personal Communication Systems - RAMON (Revised)
5 pages
A Step-By-step HTML Tutorial (Basic
No ratings yet
A Step-By-step HTML Tutorial (Basic
77 pages
Kasun CV
No ratings yet
Kasun CV
3 pages
Icet Inst English
No ratings yet
Icet Inst English
8 pages
Client - Centric Consistency Models
No ratings yet
Client - Centric Consistency Models
9 pages
2019 - Artigo - Static Structural Analysis of Pratt, Flink and Howe Steel Truss Using Ansys Software
No ratings yet
2019 - Artigo - Static Structural Analysis of Pratt, Flink and Howe Steel Truss Using Ansys Software
8 pages
Daniel K. Schneider
No ratings yet
Daniel K. Schneider
363 pages
Global CPA/CPI Offers Overview
No ratings yet
Global CPA/CPI Offers Overview
12 pages

Statistical Test Methods For Hypothesis Testing

Uploaded by

Statistical Test Methods For Hypothesis Testing

Uploaded by

International Journal of Scientific & Engineering Research, Volume 4, Issue 9, September-2013 709

Comparison of methods for detecting outliers

O utlier is an interesting field of data mining. The

A further source of variability arises in the imperfect

biased sample or include individuals who are not truly

representative of the population we aimed to sample.

statistics perform better with respect to worst-case 3.2 Quartile Method

3 METHODOLOGY 3.3 Dixon’s Test

The generalized ESD test (Rosner 1983) only requires that -2 -1 0 1 2

Remove the observation that maximizes |𝑥𝑖 − 𝑥̅ | and then

removed. Then the results in r test statistics R 1 , R 2 , ..., R r .

Critical Region: Corresponding test statistics r to calculate

where 𝑖 = 1, 2, … . , 𝑟, 𝑡𝑝,𝑣 is the 100 p percentage point from -2 -1 0 1 2

the t distribution with ν degrees of freedom and

Fig - 3. Outlier removing after Normal probability plot

experiment to detect the outliers. Table-1 represents the total work.

Technometrics, 25(2), pp. 165-172.

You might also like