Design of Experiments
Design of Experiments
Design of Experiments
refers to the random order in which the runs of the experiment are to be performed.
In this way, the conditions in one run neither depend on the conditions of the previous
run nor predict the conditions in the subsequent runs. Blocking aims at isolating a
known systematic bias effect and prevent it from obscuring the main effects [5]. This
is achieved by arranging the experiments in groups that are similar to one another.
In this way, the sources of variability are reduced and the precision is improved.
Attention to the statistical issue is generally unnecessary when using numerical
simulations in place of experiments, unless it is intended as a way of assessing the
influence the noise factors will have in operation, as it is done in MORDO analysis.
Due to the close link between statistics and DOE, it is quite common to find in
literature terms like statistical experimental design, or statistical DOE. However,
since the aim of this chapter is to present some DOE techniques as a mean for
collecting data to be used in RSM, we will not enter too deeply in the statistics
which lies underneath the topic, since this would require a huge amount of work to
be discussed.
Statistical experimental design, together with the basic ideas underlying DOE,
was born in the 1920s from the work of Sir Ronald Aylmer Fisher [6]. Fisher was the
statistician who created the foundations for modern statistical science. The second era
for statistical experimental design began in 1951 with the work of Box and Wilson [7]
who applied the idea to industrial experiments and developed the RSM. The work
of Genichi Taguchi in the 1980s [8], despite having been very controversial, had a
significant impact in making statistical experimental design popular and stressed the
importance it can have in terms of quality improvement.
In order to perform a DOE it is necessary to define the problem and choose the
variables, which are called factors or parameters by the experimental designer.
A design space, or region of interest, must be defined, that is, a range of variability
must be set for each variable. The number of values the variables can assume in
DOE is restricted and generally small. Therefore, we can deal either with qualitative
discrete variables, or quantitative discrete variables. Quantitative continuous vari-
ables are discretized within their range. At first there is no knowledge on the solution
space, and it may happen that the region of interest excludes the optimum design. If
this is compatible with design requirements, the region of interest can be adjusted
later on, as soon as the wrongness of the choice is perceived. The DOE technique
and the number of levels are to be selected according to the number of experiments
which can be afforded. By the term levels we mean the number of different values a
variable can assume according to its discretization. The number of levels usually is
the same for all variables, however some DOE techniques allow the differentiation
of the number of levels for each variable. In experimental design, the objective func-
tion and the set of the experiments to be performed are called response variable and
sample space respectively.
2.3 DOE Techniques 15
In this section some DOE techniques are presented and discussed. The list of the
techniques considered is far from being complete since the aim of the section is just
to introduce the reader into the topic showing the main techniques which are used in
practice.
Using a RCBD, the sample size grows very quickly with the number of factors.
Latin square experimental design is based on the same idea as the RCBD but it
aims at reducing the number of samples required without confounding too much the
importance of the primary factor. The basic idea is not to perform a RCBD but rather
a single experiment in each block.
Latin square design requires some conditions to be respected by the problem for
being applicable, namely: k = 3, X 1 and X 2 nuisance factors, X 3 primary factor,
L 1 = L 2 = L 3 = L. The sample size of the method is N = L 2 .
16 2 Design of Experiments
(a) (b)
Fig. 2.1 Examples of RCBD experimental design
For representing the samples in a schematic way, the two nuisance factors are
divided into a tabular grid with L rows and L columns. In each cell, a capital latin
letter is written so that each row and each column receive the first L letters of the
alphabet once. The row number and the column number indicate the level of the
nuisance factors, the capital letters the level of the primary factor.
Actually, the idea of Latin square design is applicable for any k > 3, however the
technique is known with different names, in particular:
if k = 3: Latin square,
if k = 4: Graeco-Latin square,
if k = 5: Hyper-Graeco-Latin square.
Although the technique is still applicable, it is not given a particular name for
k > 5. In the Graeco-Latin square or the Hyper-Graeco-Latin square designs, the
2.3 DOE Techniques 17
additional nuisance factors are added as greek letters and other symbols (small letters,
numbers or whatever) to the cells in the table. This is done in respect of the rule that in
each row and in each column the levels of the factors must not be repeated, and to the
additional rule that each factor must follow a different letters/numbers pattern in the
table. The additional rule allows the influence of two variables not to be onfounded
completely with each other. To fulfil this rule, it is not possible a Hyper-Graeco-Latin
square design with L = 3 since there are only two possible letter pattern in a 3 3
table; if k = 5, L must be 4.
The advantage of the Latin square is that the design is able to keep separated
several nuisance factors in a relatively cheap way in terms of sample size. On the
other hand, since the factors are never changed one at a time from sample to sample,
their effect is partially confounded.
For a better understanding of the way this experimental design works, some exam-
ples are given. Let us consider a Latin square design (k = 3) with L = 3, with X 3
primary factor. Actually, for the way this experimental design is built, the choice of
the primary factor does not matter. A possible table pattern and its translation into a
list of samples are shown in Table 2.2. The same design is exemplified graphically
in Fig. 2.2.
Two more examples are given in Table 2.3, which shows a Graeco-Latin square
design with k = 4, L = 5, N = 25, and a Hyper-Graeco-Latin square design with k = 5,
L = 4, N = 16. Designs with k > 5 are formally possible, although they are usually
not discussed in the literature. More design tables are given by Box et al. in [9].
Full factorial is probably the most common and intuitive strategy of experimental
design. In the most simple form, the two-levels full factorial, there are k factors and
L = 2 levels per factor. The samples are given by every possible combination of
the factors values. Therefore, the sample size is N = 2k . Unlike the previous DOE
18 2 Design of Experiments
Table 2.3 Example of Graeco-Latin square and Hyper-Graeco-Latin square experimental design
methods, this method and the following ones do not distinguish anymore between
nuisance and primary factors a priori. The two levels are called high (h) and low
(l) or, +1 and 1. Starting from any sample within the full factorial scheme,
the samples in which the factors are changed one at a time are still part of the sample
space. This property allows for the effect of each factor over the response variable
not to be confounded with the other factors. Sometimes, in literature, it happens to
encounter full factorial designs in which also the central point of the design space
is added to the samples. The central point is the sample in which all the parameters
have a value which is the average between their low and high level and in 2k full
factorial tables can be individuated with m (mean value) or 0.
Let us consider a full factorial design with three factors and two levels per factor
(Table 2.4). The full factorial is an orthogonal experimental design method. The
term orthogonal derives from the fact that the scalar product of the columns of any
two-factors is zero.
We define the main interaction M of a variable X the difference between the
average response variable at the high level samples and the average response at the
2.3 DOE Techniques 19
The idea of the 2k full factorial experimental designs can be easily extended to
the general case where there are more than two factors and each of them have a
different number of levels. The sample size of the adjustablefull factorial design
k
with k factors X 1 , . . . , X k , having L 1 , . . . , L k levels, is N = i=1 Li .
At this point, the careful reader has probably noted that the sample space of the
adjustable full factorial design is equivalent to the one of the RCBD. Therefore, we
could argue that the RCBD is essentially the more general case of a full factorial
design. It is true, however, that in the RCBD the focus is generally on a single variable
(the primary factor), and a particular stress is put on blocking and randomization
techniques. It is not just a problem of sampling somehow a design space since,
in fact, the order of the experiments and the way in which they are performed matter.
20 2 Design of Experiments
L1
L2
L3
L4
yi, j,l,m
i=1 j=1 l=1 m=1
y = . (2.3)
N
In order to compute the main effect of X 1 , we must evaluate the L 1 averages of
the response variables for all the samples where X 1 is fixed to a certain level
L2
L3
L4
L2
L3
L4
y1, j,l,m y L 1 , j,l,m
j=1 l=1 m=1 j=1 l=1 m=1
y X 1 =1 = ... y X 1 =L 1 = . (2.4)
L2 L3 L4 L2 L3 L4
L1
2
MX1 = y X 1 =i y . (2.5)
i=1
L3
L4
L3
L4
y1,1,l,m y L 1 ,L 2 ,l,m
l=1 m=1 l=1 m=1
y X 1 =1,X 2 =1 = ... y X 1 =L 1 ,X 2 =L 2 = .
L3 L4 L3 L4
(2.6)
The X 1 , X 2 interaction effect is
L1
L2
2
M X 1 ,X 2 = y X 1 =i,X 2 = j y MX1 MX2 . (2.7)
i=1 j=1
The advantage of full factorial designs is that they make a very efficient use of
the data and do not confound the effects of the parameters, so that it is possible to
evaluate the main and the interaction effects clearly. On the other hand, the sample
size grows exponentially with the number of parameters and the number of levels.
The family of the L k designs, that is, the full factorial designs where the number
of levels is the same for each factor, is particularly suitable for interpolation by
polynomial response surfaces, since a 2k design can be interpolated with a complete
bilinear form, a 3k design with a complete biquadratic form, a 4k with a complete
2.3 DOE Techniques 21
bicubic, and so on. However, bilinear and biquadratic interpolations are generally
poor for a good response surface to be generated. We refer to the terms bilinear,
biquadratic, and bicubic broadly speaking, since the number of factors is k, not two,
and we should better speak of k-linear, k-quadratic, and k-cubic interpolations.
Figure 2.3 shows graphical representations for the 22 , the 23 and the 33 full fac-
torial designs.
As the number of parameters increases, a full factorial design may become very
onerous to be completed. The idea of the fractional factorial design is to run only a
subset of the full factorial experiments. Doing so, it is still possible to provide quite
good information on the main effects and some information about interaction effects.
The sample size of the fractional factorial can be one-half , or one-quarter, and so on,
of the full factorial one. The fractional factorial samples must be properly chosen, in
particular they have to be balanced and orthogonal. By balanced we mean that the
sample space is made in such a manner so that each factor has the same number of
samples for each of its levels.
Let us consider a one-half fractional factorial of a 2k full factorial design. The
one-half is referred to as 2k1 fractional factorial. Let us assume k = 3. In order to
build the list of the samples, we start with a regular full factorial 2k1 (Table 2.5),
the levels for the additional parameter are chosen as an interaction of some of the
other parameters. In our case, we could add the product X 1 X 2 or X 1 X 2 .
The fractional factorial design in Table 2.5 is said to have generator or word
+ABC because the element-by-element multiplication of the first ( A), the second
(B), and the third (C) column is equal to the identity column I . The main and the
interaction effects are computed as in the previous paragraph. However, the price to
pay, in such an experimental design, is that it is not possible to distinguish between
the main effect of X 3 (C) and the X 1 X 2 (AB) interaction effect. In technical terms
we say that X 3 has been confounded, or aliased with X 1 X 2 . However, this is not the
22 2 Design of Experiments
only confounded term: multiplying the columns suitably, we realize that, if C = AB,
we have AC = A AB = B and BC = B AB = A, that is, every main effect is
confounded with a two-factors interaction effect.
The 231 design with generator I = +ABC (or I = ABC) is a resolution III
31
design. For denoting the design resolution a roman numeral subscript is used (2III ).
A design is said to be of resolution R if no q-factors effect is aliased with another
effect with less than R q factors. This means that:
in a resolution III design the main effects are aliased with at least 2-factors effects,
in a resolution IV design the main effects are aliased with at least 3-factors effects
and the 2-factors effects are aliased with each other,
in a resolution V design the main effects are aliased with at least 4-factors effects
and the 2-factors effects are aliased with at least 3-factors effects.
In general, the definition of a 2k p design requires p words to be given. Considering
all the possible aliases these become 2 p 1 words. The resolution is equal to the
smallest number of letters in any of the 2 p 1 defining words. The 2 p 1 words are
found multiplying the p original words with each other in every possible combination.
The resolution tells how badly the design is confounded. The higher is the resolution
of the method, the better the results are expected to be. It must be considered that
resolution depends on the choice of the defining words, therefore the words must be
chosen accurately in order to reach the highest possible resolution.
Table 2.6 shows an example of a 262 design with the evaluation of its resolution
and the list of the main effect and the two-factors interaction aliases.
The same idea for building fractional factorial designs can be generalized to a
L k p design, or to factorial designs with a different number of levels for each factor.
We start writing down the set of samples for a L k p full factorial design, then the
levels for the remaining p columns are obtained from particular combinations of
the other k p columns. In the same way shown above, it is possible to compute the
aliases and the resolution of the design. Although the concept is the same, things are
a bit more complicated since the formulas giving the last p columns are not defined
on a sort of binary numeral system anymore, but need to be defined according to
different systems with different number of levels.
Figure 2.4 show a few graphical examples of fractional factorial designs. A wide
list of tables for the most common designs can be found in literature [4, 5] .
2.3 DOE Techniques 23
Table 2.6 Example of 262 fractional factorial experimental design and evaluation of the design
resolution
Design 262 Main effect aliases Two-factors interaction aliases
A = BC E = ABC D F = D E F AB = C E = AC D F = B D E F
Defining Words B = AC E = C D F = AB D E F AC = B E = AB D F = C D E F
I = ABC E C = AB E = B D F = AC D E F AD = E F = BC D E = ABC F
I = BC D F D = ABC D E = BC F = AE F AE = BC = D F = ABC D E F
I = AD E F E = ABC = BC D E F = AD F AF = D E = B D E F = ABC D
Resolution F = ABC E F = BC D = AD E B D = C F = AC D E = AB E F
IV B F = C D = AC E F = AB D E
Experiment Factor level
number X 1 (A) X 2 (B) X 3 (C) X 4 (D) X 5 (E) X 6 (F)
1 1 1 1 1 1 1
2 1 1 1 +1 1 +1
3 1 1 +1 1 +1 +1
4 1 1 +1 +1 +1 1
5 1 +1 1 1 +1 +1
6 1 +1 1 +1 +1 1
7 1 +1 +1 1 1 1
8 1 +1 +1 +1 1 +1
9 +1 1 1 1 +1 1
10 +1 1 1 +1 +1 +1
11 +1 1 +1 1 1 +1
12 +1 1 +1 +1 1 1
13 +1 +1 1 1 1 +1
14 +1 +1 1 +1 1 1
15 +1 +1 +1 1 +1 1
16 +1 +1 +1 +1 +1 +1
It must be noted that Latin square designs are equivalent to specific fractional
factorial designs. For instance, a Latin square with L levels per factor is the same as
a L 31 fractional factorial design.
24 2 Design of Experiments
A central composite design is a 2k full factorial to which the central point and the star
points are added. The star points are the sample points in which all the parameters
but one are set at the mean level m. The value of the remaining parameter is given
in terms of distance from the central point. If the distance between the central point
and each full factorial sample is normalized to 1, the distance of the star points from
the central point can be chosen in different ways:
if it is set to 1, all the samples are placed on a hypersphere centered in the central
point (central composite circumscribed, or CCC). The method requires five levels
for each factor,
namely ll, l, m, h, hh,
if it is set to k , the value of the parameter remains on the same levels of the 2k
k
full factorial (central composite faced, or CCF). The method requires three levels
for each factor, namely l, m, h,
if a sampling like the central composite circumscribed is desired, but the limits
specified for the levels cannot be violated, the CCC design can be scaled
down
so that all the samples have distance from the central point equal to kk (central
composite inscribed, or CCI). The method requires five levels for each factor,
namely l, lm, m, mh, h,
if the distance is set to any other value, whether it is < kk (star points inside the
design space), <1 (star points inside the hypersphere), or >1 (star points outside the
hypersphere), we talk of central composite scaled, or CCS. The method requires
five levels for each factor.
For k parameters, 2k star points and one central point are added to the 2k full
factorial, bringing the sample size for the central composite design to 2k +2k +1. The
fact of having more samples than those strictly necessary for a bilinear interpolation
(which are 2k ), allows the curvature of the design space to be estimated.
Figure 2.5 shows a few graphical examples of central composite experimental
designs.
2.3 DOE Techniques 25
2.3.6 Box-Behnken
Box-Behnken [11] are incomplete three-levels factorial designs. They are built com-
bining two-levels factorial designs with incomplete block designs in a particular
manner. Box-Behnken designs were introduced in order to limit the sample size as
the number of parameters grows. The sample size is kept to a value which is sufficient
for the estimation of the coefficients in a second degree least squares approximating
polynomial. In Box-Behnken designs, a block of samples corresponding to a two-
levels factorial design is repeated over different sets of parameters. The parameters
which are not included in the factorial design remain at their mean level through-
out the block. The type (full or fractional), the size of the factorial, and the number
of blocks which are evaluated, depend on the number of parameters and it is cho-
sen so that the design meets, exactly or approximately, the criterion of rotatability.
An experimental design is said to be rotatable if the variance of the predicted response
at any point is a function of the distance from the central point alone.
Since there is not a general rule for defining the samples of the Box-Behnken
designs, tables are given by the authors for the range from three to seven, from nine
to twelve and for sixteen parameters. For better understandability of this experimental
design technique, Table 2.7 shows a few examples. In the table, each line stands for
a factorial design block, the symbol individuates the parameters on which the
26 2 Design of Experiments
factorial design is made, 0 stands for the variables which are blocked at the mean
level.
Let us consider the Box-Behnken design with three parameters (Table 2.7a),
in this case a 22 full factorial is repeated three times:
i. on the first and the second parameters keeping the third parameter at the mean
level (samples: llm, lhm, hlm, hhm),
ii. on the first and the third parameters keeping the second parameter at the mean
level (samples: lml, lmh, hml, hmh),
iii. on the second and the third parameters keeping the first parameter at the mean
level (samples: mll, mlh, mhl, mhh),
then the central point (mmm) is added. Graphically, the samples are at the mid-
points of the edges of the design space and in the centre (Fig. 2.6). An hypothetical
graphical interpretation for the k = 4 case is that the samples are placed at each
midpoint of the twenty-four two-dimensional faces of the four-dimensional design
space and in the centre.
As for the CCC and the CCI, all the samples have the same distance from the
central point. The vertices of the design space lie relatively far from the samples and
on the outside of their convex hull, for this reason a response surface based on a
Box-Behnken experimental design may be inaccurate near the vertices of the design
space. The same happens for CCI designs.
2.3.7 Plackett-Burman
Plackett-Burman are very economical, two-levels, resolution III designs [12]. The
sample size must be a multiple of four up to thirty-six, and a design with N samples
can be used to study up to k = N 1 parameters. Of course, as the method requires
2.3 DOE Techniques 27
a very small number of experiments, the main effects are heavily confounded with
two-factors interactions and Plackett-Burman designs are useful just for screening
the design space to detect large main effects. As in the case of Box-Behnken, Plackett-
Burman designs do not have a clear defining relation and tables for a different number
of factors are given by the authors. For N which is a power of two, the designs are
k p
equivalent to 2III fractional factorial designs, where 2k p = N . In Plackett-Burman
designs, a main effect column X i is either orthogonal to any X i X j two-factors
interaction or identical to plus or minus X i X j .
The cases N = 4, N = 8, N = 16, N = 32 are equivalent to 231 , 274 , 21511 ,
23126 fractional factorial designs. For the cases N = 12, N = 20, N = 24, N = 36
a row of 11, 19, 23, and 35 plus (high level) and minus signs (low level) is given
(Table 2.8). The Plackett-Burman designs are obtained writing the appropriate row as
the first row of the design table. The second row is generated by shifting the elements
of the first row one place right, and so on for the other rows. In the end, a row of
minus signs is added. Table 2.8 shows the Plackett-Burman patterns for N = 12,
N = 20, N = 24, N = 36, and the sample space for the case N = 12. The designs
for the N = 28 case are built in a different way: three patterns of 9 9 plus and
minus signs are given, and these patterns are assembled in a 27 27 table, then a
row of minus signs is added in the end as usual. In Plackett-Burman designs, if the
parameters are less than N 1, the first k columns are taken and the N 1 k last
columns of the design table are discarded.
2.3.8 Taguchi
The Taguchi method was developed by Genichi Taguchi [8] in Japan to improve
the implementation of off-line total quality control. The method is related to finding
the best values of the controllable factors to make the problem less sensitive to the
variations in uncontrollable factors. This kind of problem was called by Taguchi
robust parameter design problem.
Taguchi method is based on mixed levels, highly fractional factorial designs, and
other orthogonal designs. It distinguishes between control variables, which are the
factors that can be controlled, and noise variables, which are the factors that cannot
be controlled except during experiments in the lab. Two different orthogonal designs
are chosen for the two sets of parameters. We call inner array the design chosen for
the controllable variables, and outer array the design chosen for the noise variables.
The combination of the inner and the outer arrays give the crossed array which is the
list of all the samples scheduled by the Taguchi method. By combination we mean
that for each sample in the inner array the full set of experiments of the outer array is
performed. An important point about the crossed array Taguchi design is that, in this
way, it provides information about the interaction between the controllable variables
and the noise variables. These interactions are crucial for a robust solution.
Let us consider a problem with five parameters (k = 5), three of which are con-
trollable (kin = 3) and two uncontrollable (kout = 2), and let us consider two-levels
28 2 Design of Experiments
Table 2.8 Plackett-Burman patterns for N = 12, N = 20, N = 24, N = 36, and example of
Plackett-Burman experimental design for k = 11
k N Plackett-Burman pattern
11 12 ++++++
19 20 ++++++++++
23 24 ++++++++++++
35 36 + + + + + + + + + + + + + + + +
++
Experiment Parameter
number X1 X2 X3 X4 X5 X6 X7 X8 X9 X 10 X 11
1 +1 +1 1 +1 +1 +1 1 1 1 +1 1
2 1 +1 +1 1 +1 +1 +1 1 1 1 +1
3 +1 1 +1 +1 1 +1 +1 +1 1 1 1
4 1 +1 1 +1 +1 1 +1 +1 +1 1 1
5 1 1 +1 1 +1 +1 1 +1 +1 +1 1
6 1 1 1 +1 1 +1 +1 1 +1 +1 +1
7 +1 1 1 1 +1 1 +1 +1 1 +1 +1
8 +1 +1 1 1 1 +1 1 +1 +1 1 +1
9 +1 +1 +1 1 1 1 +1 1 +1 +1 1
10 1 +1 +1 +1 1 1 1 +1 1 +1 +1
11 +1 1 +1 +1 +1 1 1 1 +1 1 +1
12 1 1 1 1 1 1 1 1 1 1 1
full factorial experimental designs for the inner and the outer arrays. We assume full
factorial designs for simplicity, even though they are never taken into consideration
by the Taguchi method. Therefore, we must perform a full 22 factorial design (outer
array) for each sample of the 23 inner array. We can graphically represent the situation
as in Fig. 2.7.
2.3 DOE Techniques 29
Table 2.9 Example of Taguchi DOE for kin = 3, kout = 2, 23 full factorial inner array, 22 full
factorial outer array
Inner aray Outer array Output
Exp. num Parameter Exp.num 1 2 3 4 Mean Std. deviation
X in,1 X in,2 X in,3 Par. X out,1 1 1 +1 +1
X out,2 1 +1 1 +1
1 1 1 1 y1,1 y1,2 y1,3 y1,4 E [y1 ] E[(y1 E [y1 ])2 ]
2 1 1 +1 y2,1 y2,2 y2,3 y2,4 E [y2 ] E[(y2 E [y2 ])2 ]
3 1 +1 1 y3,1 y3,2 y3,3 y3,4 E [y3 ] E[(y3 E [y3 ])2 ]
4 1 +1 +1 y4,1 y4,2 y4,3 y4,4 E [y4 ] E[(y4 E [y4 ])2 ]
5 +1 1 1 y5,1 y5,2 y5,3 y5,4 E [y5 ] E[(y5 E [y5 ])2 ]
6 +1 1 +1 y6,1 y6,2 y6,3 y6,4 E [y6 ] E[(y6 E [y6 ])2 ]
7 +1 +1 1 y7,1 y7,2 y7,3 y7,4 E [y7 ] E[(y7 E [y7 ])2 ]
8 +1 +1 +1 y8,1 y8,2 y8,3 y8,4 E [y8 ] E[(y8 E [y8 ])2 ]
Using L kin and L kout full factorial designs the Taguchi method is equivalent to a
generic L kin +kout full factorial, and using fractional factorial designs or other orthog-
onal designs, the outcome in terms of number and distribution of the samples would
not be too different from some fractional factorial over the whole number of parame-
ters kin +kout . However, the stress is on the distinction between controllable variables
and noise variables. Looking at the design as a way of performing a set of samples
(outer array) for each sample in the inner array allows us to estimate the mean value
and the standard deviation, or other statistical values for each design point as noise
enters the system. The aim then is to improve the average performance of the prob-
lem while keeping the standard deviation low. This idea is shown in Table 2.9 for the
example given above and summarized in Fig. 2.7. Actually, Taguchi did not consider
the mean response variable and its standard deviation as performance measures.
He introduced more than sixty different performance measures to be maximized,
which he called signal-to-noise ratios (SN). Depending on the nature of the inves-
tigated problem, an appropriate ratio can be chosen. These performance measures,
however, have not met much success in that their responses are not always meaningful
for the problem. The most well-known signal-to-noise ratios are [13]:
smaller-the-better: to be used when the response variable is to be minimized.
SNstb = 10 log10 E yi2 (2.8)
nominal-the-best: to be used when a target value is sought for the response variable.
E2 [yi ]
SNntb = 10 log10
(2.10)
E (yi E [yi ])2
E stands for the expected value. According to the Taguchi method, the inner and
the outer arrays are to be chosen from a list of published orthogonal arrays. The
Taguchi orthogonal arrays, are individuated in the literature with the letter L, or LP
for the four-levels ones, followed by their sample size. Suggestions on which array
to use depending on the number of parameters and on the numbers of levels are
provided in [14] and are summarized in Table 2.10. L8 and L9 Taguchi arrays are
reported as an example in Table 2.11. Whenever the number of variables is lower
than the number of columns in the table the last columns are discarded.
2.3.9 Random
The DOE techniques discussed so far are experimental design methods which origi-
nated in the field of statistics. Another family of methods is given by the space filling
DOE techniques. These rely on different methods for filling uniformly the design
space. For this reason, they are not based on the concept of levels, do not require
discretized parameters, and the sample size is chosen by the experimenter indepen-
dently from the number of parameters of the problem. Space filling techniques are
generally a good choice for creating response surfaces. This is due to the fact that,
for a given N , empty areas, which are far from any sample and in which the interpo-
lation may be inaccurate, are unlikely to occur. However, as space filling techniques
2.3 DOE Techniques 31
are not level-based it is not possible to evaluate the parameters main effects and the
interaction effects as easily as in the case of factorial experimental designs.
The most obvious space filling technique is the random one, by which the design
space is filled with uniformly distibuted, randomly created samples. Nevertheless,
the random DOE is not particularly efficient, in that the randomness of the method
does not guarantee that some samples will not be clustered near to each other, so that
they will fail in the aim of uniformly filling the design space.
Several efficient space filling techniques are based on pseudo-random numbers gen-
erators. The quality of random numbers is checked by special tests. Pseudo-random
numbers generators are mathematical series generating sets of numbers which are
able to pass the randomness tests. A pseudo-random number generator is essentially
a function : [0, 1) [0, 1) which is applied iteratively in order to find a serie
of k values
k = (k1 ) , for k = 1, 2, . . . (2.11)
T
n= a j b j1 (2.12)
j=1
b : N0 [0, 1)
T
aj (2.13)
b (n) =
bj
j=1
Halton sequence [17] uses base-two Van der Corput sequence for the first
dimension, base-three sequence in the second dimension, base-five in the third dimen-
sion, and so on, using the prime numbers for base. The main challenge is to avoid
multi-dimensional clustering. In fact, the Halton sequence shows strong correlations
between the dimensions in high-dimensional spaces. Other sequences try to avoid
this problem.
Faure [18, 19] and Sobol sequences [20] use only one base for all dimensions and
a different permutation of the vector elements for each dimension.
The base of a Faure sequence is the smallest prime number 2 that is larger or
equal to the number of dimensions of the problem. For reordering the sequence, a
recursive equation is applied to the a j coefficients. Passing from dimension d 1 to
dimension d the reordering equation is
(d)
T
( j 1)! (d1)
ai (n) = a mod b. (2.14)
(i 1)! ( j i)! j
j=i
Sobol sequence uses base two for all dimensions and the reordering task is much
more complex than the one adopted by Faure sequence, and is not reported here.
Sobol sequence is the more resistant to the high-dimensional degradation.
In latin hypercube DOE the design space is subdivided into an orthogonal grid with
N elements of the same length per parameter. Within the multi-dimensional grid,
N sub-volumes are invididuated so that along each row and column of the grid only
one sub-volume is chosen. In Fig. 2.8, by painting the chosen sub volumes black
gives, in two dimensions, the typical crosswords-like graphical representation of
latin hypercube designs. Inside each sub-volume a sample is randomly chosen.
It is important to choose the sub-volumes in order to have no spurious correlations
between the dimensions or, which is almost equivalent, in order to spread the samples
all over the design space. For instance, a set of samples along the design space
diagonal would satisfy the requirements of a latin hypercube DOE, although it would
show a strong correlation between the dimensions and would leave most of the design
space unexplored. There are techniques which are used to reduce the correlations in
latin hypercube designs.
Let us assume the case of k parameters and N samples. In order to compute a set of
Latin hypercube samples [21] two matrices Q N k and R N k are built. The columns
of Q are random permutations of the integer values from 1 to N . The elements of
R are random values uniformly distributed in [0, 1]. Assuming each parameter has
range [0, 1], the sampling map S is given by
34 2 Design of Experiments
(a) (b)
Fig. 2.8 Example of latin hypercube designs
1
S= (Q R) . (2.15)
N
with mean value and standard deviation. X is the matrix whose rows are the
samples of the latin hypercube DOE. In case of uniformly distributed parameters on
the interval [0, 1], X = S is taken. The correlation reduction operation is essentially
an operation on Q. We map the elements of Q divided by N + 1 over a matrix Y
through the normal Gaussian cumulative distribution function Dnor m
1 qi, j
yi, j = Dnor m . (2.18)
N +1
1
N
ci, j = yl,i i yl, j j (2.20)
N
l=1
where i is the average of the values in the ith column of Y. The Choleski decom-
position requires C to be positive definite. For the way the matrix is built this is
guaranteed if N > k. A new matrix Y is computed so that
T
Y = Y L1 (2.21)
and the ranks of the elements of the columns of Y become the elements in the
columns of the matrix Q which is used in place of Q in order to compute the
samples.
A Matlab/Octave script implementing the method is reported in Appendix A.1
and a numerical example in Table 2.12. Figure 2.9 shows the effect of the correlation
reduction procedure for a case with two parameters and ten samples. The correlation
reduction was obtained using the above-mentioned script. Figure 2.10 shows a com-
parison between random, Sobol, and latin hypercube space filling DOE techniques
on a case with two parameters and a thousand samples. It is clear that the random
method is not able to completely avoid samples clustering. Using latin hypercubes
the samples are more uniformly spread in the design space. The Sobol sequence
gives the most uniformly distributed samples.
36 2 Design of Experiments
Fig. 2.10 A comparison between different space filling DOE techniques for k = 2, N = 1,000
Optimal design [22, 23] is a good DOE method whenever the classical orthogo-
nal methods may fail due to the presence of constraints on the design space. It is
a response-surface-oriented method whose output depends on the RSM technique
which is intended to be used later. A set of candidate samples is needed at the begin-
ning. This is usually given by an adjustable full factorial experimental design with
many levels for each parameter. Optimal design tests different sets of samples look-
ing for the one minimizing a certain function. It is an iterative method which involves
an onerous computation and could require a lot of time to be completed. For instance,
consider that for k parameters, with L levels each, the number of possible combi-
kN
nations of N samples in the set are LN ! : for the very simple case of k = 3, L = 4,
N = 10 this would mean 3.2 1011 sets to be tested. For this reason, optimization
algorithms are usually applied to the search procedure. The procedure is stopped after
a certain number of iterations, and the best solution found is taken as the optimal.
The output of the method is a set of samples spread through the whole design space.
As the number of samples grows, optimal designs often include repeated samples.
2.3 DOE Techniques 37
23 Full factorial
Experiment Parameters [mm] Results
number L Din Dout M [g] max [MPa]
1 80 13 17 59.19 189.04
2 80 13 19 94.70 114.11
3 80 16 17 16.28 577.68
4 80 16 19 51.79 179.24
5 100 13 17 73.98 236.30
6 100 13 19 118.4 142.64
7 100 16 17 20.35 722.10
8 100 16 19 64.74 224.05
31
2III , I = ABC Fractional factorial
Experiment Parameters [mm] Results
number L Din Dout M [g] max [MPa]
1 80 13 19 94.70 114.11
2 80 16 17 16.28 577.68
3 100 13 17 73.98 236.30
4 100 16 19 64.74 224.05
Central composite circumscribed
Experiment Parameters [mm] Results
number L Din Dout M [g] max [MPa]
18 as the 23 full factorial
9 90 14.5 18 63.12 203.65
10 90 14.5 16.27 30.22 432.45
11 90 14.5 19.73 99.34 126.39
12 90 17.10 18 17.53 635.56
13 90 11.90 18 101.2 145.73
14 72.68 14.5 18 50.97 164.46
15 107.3 14.5 18 75.26 242.84
Box-Behnken
Experiment Parameters [mm] Results
number L Din Dout M [g] max [MPa]
1 80 13 18 76.45 143.96
2 80 16 18 33.54 278.92
3 100 13 18 95.56 179.95
4 100 16 18 41.92 346.09
5 80 14.50 17 38.84 264.26
6 80 14.50 19 74.35 134.84
7 100 14.50 17 48.55 330.33
8 100 14.50 19 92.94 168.55
9 90 13 17 66.59 212.67
10 90 13 19 106.5 128.37
11 90 16 17 18.31 649.89
12 90 16 19 58.26 201.64
13 90 14.50 18 63.12 203.65
38 2 Design of Experiments
Latin hypercube
Experiment Parameters [mm] Results
number L Din Dout M [g] max [MPa]
1 81.59 14.04 18.76 77.88 137.56
2 83.25 14.33 18.54 71.03 155.18
3 84.24 15.39 17.05 27.97 386.23
4 86.93 13.76 17.54 63.41 198.10
5 88.88 14.59 17.84 57.76 216.38
6 91.58 13.48 17.21 64.63 220.09
7 92.89 15.86 17.61 33.54 379.86
8 95.35 15.61 18.85 65.64 205.31
9 97.07 13.29 18.20 92.53 171.88
10 98.81 14.81 18.15 67.06 226.79
Different optimal design methods involve different optimality criteria. The most
popular is the I-optimal which aims at the minimization of the normalized average,
or integrated prediction variance. In I-optimal designs of multivariate functions, the
variance of the predicted response variable
is integrated over the design space. Equation 2.22 comes from the delta method
for deriving an approximate probability distribution for a function of a statistical
estimator.
x = [x1 , .. . , xk ] is a point in the design space in the neighbourhood of
x0 = x0,1 , . . . , x0,k , and var (x) is the covariance matrix
var (x1 ) cov (x1 , x2 ) . . . cov (x1 , xk )
cov (x2 , x1 ) var (x2 ) . . . cov (x2 , xk )
.. .. .. .. (2.23)
. . . .
cov (xk , x1 ) cov (xk , x2 ) . . . var (xk )
where xi , i = 1, . . . , k are the parameters. The variance of the ith parameter and the
covariance of the ith and the jth parameters are defined as
N
2
xl,i i
l=1
var (xi ) = E (xi i )2 = (2.24)
N
N
xl,i i xl, j j
l=1
cov xi , x j = E (xi i ) x j j = (2.25)
N
2.3 DOE Techniques 39
N
x
where E is the expected value of the quantity in brackets and i = E [xi ] = i=1 N
i
k
k
k1
k
y (x) = 0 + i xi + i,i xi2 + i, j xi x j + (2.26)
i=1 i=1 i=1 j=i+1
where y (x) is the response variable, x1 , . . . , xk are the parameters, are the errors of
the quadratic model which are independent, with zero mean value, and 2 variance.
are the p = (k+1)(k+2)
2 unknown coefficients. Assuming that the design consists
of N p samples
x j = x j,1 , . . . , x j,k , j = 1, . . . N (2.27)
1 T
MX = X X. (2.29)
N
The prediction variance at an arbitrary point x and the integrated prediction variance,
which is the objective to be minimized in a I-optimal design, are
2
var y (x) = f (x) MX 1 f (x)T (2.30)
N
n
I = vary (x) dr (x) = trace MMX 1 (2.31)
2 R
Optimal designs and their objectives are summarized in Table 2.13 for the case of
a polynomial response surface. A Maxima script for computing the matrix M and a
Matlab/Octave script implementing the above equations for finding the I-optimal set
of samples are presented in Appendix A.2 for either full quadratic or cubic polynomial
response with two parameters. Figure 2.11 shows three I-optimal designs obtained
using the script for the cases k = 2, L = 21 with N = 6, and with N = 10 for a full
40 2 Design of Experiments
Fig. 2.11 Example of I-optimal designs for k = 2, L = 21, polynomial response surface
quadratic polynomial response surface, and with N = 20 for a full cubic polynomial
response surface.
2.4 Conclusions
technique is the best choice, because a cheap technique means imprecise results
and insufficient design space exploration. Unless the number of experiments which
can be afforded is high, it is important to limit the number of parameters as much as
possible in order to reduce the size of the problem and the effort required to solve
it. Of course the choice of the parameters to be discarded can be a particularly
delicate issue. This could done by applying a cheap technique (like Plackett-
Burman) as a preliminary study for estimating the main effects.
the number of levels L for each parameter.
42 2 Design of Experiments
The number of experiments also grows very quickly with the number of levels
admitted for each factor. However, a small number of levels does not allow a good
interpolation to be performed on the design space. For this reason, the number of
levels must be chosen carefully: it must be limited when possible, and it has to be
kept higher if an irregular behaviour of the response variable is expected. If the
DOE is carried out for RSM purpose, it must be kept in mind that a two-levels
method allows approximately a linear or bilinear response surface to be built,
a three-levels method allows a quadratic or biquadratic response surface, and so
on. This is just a rough hint on how to choose the number of levels depending on
the expected regularity of the response variable.
the aim of the DOE.
The choice of a suitable DOE technique depends also on the aim of the experi-
mentation. If a rough estimate of the main effects is sufficient, a Plackett-Burman
method would be preferable. If a more precise computation of the main and some
interaction effects must be accounted for, a fractional or a full factorial method is
better. If the aim is to focus on a primary factor a latin square or a randomized
complete block design would be suitable. If noise variables could influence sig-
nificantly the problem a Taguchi method is suggested, even if a relatively cheap
method also brings drawbacks. For RSM purposes, a Box-Behnken, a full facto-
rial, a central composite, or a space filling technique has to be chosen. Table 2.14
summarizes the various methods, their cost in term of number of experiments, and
their aims. The suitability column is not to be intended in a restrictive way. It is just
an hint on how to use DOE techniques since, as reminded above, much depends on
the complexity of the problem, the availability of resources and the experimenter
sensitivity. To the authors experience, for a given number of experiments and for
RSM purpose, space filling Sobol and Latin hypercube DOE always over-perform
the other techniques. It is also to be reminded that when dealing with response
surfaces it is not just a matter of choosing the appropriate DOE technique, also the
RSM technique which is coupled to the DOE data can influence significantly the
overall result. This issue takes us to the next chapter.
http://www.springer.com/978-3-642-31186-4