Genetic Algorithm Toolbox For SCILABUSERGUIDE
Genetic Algorithm Toolbox For SCILABUSERGUIDE
GUIDE
Li Zhong
zhli@liama.ia.ac.cn
May 19, 2004
Contents
1 Introduction 2
1.1 GATS features . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2 Task file and running . . . . . . . . . . . . . . . . . . . . . . . . . 4
2 User interface 5
2.1 Population and coding . . . . . . . . . . . . . . . . . . . . . . . . 6
2.2 Hard and soft constraint on variable . . . . . . . . . . . . . . . . 7
2.3 Objectives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.4 Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.4.1 Select, Crossover and Mutate method . . . . . . . . . . . 10
2.4.2 Self-adjust algorithm . . . . . . . . . . . . . . . . . . . . . 11
2.4.3 Fitness scaling and niche . . . . . . . . . . . . . . . . . . 12
2.4.4 Best reserve strategy . . . . . . . . . . . . . . . . . . . . . 13
2.5 Subtask . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
3 Multi-objective optimization 14
4 Parallel algorithm 16
1
Begin
Scaling Constraint Niche
Popu(t)
Best Reserve No
yes
end?
Select Multi-objtive
evaluate
parallel
evaluate
Crossover Self-adjust evaluate
Subtask parallel
Subtask
Mutate
Subtask
Popu(t+1)
1 Introduction
Genetic Algorithm (GA) is one of main methods in artificial intelligence field.
It has some attractive character, such as general propose, self-organized, stable,
and can deal some difficult problem where other methods fail. Its principle is
based on Darwin’s Evolution Theory, simple but work well. As a result, this
algorithm got full-development and are used widely in search, optimization,
and machine learning, etc. In past decades, a large number of variations and
enhancements of this algorithm have emerged since Holland first proposed it
(we call it simple GA) in 1960s. They are different from each other here and
there, but share the same principle and have similar progress.
The running process of simple genetic algorithm can be viewed as a cycling,
in each cycle population is evaluated, selected, crossovered and mutated.
Following is genetic algorithm’s flow chart:
For more information about genetic algorithm, please turn to books and
papers on this thesis.
SCILAB is a clone of MATLAB, the most popular scientific computing lan-
guage. It was developed by INRIA and ENPC. SCILAB is free, powerful and
open source, can be used to replace MATLAB in most of case. Normally, you
can port your MATLAB programs to SCILAB with a minmum modification.
In education field, SCILAB is a better choice compared with MATLAB, despite
some shortcomings.
SCILAB can run on Unix/Linux and windows. It support parallel processing
on Unix/Linux with PVM interface. It’s a very useful feature because sometimes
the problem is too difficult to be resolved in feasible time on single machine.
One of advantages of genetic algorithm is that it can easily run in parallel.
To help people use and research genetic algorithm, I develop this toolbox
for SCILAB. Any suggestion and bug reporting is welcome, and will help me to
improve this toolbox. I hope my toolbox can be useful for person interested in
genetic algorithm and SCILAB. Hope you enjoy it.
2
1.1 GATS features
Because of genetic algorithm’s character, it is very easy to enhance it by adding
new coding, method, and strategy. In past 40 years, there are many algorithms
have been developed based on or originated from the simple genetic algorithm.
They all belong to genetic algorithm cluster. Provided with different features
or enhancements, they deal with problems in broader ranges, improve the com-
puting efficiency, and get better solution.
One of this toolbox’s intentions is to make it full-function and flexible. To
do so, it has a careful-designed general framework to include as many genetic
algorithms as possible. The whole structure of the toolbox can be seen as a
backbone and many optional plug-ins. The backbone realizes the flow of genetic
algorithm and integrates selected plug-ins to construct the whole system.
The interface between backbone and plug-ins is standard and one plug-in can
replace another of same type without modification of backbone. The same parts
of GAs are realized in backbone, while the different parts of GAs are realized
in different plug-ins. With this structure, the toolbox is easy to extended.
Till now, this toolbox has already supported following features:
In these features, three features are most novel and useful. First, it induces
a file named “task file” to record configuration of backbone and plug-ins. To
make task file more easily be used, toolbox provides a user interface to edit task
file conveniently. Second, toolbox supports multi-objectives optimization and
Pareto algorithm. Last, toolbox support parallel processing, With this feature
added, GATS greatly enhances its ability to deal very difficult problems. Those
are my contribution.
Because I don’t want to reinvent wheel, part of my toolbox’s code is from
GAOT, it include most of select, crossover, mutate methods with binary and real
coding. Those files have their own copyright announcement in the file content.
My toolbox is released under the GPL 2.0, please read the COPYING file in the
directory. Others are all my own work. My work focuses on Pareto algorithm,
self-adjust algorithm, hard or soft constraints, subtask, parallel algorithm etc.
3
User
User Interface
Task
File Parser Program
Running
The other way is to run a task in a program script, you can edit the task
file manually and call genetic algorithm with specified task file in your own
program. For example, after editing a task file named ’YOUR.TASK’, you can
create a new SCILAB command file named ’start.sci’ in your working directory,
and add the following sentence in the script.
1. UG SYS DIR=’C:\TOOLBOX\INSTALL\DIRECTORY\’;
2. UG USER DIR=’C:\YOUR\WORKING\DIRECTORY\’;
3. ParseTask(’DIR/YOUR.TASK’,[]);
The first sentence is used to set the toolbox’s path, you must replace it with
your own path setting. You can also include this sentence in your SCILAB
startup scripts to load it automatically.
4
Figure 3: Main window of GATS user interface
The advantage of user interface mode is that user can modify the feature of
genetic algorithm and see its effect immediately, and user can stop a running
task at anytime. So it’s convenient for user to change and test features or
parameters just to have a try. Script mode is most helpful when user need to
run a genetic algorithm without intervening with user. And genetic algorithm
provided by this toolbox can be easily integrated into other algorithm or system
in this way.
2 User interface
User interface is designed to help people use this toolbox. It is written in tcl/tk.
This user interface is platform independent, and can run on Unix, Windows,
and Mac. It is used to load, modify and run a task file.
The main purpose of user interface is to help user to specify all kinds of
features in task file easily. Because there are so many available features in
toolbox, it’s difficult to memory and decide proper features and parameters
in editing task file. And also, if user write task file manually, it’s very likely
that user make mistakes in format and content when task file become lengthy
and complex. User interface can help user to avoid such errors. The genetic
algorithm features are organized into several dialogs according to their internal
logic relationship. When any change occurs in a dialog and is confirmed by user,
the task file will reflect this change correspondingly. By providing several dialogs
to help user to input or change GA features and parameters, user interface help
user to construct or modify a task file rapidly and conveniently. The user
interface can also load and run a task file, show its result alive and stop the
process at any time. It provides basic file management, such as load, save, save
as task file. And it runs a task file in interactive mode. This means that user
can see the result on the fly and can stop the process at his pleasure.
5
Figure 4: Population and coding set dialog
6
Figure 5: Constraints set dialog
Bounds: [1,2,0.0001;0,2.0,0.01]
Coding: Binary
7
Figure 6: Objective set dialog
2.3 Objectives
In GATS, the Objective function is the only function that must be written
by users. Objective function will be called with every individual to return a
value that is consequently converted to fitness. Fitness reflects the ability of
the individual to be fitful to environment. In this toolbox, the higher fitness,
the more fitful of the individual. Objective functions can have parameters. An
example of objective setting is as following:
On the left, the entry labeled “objectives number” is the place where user
can input or change the number of objectives. If the value in this entry is
changed, the right column will change accordingly. Column labeled “objective
function” and Column labeled “objective function” are places where use input
functions and parameters. The button between two columns can invoke a dialog
to facilitate your selecting file.
At the left-bottom corner, there is a check button labeled “Evaluation Paral-
lel”. It is used to set whether using parallel processing when evaluating objective
functions. The below entry sets the maximum number of processes the toolbox
can spawn. This feature can be supported only on Unix/Linux platform.
8
Figure 7: Algorithm set dialog
At the left-up corner, there is also a check button labeled “collect running
info”, which decide whether toolbox collect running information of process.
Running information includes the statistic of fitness data of every generation.
Because collecting running info is time consuming. It will save much time when
closing this feature.
The last two component to be introduced are a combobox and an entry to set
termination condition when cycling. They are at the left-middle of the dialog,
where user can select termination method and its parameter.
The corresponding sentences of this example are as following:
EvalFun: gaDemo1Eval1 []
CycleBegin: maxGenTerm 100
InformCollect: on
2.4 Algorithm
There are many important variations of genetic algorithm that already be inter-
grated into GATS. Some of them are different with each other in select, crossover
and mutate; some of them have various enhancement respectively; some can be
processed in parallel, etc. So the algorithm setting dialog is the most complex
one. Following is an example:
The left-up part is used to set select, crossover and mutate methods. The
right up corner is used to set self adaptive algorithm. The left down part is used
to set fitness scaling and niche features. The right bottom part is used to set
best reserve strategy.
The corresponding sentences of this example are as following:
Select: RouletteSelect []
Crossover: SimpleXover [0.2]
9
Mutation: BinaryMutation [0.05]
FitScaling: LinearFitScaling [1.1]
• Real: RDiscreteXover
• Real: RLinearXover
• Real: RHeuristicXover
• Real: RHeuristic2Xover
• Order: OUniformXover
• Order: OSimplePointXover
• Order: OPartMatchXover
10
• Order: OSlideXover
• Order: OCycleXover
User can specify crossover method in a combobox labeled “crossover”. If he
has his own select method, he could also push the button next the combobox
and open a dialog to specify it. There is an entry at the right of button, where
user can input crossover method parameters by hand.
Mutate is to change part of gene of an individual with a certain probability
(usually, this probability is very low). As a result, a new individual is born,
which is similar with the original one but with a little change. When two
individual is very similar, with most of their gene is same, it’s difficult to induce
new gene through crossover. Consequently, the search will nearly stop. So a
new method is needed to continue the search, that is mutation. Normally, only
a small mutate rate is set to avoid search become random search.
Different gene coding has different mutate method.
Toolbox support following mutate method:
• Binary and Gray : BinaryMutation
• Real: RNonUniformMutation
• Real: RBoundaryMutation
• Order: OInverseMutation
• Order: OSinglePointMutation
User can specify mutate method in a combobox labeled “mutate”. If he has
his own mutate method, he could also push the button next the combobox and
open a dialog to specify it. There is an entry at the right of button, where user
can input mutate method parameters by hand.
11
f'
c*favg
favg
f
fmin favg fmax
there are three comboboxes to select methods and three entry to input proce-
dure parameters. There are also three buttons to select user-defined procedure
as self adjust procedure.
• LinearFitScaling
• PowerFitScaling
• ExponentailFitScaling
In Linear Fitness Scaling method, suppose the original fitness is f, and the
result fitness is f 0 , the transformation can be expressed as following:
f0 = a ∗ f + b (1)
Assuming that the average fitness before and after transformation is same,
and after transformation, the maximum fitness is c times to average fitness, we
can get:
(c − 1) ∗ favg (fmax − c ∗ favg ) ∗ favg
a= , b= (2)
fmax − favg fmax − favg
The result is showed in following graph.
In the case that after transformation, the minimum fitness is less than zero,
to satisfy that all fitness must large than zero, the transformation formula must
changed to:
favg −favg ∗ fmin
a= , b= (3)
favg − fmin favg − fmin
This situation is showed as following:
12
f' f'
c*favg c*favg
favg
favg
fmin
f f
favg fmax fmin favg fmax
User can specify fitness scaling method in a combobox labeled “fit scaling”.
If he has his own fitness scaling method, he could also push the button next
the combobox and open a dialog to specify it. There is an entry at the right of
button, where user can input fitness scaling method parameters by hand.
In nature, individual of species are competing in a local area, with limited
food and other resources. To simulate such situation, researchers proposed niche
concept in GA. There are two ways to realize niche, one is cluster method, the
other is share method. In first way, the new individual can only replace the
similar parent individual to keep the diversity of the population. In second
way, the fitness is shared between individuals whom are similar enough. The
similarity between two individual is measured by the distance between their
gene or variables.
Toolbox support two niche method both:
• ClusterSelNiche :cluster niche method
• ShareEvaNiche :share niche method
User can specify niche method in a combobox labeled “fit scaling”. If he has
his own niche method, he could also push the button next the combobox and
open a dialog to specify it. There is an entry at the right of button, where user
can input niche method parameters by hand.
• Best reserve
• Extended elitist reserve
• Cross generation Elitism reserve
“Best reserve” strategy reserves the best individual from last generation to
current generation. By select this feature, the algorithm can make the max
fitness of population increase monotonously.
13
Subtask 1
The
“Extended elitist reserve” is much like the “Best reserve”, but it reserves
not only the best one, but the best N percent individuals of last generation.
The percent N is the parameter of this strategy, and can be set by user. This
strategy also has a option to set whether allowing homogeneous individual when
reserve parent generation individuals.
“Cross generation Elitism reserve” unite the last and new-born individual
as a whole, and select the best half as the new generation. This strategy can
make the mean fitness of population increase monotonously.
There three check buttons on the right down position of algorithm setting
dialog. The three choices are mutual excluding. User can select only one of
three strategies. When selecting the extended best reserve strategy, you must
also give how much percent best individuals are reserved from last generation,
and you can decide whether these individuals can include uniform ones.
2.5 Subtask
To extend the capability of toolbox, the toolbox induce subtask concept. When
setting a task, user can include other task files and setting them as subtasks.
When running this task (main task), population is divided into several parts
among those tasks, and those subtasks are called with respective population.
The subtask can also have its subtask too. This situation can be showed as
following:
The subtask setting dialog is very simple:
Left is an entry labeled “number of subtasks”, which is used to set the
number of subtasks. The right two columns set the subtask’s task file names
and the percent of population they have respectively. There is a check button
on the left-down corner. This check button set whether using parallel processing
when call subtasks. This feature can be supported only on Unix/Linux platform.
3 Multi-objective optimization
In optimization, genetic algorithm always need to optimizes many objectives si-
multaneously. Generally, it’s called multi-objective optimization. When dealing
with multi-objective optimization, the scalar concept of “optimality” does not
apply directly in such case. The usual way is to convert multi-objectives’ fitness
to a single fitness with a set of weight factors, but sometimes it’s a matter to
determind proper weights of objectives. In such case, the notion of “Pareto
optimality” is always used. Informally speaking, we can define it as following:
14
Figure 11: Subtask set dialog
15
f1
..
.
1
.
.
. . .
1
. . . 1
. .
.. 1
1
. 1
0 f2
4 Parallel algorithm
One of genetic algorithm’s major advantages is that it supports parallel process-
ing inherently. This character enable GA to solve very difficult and computing
resource consuming problems. New SCILAB version (from version 2.6) increases
a new feature that provides PVM interface to support parallel processing. PVM
is the most popular and accepted standard of parallel processing interface. This
feature makes developing parallel program on SCILAB much easier. With the
help of toolbox, user can take the advantage of parallel processing even without
16
f1
. ..
The line from a point to
its nearest Pareto optimal
..
. .
. .
1th Pareto border
0 f2
disturbing of PVM interface. What the user needs to do is just turn on the
parallel option and set necessary parameters to make it run. Toolbox supports
two levels of parallel processing: subtask level and evaluation level.
On subtask level parallel processing, each subtask is executed in a single
process, which can be parallel with each other. The number of spawned process
is equal to the number of subtask.
On evaluation level parallel processing, toolbox set the maximum number of
parallel processes, every evaluation is distributed to these processes to be dealt
of in parallel way. The maximum number of parallel processes is set by the user.
17
Reference
In developing of toolbox, I refer some book, paper and existing source code.
There are already some genetic algorithm toolboxes for MATLAB. But many of
them are commercial and without source code. Commercial software is maybe
more powerful, more convenient or better technical support. But for a person
who want to study or research on genetic algorithm, the open source package is
more favorable. Following are some of good genetic algorithm toolbox and its
website:
“Genetic Algorithm Toolbox for MATLAB”
http://www.shef.ac.uk/ gaipp/ga-toolbox/
“GEATbx: Genetic and Evolutionary Algorithm Toolbox for use with MAT-
LAB”
http://www.systemtechnik.tu-ilmenau.de/ pohlheim/GA Toolbox/
“The Genetic Algorithm Optimization Toolbox (GAOT) for MATLAB 5”
http://www.ie.ncsu.edu/mirage/GAToolBox/gaot/
Thanks to my director professor Hu Baogang for his support, and Wu Ling
for his good suggestions.
18