KEMBAR78
Denn Optimization by Variational Methods | PDF | Calculus Of Variations | Mathematical Optimization
0% found this document useful (0 votes)
1K views431 pages

Denn Optimization by Variational Methods

This book is an attempt to present a logical development of optimization theory at an elementary mathematical level. The book follows rather closely a course in optimization which I have taught at the University of Delaware since 1965. In two sections of Chapter 3 and in Chapter 11 a familiarity with partial differential equations is helpful but not essential.

Uploaded by

ksch123
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
1K views431 pages

Denn Optimization by Variational Methods

This book is an attempt to present a logical development of optimization theory at an elementary mathematical level. The book follows rather closely a course in optimization which I have taught at the University of Delaware since 1965. In two sections of Chapter 3 and in Chapter 11 a familiarity with partial differential equations is helpful but not essential.

Uploaded by

ksch123
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 431

OPTIMIZATION BY

VARIATIONAL METHODS
MORMON M. DENN
OPTIMIZATION BY
VARIATIONAL METHODS

MORTON M. DENN
Associate Professor of Chemical Engineering
University of Delaware
Preface

The development of a systematic theory of optimization since the mid-


1950s has not been followed by widespread application to the design and
control problems of the process industries. This surprising and disap-
pointing fact is largely a consequence of the absence of effective communi-
cation between theorists and process engineers, for the latter typically do
not have ,sufficient mathematical training to look past the sophistication
with which optimization theory is usually presented and recognize the
practical significance of the results. This book is an attempt to present a
logical development of optimization theory at an elementary mathematical
level, with applications to simple but typical process design and control
situations.
The book follows rather closely a course in optimization which I
have taught at the University of Delaware since 1965 to classes made up
of graduate students and extension students from local industry, together
with some seniors. Approximately half of the students each year have
been chemical engineers, with the remainder made up of other types of
engineers, statisticians, and computer scientists. The only formal
vii
rill PREFACE

mathematical prerequisites are a familiarity with calculus through


Taylor series and a first course in ordinary differential equations, together
with the maturity and problem awareness expected of students at this
level. In two sections of Chapter 3 and in Chapter 11 a familiarity with
partial differential equations is helpful but not essential. With this back-
ground it is possible to proceed in one semester from the basic elements of
optimization to an introduction to that current research which is of direct
process significance. The book deviates from the course in containing
more material of an advanced nature than can be reasonably covered in
one semester, and in that sense it may be looked upon as comprising in
part a research monograph.
Chapter 1 presents the essential features of the variational approach
within the limited context of problems in which only one or a finite num-
ber of discrete variables is required for optimization. Some of this
material is familiar, and a number of interesting control and design
problems can be so formulated. Chapter 2 is concerned with the parallel
variational'development of methods of numerical computation for such
problems. The approach of Chapter 1 is then resumed in Chapters 3
to 5, where the scope of physical systems analyzed is gradually expanded
to include processes described by several differential equations with
magnitude limits on the decisions which can be made for optimization.
The optimal control of a flow reactor is a typical situation.
In Chapters 6 and 7 much of the preceding work is reexamined and
unified in the context of the construction of Green's functions for linear
systems. It is only here that the Pontryagin minimum principle, which
dominates modern optimization literature, is first introduced in its com-
plete form and carefully related to the more elementary and classical
material which is sufficient for most applications.
Chapter 8 relates the results of optimization theory to problems of
design of practical feedback control systems for lumped multivariable
processes.
Chapter 9 is an extension of the variational development of prin-
ciples of numerical computation first considered in Chapter 2 to the more
complex situations now being studied.
Chapters 10 and 11 are concerned with both the theoretical develop-
ment and numerical computation for two extensions of process signifi-
cance. The former deals with complex structures involving recycle and
bypass streams and with periodic operation, while the latter treats
distributed-parameter systems. Optimization in periodically operated
and distributed-parameter systems represents major pertinent efforts of
current research.
Chapter 12 is an introduction to dynamic programming and
Hamilton-Jacobi theory, with particular attention to the essential
PREFACE ix

equivalence in most situations between this alternate approach and the


variational approach to optimization taken throughout the remainder
of the book. Chapter 12 can be studied at any time after the first half
of Chapters 6 and 7, as can any of the last five chapters, except that
Chapter 9 must precede Chapters 10 and 11.
Problems appear at the end of each chapter. Some supplement the
theoretical developments, while others present further applications.
In part for reasons of space and in part to maintain a gonsistent
mathematical level I have omitted any discussion of such advanced
topics as the existence of optimal solutions, the Kuhn-Tucker theorem,
the control-theory topics of observability and controllability, and
optimization under uncertainty. I have deliberately refrained from
using matrix notation in the developments as a result of my experience
in teaching this material; for I have found that the very conciseness
afforded by matrix notation masks the significance of the manipulations
being performed for many students, even those with an adequate back-
ground in linear algebra. For this reason the analysis in several chapters
is limited to two-variable processes, where every term can be conveniently
written out.
In preparing this book I have incurred many debts to colleagues and
students. None will be discharged by simple acknowledgement, but
some must be explicitly mentioned. My teacher, Rutherford Aris, first
introduced me to problems in optimization and collaborated in the
development of Green's functions as the unifying approach to variational
problems. J. R. Ferron, R. D. Gray,_Jr., G. E. O'Connor, A. K. Wagle,
and particularly J. M. Douglas have been most helpful in furthering my
understanding and have permitted me to use the results of our joint
efforts. The calculations in Chapter 2 and many of those in Chapter 9
were carried out by D. H. McCoy, those in Section 10.8 by G. E. O'Con-
nor, and Figures 6.1 to 6.4 were kindly supplied by A. W. Pollock. My
handwritten manuscript was expertly typed by Mrs. Frances Phillips.
For permission to use copyrighted material I am grateful to my several
coauthors and to the following authors and publishers:

The American Chemical Society for Figures 4.2, 5.7, 5.8, 9.1, 9.2, 9.7 to
9.13, 10.5 to 10.15, 11.8 to 11.12, which appeared in Industrial and
Engineering Chemistry Monthly and Fundamentals Quarterly.
Taylor and Francis, Ltd., for Figures 11.1 to 11.7 and Sections 11.2 to
11.7, a paraphrase of material which appeared- in International
Journal of Control.
R. Aris, J. M. Douglas, E. S. Lee, and Pergamon Press for Figures 5.9,
5.15, 9.3, and Table 9.8, which appeared in Chemical Engineering
Science.
x PREFACE

D. D. Perlmutter and the American Institute of Chemical Engineers for


Figures 8.1 and 8.2, which appeared in AIChE Journal.

Several of my colleagues at the University of Delaware have shaped


my thinking over the years about both optimization and pedagogy, and
it is my hope that their contributions to this book will be obvious at least
to them. J. M. Douglas and D. D. Perlmutter have kindly read the
entire manuscript and made numerous helpful suggestions for improve-
ment. For the decision not to follow many other suggestions from
students and colleagues and for the overall style, accuracy, and selection
of material, I must, of course, bear the final responsibility.

MORTON M, DENN
Contents

Preface vii

Introduction
OPTIMIZATION AND ENGINEERING PRACTICE 1
BIBLIOGRAPHICAL NOTES 2

Chapter 1 OPTIMIZATION WITH DIFFERENTIAL CALCULUS

1.1 Introduction 4
1.2 The Simplest Problem 4
1.3 A Variational Derivation 7
1.4 An Optimal-control Problem: Formulation 10
1.5 Optimal Proportional Control 12
1.6 Discrete Optimal Proportional Control 13
1.7 Discrete Optimal Control 15
1.8 Lagrange Multipliers 18
1.9 A Geometrical Example 21
A
X11 CONTENTS

1.10 Discrete Proportional Control with Lagrange Multipliers 23


1.11 Optimal Design of Multistage Systems 24
1.12 Optimal Temperatures for Consecutive Reactions 27
1.1'3 One-dimensional Processes 29
1.14 An Inverse Problem 30
1.15 Meaning of the Lagrange Multipliers 32
1.16 Penalty Functions 34
APPENDIX 1.1 Linear Difference Equations 36
BIBLIOGRAPHICAL NOTES 38
PROBLEMS 40

Chapter 2 OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION 44

2.1 Introduction 44
2.2 Solution of Algebraic Equations 45
2.3 An Application of the Newton-Raphson Method 46
2.4 Fibonacci Search 49
2.5 Steep Descent 52
2.6 A Geometric Interpretation 53
2.7 An Application of Steep Descent 55
2.8 The Weighting Matrix 57
2.9 Approximation to Steep Descent 59
APPENDIX 2.1 Optimality of Fibonacci Search 62
APPENDIX 2.2 Linear Programming 65
BIBLIOGRAPHICAL NOTES 68
PROBLEMS 70

Chapter 3 CALCULUS OF VARIATIONS 73

3.1 Introduction 73
3.2 Euler Equation -73
3.3 Brachistochrone 77
3.4 Optimal Linear Control 79
3.5 A Disjoint Policy 91
3.6 Integral Constraints 84
3.7 Maximum Area 85
3.8 An Inverse Problem 86
3.9 The Ritz-Galerkin Method 88
3.10 An Eigenvalue Problem 90
3.11 A Distributed System 91
3.12 Control of a Distributed Plant 93
BIBLIOGRAPHICAL NOTES 96
PROBLEMS 97
CONTENTS xUI

Chapter 4 CONTINUOUS SYSTEMS-I 100

4.1 Introduction 100


4.2 Variational Equations 101
4.3 First Necessary Conditions 105
4.4 Euler Equation 108
4.5 Relation to Classical Mechanics 109
4.6 Some Physical Equations and Useful Transformations 110
4.7 Linear Feedback Control 114
4.8 An Approximate Solution 117
4.9 Control with Continuous Disturbances 121
4.10 Proportional Plus Reset Control 124
4.11 Optimal-yield Problems 125
4.12 Optimal Temperatures for Consecutive Reactions 128
4.13 Optimal Conversion in a Pressure-controlled Reaction 130
BIBLIOGRAPHICAL NOTES 130
PROBLEMS 132

Chapter 5 CONTINUOUS SYSTEMS-II 135

5.1 Introduction 135


5.2 Necessary Conditions 136
5.3 A Bang-bang Control Problem 138
5.4 A Problem of Nonuniqueness 142
5.5 Time-optimal Control of a Stirred-tank Reactor 144
5.6 Nonlinear Time Optimal Control 150
5.7 . Time-optimal Control of Underdamped Systems 152
5.8 A Time-and-fuel Optimal Problem 155
5.9 A Minimum-integral-square-error Criterion and Singular Solutions 159
5.10 Nonlinear Minimum-integral-square-error Control 163
5.11 Optimal Cooling Rate in Batch and Tubular Reactors 165
5.12 Some Concluding Comments 169
BIBLIOGRAPHICAL NOTES 169
PROBLEMS 171

Chapter 6 THE MINIMUM PRINCIPLE 175

6.1 Introduction 175


6.2 Integrating Factors and Green's Functions 176
6.3 First-order Variational Equations 180
6.4 The Minimization Problem and First Variation of the Objective 181
6.5 The Weak Minimum Principle 184
6.6 Equivalent Formulations 187
6.7 An Application with Transversality Conditions 189
xtr CONTENTS

6.8 The Strong Minimum Principle 191


6.9 The Strong Minimum Principle: A Second Derivation 194
6.10 Optimal Temperatures for Consecutive Reactions 197
6.11 Optimality of the Steady State 199
6.12 Optimal Operation of a Catalytic Reformer 202
6.13 The Weierstrass Condition 207
6.14 Necessary Condition for Singular Solutions 207
6.15 Mixed Constraints 209
6.16 State-variable Constraints 211
6.17 Control with Inertia 212
6.18 Discontinuous Multipliers 214
6.19 Bottleneck Problems 217
6.20 Sufficiency 220
APPENDIX 6.1 Continuous Dependence of Solutions 221
BIBLIOGRAPHICAL NOTES 222
PROBLEMS 226

Chapter 7 STAGED SYSTEMS 228

7.1 Introduction 228


7.2 Green's Functions 229
7.3 The First Variation 231
7.4 The Weak Minimum Principle 232
7.5 Lagrange Multipliers 233
7.6 Optimal Temperatures for Consecutive Reactions 234
7.7 The Strong Minimum Principle: A Counterexample 237
7.8 Second-order Variational Equations 238
7.9 Mixed and State-variable Constraints 242
BIBLIOGRAPHICAL NOTES 243
PROBLEMS 245

Chapter 8 OPTIMAL AND FEEDBACK CONTROL 247

8.1 Introduction 247


8.2 Linear Servomechanism Problem 248
8.3 Three-mode Control 250
8.4 Instantaneously Optimal Relay Control 254
8.5 An Inverse Problem 257
8.6 Discrete Linear Regulator 262
APPENDIX 8.1 Liapunov Stability 265
BIBLIOGRAPHICAL NOTES 266
PROBLEMS 269
CONTENTS xv

Chapter 9 NUMERICAL COMPUTATION 271

9.1 Introduction 271


9.2 Newton-Raphson Boundary Iteration 271
9.3 Optimal Temperature Profile by Newton-Raphson Boundary Iteration 274
9.4 Steep-descent Boundary Iteration 278
9.5 Newton-Raphson Function Iteration: A Special Case 283
9.6 Newton-Raphson Function Iteration: General Algorithm 288
9.7 Optimal Pressure Profile by Newton-Raphson Function Iteration 290
9.8 General Comments on Indirect Methods 293
9.9 Steep Descent 295
9.10 Steep Descent: Optimal Pressure Profile 299
9.11 Steep Descent: Optimal Temperature Profile 301
9.12 Steep Descent: Optimal Staged Temperatures 304
9.13 Gradient Projection for Constrained End Points 308
9.14 Min H 311
9.15 Second-order Effects 314
9.16 Second Variation 315
9.17 General Remarks 321
BIBLIOGRAPHICAL NOTES 321
PROBLEMS 325

Chapter 10 NONSERIAL PROCESSES 326

10.1 Introduction 326


10.2 Recycle Processes 327
10.3 Chemical Reaction with Recycle 329
10.4 An Equivalent Formulation 331
10.5 Lagrange Multipliers 332
10.6 The General Analysis 333
10.7 Reaction, Extraction, and Recycle 337
10.8 Periodic Processes 348
10.9 Decomposition 355
BIBLIOGRAPHICAL NOTES 357
PROBLEMS 358

Chapter 11 DISTRIBUTED-PARAMETER SYSTEMS 359

11.1 Introduction 359


11.2 A Diffusion Process 359
11.3 Variational Equations 360
11.4 The Minimum Principle 362
11.5 Linear Heat Conduction 364
11.6 Steep Descent 365
xvi CONTENTS

11.7 Computation for Linear Heat Conduction 366


11.8 Chemical Reaction with Radial Diffusion 371
11.9 Linear Feedforward-Feedback Control 377
11.10 Optimal Feed Distribution in Parametric Pumping 381
11.11 Concluding Remarks 387
BIBLIOGRAPHICAL NOTES 387
PROBLEMS: 389

Chapter 12 DYNAMIC PROGRAMMING AND HAMILTON-JACOBI THEORY 392

12.1 I ntroduction 392


12.2 The Principle of Optimality and Comnutation 393
12.3 Optimal Temperature Sequences 394
12.4 The Hamilton-Jacobi-Bellman Equation 398
12.5 A Solution of the Hamilton-Jacobi-Bellman Equation 400
12.6 The Continuous Hamilton-Jacobi-Bellman Equation 402
12.7 The Linear Regulator Problem 405
BIBLIOGRAPHICAL NOTES 407
PROBLEMS 407

Name Index 411

Subject Index 415


Introduction

OPTIMIZATION AND ENGINEERING PRACTICE


The optimal design and control of systems and industrial processes has
long been of concern to the applied scientist and engineer, and, indeed,
might be taken as a definition of the function and goal of engineering.
The practical attainment of an optimum design is generally a conse-
quence of a combination of mathematical analysis, empirical information,
and the subjective experience of the scientist and engineer. In the chap-
ters to follow we shall examine in detail the principles which underlie.the
formulation and resolution of the practical problems of the analysis and
specification of optimal process units and control systems. Some of these
results lend themselves to immediate application, while others provide
helpful insight into the considerations which must enter into the specifi-
cation and operation of a working system.
The formulation of a process or control system design is a trial-
and-error procedure, in which estimates are made first and then infor-
mation is sought from the system to determine improvements. When a
2 OPTIMIZATION BY VARIATIONAL METHQDS

sufficient mathematical characterization of the system is available, the


effect of changes about a preliminary design may be obtained analyti-
cally, for the perturbation techniques so common in modern applied
mathematics and engineering analysis have their foundation in linear
analysis, despite the nonlinearity of the system being analyzed. Whether
mathematical or experimental or a judicious combination of both, pertur-
bation analysis lies at the heart of modern engineering practice.
Mathematical optimization techniques have as their goal the devel-
opment of rigorous procedures for the attainment of an optimum in a
system which can be characterized mathematically. The mathematical
characterization may be partial or complete, approximate or exact,
empirical or theoretical. Similarly, the resulting optimum may be a
final implementable design or a guide to practical design and a criterion
by which practical designs are to be judged. In either case, the optimi-
zation techniques should serve as an important part of the total effort in
the design of the units, structure, and control of a practical system.
Several approaches can be taken to the development of mathemati-
cal methods of optimization, all of which lead to essentially equivalent
results. We shall adopt here the variational method, for since it is
grounded in the analysis of small perturbations, it is the procedure which
lies closest to the usual engineering experience. The general approach is
one of assuming that a preliminary specification has been made and then
enquiring about the effect of small changes. If the specification is in fact
the optimum, any change must result in poorer performance and the pre-
cise mathematical statement of this fact leads to necessary conditions, or
equations which define the optimum. Similarly, the analysis of the effect
of small perturbations about a nonoptimal specification leads to compu-
tational procedures which produce a better specification. Thus, unlike
most approaches to optimization, the variational method leads rather
simply to both necessary conditions and computational algorithms by
an identical approach and, furthermore, provides a logical framework for
studying optimization in new classes of systems.

BIBLIOGRAPHICAL NOTES
An outstanding treatment of the logic of engineering design may be found in
D. F. Rudd and C. C. Watson: "Strategy of Process Engineering," John Wiley &
Sons, Inc., New York, 1968
Mathematical simulation and the formulation of system models is discussed in
A. E. Rogers and T. W. Connolly: "Analog Computation in Engineering Design,"
McGraw-Hill Book Company, New York, 1960
R. G. E. Franks: "Mathematical Modeling in Chemical Engineering," John Wiley &
Sons, Inc., New York, 1967
INTRODUCTION 3

Perturbation methods for nonlinear systems are treated in such books as


W. F. Ames: "Nonlinear Ordinary Differential Equations in Transport Processes,"
Academic Press, Inc., New York, 1968
"Nonlinear Partial Differential Equations in Engineering," Academic Press,
Inc., New York, 1965
R. E. Bellman: "Perturbation Techniques in Mathematics, Physics, and Engineering,"
Holt, Rinehart and Winston, Inc., New York, 1964
W. J. Cunningham: "Introduction to Nonlinear Analysis," McGraw-Hill Book
Company, New York, 1958
N. Minorsky: "Nonlinear Oscillations," I). Van Nostrand Company, Inc., Princeton,
N.J., 1962
Perhaps the most pertinent perturbation method from a system analysis viewpoint, the
Newton-Raphson method, which we shall consider in detail in Chaps. 2 and 9, is
discussed in
R. E. Bellman and R. E. Kalaba: "Quasilinearization and Nonlinear Boundary
Value Problems," American Elsevier Publishing Company, New York, 1965
1

Optimization with Differential


Calculus

1.1 INTRODUCTION

A large number of interesting optimization problems can be formulated


in such a way that they can be solved by application of differential calcu-
lus, and for this reason alone it is well to begin a book on optimization
with an examination of the usefulness of this familiar tool. We have a
further motivation, however, in that all variational methods may be con-
sidered to be straightforward extensions of the methods of differential
calculus. Thus, this first chapter provides the groundwork and basic
principles for the entire book.

1.2 THE SIMPLEST PROBLEM


The simplest optimization problem which can be treated by calculus is the
following: S(Xl,x2, . . . is a function of the n variables xl, x2, . . . ,
x,,. Find the particular values x,, xz, . . . , x which cause the function
& to take on its minimum value.
4
'IMIZATION WITH DIFFERENTIAL CALCULUS S

We shall solve this problem in several ways. Let us note first that
the minimum has the property that
g(xl,x2j . . . ,xn) - g(xl,x2. . . . ,xn) 0 (1)

Suppose that we let xl = x, + 8x1, where Sxl is a small number in abso-


lute value, while x2 = x2, x3 = z3, . . , X. = L. If we divide Eq. (1)
by 8x1, we obtain, depending upon the algebraic sign of 8x1,
&(xl + 8x1, x2, . . . , xn) - 6(x1,x2, . . . xn) >0 8x1 > 0 (2a)
8x1
or
g\xl + Ext, 22, xn) - g(xl,x2, . . .

ax,
n))<0 Sxl < 0 (2b)

The limit of the left-hand side as Sxi - 0 is simply ag/axi, evaluated at


21, x2, . . . , xn. From the inequality (2a) this partial derivative is
nonnegative, while from (2b) it is nonpositive, and both inequalities are
satisfied only if a&/ax, vanishes. In an identical way we find for all xk,
k=1,2,...,n,
ag
=0 (3)
axk

at the minimizing values ti, x2, . , 2n We thus have n algebraic


equations to solve for the is unknowns, xl, x2, . . . , xn-
It is instructive to examine the problem somewhat more carefully
to search for potential difficulties. We have, for example, made a rather
strong assumption in passing from inequalities (2a) and (2b) to Eq. (3),.
namely, that the partial derivative in Eq. (3) exists at the values x1,
x2; . . . , xn. Consider, for example, the function

g(xt,x2, ,x,,) = Ix1I +x21 + ... + IX-1 (4)

which has. a minimum at xl = x2 = = x,,,= 0. Inequalities (1)


and (2) are satisfied, but the partial derivatives in Eq. (3) are not defined
at xk = 0. If we assume that one-sided derivatives of the. function &
exist everywhere, we must modify condition (3) to -
ag
lim <0 (5a)
x,,-,pk- axk
ag
lim >0 (5b)
Zk-*ik* axk

with Eq. (3) implied if the derivative is continuous.


In problems which describe real situations, we shall often find that
physical or economic interpretations restrict the range of variables we
6 OPTIMIZATION BY VARIATIONAL METHODS

may consider. The function


8 = Y2 (x + 1) 2 (6)
for example, has a minimum at x = -1, where Eq. (3) is satisfied, but if
the variable x is restricted to nonnegative values (an absolute tempera-
ture, or a cost, for example), the minimum occurs at the boundary x = 0.
In writing the inequalities (2a) and (2b) we have assumed that we were
free to make Sxa positive or negative as we wished. But if xa lies at a
lower bound of its allowable region, we cannot decrease xa any more, so
that inequality (2b) is inadmissable, and similarly, if za lies at an upper
bound, inequality (2a) is inadmissable. We conclude, then, that zk either
satisfies Eq. (3) for inequalities (5a) and (5b)] or lies at a lower bound
where a8/axk >- 0, or at an upper bound where a8/axk < 0; that is,
<0 zk at upper bound
a8
=0 xk between bounds [or (5a) and (5b)] (7)
axk > 0
xk at lower bound
Thus, to minimize Eq. (6) subject to 0 < x < ao we find a8/ax - 1 > 0
at z = 0.
Several more restrictions need to be noted. First, if we were seek-
ing to maximize 8, we could equally well minimize -8. But from Eq. (3),
a(-8) =- a8 a8
axk axk axk

which is identical to the condition for a minimum, so that if fk is an


interior point where the derivative is continuous, we have not yet found
a way to distinguish between a maximizing and minimizing point.
Indeed, a point at which Eq. (3) is satisfied need be neither a maxi-
mum nor a minimum; consider 6 = (XI)2 - (x2)2 Furthermore, while
inequalities (1) and (2) are true for all xa, x2, ... , x, from the mean-
ing of a minimum, and hence define a global minimum, when we allow
ax, to become small and obtain Eqs. (3), (5), and (7), we obtain con-
ditions which are true for any local minimum and which may have many
solutions, only one of which leads to the true minimum value of 6.
These last points may be seen by considering the function
6(x) = Y3x' - x + 3 (g).

shown in Fig. 1.1. If we calculate d8/dx = 0, we obtain

dx=x2-1=0 (9)
or
x= ±1 (10)
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 7

Fig. 1.1 A function with a local mini-


mum and maximum: E(x) = ?sx= -
x+3's.

and we see by inspection of the figure that & is a minimum at x = +1


and a maximum at x = -1, with 8(1) = 0. But for x < -2 we have
8(x) < 0, so that unless we restrict x to x > -2, the point x = 1 is only
a local minimum. If, for example, we allow -3 < x < oo, then at the
boundary x = -3 we have
d&
dx z--3 = (-3)2 - 1 = 8 > 0 (11)

and so the point x = -3 also satisfies Eq. (7) and is in fact the global
minimum, since
g(-3) = -1s3 < a(+1) = 0 (12)

1.3 A VARIATIONAL DERIVATION


In order to motivate the development in later chapters, as well as to
obtain a means of distinguishing between maxima and minima, we shall
consider an alternative treatment of the problem of the previous section.
For simplicity we restrict attention to the case n = 2, but it will be obvi-
ous that the method is general. We seek, then, the minimum of the
function 3(x1,x2), and for further simplicity we shall assume that the
function has continuous first partial derivatives and that the minimum
occurs at interior values of x1, x2.
If g has bounded second partial derivatives, we may use a two-
dimensional Taylor series to express &(x1 + &x1, x2 + &x2) in terms of
S(x1,x2), where bx3, bx2 are any small changes (variations) in x1, x2 which
are consistant with the problem formulation:
a&i as ax2
6(x1 + bx1, x2 + bx2) - 3(x1,x2) = ax1 6x1 T+ ax2
+ o(max I6xl1,I6x2) (1)

where the partial derivatives are evaluated at x1 = x1, x2 = x2 and the


! OPTIMIZATION BY VARIATIONAL METHODS

notation o(e) refers to a term which goes to zero faster than e; that is,

lim o(E) =0 (2)


r_o E

In Eq. (1),
zaxz
Ex, 6x2
o(max I6x,I,1Sx21) =
2 [ax? (axl)e + 2 ax az

+ 2
(Sxz)' (3)

where the second partial derivatives are evaluated at some point

[Pxi + (1 - P) (xa + Sxi), P12 + (1 - P)42 + Sxz) ] 0<P<1


If za, x2 are the minimizing values then, from Eq. (1) and Eq; (1) of the
preceding section,
ar,
Sx, + ax2 Sx' + o(max ISxiI,f Ox2I) >_ 0 (4)
ax
Now ax, and 5X2 are arbitrary, and Eq. (4) must hold for any vari-
ations we choose. A particular pair

Sxa = -E ax, 6x2 = -E (5)


aX2

where a is a small positive constant and the partial derivatives are again
evaluated at z,, x2. If these partial derivatives both vanish,. inequality
(4) is satisfied trivially; if not, we may write

-E Ka&), + + o(e) > 0 (6)

or, dividing by e and taking the limit as a with Eq. (2),


(,3g)2 + ()2 < 0 (7)

But a sum of squares can be nonpositive if and only if each term in the
sum vanishes identically; thus, if 21'and x2 are the minimizing values,
it is necessary that
a3 = 38 =0 (8)
ax, ax2

We may now obtain a further condition which will allow us to dis-


tinguish between maxima and minima by assuming that C is three times
OPTIMIZATION WITH DIFFERENTIAL CALCULUS E

differentiable and carrying out the Taylor series an additional term:

8(x1 + ax,, 22 + axe) - 8(!1,x2) _ 1 ax, + r 6x2

+ 2 ax (ax,) 2 + 2
ax49

, ax2
z
ax, 5x2 + a
z
22 (0x2) 2

+ o(max I6x,{2,10x212) >- 0 (9)

The terms with arrows through them vanish by virtue of Eq. (8). If we
now set
ax, = Ea, axz = Eat (10)
where e is an infinitesimal positive number and a,, az are arbitrary finite
numbers of any algebraic sign, Eq. (9) may be written, after dividing
by E2,
2 2
1 828 o(E2)
2 V a`a'
ax; az + E2
>0 (11)
s-1;1 ,

and letting c ---' 0, we find that for arbitrary a;, a; it is necessary that
2 2
828
a;a; > 0 (12)
; 1f
ax; ax;
I

The collection of terms


[826 a28
ax, ax, ax, ax2
492&
828
ax, ax2 ax2 ax2

is called the hessian matrix of 8, and a matrix satisfying inequality (12)


for arbitrary a,, az is said to be positive semidefinite. The matrix is said
to be positive definite if equality holds only for a,, a2 both equal to zero.
The positive semidefiniteness of the hessian may be expressed in a
more convenient form by choosing particular values of a, and a2. For
example, if at = 1, a2 = 0, Eq. (12) becomes
a2s
>0 (13)
ax, ax,
and similarly for 828/(ax2 ax2). On the other hand, if a, = - 828/((jx, ax2),
a2 = 192s/(49x, ax,), then Eq. (12) becomes the more familiar result
a2s a2s --( a2s 12 > o
0x,2 0x22 a2: (14)
10 OPTIMIZATION BY VARIATIONAL METHODS

Thus we have proved that, subject to the differentiability assumptions


which we have made, if the minimum of S(x1,x2) occurs at the interior
point z,, x2, it is necessary that the first and second partial derivatives of &
satisfy conditions (8), (13), and (14). ' It is inequality (13) which dis-
tinguishes a minimum from a maximum and (14) which distinguishes
both from a saddle, at which neither a minimum nor maximum occurs
despite the vanishing of the first partial derivatives.
Let us note finally that if Eq. (8) is satisfied and the hessian is
positive definite (not semidefinite) at x1, 22, then for sufficiently small
I6x1J, 16x2I the inequality in Eq. (9) will always be satisfied. Thus, the
vanishing of the first partial derivatives and the positive definiteness of
the hessian are sufficient to ensure a (local) minimum at x1, x2.

1.4 AN OPTIMAL-CONTROL PROBLEM: FORMULATION


The conditions derived in the previous two sections may be applied to
an elementary problem in optimal control. Let us suppose that we have
a dynamic system which is described by the differential equation

dy = F(y,u) (1)

where u is a control variable and y is the state of the system (y might be


the temperature in a mixing tank, for example, and u the flow rate of
coolant in a heat exchanger). The system is designed to operate at the
steady-state value y,,, and the steady-state control setting is,, is found
by solving the algebraic equation
0 (2)

If the system is disturbed slightly from the steady state at time


t = 0, we wish to guide it back in an optimal fashion. There are many
criteria which we might use to define the optimal fashion, but let us sup-
pose that it is imperative to keep the system "close" to y,,. If we deter-
mine the deviation from steady state y(t) - y at each time, then a
quantity which will be useful for defining both positive and negative
deviations is the square, (y - y,,)2, and a reasonable measure of the total
foe

deviation will be (y - y,.)2 dt, where 0 is the total control time. On


the other hand, let us suppose that we wish to hold back the control
effort, which we may measure by foe (u - u..)' dt. If p2 represents the
relative values of control effort and deviation from steady state, then our
control problem is as follows.
Given an initial deviation yo and a system defined by Eq. (1), find
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 11

the control settings u(t) which minimize

!o [(y - y..)2 + a2(u - u..)2] dt (3)

For purposes of implementation it is usually helpful to have the control


function u not in terms oft but of the state y. This is called feedback
control.
In later chapters we shall develop the necessary mathematics for
the solution of the control problem just stated. At this point we must
make a further, but quite common, simplification. We assume that the
deviations in y and u are sufficiently small to permit expansion of the
function F(y,u) in a Taylor series about y,,, u retaining only linear terms..
Thus we write
IT
F(y..,u..) + (y - y..) + Bu v-v.. (u - u..) (4)
usu.. u-u..
Letting

x -y - y w= -(u-u..)
and noting that i = y and 0, we have, finally,
Ax + w x(0) = xo (5)
min & = 2 to (x2 + c2w2) dt (6)

It is sometimes convenient to consider control policies which are


constant for some interval; that is,
w(t) = w = const (n - 1)A < t < n0 (7)

Equation (5) then has the solution


eAA)
x(nA) = x[(n - 1),&]eA° - (8)
A
and Eq. (6) may be approximated by
N
min 8 = 7220 [x2(nA) + c2wn2] (9)

Letting

x = x(nQ)
w ua = A
eAA)

(10)
c2A 2
C2 (1 _. eA.A)2
a=eAo
12 OPTIMIZATION BY VARIATIONAL METHODS

we obtain the equivalent control problem


x= u xo given (11)
N
min S = 2
(x,,2
+
C2un2)
(12)
n-1

where, since 0 is a constant, we have not included it in the summation


for 6.

1.5 OPTIMAL PROPORTIONAL CONTROL


The most elementary type of control action possible for the system
defined by Eqs. (5) and (6) of the preceding section is proportional con-
trol, in which the amount of control effort is proportional to the devi-
ation, a large deviation leading to large control, a small deviation to little.
In that case we set
w(t) = Mx(t) (1)
z = (A + M)x x(O) = xo (2)
and we seek the optimal control setting M:
min 6(M) = 12(1 + c2M2) fo x2 dt (3)

The solution to Eq. (2) is


x(t) = xoe(A+M)t (4)

and Eq. (3) becomes

min 6(M) = x02(1 + c2M2) foe e2(A+.N)t dt (5)


or
x02 (1 + MMC2) (e2(A+M)e _ 1)
min 6(M) = (6)
4 A+M
Since x cannot be allowed to grow significantly larger because of the con=
trol action, it is clear that we require for stability that
A+M<0 (7)

This is the concept of negative feedback. The optimal control setting is


then seen from Eq. (6) to depend on A, c, and 0 but not on xo, which is
simply a multiplicative constant. Since we shall usually be concerned
with controlling for long times compared to the system time constant
IA + MI-1, we may let 0 approach infinity and consider only the problem

min 6(M)
IA+ M 2
(8)
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 13

We obtain the minimum by setting the derivative equal to zero


dg _ 2Mc2 1 + M2c2
A+M+(A+M)2=0
(9)
dM
or
1 +A 2C2
M = -A ± c2
(10)

Condition (7) requires the negative sign, but it will also follow from con-
sidering the second derivative.
For a minimum we require d2g/dM2 > 0. This reduces to
1 + A2c2
-2 (A+M)$-
.
>0 (11)

which yields a/minimum for the stable case A + M < 0, or

w=- CA + 1 cA2c2) x (12)

1.6 DISCRETE OPTIMAL PROPORTIONAL CONTROL

We shall now consider the same problem for the discrete analog described
by Eqs. (11) and (12) of Sec. 1.4,
xn = ax.-1 + u. (1)
N
min F. = I (xn2 + C2un2) (2)
n-1
We again seek the best control setting which is proportional to the state
at the beginning of the control interval
u = mxn-1 (3)
so that Eq. (1) becomes
xn = (a + (4)
which has a solution
xn = xo(a + m)n (5)
It is clear then that a requirement for stability is
ja+ml < 1 (6)
For simplicity in the later mathematics it is convenient to substi-
tute Eq. (4) into (3) and write
un =
a
+ m xn = Mxn (7)
14 OPTIMIZATION BY VARIATIONAL METHODS

and

x = 1 a M xn-1 (1 a M )nxo
= (8)

/ of M which will minimize


From Eq. (2) we then seek the value
N
a )
3(M)
nil 1 - M1
(9)
x02(1 + M2C2)

or, using the equation for the sum of a geometric series,

&(M) 2 xo2 (1
2!1 Ma
2C2 2
1- 1 a
M)21]
(10)
M)2 -
As in the previous section, we shall assume that the operation is suf-
ficiently long to allow N --p -, which reduces the problem to
1 1 + M2C2
min &(M) = (11)
2(1-M)2-a2
Setting d3/dM to zero leads to the equation

M2-I-(a2-1-} 12)M-C2=0 (12)

or

M 2C2 [C2(1 - a2) - 1 ± {[C2(1 - a2) - 1]2


+ 4C21] (13)

The positivity of the second derivative implies

M< C2(1 2C2


- a2) - 1 (14)

so that the negative sign should be used in Eq. (13) and M is negative.
The stability condition, Eq. (6), is equivalent to
1 - M > dal (15)

From Eq. (14),

1-M>
2
+1a2>-y2(1+a2)> al (16)
2C
where the last inequality follows from
a2 - 21a1 + 1'= (Ial - 1)2 > 0 (17)

Thus, as in the previous section, the minimizing control is stable.


OPTIMIZATION WITH DIFFERENTIAL CALCULUS 1s

In Sec. 1.4 we obtained the discrete equations by setting


-wnI 6AA.
a=eAA u A
(18)
c2A2
C2 = - eAA
If we let A become infinitesimal, we may write

a --- 1 + A4 u- C2
Q2
C2
(19)

Equation (12) then becomes


2
(20)

M o (21)

1 CA2c2 ) xn (22)

which is identical to Eq. (12) of Sec. 1.5.

1.7 DISCRETE OPTIMAL CONTROL


We now consider the same discrete system
xA = u (1)
but we shall not assume that the form of the controller is specified.
Thus, rather than seeking a single proportional control setting we wish
to find the sequence of settings u1, u2, . . . , ui that will minimize

(2)
n-1
If we know all the values of x,,, n = 1, 2, . . . , N, we can calculate u,;
from Eq. (1), and so we may equivalently seek

min 8(xl,x2, . . . ,x1V) + C2(x - axw-1)2) (3)


nil Ex~2
We shall carry out the minimization by setting the partial derivatives of
C with respect to the x equal to zero.
1$ OPTIMIZATION BY VARIATIONAL METHODS

Differentiating first with respect to xN, we obtain


ac
= xN ± C'(xN - axN-I) = xN + C 'uN = U (4)
axN
or
1
UN = - C,= xN (5)

Next,
a&
xN-1 + C2(XN-1 - axN-Y) - aC2(XN axN-1)
axN-1
= xN-1 + C'u,v-1 - aC2 UN = 0 (6)

and combining Eqs. (1), (5), and (6),

= -TN-1 + C2UN-1 + 1 + C2 XN_1 = 0 (7)


8xN_I
or
1 1 + C2 + a2C2
UN-1 XN_I
'V-1
C' 1 + C2
In general,
X. + C'(x. - ax,.-1) - aC'(x.+1 - ax.) 0
n 1, 2, .. . , N-1 (9)

and the procedure of Eqs. (4) to (8) may be applied ip order from n
to n = 1 to find the optimal control,settings ul, u2, . . . , UN.
We note from Eq. (9) that the optimal control will always be a
proportional controller of the form
-1aM1
xa-i (10)

with, from Eq. (5),


1
MN= -
If we substitute into Eq. (9), we obtain

x.+C'M.x. -a'C'1 M"M*M,+1 x. =0 (12)

or, if x.00,
M.M.+1+1 C2 M.+1-M.-I = 0 (13)
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 17

with Eq. (11) equivalent to the condition


MN+1 = 0 (14)

Equation (13) is a Riccati difference equation, and the unique solu-


tion can be obtained by applying Eq. (13) recursively from n = N to
n = 1. An analytical solution can be found by the standard technique
of introducing a new variable
1

M. - (1/2C2)[1 + a2C2 - C2 ± _01 + a2C2 - C2)2 + 4C2)


where either the positive or negative sign may be used, and solving the
resulting linear difference equation (see Appendix 1.1 at the end of the
chapter). The solution satisfying Eq. (14) may be put in the form
[1 + C2 + a2C2 - (1 + a2C2 - C2)2 + 4C2] -i
M = 1 - 2a2C2
n [1 + C2 + a2C2 - 'V(1 + a2C2 - C2)2 + 4C 2]n
+ a[l + C2 + a2C2 + (1 + a2C2 - C2)2 + 4C21n-'
+ all + C2 + a2C2 + \/(I + a2C2 - C2)2 + 4C2]n
where

a x C 1 + C2 + a2C2 -V/(1 + a2C2 - C2)2+4 C2 1N


L 1 + C2 + a2C2 + /(1 + a2C2 - C2)2 + 4C2
1 + C2 - a2C2 - (1 + a2C2 - C2)2 + 4C2
1 + C2- aC2 + V (1+ a2C2 - C2)2 +4C2
If we take the control time to be infinite by letting N --> oo, we
shall obtain lim xN -' 0, so that Eq. (4) will be automatically satisfied
without the need of the boundary condition (11). We find that
lim a = 0
N_.
and Mn becomes a constant.

+4
(a2+C2/z

Mn= a2-1+C2+ V (17)

which is the optimal proportional control found,in/Sec. 1.6.


The result of this section, that the optimal control for the linear sys-
tem described by Eq. (1) and quadratic objective criterion by Eq. (2) is ,a
proportional feedback control with the control setting the solution of a Riccati
equation and a constant for an infinite control time, is one of the impor-
tant results of modern control theory, and it generalizes to larger sys-
tems. We shall return to this and related control problems in Sec. 1.10
and in later chapters.
111 OPTIMIZATION BY VARIATIONAL METHODS

1.8 LAGRANGE MULTIPLIERS


In the previous section we were able to substitute the equation describ-
ing the process, Eq. (1), directly into the objective function, Eq. (2), and
thus put the problem in the framework of Sec. 1.1, where we needed only
to set partial derivatives of the objective to zero. It will not always be
possible or desirable to solve the system equations for certain of the varia-
bles and then substitute, and an alternative method is needed. One such
method is the introduction of Lagrange multipliers.
For notational convenience we shall again restrict our attention to
a system with two variables, x, and x2. We seek to minimize 8(xl,x2),
but we assume that xl and x2 are related by an equation

g(xl,x2) = 0 (1)

It is convenient to consider the problem geometrically. We can plot the


curve described by Eq. (1) in the x1x2 plane, and for every constant we
can plot the curve defined by

g(x,,x2) = const (2)

A typical result is shown in Fig. 1.2.


The solution is clearly the point at which the line g = 0 intersects
the curve of constant 8 with the least value, and if both c%rves are con-
tinuously differentiable at the point, then they will be tangent to each
other. Thus, they will possess a common tangent and a common normal.
Since the direction cosines for the normal to the curve g = 0 are propor-
tional to ag/ax; and those of g = const to a6/ax; and for a common nor-
mal the direction cosines must coincide, it follows then that at the opti-
mum it is necessary that 8g/ax; be proportional to 8g/8x;; that is, a

Fig. 1.2 Contours and a constraint


K, curve.
OPTIMIZATION WITH DIFFERENTIAL CALCULUS U

necessary condition for optimality is

(3a)
ax + ax
o (3b)
T1 +Xax
We shall now obtain this result somewhat more carefully and more
generally. As before, we suppose that we somehow know the optimal
values x,, 22, and we expand £ in a Taylor series to obtain

3(91 + 6x1, xs + bx2) - &(x,,x2) = axl


ax + 8x2 axe
+ o(max Iaxil,Iax2I) ? 0 (4)

Now, however, we cannot choose axe and axe in any way that we wish,
for having chosen one we see that the constraint, Eq. (1), fixes the other.
We can, therefore, freely choose one variation, ax, or ax2, but not both.
Despite any variations in x, and x2, we must satisfy the constraint.
Thus, using a Taylor series expansion,

9(x1 + ax,, x: + axe) - 9(x,,xs) = ax ax, + a y axe


+ o(max IaxiI,lax2l) = 0 (5)
If we multiply Eq. (5) by an arbitrary constant X (the Lagrange multi-
plier) and add the result to Eq. (4), we obtain

(+X)Oxt+(+X)Oxt
a& cig a&

+ o(max I3x,l,I6x21) >_ 0 (6)


We ate now free to choose any X we please, and it is convenient to define
X so that the variation we are not free to specify vanishes. Thus, we
choose X such that at x,, 22

ax, + Xa x = 0 (7)

which is Eq. (3b). We now have

(az + a a g) axl + o (max l axl 1, l ax:I) ? 0 (8)

and by choosing the special variation

ax,= -e -+Xc3x) (9)


20 OPTIMIZATION BY VARIATIONAL METHODS

it follows, as in Sec. 1.3, that at the minimum of C it is necessary that

ax +X a1g=0 (10)

which is Eq. (3a). Equations (1), (7), and (10) provide three equations
for determining the three unknowns x1, f2, and X. We note that by includ-
ing the constraint in this manner we have introduced an additional varia-
ble, the multiplier X, and an additional equation.
It is convenient to introduce the lagrangian 2, defined
£(x1,x2,X) = 3(xl,x2) + X9(x1,x2) (11)
Equations (7) and (10) may then be written as
a.e aye
8x1 = axe
=0

while Eq. (1) is simply


aye=0
(13)
TX

Thus, we reformulate the necessary condition in the Lagrange multiplier


rule, as follows:
The function & takes on its minimum subject to the constraint equation
(1) at a stationary point of the lagrangian C.
(A stationary point of a function is one at which all first partial deiava-
tives vanish.) For the general case of a function of n variables
S(x1,x2, . . . and m constraints
g;(x1,x2, . . . 0 i = 1, 2, . . . , m < n (14)

the lagrangiant is written


2(21,x2, . . . ,x,,,X1iX2, . . . ,X.,) = 6(x1,x2, . . . ,x,)
+ X g,(x1,x2, . . . ,x,,) (15)
:-1

t We have not treated the most general situation, in which the lagrangian must be
written

Act + A,&
.1
So-called irregular cases do exist in which No - 0, but for all regular situations No may
be taken as unity without loss of generality.
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 21

and the necessary condition for a minimum is

ax~0 i=1,2,...,n (16a)


ac
a,=0 i=1,2,...,m (16b)

It is tempting to try to improve upon the multiplier rule by retain-


ing second-order terms in the Taylor series expansions of & and g and
attempting to show that the minimum of 3 occurs at the minimum of £.
This is often the case, as when g is linear and 8 is convex (the hessian of g
is positive definite), but it is not true in generat, for the independent
variation cannot be so easily isolated in the second-order terms of the
expansion for 2. It is easily verified, for example, that the function
3 = x1(1 + x2) (17)
has both a local minimum and a local maximum at the constraint
x1 + (x2)2 = 0 (18)
while the stationary points of the lagrangian
.C = x1(1 + x:) + Xxi + X42), (19)
are neither maxima nor minima but only saddle points.
The methods of this section can be extended to include inequality
constraints of the form
g(x1,x2) >_ 0 (20)

but that would take us into the area of nonlinear programming and away
from our goal of developing variational methods of optimization, and we
must simply refer the reader at this point to the specialized texts.

1.9 A GEOMETRICAL EXAMPLE


As a first example in the use of the Lagrange multiplier let us consider
a problem in geometry which we shall find useful in a later discussion of
computational techniques. We consider the linear function
g = ax1 + bx2 (1)
and the quadratic constraint
g,= a(xl)e + 2$x1xi + y(x2)s - 0= = 0 (2)
where
a>0 ay-$2>0 (3)
22 OPTIMIZATION BY VARIATIONAL METHODS

Fig. 1.3 A linear objective with elliptic


constraint.

The curve g = 0 forms an ellipse, while for each value of 3 Eq. (1) defines
a straight line. We seek to minimize & subject to constraint (2); i.e., as
shown in Fig. 1.3, we seek the intersection of the straight line and ellipse
leading to the minimum intercept on the x2 axis.
As indicated by the multiplier rule, we form the lagrangian
E = ax, + bx: + Xa(x1)' + 2a$x1x: + Xy(x2)2 - XQ2 (4)
We then find the stationary points
at =
J7 a + 21%axi + 27$x2 = 0 (5a)
21 .

az
ax,
= b + 2\8x1 + 2a7xs - 0 (5b)

while 8.c/ax = 0 simply yields Eq. (2). Equations (5) are easily ^olved
for x1 and x2 as follows:
ay - bf
x1 = - 2X(ay-0') (6a)
ba -a#
X2= - 2X(ay - $2) (6b)

X can then be obtained by substitution into Eq. (2), and the final result is
ay - b$
X1 = f (7a)
(ory - $')(ya2 + ab2 - 2ab$B)
ba - a{3 (7b)
zs =
(a'y - $') (ya2 + ab2 - 2ab$)
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 23

The ambiguity (±) sign in Eqs. (7) results from taking a square root, the
negative sign corresponding to the minimum intercept (point A in Fig.
1.3) and the positive sign to the maximum (point B).

1.10 DISCRETE PROPORTIONAL CONTROL WITH LAGRANGE MULTIPLIERS


We shall now return to the problem of Sec. 1.7, where we wish to
minimize
N
S (xn2 + C2un2) (1)
2
n-1
and we write the system equation
x - axn-1 - un - 9n(xn,xn-1,un) = 0 (2)

Since we wish to find the minimizing 2N variables x1, x2, . . . , xN, U1,
U2, . . . , UN subject to the N constraints (2), we form the lagrangian
N N

n-1n-1(xn2 + C2un2) + I Xn(xn - axn-1 - un)

Taking partial derivatives of the lagrangian with respect to x1,


(3)

x2, xN, u1, U2i . . . , UN, we find


cle
ccun xC2un
-A =0 n=1 , 2 ,..., N (4a)

a-
axN
= xN + aN = 0 (4b)

aL
axn
=xn+Xn-aXn+1=0 n=1,2, ...,N-1 (4c)

Equation (4b) may be included with (4c) by defining

XN+1 = 0 (5)

We thus have simultaneous difference equations to solve

xn - axn-1 - C2 = 0 xo given (6a)


xn+Xn-axn+1=0 XN+1=0 (6b)

with the optimal controls un then obtained from Eq. (4a).


Equations (6a) and (6b) represent a structure which we shall see
repeated constantly in optimization problems. Equation (6a) is a differ-
ence equation for xn, and (6b) is a difference equation for X,,; but they are,
coupled. Furthermore, the boundary condition xo for Eq. (6a) is given
for n = 0, while for Eq. (6b) the condition is given at n = N + 1. Thus,
24 OPTIMIZATION BY VARIATIONAL METHODS

our problem requires us to solve coupled difference equations with split


boundary conditions.
For this simple problem we can obtain a solution quite easily. If,
for example, we assume a value for Xo, Eqs. (6) can be solved simultane-
The calculated value for XN+l will proba-
ously for X1, xi, then X2, x2, etc.
bly differ from zero, and we shall then have to vary the assumed value Xo
until agreement is reached. Alternatively, we might assume a value for
xN and work backward, seeking a value of xN which yields the required x0.
Note that in this latter case we satisfy all the' conditions for optimality
for whatever xo results, and as we vary xN, we produce a whole family of
optimal solutions for a variety of initial values.
We have already noted, however, that the particular problem we
are considering has a closed-form solution, and our past experience, as
well as the structure of Eq. (6), suggests that we seek a solution of the
form
x. = C2Mnxn (7a)4
or
U. = M.X. (7b)

We then obtain
xn+l - axn - Mn+lxn+l = 0 (8a)
x. + C2Mnxn - aC2Mn+lxn+1 = 0 (8b)

and eliminating xn and x.+l, we again get the Riccati equation

M.M.+l+1 Ca2CM.+1-M.-C2=0 (9)

with the boundary condition


MN+1 = 0 (10)

for a finite control period. Note that as N - w, we cannot satisfy Eq.


(10), since the solution of the Riccati equation becomes a constant, but
in that case XN - 0 and the boundary condition AN+1 = 0 is still satisfied.

1.11 OPTIMAL DESIGN OF MULTISTAGE SYSTEMS


A large number of processes may be modeled, either as a true represen-
tation of the physical situation or simply for mathematical convenience,
by the sequential black-box structure shown in Fig. 1.4. At each stage
the system undergoes some transformation, brought about in part by the
decision u. to be made at that stage. The transformation is described
by a set of algebraic equations at each stage, with the number of equa-
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 26

Fig. 1.4 Schematic of a multistage system.

tions equal to the minimum number of variables needed in addition to


the decision variable to describe the state of the process.
For example, let us suppose that two variables, z and y, describe
the state. The variable x might represent an amount of some material,
in which case one of the equations at each stage would be an expression
of the conservation of mass. Should y be a temperature, say, the other
equation would be an expression of the law of conservation of energy.
We shall use-the subscript n to denote the value of x and y in the process
stream leaving stage n. Thus, our system is described by the 2N alge-
braic equations
4'a(xn,xn-17yn,yn-17un) = 0 n = 1, 2, . . . , N (la)
>rn(xn,xn-1,ynlyn-17un) = 0 n = 1, 2, . . . , N (1b)

where presumably xo and yo are given. We shall suppose that the state
of the stream leaving the last stage is of interest, and we wish to choose
u1, u2, ... , uN in order to minimize some function S(xN,yN) of that
stream.
This problem can be formulated with the use of 2N Lagrange multi-
pliers, which we shall denote by Xi, X2, . . . , AN, A1, A2, . . . , AN. The
lagrangian is then
N
' - &(xN,yN) +
n-1
N
+ I An#n(xn,xn-l,yn,ya-l,un) (2)
n-l
and the optimum is found by setting the partial derivatives of £ with
respect to xn, yn, un, Xn, and An to zero, n = 1, 2, . . . , N. At stage N
we have

a +XNxN+AN-=o a
aN (3a)

+ XX ON + AN =0 (3b)
yN a yN ayN
XN
auN
+ AN =0 (3c)
aUN
28 OPTIMIZATION BY VARIATIONAL METHODS

or, eliminating XN and SAN from (3c) with (3a) and (3b),
as a#N a*N 4 N aON C18 aON 491PN 49ikN 4N =0 (4)
ayN {auN azN auN azN axN auN ayN UN ayN>
For all other n, n = 1, 2, . . . , N - 1, we obtain
an aon+1 + A. + 1 An+l a_n+l -0
n axn + Xn+1
aX n
a0ft
axn axn
(5a)

an
w ay. + Xn+1
aW
ayn
+ A. aOn
ay. + An+1 04'n+1
ayn
-o (5b)

Xn
ayn -1- An au" = 0 (5c)

Again we find that we have a set of coupled difference equations,


Eqs. (la), (lb) and (5a), (5b), with initial conditions xo, yo, and final con-
ditions AN, AN given by Eqs. (3a) and (3b). These must be solved simul-
taneously with Eqs. (3c) and (5c). Any solution method which assumes
values of Ao, Ao or xN, yN and then varies these values until agreement
with the two neglected boundary conditions is obtained now requires a
search in two dimensions. Thus, if v values of xN and of yN must be con-
sidered, the four difference equations must be solved v' times. For this
particular problem the dimensionality of this search may be reduced, as
shown in the following paragraph, but in general we shall be faced with
the difficulty of developing efficient methods of solving such boundary-
value problems, and a large part of our effort in later chapters will be
devoted to this question.
Rewriting Eq. (5c) for stage n + 1,

au+l
n
+ °n+1 a,"unn+l = o
Xn+1 (5d)

we see that Eqs. (5a) to (5d) make up four linear and homogeneous
equations for the four variables X., X.+1, An, An+1, and by a well-known
result of linear algebra we shall have a nontrivial solution if and only if
the determinant of coefficients in Eqs. (5a) to (5d) vanishes; that is,
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 27

or

a# a4.. aor a#,+1 a-0.+1 a-0.+1 a41-+1


ax* au ~ axn LA
au ayw sun+1 ay,,
a4* a4* a,kn+1
_0 (7)
c10-+1'94"+1
ay au ay,. au.) k ax. ax. au.+1
[Equation (7) has been called a generalized Euler equation.]
Equations (la), (1b), (4), and (7) provide 3N design equations for
the optimal x1, yn, and decision variables. One particular method of
solution, which requires little effort, is to assume u1 and obtain x1 and y1
from Eqs. (1a) and (1b). Equations (1a), (ib), and (7) may then be
solved simultaneously for x2, y2, u2 and sequentially in this manner until
obtaining x,v, yNv, and uNv. A check is then made to see whether Eq.
(4) is satisfied, and if not, a new value of u1 is chosen and the process
repeated. A plot of the error in Eq. (4) versus choice of u1 will allow
rapid convergence.

1.12 OPTIMAL TEMPERATURES FOR CONSECUTIVE REACTIONS


As a specific example of the general development of the preceding sec-
tion, and in order to illustrate the type of simplification which can often
be expected in practice, we shall consider the following problem.
A chemical reaction
X -+ Y --> products
is to be carried out in a series of well-stirred vessels which operate in the
steady state. The concentrations of materials X and Y will be denoted
by lowercase letters, so that x, and y., represent the concentrations in
the flow leaving the nth reaction vessel, and because the tank is well
stirred, the concentration at all points in the vessel is identical to the
concentration in the exit stream.
If the volumetric flow rate is q and the volume of the nth vessel V,,,
then the rate at which material X enters the vessel is gxA_l, the rate at
which it leaves is qx,, and the rate at which it decomposes is
where r1 is an experimentally or theoretically determined rate of reaction
and u is the temperature. The expression of conservation of mass is then
qxn + (1)

In a similar way, for y*,


qy,,_1 = qyn + V%r2(x,,y.) (2)

where r2 is the net rate of decomposition of y and includes the rate of


formation, which is proportional to the rate of decomposition of x,,. In
28 OPTIMIZATION BY VARIATIONAL METHODS

particular, we may write


ki(u.)F(x,,) (3a)
r2(x,,,y,,,u,,) = (3b)
where k, and k2 will generally have the Arrhenius form

k;o exp (-!) ` (4)

Defining the residence time 9 by

9= (5)
4n

we may write the transformation equations corresponding to Eqs. (1a)


and (1b) of Sec. 1.11 as
On = x-B (6a)
,yn = y+ 9 ks(un)G(y,.) (6b)

We shall assume that Y is the desired product, while X is a valu-


able raw material, whose relative value measured in units of Y is p. ' The
value of the product stream is then pxN + yN, which we wish to maximize
by choice of the temperatures u in each stage. Since our formulation
is in terms of minima, we wish to minimize the negative of the values, or
& (XN,YN) = - PXN - yN (7)

If Eqs. (6a) and (6b) are substituted into the generalized Euler
equation of the preceding section and k, and k2 are assumed to be of
Arrhenius form [Eq. (4)], then after some slight grouping of terms we
obtain
1+ v9 ks(u,.)G'(y,.)
1 + 9«ks(u,)G'(y,) E'ki(u»)F(x*) + 1 +
Ezks(u.+,)G(y+.+,)
(8)
ER i

where the prime on a function denotes differentiation. Equation (8)


can be solved for in terms of y,.+,, xp, y,,, and u.:

ex
1)^ Jkio [ks(u,.) 1 +
kso ki(u,1) 1 +
El ks(uw)G'(yw)F(x*+,) llcsi-a
19
+ E2'11 + O ks(u+.)G'(y,)]G(y,.+,)]
[B(xn,x 1,y (g)
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 21

and upon substitution Eqs. (6a) and (6b) become


xn-1 - xn - gnk10[B(xn-I)xn,yn-l,yn,un-1)]S,'1($,'-8,')F(xn)
=0
(10a)
yn-1 - yn + v(xn-1 - xn)
- Bnk30[B(xn-I,xn,yn-I,yn,un-1)]$"/(&; -B,')G'(yn) = 0 (10b)
The boundary condition, Eq. (4) of Sec. 1.11, becomes

k1(uN)F(xN)(p - ') + E, k2(ulv)G(yiv)[1 + BNk1(uN)F'(ZN)]


1

+ p8Nk1(uN)ka(uN)F(xN)G'(yN) = 0 (11)
The computational procedure to obtain an optimal design is then
to assume u1 and solve Eqs. (6a) and (6b) for x1 and yl, after which
Eqs. (10a) and (10b) may be solved successively for n = 2, 3, . .. , N.
Equation (11) is then checked at n = N and the process repeated for a
new value of u1. Thus only 2N equations need be solved for the opti-
mal temperatures and concentrations.
This is perhaps an opportune point at which to interject a note of
caution. We have assumed in the derivation that any value of U. which
is calculated is available to us. In practice temperatures will be bounded,
and the procedure outlined above may lead to unfeasible design specifi-
cations. We shall have to put such considerations off until later chap-
ters, but it suffices to note here that some of the simplicity of the above
approach is lost.

1.13 ONE-DIMENSIONAL PROCESSES


Simplified models of several industrial processes may be described by a
single equation at each stage
xn - ,fn(xn_1,un) = 0 n= 1, 2, . . . , N (1)
with the total return for the process the sum of individual stage returns

8= 1 6tn(xn-1,u,,) (2)
ft-1
The choice of optimal conditions u1, u2, . . . , uN for such a process is
easily determined using the Lagrange multiplier rule, but we shall obtain
the design equations as a special case of the development of Sec. 1.11.
We define a new variable yn by
yn - yn-1 - 61n(xn_1)un) = 0 yo = 0 (3)
It follows then that
8= YN (4)
10 OPTIMIZATION BY VARIATIONAL METHODS

and substitution of Eqs. (1) and (3) into the generalized Euler equation
[Eq. (7), Sec. 1.111 yields
air*-1au,.1 a6i,.
of*_1/sun-1 + ax.-1- 0f/au,1
n= 1,2, . . . ,N (5)
with the boundary condition
acRrv
- 0 (6)
atN
The difference equations (1) and (5) are then solved by varying ul until
Eq. (6) is satisfied.
Fan and Wangt have collected a number of examples of processes
which can be modeled by Eqs. (1) and (2), including optimal distribution
of solvent in cross-current extraction, temperatures and holding times in
continuous-flow stirred-tank reactors with a single reaction, hot-air allo-
cation in a moving-bed grain dryer, heat-exchange and refrigeration sys-
tems, and multistage gas compressors.
We leave it as a problem to show that Eq. (5) can also be used
when a fixed value xN is required.

1.14 AN INVERSE PROOLEr


It is sometimes found in a multistage system that the optimal decision
policy is identical in each stage. Because of the simplicity of such a
policy it would be desirable to establish the class of systems for which
such a policy will be optimal.' A problem of this type, in which the
policy rather than the system is given, is called an inverse problem. Such
problems are generally difficult, and in this case we shall restrict our
attention to the one-dimensional processes described in the previous sec-
tion and, in fact, to-those systems in which the state enters linearly and
all units are the same.
We shall consider, then, processes for which Eqs. (1) and (2) of
Sec. 1.13 reduce to
x = ff(x,._,,u,.) = A(u.)xR-I + B(un) (1)
w(xn_l,uw) Ci(u,.)xn-1 + D(un) (2)
Since we are assuming that all decisions are identical, we shall simply
write u in place of u,,. The generalized Euler equation then becomes
C'x,.2 + D' D')
A'xn-2 + B' + C - A'xn-1 + B' = 0 (3)

where the prime denotes differentiation with respect toAfter u. substi-


t See the bibliographical notes at the end of the chapter.
OPTIMIZATION WITH DIFFERENTIAL CALCULUS >Rt

tution of Eq. (1) and some simplification this becomes


(AA'C' + AA'2C - AMA'C')x.-22

+ (A'BC' +'B'C' + A'2BC + AA'B'C


+ A'B'C - AA'BC' - A2B'C')x._2
+ (A'D'B + B'D' + A'BB'C + B'2 - ABB'C' - AB'D') = 0 (4)

Our problem, then, is to find the functions A(u), B(u), C(u), and
D(u) such that Eq. (4) holds as an identity, and so we must require that
the coefficient of each power of x._2 vanish, which leads to three coupled
differential equations
AABC' + A'C - AC') = 0 (5a)
A'BC' + B'C' + A'2BC + AA'B'C + A'B'C
- AA'BC' - A2B'C' - 0 (5b)
A'D'B + B'D' + A'BB'C + B'2 - ABB'C' - AB'D' = Or (Sc)
If we assume that A (u) is not a constant, Eq. (5a) has the solution
C(u) = a(A - 1) (6)

where a is a constant of integration. Equation (5b) is then satisfied


identically, while Eq. (5c) may be solved by the substitution
D(u) = aB(u) + M(u) (7)
to give
M'(A'B + B' - AB') = 0 (8)
or, if M is not a constant,
B(u) = B(A - 1) (9)
where {4 is a constant of integration. 11M(u) and A(u) are thus arbi-
trary, but not constant, functions of u, and the system must satisfy the
equations
x. = A(u.)z.-1 + 11 (10)
61(x.-i,u,.) = a[A(u.) - 11x._1 + aB(u.) + M(u.)
= a(x. - xn-1) ± (11)
Thus, the most general linear one-dimensional system whose opti-
mal policy is identical at each stage is described by Eq. (10), with the
objective the minimization of
N N
S _ I n = a(XN - x0) + I (12)
n-1 n-1
N
This objective function includes the special cases of minimizing I M(u.)
R-1
a OPTIMIZATION BY VARIATIONAL METHODS

for fixed conversion xN - xo, in which case a is a Lagrange multiplier, t


and maximizing conversion for fixed total resources, in which case
N
M(un) u, - U) (13)
nil
with X a Lagrange multiplier and U the total available resource. Multi-
stage isentropic compression of a gas, the choice of reactor volumes for
isothermal first-order reaction, and multistage cross-current extraction
with a linear phase-equilibrium relationship are among the processes
which are described by Eqs. (10) and (11).

1.15 MEANING OF THE LAGRANGE MULTIPLIERS


The Lagrange multiplier was introduced in Sec. 1.8 in a rather artificial
manner, but it has a meaningful interpretation in many situations, two
of which we shall examine in this section. For the first, let us restrict
attention to the problem of minimizing a function of two variables,
g(xi,x2), subject to a constraint which we shall write
q(xl,xs) - b = 0 (1)
where b is a constant. The lagrangian is then
2= Xg(xl,x2) - Xb (2)
and the necessary conditions are
a& +a a9
=0 (3a)
ax, ax,

axe +a ax =
0 (3b)

Let us denote the optimum by fi* and the optimal values of xl and
xs by x*, x2*. If we change the value of the constant b, we shall cer-
tainly change the value of the optimum, and so we may write S* as a
function of b
&* = E (b) (4a)

and
_xs
xi = xi (b) = xi (b) (4b)

T h us
d8' _ as dxi + as dx2
(5)
WY - \axl/=, __,. db ax2J 1... 1 db

t Recall that the lagrangian is minimized for a minimum of a convex objective func-
tion if the constraint is linear.
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 33

and, differentiating Eq. (1) as an identity in b,


499 dxi + ag dxq
1=0 (6)
(al)=,__, A 49x2)=,-=, db
xe x z3 - ze

or
1 - (499/49x2)1;_;; dx2 /db
dx*1 _ (7)
db - (499/ax,)=,-_,.

Combining Eqs. (3a), (5), and (7), we obtain


dS* _ - x + dx2 496)
+
db A L \ axe =__,. 49x2 ,_=,
z,_zt zi-Z,()]
and from Eq. (3b),

(9)

That is, the Lagrange multiplier represents the rate of change of the
optimal value of the objective with respect to the value of the constraint.
If E has units of return and b of production, then X represents the rate of
change of optimal return with production. Because of this economic
interpretation the multipliers are often referred to as shadow prices or
imputed values, and sometimes as sensitivity coefficients, since they repre-
sent the sensitivity of the objective to changes in constraint levels.
A second related. interpretation may be developed in the context of
the one-dimensional processes considered in Sec. 1.13. We seek the
minimum of
N
S= n(xn-1,un) (10)
n-1
by choice of ul, u2, . . . , UN, where

xn = f*(xn_1,un) (11)
Now we can find the optimum by differentiation, so that
aS
=0 n = 1, 2, . . . , N (12)
au,.
or, since x,. depends on all u;, i < n, by the chain rule,
aS _ afn of-+1 afe
aun aun + 49x,,
496tH+1

aun
+ 0x,+1 axn aun +
+ MIN afn am. +/ a'?-+I + of-+1 +
axN-1
. . .
au,. aun ( 49x axn+1 ax,.
+ MIN . at-+1 afn =0 (13)
T axN-1 49x49 aun
34 OPTIMIZATION BY VARIATIONAL METHODS

and similarly,
as a6t a6tn+1 af.
aun_1 = ax._, + axn ax._,

+ aGIN ... __an = 0 (14)


OXN_1 ax.-1 au.-1
a8 a6tN
=o (15)
OUN 49UN

If we define a variable X. satisfying


an afn
+ ax.-1 (16a)
axn_1
N=0 (16b)

we see that Eqs. (13) to (15) may be written


618 a6tn
aua aun
+ Xnau
afn = 0 (17)

which, together with Eq. (16), is the result obtainable from the Lagrange
multiplier rule. The multiplier is then seen to be a consequence of the
chain rule and, in fact, may be interpreted in Eq. (17) a-
X, = (18)

the partial az
derivative of the objective with respect to the state at any
stage in the process. This interpretation is consistant with the notion
of a sensitivity coefficient.

1.16 PENALTY FUNCTIONS


An alternative approach to the use of Lagrange multipliers for con-
strained minimization which is approximate but frequently useful is the
method of penalty functions. Let us again consider the problem of mini-
mizing 8(x1,x2) subject to
9(x1,x2) = 0 (1)
We recognize that in practice it might be sufficient to obtain a small but
nonzero value of g in return for reduced computational effort, and we are
led to consider the minimization of a new function
9(x1,x2) = 8(x1,x:) + 3. Kl9(x1,x2))Z (2)

Clearly, if K is a very large positive number, then a minimum is obtain-


able only if the product Kg2 is small, and hence minimizing 9 will be
equivalent to minimizing a function not very different from F. while ensur-
ing that the constraint is nearly satisfied.
OPTIMIZATION WITH DIFFERENTIAL CALCULUS n
Before discussing the method further it will be useful to reconsider
the geometric example of Sec. 1.9
s=axi+bx2 (3)
g = a(xi)2 + 2$xix2 + 7(x2)2 - 02 = 0 (4)
a>0 ay-,B2>0 (5)

Equation (2) then becomes


= axi + bx2 + 4K[a(xl)2R + 20x1x2 + y(x2)2 - a,2jt (6)
with the minimum satisfying
ai;
= a + 2K(axi + Nx2)[a(xi)2 + 2j6xlx2 + 7(x2)2 - 421 = 0 (7a)
axi
Q
b + 2K($xa + 7x2)[a(xi)2 + 2$xix2 + 7(x2)2 - A21 = Q (7b)
axe
Equations (7) give
x, a$ - baxi
(8)
b,8 - ay
and we obtain, upon substitution into Eq. (7a),

a + 2Kxi a + # # bP--a-y) I(X [a + 2# bft -- ay


\ + 7 (bo ay)2J A j 0 (9) - -
Equation (9) may be solved for xi in terms of K,, with x2 then
obtained from Eq. (8). Since we are interested only in large K, how-
ever, our purposes are satisfied by considering the limit of Eq. (9) as
K - oo. In order for the second term to remain finite it follows that
either xi =rr0, in which case the constraint equation (4) is not satisfied, or

(xi)2[a+2#b/3-ay+7(bfl 0 (10)
JJJJ

This last equation has the solution

xi = ±A
ay -M (Ila)
qq q
(a7 - N2) (ya2 + ab2 - 2ab#)
and, from Eq. (8),
p ba - as
x2 = ±A (11b)
(a7 - 02) (ya2 + ab2 - 2abfi)
which are the results obtained from the Lagrange multiplier rule.
In practice we would consider a sequence of problems of the form of
31 OPTIMIZATION BY VARIATIONAL METHODS

Eq. (2) for K(l), K(2), Krz,


9(n) = 8(x1,x2) + 12Kcn>[9(x1,x2)]2 (12)
where K(n+1) > K(.,) and lim
n_0
-a -. It might be hoped that as
becomes arbitrarily large, the sequence [1(n) } will approach a finite
limit, with g vanishing, and that this limit will be the constrained mini-
mum of E(x1,x2). It can in fact be shown that if the sequence converges,
it does indeed converge to the solution of the constrained problem. In
general the sequence will be terminated when some prespecified toler-
ance on the constraint is reached.
Finally, we note that the particular form 12Kg2 is only for con-
venience of demonstration and that any nonnegative function which
vanishes only when the constraint is satisfied would suffice, the particu-
lar situation governing the choice of function. Consider, for example,
an inequality constraint of the form
1x11 < X1 (13)

The function [x1/(X1 + t)]2N will be vanishingly small for small e and
large N when the constraint is satisfied and exceedingly large when it is
violated. It is thus an excellent penalty function for this type of con-
straint. Other functions may be constructed to "smooth" hard con-
straints as needed.

APPENDIX 1.1 LINEAR DIFFERENCE EQUATIONS


We have assumed that the reader is familiar with the method of solution
of linear difference equations, but this brief introduction should suffice
by demonstrating the analogy to linear differential equations. The nth-
order homogeneous linear difference equation with constant coefficients
is written
an-lxk+n-1 + . .
. + alxk+l + aoxk = 0 (1)

with x specified at n values of the (discrete) independent variable k.


If, by analogy to the usual procedure for differential equations, we seek
a solution of the form
xk = e"" (2)

Eq. (1) becomes


n

e'nk 1 ae"'P = 0 (3)


n-o
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 37

or, letting y = em and noting that e,nk 5 0, we obtain the characteristic


equation
n

1, a,,yP = 0 (4)
pe0

This algebraic equation will have n roots, y1, Y2, . , yn, which we
-

shall assume to be distinct. The general solution to Eq. (1) is then


xk = Ciylk + C2y2k + . . - + Cnynk (5)
where C1, C2, . . . , Cn are evaluated from the n specified values of x.

Consider, for example, the second-order difference equation


xn+2 + 2axn+l + $xn = 0 (6)
The characteristic equation is
y2+2ay+$=0 (7)

or
y= -a±1/a2_# (8)

The general solution is then


xk = C1(-a + v a2 - $)k + C2(-a - 2 - Q)k (9)
If initial conditions xo and x1 are given, the constants CI and C2 are
evaluated from the equations
xo = Cl + C2 (10a)
X1 = (-a + 1"a2 - $)C1 - (a + a2 - Q)C2 (10b)
The modification for repeated roots is the same as for differential
equations. If the right-hand side of Eq. (1) is nonzero, the ge eral solu-
tion is the sum of a particular and homogeneous solution, the standard
methods of finding particular solutions, such as undetermined coefficients
and variation of parameters, carrying over from the theory of ordinary
differential equations. For instance, if our example were of the form
Xn+2 + 2axn+1 + $xn = n (11)

the solution would be of the form


xk = C1(-a + a2 - Y)k + C2(-a - Va%_-0) k + X,, (p) (12)

The particular solution xk(r) can be found from the method of undeter-
mined coefficients by the choice
xk(v) = A + Bk (13)
38 OPTIMIZATION BY VARIATIONAL METHODS

Substituting into Eq. (11) gives


A + B(n + 2) + 2a[A + B(n + 1)] +,6(A + Bn) = n (14)
or, equating coefficients of powers of n on both sides,
n°: (1 + 2a + 19)A + 2(1 + a)B = 0 (15a)
n': (1 + 2a + $)B = 1 (15b)
Thus,
B = 1
(16a)
1+2a+,6
A
2(1 +a)
(1+2a+Y)2 (16b)

and the solution to Eq. (11) is


xk=C,(-a+ a2-j)k
2(1 + a) k
(17)
(1 +2a+6)2+1 +2a+$
The constants C, and C2 are again evaluated from the boundary con-
ditions. If, for example, x° and x, are given, Ci and C2 are found from

xo = C, + C2 - 2(1 +.a) (18a)


(1+2a+Q)2
x,=C,(-a+ a2

(18b)
(12+2+$)2+ 1 +2a+$

BIBLIOGRAPHICAL NOTES
Sections 1.8 and 1.3: The elementary theory of maxima and minima is treated in all
books on advanced calculus. The fundamental reference on the subject is
H. Hancock: "Theory of Maxima and Minima," Dover Publications, Inc., New
York, 1960
Useful discussions in the context of modern optimization problems may be found in
T. N. Edelbaum: in G. Leitmann (ed.), "Optimization Techniques-with Applications
to Aerospace Systems," Academic Press, Inc., New York, 1962
G. Hadley: "Nonlinear and Dynamic Programming," Addison-Wesley Publishing
Company, Inc., Reading, Mass., 1964
D. J. Wilde and C. S. Beightler: "Foundations of Optimization," Prentice-Hall, Inc.,
Englewood Cliffs, N.J., 1967
Sections 1.4 to 1.7: We shall frequently use problems in control as examples of applica-
tions of the optimization theory, and complete references are given in later chapters.
A useful introduction to the elements of process dynamics and control is
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 39

D. R. Coughanowr and L. B. Koppel: "Process Systems Analysis and Control,"


McGraw-Hill Book Company, New York, 1965
The demonstration that optimal feedback gains are computable from the solution of a
Riccati equation is an elementary special case of results obtained by Kalman.
References to this work will be given after the development of the required mathematical
groundwork.
Section 1.8: The references on the theory of maxima and minima in Secs 1.2 and 1.8 are
also pertinent for Lagrange multipliers and eaastrained minimization. The gen-
eralization of the Lagrange multiplier rule to include inequality constraints is based
on a theorem of Kuhn and Tucker, which is discussed in the books by Hadley and
by Wilde and Beightler. There is a particularly enlightening development of the
Kuhn-Tucker theorem in an appendix of
R. E. Bellman and S. E. Dreyfus: "Applied. Dynamic Programming," Princeton
University Press, Princeton, N.J., 1962
See also
H. P. Kunzi and W. Krelle: "Nonlinear Programming," Blaisdell Publishing Company,
Waltham, Mass., 1966
An interesting application of Lagrange multiplier-Kuhn-Tucker theory with process
applications, known as geometric programming; -i(di.cussed in the text by Wilde
and Beightler (cited above) and in
R. J. Duffin, E. L. Peterson, and C. Zener: "Geometric Programming," John Wiley &
Sons, Inc., New York, 1967
C. D. Eben and J. R. Ferron: AIChE J., 14:32 (1968)
An alternative approach, taken by some of these authors, is by means of the theory of
inequalities.

Sections 1.11 to 1.14: The generalized Euler equations were derived in


M. M. Denn and R. Aris: Z. Angew. Math. Phys., 16:290 (1965)
Applications to several elementary one-dimensional design problems are contained in
L. T. Fan and C. S. Wang: "The Discrete Maximum Principle," John Wiley & Sons,
Inc., New York, 1964

Section 1.15: The interpretation of Lagrange multipliers as sqp'sitwity coefficients follows


the books by Bellman and Dreyfus and Hadley. The chain-rule development for
one-dimensional staged processes is due'to
F. Horn and R. Jackson: Ind. Eng. Chem. Fundamentals, 4:487 (1965)

Section 1.16: The use of penalty functions appears to be due to Courant:


R. Courant: Bull. Am. Math. Soc., 49:1 (1943)
The theoretical basis is contained in supplements by H. Rubin and M. Kruskal (1950)
and J. Moser (1957) to
R. Courant: "The Calculus of Variations," New York University Press, New York,
1945-1946
OPTIMIZATION BY VARIATIONAL METHODS

See also

A. V. Fiacco and G. P. McCormick: Management Sci., 10:601 (1964)


H. J. Kelley: in G. Leitmann (ed.), "Optimization Techniques with Applications
to Aerospace Systems," Academic Press, Inc., New York, 1962

Appendix 1.1: Good introductions to the calculus of finite differences and difference
equations may be found in
T. Fort: "Finite Differences and Difference Equations in the Real Domain," Oxford
University Press, Fair Lawn, N.J., 1948
V. G. Jenson and G. V. Jeffreys: "Mathematical Methods in Chemical Engineering,"
Academic Press, Inc., New York, 1963
W. R. Marshall, Jr., and R. L. Pigford: "The Application of Differential Equations
to Chemical Engineering Problems," University of Delaware Press, Newark,
Del., 1947
H. S. Mickley, T. K. Sherwood, and C. E. Reed: "Applied Mathematics in Chemical
Engineering," 2d ed., McGraw-Hill Book Company, New York, 1957

PROBLEMS
1.1. The chemical reaction X --+ Y - Z, carried out in an isothermal batch reactor,
is described by the equations

U-kix - k,y
z + y + z - coast
If the initial concentrations of X, Y, and Z are ze, 0, and 0, respectively, and the values
per unit mole of the species are cx, cyy, and cz, find the operating time 8 which maidmizes
the value of the mixture in the reactor
IP - Cx(x(8) - xe] + CYy(8) + czz(8)
12. For the system in Prob. 1.1 suppose that ki and k, depend upon the temperature
u in Arrhenius form
B,)
ki .N kie exp (

For fixed total operating time 8 find the optimal constant temperature. Note the
difference in results for the two cases E, < E, (exothermic) and E, < E, (endothermic).
U. The feed to a single.etage extractor contains a mass fraction xe of dissolved solute,
and the extracting solvent contains mass fraction ye. The mass fraction of solute in
the effluent is x and in the exit solvent stream is y. Performance is approximately
described by the equations
x + oy - xe + aye
y - Kx
where K is a constant (the distribution coefficient) and a in the solvent-to-feed ratio.
The cost of solvent purification may be taken approximately as
C, -dy - ye)
Ye
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 41

and the net return for the process is the value of material extracted, P(xo - x), less
the cost of solvent purification. Find the degree of solvent purification yo which
maximizes the net return.
1.4. The reversible exothermic reaction X Y in a continuous-flow stirred-tank
reactor is described by the equations
0 - cf - c - er(c,cf,T)
0 = Tf - T + OJr(c,cf,T) - eQ
where c denotes the concentration of X, T the temperature, and the subscript f refers
to the feed stream. r is the reaction rate, a function of c, cf, and 7', a the residence
time, J a constant, and Q the normalized rate of heat removal through a cooling coil.
For fixed feed conditions find the design equations defining the heat-removal rate Q
which maximizes the conversion, cf - c. Do not use Lagrange multipliers. (Hint:
First consider c a function of T and find the optimal temperature by implicit differentia
tion of the first equation. Then find Q from the second equation.) Obtain an
explicit equation for Q for the first-order reaction

r a k,o exp 1 7x`1 (Cl - c) - k2o exp (T°c


1.5. A set of experimental measurements y,, y2, ... , yn is made at points x,, x2,
... x., respectively. The data are to be approximated by the equation
y = af(x) + R9(x)
where f(x) and g(x) are specified functions and the coefficients a and 0 are to be chosen
to minimize the sum of squares of deviations between predicted and measured values
of y

min a - [af(xi) + 59(xi): - yi]2

Obtain explicit equations for a and d in terms of the experimental data. Generalise
to relations of the form
N
y= akfk(x)

Find the best values of a and d in the equation


y -a+fx
for the following data:

x 0 1 2 3 4 5 6

y 0 4.5 11.0 15.5 17.0 26.5 30.5

1.6. A sequence of functions 0,(x), ¢2(x), . . . , is called orthogonal with weight-


ing p(x) over an interval (a,b] if it satisfies the relation
b

f. dx = 0 i0j
42 OPTIMIZATION BY VARIATIONAL METHODS

A given function y(x) is to be approximated by a sum of orthogonal functions


N
11(x) = I
n-l

Find the coefficients c,, c2, ... , cN which are best in the sense of minimizing the
weighted integral of the square of the deviations
N
mine = f ab P(x)[y(x) - Z C.O.(x)dx
n-l

Show that the sequence sin x, sin 2x, sin 3x, ... is orthogonal over the interval
0 < z < r with weighting unity. Find the coefficients of the first four terms for
approximating the functions
(a) y=1 0<x<r
(b) y =x 0 <x <T
r
x 0 < x <
(c) y = -2
T-x 2<x<
Compare the approximate and exact functions graphically.
1.7 The cost in dollars per year of a horizontal vapor condenser may be approxi-
m4ated by
C- 0,N-36D-1L-ss + #:N_6.2D0.'L-' + S,NDL + j94N-1."D-4 L

where N is the number of tubes, D the average tube diameter in inches, and L the
tube length in feet. 01, Bt, 03, and 04 are coefficients that vary with fluids and con-
struttion costs. The first two terms represent coat of thermal energy; the third, fixed
charges on the heat exchanger; and the.fourth, pumping costs: Show that for all
values of the coefficients the optimal cost distribution is 43.3 percent thermal energy,
53.3 percent fixed charges, and 3.33 percent pumping cost.
Show that the optimal value of the cost can be written
r, \f,\f,
C=C
\f=) 0)
where f,, f:, f,, f4 are respectively the fractions of the total cost associated with the
first, second, third, and fourth terms in the cost. [Hint: If A - aC, B - OC, and
a + A - 1, then C - (A/a)-(B/p)s.) Thus obtain explicit results for N, D, and L in
terms of the 0;. Solve for 01 - 1.724 X 106, 02 - 9.779 X 104, P = 1.57, Y4
3.82 X 10-1, corresponding to a desalinatign plant using low-pressure steam. (These
results are equivalent to the formalism of geometric programming, but in this case
they require only the application of the vanishing of partial derivatives at a minimum.
The problem is due to Avriel and Wilde.)
1.8. For the minimization of a function of one variable, &(x), extend the analysis of
Sec. 1a3 to obtain necessary and sufficient conditions for a minimum when both the
first and second derivatives vanish. Prove that a point is a minimum if and only if
the lowest-order nonvanishing derivative is positive and of even order.
OPTIMIZATION WITH DIFFERENTIAL CALCULUS 43

13.. Prove the converse of Eqs. (13) and (14) of Sec. 1.3, namely, that a quadratic
form
ax' + 20xy + 'yy'
is positive definite if a > 0, a8 > y'.
1.10. For the system described in Prob. 1.4 suppose that the cost of cooling is equal to
pQ. Find the design equation for the rate of heat removal which maximizes conversion
less cost of cooling. Lagrange multipliers may be used.
UL Prove that when a is convex (the hessian is positive definite) the minimum of E
subject to the linear constraints

ai1xi - bi i - 1,2, .. . ,m <n


occurs at the minimum of the lagrangian with respect to x,, x=, . . . , x,,.
1.12. Obtain the results of Sec. 1.13 by direct application of the Lagrange multiplier
rule rather than by specialization of the results of Sec. 1.13. Extend the analysis to
include the following two cases:
(a) xN specified.
(b) Effluent is recycled, so that x, and xN are related by an equation xo - g(xN).
1.13. The reversible reaction A B is to be carried out in a sequence of adiabatic
beds with cooling between beds. Conversion in the nth bed follows the relation

8w-
f _. dE
r(T,E)
where 8,, is the holding time, xw the conversion in the stream leaving the nth bed, and
r( T, the reaction rate. In an adiabatic bed the temperature is a linear function of
inlet temperature and of conversion. Thus the conversion can be expressed as
8w - _,,dE F(xw-1
xwf T,,)
1=. i R(T,,E)
where T. is the temperature of the stream entering the nth bed. Obtain design
equations and a computational procedure for choosing 8w and T. in order to maximize
conversion in N beds while maintaining a fixed total residence time
N
9- 8w
n-1
This problem has been considered by Horn and Huchler and Aria.)
2
Optimization with Differential
Calculus: Computation

2.1 INTRODUCTION
The previous chapter was concerned with developing algebraic con-
ditions which must be satisfied by the optimal variables in minimizing
an objective. The examples considered for detailed study were some-
what extraordinary in that the solutions presented could be obtained
without recourse to extensive numerical calculation, but clearly this will
rarely be the case in practice. In this chapter we shall consider several
methods for obtaining numerical solutions to optimization problems of
the type introduced in Chap. 1. An entire book could easily be devoted
to this subject, and we shall simply examine representative techniques,
both for the purpose of introducing the several possible viewpoints and
of laying the necessary foundation for our later study of more complex
situations. Two of the techniques which we wish to include for com-
pleteness are not conveniently derived from a variational point of view,
so that in order to maintain continuity of the development the details
are included as appendixes.
µ
OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION 45

2.2 SOLUTION OF ALGEBRAIC EQUATIONS


The condition that the first partial derivatives of a function vanish at
the minimum leads to algebraic equations for the optimal variables, and
these equations will generally be highly nonlinear, requiring some iter-
ative method of solution. One useful method is the Newton-Raphson
method, which we shall derive for the solution of an equation in one
unknown.
We seek the solution of the equation
f (x) = 0 (1)

If our kth approximation to the solution is x(h) and we suppose that the
(k + 1)st trial will be the exact solution, we can write the Taylor series
expansion of f (x(k+I)) about f(x(k)) as
f(x(kFI)) = 0 = f(x(k)) -+' f(x(k))(x(k+i) - xu)) + (2)

Neglecting the higher-order terms and solving for x(k+i), we then obtain
the recursive equation
x(k+I) = X(k) ._. f (x(j (3)

As an example of the use of Eq. (3) let us find the square root of 2
by solving
f(x) = x2 - 2 = 0 '(4)

Equation (3) then becomes


x(k+1) = x(k)
x(02 -2 x(k)2+2
2x(k) 2z(k) . (5)

If we take our initial approximation as x('),= 1, we obtain x(2) = 1.5,


x(') = 1.4167, etc., which converges rapidly to the value 1.4142. On the
other hand, a negative initial, approximation.veill converge to - 1.4142,
while an initial value of zero will diverge immediately.
When it converges, the Newton-Raphson method does so rapidly.
In fact, convergence is quadratic, which means, that the error Ix(k+l)
xj is roughly proportional to the square of the previous error, rxk) -
xi. Convergence will generally not occur, however, without a good
first approximation. The difficulties which to- be anticipated can be
visualized from Fig. -2.1. The Newton-Raphson procedure is one of esti-
mating the function by its tangent at the point x(k). Thus, as shown,
the next estimate, x(k+I), is closer to the root, x(k+r) closer still, etc. Note,
however, that convergence can be obtained only when the slope at xtk)
has the same algebraic sign as the slope at the root. The starting point
4 OPTIMIZATION BY VARIATIONAL METHODS

'(4

_ i a II

'(K) ,r(k),(R+2),,(k+3) (k+) F),. 21 Successive- iterations of the


,, Newton-Raphson method.

x(m, where f (x) is different in sign from f (f), will result in divergence
from the solution.
For an optimization problem the function f (x) in Eq. (1) is the
derivative 6'(x) of the function 8(x) -which is being minimized. Equa-
-tion (3) then has the form
(k)
z(k+l) = X (k) - e(z(k) (6)

At the minimum, 6" > 0, so that convergence is possible (but not guar-
anteed!) only if 8" is positive for each approximation. For the minimi-
sation. of a function of several variables, 3(x1,x2, x.), the iteration
formula analogous to Eq. (6) can be demonstrated by an equivalent
...
development to be

-Ft(k+1) = z1(k) - V tnei (7)

where wq is the inverse of the hessian matrix of F, defined as the solution


of the n linear algebraic equations
w

wi;826(x1(k),x!(k),
- . . . ,xn(k)) _ 1 i=p
J-
ax, x, a'' - { o i p (8)

Z3 AN APPLICATION OF THE NEWTON-RAPHSON METHOD


As a somewhat practical example of the application of the. Newton-
Raphson method to an optimization problem we shall consider the con-
secutive-reaction sequence described in Sec; 1.12 and seek the optimal
temperature in a single reactor. The reaction sequence is
X - Y -- products
and taking the functions F and G as linear, the outlet concentrations of
OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION 47

species X and Y are defined by the equations


xo - x - 6kloe-B1'I"x = 0 (la)
yo - y + 6kloe-Bi I"x - 6k2oe-Bi I"y = 0 (2a)
or, equivalently,
x0
x= 1 + 6k10e-B,'I"
_ 0k10e-B''"Lx0
Y OkzOe-E= lu + (1 + elope E; l")(1 + Ok20e E''1") (2b)
1+
The object is to choose u to maximize y + px or to minimize
-y -px= - 1 + 6k20e`R' "" Yo

Okloe-E''I"xo pxo
(1 + Bkloe-81`)°)(1 + Bk2oe81'J") - 1 + 0kioe-E,'1u (3)

For manipulations it is convenient to define a new variable


v = e-81'1" (4a)
u= In v (4b)
so that the function to be minimized may be written
S(v) _ _ Yo - Okloyzo _ pxo (5)
1 + 6kzovB (1 + Ok10v)(1 + 60200°) 1 + 6k10v
where fl is the ratio of activation energies

(6)
E1?

The iteration procedure, Eq. (6) of the previous section, is then


v(k+1) = v(k) _. V (y(k)) (7)
&"(v(k) )

We shall not write down here the lengthy explicit relations for 6' and V.
For purposes of computation the following numerical values were
used:
x0= 1 yo=0
k10 =5.4X 1010 k20=4.6X 1017
Ei = 9,000 Eq = 15,000
p=0.3 6= 10
The first estimate of u was taken as 300, with the first estimate of v then
calculatectfrom Eq. (4a). Table 2.1 contains the results of the iteration
4a OPTIMIZATION BY VARIATIONAL METHODS

Table 2.1 Successive approximations by the Newton-Raphson


method to the optimal temperature for consecutive reactions

Iteration V X 1012 u -6 -6' 6" X 10-44

Initial 9.3576224 X 10-4 300.00000 0.33362777 3.4131804 X 1011 37.249074


1 1.0098890 325.83690 0.53126091 1.2333063 X 1011 14.815753
2 1.8423181 333.08664 0.59275666 3.4447925 X 10" 7.4848782
3 2.3025517 335.85845 0.60156599 5.5884200 X 10' 5.1916627
4 2:4101939 336.43207 0.60187525 2.3492310 X 10' 4.7608371
5 2.4151284 336.45780 0.60187583 4.6704000 X 104 4.7418880
6 2.4151382 336.45785 0.60187583 8.6400000 X 104 4.7418504
7 2.4151382 336.45785 0.60187583 8.6400000 X 104 4.7418504

based on Eq. (7), where an unrealistically large number of significant


figures has been retained to demonstrate the convergence. Starting
rather far from the optimum, convergence is effectively obtained by the
fourth correction and convergence to eight significant figures by the sixth.
It is found for this example that convergence cannot be obtained
for initial estimates of u smaller distances to the right of the optimum.
The reason may be seen in Figs. 2.2 and 2.3, plots of g and S' versus u,
respectively. There is an inflection point in & at u 347, indicating a
change in sign of C", which shows up as a maximum in S'. Thus, care
must be taken even in this elementary case to be sure that the initial
estimate is one which will lead to convergence.

-0.1

-0.4

- 0.5

-0.6

300 320 340 360 380 400 Fig. 2.2 Objective function versus
u temperature for consecutive reactions.
OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION 4!

Fig. 2.3 Derivative of objective versus I I 1 1

300 320 340 _360 380 400


1 1 1 1

temperature for consecutive reactions. U

2.4 FIBONACCI SEARCH


The use of the Newton-Raphson method assumes that the function and
its derivatives are continuous and, most importantly, that derivatives
are easily obtained. Convergence is not assured, and it may be incon-
venient to evaluate derivatives, as would be the case if the function .8
were not available in analytical form but only as the outcome of a physi-
cal or numerical experiment. Under certain assumptions an appealing
alternative is available for functions of a single variable, although we
must change our point of view somewhat.
We shall restrict our attention to functions 8(x) which are unimodal;
i.e., they must possess a single minimum and no maximum in the region
of interest, but they need not be continuous. An example is shown in
Fig. 2.4, where the region of interest is shown as extending from x = 0
to x = L. An important feature of such functions is the fact that given
two observations 8(x1) and 8(x2) at points x, and x2, respectively, we may
say unequivocally that if 8(x1) > 8(x2), the minimum lies somewhere in
the interval x1 < x < L, while if 8(x2) > 8(x1), the minimum lies in the
interval 0 < z < X2. Note that this is true even if x1 and x2 both lie on
the same side of the minimum. The Fibonacci search procedure exploits
so OPTIMIZATION BY VARIATIONAL METHODS

this property of unimodal functions to eliminate in a systematic manner


regions of the independent variable in which the minimum cannot occur.
After N such eliminations there remains an interval of uncertainty, which
must contain the minimum, and the procedure we shall describe here is
the one requiring the minimum number of measurements (evaluations)
of the function in order to reach a given uncertainty interval.
The algorithm requires that the function be measured at two
symmetric points in the interval of interest, 0.382L and 0.618L. If
8(0.382L) > 8(0.618L), the region to the left of 0.382L is excluded,
while if 8(0.618L) > &(0.382L), the region to the right of O.&18L is
excluded. The process is then repeated for the new interval. Part of
the efficiency results from the fact that one of the two previous measure-
ments always lies inside the new interval at either the 38.2 or 61.8 pkr-
cent location, so that only one new measurement need be made-at the
point an equal distance on the other side of the midpoint from the point
already in the interval. The proof that this is the best such algorithm in
the sense defined above is straightforward, but of a very different nature
from the variational analysis which we wish to emphasize, and so we
bypass it here and refer the interested reader to Appendix 2.1.
To demonstrate the Fibonacci search algorithm we shall again con-
sider the reactor example of the previous section, the minimization of
S(u)
_ yo _ Okioe-$ "uxo
1 + 6k2oe-E, i° (1 + ekloe-$''!")(1 + ek2oe`R' /u)
-azo
1 + ekloe-Srru (1)

for the values of parameters used previously. The initial interval of


interest is 300 _< u < 400, and we have already seen that Newton-
Raphson will converge from starting values in no more than half the
region.
In this case L = 100, and the points at 0.382L and 0.618L are
u = 338.2 and u = 361.8, respectively. Here,
8(338.2) = -0.599 8(361.8) = -0.193

6(x)
6(x2)

X Fig. 2.4 A unimodal function.


OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION 51

300 320 340 360 380 400


u

Fig. 2.5 Successive reductions of the interval of uncertainty


for the optimal temperature by Fibonacci search.

so that the region 361.8 < u < 400 is excluded. The remaining point is
u = 338.2, which is at 61.8 percent of the new interval 300 < u < 361.8,
and the point which is symmetric at 38.2 percent is u = 323.6076. The
process is then repeated, leading this time to the elimination of the region
on the far left. The first several eliminations are shown graphically in
Fig. 2.5, and the full sequence of calculations is shown in Table 2.2,
where many extra significant figures have again been retained. The

Table 2.2 . Successive Iterations of the Fibonacci search method to the final
Interval of uncertainty of the optimal temperature for consecutive reactions

No. of
Computa-
tlona Int .1Val 140.888E g(u0.088L) u0.818L &(u0.818L)

1,2 300.000-400.000 338.20000 0.59912986 361.80000 0.19305488


3 300.000-361.800 323.60760 0.50729013 338.20000 0.59912986
4 323.608-361.800 338.20000 0.59912986 347.21050 0.49250240
5 323.608-347.211 332.62391 0.59023933 338.20000 0.59912986
6 332.624-347.211 338.20000 0.59912986 341:63842 0.57647503
7 332.624-341.638 336.06745 0.60174375 338.20000' 0.59912986
8 332.624-338.200 334.75398 0.59949853 336.06745 0.60174375
9 334.754-338.200 336.06745 0.60174375 336.88362 0.60171592
10 334.754336.884 335.56760 0.60119983 336.06745 0.60174375
11 335.568-336.884 336.06745 0.60174375 336.38086 0.60187065
12 336.067-336.884 336.38086 0.60187065 336.57184 0.60186444
13 336.067-336.572 336.26013 0.60184180 336.38086 0.60187065
14 336.260-336.572 336.38086 0.60187065 336.45277 0.60187581
15 336.381-336.572 336.45277 0.60187581 336.49889 0.60187435
16 336.381-336.499 336.42595 0.60187494 336.45277 0.60187581
S2 OPTIMIZATION BY VARIATIONAL METHODS

rapid convergence can be observed in that 7 evaluations of C are required


to reduce the interval of uncertainty to less than 10 percent of the origi-
nal, 12 to less than 1 percent, and 16 to nearly 0.1 percent.

2.5 STEEP DESCENT


The Fibonacci search technique does not generalize easily to more than
one dimension, and we must look further to find useful alternatives to
the solution of the set of nonlinear equations arising from the necessary
conditions. - The method of steep descent, originated by Cauchy, is one
such alternative, and it, too, is a technique for obtaining the solution
without recourse to necessary conditions.
Our starting point is again the variational ;equation

M = C(x1 + 6x1, X2 + Ox:) - 8(x1,x2) = Oxl + Ox: + °(E) (1)


8x1 8
where the partial derivatives are evaluated at 91, x2, but we now suppose
that xi and 22 are not the values which cause 8(xi,x2) to take on its mini-
mum. Our problem, then, is to find values Ox1, ax: which will bring g
closer to its minimum or, in other words, values Ox1, ax: which will. ensure
that
at < 0 (2)
A choice which clearly meets this requirement is

Ox1

Ox2 -W2I aS
87X2)11.12
(3b)

where w1 and w2 are sufficiently small to allow o(e) to be neglected in


Eq. (1). We then have

08 _ -w1 (az) - W2 <0


u-)
(4)

which satisfies Eq. (2), so that if w1 and w2 are small enough, the new
value x1 + Sxl, x2 + u2 will be a better approximation-of the minimum
than the old.
An obvious generalization is to choose
e& as (5a)
axl - wl2 ax2
Ox1 = - w11

ax2 = - w21 az - w22 a = (5b)


OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION 53

where the matrix array [w1 W12]


is positive definite and may depend
W21 W22
on the position xl, z2, guaranteeing that
2 2
M aM
(6)
i - 1 i-1

for small enough w;,. All we have done, of course, is to define a set of
directions
s

6x1 = - L WI4 ax;


ag (7)
i-1

which will ensure a decrease in the function 6 for sufficiently small dis-
tances. We have at this point neither a means for choosing the proper
weighting matrix of components w;, nor one for determining how far to
travel in the chosen direction. We simply know that if we guess values
x1i x2, then by moving a small distance in the direction indicated by Eq.
(7) for any positive definite matrix, we shall get an improved value.

2.6 A GEOMETRIC INTERPRETATION

It is helpful to approach the method of steep descent from a somewhat


different point of view. Let us suppose that we are committed to mov-
ing a fixed distance 4 in the xlx, plane and we wish to make that move in
such a way that & will be made as small as possible. That is, neglecting
second-order terms, minimize the linear form

bS = [
>h.3,
a
ax2 2..i.
ax2 (1)

by choice of x1i x2, subject to the constraint


(axl)e + (ax2)2 - 42 = 0 (2)

This is precisely the problem we solved in Sec. 1.9 using Lagrange.multi-


pliers and in Sec. 1.16 using penalty functions, and we may identify terms
and write the solution as
as/ax;
ax: 4 i = 1, 2 (3)
((a&/ax1) + (as/ax2) 2]
That is, in Eqs. (3) of See. 2.5 w, is defined as

i= 1,2 (4)
54 OPTIMIZATION BY VARIATIONAL METHODS

where A is a step size and w; is the same for each variable but changes in
value at each position.
The ratios
as/ax;
[(as/ax,), + (as/ax,),)i
will be recognized as the set of direction cosines for the gradient at the
point and Eq. (3) is simply a statement that the most rapid
change in 6 will be obtained by moving in the direction of the negative
gradient vector. A potential computational scheme would then be as
follows:

1. Choose a pair of points (21,22) and compute g and a6/ax,, as/axe.


2. Find the approximate minimum of 3 in the negative gradient direc-
tion; i.e., solve the single-variable problem

mXnE,ax, ! - aa,
Call the minimizing point the new (f,,,) An approximate value
of a might be obtained by evaluating 8 at two or three points and
interpolating or fitting a cubic in X. A Fibonacci search could be
used, though for an approximate calculation of this nature the num-
ber of function evaluations would normally be excessive.
3. Repeat until no further improvement is possible. In place of step 2
it might sometimes be preferable to use some fixed value w; = w
and then recompute the gradient.. If the new value of S is not less
than s(x,,fi,), the linearity assumption has been violated and w is
too large and must be decreased.

It must be noted that this gradient method will find only a single
minimum, usually the one nearest the surfing point, despite the possible
existence of more than one. Thus, the process must be repeated several
times from different starting locations in order to attempt to find all the
places where a local minimum exists.
A far more serious reservation exists about the method derived
above, which might have been anticipated from the results of the previ-
ous section. We have, as is customary, defined distance by the usual
euclidean measure
A= = 0x,), + (ox,)' (5)
Since we shall frequently be dealing with variables such as temperatures,
'concentrations, valve settings, flow rates, etc., we must introduce nor-
malizing factors, which are, to, a certain extent, -arbitrary. The proper
OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION 55

distance constraint will then be


A2 = a(axl)2 + y(ax2)2 (6)

and the direction of steep descent, and hence the rate of convergence, will
depend on the scale factors a and y. Moreover, it is quite presumptu-
ous to assume that the natural geometry of, say, a concentration-valve-
setting space is even euclidean. A more general definition of distance is
A2 = a(axi)' + 2$ axl 6x2 + 'y(ax2)2 (7)

Here, the matrix array 10 $] is known as the covariant metric tensor


and must be positive definite, or a > 0, ay - $2 > 0. It follows, then,
from the results of Sec. 1.9 that the coefficients w;j in Eq. (7) of Sec. 2.4
aret
wll= Ay
D (8a)

W12 = w21 = - D (8b)

(8c)

where
(ay aFi ' a&)2
p aFi aFi
D
={ - #) [y
2
(.axl) + a (WX2-
2S
axl
ax2] c
(Sd)

There is no general way of determining a suitable geometry for a


given problem a priori, and we have thus returned to essentially the same
difficulty we faced at the end of the previous section. After an example
we shall explore this question further.

2.7 AN APPLICATION OF STEEP DESCENT


As an example of the use of steep descent we again consider the consecu-
tive-reaction sequence with linear kinetics but now with two reactors
and, therefore, two temperatures to be chosen optimally. The general
relations are
xn-1 - x - Ok1(u,)x,. = 0 n = 1, 2, . . . , N (1)
y.-i - y,. + Bkl(u,.)xn - Ok2(un)yn = 0 n = 1, 2, ... , N (2)
t The matrix w,, consists of a constant multiplied by the inverse of the covariant
metric tensor. The inverse of an array a;1 is defined as the array b;, such that

I aubt,
11
l0
-j
iOj
k
Si OPTIMIZATION BY VARIATIONAL METHODS

where
k,oe a,-ru, i = 1, 2 (3)

and the objective to be minimized by choice of ul and u2 is.


yN - PxN (4)

For N = 2 this can be solved explicitly in terms of ul and u2 as


1 Yo Okl(u,)xo
= 1 + 9k2(u2) 1+ 9k2(ul) + [1 + ekl(ul)][1 + 9k2(u1)]
_ 9ki(u2)xo
[1 + 9k1(u2)][1 + 9k2(u2)][1 + 9k1(ul)]
Pxo
(5)
[1 + 9kl(u2)] 1'+ 9k1(ul]
or, for computational simplicity,
1 Yo eklovixo
1 + Bk2ovza 1 + 9k2ov1D + (1 + 9klovl) (1 + 9k2ovio)
_ 9k1oy2xo
(1 + 9k1ov2)(1 + 9k2ov?)(1 + 9klovl)
Pxo
(6)
(1 + Bk1ov2)(1 + 9klov1)
where
v = eZ111U. (7)

and

E2
0 (8)
1

Though they are easily computed, we shall not write down the cumber-
some expressions for a6/avl and a8/av2.
The values of the parameters were the same as those used previ-
ously in this chapter, except that 0 was set equal to 5 in order to main-
tain comparable total residence times for the one- and two-reactor
problems. The simplest form of steep descent was used, in which the
correction is based on the relation [Eq. (3) of Sec. 2.5]
vlmw = vlold - W1 a&i
(9a)
av1

V2n.w = v2c1d - W2 a : I (9b)

Since the relative effects of vi and v2 should be the same, no scale factor is
needed and wl and w2 were further taken to be the same value w. Based
OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION 57

Table 2.3 Successive approximations to the optimal temperatures


In two reactors for consecutive reactions by steep descent

Iteration V1 X 107, u, ei X 101, YZ -G -dG/Bo, -8E/8v,

Initial 9.358 X 10-0 300.0 9.358 X 10'' 300.0 .3340 1.749 X 1011 1.749 X 1011
1 1.843 333.1 1.842 333.1 0.6360 2.877 X 1010 2.427 X 1010
2 2.131 334.6 2.085 334.6 0.6472 1.771 X 1010 1.354 X 1010
3 2.308 335.9 2.220 335.4 0.6512 1.192 X 1010 8.158 X 10,
4 2.427 336.5 2.302 335.9 0.6530 8.434 X 100 5.067 X 100
6 2.511 337.0 2.353 336.1 0.6538 6.159 X 100 3.168 X 100
6 2.573 337.3 2.384 336.3 0.6542 4.602 X 100 1.959 X 100
7 2.619 337.5 2.404 336.4 0.6544 3.500 X 100 1.175 X 100
8 2,654 337.6 2.416 336.5 0.6546 2.701 X 100 6.623 X 100
9 2.681 337.8 2.422 336.5 0.8546 2.111 X 100 3.286 X 100
10 2.702 337.9 2.426 336.5 0.6547 1 867 X 100 1.139 X 100
11 2.719 338.0 2.427 336.5 0.6547 1.329 X 100 -2.092 X 101

upon the values of derivatives computed in the example in Sec. 2.3, this
weighting w was initially taken as 10-27, and no adjustment was required
during the course of these particular calculations. The initial estimates
of u1 and u2 were both taken as 300, corresponding to vi and V2 of 9.358 X
10-14. The full sequence of calculations is shown in Table 2.3.
Examination of the value of the objective on successive iterations
shows that the approach to values of I; near the optimum is quite rapid,
while ultimate convergence to the optimizing values of v1 and v2 is rela-
tively slower. Such behavior is characteristic of steep descent. Some
trial and error might have been required to find an acceptable starting
value for w had a good a priori estimate not been available, and some
improvement in convergence might have been obtained by estimating
the optimal value of w at each iteration, but at the expense of more
calculation per iteration.
This is an appropriate point at which to interject a comment on
the usefulness of the consecutive-chemical-reaction problem as a compu-
tational example, for we shall use it frequently in that capacity. The
examples done thus far indicate that the objective is relatively insensi-
tive to temperature over a reasonably wide range about the optimum,
a fortunate result from the point of view of practical operation. This
insensitivity is also helpful in examining computational algorithms, for it
means that the optimum lies on a plateau of relatively small values of
derivatives, and computational schemes basing corrections upon calcu-
lated derivatives will tend to move slowly and have difficulty finding the
true optimum. Thus, codlpetitive algorithms may be compared under
difficult circumstances. '

2.8 THE WEIGHTING MATRIX


We can gain some useful information about a form to choose for the
weighting matrix to,, in steep descent by considering the behavior of the
sa OPTIMIZATION BY VARIATIONAL METHODS

function 8 near its minimum, where a quadratic approximation may suf-


fice. Since there is no conceptual advantage in restricting 8 to depend
on only two variables, we shall let 8 be a (unction of n variables xi,
z:, . . . , xn and write
ft
p (xl,x4, 88
'Fi ,xn) = 6(422, . . . ,. tn) + iaxi bxi
Z

+ cc
8x.axiax;+ ... (2)
i-i,-1 8x
or, for compactness of notation, we may denote the components of the
gradient 88/8x by Gi and the hessian 8'8/(8x; 8x;) by H1,, so that

8(xl,x2, ,xn) _ 6(z 1,x2, ,fin) + Gi bxi

+2i-iIf--1I Hi; ax 6x;+ (2)

If we now minimize 8 by setting partial derivatives with respect to


each bxi bo zero, we obtain
n
Gi + Hi;bx; = 0 (3)
i-1
Since the hessian is presumed positive definite at (and hence near) the
minimum, its determinant does not vanish and Eq. (3) can be solved by
Cramer's rule to give
n
bxi = - wiG; (4)
j-1

where the weighting matrix satisfies the equations


n 1
WikHk, = ail = i 0 i=j (5)
i j
k-1
That is, the proper weighting matrix is the inverse of the hessian. This
is equivalent to the Newton-Raphson method described in Sec. 2.2. Note
that it will not converge if the hessian fails to be positive definite at the
point where the calculation is being made. Thus it will be of use only
"near" the solution, but even here its use will require the calculation of
second derivatives of the function 8, which may often be inconvenient or
even difficult. It will, however, yield the minimum in a single step for
a truly quadratic function and give quadratic convergence (when it con-
verges) for all others.
OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION SI

Several methods have been devised which combine some of the


simplicity of the most elementary steep-descent procedure with the rapid
ultimate convergence of the Newton-Raphson method. This is done by
computing a new weighting matrix w,, at each iteration from formulas
requiring only a knowledge of the first derivatives of S, starting at the
first iteration with the simple weighting w,, = wS;;. As the optimum is
approached, the weighting matrix w;; approaches that which would be
obtained using the Newton-Raphson method but without the necessity
of computing second derivatives. In many practical situations such a
procedure is needed to obtain satisfactory convergence. A discussion of
the basis of the computation of new w;; would be out of place here, how-
ever, and we refer the reader to the pertinent literature for details.

2.9 APPROXIMATION TO STEEP DESCENT


In situations where it is inconvenient or impossible to obtain analytical
expressions for the derivatives of the function 8 required in steep descent
some form of numerical approximation must be used. The amount of
computation or experimentation required to obtain accurate estimates
of the gradient at each iteration is generally excessive in terms of the
resulting improvement in the optimum, so that most techniques use crude
estimates. A number of such procedures have been developed and tested,
and the bibliographical notes will provide a guide for those interested in
a comprehensive study. The one technique we shall discuss here is con-
ceptually the simplest yet, surprisingly, one of the most effective.
The procedure can be motivated by reference to Fig. 2.6, where
contours of constant S are drawn for a two-variable problem. The tri-
.

angle ABC provides data for crudely estimating the gradient, and if A is

Fig. 2.6 Approximation to steep descent


by reflection of triangles. I[,
60 OPTIMIZATION BY VARIATIONAL METHODS

the worst of the three points, the line with an arrow from A through the
centroid of the triangle provides a reasonable first approximation. Thus,
we can simply reflect the triangle about the line BC to obtain a new tri-
angle B1C in a region of lower average value of &.With only a single new
calculation the worst point can again be found and the triangle reflected,
leading to the triangle B21. The process is continually repeated, as
shown.
There are several obvious difficulties with this overly simple pro-
cedure. First, continuous reflection is not adequate, for it will result in
too slow convergence far from, the optimum and too much correction and
oscillation near the optimum. Thus, the point reflected through the cen-
troid should be moved a fractional distance a > 1 on the other side, and
if the new point is also the worst, the distance moved should then be
reduced by a fractional factor of 1/r < 1. Hence, the triangle will be
distorted in shape on successive iterations. In some cases the distortion
will cause the triangle to degenerate to 'a line, so that more than three
starting points will usually be needed. For n variables the number of
points would then be greater than n + 1. (The coordinate of the cen-
troid of N points is simply the sum of the individual coordinates divided
by N.)

380 P_

370

360

350
U2

340

330

320

310

I I _ I I I 1

300 310 320 330 340 350 360 370 380


u,

Fig. 2.7 Successive approximations to optimal temperatures in


two reactors by the complex method.
OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION M

Table 2.4 Successive approximations to the optimal temperatures


in two reactors for consecutive reactions by the complex method

Itera-
tion It, [t2 It t U2
t2
-c
Initial 35:3.0 0.0657
:330.0 322.0 358.0 0.384-1 878.0 370.0 0.0222
1 383.0 330.0 0.0657 322.0 358.0 0.3844 338.9 330.1 0.6431
2 302.4 3-51.6 0.5018 322.0 358.0 0.3844 3:38.9 1330.1 0.6431
3 302.4 351.6 10.5018 319.9 331.7 0.5620 :3:35.9 33(1. 11 0.6431
4 343.5 319.9 110.6055 319.9 331.7 0.5620 338.9:330.1 1 0.6431
5 343.8 319.9 10.6055 336.4 :326.6 0.6229 335.9 3:30.1 1 0.6431
6 :334.3 3:3'2.9 0.6402 336.4 326.6 0.6229 33,`5.3) 330.1 0.6431
7 334.3 332.9 336.7 334.1 0.65(18 33,5.9 330.1 0.6431
S 339.7 331.7 0.6450 336.7 334.1 0.11508 338.9 330.1 10.6431
9 339.7 331.7 0.6480 336.7 334.1 0.6508 337.9 3:34.4 0.6529
10 336.0 i
335.6 0.6518 336.7 334.1 0.6608 :337.9 334.4 0.6529
11 336.0 335.6 0.6518 337.0 1 335.5 0.6534 :337.9. 334.4 0.6529
12 338.2 I :3:34.6 0.6534 337.0 335.5 0.6534 337.9 334.4 0.6529
13 1338. 2 3:34.6 0.6534 337.0 335.5 10.6534 337.5 335.4 10.6538
14 338.2 334.6 10.6534 338.3 334.7 0.6537 3:37.5 { 335.4 1 0.6.538

The geometrical figure made up by joining N + 1 points in an


N-dimensional space is called a simplex, and the basic procedure is often
called the simplex method, a name which can cause unfortunate confusion
with the unrelated simplex method of linear programming (Appendix 2.2).
Box has done a detailed study of the modifications required for systems
with constraints and has called the resulting procedure the complex
method. His paper is highly recommended for computational details.
As an example of the simplex-complex approach the optimal tern-
perature problem for two stages was again solved, the function & repre-
sented by Eq. (5) of Sec. 2.7, and the same parameters used as for steep
descent. For simplicity only three points were used, and the starting
values were chosen using random numbers in the interval 300 < ui,
u2 < 400. The reflection parameter a was taken as 1.3 and r as 2. The
first nine iterations are shown graphically in Fig. 2.7, where the sequence
of triangles is ABC, AC1, A12, 213, 314, 145, 561, 671, 781, 798; the
first 14 are listed in Table 2.4, with the worst of each set of three points
in boldface. The consequence of random starting values is an initial
scan over the entire region of interest, followed by rapid approach to one
region and systematic descent to the neighborhood of the optimum. It
will be observed that convergence is moderately rapid; the final region
computed is a small one containing the single value found previously by
steep descent and the worst value of the objective close to the minimum
found previously.
42 OPTIMIZATION BY VARIATIONAL METHODS

APPENDIX 2.1 OPTIMALITY OF FIBONACCI SEARCH


In this appendix we wish to prove that the Fibonacci search algorithm
described and applied in Sec. 2.4 is optimal for unimodal functions in the
sense that for a given final interval of uncertainty it is the sequence of
steps requiring the fewest function evaluations. We shall actually prove
an equivalent result, that for a given number of function evaluations the
Fibonacci algorithm places the minimum within the smallest possible
interval of uncertainty or, measuring length in units of the final interval
of uncertainty, that it allows the largest possible initial interval [0,L]
such that after a given number of observations the minimum can be
located within an interval of unit length. We develop the computational
scheme by first proving the following result.
Let L. be any number with the property that the minimum of the
unimodal function s(x) on the interval 0 < x < Lp can be located within
an interval of unit length by calculating at most n values and making
comparisons. If we define

F. = sup L. (1)

then

F. = F._2 n>2 (2)


Fo = FI = 1 (3)

The notation sup in Eq. (1) stands for supremum, or least upper bound.
We use this instead of "maximum" because while L. will be able to
approach the upper bound F. arbitrarily closely, it will never in fact be
able to take on this value. Our development follows that of Bellman
and Dreyfus quite closely.
Clearly if we have made no observations, we can place the mini-
mum within a unit interval only if we have started with a unit interval,
so that Fo = 1, and since a single observation is of no use whatsoever in
placing the minimum, one observation is no better.than none and F1 = 1.
For n = 2 the minimum may lie either in the interval [0,x2] or [x1,L2], and
so neither of these may exceed unity. It is obvious, however, that each
of these intervals may be set equal to unity and the value of L2 maxi-
mized by placing x1 and x2 equal distances from the center of the interval
and as close together as possible; that is,

x1-I-e x2=1 (4)

in which case
L2=2- (5)
OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION 43

where E is as small as we wish, and hence


F2 = sup L2=2=Fo+F1 (6)

The remainder of the development proceeds by induction. We


assume that
Fk = Fk_1 + Fk_2 k = 2, 3, . . . ,n-1 (7)
and then prove that this implies Eq. (2) for k = n, completing the proof.
Referring to Fig. 2.4, setting L = L,,, we see that if 6(x1) < g(X2), this
implies that the minimum is in the interval [J,x:1. Since we now have
only n - 2 trials in which to locate the minimum in a unit interval, we
must be left with a smaller interval than the largest possible having
n - 1 trials. Hence
x2 < F,.-1 (8)
Similarly, it is possible that the minimum might in fact be in the inter-
val [0,x11, but an additional trial would be necessary to estabJi h this,
leaving n - 3 trials. Thus,
x1 < (9)
On the other hand, if 6(X2) < &(x1), the minimum would lie in the
interval [x1,L ], and identical reasoning requires
L. - xl < Fn-1 (10)

Combining the inequalities (9) and (10), we find


L. < F,._1 + Fa_2 (11)
or
F. = sup L < Fn-1 + Fr_s (12)

Suppose we pick an interval

L. = (1 - 2) F,._2) (13)

and place x1 and x2 symmetrically,

xs = ( 1 - 2} (14)

so that the interval remaining after these two trials is as close as possible
to the largest possible interval with n - 2 evaluations left. SucF a place-
ment of points is consistent with our induction hypothesis. It follows,
then, that
F. = sup L. > (15)
64 OPTIMIZATION BY VARIATIONAL METHODS

and combining the inequalities (12) and (15), we obtain the desired result

F. = F.-, + F._2 (2)

Let us note, in fact, that the placement of starting points in Eqs.


(14) is in fact the optimum, for we have always an optimum interval

Lk=(1-E)Fk 2<k<n (16)

and an optimum placing of one point, in the position (1 - E/2)Fk_l.


The procedure is then as follows.
Choose the two starting points symmetrically, a distance
from each end of the interval 0 < x < L, and make each successive obser-
vation at a point which is symmetric in the remaining interval with the
observation which already exists in that interval. After n observations
the minimum will then be located in the smallest interval possible.
One drawback of this procedure is that it requires advance knowl-
edge of the number n of experiments. This is easily overcome by noting
that the sequence defined by Eqs. (2) and (3), known as the Fibonacci
numbers, may be found explicitly by solving the difference equation (2)
by the method outlined in Appendix 1.1. The solution with initial con-
ditions defined by Eq. (3) is

F. +1' +)
2 i/5 2 2 /5 \ 2
(17)

and for large n this is well approximated by

F.
2 -VV/ 5
` 2 J (18)

Thus
2
= 0.618
Fn .-1+V5 (19)

and a near-optimum procedure is to place the first two points symmetri-


cally in the interval 0 < x < L a distance 0.618L from each end and then
procede as above. In this way the interval of uncertainty for the loca-
tion of the minimum can be reduced by a factor of nearly 10,000 with
only 20 evaluations of the function &(x). It is this near-optimum pro-
cedure, sometimes referred to as the golden-section search and indistin-
guishable from the true optimum for n greater than about 5, which is
described and applied in Sec. 2.4.
OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION iS

APPENDIX 2.2 LINEAR PROGRAMMING


A large number of problems may be cast, exactly or approximately, in
the following form:
n
Min 8 = c1x1 + C2 t2 + + Cnxn = c;x; (1)
-r
with linear constraints

(2)
-1
x;>0 j= 1,2, . . . n (3)

where the coefficients c; and a;, are constants and the notation (<, =, > )
signifies that any of the three possibilities may hold in any constraint.
This is the standard linear programming problem. We note that if there
should be an inequality in any constraint equation (2), then by defining
a new variable y; > 0 we may put

a,,x; ± y; = b; (4)
-1
Thus, provided we are willing to expand the number of variables by
introducing "slack" and "surplus" variables, we can always work with
equality constraints, and without loss of generality the standard linear
programming problem can be written
n

Min 8 = I c;x; (1)


i-1
n

a;x;=b; i=1, 2, m<n (5)


i-1
Q j= 1, 2, . . . ,n (3)
In this formulation we must generally have m < n to prevent the exist-
ence of a unique solution or contradictory constraints.
The m linear algebraic equations (5) in the n variables it, x2,
xn will generally have an infinite number of solutions if any exist at all.
Any solution to Eqs. (5) satisfying the set of nonnegative inequalities
[Eq. (3)1 will be called a feasible solution, and if one exists, an infinite
number generally will. A basic feasible solution is a feasible solution to
Eqs. (5) in which no more than m of the variables x1, x2, . . . , x;, are
nonzero. The number of basic feasible solutions ii finite and is bounded
from above by n!/[m!(n - m)!], the number of combinations of n varia-
bles taken m at a time. It can be established without difficulty, although
we shall not do so, that the optimal solution which minimizes the linear
66 OPTIMIZATION BY VARIATIONAL METHODS'

form 6 subject to the constraints is always a basic feasible solution.


Thus, from among the infinite number of possible combinations of varia-
bles satisfying the constraints only a finite number are candidates for the
optimum. The simplex method of linear programming is a systematic
procedure for examining basic feasible solutions in such a way that S is
decreased on successive iterations until the optimum is found in a finite
number of steps. The number of iterations is generally of order 2m.
Rather than devoting a great deal of space to the method we shall demon-
strate its operation by a single example used previously by Glicksman.
Let
S = -5x - 4y - 6z (6)
x+y+z<100 (7a)
3x + 2y + 4z < 210 (7b)
3x + 2y < 150 (7c)
x,y,z>0 (7d)
We first convert to equalities by introducing three honnegative slack
variables u, v, w, and for convenience we include S as a variable in the
following set of equations:
x+ y+ z+u = 100 (8a)
3x + 2y + 4z + v = 210 (8b)
3x + 2y +w = 150 (8c)
5x + 4y + 6z +S=0 (8d)
x, y) z, u, v, w > 0 (8e)
The solid line is meant to separate the "convenience" equation (8d)
from the true constraint equations. A basic feasible solution to Eqs.
(8a), (8b), and (8c) is clearly u = 100, v = 210, w = 150, with x, y, and z
all equal to zero, in which case S = 0.
Now, computing the gradient of S from Eq. (6),
8S
8x
-55 8S
8y
-4 8S
5z
- -6 (9)

so that improvement in S can be obtained by increasing x, y, and/or z.


Unlike most steep-descent procedures we choose here to move in only.a
single coordinate direction, and since the magnitude of the gradient in
the z direction is greatest, we shall arbitrarily choose that one. From
.

the point of view.of a general computer program this is simply equivalent


to comparing the coefficients in Eq. (8d) and choosing the most positive.
Since we choose to retain x and y as nonbasic (zero), Eqs. (8a) and (8b)
become
z + u = 100 (10a)
4z + v = 210 (10b)
OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION 67

(Equation (8c) does not involve z or it, too, would be included.) As


we are interested only in basic feasible solutions, either u or v must go
to zero (since z will be nonzero) while the other remains nonnegative.
If u goes to zero, z = 100 and v = -190, while if v goes to zero, z = 52.5
and u = 48.5. That is,
z = min (109/1,21%) = 52.5 (11)
and v is to be eliminated as z is introduced. Again, the required calcula-
tion for a general program is simply one of dividing the right-hand column
by the coefficient of z and comparing.
The Gauss-Jordan procedure is used to manipulate Eqs. (S) so
that the new basic variables, u, z, and w each appear in only a single
equation. This is done by dividing the second equation by the coefficient
of z, then multiplying it by the coefficient of z in each of the other equa-
tions, and subtracting to obtain an equivalent set of equations. The
second equation is chosen because it is the one in which v appears. The
result of this operation is
34x + 32y + U - 14v = 4712 (12a)
34x+32y+z + %v = 5212 (125)
3x + 2y +w = 150 (12c)
32x + y - 32v + F. = -315 (12d)

The basic feasible solution (x, y, v = 0) is u = 4732, z = 52;2, and


w = 150. From Eq. (12d)
g.= -315-32x-y+%v (13)
so that the value of 3 in the basic variables is -315.
Repeating the above procedure, we now find that the largest posi-
tive coefficient in Eq. (12d) is that of y, and so y is to be the new basic
(nonzero) variable. The variable to be eliminated is found from.exa_m-
ining the coefficients of the basic variables in Eq. (12) in the form
Y =min (477 52,- 150 _ 75
72,

(14)
\72 2
which corresponds to eliminating w. Thus, we now use the Gauss-Jordan
procedure again to obtain y in the third equation only, the result being
-3'zx +u-34v- -%W = 10 (15a)
Z + 34v - 34w 15 (15b)
2x + y + 32w = 75 (15c)
-x -%v-Y2w+g= -390 (15d)
The basic feasible solution is then x, v, w = 0, u = 10, z = 15, y = 75,
a OPTIMIZATION BY VARIATIONAL METHODS

and the corresponding value of & - -390. There are no positive coeffi-
cients in Eq. (15d), and so this is the minimum.. In terms of the original
variables only, then, x = 0, y = 75, z = 15, and only two of the three
original inequality constraints are at equality.
It should be clear from this example how a general computer code
using only simple algebraic operations and data comparison could be
constructed. The details of obtaining the required starting basic feasible
solution for the iterative process under general conditions, as well as other
facets of this extensive field, are left to the specialized texts on the subject.
The interested reader should establish for himself that the one-at-a-time
substitution used in the simplex method is the required result from steep
descent when, instead of the quadratic-form definition of distance,
42 (16)
a
a sum-of-absolute-value form is used,

(17)

Linear programming can be used to define directions of steep descent


in-constrained nonlinear minimization problems by linearizing constraints
and objective at each iteration and bounding the changes in the variables.
The solution to the local linear programming problem will then provide
the values at which the linearization for the next iteration occurs. Since
gradients must be calculated for the linearization, this is essentially
equivalent to finding the weighting. matrix in a constrained optimization.
The MAP procedure referred to in the bib'iographical notes is such a
method.

BIBLIOGRAPHICAL NOTES
Section t..5: Discussions of the convergence properties of the Newton-Raphson and
related techniques may be found in such books on numeripal analysis as
_

C. E. Froberg: "Introduction to Numerical Analysis," Addison-Wesley Publishing


Company, Inc., Reading, Mass., 1965
F. B. Hildebrand: "Introduction to Numerical Analysis," McGraw-Hill Book
Company, New York, 1956

Section 2.4 and Appendix t.1: The derivation of the Fibonacci search used here is
based on one in
R. E. Bellman and S. E. Dreyfus: "Applied Dynamic Programming," Princeton
University Press, Princeton, N.J., 1962
OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION a
which contains references to earlier work of Kiefer and Johnson. An alternative approach
may be found in
D. J. Wilde: "Optimum Seeking Methods," Prentice-Hall, Inc., Englewood Cliffs,
N.J., 1964
and C. S. Beightler: "Foundations of Optimization," Prentice-Hall, Inc.,
Englewood Cliffs, N.J., 1967

Sections 2.5 and 2.6: The development of steep descent was by Cauchy, althougA his
priority was questioned by Sarru8:
A. Cauchy: Compt. Rend., 25:536 (1847)
F. Sarrus: Compt. Rend., 25:726 (1848)
Some discussion of the effect of geometry may be found in the books by Wilds cited above;
see also

C. B. Tompkins: in E. F. Beckenbach (ed.), "Modern Mathematics for the Engineer,


vol. I," McGraw-Hill Book Company, New York, 1956
T. L. Saaty and J. Bram: "Nonlinear Mathematics," McGraw-Hill Book Company,
New York, 1964

Section 9.8: The most powerful of the procedures for computing the weighting matrix
is probably a modification of a method of Davridon in

R. Fletcher and M. J. D. Powell: Computer J., 6:163 (1963)


G. W. Stewart, III: J. Adsoc. Comp. Mach., 14:72 (1967)
W. I. ZangwilI: Computer J., 10:293 (1967)
This has been extended to problems with constrained variables by
D. Goldfarb and L. Lapidus: Ind. Eng. Chem. Fundamentals, 7:142 (1968)
Other procedures leading to quadratic convergence near the optimum are discussed in
the books by Wilde and in
H. H. Rosenbrock and C. Storey: "Computational Techniques for Chemical Engi-
neers," Pergamon Press, New York, 1966
Many of the standard computer codes use procedures in which the best direction for
descent at each iteration is obtained as the solution to a linear programming problem.
The foundations of such procedures are discussed in
G. Hadley: "Nonlinear and Dynamic Programming," Addison-Wesley Publishing
Company, Inc., Reading, Mass., 1964

One such technique, known as MAP. (method of approximation programming) is developed


in
R. E. Griffith and R. A. Stewart: Management Sci., 7:379 (1961)
MAP has been applied to a reactor design problem in
C. W. DiBella and W. F. Stevens: Ind. Eng. Chem. Process Design Develop., 4:16
(1965)
70 OPTIMIZATION BY VARIATIONAL METHODS

Section 2.9: The references cited for Sec. 2.8 are pertinent for approximate procedures
as well, and the texts, in particular, contain extensive references to the periodical
literature. The simplex-complex procedure described here is a simplified version
for unconstrained problems of a powerful technique for constrained optimization
devised by
M. J. Box: Computer J., 8:42 (1965)
It is an outgrowth of the more elementary simplex procedure, first described in this sec-
tion, by
W. Spendley, G. R. Hext, and F. R. Himaworth: Technometrics, 4:441 (1962)

Appendix 2.2: A delightful introduction to linear programming at a most elementary


level is
A. J. Glicksman: "Introduction to Linear Programming and the Theory of Games,"
John Wiley & Sons, Inc., New York, 1963.
Among the standard texts are
G. B. Dantzig: "Linear Programming and Extensions," Princeton University Press,
Princeton, N.J., 1963
S. I. Gass: "Linear Programming: Methods and Applications," 2d ed., McGraw-Hill
Book Company, New York, 1964
G. Hadley: "Linear Programming," Addison-Wesley Publishing Company, Inc.,
Reading, Mass., 1962
Linear programming has been used in the solution of some optimal-control problems; see
G. I)antzig: SIAM J. Contr., A4:56 (1966)
G. N. T. Lack and M. Enna: Preprints 1987 Joint Autom. Contr. Conf., 474
H. A. Lesser and L. Lapidus: AIChE J., 12:143 (1966)
Y. Sakawa: IEEE Trans. Autom. Contr., AC9:420 (1964)
H. C. Torng: J. Franklin Inst., 278:28 (1964)
L. A. Zadeh and B. H. Whalen: IEEE Trans. Autom. Contr., AC7:45 (1962)
An e..:ensive review of applications of linear programming in numerical analysis is
P. Rabinowitz: SIAM Rev., 10:121 (1968)

PR03LEMS
2.1. Solve Prob. 1.3 by both Newton-Raphson and Fibonacci search for the following
values of parameters:

K=3 a=1 x0=0.05 C


P
- 0.01

2.2. Obtain the optimal heat-removal rate in Prob. 1.4 by the Fibonacci search method,
solving the nonlinear equation for T at each value of Q by Newton-Raphson for the
following rate and parameters:
p (_ 40,000
r = 2.5 X 101 exp '0"') 2.0 X 10Tex\ c

J = 10' B = 10-2
OPTIMIZATION WITH DIFFERENTIAL CALCULUS: COMPUTATION 71

2.3. Derive Eqs. (7) and (8) of Sec. 2.2 for the multidimensional Newton-Raphson
method.
2.4. The following function introduced by Roscnbrock is frequently used to test
computational methods because of its highly curved contours:
& = 100(x2 - y')2 + (1 - x)2
Compare the methods of this chapter and that of Fletcher and Powell (cited in the
bibliographical notes) for efficiency in obtaining the minimum with initial values
x = -1, y = -1.
Solve each of the following problems, when appropriate, by steep descent, Newton-
Raphson, and the complex method.
2.5. Solve Prob. 1.7 numerically and compare to the exact solution.
2.6. Using the data in Prob. 1.5, find the coefficients a and t4 which minimize both the
sum of squares of deviations and the sum of absolute values of deviations. Compare
the former to the exact solution.
2.7. The annual cost of a heavy-water plant per unit yield is given in terms of the flow
F, theoretical stages N, and temperature T, as
300F + 4,000NA + 80,000
18.3(B - 1)
where

A =2+3exp(16.875- T)
\ 14.4
4(1 - s)
B =
0.6(1 - $)(c 8 - 1) + 0.40

0 =a (as)N+t+0-1
F
0
1,400

a = exp (508 - 0.382)


Find the optimal conditions. For computation the variables may be bounded by
250 <- F < 500
1 <N < 20
223 < T < 295

(The problem is due to Rosenbrock and Storey, who give a minimum cost of S =
1.97 X 10'.)
2.8. Obtain the optimal heat-removal rate in Prob. 1.4 by including the system equa-
tion for temperature in the objective by means of a penalty function. The parameters
are given in Prob. 2.2.
2.9. Using the interpretation of Lagrange multipliers developed in See. 1.15, formulhte
a steep-descent algorithm for multistage processes such as those in See. 1.13 in which
it is not necessary to solve explicitly for the objective in terms of the stage decision
variables. Apply this algorithm to the example of Sec. 2.7.
72 OPTIMIZATION BY VARIATIONAL METHODS

2.10. Solve the following linear programming problem by the simplex method:
min E = 6x1 + 2x2 + 3x3
30x3 + 20x2 + 40x3 2!34
x1+x2+x2=1
10x, + 70x2 < 11
X1, x2, x3 > 0
Hint: Find a basic feasible solution by solving the linear programming problem
min & = z1 + z2
30x1 + 20x2 + 40x3 - w1 + 21 a 34
X1 + x7 + x$ + Z2 - 1
10x1 + 70x2 + w3 - 11
xl, x2, x3, w1, W3, zl, ZS > 0
You should be able to deduce from this a general procedure for obtaining basic feasible
solutions with which to begin the simplex method.
2.11. Formulate Prob. 1.4 for solution using a linear programming algorithm iteratively.
3
Calculus of Variations

3.1 INTRODUCTION
Until now we have been concerned with finding the optimal values of a
finite (or infinite) number of discrete variables, x1, x2, . . . , x,,, although
we have seen that we may use discrete variables to"approximate a func-
tion of a continuous variable, say time, as in the optimal-ebnitrol problem
considered in Sees. 1.4 and 1.6. In this chapter we shall begin consider-
ation of the problem of finding an optimal function, and much of the
remainder of the book will be devoted to this task. The determination
of optimal functions forms a part of the calculus of variations, and we
shall be concerned in this first chapter only with the simplest problems
of the subject, those which can be solved using the techniques of the
differential calculus developed in Chap. 1.

3.2 EULER EQUATION


Let us consider a continuous differentiable function x(t), where t has the
range 0 < t < 0, and a function 9 which, for each value of t, depends
73
74 OPTIMIZATION BY VARIATIONAL METHODS

explicitly on the value of x(t), the derivative ±(t), and t; that is,
5 = a(x,x,t) (1)

For each function x(t) we may then define the integral


s[x(t)] = f oB S(x,x,t) dt (2)

The number &[x(t)] depends not on a discrete set of variables but on an


entire function, and is commonly referred to as a functional, a function
of a function. We shall seek conditions defining the particular function
2(t) which causes s to take on its minimum value.
We shall introduce an arbitrary continuous differentiable function
n(t), with the stipulation that if x(0) is specified as part of the problem,
then n(0) must vanish, and similarly if x(6) is specified, then 17(6) must
vanish. If we then define a new function
x(t) = x(t) + en(t) (3)

we see that &[X(t) + s, (t)] depends only on the constant e, since 2(t) and
n(t) are (unknown) specified functions, and we may write
t`i(E) = 10 ff(x + En, ± + En, t) dt (4)

Since 2 is the function which minimizes 9, it follows that the function


s(e) must take on its minimum at e = 0, where its derivative must vanish,
so that
d&
T,.-0 = 0 (5)

We can differentiate under the integral 'sign in Eq. (4), noting that
dT al; dx Off dx
de = T. de + ax ae
(6)
ax '1 + ax ''
and
dP-
f I dt = 0 (7)
de Jo [()x_5 n +
We integrate the second term by parts to obtaint
e 496 . 85 0 e d ag
(8)
fo azndt = axnlo - fo n dtaxdt
t Note that we are assuming that off /a± is continuously differentiable, which will not
be the case if t(t) is not continuous. We have thus excluded the possible cases of
interest which have cusps in the solution curve.
CALCULUS OF VARIATIONS 75

where we have dropped the notation ( )Z_, for convenience but it is


understood that we are always evaluating functions at x. We thus have,
from Eq. (7),
d c?aff

10 (ax - it ai} ,(t) dt + z't 10


0 (9)

for arbitrary functions ,7(t).


If x(0) is specified, then since ,1(0) must be zero, the term (af/ax),,
must vanish at t = 0, with an identical condition at t = 0. If x(0) or
x(0) is not specified, we shall for the moment restrict our attention to
functions 71(t) which vanish, but it will be necessary to return to con-
Thus, for arbitrary differentiable
sider the effect of the term"(aff/ai),q f o.
functions ,t(t) which vanish at t = 0 and B we must have

Jo ax it ax)t)dt = 0 (10)

If we choose the particular function

n(t) = w(t)
aT - d af
(TX dt
where w(0) = w(0) = 0 and w(t) > 0, 0 < I < 0, we obtain
z

ax ax) dt = 0
w(t) (12)
Io ax
The only way in which an integral of a nonnegative function over a
positive region can be zero is for the integrand to vanish idlintically,
and so we obtain
of da9=0 0<t<0 (13)
ax dt ax
which is known as Euler's differential equation.
If we now return to Eq. (9), we are left with

axy (ax n1 =
,_o
0
(14)
If x(9) is not specified, ,,(B) need not be zero and we may choose a func-
tion such that
n(0) = E1 I- (15)

where el > 0 is nonzero only when x(0) is not specified. Similarly, if


x(0) is not specified, we may choose n(t) such that
Carl (16)
az ,_o
76 OPTIMIZATION BY VARIATIONAL METHODS

Thus, for these particular choices, we have


a5 (a5)=
0
E'
1

ax 1 + f= ax e_o - (17)

and it follows that a5/ax must vanish at an end at which x(t) is not
specified.
We have thus obtained a condition for the minimizing function
equivalent to the vanishing of the first derivative for the simple calculus
problem: the optimal function 2(t) must satisfy the Euler second-order differ-
ential equation
d a5 a25 a=5 a2S a5 ( )
x 18
at ax = at ax + ax ax x + ax= = ax
with the boundary conditions

At t = 0: x(0) specified or az = 0 (19a)

At t = 0: x(0) specified or a=0 (19b)

It is sometimes convenient to use an alternative formulation for the


Euler equation. We consider
d a5 _ as aaxs . asax as d as
5
dt - i ax) at + + x - x ai - x at az (20)

and substituting Eq. (18),


d
(21)
T (5 - x ax) a
if if does not depend explicitly on t, then a5/at is zero and we may inte-
grate Eq. (21) to obtain a first integral of the Euler equation
if - x ax = const (22)

Finally, if we seek not one but several functions xl(t), xs(t), ... ,
. . . ,x,,,il,tt, . . . ±.,t), then the Euler
x (t) and if is a function 5(xl,x=,
equations (13) and (18) are written
d a5 as
I ax; - ax; = 0 i = 1, 2, . . . , n (23)

- x,a- =.Const (24)

It should be noted that the Euler equation is obtained from the


necessary condition for a minimum, the setting of a first derivative to
CALCULUS OF VARIATIONS 77

zero. Just as in the simple problems of differential calculus, we cannot


distinguish from this condition alone between maxima, minima, and other
stationary values. We shall refer to all solutions of the Euler equation
as extremals, recognizing that not all extremals will correspond to minima.

3.3 BRACHISTOCHRONE

Solutions of Euler's equation for various functions yield the answers to


many problems of interest in geometry and mechanics. For historical
reasons we shall first consider a "control" problem studied (incorrectly}
by Galileo and later associated rith the Bernoullis, Newton, Leibniz, and
L'Hospital. We assume that a particle slides along a wire. without fric-
tional resistance, and we seek the shape, or curve in space, which will
enable a particle acted upon only by gravitational forces to travel between
two points in the minimum time. This curve is called a brachistochrone.
The system is shown in Fig. 3.1, where m is the.particle mass, 8 is
are length, t time, g the acceleration due to gravity, x the vertical coordi-
nate, and z the horizontal. The speed is dx/dt, and the acceleration
d2x/dt2. A force balance at any point along the curve then requires
d
md=mg sin /x2 «
dx
mgt (1)

Dividing out the common factoi m and multiplying both sides by ds/dt,
we find
1d ds 2 dx
2 dt dt) - g dt (2)

or, integrating,
ds
dt
= v2g(x - c) (3)

X,

!-arc length

X2

Fly. 31 Motion of a particle under the mg


influence of gravity along a frictionless
wire between two points.
7$ OPTIMIZATION BY VARIATIONAL METHODS

where the constant c is determined from the initial height and velocity as

C
(d),
= x(0) - 2- o (4)
We note that
ds2 = dx2 + dz2 (5)
or

ds = 1+
()2 dz (6)

Thus, substituting Eq. (6) into (3) and integrating, we obtain the totq l
time T to travel the path
2g T = 1 + (dx/dz) 2
dz (7)
I-=
- V x-c
This is the integral to be minimized.
Identifying Eq. (7) with our notation f Sec. 3.2, we find
+ x2
1 (8)
x - c

Because S is independent of t (or, as we have used it, z) we may use the


first integral of the Euler equation [Eq. (22) of Sec. 3.2], which becomes

1 = const = (1 (9)
(x-c)(1+x2) 2b

Equation (9) is a first-order differential equation for x, with the con-


stant b to be determined from the boundary conditions. If we presume
that the solution is available and b is known, the control problem is
solved, for we have an expression for the steering angle, arctan z, as a
function of the height x. That is, we have the relation for an optimal
nonlinear feedback controller. We do seek the complete solution, how-
ever, and to do so we utilize a standard trick and introduce a new varia-
ble i' such that
sins'
x (10)
1 +cos
It then follows from Eq. (9) that
x=c+b(1+cosr) (11)
and
dz _ dz dx
dr dx dr
= b(1 + cos t) (12)

or
z = k+b(1-+sinr) (13)
CALCULUS OF VARIATIONS 79

Equations (11) and (13) are the parametric representations of the


curve of minimum time, which is a cycloid. This curve daps out the
locus of a point fixed on the circumference of a circle of radius b as
the circle is rolled along the line x = c, as shown in Fig. 3.2. When the
boundary conditions are applied, the solution is obtained as
cosh
x=x1+(x2-xl) 1 - COS tf
0< <rf (14a)

r+sini'
z=zl'+(z2-zl) _f+sinl'f 0<<3'f (14b)

where 3'f is the solution of the algebraic equation


1 x2-xi
+ sin if z2 - zI
The; existence of such solutions is demonstrated in texts on the calculus
of variations.

3A OPTIMAL LINEAR CONTROL


As a second and perhaps more useful example of the application of the
Euler equation we shall return to the optimal-control problem formu-
lated in Sec. 1.4. We have a system which varies in time according to
the differential equation
i = Ax + w (1)
where A is a constant and w a function of time which we are free to
choose. The optimal-control problem is to choose w(t) so as to minimize
the integral
C
r (x2 + c2w2) dt (2)
2 Jo

I1 IZ

X=C

Fig. 3.2 Construction of a cycloid.


so OPTIMIZATION BY VARIATIONAL METHODS

If. we solve for w in Eq. (1) and substitute into Eq. (2), we have
g jo [x2 + c2(± - Ax)2] dt (3)
or
if = Y2[x2 + c2(± -- Ax)2] (4)

The Euler equation is then


dc2(Z-Ax) =x-Ac2(t-Ax) (5a)

or
(5b)
c (1 + A2c2)x
x(O) is given. Since x(O) is not specified, the second boundary condition
is obtained. from a9/ft = 0 at t = B, or
x-Ax=O att=B (6)
Equation (5) may be solved quite easily to obtain x(t), and hence
i(t) and the optimal control policy w(t). For purposes of implemen-
tation, however, it is convenient to have a feedback controller, where
w is a function of the present state x, for such a result will be independ-
ent of the particular initial condition x(0). Motivated by the results
obtained in Chap; 1, we seek a solution in the form of a proportional
controller
w(t) = M(t)x (7)
and Eq. (1) becomes
x = (A + M) x (8)
Differentiating once, we have
.t = Az + Mt + Mx = Mx + (A + M) 2x (9)
and this must equal the right-hand side of Eq. (5b). For x 96 0, then,
M must satisfy the Riccati ordinary differential equation
M + 2AM + M2 - =0 (10)

with the.boundary condition from Eq. (6)


M(0) = 0 or x(9) = 0 (11)

The Riccati equation is solved by use of the standard transformation


M(t) = y(t) (12)
y(t)
CALCULUS OF VARIATIONS 91

with the resulting solution


1 + A2C2 1 - k exp [2 (1 + A2c2)/c2 t]
11 (l) _ - A +
4 c2 1 + k exp [2 1/(1 + A2c2)/c2 t]}
(13)

where the constant k is determined from the condition M(B) = 0 as


k
= V/1 + A2c2 - Ac eXp 2 J1 +A2c2 e (14)
2C2 +Ac (_
In the limit as oo we have k --, 0, and M becomes a constant

M -(A+ 0
/1 +A2c2) (15)

which is the result obtained in Sec. 1.5. Here we do not satisfy M(B) = 0
but rather x(B) = 0. We have again found, then, that the optimal con-
trol for the linear system with quadratic objective function is a proportional
feedback control, with the controller gain the solution of a Riccati equation,
and a constant for an infinite control time.

3.5 A DISJOINT POLICY


It sometimes happens that a problem can be formulated so that the inte-
grand is independent of the derivative, z; that is,
&[x(t)] = fo 5(x,t) dt (1)

The Euler equation then reduces to

0 = ax (2)

In this case, however, a much stronger result is easily obtained without


any considerations of differentiability or constraints on x.
If t(t) is the function which minimizes &, then for all allowable
functions x(t)
f 5[x(t),t] dt - fo 3[x(t),t] dt < 0 (3)

In particular we shall take x(t) different from x(t) only


interval t, < t < t, + 0, for arbitrary ti. We thus have

and using the mean-value theorem,


5[x(t),t]l < 0
112 OPTIMIZATION BY VARIATIONAL METHODS

where t is somewhere in the interval ti < t < ti + A. Dividing by A


and then letting A go to zero, we find
ff[z(t),t] < Y[x(t),t] (6)
for all t, since tl was arbitrary. That is, to minimize the integral in Eq.
(1) choose x(t) to minimize the integrand at each value of t. Such an opti-
mal policy is called disjoint.
An immediate-application arises in the design and control of a batch
chemical reactor in which a single reaction is taking place. If, for sim-
plicity, we suppose that the reaction is of the form
n,A ± n2B (7)

that is, A reacting to form B with' some reverse reaction, then since there
are no inflow and outflow streams in a batch reactor, the conservation of
mass requires that the rate of change of the concentration of A, denoted
by a(t), be equal to the net rate of formation of A. That is
d = -r(a,b,u) (8)
where b is the concentration of B and u is the temperature. Since the
amount of B present is simply the initial amount of B plus the amount
formed by reaction, we have

b(t) = b(0) + [a(O) a(t)] (9)


n=
where n,/n2 represents the number of moles of B formed from a mole of A.
Thus, we can write
d = -r(a,u) (10)

where it is understood that a(O) and b(0) enter as parameters.


Suppose now that we wish to achieve a given conversion in the
minimum time by a programmed control of the reactor temperature u(t).
We can solve Eq. (10) formally for the total time as

f a(o)
ace)
da =9
r(a,u) -
(11)

We wish to choose u(t) to minimize 9, and we note that since a(t) will be
a monotonic function of time, when the solution is available, it will be
sufficient to know a if we wish to know the time t. Thus, we may con-
sider u as a function of a and write

F[u(a),a] = r(Q u) (12)

Since r > 0, we shall minimize 1/r by maximizing r. The optimal policy


CALCULUS OF VARIATIONS $3

is then to choose the temperature at each instant to maximize the instanta-


neous rate of reaction. This result is probably intuitively obvious.
The kinetics of the reaction will usually be of the form
/
r(a,b,u) = plan, exp 1 -F'\
/\ uJ1/ } - p2b", exp\ [ -u' 1
plan, exp -ll - p2 (bo
u / 12 ao
n,
- 12 a)n, exp 1 u2

(13)
An internal maximum cannot occur when E,' > Eg (an endothermic reac-
tion), since r > 0, and the maximum is attained at the highest value of u
allowable. This, too, is perhaps intuitively obvious. The only case of
interest, then, is the exothermic reaction, E2 > E.
The maximum of Eq. (13) can be found by differentiation to give
U
Ei - E; (14)
In p,E' [b(0) + (n2/nI)a(0) - (n2/n,)alnt

(PlEf a n,

By differentiating Eq. (14) we obtain


du u2
>0 (15)
da nl(E'2? E>) 'na g + b 2
and since a(t) decreases in the course of the reaction, the optimal tem-
perature is monotone decreasing.
There will, in general, be practical upper and lower limits on the
temperature which can be obtained, say u* < u < u*, and it can be seen
from Eq. (14) that a small or zero value of b(0) may lead to a tempera-
ture which exceeds u*. The starting temperature is then
E2' E1'
u = min u*, (16)
In p:E2
p1Ei b(Q)n'l
a(0)n'J
If the starting temperature is u*, this is maintained until Eq. (14) is
satisfied within the allowable bounds. In a similar way, the tempera-
ture is maintained at u* if the solution to Eq. (14) ever falls below this
value. The optimal temperature policy will thus have the form shown
in Fig. 3.3. The solution for the concentration a(t) is obtained by sub-
stituting the relation for the optimal temperature into Eq. (10) and inte-
grating the resulting nonlinear ordinary differential equation. The solu-
tions have been discussed in some detail in the literature.
The monotonicity of the function a(t) means that the policy which
provides the minimum time to a given conversion must simultaneouilly.
provide the maximum conversion for a given time. This must be tom,
$4 OPTIMIZATION BY VARIATIONAL METHODS

U*

U(f)

U*

0 Fig. 3.3 Optimal temperature schedule


in a batch reactor.

for were it possible to reach a smaller value of a(8) [larger conversion


a(O) - a(9)] in the same time 9, the required value of a(O) could have
been reached in a shorter time, which contradicts the assumption of a
minimum-time solution. Since Eq. (10) can be applied to chemical
reaction in a pipeline reactor in which diffusion is negligible when t is
interpreted as residence time in the reactor, we have thus obtained at
the same time the solution to the following important problem: find the
temperature profile in a pipeline reactor of fixed length which will maximize
the conversion.

3.6 INTEGRAL CONSTRAINTS


In applications of the calculus of variations it frequently happens that
the integral.
8[x(t)] = fo ff(x,x,t) dt (1)

must be minimized subject to a so-called isoperimetric constraint,


fa G(x,x,l) dt - b = 0 (2)

We can obtain a formal solution to this problem in the following way.


Let x(t) be the solution, and take
x(t) = x(t) + E1171(t) T E2, 2(t) (3)

where El and t2 are constants and 771(t) is an arbitrary differentiable func-


tion. For a given 'n1, E1, and e2, the function 172 will be determined by
Eq. (2). The minimum of 8 subject to the constraint must then occur
at E1 = E2 = 0, and we may write
6(E1,E2) = foe 5(2 + E1711 + E2172, ± + E1711 + E2712, t) dl (4)
B

g(E1,E2) = f4 G(x + E1771 + E2712, + E17l1 + E27f2, t) dt - b = 0 (5)

The minimum of 8 subject to the constraint equation (5) is found by


CALCULUS OF VARIATIONS $5.

forming the lagrangian


2(il,t2) = a((1,E2) + Xg(el,E2) (6)
and setting partial derivatives with respect to el and E2 equal to zero at
el = E2 = 0, we obtain
(9 + xG) - at ai (ff +
Io rax
L
XG)] qi(t) dl + (a + XG)n1 o = 0
(7a)

(5- + XG) -
at ax (if +
AG)J 'n2(t) dt +
fo c3x az (if + XG)172 to = 0
(7b)
Since ill(t) is arbitrary, we obtain, as.in Sec. 3.2, the Euler equation
d (9 =0
dcai(g+XG) -a (if+XG)
(8)

with the boundary conditions


At t = 0: x(0) given or a (if + XG) = 0 (9a)

At t = 0: x(0) given or ax (if + XG) = 0 (9b)

The constant Lagrange multiplier X is found from Eq. (2): For n "func-
tions xl(t), x2(t), ... , x,,(t) and m constraints

fo° G.(xl,x2, . ,xn)xl>x2, . . . ,2n,t) dt -- bi = 0


i = 1, 2, . . . ,m<n (10)
The Euler equation is
d
d a" (Y + j-1 axe
+ I a1G;) = 0
j-1
k=1,2,...,n (11)
It is interesting to note that the Euler equation (8) is the same
as that which would be obtained for the problem of extremalizing
Jo G(x,x,t) dt subject to the constraint j 0if(x,z,t) dt = const (after
dividing by the constant 1/X). This duality is of the same type as that
found in the previous section for the minimum-time and maximum-
conversion problems.

3.7 MAXIMUM AREA


The source of the name isoperimetric is the historically interesting prob-
lem of finding the curve of fixed length which maximizes a given area.
86 OPTIMIZATION BY VARIATIONAL METHODS

If, in particular, we choose the area between the function x(t) and the
axis x = 0 between t = 0 and t = 0, as shown in Fig. 3.4, for a curve x(t)
of fixed are length L, we then seek to minimize
E = - area = - fa x(t) dt (1)
with
Are length = 0
(1 + ±2)3`' dt = L (2)

The Euler equation (8) of Sec. 3.6 is then

Y7 [-X + X (j + x2),1] = [-x + X (I + ±2)%] (3)


dt ax
or
d i
dt 1 -+±2 = -1 (4)

This has the solution


(x-E)2+(t-r)2=X2
(5)
which is the are of a circle. The constants t, r, and X can be evaluated
from the boundary conditions and Ea. (2).

3.8 AN INVERSE PROBLEM


Th,, second-order linear ordinary differential equation
at p(t)x + h(t)x + f(t) = 0 (1)

appears frequently in physical problems, and, in fact, an arbitrary linear


second-order equation
a(t)y + b(t)y + c(t)y + d(t) = 0 (2)
may be put in the self-adjoint form of Eq. (1) by introducing the cnange
A

Fig. 3A Area enclosed by a curve of


length L and the t axis.
CALCULUS OF VARIATIONS $7

x(t) = y(t) exp (!u b a a dt (3)


\\\

Because of the wide application of Eq. (1) it is of interest to determine


under what. conditions it corresponds, to the Euler equation for some
variational problem.
We shall write h(t) and f (t) as
h(t) = t(t) - q(t) f(t) = m(t) - n(t) (4)
so that Eq. (1) can be rewritten
d [p(t)t + r(t)x + m(t)] = r(t)± + q(y)x + n(t) (5)

This will be of the form of an Euler equation


doff of (6)
dt ax ax
provided that
497
= p(t)t + r(t)x + m(t)
ax

TX
= r(t)x + q(t)x + n(t) (7b)

Integrating, from Eq. (7a)


3F = Y2[p(i)x= + 2r(t)xx + 2m(t)x] + arbitrary function of x (8a)
while from (7b)
3[q(t)x2 + 2r(t)x-4- + 2n(t)x] + arbitrary function of x (8b)

so that, within an additive constant,


%[p(t)x= + 2r(t)x± + q(t)x!] + m(t)± + n(t)x (9)

with the boundary conditions then becoming


At t = 0 and B: x fixed or px + rx + m = 0 (10),

As shown in the problems, these boundary conditions may be generalized


somewhat.
In this next section we shall discuss an approximate method for
determining stationary values for the integral fo ff(x,x,t) dt, and as a
consequence of the result of this section, such a procedure can always be
applied to the solution of a linear second-order differential equation.
U OPTIMIZATION BY VARIATIONAL METHODS

3.9 THE RITZ-GALERKIN METHOD


The Ritz-Galerkin method is an approximate procedure for the solution
of variational problems. Here a particular functional. form is assumed
for the solution with only parameters undetermined, and the integral
(x,z,t) dt (1)
fII
which now depends only on the parameters, is minimized (or more gener-
ally, made stationary) by choice of the parameters. Suppose, for exam-
ple, we seek an approximate solution of the form
N
x(t) 4,o(t) + I C.44.(t) (2)
x-1

where ko satisfies any nonzero boundary specifications and the l4. are
members of a complete set which vanish at any boundary where x(t) is
specified. The minimum of S is then found approximately by choosing
the coefficients C1, C29. . . , CN.
Substituting Eq. (2) into Eq. (1), we obtain
6 x f0 5(#o + FC.O., Oo + t) dt (3)
S is now a function of C1, Cs, . . . , CN, and upon differentiating K with
respect to each of the C. and setting the result to zero

aC
f ax + ax '*) dt = 0 n = 1, 2, ... , N (4)

Here the subscript ap refers to the fact that the partial derivatives are
evaluated for the approximate solution. Equation (4) leads to N alge-
braic equations for the N coefficients after the integration is carried out,
and is generally known as the Ritz method.
An alternative form which requires less computation when dealing
directly with the solution of Euler differential equations is obtained by
integrating Eq. (4) by parts. Thus
d
fo k az d:
ax) 4,.(t) dl + az no (5)

and since either 4.. is zero (x specified) or 8ff/81 vanishes (natural bound-
ary condition), we obtain, finally, the Galerkin form
0 BS.D dW.P n= 1,2, ... N
fo ax ' dt ea~ } ¢.(t)dt=0 (6)

Since an exact solution would satisfy the Euler equation,


8R d 8R = 0
(7)
.3x - dt 8
CALCULUS OF VARIATIONS

we may consider the quantity in parentheses in Eq. (6) as the residual at


each value of t which remains when an approximation is used in the left-
hand side of the differential equation. Thus, writing
a9;., d as:.P
ax - it ax _ R(C, C2, ... Cm,t) (8)

Galerkin's method may be written


foe R(C2,C2, . . . ,Cjv,t)4,.(t) dl = 0 n= 1, 2, ... , N (9)

This procedure is often used for the approximate solution of differential


equations which are not Euler equations but without the rigorous justi-
fication provided here.
For demonstration, let us consider the approximate solution of the
nonh6mogeneous Bessel equation
d =
(t d + tx - t = 0 x(0) finite x(1) 0 (10)
wt-
)
Comparing with Eq. (1) of Sec. 3.8 this corresponds to
p(t) = t q(t) = -t r(t) = 0 m(t) = 0 n(t) = 1 - (11)
and provided that t(0) remains finite, the boundary conditions of Eq.
(10) of Sec. 3.9 are satisfied, the condition at t = 0 being the natural
boundary condition.
We shall use for approximating functions
n=1,2, . . . , N (12a)
which vanish at t = 1 and remain finite at t = 0. A function yf'e is not
needed. Thus, substituting the approximation for x(t) into the left-hand
side of Eq. (10), the residual is written

R(C,,C2, ... ,CX,t)


I C. dt [t to-'(1 - t)+ tn(1 - t)t
dt JJ 1

(12b)
and the coefficients are found by solving the linear algebraic equations

fo
2
R(C,,C2, .. ,CN,t)tn-'(1 - t) dt = 0
n = 1, 2, . . . , N (13)
In particular, for a one-term approximation
N = 1: R(C,,t) = -C, + (C2 - 1)t - Cit2 (14)
and the solution of Eq. (13) is C, = -0.4, or
N = 1: x(t) ;; -0.4(1 --- t) (15)
OPTIMIZATION BY VARIATIONAL METHODS

Table 3.1 Comparison of the first-


and second-order approximations by the
Ritz-Calerkin method to the exact solution

- x(t)

t N=1 N-2 Exact

0.0 0.40 0.310 0.307


0.1 0.36 0.304 0.303
0.2 0.32 0.293 0.294
0.3 0.28 0.276 0.277
0.4 0.24 0.253 0.255
0.5 0.20 0.225 0.226
0.r) 0.16 0.191 0.192
0.7 0.12 0.152 0.151
0.8 0.08 0.107 0.106
0.9 0.04 0.056 0.055
1.0 0.0 0.0 0.0

j, 'or a two-term approximation,


N = 2: R(C1,C2,t) = (C2 - C1) + (C1 - 4C2 - 1)t
+ (C2 - C1)t2 - C2 t3 (16)
and Eq. (13) becomes simultaneous algebraic equations for C1 and C2,
with solutions C1 = -0.31, C2 = -0.28, or
N = 2: x(t) ..- -0.31(1 - 1) - 0.281(1 - t) (17)
The exact solution is
Jo(t)
x (t) = 1 _ (18)
Jo(1)

where Jo is the Bessel function of zero order and first kind. Table 3.1
compares the two approximations to the exact solution.

3.10 AN EIGENVALUE PROBLEM


The homogeneous equation
d
(t dt) + Xtx = 0 (1)
d1

with boundary conditions


x(0) finite x(1) = 0 (2)
clearly has the trivial solution x(t) = 0, but for certain eigenvalues, or
characteristic numbers, X, a nontrivial solution can be obtained. These
CALCULUS OF VARIATIONS 11

numbers are solutions of the equation


Jo (.\/-X) = 0 (3)
and the first two values are X1 = 5.30, X2 = 30.47.
The Ritz-Galerkin method may be used to estimate the first several
eigenvalues. If we again use the approximation
N
x(t) - I 0 (4)
n-1
then
N = 1: R(C1it) = -C1 + xC1t - C1t2 (5)

and Eq. (13) of Sec. 3.9 becomes


N= 1: C1(X-G) =0 (6)

Since if C1 vanishes we obtain only the undesired trivial solution, we can


satisfy Eq. (6) only by
X=6 (7)
Thus, by setting N = 1 we obtain an estimate of the first (smallest)
eigenvalue.
Similarly, for a two-term expansion
N = 2: R(C1iC2,t) = (C2 - C1) + (XC1 - 4C2)t
+ X(C2 - C1)t2 - )C2t3 (8)
and we obtain the two equations
C1(5X - 30) + C2(2X - 10) = 0 (9a)
C1(2X - 10) + C2(X - 10) = 0 (9b)
These homogeneous equations have a nontrivial solution only if the
determinant of coefficients vanishes, or X satisfies the quadratic equation
(5X - 30) (X - 10) - (2X - 10) 2 = 0 (10)
The two roots are
X1=5.86 A2=34.14
giving a (better) estimate of the first eigenvalue and an initial estimate
of the second. In general, an N-term expansion will lead to an Nth-order
polynomial equation whose roots will approximate the first N eigenvalues.

3.11 A DISTRIBUTED SYSTEM


In all the cases which we have considered thus far the Euler equations
have been ordinary differential equations. In fact, the extension of the
92 OPTIMIZATION BY VARIATIONAL METHODS

methods of this chapter, which are based only on differential calculus,


to the study of distributed systems is straightforward, and this section
and the next will be devoted to typical problems. We shall return to
such problems again in Chap. 11, and there is no loss in continuity in
skipping this section and the next until that time.
Let us suppose that x is a function of two independent variables,
which we shall call t and z, and that x(t,z) is completely-specified when
t = 0 or 1 for all z and when z = 0 or 1 for all t. We seek the function
x(t,z) which minimizes the double integral

g[x(t,z)l = f o1 Jo [
()2

+
(+ l
0(x) i dt dz (1)

where O(x) is any once-differentiable function of x.


If we call 2(t,z) the optimum, we may write
x(t,z) _ 2(t,z) + E,t(t,z) (2)

where a is a small number and n(t,z) is a function which must vanish at


t = 0 or 1 and z = 0 or 1. For a particular function n the integral 8
depends only on a and may be written
1 1 1 (a-f an 2 1 az an 2
8(e) = 0 fo
2 at +Eat) +(az +eaz)
+ 4(2 + 2)] dt dz (3)
The minimum of 8 occurs when e = 0 by the definition of 2, and at e = 0
the derivative of 8 with respect to a must vanish. Thus,
d3 r1 1 ax an + ax an + 0'(2)n dt dz = 0 (4)

a
at at f o n
dt (5)

and the first term vanishes by virtue of the restrictions on n. A similar


integration may be carried out on the second term with respect to z, and
we obtain
1 a22 ,(2)]
f
a2z
(1
!o o - ate W2 + n(t,z) dt dz = 0 (6)

Since n is arbitrary except for the boundary conditions and obvious differ-
entiability requirements, we may set
z- 2-
n(t,z) = w(t,x)
1
- ate azz + 0'(x)] (7)
CALCULUS OF VARIATIONS u
where
w(0,z) = w(l,z) = tv(t,O) = w(t,1) = 0
(8)
w(t,z) > 0 for t, z * 0, 1
Thus,
Jol z- 2- 2
fr

Jol
((
w(t,z)
at2 + az2 - '(z) dl dz = 0 (9)

and it follows that x must satisfy the Euler partial differential equation
491X
2

x specified at
t = 0, 1
ate + az2 -
(x) = 0
z=0,1 (10)

The partial differential equation


IX 49 2

ate + azx - kF(x) = 0


arises often in applications, as, for example, in two-dimensional mass or
heat transfer with nonlinear generation. The method discussed in Sec. 3.9
may then be extended to this case to determine approximate solutions.

3.12 CONTROL OF A DISTRIBUTED PLANT


Many control problems of interest require the control of a system which
is distributed in space by the adjustment of a variable operating only at
a physical boundary. The complete study of such systems must await
a later chapter, but we can investigate a simple situation with the ele-
mentary methods of this chapter. One of our reasons for doing so is to
demonstrate still another form of an Euler equation.
A prototype of many important problems is that of adjusting the
temperature distribution in a homogeneous slab to a desired distribution
by control of the fuel flow to the furnace. The temperature in the slab x
satisfies the linear heat-conduction equation
axla2x1
_ 0<t<0
Wt- az2 0<z<1 (1)

with a zero initial distribution and boundary conditions


ax,
0 at z = 1
c3z
(2)
_
ax1
az
P(xl-x,) atz=0
S4 OPTIMIZATION BY VARIATIONAL METHODS

Here x2 is the temperature of the furnace. The first boundary condition


is a symmetry condition, while the second reflects Newton's law of cool-
ing, that the rate of heat transfer at the surface is proportional to the
temperature difference. The furnace temperature satisfies the ordinary
differential equation

r dt + x2 = u(t)
2 x2(0) = 0 (3a)

where u(t) is the control variable, a normalized fuel feed rate.


The object of control is to obtain a temperature distribution x; (z)
in time 6 or, more precisely, to minimize a measure of the deviation
between the actual and desired profiles. Here we shall use a least-square
criterion, so that we seek to minimize
fol
ts[u] = [x; (z) - xi(6,z)l2 dz (3b)

In order to use the procedures of this chapter it will be necessary to


obtain an explicit representation of s[u] in terms of u. Later we shall
find methods for avoiding this cumbersome (and often impossible) step.
Here, however, we may use either conventional Laplace transform or
Fourier methods to obtain the solution
to
xl(8,z) = fK(6 - t, z)u(t) dt (4)

where
a2 cos a(l - z)
K(t'z) = e IT
cos a - (a/p) sin a
a
COS (1 - Z)Pi
+ 2a2
it (a2 -
Oil)
1 + 1 + P
cos +i
(5)

with a - 1// and Pi the real roots of


#tan$ = p (6)

Thus,
f01
t;[u] = [xl (z) - fo K(6 - t, z)u(t) dt,2 dz (7)

Now, in the usual manner, we assume that u(t) is the optimum,


,(t) is an arbitrary differentiable function, and a is a small number and let

u(t) = u + ert (8)


CALCULUS OF VARIATIONS ss

The function of e which results by evaluating Eq. (7) is


s[n + e>)J = f oI [x, (z)]2 clz - 2 f,' x" (z) fo K(8 - t, z)i (t) clt dz
- 2e fol x; (z) fo K(O - t, z),,(t) dt dz
+ 0f' K(O - t, z)u(t) dt]2 dz
fot
+ 2e [ f u K(8 - r, z)u(r) dr] [fo K(O - 1, z)-,(I) dl] dz
+ e2 fot [fo K(8 - t, z), (t) dt]2 dz (9)

and, evaluating d8/de ate = 0,

de I.so =
-2f g x, (r) fo K(8 - t, z)n(t) dt dz
+ 2 Jot [ fo K(0 - r, z)u(r) dr, [fo K(9 - t, z),7 (t) dt] dz = 0 (10)

There is no difficulty in changing the order of time and space integration,


ind the inner two integrals in the second term may be combined to give
fot
fo
,7(t) [ K(6 - t, z) x,* (z) dz

-f o
fo' K(O - 1, z)K(8 - r, z)u(r) dz dr] dt = 0 (11a)
or, setting the arbitrary function n(t) proportional to the quantity in
brackets,
foI
K(8 - t, z)x, (z) dz
B I

fo [ fo
K(0 - t, z)K(8 - r, z) dz] a(r) dr = 0 (11b)
Equation (11) is an integral equation for the function u(i). It is
simplified somewhat by defining
101
ow = K(8 - t, z)x; (z) dz (12a)
I
G(t,r) = 0
K(9 - 1, z)K(O - r, z) dz (12b)

so that the Euler equation becomes


fo G(t,r)u(r) dr = 41(t) (13)

This is a Fredholm integral equation of the first kind. Analytical solu-


tions can be obtained in some special cases, or numerical methods may
be used. An obvious approach is to divide the interval 0 < I < 0 into
N even increments of size At and write
G = G(i At, j At) At (14a)
u, = u(j at) (14b)
O; = 1'(i At) (14c)
!6 OPTIMIZATION BY VARIATIONAL METHODS

We obtain an approximation to u(t) by solving the linear algebraic equa-


tions approximating Eq. (13)
N
i = 1,2,...,N (15)
-x

BIBLIOGRAPHICAL NOTES

Sections 3.2 and 3.3: An extremely good introduction to the calculus of vairations by
means of detailed study of several examples, including the brachistochrone, is
G. A. Bliss: "Calculus of Variations," Carus Mathematical Monograph, Mathematical
Association of America, Open Court Publishing Co., La Salle, Ill., 1925
Other good texts on the calculus of variations include
N. I. Akhiezer: "The Calculus of Variations," Blaisdell Publishing Company, Wal-
tham, Mass., 1962
G. A. Bliss: "Lectures on the Calculus of Variations," The University of Chicago
Press, Chicago, 1946
0. Bolza: "Lectures on the Calculus of Variations," Dover Publications, Inc., New
York, 1961
It. Courant and D. Hilbert: "Methods of Mathematical Physics," vol. 1, Interacience
Publishers, Inc., New York, 1953
L. A. Pars: "An Introduction to the Calculus of Variations," John Wiley & Sons,
Inc., New York, 1962
Applications specifically directed to a wide variety of engineering problems are found in
It. S. Schechter: "The Variational Method in Engineering," McGraw-Hill Book
Company, New York, 1967

section 3.4: We shall frequently use problems in control as examples of applications of


the optimization theory, and complete references are given in later chapters. A
useful introduction to the elements of process dynamics and control is
1). R. Coughanowr and L. B. Koppel: "Process Systems Analysis and Control,"
McGraw-Hill Book Company, New York, 1965
The reduction of the optimal feedback gain to the solution of a Riccdti equation is an
elementary special case of results due to Kalman, which are discussed in detail in
later chapters. An approach to linear feedback control like the one used here, based
on the classical calculus-of-variations formulation, is contained in
P. Das: Automation Remote Contr., 27:1506 (1966)

Section 3.6: The first discussion of the single-exothermic-reaction problem is in


K. G. Denbigh: Trans. Faraday Soc., 40:352 (1944)
A fairly complete discussion is in
It. Aria: "The Optimal Design of Chemical Reactors," Academic Press, Inc., New
York, 1961
CALCULUS OF VARIATIONS 97

The approach taken here, as applied particularly to the exothermic-reaction problem, is


credited to Horn in
K. G. Denbigh: "Chemical Reactor Theory," Cambridge University Press, New
York, 1965
where there is discussion of the practical significance of the result. Some useful con-
siderations for implementation of the optimal policy when parameters are uncertain
can be found in
W. H. Ray and It. Aris: Ind. Eng. Chem. Fundamentals, 5:478 (1966)

Sections 3.6 and 3.7: The texts cited for Secs. 3.2 and 3.3 are pertinent here as well.
Section 3.8: The general inverse problem may be stated: When is a differential equation
an Euler equation? This is taken up in the text by Bolza cited above and in
J. Douglas: Trans. Am. Math. Soc., 50:71 (1941)
P. Funk: "V'ariationsrechnung and ihr Anwendung in Physik and Technik," Springer-
Verlag OHG, Berlin, 1962
F. B. Hildebrand: "Methods of Applied Mathematics," Prentice-Hall, Inc., Engle-
wood Cliffs, N.J., 1952

Sections 3.9 and 3.10: The approximate procedure outlined here is generally known as
the Ritz or Rayleigh-Ritz method and as Galerkin's method when expressed in terms
of the residual. Galerkin's method is one of a number of related procedures known
as methods of weighted residuals for obtaining approximate solutions to systems
of equations. A review of such methods with an extensive bibliography is
B. A. Finlayson and L. E. Scriven: Appl. Mech. Rev., 19:735 (1966)
See also the text by Schechter cited above and others, such as
W. F. Ames: "Nonlinear Partial Differential Equations in Engineering," Academic
Press, the., New York, 1965
L. Collatz: "The Numerical Treatment of Differential Equations," Springer-Verlag
OHG, Berlin, 1960
L. V. Kantorovich and V. I. Krylov: "Approximate Methods of Higher Analysis,"
Interscience Publishers, Inc., New York, 1938
Sections 3.11 and 3.12: Distributed-parameter systems are considered in some detail
in Chap.' 11, with particular attention to the control problem of Sec. 3.12. The
derivation of the Euler equation used here for that process follows
Y. Sakawa: IEEE Trans. Autom. Contr., AC9:420 (1964)

PROBLEMS
3.1. A system follows the equation'
x- -x + u
Find the function u(t) which takes x from an initial value xo to zero while minimizing

E= (K + us) dt
IU
!t OPTIMIZATION BY VARIATIONAL METHODS

(time plus cost of control) where 8 is unspecified. Hint: Solve for fixed 0; then deter-
mine the value of 0 which minimizes E.
3.2. A body of revolution with axis of symmetry in the x direction may be defined as
one which intersects all planes orthogonal to the x axis in a circle. Consider such a
body whose surface in any plane containing the x axis is described by the curve y(x),
y(0) -0 y(L) -R
The drag exerted by a gas stream of density p and velocity v flowing in the x direction
is approximately

J - 4rpvi JOL y dx
dx)
Find the function y(x) passing through the required end points which makes the drag
a minimum.
3.3. S is & function of t, x, and the first n derivatives of x with respect to t. Find the
Euler equation and boundary conditions for
min & - foe T(x,z, ... x(R),t) di
3.4. A second-order process described by the equation
I+ax+bx - u
is to be controlled to minimize the error integral
r oe
min E - (x' + c'u') dt
Show that the optimal control can be expressed in the feedback form
u -Mix +M:x
and find the equations for M, and M2.
3.5. Obtain the Euler equation and boundary conditions for minimization of

E- Jo [p(t)i= + 2r(t)xz + 4(i)x' + 2m(t)x + 2n(t)x] dt + ax'(e) + bxs(0)


2 theeee

and relate result to the discussion of Sec. 3.8.


3A. Steady-state diffusion with isothermal second-order chemical reaction, as well as
other phenomena, can be described by the equations

dz Dx2-0
D- - h(x - xo) atz-0
Ddx -0 atz -L
where k, D, h, and xo are constants. Find the parameters in a cubic approximation
to the solution.
3.7. Using a polynomial approximation, estimate the first two eigenvalues of
Y+ax-0
CALCULUS OF VARIATIONS

for the following boundary conditions:


(a)x-0 at t=0,x
(b)i-0 att-0
x-x-0 at t=w
Compare with the exact values.
3.8. Laminar flow of a newtonian liquid in a square duct is described by the equation
a'v+a2v-1AP v-
Oatx - ta
ax' ay' is L a0 y - to
Here v is the velocity, p the viscosity, and AP/L the constant pressure gradient.
(a) Using the Galerkin method, find the coefficients A, B, and C in the approxi-
mate form
v - (x' - a')(y' - a')(A + B(x' + y2) + Cx'y')
The solution is most conveniently expressed in terms of the average velocity,

LI -! J aQ I as v(x,y) dx dy
(The form of the approximation is due to Sparrow and Siegal. Numerical values of
the exact solution are given in the book by Schechter.)
(b) Formulate the flow problem in terms of minimization of an integral and use
the complex method to estimate values for A, B, and C.
4
Continuous Systems:

4.1 INTRODUCTION
In the previous chapter we investigated the determination of an entire
function which would minimize an objective, and we were led to the
Euler differential equation for the minimizing function. It is rare that
a problem of interest can be formulated in so simple a fashion, and we
shall require a more general theory. Consider, for example, a chemical
reactor which we wish to control in an optimal manner by changing cer-
tain flow rates as functions of time. The laws of conservation of mass
and energy in this dynamic situation are represented by ordinary differ-
ential equations, and the optimizing function must be chosen consistent
with these constraints.
We shall assume that the state of our system can be adequately
represented by N variables, which we shall denote by xi, x2, . . . , xx.
In a chemical system these variables might be concentrations of the per-
tinent species and perhaps temperature or pressure, while for a space
vehicle they would represent coordinates and velocities. In addition,
100
CONTINUOUS SYSTEMS: I 101

we suppose that certain control or design variables are at our disposal


to adjust as we wish, and we shall denote them by ul, u2, ... , UR.
These might be flow rates, temperatures, accelerations, turning angles,
etc. Finally, we suppose that the state variables satisfy ordinary differ-
ential equations

.ti = dxi = fi(xl,x2, . . . )xN,u'1,u2, . . . ,UR)


i = 1, 2, .. , N
dt o<t<8
(1)

and we wish to choose the functions Uk(t), k = 1, 2, . . . , R in order to


minimize an integral
e
C[ul,u2, . . . ,uR) = fo 5(xl,x2, . . . ,XN,UIlu2) . . . UR) dt (2)

We shall generally refer to the independent variable t as time,


although it might in fact refer to a spatial coordinate, as in a pipeline
chemical reactor. The total operating duration 0 may or may not be
specified in advance, and we may or may not wish to impose conditions
on the variables xi at time 0. A typical measure of performance in a
control problem might be a weighted sum of squares of deviations from
preset operating conditions, so that 5 would be

CJ(xl _ x18)2 + . . . + CN(XN ZNS)2


+ CN+1(ul - u18)2 + . .
. + CN+R(UR - URS)2 (3)

On the other hand, if the controls were to be set to bring x1, x2, . . . ,
xN to fixed values in the minimum time 8, we would wish to minimize
& = 0 or, equivalently,

As we shall see, there is no loss of generality in the choice of the per-


formance index, Eq. (2).

4.2 VARIATIONAL EQUATIONS


The development in this section parallels that of Sec. 1.8. For con-
venience we shall put N = R = 2, although it will be clear that the
results obtained are valid for any N and R. We thus have the state
defined by the two differential equations

±1 = f1(x1,x2,u1,u:) (14)
i2 = f2(xl,x:,u1,u1.) (lb)
me OPTIMIZATION BY VARIATIONAL METHODS

and the performance index


&[u1,u2] = 0
ff(xl,x2,ul,u2) dt (2)

Let us suppose that we have specified the decision variables ul(t),


u2(t) over the entire interval 0 < t < 0. Call these functions ul(t), u2(t).
For given initial conditions x1o, x20 we may then solve Eqs. (1) for xl(t)
and 22(t), 0 < t < 0, corresponding to the choices ul(t), 42(t). The value
of the performance index is completely determined by the choice of deci-
sion functions, and we may call the result t;[u1iu2].
We now change ul and u2 at every point by small amounts, bul(t)
and 6u2(t), where
bol(t)I, Ibu2(t)I < e 0<t<0 (3)
and a is a small positive constant. [If x1(0) and x2(0) are not specified,
we also make small changes U1(0), bx2(0).] That is,
U1(t) = ul(t) + bul(t) (4a)
u2(t) = u2(t) + W2(t) (4b)
and as a result we cause small changes bxl(() and d,z2(t) in xl and x2,
respectively, and a change 66 in the performance index. We can obtain
expressions for bxl, 6x2r and 56 by evaluating Eqs. (1) and (2) with (ul,u2)
and (ul + bul, u2 + 6u2), successively, and then subtracting. Thus,

at
(xl + axl) - at- xl = 6x1
= fl(xl + bxl, 22 + 6x2, ul + aul, u2 + but) - fl(21,22,41,u2)(5a)

it (22 + bx2) - jx2 = bx2


= f2(xl + bxl, x2 + 5x2, u1 + aul, u2 + but) - f2(xl,x2,u1,i72) (5b)
3& = g[ul + but, u2 + but] - F,[ul,u2]
= Jo bxl, x2 + 6x2, 'ul + aul, u2 + but) - ff(21,22,u'1,i22)] dt
rrg+60

+ Jd ff(21 + 6x1, x2 + 6x2, ul + bul, u2 + 8u2) dt (6)

The last term in Eq. (6) must be added in the event that the total time 0
is not specified, and a change in the decisions requires a change in the
total process time in order to meet some preset final condition. B then
represents the optimal duration and 60 the change.
If we expand the functions fl, ff, and T at every t about their.respec-
tive values at that t when the decisions ul, u2 are used, we obtain
ai; = fo
/ `
a bxl -} ax, +
a bui + a' bug dt
ax, axe aul 0U2
\ + 60 + o(E) (7)
CONTINUOUS SYSTEMS: I 103

afiSxl+ aflSx2+ aflSul+ af26u2+0(e) (8a)


axl ax2 au, au2

Sz2
af26xl+ af2Sx2+ af2Sul+ af2Su2+0(e) (8b)
ax1 ax2 aul 8U2

We now multiply Eqs. (8a) and (8b), respectively, by arbitrary continu-


ous functions X1(t) and X2(t), integrate from t = 0 to t = B, and add the
result to Eq. (7). Thus,

bS = faJl61 +Xlaxl+X2 XI 311

+1 a Y+Xlz+\2 2) ax2 - X26±2

aul
+ (aul + Xi aul + X2 of l)
+
(a + X l Al
au2 + X. au:
af2)
sue dt + I _a Sa + o(e) (9)
au2
Integrating the terms X, ft, and X2 6±2 by parts gives, finally,
fIM-5
SS= +Xlafl+X2azi+Sxl
+ (az + X1 ofy + X2 auz + ^2) 6x2
(T11l
+ + Xl aui + X2 aui1 Sul
(C of l
sue dt
+ au, + l au2 + Z au2
of t )

+5 It-S Se - X1(8) Sxl(6)


- X2(B) 6x2(9) + X1(0) 6xl(0) + X2(0) 6x2(0) + 0(t) (10)

Now, just as in Sec. 1.8, we find ourselves with terms which are at
our disposal, namely, the decision changes Sul(t), 5u2(t), and terms which
are not, the state variations Sxl(t) and Sx2(t). We therefore eliminate
these latter terms from the expression for a& by removing some of the
arbitrariness from the functions al(t), X2(t) and requiring that they satisfy
the differential equations

a-
afl aft
^1 = - axl - a l xl ,2 (h a)
axl
2 ag afl aft
Xl aZ2 - X2 (11b)
ax2 49x2

Note that we have not yet specified boundary conditions. The variation
104 OPTIMIZATION BY VARIATIONAL METHODS

8& is now

b8
= f a + Xl auafl '} X2 auafsl` 8161
g

au1 1

+`a 2+Xiaus+X,/2) 6u2]dl


+5 I1-1 ae - X1() axl(e) - X2(8) 8x2()
+ X1(0) 6xi(0) + X2(0) 5x2(0) + 0(E) (12)
It is necessary at this point to distinguish between cases when the
total duration 0 is specified and when it is not. If 0 is specified, then
58 must be zero. If x1(9) is fixed,'then 8x1(9) is zero and the term
X1(8) hl(8) vanishes. If, on the other hand, x1(9) is free to take on any
value, we have no'control over 8x1(8), so that we remove it from the
expression for SE by specifying that X1(0) = 0. Similar considerations
apply to the term X2(0) 8x2(9), and we obtain the boundary conditions:
0 specified:
x1(9) free X1(9) = 0
x1(9) fixed X1(9) unspecified
(13)
X2(0) free X2(8) = 0
x2(8) fixed X2(0) unspecified
If a is not specified, the variations 68, 6x1(8), and 5x2(9) are related.
In fact,
x1(8 + ae) = x1(6) + axl(e) + fl Il_s 59 + o(e) (14)
and similarly for x2. Thus, if x1 is fixed at the end 'of the process, we
require x1(8 + 69) 21(6), and if x2 is free, the terms
if 69 - X1 5x1 - X2 8x2 = (`if + Xlfl) 58 - X2 5x2 (15)
and similarly for x2 fixed, or both. If neither x1 nor x2 is fixed, of course
we may choose 89 = 0 and use the previous results. Thus, applying the
logic of the previous paragraph, we obtain the following conditions:
0 unspecified:
X1(0) free x2(8) free X1(8) = 0 X2(8) = 0
x1(8) fixed x2(8) free if + X1f1 = 0 X2 = 0 (16)
x1(8) free x2(8) fixed if + X2f2 = 0 X1 = 0
x1(8) fixed if + Xlfl + X2f2 = 0
x2(8) fixed
We now apply the same approach to the terms X1(0) &x1(0) and
X2(0) 5x2(0) and obtain the further boundary conditions:
x1(0) free X140) = 0
x1(0) fixed X1(0) unspecified
(17)
x2(0) free X2(0) = 0
x2(0) fixed X2(0) unspecified
CONTINUOUS SYSTEMS I -ins

For the problem with fixed 9, Eqs. (13) and (17) provide a total of four
boundary conditions for the four differential equations (1a), (1b) and
(11a), (11b). When B is variable, conditions (16) and (17) provide five
conditions, four boundary conditions and a stopping condition.t With
these conditions we finally obtain an expression for 68 which., depends
only on the variations in the decision variables,

- Jo [(au + X1 Al af, aul


aut + X= aut)
_) Susj dt + o(e) (18)
+ (aus + Claus +

4.3 FIRST NECESSARY CONDITIONS

We now introduce into the discussion, for the first time, the fact that
we are considering an optimization problem. Thus, if the choices ut(t),
til(t) are those which minimize ,8, then for all variations fti(t), 5u2(t) it is
necessary that 8 increase, or S8 > 0. We consider in this section only
situations in which the optimal decisions are unbounded and we are free
to make any (small) variations we wish. In doing so we exclude a large
class of problems of interest (indeed, most!), and we shall return shortly
to considerations of constraints on the allowable decisions.
As in Chap., 1, we choose a particular set of variations which makes
our task easy. We set .

but = _E {au + X 1 au,


i9l + = aut)
85 ' #1 12
OU2 allll Cluj

where e' is a small positive constant. Thus Eq. (18) of the preceding
section becomes

S8 = -E' for
s
,

aut + aut + aut) =

ta5 X Z

+ au + 1 au= + a= au: ,] dt + o(E) '- 0 (2)


Since e and e' are of the same order, it follows that

lim o(e) = 0
t

t For the case # unspecified and both x,(B) and x2(6) unspecified we require an addi-
tional condition. We shall find later that this condition is 5 - 0.i-e
106 OPTIMIZATION BY VARIATIONAL METHODS

and dividing by e' and taking the limit in Eq. (2), we obtain
of
I0
foe
of l afz = afl af, _
Kau, + l aul + = aui) + 1 8U2 + X1 au= + X= au=/ j dt -0
(4)
Since this is the integral over a positive region of a sum of squares, we
can satisfy the inequality only' if the integrand vanishes identically
(except, perhaps, on a set of discrete points). We therefol'e conclude
that if ui and u2 are the unconstrained functions which cause.& to take on its
minimum value, it is necessary that

aui + X, aui + a:
aui .0. (5a)

except, perhaps, on a set of discrete points. These equations represent


the extension of the multiplier rule to the problem of minimizing an
integral subject to differential-equation side conditions. It is important
to note that the optimal functions. ui(t), u2(t) are found from a set of
algebraic relations to be satisfied at each point.
Let us pause for a moment and contemplate what we have done:
In order to obtain conditions for the minimizing functions ui(t) and u=(t)
we have had to introduce two additional functions Xi(t) and a=(t) which
also satisfy a set of differential equations. The four boundary conditions
for the total of four differential eqtiations are split between the two ends,
t = 0 and t = 0, and there is an additional set of algebraic conditions
to be satisfied at each point. This is a rather formidable problem but one
which we must accept if we are to attack design and control problems
of any significance.
It is often convenient, and always elegant, to introduce the hamil-
tonian function H, defined as
H = ff + XJ, + X,f= (6)
The differential equations for xi, x=, ai, and a= can then be written in the
canonical form
aH aH
xl = x= = (7a)
sAi sa=

^1 aH x2= - OH (7b)

while the necessary conditions (5a) and (5b) can be written


aH=0 aH=0 (8)
aui au2
CONTINUOUS SYSTEMS: 1 167

Furthermore, the hamiltonian is a constant along the optimal path, for


dH aH aH aH . aH . aH .aH
7t - ax, Z1 + aX2 xz {- aX' 1 -{ Xs )\f -}- aul ul + aus 112z (9)

which equals zero after substitution of Eqs. (7) and (8).


When 8 is unspecified, the constant value of the hamiltonian is
found from the value at t = 0 to be zero. If x1(9) and x2(8) are free,
we can simply apply the calculus to find the optimal stopping time

ae " elo 5dt = III =0 (10)

Together with the first boundary condition [Sec. 4.2, Eq. (16)] this givesf
H = Hl,_, - 0 (11)
On the other hand, if x1j x2, or both are specified, Eq. (11) follows directly
from the remaining boundary conditions [Sec. 4.2, Eq. (16)].
We may summarize the results of this section in the following
statements:
The unconstrained functions ul(t) and u2(t) which minimize 6 make
the hamiltonian stationary$ for all t, 0 < t < 8. The hamiltonian is
constant along the optimal path, and the constant has the value zero
when the stopping time B is not specified.
This is a very weak form of what has come to be called Pontryagin's
minimum principle.
For the more general problem of N state and R decision variables
we require N multipliers, and the hamiltonian has the form
N
H=5+ Xnfn(x1)x2, . . ,xv,ul,uz,' . ,us) (12)
n-l
with canonical equations
x;=aaH
°f ; (13a)

OH afn
X. (13b)
ax;
ax nil
and the boundary conditions
X. = 0 x; free (14a)
a; unspecified x; fixed (14b)
t This is the miming stopping condition which we noted previously.
We shall find later that the hamiltonian is in fact a minimum at these stationary
points.
109 OPTIMIZATION BY VARIATIONAL METHODS

The hamiltonian is made stationary by all decision variables uk, k = 1, 2,


. . . , R, and is a constant, with zero value when 0 is unspecified.

4.4 EULER EQUATION


At.this point it may be useful to reconsider the problem of Sec. 3.2 from
the more general result of the previous section. We seek the function
x(t) which will minimize the integral
3= 0
a(x,t,t) dt (1)

First, we must put the problem in the form of Eqs. (1) of Sec. 4.2.
To do so, we observe that for a given (or optimal) initial condition, x(0),
the function x(t) is uniquely determined by its derivative ±(t). Thus,
t may be taken as the decision variable. ' Furthermore, the etplicit
dependence of 9 on t may be removed by the simple guise of defining a
new state variable. That is, if we let x1 denote x, the problem can be
reformulated as
e
S= %(xl,u,x2) dt (2)
o
with
i1 = U (3a)
x2 = 1 X2(0) = 0 (3b)

so that x2(t) m t. This is precisely the form which we have studied


(with the special case of a single decision function).
The hamiltonian is
H=SF+X1u+X2 (4)

with multiplier equations


aH aif aS
(5a)
axl =-ax)
aH 05 05
(5b)
ax: axs = - at
Equation (5a) can be integrated to give

al(t) = x1(0) - fax (6)

We generally do not need Eq. (5b), but we should note that it requires
that X2 be constant if 5 is independent of t.
The condition that the hamiltonian be stationary is
aU=au+a1= 01= 015
+X) (7)
CONTINUOUS SYSTEMS: I 109

or, combining with Eq. (6),


as as dt + c (8)
TX = Jo ax
where c denotes a constant. This is the equation of du Bois-Reymond.
If we differentiate with respect to t, we obtain Euler's equation
d as as
dtat -ax (9)

The necessary boundary conditions follow from Eq. (13) of Sec. 4.2 as

x(0) given or ax =0 (10a)


1 1-0
05 1

x(B) given or = 0 (10b)


az ,1-e
Equations (9) and (10) are identical to Eqs. (18) and (19) of Sec. 3.2.
It may be that the optimal function x(t) has a "corner" at some
value oft where the derivative is not continuous. The integrand 9(x,z,t)
will not be continuous at this point, and the Euler equation (9) will not
be defined. The integral in Eq. (8) will be continuous, however (recall
that the value of an integral is not changed by what happens at a single
point), and therefore aS/az must also be continuous at the corner. This
is the Erdmann-Weierstrass corner condition.

4.5 RELATION TO CLASSICAL MECHANICS

The reader familiar with classical mechanics may wish to relate the
hamiltonian defined here with the function of the same name employed
in mechanics. It will be recalled that in a system of N particles, with
masses ml, m2i . . . , mN and positions in a three-dimensional space x11,
. . . , xNl, xN2, xN3, the physical system is the one
x12, x13, x21, x22, x23,
which minimizes (or makes stationary) the action integral
N 3

z L-.----V(x11,x12)x1`3,
fo [s-li-1
L , xN1,xN2,xN3..11 (1)

where the first term is kinetic and the second potential energy. Renum-
bering, we may use a single-subscript notation
3N M2 ,_+,2
S = fo - V (x1jx2, . . . ,x3N) dt (2)
i-1 IJ
with m1 = m2 = m3, m4 = m5 = m6, etc.
no OPTIMIZATION BY VARIATIONAL METHODS

If we take the decision variable as the velocity,


x;=u; i=1,2, ... ,3N (3)
and
3N
m2 2
._1 - V(xl,x2, . . ,x3N) (4)
i-I
then the hamiltonian is
3N 3N
H =Im2 - V (x,,x2, . . . ,xaN) 1- atu; (5)

The multiplier equations are

' - axi
aH aV
ax;
while the stationary condition is
(6)

aH
=0=ma-us +X; (7a)
au,
or
X;
(7b)
m;

Defining the momentum p; as - Xi, we find

(8)

and
3N
H= pit -- V (x1,x2,
2m;
. . ,x3N) (9)
i-i

which is simply the negative of the usual hamiltonian of mechanics.


(The fact that we have obtained the negative stems from an unimpor-
tant sign convention.) This is the total energy, which is a constant in
a conservative system.

4.6 SOME PHYSICAL EQUATIONS AND USEFUL TRANSFORMATIONS


In this and subsequent chapters we shall often find it useful to relate the
mathematical development to the particular example of the control of a
continuous-flow stirred-tank chemical reactor. This example is chosen
because, while simple, it retains the basic features of many practical sys-
tems and because the basic equations have received considerable atten-
tion in the published literature. In this section we shall develop the
CONTINUOUS SYSTEMS: I 111

equations and consider several, transformations of variables, each of which


will be useful in various formulations of the control problem. One of our
purposes is to derive, through the consideration of the reactor example,
the general form of a second-order system, and the reader who is not
interested in the particular physical application may wish to begin with
Eqs. (10).
The reactor system is shown schematically in Fig. 4.1. For sim-
plicity we assume a single liquid-phase first-order reaction
A --+ products (1)

The equation of conservation of mass is then

A=V(A, - A) - kA (2)

where A denotes the concentration of reactant, the subscript f refers to


the feed stream, V the volume, and q the volumetric flow rate of the feed.
k is a temperature-dependent reaction-rate coefficient of Arrhenius form

k = koexp( T l! (3)

where ko and E' are constants and T is temperature. The equation


simply states that the net rate of accumulation of A in the reactor is
equal to the net rate at which it enters less the rate of reaction
Similarly, an energy balance leads to
4 (T, - T) - UKq`
(T - T,) -b (-AH) kA
fi = V VCpp(1 + Kq,) C,p
(4)

Here the subscript c refers to the coolant stream, p is the density, Cp the

Coolant flow rate


Flow rote
IQ,

Coolant
Feed stream
91.
O T
nX

Product stream
A, T

Fig. 4.1 Schematic of a continuous-flow stirred-tank reactor.


112 OPTIMIZATION BY VARIATIONAL METHODS

specific heat, U the overall heat-transfer coefficient times the cooling area,
AH the heat of reaction, and K a constant, defined as

K= 2CPCPC
U (5)

Equations (2) and (4) are made dimensionless by defining a dimen-


sionless concentration Z1 and temperatures Z2, Zr, and Z,, as follows:
A 1
Z1 (6a)
Af
CPpT
Z2 (6b)
(-AH)A,
CPpT1
Z' _ (-AH)A, (6c)

CPpT
Z (-AH)A1 (6d)
Thus
Z1 = V (1 - Z1) - kZ1 (7a)

Z2 (ZI - Z2) (Z2 - Z,) + kZ1 (7b)


V VCPp(K++
where k may be written in terms of Eqs. (6) as

k = ko exp - (-AH)A,
E'CPp 1
Z2 (8)

Let us now assume that the reactor has been designed to operate
at a stable steady-state condition Z18, Z25 and that we shall control the
reactor in the neighborhood of the steady state by adjusting the flow
rates q and q,. Let x1 and x2 denote variations about the steady state
in (dimensionless) concentration and temperature, respectively, and ul
and u2 in feed and coolant flow rates; that is,
x1 = Z1 - Zis X2 = Z2 - Z2s (9a)
ul = q - qs u2 = qc - qcs (9b)

where the subscript S refers to the steady state. Substituting into Eqs.
(7) and expanding the right-hand.sides in Taylor series about the steady
state, exactly as in Sec. 1.4, we obtain the following equations for the
dynamic behavior of the reactor in a neighborhood of the steady state
x1=x2=u1=u2=0:
x1 = a11x1 + a12x2 + b11u1 (10a)
z2 = a21x1 + a22x2 + b21U1 + b22U2 (10b)
CONTINUOUS SYSTEMS: I 113

where the constants are defined as follows:

ail = V + ks (lla)
E'CppksZ,s
(11b)
a12 (- AH) A! Z zsz
a21 = -ks (11c)
qs UKq,s E'C,pksZls
azz = + VC,p(1 + Kq,s) (-AH)A,Z2s2
(lld)

b11 = V (1 - Z15) (lle)


b21 = V (Z1 - Z25) (111)
UK(Z2s - Zc)
b22 = - VC,p(l + (llg)

It is clear that Eqs. (10) will apply to a large-lass of systems besides the
reactor problem considered here. Values of the parameters which are
used for subsequent numerical calculations are collected in Table 4.1 in
consistent (cgs) units.
In some cases we shall choose to hold ui at zero and control only
with the coolant flow rate. We can simplify the form of the equations by
defining new variables yl and Y2, as follows:
x1 = -yl
all _ 1
xz =
ail Y1 a12
Y2

Substituting into Eqs. (10), we then obtain


yl = y2 (13a)
y2 = (a12a21 - alia22)yi + (a11 + a22)y2 T (-a12b22u2) (13b)

or, with obvious notation,


y1 = y2 (14a)
92 = - a2y1 - a1y2 + U (14b)

Table 4.1 Parameters for continuous-flow stirred-tank reactor

V = 1,000 Af - 0.0065 T, = 350


T.-340 k0 -7.86X1012 E'-14,000
(--MI) - 27,000 p - 1.0 C, - 1.0
U-10 K-0.2 qs-10
As=15.31 X10-1 Ts-460.91 q,s=5
114 OPTIMIZATION BY VARIATIONAL METHODS

Equations (14) are equivalent to the equation of motion of a solid body


with viscous drag and a linear restoring force, the general second-order
system
y + a,y + azy = u(t) (15)
Hence, Eqs. (15) and (10) are completely equivalent when b11 = 621 = 0.
Equations (10) and (15) are also related in another way for this
case. We may wish an equation for concentration alone if the tempera-
ture is of little importance to us. Differentiating Eq. (10a), we obtain
(with b,1 = b2l = 0)
Zi = a11±1 -4- a,zx2 (16)
and substituting for t2 from Eq. (10b) and x2 from Eq. '(10a),
xl + alxl + azxl = -U(t) (17)
where al, a2, and u have the same meanings as above.
Another transformation which we shall find convenient is
xl = Y1 + yz (18a)
+ S1 all + S2
X2 = all yz (18b)
a12 a12

Here,
231 = -(all + a22) + [(all + a22)2 - 4(alla22 - alzazl)]; (19a)
2S2 = -(all + a22) - [(all + a22) 2 - 4(alla22 -. al2a2i)j/ (19b)
For the parameters in Table 4.1, S1 and S2 are negative and real. Equa-
tions (10) then become
yl = S1Y1 - M11u1 - M12u2 (20a)
y2 = 527,2 + M21u1 + M12u2 (20b)
where
alibi, + alzbzl + S2b11
(21a)
S,-St
M1, = alzb22
(21b)
Sl - SZ
M21 = M11 + bl1 (21c)

4.7 LINEAR FEEDBACK CONTROL


We are now in a position to consider the control of the chemical reactor.
We shall suppose that the linearized equations are adequate and that the
reactor is to be controlled by adjustment of the coolant flow rate. Equa-
CONTINUOUS SYSTEMS: I 113

tions (10) of the preceding section are then


z, = a12x2 (la)
x2 = a21x, + a22x2 + b22u2 (ib)

where x, and x2 are deviations in concentration and temperature, respec-


tively, and u2 the variation in coolant flow rate. The system is initially
displaced from the equilibrium values, and our control action will be
designed to keep fluctuations in all variables small. This objective may
be accomplished by minimizing the integral

g f o# + C2x22 + c:u:2) dt (2)


2

We shall make the transformation


xi = -y, (3a)
all 1
X2 = - y, - - Y2 (3b)
a,2 a,2

giving
y, = Y2 (4a)
02 = - a2y, - a,y2 + u (4b)
and the objective

g
2
fo (ci,y,2 + 2c,2y,y2 + c22y22 + c::u=) dt (5)

where
a, = - (a + a22) (6a)
a2 = a,2a2, (6b)
u = -a,2b22u2 (6c)

C C1 + C2
a,22 (6d)
all
C,2 -Cla,2
2 (6e)

_ C2
C22 (6f)
a,22
Cs
C21
a,22b2:2
(69)

The hamiltonian is then


H = 2c,2y,y2 + c22y22 + c22u2)
+ X,y2 + X2(- a2y, - a,y2 + u) (7)
116 OPTIMIZATION BY VARIATIONAL METHODS

and the canonical equations for the multipliers


aH ay=
-C11Y1 - C12y2 + a2X2 (8a)
1

aH ay=
),2
e
-c12y1 - C22y2 - Xi + a1X2 (8b)

with boundary conditions


X1(9) = X2(9) = 0 Z9)

The optimality criterion, that the hamiltonian be stationary, is


aH=0=X2+C33U (10)
au
or

U= -- X2
C33

Thus, the problem is one of finding the function-X2(t).


The optimal control for this system can be found in a rather straight-
forward manner by seeking a solution of the form
X1 = m11y1 + mi2y2 (12a)
A2 = mi2yi + m22y2 (12b)

From Eqs. (8), then,


X1 = -Clly, - Clay! + az(muuyi + m22y2) (13a)
X2 = -C12YL - C22Y2 - (miiy1, + m12y2) + ai(mi2yi + m22y2) (13b)

By differentiating Eqs. (12) with respect to time we also obtain


Xi = miiyi + m1iyi +, m12y2 + m12 r2 (14a)
X2 = m12y1 + mizyl + m22y2 + m22y2 (14b)

and, substituting Eqs. (4),


x1 = m11y1 + mlly¢ + m12Y2
T22
+ m.14 (-a2y1 - aly2 - M12 y1 - my2 (15a)

X2 = m12y1 + m12y2 + m22y2


/ m12
+ m22 l -aiy1 - a1y2 - Cb3 yl -
m22
Cs' yz (1 5b)

If a solution of the form of Eq. (12) is to exist, Eqs. (13),and (15) must
be identical for all yl and y2. That is, the coefficients of yl in Eqs. (13a)
and (15a) must be identical, as they must be in Eqs. (13b) and (15b),
and similarly for y2. Thus, equating coefficients, we obtain the three
CONTINUOUS SYSTEMS: I

differential equations
m11 = m122 -I- 2a2C3311112 - C33C11 (16a)
*22 m222 - 2c33m12 + 2a1c3391122 - Ca3C22 (16b)
m12 1n121n22 - C33nL11 + a1C33m12 + a2C33m22 - C3012 (16c)
Equations (16) are the multidimensional generalization of the
Riccati equation, which we have considered several times previously.
For finite B a solution can be obtained numerically with boundary con-
ditions m11(B) = m12(8) = m22(0) = 0, while as O--- ao, a stable constant
solution exists. We shall consider this case, and setting the time deriva-
tives in Eqs. (16) to zero, we obtain
m12 = -c3sa2 + (c$32a22 + C32C11)S4 (17a)
m22 = -C33a1 + [C332a12 + C33C22 - 2c332a2
+ c3s(C332a22 + (17b)
Combining Eqs. (11), (12), and (17), we obtain the optimal control

u= [a2 - (a22 +Cc-3. l

+ {ai - a12 - 2a2 - Ci: i] y


as + Cu, (18)

This is a multivariable linear feedback controller, whlsre the controller


gains depend on both the system parameters and relative weights. The
resulting system is stable, so that yl and y2 go to zero as 0 -> ao, and
hence Eq. (12) for X1(8) add 7 2(9) satisfies the zero boundary conditions
of Eq. (9).
The system equations (4a) and (4b), together with the control (18),
are linear, and are easily solved and then transformed back to the Z1,
Z2 variables defined in the previous section. Figure 4.2 shows the paths
in the Z.1Z2 plane for the linearized reactor equations with the param-
eters in Table 4.1, using constants
c1=84.5 c2=6.16 C3=10-' (19)
It is interesting to note that from most starting points the equilibrium
point is approached along a line in the plane. A weakness of the linear-
ized analyses becomes evident by noting that some of the paths cross into
negative concentrations, which are physically meaningless.

4.$ AN APPROXIMATE SOLUTION


The technique used in the previous section to obtain solutions of the
multiplier equations is a powerful one when the objective is of the form of
Eq. (5), i.e., a quadratic form in the state variables plus a squared cost-
in OPTIMIZATION BY VARIATIONAL METHODS

3.00

2.90

2.80

2.60

2.50

2.40

2.30

I I i
2.20 I I 1 1

0 0.04 0.08 0.12 0.16


Composition z,

Fig. 42 Temperature-concentration paths for the controlled


reactor. [From J. M. Douglas and M. M. Denn, Ind. Eng.
Chem., 67 (11): 18 (1965). Copyright 1965 by the American
Chemical Society. Reprinted by permission of the copyright
owner.]

of-control term, but it is restricted to linear systems. It is possible, how-


ever, to obtain approximate solutions to nonlinear systems with the same
objective by similar methods. In order to avoid algebraic complexity
we shall simplify our reactor example somewhat for the time being, but
we shall see later that the result is still of physical as well as mathematical
interest.
We shall assume for the reactor model described by Eqs. (6) and (7)
of Sec. 4.6 that it is possible to choose the temperature Z2 at every instant
of time, and hence temperature or, equivalently, the reaction rate k
[Sec. 4.6, Eq. (9)] is the control variable. The state of the system is
described, then, by the single equation

Z1 = (1 - Z1) - U, (1)

V
and we shall suppose that the objective is again one of keeping flue tua-
tions small, that is,
3 = 3 (Z1 - Z1a)1 + 3c2(k - ks) 2 (2)
CONTINUOUS SYSTEMS: f U9

It is convenient to define variables


x=Z1-Zis u=k - ks (3)

and so the system equation becomes, after subtracting out the steady-
state terms,
i = - (3+ ks) x - (Zis + x)u (4)

= 12 (x2 + c2u2) (5)


Note that we have not linearized. The linearized form of Eq. (4) would
not contain the term xu.
The hamiltonian for this nonlinear system is

H= iix2+12c2u2-x(3+ksl x-a(Z15+x)u (6)

and the equation for the multiplier

-aH= -x+lV+ks, X+Xu (7)

The condition that the hamiltonian be stationary is


aH
au
=cu-X(Zis+x)=0
2
(8)

or
U = c (Zis + x) (9)

so that the problem is again one of finding the solution to the multiplier
equation.
For the linear system of the previous section we were able to obtain
a solution for X proportional to x. We might look upon this as the lead
term in a series expansion and seek a solution of the form
X = mx + px2 .+. . . . (10)
Again, for simplicity, we shall take 0 ---> so that it will be possible to
obtain the solution with ni, p, . . . as constants.
Differentiating Eq. (10), we `find, using Eqs. (4) and (9),

{(# + ks) x + (Zis c2 x) 2 (mx + px2) ] + .. .

- (m + 2px)
1 (l la)
while from Eqs. (7), (9), and (10),
\1
_ -x + V + ks l (mx }px2) +
m2x2Z>. s
c2
+ ... (11b)

The coefficients of each power of x must be equal in these two equations,


120 OPTIMIZATION BY VARIATIONAL METHODS

so that we obtain from the power x'


+ 2m \V + k sl
m2Z1s2
1 = 0 (12)

and from the power` x2

P
(q + ks + mZ1s2 ) + m2Z1s = 0 ( 13 )
``
V c2 c2

Equation (12) is easily shown to be the steady-state limit of the Riccati


equation (10) of Sec. 3.4 for the linearized system when the proper
identification of coefficients is made, the stable solution being
2 2 ,_

M --
Z1s2 I V
+ k,, - (1 ks) + ZC2 ] (14)

while the solution to Eq. (13) is


P m2Z1s
W c2(q/V + ks) + mZ1s2 (15)
Coefficients of x2, x4, etc., in the series for X can be obtained in the same
manner. In terms of the original variables, then, we obtain the optimal
nonlinear feedback control
k = ks + Z' [m(Zi - Z15) + p(Z1 - Z1s)2 + .. ] (16)

Had we not let B -- , we would have obtained differential equations for


m, P, ....
Equation (16) defines a curve in the Z1Z2 plane. If we assume
that we have reached that curve in some way, we can substitute Eq. (16)
into Eqs. (7) of Sec. 4.6 to obtain the optimal coolant flow rate qc to
implement this policy, namely, the solution of
q. _ (-1H)VA,Z22 q
(1 - Z1) - kZ1 (2Z1 - ZIS)
1 + Kq, k UKE'c2 IV
[m -f- P(Z1 - Z1s) + ] + UK (Zf Z2) + kV C P 21 (17)
Hence, had we posed the problem of finding the coolant flow rate to
minimize the integralt
_ f [(Z1 - Z1S)2 + c2(k - ks)2] dt
we would have the partial solution that once having reached the line
defined by Eq. (16), the optimal control policy is the nonlinear feedback

t There is no obvious physical reason why k should be any less meaningful in the
integral than Z1, since it is the reaction rate, rather than the temperature, which
affects conversion.
CONTINUOUS SYSTEMS: I 121

controller defined by Eq. (17). As we shall see in the next chapter, the
overall policy is the one which we intuitively expect, namely, full cooling
or no cooling until the line defined by Eq. (16) is reached and then, non-
linear control as defined above.

4.9 CONTROL WITH CONTINUOUS DISTURBANCES


The control problems we have considered thus far have dealt with autono-
mous systems only; i.e., the differential equations are assumed to have
no explicit dependence on time. This will be the case, for example, when
the operating conditions have been changed and the system must be
brought to a new steady state (a set-point change) or when a large pulse-
like disturbance has affected the system but no more disturbances are
expected for a time long compared to the system's response time. In
many control situations, however, disturbances of prolonged duration
may be expected to enter the system and must be included in a rational
control analysis.
For simplicity we shall restrict our attention to linear systems with
constant properties in which the state of the system can be described by
a single variable. If the disturbance is a piecewise differentiable func-
tion D(t), the system response is described by the differential equation
x = Ax + u + D(t) x(0) = xo (1)
We shall again take the control objective to be the minimization of the
integral
& = 2 Io (x2 + c2u2) dt (2)

If we follow the procedure of Sec. 4.4 and remove the explicit time
dependence by definition of a new variable, we can write Eqs. (1) and (2)
in autonomous form as
a1 = Ax, + D(x2) + u x1(0)" = xo (3a)
x2 = 1 x2(0) = 0 (3b)
2
(x12 + c2u2) dt (4)
Ia

The hamiltonian for this system is


H = 3'(x12 + c2u2) + X1(Ax1 + D + u) + X2 (5)
with multiplier equations
1,1=-aH=-xi-AX1 X1(d)=0 (6a)

aH _ `X1dD = -X11 (6b)


122 OPTIMIZATION BY VARIATIONAL METHODS

where X2(B) will be zero if 0 is unspecified, but unspecified for 0 fixed.


The condition of optimality is
aH =c2u+X,=0 (7a)
-5-U

or

(7b)

We see, therefore, that x2 and X2 are in fact extraneous, and we may


drop the subscripts on x, and X,.
Equations (3a) and (6a) form a system
z = Ax - + D(t) x(O) = xo (8a)
x = -x - AX X(0) = 0 (8b)
Since, we already know that a homogeneous solution (D = 0) for this
system can be obtained in the form X proportional to x, we shall seek a
solution
X = -c2[M(t)x + L(t)] (9)

From Eq. (8a) we then have


X = -c2M(Ax + D + Mx + L) - c2Mx - c2L (10a)
while from Eq. (8b)
X = -x + Ac2Mx + Ac2L (10b)

Equating coefficients of x in Eqs. (10), we obtain an equation for M(t)

M + 2AM + M2 - c = 0 (Ila)
and therefore L(t) must satisfy
L + [A + M(t)]L + M(t)D(t) = 0 (llb)
Equation (Ila) is, of course, simply the Riccati equation of Sec. 3.4.
In order to establish boundary conditions for Eqs. (Ila) and (llb) we
shall assume that at some time t < 0 the disturbance vanishes and
remains identically zero.During this final period we have the problem
we have already solved, namely, a system offset from x = 0 with no
disturbances, and we know that the solution requires
M(9) = 0 or x(0) = 0 (12a)
The solution for M(t) is thus given by Eqs. (13) and "(14) of Sec. 3.4.
It follows then, from Eqs. (8b) and (9), that the proper boundary con-
CONTINUOUS SYSTEMS: I 123

dition for Eq. (11b) is


L(O) = 0 (12b)--

This boundary condition clearly points up the difficulty we face in deal-


ing with nonautonomous systems. While the formal mathematics is
straightforward, we must know the future of the disturbance function
D(t) in order to obtain a solution to Eq. (llb) which satisfies the bound-
ary condition [Eq. (12b)] at t = B.
In practice this difficulty, while serious, may be somewhat less
severe on occasion than it first appears. If we let 0 -- co, the solution
of the Riccati equation (Ila) is

M = -A - 1 le' (13)

and Eq. (lib) is

L` - J1 C2

which has the solution


L = (A + Ji
9
c2 D(t) L(w) =-0 .(14)

/
[-
cA2c= f~ }
L(t) _ -tA+ exp 1 c2 (r - t)]
\ c2 D(r) dr (15)
For bounded disturbances the integrand will effectively vanish for future
times greater than several time constants, c(1 + A2c2)-, and it is neces-
sary only to know (or estimate) the disturbance for that time into the
future. Indeed, disturbances often have the form of step changes,
D(t) = D = const 0 < t < nc(.1 + A2c2)- (16)
in which case Eq. (15) yields

L(t) _ - l 1 + 1 +AZC= (1 - e-")D (17a)

and, for n greater than 2 or 3,

L - 1+ A 2C2
1+A2c2)D (17b)

That is, the optimal control for a step disturbance is proportional to


both the system state and the disturbance
(l
u(t)
- CA + c2A2C2) x(t) - + +2A2c2 D (18)
The term c2u2 in the objective defined by Eq. (2) is meaningless for
124 OPTIMIZATION BY VARIATIONAL METHODS

many industrial situations, in which a true cost of control is negligible.


Such a term might be useful as a penalty function in order to keep the
control effort u(t) between bounds, but an immediate disadvantage of
suet' a practice is evident from the substitution of Eq. (18) into Eq. (1).
The optimal response to a step-function disturbance is then found to be
x(t)
1+cDc2+(xo+1+A2c2)expI 1+Arest) (19)
or
x(t) _ AclD (20)
1 + A 2c2
That is, the optimal control with a cost-of-control term does not return
the system to x = 0 as long as there is a step disturbance present.
This is known as steady-state offset, and is clearly undesirable in many
circumstances.

4.10 - PROPORTIONAL PLUS RESET CONTROL


Very often the serious restriction is not the available control effort but
the maximum rate at which the control setting may be changed. In
such cases the problem of steady-state offset for step disturbances can be
resolved, and the result poiryts up an interesting connection between
traditional control practice and optimal-control theory.
We shall assume that tale system is at equilibrium for t < 0, with
a step disturbance of magnitude D entering at t = 0. We thus have
z=Ax+D+u x(0)=0 (1)

with the objective

i; = 2 Jo (x2 + c242) dt (2)

where the term c2u2 is intended as a penalty function to keep the rate
of change of control action within bounds. Because D is a Constant, we
can differentiate Eq. (1) once to obtain
z = Ax + it x(0) = 0 z(0) = D (3)

or defining xl=x,x2=z,w=u,
21 = x2 x1(0) = 0 (4a)
x2 = Arxe + w x2(0) = D (4b)
Fi = 2 (x12 + c2w2) dt (5)
0

The problem defined by Eqs. (4) and '(5) is precisely the one con-
If we let 0 - ao, then, by relating coefficients. the
-sidered in Sec. 4.7.
CONTINUOUS SYSTEMS: 1 125

optimal solution is obtained from Eq. (18) of Sec. 4.7 as


w(t) = ic(t) = - c x, - (A + X2 (6)

Integrating, the control action is found to be


fo x(r) dT
u(t) = - CA + c) x(t) ^ c
(7)

That is, the optimal control is proportional to both the offset and the
integral of the offset. The integral mode is often referred to as reset.
Most industrial controllers employ proportional and reset modes.
The importance of the reset mode can be clearly seen by substitut-
ing the control action into Eq. (3). The system response is determined
by the equation
2+!i+-1X=0 x(0)=0 x(0)=D (8)

which has the solution


2cD 4c
e-1/2c sinh t e < 3/
x(t) ,/1 -4 c (9)
-D 1 e-111C sin
1/4c __1
t c>
2c

That is, the response is overdamped and without oscillation when c < j,
underdamped with oscillations for c > Y4, but. always decaying exponen-
tially to zero after the initial rise. Thus, there can be no steady-state
offset.

4.11 OPTIMAL-YIELD PROBLEMS


In many processes the quantity of interest will' not be ,a cumulative
measure of profit or loss in the form
g = ff(x1,x2,ul,u2) dt (1)
but simply the difference between initial and final values of x, and x2.
A particular case would be a chemical-reaction system, where we might
seek to maximize some weighted sum of the conversions, with the profit,
say 61, expressed as
19 = c[xi(8) - x1(0)) + [x:(#) - x:(0)l (2)
This is equivalent, however, to writing
6' = fo (cxI + 11) dt = 0
(cf, + f2) dt (3)
126 OPTIMIZATION BY VARIATIONAL METHODS

which is in the form of Eq. (1) if we obtain a minimization problem


by letting S = -(P. Hence, we wish to consider the general problem of
two nonlinear state equations, with 5 defined as
S= -Ch -f= (4)

For algebraic simplicity we shall assume a single decision function u(t).


The canonical equations are
21 = f1(xx,x=,u) (5a)
x= = f2(x1,x=,u) (5b)
1= -(X1-c)afl0x1 (5c)

X=-= -(X1-c)ax=-(A2-1)of= (5d)

with boundary conditions X1(9) _ X2(9) = 0 and the hamiltonian


H = fi(A1 - c) + f2(X= - 1) (6)

It is convenient to define new variables 4,1, A2 such that


(7a)
(7b)
(7c)
(7d)

We then have

o f 1 C1xj
(8a)

aft af=

- tG= ax= (8)


ax=
H=#Lf1+ihf= (9)

and the condition of optimality,


aH
au = 01 a
af1
+ 4,:
af=
a
=o (10)

For this class of problems it is possible to reduce the number of


variables which need to be considered. Equation (10) is true for all
time, and therefore its derivative with respect to t must vanish. Hence,

da=
jf
IYL au + 1 (\8u 0x111
= =h
+ au ax=f= + ax=
a=f1

_
(11)
+ d= au (au ax,f1 + au ax=f= + au= u = 0
CONTINUOUS SYSTEMS: I 127

or, substituting Eqs. (8),


aft afl aft afl a2fl 49 2f, 02A
4,1
au axt au + a26 axl
aX2
fl + au ax2
f2
+ aY42
0,12 -3f2 0_f2 a2f2 a2f2
(- au 5_4 - au axe + au axl f l + au ax2 f 2 + aut is =0
afl a2fl

+ 412
(12)

Equations (10) and (12) are linear homogeneous algebraic equations for
4,t and 4#t, and the condition that they have a nontrivial solution (Jj and
1r2 both not identically zero) is the vanishing of the determinant of
coefficients. Thus
Of, Of, aft aft 012 a2f2 a2ft 02f2
T (- au 5Z ax, ax, 49u,
aft 0fl Of,02ft 02fl aft Of,
a2fl
-au(rauaxt-au0xs+auaxtft+auaxef2+autu> =0
(13)
or, solving for u,
aft 02f, afl afl aft Ofl
au'au axt fl + aua2ft
axe f 2 - U ax au x2)
Of, 02f2 aft aft aft aft1
is =
au (_O'ft
au axlfl + au ax2f2 au axl au axt
(14)
Of, 02f2 aft atfl
auau2 -auau2
Equation (14), which has been called a generalized Euler equation,
is an ordinary differential equation for the optimal decision function u(t),
which must be solved together with Eqs. (5a) and (5b) for xt and X2-
A boundary condition is still required, and it is obtained by evaluating
Eq. (10) at t = 0 with the values of ¢l, 02 obtained from Eqs. (7c) and
(7d) :

cafutft+a2=0 att=8 (15)

The problem can then be solved numerically by searching over values of


u(0) and solving the three differential equations until the stopping con-
dition, Eq. (15), is satisfied. Equivalently, for each, initial value u(0)
there is a process running time for which u(O) is optimal, and this cor-
respondence is found when Eq. (15) is satisfied.
The reader will recognize that this section parallels Sec. 1.11 for
discrete systems. In each case it has been possible to eliminate the
multipliers by use of the optimality conditions. It should be clear that
128 OPTIMIZATION BY VARIATIONAL METHODS

the algebraic conditions needed to pass from Eqs. (10) and (12) to (13)
require that the number of state variables exceed the number of decision
variables by no more than 1, so that there will be at least as many Itb'mo-
geneous equations as multipliers.

4.12 OPTIMAL TEMPERATURES FOR CONSECUTIVE REACTIONS


We now make use of the results of the previous section to consider the,
continuous analog of the problem of Sec. 1.12. A system of consecutive
reactions is assumed to take place,
X1-> X2 -* decomposition products
and the reaction is to be carried out in a batch reactor. X2 is the desired
product, and we seek to maximize the increase in value of the contents
of the reactor after an operating period 0 by adjusting the reactor temper-
ature u(t) in time.
The equations describing the course of the reaction are
±1 = -k1(u)F(x1) (la)
(that is, the rate of reaction of X, depends only on temperature and con-
centration of X1)
x2 = v'kl(u)F(Z1) - k2(u)G(x2) (lb)
(The rate of change of concentration x2 is the difference between the
rate of formation from Xl and the rate of decomposition. The latter rate
depends only on temperature and concentration of X2.) The coefficient
v is introduced to account for changes in reaction stoichiometry (the
number of molecules of Xl needed to form a molecule of X2). It should
be noted that the equations describing the 'course of this reaction in a
pipeline reactor in which diffusion is negligible are identical if t is inter-
preted as residence time, or length into the reactor divided by fluid
velocity. Hence, we may look upon this problem as that of determining
the best that could be accomplished in a pipeline reactor if it were possible
to specify the temperature at every point, and we shall generally refer
to the.function u(t) as the optimal temperature profile. In the latter case
the true reactor design problem would be that, of approaching the upper
bound represented by the optimum profit with a practical heat-exchange
system, since in a real ,reactor the temperature cannot be. specified at
every point in space.
The increase in value of the product has precisely the form of
Eq. (2) of the preceding section, where c represents the value of the feed
Xl relative to the desired product X2. Clearly c < 1. Equation (14)
CONTINUOUS SYSTEMS: I in
of Sec. 4.11 for the optimal temperature is
vF(xl)G'(x2)[kl(u)ks(u) - k'(u)k2(u)}k'(u) (2)
u
G(x2)[ki (u)k2(u) - ki(u)kz (u))
where the prime denotes differentiation with respect to the argument, and
the stopping condition, Eq. (15), is
ki(u)F(x,)(v - c) - k'(u)G(x2) = 0 t=0 (3)

More specifically, if the functions k, and k2 are of Arrhenius form

k; = k;o exp - u` i = 1, 2 (4)

where k,o, k2o, E;, and E2 are constants, then

u
vF(x,)G'(x2) I - E'1J
E2'G(x2) k io exp u (5)

and the final condition on the temperature is

u
E2-E,' t=B (6 )
In E2kzoG(x2) 1

1(y - c)E;kioF(xi) J
Equation (5) requires that the optimal temperature always decrease
in time or with reactor length. When E2 > Ei, a high temperature
favors the second reaction with respect to the first, and a decreasing
temperature makes sound physical sense, for it suggests a high tempera-
ture initially to encourage the reaction X, -- X2 when there is little X2
to react and then a low temperature in the latter stages in order to prevent
the decomposition of the valuable product X2. On the other hand, if
E' > Ez, a decreasing temperature profile contradicts the physical intui-
tion that since the reaction X, --> X2 is favored with respect to the
decomposition of X2, the highest possible temperature is optimal at all
times. In Chap. 6 we shall develop a condition analogous to the second-
derivative test of Sec. 1.3 which verifies this physical reasoning and
demonstrates that Eq. (5) defines an optimum only when E2 > E.
The procedure for obtaining the optimal temperature profile and
optimal profit is as described at the end of Sec. 4.11. The feed composi-
tions x,(0) and x2(0) are presumed known, and a value is assumed for
u(0). Equations (1) and (5) are then integrated simultaneously until
Eq. (6) is satisfied, and u(0) is varied and the procedure repeated until
the solution of Eq. (6) occurs at t == 0. Amundson and Bilous have
carried out such solutions for several cases.
130 OPTIMIZATION BY VARIATIONAL METHODS

4.13 OPTIMAL CONVERSION IN A PRESSURE-CONTROLLED REACTION


As a further example of the use of the generalized Puler equation and for
purposes of reference in our later discussion of computation we shall con-
sider a second type of optimal-yield problem, the maximization of inter-
mediate conversion in a consecutive-reaction sequence carried out in a
pipeline reactor where the reaction rate is dependent not upon tempera-
ture but upon pressure. The reaction is
X, -> 2X2 -+ decomposition products
where the first reaction is first-order and the second second-order. Con-
centrations are denoted by lowercase letters and total pressure by u.
Assuming ideal gases and Dalton's law of additive partial pressures,
the state equations may be written
x, _ -2k,u A + x2 xj(0) = xio (la)
z
x2 = 4k,u A + x2 - 4k2u2 (A x2(0) = x20 (lb)
+x2)2
where k1 and k2 are positive constants and A = 2x,o + x20. To maxi-
mize the conversion of intermediate we have.
(P = x2(0) (2)
in which case the parameter c defined in Sec. 4.11 is zero. Performing
the required differentiations, Eq. (14) of Sec. 4.11 for the optimal pres-
sure is then found to be
4uz k2ux2 a
x2(A + x2)2 [k l
Ax , + (A + x2) (3)

with the boundary condition from Eq. (15) of Sec. 4.11,


U = k1x,(A + x2) at t = 0 (4)
2k2x2s
Equation (3) indicates that the optimal pressure decreases with
reactor length, and if x20 is small, very steep gradients may be required
near t = 0. It is, of course, impossible to specify the pressure at 'each
point in a pipeline reactor, so that the optimal conversion calculated by
the solution of Eqs. (1), (3), and (4) provides an upper bound for evalu-
ating the results of a practical reactor design.

BIBLIOGRAPHICAL NOTES
Sections 4.2 and 4.3: The derivation follows
J. M. Douglas and M. M. Denn: Ind. Eng. Chem., 57(11): 18 (1965)
CONTINUOUS SYSTEMS: I 131

The results obtained here are a special case of much more general ones derived in subse-
quent chapters, and a complete list of references will be included later. A funda-
mental source for Chaps. 4 to 8 is
L. S. Pontryagin, V. G. Boltyanskii, R. V. Gamkrelidze, and E. F. Mishchenko:
"Mathematical Theory of Optimal Processes," John Wiley & Sons, Inc., New
York, 1962

Section 4.4: Any of the texts on the calculus of variations noted in the bibliographical
notes for Chap. 3 will contain a discussion of the corner condition.

Section 4.6: This section is based on

L. I. Rozenoer: in Automation and Remote Control, I, Proc.1st IFAC Congr., Moscow,


1980, Butterworth & Co. (Publishers), Ltd., London, 1961
The principal of least action is discussed in books on classical mechanics, such as
H. Goldstein: "Classical Mechanics," Addison-Wesley Publishing Company, Inc.,
Reading, Mass., 1950
L. D. Landau and E. M. Lifshitz: "Mechanics," Addison-Wesley Publishing Company,
Inc., Reading, Mass., 1960

Section 4.6: The model of a stirred-tank reactor and an analysis of its transient behavior
are contained in
R. Aris: "Introduction to the Analysis of Chemical Reactors," Prentice-Hall Inc.,
Englewood Cliffs, N.J., 1965
This is also an excellent source of details on other reactor models used as examples through-
out this book.

Section 4.7: This section follows the paper by Douglas and Denn cited above. The basic
work i
R. E. Kalman: Bol. Soc. Mat. Mex., 5:102 (1960)
A more general discussion is included in Chap. 8, and an extensive survey of optimal
linear control is contained in
M. Athans and P. Faib: "Optimal Control," McGraw-Hill Book Company, New
York, 1966
The reader unfamiliar with the conventional approach to process control may wish-to
consult a text such as
D. R. Coughanowr and L. B. Koppel: "Process Systems Analysis and Control,"
McGraw-Hill Book Company, New York, 1965
'D. D. Perlmutter: "Chemical Process Control," John Wiley & Sons, Inc., New York,
1965
J. Truxal: "Automatic Feedback Control System Synthesis," McGraw-Hill Book
Company, New York, 1955

Section 4.8: The expansion technique for obtaining nonlinear feedback controls is due
to Merriam; see
132 OPTIMIZATION BY VARIATIONAL METHODS

C. W. Merriam: "Optimization Theory and the Design of Feedback Control Systems,"


McGraw-Hill Book Company, New York, 1964
A. R. M. Noton: "Introduction to Variational Methods in Control Engineering,"
Pergamon Press, New York, 1965
Sections 4.9 and 4.10: The references by Kalman and Athans and Falb cited above are
pertinent here also, and the discussion is expanded in Chap. 8. The consequences
of the use of 0 as the cost term in relating optimal-control theory to conventional
feedback control practice is the subject of research being carried on in collaboration
with G. E. O'Connor; see
G. E. O'Connor: "Optimal Linear Control of Linear Systems: An Inverse Problem,"
M. Ch. E. Thesis, Univ. of Delaware, Newark, Del., 1969
Sections 4.11 to 4.15: The generalized Euler equation was obtained in
M. M. Denn and R. Aris: Z. Angew. Math. Phys., 16:290 (1965)
Prior derivations specific to the optimal-temperature-profile problem are in
N. R. Amundson and O. Bilous: Chem. Eng. Sci., 6:81, 115 (1956)
R. Aris: "The Optimal Design of Chemical Reactors," Academic Press, Inc., New
York (1961)
F. Horn: Chem. Eng. Sci., 14:77 (1961)
Both the optimal temperature- and pressure-profile problems were studied in
E. S. Lee: AIChE J., 10:309 (1964)

PROBLEMS
4.1. The pressure-controlled chemical reaction A = 2B, carried out in a tubular
reactor, is described by the equation for the concentration of A
4(xo - x)=
z = -k,u 2xox - x + k2u!
(2xo - x)'
where x0 is the initial value of x, u is the pressure, and k, and k2 are constants. x(8)
is to be minimized. Obtain an algebraic equation for the theoretical minimum value
of x(8) in terms of e, k,, k2, and x0. For comparison in ultimate design obtain the
equation for the best yield under constant pressure. (The problem is due to Van de
. Vusse and Voetter.)
4.2. Batch binary distillation is described by the equations
z1--u
u
i2 = [x2 - F(x2,u)]
x1

Here x, denotes the total moles remaining in the still,


with initial conditions x,o, x2o.
x2 the mole fraction of more volatile component in the still, u the product withdrawal
rate, and F the overhead mole fraction of more volatile component, a known function
of x2 and u which depends upon the number of stages. The withdrawal rate is to be
found so as to maximize the total output
!o e
max 61 = u(t) dt
CONTINUOUS SYSTEMS: 1 133

while maintaining a specified average purity


e
o Fu dt
F=
u dt
IoB

Formulate the problem so that it can be solved by the methods of this chapter and
obtain the complete set of equations describing the optimum. Describe a computa-
tional procedure for efficient solution. (The problem is due to Converse and Gross.)
4.3. Consider the linear system -

x=u
X(0) = xa i(0) = yo
x(8) = 0 x(8) = 0
and the objective to be minimized,
1
min 8 = a U2(t) dt
2
(a) Find the unconstrained function u which minimizes 8 for a fixed 8.
(b) Examine the nature of the minimum 8 in part (a) as a function of 8 graphi-
cally. Comment on the sensitivity of the solution to changes in 8.
(c) Find the unconstrained function u which minimizes 8 for unspecified. B.
Comment on the significance of the solution in terms of the results of part (b). (The
problem is due to Gottlieb.)
4.4. Solve the control problem of Sec. 3.4 by the methoc of Sec. 4.7.
4.5. Consider the nonlinear system
i = f(x) + b(x)u
x(0) - xa x(8) - 0
where f and b are continuous differentiable functions of x. Show that the optimum
unconstrained function u which minimizes
8 = fo [9(x) + c'u=] dt
with c a constant and g a nonnegative continuous differentiable function of x, has th
feedback form

ua f(x) f(x) ' [b(x)1'[g(x) + j61


b(x) b(x) { x + x'c'
where d is a constant depending on xa and 8. Suppose 8 is unspecified? Compare the
solution of Sec. 3.4. (The problem is due to Johnson.)
4.6. Extend the analysis of Sec. 4.7 to the case
z, = a,zx, + biu
i, - a21X1 + a:,x,, + b,u
min 8 = 3'z[C,e,1(6) + C2e,'(8) + foe (9 + c,e;' + c,u') dt]
where
el =x, -x,` e2:x2 -x;
x,` and x, are some desired values. Obtain equations for the gains in the optimal
controller.
134 OPTIMIZATION BY VARIATIONAL METHODS

4.7. The kinetics of a one-delayed-neutron group reactor with temperature feedback


proportional to flux are

it = A (un - ant - pn) + yc

Here n is the neutron density, c the precursor concentration, u the reactivity, and the
constants A, y, a, and 0, respectively, the neutron generation time, the decay constant,
the power coefficient of reactivity, and the fraction of neutrons given off but not
emitted instantaneously. Initial conditions no and co are given, and it is desired to
bring the neutron density from no to ono in time 8, with the further condition that n(8)
be zero and the effort be a minimum,
e
minc =21 fo u'dt
Obtain the equations needed for solution. (The problem is due to Rosztoczy, Sage,
and Weaver.)
4.8. Reformulate Prob. 4.7 to include the final constraints as penalty functions,

min c a 3j { C,[n(8) - ono)' + C2[c(8) - c']2 + J0 u2 dt}

(What is Obtain the equations needed for solution. Normalize the equations
with respect to ono and obtain an approximate solution with the approach of Sec. 4.8,
utilizing the result of Prob. 4.6.
4.9. Let I be an inventory, P production rate, and S sales rate. Then
1*-P-S
Assuming quadratic marginal costs of manufacturing and holding inventories, the
excess cost of production for deviating from desired, values is

fa IC,[I(t) - !J' + C,[P(t) - PJ'[ dt


where 9 is fixed, I and P are the desired levels, and C1 and CP are constant costs.
If the sales forecast S(t) is known, determine the optimal production schedule P(t),
0 < t < 8. Would a feedback solution be helpful here? (The problem is due to
Holt, Modigliani, Aluth, and Simon.)
4.10. Let x denote the CO2 concentration in body tissue and u the pulmonary ventila-
tion. An equation relating the two can be writtent
of + (a1 + azu)x + a3ux = a4 + a6u

where the a; are constants. Find an approximate feedback solution for the "control"
u which regulates the CO2 level by minimizing

& -2f
1
o
e
(x' + c'u') de

It is commonly assumed that u - a + bx.


t The model equation is due to Grodins et al., J. Appl. Phys., 7:283 (1954).
5
Continuous Systems: II

5.1 INTRODUCTION
We now generalize our discussion of systems described by ordinary differ-
ential equations somewhat by relaxing the requirement that the optimal
decision functions be unconstrained. Complete generality must await
the next chapter, but most situations of interest will be included within
thescope of this one, in which we presume that the optimal decision
functions' may be bounded from above and below by constant values.
Typical bounds would be the open and shut settings on valves, safety
limitations on allowable temperatures and pressures, or conditions
describing the onset of unfavorable reaction products.
We again assume that the state of the system is described by the N
ordinary differential equations

fi(x1/ir4) ,zN,7L1,262, ,26R)


i = 1, 2, .x , N
-4
0<t<9 (1)
where we wish to choose the R functions ut(t), u4(t), ... , uR(t) in
135
W OPTIMIZATION BY VARIATIONAL METHODS

order to minimize an integral


e
8[u1,212, . . . ,UR) = f0 if (xl,x2) . . . ,xN,ul,u2, . . . U R) dt (2)

We now assume, however, that the functions uk(t) are bounded


uk. < Uk(t) < u,' k = 1, 2, . . . ,R (3)
where the bounds uk are constants. The absence of a lower bound
simply implies'that uk. -+ - oo,. while the absence of an upper bound
implies that u,' -- + 0o .

5.2 NECESSARY CONDITIONS


For simplicity of presentation we again restrict our attention to the special
case N = R = 2, although the results will clearly be general. Thus, we
consider the state equations
Zl = f1(x1,x2,u1,u4) (la)
x2 = f2(xl,x2,u1,u2) (lb)
with constraints
u1. < u1 < ui (2a)
U2* < u2 < u2` (2b)
and objective
Fi[u1,u2) = f o' dt (3)

In Sec. 4.2 we derived an equation for the change in E which results


from small changes in ul and u2. In doing so we were never required to'
specify those changes, Sui(t) and bus(t). Thus, as long as we stipulate
that those variations be admissible--i.e., that they not be such that ul or
u2 violates a constraint-then Eq. (18) of Sec. 4.2 remains valid, and we
can write
a = fa (au aul + O2 Su_1 dt + o(E) >_ o (4)

Here we have used the hamiltoni/an notation of Sec. 4.3


H = if + Xifl + X2f2 (5)
where the multipliers satisfy the canonical equations
aH aH
x1 = - axl x2 = - axe (6)

with boundary conditions from Eq. (13) or (16) of Sec. 4.2, depending
upon whether or not 0 is specified.
CONTINUOUS SYSTEMS: II 137

We shall presume that in some way we know the optimal functions


ul(t), u2(t) for this constrained problem. Each may be only piecewise
continuous, with segments along upper and lower bounds, as well as
within the bounds, as, for example, in Fig. 5.1. It is then necessary
that for all allowable variations SS > 0. We first choose Sul = 0 when-
ever ul is equal to either ul* or u*, and similarly for Sue. It then follows
that since we are not at a constraint, we may vary ul and u2 either posi-
tively or negatively (Fig. 5.1). Thus, whenever H is differentiable, we
may choose the particular variations
H
Sul = -e aul auR
aH (7)
R

for sufficiently small e' and obtain the same result as before:
When the optimal decision u; (i = 1 or 2) lies between the constraints
u;. and u* and the hamiltonian is differentiable with respect to u;,
it is necessary that the hamiltonian be stationary with respect to u,
(aH/au; = 0).
Let us now consider what happens when tit = u*. For convenience
we set Sue = 0 for all t and Sul = 0 whenever ul 96 u,*. Because of the
constraint all changes in ul must be negative (Fig. 5.1), and so we have
Sul < 0 (g)
Let us make the particular choice

Sul=elaH<0 (9)

where we cannot set the algebraic sign of El since we do not know the sign

u,
_T8

8u, may be
Su, moy only positive or
be negative negative when
when u,=u*
8u, may only
be positive but > 0
when u,=u,*
But 0
eui2
u, * -------------1 j- - bue20

Fig. 5.1 Allowable variations about the optimum decision


function.
13$ OPTIMIZATION BY VARIATIONAL METHODS

of all/au,. From Eq. (4) we then have


s

J
r
(3u > dt + o(f) > 0 (10)

Thus, el > 0, and from Eq. (9),


aH,<
aul- 0
Since u, decreases as we move into the interior of the allowable region, it
follows from the sign of the derivative in Eq. (11) that H is increasing
or that the hamiltonian is a minimum relative to u, when u, = u,*. (In
an exceptional case it might be stationary at u; , and the nature of the
stationary point cannot be determined.)
We can repeat this analysis for sections where u, = u,., and there-
fore &u, > 0 (Figure 5.1) and we obtain the same result. The symmetry
of the problem establishes the result for us as well. If u, or us lies along
an interior interval at a constant value where H is not differentiable but
one-sided derivatives exist, then an identical proof establishes that at these
points also H is a minimum.. Furthermore, the hamiltonian is still a
constant, for whenever the term all/au, (or OH/au:) in Eq. (9) of Sec. 4.3
does not vanish along the optimal path for a finite interval, the optimal
decision u, (or us) lies at one of its bounds or at a nondifferentiable point
and is a constant. In that case du,/dt = 0 (or dus/dt) and the products
aH du, -
aH au2ddU2
au, dt and t
- always vanish, leading to the result that,iH/dt = 0.
We summarize the results of this section in the following weak form
of the minimum principle:
Along the minimizing path the hamiltonian is made stationary by an.
optimal decision which lies at a differentiable value in the interior of the
allowable region, and it is a minimum (or stationary) with respect
to an optimal decision which lies along a constraint boundary or at a
nondifferentiable interior point. The hamiltonian is constant along
the optimal path, and the constant has the value zero when the stopping
time 0 is not specified.

5.3 A BANG-BANG CONTROL PROBLEM


As the first example of the use of the necessary conditions derived in the
previous section let us consider the control of a particularly simple
dynamical system, described by the equation
1 = u (1)
CONTINUOUS SYSTEMS: II in

or, equivalently,
it=x: (2a)
2==u (2b)

We shall suppose that the system is initially at some state x1(0), x:(0)
and that we wish to"choose the function u(t), subject to the boundedness
constraints
u*=-1<u<+1=u* (3a)
or
Jul < 1 (3b)
in order to reach the origin (x1 = x= = 0) in the minimum time; that is,
6=fu 1dt=0 (4a)
=1 (4b)

The hamiltonian for this system is


H = 1 + X1x2 + xsu (5)

and since 0 is unspecified, the constant value of H along the optimal path
is zero. The canonical equations for the multipliers are
8H = 0 (6a)

X2= -aaH= -x1 (6b)

and since four boundary conditions are given on the -state variables, the
boundary conditions for the multipliers are unspecified. Equation (6a)
has the solution
Xi. = cl = const (7a)

and Eq. ON the solution


x= _ -clt - c2 (7b)

where c1 and C2 are unknown constants of integration resulting from the


unspecified boundary conditions.
It is of interest first to investigate whether u may take on values in
the interior of the allowable region. In that case the condition for opti-
mality is
dH=0=x:= -clt - c: (8)
au
Equation (8) cannot be satisfied for any finite interval of time unless
both c, and Cl, the slope and intercept of the straight line, vanish. In
IQ OPTIMIZATION BY VARIATIONAL METHODS

that case 1\x is also zero [Eq. (7a)] and the hamiltonian, Eq. (5), has the
value unity. Since we have already noted that the optimal value of H
must be zero, it follows that the necessary conditions for a minimum
can never be satisfied by a control function which is in the interior of
the allowable region for any finite time interval.
The only possibilities for the optimum, then, are u = +1 and
u = -1. A control system of this type, which is always at one of its
extreme settings, is known as a bang-bang or relay controller. A typical
example is a thermostat-controlled heating system. We note that the,
question of when to use u = + 1 or u = -1 depends entirely upon the
algebraic sign of X2, for when X2 is positive, the hamiltonian is made a
minimum by using u = -1 (-11\2 < + 11\2, X2 > 0), while when 1\2 is
negative, the hamiltonian is minimized by u = +1 (+11\2 < -11\2,
X2 < 0). Thus, the optimal policy is
u = - sgn X2 = sgn (cxt + c2) (9)
Here the sgn (signum) function is defined as
_ (10)
sgn y Iyl = { ±1 y<0
and is undefined when y = 0.
We now have sufficient information to solve the system differential
equations, starting at xx = x2 = 0 and integrating in reverse time, i.e.,
calling the final time t = 0 and the initial time - 0. The condition
H = 0 establishes that C2 = 1, and for each value of cl, - co < cx cc,
we shall define a trajectory in the xxx2 plane, thus flooding the entire
plane.with optimal trajectories and de6ning a feedback control law.: In
this case, however; the entire, problem can -,be solved more siipply by
analytical methods.
We note first that the argument of the signum function in Eq. (9)
can change sign at most once. Thus the optimal solution may switch
from one extreme value to the other at most onee.t During an interval
in which the optimal control policy is u = +1 the system equations (2)
become
±1 = x2 (11a)
is = 1 (llb)
or
x2=t+C: (12a)
'/ + c3)2 + (C4 -
xx = %t2 + Cat + C4 = 72(t
2
(12b)
2
t It can be demonstrated that for the time-optima! control of an nth-order dynamical
system with all real characteristic roots the number of switches cannot exceed 1. less
than the order of the system.
CONTINUOUS SYSTEMS: 11 1a,

X2

Fig. 5.2 Possible responses for u = + 1.

Thus, a first integral is


x1 = 112 x22 + C (13)
which defines the family of parabolas shown in Fig. 5.2, the arrows indi-
cating the direction of motion. Note that the origin can be reached only
along the dashed line x1 = /2x22, x: < 0, so that if u = +1 forms the
last part of an optimal trajectory, this must be the path taken. In a
similar way, when u = -1, we obtain the family of parabolas
xi = -12x22 + c (14)
shown in Fig. 5.3, with the only possible approach to the origin along the
dashed line x1 = -112x22, x2 > 0.
When the two sets of trajectories are superimposed, as in Fig. 5.4,
the optimal policy becomes obvious at once. The approach to the origin
must be along the dashed line, which has the equation
x1 + 112x21x21 = 0 (15)

and at most one switch is possible. The only way in which initial states
below the dashed line can be brought to the origin in this manner is to

x2

X,

Fig. 5.3 Possible responses for u = -1.


142 OPTIMIZATION BY VARIATIONAL METHODS

XZ

XI

Fig. S.4 Superposition of all possible


responses with bang-bang control.

use the control u = +1 until the resulting trajectory intersects the line
of Eq. (15) and then to switch to u = -1 for the remainder of the
control time. Similarly, initial states above the dashed line are brought
to the origin by employing the control action u = -1 until intersection
with the dashed line [Eq. (15)] followed by u = -1. This defines the
optimal feedback control policy, and only the switching curve [Eq. (15)]
ief required for implementation. The optimal trajectories are then as
shown in Fig. 5:5.

S.4 A PROBLEM OF NONUNIQUENESS


The simple dynamical system considered in the previous section may be
used to illustrate another feature of solutions employing the minimum
principle. We now suppose that we wish to solve the minimum-time
problem to drive xL to zero, bht we do not choose to specify x2(8). The
analysis is essentially unchanged, but because x=(8) is unspecified, we now
must invoke the boundary condition
X2(8) = 0 (1)

X2

X1

Fig. S.S Time-optimal paths to the origin.


CONTINUOUS SYSTEMS: II 143

or, from Eq. (7b) of the preceding section,


X2(O) -CIO - c2 = 0 (2)

Thus,
X2(t) = cl(O - t) (3)

which cannot change algebraic sign, and therefore the optimal control
function, defined by Eq. (9) of Sec. 5.3, must always be +1 or -1, with
no switching possible.
Figure 5.6 shows the trajectories in the right half-plane. For start-
ing values above the dashed line x1 + 112x2Ix2I = 0 the line x1 = 0 can
be reached without switching only by using the policy u = -1. For
starting points below the line x1 + t3 x2Ix21 = 0, however, the x2 axis
can be reached without switching by using either u = + 1 or u = -1.
Thus, even in this simplest of problems, the minimum principle does not
lead to a unique determination, and the true optimum must be dis-
tinguished between the two candidates by other considerations.
In this case the true optimum can be determined analytically.
Setting u = ± 1 and dividing Eq. (2a) of Sec. 5.3 by (2b), we obtain the
equation for the tangent to each trajectory passing through a point

tan a = dz2 = x2 u = +1 (4a)

tan = d22 = -x2 u = -1 (` b)

Fig. 5.6 Two possible paths to the xZ axis


satisfying the necessary conditions.
144 OPTIMIZATION BY VARIATIONAL METHODS

Thus, referring to Fig. 5.6, the line segments Q,P2 and Q,P, are equal in
magnitude. But integrating Eq. (2a) of Sec. 5.3,
8 = x2(8) - x2(0) = QOQ2 u = +1 (5a)
B = x2(0) - x2(8) = QoQ1 u = - 1 (5b)

and, by inspection,
QOQ2 > QOP2 = QOP1 > Q0Q1 (6)
Thus, u = -1 leads to the shorter time in the entire right-hand plane..
By similar reasoning, when x, < 0, the optimal policy is u = +1.

5.5 TIME-OPTIMAL CONTROL OF A STIRRED-TANK REACTOR


We shall now return to the problem of the control of a stirred-tank
chemical reactor introduced in Sec. 4.6. The dynamical equations for
the reactor after an upset, linearized about the desired steady-state
operating conditions, were shown to be of the form of the general second-
order system
x1 = a11x1 + a12x2 + b11u1
12 = a21x1 + a22x2 + b21u1 + b22u2
where x1 and x2xire the deviations from steady state in reduced concen-
tration and temperature, respectively, while u1 and u2 are the variations
in process and coolant flow rates. It was also shown that after a linear
transformation of the dependent variables the system could be repre-
sented by the equations
y1 = S1y1 M11u1 - M12u2 (2a)
y2 = S2Y2 + M21u1 + M12u2 (2b)

where the parameters S1, S2, M11, M12, and M21 are defined in Sec. 4.6 by
Eqs. (19) and (21). In this section we shall consider the problem of
returning the system from some initial state y1(0), 1/2(0) to the steady
state y1 = Y2 = 0 in the minimum time by choice of the functions u1(t),
u2(t), subject to the operating constraints on the flow rates
u,. < u1 < u; (3a)
U2* < u2 < us (3b)

For the minimum-time problem the function if is equal to unity,


and so the hamiltonian is
H= 1 + A1(S1y1 - M11u1 - M12u2) + X2(S2y2 + M21u1 + M12u2)
CONTINUOUS SYSTEMS: II 143

and the equations for the multipliers are


Al _ - 4911
= -S1X1 (5a)
ay,
aH
X2 = - = - S2X2 (5b)
aye
These last equations may be integrated directly to give
X1(t) = Aloe-s,` (6a)
X2(t) = Xye-s,e (6b)
although the initial conditions X1o, X2o are unknown.
The possibility that u1 may lie somewhere between its bounds is
considered by setting aH/au1 to zero
aH
au1
= -M11A1 + M21X2 = 0 (7)

Substitution of Eqs. (6) and (7) demonstrates that this.equality can hold
for more than a single instant only if S1 = S2, a degenerate case which
we exclude. Thus, the optimal u1 must always lie at a bound, u1. or
ui , and the"same may easily be shown true for u2. The coefficient of u1
in Eq. (4) is -X1M11 + X2Mf21, so that the hamiltonian is minimized with
respect to u, by setting u1 equal to the smallest possible value when the
coefficient is positive and the largest possible value when the coefficient
is negative, and similarly for u2:
ui M21X2 < 0
U, = (8a)
u,. -M11A1 + M21A2 > 0
u2 1112(X2 - X1) < 0
U2 = (8b)
u2 M12(X2 - X1) > 0
For the parameters listed in Table 4.1 we have S1 < S2 < 0,
M21 > M11 > 0, M12 < 0. Using Eqs. (6)f we can then rewrite the
optimal control policy in Eqs. (8) as
u1 11> 0
u1 =
X20 (r A go
(9a)
u1. X20 (r ecs,-s,I: - 1 1 < 0
X 10

io
- X20 020 1 > 0
U2 = (9b)
U2* X20 (\X20
X10 ecs2-s,1t -11< 0
//
where

r=Mzi<1 (10)
146 OPTIMIZATION BY VARIATIONAL METHODS

The structure of the optimal control policy may now be deduced in a


manner similar to that in the two previous sections.
If Xlo and 1\2o have opposite signs or have the same sign with
1\1o/A20 < 1, the quantities r(A1o/A2o)e($,-s,>r - 1 and tX1o/A2o)e(S1-s,)' - 1

are both always negative, and depending on the sign of X20, it folldws
from Eqs. (9) that the optimal control must always be either the pair
(u; ,u2.) or (u1*,u2 ), with no switching possible. These pairs may also
occur when A1o and A2o have the same algebraic sign with A1o/X20 > 1 but
only when t is sufficiently large for (A1o/1\20)e(s; s=)1 to be less than unity.
If 1 < A10/A20 < 1/r, the initial policy is (ul.,u2.) if A20 > 0, fol-
lowed by (u1.,ug) if the origin has not been reached after a time 't2 such
that
A1° 1=0 (11 a)
X20

which is the criterion of switching in Eq. (9b), or

t2 _ I Ago
(llb)
S,2 S11n A1o

with no further switching possible. Similarly, if A2o <.0, the sequence is


(u; ,u; ), (u; ,u2.), with the switching time defined by Eq. (11).
The remaining possibility is that A1o/A2o > 1/r. If A20 > 0, the
initial control policy defined by Eqs. (9) is (ui ,u2.), followed, if the origin
has not been reached after a time
1 A2o 1
t1 = In (12)
S2 _ S1 A1o r

by the policy (u1.,u2.). A further switch will occur at time t2 defined


by Eq. (llb) to the policy (u1.,us ), and no further switching is possible.
Thus, the total duration for which the system may be controlled by the
policy is given as

is - t1 = (in - In 1 S11n r (13)


S2 1 S1 A1o A1o r) S2

which depends only on the system parameters. In a similar way, if


A20 < 0, the sequence of optimal policies is (u1.,us ), (u; ,uz ), and (u2 ,u2.),
with the same switching time. This exhausts all possibilities, and it
should be noted that no single control variable may switch more than
once between limits in this second-order system.
With the sequence of possible control actions available to us we are
now in a position to construct the optimal feedback control policy. Since
CONTINUOUS SYSTEMS: II 147

the control action must be piecewise constant, an integral of Egs.-(2) is


r yi - (MII/SI)ul - (M12/SJ)uzl s,
y1, - (M22/S1)u2J
ys + (M21/S2)ui + (M1:/S:}u2 St (14)
Y2, + (M21/S2)u1 -1- (M12/S:)ua
where y1 ys, are values of yi, Y2 somewhere on the path. Thus the
paths leading to the origin may be obtained by setting y1, - yzr - 0
and putting the appropriate control policy into Eq. (14).
The line marked y+_ in Fig. 5.7 is a plot of Eq. (14) passing through
the origin with the-control policy (ui ,uz#). Since we have found that
this policy must always be preceded by the policy. (ui ,us ), the line y+_
must be a switching curve for trajectories with the control (u,*,ui ), for

0.02 \ a04

F10.5.7 Time-optimal paths to the origin in transformed coordinates


for the controlled reactor. [From J. M. Douglas and M. M. Denn,
Ind. Eng. Chem., 57(11):18 (1065). Copyright 1965 by the American
Chemical Society. Reprinted by permission of the copyright owner.]
148 OPTIMIZATION BY VARIATIONAL METHODS

otherwise these trajectories could not reach the origin by an. optimal
sequence. Similarly, the line y_+, corresponding to (u1.,u2 ), must be
the switching curve for trajectories with control (ul+,u2*)
By choosing points on the y+_ curve and solving Eqs. (2) with the
constant control policy (ui ,u2) for a time interval 1/(S2 - S1) In r we
obtain the curve where the optimal control must have switched from the
policy (u1.,u2 ), the y++ switching curve, and similarly for the y__ curve.
We obtain in this way a line for the y++ (or y__) curve which stops short
a finite distance from the origin, for we have seen that we can reach the
origin along an optimal trajectory prior to switching from (us,u= ). We
obtain the remainder of the y++ curve by setting x1, = x2r = 0 and
(u1,u2) = (ui ,u=) in Eq. (14), and similarly for y_ _. These switching
curves may fail to be smooth at the intersection of the two segments.
We have now divided the x1x2 plane into four sections, in each of
which the control action is completely specified, with the change in con-
trol indicated by reaching a boundary. We have, therefore, by deter-
mining the switching curves, constructed the optimal feedback control for
the time-optimal problem. The curves in Fig. 5.7 are calculated for the
values of the parameters given in Table 4.1, together with the constraints
-8<u1<+10=ui (15a)
u2.= -5<u2<15=?42 (15b)
while Fig. 5.8 shows the trajectories after transformation to the original
dimensionless concentration (Z1) and temperature (Z2) coordinates.
Only one switching was required for most trajectories, and initial con-
ditions for trajectories requiring more than one switching generally fall
too far frgm the origin in the Z1Z2 plane for a linearized solution to be
useful, in some cases generating trajectories which, lead to negative con-
centrations. It is interesting to observe that many of the optimal tra-
jectories which approach the y++ and y__ curves do so with a common
tangent.
At this point it is useful to note again an alternative method of
solution which is well suited to automatic digital computation. We
make use of the fact that for problems with unspecified total operating
times the hamiltonian has the value zero. Thus, when the origin has
been reached, from Eq. (4),

X2(e)
_- X 1(6)[M11u1(8) + M12u2(B)} - 1 (16)
M21u1(e) + M12u2(e)
If we specify the final values u1(6) and u2(8), Eq. (16) definer a unique
relation between A1(8) and X2(8); for some range of values of A1(O) this
relation will be consistant with the requirements of Eq. (9) for the choice
of u1i u2. For example, the final policy (ui,us) requires, after some
CONTINUOUS SYSTEMS: (I 149

3.00

290

2.80

N2.70

2.60
a
E
r°- 2.50

2.40

2.30

I (
2.20 i 1 1 1

0 0.04 0.08 0.12 O.t6


Composition tti '

R9. 6.E Time-optimal temperature-concentration paths.for the con-


trolled reactor. [From J. M. Douglas and M. M. Denn, Ind. Eng.
Ch.m., 67(11):18 (1965). Copyright 1965 by the American.Chemieat
Society. Reprinted by permission of the copyright owner.)

algebra, the satisfaction of the two inequalities


M:lu; + M12u, > 0 (17a)
which is a limitation imposed by the physical properties of the system,
and

We) > (17b)


1

Similar relations can be found for other policies.


For a given X1(8) we then have values X1(6), 12(e), y1(9) = 0,
yz(e) = 0, and we can integrate the four differential equations (2) and
(5) in the negative time direction, monitoring the combinations in Eq.
(9) at all times. When the sense of an inequality changes, we need sim-
ply make the appropriate change is the control and continue. In this
way we shall map out the switching curves and optimal trajectories as
we vary the values of X1(8) . over the range -- = < X1(9) < m. This
backward tracing technique will clearly be of the greatest use in non-
150 OPTIMIZATION BY VARIATIONAL METHODS

linear systems, where the analytical methods of this section cannot be


employed.

5.6 NONLINEAR TIME-OPTIMAL CONTROL


The practical design of a time-optimal control system for the stirred-
tank reactor for any but very small upsets will require the use of the
full nonlinear equations, and we shall build on the observations of the
previous section by following a recent paper of Douglas and considering
this more general problem. The nonlinear equations describing the reac-
tor, Eqs. (7) of Sec. 4.6, are

z1= y(1-Z1)-kZl (la)


UKq.
Z, = Z,) Z.) +kZ1 (1b)
V
(Z1
- VCDP( +
Kq,) (Z: -

where Z, is dimensionless concentration, Z2 dimensionless temperature,


and k has the form
k = ko exp ( AH)A f Z=J (2)

The constants are defined in Sec. 4.6, with numerical values given in
Table 4.1.
In order to avoid complication we shall assume that the flow rate q
is fixed at qs and that control is to be carried out only by varying the
coolant flow rate q. subject to the bounds
qes+u: <qc <_ qcs+U:. (3)

where the numerical values of U2. and u: are the same as those in the
previous section. The hamiltonian for time-optimal control is then

H = 1 + X1 y(1 -Z1) - kZiJ +a: I V (Z, --Z:)


UKga
VCpp(1 +
(Z, - Z.) + kZi] (4)

with multiplier equations


1 i - azI
8Il =.

(1 +k) x1-ka,
aH E'C,pkZI
X: = - az:. (-AH)A fZ:=
[.1 UKga
E'C pkl, 1 X, (5b)
+ V + 1'Dp(1 + Kq.) (-LH)AfZ:=
CONTINUOUS SYSTEMS: II 151

We first consider the possibility of intermediate control by setting


8H/8q, to zero:
K
aq = -a= C (Z2 - Z.) 1 +1Kq. = 0 (6)

This equation can hold for a finite time interval only if Z, = Z, or X2 = 0.


In the former case, Z2 constant implies that Z, is also constant, which is
clearly impossible except at the steady state. On the other hand, if X
vanishes for a finite time interval then so must its derivative, which,
from Eq. (5b), implies that X, also vanishes. But if X, and X2 are both
zero, then, from Eq. (4), H is equal to unity, which contradicts the neces-
sary condition that H = 0 when 8 is unspecified. Thus, the control for
this nonlinear problem is also bang-bang, and we shall have the solution
by construction of the switching surfaces.
The structure of the optimal control function is
qg q.s + u=` (Z: - Z.) X, > 0
q.s + us. (Zs - Z.)X2 < 0 (7)

Because of the nonlinear nature of Eqs. (1) and (5) analytical solutions
of the type employed in the previous section cannot be used to deter-
mine the maximum number of switches or the switching curves. It is
to be expected that in a region of-the steady state the behavior of the
nonlinear system will approximate that of the linearized system, so that
a first approximation to the switching curve can be obtained, by setting
q. equal, in turn, to its upper and lower limits and obtaining, respec-
tively, the curves y+ and y_, shown in Fig. 5.9. The optimal trajectories
can then be computed, using the policy us" above the switching curve,
us. below, and switching upon intersecting y+ or y_. Because of the
manner of construction of these curves no more than one switch will ever
be made. It should be noted that many of the trajectories approach the
y_ switching curve along a common tangent, as'in the linearized solution,
although this is not true of y+. No trajectories can enter the region of
negative concentration.
The verification of this solution must be carried out by the back-
ward tracing procedure described in the previous section. When steady
state has been reached, Eqs. (4) and (7) become

H = -1 + X1(e) IV (1 - Zis) - kZ,s] + X2(8) IV (Z1 - Zss)


UK%
VCpa(1 + Kq.)
(Zis - Z.) + kZ,s] = 0 (8)
q0(8) q.s + ui (Z23 - Z.)Xs(6) > 0
;9)
1 u2. (Zss - Z.)X2(8) < 0
U2 OPTIMIZATION BY VARIATIONAL METHODS

A choice of X2(8) uniquely determines q,(8) from Eq. (9), while Eq. (8)
determines X1(8). Thus, since x1(8) = x2(8) = 0, the four equations (1)
and (5) can be integrated simultaneously in the reverse time direction,
always monitoring the algebraic sign of (Z2S - Z,)X2. When this sign
changes, qe is switched to the other extreme of the range and the process
continued. This is done for a range of values of X2(e), and the locus of
switching points is then the switching curve for the feedback control sys-
tem. The trajectories and switching curves in Fig. 5.9 were verified by
Douglas in this way, except in the region where trajectories approach y_
along a common tangent, where extremely. small changes in X2(8) (of the
order of 10-6) are required to generate new trajectories.
It is important to recognize that the nonlinear differential equa-
tions (1) may admit more than one steady-state solution; in fact, the
parameters listed in Table 4.1 are such that three solutions are possible.
Thus, there is a separatrix in the x1x2 plane which is approximately the
line Z2 = 2.4, below which no trajectory can be forced to the desired
steady state with the given control parameters. The design steady state
is controllable only subject to certain bounded upsets, a fact which would
not be evident from a strictly linearized analysis.

5.7 TIME-OPTIMAL CONTROL OF UNDERDAMPED SYSTEMS


The systems we have studied in the previous several sections had the
common property that in the absence of control, the return to the steady
state is nonoscillatory, or overdamped. This need not be the case, and,

3.00

E
r 2.60

2.40'
0 0.02 0.04 0.06 0.08 0.10
Composition r,

Ffg. &9 Time-optimal temperature-concentration paths for the non-


linear model of the controlled reactor. [From J. M. Douglas, Chem.
Eng. Sci., 21:519 (1965). Copyright 1965 by Pergamon Press. Re-
printed by permission of the copyright owner.l
CONTINUOUS SYSTEMS: II 183

indeed, oscillatory behavior is often observed in physical processes. Dif-


ferent choices of parameters for the stirred-tank-reactor model would lead
to such oscillatory, or underdamped, response, and, since the structure of
the time-optimal control is slightly changed, we shall briefly examine this
problem.
The prototype of an oscillatory system is the undamped forced
harmonic oscillator,
2+X=u (1)
or
tl=x: (2a)
x4 = -xl + U (2b)
We shall suppose that u is bounded
fuI<1 (3)

and seek the minimum time response to the origin. The hamiltonian is
then
H = 1+Xixz-X2x1+X2u (4)
with multiplier equations

aH = Xz (5a)
i
aH = -al (5b)

The vanishing of X or X2 over a finite interval implies the vanishing


of the derivative, and hence of the other multiplier as well, leading to the
contradiction H = 1 for the time-optimal ease. Thus, intermediate con-
trol, which requires X2 = 0, is impossible, and the solution is again bang-
bang, with
u= -sgnX2 (6)
The solution of Eqs. (5) may be written
Xz = -A sin (t + 0) (7)
where the constants of integration A and 0 may be adjusted so that
A > 0, in which case Eq. (6) becomes
u = sgn [sin (t + 0)] (8)
That is, rather than being limited to a single switch, the controller
changes between extremes after each time interval of duration x.
The construction of the optimal feedback control proceeds in the
same way as in Secs. 5.3 and 5.4. When u = +1, the first integral of
154 OPTIMIZATION BY VARIATIONAL METHODS

Fig. 5.10 Possible responses of an


undamped system with bang-bang
control.

Eqs. (2) is
u= +1: (x,-1)2+(x2)4=R2 (9a)
a series of concentric circles centered at x, = 1, z2 = 0, while for u = -1
the integral is a series of circles centered at x, -1, x2 = 0:
u = -1: (x1 + 1)2 + (x2)2 = R2 (9b)
All trajectories must then lie along segments of the curves shown in Fig.
5.10, and since the control action changes after every v time units, a tra-
jectory can consist of at most semicireles. Thus, the approach to the
origin must be along one of the dashed arcs, which must also form part
of the switching curves, -y+ and -y-.
We can complete the construction of the switching curves by con-
sidering, for example, any point on y+. The trajectory leading to it
must be a semicircle with center at -1, and so the corresponding point
on the y_ curve can be constructed, as shown in Fig. 5.11. In this fashion
the switching curve shown in Fig. 5.12 is built up; with some typical tra-
jectories shown. Above the switching curve and on y+ the optimal con-
trol is u = +1, while below and on y_ the optimum is u = -1.

Fig. 5.11 Construction of the switching


curve for time-optimal control of an
undamped system.
CONTINUOUS SYSTEMS: II 155

Fig. 5.12 Time-optimal paths to the origin for an undamped


system.

The general second-order system,


z + alt + a2x = u (10)

which is represented by the chemical reactor, may be thought of as a


damped forced harmonic oscillator whenever the parameters al and a2
are such that the characteristic equation
m2+aim+a2=0 (11)

has complex roots. In that case a similar construction to the one above
leads to a switching curve of the type shown in Fig. 5.13 for the time-
optimal problem. We leave the details of the construction to the inter-
ested reader.

5.t A TIME-AND-FUEL-OPTIMAL PROBLEM


Although rarely of importance in process applications, aerospace prob-
lems often involve consideration of limited or minimum fuel expenditures
to achieve an objective. If the control variable is assumed to be the
156 OPTIMIZATION BY VARIATIONAL METHODS

X2,

Fig. 5.13 Switching curve for time-


optimal control of an underdamped
system.

thrust, then to a reasonable approximation the fuel expenditure may be


taken as proportional to the magnitude of the thrust, so that the total
fuel expenditure is proportional to Io Jul dt. Besides the obvious physi-
cal importance, optimization problems which involve the magnitude of
the decision function introduce a new mathematical structure, and so we
shall consider one of the simplest of such problems as a further example
of the use of the minimum principle.
The system is again the simple one studied in Sec. 5.3

and we shall assume that the total operating time for control to the origin
is unspecified but that there is A. premium on both time and fuel. The
objective which we wish to minimize is then

s = PO + fa lul dL = fa (p + lul) dt (3)

where p represents the relative value of time Sfuel. The hamiltonian is


H = p + Jul + )tiX2 + A2U (4)

with multiplier equations

l= -aH=0 (5a)

aH
xY
axY =
-al (5b)

or
Al = C1 = const (6a)
A2 = -clt + C2 (6b)
CONTINUOUS SYSTEMS: II 157

As usual, we first consider the possibility of a stationary solution


by setting aH/au to zero, noting that alul jau = sgn u, u 76 0:
aH=sgnu+X21+X2=0 (7)
au
But if X2 is a constant for a finite interval of time, X2 =-X, is zero, so
that Eq. (4) becomes
H=P+lul -usgnu=p+lul
- lul=P>O (8)

which contradicts the necessary condition that H = 0 for unspecified 0.


We note, however, that the hamiltonian is not differentiable at u = 0
because of the presence of the absolute-value term, and so we may still
have an intermediate solution if H is minimized by It = 0.
Whenever X2 < -1, the term Jul + a2u is less than zero when
at = + 1, while it is zero for It = 0 and greater than zero for u = -1.
Thus the hamiltonian is minimized by It = +1. Similarly, when X2 >
+1, the hamiltonian is minimized by u = -1. For - I < X2 < + 1,
however, l ui + X214 is zero for u = 0 and greater than zero for u = ± 1.
Thus, the optimal solution is
+1 X2 < -1
It = 0 -1<X2<+1 (9)
-1 +1 < X2
Since 1\2 is a linear function of time, the only possible control sequences
are + 1, 0, -1 and -1, 0, + 1, or part of either, with a maximum of
two switches. This is a bang-coast-bang situation, or a relay controller
with a dead zone.
We can show further that the final optimal control action must be
u +1 or It = -1. Ifu=Oatt= 0, when x, =x2=0,then, again,
H = p 96 0, which contradicts a necessary condition for optimality.
Thus, the approach to the origin and the switching curve from coasting
operation must be the time-optimal switching curve
xl + i2x2lx21 = 0 (10)

shown as the segments yo+ and yo- in Fig. 5.14. The switching curves
y-o from u = -1 to It = 0 and y+o from It = +1 tp u = 0 can be con-
structed analytically in the following way.
Let t2 denote the time that a trajectory intersects the switching
curve yo+ and t, the prior time of intersection with y_o. From Eqs.
(6b) and (9) we write
A201) = +1 = -c1t1 - c2 (Ila)
X2(t2) = -1 = -c1t2 - c2 (1 lb)
158 OPT`MIZATiON BY VARIATIONAL METHODS
X2

Fly. 5.14 Time-and-fuel-optimal switch-


ing curves and paths to the origin.

or, solving for c1,


2
C1 = (12)
12 - tl

Evaluating the hamiltonian at t = t1, where u = 0,


H = P + clx2(tl) = 0 (13)

or, eliminating c1 in Eqs. (12) and (13),


12-t1=- 2x2(11) P
(14)

In the interval t,1 < t < t2 the solution of Eqs. (1), with u = 0, is
x2(12) = x2(tl) (15a)
x1(12) = x1(11) + x2(tl)(12 - tl) (15b)

But, from Eq. (10), the curve yo+ has the equation
x1(12) = 32x22(t2) (16)
and combining Eqs. (14) to (16), we obtain the equation for the switch-
ing curve y_o

x1(11) = P 2p 4 x22(tl) (17)

We obtain the y+o curve in a similar way, with the entire switching curve
represented by
P 4
xl + p x2Ix21 = 0 (18)
CONTINUOUS SYSTEMS: II 15I

The two switching curves, together with typical trajectories, are shown
in Fig. 5.14.
Note that as p in which case only time is important, the
two switching curves coincide and the coast period vanishes, giving, as
expected, the time-optimal solution. As p -- 0, the y+o and y_o lines
tend to the x, axis, which we anticipate will be the fuel-optimal solution
(bang-coast). In fact the minimum fuel can be obtained by more than
one control action, and the one obtained here represents the solution
requiring the minimum time. Furthermore, the reader will note that
we are asking that the system move at zero velocity between states,
which is impossible, so that the limiting solution clearly does not exist,
but we can come arbitrarily close to implementing a fuel-optimal solu-
tion as p is allowed to become arbitrarily large.

5.9 A MINIMUM-INTEGRAL-SQUARE-ERROR
CRITERION AND SINGULAR SOLUTIONS
In Sec. 4.7 we studied the optimal control of the linear second-order
system, representing the stirred-tank chemical reactor, for an objective
in the form of an integral of squares of the state deviations and the
square of the control function, leading to a linear feedback control. The
u2 term in the objective may be rationalized as a penalty function to
keep the control action within bounds, but this goal can also be accom-
plished by the methods of this chapter. We thus return to the second-
order system with quadratic objective, but we shall now eliminate the
cost-of-control term from the objective and include bounds on the con-
trol action, the coolant flow rate.
After an appropriate change of variables the reactor equations
become Eqs. (4) of Sec. 4.7
yi = Y2 (la):
J2 = - a2y1 - a1y2 + u (lb)

and the objective


f o(c11y,2 + 2c12y1y2 + c22y22) dt (2)

where we have set ca, to zero, but we now seek the optimal function u
subject to the restriction
u* < u < u* (3)
The hamiltonian may then be written
H = i2(C11y12 + 2c12y1y2 + c22y22)
+ X1y2 - a2A2y1 - a1X2y2 + X2u (4)
160 OPTIMIZATION BY VARIATIONAL METHODS

and the multiplier equations


aH
XI = - - = a2X2 - C11y1 - c12ys (5a)
ayi
aH
s = - - ay,
= -XI + aids - c12y1 - Cssy: (5b)

Because of the linearity of H in u the minimum principle then implies


the optimal solution in part as
U = {u' Xs < 0
u. a: >0 (6)

It will now become clear why we; have always been careful to
examine each 'system for the possibility of an intermediate solution.
Setting all/au to zero, we obtain
aHasO (7)
au
which is the situation not covered by Eq. (6). But if X2 is zero for a
finite time interval, so must be its derivative and Eq. (5b) becomes
0=--aI -c1sy>-C::ys (8)

This in turn implies that


X1 = -cisl%i - Cssys = ascssys + (ales: - c>s)ys - C22u (9)

which, when compared to Eq. (5a), leads to a solution for u

u = (as+Ciilyi+a1ys
\\ Call
(10)

That is, if intermediate control is possible, the form is a linear feedback


controller.
We must still satisfy the requirement that the hamiltonian be zero
along the optimal path. For the intermediate control defined by Eqs.
(7), (8), and (10) the hamiltonian, Eq. (4), becomes
H (-c&syl - cssys)ys + i2(ciiyis + 2c12y1y2 + cssyss)
_ ,(ciiy>s - cssyss) = 0 (11)
so that the intermediate solution represented by Eq. (10) is possible only
when the system lies along one of two straight lines in the ylys state space
Cll yl + css ys = 0 (12a)
Cll yl - Ca ys = 0 (12b)

We can further eliminate the second of these possibilities by substituting


CONTINUOUS SYSTEMS: II 161

Eq. (10) into Eq. (11), giving


y1 Y2 (13q)
y2 yl (13b)
C22

and, differentiating Eqs. (12),


CC11
92 ± 92 (14)
22

where the positive sign corresponds to Eq. (12b). Combining Eqs. (13b)
and (14) and solving, we obtain

C ' (t - r)
yl = yl(r) exp ± Cft (15)

with the positive exponential corresponding to" Eq. (12b). Thus the con-
troller is unstable and clearly not a candidate for the optimum along line
(12b), since the objective will grow without bound, while the solution is
stable along the line (12a) and, hence, perhaps optimal. At all other
points in the y,y2 phase plane except this single line, however, the opti-
mal solution must be at one of the extremes, u* or u*. Indeed, only the
finite segment of the line (12a) satisfying

< (a2 + ajy2 5 u* (16)


u* + e22 y,
may be optimal.
Next we must establish that when the intermediate, or singular,
control defined by Eq. (10) is possible, it is in fact optimal. That is,
when 1\2 vanishes at a point yi, y2 on the line segment described by Eqs.
(12a) and (16), we do not shift to the other extreme of control but rather
operate in such a way as to keep X2 identically zero. Because a bang-
bang trajectory would again intersect the line segment at some other
point y7, y2 , it suffices to show that the singular path between these
two points (along the straight line) leads to a smaller value of the objec-
tive than any other path. That is, if we let ai(l) and u2(t) denote the
values of y, and y2 along the singular path, for all paths between the
two points
v. - vl"
Iy,v:--v.' v:"
(e,)yi2 + 2c12y1y2 + c22y22) dt
v:- v!'
VI-VI"

ZYYe
2c,20192 4- czsv22) dt > 0 (17)
vi

This relation is most easily verified by a method due to Wouham and


162 OPTIMIZATION BY VARIATIONAL METHODS

Johnson. We note that from l:q. (la)


1d (yl2)
y1J2 = l Wt (18)

The integrand in Eq. (2) may then be written


C11y12 + 2cl2y1y2 + C22y22 = ( C11 yl + C22 Y2)2

+ (C12 - V Cucz2) dt (y12) (19a)

and, along the singular path, from Eq. (12a),


0110,2 + 2c,2Q1Q2 + 022022 = (012 - 'V C11C22) d (012) (19b)

Integrating Eqs. (19a) and (19b) and subtracting, we make use of the
fact that of and yl are identical at both end points (we are comparing
the value of the objective along different paths between the same end
points) to establish Eq. (17):
y,-y," y1-yt11
y, - y," 2 y, ' y:"
(Cllyl + 2c12y,y2 + C22y 2) dt - 2 2
(Cllul + 201401472
Jy:y, _y,'
= yt
J
y,
Y,=yi
y,'
y, - y,"
y, - yi'
+ 022022) dl = f ( CSI yl + C22 Y2)2 dt > 0 (20)
y, - y,'
y=- y1
This proves that whenever it is possible to use the linear feedback con-
trol defined by Eq. (10)-i.e., whenever Eqs. (12a) and (16) are satisfied
-it is optimal to do so.
The remainder of the optimal policy can now be easily constructed
by the backward tracing procedure. At any point on the singular line
values are known for yl, y2, Xi, and A2
yl = yi (21a)
Y2 = Y2 (21b)
X1 = - cl2yi - c22y2 (21c)
_A2 = 0 (21d)
Equations (1) and (5) can then be integrated in reverse time for u = u*,
checking at all times to be sure that X2 < 0. When A2 again returns to
zero, we have reached a point on the y_+ switching curve and u is set to
u* and the process continued. In a similar way, setting u initially to u*
will generate a point on the y+_ switching curve when X2 returns to zero.
By carrying out this procedure for all points on the singular line the
entire switching curve can be generated, just as for the time-optimal
problems considered in previous sections.
A final word concerning practical implementation is perhaps in
CONTINUOUS SYSTEMS: II 1u

order. In any real system it will be impossible to maintain the process


exactly along the singular line described by Eq. (12a), and careful exami-
nation of Eqs. (13) indicates that they are unstable if the system ever
deviates from the straight line. Thus the control law of Eq. (10) must be
replaced by one which is completely equivalent when the system is in fact
on the singular line but which will be stable if slight excursions do occur.
This can be accomplished, for example, by subtracting 3[(cl,/c22)y, +
(ci1/c2s)y21, which vanishes along line (12a), from the right-hand side
of Eq. (10), giving
(22)
u = Cat - 2 c111 yi + (ai - 3 LC22 y2

with the resulting system equations (1), the stable process


y, = Y2 (23a)

y' = -2 C22 y1 - 3 J2c2 y2 (23b)

5.10 NONLINEAR MINIMUM-INTEGRAL-SQUARE-ERROR CONTROL


A number of the ideas developed in the previous' section can be applied
with equivalent ease to nonlinear systems. In order to demonstrate this
fact we shall return again to the now-familiar example of the stirred-tank
chemical reactor with control by coolant flow rate, having state equations

Z1 = (1 - Z1) - kZ1 (1a)

V
Z= = (Z, - Z2) - VCP(1 + Kq,) (Z2 - Z.) + kZ, (1b)
V
where k = k(Z2). Since q/(1 + Kq,) is a monotonic function of qc, we
may simply define a new decision variable w as the coefficient of Z2 - Z.
in Eq. (lb) and write
Z2 = (Z, - Z2) - w(Z2 - Z.) + kZ1 (lc)
y
_

with constraints derivable from Eq. (3) of Sec. 5.6 as


w* < w < w* (2)
We shall attempt to maintain the concentration Z, and temperature Z:
near the respective steady-state values Z15, Z23 by choosing w to mini-
mize the integral

[(Z1 - Z13)2 + c2(k - k8)2] dt (3)


2 Jo
114 OPTIMIZATION BY VARIATIONAL METHODS

where ks = k(Z2S). The choice of k, rather than Z2, in Eq. (3) is one of
convenience, though it can be rationalized by the observation that it is
deviations in reaction rate, not temperature, which adversely affect the
product composition. This is, of course, the objective chosen in Sec. 4.8.
The hamiltonian for this problem is `

H = 3- (Z1 - Z15)2 + ti2c2(k - k5) 2 + X1 V (1 - Z1) - X1kZ1

+ X2 (Zf - Z2) - X2w(Z2 - Zc) + X2kZ1 (4)


V
with multiplier equations

1 = -(Z1 - Zls) + V X1 + kX1 - kX2 (5a)

X 2 = - 0 (k - ks) aZ2 +
ak
X1
ak q
Z 1 aZ2 + X2 V + X2w - X2Z1
ak
aZt
(5b)

and, from the linearity of H in w, the optimum has the form


tu* X2(Z2 -- ZI) > 0 (6)
u' = w# X2(Z2 - Z,) < 0
The possibility of a singular solution, for which X2 vanishes over a
finite interval, must still be examined. In that case the derivative must
also be zero, and since ak/aZ2 9d 0, Eq. (5b) reduces to
X1Z1 - c2(k - ks) = 0 (7a)
and Eq. (5a) to
\\
x1= -(Z1-Z1s)+(V+k)X1. (7b)

These two equations, together with Eq. (1a),-are identical to Eqs. (1),
(7), and (8) of Sec. 4.8, and the approximate solution for the singular line
is given by Eq. (16) and the corresponding singular control by Eq.. (17)
of that section. Thus, the singular solution is equivalent. to choosing
the temperature Z2 which minimizes the objective, Eq. (3), provided that
the resulting flow rate is consistant with Eq. (2).
In the previous section we utilized a mathematical argument to
prove the optimality of the singular solution. Here, we simply rely on
physical reasoning. Clearly, if we are in fact free to specify the tem-
perature directly, we shall choose to do so, since this is our real physical
goal, and whenever the choice of flow rate coincides with the optimal
choice of temperature, that choice must also lead 'to the optimal flow
rate. Only when the optimal temperature is not accessible by choice of
CONTINUOUS SYSTEMS: it 16S

flow rate need we consider the second in this hierarchy of optimization


problems, the optimal choice of flow rate itself. Thus, for this reactor
problem, the singular solution, when possible, must be optimal.
the construction of the switching curves can now be carried out by
the backward tracing technique, since values of Z2, Z2, A1, and X2 are all
known along the singular line. We should observe that a closed-form
solution for the nonlinear singular control w (but not for the singular line)
is in fact available by eliminating X1 between Eqs. (7a) and (7b).

5.11 OPTIMAL COOLING RATE IN BATCH AND TUBULAR REACTORS


In Sec. 3.5 we briefly considered the optimal specification of the temper-
ature program in a batch or plug-flow pipeline chemical reactor in order
to minimize the time of achieving a given conversion, for a single reaction
or, equivalently, of obtaining the maximum conversion in a given time.
In the batch reactor it is somewhat more realistic to suppose that we can
specify the cooling rate as a function of time, rather than the temper-
ature, and in the tubular reactor also the rate of heat removal is some-
what closer to actual design considerations than the temperature at each
position. We are now in a position to study this more realistic problem
as an example of the methods of this chapter, and we follow a discussion
of Siebenthal and Aris in doing so.
Denoting the reactant concentration as a and temperature as T,
the equations describing the state of the reactor are
a -r(a,T) (1a)
T = Jr(a,T) - u (1b)
where J is a constant (heat of reaction divided by the product of density
and specific heat) and u, the design or control variable, is the heat removal
rate divided by density and specific heat, with bounds
u* = 0 < u < u* (2)
In a batch system u will be a inonotonic function of coolant flow rate.
In a tubular reactor the independent variable is residence time, the ratio
of axial position to linear velocity. The reaction rate r(a,T) is given by
Eq. (13) of Sec. 3.5
/ / bo +n21 ao - n2
r(a,T) =plan exp (- E i\ P2 nl
a)n,

exp - T2 t (3)

where ao and bo are the initial values of reactant and product, respec-
tively. When E2 > Ei, there is a maximum value of r with respect to
1K OPTIMIZATION BY VARIATIONAL METHODS

T for fixed a satisfying ar/aT = 0, while r will go to zero for fixed T at


some finite value of a (equilibrium). These are shown in rig. 5.15 as
the lines r = 0 and x = rm.z.
We shall first consider the objective of minimizing a (that is, maxi-
mizing conversion) while placing some finite cost on operating time (or
reactor length), so that the objective is
& - a(e) + pe (4a)
or, equivalently, using Eq. (la),
s = Jo [p - r(a,T)] dt (4b)

The hamiltonian is
H = p - r(a,T) - Xir(a,T) + X,Jr(a,T) - a2u
= p - r(a,T)(1 + ai - JX2).- X2U (5)
with multiplier equations

as = a (1 + al:- At)
a,(e) = 0 (6a)

aH &
aT
= aT- (1 + X1 - JX2) x2(e) = 0 (6b)

Because of the linearity of H in u the optimal decision function is

11 =
u X2 > 0
(7)
0 1\ <0
with an intermediate solution possible only if X2 vanishes over a finite
interval.

O
Fig. 5.15 Reaction paths and switching
o° curve for optimal cooling in a batch or
tubular reactor. [After C. D. Siebenthal
and R. Aris, Chem. Eng. Sci., 19:747
(1964). Copyright 1964 by Pergamon
NM FG Press. Reprinted by permission of the
Temperature T copyright owner.]
CONTINUOUS SYSTEMS: II 167

Because 0 is not specified, the optimal value of the hamiltonian is


zero. At t = 0, using the boundary conditions for X1 and X2, Eq. (5)
reduces to the stopping condition
r[a(0),T(0)] = p (8)

That is, as common sense dictates, the process should be terminated


when the reaction rate falls to the value p and the incremental return in
the objective changes algebraic sign. The stopping curve r = p is shown
in Fig. 5.15.
Let us now consider the possibility that the final operation is inter-
mediate. In that case the vanishing of X2 implies the vanishing of its
derivative, and therefore of ar/aT. That is, if the final operation is
singular, it must be along the curve ?*,,,ax, terminating at the highest point
of the curve r = p, shown as point A. This is. precisely the policy which
we found to give the minimum time and maximum conversion operation
in Sec. 3.5, so that we may anticipate that singular operation is optimal
when possible for this combined problem as well, and we shall show that
this is indeed the case.
We first suppose that an optimal path terminates on the line r = p
to the right of rm,x. Here, ar/aT < 0, and it follows from Eq. (6b) that
K2(9) = ar/aT < 0, or 1\2 is decreasing to its final value of zero. Thus,
from Eq. (7), the final policy must be u = u*, full cooling. There is,
however, a point on the curve r = p, say C, at which a trajectory for
u = u* intersects the line r = p from the right; i.e., the slope of the line
-da/dT along the trajectory is greater than the slope of the line r = p.
From Eqs. (1),
da )u_u* _ p da ar/aT
dT Jp - u* > ( dT)f_P ar/aa (9)

r-o
and, after slight manipulation, the point C is defined by
ar/aa
u* > p J - (10)
ar/aT>
Since the integrand in Eq. (4b) is positive whenever the system is to the
right of r = p, an optimal policy cannot terminate at r = p to the right
of point C, for smaller values of S could be obtained by stopping earlier.
Thus, an optimal policy can terminate to the right of rmax only under the
policy u = u* on the segment AC of r = p.
In a similar way we find that the system can terminate to the left
of rmax only under adiabatic conditions (u = 0) on the pegment BA of
r = p, where B is defined by J > (ar/aa)/(a)-/aT). Thus, whenever the
system lies to the right of )'max and to the left of the full-cooling tra-
168 OPTIMIZATION BY VARIATIONAL METHODS

jectory AF, the optimal policy is full cooling until the intersection with
rmax, followed by the intermediate value of u necessary to remain on rmax,
for the system can never leave the region bounded by DAF, and this is
the policy which will minimize the term p6 in the objective, a(6) being
fixed. Similarly, to the left of rmax and below the adiabatic line DA the
policy is adiabatic, followed by the singular operation. Within the region
bounded by FACG the policy is only full cooling, and in EGAD only adi-
abatic, but even here some trajectories starting in regions r < p might
give positive values to the objective, indicating a loss, so that some initial
states will be completely excluded, as will all initial states outside the
envelope EBACG.
By combining Eq. (1) and Eq. (14) of Sec. 3.5 it easily follows that
the optimal policy on the singular line rmax is

u = r I J + ni[nibo + n2ao - n2a(1 - n2/n,)l T 2> 0


a(nlbo + n2ao - n2a)
There will be some point on rmax, say K, at which this value exceeds u*,
and a switching curve must be constructed by the usual backward tracing
method. This switching curve KM will be to the right of the full-cooling
line KN, for we may readily establish that KN cannot form part of an
optimal path. By integrating Eq. (6a) along rmax from t = B we find
that X (i) + 1 > 0 at K, in which case, for an approach from the left of
rmax, where ar/aT > 0, X2(0 > 0. Thus, X2 < 0 just prior to reaching
K, and only adiabatic approach is possible. Furthermore, along an adi-
abatic path both ar/aa and X2 are negative, so that X1 + 1 is always
greater than its value at rmax, which is positive. Hence X2 is always posi-
tive, so that X2 can never reach zero on an adiabatic path, and an adi-
abatic path cannot be preceded by full cooling. Thus, for starting points
to the right of the adiabatic line KL the optimal policy is adiabatic to
the switching line KM, followed by full cooling to rmax, then the policy
defined by Eq. (11) to the point A.
If we now consider the problem of achieving the maximum con-
version in a given duration (p = 0, 0 fixed), clearly the solution is identi-
cal, for the hamiltonian now has some nonzero constant value, say -A,
and we may write

H+, =A-r(a,T)(1+X1-JX2)-X2u=0 (12)

with the multiplier equations and Eq. (7) unchanged. Furthermore,


since a(t) must be a monotone decreasing function, the policy which
minimizes a(6) for fixed 6 must, as noted in Sec. 3.5, also be the policy
which reaches a specified value of a in minimum time (or reactor length).
CONTINUOUS SYSTEMS: II 169

5.12 SOME CONCLUDING COMMENTS


Before concluding this chapter of applications some comments are in
order. The bulk of the examples have been formulated as problems in
optimal control, largely because such problems admit some discussion
without requiring extensive computation. From this point on we shall
say less about the construction of feedback control systems and concen-
trate on computational methods leading to optimal open-loop functions,
which are more suitable to process design considerations, although several
chapters will contain some significant exceptions.
The reader will have observed that the several optimal-control prob-
lems which we have formulated for the second-order system, exemplified
by the stirred-tank chemical reactor, have each led to markedly different
feedback policies, ranging from relay to linear feedback. The physical
objective of each of the controls, however, is the same, the rapid elimi-
nation of disturbances, and there will often be situations in which it is
not clear that one objective is more meaningful than another, although
one "optimal" control may be far easier to implement than another.
This arbitrariness in the choice of objective for many process applications
will motivate some of our later considerations.

BIBLIOGRAPHICAL NOTES
Section 5.2: The derivation follows
J. M. Douglas and M. M. Denn: Ind. Eng. Chem., 57 (11):18 (1965)
The results are a special case of more general ones derived in Chap. 6, where a complete
list of references will be included. A fundamental source is
L. S. Pontryagin, V. G. Boltyanskii, R. V. Gamkrelidze, and E. F. Mishchenko:
"Mathematical Theory of Optimal Processes," John Wiley & Sons, Inc., New
York, 1962

Sections 6.3 to 5.5: The time-optimal control problem for linear systems is dealt with in
the book by Pontryagin and coworkers and in great detail in
M. Athans and P. Falb: "Optimal Control," McGraw-Hill Book Company, New
York, 1966 it
R. Oldenburger: "Optimal Control," Holt, Rinehart & Winston, New York, 1966
Historically, the bang-bang result was obtained by Bushaw and further developed by
Bellman and coworkers and LaSalle:
R. Bellman, I. Glicksberg, and O. Gross: Quart. Appl. Math., 14:11 (1956)
D. W. Bushaw : "Differential Equations with a Discontinuous Forcing Term,"
Stevens Inst. Tech. Expt. Towing Tank Rept. 469, Hoboken, N.J., 1953; Ph.D.
thesis, Princeton University, Princeton, N.J., 1952; also in S. Lefschetz (ed).,
"Contributions to the Theory of Nonlinear Oscillations," vol. 4, Princeton
University Press, Princeton, N.J., 1958
170 OPTIMIZATION BY VARIATIONAL METHODS

J. P. LaSalle: Proc. Natl. Acad. Sci. U.S., 45:573 (1959); reprinted in R. Bellman
and R. Kalaba (eds.), "Mathematical Trends in Control Theory," Dover Publica-
tions, Inc., New York, 1964
The calculations shown here for the reactor problem are from the paper by Douglas and
Denn. Further considerations of the problem of nonuniqueness are found in
1. Coward and R. Jackson: Chem. Eng. Sci., 20:911 (1965)
A detailed discussion of all aspects of relay control is the subject of
I. Flugge-Lotz: "Discontinuous and Optimal Control," McGraw-Hill Book Company,
New York, 1968

Section 5.6: The section is based on


J. M. Douglas: Chem. Eng. Sci., 21:519 (1966)
Other nonlinear time-optimal problems are treated in Athans and Falb and
1. Coward: Chem. Eng. Sci., 22:503 (1966)
E. B. Lee and L. Markus: "Foundations of Optimal Control Theory," John Wiley
& Sons, Inc., New York, 1967
C. I). Siebenthal and R. Aria: Chem. Eng. Sci., 19:729 (1964)
A simulation and experimental implementation of a nonlinear lime-optimal control is
described in
M. A. Javinsky and R. H. Kadlec: Optimal Control of a Continuous Flow Stirred
Tank Chemical Reactor, preprint 9C, 68d Natl. Meeting, A IChE, St. Louis, 1968
Optimal start-up of an autothermic reactor is considered in
R. Jackson: Chem. Eng. Sci., 21:241 (1966)

Section 5.7: See the books by Pontryagin and coworkers and Athans and Falb.

Section 5.8: The book by Athans and Falb contains an extensive discussion of linear and
nonlinear fuel-optimal problems.

Section 5.9: The basic paper is


W. M. Wonham and C. D. Johnson: J. Basic Eng., 86D:107 (1964)
where the problem is treated for any number of dimensions; see also
Z. V. Rekazius and T. C. Hsia: IEEE Trans. Autom. Contr., AC9:370 (1964)
R. F. Webber and R. W. Bass: Preprints 1987 Joint Autom. Contr. Confer., Phila-
delphia, p. 465
In Chap. 6 we establish some further necessary conditions for the optimality of singular
solutions and list pertinent references. One useful source is
C. D. Johnson: in C. T. Leondes (ed.), "Advances in Control Systems," vol. 2,
Academic Press, Inc., New York, 1965

Section 5.10: The use of a hierarchy of optimization problems to deduce an optimal


policy is nicely demonstrated in
CONTINUOUS SYSTEMS: II III

N. Blakemore and R. Aria: Chem. Eng. Sci., 17:591 (1962)

Section 5.11: This section follows


C. D. Siebenthal and R. Aria: Chem. Eng. Sci., 19:747 (1964)

Section 5.1P The literature contains many examples of further applications of the prin-
ciples developed here, particularly such periodicals as A IA A Journal, Automatica,
Automation and Remote Control, Chemical Engineering Science, Industrial and
Engineering Chemistry Fundamentals Quarterly, IEEE Transactions on Auto-
matic Control, International Journal of Control, Journal of Basic Engineering
(Trans. ASME, Ser. D), and Journal of Optimization Theory and Applications.
The annual reviews of applied mathematics, control, and reactor analysis in Indus-
trial and Engineering Chemistry (monthly) list engineering applications. Some
other recent reviews are
M. Athans: IEEE Trans. Autom. Contr., AC11:580 (1966)
A. T. Fuller: J. Electron. Contr., 13:589 (1962); 15:513 (1963)
B. Paiewonsky : A IAA J., 3:1985 (1965)
Applications outside the area of optimal design and control are growing, particularly in
economics. A bibliography on recent applications of variational methods to economic
and business systems, management science, and operations research is
G. S. Tracz: Operations Res., 16:174 (1968)

PROBLEMS
5.1. Extend the minimum principle of Sec. 5.2 to nth-order systems
xi - fi(x1,x2, . . . xwrul,uir . . . 1 m 1, 2, . . . ,n
Show that the hamiltonian is defined by

H-5+ 1xif,

aH as " 8fi
axi axi 7
t ' axi

5.2. Solve the optimal-temperature-profile problem of Sec. 4.12 for the case of an
upper bound on the temperature. By considering the ratio of multipliers show how
the problem can be solved by a one-dimensional search over initial values of the ratio
of multipliers. For consecutive first-order reactions
F(x1) - x, G(x2) _= x,
obtain an algebraic expression for the time of switching from the upper bound to an
intermediate temperature in terms of the initial ratio of the multipliers. Discuss the
computational effect of including a lower bound on temperature as well.
5.3. Solve the optimal-pressure-profile problem of Sec. 4.13 for the case of an upper
bound on the pressure. By considering the ratio of multipliers show how the problem
172 OPTIMIZATION BY VARIATIONAL METHODS

can be solved by a one-dimensional search over initial values of the ratio of multipliers.
Discuss the computational effect of including a lower bound on the pressure as well.
5.4. Consider the system
i, - a,tx, + a12x2 + b,u
x2 = a21x, + a2222 + b2u
Iul < 1
Establish the number of switches possible in the minimum time control to the origin
for all values of the parameters and construct the switching curve for the case of
complex roots to the characteristic equation.
S.S. A system is termed controllable to the origin (xt - 0, z: - 0) if, for each initial
state, there exists a piecewise-continuous control u(t) such that the origin can be
attained in some finite time B. We have assumed throughout that the systems with
which we are dealing are controllable. For the linear system
21 - a12x2 + b,u
x2 = a2,21 + a22x2 + b2u

show that a necessary and sufficient condition for controllability is


b,(a21b, + a22b2) - b2(a,,b, + a12b2) # 0
This is a special case of results of Kalman on controllability and the related concept
of observability. Hint: Solve for x, and x2 in terms of a convolution integral involving
u(t) by Laplace transform, variation of parameters, or any other convenient method.
"Only if" is most easily demonstrated by counterexample, "if" by construction of a
function which satisfies the requirements.
5.6. For the nonlinear system
x + g(x) = u Jul < 1,
where g is a differentiable function satisfying tg(.z) > 0, develop the feedback control
for minimum time to the origin. Show that the optimum is bang-bang and no more
than one switch is possible and indicate the procedure for constructing the switching
curve. (The problem is due to Lee and Markus.)
5.7. A simplified traffic-control problem is as follows.
Let q, and q2 represent arrival rates at a traffic light in each of two directions
during a rush period of length 9, s, and 82 the discharge rates (st > s2), L the time for
acceleration and clearing, and c the length of a cycle. If x, and x2 represent the lengths
of queues and u the service rate to direction 1, the queues satisfy the equations
xt=gt -- u
L\ s,
2t-q2-,2 1
c
+-u
Si

with initial and final conditions x,(0) - x2(0) - x,(B) - 22(9) = 0.


The service rate is to be chosen within bounds
.u,<u<u
to minimize the total holdup
e
E ,. fo (xt + 22).dt
CONTINUOUS SYSTEMS: II 173

It is helpful to approximate arrival rates as constant during the latter phases of the
rush period in obtaining a completely analytical solution. (The problem is due to
Gazis.)
5.8. The chemical reaction X Y -+ Z is to be carried out isothermally in a catalytic
reactor with a blend of two catalysts, one specific to each of the reactions. The
describing equations are
i = u(ksy - kiz)
y = u(kix - k2y) - (1 - u)kay
x+y+z - const
Here u is the fraction of catalyst specific to the reaction between X and Y and is
bounded between zero and 1. Initial conditions are 1, 0, and 0 for x, y, and z, respec-
tively. The catalyst blend is to be specified along the reactor, 0 < 1 < 0, to maximize
conversion to Z,
max 61 - z(0) - 1 - x(8) - y(8)
(a) When k, = 0, show that the optimum is a section with u = 1 followed by
the remainder of the reactor with u = 0 and obtain an equation for the switch point.
(b)r When k2 $ 0, show that the reactor consists of at most three compartments,
the two described above, possibly separated by a section of constant intermediate
blend. Obtain the value of the intermediate blend and show that the time interval
(t1,t,) for intermediate operation is defined by

t1 = log
k1 + k2 \1 + k' + ksk,
k1

v k:

(The problem is due to Gunn, Thomas and Wood, and Jackson.)


5.9. For the physiological system described in Prob. 4.10 find the optimal control
when c - 0 with the presumption that u is bounded. Examine possible singular
solutions and describe the process of construction of any switching curves.
5.10. Using the device of Sees. 4.4 and 4.9 for treating explicit time dependence,
extend the minimum principle of Sec. 5.2 to include problems of the form
i1 = f1(x1,x2,u,t)
iz - f2(x1,x2,u,t)
re
min t - J 0 a(xa,x2,u,t) dt

5.11. The system


i+ai+bx =u
is to be controlled to minimize the ITES (integral of time times error squared) criterion,
1 re

Develop the optimal feedback" control. Consider whether intermediate control is


ever possible and show bow switching curves can be constructed.
174 OPTIMIZATION BY VARIATIONAL METHODS

5.12. The chemical reactor described by Eqs. (7) of Sec. 4.6 is to be started up from
initial conditions Z,Q, Z20 and controlled to steady state Z,,, Zu by manipulation of
coolant flow rate q, and feed temperature Z, with bounds
0 < q. < q! Z,. < Z, < Z1
If heating costs are proportional to feed temperature, the loss in profit during start-up
is proportional to
&=Io,[(Z1.-Z.)+c(Z,-Z,.)1dd

where 9 is unspecified. Discuss the optimal start-up procedure. (A more complex


version of this problem has been studied by Jackson.)
6
The Minimum Principle

6.1 INTRODUCTION
In the preceding chapters we have obtained and applied necessary con-
ditions for optimality in a wide variety of optimal-design and optimal-
control problems. Greater generality is required, and that is one aim
of this chapter. A more serious deficiency of the preceding work,
however, is that while the results are certainly correct, the motivation
for several important operations in the derivations is not at all obvious.
Thus we have little direction in attacking new problems by these methods,
and, indeed, we sense that our ability to devise efficient computational
algorithms is dependent upon our understanding of the logical steps in
a proof of necessary conditions.
There is, in fact, an underlying logic which can be applied to both
the theoretical and computational aspects of variational problems. The
logic is firmly grounded in the elementary theory of linear ordinary
differential equations, and the first considerations in this chapter will
of necessity be directed to a discussion of this theory. The resulting
175
176 OPTIMIZATION BY VARIATIONAL METHODS

relations, which we shall apply to modern variational problems, were


used by Bliss in his analysis of problems of exterior ballistics in 1919.

6.2 INTEGRATING FACTORS AND GREEN'S FUNCTIONS


The principle we are seeking is best introduced by recalling the method
of integrating the linear first-order equation. Consider
± = a(t)x + b(t) (1)
In order to integrate this equation we multiply both sides by an inte-
grating factor r(t), an arbitrary differentiable function. We are inten-
tionally introducing an extra degree of freedom into the problem. Thus,
r(t)x(t) = a(t)r(t)x(t) + r(t)b(t) (2)

or, integrating from t = 0 to any time t,


fo' fo'
r(r)±(r) dr = ,,o(r)r(r)x(r) dr + r(r)b(-r) dr (3)

The term on the left can be integrated by parts to give

r(t)x(t) - r(0)x(o) - fo i'(r)x(r) dr


fot
= fat a(r)r(r)x(r) dr + r(r)b(r) dr (4)

If we now remove most of the arbitrariness from r by defining it to


be the solution of
i' _ - ar (5)
or
fot
r(t) = r(0) exp [ - a(>;) d¢] (6)

then Eq. (4) simplifies to


r(t)x(t) exp [ - fo a(s) dE] - r(00)x(o)
r
-
[ .lor a() dt] b(r) dr (7)
= r(0) f o,
Assuming that r(0) is different from zero, we can solve explicitly for
x(t)
Jt
x(t) = x(O) exp [ fo a(s) d ] + fo exp [ a(t) dA] b(r) dr (8)

The integrating factor r(t) is generally known as a weighting function


or, as we prefer, Green's function.
We can generalize to a system of n linear equations in a straight-
177
THE MINIMUM PRINCIPLE

forward manner. We consider the system


x, = a,l(t)xl + a12(t)x2 + + aln(t)x, + b,(t)
x2 = a21(t)x, + a22(t)x2 + + a2n(t)xn + b2(t) (9a)

xn = anl(t)xl + an2(t)x2 + + ann(t)xe + bn(t)


or, equivalently,

xi = I a)(t)x! + bi(t) (9b)

It is convenient to introduce n2 functions rki(t), k, i = 1, 2, . , n,


to multiply Eq. (9) by rki and sum over all i. Thus
n n

I r = I (`)L
rkixi T
l kia. jx, +
(`n)

rkibi (10)
i-I i-Ii-i i-i
Integrating from t = 0 to any time and integrating the left-hand side
by parts, we obtain
inn

rki(T)xi(T) dr 1, ki(t)xi(t) - rki(0)xi(0)


Jog
i-1 i=1 i-1
n n n
r
'ki(r)xi(r) dr = Jo rki(T)aii(T)xi(r) dr
10

rki(r)bi(r) dr (11)
i -i
As for the first-order system, we remove most of the arbitrariness
from r by partially defining Fk,(t) as a solution of
n

Pki(t) _ - I rkf(t)a11 (12)


i-1
equation (12) is called the adjoint of Eq. (9b). Thus,
n n
}rki(t)xi(t) -
`Rn)
)
(\
rki(O)xi(0) = f6 / rki(r)bi(r) dr (13)
i-1
4
Equation (13) is known as Green's identity. It is the, one-dimensional
equivalent of the familiar relation between volume and surface integrals.
The matrix of Green's functions r is known as Green's matrix, the funda-
mental matrix, or sometimes the adjoint variables. Unlike the first-order
case, it is necessary to specify the value of I'ik at some time. It is con-
178 OPTIMIZATION BY VARIATIONA! METHODS

venient to specify r at the time of interest 8 as


1 i= k
rfk(e) = Sik =
0 i k
(14)

so that Eq. (13) reduces to


n n

x:(8) _ r;,(o)x,(0) + fo ri,(t)b,(t) dt (15)

Indeed, for later use it is convenient to note explicitly the moment at


which condition (14) is specified, and we do this by specifying two
arguments, r;k(8,t), where 8 is the time of interest and t any other time.
Thus,
n n
xi(8) = ri;(e,0)x2(o) + fo ri;(e,t)b;(t) dt (16)

where r,,(e,t) satisfies Eq. (12) with respect to t and the condition
rik(e,e) = bik (17)
We shall sometimes be interested in a linear combination of the
components x; of x at time 8. We define a vector y(t) with components
Ti(t) by

NO) = Jy;(e)r;i(9,t) (18)


,-1
where y;(0) are specified numbers; then by multiplying Eqs. (12) and
(16) by appropriate components of y(8) and summing we obtain
n n n
yi(9)xi(9) _ yi(0)xi(0) + fo yi(t)bi(t) dt (19)
i-1 i -1 i-1

and

Yi = - 4 7,(t)a,i(t) (20)
i-1
As a simple illustrative example consider the linear second-order
system with constant coefficients
x + a1x + a2x = b(t) (21)
or, equivalently,
z1 = x2 (22a)
x2 = -a2x1 _ a1x2 + b2(t) (22b)
THE MINIMUM PRINCIPLE 179

The adjoint system, defined by Eq. (12), with boundary conditions from
Eq. (14), is
v11 = a2r12 T11 = 1 at t = 0 (23a)
F12 _ - r11 + a,r12 r12=0 att=0 (23b)
F21 = a2r22 r21 = 0 at t = 0 (23c)
V22 _ - r21 + a1r22 r22 = 1 at t = 0 (23d)
The solution is easily found to be
el4« 'u-e)
(ai - a,2 --4a2)
2 a12 - 4a2
exp[% a,2-4a2(t- 8))- (a1+ a12-4a2)
exp [-Y2 1/x12 - 4a2 (t - 0)) } (24a)
eSSa'(t-e)
r12(0,t) = - {exp [32 a12 - 4a2 (t - B))
a,2 - 4a2
- exp [-3z -\ a12 - 4a2 (t - 0))} (24b)
14«,cr-e)
a12 - 4a2 (t - B))
r21 (8,t) = ate { exp [Y2
1/a ,2 - 4az
-exp[-Y2 a12-4a2(t-0))} (24c)

r22(8,t) _ - { (a1 + Va12 - 4a2)


2 a,2 - 4a2
exp [32 -V a12 - 4a2 (t - 8)] - (a, - a,2 - 4a2)
exp [-% v a12 - 4a2 (t - e))) (24d)

Then, for any time 0,


x(8) = x1(9) = r11(B,O)x(O) + r12(e,O)t(O)
+ foe r12(e,t)b2(t) dt (25a)
±(e) = X2(0) = r21(8,O)x(O) + r22(e,0)x(O)
+ fo r22(e,t)b2(t) dt (25b)

or, substituting just into the equation for x(t),


e-«1e12
{[cri a12 - 4a2
x(9) x(O) -
- 1/a12 - 4a2 2

exp(-TL Val2-4a20) - f a1+ 212-4a2-z(O)]

exp (32
l
a12 - 4az 8) }
- fe eSS«,ce-e)

o ate - 4a2
(1111-

{ exp [3i 1/a12 - 4a2 8) )


- exp [-Y2 a,2 - 4a2 (t - 0))}b(t) dt (2&)
150 OPTIMIZATION BY VARIATION* METHODS

6.3 FIRST-ORDER VARIATIONAL EQUATIONS

We now consider a physical system described by the (usually nonlinear)


equations

x; = f;(x,u) 0<t<6 (1)


i = 1, 2, , n

If we specify a set of initial conditions to and specify the components


of u(t) to be particular functions uk(t), k = 1, 2, . . . , R, Eqs. (1) can
be integrated to obtain functions which we shall denote by x;(t), i = 1,
2, . . . , n. That is, we define t(t) as the vector whose components
are solutions of the equations
= f,(t,u) 0<t<a
i = 1, 2, . . . ,n (2)
t(0) = t0
Let us now suppose that we wish to specify new functions uk(t)
and initial values x;(0) as
uk(t) = uk(t) + auk(t) k = 1, 2, . . . ,R (3a)
x;(0) = z;0 + bx;0 i = 1, 2, . . . ,n (3b)
where, for some predetermined e > 0,
k = 1, 2, . . . , R
Wk(t) < E
0<t<0 (4d)
16x;ol < e i = 1, 2, . . . ,n (4b)
If we now solve Eqs. (1) and write the solution as
x,(t) = xi(t) + Sx1(t) (5)
it follows (see Appendix 6.1) that
1 Sx;(t), < Ke (6)
where K depends on 0 but is independent of the variations Su and Sxo.
Thus,

i; = ,+O± =ft(t+fix, ft +Su) 0<t<0


i=1,2, . . . n
(7)
X(0) = to + Sxo
and, subtracting Eq. (2) from (7),

0±, = f,(t + &x, 6 + Su) - f;(t,n) 0<t<0


i=1,2,...,n (8)

If f; has piecewise continuous first and bounded second derivatives


axi=
j
THE MINIMUM PRINCIPLE

with respect to components of x and u, then everywhere except at


isolated discontinuities of u we may expand the right-hand side of
Eq. (8) in a Taylor series to write

-l
af`axj+
8x;

'Yjaxi
k l
fauk+o(6)
auk

where the partial derivatives are evaluated for It and u and,lence,


known functions of t. Comparing the linear equation (9) with Eq. (9`
of the preceding section, we identify the terms 8fi/8xj with aij and intro-
duce Green's vector y from Eq. (20) of Sec. 6.2
n
dlj 0<t<8
i= 1,2,
Green's identity, Eq. (19) of that section, is then
. . . ,n
.0<lEe,
i = 1, 2, .

r
`.
n
181

(9

(10)

-Yi(8) axi(o) y,(O) 6x,0 + JO -Lf OUk dt }- U(E)


+I ry'
i I
(11)
To first order, this is the effect on system output of small changes in
initial conditions and decision functions.

6A THE MINIMIZATION PROBLEM AND FIRST VARIATION OF THE OBJECTIVE


It is convenient to formulate the optimization problem in somewhat
different form from the previous chapters. Instead of seeking to mini-
mize an integr4l we shall suppose that ours concern is with some func-
tion S which depends only upon the state of the process at time 0, x(8).
We assume that the system is described by the differential equations

-ti = Ji(x,u)
0<t<8 (1)
i= 1, 2, . . . n
that the decision variables uk(t) satisfy inequality constraints
U,(u)>0 p = 1, 2, . . . P (2)

that the initial state xo is subject to equality constraints


q.,(xo) = 0 m = 1, 2, . . . ,M (3)

and the final state to equality constraints


g.[x(e)I=0 s= 1, 2, ...,S (4)
122 OPTIMIZATION Bf VARIATIONAL METHODS

The constraint equations (2) to (4) are more general than those con-
sidered previously. We shall see later that the choice of the objective
&[x(e)] also includes the form of previous chapters as a special case.
We shall first assume that 0 is specified. If we specify u(t) and
5u(t) subject to the constraint equation (2) and to and Szo subject to
Eq. (3), we may write Eq. (11) of Sec. 6.3

i-1 i-1
(5)
Furthermore, q(8o +.Szo) and q(2o) must both equal zero, as must
g[!(0) + Sx(0)] and g[2(8)], so that
n

4m(20 + bao) - Qm(g0) = aaxi0 + 0(E) = 0


x o0

m = 1, 2, . .. , M (6)
9.[!(0) + &z(e)l - 9.[2(0)] 8x, axl(e) + o(E) = 0

s=1,2,...,S (7)
The change in & as a result of decision changes is reflected through a
change in x(9)
n

a& = &[:(e) + sa(e)] -.&[t(0)] _ ax:


axl(e) + 0(0% (8)
i-1

We have not yet specified the value of the vector y(9). Let us
write this as

71(e) = ax + .i (9)

From Eq. (8) we may then write


n
&& =
I k'i(e) - ')'d axi(B) + 0(E)
i-1
(10)

and, from Eq. (5),


n n n R
axl(e) + 'Y,(0) a:Zio + JOB I I yj auk Suk dt
i-1 i=1 i-1 k-1
+ 0(t) (11)

We can obtain an expression for a& entirely in terms of the variations in


THE MINIMUM PRINCIPLE 183

decision btt(t) by choosing particular boundary conditions for 7(t)


S
dg,
7c = (12a)
YJ ax
a-1
or
s

49X,
at;
I
a-1
ag,
(12b)

and
M

yi(O) 'hm axm (13)


in -I
Here, it and Y are undetermined multipliers which must be found as part
of the solution. Using Eqs. (6) and (7), Eq. (11) becomes, upon-substi-
tution of the boundary conditions,
x R
a afi
bE = but dt + o(e) (14)
yi auk
Io
i11 k1
The 2n differential equations for the x; and y; require a total of 2n
specified values. Equations (3) and (4) give a total of if + S conditions.
Equation (12b) specifies n - S values and Eq. (13) n - 117, so that the
total is indeed the required number. The special case in which xi(0) or
xio is directly specified is included by setting
g,[x(0)] = x,(0) - x; (0) = 0 (15a)
and
qm(xo) = xmo - X. *O = 0 (15b)

where x; (0) and xw*,a are fixed numbers. In the former case
a&
Y.(0) = ax + V. (16a)

where P. is unspecified, so that y,(0) is unspecified, while in the latter


'Ym(O) _ 77m (161,)

where i is an unspecified number.


When 0 is not fixed, we must allow for the possibility that the
changes bu(t), Sxo will require a change 60 in order to satisfy all the con-
ditions on x(0). Let a refer to the interval associated with u(t), to. Then
bs = 6[x(8 + 60)] - s[!(e)] (17a)
I" OPTIMIZATION BY VARIATIONAL METHODS

or, writing E[x(e + ae)] in terms of


n
as
as = E[x(e)] + fi[x(B),u(B)] se + o(se) - 8120)]
i -1 axi
as
= s(x(B)] - s[a(e)] + fi[2(6),u(e)] ae + o(ae) + 0(E)
i1 ax;

(17b)
and, finally,
n
as = I as
, axi
[axi(s) + fi as] + o(E) (17c)

where we have included o(SO) as o(E).


Similarly,
0
ag. [ax,(e) + fi as] + o(E) = 0 (18)

Defining yi(B) by Eq. (12b), y;(0) by Eq. (13), and substituting Eqs. (5)
and (18) into (17c), we obtain

as = n(B)f be + Jo yi auk auk do + o(E) (19)


i-1 i-lk-1

Finally, because of the extra degree of freedom resulting from the non-
specification of 0 we may set

0 unspecified: y;{9)f;[2(B),n(B)] = 0 (20)


i-1
and ag is again represented by Eq. (14). In other words, the unspecified
stopping time 9 is defined by the additional Eq. (20).

6.5 THE WEAK MINIMUM PRINCIPLE


It is convenient to introduce again the hamiltonian notation
n
H = I y; fi (1)
i-1

As previously, the system and multiplier equations may be written in


the canonical form
aH (2a)
ayi
aH (2b)
axi
THE MINIMUM PRINCIPLE 185

with the boundary conditions Eqs. (3), (4), (12), and (13) of Sec. 6.4
and the additional condition
0 unspecified: H=0 at t = 8 11 (3)
If we now assume that u(t) is in fact the set of allowable functions which
minimize &, then, from Eq. (14) of the preceding section, we may write
e
rr
aS
= Jo 1 a u auk dt +
k-1
0 (4)

The functions uk(t) must be chosen subject to the restriction


U,(u) > 0 p = 1, 2, . , P . . (5)

If, for a .particular value of k, all constraints are at inequality, we may,


choose auk (sufficiently small) as we wish, and it follows (as in Sec 5.2)
that
aH = 0
uk unconstrained - (6)
auk
If some constraint U, is at equality for the optimum, it must be
true that

auk + o(e) > 0 (7)


1L U"
k-1
aul,

In the spirit of Sec. 5.2, we choose all variations but one to be zero and
that one as
a U, aH
Suk (8)
auk auk
From Eq. (7) auk and aUD/auk must have the same algebraic sign.

a
auk `auk
-)j
Substituting into Eq. (4), we obtain
all, k 0
(9a)

or
au°>0 (9b)

Then, multiplying Eq. (8) by all/auk,


\z
a uk
auk auk aH /lk >0 (10 a)

or, equivalently,
aukaH>0
auk - (10b)
.136 OPTIMIZATION BY VARIATIONAL METHODS

Thus, when a constraint is in force, the hamiltonian is a minimum with


respect to decisions affected by the constraint. (In exceptional cases
the hamiltonian may only be stationary at a constraint.) An equivalent
result holds when the hamiltonian is not differentiable for some f1k.
For the special case considered in Sec. 5.2 we were able to establish
that the hamiltonian is a constant along the optimal path. This result,
too, carries over to the more general situation considered here. We
calculate

lY = 1 aHx;+
Z aHti.+ L aHuk (11)
ax,
i-1 ._1 a1 k-1 auk
The first two terms are equal with opposite signs as a result of the
canonical equations (2). When the optimal uk is unconstrained, all/auk
vanishes. When at a constraint for a finite interval, however, we might
move along the constraint surface with changing Uk if more than one
decision enters into the constraint. Thus, we cannot simply set &k to
zero at a constraint. However, when the constraint U, is equal to zero

(
for a finite time, its time derivative must vanish, so that

U' I
k-1 auk
uk = 0 (12)

Furthermore, if H is minimized subject to the constraint U, = 0, the


Lagrange multiplier rule requires that
D
- -X a-. 13)

or, substituting into Eq. (11),


H=_X Laukuk (14)
k-I auk

But from Eq. (12) this is equal to zero. Thus,


H = coast along optimal path (15a)
Furthermore, from Eq. (3),
0unspecified: H = 0 0<t<0 (15b)
We may summarize the necessary conditions as the weak minimum
principle:
Given C[x(B)] and

H
i-1
THE MINIMUM PRINCIPLE 117

where
aH aH
z:=ayti ti:=-az;
with 2n boundary conditions
o) = 0 m = 1, 2, . . . , i if
9.[z(O)) =0 s 1, 2, . . . S
IV
aq-
7; (0) 1 7. i = 1, 2, n
m-t az,
as
Ti(0)=W S
Lg.
az,
i= 1, 2, n
ax,

The decisions uk(t) which minimize 6[z(O)] subject to the restrictions


U,(u) > 0 p = 1, 2, . . . ,P
make the hamiltonian H stationary when a constraint is not at equality
and minimize the hamiltonian (or make it stationary) with respect
to constraints at equality or at nondifferentiable points. Along the
optimal path the hamiltonian is a constant, and if 0 is unspecified,
that constant value is zero.
One final remark should be made. Since an integral is unaffected
by values of the integrand at discrete points, the arguments made here
and in Sec. 5.2 may fail to hold at a set of discrete points (in fact, over
a set of measure zero). This in no way affects the application of the
results.

6.6 EQUIVALENT FORMULATIONS


In assuming that the objective function to be minimized depends only
upon the state z(9) we have written the Mayer form of the optimization
problem. In previous chapters we have been concerned with the
Lagrange form, in which an integral of a function of state and decision
is minimized. It is useful to establish that the latter problem, can be
treated as a special case of the results of Sec. 6.5.
We consider the system
i, = f1(z,u) i = 1, 2, ,n . . . (1)
with appropriately bounded decisions and end points, and we assume that
u is to be chosen in order to minimize
Io !Y(z,u) dt (2)
1!! OPTIMIZATION BY VARIATIONAL METHODS

By defining a new variable, say xo, such that


xo = 3(x,u) xo(0) = 0 (3)

we may write
&=foxodt=xo(9) (4)

Thus, numbering from k = 0 to n, the hamiltonian, Eq. (1) of Sec. 6.5,


becomes

H = yog(x,u) + I 7kfk(x,u) (5)


k-1
with
aH
x a7;
fi n (6a)

aH
ax;
aT
-7o ax, - 4C7k ax,
afk
(6b)
k -1
rend
all
yo= -axo= 0 (6c)

Furthermore, since xo(O) is completely free, the boundary condition at


t =.Bfor -yois

7o(9) = axo = 1 (7)

which, together with Eq. (6c), establishes that 7o is identically unity.t


The hamiltonian is therefore .
n
H=+ i-I 7;f: (8)

as in Chaps. 4 and S.
It is also useful to consider cases .in which the independent varia-
ble t appears explicitly in the system equations or in the objective.
Here we consider systems
zi = f;(x,u,t) i= 1, 2, . . . ,n (9)

and objectives
8 = 6[x(9),91. (10)

t Any positive constant multiple of an integral of 3 could be minimized, so that yo


will be any positive constant. An unstated regularity assumption, similar to the one
used for the Lagrange multiplier rule in Sec. 1.8, prevents y, from becoming zero,
and hence there is no loss of generality in taking it to be unity.
THE MINIMUM PRINCIPLE in
The device which we use here is identical to that above. We define a
new variable xo such that
xo = 1 xo(O) = 0
in which case xo and t are identical. Equations (9) and (10) now depend
explicitly on xo, x1, . . . , xn but (formally) not on t. Thus, the results
of Sec. 6.5 must apply, and we write

H=7o+ 7:f: (12)

with
n
aH
Yo=-axo -- i
7; It- (13a)

a&
0 unspecified
7o(B) = axo a8 (13b)
unspecified xo(O) = 0 specified
n
Clearly nothing is changed except that the sum yifi is not equal to
i-1
a constant when some fi depend on t, since H = const and 7o varies
with t according to Eq. (13a).

6.7 AN APPLICATION WITH TRANSVERSALITY CONDITIONS


The essential improvement in the form of the weak minimum principle
developed in the two preceding sections over shat used in Chap. 5 is the
generalization of allowable boundary conditions for the state variable
and the corresponding boundary conditions for the multipliers, or, as we
now prefer to think of them, Green's functions. These latter conditions
are usually referred to as transversality conditions. We can demonstrate
the application of the transversality conditions by again considering the
simple second-order system of Sec. 5.3

where we now seek the minimum time control not to the origin but to a
circle about the origin of radius R; that is, we impose the final condition
9[z(8)I = x12(8) + x22(0) - R2 = 0 t=9
Using the equivalent formulation of Sec. 6.6, we have 5 = 1 for
110 OPTIMIZATION BY VARIATIONAL METHODS

minimum time control, and the hamiltonian is


H = 1 + 71x2 + y2u (2)

where y, and 72 satisfy the canonical equations

yI =
aH0 (3a)
ax,
aH
12 = -axs= -7i (3b)

Thus, as previously,
y, = c, = const (4a)
y2 = -c,t - C2 (4b)

and the optimal control is


u = -sgn 72 = sgn (c,t + c2) (5)
which can switch between extremes at most once. The allowable tra-
jectories are again the parabolas
x, x22 + const (6)
Since the objective does not depend explicitly on x,(0) or x2(0),
Eq. (12b) of Sec. 6.4 for the boundary conditions of the Green's func-
tions becomes
0

a$
11(0) = + `' ax, = 2yx,(e) (7a)

0
10

72(0) = $ + I, ax = 2vx2(0) (7b)

or, eliminating the unknown constant v,


yi(0) x:(0) (8)
1'2(0) - x2(0)
Thus, evaluating Eqs. (4) at time 0, Eq. (8) becomes
c, _ x,(0) (9)
- c,0 - c2 x,(0)
or
-C1
C, (10)
= 0 + x2(8)/x,(0)
The optimal control, Eq. (5), then becomes
r
u =sgn {c2 L1 - 0 + x2(0)/x1(0)
_
THE MINIMUM PRINCIPLE 191

When x2(8) and x1(8) have the same algebraic sign (the first and
-third quadrants), the algebraic sign of the argument in Eq. (11) never
changes (t < 8 and therefore t/[8 + x2(8)/x1(8)] < 1). Thus, all tra-
jectories ending on the circle of radius R in the first and third quadrant
must do so without switching between control extremes and must con-
sist of one of the parabolas defined by Eq. (6). Inspection of Fig. 5.4
clearly indicates that for approaches from outside the circle the optimum
is

u= -I x1(8)>0
x2(8) > 0
(12a)
x1(8) < 0
U = +1 (12b)
X2(0) < 0

For trajectories ending in the fourth quadrant, where x1(8) < 0,


x2(0) > 0, there is a switch in the optimal control at a value
1. = 8-}-x2(6) <8 (13)
x1(8)
where the argument of the signum function in Eq. (11) passes through
zero. Here the optimal trajectories, which must fi ish with control
u = - 1 in order to intersect the circle (we assume R < 1 to avoid the
possibility of intersection along a line u = +1), can have utilized
u = -1 for a time equal at most to
B -=-x1(0)
t, x2(0) > 0 (14)

prior to which the control must have been u = +1. The point on the
parabolic trajectory ending at x1(8), x2(8) corresponding to a time
0 - t. units earlier is

xl(t.) =
R2
x1(0)
- 1x2 2(8)
2 x12(8)

x2(t.) = x2(0) [x1(8) - 1]


x1(0)

which then defines the switching curve as x1(8) runs from zero to -R,
x2(0) from R to zero. The switching curve in the second .
quadrant is
similarly constructed, giving the entire feedback policy.

6.8 THE STRONG MINIMUM PRINCIPLE


We have called the necessary conditions derived thus far the weak
minimum principle because, with the experience we have now developed
and little more effort, a stronger result can be obtained. Specifically,
192 OPTIMIZATION BY VARIATIONAL METHODS

it is possible to prove that the optimal function u(t) not only makes the
hamiltonian stationary but that it makes the hamiltonian an absolute
minimum. This latter result is of importance in some applications.
The essential part of the derivation of the weak minimum principle
was in obtaining an infinitesimal change in x(®) and relating this change
to the hamiltonian and to the infinitesimal change in u for all t. It is
possible to obtain an infinitesimal change in x(O) (and thus S) by making
a change in u of finite magnitude if that change is made over a suffi-
ciently small time interval, and the hamiltonian enters in a different way.
Let us suppose that the optimal decision function u(t) and optimal
state vector !(t) are available and that at time t we have effected a
change in the state such that
x(tl) = !(t1) + &x(tl) 1sx(tl)j < E (1)
If in the interval t1 < t < 8 we employ the optimal decision 6(t), we
may write

xi = f,(x,u) = fi(2,6) + ,I1 ax, Sx, + 0(t) (2a)

or

Sz; = I ax` ax; + o(e) t, < t < 0 (2b)


s1
The equation is of this form because Bu = 0, t1 < t < B. Green's
identity then becomes

1 70) Sxi(0) = 7i(tl) axi(tl) + 0(E) (3)


i-1 i-1

where 7 satisfies the equations


n

il 'y
(4)
ax,
-1
Next we presume that for all time earlier than t, - A we make
no changes at all.Thus, x(ti 1(tl 1A), or
5x(t, - A) = 0 (5)
During the interval ti - A < t < t1 we set
u(t) = u(t) + su t1 - A < t < t1 (6)

where 1bul is finite. It follows then by direct integration that


e

x; (t) = ±101 - ) + 1 _o f, (X, u + Su) at t, - A < t 5 t1


THE MINIMUM PRINCIPLE 193

while the optimal values may be written


x;(t) = x;(tt - °) + J=' f,(!,u) dt t, - n < t < tl (8)

The variation in the state is then


bx;(t) = x;(t) - -ti(t) = Jto [f;(x, u + &u) - f,(2,u) dt
tl-°<t<tt (9)

and we may note a useful bound on its magnitude,


lbx,(t)l < max If,(x, u + 5u) - f;(x,u)I(t - t, + °)
a-e<t<a
t,-°<t<t, (10)
We are now able to relate the change in the objective to the finite
change 6u. Truncating a Taylor series after one term, we write.

of
MX, u + &u) = M2, u + &u) + i Sx; (11)
ill 8x;
where the partial derivatives are evaluated somewhere between x and
x. Because Eq. (10) establishes Sx as being of the same order as °,
any integral of fix over an interval ° must be of order °2. Thus
n
((t,
Jt,-o I' ax; bx; dt = o(°)
1
Cifi
(12)

and Eq. (9) becomes, at t = ti,


dx;(ti) =
J" [f (x, u + Su) - f;(x,u)) dt+ o(i)
t (13)

Furthermore, if the region t, - ° < t < ti includes no points of dis-


continuity of u, the integrand is continuous and we can write the exact
relationship as
bx;(ti) = [f;(x, u + &u) - f;(x,u)J ° + o(°) (k4)
and Eq. (3) as
n n
y,(o) bx;(B) u + &x) - y;f;(x,u)J t_t, ° + o(°) (15)

By imposing on y the boundary conditions of Eq. (12) of Sec. 6.4, Eq.


(15) becomes

at; _ [y;f,(x, (1 + &u) - y,f;(x,u)] It-t, ° + o(°) (16)


,_1
Mr OPTIMIZATION BY VARIATIONAL METHODS

relating the finite change in u to the corresponding infinitesimal change


in &.
The condition of optimality, 5& > 0, requires
n
A 'Yrf;(=, u + Su) + o(n) >_ n 'Y;f.(=,G) t = tl (17)

or, dividing by the positive number A and taking the limit as A - 0,


n
rf,(!, u + Su) > y.f:(=,u) (18)

where tj is any point of continuity of ft. The sum in Eq. (18) is the
hamiltonian
n

I tirf;
;-i
(19)

and so we have established that:


The hamiltonian takes on its absolute minimum value when evaluated
for the optimal decision function u(t).
This result is significantly stronger than the condition that the hamil-
tonian be a minimum at boundaries and simply stationary for interior
values of the optimal decision, and we call this, together with all the
conditions of Sec. 6.5, the strong minimum principle.

63 THE STRONG MINIMUM PRINCIPLE: A SECOND DERIVATION_


Before applying the strong minimum principle to some practical prob-
lems, it is useful for future comparisons to consider a second derivation
of the strong minimum principle based on Picard's method of the solu-
tion of ordinary differential equations. We return to the idea of a con-
tinuous infinitesimal variation Su(t), ISul < E, 0 < t < B, and, as in Sec.
6.3, we write the equations describing the -corresponding infinitesimal
variation fix. Now, however, we retain terms to the second order in e
z
5z; _ 5x; + Suk + 8xk Ex, Sxk
j-1 axj k-1
auk i
k-1
ax,
n R a2f, 1 R azf
+ j-1 k-1 I az; auk 6x; 6uk +
j .k-1 au; ou,
,
5uj Suk + o(e2) (1)

The essence of Picard's method is first to integrate Eq. (1) by con-


sidering the second-order terms as nonhomogeneous forcing terms in a
linear equation. In this light we introduce the appropriate Green's func-
THE MINIMUM PRINCIPLE 1!S

tions r;k(t,rr) as defined by Eq. (12) of Sec. 6.2


\
G r;k (t,T)
.
(2a)
k-1 j
1 i azk
=j
rij(t,t) = aij = (2b)
0 i=j
Then Green's identity becomes

az;(t) = r;,(t,0) 8xj(0) + Ia r;j(t,s) auk auk ds


j-1 j-1 k-1
Cn R Zf
+ o
/
j-1 k,1-1
I r;j(t,s)
auk but
but &ul ds

+
1
2
/=
o I
j,k1
r;j(t's) a2f
azk azl
bxk dxt ds

rt a2fj
+ 0
n
j,k--I
R
r;j(t,s)
OXk aul
axk aul ds + o(e2) 0<t<9
1

which is an integral equation for &x. We next substitute 6x(t) from Eq.
(3) into the last two terms of Eq. (3). For simplicity we set &x(0) to
zero, either as a special variation or because of fixed initial values, and
we obtain the explicit representation at t = 0,
n R af
6X, (0) =
Jo
0
11 r;j(o,8) aL. auk ds
i-1 k-I
n R 2j
c
+ 2 Io j-1
11 kJ-1
r`j(B's).au Jau, auk 6u, ds
()R
n a2j'.

fo rl,,,(s,v) au, do I ds
+ jo 1 1 r;j(e,8)
1 k,r-1
auk ax, auk our
n R

+2 Jo I I
j,k,l,m,r-11 r,w-1
r,,(o,s) azk2alxl

fo' rkrn(s,a) aum our da] ko rln(s,r) a , 6u dr 1 ds + o(E2) (4)

where all products of variations higher than the second have been
included in o(e2).
We now make a special choice of the variation Bu(t). As we have
already established that the hamiltonian is a minimum for noninterior
values of the optimal decision, we set au; to zero whenever the corre-
sponding decision u; does not correspond to a stationary value of the
1% OPTIMIZATION IJY VARIATIONAL METHODS

hamiltonian. Furthermore, we allow Su to be nonzero only over an


infinitesimal interval t1 - A < t < tI
OH
0 0
au,
bu,(t) = t1-A<t<t1 (5)
y,(t)F6 0
0 otherwise
In that case each of the first two terms in Eq. (4) is a single integral and
hence of order A, while the third and fourth terms involve products of
integrals containing Su and are of order W. Thus,
n R
bx;(O) = f I I r,,(o,s) auk yk ds
i-1 k-1
n R

+ 2 Jt
azf'au, ykyl ds + o(E2) + o(0)
rij(B,s) auk (6)
Je,! o j-2 k,l-1
If we multiply Eqs. (2) and (6) by a constant, y;(8), and sum over i = 1
to n, then, as in Sec. 6.2, we define a vector ?(t) satisfying
n

ax;yj 0<t<e
i-I

and

,(B) axi(e) ° I, o yi auk yk ds


i-1 i-1k-1
n R

+2 I°` n yi auza'ut ykyi ds + o(E2) + o(A) (7)


-1 k.l-1
We shall assume 0 to be fixed for simplicity. The variation in t;
caused by the change fix(O) is then, to second order,

bs = GI ax & (0) + 2 ax)(e) + o(f2) (8)


8xi 2axj'bxi(e)

But, from Eq. (3),


6xi(8) bxj(@) = 0(0) + o(E2) (9)

so that
as
bS = bxi (9) + O (A) + o (e2) (10)
ax:
i-1
THE MINIMUM PRINCIPLE -197

Thus, if for simplicity we assume x(8) to be unspecified and write


as
ti: (e) = ate;

and introduce the hamiltonian, the combination of Eqs. (10), (11), and
(7) becomes
n
aH 1
a2H
as = f 1 y' ds + 21=,-o G au; au; ysY; ds + o(A)
:-1
(12)

The first integral vanishes by virtue of the arguments used in the


derivation of the weak minimum principle. The nontiegativity of the
variation 53 requires that
a2H
n
t,j-1
aau; yiy' ? 0 (13)

for arbitrary (small) y and, in fact, equality in Eq. (13) is an exceptional


case which oannot occur when the minimum is taken on at discrete
points, so that Eq. (13), taken together with the weak minimum prin-
ciple, implies that the hamiltonian is a (local) minimum. If equality
shou'd obtain in Eq. (13), a consideration of higher-order terms would
lead to the same conclusion.
This derivation leads to weaker results than that of the previous
section. Rather than establishing, as in Sec. 6.8, that the minimizing
function u(t) causes the hamiltonian to take on its absolute minimum,
we have shown here only that the hamiltonian is a local minimum.
The extension to constrained end points and variable 0 is straightforward.
The particular usefulness of the somewhat weaker result of this section
will become apparent in considering sufficiency and in the discussion
of discrete systems in the next chapter.

6.10 OPTIMAL TEMPERATURES FOR CONSECUTIVE REACTIONS


In Sec. 4.12 we applied the weak minimum principle to the problem.
of determining the optimal temperature program for the consecutive-
reaction scheme
X1-> X2 - decomposition products
We return to that problem now to demonstrate the usefulness of the
strong principle and to impose a limitation on the solution we have
obtained.
13$ OPTIMIZATION BY VARIATIONAL METHODS

The system is described by equations


±1 = k,(u)F(xi) (la)
x2 = vk,(u)F(x,) - k,(u)G(x2) (1b)
where F and G must be positive functions for physical reasons. and k,
and k2 have the form
kr(u) = k,oe-s,'Iu i = 1, 2
The goal is to maximize x,(9) or, equivalently, to minimize
with x,(0) unspecified. Thus, the hamiltonian is
H = - y,k,F + y2vk1F - y2k2G (3)

with multiplier equations and boundary conditions

y, aH = (yi - 71(9) = 0 (4a)

aaH
'Y = y2k2G' 72(e) _ - 1 (4b)

Equation (4b) may be integrated to give the important result

y,(t) = - exp (,e k2G' d8) < 0 (5)

If we assume that the optimal temperature function u(t) is uncon-


strained, the conditions for $ minimum are
aH
_ y,k'F + ysrk,F - y2k'G = 0 (6)
8u =
a2H
aus = - yik''F + y3rki'F - yaks G > 0 (7)

Equation (6) can be solved for y, as

It = P72 - 72 k 2
k'G
jp (8)

in which case Eq. (7) becomes


/k',k
y2G ( ks,. >0 (9)
ki
.or, making` use of the negativity of y2 and the positivity of G, a necessary
condition for optimality is
k''ksk='<0 (10)
k'
THE MINIMUM PRINCIPLE in

Now, from the defining equation (2) it follows that

k' = k:oE;
u2

k`a = k,oE; a_B,,Iu - 2k.oE;


u, e-se u (1lb)
U4

so that Eq. (10) becomes, after some simplification,

k'oE= a-$j tu(E, - Es) S 0 (12)


U4

or, eliminating the positive coefficient,


E1 < E2 (13)

Thattis, the solution derived in Sec. 4.12 is optimal only when E1 < .E2.
When El > E2, the optimum is the highest possible temperature.

6.11 OPTIMALITY OF THE STEADY STATE

It has been found recently that certain reaction and separation processes
which are normally operated in the steady state give improved perfor-
mance when deliberately forced to operate in a time-varying manner, where
the measure of performance is taken to be a time-averaged quantity such
as conversion. Horn has shown a straightforward procedure for deter-
mining under i erhh.in conditions when improved performance mA.y he
expected in the unsteady state by the obvious but profound observation
that if the problem is posed as one of finding the time-varying operating
conditions to optimize a time-averaged quantity, the best steady state
can be optimal only if it satisfies the necessary conditions for the time-
varying problem. As we shall see, the application of this principle
requires the strong minimum principle.
We shall consider an example of Horn and Lin.of parallel chemical
reactions carried out in a continuous-flow stirred-tank reactor. The
reactions are
X2

X1

X,
where the reaction X1 -+ X2 is of order n and the reaction X1--' X,
is of first order. X2 is the desired product, and the goal is to choose
the operating temperature which will maximize the amount of X2. The
200 OPTIMIZATION BY VARIATIONAL METHODS

reactor equations are


z1 = -ux1n - aurxi - xl + 1 (1a)
x2 = uxln - x2 (lb)
where x1 and X2 are dimensionless concentrations, r is the ratio of acti-
vation energies of the second reaction to the first, u is the decision varia-
ble, the temperature-dependent rate coefficient of the first reaction, .and
time is dimensionless.
For steady-state operation the time derivatives are zero, and we
obtain a single equation in x2 and u by solving Eq. (16) and substituting
into (la)
0 = -x2 - xz'In(aur-tin + U -1/n) + 1 (2)
As we wish to maximize x2 by choice of u, we differentiate Eq. (2) with
respect to u to obtain
0 = - ax;au
1
n
- ax2
au (aur-1'n + u- 1/n)
x21Jn-1

xz1/n- nr - 1 1 u-0441thl (3)


\a n n
and since 49x2/(9u must be zero at an. internal maximum, solving for u
leads to the optimum steady-state value
=
- 1)JI,r
a(nr 1
(4)

Clearly this requires


nr - 1 > 0 (5)

To establish that Eq. (4) does indeed lead to a maximum we take the
second derivative of Eq. (2) with respect to u to obtain, after using Eq.
(4) and the vanishing of ax2/au,

0
z /
dug t l + x21/n-lu l/n nr r 1) - x21)n u-(I+2n)/n(6)
Since u and x2 are both positive, it follows at once that (92x2/au2 is nega-
tive and that a maximum is obtained.
If we formulate the problem as a dynamical one, our goal is to
choose u(t) in the interval 0 < t < 0 in order to maximize the time-
average value of x2(t) or, equivalently, minimize the negative of the
time-average value. Thus, i

aJ = - e f x2(t) dt (7)
THE MINIMUM PRINCIPLE 201

or, using the equivalent formulation which we have developed,

(8)
8

Using Eqs. (la) and (lb), the hamiltonian is then

H=-B+ y1(-uxl" - au'xl - xl + 1) + y2(uxi" - x2) (9)

The multiplier equations are


all
yi = - axl
= 71(nuxl°-' + au' + 1) - ny2ux1"-' (10a)

aH 1
12
(IX 2
= B+ 72 (10b)

and the partial derivatives of H with respect to u are

au = -ylxx^ - ay1ru'-'xl + 72xln


02 H
au2 = -ay,r(r - 1)u'-2x, (12)

Since we are testing the steady state for optimality, it follows that
all time derivatives must vanish and the Green's functions y, and y2 must
also be constant. Thus, from Eqs. (10),
nuxln-'
y (13a)

y2=-e1 (13b)

(Clearly we are excluding small initial and final transients here.) Setting
aH/au to zero in Eq. (11) and using Eq. (13) leads immediately to the
solution

[anr'_ 1) (14)

the optimal steady state, so that the optimal steady state does satisfy the
first necessary condition for dynamic optimality. However, it follows
from Eq. (13a) that y, is always negative, so that a2H/au2 in Eq. (12)
has the algebraic sign of r - 1. Thus, when r > 1, the second deriva-
tive is positive and the hamiltonian is a minimum, as required. When
r < 1, however, the second derivative is negative and the hamiltonian is
a local maximum. Thus, for r < 1 the best steady-state operation can
always be improved upon by dynamic operation.
We shall return to this problem in a later chapter with regard to the
202 OPTIMIZATION BY VARIATIONAL METHODS

computational details of obtaining a dynamic operating policy. We leave


as a suggestion to the interested reader the fruitfulness of investigating
the relation between steady-state optimality and minimization of the
steady-state lagrangian, with emphasis on the meaning of steady-state
Lagrange multipliers.

6.12 OPTIMAL OPERATION OF A CATALYTIC REFORMER

One of the interesting applications of the results of this chapter to an


industrial process has been Pollock's recent preliminary study of the
operation of a catalytic reformer. The reforming process takes a feed-
stock of hydrocarbons of low octane number and carries out a dehydro-
genation over a platinum-halide catalyst to higher octane product. A
typical reaction is the conversion of cyclohexane, with octane number of
77, to benzene, with an octane number of over 100.
A simple diagram of the reformer is shown in Fig. 6.1. The feed is
combined with a hydrogen recycle gas stream and heated to about 900°F,
then passed into the first reactor. The dehydrogenation reaction is endo-
thermic, and the effluent stream from the first reactor is at about 800°F.
This is teheated and passedto the second reactor, where the temperature
drop is typically half that in the first reactor. The stream is again heated
and passed through the final reactor, where the temperature varies only
slightly and may even rise. Finally, the stream is passed to a separator
for recovery of hydrogen gas, and the liquid is debutanized to make the

Hydrogen
product

FIg. 6.1 Schematic of a catalytic reforming process. (Courtesy of A. W.


Pollock.)
0
THE MINIMUM PRINCIPLE 203

reformate product, which may have an octane number as much as 60


greater than the feed.
Numerous side reactions deposit coke on the catalyst, reducing its
efficiency, and it becomes necessary eventually to shut down the reactor
and regenerate the catalyst. The problem posed by Pollock, then, was
to determine how to operate the reformer and when to shut down in
order to maximize the profit over the total period, including both oper-
ating time and downtime.
The decision variable is taken by means of a simplified model to be
the desired octane number of the product, which is adjusted by an
operator. For convenience we define u as the octane number less 60.
Pollock has used estimates from plant and literature data to write the
equation for coke accumulation, the single state variable, as
it = b2uO xx(0) = 0 (1)

and the time-average profit as proportional to


Q +A9
W e + r fo (B + u)[1 - (N + b,x,)u2] dt - (2)

Here r is the time needed for regeneration, B + u the difference in value


between product and feed divided by the (constant) marginal return for
increased octane, 1 - (bo + bix,)u2 the fractional yield of product, Q
the fixed cost of regeneration divided by flow rate and marginal return,
and A the difference in value between other reaction products and feed
divided by marginal return. We refer the reader to the original paper
for the construction of the model, which is at best very approximate.
The following values of fixed parameters were used for numerical studies:
bo=10-' b,=2X10-8
b2=35-a+,
r=2
The objective equation (2) is not in the form which we have used,
and to convert it we require two additional state variables, defined as
x2 = (B + u)[1 - (bo + b,x,)u2] x2(0) = 0 (3a)
xa = 1 x3(0) = 0 (3b)

In that case we can express the objective as one of minimizing

= Q - Ax3(9) - x2(e) (4)


xa(a) + r
The hamiltonian is then
H = ylb2u5 + y2(B + u)[1 - (bo + blxi)u2] + ya (5)
204 OPTIMIZATION BY VARIATIONAL METHODS

with multiplier equations

aH = -biu2(B + u)y2 710) = -=0 (6a)

72 - ax2
aH _ 0 as
y2(0) =axe
1

xa(e) + r (6b)
aH as _ Q - Ar - x2(9)
=0
ye
axe
73(9) = axe - Ix:(B) rJ 2
(6c)

It is convenient to define
Z = yi(O + r) (7)

which, together with Eqs. (3), (5), and (6), leads to

H = - 9 + r'Zb2ua + (B + u)[1 - (bo + bixi)u2


l
+Q-Ar-x2(9)1
6+r J
(8)

2 = b,u2(B + u) Z(a) = 0 (9)

Eliminating constant terms, minimizing the hamiltonian is equivalent


to maximizing
H* = Zb2u' li- (B + u)[1 - (bo + bixl)u2] (10)
The optimal 9 is determined by the vanishing of the hamiltonian
(a + r)(B + u)[1 - (bo + bIx1)u2]
= fa (B + u)[1 - (bo + blxl)u2J dt - (Q - Ar) t=9 (11)

Because of the preliminary nature of the model and the uncertainty


of the parameters, particularly A, B, Q, and S, a full study was not
deemed warranted. Instead, in order to obtain some idea of the nature
of the optimal policy, the value of Z at t = 0 was arbitrarily set at
-0.1. Equations (1) and (9) were then integrated numerically from
t 0, choosing u at each time instant by maximizing Eq. (10), subject
to constraints
30 < u < 40 (12)
(octane number between 90 and 100). 9 was determined by the point
at which Z reached zero. Equation (11) then determined the values of
Q and A for which the policy was optimal. A complete study would
require a one-parameter search over all values Z(0) to obtain optimal
policies for all Q and A. Figures 6.2, 6.3, and 6.4 show the optimal
policies for 0 = 1, 2, and 4, respectively, with values of B from 30 to
THE MINIMUM PRINCIPLE Zos

R°I 8
30
40

50

90
5 10 15 20 25 30 35 40 45
t
Fig. 6.2 Optimum target octane schedule for 15 - 1.
(Courtesy of A. W. Pollock.)

90. The near constancy of the policies for $ = 2 is of particular interest,


for conventional industrial practice is to operate at constant target
octane.
The remarkable agreement for some values of the parameters
between Pollock's preliminary investigation of optimal policies and
industrial practice suggests further study. We consider, for example,
the possibility that constant octane might sometimes represent a rigor-
ously optimal policy. Differentiating H* in Eq. (10) with respect to
u and setting the derivative to zero for an internal maximum yields,
after some rearrangement,

x1 _
Nb2_-1 Z + B - bol(B + 2)u2 + 2Bu] ( 13)
b1[(B + 2)u 2 + 2Bu] b1[(B + 2)u 2 + 2Bu]

8
2
0 100 30
U
40
98

E 96
E 60
94
70
92 0
O 90
90
0 5 10 15 20'. '25 30 35 40 45

Fig. 6.3 Optimum target octane schedule for l4 s 2.


(Courtesy of A. W. Pollock.)
206 OPTIMIZATION BY VARIATIONAL METHODS

f
Fig. 6.4 Optimum target octane schedule for B a 4. (Cour-
tesy of A. W. Pollock.)

Now, if u is a constant, then differentiating Eq. (13) with respect to time


$b2u8-'
(14)
x' b,[(B + 2)u2 + 2Bu) Z
and combining this result with Eqs. (1) and (9), we obtain a value for u

u B($-2)
B+2-$ (15)

We test for a maximum of H* by calculating the second derivative


0211 *
8u2
-2(bo + bixi)[(B + 2)u + B] + Zb20($ - OUP-2
(16)

Since Z < 0, it follows that we always have a maximum for u > 0,


or, from Eq. (15),
2<0<B+2 (17)

Equation (11) is easily shown to be satisfied for positive 0.


For S near 2 and large B (small marginal return), the rigorously
constant policy corresponds to very small u, which is verified by examina-
tion of Fig. 6.3. For small values of B (B near R - 2) constant values
of u of the order of 35 to 40 can be accommodated. For very large
$ ($ > 10) and large B corresponding large values of u are also obtained.
It is clear from Figs. 6.2 to.6.4 that only minor changes in shape occur
over wide ranges of B,, so that essentially constant policies will generally
be expected. Thus, we conclude that within the accuracy of Pollock's
model the industrial practice of constant-octane operation is close to
optimal for a significant range of parameters.
THE MINIMUM PRINCIPLE 207

6.13 THE WEIERSTRASS CONDITION


The minimum principle is a generalization of a well-known necessary
condition of Weierstrass in the calculus of variations. In Sec. 4.4 we
considered the problem of finding the function x(t) which minimized the
integral
& = Io Y(x,.t,t) dt (1)

where the hamiltonian was found to be


H=iF+x1x-1-X2 (2)
and l is the decision variable. The minimum condition is then
V2,14) + X1± < 5(2,±,t) + X1± (3)

where x denotes the optimal function. From the stationary condition


Eq. (7) of Sec. 4.4,

(4)

or

iT(211,t) - ff(z,x;t) ax (I - x) < 0 (5)

which is the Weierstrass inequality.


The somewhat weaker condition that the second derivative of the
hamiltonian be positive for a minimum becomes, here,
025
TX >
-0 (6)

which is known as the Legendre condition. Both these results are gener-
alized to situations with differential-equation side conditions.

6.14 NECESSARY CONDITION FOR SINGULAR SOLUTIONS

In Chap. 5 we considered several applications in which a portion of the


optimal solution lay along a singular curve. These were internal optima
.

in problems in which the decision appeared linearly, so that first deriva-


tives of the hamiltonian were independent of the decision. Consequently
'the second-derivative test for a minimum cannot be applied, and differ-
ent conditions are needed. To this end we return to the analysis in
Sec. 6.9.
Our starting point is Eq. (3) of Sec. 6.9, which we multiply by y;(9)
an OPTIMIZATION BY VARIATIONAL METHODS

and sum over i to obtain

as - 1 o'; a2H axk axj dt + fo ° kI axk au axk au dt


o k axk axj
"
+ 0 (1)

Here we have assumed a single decision variable and made use of the
fact that all/au is zero for an optimum and a2H/au2 vanishes for singular
solutions. fix(O) was taken to be zero, and we have deleted the term
a2S
axj(9) axk(e)
axj axk

leaving to the reader the task of demonstrating that for the special vari-
ation which we shall choose this term is of negligible magnitude compared
to the terms retained. We now assume the symmetric special variation

au = +E to < t < to + 0 (2)


-E to - A<t < to
in which case both integrals in Eq. (1) need be evaluated only over to -
A < t < to + O to within the specified order. Now we expand the inte-
.

grands in Taylor series about their values at to as follows:


a:H axkaxj = a:H + d a2H (t -to) + CAafk
ax-' ax, [ax. ozj I ax-" OZj au
(CA
IOU 0) + CA L'
au) (t

+ au.- au t - to)+
(CA afjaf'eafk! (3a)

aH
axk au
axkau fe [axkaH. +d: aH (t-to)+
au dt axle au ] [ Eoafk
au
+1 eG1 axj au auk) (t - to) + d ED j i
clu

ft ) (t 21to) + (3b)
a
Here all derivatives are evaluated at to, and ax has been calculated from
the equation

dik =
41I
akkax;+au
f
6U (4)

The symbol ± denotes + for t > to, - fort < to.


THE MINIMUM PRINCIPLE 209

Substituting Eqs. (3) into Eq. (1) and integrating, we obtain


E2A3 _a2H af, af, a2H (d af; _ af; af;
68
3 L ..
ax; ax; au au+ axi au dt au ` L ax, au
_ (d a2H.) al;] + o(E203) >'0 {5}
v dt ax; au au
in which case the term in square brackets must be nonnegative. This
may be shown to be equivalent to the statement
a d2 aH
au dt2 au
<0 (6)

which is, in fact, a special case of the more general relation which we
shall not derive
(d2k OH)
(-1) a VU d12k au ?0 k = 0, 1, 2, . . . (7)

Consider, for example, the batch- and tubular-reactor temperature-


profile problem of Sec. 5.11. The hamiltonian is.
H = p - r(a,T)(1 + X1 - JX2) - X2u (8)
where a and T are the state variables, and the singular arc corresponds
to X2 = 0, ar/aT = 0, a2r/aT2 < 0, the line of maximum reaction rate.
The state equations are
a = -r(a,T) (9a)
T = Jr(a,T) - u (9b)
so that the bracketed term in Eq. (5) becomes
- aT22

(1 + X1 - JX2) > 0 (10)

But X2 = 0, a2r/aT2 < 0, and since the singular are has been shown. to
be a final arc with \,(O) = 0, it easily follows from Eq. (6a) of Sec. 5.11
that a, > 0. Thus Eq. (10), is satisfied, and the singular are does
satisfy the further necessary condition.

6.15 MIXED CONSTRAINTS


Our considerations thus far have been limited to processes in which only
the decision variables are constrained during operation. In fact, it
may be necessary to include limitations on state variables or combina-
tions of state and decision variables. The latter problem is easier, and
we shall consider it first.
We suppose that the minimization problem is as described in
210 OPTIMIZATION BY VARIATIONAL METHODS

Sec. 6.4 but with added restrictions of the form


Qk(x,u) >_ 0 k = 1, 2, ... , K (1)

When every constraint is at strict inequality, Eq. (1) places no restric-


tions on variations in the optimal decision, and the minimum principle
as derived in Sec. 6.8 clearly applies. We need only ask, then, what
changes are required when one or more of the constraints is at equality
for a finite interval.
It is instructive to consider first the case in which there is a single
decision variable u and only one constraint is at equality. Dropping
the subscript on the constraint, we then have
Q(x,u) = 0 (2)

If the state is specified, Eq. (2) determines the decision. Any change
in x or u must satisfy, to first order,

bQ = IG ax:ay ax` + au au = 0
:-1
a6l
(3)

while, since the differential equations must be satisfied,

5x;= aiax1+a`bu (4)


-1
Substituting for bu from Eq. (3),

f 1 aQ
ax;, bx;
lax; - au (au) (5)

Thus, using the results of Sec. 6.2, along this section of the optimal
path the Green's vector must satisfy the equation
af, af a- Q 1 aQ

y' - ax; au au ax; y'


(6)

Although the canonical equations are not valid over the constrained
part of the trajectory we define the hamiltoniai as before, and it is
easily shown that H is constant over the constrained section. If con-
tinuity of H is required over the entrance to the constrained section,
the multipliers are continuous and H retains the same constant value
throughout and, in particular, H = 0 when 0 is unspecified.
In the general case of K1 independent constraints at equality
involving R1 >_ K1 components of the decision vector we solve the K1
THE MINIMUM PRINCIPLE 211

equations

aQkk aui
sui + I aQk axi = 0 k = 1, 2, . . . , Kl (7)

for the first K1 components Sui. (It is assumed that aQk/aui is of rank
K1.) The equation for the Green's vector is then
Ki

- axi -
af; 8J aQk
is = Spk - 1'i (8)
P.I au, Ox,

Here Sk is the inverse of the matrix consisting of the first Kl elements


oQk/aui; that is,
KI

Z S,k auk = a;, i,j=1,2,...Kl (9)


k-i

Upon substitution into the variational equations it follows that the


R, - K, independent components of u must be chosen to satisfy the
weak minimum principle and that H is again constant.

6.16 STATE-VARIABLE CONSTRAINTS


We are now in a position to consider constraints on state variables
only. Clearly we need only consider the time intervals when such
constraints are actually at equality. For simplicity we consider here
only a single active constraint
Q(x) = 0 (1)
The generalization is straightforward.
Let us consider an interval tl < t < t2 during which the optimal
trajectory lies along the constraint. Since Eq. (1) holds as an identity,
we can differentiate it with respect to t as often as we wish to obtain
Q Q(-) =0 ti<t<t2 (2)
In particular, we assume that the decision vector u first enters explicitly
in the mth derivative. If we then let
Q(m) (x) = Q(x,u) = 0 t, < t < is (3)
we can apply the results of the previous section to the mixed constraint
0 in order to obtain the multiplier equations. In addition, however,
we now have m additional constraints
QU)=0 j=0,1,2,...,m-1 (4)
which must be satisfied at t = tl.
212 OPTIMIZATION BY VARIATIONAL METHODS

If we apply Green's identity across the vanishingly small interval


ti < t < t1+, we obtain
n n

t. -
I yi ax: (5)
i-1 i-1

The Green's vector y(ti-) may be expressed as the sum of m + 1 vectors,


the first in being normal to the surfaces defined by Eq. (4). Thus
m-1 aQ(i)
yi(t1 )_ + ai (6)
io µi ax;
or, from Eq. (5),
m-1 aQ(j) n n
n
GG µ, a.'xi + 1 Qi ax; _ y; ax; is (7)
i-Oi-1 i-I i-1

But the constraints imply that


6Q(j) n
aQci)
= axi = 0 j = 0, 1,- (8)
1
i-1
8x;

so that Eq. (7) becomes


n n

I Gi axi Its- _ Ii-1


yi axi It1+ (9)
i-1
Since ax is continuous, Eq. (9) has a solution
d = y(t1+) (10)
and the Green's vector may then have a discontinuity of the form
n<-1
Qia)
ye(ti) yi(tl+) +
C1X
F1i (11)
-o
This jump con&ion is unique if we require that ?(t2) be continuous, and
the m components of i are found from the m extra conditions of Eq. (4).
As before, the hamiltonian has a continuous constant value along the
optimal path.

6.17 CONTROL WITH INERTIA


In all the control problems we have studied we have assumed that instan-
taneous switching is possible between extremes. A more realistic approx-
imation might be an upper limit on the rate of change of a control setting.
Consider again, for example, the optimal-cooling problem of Sec. 5.11.
THE MINIMUM PRINCIPLE

The state equations are


a = -r(a,T) (la)
T = Jr(a,T) - q (lb)
where we have written q for the cooling rate. Now we wish to bound
the rate of change of q, and so we write a third equation
¢ = u (1c)

with constraints

U* < u < u* (2a)


0 < q < q* (2b)
and perhaps a temperature constraint
T* <T <T* (2c)

The objective is again to minimize


& = f o' - r(a,T)] dt (3)

We may consider. u to be the decision variable, in which case Eqs.


(2b) arld (2c) denote constraints on the state. The hafniltonian is
H = -'Yir+y2(Jr-q)+yau+p-r (4)

and if neither constraints (2b) nor (2c) are in effect, the multiplier equa-
tions are
_ aH Or
7r '(1+Yr-J72) (5a)
as aa
_ aH Or
tie
aT aT
(l + yt - Jy2) (5b)

aH
Ya = - = y2 (5c)
aq

If control is not intermediate, u lies at one of its extremes, so that the


optimum is to change controls as fast as possible. If the constraint sur-g
face q = 0 or q = q* is reached, the control appears in the first derivative
and m =.1:
Q(I)=
a-t (q.*-q)u=0 (6)

Since Q is independent of x, the multiplirr equations are unchanged and


we simply reduce to the problem of Sec. 5.11.
Intermediate control is possible only if ya = 0, which, from Eq.
(k), implies 72 = 0. But y2 vanishes only along the line of maximum
!U OPTIMIZATION BY VARIATIONAL METHODS

reaction rate, so that the same intermediate policy is optimal. If a tem-


perature constraint is reached, say T = T*, we have

(T* - T) q - Jr(a,T*) = 0 (7a)


Qu> = dt
e(=)
(7b)
= dt
so that m = 2 and the coolant is varied at a rate
8r(a,T*)
u = Jr(a,T ' )
8a
(8)

It will be impossible to satisfy equality constraints on both temperature


and rate of cooling, as seen by comparing Eqs: (6) and (8), and so some
initial states will be excluded.
We wish to emphasize here the manner in which we have developed
the optimal policy for this problem. In See. 3.5 we studied the optimal
temperature policy. We found in Sec. 5.11 that, when possible, the
optimalSoolant policy reduces to one of choosing the optimal tempera-
ture policy. Here we have found that, except when switching, the opti-
mal coolant rate of change is one which gives the optimal coolant policy.
This hierarchy of optimization problems is typical of applications, and
the physical understanding eliminates some of the need for rigorous
establishment of optimality of intermediate policies.

5.18 DISCONTINUOUS MULTIPLIERS


One of the most striking features of the presence of state-variable con-
straints is the'possibility of a discontinuity in the Green's functions. We
shall demonstrate this by a problem of the utmost simplicity, a first-
order process,
±1 = U (1)

We assume that xl must he within the region bounded by the straight lines

Q1=xi-(ai--$it)>0 (2a)

Q2 = -xi + (a= - s=t) > 0 (2b)

and that we choose u to minimize

5--210 (x1'+c'u')dl (3)

while obtaining xi(8) = 0. The reader seeking a physical motivation


THE MINIMUM PRINCIPLE 215

might think of this as a power-limited mixing process in which the con-


straints are blending restrictions.
It is convenient to define a new state variable
x2 = 3 (x12 + c2u2) x2(0) = 0 (4)

and, because time enters explicitly into the constraints


za = 1 x3(0) = 0 x3(8) = 0 (5)

We then have
g = x2(0) (6)
Q1 = x1 + $1x3 - al 1 0 (7a)
Q2 = -x1 - $2x3 + a2 0 (7b)
The hamiltonian is
H = Y1u + i2Y2(x12 + c2u2) + 73 (g)

and, along unconstrained sections of the trajectory, ,

tit = - Y2x1
y2 = 0 Y2(8)

'Y3=0
Furthermore, along unconstrained sections the optimal path is defined by
c2x1-xl=0 (12)

From the time t1, when xl intersects the constraint Q1 = 0, until leav-
ing at t2, we have
1=0=u+$1 t1 <t<t2 (13)

and since this is independent of x, the multiplier equations are unchanged.


In particular, 72 and Y3 are constants with possible jumps at intersections
of the free curve and the constraint.
The jump condition, Eq. (11) of Sec. 6.16, requires that
Y1(ti) = Y1(t1+) + µ (14a)
Y2(t1-) = Y2(t1+) (14b)
7301) = Y3(tl+) + µ01 (14c)

and together with the continuity of the hamiltonian it follows that the
control is continuous
u(tl) = -$1 = u(t1+) (15)
216 OPTIMIZATION BY VARIATIONAL METHODS

In the interval 0 < t < t, Eq. (12) must be satisfied, and the solution with
x,(t,) = al - Iglti (16)
is

XI [a, - Olt, - x,(0)e-11'1 sinh (t/c) + XI(0)e_ 1c (17a)


sinh (t,/c)
cosh (t/c)
u = [al - Olt, - x,(0)e-'d°] c sinh (tile) - x1(0) e _,1c
(17b)
c

Because of the continuous concave curvature of x,(t) the point t, is the


first at which u = -t,. Thus t, is the solution of
(a, - alt,) cosh t, + Ole sirrh ' - x,(0) = 0 (18)

The solution curve x,(4) leaves the constraint Q, = 0 at t = t2. In


the interval between constraints Eq. (12) must again be satisfied, and it
is a simple consequence of the curvature of the solution of that equation
and the continuity of u(t2) that the solution curve x,(t) can never inter-
sect the line Q2 = 0 for any t < 0. The solution of Eq. (12) satisfying
x,(!) = a, - $,t2 and x,(0) = 0 is
sinh [(0 - t)/c]
x,(t) _ (al - a,t2) sinh (19a)
[(0 - t2)/c]
cosh [(0 - t) /c]
u = - (a, - alt2) c sinh [(0 - t2)/c] (19b)

The contribution to the objective from this final section is made as small
as possible by choosing 0 as large as allowable, 0 = a2/02 Evaluating
Eq. (19b) at t2, the continuity of u(t2) requires that t2 be the solution of
at $2t2
(a, - Nlt2) coth
- - CO, = 0 (20)

Finally, we return to the multipliers. Since Y2 is continuous, it fol-


lows from Eq. (10) that Y2 = 1. Thus, setting all/au to zero, we obtain
for unconstrained sections

Yl = -C2u (21)

In particular, at = t, and, from Eq. (13), t = t2


tt
YI(tl-) = c2R1 = Yl(t2) (22)

But since Eq. (9) applies along the constraint,


'Y,= -x, =alt - ca, tl <t<t2 (23)
THE MINIMUM PRINCIPLE 217

and integrating with the boundary condition at t2,

71(ti+) = c2$1 + i2Nl(t12 - t22) - al(tl - t2) (24)

Thus, from Eqs. (14a), (22), and (24),

Fu = al(tl - t2) - %ZSl(tl2 - t22) (25)

and 7 is discontinuous at ti.

6.19 BOTTLENECK PROBLEMS


Bottleneck problems are a class of control problem typified by constraints
of the form
u < 0(x) (1)
When there-is little of the state variable and .0 is small, a bottleneck
exists and little effort can be exerted. When 0 has increased and the
bottleneck has been removed, large effort can be used. Many economic
problems are of this type, and a bottleneck model has recently been
employed to describe the growth of a bacterial culture, suggesting an
optimum-seeking mechanism in the growth pattern.
The example of a bottleneck problem we shall examine has been
used by Bellman to demonstrate the application of dynamic program-
ming. We have
xi = alu1 (la)
x2 = a2u2 - U1 (lb)

with
U1iu2>0 (2)

Q1 = x2 - ul - U2 >0 (3a)
Q2 =xl-u2>0 (3b)

and we seek to maximize x2(9). In this case the multiplier boundary


conditions are
71(0) = 0 72(0) = - 1 (4)

and the hamiltonian is


H = ylalul + 72a2u2 - 72u1 (5)

At t = 8 the hamiltonian becomes


H = -a2u2 -I- u1 t=0 (6)
218 OPTIMIZATION BY VARIATIONAL METHODS

in which case we require u2 to be a maximum, ul a minimum. Thus,


ul(9) = 0 (7a)
and, from Eqs. (3),
u2(9) = min j x'(8) (7b)
11
x2(8)

We shall carry out our analysis for x2(9) > x1(9), in which case
u2(9) = x1(9) (8)
Then Q2 = 0 is in force, and just prior to t = 0 the state satisfies the
equations
x1 = 0 (9a)
z2 = a2x1 (9b)

or
x1(t) = x1(8) (103)
x2(t) = a2x1(8)(t - 0) + x2(9) (104)

The Green's functions satisfy Eq. (5) of Sec. 6.15 when Q2 = 0 is in effect,
which leads to

71 = - [afI afl (-9Q2\-' aQ2


ryl
axl au2 au2/ 8xI

aft aft aQ2 -' aQ2 72 = - a272 (11a)


Cax1 - au2 (au2) axl

[afl _ afl aQ2 -' aQ2


'Y2 = - ax2
71
ax2 au2 Cau2)
raft afs -L-(,q)' 72=0 (11b)
ax2 au2 au2 ax2

or
72 = 1
(12a)
71 = a2(8 - t) (12b)

This final interval is preceded by one in which either


72(t) - a71(t) > 0 x2(t) > xl(t) (13a)
or
72(t) - a,71(t) < 0 X1(t) > x2(t) (13b)

The former condition ends at time t1, when 72 equals a171, which is, from
Eq. (12),
tl = 9 _ 1
alas
(14a)
THE MINIMUM PRINCIPLE

The latter ends at t2, when x, = x2, from Eq. (10)

it
__

0
_ I' - 11
x,(8) at
1
(14b)

The simpler case encompasses final states for which it > t,. I or times
just preceding t2i then, the optimal policy is
u1 = 0 (15a)
X1(t)
U2 =min x2 (15b)
1X2(t)) =
The state equations are
z,=0 (16a)
x2 = a2x2 (16b)

in which case
x1(t < t2) = x,(t2) (17a)
x2(t < t2) < x2(t2) (17b)

so that x2 < x, for all t < it during which this policy is in effect. But
now Q, = 0 is in effect, leading to multiplier equations
7', = 0 71 = 71(12) = a2(0 - t2) (18a)
72(t2)e-°,(e-y) = e-%(-y)
7'2 = -a272 72 = (18b)

and

72(t) - a,71(t) > 72(12) - a,7,(12) > 0 t < it (19)

so that the optimal policy is unchanged.for all t < t2.


The second possibility is t, > t2. For times just prior to t, the
coefficient of u2 in the hamiltonian is negative, while x2 > x1. If the
coefficient of u1 is zero and u, is intermediate while u2 is at its maximum,
the multiplier equations are unchanged and 72 - a7, < 0, which is a
contradiction. Thus, the coefficient of u1 is also negative, requiring u1
to be at a maximum. The only possibilities compatable with x= > x1 are
u2=x1 ul=x2-x, (20a)
U2 = 0 u, = x1 (20b)

The second case corresponds to Q, = 0, Q2 not in effect, in which case it


follows from the previous discussion that the coefficient of u1 is positive,
resulting in a contradiction. Thus, for t < t, the policy is Eq. (20a).
It follows directly by substitution into the state and multiplier
equations that this policy remains unchanged for all t < t,. Thus, the
220 OPTIMIZATION BY VARIATIONAL METHODS

optimal policy can be summarized as


1 x2 S XI, u1 = 0, u2 = x2
0<t<B- a1a2 x2 > x1, u1 = x2 - x1, u2 = XI
1
0- a1a2 <t<0 U2 = min (x1,x2)
(21)

Bellman has established the optimality of this policy by an interesting


duality relation which establishes strong connections between this type of
linear bottleneck problem and a continuous analog of linear programming.

6.20 SUFFICIENCY
In Sec. 1.3 we found that only .a slight strengthening of the necessary
conditions for optimality was required in order to obtain sufficient con-
ditions for a policy that was at least locally optimal. Sufficient con-
ditions in the calculus of variations are far more difficult to establish,
and, in general, only limited results can be obtained.
Our starting- point is Eq. (4) of Sec. 6.9, which, after multiplication
by -yi(6) and summation from i = 1 to n yields
n R
e aH 82H
/i
1
yi(9) 8xi(B) = Ia Si6k + R` Su; but
auk .41 49% auk
i-1 k 1

Suky; + 2 axza kYIYk A + o(E2) (1)


+ k-1 i-1
L auk a ;

where y'is defined as

IOa
y7(3) = auk(t) dt (2)
i-1 k-1 r"(s,t)
L.l L.l auk

If we restrict attention to situations in which g is a linear function of


components of a(9) so that the second derivative vanishes identically,
Eq. (1) is an expression of 53 for an arbitrary (small) variation in u.
We require conditions, then, for which a& is always strictly positive for
nonzero 5u.
When a function satisfying the minimum principle lies along a
boundary, the first-order terms dominate. Thus we need only consider
interior values at which aH/auk vanishes and the hessian is positive
definite. The two remaining terms in Eq. (1) prevent us from stating
that bg is positive for any change in u, rather than the very special
changes which we have considered. A special case of importance is that
THE MINIMUM PRINCIPLE 221

of minimizing
e
fo ff(x,u) dt (3)

with linear state equations


R

xi Aijxj + I "ikuk DD
(4)

By introduction of a new state variable this ,is equivalent to a form in


which S is a linear function of the final state. The hamiltonian is then
n n R'r

+ + Y L riHikuk (5)
=1 k=1
and for a function satisfyh: the minimum principle Eq. (1) becomes
R 25
R
SS-1J\1
r //
0 au auk 444
a2
axj auk 6%yi

j,k - l j-1 k+1


+ I- a yjyk
j,k 1
ax; axk /j de + o(E2) (6)

If if is strictly convex, that is,


R a2 n R a2_
p
Ld auk auk
ajak + 2 ftj auk Nj«k
j,k-1 j-1 k=1
n
a25
+ axi axk
Pjlek > 0 (7)
j,k-1
at all differentiable points for arbitrary nonzero a, g, then 6& is positive.
Thus, the minimum principle is sufficient for local optimality in a linear
system iethe objective is the integral of a strictly convex function. An
important case is the positive definite quadratic objective, which we have
considered several times.

APPENDIX 6.1 CONTINUOUS DEPENDENCE OF SOLUTIONS


The linearizations in this chapter are all justified by a fundamental result
about the behavior of the solutions of differential equations when small
changes are made in initial conditions or arguments. We establish that
result here.
Consider
x = f(x,u) 0 < It < 0 (1)
OPTIMIZATION BY VARIATIONAL METHODS

where u(t) is a piecewise-continuous vector function. We assume that f


satisfies condition with respect to x and u
If(zi,ut) - f(x2,us)I < Lp([xI - z1I + Iul - u2I)
. (2)
where Lp is a constant and any definition of the magnitude of a vector
may be used. If we specify a function u(t) and initial condition 2o, Eq.
(2) is sufficient to ensure a unique solution 2(t) to Eq. (1). Similarly,
if we specify u = u + Bu and xo = 20 + Sao, we obtain a unique solution
2 + Sx. We seek the magnitude of Sx at some finite terminal time 9 for
which both solutions remain finite
Let
16u(t)1, ISxoI < C. (3)

If we integrate Eq. (1) for the two choices of xo and u and subtract, we
obtain
fo'
. fix (t) = 5xo + [f(2 + &x, u + &u) - f(2,u)I ds 0 < t < B (4)
Making use of Eqs. (2) and (3), this leads to
Isx(t)I < e + Lp fog (ISxI + 16u() ds 0<t<e (5)

If ISzI. is equal to the least upper bound of ISxI in 0 < t < 0, then
[fix (t)( <- e(1 + Lpt) + (6)

Substituting Eq. (6) back into Eq. (5), we obtain


I&x(t)I < e(1 + Lpt) + et + Y2Lpt2 + i2LpI5a1mt2 (7)

and by continued substitution we obtain, finally,


I5z(t)I < e(e=La' - 1) (8)

or
Isz(e)I < Ke (9)

where
K = 2eLD° - 1

and depends on 0 but not on Sz0 or Su(t).

BIBLIOGRAPHICAL NOTES
Section 6.2: Solution proxdures for linear ordinary differential equations are discussed
in any good text, such as
E. A. Coddington and N. Levinson: "Theory of Ordinary Differential Equations,"
McGraw-Hill Book Company, New York, 1955
THE MINIMUM PRINCIPLE

The linear analysis used here is covered in


M. M. Denn and R. Aria: Ind. Eng. Chem. Fundamentals, 4:7 (1965)
L. A. Zadeh and C. A. Desoer: "Linear System Theory," McGraw-Hill Book Com-
pany, New York, 1963
Green's identity introduced here is a one-dimensional version of the familiar surface
integral-volume integral relation of the same name, which is established in any book
on advanced calculus, such as
R. C. Buck: "Advanced Calculus," 2d ed., McGraw-Hill Book Company, New York,
1965

Section 8.5: The properties of the adjoint system of the variational equations were exploited
in pioneering studies of Bliss in 1919 on problems of exterior ballistics and subse-
quently in control studies of Laning and Battin; see
G. A. Bliss: "Mathematics for Exterior Ballistics," John Wiley & Saps, Inc., New
York, 1944
J. H. Laning, Jr., and R. H. Battin: "Random Processes in Automatic Control,"
McGraw-Hill Book Company, New York, 1956

Sections 6.4 and 6.5: The development is similar to


M. M. Denn and R. Aris: AIChE J., 11:367 (1965)
The derivation in that paper is correct ealy for the case of fixed 0.

Section 6.8: The earliest derivation of a result equivalent to the strong minimum principle
was by Valentine, using the Weierstrass condition of the classical calculus of varia-
tions and slack variables to account for inequality constraints; see
F. A. Valentine: in "Contributions to the Theory of Calculus of Variation, 1933-37,"
The University of Chicago Press, Chicago, 1937
The result was later obtained independently by Pontryagrin and coworkers under some-
what weaker assumptions and is generally known as the Pontryagin maximum (or
minimum) principle; see
V. G. Boltyanskii, R. V. Gamkrelidze, and L. S. Pontryagin: Rept. Acad. Sci. USSR,
110:7 (1956); reprinted in translation in R. Bellman and It Kalaba (eds.),
"Mathematical Trends in Control Theory," Dover Publications, Inc., New York,
1964
L. S. Pontryagin, V. A. Boltyanskii, R. V. Gamkrelidze, and E. F. Mishchenko:
"The Mathematical Theory of Optimal Processes," John Wiley & Sons, Inc.,
New York, 1962
A number of differ.mt approaches can be taken to the derivation of the minimum principle.
These are typified in the following references, some of which duplicate each other in
approach:
M. Athans and P. L. Falb : "Optimal Control," McGraw-Hill Book Company, New
York, 1966
L. D. Berkovitz: J. Math. Anal. Appl., 3:145 (1961)
A. Blaquiere and G. Leitmann: in G. Leitmann (ed.), "Topics in Optimization,"
Academic Press, Inc., New York, 1967
224 OPTIMIZATION BY VARIATIONAL METHODS

S. Dreyfus: "Dynamic Programming and the Calculus of Variations," Academic


Preen, Inc., New York, 1965
H. Halkin: in G. Leitmann (ed.), "Topics in Optimization," Academic Press, Inc.,
New York, 1967
1. Hestenes: "Calculus of Variations and Optimal Control Theory," John Wiley &
Sons, Inc., New York, 1966
R. E. Kalman: in R. Bellman (ed.), "Mathematical Optimization Techniques," Uni-
versity of California Press, Berkeley, 1963
E. B. Lee and L. Markus: "Foundations of Optimal Control Theory," John Wiley &
Sons, Inc., New York, 1967
U. Leitmann: "An Introduction to Optimal Control," McGraw-Hill Book Company,
New York, 1966
L. Neustadt: SIAM J. Contr., 4:505 (1966); 5:90 (1967)
J. Warga: J. Math. Anal. Appl., 4:129 (1962)
The derivation used here is not rigorous for the constrained final condition, but the trans-
versality conditions can be obtained by using penalty functions for end-point con-
straints and taking limits as the penalty constant becomes infinite. We have assumed
here and throughout this book the existence of an optimum. This important ques-
tion is discussed in several of the above references.

Section 6.9: We follow here

M. M. Denn and R. Aris: Chem. Eng. Sci., 20:373 (1965)


The result is equivalent to the Legendre-Clebsch condition of classical calculus of variations.

Section 6.10: This result was obtained by Aris; see


R. Aris: "Optimal Design of Chemical Reactors," Academic Press, Inc., New York,
1961

Section 6.11: The example and mathematical development follow

F. Horn and R. C. Lin: Ind. Eng. Chem. Process Design Develop., 6:21 (1967)
A good introduction to the notion of process improvement by unsteady operation is

J. M. Douglas and D. W. T. Rippin: Chem. Eng. Sci., 21:305 (1966)

Section 6.12: The example is from


A. W. Pollock: Applying Pontryagin's Maximum Principle to the Operation of a
Catalytic Reformer, 16th Chem. Eng. Conf. Chem. Inst. Can., Windsor, Ontario,
1966

Applications to other problems with decaying catalysts have been carried out by

A. Chou, W. H. Ray, and R. Aris: Trans. Inst. ('hem. Engre. (London), 45:T153 (1967)
S. Szepe and O. Levenspiel: unpublished research, Illinois Institute of Technology,
Chicago

Section 6.13: The classical Weierstrass and Legendre conditions are treated in any of the
texts on calculus of variations cited for Sec. 3.2.
THE MINIMUM PRINCIPLE 225

Section 6.14: See

B. S. Goh: SIAM J. Cont., 4:3091(1966)


C. D. Johnson: in C. T. Leondes (ed.), "Advances in Control Systems," vol. 2,
Academic Press, Inc., New York, 1965
H. J. Kelley, R. E. Kopp, and H. G. Moyer: in G. Leitmann (ed.), "Topics in Opti-
mization," Academic Press, Inc., New York, 1967
These papers develop the necessary conditions in some detail and deal with several non-
trivial applications. A very different approach applicable to certain linear problems
is taken in
A. Miele: in G. Leitmann (ed.), "Optimization Techniques with Applications, to
Aerospace Systems," Academic Press, Inc., New York, 1962

Sections 6.15 and 6.16: The approach is motivated by


A. E. Bryson, Jr., W. F. Denliam, and S. E. Dreyfus: AIAA J., 1:2544 (1963)
State-variable constraints are included in Valentine's formulation cited above and are
treated in most of the other references for Sec. B.S. See also the review paper

J. McIntyre and B. Paiewonsky: in C. T. Leondes (ed.), "Advances in Control Sys-


tems," vol. 5, Academic Press, Inc., New York, 1967

Section 6.17: This problem was solved using other methods by

N. Blakemore and R. Aria: Chem. Eng. Sci., 17:591 (1962)


Other applications of the theory can be found in the references cited for Secs. 6.8 and 6.15
and

J. Y. S. Luh, J. S. Shafran, and C. A. Harvey: Preprints 1987 Joint Autom. Contr.


Conf., Philadelphia, p. 144
C. L. Partain and R. E. Bailey: Preprints 1987 Joint Autom. Contr. Conf., Philadelphia,
p. 71
C. D. Siebenthal and R. Aris: Chem. Eng. Sci., 19:729 (1964)

Section 6.19: The example was solved by other methods in

R. E. Bellman: "Dynamic Programming," Princeton University Press, Princeton,


N.J., 1957

An intriguing analysis of batch microbial growth as an optimum-seeking bottleneck


process is in
C. H. Swanson, R. Aris, A. G. Fredrickson, and H. M. Tsuchiya: J. Theoret. Biol.,
12:228 (1966)

Section 8.20: Sufficiency for more general situations is considered in several of the refer-
ences cited for Sec. 6.8.

Appendix 6.1: This is a straightforward] extension of a well-known result in differential


equations; see, for example, the book by Coddinglon and Levinson cited above.
226 OPTIMIZATION BY VARIATIONAL METHODS

PROBLEMS
6.1. Show that the equation

d p(t)E + h(t)i s u(t)

is Self-adjoint in that the adjoint equation is


d
p(t)i' + h(t) r - 0

6.2. Obtain the control which takes the second-order system


x+ax+bx - u
Jul < 1
from some initial state to a circle about the origin while minimizing
(a) time
(b) E-2Io (x'+cu')dt
U. Determine the relation between optimality of the steady state and the character
of stationary points of the steady-state lagrangian. Interpret this result in terms of
the meaning of steady-state Lagrange multipliers.
6.4. Examine the singular solutions of Secs. 5.9 and 5.10 in the context of the necessary
condition of Sec. 6.14.
6.5. Goddard's problem of maximizing the height of a vertical flight rocket is described
by
h-v
o I (T - D) -
m
rh s -
-T
c

Here h is the altitude, v the velocity, m the mass, g the acceleration due to gravity,
and c the exhaust velocity. D is the drag, a f unction of v and h. The control variable
is the thrust T, to be chosen subject to
B<T <T'
to maximize the terminal value of h for fixed m. Show that intermediate thrust is
possible only for drag laws satisfying
aD 2a
c= 0
av! c 0,V

6.6. Given
=u
X(0) - z(1) 0
x(0) - vo > 0 z(l) + -vl < 0
Jx(t) I < L
THE MINIMUM PRINCIPLE 227

find u(t), 0 < t < 1 to minimize

u!(t) dt
2 Jo
(The problem is due to Bryson, Denham, and Dreyfus.)
6.7. Consider the problem of determining the curve of minimum length between point
(x,o,x20) and the origin which cannot pass through a closed circular region. The
equations are
i, = sin u
it = Cog U
(xi-a):+xs:>_R'

B
min 6 = (o dt

Find u(t). (The problem is due to Leitmann.)


6.8. In some practical problems state variables may be discontinuous at discrete
points (e.g., the mass in a multistage rocket) and the form of the system equations
might change. Consider the system
xti(") = f (")(x("),u) 4-1 < t < t"
with discontinuities of the form
z("+1%) = lim z(t" + E) = lim x(tn - E) + {« =

where t" is a specified constant and t" is defined by the condition


+y"(z,t} = 0
For specified initial conditions zo and decision and final state constraints
U. <U <u
Ot[z(N)(1N)1 = 0
find the control which minimizes
H
5[x ")(t),u(t)i dt
nil f :.
(The problem is due to Graham.)
7
Staged Systems

7.1 INTRODUCTION
In Chap. 1 we briefly examined some properties of optimal systems in
which the state is described by finite difference equations, while the pre-
ceding four chapters have involved the detailed study of continuous sys-
tems. We shall now return to the consideration of staged systems in
order to generalize some of the results of the first chapter and to place
the optimization of discrete systems within the mathematical framework
developed in Chap. 6 for continuous systems. We shall find that the
conditions we develop are conveniently represented in a hamiltonian for-
mulation and that many analogies to continuous variational problems
exist, as well as significant differences.
In the study of continuous systems it has often been possible to
suppress the explicit dependence of variables on the independent varia-
ble t so that inconvenient notational problems have rarely arisen. This
is not the case with staged processes, where it is essential to state the'
location in space or discretized time precisely. For typographical rea-
sons we shall denote the location at which a particular variable is con-
sidered by a superscript. Thus, if x is the vector representing the state
xn
STAGED SYSTEMS 229

of the system, the state at position n is x", and the ith component of
The systems under consideration are represented by the block
x" is xi".
diagram in Fig. 7.1, and the relation between the input and output of
any stage is
x" = f"(x"-1,u") n = 1, 2, . . . , N (la)
or
1 2
xi" = fi"(x"-1,un)

(lb)
n= 1, 2,, N
S is the number of variables needed to represent the state. The func-
tions f" need not be the same for each stage in the process.

7.2 GREEN'S FUNCTIONS


Linear difference equations may be summed in a manner identical to the
integration of linear differential equations by means of Green's functions.
We consider the system
S
x,"= Aij"x,^+bi-
i=1,2, S
n = 1, 2, N (1)
j-1

We now introduce S1 functions rkiN", i, k = 1, 2, 8, at each stage, . . ,

denoting both the stage number n and the final stage N, with which we
multiply Eq. (1) and sum over n from 1 to N and over i from 1 to S.
Thus,
N S N S N S
I I rki Nnxin= 4 r ) kiNnAijxjn-1 + I 4 rki Nnbi n
n-1 i-1 n-1 i.j-1 n-1 i-1
k= 1, 2, . . . S (2)

The left-hand side of Eq. (2) may be rewritten (summed by parts) as


N S N S S

I I rkiN"x," = InI-IrkjN."-'xjn-1
n-lei-1
j-1 + I rkiNNxiN
ILLL

i-I
s
- I rkiNOXie (3)
{-1

X'=f1(XO, u1)
=f0(X"-1
U") + XN=fN(IN-1, UN)

-
X11

Xa X"
XN-1 IN
Sta g e n .. . j. St age N

Decision u'
i
Decisi on u" Decision uN

Fig. 7.1 Schematic of a staged system.


230 OPTIMIZATION BY VARIATIONAL METHODS

which yields, upon substitution into Eq. (2),


S S

1,kiNNxiN - I rkiNOxi0
i-1 i-1
N S S N S
(`j ``J

1N
n-1 j-1
I \I t/
i-1
rkiNnAijn '- 1'A.ji"n-1
/ xjn-1 + n-1i-1
rkiN nb,n

k = 1, 2, . . , S (4)

Since the goal of summing the equations is to express the output x"'
in terms of the input x° and the forcing functions bn, we define the Green's
functions by the adjoint difference equations

rkjN.n-1 =
(S
\ rkiNnAijn k,j=1,2,...,5
i-L I n=1,2,...,N (5)

rkjNN
= akj =
1 k - `6)
o k

Thus, Eq. (4) becomes a special form of Green's identity


S N S
xiN = I rjNOxj0 + I I riiNnbjn (7)
j-1 n-ij-1
These equations are analogous to Eqs. (12), (14), and (15) of Sec. -6.2,
but it should be noted that the difference equation (5) has an algebraic
sign opposite that of the differential equation (12) of that section.
We shall generally be interested in a linear combination of the com-
ponents xiN. If we define a vector yn with components y,n by

,yin = yjNrj,Nn i=1,2,...,S (8)


i-1
where y; ' are specified numbers, then by multiplying Eqs. (5) to (7) by
yiN and summing over i we obtain the scalar form of Green's identity
s S N S
y'NX,N
(9)
i-1 I 7i0A0 +n-1
i-1 I I i-1
y, nbin
and
s
yin-1 _ i = 1, 2, . . . ,S
--
j 1
n=1,2, .,N (10)
STAGED SYSTEMS Uu

7.3 THE FIRST VARIATION


We now turn to the optimization problem. The state is described by
the equations

xin = fin(xn-I,un)
i=1,2,...,5
n= 1,2, . . . N (1)

with the objective the minimization of a function of the final state S(xN).
The decision functions u" may be restricted by inequality constraints
Upn(un) > 0 p = 1, 21 P
n=1,2,...,N (2)

and the initial and final states by equality constraints,


qk(x°) = 0 k = 1, 2, K . . , '(3)
gi(XN) = 0 1 = 1, 2, ... , L (4).

We assume that we have chosen a sequence of decisions fin and an


initial state 2°, after which we determine a sequence of states 2" from
Eq. (1). We now make small changes consistant with the constraints.

uk" = ukn + &Ukn I auknl _< E (5a)


xi° = xio + axi° 15xi°I :5 E (5b)

for some predetermined t > 0. If the functions f° are continuous, it fol-


lows that the outputs from stage N change only to order E. For piece-
wise continuously differentiable functions fn the variational equations for
axn are then
S ft pn
af 6xn-1 + I
axis = I axjn-1
n
aJ,
OUkn + o(E) (6)
LLL (3Zdkn
j.l k-I

where the partial derivatives are evaluated at xn, nn. Equation (6) is
linear and has Green's functions defined by Eq. (10) of the preceding
section as

,Yn-I = S yjn (7)


L
j-i
ax n-I
'

in which case Green's identity, Eq. (9), becomes


S 3 N 3 R
afi"
yiN axiN = yi° axi° + sukn + 0(E) (8)
yin aukn
i-l i-1 n.1 i-1 k-1
For staged systems we need only consider the case where N is fixed,
232 OPTIMIZATION BY VARIATIONAL METHODS

so that the change in ti brought about by the changes Bu-, r,)xo is


S
ar;
6& _ aX,N SX,N + o(E) (9)
i-1
If, exactly as in Sec. 6.4 for continuous systems, we define the boundary
conditions for the difference equations for the Green's functions as
N _ as ag1
Yl (10a)
ry
axiN az;N
i-1
K
° aqk
Yi ° L ?7k ax;U (10b)
k-I

then Eqs. (8) to (10) combine to give


N 3 R
_ af;"
SF, Suk" + 0(t)
n-1 i-1 k-1 y," aut"

7.4 THE WEAK MINIMUM PRINCIPLE


In order to reinforce the analogy to continuous systems we introduce the
stage hamiltonian H" as
s
Hn = y;"fn (1)
i-1

We then have the canonical equations


= OH" (2a)
xIn ay;n
aH"
7' n-1 = (2b)
a..in-1

and Eq. (11) of Sec. 7.3 for SS may be written


N R
OHM
Ss = Sukn + o(e) > 0 (3)
auk"
n-1 k-1
where the inequality follows from the fact that the sequence On mini-
mizes E. By a proof identical in every respect to that in Sec. 6.5 for
continuous systems we obtain the following weak minimum principle:
The decisions ukn which minimize 6(XN) subject to the constraints
17,"(u") > 0 make the stage hamiltonian H" stationary when a con-
straint is not at equality and minimize the hamiltonian (or make it
STAGED SYSTEMS 233

stationary) with respect to constraints at equality or at nondifferentiable


points.
It should be noted that, unlike continuous systems, it is not true
that H" is a constant for all n or, as we shall subsequently prove, that
H" is a minimum at stationary points. Equivalent formulations of the
objective may be accommodated by defining new state equations as for
continuous systems.

7.5 LAGRANGE MULTIPLIERS


The only essential generalization other than notation which we .1 have
introduced beyond the discussion of staged systems in Chap. is the
restriction on the allowable values of the decision. It follows, then, that
in situations in which the optimal decisions are unconstrained the sta-
tionary condition on the staged hamiltonian should be derivable from the
Lagrange multiplier rule. We show here that this is so.
We shall write the system equations and boundary restrictions as
1, 2, . . . ,S (1)
-xi" + fn(xn-',un) = 0 n 1, 2, N
g1(xN) = 0 1 = 1, 2, . . . , L (2)
-qk(x°) = 0 k = 1, 2, ... , K (3)
The choice of positive or negative signs is motivated by a desire to relate
the results directly to those of the previous section. The minimizing
values of u" for the objective S(xN) are found from stationary values of
the lagrangian
N s

2=C($N)+ II
n-1 i-1
Xi"[-xi" + fn(t"-1,u;./J
L K
+ I v,gi(x'') - I 1kgk(x°) (4)
1-1 k-1

Here we have introduced a multiplier Xe" for each of the constraint Eqs.
(1), a multiplier vi for each Eq. (2), and 'hk for each Eq. (3).
Setting partial derivatives with respect to u;" to zero, we obtain
s of, n
Yin au n= 0 j = 1, 2, , R (5)

For n 1 the partial derivatives with respect to xi"-' yield


s
A+ L i=1
x;" of in
axe-' =0 2
n =V 2,
1' 3,
2' '
,N
S
03)
LU OPTIMIZATION BY VARIATIONAL METHODS

Partial derivatives with respect to x,N give the equations


L
y1
X iN + /
61xiNL..I/ Vl
ax{N
0 (7)
i-1
while derivatives with respect to x;° give

I3
j-l
J1 ou'
ax;
- K
k-1
a
u ax;° = (8a)

By defining X,° through Eq. (6) this last relation may be written
K
Xc°- 1 7,, C9
a=0 (8b)
k-1

Equations (5) to (8) are identical to the equations for the weak
minimum principle when the decisions are unconstrained if the Lagrange
multipliers Xi" are identified with the Green's functions y;". Surprisingly
some authors have failed to recognize this simple relation and have
devoted considerable space in the literature to the complicated deri-
vation (or rederivation) through a "minimum principle" of results in
unconstrained systems more easily (and frequently, previously) obtained
through application of lagrangian methods. Indeed, the interpretation
of the multipliers in Sec. 1.15 in terms of partial derivatives of the objec-
tive leads directly to the weak minimum principle with constraints, and
it is only to simplify and unify later considerations of computation that
we have adopted the form of presentation in this chapter.

7.6 OPTIMAL TEMPERATURES FOR CONSECUTIVE REACTIONS


In order to illustrate the difficulties which arise in even the simplest
situations when constraints are imposed on the decisions in a staged sys-
tem it is helpful to return to the example of Sec. 1.12. We desire the
optimum sequence of operating temperatures in carrying out the chemi-
cal reaction
X --> Y -- products
in N stirred-tank reactors. We shall retain x and y to denote concen-
trations and u for temperature, but we now use superscripts to signify
the stage number.
Equations (6) defining the state in Sec. 1.12 are
0= - x" - 8"k1(un)F(xn)
xn-1
(1a)
0 = yn-1 - y" + v0nk1(u")F(xn) - B"k2(u")G(y") (lb)
STAGED SYSTEMS us

and the goal is to choose u', u2, . . . , uN in order to minimize

- pxI - yrr (2)

In Chap. 1 we found that if u" is not constrained, the design problem is


reduced to the simultaneous solution of two coupled nonlinear difference
equations with an iterative search over one initial condition, a rather
simple calculation. We now add the restriction that u" lie within bounds
u* <u" <u* (3)

Equations (1) are not of the form for which we have developed the
theory, which would be

x" = f"(x"-',y"-',u") (4a)


y" = gn(x"-1,y"-1,u") (4b)

Since we need only partial derivatives of f" and g" in order to apply the
theory, however, this causes no difficulty. For example, partially differ-
entiating Eq. (la) with respect to x"-' yields
a f"
1 - ax"''
af" - O"k1(u")F'(x")
ax"-3
=0 (5a)

or

af" __ 1
(5b)
ax"'' 1 + 8"kl(un)F'(xn)

The other partial derivatives are similarly obtained, and we can write
the multiplier equations
af" ag" yl"
yl"-i = 'Y1' ax"-1 + ax"-' 1 + 8"kl(u")F'(x")
'Y 2 "pO"k l (u")F'(x")
+ [1 + O"kl(u")F'(x"))[1 + 8"k2(u")G'(y")]
afn ay "
72"-' - 71" ayn_1 + 72" 49yn_1
y2 "
1 + 9"k2(u*)G'(y")
(6b)

with boundary conditions


as
tilN=ay= -p (7a)

y2"'=ayN= -1 (7b)
Z36 OPTIMIZATION BY VARIATIONAL METHODS

Also,
all- _ af" yl"9"k1(un)F(x")
T--
` 7i" au- + 72 nag-
au" 1 -}- 9"kl(un)F'(x")
72np8"kl(u")F(x")
+ [1 + 9"k2(u")G'(y"))[1 + 9"ki(u")F'(x°)]
72"9"k2(u")G(y")
1 + 9"k2(u")G'(y") (8)

The equation aH"/au" = 0 will have a unique solution for variables of


interest, in which case the optimum u" for solutions which lie outside the
bounds established by Eq. (3) is at the nearest bound.
Because Eq. (6b) is independent of yin and 72" cannot go to zero,
we may divide Eqs. (6) to (8) by 72" and by defining

" 72n (9)

we obtain
1 + 9"kl(u")F'(x") p9"kl(u")F'(x")
(10)
1+ 9"k2(u")G'(y") 1 + 9"k2(u")G'(y")
N=P (11)
with solutions of aHn/au" = 0 at solutions of
9"k1(u")F(x") p9"k; (u")F(x")
1 + 9"ki(u")F'(x") + [1 + 9"k2(u")G'(y")][1 + 9"kl(u")F'(x"))

9"k2(u")G(y") =0 (12)
1 + 9"k2(u")G'(y")
The required computational procedure is then as follows:

1. Assume r'.
2. Solve Eqs. (1) and (12) simultaneously for x', y', u'.
3. If u' exceeds a bound, set u' to the nearest bound and recompute
x', y' from Eqs. (1).
4. Compute x2, y:, us, r simultaneously from Eqs. (1), (10), and (12).
5. If u2 exceeds a bound, set u2 to the nearest bound and recompute
x2, y2, ?.2 from Eqs. (1) and (10).
6. Repeat steps 4 and 5 for n = 3, 4, . . . , N.

7. Repeat steps 1 to 6 for changing f' until YN = p.


This is substantially more computation than required for the uncon-
strained case. Note that we could have chosen zN and yN and com-
puted backward, adjusting xN and yN until matching the specified x°, yo,
but that without some rational iteration procedure this requires substan-
tially more calculations than a one-dimensional search on f''.
STAGED SYSTEMS

In the special case that


F(xn) = xn (13a)
G(yn) = yn (13b)

the problem simplifies greatly. Defining


y"
17
(14)
xn

we combine Eqs. (1) to yield


n_1 _ 1 + 9nk2(un) n- vOnk,(un)
(15)
11
1 + 9nk1(un) 1 + 9nk,(un)
while Eqs. (10) and (12) become, respectively,
I + 9nk2(un) "Bnk,(un)
n t = (16)
1 + 9"kl(un) + 1 + Onk,(un)
kl(u") k2(un)
n
1 + 9"k,(un) I + enk2(un)
vk,(un) 0 (17)
[1 + 9nk1(u")J[1 + 9"k2(u"))
We may now either follow the procedure outlined above or choose 1f N,
compute since we are given i-N, then compute 1 N-1, rN-1, um-1, etc.,
by means of Eqs. (15) to (17), iterating on ,IN until matching the speci-
fied feed ratio n°. The advantage of this latter calculation is that what-
ever the , ° corresponding to the chosen 71N, the complete set of necessary
conditions has been used and the policy is optimal for that particular 11°.,
Thus a complete set of optimal policies is mapped out in the course of
the computation, which is not the case in calculations which assume r°.

7.7 THE STRONG MINIMUM PRINCIPLE: A COUNTEREXAMPLE


At the end of Sec. 7.4 we stated that it is not generally true for discrete
systems that the stage hamiltonian assumes a minimum at stationary
points. From the discussion in Sec. 7.5 it may be seen that such a
minimization would correspond to minimizing the lagrangian, which we
showed to be incorrect in Sec. 1.8, but because of substantial confusion
about this point in the engineering literature we shall pursue it some-
what more. In this section we shall demonstrate a counterexample to
a strong minimum principle; in the next section we shall construct some
of those situations for which a strong minimum principle does exist.
Consider the system
xln = x1n-1(1 + x2"_1) 12(u")2 x1° _ 74 (la)
x2n = 4x1"-1 - 2x2n-1 + u" x2 = +1 (1b)
no OPTIMIZATION BY VARIATIONAL METHODS

where u' and u2 are to be chosen subject to


0<u*<u" n=1,2 (2)

in order to minimize
S = _x,2 (3)

By direct substitution
S= + (u1)21(2 - u') + 12(U2)2 (4)

-and the optimal values are u' =+ 1, ult = u*.


Now
H" = 'Y1"(x,"-1 + xl"-'x2'-' - 7l('u")zl
+ 72"(4x1"'' - 2x,"-' + u") (5)
aH,
au' = -U171' + 72' (6)
a2H'
a(u')2 = -y1' (7)

But
s
'rll = axil = -(I + x21) (-2 + 'u') (8a)

72'
axaH2

=1=l
= -xl1 7'l 11 + (499
9(499 (8b)

Thus, all'/au' does indeed vanish at u' = 1. But from Eqs. (7) and
(8a),
a'H' -1
2 = -1 < 0 (9)
a(u1)2

which corresponds to a maximum of H' rather than a minimum.

7.8 SECOND-ORDER VARIATIONAL EQUATIONS


We shall now examine circumstances under which a strong minimum
principle, in which the stage hamiltonian is in fact minimized by the
optimum values of u", can be shown to exist. The significance of such
a result is clear in the light of the equivalent results for continuous sys-
tems discussed in the previous chapter, and we shall find subsequently
that there are other computational implications. Our procedure is again
the application of Picard's iteration method.
We suppose that z° and n° have been specified and that we now
consider variations Bus, n 1, 2, . . . , N such that I du;"J < E. &x0 will
be taken to be zero. The variational equations corresponding to Eqs.
STAGED SYSTEMS 239

(1) of Sec. 7.1 are then, to second order in e,


sn
Ik ai, n
axin =
j-1 axjn-1 + k-lI Sukn

S 2f n if
2n S R
Sukn axjn-1
+21 axjna' axkn-1 axn-1 axkn-1 +
j-1 k=1
axja
I c`) 3If.n
Sujn bUk n +UO(J)
O/( (1)
+ . L1
J,k - 1
all n

If we look upon this as a linear nonhomogeneous system for which the


appropriate Green's functions I'ijnm satisfy the equation

r{_n,m-1 = \ rijnm afkm (2)


' G ax ln-1
kal
then Green's identity becomes

axis rijnm (a26 m auk- + a2fjm


SUkm SUP- }
k a,ukm aUPm
mmljglk=1 p=1
n S
a2f.m
S.rjm-1 axpm-1
+ 2 m - I j,k.p = 1 rijnm axkm-1 ax
P m-l
n S R

./ j,k=1 p-1
+ m=1 I rijnm 32",m axk,
au,- Sxkm-1 + O(E2) (3)

Evaluating Eq. (3) for n = m - 1 and substituting back into the right-
hand side, we obtain, finally, an explicit representation for Sxv
N S R R
2 Ln

SxiN =
rJ rijNn
(Li auk~ + `I S2lkn aupn
pal aua . P
LI n
n-1j+1k-1
N S n -1 S R

+ n-I j.k,pI
n-i S R
Nn _ J7

axkn al dxpn-1
2f.. n

111
q1 v1
rk n-l,m
Q
{{

a2lvm
a2cvm

{r \ N S R
2 fn
X rp r n
n
axk
n-1 j,kml allp

n-1 S R a{19m

X rjqn-l.m
auvm) + O(E2) (4)
allvm
,n-1 q =1 v-1

We shall assume, for simplicity, that the objective to be minimized,


S(xN), is linear in the components xiN. By suitable definition of addi-
tional variables we can always accomplish this. If we further assume
240 OPTIMIZATION BY VARIATIONAL METHODS

that xN is completely unconstrained, we obtain

SS = bx,N N ax;N
a.Z,N (5)

and multiplying Eqs. (2) and (3) by y,,' and summing we obtain, with
the usual definition of the stage hamiltonian,
I 2 n
SF = I an aukn +
2 LI 3unfaukn bu7n aukn
n=1 k=1 j,k=1 7
N S n n-1 S R
fi{m
II
2

rkj n-l,m
v
Suym/J1
+ n=1 k,p=1
8X"al axPn-1 (m=1 7-1 v=1 U
n-1 S
/`'1
/C T
R
, t
LL
N S' CR auPn 2 n

X (I I V 1
r=l q -1 s=1
P4n-l r
euzr} + n =1 k-l p= I
axka Suyn
n-1 S P
m

XII M-1 q-1 j-1


Tk,,n-I.m
i qm aujml + o(e2) > 0
7
C
(6)

Unlike the continuous case, in which At can be made arbitrarily small,


there is no general way in which the last two terms can be made to van-
ish, and this is the source of the difference between discrete and continu-
ous systems. Several special cases, however, can be treated. Because
we are interested in the character of stationary points, we shall restrict
nonvanishing Sukn to those corresponding to interior values of the
optimal f, n.
Linear separable equations have the form
s
x;n A,,nxjn-1 + bin(un) (7)
j=1

where the A,," are constants. In that case both the second partial deriv-
atives of Hn with respect to components only of xn-1 and xn-1 and un
vanish identically, and Eq. (6) reduces to
N` R

II
n 2 n
SS
= n-1 k=1 aukn
sukn +
2 n-1 I
L j,k-1 8u n aukn Sujn 7
Sukn + 0(E2) > o

(8)

The first term is zero by virtue of the weak minimum principle and the
assumption concerning admissable bun. It follows, then, that the hessian
matrix of second derivatives of Hn must he positive definite, correspond-
ing to a minimum. Furthermore, since the Sun are arbitrary, Eq. (8)
STAGED SYSTEMS 241

establishes the sufficiency of the strong minimum principle for the linear
separable case as well.
Linear nonseparable equations are of the form
s
x," = I A;;" (u") x"-' + b;" (u") (9)
j-1
Here the mixed second partial derivatives do not vanish. By choosing
the special variation, however,
bu* m = n*
bum _ (10)
0 m n*

for some specified n* the last term in Eq. (6) also vanishes, and we obtain,
using the weak minimum principle,

SE = 2
I

j R

I
asg"*
au "* auk"*
Su* Su,* + o(EZ) > 0

which establishes the necessity of the strong minimum principle for this
case also. That it is not sufficient is apparent from Eq. (6).
If there is a single state variable, with the scalar equation

x" = f"(x"-',u") (12)

then, since y" will be nonzero, the vanishing of all"/auk" implies the van-
ishing of 49f"/auk". It follows from Eq. (6), then, that the strong mini-
mum principle is both necessary and sufficient. Furthermore, f" always
takes on an extreme value, and if of"/ax"-' is positive for all n, then y". is
always of one sign. If, in addition, x" must always be positive for physi-
cal reasons, the policy is disjoint, which means that minimizing H" in
order to minimize or maximize x" is equivalent to minimizing or maxi-
mizing x" at each stage by choice of u". This should recall the discussion
of optimal temperature profiles for single reactions in Sec. 3.5, and, indeed,
an identical result holds for the choice of temperatures for a single reaction
occurring in a sequence of staged reactors.
We emphasize that the results obtained here are applicable only for
a linear objective.t If S is nonlinear, an additional term is required in
Eq. (6). In particular, a linear system with a nonlinear objective is
formally equivalent to a nonlinear system with a linear objective, for
which the strong minimum principle does not apply. A counterexample
t With the exception of those for a single state variable.
242 OPTIMIZATION BY VARIATIONAL METHODS

which is linear and separable is


xln = - 3- (un) 2
-3x1n-1
x1° = given (13a)
x2n = X2n-1 + un x2° = 0 (13b)
Fi = -x12 - (x22)2 (13c)
We leave the details of the demonstration' to the reader.

7.9 MIXED AND STATE-VARIABLE CONSTRAINTS


For staged systems it is not necessary to distinguish between constraints
on the state of the system
Qkn(xn) >- 0 (1)

and mixed constraints


Qkn(xn,un) > 0 (2)
Through the use of the state equation both types of constraint can always
be put in the form
Qkn(xn-l,un) > 0 (3)
If the constraint is not violated by the optimal trajectory, the theory
developed in Sees. 7.4 and 7.8 remains valid. Thus, we need only con-
sider the case where equality must hold in Eq. (3).
As with the continuouY case, we first suppose that there is a single
stage decision variable u^ and only one constraint is at equality. We
then have
Qn(xn-l,un) = 0 (4)
and the variational relationship

aQn = L aQn Sxcn-l + aQnn Sun


axin-1
=0 (5)
t
Coupled with the variational equation,
s
afi" inin
axin = axjn-1 + sun (6)
. j-i 44-I axin-1 clu"

we may solve for Sun and write


s
af'n
axjn-1
- aftn
aun
NA-1 aQn 1
aun
axjn-l (7)

1
The decision un is determined by Eq. (4), and the Green's function corre-
STAGED SYSTEMS 243

sponding to the difference equation (7) satisfies


af;n - af,* aQn -1 aQn
y, "-` - ; [axin-1 au" %aun) axfn-1 yn
(O)

Equation (3) of Sec. 7.4 for 63 remains unchanged provided we exclude


values of n at which Eq. (4) holds, and the weak minimum principle
must be satisfied for all stages where a constraint is not at equality.
In the general case of K1 independent constraints and R1 > K1
components of U" we find, as in Sec. 6.15, that the Green's vector must
satisfy the equation
s K,
aQkn
)
af'n
n
axin-1 7i n (9)
ax{n-1
aupn Syk
i=1 p,k-1
where Spkn is the matrix such that
K,
a n

I Sp/c
au; = atp i,p= 1, 2, . . . ,K1 (10)
k-1

It follows then that the R1 - K1 independent components of un must be


chosen to satisfy the weak minimum principle.

BIBLIOGRAPHICAL NOTES
Section 7.1: Finite difference representations might arise either because of time or space
discretization of continuous processes or because of a natural staging. For the for-
mer see, for example,

G. A. Bekey: in C. T. Leondes (ed.), "Modern Control Systems Theory," McGraw-


Hill Book Company, New York, 1965
J. Coste, D. Rudd, and N. R. Amundson: Can. J. Chem. Eng., 39:149 (1961)
Staged reaction and separation processes are discussed in many chemical engineering
texts, such as

R. Aris: "Introduction to the Analysis of Chemical Reactors," Prentice-Hall, Inc.,


Englewood-Cliffs, N.J., 1965
B. D. Smith: "Design of Equilibrium Stage Processes," McGraw-Hill Book Company,
New York, 1963

Section 7.2: The linear analysis for difference equations used here is discussed in
M. M. Denn and R. Aria: Ind. Eng. Chem. Fundamentals, 4:7 (1965)
T. Fort: "Finite Differences and Difference Equations in the Real Domain," Oxford
University Press, Fair Lawn, N.J., 1948
L. A. Zadeh and C. A. Desoer: "Linear System Theory," McGraw-Hill Book Com-
pany, New York, 1963
244 OPTIMIZATION BY VARIATIONAL METHODS

Sections 7.3 and 7.4: The analysis follows the paper by Denn and Aria cited above.
Similar variational developments may be found in

S. S. L. Chang: IRE Intern. Conv. Record, 9(4):48 (1961)


L. T. Fan and C. S. Wang: "Discrete Maximum Principle," John Wiley & Sons, Inc.,
New York, 1964
S. Katz: Ind. Eng. Chem. Fundamentals, 1:226 (1962)
The paper by Katz contains a subtle error and erroneously proves a strong minimum
principle. This incorrect strong result is cited, but not necessarily required, in several
of the examples and references in the book by Fan and Wang.

Section 7.5: The lagrangian development was used by Horn prior to the work cited above:
F. Horn: Chem. Eng. Sci., 15:176 (1961)

Section 7.6: This reactor problem has been studied in the papers cited above by Horn and
Denn and Aris and in the book ,
R. Aria: "The Optimal Design of Chemical Reactors," Academic Press, Inc., New
York, 1961

A number of other applications are collected in these references and in the book by Fan
and'Wang.

Section 7.7: The observation that a strong minimum principle is not available in general
for discrete systems is due to Rozenoer:

L. Rozenoer: Automation Remote Contr., 20:1517 (1959)

A counterexample to a strong minimum principle was first published by Horn and


Jackson in
F. Horn and R. Jackson: Ind. Eng. Chem. Fundamentals, 4:110 (1965)

Section 7.8: The development follows

M. M. Denn and R. Aria: Chem. Eng. Sci., 20:373 (1965)

Similar results are obtained in

F. Horn and R. Jackson: Intern. J. Contr., 1:389 (1965)

Several workers, notably Holtzman and Halkin, have examined the set-theoretic foun-
dations of the necessary conditions for optimal discrete systems with care. Some of
their work and further references are contained in

H. Halkin: SIAM J. Contr., 4:90 (1966)


J. M. Holtzman and H. Halkin: SIAM J. Contr., 4:263 (1966)

Section 7.9: These results are obtained in the paper by Denn and Aria cited for Sec. 7.8.
STAGED SYSTEMS 245

PROBLEMS
7.1. Consider the necessity and sufficiency of a strong minimum principle for
S
xi" - j-1I aijxjw-1 + biu"

N jS

min
I[
[ La xi"Qijxj" + P(u")']
2 n11 i,j-1
7.2. Denbigh has introduced the pseudo-first-order reaction sequence
ki k,
X, X, -, X,
Lk, J,k,
Yl Y2

where X, is the desired product. Taking the reaction-rate coefficients as

ki(T) - ki0 exp


(:Pi)
where T is the temperature, the equations describing reaction in a sequence of stirred-
tank reactors are

xin-1 - xl"[l + u1"(1 + U24))


x:"-i - -uj"xl" + xt" 1 + 0.01uu1" l l
L \ + .u'w
XI"-1 - -0.01u1"xt" + x""

Here, ul is the product of reactor residence time and k1, and u2 is the ratio k1/k1.
Taken together, they uniquely define temperature and residence time. The following
values of Denbigh's have been used:

k2 = 10'k1 exp (::-


Tom) ks - 10-2k1 k4 -
f 00
7,

3, T 0

Aris has introduced bounds

0 < T < 394 0 _< ui < 2,100

For initial conditions x° - 1, x2° - x3° - 0 determine the maximum conversion to


X3 in a three-stage reactor. Hint: Computation will be simplified by deriving bounds
on initial values Yi° and/or final values The optimal conversion is 0.549.
7.3. Obtain the equations defining an approximate nonlinear feedback control, for the
system described by the nonlinear difference equation

xw+: = F(xw+l,xw,uw)

min S =2 - 1N (z,, + Puw')


nsl
246 OPTIMIZATION BY VARIATIONAL METHODS

7.4. Repeat Prob. 1.13 using the formalism of the discrete minimum principle. Extend
to the case in which T. is bounded from above and below.
7.5. Consider the reaction X Y -, Z in a sequence of stirred tanks. This is defined
by Eqs. (1) of Sec. 7.6 with the added equation -
X0 + y* + z* - coast
Derive the procedure for maximizing z"'. Specialize to the case
F(x*) - x* G(y*) - y*
and compare your procedure for computational complexity with the one used in the
book by Fan and Wang for studying an enzymatic reaction.
8
Optimal and Feedback Control

8.1 INTRODUCTION
Many of the examples considered in the preceding chapters have been
problems in process control, where we have attempted to use the opti-
mization theory in order to derive a feedback control system. Even in
some of the simple examples which we have studied the derivation of a
feedback law has been extremely difficult or possible only as an approxi-
mation; in more complicated systems it is generally impossible. Further-
more, we have seen that the optimal feedback control for the same sys-
tem under different objectives will have a significantly different form,
although the objectives might appear physically equivalent.
In this chapter we shall briefly touch upon some practical attacks
on these difficulties. We first consider the linear servomechanism prob-
lem and show how, for a linear system, the optimization theory com-
pletely determines the feedback and feedforward gains in a linear control
system and how classical three-mode feedback control may be viewed as
a natural consequence of optimal control. The problem of ambiguity of
247'
248 OPTIMIZATION BY VARIATIONAL METHODS

objective will be similarly attacked by considering another particularly


easily implementable feedback system and solving the inverse problem,
thus defining a class of problems for which a reasonable optimum control
is known.

8.2 LINEAR SERVOMECHANISM PROBLEM

The linear servomechanism problem is the control of a linear system


subject to outside disturbances in such a way that it follows a prescribed
motion as closely as possible. We shall limit our discussion here to sys-
tems with a single control variable and disturbances which are approxi-
mately constant for a time much longer than the controlled system
response time, and we shall assume that the "motion" we wish to follow
is the equilibrium value x = 0. Thus, the state is described by
zi = Ai;x; + biu + di (1)

with A, b, and d consisting of constant components, and we shall use a


quadratic-error criterion,

2 Io ' xiCi;x; + u2) dt (2)

C is symmetric, with Ci; = C;i. The u2 term may be looked upon as a


penalty function or simply a mathematical device in order to obtain the
desired result. Multiplying by a constant does not alter the result, so
that there is no loss of generality in setting the coefficient of u2 to unity.
The linear regulator problem is the special case with d = 0.
The hamiltonian for this system is

H= Z xiCi1x; + %u2 + yiAi,x, + q yibi + yid; (3)

with multiplier equations


aH (\
yi = Oxi
_ G Ci,x, - I ykAki
} (4)
k

The partial derivatives of H with respect to u are


aH
au
= u + L yibi (5a)

a2H=1>0 (5b)
au2
Equation (5a) defines the optimum when set to zero
OPTIMAL AND FEEDBACK CONTROL

(6)

while Eq. (5b) ensures a minimum. We have already established in Sec.


6.20 that the minimum principle is sufficient for an optimum here..
We seek a solution which is a linear combination of the state varia-
bles and forcing functions. Thus,

7i Mijxj + D;,dj (7)


i
Differentiating,

Mijx, + M;jxj + Dijdj (8)

and substituting Eqs. (1), (6), and (7),

M;jxj + MijAjkzk

I Mijbj Q Muxkb,
j k.!
+ kjI Dudb,) + IjMijdj + Ej Aidj , (9a)

while Eq. (4) becomes

-ii = - I Cijxj - I Q Mklzt + I Dkd:) Aki (9b)


j k I 1

Equating the coefficients of each component of x and of d, we then obtain


the two equations
M;j + I (MikAkj + MkjAk;) - rj' btMij) + C,j = 0
k k
Mikbk) \Lt
(10)
Aj + I Ak;D:i
k
- \ M ,kbk) \ b,D:j) + Mij = 0
((

k
1 ((
i
(11)

Equation (10) is a quadratic Riccati differential equation, while Eq. (11)


is a linear nonhomogeneous differential equation with variable coefficients.
It should be observed that the symmetry of C implies a symmetric solu-
tion to Eq. (10), Mij = Mj;.
We shall not consider transient solutions of Eqs. (10) and (11),
for it can be established that as & -- oo, we may obtain solutions which
are constants. Thus, the optimal control is

u=- b,Miizj - b;D;,dj (12)


!!!@ OPTIMIZATION BY VARIATIONAL METHODS

where M and D are solutions of the algebraic equations

- (M;kAk, + Mk;Ak;) + (I Mikb.) \ M1,b,) = C,, (13)

(Ak: - bk Mob) Dk; M(; (14)


-k 1

We have previously solved special cases of these equations.


The obvious difficulty of implementing the feedback control defined
by Eqs. (12) to (14) is that the matrix C is not known with certainty.
It will often be possible to estimate C to within a scale factor, however
(the coefficient of u2!), in which case Eqs. (13) and (14) define the inter-
actions between variables and only a single parameter needs to be chosen.

$.3 THREE-MODE CONTROL

An interesting application of the linear servomechanism problem arises


when one postulates that it is not the u2 term which should be retained
in the objective but a term 42. This would follow from the fact that
the rate of change of controller settings is often the limiting factor in a
design, rather than the magnitude of the setting. We might have, for
example, for the forced second-order system
x+bx+ax=u+d (1)
or
zl = x2 (2a)
z2= -ax,-bx2+u+d (2b)

the objective

S = 2 fom (C,x12 + C2x22 + u=) dt (3)

By defining u as a new state variable x:, we may rewrite Eqs. (2)


and (3) as
ZI = x2 (4a)
it = -axe - bx2 + x, + d (4b)
.is = W (4c)
(0x14 + C2x2' + w2) dt (5)
2 !o
where w is taken as the control variable. This is of the form of the linear
servomechanism problem, with the degenerate solution
w = -MIIxt - M12x2 - M13x3 - D32d (6)
OPTIMAL AND FEEDBACK CONTROL M
Equations (13) and (14) of Sec. 8.2 reduce to the nine equations

C,1 = M132 + 2bM12 (7a)


0 = M13M23 - M11 + aM12 + bM22 (7b)
0 = MIZM33 --- MM12 + bM23 (7c)
C2 = 'M232 - 2M12 + 2aM22 (7d)
0 = M23M33 - M13 + aM23 - M22 (7e)
0 = M332 - 2M23 (7f)
Mil = bD22 + M13D32 (7g)
M22 = -D12 + aD22 + M23D32 (7h)
M23 = -D22 + M33D32 (7i)

The solution may be shown to be

D32 = M33 (8a)


M13 = -bM33 + N C1 (8b)

M23 = i2M332 (Sc)

where M33 is the unique positive solution of the quartic equation

M33' + 4aM333 + 4(a2 + b)M332 + 8(ab - )M33 - 4C2


-8a =0 (9)
Thus,
w = is = - (/ - bM3,)x - i2M332i - M33u - Maad (10)

We may substitute for u + d in Eq. (10) from Eq. (1) to obtain


x - (3 M33,2 + aM33)i - M33x (11)

or, integrating,

v Cl f of x(T) dr - (%M332 + aM3a)x(t) - Msai(t) (12)


u(t)

This is a three-mode controller, in which the control action is taken as a


weighted sum of the offset and the derivative and the integral of the
offset. Such controls are widely used in industrial applications, the inte-
gral mode having the effect of forcing the system back to zero offset in
the presence of a persistent disturbance. Standard control notation
would be of the form
t)
-K I x(t) + r Jot x(r) dr + TD ad (13)
u(t) _
252 OPTIMIZATION BY VARIATIONAL METHODS

where
K = 12Ma3(Maa + 2a) (14a)
M33(M33 + 2a)
TI = (14b)
2 VC1
2
TD = (14c)
M33 + 2a
These equations can be rearranged to yield

K=
2(1 2
- aTD) (15)
TD

and the nonnegativity of K then bounds the derivative time constant

aTD<1 (16)
The governing equation for the controlled system subject to step
disturbances is

i'+ (a+KrD)x+ (b+K)±+Kx


TI
=0 (17)

or, defining the ratio of time constants by .


a = TD-Tr
(18)

and substituting Eq. (15) into (17), the governing equation becomes

+
[2
TD rD] x + [b+_(1 - aTD)] x
+L D3(1 amD)] x = 0 (19)

For a stable uncontrolled system (a, b > 0) the necessary and sufficient
condition that Eq. (19) have characteristic roots with negative real parts
and that the controlled system be stable is
(coefficient of t) (coefficient of t) > (coefficient of x) (20a)
or
2-arDrb+ 2a(1-aTD)] (20b)
TD L TD Q7D

The inequality can be converted to equality by introducing a stability


factor o > 1 and writing
2-amDI b-}Tt)2 s (1-arD)I-i =°- r 2 all-arD)] (21)
TD art)
Here o = 1 corresponds to marginal stability.
OPTIMAL AND FEEDBACK CONTROL 253

Equation (21) can be rewritten as \a cubic equation inrD

-2 rp3+(a2+b)rp2+(a3{arn+i2-aJ=0 (22)

It will often happen in practice that a2 > b. For example, in the reactor-
control problem used in Chaps. 4 and 5 and again in a subsequent section
of this chapter a = 0.295, b = 0.005. In that case Eq. (22) approxi-
mately factors to
\\ // \\
rD2+arp-1arn-2+a)=0 (23)
C
and the unique root satisfying the inequality arD is

arp=2-- (24)

,
The nonnegativity of TD :iliu the upper-
tuoullua vu.cll
.. .V..., n
..u 1,
:.. '111LUtS-

al1Ow-
able values of u/a to
2 > -° > 1 (25)
-a
Equation (15) for the gain can now be rewritten in terms of the single
parameter o/a as
K _ 2a(o - a)
(26)
P (2a - 0')2
Further restrictions on the allowable controller design parameters
are obtained by substituting Eqs. (14), (18), (24), and (26) into the
quartic equation (9) and rearranging to solve for C2, yielding
C2 = 4K 2 [_a2 - c + av + (2a 0)2] (27)
a J
A meaningful optimum requires that the objective be positive definite,
or that C2 be nonnegative. Using the approximation b >> a2 for con-
sistancy, a and c are then further related by the inequality
a2-ao+a, <0 (28a)

An equivalent form obtained by completing the square is


(a-32u)2+ u(1-%a)<0 (28b)

from which it follows that


0>4 (29)
That is, optimality requires at least 4 times the minimum stability con-
254 OPTIMIZATION BY VARIATIONAL METHODS

dition! It further follows, then, from Eq. (25) that a is bounded from
below

a2i2o>2 (30)

A sharp upper bound on the allowable ratio of time constants can


be obtained by rewriting Eq. (28b) as

(a il42 411 (28c)

or, taking the square root,

a- j2v<2j1 -

2<<rp<211-+ -(32)
Thus, the ratio rr/rn isstrictly bounded by the one-parameter inequality
(31)

For a in the range 4 to 6 the allowable values of rr/rn are restricted


to lie between 2 and 4.7. Standard control practice for settings for three-
mode systems is a value of rr/rD of approximately 4.

8.4 INSTANTANEOUSLY OPTIMAL RELAY CONTROL


In order to circumvent the difficulty of constructing a feedback control
system from optimization theory a number of investigators have used
ad hoc methods to construct feedback systems which are, in some sense,
instantaneously optimal, rather than optimal in.an overall sense. For
example, suppose that the positive definite quadratic form

XiQrsx; (1)
E=2
i.J

is used as a measure of the deviation from the equilibrium x = 0, being


positive for ar v offset. Then an instantaneously optimal control would
be one which drives E to zero as rapidly as possible, disregarding the
future consequences of such action.
We shall restrict attention to systems which are linear in a single
bounded control variable
x, = f,(x) + b;(x)u (2)
U* < u < u* (3)

(Linearity is essential, but the single control is not.) This form is typi-
cal of many processes. The criterion of driving E to zero as rapidly as
OPTIMAL AND FEEDBACK CONTROL 23

possible is equivalent to minimizing the time derivative E


E _ xiQ:ixi = xiQiifi + ( x:Qijb) U (4)
_. is +a

(We have made use of the symmetry of Q.) Since Eq. (4) is linear in u,
the minimum will always occur at an extreme when the coefficient of u;
does not vanish identically
u* I x.Qiibi > 0
u= `'' (5)
u* x;Q;ibi < 0
.

Thus we immediately obtain a feedback control law. Note that when


b is independent of x, the switching criterion is linear.
This mode of control can be illustrated by returning to the example
of the stirred-tank chemical reactor, defined in Sec. 4.6.as

d (A - A,) = (A, - A) - kA (6)


V
I (T- T.) = q
d
V C9P VCDP
(7)

where x1 is taken as A - A X2 as T - T control is by coolant flow


rate q,, and we have defined u as the monotonic function ('Kq,/(1 + Kq,).
All parameters will be identical to those used in Sees. 4.6 and 5.6, in which
case the bounds on u are
0<u<8 (8)
Comparison of Eqs. (6) and (7) with Eq. (2) indicates that the
switching function has the form
T T`
.L r xiQ
rbi = - V C'P IQ12(A - A.) + Q22(T - T,)l (9)

As T will always exceed the coolant temperature T,, the coefficient of the
bracketed term in Eq. (9) is always negative and Eq. (5). for the control
law reduces to
U _ 8 (T - T.) + a(A - A,) > 0 (10)
0 (T-T,)+a(A-A,)<0
where a = Q12/Q22 - This, is a linear switching law for the nonlinear proc-
ess. An even simpler result is obtained when cross terms are excluded
from the objective, in which case a = 0. Then the control is

u-_ 8 T > T,
0 T<T,
256 OPTIMIZATION BY VARIA' ONAL METHODS

That is, adiabatic operation when the temperature is below the steady-
state value, full cooling when above, irrespective of the relative weight-
ing placed on concentration and temperature deviations. It is interest-
ing to compare this result with the time-optimal control shown in Fig.
5.9, in which the switching curve does not differ significantly from the
steady-state temperature.
Paradis and Perlmutter have computed the response of this system
under the control equation (11) with an initial offset of T - T. = -20,
A - A. = 2 X 10-4. The phase plane in Fig. 5.9 indicates that away
from equilibrium the temperature should approach the steady-state value
immediately, first slowly and then quite rapidly, while the concentration
deviation should first grow and then approach zero. Figures 8.1 and 8.2
show that this is precisely what happens, where curve a is the controlled
response and curve bthe uncontrolled. The first two switches occur at
23.70 and 25.65 sec, after which the controller switches between extremes
rapidly. Such "chattering" near the steady state is a common charac-
teristic of relay controllers. It should be noted that the system is asymp-
totically stable and returns to steady state eventually even in the absence
of control.
In order to avoid chatteringt some criterion must be introduced for
t Chattering may sometimes be desirable, as -discussed in detail in the book by
Flilgge-Lots.

0
3
Q

1
Desire
operating
level
I I I I I I 1 I I 1

10 20 30 40 50 60 70 80 90 100
Time t

Fig. $.1 Concentration response of the controlled and


uncontrolled reactor using instantaneously optimal control.
[From W. 0. Paradis and D. D. Perlmutter, AIChE J.,
12:876 (1966).Copyright 1966 by the American Institute
of Chemical Engineers. Reprinted by permission of the
copyright owner.]
OPTIMAL AND FEEDBACK CONTROL 257

460

455

-450
K
445

440

10 20 30 40 50 60 70 80 90
Time t

Fig. 8.2 Temperature response of the controlled and


uncontrolled reactor using instantaneously optimal con-
trol. [Front W. 0. Paradis and D. D. Perlmutter, AIChE
J., 12:876 (1966). Copyright 1966 by the American Insti-
tute of Chemical Engineers. Reprinted by permission of the
copyright owner.)

changing from relay to another form of control. In analogy with the


time-optimal case Paradis and Perlmutter simply set u to its steady-state
value of 5 at the time of the second switch and allowed the natural sta-
11

bility of the system to complete the control. These results are shown as
curve c in Figs. 8.1 and 8.2 and indicate quite satisfactory performance.

8.5 AN INVERSE PROBLEM


The results of the preceding section suggest the fruitfulness from a practi-
cal control point of view of pursuing the subject of instantaneously opti-
mal controls somewhat further. In particular, since the one serious draw-
back is the possibility that rapid return toward equilibrium might cause
serious future difficulties,. we are led to enquire whether this ad hoc policy
might also be the solution of a standard optimal-control problem, in which
case we would know precisely what overall criterion is being minimized,
if any. To that end we are motivated to study the inverse problem.
We shall restrict our attention for simplicity to linear systems with
constant coefficients
zi = Ai,x1 + b.'u (1)

and we shall take u to be a deviation from steady-state control with sym-


metric bounds, in which case b may be normalized so that the hounds are
OPTIMIZATION BY VARIATIONAL METHODS

unity
Jul < 1 (2)
If the bounds on u are not symmetric, the subsequent algebra is slightly
more cumbersome iut the essential conclusions are unchanged. Under
these assumptions the feedback control law of Eq. (5) of the preceding
section is
u = - sgn Q b;Q;;x;) (3a)

or

u = - sgn Q a;x;) (3b)


i
a linear switching law. We shall suppose that the cost of control is
negligible and consider an overall objective of the form
3 = Io T(x) dt (4)

where
5(x) > 9(0) > 0 x 96 0 (5)

The inverse problem which we wish to solve is for a function F(x) satis-
fying Eq. (5) such that a control of the form of Eq. (3) minimizes 6.
The hamiltonian for the linear stationary system described by Eq.
(1) and the objective equation (4) is
H = F(x) + I y;A;;x; + y;b;u (6)

and, since u enters linearly, whenever the'coefficient of u does not vanish,


the optimal control is
u = - sgn (7)

Equations (3) and (7) will define the same control if (but not only if!)
we take
yi = Q;;x; (8)

We need, then, to determine whether such a relation is compatable with


the minimum principle and, if so, what the resulting functions is.
The equations for the Green's functions are
ax; - (9)
x; L. y'A;;
OPTIMAL AND FEEDBACK CONTROL 26!

or, upon substituting Eq. (8),


ag - I xkQkjA;; (10)
ax;
j.k

But differentiating Eq. (8),


ti; _ Qiixj = I QiA;kxk - I Qiibi sgn C xkQktbt> (11)
ilk j k.t

where we have used Eqs. (1), (7), and (8). The right-hand sides of
Eqs. (10) and (11) must be identical, leading to a family of partial differ-
ential equations for t(x):

ax; -
- (QriAikxk
+ xkQkiAii) + Qiibj sgn (I xkQktbt} (12)
j.k 7 k.t

Integration of Eq. (12) is straightforward, yielding

F(x)
- 11 xj(Q;iAjk + Qk,Ai;)xk + I x,Q;;bj + const
I (13)
c.j.k

If 8 is fixed, the value of the constant is irrelevant. If 0 is unspecified,


the condition H = 0 establishes that the constant is zero. We obtain,
then,

(x)
=2 x;C;ixi +
I x;Qi;b; (14)
i., i.i
where

Cij = - (QikAk; + QikAki) (15)


k

The absolute-value term in Eq. (14) is a linear combination and can


vanish for x 5- 0, so that we ensure satisfaction of Eq. (5) by requiring
that the quadratic form be positive definite. This places an
to
interesting restriction on the uncontrolled system, for the time derivative
of the positive definite quadratic form E = 3 x;Q;ix; without control is
simply
E= - I x,C,;x; (16)
;.j
which is required to be negative definite. According to Liapunov sta-
bility theory, then (see Appendix 8.1), the quadratic form E must be a
Liapunov function for the uncontrolled system and the uncontrolled sys-
tem must be asymptotically stable. In that case t is also negative defi-
nite for the controlled system, and the controlled system is also stable.
M OPTIMIZATION BY VARIATIONAL METHODS

The asymptotic stability of the uncontrolled system is sufficient to ensure


the existence of a positive definite solution Q to Eq. (15) for arbitrary
positive definite C, although the converse is not true.
We now have a solution to the inverse problem, which establishes
that for an asymptotically stable linear stationary system the relay con-
troller with linear switching corresponds to an objective which is the
integral of a positive definite quadratic form plus a second positive semi-
definite term. The use of a quadratic form as an integrand is, of course,
now well established, and we anticipate that the additional term will
simply have the effect of bending the trajectories somewhat. Thus, the
instantaneously optimal policy does correspond to a very meaningful
overall objective. We are not quite finished, however, for we must still
establish that the minimum of the integral of Eq. (14) does indeed occur
for the control defined by Eq. (3). We must do this because of the possi-
bility that a singular solution may exist or that the minimum principle
leads to multiple solutions, another of which is in fact the optimum.
The possibility of a singular solution in which the switching cri-
terion vanishes for a finite time interval is most easily dismissed by con-
sidering the second-order system
21 = X2 (17a)
x2 = A21x1 + A22x2 + bu (17b)
for which the switching criterion defined by Eq. (3) is
Q12x1 + Q22x2 = 0 (18)
If this is differentiated with respect to time and Eqs. (17) substituted,
the resulting control is

u
- - (A21 Q222) xl - A22x2 (19)

On the other hand, the vanishing of the switching function means that
the integrand of the objective is simply 3' (Cllxl2 + 2C12x1x2 + C22x22y,
and the criterion for singular control was found in Sec. 5.9 for this case
to be
u = - (A ell
\ 21-C22)xl-Az2xz (20)

and

ell X1 + C22 x2 = 0 (21)


Thus, singular control is possible and, in fact, optimal if and only if
ell _ Q122
(22)
C22 Q222
OPTIMAL AND FEEDBACK CONTROL $1

Together with Eq. (15) this yields only discrete values of the ratio
C matrix is at the disposal of the designer, only infinitesimal
changes are needed to avoid the possibility of intermediate control. The
generalization to higher dimensions is straightforward and yields the
same result.
Finally, it remains to be shown that there cannot be another con-
trol policy which satisfies the minimum principle. Here, for the first
time, we make use of our wish to avoid chattering and presume that
within some neighborhood of the origin we want to switch from the relay
controller to some other form of control, perhaps linear. We shall choose
that region to be an ellipsoidal surface such that the control effort is to
terminate upon some manifold
g(x) = I x;(0)Q;;x;(6) - const = 0 (23)
:.,
where the final time 0 is unspecified. The boundary condition for the
Green's functions is then
y;(0) = v ax = 2v I Qi,x, (24)

where v is some constant and the condition H = 0 immediately estab-


lishes that 2v = I. Hence, for any final condition x(0) a complete set of
conditions is available for the coupled differential equations (1) and (9),
and the solution over any interval between switches is unique. The con-
trol given by Eq. (3) must, then, be the minimizing control for the objec-
tive defined by Eq. (14).
Any asymptotically stable linear system, then, may be controlled
optimally with respect to Eq. (14) by a particularly simple feedback
policy. Because of the ambiguity of defining a precise mathematical
objective it will often be the case that ff as defined here meets all physical
requirements for a control criterion, in which case the instantaneously
optimal policy will provide excellent control to within some predeter-,
mined region of the desired operating conditions.
The extension to nonlinear systems of the form
x: = f;(x) + b.(x)u (25)
is trivial, and again the condition for optimality is that the quadratic
form E be a Liapunov function for the uncontrolled system and, there-
fore, for the controlled system. Now, however, this requirement is more
restrictive than simply demanding asymptotic stability, for the existence
of a quadratic Liapunov function is ensured only in the region in which
linearization is valid, and large regions of asymptotic stability may exist
in which no quadratic Liapunov function can be found.
262 OPTIMIZATION BY VARIATIONAL METHODS

One final comment is required on the procedure used to solve the


inverse problem. Equation (8) can be shown to be the consequence of
any linear relation between y; and Q;;x,, but any sign-preserving non-
linear relation would also retain the same control law. Thus, by con-
sidering nonlinear transformations for the one-dimensional system
z=Ax+bu (26)
Thau has found that the control
u = - sgn Qbx (27)

is optimal not only for the objective found here


S + IQbxI) dt (28)
= Jo (AQx2
but also for
S = fo (Ab2Q3x4 + Ib=Q=x'I) dt (29)

and

T; = fa
(b x sinh bQx + Isinh bQxl) dt (30)

One way in which this equivalence of objectives can be established is to


use the relation

sgn bQx = sgn [ k (bQx)2k+11 K2 > Ki > 0 (31)


k-Ri J
and then to procede in the manner of this section. The desirability of
extending this approach to multidimensional systems is obvious, as are
the difficulties.

8.6 DISCRETE LINEAR REGULATOR


We have observed earlier that we may often be interested in controlling
systems which evolve discretely in time and are described by difference
equations. The difference-equation representation might be the natural
description, or it might represent an approximation resulting from com-
putational considerations. The procedures discussed in this chapter can
all be extended to the control of discrete systems, but we shall restrict
ourselves to a consideration of the analog of the problem of Sec. 8.2.
We consider a system described by the linear difference equations

x;" _ , Atiixi"-1 + b,u" (1) .


OPTIMAL AND FEEDBACK CONTROL

The coefficients Aij and bi are taken as constants, though the extension
to functions of n is direct. We have not included a disturbance term,
though this, too, causes no difficulties, and we seek only to find the
sequence of controls Jun 1 which regulates the system following an initial
upset or change in desired operating point. The minimum-square-error
criterion is again used
N`

S R(u")2]1
`l l (2)

Unlike the continuous case, we include a coefficient of the (u")2 term in


the objective, for R can be allowed to go to zero for discrete control.
A Lagrange multiplier formulation is the most convenient for solv-
ing this problem. The lagrangian is

=2 [I R(u")2]

+ I../n ffi x4" Qi A,,x,n_1 + biun - xi"/ J (3)

Setting partial derivatives with respect to xi" and u" to zero, respectively,
we obtain
xinCij - xjn + X;"+'Aii = 0 (4)

Run + Xi-bi = 0 (5)

Equation (4) allows us to define a variable Xi'+' as zero. Unlike the


analysis for the continuous problem, we shall not solve Eq. (5) for u"
at this point, for further manipulation will enable us to obtain a feed-
back solution which will be valid as R - 0.
If Eq. (4) is multiplied by bj and summed over j, we obtain
xin+'Ai7.bj =0
I xiiC1jbj - Xi"bi + (6)
ij 1,3

Substitution of Eq. (5) then leads to


I xi"Ciibi + Run + I Xin+'Aijbj = 0 (7)
+.j id
We now seek to express the multipliers in terms of the state variables as
Xi"+1
= M1k"xkn (8)
k

Since X '+' is zero, Mk"" must be zero, except for N -- oo, in which case
xkN - 0. ' Substitution of Eq. (8) into Eq. (7) then leads to the required
264 OPTIMIZATION BY VARIATIONAL METHODS

feedback form
u" = I K1"x "-I (9)

where
I CikbkAij + I blAklAlki"A,;
K." i.l.k
(10)
R + I b,Cikbk + b1Ak1:llk,"bi
i,k

The feedback dependence must be on x"-', since the state at the begin-
ning of the control interval is the quantity that can be measured.
We still need a means of calculating 211,," in order to compute the
feedback gains Kj". This is done by first substituting Eq. (9) into Eq. (1)

x," = I (Aij + b,Kj").rj"-1 (11)


J

Substitution of Eq. (11)1 into Eq. (4) yields


(I Aik + biltk") Cijx,"-I - il!jk"-Ixk"-I
k k

+ Ai;
[1 .1f, " I (AIk + b1Kk")] xk"-' = 0 (12)
1 k

If Eq. (12) is to hold for all values of x"-1, the coefficient of xk"-1 must
be identically zero, in which case i1ij" must be a solution to the difference
equation
:11jk"-' _ CijAik +N' Aijifil"AIk + biCij + Aijhlil"bzKk"
1l1;kN = 0 (13)
Equation (13), with Kj" defined by Eq. (10), is the generalization of the
Riccati difference equation first encountered in Sec. 1.7. For the impor-
tant case that N ---* co, a constant solution is obtained. Clearly there is
no difficulty here in allowing R to go to zero.
The instantaneously optimal approach of Sec. 8.5 may be applied
to the discrete system to obtain an interesting result. We define a posi-
tive definite quadratic error over the next control interval
E (14)
= l xi"Q11xj" +
2 P(u")2

The control which makes E as small as possible over the following inter-
val is found by setting the derivative of E with respect to u" to zero
O1' _ ax" (15)
fail" - I
x,nQ'j
au."
+ Pu" = 0
OPTIMAL AND FEEDBACK CONTROL 265

From Eq. (1),


ax;" (16)
au" = bi

so that Eq. (15) becomes


I biQi;x;" + Pu" = 0 (17)
i, j

The required feedback form for u" is obtained by substituting Eq. (1) for
x;" into Eq. (17) and solving
u" = I kx;"-' (18)
s

I / biQikAki
k
i
k, c/` (19)

/-I I biQikbk+P
k i

This linear feedback control can be conveniently compared to the


discrete-regulator solution for N - by defining a new variable
µik = A,kMti (20)

It can be shown from Eqs. (10) and (13) that µik is symmetric(µik = µki).
Equation (10) for the regulator feedback gain can then be written
I I bi(Cik + Uik)Aki
k i (21)
K1=-
11 1 bi(Cik + Iik)bJ + R
k i

The results are identical if we make the following identifications:


Qik = Cik + µik (22)
P=R (23)
Qik defined by Eq. (22) is symmetric, as it must be. We find, therefore,
that the discrete-regulator problem is disjoint and has a solution corre-
sponding to an instantaneous optimum over each control cycle. The
parameters in the instantaneous objective will have physical significance,
however, only when computed by means of Eq. (22).

APPENDIX 8.1 LIAPUNOV STABILITY


We shall review briefly here the elementary principles of Liapunov sta-
bility needed in this chapter. Consider any function V(x) which is
positive definite in a neighborhood of x = 0. The values of V define a
distance from the origin, although the closed contours might be quite
OPTIMIZATION BY VARIATIONAL METHODS

irregular. Thus, if the system is displaced from equilibrium to a value x0


and there exists any positive definite function V(x) such that V(x) < 0
in a region containing both Ko and the origin, the system can never escape
beyond the contour V(x) = V(xo) and the origin is stable. Furthermore,
if there exists a function such that V(x) < 0 for x 0, the system must
continue to pass through smaller and smaller values of V and ultimately
return to equilibrium. In that case the origin is asymptotically stable.
A function V(x) such as that described above, positive definite with
a negative semidefinite derivative, is called a Liapunov function. For a
system satisfying the differential equations
xi = Fi(K) Fi(0) = 0 (1)
the derivative of V is computed by
V(x) _ aV aV
zi = (2)
i axi i axi
If a Liapunov function can be found in some region including the origin,
the system is stable with respect to disturbances within that region. If
V is negative definite, the system is asymptotically stable. Construc-
tion of a Liapunov function is generally quite difficult.

BIBLIOGRAPHICAL NOTES
Section 8.1: The conventional approach to the design of feedback control systems is treated
extensively in texts such as
P. S. Buckley: "Techniques of Process Control," John Wiley & Sons, inc., New
York, 1964
D. R. Coughanowr and L. B. Koppel: "Process Systems Analysis and Control,"
McGraw-Hill Book Company, New York, 1965
D. D. Perlmutter: "Chemical Process Control," John Wiley & Sons, Inc., New York,
1965
J. Truxal: "Automatic Feedback Control System Synthesis," McGraw-Hill Book
Company, New York, 1957
The design of optimal control systems based on a classical frequency-domain analysis
is treated in, for example,
S. S. L. Chang: "Synthesis of Optimum Control Systems," McGraw-Hill Book
Company, New York, 1961
A modern point of view somewhat different from that adopted here is utilized in
C. W. Merriam: "Optimization Theory and the Design of Feedback Control Systems,"
McGraw-Hill Book Company, New York, 1964
1

The two approaches are reconciled in our Chap. 12; see also
L. B. Koppel: "Introduction to Control Theory with Applications to Process Control,"
Prentice-Hall, Inc., Englewood Cliffs, N.J., 1968
i
OPTIMAL AND FEEDBACK CONTROL 267

L. Lapidus and R. Luus: "Optimal Control of Engineering Processes," Blaisdell


Publishing Company, Waltham, Mass., 1967
and a forthcoming book by J. M. Douglas for parallel discussions pertinent to this entire
chapter.

Section 8.8: The properties of the linear system with quadratic-error criterion have been
investigated extensively by Kalman, with particular attention to the asymptotic
properties of the Riccati equation. In particular see
It. E. Kalman: Bol. Soc. Mat. Mex., 5:102 (1960)
in It. Bellman (ed.), "Mathematical Optimization Techniques," University
of California Press, Berkeley, 1963
: J. Basic Eng., 86:51 (1964)

A detailed discussion is contained in

M. Athans and P. Falb: "Optimal Control," McGraw-Hill Book Company, New


York, 1966
and some useful examples are treated in the books by Koppel and Lapidus and Luus and
A. R. M. Noton: "Introduction to Variational Methods in Control Engineering,"
Pergamon Press, New York, 1965

A computer code for the solution of the Riccati equation, as well as an excellent and
detailed discussion of much of the basic theory of linear control, is contained in

It. E. Kalman and T. S. Englar: "A User's Manual for the Automatic Synthesis
Program," NASA Contractor Rept. NASA CR-475, June, 1966, available from
Clearinghouse for Federal Scientific and Technical Information, Springfield,
Va. 22151

Numerical solution of the Riccati equation by sucessive approximations is discussed in

N. N. Puri and W. A. Gruver: Preprints 1967 Joint Autorn. Contr. Conf., Philadelphia,
p. 335

Though not readily apparent, the procedure used in this paper is equivalent to that dis-
cussed in Sec. 9.6 for the numerical solution of nonlinear differential equations.

Section 8.3: The general relationship between the linear servomechanism problem with
a us cost-of-control term and classical control is part of a research program being
carried out in collaboration with G. E. O'Connor. See

G. E. O'Connor: "Optimal Linear Control of Linear Systems: An Inverse Problem,"


M.Ch.E. Thesis, University of Delaware, Newark, Del., 1969

Section 8.4: The particular development is based upon

W. O. Paradis and D. D. Perlmutter: AIChE J., 12:876, 883 (1966)

Similar procedures, generally coupled with Liapunov stability theory (Appendix 8.1),
268 OPTIMIZATION BY VARIATIONAL METHODS

have been applied by many authors; see, for example,


C. D. Brosilow and K. R. Handley: A IChE J., 14:467 (1968)
R. E. Kalman and J. E. Bertram: J. Basic Eng., 82:371 (1960)
D. P. Lindorff: Preprints 1967 Joint Autom. Contr. Conf., Philadelphia, p. 394
A. K. Newman: Preprints 1967 Joint Autom. Contr. Conf., Philadelphia, p. 91
The papers by Lindorf and Newman contain additional references. A detailed dis-
cussion of the properties of relay control systems will be found in
1. Flugge-Lotz: "Discontinuous and Optimal Control," McGraw-Hill Book Company,
New York, 1968

Section 8.5: This section is based on


M. M. Denn: Preprints 1967 Joint Autom. Contr. Conf., Philadelphia, p. 308; A IChE
J., 13:926 (1967)
The generalization noted by Thau was presented in a prepared discussion of the paper,
at the Joint Automatic Control Conference. The inverse problem has been studied
in the context of classical calculus of variations since at least 1904; see
0. Bolza: "Lectures on the Calculus of Variations," Dover Publications, Inc., New
York, 1960
J. Douglas: Trans. Am. Math. Soc., 60:71 (1941)
P. Funk: "Variationarechnung and ihr Anwendung in Physik and Technik," Springer-
Verlag OHG, Berlin, 1962
F. B. Hildebrand: "Methods of Applied Mathematics," Prentice-Hall, Inc., Engle-
wood Cliffs, N.J., 1952
The consideration of an inverse problem in control was first carried out by Kalman for
the linear-quadratic case,
R. E. Kalman: J. Basic Eng., 86:51 (1964)
See also

A. G. Aleksandrov: Eng. Cybernetics, 4:112 (1967)


P. Das: Automation Remote Contr., 27:1506 (1966)
R. W. Obermayer and F. A. Muckler: IEEE Conv. Rec., pt. 6, 153 (1965)
Z. V. Rekazius and T. C. Hsia: IEEE Trans. Autom. Contr., AC9:370 (1964)
F. E. Thau: IEEE Trans. Autom. Contr., AC12:674 (1967) '

Several authors have recently studied the related problem of comparing performance of
simple feedback controllers to the optimal control for specified performance indices.
See
A. T. Fuller: Intern. J. Contr., 5:197 (1967)
M. G. Millman and S. Katz: Ind. Eng. Chem. Proc. Des. Develop., 6477 (1967)

Section 8.6: The Riccati equation for the feedback gains is obtained in a different manner
in the monograph by Kalman and Englar cited for Sec. 8.2, together with a computer
code for solution. See also the books by Koppel and Lapidus and Luus and

S. M. Roberts: "Dynamic Programming in Chemical Engineering and Process


Control," Academic Press, Inc., New York, 1964
OPTIMAL AND FEEDBACK CONTROL 268

W. G. Tuel, Jr.: Preprints 1967 Joint Autom. Contr. Conf., Philadelphia, p. 549
J. Tou: "Optimum Design of Digital Control Systems," Academic Press, Inc., New
York, 1963
"Modern Control Theory," McGraw-Hill Book Company, New York, 1964
The book by Roberts contains further references. Instantaneously optimal methods have
been applied to discrete systems by
R. Koepcke and L. Lapidus: Chem. Eng. Sci., 16:252 (1961)
W. F. Stevens and L. A. Wanniger: Can. J. Chem. Eng., 44:158 (1966)

Appendix 8.1: A good introductory. treatment of Liapunov stability theory can be found
in most of the texts on control noted above and in
J. P. LaSalle and S. Lefschetz: "Stability by Liapunov's Direct Method with Applica-
tions," Academic Press, Inc., New York, 1961
For an alternative approach see
M. M. Denn: "A Macroscopic Condition for Stability," AIChE J, in press

PROBLEMS
8.1. For systems described by the equations
2i - f,(x) + b,(x)u
use the methods of Secs. 8.2 and 4.8 to obtain the linear and quadratic terms in the
nonlinear feedback control which minimizes

s
e (j' xiC,;z; + u' dt
3 2 fo \4
Extend to the feedback-feedforward control for a step disturbance d which enters as

ii - f,(x) + bi(x)u + g,(x)d


8.2. Extend the results of Sec. 8.3 to a second-order system with numerator dynamics,

z+at+bx-u+eii.+d
8.3. Extend the control approach of Sec. 8.4 and the optimization analysis to the case
when E(x) is an arbitrary convex positive definite function.
M. The unconstrained control problem

x = f(x,u,P)
x(0) = xo
s
min E a fo 5(x,u,c) di

where p and c are parameters, has been solved for a given set of values of xo, p, and c.
Obtain equations for the change du(t) in the optimal control when xo, p, and c are
changed by small amounts Sxo, bp, and Be, respectively. In particular, show that au
276 OPTIMIZATION BY VARIATIONAL METHODS

may be expressed as

au - I Ki(t) axi + 191k(t) apk + Z 92.0) ac,"


i k

where ax is the change in x and Ki(t) and g ,(t) are solutions of initial-value problems.
(Hint: The method of Sec. 8.2 can be used to solve the coupled equations for changes
in state and Green's functions.) Comment on the application of this result to the
following control problems:
(a) Feedback control when the system state is to be maintained near an optimal
trajectory.
(b) Feedback-feedforward control when small, relatively constant disturbances
can enter the system and the optimal feedback control for the case of no disturbance
can be obtained.
8.6. The system
x(t) + at(t) + bx(t) - u(t)
is to be regulated by piecewise constant controls with changes in u every r time units
to minimize
1
S- Ioe (z' + ci') dt
2
Obtain the equivalent form
xl" - 2="-1

x7" -0yl"-1 - axf"-1 + w"


H
min s - I [(x1")' + C(z:")11
2 n-1

Obtain explicit values for the parameters in the optimal control for N -- co
W. - -K1z1"-1 - Ktx:"-1

Extend these results to the system with pure delay T


2(t) + at(l) + bx(t) = u(t - T)
(The analytical solution to this problem has been obtained by Koppel.)
9
Numerical Computation

9.1 INTRODUCTION
The optimization problems studied in the preceding six chapters are
prototypes which, because they are amenable to analytical solution or
simple computation, help to elucidate the structure to be anticipated in
certain classes of variational problems. As in Chaps. I and 2, however,
where we dealt with optimization problems involving only differential
calculus, we must recognize that the necessary conditions for optimality
will lead to serious computational difficulties if rational procedures for
numerical solution are not developed. In this chapter we.shall consider
several methods of computation which are analogs of the techniques
introduced in Chap. 2. In most cases we shall rely heavily, upon the,
Green's function treatment of linear differential and difference equations
developed in Sees. 6.2 and 7.2.

9.2 NEWTON-RAPHSON BOUNDARY ITERATION


The computational difficulties to be anticipated in the solution of a vari-
ational problem are best illustrated by recalling the necessary conditions
2n
272 OPTIMIZATION BY VARIATIONAL METHODS

for minimizing the function 8[x(0)J in a continuous system when x(O) is


unconstrained, x(0) = xo is given, and 0 is specified. We must simul-
taneously solve the S state equations with initial conditions
0 < t < 0
- f,(x,u)
i= 1, 2, S
x(0) = xo (1)

the S multiplier equations with final conditions


s
af,
'ax; 0<t<e
a& i = 1, 2, .. .

7i(e) = axi (2)

together with the minimization of the hamiltonian at each value of t


s
min H = yi f, (3)
u(Q i-1

An obvious procedure for obtaining the solution is to assume either


x(e) or 7(0) and integrate Eqs. (1) and (2), choosing u at each value of t
to satisfy the minimum principle. By choosing x(0) we compute an
optimal solution for whatever the resulting value of x(0), but we must
continue the process until we find the x(9) which leads to the required xo.
Let us suppose that the possible range over which each component xi(0)
may vary can be adequately covered by choosing M values, xi(1)(0),
xi(2)(9),
. . , xi(M)(0). There are, then, MB possible combinations of
.

final conditions to be evaluated, and for each the 2S differential equa-


tions (1) and (2) must be integrated, or a total of 2SM8 differential equa-
tions are to be solved. A very modest number for M might be 10, in
which case a system described by only three differential equations would
require the numerical integration of 6,000 differential equations in order
to reduce the interval of uncertainty by a factor of 10. For S. = 4 the
number is 80,000. This exponential dependence of the number of com-
putational steps on the dimension of the system is generally referred to
as the curse of dimensionality.
For certain types of problems the curse may be exorcised by a
linearization procedure for improving upon estimates of x(8). We make
a first estimate 8(8) and integrate Eqs. (1) and (2) with respect to t from
t = 0 to t = 0, determining u at each step in the numerical integration
from the minimum condition. The valug of x so computed at t = 0,
x(0), will generally not correspond to the required value xo. A new value
of x(6) will produce new functions x(t), u(t) in the same way, and the
NUMERICAL COMPUTATION 273

first-order variational equation must be

bii = ax' bxj + auk auk (4)

k-1
where fix = x - 2, &u = u - n, and partial derivatives are evaluated
along the trajectory determined by u. Defining the Green's functions
rij(B,t) by
s
I
k-1
k
ax. (5)

rri(B,B) = b+j = { 0 iyd j


(6)

Green's identity, Eq. (15) of Sec. 6.2, may be written


s
VIN Is r.3 <,
R
af
b?Lk dt (7)
a4Ak
j-1 j-1 k-1
At this point we make the critical assumption that the optimal
policy does not differ significantly for neighboring values of z(8), in which
case Bu will be approximately zero. Equation (7) then provides an algo-
rithm for determining the new estimate of x(6)
S

z (9) _ fti(®) + r0(8,0)[xjo - .4(0)J (8)


j-1

Equation (8) may be looked upon as an approximation to the first two


terms of a Taylor series expansion, in which case we might interpret
r(6,0) as an array of partial derivatives
ax;(B)
r:j(9,0) = (9)
axj (0)
which is quite consistent with the notion of an influence function intro-
duced earlier. Equation (8) is then analogous to the Newton-Raphson
procedure of Sec. 2.2, where the function to be driven to zero by choice of
z(8) is z(0) - z0.
This procedure is equally applicable to systems described by differ=
ence equations
n is n-1 n n = 1, 2, . . . , N

The variational equations are


s R
ax;" = 6x`n-1
+ k-1 b1Lk" (11)
G
j-1
xj n_ 1
adJ, 4 a2lk"
274 OPTIMIZATION BY VARIATIONAL METHODS

with Green's functions defined by


St afkn
riix.n-1 rikNn

1r"
(12)
L/d
k-1

NN
_
-a" ij0
j (13)

The first-order correction to an estimate 2N is then


s
xiN = ZiN + rijNo[xjo - x-ol 11
(14)
j-i

As outlined, this Newton-Raphson procedure requires the calcu-


lation-of S2 + S Green's functions, the S2 functions rij(B,t)(r;, 'n) needed
for the iteration and the S functions yi(t)(yin) for the optimization prob-
lem. By use of Eq. (18) of Sec. 6.2 or Eq. (8) of Sec. 7.2 it follows from
the linearity and homogeneity of the Green's function equations that y
can be calculated from r

Yi(t) _ rji(9,t) (15)


8xj

with an identical relation for staged systems. Thus, only S2 equations


need be solved on each iteration.
It is likely that the linearization used in the derivation of the algo-
rithm will be poor during the early stages of computation, so that using
the full correction called for in Eqs. (8) and (14) might not result in con-
vergence. To overcome this possibility the algorithm must be written as
s
xi(B) = zi(o) + 1 rij(o,0)[xj, - x(0)) (16)
i-1
r
S

x'N = x'N + r
rijNO(xjo - xio) (17)
j-1
where r > 1 is a parameter controlling the step size. r must be taken
large initially and allowed to approach unity during the final stages of
convergence.

9.3 OPTIMAL TEMPERATURE PROFILE BY


NEWTON-RAPHSON BOUNDARY ITERATION
As an example of the Newton-Raphson boundary-iteration algorithm we
shall consider the problem of computing the optimal temperature profile
in a plug-flow tubular reactor or batch reactor for the consecutive-
NUMERICAL COMPUTATION 275

reaction sequence
X, --> X2 - > products
We have examined this system previously in Sees. 4.12 and 6.10 and have
some appreciation of the type of behavior to be anticipated. Taking
v = 1, F(x1) = x12 (second-order reaction), and G(x2) = x2 (first-order
reaction), the state is described by the two equations
z, = -k1oe-E,'""x,2 x1(0) = x10 (1a)
x2 = k10e-E"'l"x12 - k20e-E"/' x2 x2(0) = X20 (lb)
with u(t) bounded from above and below
U. < U < u* (2)

The objective is the maximization of value of the product stream, or


minimization of
-e[xl(9) - x,oj - [x2(9) - x2o1 (3)
where c reflects the value of feed x, relative to desired product x2.
The Green's functions r;;(9,t) for the iterative algorithm are defined
by Eqs. (5) and (6) of the previous section, which become

-- -
af,
- r,, axl r,2
af2

ax,
= 2x,k,oe-E,'I"(r - r,2)
r11(9,9) = I
t12 = - r , ,
axt
-
af-t
- r 12
af2
ax2
= k20e-Ei',"r,2 r12(8,0) = 0
(4a)

(4b)

af,
1'21 = -r21ax, - r22 af2

ax,
r,,(9,9) = 0 (4c)

1'22 = - r2, ax2 - r22ax2- k2oe-


Of,
=
of,
E,-," r22 r22(9,9) = 1 (4d)

It easily follows that r12(9,t) =_ 0 and will not enter the computation,
although we shall carry it along in the discussion for the sake of generality.
The hamiltonian for the optimization problem is
H= -Y,ktoe-E-'i"xt2
+ Y2(ktoe-E,'1"x12 - k2pe-Ei'i"x2)
(5)
where the Y; are computed from the r;; by
a6 as
71 = W r,, + ax2 r2, _ -crl, - r21 (6a)
as as
ax, r,2 + ax2 r22 = -cr,2 - r22
Y2 = (6b)
276 OPTIMIZATION BY VARIATIONAL METHODS

Minimization of the hamiltonian leads to the equation for the optimal u(t)
u* v(t) > u*
u(t) = v(t) u* < v(t) < u* (7)

IU* v(t) < u*


where the unconstrained optimum is

U (t) =
E' - E'
2 1
(8)
y2x2k2o
In
('Y2 - yi)xi2kio
The computational procedure is now to choose trial values x1(6),
22(6) and integrate the six equations (1) and (4) numerically from
t = 6 to t = 0, evaluating u at each step of the integration by Eq. (7)
in conjunction with Eqs. (8) and (6). At t = 0 the computed values
21(0), 22(0) are compared with the desired values x10, x20 and a new trial
carried out with values

x1(6) = 21(6) + {r11(6,0)[x1° - 21(0)] + r12(6,0)[x20 - 22(0)11

r (9a)

X2(0) = 22(6) + {r2,(0,0)[x10 - 21(o)J + r22(e,0)[x2° - 22(0)11

(9b)

In carrying out this process it is found that convergence cannot be


obtained when xi and x2 are allowed to penetrate too far into physically
impossible regions (negative concentrations), so that a further practical
modification is to apply Eqs. (9) at any value of t for which a preset
'bound on one of the state variables has been exceeded and begin a new
iteration, rather than integrate all the way to t = 0.
The values of the parameters used in the calculation are as follows:
k1o=5X1010 k20=3.33X 1017
E;=9X103 E2=17X103
u* = 335 u* = 355
6=6 c=0.3
x1o=1 x20=0
The iteration parameter r was taken initially as 2 and increased by 1
each time the error, defined as Ix1(0) - x1ol + 1x2(0) - x2ol, did not
decrease. In the linear region, where the ratio of errors on successive
iterations is approximately I - 1/r, the value of r was increased by 0.5
for each iteration for which an improvement was obtained. Starting
values of x1(6) and x2(6) were taken as the four combinations of
NUMERICAL COMPUTATION 277

x1(9) = 0.254, 0.647 and x2(0) = 0, 0.746, calculated from the limiting
isothermal policies as defining the extreme values.
The approach to the values of x1(9) and x2(9) which satisfy the
two-point boundary-value problem (x1 = 0.421, x2 = 0.497) are shown
in Fig. 9.1, where the necessity of maintaining r > 1 during the early
stages to prevent serious overshoot or divergence is evident. Successive
values of the optimal temperature profile for the iterations starting from
the point xl = 0.647, x2 = 0 are shown in Fig. 9.2, where the ultimate
profile, shown as a broken line, is approached with some oscillation. In
all four cases convergence to within 0.1 in u(t) at all t was obtained with
between 12 and 20 iterations.
As described in this section and the preceding one, the algorithm
is restricted to problems with unconstrained final values. This restric-
tion can be removed by the use of penalty functions, although it is found
that the sensitivity is too great to obtain convergence for large values
of the penalty constant, so that only approximate solutions ,can be
ests in part
realized. The usefulness of the Newton-Raphson method rests'
upon the ability to obtain an explicit representation of the optimal
decision, such as Eq. (7), for if the hamiltonian had to be minimized by
use of the search methods of Chap. 2 at each integration step of every
iteration to find the proper u(t), the computing time would be excessive.

0.7

0.6

0.5

m 0.4

0.3

Fig. 9.1 Successive approximations to 0.2


final conditions using Newton-Raph-
son boundary iteration. [From M. M.
Denn and R. Aris, Ind. Eng. Chem. 0.1
Fundamentals, 4:7 (1965). Copyright
1965bYthe American Chemical Societ y. 6/' " I I I e i

Reprinted by permission of the copyright 0.3 0.4 0.5 0.6 0.7


owner.] x, (8)
27$ OPTIMIZATION BY VARIATIONAL METHODS

Fig. 9.2 Successive approximations to the optimal


temperature profile using Newton-Raphson boundary
iteration. [From M. M. Denn and R. Aris, Ind. Eng.
Chem. Fundamentals, 4:7 (1965). Copyright 1965 by
the American Chemical Society. Reprinted by permission
of the copyright owner.]

It is this latter consideration, rather than convergence difficulty, which


has proved to be the primary drawback in our application of this method
to several optimization problems.

9.4 STEEP-DESCENT BOUNDARY ITERATION


The Newton-Raphson boundary-iteration algorithm convetges well near
the solution to the two-point boundary-value problem resulting from the
minimum principle, but it requires the solution of S(S - 1) additional
differential or difference equations for each iteration. An approach
which requires the solution of fewer equations per iteration, but which
might be expected to have poorer convergence properties, is solution of
the boundary-value problem by steep descent. The error in satisfying
the specified final (initial) conditions is taken as a function of the assumed
initial (final) conditions, and the minimum of this error is then found.
To demonstrate this procedure we shall consider the optimal-
pressiire-profile problem for the gas-phase reaction
X1 - 2X2 -* decomposition products
where the conversion to intermediate X2 is to be maximized. This
problem was studied in Sec. 4.13, where the system equations were
written as
xl = -2klu A +xl x2 x1(0) = x10 (1a)
2
xz = 4klu A + X2 - 4k2u2 (A
+ X2)2
X2(0) = x20
NUMERICAL COMPUTATION Z!!

Here A = 2x,o + x2o, and the objective is to minimize


g = -X2(0) (2)

For simplicity u(t) is taken to be unconstrained.


The hamiltonian for optimization is.
x1 x, x22
H = yi (_2kiu A + + y2 4k,u A + xt - 4kzu2
x2) (A + x2)21
(3)
with multiplier equations -

aH _ 2k,u CM
yl _ - 5x, A + X2
(y1 - 2yz) 7j(e) = ax, = 0 (4a)
aH 2k,,ux, 8k2uzyzAx2
12 = - ax2 (A +x2)2 (yl
2X:} + (A '+ x2)3
'at = -1
'12(8) _ xz
(4b)

The optimal pressure u(t) is obtained by setting aH/au to zero to obtain


A + x2 k,x,(y, - 272)
U
4 k2y2x22
5 t)
It is readily verified that yz < 0, in which case a2H/au2 > 0 and the
condition for a minimum is met. We shall attempt to choose initial
values y,(0), y2(0) in order to match the boundary conditions on y at
t = 8, and we shall do this by using Box's approximate (complex) steep-
descent method, discussed in Sec. 2.9, to find y,(0), y2(0) which minimize
E = (71(8))2 + 112(8) 4- 1)2 (6)
We begin with at least three pairs y,(0), yz(0) and integrate Eqs. (1)
and (3), calculating u at each step of the numerical integration from
Eq. (5), and determine the value of the error E from Eq. (6) in each case.
By reflecting the worst point through the centroid new values are found
until E is minimized.
The parameters used for computation are
k, = 1.035 X 10-2 k2 = 4.530 X 10-2
x10=0.010 x20=0.002
8=8.0 A=0.022
For these parameters the solution of the boundary-value problem-is at
y,(0) = -0.8201, 72(0) = -0.4563, with a value of x2(8) of 0.0132.
Table 9.1 shows a sequence of iterations using a' three-point simplex
with one starting point close to the minimizing point, and convergence
to 1 part in 101 in y(8) is obtained in 17 iterations. Computations `frog
other initial triangles are shown in Tables 9.2 and 9.3, and it is evident
that poor results might be obtained without a good first estimate.
Table 9.1Successive approximations to the Initial values of multipliers using the complex
method for steerdescent boundary Iteration to minimize final error in boundary conditions

xs(B) X2(8) as(a


E X 10' X10=
Iteration - y,(0) --y2(0) E X 106 - y1(0) - y:(0) E X 10' --Y1(O) -,Y2(O)
X10' X10

0.8000 0.4500 5.2 X 10' 1.130 0.5000 1.0000 3.9 X 1010 0.759 0.2000 0.9000 6.4 X 1010 0.732
1 0.8000 0.4500 5.2 X 10' 1.130 0.6000 1.0000 3.9 X 1010 0.759 0.8900 0.6317 2.3 X 100 0.955
2 0.8000 0.4500 5.2 X 10' 1.130 0.8879 0.4837 1.4 X 10' 1.123 0.8900 0.8317 2.3 X 10' 0.965
3 0.8000 0.4500 5.2 X 10' 1.130 0.8879 0.4837 1.4 X 10' 1.123 0.8470 0.4778 1.1 X 10' 1.129
4 0.8000 0.4500 5.2 X 10' 1.130 0.8385 0.4685 1.2 X 10' 1.132 0.8470 0.4778 1.1 X 10' 1.129
5 0.8000 0.4500 6.2 X 10' 1.150 0,8385 0.4685 1.2 X 10' 1.132 0.8045 0.4494 6.3 X 10' 1.132
6 0.8330 0.4637 7.6 X 10' 1.132 0.8386 0.4885 1.2 X 10' 1.132 0.8045 0.4494 6.3 X 10' 1.132
7 0.8330 0.4637 7.6 X 102 1.132 0.8082 0.4502 6.0 X 10' 1.132 0.8045 0.4494 6.3 X 10' 1.132
8 0.8330 0.4637 7.6 X 10' 1.132 0.8082 0.4502 6.0 X 10' 1.132 0.8291 0.4610 133 1.132
9 0.8110 0.4512 95.6 1.132 0.8082 0.4502 6.0 X 10' 1.132 0.8291 0.4810 133 1.132
10 0.8110 0.4512 95.6 1.132 0.8167 0.4544 22.3 1.132 0.8291 0.4610 188 1.132
11 0.8110 0.4512 95.6 1.132 0.8167 0.4544 22.3 1.132 0:8174 0.4547 8.6 1.132
12 0.8203 0.4563 6.5 1.132 0.4187 0.4544 22.3 1.132 0.8174 0.4547 8.6 1.132
13 0.8203 0.4563 6.5 1.132 0.8200 0:4561 0.14 1.132 0.8174 0.4547 8.6 1.132
14 0.8203 0.4563 6.6 1.132 0.8200 (r:4561 ' 0,14 .1.132 0.8194 0.4558 0.78 1.132
15 0.8199 0.4561 0.63 1.132 0.8200 0.4561 0.14 1.132 0.8194 0.4568 0.78 1.132
16 0.8199 0.4561 0.63 1.132 0.8200 0.4561 0.14 1.132 b.8202 0.4563 0.04 1.132
17 0.8200 0.4562 0.01 1.132 0.8200 0.4611 0.14 .1.132 0.8202 0.4563 0.04 1.132
Table 9.2 Successive approximations to the Initial values of multipliers using the complex
method for steep-descent boundary iteration to minimize final error in boundary conditions

x2(0) $2(e) x2(0)


Iteration -71(0) -1'2(0) E X 106 -1''(0) -1'2(0) .E X 10 6 X10= -1',(0) -72(0) E X 10 6' X102
X102

0.7000 0.4000 2.5 X 106 1.123 0.6000 0.5000 6.1 X 1010 0.882 1.0000 1.0000 6.8 X 101, 0.834
1 0.7000 0.4000 2.5 X 105 1.123 0.6000 0.5000 6.1 X 1010 0.882 0,4633 0:1567 7.9 X 106 - 1.916
2 0.7000 0.4000 2.5 X 10' 1.123 0.5869 0.3414 5.8 X 106 1.111 0.4633 0.1667 7.9 X 10, -1.916
3 0.7000 0.4000 2.5 X 105 1.123 0.5869 0.3414 5.8 X 1011, 1.111 0.6014 0.3208 4.4 X 106 1.087
4 0.7000 0.4000 2-5 X 105 1.123 0.6847 0.3705 2.5 X 105 1.116 0.6014 0.3208 4.4 X 105 1.087
5 0.7000 0.4000 2.5 X 106 1_123 0.6847 0.3706 2.6 X 106 1.116 0.7409 0.4197 1.2 X 106 1.127
6 0.7900 0.4000 2.6 X 105 1.123 0.7121 0.4007,. 3,7 X 10' 1.130 0.7409 0.4197 1.2 X 106. 1.127
7 0.7406 0.4156 2.2 X 10' 1.131 0.7121 0.4007 3.7 X 10' 1.130 0.7409 0.4197 1.2 X 105 1.127
8 0.4156 2.2 X 10' 1.131 0.7121 0.4007 8.7 X 10' 1.130 0.7186 0.4020 1.4 X 10' 1.131
9 0.7406 0.4156 2.2 X 10' 1.131 0.7390 0.4131 9.7 X 10' 1.131 0.7186 0.4020 1.4 X 10' 1.131
10 0.7225 0.4032 1.1 X 10' 1.132 0.7390 0.4131 9.7 X 105 1.131 0.7186 0.4020 1.4 X 10' 1.131
11 0.7225 0.4082 1.1 X 10' 1.132 0.7390 0.4131 9.7 X 10' 1.131 0.7372 0.4115 8.1 X 10' 1.132
12 0.7464 0.4171 8.3 X 10' 1.132 0.7390 0.4181 9.7 X 10' 1.131. 0.7372 0.4115 8.1 X 10' 1.132
13 0.7464 0.4171 8.3 X 10' 1.132 0.7433 0.4149 7.2 X 10' 1.132 0.7372 0.4115 8.1 X 10' 1.132
14 0.7370 0.4111 7.9 X 10' 1.132 0.7433 0.4149 7.2 X 10' 1.132 0.7372 0.4115 8.1 X 10' 1.132
15 0.7370 0.4111 7.9 X 10' 1.182 0.7433 0.4149 7.2 X 10' 1.132 0.7417 0.4138 7.2 X 10' 1.132
16 0.7454 0.4161 6.9 X 10' 1.132 0.7433 0.4149 7.2 X 10' 1.132 0.7417 0.4138 7.2 X 10' 1.132
17 0.7454 0.4161 6.9 X 10' 1.132 .0.7437.. 0.415p. 6.9 X 10' 11.132 0.7417 0.4138 7.2 X 10' 1.132
Table 9.3 Successive approximations to the initial values of multipliers using the complex
method for steep-descent boundary iteration to minimize final error in boundary conditions

=zX10(6) xxX10(6)
Iteration -yt(0) - y,(0) E X 10' E X 108 x2 (8)
-y'(0) -ys(U) --`(U) - y,(0) E X 106
= = X102

0.4000 0.6000 8.4 X 10' 0.780 0.5000 0.8000 1.7 X 1010 0.774 0.7000 0.7000 3.3 X 10' 0.834
1 0.4000 0.6000 8.4 X 10' 0.780 0.5767 0.5700 2.1 X 10' 0.836 0.7000 0.7000 3.3 X 10' 0.834
2 0.7654 0.6537 1.2 X 10' 0.874 0.5767 0.5700 2.1 X 10' 0.836 0.7000 0.7000 3.3 X 10' 0.834
3 0.7654 0.6537 1.2 X 10' 0.874 0.5767 0.5700 2.1 X 10' 0.836 0.6566 0.5648 9.8 X 108 0.871
4 0.7654 0.6537 1.2 X 10' 0.874 0.7819 0.6302 7.7 X 108 0.894 0. 6-566 0.5648 9.8 X 108 0.871
5 .0.6939 0.5675 6.9 X 108 0.889 0.7819 0.6302 7.7.x 108 0.894 0.6566 0.5648 9.8 X 108 0.871
6 0.6939 0.5675 6.9 X 108 0.889 0.7819 0.6302 7.7 X 108 0.894 0.7818 0.6170 6.2 X 108 0.902
7 0.6939 0.5676 6.9 X 108 0.889 0.7143 0.5720 6.0 X 108 0.897 0.7818 0.6170 6.2 X 108 0.902
8 0.7769 0.6089 5.7 X 108 0.905 0.7143 0.5720 6.0 X 108 0.897 0.7818 0.6170 6.2 )( 108 0.902
9 0.7769 0.6089 5.7 X 108 0.903 0.7143 0.5720 6.0 X 108 0.897 0.7263 0.5763 5.7 X 10' 0.900
10 0.7769 0.6089 5,.7 X 108 0.906 0.7716 0.6035 5.5 X 108 0.906 0.7263 0.5763 5.7 X 108 0.900
11 0.7340 0.5799 5.5 X 10' 0.902 0.7716 0.6035 5.5 X 108 0.906 0.7263 0.5763 5.7 X 108 0.900
12 0.7340 0.5799 5.5 X 108 0.902 0.7716 0.6086 6.6 X 10' 0.906 0.7669 0.6000 5.5 X 108 0.906
13 0.7340 0.5799 6.6 X 108 0.902 0.7392 0.5826 5.5 X 108 0.903 0.7669 0.6000 5.5 X 108 0.906
14 0.7632 0.5974 5.4 X 108 0.906 0.7392 0.5826 6.5 X 108 0.903 0.7669 0.6000 5.5 x 108 0.906
15 0.7632 0.5974 5.4 X 108 0.906 0.7788 0.6072 5.4 X 108 0.908 0.7669 0.6000 6.6 X 108 0.906
16 0.7632 0.5974 5.4 X 108 0.906 0.7788 0.6072 5.4 X 10' 0.908 0.7732 0.6035 5.4 X 108 0.907
17 0,7632 0.5974 5.4 X 10' 0.906 0.7625 0.5968 5:4 X 10' 0.906 0.7732 0.6035 5.4 X 108 0.907
NUMERICAL COMPUTATION 283

For problems of the specific type considered here, where all initial
values of the state variables are known and final values unspecified, we
can use a more direct approach to steep-descent boundary iteration.. The
value of the objective 6 depends only upon the choice of y(0), for every-
thing else is determined from the minimum-principle equations if all initial
conditions are specified. Instead of minimizing E, the error in final con-
ditions, it is reasonable simply to seek the minimum of S directly by steep-
descent iteration on the initial conditions. Tables 9.4 and 9.5 show
the results of such a calculation using the simplex-complex procedure.
In neither case are the values of yi(O) = -0.8201, yz(0) _ -0.4563
approached, although the same ratio 1.80 of these values is obtained.
It is evident from Eq. (5) that only the ratio,-yl/,y2 is required for defining
the optimum, and Eqs. (4) can be combined to give a single equation for
this ratio. Hence the optimum is obtained for any initial pair in the
ratio 1.80 and, had we so desired, we might have reduced this particular
problem to a one-dimensional search.

9.5 NEWTON-RAPHSON FUNCTION ITERATION: A SPECIAL CASE

We have already seen how a computational' scheme of the Newton-


Raphson type can be applied to boundary-value problems by lineari-
zation of the boundary conditions. The theory of linear differential and
difference equations is highly developed, and we know that the principle
of superposition can be used to solve linear boundary-value problems.
Thus, we might anticipate that a Newton-Raphson linearization approach
to the solution of differential equations would be practical. Before devel-
oping the procedure in general it is helpful in this case to study a specific
simple example.
A convenient demonstration problem of some importance in physics
is the minimization of the integral
(2 [l + (x)11 dt
1 x (1)

where the function x(t) is to be chosen in the interval 1 < t < 2 subject
to boundary conditions x(1) = 1, x(2) = 2. This is a special case of
Fermat's minimum-time principle for the path of a light ray through an
optically inhomogeneous medium. In the notation of Sec. 3.2 we write
ff(x,x,t) = x-I[1 + (±)2J 4 (2)

and the Euler equation


day a3
dt ax = ex (3)
Table 9.4Successive approximations to the Initial values of multipliers using the
complex method for steep-descent boundary Iteration to minimise the objective

(0)(0) ys(0)(0)
Iteration -y,(0) -,y2(0) 'Y-Y12
-h X 10' -71(0) -Y:(0) (0) --E X 10' -y,(0) -y:(0) -S X 10'
Yt (0) W(O)

0.5000 0.5000 1.00 0.834 0.5000 1.0000 0.50 0.759 0.2000 0.9000 0.22 0.732
1 0.5000 0.5000 1.00 0.834 0.5000 1.0000 0.50 0.759 0.6600 0.6700 0.99 0.831
2 0.5000 0.5000 1.00 0.834 0.6227 0.3637 1.71 1.107 0.6600 0.6700 0.99 0.831
3 0.5000 0.5000 1.00 0.834 0.6227 0.3637 1.71 1.107 "0.5087 0.3048 1.67 1.086
4 0.5739 0.3136 1.83 1.126 0.6227 0.3637 1.71 1.107 0.5087 0.3048 1.67 1.086
5 0.5739 0.3136 1.83 1.126 0.8227 0.8637 1.71 1.107 0.6460 0.3567 1.81 1.131
6 0.5739 0.3138 1.83 1.126 0.6129 0.3418 1.79 1.132 0.8468 0.3567 1.81 1.131
.
7 0.6591 0.3682 1.79 1.132 0.6129 0.3418 1.79 1.132 0.6460 0.3587 1.81 1.131
8 0.6591 0.8682 1.79 1.132 0.6129 0.3418 1.79 1.132 0.6384 0.3544 1.80 1.132
9 0.6078 0.3381 1.80 -1.132 0.8129 0.3418 1.79 1.132 0.6384 .0.3544 1.80 1.132
10 0.6078 0.3381 1.80 1.132 0.6243 0.3474 1.80 1.132 0.6384 0.3544 1.80 1.132
Table 9.5Successive approximations to the initial values of multipliers using the
complex method for steep-descent boundary iteration to minimize the objective

n(0) y'(0) - S X 10'


Iteration --y,(0) --Ys(O) 'Yi(0) - 9 X 10' -7L(O) -'YS(O) - X 10' -.1(0) --Y%(O)
72(0) (0)
72(0)

0.9000 0.5500 1.64 1.068 0.5000 1.0000 0.50 0.759 0.2000 0.9000 0.22 0.732
1 0.9000 0.5500 1.64 1.068 0.5000 1.0000 0.50 0.759 0.9667 0.7083 1.36 0.937
2 0.9000 0.5500 1.64 1.068 0.9873 0.5830 1.69 1.098 0.9667 0.7083 1.36 0.937
3 0.9000 0.5500 1.64 1.068 0.9873 0.5830 1.69 1.098 0.9408 0.5489 1.71 1.108
4 0.9982 0.5744 1.74 1.118 0.9873 0.6830 1.69 1.098 0.9408 0.5489 1.71 1.108
5 0.9982 0.5744 1.74 1.118 0.9600 0.5503 1.75 1.121 0.9408 0.6489 1.71 1.108
6 .0.9982 0.5744 1.74 1.118 0.9600 0.5503 1.75 1.121 0.9995 0.5695 1.75 1.124
7 0.9699 0.5521 1.76 1.125 0.9600 0'.6603 1.76 1.121 0.9995 0.5695 1.75 1.124
8 0.9699 0.5521 1.76 1.125 0.9979 0.5665 1.76 1.126 0.9996 0.5696 1.75 1.124
9 0.9699 0.6521 1.76 1.126 0.9979 0.5665 1.76 1.126 0.9756 0.5539 1.76 1.126
10 0.9957 0.5645 1.76 1.127 0.9979 0.5665 1.76 1.126 0.9766 0.5689 1.76 1.126
I
2116 OPTIMIZATION BY VARIATIONAL METHODS

reduces to the nonlinear second-order equation


x2 +(z)+1=0 x(1)=1
(4)
x(2) = 2
By noting that
xx + (x)2 = xx
at 2 it (d x) (5)

Eq. (4) can be integrated directly to obtain the solution satisfying the
boundary conditions
x(t) = (6t - t= - 4))s (6)
We wish now to solve Eq. (4) iteratively by a Newton-Raphson
expansion analogous to that developed for nonlinear algebraic equations
in Sec. 2.2. We suppose that we have an estimate of the solution,
x(*) (t), and that the solution is the result of the (k + 1)st iteration,
x(k+')(t). The nonlinear terms in Eq. (4) may be written
x(k+1)x(k+1) = x(k)x(k) + x(k) \\(x(k+l) - x(k)) + x(k) (i(k+1) - x(k))
+ higher-order terms (7a)
(Z(k+1))2 = (±(k))2 + 21(k)(t(k+1) _, t(k))
+ higher-order terms (7b)
With some rearranging and the dropping of higher-order terms Eq. (5)
then becomes a linear ordinary differential equation in x(k+1)

Z(k+1) + xtk) ) p+1) + 2(k) (k+1) x(k) + (x(k)x(k) 1


x(k+1)(1) =1
x(k+1) (2) = 2 (8)
A particularly convenient starting estimate which satisfies both
boundary conditions is
x(o) (t) =t (9)
in which case Eq. (8) for x(l) simplifies to

z(1) + x(n = 0 (10)

This linear homogeneous equation has two solutions, x(') = 1/t and
x(1) = 1. Using the principle of superposition, the general solution is
a linear combination of the two
x(1) = c1 + C2t-1 (ii)
and the constants are evaluated from the boundary conditions at t 1
NUMERICAL COMPUTATION Z$7

andt=2
x(1)(1) = 1 = C, + C2 (12a.)
x(1)(2) = 2 = C1 + %C2 (12b)
The solution is then
x(') (t) = 3 - 2t-' (13)
Table 9.6 shows the agreement between the exact solution and these
first two Newton-Raphson approximations.
The starting approximation need hot satisfy all or any of the
boundary conditions, though by use of superposition all subsequent
approximations will. For example, with the constant starting value
x(0) = 1.5 Eq. (8) for x(1) becomes
x(u = -23 (14)
The homogeneous solutions are x(') = 1, x(') = t, while the particular
solution obtained from the method of undetermined coefficients is -%t2.
By superposition, then, the general solution is
x(1) = C1 + C2t - /St2 (15)

Solving for the. constants from the boundary conditions,


x(1)(1)
= 1 = c1 + c2 _- % (16a)
X(1) (2)= 2 = c, + 2c2 - % .(16b)

so that the solution is


x(1) = -2j3 + 2t - lj.3t2 (17)

Table 9.6 Comparison of exact


solution and approximation using
Newton-Raphson function iteration

t x(°)(t) xttl

1.0 1.000 1:000 1.000


1.1 1.100 1.182 1.179
1.2 1.200 1.334 1.327
1.3 1.300 1.462 1.453.
1.4 1.400 1.572 1.562
1.5 1.500 1.667 1.658
1.6 1.600 1.750 i.744
1.7 1.700 1.824 1.819
1.8 1.800 1.889 1.887
1.9 1.900 1.947 1.947
2.0 2.000 2.000 2.000
2ta OPTIMIZATION BY VARIATIONAL METHODS

Table 9.7 Comparison of exact


solution and approximation using
Newton-Raphson function iteration

t x(0)(t) x(t)(t) x(t)

1.0 1.500 1.000 1.000


1.1 1.500 1.130 1.179
1.2 1.500 1.253 1.327
1.3 1.500 1.370 1.453
1.4 1.500 1.480 1.562
1.5 1.500 1.5&3 1.658
1.6 1.500 1.680 1.744
1.7 1.500 1.770 1.819
1.8 1.500 1.853 1.887
1.9 1.500 1.930 1.947
2.0 1.500 2.000 2.000

Table 9.7 shows the start of convergence for this sequence of -Newton-
Raphson approximations, and considering the crude starting value, the
agreement on the first iteration is excellent.
It is helpful to observe here that the principle of superposition can
be used in such a way as to reduce the subsequent algebra. We can
construct a particular solution satisfying the boundary condition at t = 1,
x('D) = 3 - %J2 (1S)
To this we add a multiple of a nontrivial homogeneous solution which
vanishes at t = I
x(Ih) = t -1 (19)

so that the solution is written


x(1) = x(IP) + clx<u) - Y3 - 13t2 + cl(1 - t) (20)

The boundary condition at t = 1 is automatically satisfied, and the


single coefficient is now evaluated from the one remaining boundary
condition at t = 2, leading again to the result in Eq. (17).

9.6 NEWTON-RAPHSON FUNCTION ITERATION: GENERAL ALGORITHM

The example of the preceding section is a graphic demonstration of the


use of a Newton-Raphson linearization of the differential equation to
obtain an approximate solution to a nonlinear boundary-value problem,
but it is peculiar in that explicit analytical solutions are available for
both the exact and approximate problems. The general variational
NUMERICAL COMPUTATION 2$S

problem in which the decision function u(t) is unconstrained leads to 2S


nonlinear first-order equations with S conditions specified at t = 0 (it = 0)
and S conditions at t = B (n = N). We now consider the extension of
the Newton-Raphson approach to this problem. This is frequently
referred to as quasilinearization.
We shall suppose that we have a system of 2S differential equations
yi = Fi(y) i=1,2,...,2S (1)

In a variational problem S of the variables would correspond to the state


variables x,(t) and the remaining S to the Green's functions y,(t). It is
assumed that the condition all/au = 0 has been used to obtain u
explicitly in terms of x and y. If we number the components of y such
that the first S components are specified at t = 0, we have the condition
NO i = 1, 2, . . . , S
Yi(0) (2)
unspecified i = S + 1, S + 2, . . . , 2S
Any S variables or combination might be specified at t = 0, depending
upon the nature of the problem.
We now assume that we have an nth approximation to the solution
y(n). If the (n + 1)st result is assumed to be the exact solution, we
write Eq. (1) as
yi(n+l) = F,(Y(n+l))
(3)

and the right-hand side may be expanded in a Taylor series about the
function y(n) as
s
zC
aFi(Y(n)) - yi(n)) +
0i(n+1) = Fi(Y(n)) + / (y1(n+l)
(4)
i_1 ayi

or, regrouping and neglecting higher-order terms, ,

2S zs
C aFi(y(n)) aF,(y(n))
a yj(n+n + [F() _ ay y,(n)1 (5)
is1 y, 1

Eq uation (5) is a set of linear nonhomogeneous ordinary differential


equations to which the principle of superposition applies. Thus, we
need obtain only a single particular solution which satisfies the S boundary
conditions at t = 0 and add to this, with undetermined coefficients, S
solutions of the homogeneous equation
zs

I
F,(y(n))
cn+n,n = --- y1(n+1).h (g)
yi
2" OPTIMIZATION BY VARIATIONAL METHODS

The S coefficients are then found from the S boundary conditions at


t = 8.
To be more precise, let us denote by yi(n+').p the solution of Eq. (5)
which satisfies the initial condition

yi(n+l).p(0) Yio 21,2,...,5


l
// arbitrary i=S+1,S+2, . . . 2S (7)

and by k = 1, 2, . . . , S, the linearly independent solutions


of Eq. (6) with zero initial conditions for i = 1, 2, . . . , S. The general
solution of Eq. (5) may then be written
s
y,(n+l)(t) = yi(n+1).p(t) + I ckyi(n+1).hh(t)
i = 1, 2, , 2S (8\
k-1

and the S constants ck are determined from the S specified conditions


at t = 8. For example, if y1(8), 112(8), and the last S - 2 values are
specified, the ck would be obtained from the S linear algebraic equations
s
yi(e) = yi(n+1).P(8) + I
k-1
i = 1, 3, S + 2, 8 + 3, . . . , 2S (9)

As outlined, the method is confined to variational problems in


which there are no trajectory constraints on u or x but only specifications
at t = 0 and B. Since u must be completely eliminated by means of the
stationary condition for the hamiltonian, constraints on u can be imposed
only if they are convertable to constraints on a and y, which can be
incorporated by means of penalty functions. Finally, we note that a
completely analogous treatment can be applied to difference equations.

9.7 OPTIMAL PRESSURE PROFILE BY


NEWTON-RAPHSON FUNCTION ITERATION

In order to demonstrate the use of Newton-Raphson function iteration


we shall again consider the optimal-pressure problem. The equations
are given in Sec. 9.4 as

i1 = -2k1u A x1(0) = x10


+ xs
2
12 = 4k1u A + - 4k2u2 (A + x2(0) = x20
X2 x2)2
S = -x2(8)
NUMERICAL COMPUTATION M
The Green's functions satisfy
2k1u
tit=A+X2(f1-27s) til(e)=0 (3a)
2k1ux1 8k2u272A'x2
rs = - (A + x2) 2 (11 - 272) + (A + x2)'
72(8) = - 1 (3b)

and the optimal pressure satisfies

u(t) = _ A +4 x2 kjx&yl - 272)


k272x22
(4)

Substitution of Eq. (4) into Eqs. (1) and (3) leads to the four equations
in four variables

x1 =
k12x12 _1 +272
-
71
(5a)
k2x2 2

k12x12 712
z2 = (5b)
ksx22 1- 7 2
k12x1
tit = - 2ksysx22 (71 - 272)2 (5c)

k12x12
72 (71 - 272) s (5d)
= 2k27 2x2a

In the notation of the previous section x1; x2, 71i and 72 correspond,
respectively, to y1, y2, y,, y4 Values at t = 0 are given for x1 and z2
(y1 and y2) and at t = 0 for 71 and 72 (y: and y4). The linearized equa-
tions for the iteration become, after some simplification,

k12(x1("))2 2 71(n)

k2(x2(n))2 x1(n) 272(n)


2 71("> 1
7i(n+u
X2(ft)
-1 + 272(")) x2(n+1) +
272(n)
k12(x1(n) ) 2 7 i(n)
71(")
2(7i(n))2
72(n+l)
+ ) (6a)
k2(X2( ( 72W

±2('1+0 = k12(x1("))2 2
1-
(71(n))2
] x1(.+,)
k2(x2(n))2
x100 4(72("))2
2
x2%) 1
- (71(n) 2 (n)
x2(n+l) - 2(y2(n))2 y1(n+1)

(71(n))2

+ k12(x1(n))2 _ (71(n)))

+ 2(72(n)2 72 (n+1) } k2(x2(n))2 L1 (sb)


4(72(n) 2 J
1
292 OPTIMIZATION BY VARIATIONAL METHODS

k12x1(n) (7'(n) - 272(n))2


k2(x2(n)) 27z(n) 2x1(n)
(71(n)
- 2v2(n))2
x2(ntl) - (71(n) - 27,(n))71("+1)
2X2 (n)
(7i(n))2 - 4(72(n))2
72(n+1)
27x(n)

-
k12x1(n)
2k2(x2(n))27s(n)
(7 i(n) - 27z("))2 (6c)

k12(x1(n))2 f(7i(n) - 2,,,(n))2


.y2(n+1) xl(n+1)
k2(x2(n))172 (n) xl(n)

(71(n) - 2y2')2 x2(n+l) + (71(n)


- 272(n)) 71(n+1)
2x (n)
4(72(n))2

27x(")
- (71(n))2

72(n+1)

k12(xl(n))2
(71(n) - 272("))2 (6d)
2k2(x2(n))i72(n)

The particular solution, with components x1(n+li.P, x2(n+1).P, 71(n+1).P,

72(n+l).P, is the solution of Eqs. (6) with initial conditions xlo, x20, 0, 1.
(The latter two are arbitrary.) The homogeneous equations are obtained
by deleting the last term in each of Eqs. (6), and the homogeneous solu-
tions, with components xl(n+l).hk, x2(n+l).Ak, 71(n+1).h,, 7:(n+1),hk, k = 1, 2,
are solutions of the homogeneous equations with initial conditions 0, 0,
1, 0 and 0, 0, 1, 1. (The latter two are arbitrary but must be inde-
pendent for the two solutions and not identically zero.) The general
solution can then be written
xl("+1)(t) = x1('+I).P(t) + c1x1(n+l).A,(1) + C2x1(n+1).k(t) (74)
x2(n+1)(t) = x2(n+1).P(t) + Cax2(n+l).A,(t) -4- c2x2(n+l).A,(t) (7b)
= 7t(n+1),P(J) +
71(n+1)(t) C171(n+1).A,(t) + c271(n+l).k0)
(7c)
72(ntll(t) = .2(n+1).P(t) + C172(n+1).A,(t) + C272(n+1).)y(t)
(7d)

The initial conditions on xl and x2 are automatically satisfied, and the


coefficients cl and c2 are evaluated from the boundary conditions on 71
and 72 at t = 0
71(8) = 0= 71(n+l>.P(0) + c171(n+1).4,(0) + C271(n+l)Jy(0)
(8a)
72(0) = -1 = 72(n+1).P(0) + C172(n+l).A,(0) + C272 (0) (8b)

We show here some calculations of Lee using the parameters in


Sec. 9.4 with the following constant initial choices:
xl(0) = 0.01 x2(0) = 0.01
71(0) = 0 72(0) = -.1.0
NUMERICAL COMPUTATION 293

Convergence of xi and x2 is shown in Fig. 9.3, and the results of the first
five iterations are listed in Table 9.8.. The rather poor starting values
result in rapid convergence, though with some oscillation, and it can,
in fact, be established that when convergence occurs for this Newton-
Raphson procedure, it is quadratic.

9.8 GENERAL COMMENTS ON INDIRECT METHODS

The computational techniques discussed thus far make use of the neces-
sary conditiops for the optimal decision function at each iteration and
are often called indirect methods. Reasonable starting estimates can
often be obtained by first assuming a function u(t) and` then solving
state and multiplier equations to obtain functions or boundary con-
ditions, as required, or by the use of the direct methods to be developed
in the following sections. .

A common feature of. indirect methods is the necessity of solving


state and Green's function equations simultaneously, integrating both

Fig. 9.3 Successive approximations to optimal concen-


tration profiles using Newton-Raphson function itera-
tion. [From E. S. Lee, Chem. Eng. Sci., 21:183 (1966).
Copyright 1966 by Pergamon Press. Reprinted by per-
mission of the copyright owner.]
$4 OPTIMIZATION BY VARIATIONAL METHODS

Table 9.8 Successive approximations to optimal concentration and


multiplier profiles using Newton-Raphson function Iteration t

i x((0) X 10' XI(" X 10' x;(') X 10' X 102 Xs(') X 10' x,(') X 10'
0 1.0000 1.0000 1.0000 .1.0000 1.0000 1.0000
2 1.0000 0.6434 0.6372 0.6620 0.6634 0.6634
4 1.0000 0.6370 0.5119 0.5332 0.5342 0.5343
6 1.0000 0.6607 0.4393 0.4486 0.4493 0.4493
8 1.0000 0.6602 0.3927 0.3860 0.3864 0.3864

i x:(0) X 10' x:(') X 10' x:(') X 10' x2(') X 102 xs(`) X 10' x:(') X 10'

0 1.0000 0.2000 0.2000 0.2000 0.2000 0.2000


2 1.0000 0.8471 0.8060 0.7795 0.7771 0.7770
4 1.0000 1.0215 0.9757 0.9610 0.9597 0.9597
6 1.0000 1.1011 1.0648 1.0651 1.0643 1.0643
8 1.0000 1.1392 1.1147 1.1324 1.1320 1.1320

t -ys(0) -yl(s) -yl(s) -71(3) - In(4)


0 0 1.2736 0.8555 0.8188 0.8202 0.8202
2 0 0.8612 0.6945 0.6785 0.6780 0.6780
4 0 0.5696 0.4868 0.4911 0.4913 0.4914
6 0 0.3032 0.2687 0.2648 0.2654 0.2654
8 0 0 0 0 0 0

I -ys(o) -y:(') -yea) -y:(') -y:(') -72(6)

0 1.0000 1.0908 0.5492 0.4625 0.4563 0.4582


2 1.0000 1.2665 0.7399 0.6647 0.6650 0.6650
4 1.0000 1.2592 0.8636 0.7915 0.7911 0.7911
6 1.0000 1.1320 0.9345 0.9002 0.8997 0.8997
8 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000

t From E. S. Lee, Chem. Eng. Sci., 21:183 (1966). Copyright 1966 by Pergamon
Press. Reprinted by permission of the copyright owner.

from either t = 0 to 0 or 0 to 0. A potentially serious computational


problem not evident in the examples can be observed by considering the
simplest of systems

z=ax+u (1)

for which the Green's function equation is

y = -ay (2)
NUMERICAL COMPUTATION 2!S

Solving both equations,


x(t) = x(0)e°' + f o' e°('-')u(r) dr (3a)
-y(t) = y(O)e-' (3b)

Thus, when either equation has decreasing exponential behavior, the


other must be an increasing exponential. It is: for this reason that the
statement is often made that the optimization equations are unstable.
In any indirect method extreme sensitivity and poor convergence will
result whenever 0 is significantly greater than the smallest time constant
of the system.

9.9 STEEP DESCENT

The difficulties associated with the solution of the two-point boundary-


value problem may be circumvented in many instances by adopting a
direct approach which generalizes the steep-descent technique developed
in Sees. 2.5 and 2.6. The necessary conditions for optimality are not
used, and instead we obtain equations for adjusting estimates of the
decision function in order to improve the value of the objective. For sim-
plicity we shall carry out the development with only a single decision
variable.
The system is described by the ordinary differential equations
with known initial values
z; = f:(x,u) x1(0) = xco (1)

where u(t) may be bounded from above and below


u* < u(t) < u* (2)

The goal is to minimize 8[x(0)1, and for the present we shall assume.that
the values of x(0) are entirely unconstrained.
If we choose any function u(t) which satisfies the upper- and lower-
bound constraint, we may integrate Eq. (1) and obtain a value of 8.
The effect of a small change ou(t) in the entire decision function is then
described to first order by the linear variational equations

ax` ax, + au` au ax1(0) = 0 (3)

and the corresponding first-order bhange in 8 is

ax,(0) (4)
8x;
ti
2!6 OPTIMIZATION BY VARIATIONAL' METHODS

All partial derivatives are evaluated along the solution determined by


u. The Green's function for Eq. (3), as discussed in Sec. 6.2, must
satisfy the adjoint equation
af;
yJax;

Green's identity is then, noting that 6x;(0) = 0,

I y,(9) ax;(e) = fo` L su dt (6)


-Y.

By defining boundary conditions for Eq. (5) as


as
-Mo) = (7)
ax;

we can combine Eqs. (4) and (6) to write

a6 =o fy. u' au dt (8)

Until this point the approach does not differ from the development
of the weak minimum principle in Sec. 6.5. Now, as in Sec. 2.5, we make
use of the fact that u(t) is not the optimum, and we seek au(t) so that t
is made smaller, or S8 < 0. An obvious choice is

au(t) = -w(t) y; aut w(t) > 0 (9)

where w(t) is sufficiently small to avoid violation of the linearity assump-


tion. Then

SS = - fo w(t)
af;z dt < 0 (10)
y` au

At a bound we must take w equal to zero to avoid leaving the allowable


region.
It is helpful to examine an alternative approach which follows
the geometrical development in Sec. 2.6. We define distance in the
decision space as

nZ = fo g(t)[au(t)]2 dt g(t) > 0


NUMERICAL COMPUTATION 297

and we seek on which minimizes (makes as negative as possible)

as = y; u' au (t) dt (12)


Io

This is an isoperimetric problem in the calculus of variations (Sec. 3.6)


for which the Euler equation is

(au)
y d ii au + xg(su)'] = 0 (13)

Here X is a constant Lagrange multiplier. .

Differentiating and solving for au, we obtain


1
au = - 2xg(t) y'
Of,
T (14)
;
From Eq. (11), then,
A2 = 12 r,9 l af, 2
dt (15)
4, o y` au

or, substituting back into Eq. (10),

G(t) y; af;/au
au = -A (;

af,/au)2 (16)
[J0 G(r)
fr
dr
\ y'

where G(t) is the inverse of g(t) and the positive square root has been
used. For later purposes it is helpful to introduce the notation
\2
rEE = f G(r) y; a') dr (17)

in which case
G(t) I y; af;/au
au = -A _
(18)
1.",
EEE
Thus,
w(t)
G( (19)
1881,
t

where G is the inverse of the metric (weighting function) of the space.


If we agree to define the hamiltonian

H= y;f: (20)
298 OPTIMIZATION BY VARIATIONAL METHODS

for any decision function, not just the optimum, the improvements in
u may be conveniently written

Su = -zv(t) (21)
au
In the case of more than one decision it easily follows that
61H
5u, wi; (22)
au

For systems described by difference equations


xtn = fn (x n-',un) (23)

where S(XN) is to be minimized, an analogous development leads to

aw - win C1Hft
au;n (24)

with
Hn = \ 1tinlia (25)
Ls

af" as
y'n-1 = yin axtp;-1 7tiN = aXN
(26)

In that case the equivalent to Eq. (19) is


Gn
Zwn = - 0 (27)
It.6 4
N
IE
I Gn ( 2 (20)

The significant feature of steep descent should be noted here.


The boundary-value problem is completely uncoupled, for the state
equations are first solved from t = 0 to 9. (n = 0 to N) and then the
multiplier equations solved with known final condition from t = 9 to
t = 0. If the state equations are stable when integrated from t = 0 in
the forward direction, the multiplier equations are stable in the direction
in which they are integrated, from 0 to zero. One practical note of
precaution is in order. When stored functions are used for numerical
integration, a means must be provided for interpolation, for most numeri-
cal-integration algorithms require function evaluation at several loca-
tloqs between grid points.
NUMERICAL COMPUTATION 2!!

9.10 STEEP DESCENT: OPTIMAL PRESSURE PROFILE

The method of steep descent is the most important of the computational


techniques we shall discuss, and we demonstrate its use with several
examples. The first of these is the optimal-pressure problem for con-
secutive reactions studied previously in this chapter. The pertinent
equations are

it = -2k,u A X1 xl (1a)

xz = 4k,u A
+ - 4kzu2 2
(lb)
x2 (A + x2)2
S = -X2(0) (2)
2k,u
A+x2 (Y1 - 2Y2) Y1(0) = 0 (3a)

2k,ux1 8k2u2Y2Ax2
'Y2 = - (A + x2) 2 (y I - 272) + (A + X2)' 72(0) _ -1 (3b)
aH x22

T -2k, A +XIX2 (-I, - 2Y2) - 8k2 (A + x2)2 u (4)

The values of the parameters are those used for previous calcu-
lations. Following a choice of u, a(t), the new decision function is caleu-
lated from the equation

unew(t) = u - W (5)
au
where Eq. (4) for all/au is evaluated for u and the solutions of Eqs. (1)
and (3) are calculated using u. For these calculations w was initially
set equal to the constant value of 25. The program was written so as
to reduce w by a factor of 2 whenever improvement was not obtained,
but effective convergence was obtained before this feature was needed.
Figures 9.4 and 9.5 show successive pressure profiles computed from con-

0.7

0.6
0
0.5
1

0.4 2
0.3
3
0.2
Fig. 9.4 Successive approximations to
0.1
the optimal pressure profile using steep 5
descenti, starting from the constant 00 1 2 3 4 5 6 7 8
policy u = 0.5. Residence time t
300 OPTIMIZATION BY VARIATIONAL METHODS

Fig. 9.5 Successive approximations to


the optimal pressure profile using steep
2 3 4 5 6 descent, starting from the constant
Residence time t policy u - 0.2.

stant starting profiles of u = 0.5 and u = 0.2, respectively, with the


vertical scale on the latter expanded to avoid cluttering. The dashed
line is the solution obtained from the necessary conditions by the indirect
techniques discussed previously. The rapid convergence is shown in Fig.
9.6, where the lower points are those for the starting value u = 0.5. The
near-optimal values of the objective, 0.011305 after seven iterations and
0.011292 after four, respectively, compared with an optimum of 0.011318,
are obtained with profiles which differ markedly from the optimum over
the first portion of the reactor. This insensitivity would be an aid in
ultimate reactor design.

Fig. 9.6 Improvement of the objective


I

I
I

2
i

3 4
I

5
function on successive iterations using
Iteration number steep descent.
NUMERICAL COMPUTATION 301

9.11 STEEP DESCENT: OPTIMAL TEMPERATURE PROFILE


As a second example of steep descent we consider the tubular-reactor
optimal-temperature-profile problem solved by Newton-Raphson bound-
ary iteration in Sec. 9.3. The equations for the system are
±1 = -kloe-E,'1ux12 (la)
22 = kloe El'I"xl - k2oe E='I"x2 (lb)
U* < u < u* (2)
s = -c[xl(O) - x1o1 - [x2(e) - x20] (3)

The Green's function equations are then


til = 2xlkloe-E,'1,(y1 - 72) 71(9) = -c (4a)
72 = 72(0)
k2oe-Ei"/°y2
= -1 (4b)

and the decision derivative of the hamiltonian is readily computed as


aH kloEixl2e-E,'tn k2oEZx2e-Ez'I'

au - u2 (71 - 72) u2
y2 (5)

The parameters for computation are the same as those in Sec. 9.3. The
new decision is calculated from the equation

uuew(t) = u(t) - w(t)


a (6)

A reasonable estimate of w(t) may be obtained in a systematic way.


Following the geometric interpretation, we may write, for uniform
weighting,
w(t) = a (7)
[fa (a
If data are stored at N uniform grid points spaced At apart, then,
approximately,
A
w(t) = (tn1
135

) IZ At
In', [ a
or, combining (At)3h into the step size,
S
w(t) = (9)
(tn)]T
In I1 [",
302 OPTIMIZATION BY VARIATIONAL METHODS

If au is to be of the order of one degree in magnitude then


b(aH/au)
N (10)
[IH 2 ;4

or
S N34 (11)
For simplicity b is taken as an integer. Equation (6) is written for
computation as
unew = u - b (t)
ell/au
N lI(in)ls}1
(12)
[_5_U
{ 1,
where b(t) is a constant b, unless Eq. (12) would cause violation of one
of the constraints, in which case b(t) is taken as the largest value which
does not violate the constraint. The constant b is taken initially as the
smallest integer greater than N4 and halved each time a step does not
lead to improvement in the value of S. For the calculations shown here
N = 60, and the initial value of b is 8.
Figure 9.7 shows 'successive iterations startijig from an initial con-
stant policy of u = u* except over the first integration step, where u is
linear between u* and u.. This initial segment is motivated by the

10

6II
4
2

i
I 2
i
3 4 5
i
6
O

Residence time f

Fig. 9.7 Successive approximations to the optimal tem-


perature profile using steep descent, starting from the con-
stant policy u a 335. [From J. M. Douglas and M. M.
Denn, Ind. Eng. Chem., 57(11):18 (1965). Copyright 1965
by the American Chemical Society. Reprinted by permission
of the copyright owner.]
NUMERICAL COMPUTATION 303

355

a
E

340

3351 I l I 1 I

0 1 2 3 4 5 6
Residence time f
Fig. 9.8 Successive approximations to the optimal tem-
perature profile using steep descent, starting from the con-
stant policy u = 355. [From J. M. Douglas and M. M.
Denn, Ind. Eng. Chem., 57(11):18 (1965). Copyright 1965
by the American Chemical Society. Reprinted by permission
of the copyright owner.]

solution to the generalized Euler equation in Sec. 4.12, which requires


an infinite initial slope for profiles not at an upper bound. The method
of choosing the weighting w(t) in Eq. (12) is seen to produce changes in
u(t) of order unity, as desired. The dashed line is the solution obtained
using Newton-Raphson boundary iteration. Successive values of the
profit are shown as the upper line in Fig. 9.9, and it can be seen that
convergence is essentially obtained on the eleventh iteration, though the
temperature profile differs from that obtained using the necessary
conditions.

0
' 0.30

00.25
Fig. 9.9 Improvement of the objec-
tive function on successive iterations
using steep descent. [From J. M.
Douglas and M. M. Denn, Ind. Eng.
Chem., 57(11):18 (1965). Copyright
1965 by the American Chemical Society.
Reprinted by permission of the copy- 2 4 6 8 10 12 14
right owner. ] Iteration number
3" OPTIMIZATION BY VARIATIONAL METHODS

Calculations starting from an initial constant policy u = u* are


shown in Fig. 9.8 and the lower curve in Fig. 9.9, and the conclusions are
similar. The two calculations lead to values of the objective of 0.3236
and 0.3237, respectively, compared to a true optimum of 0.3238. It is
typical of steep descent procedures that convergence is rapid far from
the optimum and very slow near an insensitive minimum or maximum.
Though a frustration in calculation, this insensitivity of the optimum is,
of course, an aid in actual engineering design considerations.

9.12 STEEP DESCENT: OPTIMAL STAGED TEMPERATURES


We can examine the use of steep descent for a staged system by
considering the problem of specifying optimal temperatures in a sequence
of continuous-flow reactors of equal residence time. The reaction
sequence is the one considered for the tabular reactor
X1--+ X2 -+ products
with all rate parameters identical to those in Sec. 9.3. We have con-
sidered this problem in Secs. 1.12 and 7.6, where,. with F quadratic and
G linear, the state equations and objective are written
0 = xIn-1 x1n - 8k1oe-E,'1u"(xln)2 (1a)
0 = x2w-1 - x2" + 8kioe-&,'lu'(x1n)2 - 8k2oe-8'"Ir"x2n (lb)
t; = -c(x1N - x10) - (x2'' - x20) (2)
U* < un < u* (3)
By implicit differentiation the Green's function equations are shown in
Sec. 7.6 to be
.Y,.-I = -/In
1 + 28k10e e,'/u"x1n
28y2nk1oe 8,'l u'xln
+ (1 + 28k1oe 8,'/u"x1")(1 + e,'lue) y1" = -c (4a)

y "
ye-1 = 1 + 8k2oe- J,/4-* 72N = -1 (4b)

The decision derivative of the stage hamiltonian is


aH"
au"
T2"EIk10e 8ilu+(x1")2 - y1#E1kioe-,R,'/v*(X1n)1(1 + ek29e

8 - y2"E=k2oe-a,'l""x2"(1 + 28k1oe-8-'lu"xi%)
(un)2 (1 + 8k2oe-e-'/u")(1 + 28kioe'80"x1")
(5)
Following the specification of a temperature sequence {in I and the
successive solution of Eqs. (1) and Eqs. (4), the new decision is obtained
NUMERICAL COMPUTATION 305

from the algorithm of Eqs. (20) and (23) of Sec. 9.9


aH^/au"
(6)
n NJ )2]½
H

where we have taken the weighting as uniform. If a bound is exceeded


by this calculation, that value of u" is simply moved to the bound, as
in the previous section. The starting value of A is taken as the smallest
integer greater than N" in order to obtain starting-temperature changes
of the order of unity. A is halved each time an improvement does not
occur on an iteration.
The residence time 9 was chosen so that No equals the tubular-
reactor residence time of 6 used in the preceding numerical study.
Figure 9.10 shows the convergence for a three-stage reactor system
(N = 3, 0 = 2) for starting values entirely at the upper and lower bounds.

355

350

U,

u2
E U3

340

335

0 5 10 15
Iteration number
Fig. 9.10 Successive approximations to the optimal tem-
perature sequence in three reactors using steep descent.
[From M. M. Denn and R. Aria, Ind. Eng. Chem. Funda-
mentals, 4:213 (1965). Copyright 1965 by the American
Chemical Society. Reprinted by permission of the copyright
owner.]
306 OPTIMIZATION BY VARIATIONAL METHODS

The calculations in Fig. 9.11 show convergence for a 60-stage reactor


(N = 60, 0 = 0.1) from the lower bounds, with the discrete profile
represented as a continuous curve not only for convenience but also to,
emphasize the usefulness of a large but finite number of stages to approxi-
mate a tubular reactor with diffusive transport. For the latter case 22
iterations were required for convergence, defined as a step size of less
than 10-3.
This process is a convenient one with which to demonstrate the
use of penalty functions for end-point constraints. If we take the
objective as simply one of maximizing the amount of intermediate
g = -x2N (7)

and impose the constraint of 60 percent conversion of raw material


x1N = 0.4 (8)

then the penalty-function approach described in Sec. 1.16 would sub-


stitute the modified objective
= -x21 + %K(x1N -0.4) 2
(9)

In the minimization of 9 the multiplier equations (4) and direction of


steep descent described by Eqs. (5) and (6) do not change, but the

355

335 ___ 1 I I I I I

1 5 10 20 30 40 50 60
Stoge numbern

Fig. v.11 Successive approximations to the optimal tem-


perature sequence in a 60-stage reactor using steep descent,
starting from the constant policy u = 335. [From M. M.
Denn and R. Aria, Ind. Eng. Chem. Fundamentals, 4:213
(1965). Copyright 1965 by the American Chemical Society.
Reprinted by permission of the copyright owner.!
NUMERICAL COMPUTATION 707

boundary conditions on the multipliers do. We now have


aaat
'Y ]N 1N = K(z1N - 0.4) (10a)
= ai;
72
N
- ax2N = - 1 (10b)

Following convergence of the iterations for a given value of the penalty


constant K and a corresponding error in the constraint a new value of
K can be estimated by the relation
x1N - 0.4
Knew _ K°Id tolerance

For these calculations the tolerance in the constraint was set at 10-1.
Figure 9.12 shows the result of a sequence of penalty-function
calculations for N = 60 starting with K = 1. Convergence was obtained
in 17 iterations from the linearly decreasing starting-temperature sequence
to the curve shown as K = 1, with an error in the constraint x1N = 0.4
of 1.53 X 10-1. Following Eq. (11), the new value of K was set at
1.53 X 102, and 107 iterations were required for convergence, with a
constraint error of 8.3 X 10-1. A value of K of 1.27 X 10' was then
used for 24 further iterations to the dashed line in Fig. 9.12 and an error
of 10-1. This extremely slow convergence appears to be typical of the
penalty-function approach and is due in part to the sensitivity of the
boundary conditions for y to small changes in the constraint error, as
may be seen from Eq. (10a).

355

K'I.53 X102
340
5 t0 20 30 40 50 60
Stage number n

Fig. 9.12 Approximations to the optimal temperature


sequence with constrained output using steep descent and
penalty functions. (From M. M. Denn and R. Aris, Ind.
Eng. Chem. Fundamentals, 4:213 (1965). Copyright 1965
by the American Chemical Society. Reprinted by permission
of the copyright owner.]
me OPTIMIZATION BY VARIATIONAL METHODS

9.13 GRADIENT PROJECTION FOR CONSTRAINED END POINTS


By a direct extension of the analysis in Sec. 9.9 we can generalize the
steep-descent procedure to account for constrained end points. We again
consider a system described by the differential equations
z; = f;(x,u) x;(0) = x;o (1)

with bounds on u,
u* <u<u* (2)

The objective is to minimize S[x(6)], but we presume that there is also a


single (for simplicity) constraint on the final state
41 (0)] = 0 (3)

For any function u(t) which lies between the upper and lower bound
we can integrate Eqs. (1) to obtain a value of & and of 0 which will
generally not be zero. The first-order effect of a small change is then
described by the equations

ai; = af' ax, + a' su ax;(0) = 0 (4)

with corresponding first-order changes in g and 0


a , ax,(e)
aa; _
ax,

ax,(9) (6)
ax;

We now define two sets of Green's functions for Eq. (4), y and 4, satis-
fying the adjoint differential equation but different boundary conditions

(7)
axi
af, a11
(8)
ax; ax;

Using Green's identity, Eqs. (5) and (6) will then become, respectively,

SS = fo y; au au dt (9)

a= Io
E ,y; af' su dt (10)
NUMERICAL COMPUTATION

We now choose a distance in the decision space


02 = fo g(t)[Su(t)]2 dt

and ask for the function Su which minimizes M in Eq. (9) for a fixed
distance 0' and a specified correction to the constraint 34. This is an iso-
perimetric problem with two integral constraints, and hence two constant
Lagrange multipliers, denoted by X and is. The Euler equation is

a(Su) I ry afSU af;


au Su + Xg(Su)2 = 0 (12)

or

au - -TX-('Y.+i4)au
where G(t) is the inverse of g(t).
We need to use the constraint equations (10) and (11) to evaluate
X and P. By substituting Eq. (13) for Su into Eq. (10) for the fixed
value 6.0 we obtain
a
-y, T dr

- fo G(r)
aui
2
dr (14)

It is convenient to define three integrals


2

IEE = fa G(r) yi au dT (15a)

IEm = Ja G(r) ti; a'} ( 'G` au' l dr (15b)

I,, = fa G(r) I ', au' dr (15c)

The subscripts denote the appropriate boundary conditions for the


Green's functions used in the integral. Equation (14) then becomes,
slightly rearranged,

P = - Icm + A so
Ioe
(16)

If Eqs. (13) and (16) are now substituted into Eq. (11) for A2, we obtain,
after some algebraic manipulation,
1
TX-
- ± 1IImmlL2 - (6.0)2
1
(17)
L ee - J 2J
310 OPTIMIZATION BY VARIATIONAL METHODS

Finally, then, Eq. (13) for bu becomes

bu = -G(t) I loaf -(I&m2 (y'


e&
1
I## #') au
+ G(t) 1o #i a '` (18)

The presence of the square root imposes an upper limit on the correction
30 which can be sought, and a full correction cannot generally be obtained
in a single iteration. The procedure outlined here is geometrically equiv-
alent to projecting the gradient onto the subspace defined by the con-
straint. Identical equations are obtained for systems with difference
equations if integrals are replaced by sums and continuous variables by
their discrete analogs.
If the gradient-projection procedure is applied to the reactor-tem-
perature problem of the preceding section with ¢(xN) = x1N - 0.4, the
only additional computations required are the recalculation of Eqs. (4)
of that section, substituting 4'1", 412" for yl", y2", with boundary conditions
1, 0 on *N and 0, -1 on yN, followed by calculation of the sums Is., Is*f
and Ii,. Computations were carried out using a maximum correction
60 of 0.01 in magnitude for each iteration. Lines I and II in Fig. 9.13

0.25

0.20

0.15
0

00.10

Fig. 9.23 Approach to final value con-


straint using steep descent and gradi-
w ent projection. [From M. M. Denn
-0.05
and R. Aris, Ind. Eng. Chem. Funda-
mentals, 4:213 (1965). Copyright 1965
by the American Chemical Society. Re-
-0.10 11 1 1 1 1 1

printed by permission of the copyright


0 5 10 15 20 25 30
Iteration number owner.]
NUMERICAL COMPUTATION 311

show the convergence of the constraint for N = 3. 0 = 2 from starting


policies at the lower and upper bounds, respectively. The correction is
seen to be linear. Line III shows the convergence for the 60-stage sys-
tem, N = 60, 0 = 0.1, from the starting policy in Fig. 9.12. Twenty-
two iterations were required here for convergence to the same optimal
policy as that shown in Fig. 9.12 for K = 1.27 X 103. In all cases the
final error in the constraint was less than 10-5.

9.14 MIN H
Steep descent provides a rapidly convergent method of obtaining approxi-
mate solutions to variational problems, but it is evident from the examples
that ultimate convergence to the solution of the minimum-principle
necessary conditions is slow and computationally unfeasible. The min-H
procedure is one which can be used only in the latter stages of solution
but which will, with only minor modifications of the computer code,
lead to a solution satisfying the necessary conditions.
The system and multiplier equations are
x; = f;(x,u) x;(0) = x;o (1)

y; - - Yi
ax,
y;(0)
ax;
(2)

For a given decision function f4(t) the solutions to Eqs. (1) and (2) are
denoted as Z(t), Y(t). Writing the hamiltonian in terms of these latter
variables, we have

(3)

The new value of u is chosen so that H is minimized. In that way


convergence is forced to the solution of the necessary conditions. In
practice severe oscillations and divergence may occur if the full correc-
tion is used, so that if we denote the function obtained by minimizing H
in Eq. (3) as il, the new value of u is obtained from the relation

(4)
r
A value of r = 2 has generally been found to be satisfactory. Modi-
fications similar to those in Sec. 9.13 can be made to accommodate
end-point constraints.
The relation to steep descent may be observed by considering the.
special case for which the minimum of H occurs at an interior value.
312 OPTIMIZATION BY VARIATIONAL METHODS

Then the minimum, t!, satisfies the equation

5
y' au
This can be expanded about u for a Newton-Raphson solution as
`Jt
at;(31,a
au + L. ry`
a2};(2,u)
au2
0 - u) + .. . =0 (6)
a

and if the higher-order terms can be neglected, the min-H procedure is


equivalent to
Ca2H J aH
uOCM u- au2J au
(7)

That is, we get nearly the same result as a steep-descent procedure with
the inverse of a2H/au2 as the weighting factor w. At the minimum the
hessian of H is positive, and so Eq. (7) will allow convergence, and this
must be true by continuity arguments in a neighborhood of the minimum.
For staged systems we have no strong minimum principle in general, and
a2H/au2 need not be positive near the optimum, so that this procedure
might lead to divergence arbitrarily close to the minimum in a staged
system.
As an example of the convergence properties of the min-H algorithm
we consider the problem of the optimal tubular-reactor temperature
profile for the consecutive reactions.The state and multiplier equations
(1) and (4) of Sec. 9.11 are solved sequentially to obtain xl, 22, tit, tit
for the specified u. Using the necessary conditions in Sec. 9.3, we then
find il from the relation
u* u(t) > u*
tZ = v(t) u* < v(t) < u* (8)
U* u(t) < u*
with
u (t)
E_ - E; (9)
12 x 2 k2o
In
(tit - 'Y1)xl2klo
For these computations the initial profile was taken as the constant
u = 345 and r was set at 2. The results are shown in Table 9.9, where e
is the integral of the absolute value of the difference between old and
new temperature. The stabilizing factor r = 2 clearly slows convergence
for the small values of t. Convergence is not uniform and, indeed, S is
not uniformly decreasing, but very rapid convergence to a solution
satisfying the necessary conditions is obtained in this way. Numerical
Table 9.9 Successive approximations to optimal temperature profile using the min-H method

Iteration Number

t 0 1 2 4 6 8 10 12 14

0 345.00 350.00 352.50 354.38 354.84 354.96 354.99 355.00 3:15.00


0.5 345.00 350.00 352.50 354.38 354.84 354.96 354.99 355.00 335.00
1 345.00 350.00 349.88 349.20 348.86 348.76 348.70 348.74 348.74
2' 345.00 345.76 344.61 344.19 344.15 344.16 344.16 344.16 344.16
3 345.00 343.89 342.50 342.10 342.11 342.12 342.13 342.13 342.13
4 345.00 342.94 , 341.36 340.89 340.89 340.90 340.91 340.91 340.91
5 345.00 342.38 340.67 340.09 340.06 340.07 340.07 340.07 340.07
6 345.00 342.05 340.21 339.52 339.46 339.46 339.46 339.46 339.46

x,(6) 0.4147 0.4088 0.4186 0.4196 0.4196 0.4196 0,4196 0.4196 0.4196
x,(6) 0.4952 0.4999 0.4977 0.4973 0.4974 0.4974 0.4974 0.4974 0.4974
-s; 0.3196 0.3225 0.3233 0.3232 0.3233 0.3233 0.3233 0.3233 0.3233
29.7 18.1 4.3 0.70 0.29 0,086 0.022 0.0054 0.0013
314 OPTIMIZATION BY VARIATIONAL METHODS

values differ somewhat from those indicated in earlier calculations for


this system because of the use of a different numerical-integration
procedure and a constant value of u over an integration step instead of
using interpolation.

9.15 SECOND-ORDER EFFECTS

In looking at the convergence properties of steep descent for problems in


differential calculus in Sec. 2.8 we found that by retaining second-order
terms in the steep-descent analysis we were led to a relation with quad-
ratic convergence which was, in fact, equivalent to the Newton-Raphson
solution. We can adopt the same approach for the variational problem
in order to find what the pertinent second-order effects are.
It helps to examine the case of a single state variable first. We
have
x = J(x,u) (1)

and we seek to minimize 6[x(9)]. If we choose u(t) and expand Eq. (1)
to second order, we have

bx-axbx+aubu+ax2bx2+axaubxbu+au bu2
bx(0) = 0 (2)
The corresponding second-order change in E is
Sa; = g' bx(9) + 3E" 5x(9)2 (3)

where the prime denotes differentiation. Our goal is to choose bu so


that we minimize bs. The quadratic nature of the problem prevents an
unbounded solution, and it is not necessary to specify a step size in the
decision space.
Equations (2) and (3) define a variational problem of the type we
have studied extensively. Here the state variable is ax and the decision
Su, and we wish to choose bu to minimize a function [Eq. (3)] of bx(9).
The hamiltonian, which we denote by h, is
of of 1
'92f a2f 1 '92f
h
ax
bx
+ au Su + 2 axe bx2 + ax au bx bu + 2 5u2 Sue (4)
where the Green's function is denoted by ¢. For unconstrained bu, then,
we can set ah/8(bu) to zero to obtain
af2J
bu = -
GOf
aJ u2) au + ax 8)
The Newton-Raphson approximation to min H, Eq. (7) of Sec. 9.14,
(5)

would correspond to Eq. (5) without the term a'f/(ax au) Ox, so that
NUMERICAL COMPUTATION 31S

min H is clearly not the complete second-order correction for' steep


descent.
We can carry the analysis somewhat further. The first-order solu-
tion to Eq. (2) is
bx =
ft exp I j ax (Q) da] au (r) au(,) dr
{
(6)

Equation (5) can then be written as a Volterra-type integral equation


for the correction Bu(t)

Bu(t) + (L\
auz_I ax a 1o exp Cllr ax (o)] au (r) bu(r) dr
__ a2f -1 of (7)
au2> au
There is no computational advantage in doing so, however, for the solu-
tion of an integral equation is not, an easy task and Eq. (5), although it
contains the unknown ox, can be used directly. Noting that ax is simply
x - x, it follows that the best second-order correction bu is a known
explicit function of x, which we denote as bu(x). Equation (1) can then
be written
x = fix, a + bu(x)] (8)

which can be integrated numerically in the usual fashion. The new


value of u is then constructed from Eq. (5).

9.16 SECOND VARIATION


A complete second-order approach to the numerical solution of vari-
ational problems requires that we generalize the result of the preceding
section to a system of equations. We consider
i, = fi(x,u) (1)
with u(t) to be chosen to minimize s(x(e)]. We shall assume x(O) to be
completely unspecified, though the extension to constrained final values
can be carried out. Unlike the quasi-second-order min-H procedure, we
must assume here that u is unconstrained. Fdr notational convenience
we shall use the hamiltonian formulation in what follows, so that we define

H= Yf, (2)

(3)
316 OPTIMIZATION BY VARIATIONAL METHODS

Now, if we expand Eq. (1) about some choice u(t) and retain terms
to second order, we obtain
a2 =
64 = ax;
aui 6U + ,-,
OXJ OX
ax; axk +
8x,
t9u ox, Ou
).k
1 2f,
+ a au= azi(o) = 0 (4)

The second-order expansion of the objective is


z8
as = a axt(e) + - ax ax;
ax1(a) az;(B) (5)

This defines a variational problem, for we must choose bu(t), subject to


the differential-equation restriction, Eq. (4), so that we minimize SE in
Eq. (5). Denoting the hamiltonian for this subsidiary problem by h and
the multiplier by 4, we have
z
44 ax` ax; ax; axk
0 ,
+ I.C
i
u au + ij.k
axaxk
'
a=f; - 1 a=fi
ax; au + 2- au= (6)
+ ax1 au au=

C
_ - a(ax;)
A _
Y'`
af ;
ax; - /C, f axk
k Y' ax; axk
a ,
.

4'i
aft
2

ax; au
alb

(7)
a(as) aE
7( ) a(ax;) = ax; + k ax, axk
12-'8- axk (8)

By assuming that u is unconstrained we can allow au to take on any


value, so that the minimum of h is found by setting the derivative with
respect to au to zero
ah cc of i a=f; c a=f i
au = o
a(bu)
i
au + jJ ax, au ax' + 1
t
V"' =
(9)

or
a=ft 4'i a=f
su = - auz I au
aft + id ax au s
ax; j (10)
i

There are notational advantages to using the hamiltonian and


multiplier for the original problem, defined by Eqs. (2) and (3). We
define 57 by

5Yi=4',-Y; (11)
NUMERICAL COMPUTATION 9vi

From Eqs. (3) and (7),


a2H a2f;
axk
k az; axk
axk -
;k
Sy;
ax; axk

a2H a2f
ax; au
au - ay;
ax; au
au (12)

while the boundary condition is obtained from Eqs. (3) and (8)
I a2& I a2&
Sy;(B) = axk - ax; (13)
ax; + ax; axk k ax, axk 6xk(9)

Equation (10) for au becomes


a2H
I
2

+G ax
uLfi

au
Lfi ) au ay` auSx'
1

'
aax;2fau ax;) (14)
+ I.1ay`
We require a solution to the system of Eqs. (1) and (12) to (14) which
can be implemented for computation without the necessity of solving a
boundary-value problem.
The fact that we have only quadratic nonlinearities suggests the
use of the approximation technique developed in Sec. 4.8. We shall
seek a solution of the form
ay;(t) = g;(t) + I M;k(t) axk(t) + higher-order terms (15)
k

The functions g; and M;k must be obtained, but it is evident from Eq. (13)
that they must satisfy the final conditions
g,(8) = 0 (16a)
2

M'k(B) (16b)
= ax, axk
We can also express Eqs. (4), (12), and (14) to comparable order as

ax; _ I az` ax; + au` au + higher-order terms (17)

I
:
a; a2
S' au
y' ay` ax; - ax; axk axk - ax; au
+ higher-order terms (18)
au
(a2H)-l
aH + ayt aJ; + a2H
au au au ax; au
-- higher-order terms (19)
311 OPTIMIZATION BY VARIATIONAL METHODS

Substituting Eq. (15) into Eqs. (18) and,.(19), we obtain


{ f
a Yi = - 9i
ik
M". ax, afi
ax; - I ax; axk
k
a_
axk

612H ,2H)-1t0H + I 9s afi afi


ax; au au au i au + Ii,kMik axk au
(20)
azk)
+ axk au

On the other hand it must be possible to differentiate Eq. (15) with respect
to t and write
aii=#;+I M;kaxk+I mo,61k (21)
k k

or, substituting from Eqs. (15) and (17) to (19),

ox; _ +M;kaxk+\'M;k afk axi


k k,i
ax;
=-1 /aH afi afi
M",LA
au(` 9i au + M`' axp
+ k
au au + au
I .32H
+- axi (22)
i axi au
The coefficients of (&x)° and (6x)1 in Eqs. (20) and (22) must be the
same, leading to the equations
(8211)_I a2H afk
Mii + [r afk afk

k axi au= au au axi I Mki + Mik axj

- (!)'
a=H Ma=H 1
of
au=au au ax; -( k 'k au) (au=> ( au'i
M
aH
a2 H aH aH _ -1 aH
2
=0 Mi;(9) =
a8
:
(23)

gi
+ azi ax;
_ (
L
k
axi au au=
afk (a=H\-' at; _ af,
Mik au au= J au ax; + au=
axi au
af, a=H 1
axi ax;

au au axi J 9i
of 0H
H 82H -1 aH
a=H-1 aH
0 gi(9) = 0
au (au=) au - axi au au=) au -
(24)

Equation (23) for the "feedback gain" is simply the Riccati equation
obtained for the optimal control problem in Sec. 8.2, and Mi; is symmetric
NUMERICAL COMPUTATION 31!

(M;; = M;;). As convergence occurs and 8H/au goes to zero, the forcing
term in Eq. (24) also goes to zero and the "feedforward gain" vanishes.
The correction to the decision, bu(t), is then obtained from Eqs. (19)
and (15) as
a2 /-1 M
bu(t) = au / + g' aut
\
+ au' M,:1 (x: _f:) (25)

where x; is the (as yet unknown) value of the state corresponding to the
new decision function.
The computational algorithm which arises from these equations
is then as follows:

1. Choose u(t), solve Eqs. (1) for x and then (3) for 7 in succession, and
evaluate all partial derivatives of H.
2. Solve Eqs. (23) and (24), where all coefficients depend upon 2(t),
!(t), and Y(t).
3. Compute u(t) from the expression

unew(x,t) = t(t) + T
bu(t) (26)

where bu is defined by Eq. (25) and r is a relaxation parameter,


r > 1. Note that unew is an explicit function of x.
4. Solve Eq. (1) in the form
x: _ filx,unew(x,t)J (27)

simultaneously calculating u(t), and repeat until convergence is


obtained.

It is interesting to note that after convergence is obtained, Eq. (25)


provides the feedback gains for optimal linear control about the optimal
path.
The second-variation method was applied to the optimal-pressure-
profile problem studied several times earlier in this chapter. There are
two state variables, hence five auxiliary functions to be computed,
M11, M12, M22, g1, 92- We shall not write down the cumbersome set
of specific equations but simply present the results for an initial choice
of decision function u(t) = const = 0.2, one of the starting values used
for steep descent, with r = 1. Solutions to the Riccati equation are
shown in Fig. 9.14, where differences beyond the second iteration were
too small to be seen on that scale. From the first iteration on, the
3m OPTIMIZATION BY VARIATIONAL METHODS

3 4 5 6
Residence time t
Fig. 9.14 Successive, values of feedback gains using the
second-variation method.

Table 9.10 Successive approximations to optimal pressure profile


and outlet concentrations using the second-variation method

Iteration number

t 0 1 2 4 5

0 0.2000 1.1253 0.5713 0.7248 0.7068 0.7174


0.2 0.2000 0.8880 0.4221 0.3944 0.3902 0.3902
0.4 0.2000 0.4914 0.3527 0.3208 0.3194 0.3196
0.8 0.2000 0.3846 0.3014 0.2798 0.2786 0.2786
0.8 0.2000 0.3189 0.2888 0.2529 0.2522 0.2521
1.0 0.2000 0.2749 0.2421 0.2337 0.2332 0.2332
2.0 0.2000 0.1743 0.1819 0.1829 0.1829 0.1829
3.0 0.2000 0.1358 0.1566 0.1590 0.1590 0.1590
4.0 0.2000 0.1169 0.1421 0.1442 0.1442 0.1442
5.0 0.2000 0.1082 0.1324 0.1339 0.1339 0.1339
6.0 0.2000 0.1056 0.1251 0.1260 0.1260 0.1280
7.0 0.2000 0.1086 0.1191 0.1198 0.1198 0.1198
8.0 0.2000 0.1092 0.1140 0.1148 0.1148 0.1147

x (8) 1 3-3338 X 10-1 1 3.6683 X 10-1 3.&309 X 10-1 3.8825 X 10-1 3.8624 X 10-+ 3.8622 X 10-1
z 1(8) 1.0823 X 10-1` 1.1115 X 10-1 1.1315 X 10-1 1.1319 X 10-1 1.1319 X 10-1 1.1319 X 10-1
NUMERICAL COMPUTATION 321

functions gl and 92 were at least one order of magnitude smaller than


-y, and 72, to which they act as correction terms, and could have been
neglected without affecting the convergence. Successive pressure pro-
files are given in Table 9.10, where, following some initial overshoot
resulting from using r = 1, the convergence is extremely rapid. With
the exception of the value right at t = 0 the agreement is exact with
solutions obtained earlier using the necessary conditions.

9.17 GENERAL REMARKS


Though we have not enumerated every technique available for the
numerical solution of variational problems, this sketch of computational
methods has touched upon the most important and frequently used
classes of techniques. By far the most reliable is the method of steep
descent for improving functional values of u. From extremely poor
initial estimates convergence to near minimal values of the objective are
obtained by a stable sequence of calculations. As a first-order method,
however, it generally demonstrates poor ultimate convergence to the
exact solution of the necessary conditions.
All other methods require a "good" initial estimate of either bound-
ary values or one or more functions. Such estimates are the end'product
of a steep-descent solution, and a rational computational procedure would
normally include this easily coded phase I regardless of the ultimate
scheme to be used. The final calculational procedure, if any, must
depend upon the particular problem and local circumstances. Min H,
for example, is more easily coded than second variation, but it lacks the
complete second-order convergence properties of the latter. Further-
more, if second derivatives are easily obtained-and if a general code for
integrating the Riccati equation is available, the coding differences are
not significant. On the other hand, if u(t) is constrained, second vari-
ation simply cannot be used. Similarly, steep-descent boundary iter-
ation is attractive if, but only if, a highly reliable automatic routine of
function minimization without calculation of derivatives is available,
while all the indirect methods require relative stability of the equations.

BIBLIOGRAPHICAL NOTES
Section 9.1: We shall include pertinent references for individual techniques as they are
discussed. We list here, however, several studies which parallel all or large parts
of this chapter in that they develop and compare several computational procedures:
R. E. Kopp and H. G. Moyer: in C. T. Leondes (ed.), "Advances in Control Sys-
tems," vol. 4, Academic Press, Inc., New York, 1966
L. Lapidus: Chem. Eng. Progr., 63(12):64 (1967)
L. Lapidus and R. Luus: "Optimal Control of Engineering Processes," Blaisdell
Publishing Company, Waltham, Mass., 1967
322 OPTIMIZATION BY VARIATIONAL METHODS

A. R. M, Noton: "Introduction to Variational Methods in Control Engineering,"


Pergamon Press, New York, 1965
B. D. Tapley and J. M. Lewallen: J. Optimization Theory Appl., 1:1 (1967)
Useful comparisons are also made in papers by Storey and Rosenbrock and Kopp,
McGill, Moyer, and Pinkham in
A. V. Balakrishnan and L. W. Neustadt: "Computing Methods in Optimization
Problems," Academic Press, Inc., New York, 1964
Most procedures have their foundation in perturbation analysis for the computation of
ballistic trajectories by Bliss in 1919 and 1920, summarized in
G. A. Bliss: "Mathematics for Exterior Ballistics," John Wiley & Sons, Inc., New
York, 1944

Sections 9,2 and 9.3: The development and example follow


M. M. Denn and R. Aria: Ind. Eng. Chem. Fundamentals, 4:7 (1965)
The scheme is based on a procedure for the solution of boundary-value problems without
decision variables by
T. R. Goodman and G. N. Lance: Math. Tables Other Aids Comp., 10:82 (1956)
See also
J. V. Breakwell, J. L. Speyer, and A. E. Bryson: SIAM J. Contr., 1:193 (1963)
R. R. Greenly, AIAA J., 1:1463 (1963)
A. H. Jazwinski: AIAA J., 1:2674 (1963) -

AIAA J., 2:1371 (1964)


S. A. Jurovics and J. E. McIntyre: ARS J., 32:1354 (1962)
D. K. Scharmack: in C. T. Leondes (ed.), "Advances in Control Systems," vol. 4,
Academic Press, Inc., New York, 1966

Section 9.4: Little computational experience is available for th4, rather obvious approach.
Some pertinent remarks are contained in the reviews of Noton and .Storey and
Rosenbrock cited above; see also
J. W. Sutherland and E. V. Bohn: Preprints 1966 Joint Autom. Contr. Conf., Seattle,
p. 177
Neustadt has developed a different steep-descent boundary-interation procedure for
solving linear time-optimal and similar problems. See papers by Fndden and
Gilbert, Gilbert, and Paiewonsky and coworkers in the collection edited by Bala-
krishnan and Neustadt and a review by Paiewonsky,
B. Paiewonsky: in G. Leitmann (ed.), "Topics in Optimization," Academic Press,
Inc., New York, 1967

Sections 9.5 to 9.7: The Newton-Raphson (quasilinearization) procedure for the solution
of boundary-value problems is developed in detail in
R. E. Bellman and R. E. Kalaba: "Quasilinearization and Nonlinear Boundary-
value Problems," American Elsevier Publishing Company, New York, 1965
NUMERICAL COMPUTATION 323

Convergence proofs may be found in


R. E. Kalaba: J. Math. Mech., 8:519 (1959)
C. T. Leondes and G. Paine: J. Optimization Theory Appl., 2:316 (1968)
It. McGill and P. Kenneth: Proc. 14th Intern. Astron. Federation Congr., Paris, 1963
The reactor example is from
E. S. Lee: Chem. Eng. Sci., 21:183 (1966)
and the calculations shown were done by Lee. There is now an extensive literdture of
applications, some recorded in the book by Bellman and Kalaba and in the reviews
cited above. Still further references may be found in
P. Kenneth and R. McGill: in C. T. Leondes (ed.), "Advances in Control Systems,"
vol. 3, Academic Press, Inc., New York, 1966
E. S. Lee: AIChE J., 14:467 (1968)
--: Ind. Eng. Chem. Fundamentals, 7:152, 164 (1968)
R. McGill: SIAM J. Contr., 3:291 (1965)
--- and P. Kenneth: AIAA J., 2:1761 (1964)
C. H. Schley and I. Lee: IEEE Trans. Autom. Contr., AC12:139 (1967)
It. J. Sylvester and F. Meyer: J. SIAM, 13:586 (1965)
The papers by Kenneth and McGill and McGill use penalty functions to incorporate
state and decision constraints.

Section 9.8: Pertinent comments on computational effort are in


R. E. Kalman: Proc. IBM Sci. Comp. Symp. Contr. Theory Appl., While Plains, N.Y.,
1966, p. 25.

Sections 9.9 to 9.13: The idea of using steep descent to solve variational problems origi-
nated with Hadamard; see
R. Courant: Bull. Am. Math. Soc., 49:1 (1943)
The first practical application to a variational problem appears to be in
J. H. Laning, Jr., and R. H. Battin: "Random Processes in Automatic Control,"
McGraw-Hill Book Company, New York, 1956
Subsequently, practical implementation was accomplished, independently about 1960 by
Bryson, Horn, and Kelley; see

A. E. Bryson, W. F. Denham, F. J. Carroll, and K. Mikami: J. Aerospace Sci.,


29:420 (1962)
F. Horn and U. Troltenier: Chem. Ing. Tech., 32:382 (1960)
H. J. Kelley: in G. Leitmann (ed.), "Optimization Techniques with Applications to
Aerospace Systems," Academic Press, Inc., New York, 1962
The most comprehensive treatment of various types of constraints is contained in

W. F. 1)enham: Steepest-ascent Solution of Optimal Programming Problems, Raytheon


Co. Rept. BR-2393, Bedford, Mass., 1963; also Ph.D. thesis, Harvard University,.
Cambridge, Mass., 1963
324 OPTIMIZATION BY VARIATIONAL METHODS

Some of this work has appeared as


W. F. Denham and A. E. Bryson: AIAA J., 2:25 (1964)
An interesting discussion of convergence is contained in
D. E. Johansen: in C. T. Leondes (ed.), "Advances in Control Systems," vol. 4,
Academic Press, Inc., New York, 1966
Johansen's comment& on convergence of singular controls are not totally in agreement with
our own experiences. The formalism for staged systems was developed by Bryson,
Horn, Lee, and by Denn and Aris for all classes of constraints; see
A. E. Bryson: in A. G. Oettinger (ed.), "Proceedings of the Harvard Symposium on
Digital Computers and Their Applications," Harvard University Press, Cam-
bridge, Mass., 1962
M. M. Denn and R. Aris: Ind. Eng. Chem. Fundamentals, 4:213 (1965)
E. S. Lee: Ind. Eng. Chem. Fundamentals, 3:373 (1964)
F. Horn and U. Troltenier: Chem. Ing. Tech., 35:11 (1963)
The example in Sec. 9.11 is from
J. M. Douglas and M. M. Denn: Ind. Eng. Chem., 57(11):18 (1965)
while that in Secs. 9.12 and 9.13 is from the paper by Denn and Aris cited above. Other
examples of steep descent and further references are contained in these papers and
the reviews. An interesting use of penalty functions is described in
L. S. Lasdon, A. D. Waren, and R. K. Rice: Preprints 1967 Joint Autom. Cont. Conf.,
Philadelphia, p. 538

Section 9.14: The min-H approach was suggested by Kelley, in the article cited above,
and by
S. Katz: Ind. Eng. Chem. Fundamentals, 1:226 (1962)
For implementation, including the extension to systems with final constraints, see
R. G. Gottlieb: AIAA J., 5:(1967)
R. T. Stancil: AIAA J., 2:I365 (1964)
The examination of convergence was-in
M. M. Denn: Ind. Eng. Chem. Fundamentals, 4:231 (1965)
The example is from
R. D. Megee, III: Computational Techniques in the Theory of Optimal Processes,
B. S. thesis, University of Delaware, Newark, Deli, 1965

Sections 9.15 and 9.16: The basic development of a second-vao'iatioh procedure is in


H. J. Kelley, R. E. Kopp, and H. G. Moyer: Progr. Astron. Aero., 14:559 (1964)
C. W. Merriam: "Optimization Theory and the Design of Feedback Control Sys-
tems," McGraw-Hill Book Company, New York, 1964
The second variation is studied numerically in several of the reviews cited above; see also
D. Isaacs, C. T. Leondes, and R. A. Niemann: Preprints 1966 Joint Autom. Contr.
Conf., Seattle, p. 158
NUMERICAL COMPUTATION 325

S. R. McReynolds and A. E. Bryson: Preprints 1965 Joint Autom. Contr. Conf.,


Troy, N.Y., p. 551
S. K. Mitter: Automatica, 3:135 (1966)
A. R. M. Noton, P. Dyer, and C. A. Markland: Preprints 1966 Joint Autom. Contr.
Conj., Seattle, p. 193
The equations for discrete systems have been obtained in
F. A. Fine and S. G. Bankoff: Ind. Eng. Chem. Fundamentals, 6:293 (1967)
D. Mayne: Intern. J. Contr., 3:85 (1966)
Efficient implementation of the second-variation technique requires an algorithm for solv-
ing the Riccati equation, such as the computer code in
R. E. Kalman and T. S. Englar: "A User's Manual for the Automatic Synthesis
Program," NASA Contractor Rept. 475, June, 1966, available from the Clearing-
house for Federal Scientific and Technical Information, Springfield, Va. 22151.

PROBLEMS
9.1. Solve the following problem by each of the, methods of this chapter.If a computer
is not available, carry the formulation to the point where a skilled programmer with
no knowledge of optimization theory could code the program. Include a detailed
logical flow sheet.
2
i1 = - arctan u - x1
x
is = xI - x:
is = x2 - xa
x1(0) = 0 x2(0) _ -0.4 xa(0) = 1.5
min & = 103 1(2x:)sa + xa= + 0.01u'1 dt

Take a = 2 and 10. The latter case represents a penalty function approximation
for Ix,! < 5J. The arctangent is an approximation to a term linear in u with the
constraint Jul < 1. (This problem is due to Noton, and numerical results for some
cases may be found in his book.) Repeat for the same system but with
i1 -U-x1
Ju l < 1
9.2. Solve Prob. 5.8 numerically for parameter values
k1=ka=1 k2 -10
9-0.4 and 9-1.0
Compare with the analytical solution.
9.3. Solve Prob. 7.2 using appropriate methods of this chapter. (Boundary iteration
is difficult for this problem. Why?)
9.6. Develop an extension of each of the algorithms of t%is chapter to the case in
which a is specified only implicitly by a final constraint of the form
#(x(e)l = 0
Note that it might sometimes be helpful to employ a duality of the type described in
Sec. 3.5.
10
Nonserial Processes

10.1 INTRODUCTION
A large number of industrial processes have a structure in which the
flow of material and energy does not occur in a single direction because
of the presence of bypass and recycle streams. Hence, decisions made
at one point in a process can affect the behavior at a previous point.
Though we have included the spatial dependence of decisions in processes
such as the plug-flow reactor, our analyses thus far have been couched
in the language and concepts of systems which evolve in time. For
such systems the principle of causality prevents future actions from
influencing the present, and in order to deal with the effect of feedback
interactions in spatially complex systems we shall have to modify our
previous analyses slightly.
The optimization of systems with complex structure involves only
a single minor generalization over the treatment of simple systems in
that the Green's functions satisfy a different set of boundary conditions.
There has been some confusion in the engineering literature over this
326
NONSERIAL PROCESSES 327

problem, however, and we shall proceed slowly and from several different
points of view. The results of the analysis will include not only the
optimization of systems with spatial interactions but also, because of
mathematical similarities, the optimal operation of certain unsteady-state
processes.

10.2 RECYCLE PROCESSES


We can begin our analysis of complex processes by a consideration of the
recycle system shown in Fig. 10.1. A continuous process, such as a
tubular chemical reactor, is described by the equations
z; = f;(x,u) 0<t<0 (1)
where the independent variable t represents length or residence time.
The initial state x(0) is made up of a mixture of a specified feed x1 and
the effluent x(O) in the form
x;(0) = G,[xf,x(8)] (2)
The goal is to choose u(t), 0 < t < 0, in order to minimize a function
of the effluent, 6[x(8)).
We carry out an analysis here identical to that used in previous
chapters. A function u(t) is specified, and Eqs. (1) and (2) are solved
for the corresponding 8(t). We now change u(t) by a small amount
Su(t) and obtain first-order variational equations for &x
s
Ox, = 1 afi 0x, + a
ax,
usu ` (3)
1-I

The corresponding first-order change in the mixing boundary condition,


Eq. (2), is
s
aG,
0x;(0) = I ax;(e) 0x1(8) (4)
f-I

Green's identity can be written for the linear equations (3) as

Y6) Ox,(B) _ y:(0) 0x,(0) +ofI 'Y; a i Su dt (5).


ial i-I :.1

x(0) x(8)
L> x(0)-G[x,,x(8) x=f(x,u)
t
Fig. 10.1 Schematic of a continuous recycle process.
32i OPTIMIZATION BY VARIATIONAL METHODS

where the Green's functions satisfy the equations

0<t<0 (6)
y`
ill ax;

Following substitution of Eq. (4), Green's identity, Eq. (5), becomes

NO) ax;(8) = 1 NO
[I aG, ax;(B) ] +fp y: a ' au dt
0 (0)
(7)

or

aG ; to aH
I y:(B) - l 70) axi (e)] ax; (9) =
fo au
su do (8)

In Eq. (8) we have substituted the usual hamiltonian notation


s
H= Iyif: (9)
i-l
The first-order change in the objective, g[x(e)j, as a result of the
change su in decision, may be written
s
S& _ axi(B) (10)
a x;

This can be related explicitly to su by means of Eq. (8), for if we write


S

y' (o) ax19) az


(11)

Eq. (8) becomes

afi
- fo au au dt (12)

If a is the optimal decision, the weak minimum principle follows immedi-


ately from Eq. (12) by a proof identical to those used previously. The
only change resulting from the recycle is that in the pair of boundary
conditions, Eqs. (2) and (11).
An identical analysis can be carried out for a sequence of staged
operations with recycle of the type shown in Fig. 10.2. The state is
described by difference equations
x:^ = f;"(x^-',u^) n = 1, 2, . . . , N (13)

where the input to the first stage is related to the effluent x' and feed
NONSERIAL PROCESSES 32!

if xa= G(xr, x")


x0
x,=If t(xa,u')
I
x2=f2(x1,u2)

2
N=fN(xN-t,uN)
N
N

Fig. 10.2 Schematic of a staged recycle process.

x, by the equation
xie = G,(xi,xN) (14)

and the objective is the minimization of S(XN). The stage hamiltonian


is defined by
s
Hn = yinfin (15)

with Green's functions satisfying


s r
afire
y; n fz = 1, 2, , N (16)
i-1

, Q,, _ aS
(17)
ysN ^fj ax;N ax:N
i-t

As previously, only a weak minimum principle is generally satisfied.


For continuous recycle systems a strong principle is proved in the usual
way.

10.3 CHEMICAL REACTION WITH RECYCLE


We can apply the minimum principle to an elementary recycle problem
by considering the process shown in Fig. 10.3. The irreversible reaction
X -* Y

Choose temperotur profile


Feed X I Product Y
Mixer Reactor X -Y Separator

Unreocted X

Fig. 10.3 Schematic of a reactor with recycle of unreacted feed.


330 OPTIMIZATION BY VARIATIONAL METHODS

is carried out in a tubular reactor, after which unreacted X is separated


and recycled. We wish to find the temperature profile in the reactor
which will maximize conversion less the cost of operation.
The concentration of X is denoted by x, and the temperature by u.
With residence time as independent variable the governing equation
for the reactor is then
x, _ -k(u)F(x,) 0<t<0 (1)

The mixing condition is


x1(0) _ (1 - p)xj + px,(B) (2)

where p is the fraction of the total volumetric flow which is recycled.


The cost of operation is taken as the heating, which may be approxi-
mated as proportional to the integral of a function of temperature.
Thus, we seek to maximize
(P = x1(0) - x,(0) - fo g[u(t)] dt (3)

or, equivalently,
= fo [k(u)F(x1) - g(u)] dt (4)

This is put into the required form by defining a variable x2,


22 = -k(u)F(x1) + j(u) X2(0) = 0 (5)

Then we wish to minimize


S = x2(8) (6)

The hamiltonian is
H = -y,k(u)F(x,) + y2[-k( u)F(x1) + 9(u)] (7)

with multiplier equations

aH = (y1 + 'v2)k(u)F'(x1) (8a)

aH
y2° -ax2=0 (8b)

The prime denotes differentiation with respect to the argument. The


boundary conditions, from Eq. (17) of the preceding section, are
y,(8) = Py,(0) (9a)
72(0) = 1 (9b)

Equations (Sb) and (9b) imply that 72 = 1. If the optimum is taken to


NONSERIAL PROCESSES 331

be interior, then
aH = 0 = - (y, + 1)F(xi)k'(u) + g'(u) (10)
au
or
g'(u) = (y, + 1)F(xi) (11)
k' (u)
It is convenient to differentiate both sides of Eq. (11) with respect
to t to obtain
d
g(u) = 1,F(x,) + (71 + 1)F'(x,)x,
Wt k,( u )
_ (y, + 1)k(u)F'(x1)F(x,) + (y, + 1)F'(x,)[-k(u)F(x,)]
=0 (12)

Thus, g'(u)/k'(u), a function only of u, is a constant, so that if the equa-


tion g'(u)/k'(u) = const has a unique solution, the optimal temperature
profile must be a constant. The problem then reduces to a one-dimen-
sional search for the single constant u which maximizes (Y.
For definiteness we take F(xi) = xl. Then Eqs. (1) and (2) can
readily be solved for constant u and substituted into Eq. (3) to obtain
p)(1 - e-k( )
xf(1

The maximization of 61 with respect to u can be carried out for given


values of the parameters and functions by the methods of Chap. 2.

10.4 AN EQUIVALENT FORMULATION


The essential similarity between the results for recycle systems obtained
in See. 10.2 and those for simple straight-chain systems obtained pre-
viously in Chaps. 4 to 7 suggests that the former might be directly deriv-
able from the earlier results by finding an equivalent formulation of the
problem in the spirit of Sec. G.G. This can be done, though it is awkward
for more complex situations. We shall carry out the manipulations for
the continuous recycle system for demonstration purposes, but it is
obvious that equivalent operations could be applied to staged processes."
The system satisfies the equations and recycle boundary condition
z,=f;(x,u) 0<t<0 (1)
x;(0) = G;[xf,x(0)] (2)

and the objective is the minimization of &[x(0)]. We shall define 8


332 OPTIMIZATION BY VARIATIONAL METHODS

new variables, yl, yz, . , y as fol lows:


y; = 0 (3)

Y'(0) - x;(0) = 0 (4)


y;(9) - G;[x,,x(e)) = 0 (5)
Equations (1) and (3) to (5) define a system of 2S equations with S
initial constraints and S final constraints but with the recycle condition
formally removed. The transversality conditions, Eqs. (12) and (13)
of Sec. 6.4, can then be applied to the multipliers.
For convenience we denote the Green's functions corresponding to
the x; as y; and to the y; as 1';. The y, satisfy the equations
s
ti; Ii-1 yjA-i
dx,
0<t<6 (6)

The transversality conditions resulting from the constraint equations (4)


and (5) are, respectively,
y;(0) (7)

y;(B) = (8)
ax; - 4 vi
where the 'n; and v; are undetermined multipliers. The functions r;
satisfy the following equations and transversality conditions:
t'; = 0 r, = const (9)

r;(e) = v; (11)
Combining Eqs. (7) to (11), we are led to the mixed boundary condition
for the y; obtained previously
S
d(i d6
W ax,
(12)
i-1

The hamiltonian and strong minimum principle,. of course, carry over


directly to this equivalent formulation, but fqr the staged problem we
obtain only the weak minimum principle.

10.5 LAGRANGE MULTIPLIERS


The Lagrange multiplier rule, first applied in Chap. 1 to the study of
staged systems, is a particularly convenient tool for a limited class of
problems. One of the tragedies of the minimum-principle-oriented
analyses of recent years has been the failure to retain historical per-
NONSERIAL PROCESSES 333

spective through the relation to Lagrange -multipliers as, for example,


in Sec. 7.5. We shall repeat that analysis here for the staged recycle
problem as a final demonstration of an alternative method of obtaining
the proper multiplier boundary conditions.
The stage difference equations and recycle mixing condition may be
written as.
-x,n + fin(xn_l,un) = 0 n = 1, 2, . . . , N (1)
-xi° + Gi(x,,xN) = 0 (2)
For the minimization of F(x") the lagrangian is then written
N S
£ = &
+ n-1
4I i-1
/-.l
X,' (-xtin + ./in(an-l,u°)1
S
+ Xi°(-xi° + Gi(x,,xN)) (3)
i-1

Setting partial derivatives with respect to each of the variables u', u2,
. . UN, zi°, xi', . . . , x;N to zero, we obtain
a. _ S
VU_
:n afin
aun
= o (4)
i-1
S
a.C
ax'°
X,n 1 T, Xn+1
ax n
=O n=0,1,2,...,N-1
i-1
(5)
aae as X.N + ° aG; _o
aX7°5-ZW - ill I ax:N
(6)

Equations (4) and (5) are the usual weak-minimum-principle equations


for an unconstrained decision, while Eq. (6) is the multiplier boundary
condition obtained previously for the recycle problem.

10.6 THE GENERAL ANALYSIS


We can now generalize the recycle analysis to include plants with an
arbitrary structure of interactions. The essence of the analysis is the
observation that any complex structure can be broken down into a set of
serial, or straight-chain, structures of the type shown in Fig. 10.4, with

Fig. 10.4 Serial subsystems.


334 OPTIMIZATION BY VARIATIONAL*METHODS

mixing conditions expressing the interactions between serial subsystems.


For notational purposes we shall denote the particular subsystem
of interest by a parenthetical superscript. Thus x(k) refers to the state
of the kth serial subsystem. If it is continuous, the dependence on the
continuous independent variable is expressed as x(k)(t), where t ranges
from zero (inlet) to 9(k) (outlet). In a staged serial subsystem we would
indicate stage number in the usual way, z(4)", where n ranges from zero
to N(k). It is convenient to express the inlet and outlet of continuous
subsystems in the same way as the discrete, so that instead of z(k)(0)
we shall write z()0, and X(k)111 for x(4)(9(4)).
The state of the kth serial subsystem is described by the difference
or differential equations
xi (k)" = (k)" (x(k)w-1 'U (k)" ) n = 1, 2, . . . , NW
(la)
s
i = 1, 2, . . . , S(k)

xs(k) = f1(k)(x(k),u(k)) 0 < t < 9(k)


S(k) (lb)
i = 1, 2, ,

where S(k) is the number of state variables required in the kth subsystem
and a single decision is assumed for convenience. The subsystems are
related to one another (by mixing equations of the form
x(4)0 = G(k)({z,1,fz(1)N1) (2)

Equation (2) is a statement that the input to subsystem k depends in


some known manner on the set of all external feeds to the system {z,}
and the set of outflow streams from all serial subsystems {x(1)N). The
simple recycle mixing condition, Eqs. (2) and (14) of Sec. 10.2, is a special
case of Eq. (2). The objective is presumed to depend on all outflow
streams
8 = &{x(1)N{ (3)

and for simplicity the outflows are presumed unconstrained.


Taking the usual variational approach, we suppose that a sequence
of decisions {u(k)j has been made for all parts of all subprocesses and that
the system equations (1) have been solved subject to the mixing boundary
conditions, Eq. (2). A small change in decisions then leads to the linear
variational equations
s«,
ax(k)n = a1s(k)s sx(k)"-1 +
af+ (k)" au(k)"
4 - axf(4)w-1 ' a.u(k)"
j-1
n = 1, 2, N(k)
S(k) (4a)
i = 1, 2,

az,(k) = axj(k) + aVk) au(k) 0 < t < 9(k) (4b)


2M! au i = 1, 2, . . . , S(k)
j-1
NONSERIAL PROCESSES 33S

The corresponding first-order changes in the mixing conditions and


objective are
So)
aG(k)
Sxi(k) 0 ax (t' N (5)
axj(1) N
t i-1
S())

a8 = axj(i)N (6)
axj(1'N

The summations over 1 indicate summation over all subsystems.


The linear variational equations have associated Green's functions
which satisfy difference or differential equations as follows:
S(k)

,yi(k)n-1 yi(k)n n = 1, 2, . . , N(k)


Y
8(k) (7a)
i-1
LL..II
(3xi(k)n-1
i = 1, 2, . . . ,

S(k)
0f.(k)
ax/i(k) 0<t<0(k)
- 1 77(k) i = 1, 2, . .
S(k) (7b)
i-1

Green's identity for each subsystem is then


S(k) S(k) Ar(k) 3(k)
(k)n
.Yi(k)N axi(k)N = \ ,Yi(k)0 &xi(k)0 + I I a '(k)n au(k)n (8a)

S(k) So) S(k)


i(k)N axi(k)N = Yi(k)0 axi(k)0 +
rO tt) ( ''i(k) aau(k)(t)
dt
.Jf
(8b)
The notation is made more compact by introducing the hamiltonian,
So)
H(k)n o ,ii(k)nfi(k)n staged subsystem (9a)
i-1
S
H(k) = ,Yi(k) f.(k) continuous subsystem (9b)

and the summation operator $(k), defined by

au(k)n staged subsystem


oH(k) au(k) _
8(k) au(k)
Bo) 8H(k)
au(k)(t) dt continuous subsystem
(10)

Green's identity can then be written conveniently for staged or continuous


336 OPTIMIZATION BY VARIATIONAL METHODS

subsystems as
8(k) S(k)
)
)i(k).V bxi(k)N I .yi(k)O Sxi(k)o + s(k)
L.r au(k) bu(k) (11
i-1 i-1

We now substitute Eq. (5) for bx,(k)0 into Eq. (11), which is written
as
S(k) s(k) S,)) aGi(k>
7i(k)N bxi(k)N - p aH(k)
axi(I)N = cl(k)
4 ,,i(k)O j..1
II axj(!)N
u
a(k) 50)
i-1 i-I I

(12)
Equation (12) is then summed over all subsystems
3(k) S(k) S(,)

Ik i-1 . y,(k)N axi(k)N - .) (k)o I I ax;(!)N


"j"),
k i-1 I j-1
aH(k)
_ ++
J(k) 5U(k) au(k) (13)
k

In the first term the dummy indices for summation, i and k, may be
replaced by j and 1, respectively, and order of finite summation inter-
changed in the second such that Eq. (13) is rewritten
S(n Su) S(k)
a(y,
yj (i)N axi(!)N yf(k)U 11 axj (1)N
1 j-1 1 j-1 ( k i-1
ax(I)N/
OH(k)
c7 (k) au(k) a2E(k) (14)
A;

or, finally,
Su) S(k )

yi(k)o aGi(k)1
all(k)
1 j-1
(ywv

- k i-1
ax (I)N J/
bx.(°N =
1

k
eS<k)
au(k)
au(k) (15)

Comparison with Eq. (6) for & dictates the boundary conditions for
the Green's functions
So)
Yi(1)N - .ri(k)0 6Cii
(R,

azi(pN
_ dE
ax.(I)N
(16)
k i-1
This is a generalization of Eqs. (11) and (17) of Sec. 10.2. The first-
order variation in the objective can then be written explicitly in terms
of the variations in decisions by combining Eqs. (6), (15), and (16)
aH(k)
59 _ (k) bulk) au(k) (17)
k
NONSERIAL PROCESSES 337

We may now adopt either of the two points of view which we have
developed for variational problems. If we are examining variations
about a presumed optimum, SS must be nonnegative and analyses identi-
cal to those in Sees. 6.5 and 7.4 lead to the weak minimum principle:
The Hamiltonian H(k)n or H(k) in each subprocess is made stationary
by interior optimal decisions and a minimum (or stationary) by
optimal decisions at a boundary or nondifferentiable point.

For continuous subsystems H(k) is a constant, and by a more refined


analysis of the type in either Sec. 6.8 or 6.9, a strong minimum principle
for continuous subsystems can be established. The necessary conditions
for an optimum. then, require the simultaneous solution of Eqs. (1) and
(7) with mixing conditions defined by Eqs. (2) and (16) and the optimum
decision determined by means of the weak minimum principle. .

Should we choose to develop a direct computational method based


upon steep descent, as in Sec. 9.9, we wish to choose Su(k)(t) and Eu(k)n
to make & nonpositive in order to drive S to a minimum. It follows
from Eq. (171, then, that the changes in decision should be of the form

gu(k)n = _w(k)n aH(k)n (18a)


au(k)n

au(k)(t) = _w(k)(t) as (18b)


k>

where w(k)11 and w(k)(t) are nonnegative. A geometric analysis leads to


the normalization analogous to Eqs. (12) and (23) of Sec. 9.9.

au (k) n OG(k)n 3H(k)n/49u(k)n


(19a)
2
H(k)

zG(k)(t) aH(k)/au(k)
5.'(k)(t) (19b)
all(k)
S(k)G(k)
k
au(k)
)7'
where A is a step size and G(k) n, G(k) (t) are nonnegative weighting functions.

10.7 REACTION, EXTRACTION, AND RECYCLE


The results of the preceding section can be applied to the optimization
of the process shown in Fig. 10.5. The reaction

X1 --" Xz -- products
338 OPTIMIZATION BY VARIATIONAL METHODS

Solvent
E7
Solvent
I II
Prod ct
Extraction of x,
Solvent (choose amount of solvent)
removal

I 2 3
Feed
pure x, Mixing Reaction x,-x2--x3
(choose temperatures)

FIg.10.5 Schematic of a reactor system with countercurrent extrac-


tion and recycle of unreacted feed. (From M. M. Denn and R. Aris,
Ind. Eng. Chem. Fundamentals, 4:248 (1965). Copyright 1965 by
the American Chemical Society. Reprinted by permission of the
copyright owner.]

is carried out in a sequence of three stirred-tank reactors. The product


stream is passed through a two-staged countercurrent extraction unit,
where the immiscible solvent extracts pure X1i which is recycled to the
feed to the first reactor. The reactor system will be denoted by arabic
stage numbers and the extractor by roman numerals in order to avoid
the use of parenthetical superscripts.
The kinetics of the reactor sequence are taken as second ' and
first order, in which case the reactor material-balance equations are
those used in Sec. 9.12
0 = xin-' - x1n - 6kioe-8"'1-'(x1")2 n = 1, 2, 3 (1a)
0 = x2n-' - x1; + 6?k, oe-E,'/"1(x1") 2
- 9k2oe 8i "1x2" n = 1, 2, 3 (1b)

The material-balance equations about the first and second stage of the
extractor are, respectively,
x11 -I- u11[4'(x11)
- 0111)1 - x18 = 0 (2)
X111 + 4111(x111) - x11 = 0 (3)

Here u11 is the ratio of volumetric flow rates of solvent to product stream
and #(x1) is the equilibrium distribution between the concentration of
x1 in solvent and reactant stream. Because the solvent extracts X1
only, there is no material-balance equation needed for x2 about the
extractor, for x23 = x211. The external feed is pure X1i so that the
NONSERIAL PROCESSES

feed to reactor 1 is
x1° = xu + u"I '(xi') (4a)
x2° = 0 (4b)
We wish to maximize the production of X2 while allowing for costs of
raw material and extraction, and hence we seek to minimize
8 = -x23 - CxIII + OuII (p)

The temperatures u', u2, u3 and solvent ratio u11 are to be chosen subject
to constraints
U* < u', U2, u3 < u* (6a)
0<u"I (6b)

Equations (1) to (5) are not in the form required for application
of the theory, for, though we need not have done so, we have restricted
the analysis to situations in which a decision appears only in one stage
transformation and not in mixing conditions or objective. This is
easily rectified by defining a variablet x3 with
x3' = x3' (7)
x31I = x3I + uII (8)
Equations (2) and (3) are then rewritten, after some manipulation,
4,(xII) - xI` = 0 (9)
xIII + uII4,(xi"1) - X11
=0 (10)

where the mixing boundary conditions are now rearranged as


xl _ xIf + x,1 - -xIII (lla)
x2° = 0 (llb)
i II
xIs (1lc)
x1 x3II I

23' = 0 (lld)
The system is then defined by Eqs. (1) and (7) to (11), and the objective
is rewritten
8 = --:X1 a - X1 11 + Ox3II (12)

The structure defining the interactions represented by Eqs. (11) is shown


in Fig. 10.6.
The equations for the Green's functions are defined by Eq. (7) of
f The superscript z (for zero) will denote the input to the first stage for the roman-
numeraled variables.
340 OPTIMIZATION BY VARIATIONAL METHODS

r- II
1 Hi 2
1 3 I

Fig. 10.6 Structure of the interactions in the reaction-


extraction-recycle system. [From M. M. Denn and R.
Aris, Ind. Eng. Chem. Fundamentals, 4:248 (1965).
Copyright 1965 by the American Chemical Society. Re-
printed by permission of the copyright owner.)

Sec. 10.6 and, using implicit differentiation as in Sec. 7.6, are


n-1 y1n
y 1
1 + 20k 10e-E,"lu"x In

2e,Y2nk1pe E,"/u"xin

+ (1 + 28k1oe-E''lu'x1")(1 + 8k20e_E='!
n = 1,2,3 (13a)

72"-1 = 1 + k620e-E="'U'
ry2" n = 1, 2, 3 (13b)

yI = ,(xll)
?1
(14a)
171I1
i
= 1 + ull#'(xi') (14b)

yin-1=7'3" n=I,II (14c)


The prime denotes differentiation with respect to the argument. The
boundary conditions, obtained from Eqs. (11) and (12) by means of the
defining equation (16) of the preceding section, are

.yiII + x31L + 71 = -c (15a)

YI3- iii -y10=0 (15b)

y23 = -1 (15c)
.I
7Y31I + 3t1 , (X19 ' or (15d)

Finally, the partial derivatives of the hamiltonians with respect to the


decisions are 9k29e_R="/u")jE'kloe-E,"lu"(xiT
Lf72n

- Y1"(1 +
aHn _ 0 - y2nE2k2oe-E="'"X2"(1 + 20kjoe-E'''""x1")
au". (u")2 (1 + 9k2oe-E! I"")(1 + 20k10e-E"''""xl")
n = 1, 2, 3 (16a)
OHII 11 0111 11 0311
aull - 71 aull + 13 auli
7i ll_(x 1 i1) II (16b)
1 + u' I.l" (x i11) + '1' 3
NONSERIAL PROCESSES -341

or, substituting Eq. (15d) into (16b),


allil
^ uII 4011) 1 +I U1 y (XIII) (16c)
U

It is evident that the artificial variables x3', x3II and -y3I, 73 11 are never
needed in actual computation.
The simultaneous solution of the material-balance relations, Eqs.
(1) to (4), and Green's function, Eqs. (13) to (15), with the optimal tem-
peratures and solvent ratio determined from the weak minimum principle
by means of Eqs. (5) and (16), is a difficult task. Application of the
indirect methods developed in Chap. 9 would require iterative solution
of both the state and multiplier equations for any given set of boundary
conditions because of the mixing conditions for both sets of variables, and
even with stable computational methods.this would be a time-consuming
operation. Steep descent, on the other hand, is quite attractive, for
although the material-balance equations must be solved iteratively for
each assumed set of decisions, the multiplier equations are then linear
with fixed coefficients and can be solved by superposition.
To illustrate this last point let us suppose that the decisions and
resulting state variables have been obtained. Addition of Eqs. (15a) and
(15b) leads to

VIII + 'v'3 = -c (17)

and so it is evident that a knowledge of yl' is sufficient to solve Eqs.


(13a), (14), (15a), (15b), and (15d). We assume a value 9I' and compute
the set (91^), n = 1, 2, 3, z, I, If. The correct values are denoted by
191" + 3yi" J . From Eq. (17)
Sy1II - by1' (18)

in which case it follows from Eqs. (14a) and (14b) that


&y1'
(19)
67", 4'(x,')[1 + u114"(x1I))
From Eq. (13a),
3 1

ay1° = b YI (20)
1 + 29xt'"k3oe-E,'l"

But it follows from Eq. (15b) that

-
a
8
by1' uII - b7'I° -113 + uII + '11° (21)
342 OPTIMIZATION BY VARIATIONAL METHODS

and combining Eqs. (19) to (21), it follows that the correct value of yl' is
11 - "T IC + ION'
yl = lla + 3

1+
u"I"(x,')[1 + u"IL"(xi)] X111 1 + 28xI kloe-1W1u°
(22)
Thus, once the temperatures and solvent ratio have been specified and
the material-balance equations solved iteratively, the equations for the
Green's functions need be solved only twice with one application of Eq.
(22).
The corrections to the values of u', us, u', u"I are calculated using
Eqs. (16a) and (16c) from the relations
bun = -A

= -A[ 3
3
()2 Gn aHn/aun

GII aH"/aull
u'Ibull (23a)

(23b)
G. aI (OH-)2
+G II au"'CauII)J

The physical parameters used in the calculations in Secs. 9.3 and 9.12
were used here, with 0 = 2 and a total reactor residence time of 6. The
function 4,(x,) is shown. in Fig. 10.7 and has the analytical form

(xI) = 2.5xi-- 2(xi)1 0 < xI < 0.6 (24)


1.08 - 1.lxi+ (xl)2 0.6 <xl <0.9
The cost of extraction o was allowed to range over all values, and GI,
Gs, G3 were set equal to unity. Following some preliminary experimen-

0.9

0.8

0.7

0.6
0.5

0.4
0.3
Fig. 10.7 Equilibrium distribution func-
tion of feed between solvent and reactant
0.2 streams. [From M. M. Denn and R.
0.1 Aris, Ind. Eng. Chem. Fundamentals,
4:248 (1965). Copyright 1965 by the
0 0.2 0.4 0.6 0.8
American Chemical Society. Reprinted
X1 by permission of the copyright owner.]
NONSERIAL PROCESSES 343

tation, G" was taken as 0.005. As in Sec. 9.12 the initial step size a
was set equal to 2, with the step size halved for each move not resulting
in a decrease in the value of S. The criterion for convergence was taken
to be a step size less than 10-1. In all calculations the material-balance
equations were solved by a one-dimensional direct search. Figure 10.8
shows a typical convergence sequence, in this case for o = 0.30.
The profit -S is plotted in Fig. 10.9 as a function of extraction cost.
The horizontal line corresponds to the optimal nonrecycle solution found
in Sec. 9.12, and in the neighborhood of the intersection two locally opti-
mal solutions were found. Figure 10.10, for example, shows another set
of calculations for a = 0.30 starting at the same initial temperature policy

0.4 344

0.3 343

0.2 342

0.1 341

0 340
11 I 1 1

0 5 10 15
Iteration number

Fig. 10.8 Successive approximations to the optimal


temperature sequence and solvent ratio using steep
descent. [From M. M. Denn and R. Aris, Ind.
Eng. Chem. Fundamentals, 4:248 (1965). Copy-
right 1965 by the American Chemical Society. Re-
printed by permission of the copyright owner.]
344 OPTIMIZATION BY VARIATIONAL METHODS

as in Fig. 10.8 but at a different solvent ratio, and the resulting tempera-
tures are those for the optimal three-stage serial process. The optimal
temperature and solvent policies as functions of o are shown in Figs.
10.11 and 10.12. For sufficiently inexpensive separation the tempera-
tures go to u1., indicating low conversion and large recycle, while for suf-
ficiently costly separation the optimal policy is nonrecycle. Multiple
solutions are shown by dashed lines. The discontinuous nature of the
extraction process with separation cost has obvious economic implications
if capital investment costs have not yet been taken into account in deriv-
ing the cost factors for the process.
A mixed continuous staged process with the structure shown in Figs.
10.5 and 10.6 is obtained by replacing the three stirred-tank reactors
with a plug-flow tubular reactor of residence time 0 = 6. The material-
balance equations for the continuous serial process are then
z1 = -kloe-E,'I"(x,)2 0<t<6 (25a)
x2 =
ki0e_E,11u(x1)2
- kgpe_E''1ux2
0<t<6 (25b)

with corresponding Green's functions

'y, = 2k,oe-$"'ux1(y1 - 72) 0<t<6 (26a)


y2 = k2oe-E"'uy2 0 < t <.6 (26b)

0.43

0.41

0.39

0.37

0.35
0
a
0.33

0.31 Fig. 10.9 Profit as a function of cost


of extraction. [From M. M. Denn
0.29 and R. Aris, Ind. Eng. Chem. Funda-
mentals, 4:248 (1965). Copyright 1965
by the American Chemical Society. Re-
0.270.15 0.30 0.35 printed by permission of the copyright
0.20 0.25
Cost of extraction o owner. ]
NONSERIAL PROCESSES 345

0.75 345

0.50 344

0.25 343

0 342
I l I I I 1 I I
0 1 3 4 5 6 7
Iteration number

Fig. 10.10 Successive approximations to the opti-


mal temperature sequence and solvent ratio using
steep descent. {From M. M. Denn and R. Aris,
Ind. Eng. Chem. Fundamentals, 4:248 (1965).
Copyright 1965 by the American Chemical Society.
Reprinted by permission of the copyright owner.]

345

340
Fig. 10.11 Optimal temperature se-
E
quence as a function of cost of extrac- ,!
tion. From M. M. Denn and R. Aris,
Ind. Eng. Chem. Fundamentals, 4:248
(1965). 'Copyright 1965 by the American 335
Chemical Society. Reprinted by permis- 0.10 0.15 0.20 0.25 0.30
sion of the copyright owner.) Cost of extraction c
346 OPTIMIZATION BY VARIATIONAL METHODS

1.1

1.0

0.9

0.8
0.7
0:6
0N
-0.5
0
0.4
c
00 .0.3
E Fig. 10.12 Optimal solvent ratio as a
a 0.2 function of cost of extraction. [From
0.1 M. M. Denn and R. Aris, Ind. Eng.
Chem. Fundamentals, 4:248 (1965).
0
Copyright 1965 by the American Chemi-
0.15 0.20 0.25 0.30 0.35 cal Society. Reprinted by permission of
Cost of extraction o the copyright owner.]

The partial derivative of the continuous hamiltonian with respect to the


decision is
aH 1
[x12k`oe t 11E,(y1 - -12) - xsksoe N-1uE2y2] (27)
au u2

If we denote the effluent at t = .6 by the superscript n = 3, Eqs. (2) to


(4), (7) to (11), (14), (15), and (16c) remain valid. The Green's function
equations can still be solved with a single correction, but instead of Eq.
(22) we must use
y1(6) = 91(6)
+ 91(0) - 91(6) + !ls/u1I
r(e
1+ 1 - exp ( - 2x,kloe-B,'!- dl)
uI1G (xI,)1
> Jo
(28)

The corrections in temperature and solvent ratio are


su(c) -o G(t) G(t) aH/au dt + G" (au)2
(29a)
OHlI/au11
[j0 (aHIt1)2T,
gull =- G11
(29b)
f 121'5

au)2 dt + G" ( u11


[JOB G(t)

Following the results of Sec. 9.11, the continuous variables were


.stored at intervals of 0.1 with linear interpolation, and an initial step size
of S was taken. G(t) was set equal to unity and GII to 0.0005. A typi-
NONSERIAL PROCESSES 347

Residence time t
Fig. 10.13 Successive approximations to the optimal temperature
profile in a continuous reactor using steep descent. (From M. M.
Denn and R. Aris, Ind. Eng. Chem. Fundamentals, 4:248 (1965).
Copyright 1965 by the American Chemical Society. Reprinted by
permission of the copyright owner.)

cal convergence sequence is shown in rigs. 10.13 and 10.14 for o = 0.25.
The solid starting-temperature profile is the optimal nonrecycle solution
found in Sec. 9.3, while the shape of the dashed starring curve was dic-
tated by the fact that the unconstrained solution for the temperature
profile can be shown from the necessary conditions to require an infinite

Fig. 10.14 Successive approximations to


the optimal solvent ratio following a
continuous reactor using steep descent.
[From M. M. Denn and R. Aris, Ind.
Eng. Chem. Fundamentals, 4:248 (1965).
Copyright 1965 by the American Chemi-
cal Society. Reprinted by permission of 5 10
0
the copyright owner.] Iteration number
348 OPTIMIZATION BY VARIATIONAL METHODS

355

Z 350
v
c 345
CL
E
340

335

F1g.10.15 Optimal temperature profile as a function of cost


of extraction. [From M. M. Denn and R. Aris, Ind. Eng.
Chem. Fundamentals, 4:248 (1965). Copyright 1965 by the
American Chemical Society. Reprinted by permission of the
copyright owner.)

slope at t = 0 (compare Sec. 4.12). Convergence was obtained in 9 and


11 iterations from the solid and dashed curves, respectively, with some
difference in the ultimate profiles. The apparent discontinuity in slope
at t = 0.1 is a consequence of the use of a finite number of storage loca-
tions and linear interpolation.
The profit is shown as a function of o in Fig. 10.9, the solvent allo-
cation in Fig. 10.12, and the optimal temperature profiles in Fig. 10.15.
As in the staged system there is a region in which multiple solutions were
found. For o = 0.325, for example, the same profit was obtained by the
curve shown in Fig. 10.15 and solvent in Fig. 10.12 and by the nonrecycle
solution, shown as ar = oo. It is evident that multiple solutions must be
expected and sought in complex systems in which the amount of recycle
or bypass is itself a decision which has monetary value.

10.8 PERIODIC PROCESSES

The observation that the efficiency of some separation and reaction sys-
tems can be enhanced by requiring the system to operate in the unsteady
state, as demonstrated, for example, in Sec. 6.11, has led to substantial
interest in the properties of periodic processes. These are processes in
which a regular cyclic behavior is established and therefore, in terms of
time-aver6ged behavior, allows the overall operation to 'be considered
from a production point of view in steady-state terms. As we shall see,
the fact that a decision made during one cycle has an effect on an
NONSERIAL PROCESSES 349

earlier part of the successive cycle is equivalent to feedback of infor-


mation, and such processes are formally equivalent to recycle processes.
We consider a process which evolve6 in time according to the diff er-
ential equations
o<t<8
-ii = f;(x,u) i=1,2,...,S (1)

The process is to be operated periodically, so that we have boundary


conditions
xi(0) = xi(8) i = 1, 2, . . . , S (2)

This is clearly a special case of the recycle boundary conditions. For


simplicity we shall assume that 8 is specified. The decision function u(t)
is to be chosen to minimize some time-averaged performance criterion

S= fo 5(x,u) dt (3)

This can be put in the form of a function of the state at t = 8 by defining


a new variable

x.+1= 9(x,u) 0<t<8 (4)

x,+1(0) = 0 (5)

In that case
&[x(o)J = x,+1(e) (6)

The hamiltonian for this problem is


s
H = y.+1 T + y;fi (7)
i-i

where the multiplier equations are

1 as: aj
'Yj i=1,2, ...,5 (8a)

(8b)

The boundary conditions, following Eq. (11) of Sec. 10.2, are

yi(8) - yi(0) = 0 i = 1, 2, . . . , S (9a)


74+1(8) = 1 (9b)
350 OPTIMIZATION BY VARIATIONAL METHODS

Thus_ we obtain, finally,


s
H e£+ Yifi (10)

1 05 aft
e axi axi i = 1, 2, ... 'S
1
j-1
'y (0) _ 'yi(B) i = 1, 2, .,S (12)

The partial derivative of the hamiltonian with respect to the decision


function is, of course,
aH 1 M; afi
(13)
au au + i-1 y` T.
The calculation of the periodic control function u(t) for optimal
periodic operation is easily carried out by steep descent by exploiting
the periodic boundary conditions on x and y. An initial periodic decision
function is specified and Eqs. (1) integrated from some initial state until
periodic response is approached. Next, Eqs. (11) for y are integrated in
reverse time from some starting value until a periodic response results.
Reverse-time integration is used because of the stability problems dis-
cussed in Sec. 9.8. Finally, the new decision is chosen by the usual
relation
Su = -w(t) u 0<t<B (14)

and,, the process is repeated. Convergence to periodic response might


sometimes be slow using this simulation procedure, and a Newton-
Raphson scheme with second-order convergence properties can be
developed.
We have already seen for the special case in Sec. 6.11 that all/au
vanishes at an interior optimal steady state, necessitating the use of the
strong minimum principle in the analysis of optimality. This is, in fact,
true in general, and it then follows that a steady state cannot be used as
the starting value for a steep-descent calculation, for then du in Eq. (14)
would not lead to an improvement beyond the optimal steady state.
We prove the general validity of this property of the steady state by
noting that in the steady state we seek to minimize ff(x,u) subject to
the restrictions
f;(x,u) = 0 i = 1, 2, . . . , S (15)

The lagrangian is then


s
(X' U) + XJ1(x,u) (16)
i-1
NONSERIAL PROCESSES 351

and it is stationary at the solutions of the following equations:

'
S

T au + i-1I aui
0 (17)

= ax; + jsl
S
afax;
i=1,2,...,5
x; = o (18)

Identifying X; with B-y;, Eq. (17) is equivalent to the vanishing of all/au


in Eq. (13), while Eqs. (15) and (17) are the steady-state equivalents of
Eqs. (1) and (11), whose solutions trivially satisfy the periodicity bound-
ary conditions.
As a computational example of the development of an optimal peri-
odic operating policy using steep descent we shall again consider the reac-
tor example of Horn and Lin introduced in Sec. 6.11. Parallel reactions
are carried out in a stirred-tank reactor with material-balance equations
xl = -ux1" - aurxi - x1 + 1 (19a)
x2=ux1"-x2 (19b)
x, and x2 are periodic over the interval 0. The temperature-dependent
rate coefficient u(t) is to be chosen periodically, subject to constraints
u* <u<u* (20)
to maximize the time-average conversion of X2, that is, to minimize
S
B f
o
x2(t) dt (21)
Then
Bx2 (22)

The periodic Green's functions satisfy the differential equations


y, = -y,(nux,"-l + au' + 1) - ny2ux1"-1 (23a)
72 = 8 + 72 (23b)

The hamiltonian is
1
H=- B
x2 - yl(uxl" + au'x1 + x1 - 1) + 7'2(uxln - x2) (24)

with a partial derivative with respect to u


aH
au = - (71x1" + aylru'-1x1 + 72x1")
(25)

It is shown in Sec. 6.1 that when


nr-1>0 r<1 (26)
improvement can be obtained over the best steady-state solution.
3S2 OPTIMIZATION BY VARIATIONAL METHODS

For this particular problem some simplification results. Equation


(19a) involves only u and x1 and can be driven to a periodic solution for
periodic u(t) in the manner described above. Equations (19b), (23a),
and (23b) can then be solved in terms of u and x1 with periodic behavior
by quadrature, as follows:
x2(t) = x2(0)e-e + fo t u(r)x1"(r) dr (27a)

l
e

X2(0)
1 f
e-0 o
dr (27b)

72(t) const (28)

71(t) _ 'Y1(O) exp [fo (nuxln-1 + au' + 1) dv]

+ fo exp [f' (nuxln-1 + au' + 1) do] u(T)xln(T) dT (29a)

71(0) =
f o exp [f' (nuxln-1 + au' + 1) dv] u(T)xin(r) dr
e (29b)
1 - exp [f (nuxln-1 + au' + 1) da]

The adjustment in u(t) is then computed from Eq. (14) using all/au in
Eq. (25).
The numerical values used are as follows:
n =2 r =0.75
a=1 a =0.1
u*=1 u*=5
The optimal steady state, computed from Eq. (15) of Sec. 6.11, is
u = 2.5198, with corresponding values
x1 = 0.27144
x2(= -&) = 0.18567
71 = X3.1319
The starting value of u(t) for the steep-descent calculation was taken as
u(t) = 2.5198 + 0.40 sin 20irt (30)
The weighting factor w(t) for steep descent was based on the normalized
form and taken as
w(t) = G(t) _
(31)
dtj34
[f0 au}
with G(t) equal to 0.1 or the maximum required to reach a bound and
halved in case of no improvement in 8.
NONSERIAL PROCESSES 353

0 3
V
Q
v
0
0

0 0.01 0.02 0.03 0 04 0.05 0 06 0.07 0.08 0.09 0.10


Time f

Fig. 10.16 Successive approximations to the optimal peri-


odic control using steep descent.

A response for 'x1(t) which was periodic to within 1 part in 1,000


was reached in no more than eight cycles for each iteration, using the
value of x1(9) from the previous iteration as, the starting value. This
error generally corresponded to less than 1 percent of the amplitude of
the oscillation. Successive values of the periodic temperature function
u(t) are shown in Fig. 10.16 and corresponding values of the objective in
Fig. 10.17. The dynamic behavior of x1 and x2 is shown in Figs. 10.18

0.18 5'
Fig.10.17 Time-averaged conversion on
successive iterations.
0 1
I
2 3 4 55 6
I 4

7 8 9 10 11
Iterations number
354 OPTIMIZATION BY VARIATIONAL METHODS

0 280

0270

0 260
0 001 0.02 0 03 0.04 0.05 006 0 07 0.08 0.09 0.10
I
Fig. 10.18 Dynamic response of x, on successive iterations to peri-
odic forcing.

and 10.19, respectively. Note that following the first iteration the entire
time response of x2(t) lies above the optimal steady-state value. The
course of the iterations is clearly influenced by the sinusoidal starting
function and appears to be approaching a bang-bang policy, though this
was not obtained for these calculations. Horn and Lin report that the
optimum forcing function for this problem is in fact one of irJiinitely rapid
switching between bounds or, equivalently, one for which the length of
the period goes to zero. The steep-descent calculation evidently cannot
suggest this result for a fixed value of 0.
In realistic applications the decision function will normally not
influence the system response directly, as the temperature does in this

0 195

X20)0.190

0
t
0
0.01 0.02 003 004 005 0.06 007 008 0.09 0...

Fig. 10.19 Dynamic response of x2 on successive iterations to peri-


odic forcing.
NONSERIAL PROCESSES 355

example, but will do so through a mechanism with which damping will be


associated. Coolant flow-rate variations, for example, must be trans-
mitted to the output conversion through the energy equation for tem-
perature, and high-frequency oscillations will be'masked and therefore
have the same effect as steady-state operation. An optimal period will
therefore exist, and can most easily be found by a one-dimensional search.

10.9 DECOMPOSITION
The computational procedure which we have developed for the solution
of variational problems in complex structures has depended upon adjust-
ing the boundary conditions of the Green's function in order to incorpo-
rate the mixing boundary conditions. There is an alternative approach
that has sometimes been suggested which we can briefly illustrate by
means of the examples used in this chapter.
We consider first the reactor problem from Sec. 10.3. The problem
is formulated in terms of two variables
it = -k(u)F(xl) (1a)
x2 = -k(u)F(x1) + g(u) (lb)

The boundary conditions are


x1(0) _ (1 - P)x/ + Px1(0) (2a)
X2(0) = 0 (2b)
with objective
C = x2(0) (3)

If we knew the value of the input x1(0), this would specify the output
x1(8). For any fixed input, say x*, we can find the function u(1) which
minimizes E subject to Eqs. (1) and (2b) and
x1(0) = xl (4a)

x 1 (0) = P x1 - 1 P xf ( 4b )
P

This is a problem for a serial process with constrained output. We call


the optimum C(xl ). For some physically realizable value of x,*, & takes
on its minimum, and this must then represent the solution to the original
recycle problem. The sequence of operations is then:

1. Choose x,*. Solve a constrained-output minimization problem to find


S(x*).
2. Search over values of x; to find the minimum of S(xi) using, for
example, the Fibonacci method of Sec. 2.4.
384 OPTIMIZATION BY VARIATIONAL MtTNIDDS

In this case the ultimate computation by either this decomposition


method or the use of recycle boundary conditions for the Green's func-
tions is identical, and no saving occurs by use of one method or the other.
A somewhat more revealing example is the plant studied in Sec.
10.7. The reactor equations are
0 = x1"-1 - xl" - Okloe-E,'"""(xi")2 n = 1, 2, 3 (5a)
0 = x2"-' - x2" + Bk10e-e,'Iu'(x1")2 - n = 1, 2, 3
(5b)
The extractor ((eq,//uationsand the boundary conditions are
x11 + u1 [.(x11) - 0(x1i1)] - x13 = 0 (6)
X111 + ull0(x111').(- x11 = 0 (7)
x10 = x11 + 1613#\x11) (8a)
x20 = 0 (8b)

and the objective is


& = -x23 - Cxll + null (9)
The key to the decomposition of this problem is the observation that
[Eq. (11a) of Sec. 10.7]
X10 x13 - x111 (10)
= x11 +
Thus, by setting x13 and x111 to fixed values x13# and xIII* we completely
fix the feed and effluent of X1 from the reactor system. We have already
seen in Sec. 9.13 how to solve the problem of minimizing -x23 subject to
fixed inputs and outputs of X1. Call this minimum -x23*(x13*,x11I*).

Furthermore, Eqs. (6) and (7) can be solved for ull in terms of x13* and
x111# Call this value uuI*(x13*,xlll*). After carrying out these two
operations we then seek the proper values of x13 and xlll by the operation
min [-x23*(X18*,xlIi*) - Cxi11* + vu1I*(xl3*,xlli*)] (11)
x1*a.z,.n

.The decomposition approach appears to offer no advantage in this


latter case. The computation of a single optimum temperature sequence
for a constrained output in Sec. 9.13 requires nearly as extensive a set
of calculations as the complete analysis in Sec. 10.7. The subminimiza-
tion would have to be carried out for each pair of values xi3, x1II in the
search for the optimum pair using, for example, an approximation to steep
descent such as that in Sec. 2.9. The approach of breaking a large
minimization problem down into a repeated sequence of smaller problems
has been applied usefully in other areas, however, particularly certain
types of linear programming problems, and might find application in
some problems of the type treated here.
NONSERIAL PROCESSES

BIBLIOGRAPHICAL NOTES

Section 10.2: The development here was contained in


M. M. Denn and R. Aris: AIChE J., 11:367 (1965)
and : Ind. Eng. Chem. Fundamentals, 4:7 (1965)
A proof of a strong minimum principle for continuous systems is in
M. M. Denn and R. Aris: Chem. Eng. Sci., 20:373 (1965)
A first incorrect attempt to extend results for straight-chain processes to recycle processes
was by
D. F. Rudd and E. D. Blum: Chem. Eng. Sci., 17:277 (1962)
The error was demonstrated by counterexample in
R. Jackson: Chem. Eng. Sci., 18:215 (1963)
A correct development of the multiplier boundary conditions for recycle was also obtained by
L. T. Fan and C. S. Wang: Chem. Eng. Sci., 19:86 (1964)

Section 10.3: The example was introduced by


G. S. G. Beveridge and R. S. Schechter: Ind. Eng. Chem. Fundamentals, 4:2:17 (1965)

Section 10.4: The reformulation to permit use of the straight-chain process result is
pointed out in
F. J. M. Horn and R. C. Lin: Ind. Eng. Chem. Process Design Develop., 6:21 (1967)

Section 10.5: The lagrangian approach was taken by Jackson to obtain results applicable
to the more general problem of the following section in
R. Jackson: Chem. Eng. Sci., 19:19, 253 (1964)

Sections 10.6 and 10.7: The general development and the example follow
M. M. 1)enn and R. Aris: Ind. Eng. Chem. Fundamentals, 4:248 (1965)
Further examples of the optimization of nonserial processes may be found in
L. T. Fan and C. S. Wang: "The Discrete Maximum Principle," John Wiley & Sons,
Inc., New York, 1964
-- and associates: "The Continuous Maximum Principle," John Wiley & Sons,
Inc., New York, 1966
J. D. Paynter and S. G. Bankoff: Can. J. Chem. Eng., 44:340 (1966); 45:226 (1967)
A particularly interesting use of the Green's functions to investigate the effects on reaction
systems of local mixing and global mixing by recycle and bypass is in
F. J. M. Horn and M. J. Tsai: J. Optimization Theory Appl., 1:131 (1967)
R. Jackson: J. Optimization Theory Appl., 2:240 (1968)
Section 10.8: This section is based on the paper by Horn and Lin cited above. The paper
was presented at a symposium on periodic processes at the 151st meeting of the
American Chemical Society in Pittsburgh, March, 1966, and many of the other
papers were also published in the same issue of Industrial and Engineering Chemis-
try Process Design and Development. Another symposium was held at the 60th
31$ OPTIMIZATION BY VARIATIONAL METHODS

annual meeting of the AIChE in New York, November, 1967. An enlightening


introduction to the notion of periodic process operation is
J. M. Douglas and D. W. T. Rippin: Chem. Eng. Sci., 21:305 (1966)
A practical demonstration of periodic operation is described in

R. H. Wilhelm, A. Rice, and A. Bendelius: Ind. Eng. Chem. Fundamentals, 6:141


(1966)

Section 10.9: The basic paper on decomposition of recycle processes is


L. G. Mitten and G. L. Nemhauser: Chem. Eng. Progr., 59:52 (1963)
A formalism is developed in
R. Aris, G. L. Nemhauser, and D. J. Wilde: AIChE J., 10:913 (1964)
and an extensive discussion of strategies is contained in
D. J. Wilde and C. S. Beightler: "Foundations of Optimization," Prentice-Hall, Inc.,
Englewood-Cliffs, N.J., 1967
See also
D. F. Rudd and C. C. Watson: "Strategy of Process Engineering," John Wiley &
Sons, Inc., New York, 1968

PROBLEMS
10.1. The autocatalytic reaction X, + X, = 2X2 in a tubular reactor is described
by the equations
C_Et\
.t,i - kio exp C-EIJ xIxt - kto exp x,2
uJ u
x, + xt - coast
The reaction is to be carried out in a reactor of length L with product recycle such that
x,(0) = (1 - p)xi + px,(L)
x2(0) - (1 - p)xi + px,(L)
Develop an equation for the maximum conversion to x*(L) when
E,
=2 xj = 1 - E Zq = E
E,
(The problem is due to Fan and associates.)
10.2. A chemical reaction system in a tubular reactor is described by the differential
equations
z, = f: (z)
x;(0) - xio

with an objective 61[z(B)]. Show that an improvement in tP can be obtained by


removing a small stream at a point ti and returning it at t: only if
[Yi(tt) - Yi(tl)k:(ti) > 0

Obtain the equivalent result for recycle of a stream from t, to ti. (The problem is
du: to Jackson and Horn and Tsai.)
11

Distributed -parameter Systems

21.1 INTRODUCTION
A great many physical processes must be described by functional equa
tions of a more general nature than the ordinary differential and differ-
ence equations we have considered thus far. The two final examples in
Chap. 3 are illustrative of such situations. The procedures we have
developed in Chaps. 6, 7, and 9 for obtaining necessary conditions and
computational algorithms, however, based upon the construction of
Green's function for the linear variational equations, may be readily
extended to these more general processes. We shall illustrate the essen-
tially straightforward nature of the extension in this chapter, where we
focus attention on distributed-parameter systems describable by partial
differential equation.; in two independent variables.

11.2 A DIFFUSION PROCESS


As an introduction to the study of distributed-parameter systems we
shall consider the generalization of the slab-heating problem of Sec.
359
360 OPTIMIZATION BY VARIATIONAL METHODS

3.12, for the results which we shall obtain will be applicable not only to a
more realistic version of that problem but to problems in chemical
reaction analysis as well. The state of the system is describedt by the
(generally nonlinear) diffusion-type partial differential equation
ax ax a2x 0<z<1
at = I x,
az' az2
t z,tl
0<t<e (1)

with a symmetry conditionat z = 1


ax=0 atz=Iforallt (2)
az
and a "cooling" condition at z = 0,
ax
az = g(x,v) at z = 0 for all t (3)

where v(t) represents the conditions in the su*ronnding environment.


The initial distribution is given
x = 0(z) at t = 0 (4)
The environmental condition v(t) may be the decision variable, or it
may be related to the decision variable through the first-order ordinary
differential equation

r dt = h(v,u) (5)

The former case (lagless control) is included by taking the time constant r
to be zero and h as u - v. It is assumed that the purpose of control
is to minimize some function of the state at time B, together with the
possibility of a cost-of-control term; i.e., we seek an admissible function
u(t), 0 < t < 0, in order to minimize
s[u] = f01 E[x(0,z);z] dz + fo C[u(t)] dt (6)

11.3 VARIATIONAL EQUATIONS


We shall construct the equations describing changes in the objective
as a result of changes in the decision in the usual manner. We assume
that we have available functions u(t), 0(1), and x(t,z) which satisfy Eqs.
(1) to (5) of the preceding section. If we suppose that in some way we
have caused an infinitesimal change Sv(t) in v(t) over all or part of the
range 0 < t < 0, this change is transmitted through the boundary condi-
tions to the function x(t,z) and we can write the variational equations to

t Note that the state at any time t" is represented by an entire function of z, z(t',z),
0 < z < 1.
DISTRIBUTED-PARAMETER SYSTEMS 361

first order asf


(bx), = f= ax + (ax), + J=..(ax): (1)
(bx), = g= ax + g, av at z = 0 for all t (2)
(bx), = 0 at z = 1 for all t (3)
ax = 0 when t = 0 for all z (4)
We construct the Green's function y(t,z) for this linear partial differ-
ential equation as in Sees. 6.2 and 7.2. We consider first the product
y ax and write
(y bx), = y, ax + y (Sx),
= 'y ax + yf= bx + 7f=.(ax)= + yJ=..(ax)s= (5)
or, integrating with respect to z,
(y ax) dz = fo' [ye bx + yf= ax + 'Yf:.(ax). + YJ:..(ax)s:] dz
at fol
(6)
The last two terms can be integrated by parts
fo yf.,(ax). dz = - fl (yJ=)= ax dz + yf=. ax Ioo (7a)
fol I
yJ=.,(ax).s dz = fo' (yf=..),, ax dz + [yf.,,(ax)= - ('f=..)= ax] 0
(7b)
so that Eq. (6) becomes

at f o' (y ax) dz = fog [y, + yJ= - (yJ=,)= + (yJ=ax dz


+ { bx[yf=, - (yJ=..)=] + (bx).yf=..1 0
(8)

Integrating with >;espect tot from 0 to B and using Eqs. (2) to (4), we
then obtain
foI y(B,z) bx(9,z) dz = fo fo' [ye + 'Yf= - (yf=.)=
+ (yJ=,.)=_] ax dz dt + fo ax[-(f., - dt 1', - 1

fo ax[-ff., + yf=..g= - (yf=.) } dt IL-o


- fo y(t,0)f.,,(t,0)g.(t) av(t) dt (9)

t In accordance with common practice for partial differential equations and in order
to avoid some cumbersome notation we shall use subscripts to denote partial dif-
ferentiation in Sees. 11.3 to 11.7. Thus,
ax
X.
ax of
x` at - as f=" - a0=x/M)
and so forth. Since we are dealing here with a single state variable and a single
decision variable, and hence have no other need for subscripts, there should be no
confusion.
362 OPTIMIZATION BY VARIATIONAL METHODS

As before, we shall define the Green's function so as to remove


explicit dependence upon 6x from the right-hand side of Eq. (9). Thus,
we require y(t,z) to satisfy the equation
0<z<1
ye = -yf= + (yf:.)= - (l'f=..):: 0<t<B (10)

y(f1, + f." g.) - (yfZ )s = 0 at z = 0 for all t (11)


if., -(yf')s=0 at z = 1 for all t (12)
Equation (9) then becomes
f01
y(O,z) ax(9,z) dz = - fo 8v(t) dt (13)

Finally, we note that the variation in the first term of the objective
[Eq. (6), Sec. 11.2] as a.result of the infinitesimal change Sv(t) is
fo E2[x(O,z);z} Ex(9,z) dz

If we complete the definition of 'y with


y=E2 att=0forallz (14)
then we may write
foI E=[x(9,z);z] ax(9,z) dz = - fo av(t) dt (15)

11.4 THE MINIMUM PRINCIPLE


In order to establish a minimum principle analogous to that obtained in
Sec. 6.8 we must consider the effects on the objective of a finite perturba-
tion in the optimal decision function over a very small interval. If u(t)
is the optimal decision, we choose the particular variation

Su(t) = 0
0 < t < t,
t1+A<t<tl (1)
Su(t) = finite t, < t < t, + 0
where A is infinitesimal. Thus, during the interval 0 < t < t1 the value
of v(t) is unchanged, and Sv(t1) = 0.
During the interval t1 < t < t1 + A it follows from Eq. (5) of
Sec. 11.2 that Sv must satisfy the equation
T(bv), = h(v + Sv, u + du) - h(f,u) (2)
or

dv(t) = T I [h(v + Sv, u + Su) - h(v,u)] dl (3)

The integral may be expressed to within o(d) as the integrand evaluated


at l1 multiplied by the integration interval. Thus, using the fact that
DISTRIBUTED-PARAMETER SYSTEMS 363

Wt,) is zero we may write


t - t,
av(t) = {h[v(ti), u(ti) + au(tl)1 - o(o)
t1 < t < t, + A (4)
and, in particular,
av(tI + o) = (h[v(t,), u(ti) + bu(rl)] - h[v(t,),u(ts)]I + o(o)
T
(5)
Because A is infinitesimal this change in v(t) resulting from the finite
change bu is infinitesimal.
Following time t, + A the variation au is again zero. The varia-
tional equation describing the infinitesimal change in v is then, to within
o(ff),
T(av), = h,(v,u) av t, + o < t < 0 (6)
which has the solution, subject to Eq. (5),

av(t) = [h(i,u) - h(v,u)] exp {T f+ ho[vO,uO] d2 } + o(o)


ti+A<"t<8 (7)
where the term in brackets preceding the exponential is evaluated at
t=t,.
Combining Eq. (7) with Eqs. (6) and (15) of Sees. 11.2 and 11.3,
respectively, we obtain the expression for the change in S resulting from
the finite change in u -

[h(v,u) - h(v,u)] f-f (OW ." (8,0) g' (8)


fe
exp [T h,(t) dEJ ds + A[C(u) - C(u)} + o(A) > 0 (5)
where the inequality follows from the fact that u(t) is the function which
minimizes &. Dividing by A and then taking the limit as d -- 0 and not-
ing that t, is completely arbitrary,'we obtain the inequality

C(u) f y(s,0)f=..(s,0)ge(s) exp [T f d 1 ds


J
[f
T

(e
> C(u) Je
y(s,0)f._(s,0)g.(s) exp T had) dE] ds (9)
T

That is, it is necessary that the optimal function u(t) minimize


min C(u)
U(t) T t exp I 1 f d li,(t) dEj ds
T t
(10)
364 OPTIMIZATION BY VARIATIONAL METHODS

i
everywhere except possibly at a set of discrete points. If the final time 9
is not specified, we obtain a further necessary condition by differentiating
Eq. (6) of Sec. 11.2 with respect to 0 and equating the result to zero to
obtain
0 unspecified: foI y(B,z)f(B,z) dz + C[u(9)] = 0 (11)

In the important case of the lagless controller, where


h(v,u) = u - v
and r goes to zero, we can utilize the fact that for small r the integrand in
Eq. (10) is negligibly small for all time not close to t. Taking the limit
as r -+ 0 then leads to the result
u = v: min C(u) - uy(t,0)f(t,0)g (t) (12)
u(t)

with v equal to the minimizing value of u.

11.5 LINEAR HEAT CONDUCTION


The linear heat-conduction system examined in Sec. 3.12 with a quad-
ratic objective has the property that y and x can be obtained explicitly
in terms of u. If we consider

-
xt _ x :`
0<z<1 (1)
0<t<0
X. = 0 at z = 1 for all t (2)
xz = p(x - v) at z = 0 for all t (3)
x=0 att=0forallz (4)
rv,u - V 0 <t<0 (5)
fol
min E = [x-(z) - x(B,z)]2 dz + fo C[u(t)] dt (6)

where 0 is specified, 17 a constant, and x*(z) a specified function, Eqs. (10)


to (12) and (14) of Sec. 11.3 for y then become
0<z<1 (7)
y`--y" o<<t<0
py - y. = 0 at z = 0 for all t (8)
y, = 0 at z = 1 for all t (9)
y = -2[x*(z) - x(@,z)] at t = 0 for all z (10)
Equations (1) to, (5) and (7) to (10) are solved in terms of u(t) by
elementary methods such that the minimum principle, Eq. (10) of the
preceding section, becomes
min C[u(t)] + u(t) [fo G(l,s)u(s) dx - kt)] (11)
u(t)
DISTRIBUTED-PARAMETER SYSTEMS 366

where, as in Sec. 3.12,


41 (0 = foI x"(z)K(0 - t, z) dz (12)

G((s) = foI K(8 - t, z)K(O - s, z) dz (13)


and
a2 cos all - z) a-a,t
K(t ,z)
Cos a - a/q sin a
Cos (1 - z)/ii e-5"t
+ 2a2 (14)
.L1 (a2 - t.2)[1/P + (1 + P)/O,2J cos t,
with a = 1/Vr and tic the real roots of
0 tan 0 = p (15)
Thus, for example, if C(u) = %c1u2 and u(t) is unconstrained, the
minimum of Eq. (11) is obtained by setting the partial derivative with
respect to u(t) to zero. The optimal control is then found as the solution
of the Fredholm integral equation of the second kind
fo G(t,s)u(s) ds + c2u(t) = 4,(t) (16)

If, on the other hand, C(u) = 0 and u(t) is constrained by limits


u, < u < u', Eq. (11) is linear in u and the optimal control is at a
limit defined by
= u fa G(t,s)u(s) ds - '(t) < 0
u(t) (17)
U* G(t,s)u(s) ds - ¢(t) > 0
f0
An intermediate (singular) solution is possible as the solution of the
Fredholm integral equation of the first kind
fo G(t,s)u(s) ds = '(t) (18)

Equation (18) is the result obtained as a solution to the simplest problem


in the calculus of variations in Sec. 3.12.

11.6 STEEP DESCENT


For determination of the optimum by means of a steep-descent calcu-
lation we first choose a trial function u(t) and then seek a small change
ft(t) over all t which leads to an improved value of the objective C. We
need, then, the effect on C of a continuous small variation in u, rather
than the finite sharply localized variation used in the development of
the minimum principle.
3" OPTIMIZATION BY VARIATIONAL METHODS

From Eqs. (6) and (15) of Sees. 11.2 ajid 11.3, respectively, we have,
for any small change bu, the first-order expression for the corresponding
change in Is,

bs = fo Cu(t) bu(t) dt - fo y(t,O)f...(t,O)g.(t) bv(t) dt (1)

Here, by can be -related to a continuous small change bu by the vari-


ational equation derived from Eq.' (4) of Sec. 11.2
r 6v, = h, by + h. bu (2)
or

bv(t) = T fo exp LT f` h,(E) dE] bu(s) ds (3)

Thus, by substituting for by in Eq. (1) and changing the order of inte-
gration we obtain the expression for the variation in the objective
SE=fo {C
hu f ° y(s,O)f...(s,O)g.(s) exp [T f ' h,(E) dt] I bu(t) dt (4)

The choice of 'bu(t) which leads to a decrease in S is then

au(t) = -w(t) I C.

- hfT
9
'(s,0)f=..(s,0)g.(s) exp [
f. d>:] ds} (5)

where w(t) > 0 reflects the geometry of the space and is chosen suf-
ficiently small to ensure that a constraint is not violated. For r = Of
with u equal to v, we obtain
bu(t) = -w(t)[C - y(t,0)f (t,0)g.(t)J (6)

11.7 COMPUTATION FOR LINEAR HEAT CONDUCTION


As a first application of the computational procedure to a distributed
system we return to the linear heat-conduction problem described in Sec.
11.5 without the cost-of-control term. The system equations ere
_
x,-x:: 0<z<1
0<t<0 (1)

xy=0 atz= lfor all t (2)


x, = p(x - v) at z = 0 for all i (3)
x=0 att=0foralIz (4)
rv,=u - V 0 <t<0 (5)
DISTRIBUTED-PARAMETER SYSTEMS 397

u* <u <u* (6)


E = 12[x*(z) - X(O,Z)]2 (7)
C=0 (8)
_ 0 < z < 1
0<t<0 (9)

py-ye=0 atz=0forallt (10)


y.=0 at z= 1 for all (11)
y = -2[x*(z) - x(O,z)) at t = 0 for all z (12)

The necessary conditions for an optimum are given in Sec. 11.5 by Eqs.
(17) and (18), while the direction of steep descent is determined by
e
ou(t) _ -w(t) p f y(8,0)e(t )!T ds (13)

The linearity of the system makes it possible to obtain and use an


analytical expression for y(s,0).
The parameters used in the numerical solution are
p=10 r=0.04
x*(z) = constant = 0.2
u*=0<u<1=u*
Two cases are considered, 0 = 0.2 and 0 = 0.4, with the starting func-
tions u(t) taken as constant values of u(t) = 0, 0.5, and 1.0 in each case.
Numerical integration was carried out using Simpson's rule, with both z
and t coordinates divided into 50 increments. w(t) was arbitrarily set as
10, or the maximum less than 10 which would carry u to a boundary
of the admissible region, and halved whenever a decrease in the objective
was not obtained.
Figure 11.1 shows several of the successive control policies calcu-
lated from Eq. (13) for 0 = 0.2 with a constant starting policy of u = 0.5,
no improvement being found after 10 iterations. The policy is seen to
approach a bang-bang controller, with the exception of a small time
increment at the end, and the final policies from the other two starting
values were essentially the same. The successive values of the objective
are shown in Fig. 11.2, with curves I, II, and III corresponding, respec-
tively, to starting values u(t) = 0, 0.5, and 1. The three temperature
profiles x(0,z) at t = 0 corresponding to the starting control policies,
together with the final profile, are shown in Fig. 11.3, where it can be
observed from curve III' that no control policy bounded by unity can
raise the temperature at z = 1 to a value of 0.2 in a time period 0 = 0.2.
Figure 11.4 shows several of the iterations for the optimal control
with 0 = 0.4, starting with the constant policy u(t) = 0.5, with no.
improvement possible after 11 descents. The curve for the second itera
368 OPTIMIZATION BY VARIATIONAL METHODS

1.0

0.8

00.6
CL

0.4
0
U
0.2
2 3

0.4 0.8 0.12 0.16 0.20


Time t
Fig. 11.1 Successive approximations to the optimal control policy
using steep descent, 8 = 0.2. [From M. M. Denn, Intern. J. Control,
4:167 (1966). Copyright 1966 by Taylor and Francis, Ltd. Re-
printed by permission of the copyright owner.]

tion is not shown beyond t = 0.25 because it differs in general by too


little from the optimal curve to be visible on this scale. The sharp oscilla-
tions in the optimal controller at alternate points in the spatial grid were
felt to have no physical basis, and a crude smoothed approximation to
the optimum was then used as a starting function for the steep-descent
program. This converged to the dashed curve shown in Fig. 11.4, which
has very nearly the same value of the objective as the oscillating result.
Figure 11.5 shows the reduction in the objective for steep-descent

Fig. 11.2 Reduction in the objective on


successive iterations, 8 = 0.2. [From
M. M. Denn, Intern. J. Control, 4:167
(1966). Copyright 1966 by Taylor and
Francis, Ltd. ' Reprinted by permission
I I I

2 4 6 8
Iteration number of the copyright owner.]
DISTRIBUTED-PARAMETER SYSTEMS 36!

0.2 04 0.6 1.0


Position z

Fig. 11.3 Temperature profiles at t = 8 using initial and


optimal control policies, 8 - 0.2. ]From M. M. Denn,
Intern. J. Control, 4:167 (1966). Copyright 1966 by Taylor
and Francis, Ltd. Reprinted by permission of the copyright
owner.l

calculations starting from constant control policies of u(t) = 0, 0.5, and 1,


corresponding to curves I, II, and III, respectively, with the temperature
profile at t = 0 for each of these starting policies shown in Fig. 11.6. The
optimum is indistinguishable from 0.2 on the scale used. The three
unsmoothed policies are shown in Fig. 11.7. They indicate substantial
latitude in choosing the control, but with the exception of the oscillations
in the later stages, which can be smoothed by the same computer program,
all are smooth, intermediate policies, corresponding to approximations
to the singular solution defined by the integral equation (18) of. Sec. 11.5.

1.0

0
a
60.4
c
0
U
0.2

0.04 0.08 0.12 0.16 0.20 0.24 0.28 0.32 0.36 0.40
Time t

Fig. 11.4 Successive approximations to the optimal control policy


using steep descent, 8 - 0.4. [Front M. M. Denn, Intern. J. Control,
4:167 (1966). Copyright 1966 by Taylor and Francis, Ltd. Re-
printed by permission of the copyright owner.]
IE OPTIMIZATION BY VARIATIONAL METHODS

10-

to-

10-
Fig. 11.5 Reduction in the objective on
successive iterations, 0 = 0.4. [From
M. M. Denn, Intern. J. Control, 4:167
10- (1966). Copyright 1966 by Taylor and
4 6 8 10 12
Francis, Ltd. Reprinted by permission
Iteration number of the copyright owner.)

It is interesting to note the change in the optimal control strategy from


bang-bang to intermediate as more time is available. It has sometimes
been suggested in the control literature that steep descent cannot be
used to obtain singular solutions to variational problems, but it is clear
from this example that such is not the case.
1.0

ti
m 0.8

0.6
0
w 0.4
E
0.2

0 0.2 0.4 0.6 0.8 1.0


Distance z

Pig. 11.6 Temperature profiles at t = 0 using initial and


optimal control policies, 0 - 0.4. [From M. M. Denn,
Intern: J. Control, 4:167 (1966). Copyright 1966 by Taylor
and Francis, Ltd. Reprinted by permission of the copyright
owner. I
DISTRIBUTED-PARAMETER SYSTEMS 371

0.04 0.08 0.12 0.16 0.20 0.24 0.28 0.32 0.36 0.40
Time f
Fig. 11.7 Unsmoothed intermediate optimal control policies for three
starting functions, 0 - 0.4. [From M. M. Denn, Intern. J. Control,
4:167 (1966). Copyright 1966 by Taylor and Francis, Ltd. Reprinted
by permission of the copyright owner.]

11.8 CHEMICAL REACTION WITH RADIAL DIFFUSION


We have considered the optimal operation of a batch or pipeline chemical
reactor in Sec. 3.5 with respect to the choice of operating temperature and
in Sec. 5.11 with respect to the optimal heat flux. In any realistic con-
sideration of a packed tubular reactor it will generally be necessary to
take radial variations of temperature and concentration into account,
requiring even for the steady state a description in terms of two inde-
pendent variables. The study of optimal heat-removal rates in such a
reactor provides an interesting extension of the results of the previous
sections and illustrates as well how an optimization study can be used in
arriving at a practical engineering design.
In describing the two-dimensional reactor it is convenient to define
x(t,z) as the degree of conversion and y(t,z) as the true temperature
divided by the feed temperature. The coordinates t and z, the axial
residence time (position/velocity) and radial coordinate, ;espectively, are
normalized to vary from zero to unity. With suitable physical approxi-
mations the system is then described by the equations
ax _ A (1 a /z L 0<t<1 (1)
-[- Darr(x,y)
cat Pe zaz` az 0<z<1
ay A 1a z ay + Dairlr(x,y)
0< t< 1
at X7 z r az 0< z< 1 (2)

Here A is the ratio of length to radius, Pe and Pe' the radial Peclet num-
bers for mass and heat transfer, respectively, Dal and Datii the first and.
third Damkohler numbers, and r(x,y) the dimensionless reaction rate.
372 OPTIMIZATION BY VARIATIONAL METHODS

The boundary conditions are


x=0 att=0forallz (3a)
y=1 at t = 0forallz (3b)
ax =
a 0 at z = 0 for all t (3c)

az = 0 at z = 0 for alit (3d)


ax az=0
atz=lforallt (3e)

ay
az = - u(t) at z = I for all t (3f)

The dimensionless wall heat flux u(t) is to he chosen subject to bounds


u* < u(t) < u* (4)

in order to maximize the total conversion in the effluent

t(1) = 2 foI zx(l,z) dz (5)


41
The generalization of the 'inalysis of Secs. 11.2 to 11.4 and
11.6 to include two dependent variables is quite straightforward, and we
shall simply use the results. We shall denote partial derivatives explic-
itly here and use subscripts only to distinguish between components y,
and y2 of the Green's vector. The multiplier equations corresponding to
Eq. (10) of Sec. 11.3 are
ayi = ar + A a yi _ a2y1
at
- (Daiyi + Daliiy2) ax Pe az z
4922 (6)
at- 2
5-,-)
= -(Dajyi + Daiij72) ay + PP Gz 72 (7)

while the boundary conditions corresponding to Eqs. (11), (12), and (14)
are
yl = -2z at t = 1 for all z (8a)
72 = 0 at t = 1 for all z (8b)
y,=y20 atz=0foralf t (8c)
z2
y, azl = y2 - =0 at z = l for all t (8d)

The minimum principle corresponding to Eq. (12) of Sec. 11.4 is


min
u(()
- Pei 720100)
A
(9)
DISTRIBUTED-PARAMETER SYSTEMS .37

The linearity of the minimum criterion requires that the optimal


cooling function have the form
u* 72(1,1) > 0
U11 (10)
u* 72(t,l) < 0
with intermediate cooling possible only if 72(t,l) vanishes for some finite
interval. This is the structure found for the optimal operation of the
reactor without radial effects in Sec. 5.11. Indeed, the similarity can
be extended further by noting that if Eq. (8d) is taken to imply that 12 is
approximately proportional to z near z = 1, then the second parenthesis
on the right side of Eq. (7) must be close to zero near z = 1. Thus
12(1,1) can vanish for a finite interval only if the reaction rate near the
wall is close to a maximum; i.e., the singular solution implies
8r
0 z=1 (11)
ay
which must be identically true without radial effects. If the parameters
are chosen such that u* = 0 and 12(1,1) is initially zero, then since when
u = 0 initially there are no radial effects, the switch to intermediate
cooling should be close to the switch for the one-dimensional reactor.
Computations for this system were carried out using steep descent,
the direction corresponding to Eq. (6) of Sec. 11.6 being
BU(t) =. w(t)72(t,l) (12)
For the computations the reaction was taken to be first order, in which
case the dimensionless reaction rate has the form
r(x,y) = eE,'ITo[(1 _ x)e-E,'IYTo - kxe-E,'/VToJ (13)
where To is the temperature of the feed. The physical parameters were
chosen as follows:
Pe = 110 Pe' = 84.5
Ei = 12,000 E' =,25,000
A=50
Dal = 0.25 Darn = 0.50
To = 620 u* = 0
u* was not needed for the parameters used here. The solution, for the
case without radial effects, following the analysis of See. 5.11, gives an
initially adiabatic policy (u = 0) with a switch to an intermediate policy
at t = 0.095.
Solutions of the nonlinear partial differential equations were
obtained using an implicit Crank-Nicholson procedure with various grid
sizes. Figure 11.8 shows successive values of the heat flux starting from
374 OPTIMIZATION BY VARIATIONAL METHODS

Fig. 11.8 Successive approximations to


the optimal heat flux using steep descent.
[From M. M. Denn, R. D. Gray, Jr., and
J. R. Ferran, Ind. Eng. Chem. Funda-
mentals, 5:59 (1966). Copyright 1966
by the American Chemical Society. Re-
0.5 0 printed by permission of the copyright
Axial distance t owner.)

a nominal curve arbitrarily chosen so that a reactor without diffusion


would operate at a dimensionless temperature y, of 1.076. w(t) was taken
initially as 0.11f' [y2(t,l)J2 dt and halved when no improvement was
obtained. For these computations a coarse integration grid was used
initially, with a fine grid for later computations. No improvement could
be obtained after the fourth iteration, and the technique.-vas evidently
unable to approach a discontinuous profile, although the sharp peak at
approximately t = 0.075 is suggestive of the possibility'of such a solution.
The successive improvements in the conversion are shown by curve I in
Fig. 11.9.
The next series of calculations was carried out by assuming that
the optimal policy would indeed be discontinuous and fixing the point
at which a switch occurs from adiabatic to intermediate operation. The
steep-descent calculations were then carried out for the intermediate
section only. Figures 11.10 and 11.11 show such calculations for the
switch at t = 0.095, the location in the optimal reactor without radial
variations. The starting policy in Fig. 11.10 is such that tOsection
beyond t = 0.095 in a reactor without radial effects would remain at a
constant value of y of 1.076 [this is the value of y at t = 0.095 for u(t) = 0,
i<0.0951. The values of the objective are shown in Fig. 11.9 as curve II,
with no improvement after two iterations. The starting policy in
Fig. 11.11 is the optimum for a reactor without radial effects, with suc-
cessive values of the conversion shown as curve III in Fig. 11.9. It is
significant that an optimal design which neglects radial effects when they
should be included can result in a conversion far from the best possible.
The maximum conversion was found to occur for a switch to intermediate
DISTRIBUTED-PARAMETER SYSTEMS 375

0.16

0.14

0
U
0 10

0
Fig. 11.9 Improvement in outlet con-
version on successive iterations. [From
0.08
M. M. Denn, R. D. Gray, Jr., and
J. R. Ferron, Ind. Eng. Chem. Funda-
mentals, 5:59 (1966). Copyright 1966
by the American Chemical Society. Re- 0.06
1 1

printed by permission of the copyright


1

0 1 2
owner.] Iteration number

operation at t = 0.086, but the improvement beyond the values shown


-here for a switch at t = 0.095 is within the probable error of the computa-
tionaat-seh eme.
One interesting feature of these 'ellealations was the discovery that
further small improvements could be obtained-followcing the convergence
of the steep-descent algorithm by seeking changes in u(0-in-_the direction
opposite that predicted by Eq. (12). This indicates that the cumulative

15

Fig. 11.10 Successive approximations to 0.5


the optimal heat flux using steep descent
with switch point specified at t = 0.095.
[From M. M. Denn, R. D. Gray, Jr., and
J. R. Ferron, Ind. Eng. Chem. Funda-
mentals, 5:59 (1966). Copyright 1966 by
the American Chemical Society. Reprint- 0 L; J

0 0.5 I.0
ed by permission of the copyright owner.] Axial distance t
376 OPTIMIZATION BY VARIATIONAL METHODS

Fig. 11.11 Successive approximations to


the. optimal heat flux using steep descent
with switch point. specified at t e 0.095.
[From M. M. Denn, R. D.+Gray, Jr., and
J. R. Ferron, Ind. Eng. Chem. Funda-
I i
mentals, 5:59 (1966). Copyright 1906 by
0.5 1.0 the American Chemical Society. Reprint-
Axial distance f ed by permission of the copyright owner.)

error associated with solving Eqs. (1) and (2) followed by Eqs. (6) and (7)
was sufficiently large to give an incorrect sign to the small quantity
y2(t,1) in the region of the optimum. Such numerical difficulties must
be expected in the neighborhood of,the optimum in the. solution of varia-
tional problems for nonlinear distributed systems, and precise results
will not be obtainable.
From an engineering point of view the results obtained thus far
represent a goal to be sought by a practical heat-exchanger design. The
coolant temperature required to produce the calculated beat-removal
rate u(t) can be determined by the relation
ay(t_1)
aZ
_ -u(t) = n[yJt) - y(t,l)l (14)

where ye is the reduced coolant temperature and +1 a dimensionless overall


heat-transfer coefficient times surface area. Using a value of n = 10,
the functions y,(t) were calculated for the values of u(t) obtained from
the steep-descent calculation. The variation of ye as a function of t was
generally found to be small, and a final series of calculations was carried
out to find the best constant value of y,,, since an isothermal coolant is
easily obtained in practice. Here the best switch from adiabatic opera-
tion was found at t = 0.07i and yc = 0.927, but the results were essen-
tially independent of the switch point in the range, studied. For a switch
at.t = 0.095, corresponding to the calculations discussed here, the best
value of ye was found to be 0.925. The corresponding function u(t),
DISTRIBUTED-PARAMETER SYSTEMS 377

calculated from Eq. (14), is plotted as the dashed line in Fig. 11.12,
together with the results from the steep-descent calculations. The cor-
respondence with the results from the rigorous optimization procedure
is striking, indicating for this case the true optimality of the conventional
heat-exchanger design, and, in fact, as a consequence of the numerical
difficulties in the more complicated steep-descent procedure, the con-
version for the best constant coolant is slightly above that obtained from
the steep-descent computation.
This example is a useful demonstration of several of the practical
engineering aspects of the use of optimization theory. An optimal design
based on an incomplete physical model, such as neglect of important
radial effects, might give extremely poor results in practice. A careful
optimization study, however, can be used to justify a practical engineer-
ing design by providing a theoretical guideline for required performance.
Finally, in complicated systems the application of the theory is limited
by the sophistication and accuracy of the numerical analysis, and all
results in such'situations must be taken as approximate.

11.9 LINEAR FEEDFORWARD-FEEDBACK CONTROL


The two examples of optimization problems in distributed-parameter
systems which we have examined thus far have both been situations in
which the decision function has appeared in the boundary conditions.
In a number of important applications the decision function enters the
partial differential equation directly. The simplest such situation, which

Fig. 11.12 Heakflux profiles obtained


using steep descent from three starting
functions. The dashed line corresponds
to an isothermal coolant. [After M. 111.
Denn, R. D. Gray, Jr., and J. R. Ferron,
Ind. Eng. Chem. Fundamentals, 5:59
(1966). Copyright 1966 by the Ameri-
can Chemical Society. Reprinted by per-
mission of the copyright owner.)
378 OPTIMIZATION BY VARIATIONAL METHODS

includes as a special case the regulation of outflow temperature in a cross-


flow heat exchanger, would be described by the single linear hyperbolic
equation
ax. 0<t < a
V a x = Ax + Bu (1)
T+ 0<z<1
where the coefficients V, A, and B are constants and u is a function only
of t. We shall restrict attention to this case, although the generalization
to several equations and spatially varying coefficients and control func-
tions, as well as to higher-order spatial differential operators, is direct.
The system is presumed to be in some known state at time t = 0.
We shall suppose that disturbances enter with the feed stream at z = 0,
so that the boundary condition has the form
x(t,0) = d(t) (2)
and that the object of control u(t) is to maintain a weighted position
average of x2 as small as possible, together with a cost-of-control term.
That is, we seek the function u(t) which minimizes the cumulative error

S fo [ f of C(z)x2(t,z) dz + u2(t) ] dt (3)

This formulation includes * the special case in which we wish only to


regulate x at z = 1, for the special choice C(z) = C6(1 - z), where S(¢)
is the Dirac delta, leads to

g = 2 fo [Cx2(t,1) + u2(t) dl] (4)

The necessary conditions for optimality are obtained in the usual


manner, by constructing the Green's function for the variational equa-
tions and examining the effect of a finite change in u(t) over an infinites-
imal interval. The details, which are sufficiently familiar to be left as a
problem, lead to a result expressible in terms of a hamiltonian,
H = 3zCx2 + 322u2 + y(Ax + Bu) (5)
as
CIX ax
at 8 = Ax + Bu
+ V az -
(6)

ay+VayaHCx-Ay
at az ax
y(O,z) = y(t,1) = 0
min 0
H de
u(t)
DISTRIBUTED-PARAMETER SYSTEMS 379

The minimum condition in turn implies


u(t) = -B fol y(t,t) dt (10)
with a minimum ensured by requiring C(z) > 0.
The system of Eqs. (6), (7), and (10) are analogous to the lumped-
parameter system studied in Sec. 8.2, where we found that the Green's
functions could be expressed as a linear combination of the state and dis-
turbance variables, provided the disturbances could be approximated as
constant for intervals long with respect to the system response time.
We shall make the same assumption here concerning d(t), in which case,
interpreting x and y at each value of z as corresponding to one component
of the vectors x and y in the lumped system,'the relation analogous to
Eq. (7) of Sec. 8.2 is
101
y(t,z) = M(t,z,E)x(i,E) A + D(t,z) d(t) (11)

The boundary conditions of Eq. (8) require


M(t,l,:;) = M(e,z,t) = D(t,l) = D(8,z) = 0 (12)

It follows from the results for the lumped system (and will be established
independently below) that M is symmetric in its spatial arguments
M(t,z,) = M(t,E,z) (13)

Substitution of Eq. (11) into the partial differential equation (7)


yields

+ V ay = -C(z)x(z) - A f 1 M(z,E)x() dt - AD(z)d (14a)


at
Here and henceforth we shall suppress any explicit dependence on t, but
it will frequently be necessary to denote the spatial dependence. It is
convenient to rewrite Eq. (14a) in the equivalent form

at + V az =- fo' [C(l:)b(z - ) + AM(z,>;)}x() dt - AD(z)d


(14b)
On the other hand, Eq. (11) requires that
fo1 a]l a(z,E)
at + V or = x(t) dt + foI M(z,E) dt
+ad+fol (15)

which becomes, following an integration by parts and substitution of


300 OPTIMIZATION BY VARIATIONAL METHODS

Eqs. (2), (6), (7), (11), and (13),


ay ay art(z,t aM(z,E
+ All1(z,E)
at + V az = Jo at- + ` oz +V at
- B2 [Jo1 M(z,o) do]
[ J' df] } x(t) dt + JaDt
+ V aD
- B2 [J0' M(z,a) do] [Jo D(E) dt] + VM(z 0)} d (16)
Equations (14) and (16) represent the same quantity, and thus the
coefficients of x and d must be identical. We therefore obtain the two
equations

I
+ v(/ a+ a3f J+ 2AM
B2 [ fo' M(z,o) do] [ fo' df] + C(t)S(z - E) = 0 (17)
OD
+ V aZ + AD - B2 [fo' M(z,v) do]
[Joy D(Z) dE]
+ VM(z,0) = 0 (18)
Any solution of Eq. (17) clearly satisfies the symmetry condition, Eq.
(13). In the special case V = 0 Eq. (17) reduces with two integrations
to the usual Riccati equation for lumped systems, Eq. (10) of Sec. 8.2.
For the industrially important case of 0 - oo Eqs. (17) and (18), as in
the lumped-parameter analogs, tend to solutions which are independent
of time. We shall consider regulation only of the exit stream, so that
C(E) = C6(1 - E), and we let 0 - oo. The optimal control is computed
from Eqs. (10) and (11) as
u(t) = fo' GFB(z)x(t,z) dz + GFF d(t) (19)

where the feedback (FB) and feedforward (FF) gains are written in
terms of the solutions of Eqs. (17) and (18) as
GFB(z) = -B f0' M(z,E) dE (20)
GFF = -B fO' D(t) dt (21)
Because of the term 5(1 - ) the steady-state solution to Eq. (17)
is discontinuous at the values r = 1, t = 1. By using the method of
characteristics a solution valid everywhere except at these points can be
obtained in the implicit form

M(z,E) _ - 1 i-(F-:)s(E-z) e-(2A/V)(z-w)GFB(0)GFB(,7 - z + ) dj


f
+ ti e(2a(V)(1-t)5(t - z) (22)
DISTRIBUTED-PARAMETER SYSTEMS 781

Here the Heaviside step function is defined as

S(Z (23)
- z) = {1 >z
Integration of Eq. (22) then leads to an etuation for the feedback gain
CB e(2AIV)(1-0
GFB(z)
V
+ B j' GFB(,j)e-(2A1V)(t-0 do rL' GFB(t) dt (24)
Equation (24) is in a form which is amenable to iterative solution for the
optimal gain. Equation (18) in the steady state is simply a linear first-
order equation which can be solved explicitly for the feedforward gain
GFF _ B Jo' dz e-(AI')( t) d Jfl e-(2A1v)E-,)GF 1(,?)G1B(n - t) do
V - B f ' dz 1',;e-(AIV)(:-e'GFB(t) dl;

(25)
This result can be easily generalized to include nonconstant dis-
turbances which are random with known statistical properties, in which
case the feedforward gain GFF depends upon.the statistics.of the dis-
turbance. In particular, if d(t) is uncorrelated ("white") noise, GFF is
zero and the optimal control is entirely feedback. The feedback section
of the optimal control requires the value of x at every position z, which is
clearly impossible. The practical design problem then becomes one of
locating a minimum number of sensing elements in order to obtain an
adequate approximation to the integral in Eq. (19) for a wide range of
disturbance states x(t,t).

11.10 OPTIMAL FEED DISTRIBUTION IN PARAMETRIC PUMPING


Several separation processes developed in recent years operate periodi-
cally in order to obtain improved mass transfer. One.such process,
developed by Wilhelm and his associates and termed parametric pump-
ing, exploits the temperature dependence of adsorption; equilibria to
achieve separation, as shown schematically in Fig. 11.13. A stream
containing dissolved solute is alternately fed to the top and bottom of
the column. When the feed is to the top of the column, which we
denote by the time interval 71, the stream is heated, while during feed
to the bottom in the interval ra the stream is cooled. There may be
intervals 'r2 and 74 in which no feed enters. A temperature gradient is
During r1 cold product lean in the
therefore established in the column.
solute is removed from the bottom, and during ra hot product rich in
312 OPTIMIZATION BY VARIATIONAL METHODS

Hot feed Hot product (rich)


311

Valve open during r3


during r,

Feed

Cold feed old product (lean)


911

Valve open
during r,
during r3

Fig.11.13 Schematic of a parametric-pumping separation


process.

solute is removed from the top. This physical process leads to an inter-
esting variational problem when we seek the distribution of hot and cold
feed over a cycle of periodic operation which maximizes the separation.
Denoting normalized temperatures by T and concentrations by c,
with subscripts f and s referring to fluid and solid phases, respectively,
the mass and energy balances inside the column (0 < z < 1) are
82T, aT, aT, aT.
+ OK =o (la)
az2 + u(t) a + at
aat' + -t (T, - T,) =0 (lb)
a2c, ac, ac, ac,
u(t) 0 (lc)
'' aZ2 + a + at + X
at
ar+a(c*-c.)=0

Here, c* is the equilibrium concentration of solute which, for the salt


solutions studied by Rice, is related to solution concentration and tem-
perature by the empirical equation
C. )$.I G-0. 447.
c* = 111
88

u(t) is the fluid velocity, which is negative during TI (downflow, hot feed),
positive during Ta (upflow, cold feed), and zero during 72 and 7 4 (periods
DISTRIBUTED-PARAMETER SYSTEMS 393

of no flow). The values used for the other physical parameters are
#K 1.10 K=1.38
y=200 a=0.3
,,n - 10'
In order to simplify the model for this preliminary study two
approximations were made based on these values. First, y was taken
as approaching infinity, implying instantaneous temperature equilibra-
tion. In that case T. = T1 = T. Secondly, dispersion of heat and mass
were neglected compared to convection and ' and 71 set equal to zero.
This latter approximation changes the order of the equations and was
retained only during the development of the optimization conditions.
All calculations were carried out including the dispersion terms. The
simplified equations describing the process behavior are then
OT
(1 + RK) = -u aT (3a)
act ac, ac, (3b)
at az at
a at- = X (C!
- C*) (3c

with boundary conditions


r,, u < 0: T=1
cf = 1 at z = 1 (4a)
ra, u > 0: T=0
cj = 1 at z = 0 (4b)
All variables are periodic in time with period 0. An immediate conse-
quence of this simplification is that Eq. (3a) can be formally solved to
yield
T 1z - l 1 OK fo u(r) drl = const (5)

Periodicity then requires

fe
o
u(r) dr = 0 (6)

The amount of dissolved solute obtained in each stream during one


cycle is
Rich product stream: fT,u(t)c1(t,l) dt (7a)

Lean produet Scream: - f u(t)cj(t,O) dt (7b)

Thus, the net separation can be expressed as


(P = f u(t)c;(t,1) (it + f u(l)cf(t,O) dt
I (8)
384 OPTIMIZATION BY VARIATIONAL METHODS

The total feed during one cycle is


V= 0
ju(t)j dt = f u(t) dt - ff, u(t) dt (9)

The optimization problem is that of choosing u(t) such that (P is maxi-


mized (-(P minimized) for a fixed value of V. To complete the specifi-
cation of parameters we take 8 = 2, V = 2.
The linearization and construction of Green's functions proceeds in
the usual manner, though it is necessary here to show that terms result-
ing from changes in the ranges of rl and r3 are of sicond order and that
Eqs. (6) and (9) combine to require
f, au(t) dt = 3u(t) dt = 0 (10)
Ifs
The Green's functions are then found to satisfy

(1 + OK) aatl + u az1 - ay, aT = 0 (11a)

aat2
+ u a22 + Xy3 = 0 (11b)

ay3 aC + K 1372 = 0 (11c)


X73
at aC. at

71: y1(t,0) = 0 72(t,0) = -1 (12a)

r3:' 71(t,1) = 0 y2(t,1) _ +1 (12b)

All three functions are periodic over 8. The direction of steep descent
'for maximizing the separation (P is
1 aT ac
w(t) I y 4 1 + c/(t,0) - 10 (y1 az + 72 a J) dz in r1
r 1 aT + aCf)
2U(t) 743 + Cf(0) - f y1 y2 az J dz in ra
o az
(13)

Here w(t) > 0, and 741 and 743 are constants chosen to satisfy the restric-
tions of Eq. (10) on au.
The initial choice for the feed distribution was. taken to be sinusoidal
u(t) = -0.5 sin t (14)

The periodic solution to the system model was obtained by an implicit


finite difference method using the single approximation T. = T1 = T
(y oo) and with dispersion terms retained. The multiplier equations
were solved only approximately in order to save computational time, for
an extremely small grid size appeared necessary for numerical stability.
DISTRIBUTED-PARAMETER SYSTEMS 3M

First Eqs. (llb) and (lle) were combined to give a single equation in -Y2,
and the following form was assumed for application of the Galerkin
method introduced in Sec. 3.9:
72 = (2z - 1) + Clz(1 - z) cos t + C2[$(1 - z) + (1 - $)z] sin t
+ C3z(1 - z) cos 2t + C4[$(1 - z) + (1 - $)z] sin 2t (15)
where $ = +I when u > 0, j6 = 0 when u < 0. Evaluating the con-
stants in the manner described in Sec. 3.9 shows them to be
C, = - 5.54 C2 = 0.461
C3 = 0.056 C4 = -0.046
indicating rapid convergence, and, in fact, the C3 and C4 terms were
neglected for computation. y3 was then found by analytically integrating
Eq. (llc), treating the equation as a linear first-order equation with vari-
able coefficients. -y, was also estimated using the Galerkin method,
leading to the solution
y, = 0.0245[z(1 - z) cos t + [0(1 - z) + (1 - Q)z} sin t} (16)
The function Su(t) was computed from Eq. (13) for a constant
weighting function w(t) = k, and a function w(t) = k1sin t!. These are
shown normalized with respect to k in Fig. 11.14 as a dotted and solid
line, respectively. The dashed line in the figure is the function 0.5 u(t).
Both weighting functions indicate the same essential features when the

0.2

0.5

k
sa
0 0

.0.1
-0.5

-0.2
V
-1.0
a*
0 2 2 2

Fig. 11.14 Normalized correction to velocity. Solid line w(t) = klsint1; dotted line
w(t) = k; dashed line 0.5u(t) _ -0.25 sin t.
386 OPTIMIZATION BY VARIATIONAL METHODS

starting velocity function a(t) is compared with the change indicated


by the optimization theory in the form of Su. Amplification of the flow
is required in some regions for improved performance and reduction in
others. The value of the separation obtained for the initial velocity dis-
tribution (k = 0) was 61 = 0.150. For sinusoidal weighting and values
of k = 0.2, 0.4, and 0.8 the values of (Q were, respectively, 0.160, 0.170,
and 0.188. For constant weighting the value of (P for k = 0.1 was 0.160.
Further increase in k in either case would have required a new computer
program to solve the 'state equations, which did not appear to be a
fruitful path in a preliminary study of this kind. The dashed sinusoid
in Fig. 11.15 is the initial velocity and the solid line the value for sinus-
oidal weighting and k = 0.8.
The trend suggested by the calculated function 6u(t) is clearly
towards regions of no flow followed by maximal flow. Such a flow pat-
tern was estimated by taking the sign changes of du for sinusoidal weight-
ing to define the onset and end of flow, leading to the rectangular wave
function shown in Fig. 11.15. The separation is 0.234, an increase of
36 percent over the starting sinusoidal separation.
No further study was undertaken because the approximate nature

0.8 r-

0.6

04

0.2

u (f) 0

-0.2

-0.4

-0.6

0
I
3w
J
2
2w

Fig. 11.15Velocity functions. Dashed line u(t) = -0.5 sin t; solid line sinusoidal
weighting with k = 0.8; rectangular wave final estimate of optimum.
DISTRIBUTED-PARAMETER SYSTEMS 387

of the model did not appear to justify the additional computational effort.
The preliminary results obtained here do suggest a fruitful area of appli-
cation-of optimization theory when physical principles of the type con-
sidered here have been carried to the point of process implementation.

11.11 CONCLUDING REMARKS


This concludes our study of the use of Green's functions in the solution
of variational problems.. In addition to the examination of several
examples which are of considerable practical interest in themselves, the
purpose of this chapter has been to demonstrate the general applicability
of the procedures first introduced in Chaps. 6, 7, and 9. The variational
solution of optimization problems inevitably reduces to a consideration
of linearized forms of the state equations, whether they are ordinary
differential, difference, integral, partial differential, difference-differential,
differential-integral, or any other form. The rational treatment of these
linearized systems requires the use of Green's functions.
As we have attempted to indicate, in all situations the construction
of the Green's function follows in a logical fashion from the application
of the timelike operator to the inner product of Green's function and
state function. It is then a straightforward matter to express the varia-
tion in objective explicitly in terms of finite or infinitesimal variations in
the decisions, thus leading to both necessary conditions and computa-
tional procedures. As long as this application of standard procedures
in the theory of linear systems is followed, there is no inherent difficulty
in applying variational methods to classes of optimization problems not
examined in this book.

BIBLIOGRAPHICAL NOTES
Sections 111 to 11.7: These sections follow quite closely
M. M. Denn: Intern. J. Contr., 4:167 (1966)
ThePartieular problem of temperature control has been considered in terms of a minimum
principle for integral formulations by
A. G. Butkovskii: Proc. sd Intern. Congr. IFAC, paper' 513 (1963)
and by classical variational methods by
Y. Sakawa: IEEE Trans. Aulom. Cont., AC9:420 (1964); AC11:35 (1966)
Modal analysis, approximation by a lumped system, and a procedure for approximating
the solution curves are used by
1. McCausland: J. Electron. Contr., 14:635 (1963)
Proc. Inst. Elec. Engrs. (London), 112:543 (1965)
Sit OPTIMIZATION BY VARIATIONAL METHODS

Section 11.8: The essential results are obtained in the paper

M. M. Denn, R. D. Gray, Jr., and J. It. Ferron: Ind. Eng. Chem. Fundamentals,
6:59 (1966)
Details of the construction of the model and numerical analysis are in
R. D. Gray, Jr.: Two-dimensional Effects in Optimal Tubular Reactor Design, Ph.D.
thesis, University of Delaware, Newark, 1)el., 1965
The usefulness of an isothermal coolant and the effect of design parameters are investigated in
A. R. Hoge: Some Aspects of the Optimization of a Two-dimensional Tubular Reactor,
B.S. thesis, University of Delaware, Newark, Del., 1965
J. D. Robinson: A Parametric Study of an Optimal Two-dimensional Tubular Reactor
Design, M.Ch.E. thesis, University of Delaware, Newark, Del., I966

Section 11.9: This section is based on,

M. M. Denn: Ind. Eng. Chem. Fundamentals, 7:410 (1968)


where somewhat more general results are obtained, including control at the boundary z = 0.
An equation for the feedback gain corresponding to Eq. (17) for more general spatial
differential operators was first presented by Wang in his review paper
P. K. C. Wang: in C. T. Leondes (ed.), "Advances in Control Systems," vol. 1,
Academic Press, Inc., New York, 1964
A procedure for obtaining the feedback law using modal analysis is given by
D. M. Wiberg: J. Basic Eng., 89D:379 (1967)
Wiberg's results require a discrete spectrum of eigenvalues and are not applicable to
hyperbolic systems of the type studied here. An approach similar to that used here
leading to an approximate but easily calculated form for the feedback gain is in
L. B. Koppel, Y. P. Shih, and D. R. Coughanowr: Ind. Eng. Chem. Fundamentals,
7:296 (1968)

Section 11.10: This section follows a paper with A. K. Wagle presented at the 1989
Joint Automatic Control Conference and published in the preprints of the meeting.
More details are contained in
A. K. Wagle: Optimal Periodic Separation Processes, M.Ch.E. Thesis, University of
Delaware, Newark, Del., 1969
Separation by parametric pumping was developed by Wilhelm and coworkirs and reported
in
R. H. Wilhelm, A. W. Rice, and A. R. Bendelius: Ind. Eng. Chem. Fundamentals,
5:141 (1966)
R. W. Rolke, and N. H. Sweed: Ind. Eng. Chem. Fundamentals,
7:337 (1968)
The basic equations are developed and compared with experiment in the thesis of Rice,
from which the parameters used here were taken:
DISTRIBUTED-PARAMETER SYSTEMS 389

A. W. Rice: Some Aspects of Separation by Parametric Pumping, Ph.D. thesis,


Princeton University, Princeton, N.J., 1966

Section 11.11: Variational methods for distributed-parameter systems have received


attention in recent years. The paper by Wang noted above contains an extensive
bibliography through 1964, although it omits a paper fundamental to all that we have
done, namely,

S. Katz: J. Electron. Contr., 16:189 (1964)


More current bibliographies are contained in

E. B. Lee and. L. Markus: "Foundations of Optimal Control Theory," John Wiley &
Sons, Inc., New York, 1967
P. K. C. Wang: Intern. J. Contr., 7:101 (1968)
An excellent survey of Soviet publications in this area is
A. G. Butkovsky, A. I. Egorov, and K. A. Lurie: SIAM J. Contr., 6:437 (1968)
Many pertinent recent papers can be found in the SIAM Journal on Control; IEEE
Transactions on Automatic Control; Automation and Remote Control; and Auto-
matica; in the annual preprints of the Joint Automatic Control Conference; and in the
University of Southern California conference proceedings:

A. V. Balakrishnan and L. W. Neustadt (eds.): "Mathematical Theory of Control,"


Academic Press, Inc., New York, 1967
The following are of particular interest in process applications:

R. Jackson: Proc. Inst. Chem. Engrs. AIChE Joint Meeting, 4:32 (1965)
Intern. J. Contr., 4:127, 585 (1966)
Trans. Inst. Chem. Engrs. (London), 46:T160 (1967)
K. A. Lurie: in G. Leitmann (ed.), "Topics in Optimization," Academic Press, Inc.,
New York, 1967
A minimum principle fyir systems described by integral equations is outlined in the paper
by Butkovskii li4fd above. Results for difference-differential equations, together
with further refeonces, are in
D. H. Chyung: Pr: prints 1967 Joint Autom. Contr. Conf., p. 470,
M. M. Denn and R. Aris: Ind. Eng. Chem. Fundamentals, 4:213 (1965)
M. N. Ogflztoreli: "Time-lag Control Systems," Academic Press, Inc., New York, 1966
H. R. Sebesta and L. G. Clark: Preprints 1967 Joint Autom. Contr. Conf., p. 326
Results due to. Kharatishvili can be found in the book by Pontryagin and coworkers and
in the collection above edited by Balakrishnan and Neustadt.

PROBLEMS
11.1. Obtain the minimum-principle and steep-descent direction used in See. 11.8.
11.2. Obtain necessary conditions and the direction of steep descent for the problem in
Sec. 11.10. What changes result when the objective is maximization of the separation
390 OPTIMIZATION BY VARIATIONAL METHODS

factor,
I u(t)c1(t,l) dt

f u(i)cj(t,0) dt
11.3. Optimal control of a heat exchanger to a new set point by adjustment of wall
temperature is approximated by the equation
ax ax
at + az = P(u - x)
x(0,t) = 0
Obtain the optimal feedback gain as a function of P and C for the objective

E - 2 Iu- [Cx2(l,t) + u2(t)1 dt

Compare with the approximate solution of Koppel, Shih, and Coughanowr,


GFB(z) _ { -C exp (- (2P + PK)(1 - z)] 0<z<1
0 z=1
where K is the solution of a transcendental equation

K =2±K[1 -exp(-2P-PK)]
11.4. The reversible exothermic reaction X = Y in a tubular catalytic reactor is
approximated by the equations
ax,
az
x:rkoe [(1 - yo - (yo + x01 0<z<1
ax,
at
-«ux: 0<r<1
where x, is the extent of reaction, x2 the catalyst efficiency, u the temperature, r the
reactor residence time, and yo the fraction of reaction product in the feed. Boundary
and initial conditions are
x, =0 atz =0forall t
x==1 att -Oforallz
The temperature profile u(z) is to be chosen at each time t to maximize average
conversion
fo,
tP = dt

Obtain all the relevant equations for solution. Carry out a numerical solution if
necessary. Parameters are
rko3X10' Ko-2.3X10-6 yo=0.06
E_ = 10,000 E; - 5,000 « = 4 X 10'6
(Th(s problem is due to Jackson.)
DISTRIBUTED-PARAMETER SYSTEMS

11.5. Discuss the minimum time control of the function x(z,t) to zero by adjusting
surroundings temperature u(t) in the following radiation problem:
ax a'x 0<z<1
at - aZ2 0<t<a
x=0 att=0forall z
-=0
ax
aZ
atz=Oforallt
ax
=k[u4(t) -x'] atz = 1for all t
az
(The problem is due to Uzgiris apd D'Souza.)
11.6. Obtain necessary conditions and the direction of steep descent for a system
described by the difference-differential equations
ii(t) =l,fx(t),:(t - r), u(t), u(t - r)]
x,(t) = x,o(t) -r < t < 0
min E = fo 5[x(t),u(t)] dt
12
Dynamic Programming and
Hamilton-Jacobi Theory

12.1 INTRODUCTION
The major part of this book has been concerned with the application of
classical variational methods to sequential decision making, where the
sequence was either continuous in a timelike variable or, as in Chap. 7,
over discrete stages. Simultaneously with the refinement of these meth-
ods over the past two decades an alternative approach to such problems
has been developed and studied, primarily by Bellman and his coworkers.
Known as dynamic programming, this approach has strong similarities to
variational methods and, indeed, for the types of problems we have
studied often leads to the same set of equations for ultimate solution.
In this brief introduction to dynamic programming we shall first examine
the computational aspects which differ from those previously developed
and then demonstrate the essential equivalence of the two approaches to
sequential optimization for many problems.
DYNAMIC PROGRAMMING AND HAMILTON-JACOBI THEORY 393

1L2 THE PRINCIPLE OF OPTIMALITY AND COMPUTATION


For illustrative purposes let us consider the process shown schematically
in Fig. 12.1. The sequence of decisions u1, u2i . . . , uN is to be made
to minimize some function 8(xN),- where there is a known input-output
relation at each stage, say
x" = f^(x°-',un) n = 1, 2, . . , N .
(1)
The dynamic programming approach is based on the principle of opti-
mality, as formulated by Bellman:
An optimal policy has the property that whatever the initial state and
initial decision are, the remaining decisions must constitute. an opti-
mal policy with regard to the state resulting from the first decision.
That is, having chosen u' apd thug determined x1, the remaining deci-
sions u2, u', .. , uN must be chosen so that C(xN) is minimized for that
particular x'. Similarly, having chosen u', u2, . . , uN-1 and thus .

determined xN-1, the remaining decision uN must be chosen so that


&(xN) is a minimum for that xN-1. The proof of the principle of opti-
mality is clear by contradiction, for if the choice u' happened to be the
optimal first choice and the remaining decisions were not optimal with
respect to that x1, we could always make &(xN) smaller by choosing a
new set of remaining decisions.
The principle of optimality leads immediately to an interesting
computational algorithm. If we suppose that we have somehow deter-
mined xN-1, the choice of the remaining decision : simply involves the
search over all values of.uN to minimize C(xN) or, substituting Eq. (1),
min E[fN(xN-1,uN)1 (2)
UN

Since we do not know what the proper value of xN`1 is, however, we must
do this for all values of xN-' or, more realistically, for a representative
selection of values. We can then tabulate for, each XN-1 the minimizing
value of uN and the corresponding minimum value of E.
We now move back one stage and suppose that we have available

Fig. 12:1 Schematic of a sequential decision process.


391 OPTIMIZATION BY VARIATIONAL METHODS

xN-2, for which we must find the decisions uN-1 and uN that minimize
S(XN). A specification of uN-1 will determine xN-1, and for any given
xN-1 we already have tabulated the optimal uN and value of C. Thus,
we need simply search over UN-1 to find the tabulatedt xN-1 that results
in the minimum value of S. That is, we search over uN-1 only, and not
simultaneously over UN-1 and uN. The dimensionality of the problem is
thus reduced, but again we must carry out this procedure and tabulate
the results for all (representative) values of xN-2, since we do not. know
which value is the optimal one.
We now repeat the process for xN-1, choosing uN-2 by means of the
table for xN-2, etc., until finally we reach x°. Since x° is known, we can
then choose u' by means of the table ford This gives the optimal
value for x', so that u2 can then be found from-Ug table for x2, etc.
In this way the optimal sequence u1, U2, . . .. , UN is cdtp d for a
given x° by means of a sequence of minimizations over a single variable.
Note that we have made no assumptions concerning differentiability of
the functions or bounds on the decisions. Thus, this algorithm can be
used when those outlined in Chap. 9 might be difficult or inapplicable.
We shall comment further on the computational efficiency following an
example.

12.3 OPTIMAL TEMPERATURE SEQUENCES


As an example of the computational procedure we shall again consider
the optimal temperature sequence for consecutive chemical reactions
introduced in Secs. 1.12 and 7.6 and studied computationally in Chaps.
2 and 9. Taking the functions F and G as linear for simplicity, the state
equations have the form
xn-1
xn = (1a)
1 + 0k1oe-E''/u*
yn-1 +
k1oe-E-'/U,(yn-1

+ xn-')
yn = (1 + 8kjoeE,"/°")(1 + 02oe-E:'I"") (1b)

where we wish to choose u', u2, . . . , uN in order to minimize


fi= - yN - pxN (2)

We shall use the values of the parameters as follows:


k1o = 5.4 X 1010 k20 = 4.6 X 1011
E1 = 9 X 10' E_ = 15 X 10'
0=5 p0.3
x°=1 y°=0
t Clearly some interpolation among the tabulated representative values of zN-' will
be required.
DYNAMIC PROGRAMMING AND THEORY 3"

Table 12.1 Optimal decisions and values of the objective at


stage N for various Inputs resulting from stage N - 1

XN-1
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
1/N-1

0 340 340 340 340 340 340 340 340 340 340
0.056 0.112 0.168 0.224 0.281 0.337 0.393 0.449 0.505 0.561

330 334 338 340 340 340 340 340 340 340 340
0.1
0.096 0.145 0.199 0.255 0.310 0.367 0.423 0.479 0.535 0.591 0.647
330 330 334 336 338 338 340 340 340 340 340
0.2
0.192 0.240 0.291 0.344 0.399 0.454 0.509 0.565 0.621 0.677 0.733

330 330 332 334 336 338 338 338 338 340 340
0.3
0.288 0.336 0.385 0.436 0.489 0.543 0.598 0.654 0.709 0.764 0.820
330 330 330 332 334 336 336 338 338 338 338
0.4
0.384 0.432 0.481 0.530 0.582 .0.634 0.688 0.742 0.798 0.853 0.908
330 330 330 332 334 336 336 336 338 338
0.5 / 330
0.480 0.528 0.577 0.625 0.675 0.727 0.779 0.833 0.887 0.942 0.997
330 330 330 330 332 334 334 334 336 336 338
0.6
0.576 0.624 0.673 0.721 0.770 0.820 0.872 0.925 0.978 1.032 1.086

330 330 330 330 330 332 332 334 334 336 336
0.7
0.672 0.720 0.769 0.817 0.866 0.913 0.966 1.018 1.070 1.123 1.177
330 330 330 330 330 330 332 334 334 334 336
0.8
0.768 0.816 0.865 0.913 0.962 1.010 1.060 1.111 1.163 1.215 1.268
330 330 330 330 330 330 332 332 332 334 334
0.9
0.864 0.912 0.961 1.009 1.058 1.106 1.155 1.205 1.25. .309 1.361
330 330 330 330 330 330 330 330 332 3341E
1.0
0.960 1.008 1.057 1.105 1.154 1.202 1.250 1.329 1.354 1.11 1.44

Furthermore, we shall restrict u° by the constraints


330 < u^ < 340 (3)

and, to demonstrate a situation where techniques based on the calculus


are not applicable, we shall further restrict each u° to values which are
even integers. That is, we shall restrict the choice to the values 330,
332, 334, 336, 338, and 340.
We begin by considering the last (Nth) stage. For each value of
xN-1, yN-1 we can compute xN and yN by means of Eqs. (1) for the six
possible choices of UN. The particular value of uN which minimizes S'
and the corresponding value of -g is recorded in Table 12.1, where it is
3% OPTIMIZATION BY VARIATIONAL METHODS

assumed that increments of 0.1 in xN-' and yN`' are sufficient for purposes
of interpolation.
Next we consider the penultimate [(N - 1)stj stage. For example,
for xN-2 = 0.7, yN-2 = 0.2, we obtain:

uN-1 xN-1 yN-1 -E

330 0.504 0.380 0.617


332 0.481 0.397 0.621
334 0.455 0.415 0.626
336 0.429 0.430 0.625
338 0.403 0.443 0.624
340 0.375 0.452 0.617

where the values of -S are obtained by linear interpolation in Table 12.1.


Thus, for this pair of values xN-2, yv-2 the optima, UN-' is 334 with a
corresponding value of S of -0.626. In this way Table 12.2 is con-
structed for all values of xN-2, yN-2, where, for simplicity, the range has
now been reduced.
In a similar manner we move on to the second from last stage and
consider values of x'''-3, yN-s, now finding the optimum.by interpolation
in Table 12.2. The only entry we note here is xN-3 = 1, yN-3 = 0, with
UN-2 = 336 and S = -0.706. When we have finally reached the first
stage, we can reconstruct the optimal policy. For example, if N = 1,
we find from Table 12.1 that for x° = 1, y° = 0 the optimum is -0.56
with u' = 340. For N = 2t we begin with Table 12.2, where we find
for x° = 1, y° = 0 the optimum is -0.662 and u' = 340. Equations (1)
then indicate that x' = 0.536, y' = 0.400, and Table 12.1 for the last
(second) stage indicates the optimal decision u2 = 336. For N = 3 our
sole tabular entry recorded above indicates u' = 336 and S = -0.71.
Then, from Eqs. (1), x' = 0.613 and y' = 0.354, and Table 12.2 indicates
that u2 = 334. Applying Eqs. (1) again, x2 = 0.395, y2 = 0.530, and
from Table 12.1 we find u3 = 332. Finally, then, x3 = 0.274, y3 = 0.624.
This process can be continued for N as large as we wish.
We can now evaluate the strong and weak points of this computa-
tional procedure. The existence of constraints increases efficiency, as
does the restriction to discrete values of the decisions. The former is

t For this case (No = 10) we found in Chap. 2 that u' = 338, u2 - 336.5, and
l; - -0.655 when there were no restrictions on u', u2. Since the restricted optimum
cannot be better than the unrestricted, we see some small error resulting from the
interpolation.
DYNAMIC PROGRAMMING AND HAMILTON-JACOBI THEORY 3!7

Table 12.2 Optimal decisions at stage N - 1


and optimal values of the objective for
various Inputs resulting from stage N - 2
ZN-4
0.5 0.6 0.7 0.8 0.9 1.0

0 336 338 338 338 338 340


0.332 0.404 0.460 0.524 0.590 0.662
334 336 338 336 336 338
0.1
0.414 0.475 0.541 0.605 0.672 0.736
334 336 336 334 36 334
0.2
0.509 0.559 0.626 0.696 0.757 0.818

0.3
334 334 336 334 " 336 334
0.555 0.648 0.713 0.776 0.840 0.905
334 334 336 334 334 334
0.4
0.649 0.735 0.799 0.863 0.926 0.992
330 332 334 334 334 334
0.5
0.743 0.826 0.887 0.951 1.014 1.078

not a major limitation on procedures derivable from the calculus, but the
latter effectively eliminates their use. Thus, if such a restriction is
physically meaningful, an algorithm such as this one is essential, while
if the restriction is used simply as a means of obtaining first estimates by
use of a coarse grid, we might expect to obtain first estimates as easily
using steep descent as outlined in Chap. 9. In reducing the N-dimen-
sional search problem to a sequence of one-dimensional searches we have
traded a single difficult problem for a large number of simpler ones, and
indeed, we have simultaneously solved not only the problem of interest
but related problems for a range of initial values. If we are truly inter-
ested in a range of initial values, this is quite useful, but if we care only
about a single initial condition, the additional information is of little use
and seemingly wasteful. Dynamic programming does not. eliminate
the ty o-point boundary-value problem which has occupied us to such a
great extent, but rather solves it by considering a complete range of final
values.
The fact that the optimum is automatically computed as a feedback
policy, depending only upon the present state and number of remaining
decisions, suggests utility in control applications. The storage prob-
lem, however, is a serious restriction, for with two state variables we have
been able to tabulate data in a two-dimensional array, but three variables
Us OPTIMIZATION BY VARIATIONAL METHODS

would require a three-dimensional array, four variables four dimensions,


etc. Fast memory capabilities in present computers effectively limit
direct storage to systems with three state variables, and approximation
techniques must be used for higher-dimensional problems. Thus, the
algorithm resulting from direct application of the principle of optimality
will be of greatest use in systems with many stages but few state variables,
particularly when variables are constrained or limited to discrete values.

12.4 THE HAM I LTON-JACOBI-BELLMAN EQUATION


For analytical purposes it is helpful to develop the mathematical formal-
ism which describes the tabulation procedure used in the previous section.
At the last stage the minimum value of g(xN) depends only upon the
value of zN-1. Thus, we define a function SN of 1N-1 as
SN(zN-1) min E[fN(xN_i,uN)J (1)
UM

which is simply a symbolic representation of the construction of Table


12.1 in the example. Similarly with two stages to go we can define
SN`I(XN-2) = min min C(xN) (2)
UN-1 UN

or, using Eq. (1),


SN-1(EN-2) = mm SN[fN-1(XN_2,uN-1)] (3)
UN-1

which is the symbolic representation of the construction of Table 12.2.


In general, then, we obtain the recursive relation
8"(z"-1) = min S"+'[f"(z"-',u")J (4)
U.

Consistent with this definition we can define -

S'N+1(XN) = g(x") (5)


Equation (4) is a difference equation for the function S",. with the
boundary condition Eq. (5). It is somewhat distinct from the type
of difference equation with which we normally deal, for it contains a
minimization operation, but we can conceive of first carrying 'out the
minimization and then using the minimizing value of u" to convert
Eq. (4) to a classical difference equation in the variable n, with S" depend-
ing explicitly upon x"-'. By its definition the optimum must satisfy
Eq. (4), and furthermore, when we have found the functions S", we have,
by definition, found the optimum. Thus, solution of the difference equa-
tion (4) with boundary-condition equation (5) is a necessary and suffi-
cient condition for an optimum. We shall call Eq. (4) a Hamilton-Jacobi-
DYNAMIC PROGRAMMING AND HAMILTON-JACOBI THEORY 3!!

Bellman equation, for it is closely related to the Hamilton-Jacobi equa-


tion of mechanics.
If we assume that the functions S" are differentiable with respect to
x»-I and, further, that fn is differentiable with respect to both x"-I and u",
we can relate the Hamilton-Jacobi theory to the results of Chap. 7. For
simplicity we shall temporarily presume that u" is unconstrained. Then,
since S"+' depends explicitly upon xn, the. minimum of the right-hand side
of Eq. (4) occurs at the solution of
asn+l 0 (6)
au" ax" au"
The value of u" satisfying this equation, un, is, of course, a function of the
particular value of x"-', and Eq. (4) can now be written as
Sn(xn-1) = Sn+l[f"(x"-l,u")1 (7)
Equation (7) is an identity, valid for all x"-'. The partial deriva-
tives in Eq. (6) can then be evaluated by differentiating both sides of
Eq. (7) with respect to each component of x"-', giving
aS" aSn+I
a fn asn+lOf n au"au"

ax;"-1 - [, ax;. 8x;"-1 Z { ax;" ) ax;"-'


where the second term is required because of the dependence of u" on
x"-1. From Eq. (6), however, the quantity in parenthesis vanishes, and
as" v a,3"+1 af;" (9)
ax," ax"-'
From Eq. (5), if there are no relations among the components of z' ,
aSN+1 ar, (10)
ax;N ax/'
Now, it is convenient to define a vector y" as
aS"+1
y," = (11)
ax ^

that is, as the rate of change of the optimum with respect to the state
at the nth stage. Then Eqs. (9) and (10) may be written
,y,n-I Yin af;" (12)
(( ax'n-1

aS
?iN o (13)
axiN
400 OPTIMIZATION BY VARIATIONAL METHODS

and Eq. (6) for the optimum u^


y"af,"=0
(14)
aul

These are the equations for the weak minimum principle derived in
Sec. 7.4, and they relate the Green's functions to the sensitivity interpre-
tation of the Lagrange multiplier in Sec. 1.15.
Even if u" is bounded, the minimization indicated in Eq. (4) can
be carried out and the minimizing value an found as a function of z"-'.
Equation (8) is still valid, and if the optimum occurs at an interior value,
the quantity in parentheses will vanish. If, on the other hand, the
optimal u" lies at a constraint, small variations in the stage input will
not cause any change and au"/ax;"'' will be zero. Thus, Eqs. (9) and
(10), or equivalently (12) and (13), are unchanged by constraints.
The minimum in Eq. (4) at a bound is defined by
()S"+' _ aS"+' af;^ >0 u" at lower bound (15a)
au" -L ti
ax,." au" <0 u" at upper bound
or
aS^+1
ax;
f ;^ y;"f" =min u^ at bound (15b)

Thus we have the complete weak minimum principle. We must reiter-


ate, however, that while the Hamilton-Jacobi-Bellman equation is both
necessary and sufficient, the operations following Eq. (5) merely define
conditions that the solution of Eq. (4) must satisfy and are therefore only
necessary.

12.5 A SOLUTION OF THE HAMILTON-JACOBI-BELLMAN EQUATION


It is sometimes possible to construct a solution to the Hamilton-Jacobi-
Bellman equation directly, thus obtaining a result known to be sufficient
for an optimum. We can demonstrate. this procedure for the case of a
system described by the linear separable equations

x;" _ A;i"x,"-' + b;"(u") (1)

where we wish to minimize a linear function of the final state

(2)

The Hamilton-Jacobi-Bellman equation, Eqs. (4) and (5) of the


DYNAMIC PROGRAMMING AND HAMILTON-JACOBI THEORY 401

preceding section, is
Sn(x"-1) = min S"+I[f"(x"-Ir"u")] (3)
U.

SN+1(xN) _ & _
I CjxjN
(4)

The linearity of f" and 8 suggest a linear solution


yn-1xj"-1 + t"
Sn(x"-1)
=I (5)

where _y n-1 and t" are to be determined. Substituting into Eq. (3), then,

yjn--1xj"-1 + c" = min 11 nxjn-1 + yinbi"(u") + V +I].JJ (6)


..II U. y,nAj LL....ll
J i,j i
or, since only one term on the right depends upon u",

j
(yjn-1
- i
yinA,jn) xi"-I = min [G yi"bin(u")J + n+t - t n (7)
u
i

The left-hand side depends upon x"-1 and the right does not, so that the
solution must satisfy the difference equation
yj"-I
yinA ,n (8)

with the optimal u" chosen by


min ytinbi"(u") (9)
U.

The variable Z" is computed from the recursive relation


Zn = n+1 + min yi"bin(u") (10)
u i

Comparison of Eqs. (4) and (5) provides the boundary conditions


yiN = ci (11)
tN+1 = 0 (12)

Equations (8), (9), and (11) are, of course, the strong. form of the
minimum principle, which we established in Sec. 7.8 as both necessary
and sufficient for the linear separable system with linear objective. We
have not required here the differentiability of b" assumed in the earlier
proof. This special situation is of interest in that the optimal policy is
completely independent of the state as a consequence of the uncoupling
of multiplier and state equations, so that Eqs. (8), (9), and (11) may be
solved once and for all for any value of x" and an optimum defined only
in terms of the number of stages remaining.
402 OPTIMIZATION BY VARIATIONAL METHODS

12.6 THE CONTINUOUS HAMILTON-JACOBI-BELLMAN EQUATION


The analytical aspects of dynamic programming and the resulting
Hamilton-Jacobi theory for continuous processes described by ordinary
differential equations are most easily obtained by applying a careful
limiting process to the definitions of Sec. 12.4. We shall take the sys-
tem to be described by the equations
ti = fi(x,u) 0<t<6 (1)
and the objective as 8[x(6)]. If we divide 0 into N increments of length
At and denote x(n At) as xn and u(n At) as un, then a first-order approxi-
mation to Eq. (1) is
zi" = zin-1 + Mx"-``',u") At + o(At) (2)
Then Eq. (4) of Sec. 124, the discrete Hamilton-Jacobi-Bellman equa-
tion, becomes
S"(x") = min Sn+1[x"-' + f(x"-',u") At + o(At)] (3)
U.

or, writing S as an explicit function of the time variable n At


S"(x"-1) = S(x"-1, n At)
we have
S(x"-1, n At) = min S[x"-' + f(x"-',u") At + o(At), n AL + Qt] (4)
u+

For sufficiently small At we can expand S, in the right-hand side of


Eq. (4) about its-4alue at xn-1, n At as follows:

+
"
S[x"-' + f At + o(At), n At + At] = S(x"-1, n At)

ax fi At +
1 a(n At)
At + o(At) (5)

or

S(x"-', n At) = min [S(xut_1, n At) +


as fi At
4. I az ,
-Al + o(At)] (6)
+ a(n At)
Since S(x"-1, n At) does not'depend upon un, it is not involved in the
minimization and can be canceled between the two sides of the equation.
Thus,
0 = min
s , fi At + At + 0 (At) ] (7)
ax a (n At)

As At gets very small, the distinction between x"-' and x", u"-' and u"
gets correspondingly small and both x"-1 and x" approach x(t) and simi-
DYNAMIC PROGRAMMING AND HAMILTON-JACOBI THEORY 403

larly for u^. Again letting t = n At, dividing by At, and taking the limit
as At -. 0, we obtain the Hamilton-Jacobi-Bellman partial differential
equation for the optimum objective S(x,t)
aSfi(x,u) asl
0 = min U(t) LL. ax,
+ at J (8)

with boundary condition from Eq. (5) of Sec. 12.4


S[x(B),B] = 8[0)J (9)
The partial differential equation can be written in classical form as
as
fr(x,u) + at = 0 (10)

where u is an explicit function of x obtained from the operation


as
tiax; f;(x,u)
m.n (11)

When u is unconstrained or u lies inside the bounds, this minimum is


found from the solution of the equation
as af; _0
(12)
T, au -
Equations (9) to (11) form the basis of a means of computation of
the optimum, and in the next section we shall demonstrate a direct solu-
tion of the Hamilton-Jacobi-Bellman equation. Here we shall perform
some manipulations analogous to those in Sec. 12.4 to show how the
minimum principle can be deduced by means of dynamic programming.
The reader familiar with the properties of hyperbolic partial differential
equations will recognize that we are following a rather circuitous path to
arrive at the characteristic ordinary differential equations for Eq. (10).
First, we take the partial derivative of Eq. (10) with respect to the
jth component z; of x. Since S depends explicitly only upon x and t and
u depends explicitly upon x through Eq. (11) we obtain
j` 2
f as af; au a2s
(13)
Lax ax;f`+L, axa;+LOx,Ouaz;+ax;at=0

The third term vanishes, because of Eq. (12) if u is interior or because a


small change in x cannot move u from a constraint when at a bound,
in which case su/ax; = 0. Thus,
a2S as af; a2S
(14)
axax;f`+Gaxax;;+ax;at = o
404 OPTIMIZATION BY VARIATIONAL METHODS

Next, we take the total time derivative of aS/ax;


d as a2S a2S
at ax; ax; ax; f` + (15)
at ax;

Assuming that the order of taking partial derivatives can be interchanged,


we substitute Eq. (15) into Eq. (14) to obtain
das aSafi
(16)
dt ax; ax; ax;

or, defining
__ as
axi
ys (17)

Eq. (16) m ay be w ritten


afi
Yi - ax (18)

while Eq. ( 11) becomes


min Yifi (19)
ti

If 0 is not specified, it is readily established that S is independent of t and


Eq. (10) is

7ifi(a,u) = 0 (20)

Equations (18) to (20) are, of course, the strong minimum principle for
this system, and Eq. (17) establishes the Green's function y as a sensi-
tivity variable.
It is apparent from these- results and those of Sec. 12.4 that if we
so wished, we could derive the results of the preceding chapters from a
dynamic programming point of view. We have not followed this course
for several reasons. First, the variational approach which we have gen-
erally adopted is closer to the usual engineering experience of successive
estimation of solutions. Second, computational procedures are more
naturally and easily obtained within the same mathematical framework
by variational methods. Finally (and though much emphasized in the
literature, of lesser importance), the dynamic programming derivation
of the minimum principle is not internally consistent, for it must be
assumed in the derivation that the partial derivatives aS/axi are con-
tinuous in x when in fact the solution then obtained for even the ele-
mentary minimum-time problem of Sec. 5.3 has derivatives which are
discontinuous at the switching surface.
DYNAMIC PROGRAMMING AND THEORY 405

12.7 THE LINEAR REGULATOR PROBLEM


To complete this brief sketch of the dynamic programming approach to
process optimization it is helpful to examine a direct solution to the con-
tinuous Hamilton-Jacobi-Bellman equation. This can be demonstrated
simply for the linear regulator problem, a special case of the control prob-
lem studied in Sec. 8.2. The system is described by the linear equations

A;ixi + b,u i = 1, 2, . . . ,s (1)


i-1
with the quadratic objective
.
8
2
ff r
(,1J-1 x1C;ixi + u2) dt (2)

or, defining
1 1
i.+1 = x;C;ixi + u2 x,+1(0) = 0 (3)
2
we have
C[a(e)) = x,+1(e) (4)

The control variable u is assumed to be unconstrained.


The Hamilton-Jacobi-Bellman partial differential equation is

0 = min
+1 asl; asl
u axi + at) (5)
i-1
or, from Eqs. (1) and (3),

0 = min
' as as 1 as
A;ixt + i-1
L, ax; b'u + 2 ax.+ 1 x;C,ixi
Y axi

i,7-1
1 as 2 Ls) (6)
+ 2 ax__ u + at

From the minimization we obtain


as )-1; ' as bi
u = _ ( ax,+1 (7)
axi
11

so that, upon substitution, Eq. (6) becomes a classical but nonlinear


partial differential equation
as i 1 ' as
b,
as
b,
id-1
axi A`ixi - 2 Gas as 1) \ i-1 Ti \i-1 axi /
1 as as
xicoxi + at = 0 (8)
+ id-1
IOi OPTIMIZATION BY VARIATIONAL METHODS

with boundary condition


S[x(8),8] = &[x(8)] = x,+1(8) (9)

The function S(x,t) satisfying Eqs. (8) and (9) will be of the form

8(x'0 = x,+1 + i71 M;;(0) = 0 (10)


i4L"- I
G

in which case

k = 1, 2,
C'xk 2 \i-1 x,Mik + ,-1 Mt;x;) ,s (11)

Equation (8) then becomes

2 xix1(MikAkj + AkiMkf)
i,,.k-I

2
+ x;x; (Mik + Mki)bk] 12 b1(Mii + M11)]
+.i. -I L

+ x;C;;x; + x;lll;;x; = 0 (12)

or, since it must hold for all x,

I (MikAk, + Ak;Mk,) - I [2 (Mik + Mk;)bk] [2 b1(Ml, + M;1)]


+ C;; + M;; = 0 M;;(8) = 0 (13)
The solution to Eq. (13) is clearly symmetric, Mij = M;i, so that we may
write it finally in the form of the Riccati equation (10) of Sec. 8.2

Mi, + I (MikAk, + Mk;Aki)


k
- Q Mikbk) (
k I
btM1j)

+ Ci, = 0 Mi;(8) = 0 (14)

The optimal feedback control is then found directly from Eqs. (7) and
(10) as
,

u = - I b;M,,x, (15)
i.,-1
Since this result was obtained in the form of a solution to the Hamilton-
Jacobi-Bellman equation, we know that it is sufficient for a minimum,
a result also established by other means in Sec. 6.20.
DYNAMIC PROGRAMMING AND THEORY 407

BIBLIOGRAPHICAL NOTES
We shall not attempt a survey of the extensive periodical literature on dynamic program-
ming but content ourselves with citing several texts. Excellent introductions may be
found in
R. Aris: "Discrete Dynamic Programming," Blaisdell Publishing Ccmpany, Wal-
tham, Mass., 1964
R. E. Bellman and S. E. Dreyfus: "Applied Dynamic Programming," Princeton Uni-
versity Press, Princeton, N.J., 1962
G. Hadley: "Nonlinear and Dynamic Programming," Addison-Wesley Publishing
Company, Inc., Reading, Mass., 1964
G. L. Nemhauser: "Introduction to Dynamic Programming," John Wiley & Sons,
Inc., New York, 1966
Problems of the type considered in Sec. 12.6 are treated in
S. E. Dreyfus: "Dynamic Programming and the Calculus of Variations," Academic
Press, Inc., New York, 1965
and numerous topics of fundamental interest are treated in the original book on the subject:
R. Bellman: "Dynamic Programming," Princeton University Press, Princeton, N.J.,
1957
Extensive applications are examined in
R. Aris: "The Optimal Design of Chemical Reactors: A Study in Dynamic Program-
ming," Academic Press, Inc., New York, 1961
R. Bellman: "Adaptive Control Processes: A Guided Tour," Princeton University
Press, Princeton, N.J., 1961
S. M. Roberts: "Dynamic Programming in Chemical Engineering and Process Con-
trol," Academic Press, Inc., New York, 1964
D. F. Rudd and C. C. Watson: "Strategy of Process Engineering," John Wiley &
Sons Inc., New York, 1968
J. Tou: "Optimum Design of Digital Control via Dynamic Programming," Academic
Press, Inc., New York, 1963

PROBLEMS
12.1. Develop the Hamilton-Jacobi-Bellman equation for the system
z" = f^(z"-' u")
N
min s = E 61"(x"-',u") + F(1N)
n-1
Apply this directly to the linear one-dimensional case
x" = A(u^)x"-' + 14(A(u") - 11
N
t = a(xN - x°) + E M(u")
n-1

and establish that the optimal decision u" is identical at each stage.
OPTIMIZATION BY VARIATIONAL METHODS

12.2. Formulate Prob. 7.2 for direct solution by dynamic programming. Draw a
complete logical flow diagram and if a computer is available, solve and compare
the effort to previous methods used. Suppose holding times u," are restricted to
certain discrete values?
12.3. The system
x=u lul <1
is to be taken from initial conditions x(O) = 1, i(0) = 1 to the origin to minimize

E - 5(x,i,u) dt
0

Show how the optimum can be computed by direct application of the dynamic pro-
gramming approach for discrete systems. (Hint: Write
ij - x: x1(t - A) = xi(t) - x3(t)A
is = u x:(t - A) = X2(t) - u(t)A
and obtain the Hamilton-Jacobi-Bellman difference equation for recursive calculation.)
Assume that u can take on only nine evenly spaced values and obtain the solution for
(a)5
(b) if = %(xs= + u=)
(c) if - 3Vx' + V)
Compare the value of the objective with the exact solution for continuously variable
u(t) obtained with the minimum principle.
1L4. Using the dynamic programming approach, derive the minimum principle for
the distributed system in Sec. 11.4.
Indexes
Name Index

Akhiezer, N., 96 Carroll, F. J., 323


Aleksandrov, A. G., 268 Cauchy, A., 52, 69
Ames, W. F., 2, 97 Chang, S. S. L., 244, 266
Amundson, N. R., 129, 132, 243 Chou, A., 224
Aris, R., 39, 96, 97, 131, 132, 165, 170, Chyung, D. H., 389
171, 223-225, 243, 244, 322, 324, Clark, L. G., 389
357, 358, 389, 407 Coddington, E. A., 222, 224
Athans, M., 131, 132, 169-171, 223, 267 Collatz, L., 97
Connolly, T. W., 2
Coste, J., 243
Bailey, R. E., 225 Coughanowr, D. R., 39, 96, 131, 266,
Balakrishnan, A. V., 322, 389 388
Bankoff, S. G., 325 Courant, R., 39, 96, 323
Bass, R. W., 170 Coward, I., 170
Battin, R. H., 223, 323 Cunningham, W. J., 2
Beightler, C. S., 38, 39, 69, 358
Bekey, G. A., 243
Bellman, R. E., 2, 39, 62, 68, 169, 220,
Dantzig, G. B., 70
225, 322, 323, 392, 407
Das, P., 96, 268
Bendelius, A. R., 358, 388 Denbigh, D. G., 96, 97
Berkovitz, L. D., 223 Denham, W. IF., 225, 323, 324
Bertram, J. E., 268 Denn, M. M., 39, 130-132, 169, 170,
Beveridge, G. S. G., 357 223, 224, 243, 244, 268, 269, 322,
Bilous, 0., 129, 132 324, 357, 387-389
Blakemore, N., 171, 225 Qesoer, C. A., 223, 243
Blaquiere, A., 223
DiBella, C. W., 69
Bliss, G. A., 96, 223, 322
Douglas, J., 97, 268
Blum, E. D., 357 Douglas, J. M., 130, 131, 150, 169, 170,
Bohn, E."V., 322
224, 267, 324, 358
Boltyanskii, V., 131, 169, 223 Dreyfus, S. E., 39, 62, 68, 224, 225, 407
Bolza, 0., 96, 97, 268 Duffin, R. J., 39
Box, M. J., 61, 70 Dyer, P., 325
Bram, J., 69
Breakwell, J. V., 322
Brosilow, C. D., 268
Bryson, A. E., 225, 322-325 Eben, C. D., 39
Buck, R. C., 223 Edelbaum, T. N., 38
Buckley, P. S., 266 Egorov, A. I., 389
Bushaw, D. W., 169 Englar, T. S., 268, 325
Butkovskii, A. G., 387, 389 Enns, M., 70
412 OPTIMIZATION BY VARIATIONAL METHODS

Fadden, R. J., 322 Holtzman, J. M., 244


Falb, P., 131, 132, 169, 170, 223, 267 Horn, F. J. M., 39, 132, 199, 224, 244,
Fan, L. T., 30, 39, 244, 357 323, 324, 351, 357
Ferron, J. R., 39, 388 Hsia, T. C., 170, 268
Fiacco, A. V., 40
Fine, F. A., 325
Finlayson, B. A., 97 Isaacs, D., 324.
Fletcher, R., 69
Fliigge-Lotz, I., 170, 256, 268
Fort, T., 40, 243 Jackson, R., 39, 170, 244, 357, 389
Franks, R. G. E., 2 Javinsky, M. A:, 170
Fredrickson, A. G., 225 Jazwinsky, A. H., 322
Froberg, C. E., 68 Jeffreys, G. V., 40
Fuller, A. T., 171, 268
Jenson, V: G., 40
Funk, P., 97, 268 Johansen, D. E., 324
Johnson, C. D., 162, 170, 225
Jurovics, S. A., 322
Gamkrelidze, R., 131, 169, 223
Gass, S. I., 70
Gilbert, E. G., 322 Kadlec, R. H., 170
Glicksberg, 1., 169 Kalaba, R. E., 2, 322, 323
Glicksman, A. J., 66, 70 Kalman, R. E., 39, 96, 131, 132, 224,
Goh, B. S., 225 267, 268, 323, 325
Goldfarb, D., 69 Kantorovich, L. V., 97
Goldstein, H., 131 Katz, S., 244, 268, 324, 389
Goodman, T. R., 322 Kelley, H. J., 40, 225, 323, 324
Gottlieb, R. G., 324 Kenneth, P., 323
Gray, R. D., Jr., 388 Kgepck, R., 269
Greenly, R. R., 322 Kopp, R. E., 225, 321, 322, 324
Griffith, R. E., 69 Koppel, L., 39, 96, 131,- 266, 268, 388
Gross, 0., 169 Krelle, W., 39
Gruver, W. A., 267 Kruskel, M., 39
Krylov, V. I., 97
Kunzi, H. P., 39
Hadley, G., 38, 39, 69, 70, 407
Halkin, H., 224, 244
Hancock, H., 38 Lack, G. N. T., 70
Handley, K. R., 268 Lance, G. N., 322
Harvey, C. A., 225 Landau, L. D., 131
Hestenes, M., 224 Laning, J. H., Jr., 223, 323
Hext, G. R., 70 Lapidus, L.,. 69, 70, 267-269, 321,
Hilbert, D., 96 LaSalle, J. P., 169, 170, 269
Hildebrand, F. B., 68, 97, 268 Lasdon, L. S., 324
Himsworth, F. R., 70 Lee, E. B., 170, 224, 389
Hoge, A. R., 388 Lee, E. S., 132, 292, 323, 324
NAME INDEX 413

Lee, I., 323 Obermayer, R. W., 268


Lefschetz, S., 269 O'Conner, G. E., 132, 267
Leitmann, G., 224 Oguztoreli, M. N., 389
Leondes, C. T., 323, 324 Oldenburger, R., 169
Lesser, H. A., 70
Levenspiel, 0., 224
Levinson, N., 222, 224 Paiewonsky, B., 171, 225, 322
Lewallen, J. M., 322 Paine, G., 323
Lifshitz, L. D., 131 Paradis, W. 0., 256, 267
Lin, R. C., 199, 224, 351, 357 Pars, L. A., 96
Lindorff, D. P., 268 Partain, C. L., 225
Luh, J. Y. S., 225 Paynter, J. D., 357
Lurie, K. A., 389 Perlmutter, D. D., 131, 256, 266, 267
Luus, R., 267, 268, 321 Peterson, E. L., 39
Pigford, R. L., 40
Pinkham, G., 322
McCausland, I., 387 Pollock, A. W., 202, 224
McCormick, G. P., 40 Pontryagin, L. S., 131, 169, 170, 223,
McGill, R., 322, 323 389
McIntyre, J. E., 225, 322 Powell, M. J. D., 69
McReynolds, S. R., 325 Puri, N. N., 267
Markland, C. A., 325
Markus, L., 170, 224, 389
Marshall, W. R., Jr., 40 Rabinowitz, P., 70
Mayne, D., 325 Ray, W. H., 97, 224
Megee, R. D., III, 324 Reed, C. E., 40
Merriam, C. W., 131, 132, 266, 324 Rekazius, Z. V., 170, 268
Meyer, F., 323 Rice, A. W., 358, 388, 389
Mickley, H. S., 40 Rice, R. K., 324
Miele, A., 225 Rippin, D. W. T., 224, 358
Mikami, K., 323 Roberts, S. M., 268, 269, 407
Millman, M. G., 268 Robinson, J. D., 388
Minorsky, N., 2 Rogers, A. E., 2
Mishchenko, E., 131, 169, 223 Rolke, R. W., 388
Mitten, L. G., 358 Rosenbrock, H. H., 69, 322
Hitter, S. K., 325 Rozenoer, L. I., 131, 244
Moser, J., 39 Rubin, H., 39
Moyer, H. G., 225, 321, 322, 324 Rudd, D. F., 2, 243, 357, 358, 407
Muckler, F. A., 268

Saaty, T. L., 69
Nemhauser, G. L., 358, 407 Sakawa, Y., 76, 97, 387
Neustadt, L. W., 224, 322, 389 Sarrus, F., 69
Newman, A. K., 268 Scharmack, D. K., 322
Nieman, R. A., 324 Schechter, R. S., 96, 357
Noton, A. R. M., 132, 267, 322, 325 Schley, C. H., 323
OPTIMIZATION BY VARIATIONAL METHODS
414

Scriven, L. E., 97 Troltenier, U., 323, 324


Sebesta, H. R., 389 Truxal, J. G., 131, 266
Shafran, J. S., 225 Tsai, M. J., 357
Sherwood, T. K., 40 Tsuchiya, H. M., 225
Shih, Y. P., 388 Tuel, W. G., Jr., 269
Siebenthal, C. D., 165, 170, 171, 225
Smith, B. D., 243
Spendley, W., 70 Valentine, F. A., 223, 225
Speyer, J. L., 322
Stancir, R. T., 324 Wagle, A. K., 388
Stevens, W. F., 69, 269 Wang, C. S., 30, 39, 244, 357
Stewart, G. W., III, 69 Wang, P. K. C., 388, 389
Stewart, R. A., 69 Wanniger, L. A., 269
Storey, C., 69, 322 Waren, A. D., 324
Sutherland, J. W., 322 Warga, J., 224
Swanson, C. H., 225 Watson, C. C., 2, 358, 407
Sweed, N. H., 388 Webber, R. F., 170
Sylvester, R. J., 323 Whalen, B. H., 70
Szepe, S., 224 Wiberg, D. M., 388
Wilde, D. J., 38, 39, 69, 358
Wilhelm, R. H., 358, 381, 388
Tapley, B. D., 322 Wonham, W. M., 161, 170
Thau, F. E., 262, 268
Tompkins, C. B., 69
Torng, H. C., 70 Zadeh, L. A., 70, 223, 243
Tou, J. T., 269, 407 Zangwill, W. I., 69
Tracz, G. S., 171 Zener, C., 39
Subject Index

Absolute minimum (see Global Chemical reactor (see Batch reactor;


minimum) Continuous-flow stirred-tank
Action integral, 109 reactor; Tubular reactor)
Adiabatic bed, 43 Classical mechanics, 109
Adjoint, 177, 223 Complex method, 61, 279
(See also Green's function; Condensor, 42
Lagrange multiplier) Constraint, 6, 18, 34
Algebraic equations, 5, 45 on decisions, 135, 181, 231
Approximate solution, 59, 88, 117, on state variables, 181, 209, 211,
164, 287, 317 231, 242, 308
Autocatalytic reaction, 358 Continuity, 5, 212, 214, 221
Continuous-flow stirred-tank reactor,
27, 30, 41, 46, 55, 111, 144, 150,
Bang-bang control, 140, 145, 151, 163, 199, 234, 304, 337, 351
153, 255, 258, 370 Control, 1, 10, 38, 212, 247, 266
Batch reactor, 40, 82, 128, 165, 212, (See also Feedback control;
274 Feedforward control; Time-
Bottleneck problem, 217, 225 optimal control)
Bound (see Constraint) Control variation (see Variation)
Boundary condition (see Convergence, 54, 57, 61, 64, 68, 69,
Transversality condition) 277-279, 299, 302, 305, 307, 311,
Boundary iteration, 272, 278, 321 312, 319, 321, 343, 347, 353, 367,
Boundary-value problem (see 374
Two-point boundary-value Cooling rate, 165, 212, 371
problem) Corner condition, 109
Brachistochrone, 77 Cost-of-control term, 117, 124, 132,
Bypass, 326 159, 250
Curse of dimensionality, 272

Calculus of variations:
isoperimetric problem, 84, 297, 309 Data fitting, 41
"simplest" problem, 73, 108 Decomposition, 355
(See also Euler equation) Definite function, 9, 21, 43, 259
Canonical equations, 106, 136, 184, Design, 1, 2, 24, 128, 130, 165, 235,
232, 378 300, 304, 371, 376, 386
Catalyst, 173, 224, 390 Difference-differential equation, 389
-Catalytic reformer, 202 Difference equation, 36, 64, 228, 236,
Chatter, 256 262, 398
Chemical reaction, 27, 40, 41, 98, Differential calculus, 4, 34
111, 199, 202, 329, 337, 351, 371 Differential equation, 176, 221
415 '
416 OPTIMIZATION BY VARIATIONAL METHODS

Diffusion, 98, 359, 371 Geometry, 21, 53, 54, 296


Direct methods, 295, 337 Global minimum, 6, 197
Discrete system (see Staged system) (See also Necessary conditions;
Discrete variable, 11, 13, 15, 23, 36, Sufficient conditions)
73, 262, 395 Goddard's problem, 226
Disjoint policy, 82 Golden section, 64
Distance in decision space, 54, 68, 296 Gradient, 54, 59, 66
Distillation, 132 (See also Steel) descent)
Distributed-parameter system, 92, Gradient projection, 308
359, 389 Green's function, 176, 177, 181, 210,
Disturbance, 121, 124, 248, 379, 381 214, 229, 273, 296, 308, 328, 335,
Drag, minimum, 98 361, 372, 378, 384, 387, 400
Dynamic programming, 217, 392 (See also Adjoint; Lagrange
computation by, 393 multiplier)
and the minimum principle, 40Y), 404 Green's identity, 177, 181, 230, 273,
296, 308, 328, 335

Eigenvalue, estimation of, 90


Equivalent formulation (see Objective
Hamilton-Jacobi-Bellman equation,
f unction)
398, 400, 402
Euler equation, 27, 28, 30, 39, 75, 85.
Hamiltonian, 106, 109, 136, 184, 232,
93, 109, 127, 283, 297, 309
297, 316, 328, 335, 378
Existence, 224 Harmonic oscillator, 153
Extractor, 30, 40, 337 Heat conduction, 93, 364, 366
Extremal, 77 Heat exchanger, 30, 376, 378, 390
Heavy-water plant, 71
Hessian, 9, 21, 46, 57
Feedback control, 11, 17, 80, 114,
Hierarchy of optimization
120, 123, 125, 132, 142, 147, 160, problems, 165, 214
247, 249, 251, 255, 264, 319, 380,
406
Feedforward control, 123, 247, 249, 380
Fibonacci search, 49, 62, 68, 355 Index of performance (see
Final time unspecified, 104, 105, 184, Objective function)
364 Indirect methods, 293
Finite variation, 192, 362 Instantaneously optimal control,
First variation, 181, 231 254, 264
Fredholm equation, 95, 365 (See also Disjoint policy;
Fuel-optimal control, 159 Inverse problem)
Function iteration, 283, 288, 295, 321 Integral control (see Reset mode)
Functional, 74 Integrating factor, 176
Fundamental matrix, 177 Intermediate solution, 139, 145, 151,
153, 157, 160, 213, 365, 369
Interval of uncertainty, 50, 62
Galerkin's method, 88, 97, 385 Inverse matrix, 46
Geometric programming, 39, 42 Inverse problem, 30, 86, 97, 205, 257
SUBJECT INDEX 417

Jump condition, 212, 214 Minimum principle:


strong, 194, 197, 223, 237, 238,
312, 329, 350, 401, 404
Kuhn-Tucker theorem, 39 weak, 107, 138, 186, 232, 328, 337,
400
Minimum-time control (see Time-
Lagrange form, 187 optimal control)
Lagrange multiplier, 18, 22, 23, 25, Mixing, 215, 327, 334, 357
32, 85, 103, 234, 263, 297, 309. Modal analysis, 387, 388
333, 400 Model, 2, 10, 27, 29, 110, 131, 203,
(See also Adjoint; Green's function) 333, 371, 382
Lagrange multiplier rule, 20, 233, 332 Multipliers (see Adjoint; Green's
Lagrangian, 20, 202 function; Lagrange multiplier)
Laminar flow, 99 Multistage system (see Staged system)
Legendre condition, 207
Liapunov function (see Stability)
Linear programming, 65, 70, 220, 356 Necessary conditions, 5, 10, 20, 76,
Linear-quadratic problem (see 85, 106, 138, 186, 194, 207, 232,
Quadratic objective) 240, 241, 311, 363, 378, 398
Lipschitz condition, 222 (See also Lagrange multiplier rule;
Local minimum, 6, 197 Minimum principle)
(Sec also Necessary conditions; Negative feedback (see Stability)
Sufficient conditions) Newton-Raphson iteration, 3, 45, 57,
68, 273, 283, 288, 312, 314, 321,
350
MAP, 68, 69 Nonlinear programming, 21, 39, 68
Maximum conversion (see Nonlinear system, optimal control of,
Optimal yield) 118, 131, 150, 163, 170, 254
Maximum principle (see Nuclear reactor, 134
Minimum principle) Numerical analysis, 70, 298, 367,
Mayer form, 187 373, 376
Metric (see Weighting function)
min 11, 311, 314, 321
Minimum (see Global minimum; Objective function, 10, 29, 101, 120,
Local minimum; Necessary 125, 159, 169, 181, 187, 203, 247,
conditions; Sufficient conditions) 250, 258, 268, 331, 349, 384
Minimum fuel-plus-time control, 155 One-dimensional process, 29, 30, 241
Minimum integral square-error Operations research, 171
criterion (see Quadratic objective) Optimal yield, 28, 46, 84, 125, 128,
Minimum principle: 130, 275, 278, 351, 374
for complex systems, 328, 337 Orthogonal function, 41
for continuous systems, 107, 138,
186, 194, 304, 328, 337
for distributed-parameter systems, Parametric pumping, 381
363, 372, 378 Particular variation (see Special
for staged systems, 232, 237, 400 variation)
418 OPTIMIZATION BY VARIATIONAL METHODS

Penalty function, 34, 124, 224, 248, Second-order variational equations,


277,306 194, 238, 314, 316
Periodic process, 348, 381 Second variation, 315, 321
(See also Steady state, Self-adjoint, 86, 226
optimality of) Sensitivity variable, 33, 34, 400
Perturbation, 2, 3 (See also Adjoint; Green's'function;
equations (see Variational, equation) Lagrange multiplier)
(See also Variation) Servomechanism problem, 248
Picard iteration, 194, 238 Signum function, 140
Pipeline reactor (see Tubular reactor) Simplex method:
Pontryagin's maximum principle of linear programming, 66
(see Minimum principle) of steep descent, 61
Pressure profile, 130, 171, 278, 290, Simulation, 2, 350
299, 319 Singular solution, 160, 164, 207, 213,
Principle of optimality, 393 260, 324, 365, 369, 370, 373
Production scheduling, 134 Special variation, 8, 19, 75, 105, 137,
Proportional control, 12, 13, 17, 23, 195, 208, 362
80, 117, 125 Stability, 12-14, 163, 252, 259, 265,
Pulmonary ventilation, control of, 295, 2
134 Staged system, 24, 228, 393
Steady state:
control of, 10, 112, 144, 152
optimality of, 199, 350
Quadratic objective, 17, 21, 79, 115,
Steep descent, 52, 57, 66, 69, 278,
117, 121, 124, 159, 163, 248, 263, 295, 308, 311, 321, 337, 341, 350,
364, 378, 405 365, 367, 370, 373, 384
Quasi linearization, 3, 289 Step size, 53, 57, 301
Stopping condition (see Final time
unspecified)
Reaction (see Chemical reaction) Strong minimum principle (see
Reactor (see Batch reactor; Minimum principle)
Continuous-flow stirred-tank Structure, 326, 333, 339, 355
reactor; Nuclear reactor; Sufficient conditions, 10, 220, 241,
Tubular reactor) 398, 406
Recycle, 326, 327, 329, 337 Switching surface, 142, 147, 151, 154,
Reformer, 202 157, 162, 168, 191, 255, 258
Regularity, 20, 188
Regulator problem (see Quadratic
objective) Taylor series, 7, 11, 19, 45, 102, 112,
Relay control (see Bang-bang 181, 193, 208, 273, 286, 289
control) Temperature profile, 27, 46, 55, 84,
Reset mode, 125 128, 171, 197, 234, 274, 301, 304,
(See also Three-mode control) 312, 337, 394
Riccati equation, 17, 24, 80, 117, 122, Terminal condition (see Tranaversality
249, 264, 318, 321, 380, 406 condition)
Ritz-Galerkin method, 88 Three-mode control, 247, 251
SUBJECT INDEX 411

Time-and-fuel-optimal control, 155 Variation, 7, 102, 136, 180, 192, 194,


Time-optimal control, 77, 82, 139, 144, 296
150, 153, 168, 189, 322 Variational equation, 52, 103, 181,
Traffic control, 172 194, 231, 239, 273, 295, 327, 334,
Transversality condition, 23, 104, 183, 360
189, 224, 261, 332, 336 Variational method, 2, 4, 387, 404
Tubular reactor, 84, 128, 130, 165,
274, 344, 371
Two-point boundary-value problem, Weak minimum principle (we
23, 26, 76, 236, 272, 289, 298, Minimum principle)
321, 397 Weierstrass condition, 207, 223
Weighting function, 53, 57, 69, 297,
301, 312, 385
Underdamped system, control of, 155'
Unimodal function, 49, 62
Uniqueness, 143, 170 Yield (see Optimal yield)

This book was set in Modern by The Maple Press Company, and printed on
-permanent paper and bound by The Maple Press Company. The designer was
J. E. O'Connor; the drawings were done by J. & R. Technical Services, Inc.
The editors were B. J. Clark and Maureen McMahon. William P. Weiss
supervised the production.

You might also like