Admissible directions in optimal control
under uniqueness assumptions
JAVIER FROSENBLUETH
IIMAS, Universidad Nacional Aut´
onma de M´
exico
Apartado Postal 20-126, CDMX 01000
MEXICO
Abstract: It is well-known that, for a mathematical programming problem involving equality and inequality con-
straints, the uniqueness of a Lagrange multiplier associated with a local solution implies, under certain smoothness
assumptions, second order necessary optimality conditions. Those conditions hold on a set of critical directions de-
fined by those points satisfying the constraints and for which the minimizing function and the standard Lagrangian
coincide. No similar links between uniqueness of multipliers and second order conditions seem to have been
established for optimal control problems. In this paper, we provide some results in this direction. In particular,
we study and completely solve a natural conjecture which provides, under uniqueness assumptions, nonnegative
second variations on a classical cone of admissible directions.
Key–Words: Optimal control, uniqueness of Lagrange multipliers, second order conditions, normality
Received: January 10, 2023. Revised: September 13, 2023. Accepted: October 16, 2023. Published: November 15, 2023.
1 Introduction
It is well-known that, for a mathematical program-
ming problem involving equality and inequality con-
straints, a second order necessary condition for a local
solution x0holds, under certain smoothness assump-
tions, on the tangent cone TS1(x0)at x0of the set S1
of points satisfying the constraints and for which the
minimizing function and the standard Lagrangian co-
incide. Thus, if x0is a regular point of S1(in the sense
that the tangent cone and the set of tangential con-
straints coincide), the second order condition holds on
the set RS1(x0)of tangential constraints (or critical
directions) at x0with respect to S1.
Regularity can be achieved in different ways. In
particular it holds if there is only one Lagrange multi-
plier associated with the local minimizer. A simple
line of reasoning, to support this statement, can be
given as follows. First of all, the uniqueness of the
multiplier is equivalent to the Mangasarian-Fromovitz
constraint qualification with respect to S1at x0. Sec-
ond, this constraint qualification is equivalent to the
condition of normality of x0relative to the set S1.
Finally (and this is a crucial result in the theory of
mathematical programming), normality implies regu-
larity. Thus, if the set of Lagrange multipliers associ-
ated with x0is a singleton, then x0is a regular point
of S1and a second order condition holds on RS1(x0).
For clarity of exposition, and for comparison rea-
sons, we shall find convenient to “unfold” in the next
few lines some of these concepts, main characters,
statements and implications.
Consider the nonlinear programming problem,
which we label (N), of minimizing fon the set S,
where f, gi:RnR(iAB)are given func-
tions, A={1, . . . , p},B={p+ 1, . . . , m}, and
S:= {xRn|gα(x)0 (αA),
gβ(x) = 0 (βB)}.
We assume, as in [1], [2], [3], that the functions delim-
iting the problem are continuously differentiable and,
when second derivatives occur, they are twice contin-
uously differentiable (for weaker assumptions see, for
example, [4], [5], [6], [7]).
Let us begin with the Karush-Kuhn-Tucker
(KKT) conditions or first order Lagrange multiplier
rule. Denote by Λ(f, x0)the set of all λ=
(λ1, . . . , λm)Rmsatisfying
i. λα0and λαgα(x0) = 0 (αA).
ii. If F(x) := f(x) + hλ, g(x)ithen F0(x0) = 0.
Here, the function Fis the standard Lagrangian,
gis the function mapping Rnto Rmwhose compo-
nents are g1, . . . , gm,hλ, g(x)i=Pm
1λigi(x)is the
WSEAS TRANSACTIONS on SYSTEMS and CONTROL
DOI: 10.37394/23203.2023.18.40
Javier F. Rosenblueth
E-ISSN: 2224-2856
382
Volume 18, 2023
standard inner product, and λ1, . . . , λmare the Kuhn-
Tucker or Lagrange multipliers.
The KKT conditions become first order neces-
sary conditions for a local solution if a certain con-
straint qualification is imposed. In other words, if x0
is a local solution to the problem, the nonemptiness of
Λ(f, x0)can be assured if some extra assumption on
x0and the constraints is added to the hypothesis.
Denote by
I(x) := {αA|gα(x)=0}(xS)
the set of active (or effective or binding)indices at x.
From the theory of convex cones (see, for example,
[6], [7]) or using the Farkas-Minkowski theorem of
the alternative (see [5]), it can be shown that, if
RS(x0) := {hRn|g0
α(x0;h)0 (αI(x0)),
g0
β(x0;h) = 0 (βB)}
denotes the set of vectors satisfying the tangential
constraints at x0(see [6], [7]), also called the lin-
earized tangent cone or the cone of locally con-
strained directions (see [5]), then
Λ(f, x0)6= f0(x0;h)0for all hRS(x0)
or, equivalently, Λ(f, x0)6=∅⇔−f0(x0)R
S(x0)
where B:= {zRn| hy, zi 0for all yB}is
the (closed convex) dual or polar cone of BRn.
One can find in the literature different answers to
the question of how, if x0is a local minimum, the re-
lation f0(x0)R
S(x0)can be assured. In other
words, there are different assumptions on the con-
straints which ensure that the condition f0(x0)
R
S(x0)is a necessary optimality condition for our
problem. Not all constraint qualifications coincide,
indeed, and a rather intricate web of implications and
equivalences has been established (see, for example,
[5]).
We shall first give to that question a simple an-
swer (that is, a constraint qualification) by making use
of the tangent cone. Moreover, as we shall see, this
approach will lead us also to the derivation of second
order conditions.
The definition we choose of tangent cone is the
one given by Hestenes in [6]. As shown in [5], it is
equivalent to the one introduced by Bouligand (also
known as contingent cone). A sequence {xq} Rn
is said to converge to x0in the direction hif his a unit
vector, xq6=x0, and
lim
q→∞ |xqx0|= 0,lim
q→∞
xqx0
|xqx0|=h.
The tangent cone of Sat x0, denoted by TS(x0), is
the (closed) cone determined by the unit vectors hfor
which there exists a sequence {xq}in Sconverging to
x0in the direction h. Equivalently (see [7]), TS(x0)is
the set of all hRnfor which there exist sequences
{xq}in Sand {tq}of positive numbers, such that
lim
q→∞ tq= 0,lim
q→∞
xqx0
tq
=h.
Note that, if {xq}converges to x0in the direction
hand fhas a differential at x0, then
lim
q→∞
f(xq)f(x0)
|xqx0|=f0(x0;h).
Also, if fhas a second differential at x0, then
lim
q→∞
f(xq)f(x0)f0(x0;xqx0)
|xqx0|2=1
2f00(x0;h).
Suppose that x0solves (N) locally, hTS(x0)is a
unit vector, and {xq} Sis a sequence converging to
x0in the direction h. Hence, f(xq)f(x0)for large
values of qand, therefore,
0lim
q→∞
f(xq)f(x0)
|xqx0|=f0(x0;h).
If also f0(x0)=0, then
0lim
q→∞
f(xq)f(x0)
|xqx0|2=1
2f00(x0;h).
This proves the following basic result on first and
second order necessary conditions (see, for example,
[5, p 303], [6, p 27-28], [7, p 214, 220]).
Theorem 1. Suppose x0solves (N) locally. Then
the following holds.
a. f0(x0;h)0for all hTS(x0).
b. If f0(x0) = 0, then f00(x0;h)0for all h
TS(x0).
The dual cone T
S(x0)of the tangent cone of S
at x0is called the normal cone of Sat x0. By the
first part of Theorem 1, if x0is a local minimum point
of a C1function fon a set S(actually, merely dif-
ferentiability at x0is required) then the negative gra-
dient f0(x0)is an outer normal of Sat x0, that is,
f0(x0)T
S(x0).
Note that TS(x0)RS(x0)since, if hTS(x0)
is a unit vector, and {xq} Sa sequence converging
to x0in the direction h, then
lim
q→∞
gγ(xq)gγ(x0)
|xqx0|=g0
γ(x0;h) (γAB)
WSEAS TRANSACTIONS on SYSTEMS and CONTROL
DOI: 10.37394/23203.2023.18.40
Javier F. Rosenblueth
E-ISSN: 2224-2856
383
Volume 18, 2023
and, therefore, g0
α(x0;h)0 (αI(x0)) and
g0
β(x0;h) = 0 (βB). Of course, the converse may
not hold. For example, if S={x:x30}, then
TS(0) = {h:h0}but RS(0) = R.
Hence, we have R
S(x0)T
S(x0). If these two
dual cones coincide and x0solves (N) locally, then
f0(x0)T
S(x0) = R
S(x0)
and so Λ(f, x0)6=. The condition
R
S(x0) = T
S(x0)
is referred to in [7] as quasi-regularity and in [5] as the
Guignard-Gould-Tolle constraint qualification. Ac-
cording to [5, p 264], it “enjoys the best situation
with respect to the problem, being the most general
constraint qualification for the feasible set of problem
(N).
The notion of “regularity” as defined in [6, p 35],
[7, p 221], known also as Abadie’s constraint qualifi-
cation (see, for example, [5]), is a crucial one. A point
x0Swill be said to be regular with respect to Sif
TS(x0)and RS(x0)coincide. Clearly, regularity im-
plies quasi-regularity, but one may find quasi-regular
points of Swhich are not regular. These cases, how-
ever, are exceptional. To give a simple example (see
[7]), this occurs with the origin with respect to the set
S={(x, y) : x0, x +y0, x2=y2}.
The same arguments given above yield the fol-
lowing result, the first order Lagrange multiplier rule
for a regular solution. It is a direct consequence of
Theorem 1(a) and the definition of regularity.
Theorem 2. If x0solves (N) locally and is a reg-
ular point of S, then Λ(f, x0)6=.
The second order Lagrange multiplier rule is a
straightforward consequence of Theorem 1(b). In
what follows, F(x) = f(x) + hλ, g(x)idenotes (as
before) the Lagrangian with respect to λΛ(f, x0).
Theorem 3. Suppose that x0Sand λ
Λ(f, x0). If x0solves (N) locally, then F00(x0;h)0
for all hTS1(x0)where S1:= {xS|F(x) =
f(x)}. In particular, if x0is a regular point of S1, then
F00(x0;h)0for all hRS1(x0).
Note that, if λΛ(f, x0)and Γ := {αA|
λα>0}, then
S1[ = S1(λ) ]
={xRn|gα(x)0 (αA, λα= 0),
gβ(x) = 0 (βΓB)}
={xS|gα(x) = 0 (αΓ)}.
Therefore, by definition of tangential constraints, we
have
RS1(x0)
={hRn|g0
α(x0;h)0 (αI(x0), λα= 0),
g0
β(x0;h) = 0 (βΓB)}
={hRS(x0)|g0
α(x0;h) = 0 (αΓ)}
={hRS(x0)|f0(x0;h)=0}.
In general, it may be difficult to test for quasi-
regularity or even regularity, and some criteria imply-
ing these conditions is required. As pointed out in [6],
“it is customary in the calculus of variations to call a
condition on the gradients g0
1(x0), . . . , g0
m(x0)anor-
mality condition if it implies regularity at x0.
In [7], a point x0Sis said to be normal relative
to Sif λ= 0 is the only solution of
i. λα0and λαgα(x0) = 0 (αA).
ii. Pm
1λig0
i(x0)=0.
Normality implies regularity. A proof of this fact
can be found in [7] where another condition, called
properness, is used to establish this implication. A
point x0Sis said to be proper relative to Sif the
set {g0
β(x0)|βB}is linearly independent and, if
p > 0, there exists hsuch that
g0
α(x0;h)<0 (αI(x0)), g0
β(x0;h) = 0 (βB).
As shown in [7, p 241], properness and normality are
equivalent. This is also proved in [5, p 256] by using
theorems of alternative (see Motzkin in [5, Theorem
2.4.19]), and in [4, p 43] for inequality constraints, in
terms of positive linear independence, by an applica-
tion of the Hahn-Banach separation theorem.
Since normality is equivalent to properness and
they imply regularity, Theorem 2 tells us that both
are constraint qualifications. They are also referred
to in the literature as the Cottle-Dragomirescu and
the Mangasarian-Fromovitz constraint qualifications
respectively. In view of Theorem 2, we have the fol-
lowing classical result.
Theorem 4. If x0solves (N) locally and is a nor-
mal point of S, then Λ(f, x0)6=.
It is important to point out that the conclusion that
normality is a constraint qualification can be reached
without making use of Theorem 2 and the notion of
WSEAS TRANSACTIONS on SYSTEMS and CONTROL
DOI: 10.37394/23203.2023.18.40
Javier F. Rosenblueth
E-ISSN: 2224-2856
384
Volume 18, 2023
regularity. The well-known Fritz John necessary op-
timality condition (see, for example, [1], [5], [7], [8]
and, for a nonsmooth multiplier rule, see [4]) allows
the cost multiplier to vanish. It states that, if x0solves
(N) locally, then there exist λ00and λRm, not
both zero, such that
i. λα0and λαgα(x0) = 0 (αA).
ii. F0(x) := λ0f(x) + hλ, g(x)i F0
0(x0)=0.
In view of this result, if x0solves (N) locally and
is a normal point of S, then λ0>0(since, by normal-
ity relative to S, if λ0= 0 then λ= 0, contradicting
the nontriviality condition) and the multipliers can be
chosen so that λ0= 1, thus implying nonemptiness of
Λ(f, x0).
We are now in a position to state a fundamental
result on second order necessary conditions under nor-
mality assumptions. It is an immediate consequence
of Theorem 3 and the well-known link between nor-
mality and regularity. For the application of this result
note that, given λΛ(f, x0),x0is a normal point of
S1(λ)if µ= 0 is the only solution of
i. µα0and µαgα(x0) = 0 (αA, λα= 0).
ii. Pm
1µig0
i(x0) = 0.
Theorem 5. Suppose that x0Sand λ
Λ(f, x0). If x0solves (N) locally and is a normal point
of S1(λ), then F00(x0;h)0for all hRS1(x0).
Let us now introduce one of the main characters
which plays a leading role in the theory: the linear
independence constraint qualification (LI). For a given
point x0S, it asks the set {g0
γ(x0)|γI(x0)B}
to be linearly independent.
This is, by the way, the definition of normal-
ity given in [6], in contrast with the definition in [7]
which we introduced before. As one readily verifies,
it corresponds to normality of x0with respect to (nei-
ther Snor S1but) the set of equality constraints for
active indices:
S0[ = S0(x0) ]
:= {xRn|gγ(x) = 0 (γI(x0)B)}.
This follows since, by definition, xS0(x0)is nor-
mal relative to S0(x0)if λ= 0 is the only solution
of
i. λαgα(x0) = 0 (αA).
ii. Pm
1λig0
i(x) = 0.
This is equivalent to LI, the condition that the lin-
ear equations g0
γ(x0;h) = 0 (γI(x0)B)in hbe
linearly independent.
Note that, given x0Sand λRmwith λα0
(αA), we have RS0(x0)RS1(x0)RS(x0)
where, by definition,
RS0(x0) = {hRn|g0
γ(x0;h)=0
(γI(x0)B)}.
Also, if x0is a normal point of S0, then it is a normal
point of S1, and hence a normal point of S.
In most textbooks (see a thorough explanation
and references in [1]) the main result on second or-
der necessary conditions differs from Theorem 5 in
two fundamental aspects: it is derived under the as-
sumption of the linear independence constraint quali-
fication (that is, normality relative to S0instead of S1)
and the second order condition holds on the set of crit-
ical directions RS0(x0)(instead of RS1(x0)). Thus, in
this result, the assumptions are stronger and the con-
clusions weaker than those of Theorem 5 since, usu-
ally, normality relative to S0is stronger than normality
relative to S1and RS1(x0)contains properly the set
RS0(x0). As pointed out in [1], “The source of this
weaker result can be attributed to the traditional way
of treating the active inequality constraints as equality
constraints. Explicitly, this rather “well-worn result”
(see [1]) is the following.
Theorem 6. Suppose x0Sand λΛ(f, x0).
If x0solves (N) locally and is a normal point of
S0(x0), then F00(x0;h)0for all hRS0(x0).
The set of critical directions in Theorem 5 is then,
in general, bigger than that of Theorem 6. How-
ever, as the following examples show, even under the
strong assumption of Theorem 6 (normality relative
to S0(x0)or the LI constraint qualification), the set of
critical directions cannot be “too big”: we may very
well have negative second variations on RS(x0).
Example 1. Consider the problem of minimizing
f(x1, x2) = x1subject to
g1(x1, x2) = x2
1x20,
g2(x1, x2) = x1+x2= 0.
Clearly x0= (1,1) is a local solution to
the problem and LI is satisfied since the gradients
g0
1(x0) = (2,1) and g0
2(x0) = (1,1) are linearly
independent. We have
F(x1, x2) = x1+λ1(x2
1x2) + λ2(x1+x2)
WSEAS TRANSACTIONS on SYSTEMS and CONTROL
DOI: 10.37394/23203.2023.18.40
Javier F. Rosenblueth
E-ISSN: 2224-2856
385
Volume 18, 2023
and so F0(x1, x2) = (1 2λ1x1+λ2,λ1+λ2).
Thus F0(x0)=0implies λ1=λ2= 1. Also,
F00(x1, x2) = 2λ10
0 0 =2 0
0 0
and so F00(x0;h, k) = 2h2. Now, we have
g0
1(x1, x2)=(2x1,1), g0
1(x1, x2) = (1,1)
and so
RS(x0) = {(h, k)| 2hk0, h +k= 0}.
Therefore (1,1) RS(x0)but F00(x0; 1,1) =
2<0.
Example 2. Consider the problem of minimizing
f(x) = x3subject to g(x) = x10.
Clearly x0= 1 is a (global) solution and LI holds
since g0(x0)=16= 0. We have
F(x) = f(x) + λg(x) = x3+λx λ
and so F0(x) = 3x2+λand F0(x0)=0implies
that λ= 3. Also F00(x0;h) = 6h2. Since
RS(x0) = {h|g0(x0;h) = h0}
we have 1RS(x0)and F00(x0;1) = 6<0.
We can now explain, in a clear and succinct way,
the role played by the uniqueness of Lagrange multi-
pliers in the intricate web of constraint qualifications.
Let us begin with a simple argument which im-
plies uniqueness. Suppose x0Sand λΛ(f, x0).
If also ¯
λΛ(f, x0), set µ:= ¯
λλand observe that
µαgα(x0) = ¯
λαgα(x0)λαgα(x0) = 0 (αA),
m
X
1
µig0
i(x0) =
m
X
1
¯
λig0
i(x0)
m
X
1
λig0
i(x0) = 0.
Thus, if x0is normal relative to S0(x0), then µ= 0
and Λ(f, x0)would be a singleton. In other words, the
following result holds.
Theorem 7. Suppose x0Sand λΛ(f, x0).
If x0is normal relative to S0(x0), then Λ(f, x0) =
{λ}.
In view of Theorem 4, if x0affords a local mini-
mum to fon Sand x0is normal relative to S0(and so
normal relative to S) then there exists λΛ(f, x0).
We have just proved that, in this event, λΛ(f, x0)
is unique. As one readily verifies, if normality is as-
sumed relative to S, the existence of λΛ(f, x0)can
be assured, but it may not be unique. Here we arrive
at a crucial result in the theory.
In [2] it is shown that uniqueness of the La-
grange multiplier associated with the point x0can be
achieved by an assumption weaker than that of nor-
mality relative to S0(x0), namely, normality relative
to S1(λ). Moreover, this assumption is not only suffi-
cient for the uniqueness of the multiplier but also nec-
essary. This is the content of the following result.
Theorem 8. Suppose that x0Sand λ
Λ(f, x0). Then x0is normal relative to S1(λ)if and
only if Λ(f, x0) = {λ}.
In [2], the statement of this result is expressed in
terms of a condition called the strict Mangasarian-
Fromovitz constraint qualification which is no other
than properness relative to S1(λ). The proof given
in that paper, however, relies precisely on the equiva-
lence between normality and properness.
Combining this result with Theorem 5, we finally
reach the main result on second order necessary con-
ditions under uniqueness assumptions.
Theorem 9. Suppose that x0Sand λ
Λ(f, x0). If x0solves (N) locally and Λ(f, x0) =
{λ}, then F00(x0;h)0for all hRS1(x0).
It is worth mentioning that an entirely different
approach, found in [3], is based on the idea that, since
constraint qualifications are independent of the objec-
tive function f, if a constraint qualification implies
a certain property for the Lagrange multipliers, this
property will hold for all objective functions for which
x0affords a local minimum. If we define F(x0)as the
set of all fC1(Rn,R)such that x0affords a local
minimum to fon S, the result on uniqueness of La-
grange multipliers given in [3] states that, if x0S,
then Λ(f, x0)is a singleton for all f F(x0)x0
is normal relative to S0(x0).
Uniqueness of multipliers in different areas of
constrained optimization has been studied from differ-
ent points of view in order to deal with, to mention a
few, problems subject to cone constraints [9], sensitiv-
ity analysis of optimization problems [10], composite
optimization [11], sets of functions with a common
local minimum [3], or derivation of second order con-
ditions [2], [5], [6], [7], [12], [13], [14].
In this paper we shall be concerned with unique-
ness of multipliers for certain classes of optimal con-
trol problems. Our main objective will be to state the
WSEAS TRANSACTIONS on SYSTEMS and CONTROL
DOI: 10.37394/23203.2023.18.40
Javier F. Rosenblueth
E-ISSN: 2224-2856
386
Volume 18, 2023
main aspects of the theory in that context and give
some answers to questions related to possible links
between uniqueness of multipliers, admissible direc-
tions, and second order optimality conditions.
The stage for such problems, involving equality
and inequality constraints in the control functions, is
rather different than that for the finite dimensional
case. In particular, it has been proved that the cor-
responding Mangasarian-Fromovitz constraint quali-
fication (also called properness) and the condition of
normality are equivalent [15], as in the mathemat-
ical programming problem, but these conditions do
not necessary imply regularity [16]. Moreover, the
Mangasarian-Fromovitz constraint qualification with
respect to S1implies uniqueness of the Lagrange mul-
tiplier, but the converse may not hold [17], [18].
It is of interest to know if, for optimal control
problems, the uniqueness of Lagrange multipliers im-
plies or not second order conditions on a specific set of
critical directions. Let us move forward to that ques-
tion.
2 Statement of the problem
In the remaining of this paper we shall deal with an
optimal control problem with fixed endpoints, posed
over piecewise C1trajectories and piecewise continu-
ous controls, and involving inequalities and equalities
in the control functions.
We believe this problem, though (relatively) sim-
ple compared with other formulations, captures the
essence of the question connecting uniqueness of mul-
tipliers with second order conditions. The problem is
not simple, by any means. Actually, the difficulties
encountered for this kind of optimal control problems
are of a much subtler nature than those for (N). As ex-
plained in [4, p 335], the type of constraints we shall
now deal with (even for the classical Lagrange prob-
lem in the calculus of variations with equality con-
straints) “make the constrained optimal control prob-
lem much more complex than (N), or even the isoperi-
metric problem. In part, this is because we now have
infinitely many constraints, one for each t.
To state the problem, suppose we are given an in-
terval T:= [t0, t1]in R, two points ξ0,ξ1in Rn, and
functions Land fmapping T×Rn×Rmto Rand
Rnrespectively, and ϕ= (ϕ1, . . . , ϕq)mapping Rm
to Rq. Denote by Xthe space of piecewise C1func-
tions mapping Tto Rn, and by Ukthe space of piece-
wise continuous functions mapping Tto Rk(kN).
Let Z:= X× Um, and set
D:= {(x, u)Z|˙x(t) = f(t, x(t), u(t)) (tT),
x(t0) = ξ0, x(t1) = ξ1},
S:= {(x, u)D|ϕα(u(t)) 0,
ϕβ(u(t)) = 0 (αR, β Q, t T)}
where R={1, . . . , r}and Q={r+ 1, . . . , q}. For
all (x, u)in Z, let
I(x, u) := Zt1
t0
L(t, x(t), u(t))dt.
The problem we shall deal with, which we label (P),
is that of minimizing Iover S.
Acontrol function uis an element of Umand a
state trajectory xcorresponding to a control function
uis an element of Xwhich solves the differential
equation ˙x(t) = f(t, x(t), u(t)) (tT). A pair
(x, u)comprising a state trajectory xand an associ-
ated control function uis referred to as a process. If
usatisfies the control constraints u(t)U(tT),
where
U:= {uRm|ϕα(u)0 (αR),
ϕβ(u) = 0 (βQ)},
and xsatisfies the endpoint constraints x(t0) = ξ0and
x(t1) = ξ1, then the process (x, u)is said to be ad-
missible, and it solves (P) if it achieves the minimum
value of Iover all admissible processes.
With respect to the functions delimiting the prob-
lem, we assume that, if F:= (L, f), then F(t, ·,·)is
C2for all tTand ϕis C2;F(·, x, u)and its deriva-
tives in (x, u)are piecewise continuous and there ex-
ists an integrable function α:TRsuch that, at any
point (t, x, u)T×Rn×Rm,
|F(t, x, u)|+|∇(x,u)F(t, x, u)|+
|∇2
(x,u)F(t, x, u)| α(t).
Moreover, the q×(m+r)-dimensional matrix
ϕi
ukδϕα
(i= 1, . . . , q;α= 1, . . . , r;k= 1, . . . , m), has
rank qon U(here δαα = 1,δαβ = 0 (α6=β)). This
condition is equivalent to the condition that, at each
point uin U, the matrix
ϕi
uk(i=i1, . . . , ip;k= 1, . . . , m)
WSEAS TRANSACTIONS on SYSTEMS and CONTROL
DOI: 10.37394/23203.2023.18.40
Javier F. Rosenblueth
E-ISSN: 2224-2856
387
Volume 18, 2023
has rank p, where i1, . . . , ipare the indices i
{1, . . . , q}such that ϕi(u) = 0 (see [12] for details).
Finally, given (x0, u0)S, we shall denote by [t]
the point (t, x0(t), u0(t)) and set A(t) := fx[t]and
B(t) := fu[t]. In what follows, the notation will
be used to denote transpose.
3 Normality and first order condi-
tions
First order necessary conditions are well established
and one version (consequence of the Maximum Prin-
ciple) can be stated as follows (see, for example, [6]).
This result can be seen as the analogous of the Fritz
John necessary optimality condition in mathematical
programming mentioned in the introduction.
For all (t, x, u, p, µ, λ)in T×Rn×Rm×Rn×
Rq×Rdefine the Hamiltonian function as
H(t, x, u, p, µ, λ) :=
hp, f(t, x, u)i λL(t, x, u) hµ, ϕ(u)i.
Theorem 10. Suppose (x0, u0)solves (P). Then
there exist λ00,pX, and µ Uq, not vanishing
simultaneously on T, such that
a. µα(t)0and µα(t)ϕα(u0(t)) = 0 (α
R, t T);
b. ˙p(t) = H
x(t, x0(t), u0(t), p(t), µ(t), λ0)on
every interval of continuity of u0;
c. Hu(t, x0(t), u0(t), p(t), µ(t), λ0) = 0 (tT).
In this theorem, the case λ0= 1, as in the theory
of mathematical programming, is particuarly relevant.
Denote by Ethe set of all (x0, u0, p, µ)Z×X×Uq
satisfying
a. µα(t)0and µα(t)ϕα(u0(t)) = 0 (α
R, t T);
b. ˙p(t) = A(t)p(t) + L
x[t] (tT);
c. B(t)p(t) = L
u[t] + ϕ0∗(u0(t))µ(t) (tT).
Given (x0, u0)S, let Λ(x0, u0)be the set of all
(p, µ)X× Uqsuch that (x0, u0, p, µ) E. The el-
ements of Ewill be called extremals and of Λ(x0, u0)
Lagrange multipliers (no confusion should arise with
the notation and terminology used in Section 1).
As for the finite dimensional case, given a so-
lution (x0, u0)to the problem, nonemptiness of
Λ(x0, u0)requires the assumption of a certain con-
straint qualification applied to the constraints and the
admissible process. By analogy with the mathemati-
cal programming problem, we define normality rela-
tive to Sby imposing the null solution as the unique
solution to the system given in Theorem 10 when the
cost multiplier vanishes. This is a natural extension of
the definition of normality given for problem (N).
We shall say that (x0, u0)Sis normal relative
to Sif, given (p, µ)X× Uqsatisfying
i. µα(t)0and µα(t)ϕα(u0(t)) = 0 (α
R, t T);
ii. ˙p(t) = A(t)p(t)
[ = H
x(t, x0(t), u0(t), p(t), µ(t),0) ] (tT);
iii. 0 = B(t)p(t)ϕ0∗(u0(t))µ(t)
[ = H
u(t, x0(t), u0(t), p(t), µ(t),0) ] (tT),
then p0. Note that, in this event, also µ0.
From Theorem 10 we conclude that, if (x0, u0)
solves (P) and is a normal process relative to S, then
Λ(x0, u0)is nonempty, that is, there exists (p, µ)
X× Uqsuch that (x0, u0, p, µ)is an extremal.
The sets S0and S1, which played a fundamen-
tal role before, have also their counterpart in optimal
control. Denote the set of active indices at uby
Ia(u) := {αR|ϕα(u)=0}(uRm).
Given u0 Um, let
S0[ = S0(u0) ] := {(x, u)D|ϕi(u(t)) = 0
(iIa(u0(t)) Q, t T)}.
For µ Uqwith µα(t)0 (αR, t T), define
S1[ = S1(µ) ] := {(x, u)D|
ϕα(u(t)) 0 (αR, µα(t) = 0, t T),
ϕβ(u(t)) = 0 (βRwith µβ(t)>0,
or βQ, t T)}.
Note that
S1(µ) = {(x, u)S|
ϕα(u(t)) = 0 (αR, µα(t)>0, t T)}.
Since normality is defined relative to any set of pro-
cesses given by inequality and equality constraints, it
can be applied to the sets S0(u0)and S1(µ). By defi-
nition, we obtain the following conditions.
An admissible process (x0, u0)is normal relative
to S0(u0)if, given (p, µ)X× Uqsatisfying
i. µα(t)ϕα(u0(t)) = 0 (αR, t T);
ii. ˙p(t) = A(t)p(t) (tT);
WSEAS TRANSACTIONS on SYSTEMS and CONTROL
DOI: 10.37394/23203.2023.18.40
Javier F. Rosenblueth
E-ISSN: 2224-2856
388
Volume 18, 2023
iii. B(t)p(t) = ϕ0∗(u0(t))µ(t) (tT),
then p0. As before, this implies that µ0.
Similarly, an admissible process (x0, u0)is nor-
mal relative to S1(µ)if, given (q, ν)X× Uqsatis-
fying
i. να(t)0and να(t)ϕα(u0(t)) = 0 (α
R, µα(t)=0, t T);
ii. ˙q(t) = A(t)q(t) (tT);
iii. B(t)q(t) = ϕ0∗(u0(t))ν(t) (tT),
then q0. In this event, again, we also have ν0.
As one readily verifies, if (x0, u0)Sis normal
relative to S0(u0), then Λ(x0, u0)is a singleton. This
assumption, however, can be weakened, and the same
conclusion will follow if, given (p, µ)Λ(x0, u0),
the admissible process is normal relative to S1(µ).
The converse, however, may not hold. There are prob-
lems for which the pair (p, µ)is unique in Λ(x0, u0)
but (x0, u0)fails to be normal relative to S1(µ). A
full account of these results can be found in [18]. For
a characterization of the uniqueness of multipliers in
terms of normality of yet another set of constraints we
refer to [17].
4 Uniqueness and second order con-
ditions
We shall find convenient to express the normality con-
ditions of the previous section in terms of certain con-
vex cones. Given µRqdefine the following subsets
of indices of R:
Γ0(µ) := {αR|µα= 0},
Γp(µ) := {αR|µα>0}.
For all uRmand µRq, consider the following
sets
τ0(u) := {hRm|ϕ0
i(u)h= 0 (iIa(u)Q)},
τ1(u, µ) := {hRm|ϕ0
i(u)h0
(iIa(u)Γ0(µ)),
ϕ0
j(u)h= 0 (jΓp(µ)Q)},
τ(u) := {hRm|ϕ0
i(u)h0 (iIa(u)),
ϕ0
j(u)h= 0 (jQ)}.
As shown in [13], [14], (x0, u0)is normal with
respect to S0(u0)if z0is the only solution of the
system
˙z(t) = A(t)z(t), z(t)B(t)h= 0
for all hτ0(u0(t)) (tT).
Similarly, (x0, u0, µ)is normal with respect to S1(µ)
if z0is the only solution of the system
˙z(t) = A(t)z(t), z(t)B(t)h0
for all hτ1(u0(t), µ(t)) (tT).
Finally, (x0, u0)is normal with respect to Sif z0
is the only solution of the system
˙z(t) = A(t)z(t), z(t)B(t)h0
for all hτ(u0(t)) (tT).
For second order necessary conditions, we
consider the quadratic integral defined, for all
(x, u, p, µ)Z×X× Uq, by
J((x, u, p, µ); (y, v)) :=
Zt1
t0
2Ω(t, y(t), v(t))dt ((y, v)Z)
where, for all (t, y, v)T×Rn×Rm,
2Ω(t, y, v) :=
[hy, Hxx(t)yi+ 2hy, Hxu(t)vi+hv, Huu(t)vi]
and H(t)denotes H(t, x(t), u(t), p(t), µ(t),1).
The next result is well-known in the literature
(see, for example, [12]). It guarantees uniqueness
of the Lagrange multiplier and provides second order
necessary conditions.
Theorem 11. Suppose (x0, u0)solves (P)
and (p, µ)Λ(x0, u0). If (x0, u0)is nor-
mal relative to S0(u0)then (p, µ)is unique and
J((x0, u0, p, µ); (y, v)) 0for all (y, v)Zsat-
isfying
i. ˙y(t) = A(t)y(t) + B(t)v(t) (tT);
ii. y(t0) = y(t1)=0;
iii. v(t)τ0(u0(t)) (tT).
This result is, clearly, the counterpart of Theorem
6 which deals with normality with respect to S0and
guarantees nonnegativity on the set RS0.
WSEAS TRANSACTIONS on SYSTEMS and CONTROL
DOI: 10.37394/23203.2023.18.40
Javier F. Rosenblueth
E-ISSN: 2224-2856
389
Volume 18, 2023
It is worth mentioning that some examples found
in the literature (see [13], [14]) show that we may have
a solution (x0, u0)to the problem which is normal rel-
ative to S, a pair (p, µ)such that (x0, u0, p, µ) E,
and (y, v)Zsatisfying (i), (ii) and (iii) of this theo-
rem, but J((x0, u0, p, µ); (y, v)) <0. In other words,
the conclusion of Theorem 11 may not hold if we as-
sume that the solution to the problem is normal with
respect to S.
Those references provide also examples where a
solution which is normal relative to S1(µ)may yield
a negative second variation on τ(u0(t)). In other
words, there exists an admissible process (x0, u0)
which solves the problem and is normal with re-
spect to S1(µ)with (p, µ)Λ(x0, u0)but, for some
(y, v)Zsatisfying (i), (ii) of Theorem 11 with
v(t)τ(u0(t)), we have J((x0, u0, p, µ); (y, v)) <
0. It is natural to ask if the same conclusion can be
reached assuming only uniqueness of the multiplier.
More generally, it is of interest to know if the sec-
ond order necessary condition given in Theorem 11
holds in a larger set and/or under weaker assumptions.
In particular, we would like to know if the theorem re-
mains valid assuming uniqueness of the multiplier in-
stead of the strong normality assumption on (x0, u0).
Explicitly, the question is if the following result holds.
Conjecture 1. Suppose (x0, u0)solves (P) and
(p, µ)Λ(x0, u0). If (p, µ)is unique, that is, if
Λ(x0, u0) = {(p, µ)}, then J((x0, u0, p, µ); (y, v))
0for all (y, v)Zsatisfying
i. ˙y(t) = A(t)y(t) + B(t)v(t) (tT);
ii. y(t0) = y(t1)=0;
iii. v(t)τ(u0(t)) (tT).
5 The example
In this final section we provide a new and illustra-
tive result in the direction signaled by the conjec-
ture. It corresponds to an example where a solu-
tion (x0, u0)to the problem has a unique multiplier
(p, µ)Λ(x0, u0)but J((x0, u0, p, µ); (y, v)) <0
for some (y, v)Zsatisfying
˙y(t) = A(t)y(t) + B(t)v(t) (tT),
y(t0) = y(t1)=0,
v(t)τ(u0(t)) (tT).
In other words, the conclusion of the conjecture may
not be true.
Example 3. Consider the problem of minimizing
I(x, u) = R1
1b(t)u3(t)dt subject to
˙x(t) = u(t) (t[1,1]), x(1) = x(1) = 0,
u2(t)1 (t[1,1])
where
b(t) = 0if t[1,0]
t2if t[0,1].
In this case T= [1,1],n=m= 1,ξ0=ξ1=
0and, for all (t, x, u)T×R×R,
L(t, x, u) = b(t)u3, f(t, x, u) = u, ϕ(u) = u21.
Let
x0(t) := t+ 1 if t[1,0]
1tif t[0,1]
u0(t) := 1if t[1,0]
1if t(0,1].
Clearly (x0, u0)is a solution to the problem.
Now, if (x0, u0, p, µ) E, then µ(t)0,˙p(t) =
0,
p(t) = 3b(t)u2
0(t)+2u0(t)µ(t) (t[1,1]).
Thus pis a constant satisfying
p=2µ(t)if t[1,0]
3t22µ(t)if t(0,1].
Since µ(t)0for all tT, from the first relation
we have p0and, from the second, p3t2for all
t(0,1] and so p0. Thus p0and therefore
µ(t) = 0if t[1,0]
(3/2)t2if t[0,1].
This implies that (p, µ)is the only pair such that
(x0, u0, p, µ) E.
We have H(t, x, u, p, µ) = pu b(t)u3(u2
1)µand so
Hu(t, x, u, p, µ) = p3b(t)u22µu,
Huu(t, x, u, p, µ) = 6b(t)u2µ.
Therefore
Huu(t, x0(t), u0(t), p(t), µ(t))
=0if t[1,0]
3t2if t[0,1].
WSEAS TRANSACTIONS on SYSTEMS and CONTROL
DOI: 10.37394/23203.2023.18.40
Javier F. Rosenblueth
E-ISSN: 2224-2856
390
Volume 18, 2023
This implies that, for all (x, u, p, µ)Z×X× U1
and (y, v)Z,
J((x, u, p, µ); (y, v)) = Z1
0
3t2v2(t)dt.
Since ϕ0(u) = 2u, we have
τ0(u0(t)) = {h|2u0(t)h= 0}={0},
τ1(u0(t), µ(t)) is given by
{h|2u0(t)h0}={h|h0}if t[1,0],
{h|2u0(t)h= 0}={0}if t(0,1],
and τ(u0(t)) is given by
{h|2u0(t)h0}={h|h0}if t[1,0],
{h|2u0(t)h0}={h|h0}if t(0,1].
Let
y(t) := t1if t[1,0]
t1if t[0,1]
v(t) := 1if t[1,0]
1if t(0,1].
Then (y, v)solves ˙y(t) = v(t) (tT),y(1) =
y(1) = 0,v(t)τ(u0(t)), and
J((x0, u0, p, µ); (y, v)) = Z1
0
3t2v2(t)dt < 0.
Let us end this paper with an open question, a new
conjecture. As mentioned before, it is of interest to
know if Theorem 11 holds in a larger set and/or under
weaker assumptions. In particular, we would like to
know if Conjecture 1 is valid if we replace condition
(iii) by
v(t)τ1(u0(t), µ(t)) (tT).
Explicitly, the question is if the following result
holds.
Conjecture 2. Suppose (x0, u0)solves (P) and
(p, µ)Λ(x0, u0). If (p, µ)is unique, that is, if
Λ(x0, u0) = {(p, µ)}, then J((x0, u0, p, µ); (y, v))
0for all (y, v)Zsatisfying
i. ˙y(t) = A(t)y(t) + B(t)v(t) (tT);
ii. y(t0) = y(t1) = 0;
iii. v(t)τ1(u0(t), µ(t)) (tT).
References:
[1] Ben-Tal A (1980) Second-order and related ex-
tremality conditions in nonlinear programming,
Journal of Optimization Theory & Applications,
31: 143-165
[2] Kyparisis J (1985) On uniqueness of Kuhn-
Tucker multipliers in nonlinear programming,
Mathematical Programming, 32: 242-246
[3] Wachsmuth G (2013) On LICQ and the unique-
ness of Lagrange multipliers, Operations Re-
search Letters, 41: 78-80
[4] Clarke F (2013) Functional Analysis, Calculus
of Variations and Optimal Control, Springer-
Verlag, London
[5] Giorgi G, Guerraggio A, Thierfelder J (2004)
Mathematics of Optimization: Smooth and Non-
smooth Case, Elsevier, Amsterdam
[6] Hestenes MR (1966) Calculus of Variations and
Optimal Control Theory, John Wiley, New York
[7] Hestenes MR (1975) Optimization Theory, The
Finite Dimensional Case, John Wiley, New York
[8] McShane EJ (1973) The Lagrange Multiplier
Rule, The American Mathematical Monthly, 80:
922-925
[9] Shapiro A (1997) On uniqueness of Lagrange
multipliers in optimization problems subject to
cone constraints, SIAM Journal of Optimization,
7: 508-518
[10] Kyparisis J (1990) Sensitivity analysis for non-
linear programs and variational inequalities with
non-unique multipliers, Mathematics of Opera-
tions Research, 15: 286-298
[11] Deng S (1996) On uniqueness of Lagrange mul-
tipliers in composite optimization, Journal of
Mathematical Analysis & Applications, 201:
689-696
[12] de Pinho MR, Rosenblueth JF (2007) Mixed
constraints in optimal control: an implicit func-
tion theorem approach, IMA Journal of Mathe-
matical Control and Information, 24: 197-218
[13] Rosenblueth JF, S´
anchez Licea G (2012) Neg-
ative second variations for problems with in-
equality control constraints, 3rd European Con-
ference for the Applied Mathematics and Infor-
matics, Montreux, Suiza, WSEAS Press, Math-
ematical Methods for Information Science and
Economics, 150-155
[14] Rosenblueth JF, S´
anchez Licea G (2013) Cones
of critical directions in optimal control, Interna-
tional Journal of Applied Mathematics & Infor-
matics, 7: 55-67
WSEAS TRANSACTIONS on SYSTEMS and CONTROL
DOI: 10.37394/23203.2023.18.40
Javier F. Rosenblueth
E-ISSN: 2224-2856
391
Volume 18, 2023
[15] Cortez KL, Rosenblueth JF (2021) Normality,
controllability and properness in optimal con-
trol, Applied Mathematics & Optimization, 84:
159-173
[16] Cortez KL, Rosenblueth JF (2019) The broken
link between normality and regularity in the cal-
culus of variations, Systems & Control Letters,
124: 27-32
[17] Becerril JA, Cortez KL, Rosenblueth JF (2019)
Uniqueness of multipliers in optimal control:
the missing piece, IMA Journal of Mathemati-
cal Control & Information, 36: 1395-1411
[18] Cortez KL, Rosenblueth JF (2018) Normality
and uniqueness of Lagrange multipliers, Dis-
crete & Continuous Dynamical Systems Series
A, 38: 3271-3290
Contribution of Individual Authors to the Creation
of a Scientific Article (Ghostwriting Policy)
The author carried out all the research of this article.
Sources of Funding for Research Presented in a
Scientific Article or Scientific Article Itself
No funding was received for conducting this study.
Conflicts of Interest
The author has no conflicts of interest to declare that
are relevant to the content of this article.
Creative Commons Attribution License 4.0
(Attribution 4.0 International, CC BY 4.0)
This article is published under the terms of the
Creative Commons Attribution License 4.0
https://creativecommons.org/licenses/by/4.0/deed.en
_US
WSEAS TRANSACTIONS on SYSTEMS and CONTROL
DOI: 10.37394/23203.2023.18.40
Javier F. Rosenblueth
E-ISSN: 2224-2856
392
Volume 18, 2023