We observe that, for t∈[0,1], this function is always
negative (i.e., ε > 1); moreover, [z(t)]−≤ −z(0) =
(1 −3e3)/(1 + 3e2)<1, because the function is
strictly negative and strictly increasing. Hence, the
constraint on the control u∈[0,2] is not active and
the optimal feedback is feasible. Using the optimal
feedback control
u∗(t, x) = e2t−3e2
e2t+ 3e2·x−
,
we can find the evolution of the expected value of the
process. Let us define m∗
x(t) := Eu∗
0,x(Xt); using the
infinitesimal generator we obtain
˙m∗
x(t)=−2Eu∗
0,xXt1+Xt+e2t−3e2
e2t+ 3e2·Xt−
,
which becomes
˙m∗
x(t) = e2t−3e2
e2t+ 3e2−2(m∗
x(t) + 1) (9)
with initial condition:
m∗
x(0) = Eu∗
0,x(X0) = x .
Equation (9) is a linear ODE whose coefficients are
continuous functions for all t∈[0,1]; therefore, there
exists a unique solution m∗
x(t)to the previous Cauchy
problem. For x=−1the solution is constant:
m∗
−1(t)≡ −1. On the other hand, for x= 1 we can
explicitly find the analytical form of this function, but
it is long and inexpressive. We prefer to plot its graph
in Figure 1. Moreover, using the evolution of the
expected value of the optimal process starting from
x= 1, we can find the evolution of the probability of
each state: p(t) := Pu∗
0,1(Xt= 1) = (1 + m∗
1(t))/2.
The probability is displayed in Figure 1.
Figure 1: p(t)and m∗
1(t).
We notice that, using the optimal feedback control,
the process moves towards the state −1. When the
initial position is −1, the process remains in this state;
otherwise, when the initial position is 1, the process
changes its state with a strictly positive probability
rate.
6 Conclusion
In this paper, we analyse the evolution of a
continuous-time Markov decision process charac-
terised by a binary state. We introduce the standard
Hamilton–Jacobi–Bellman equation and prove that,
under suitable analytical formulations of the rate of
transition and of the cost function, we can replace the
HJB equation with a backward ODE. Then, all in-
formation useful to characterise an optimal feedback
control is now contained in the solution of the back-
ward ODE. Using a numerical example, we show how
to find an optimal feedback control through the results
shown in this paper.
This research can be improved in various direc-
tions. First, we can try to extend the family of func-
tions that satisfies the hypotheses of Theorem 2. Sub-
sequently, we can investigate whether what is proven
in this paper is valid for analogous problems with an
infinite-time horizon, too. Finally, it may be inter-
esting to study whether this approach is useful for
analysing the interaction between multiple players.
References:
[1] A. Cecchin, P. Dai Pra, M. Fischer & G. Pelino,
On the Convergence Problem in Mean Field
Games: A Two State Model without Uniqueness,
SIAM Journal on Control and Optimization, Vol.
57, No. 4, 2019, pp. 2443-2466.
[2] P. Dai Pra, E. Sartori & M. Tolotti, Climb on
the Bandwagon: Consensus and Periodicity in
a Lifetime Utility Model with Strategic Interac-
tions, Dynamic Games and Applications, Vol. 9,
No. 4, 2019, pp. 1061-1075.
[3] M.K. Ghosh & S. Saha, Risk-sensitive Control
of Continuous-time Markov Chains, Stochastics,
Vol. 86, No.4, 2014, pp. 655-675.
[4] X. Guo & O. Hernández-Lerma, Continuous-
Time Markov Decision Processes, Springer
Berlin Heidelberg, 2009.
[5] M. Kochmański, T. Paszkiewicz, & S. Wolski,
Curie–Weiss magnet — A Simple Model of Phase
Transition, European Journal of Physics, Vol.34,
No.6, 2013, pp. 1555-1578.
[6] W. Wang, X. Su, S. Gan, & L. Qian, Pricing
Vulnerable European Options Under a Markov-
Modulated Jump Diffusion Process, WSEAS
Transactions on Mathematics, Vol.16, 2017, pp.
123-132.
WSEAS TRANSACTIONS on MATHEMATICS
DOI: 10.37394/23206.2023.22.17
Chiara Brambilla, Luca Grosset, Elena Sartori
Contribution of Individual Authors to the
Creation of a Scientific Article (Ghostwriting
Policy)
The authors equally contributed in the present
research, at all stages from the formulation of the
problem to the final findings and solution.
Sources of Funding for Research Presented in a
Scientific Article or Scientific Article Itself
No funding was received for conducting this study.
Conflict of Interest
The authors have no conflicts of interest to declare
that are relevant to the content of this article.
Creative Commons Attribution License 4.0
(Attribution 4.0 International, CC BY 4.0)
This article is published under the terms of the
Creative Commons Attribution License 4.0
https://creativecommons.org/licenses/by/4.0/deed.en
_US