We consider an optimal control problem with ergodic (long term average) reward for a McKean-Vlasov dynamics, where the coefficients of a controlled stochastic ifferential equation depend on the marginal law of the solution. Starting from the associated infinite time horizon expected discounted reward, we construct both the value λ of the ergodic problem and an associated function ϕ, which provide a viscosity solution to an ergodic Hamilton-Jacobi-Bellman (HJB) equation of elliptic type. In contrast to previous results, we consider the function ϕ and the HJB equation on the Wasserstein space, using concepts of derivatives with respect to probability measures. The pair (λ, ϕ) also provides information on limit behavior of related optimization problems, for instance, results of Abelian-Tauberian type or limits of value functions of control problems for finite time horizon when the latter tends to infinity. Many arguments are simplified by the use of a functional relation for ϕ in the form of a suitable dynamic programming principle.
Ergodic Control of McKean–Vlasov Systems on the Wasserstein Space / M. Fuhrman, S. Ruda'. - In: SIAM JOURNAL ON CONTROL AND OPTIMIZATION. - ISSN 0363-0129. - 63:6(2025 Nov 21), pp. 4018-4043. [10.1137/25m1755205]
Ergodic Control of McKean–Vlasov Systems on the Wasserstein Space
M. FuhrmanPrimo
;S. Ruda'
Secondo
2025
Abstract
We consider an optimal control problem with ergodic (long term average) reward for a McKean-Vlasov dynamics, where the coefficients of a controlled stochastic ifferential equation depend on the marginal law of the solution. Starting from the associated infinite time horizon expected discounted reward, we construct both the value λ of the ergodic problem and an associated function ϕ, which provide a viscosity solution to an ergodic Hamilton-Jacobi-Bellman (HJB) equation of elliptic type. In contrast to previous results, we consider the function ϕ and the HJB equation on the Wasserstein space, using concepts of derivatives with respect to probability measures. The pair (λ, ϕ) also provides information on limit behavior of related optimization problems, for instance, results of Abelian-Tauberian type or limits of value functions of control problems for finite time horizon when the latter tends to infinity. Many arguments are simplified by the use of a functional relation for ϕ in the form of a suitable dynamic programming principle.| File | Dimensione | Formato | |
|---|---|---|---|
|
25m1755205.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Licenza:
Nessuna licenza
Dimensione
467.9 kB
Formato
Adobe PDF
|
467.9 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
|
2504.17958v2.pdf
accesso aperto
Tipologia:
Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Licenza:
Creative commons
Dimensione
537.96 kB
Formato
Adobe PDF
|
537.96 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




