We consider an optimal control problem with ergodic (long term average) reward for a McKean-Vlasov dynamics, where the coefficients of a controlled stochastic ifferential equation depend on the marginal law of the solution. Starting from the associated infinite time horizon expected discounted reward, we construct both the value λ of the ergodic problem and an associated function ϕ, which provide a viscosity solution to an ergodic Hamilton-Jacobi-Bellman (HJB) equation of elliptic type. In contrast to previous results, we consider the function ϕ and the HJB equation on the Wasserstein space, using concepts of derivatives with respect to probability measures. The pair (λ, ϕ) also provides information on limit behavior of related optimization problems, for instance, results of Abelian-Tauberian type or limits of value functions of control problems for finite time horizon when the latter tends to infinity. Many arguments are simplified by the use of a functional relation for ϕ in the form of a suitable dynamic programming principle.

Ergodic Control of McKean–Vlasov Systems on the Wasserstein Space / M. Fuhrman, S. Ruda'. - In: SIAM JOURNAL ON CONTROL AND OPTIMIZATION. - ISSN 0363-0129. - 63:6(2025 Nov 21), pp. 4018-4043. [10.1137/25m1755205]

Ergodic Control of McKean–Vlasov Systems on the Wasserstein Space

M. Fuhrman
Primo
;
S. Ruda'
Secondo
2025

Abstract

We consider an optimal control problem with ergodic (long term average) reward for a McKean-Vlasov dynamics, where the coefficients of a controlled stochastic ifferential equation depend on the marginal law of the solution. Starting from the associated infinite time horizon expected discounted reward, we construct both the value λ of the ergodic problem and an associated function ϕ, which provide a viscosity solution to an ergodic Hamilton-Jacobi-Bellman (HJB) equation of elliptic type. In contrast to previous results, we consider the function ϕ and the HJB equation on the Wasserstein space, using concepts of derivatives with respect to probability measures. The pair (λ, ϕ) also provides information on limit behavior of related optimization problems, for instance, results of Abelian-Tauberian type or limits of value functions of control problems for finite time horizon when the latter tends to infinity. Many arguments are simplified by the use of a functional relation for ϕ in the form of a suitable dynamic programming principle.
stochastic optimal control; ergodic control; McKean-Vlasov differential equations; mean-field control; Bellman equations on the Wasserstein space;
Settore MATH-03/B - Probabilità e statistica matematica
21-nov-2025
Article (author)
File in questo prodotto:
File Dimensione Formato  
25m1755205.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Licenza: Nessuna licenza
Dimensione 467.9 kB
Formato Adobe PDF
467.9 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
2504.17958v2.pdf

accesso aperto

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Licenza: Creative commons
Dimensione 537.96 kB
Formato Adobe PDF
537.96 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1199835
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact