IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

We analyze a stochastic optimal control problem, where the state process follows a McKean-Vlasov dynamics and the diffusion coefficient can be degenerate. We prove that its value function V admits a nonlinear Feynman-Kac representation in terms of a class of forward-backward stochastic differential equations, with an autonomous forward process. We exploit this probabilistic representation to rigorously prove the dynamic programming principle (DPP) for V. The Feynman-Kac representation we obtain has an important role beyond its intermediary role in obtaining our main result: in fact it would be useful in developing probabilistic numerical schemes for V. The DPP is important in obtaining a characterization of the value function as a solution of a nonlinear partial differential equation (the so-called Hamilton-Jacobi-Belman equation), in this case on the Wasserstein space of measures. We should note that the usual way of solving these equations is through the Pontryagin maximum principle, which requires some convexity assumptions. There were attempts in using the dynamic programming approach before, but these works assumed a priori that the controls were of Markovian feedback type, which helps write the problem only in terms of the distribution of the state process (and the control problem becomes a deterministic problem). In this paper, we will consider open-loop controls and derive the dynamic programming principle in this most general case. In order to obtain the Feynman-Kac representation and the randomized dynamic programming principle, we implement the so-called randomization method, which consists of formulating a new McKean-Vlasov control problem, expressed in weak form taking the supremum over a family of equivalent probability measures. One of the main results of the paper is the proof that this latter control problem has the same value function V of the original control problem.

Randomized dynamic programming principle and Feynman-Kac representation for optimal control of McKean-Vlasov dynamics / E. Bayraktar, A. Cosso, H. Pham. - In: TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY. - ISSN 0002-9947. - 370:3(2018 Mar), pp. 2115-2160. [10.1090/tran/7118]

Randomized dynamic programming principle and Feynman-Kac representation for optimal control of McKean-Vlasov dynamics

Bayraktar, Erhan;A. Cosso^Secondo;Pham, Huyên

2018

Abstract

We analyze a stochastic optimal control problem, where the state process follows a McKean-Vlasov dynamics and the diffusion coefficient can be degenerate. We prove that its value function V admits a nonlinear Feynman-Kac representation in terms of a class of forward-backward stochastic differential equations, with an autonomous forward process. We exploit this probabilistic representation to rigorously prove the dynamic programming principle (DPP) for V. The Feynman-Kac representation we obtain has an important role beyond its intermediary role in obtaining our main result: in fact it would be useful in developing probabilistic numerical schemes for V. The DPP is important in obtaining a characterization of the value function as a solution of a nonlinear partial differential equation (the so-called Hamilton-Jacobi-Belman equation), in this case on the Wasserstein space of measures. We should note that the usual way of solving these equations is through the Pontryagin maximum principle, which requires some convexity assumptions. There were attempts in using the dynamic programming approach before, but these works assumed a priori that the controls were of Markovian feedback type, which helps write the problem only in terms of the distribution of the state process (and the control problem becomes a deterministic problem). In this paper, we will consider open-loop controls and derive the dynamic programming principle in this most general case. In order to obtain the Feynman-Kac representation and the randomized dynamic programming principle, we implement the so-called randomization method, which consists of formulating a new McKean-Vlasov control problem, expressed in weak form taking the supremum over a family of equivalent probability measures. One of the main results of the paper is the proof that this latter control problem has the same value function V of the original control problem.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Controlled McKean-Vlasov stochastic differential equations; dynamic programming principle; forward-backward stochastic differential equations; randomization method;
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore MAT/06 - Probabilita' e Statistica Matematica
			
	Data di pubblicazione
	
				mar-2018
			
	Rivista in ANCE
	
				TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY
			
	DOI
	
				https://dx.doi.org/10.1090/tran/7118
			
	URL
	
				https://www.ams.org/journals/tran/2018-370-03/S0002-9947-2017-07118-X/
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
Bayraktar, Cosso, Pham - TAMS.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 580.96 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	580.96 kB	Adobe PDF	Visualizza/Apri Richiedi una copia
tran7118_AM.pdf accesso aperto Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore) Dimensione 642.04 kB Formato Adobe PDF Visualizza/Apri	642.04 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/931978

Citazioni

ND

57

52

social impact