We introduce a novel theoretical framework for Return On Investment (ROI) maximization in repeated decision-making. Our setting is motivated by the use case of companies that regularly receive proposals for technological innovations and want to quickly decide whether they are worth implementing. We design an algorithm for learning ROI-maximizing decision-making policies over a sequence of innovation proposals. Our algorithm provably converges to an optimal policy in class Π at a rate of order min {1/(N ∆2), N−1/3}, where N is the number of innovations and ∆ is the suboptimality gap in Π. A significant hurdle of our formulation, which sets it aside from other online learning problems such as bandits, is that running a policy does not provide an unbiased estimate of its performance.

ROI Maximization in Stochastic Online Decision-Making / N. Cesa Bianchi, T. Cesari, Y. Mansour, V. Perchet (ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS). - In: Advances in Neural Information Processing Systems / [a cura di] M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, J. Wortman Vaughan. - [s.l] : Curran Associates, 2021. - ISBN 9781713845393. - pp. 9152-9166 (( Intervento presentato al 34. convegno Neural Information Processing Systems tenutosi a virtual nel 2021.

ROI Maximization in Stochastic Online Decision-Making

N. Cesa Bianchi
Primo
;
T. Cesari
Secondo
;
2021

Abstract

We introduce a novel theoretical framework for Return On Investment (ROI) maximization in repeated decision-making. Our setting is motivated by the use case of companies that regularly receive proposals for technological innovations and want to quickly decide whether they are worth implementing. We design an algorithm for learning ROI-maximizing decision-making policies over a sequence of innovation proposals. Our algorithm provably converges to an optimal policy in class Π at a rate of order min {1/(N ∆2), N−1/3}, where N is the number of innovations and ∆ is the suboptimality gap in Π. A significant hurdle of our formulation, which sets it aside from other online learning problems such as bandits, is that running a policy does not provide an unbiased estimate of its performance.
English
Settore INF/01 - Informatica
Intervento a convegno
Esperti anonimi
Ricerca di base
Pubblicazione scientifica
   European Learning and Intelligent Systems Excellence (ELISE)
   ELISE
   EUROPEAN COMMISSION
   H2020
   951847
Advances in Neural Information Processing Systems
M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, J. Wortman Vaughan
Curran Associates
2021
9152
9166
15
9781713845393
34
Volume a diffusione internazionale
Gold
0
Neural Information Processing Systems
virtual
2021
34
Convegno internazionale
Intervento inviato
https://papers.nips.cc/paper/2021/hash/4c4ea5258ef3fb3fb1fc48fee9b4408c-Abstract.html
DSRC - Data science research center
manual
Aderisco
N. Cesa Bianchi, T. Cesari, Y. Mansour, V. Perchet
Book Part (author)
open
273
ROI Maximization in Stochastic Online Decision-Making / N. Cesa Bianchi, T. Cesari, Y. Mansour, V. Perchet (ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS). - In: Advances in Neural Information Processing Systems / [a cura di] M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, J. Wortman Vaughan. - [s.l] : Curran Associates, 2021. - ISBN 9781713845393. - pp. 9152-9166 (( Intervento presentato al 34. convegno Neural Information Processing Systems tenutosi a virtual nel 2021.
info:eu-repo/semantics/bookPart
4
Prodotti della ricerca::03 - Contributo in volume
File in questo prodotto:
File Dimensione Formato  
NeurIPS-2021-roi-maximization-in-stochastic-online-decision-making-Paper.pdf

accesso aperto

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 386.99 kB
Formato Adobe PDF
386.99 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/905994
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
social impact