We consider a sequential decision-making setting where, at every round t, the learner (a market maker) posts a bid price Bt and an ask price At to an incoming trader (the taker) with a private valuation for some asset. If the trader’s valuation is lower than the bid price, or higher than the ask price, then a trade (sell or buy) occurs. Letting Mt be the market price (observed only at the end of round t), the maker’s utility is Mt − Bt if the maker bought the asset, it is At − Mt if they sold it, and it is 0 if no trade occurred. We characterize the maker’s regret with respect to the best fixed choice of bid and ask pairs under a variety of assumptions (adversarial, i.i.d., and their variants) on the sequence of market prices and valuations. Our upper bound analysis unveils an intriguing connection relating market making to first-price auctions and dynamic pricing. Our main technical contribution is a lower bound for the i.i.d. case with Lipschitz distributions and independence between market prices and takers’ valuations. The difficulty in the analysis stems from a unique relationship between the reward and feedback functions that allows learning algorithms to trade off reward for information in a continuous way.
Market Making without Regret / N. Cesa Bianchi, T. Cesari, R. Colomboni, L. Foscari, V. Pathak (PROCEEDINGS OF MACHINE LEARNING RESEARCH). - In: The Thirty Eighth Annual Conference on Learning Theory / [a cura di] N. Haghtalab, A. Moitra. - [s.l] : PMLR, 2025. - pp. 799-837 (( Intervento presentato al 38. convegno Conference on Learning Theory tenutosi a Lyon nel 2025.
Market Making without Regret
N. Cesa Bianchi;T. Cesari;R. Colomboni;L. Foscari;
2025
Abstract
We consider a sequential decision-making setting where, at every round t, the learner (a market maker) posts a bid price Bt and an ask price At to an incoming trader (the taker) with a private valuation for some asset. If the trader’s valuation is lower than the bid price, or higher than the ask price, then a trade (sell or buy) occurs. Letting Mt be the market price (observed only at the end of round t), the maker’s utility is Mt − Bt if the maker bought the asset, it is At − Mt if they sold it, and it is 0 if no trade occurred. We characterize the maker’s regret with respect to the best fixed choice of bid and ask pairs under a variety of assumptions (adversarial, i.i.d., and their variants) on the sequence of market prices and valuations. Our upper bound analysis unveils an intriguing connection relating market making to first-price auctions and dynamic pricing. Our main technical contribution is a lower bound for the i.i.d. case with Lipschitz distributions and independence between market prices and takers’ valuations. The difficulty in the analysis stems from a unique relationship between the reward and feedback functions that allows learning algorithms to trade off reward for information in a continuous way.| File | Dimensione | Formato | |
|---|---|---|---|
|
cesa-bianchi25a.pdf
accesso aperto
Tipologia:
Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Licenza:
Creative commons
Dimensione
754.1 kB
Formato
Adobe PDF
|
754.1 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




