We study repeated bilateral trade where an adaptive $\sigma$-smooth adversary generates the valuations of sellers and buyers. We provide a complete characterization of the regret regimes for fixed-price mechanisms under different feedback models in the two cases where the learner can post either the same or different prices to buyers and sellers.We begin by showing that the minimax regret after $T$ rounds is of order $\sqrt{T}$ in the full-feedback scenario. Under partial feedback, any algorithm that has to post the same price to buyers and sellers suffers worst-case linear regret. However, when the learner can post two different prices at each round, we design an algorithm enjoying regret of order $T^{3/4}$ ignoring log factors.We prove that this rate is optimal by presenting a surprising $T^{3/4}$ lower bound, which is the main technical contribution of the paper.
Repeated Bilateral Trade Against a Smoothed Adversary / N. Cesa Bianchi, T. Cesari, R. Colomboni, F. Fusco, S. Leonardi (PROCEEDINGS OF MACHINE LEARNING RESEARCH). - In: The Thirty Sixth Annual Conference on Learning Theory / [a cura di] G. Neu, L. Rosasco. - [s.l] : PMLR, 2023. - pp. 1095-1130 (( Intervento presentato al 6. convegno Annual Conference on Learning Theory tenutosi a Bangalore nel 2023.
Repeated Bilateral Trade Against a Smoothed Adversary
N. Cesa Bianchi;T. Cesari;R. Colomboni;
2023
Abstract
We study repeated bilateral trade where an adaptive $\sigma$-smooth adversary generates the valuations of sellers and buyers. We provide a complete characterization of the regret regimes for fixed-price mechanisms under different feedback models in the two cases where the learner can post either the same or different prices to buyers and sellers.We begin by showing that the minimax regret after $T$ rounds is of order $\sqrt{T}$ in the full-feedback scenario. Under partial feedback, any algorithm that has to post the same price to buyers and sellers suffers worst-case linear regret. However, when the learner can post two different prices at each round, we design an algorithm enjoying regret of order $T^{3/4}$ ignoring log factors.We prove that this rate is optimal by presenting a surprising $T^{3/4}$ lower bound, which is the main technical contribution of the paper.File | Dimensione | Formato | |
---|---|---|---|
cesa-bianchi23a.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Dimensione
470.56 kB
Formato
Adobe PDF
|
470.56 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.