Cooperative games in a stochastic environment

Apolloni, B.; Bassis, S.; Gaito, S.; Malchiodi, D.

doi:10.1007/978-3-540-45216-4_2

We introduce a very complex game based on an approximate solution of a NP-hard problem, so that the probability of victory grows monotonically, but of an unknown amount, with the resources each player employs. We formulate this model in the computational learning framework and focus on the problem of computing a confidence interval for the losing probability. We deal with the problem of reducing the width of this interval under a given threshold in both batch and on-line modality. While the former leads to a feasible polynomial complexity, the on-line learning strategy may get stuck in an indeterminacy trap: the more we play the game the broader becomes the confidence interval. In order to avoid this indeterminacy we organise in a better way the knowledge, introducing the notion of virtual game to achieve the goal efficiently. Then we extend the one-player to a team mode game. Namely, we improve the success of a team by redistributing the resources among the players and exploiting their mutual cooperation to treat the indeterminacy phenomenon suitably.

Cooperative games in a stochastic environment / B. Apolloni, S. Bassis, S. Gaito, D. Malchiodi (LECTURE NOTES IN COMPUTER SCIENCE). - In: Neural nets : 14th Italian Workshop on Neural Nets, WIRN VIETRI 2003, Vietri sul Mare, Italy, June 4 - 7, 2003; proceedings / [a cura di] B. Apolloni, M. Marinaro and R. Tagliaferri. - Berlin : Springer, 2003. - ISBN 978-3-540-20227-1. - pp. 25-34 (( Intervento presentato al 14. convegno 14th Italian Workshop on Neural Nets, WIRN 2003 tenutosi a Vietri sul mare nel 2003 [10.1007/978-3-540-45216-4_2].

Cooperative games in a stochastic environment

B. Apolloni^Primo;S. Bassis^Secondo;S. Gaito^Penultimo;D. Malchiodi^Ultimo

2003

Abstract

We introduce a very complex game based on an approximate solution of a NP-hard problem, so that the probability of victory grows monotonically, but of an unknown amount, with the resources each player employs. We formulate this model in the computational learning framework and focus on the problem of computing a confidence interval for the losing probability. We deal with the problem of reducing the width of this interval under a given threshold in both batch and on-line modality. While the former leads to a feasible polynomial complexity, the on-line learning strategy may get stuck in an indeterminacy trap: the more we play the game the broader becomes the confidence interval. In order to avoid this indeterminacy we organise in a better way the knowledge, introducing the notion of virtual game to achieve the goal efficiently. Then we extend the one-player to a team mode game. Namely, we improve the success of a team by redistributing the resources among the players and exploiting their mutual cooperation to treat the indeterminacy phenomenon suitably.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
			Knapsack Problem; Cooperative Game; Algorithmic Inference; Stochastic Environment; Team Game
		
	Settori scientifico-disciplinari del contributo
	
			Settore INF/01 - Informatica
		
	Data di pubblicazione
	
			2003
		
	DOI
	
			https://dx.doi.org/10.1007/978-3-540-45216-4_2
		
	Tipologia
	
			Book Part (author)
		
	Appare nelle tipologie:
	
			03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
978-3-540-45216-4_2.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 229.12 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	229.12 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/56220

Citazioni

ND

3

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca