IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

We present a procedure for the optimal implementation of public policies that involve predicting an individual behavior or characteristic. By linking prediction errors of any given classification model to the resulting social welfare, we provide a simple measure to rank different models and select the optimal one. Such measure is defined as the difference between the social welfare of a given policy and that of an error-free policy, and it is related to the ROC curve employed in the Machine Learning literature. We extend the cost isometrics approach described in the literature by considering the case of heterogeneous costs of type I and II errors. We apply our approach to the prediction of inaccurate tax returns issued by Italian self-employed and sole proprietorships. We show that the approach can result in substantial increases in revenues, and that random forest models, beyond providing comparatively good predictions, yield important insights. In our case, they both provide empirical support for existing theories on tax evasion — highlighting, for instance, cross-sectoral heterogeneity — and extend our understanding of the phenomenon — such as the role of bunching.

Machine learning and the optimization of prediction-based policies / P. Battiston, S. Gamba, A. Santoro. - In: TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE. - ISSN 0040-1625. - 199:(2024 Feb), pp. 123080.1-123080.16. [10.1016/j.techfore.2023.123080]

Machine learning and the optimization of prediction-based policies

Battiston P.;S. Gamba^Secondo;Santoro A.

2024

Abstract

We present a procedure for the optimal implementation of public policies that involve predicting an individual behavior or characteristic. By linking prediction errors of any given classification model to the resulting social welfare, we provide a simple measure to rank different models and select the optimal one. Such measure is defined as the difference between the social welfare of a given policy and that of an error-free policy, and it is related to the ROC curve employed in the Machine Learning literature. We extend the cost isometrics approach described in the literature by considering the case of heterogeneous costs of type I and II errors. We apply our approach to the prediction of inaccurate tax returns issued by Italian self-employed and sole proprietorships. We show that the approach can result in substantial increases in revenues, and that random forest models, beyond providing comparatively good predictions, yield important insights. In our case, they both provide empirical support for existing theories on tax evasion — highlighting, for instance, cross-sectoral heterogeneity — and extend our understanding of the phenomenon — such as the role of bunching.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Machine learning; Prediction; Public policy; ROC curve; Tax behavior
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore SECS-P/03 - Scienza delle Finanze
			
	Settori scientifico-disciplinari dell'articolo (validi dal 09/05/2024)
	
				Settore ECON-03/A - Scienza delle finanze
			
	Data di pubblicazione
	
				feb-2024
			
	Data ahead of print o data di stampa
	
				14-dic-2023
			
	Rivista in ANCE
	
				TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE
			
	DOI
	
				https://dx.doi.org/10.1016/j.techfore.2023.123080
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
2024_Technological forecasting_machine learning.pdf accesso aperto Descrizione: Article Tipologia: Publisher's version/PDF Dimensione 1.09 MB Formato Adobe PDF Visualizza/Apri	1.09 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1024552

Citazioni

ND

17

11

ND

social impact