Feature Selection via Mutual Information: New Theoretical Insights

Beraha, M.; Metelli, A.M.; Papini, M.; Tirinzoni, A.; Restelli, M.

doi:10.1109/IJCNN.2019.8852410

Mutual information has been successfully adopted in filter feature-selection methods to assess both the relevancy of a subset of features in predicting the target variable and the redundancy with respect to other variables. However, existing algorithms are mostly heuristic and do not offer any guarantee on the proposed solution. In this paper, we provide novel theoretical results showing that conditional mutual information naturally arises when bounding the ideal regression/classification errors achieved by different subsets of features. Leveraging on these insights, we propose a novel stopping condition for backward and forward greedy methods which ensures that the ideal prediction error using the selected feature subset remains bounded by a user-specified threshold. We provide numerical simulations to support our theoretical claims and compare to common heuristic methods.

Feature Selection via Mutual Information: New Theoretical Insights / M. Beraha, A.M. Metelli, M. Papini, A. Tirinzoni, M. Restelli - In: International Joint Conference on Neural Networks[s.l] : IEEE, 2019. - ISBN 978-1-7281-1985-4. - pp. 1-9 (( International Joint Conference on Neural Networks, IJCNN 2019 Budapest 2019 [10.1109/IJCNN.2019.8852410].

Feature Selection via Mutual Information: New Theoretical Insights

Beraha M.;Metelli A. M.;M. Papini;Tirinzoni A.;Restelli M.

2019

Abstract

Mutual information has been successfully adopted in filter feature-selection methods to assess both the relevancy of a subset of features in predicting the target variable and the redundancy with respect to other variables. However, existing algorithms are mostly heuristic and do not offer any guarantee on the proposed solution. In this paper, we provide novel theoretical results showing that conditional mutual information naturally arises when bounding the ideal regression/classification errors achieved by different subsets of features. Leveraging on these insights, we propose a novel stopping condition for backward and forward greedy methods which ensures that the ideal prediction error using the selected feature subset remains bounded by a user-specified threshold. We provide numerical simulations to support our theoretical claims and compare to common heuristic methods.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				classification; feature selection; machine learning; mutual information; regression; supervised learning
			
	Settori scientifico-disciplinari del contributo (validi dal 09/05/2024)
	
				Settore IINF-05/A - Sistemi di elaborazione delle informazioni
Settore INFO-01/A - Informatica
			
	Data di pubblicazione
	
				2019
			
	DOI
	
				https://dx.doi.org/10.1109/IJCNN.2019.8852410
			
	Tipologia
	
				Book Part (author)
			
	Appare nelle tipologie:
	
				03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
1907.07384v1.pdf accesso aperto Tipologia: Pre-print (manoscritto inviato all'editore) Licenza: Creative commons Dimensione 577.78 kB Formato Adobe PDF Visualizza/Apri	577.78 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1225939

Citazioni

ND

82

48

8

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca