IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

Imputation of missing data has always represented a problem for researchers from every field. A biased imputation may strongly affect research findings, leading to wrong conclusions. Furthermore, an imputation method may be satisfactory with certain types of data, but unsatisfactory with others. Recently, an imputation method based on the recursive use of nonlinear principal component analysis has been proposed for the specific case of constructing a composite indicator from ordinal data. This method alternates the nonlinear principal component analysis with the nearest neighbour method in order to detect the most appropriate donor for the imputation of missing categories of incomplete objects. In this work, we present an extension of this imputation method to the general case of constructing a composite indicator from mixed-type data, that is, data including both quantitative and qualitative variables. The problem of the evaluation of the most suitable distance between objects to be used in the nearest neighbour method, together with the choice of the set of the most appropriate donors, is considered. The proposed procedure is implemented using the R software. A simulation study is also implemented to test the relative performance of the proposed method when compared with other methods used in similar situations

An imputation method for mixed-type data using nonlinear principal component analysis / N. Solaro, A. Barbiero, G. Manzi, P.A. Ferrari - In: 5th CSDA international conference on Computational and Financial Econometrics and 4th international conference of the ERCIM Working Group on Computing & Statistics "Programme and abstract" bookLondon : ERCIM, 2011. - pp. 20-21 (( convegno 5th CSDA international conference on Computational and Financial Econometrics and 4th international conference of the ERCIM Working Group on Computing & Statistics tenutosi a London nel 2011.

An imputation method for mixed-type data using nonlinear principal component analysis

N. Solaro;A. Barbiero^Secondo;G. Manzi^Penultimo;P.A. Ferrari^Ultimo

2011

Abstract

Imputation of missing data has always represented a problem for researchers from every field. A biased imputation may strongly affect research findings, leading to wrong conclusions. Furthermore, an imputation method may be satisfactory with certain types of data, but unsatisfactory with others. Recently, an imputation method based on the recursive use of nonlinear principal component analysis has been proposed for the specific case of constructing a composite indicator from ordinal data. This method alternates the nonlinear principal component analysis with the nearest neighbour method in order to detect the most appropriate donor for the imputation of missing categories of incomplete objects. In this work, we present an extension of this imputation method to the general case of constructing a composite indicator from mixed-type data, that is, data including both quantitative and qualitative variables. The problem of the evaluation of the most suitable distance between objects to be used in the nearest neighbour method, together with the choice of the set of the most appropriate donors, is considered. The proposed procedure is implemented using the R software. A simulation study is also implemented to test the relative performance of the proposed method when compared with other methods used in similar situations

Scheda breve

Scheda completa

Scheda completa (DC)

	Settori scientifico-disciplinari del contributo (sola visualizzazione)
	
				Settore SECS-S/01 - Statistica
			
	Data di pubblicazione
	
				2011
			
	Enti collegati al convegno
	
				Queen Mary university of London
London school of Economics and Political science
Birkbeck university of London
			
	URL
	
				http://www.cfe-csda.org/ercim11/London2011BoA.pdf
			
	Tipologia
	
				Book Part (author)
			
	Appare nelle tipologie:
	
				03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
London2011BoA.pdf accesso aperto Tipologia: Publisher's version/PDF Dimensione 1.44 MB Formato Adobe PDF Visualizza/Apri	1.44 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/194769

Citazioni

ND

ND

ND

ND

social impact