The problem of missing data in building multidimensional composite indicators is a delicate problem which is often underrated. An imputation method particularly suitable for categorical data is proposed. This method is discussed in detail in the framework of nonlinear principal component analysis and compared to other missing data treatments which are commonly used in this analysis. Its performance vs. these other methods is evaluated throughout a simulation procedure performed on both an artificial case, varying the experimental conditions, and a real case. The proposed procedure is implemented using R.
An imputation method for categorical variables with application to nonlinear principal component analysis / P.A. Ferrari, P. Annoni, A. Barbiero, G. Manzi. - In: COMPUTATIONAL STATISTICS & DATA ANALYSIS. - ISSN 0167-9473. - 55:7(2011 Jul), pp. 2410-2420. [10.1016/j.csda.2011.02.007]
An imputation method for categorical variables with application to nonlinear principal component analysis
P.A. FerrariPrimo
;A. BarbieroPenultimo
;G. ManziUltimo
2011
Abstract
The problem of missing data in building multidimensional composite indicators is a delicate problem which is often underrated. An imputation method particularly suitable for categorical data is proposed. This method is discussed in detail in the framework of nonlinear principal component analysis and compared to other missing data treatments which are commonly used in this analysis. Its performance vs. these other methods is evaluated throughout a simulation procedure performed on both an artificial case, varying the experimental conditions, and a real case. The proposed procedure is implemented using R.File | Dimensione | Formato | |
---|---|---|---|
11_paper_CSDA.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
951.65 kB
Formato
Adobe PDF
|
951.65 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.