In recent years, with the spread availability of large datasets from multiple sources, increasing attention has been devoted to the treatment of missing information. Recent approaches have paved the way to the development of new powerful algorithmic techniques, in which imputation is performed through computer-intensive procedures. Although most of these approaches are attractive for many reasons, less attention has been paid to the problem of which method should be preferred according to the data structure at hand. This work addresses the problem by comparing the two methods missForest and IPCA with a new method we developed within the forward imputation approach. We carried out comparisons by considering different data patterns with varying skewness and correlation of variables, in order to ascertain in which situations a given method produces more satisfying results

Algorithmic-type imputation techniques with different data structures : alternative approaches in comparison / N. Solaro, A. Barbiero, G. Manzi, P.A. Ferrari - In: Analysis and modeling of complex data in behavioral and social sciences / [a cura di] D. Vicari, A. Okada, G. Ragozini, C. Wehis. - Zurich : Springer, 2014. - ISBN 978-3-319-06692-9. - pp. 253-261

Algorithmic-type imputation techniques with different data structures : alternative approaches in comparison

A. Barbiero
Secondo
;
G. Manzi
Penultimo
;
P.A. Ferrari
Ultimo
2014

Abstract

In recent years, with the spread availability of large datasets from multiple sources, increasing attention has been devoted to the treatment of missing information. Recent approaches have paved the way to the development of new powerful algorithmic techniques, in which imputation is performed through computer-intensive procedures. Although most of these approaches are attractive for many reasons, less attention has been paid to the problem of which method should be preferred according to the data structure at hand. This work addresses the problem by comparing the two methods missForest and IPCA with a new method we developed within the forward imputation approach. We carried out comparisons by considering different data patterns with varying skewness and correlation of variables, in order to ascertain in which situations a given method produces more satisfying results
Forward imputation ; Iterative PCA ; missForest ; Missing data
Settore SECS-S/01 - Statistica
2014
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
Solaro et al 2014.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 226.19 kB
Formato Adobe PDF
226.19 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/237480
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
social impact