In recent years, with the spread availability of large datasets from multiple sources, increasing attention has been devoted to the treatment of missing information. Recent approaches have paved the way to the development of new powerful algorithmic techniques, in which imputation is performed through computer-intensive procedures. Although most of these approaches are attractive for many reasons, less attention has been paid to the problem of which method should be preferred according to the data structure at hand. This work addresses the problem by comparing the two methods missForest and IPCA with a new method we developed within the forward imputation approach. We carried out comparisons by considering different data patterns with varying skewness and correlation of variables, in order to ascertain in which situations a given method produces more satisfying results
Algorithmic-type imputation techniques with different data structures : alternative approaches in comparison / N. Solaro, A. Barbiero, G. Manzi, P.A. Ferrari - In: Analysis and modeling of complex data in behavioral and social sciences / [a cura di] D. Vicari, A. Okada, G. Ragozini, C. Wehis. - Zurich : Springer, 2014. - ISBN 978-3-319-06692-9. - pp. 253-261
Algorithmic-type imputation techniques with different data structures : alternative approaches in comparison
A. BarbieroSecondo
;G. ManziPenultimo
;P.A. FerrariUltimo
2014
Abstract
In recent years, with the spread availability of large datasets from multiple sources, increasing attention has been devoted to the treatment of missing information. Recent approaches have paved the way to the development of new powerful algorithmic techniques, in which imputation is performed through computer-intensive procedures. Although most of these approaches are attractive for many reasons, less attention has been paid to the problem of which method should be preferred according to the data structure at hand. This work addresses the problem by comparing the two methods missForest and IPCA with a new method we developed within the forward imputation approach. We carried out comparisons by considering different data patterns with varying skewness and correlation of variables, in order to ascertain in which situations a given method produces more satisfying resultsFile | Dimensione | Formato | |
---|---|---|---|
Solaro et al 2014.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Dimensione
226.19 kB
Formato
Adobe PDF
|
226.19 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.