A recurring problem in multivariate data analysis (MVDA), potentially sparing no field of application, is the treatment of incomplete information. The subject is vast and complex, and has originated a literature rich of very different approaches. In an exploratory framework distance-based methods and procedures involving MVDA techniques can treat the problem properly. The nearest-neighbour imputation (NNI) method is distancebased in that detects sets of “donors” for incomplete units on the basis of their mutual nearness measured by a specific metric. MVDA techniques, such as PCA, through an iterative minimization of a loss-function, can recover values for incomplete units taking into account associations between variables. Both approaches have attractive features. In NNI, the metric and the number of donors can be chosen at will. The MVDA-based approach expressly accounts for variable associations. The approach here proposed, called forward imputation, ideally meets these features. It is developed as a distance-based approach that imputes missing values sequentially by alternating a MVDA technique and the NNI method. The MVDA technique could be any. Given the wide range of possibilities, attention here is confined to PCA. Comparisons with alternative imputation methods are then performed in presence of different data patterns

A sequential distance-based approach for imputing missing data : the forward imputation / N. Solaro, A. Barbiero, G. Manzi, P.A. Ferrari - In: 6th CSDA International conference on Computational and financial econometrics and 5th International conference of the ERCIM Working group on Computing & Statistics programme and abstract bookOviedo : ERCIM, 2012. - pp. 118-118 (( convegno 6th CSDA international conference on Computational and financial econometrics and 5th international conference of the ERCIM Working group on Computing & Statistics tenutosi a Oviedo nel 2012.

A sequential distance-based approach for imputing missing data : the forward imputation

A. Barbiero
Secondo
;
G. Manzi
Penultimo
;
P.A. Ferrari
Ultimo
2012

Abstract

A recurring problem in multivariate data analysis (MVDA), potentially sparing no field of application, is the treatment of incomplete information. The subject is vast and complex, and has originated a literature rich of very different approaches. In an exploratory framework distance-based methods and procedures involving MVDA techniques can treat the problem properly. The nearest-neighbour imputation (NNI) method is distancebased in that detects sets of “donors” for incomplete units on the basis of their mutual nearness measured by a specific metric. MVDA techniques, such as PCA, through an iterative minimization of a loss-function, can recover values for incomplete units taking into account associations between variables. Both approaches have attractive features. In NNI, the metric and the number of donors can be chosen at will. The MVDA-based approach expressly accounts for variable associations. The approach here proposed, called forward imputation, ideally meets these features. It is developed as a distance-based approach that imputes missing values sequentially by alternating a MVDA technique and the NNI method. The MVDA technique could be any. Given the wide range of possibilities, attention here is confined to PCA. Comparisons with alternative imputation methods are then performed in presence of different data patterns
Settore SECS-S/01 - Statistica
2012
Queen Mary university of London
Universidad de Oviedo
http://www.cfe-csda.org/cfe12/BoA.pdf
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
BoA.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 3.06 MB
Formato Adobe PDF
3.06 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/212942
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact