The Hat (H) matrix and in particular the elements of its principal diagonal (leverages) have a paramount importance in multiple regression analysis in order to pinpoint possible outliers and/or influential points as components of several regression diagnostics. This note presents some features of the H matrix and residuals for ANOVA models of experimental designs. For fixed effects models, the values of the elements of H are discussed in completely randomized, randomized complete block and Latin squares designs. The increasing complexity of the design structure leads to different patterns, with increasing values of the corresponding leverages (hii). For mixed effects models, developments on leverage and residuals for marginal and conditional estimates are illustrated. The application of H matrix and residuals in fixed effects and mixed effects model is shown in a worked example. It is concluded that for H matrix in mixed models, an important role is played by the values of the variances of the random effects and the error term, and, consequently, by their method of estimation. Marginal and conditional studentized residuals provide different information about the data, and thus should be both used for model checking.

Pinpointing outliers in experimental data: the Hat matrix in Anova for fixed and mixed effects models / A. Orenti, G. Marano, P. Boracchi, E. Marubini. - In: ITALIAN JOURNAL OF PUBLIC HEALTH. - ISSN 1723-7807. - 9:4(2012), pp. e8663.1-e8663.13. [10.2427/8663]

Pinpointing outliers in experimental data: the Hat matrix in Anova for fixed and mixed effects models

A. Orenti
Primo
;
G. Marano
Secondo
;
P. Boracchi
Penultimo
;
E. Marubini
Ultimo
2012

Abstract

The Hat (H) matrix and in particular the elements of its principal diagonal (leverages) have a paramount importance in multiple regression analysis in order to pinpoint possible outliers and/or influential points as components of several regression diagnostics. This note presents some features of the H matrix and residuals for ANOVA models of experimental designs. For fixed effects models, the values of the elements of H are discussed in completely randomized, randomized complete block and Latin squares designs. The increasing complexity of the design structure leads to different patterns, with increasing values of the corresponding leverages (hii). For mixed effects models, developments on leverage and residuals for marginal and conditional estimates are illustrated. The application of H matrix and residuals in fixed effects and mixed effects model is shown in a worked example. It is concluded that for H matrix in mixed models, an important role is played by the values of the variances of the random effects and the error term, and, consequently, by their method of estimation. Marginal and conditional studentized residuals provide different information about the data, and thus should be both used for model checking.
Anova; Hat matrix; Mixed effects model; Outliers
Settore MED/01 - Statistica Medica
2012
Article (author)
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/217608
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact