In a very stimulating paper, Preece gives an artificial dataset useful to illustrate the hazard of multiple regression and challenges the reader to spot the simple inbuilt features of these data. The present note aims at finding how Preece generated the whole set of data. First of all OLS regression model is fitted to the data; after checking for model assumptions some doubts arise on the validity of OLS regression; thus robust regression estimators are considered as a proper alternative. The latter give discordant coefficient estimates, but after a deep analysis, they agree in highlighting the presence of two subsets within the dataset: 9 cases being generated by one model, and the remaining 8 cases being generated by a second model. This particular pattern of the data is recognized by the mixture model as well.

A reaction to a challenging example in multiple regression analysis / E. Marubini, A. Orenti. - In: STATISTICA APPLICATA. - ISSN 2038-5587. - 29:1(2017 Jul), pp. 95-106.

A reaction to a challenging example in multiple regression analysis

E. Marubini
Primo
;
A. Orenti
Ultimo
2017

Abstract

In a very stimulating paper, Preece gives an artificial dataset useful to illustrate the hazard of multiple regression and challenges the reader to spot the simple inbuilt features of these data. The present note aims at finding how Preece generated the whole set of data. First of all OLS regression model is fitted to the data; after checking for model assumptions some doubts arise on the validity of OLS regression; thus robust regression estimators are considered as a proper alternative. The latter give discordant coefficient estimates, but after a deep analysis, they agree in highlighting the presence of two subsets within the dataset: 9 cases being generated by one model, and the remaining 8 cases being generated by a second model. This particular pattern of the data is recognized by the mixture model as well.
linear regression analysis; regression illustrative example; hazards in regression analysis; robust regression, mixture models
Settore SECS-S/01 - Statistica
lug-2017
http://sa-ijas.stat.unipd.it/sites/sa-ijas.stat.unipd.it/files/MARUBINI-ORENTI_0.pdf
Article (author)
File in questo prodotto:
File Dimensione Formato  
A reaction to a challenging example in multiple regression analysis.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 257.83 kB
Formato Adobe PDF
257.83 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/519669
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact