In the era of Big Data, several design based subsampling methods are proposed to reduce costs (and time) and to help in informed decision making. Most of these approaches require the specification of a model. A wrong model assumption and/or the possible presence of outliers represent a limitation for the most commonly applied subsampling criteria. Through a simulation study, we explore if a subsampling method, originally introduced by [1] to avoid outliers, works well to account for model uncertainty and, on the other side, if the subsampling approach introduced by [2] to account for model misspecification, is robust to the presence of outliers.

Optimal Subsampling from Big Datasets in Presence of Misspecification / L. Deldossi, C. Tommasi (ITALIAN STATISTICAL SOCIETY SERIES ON ADVANCES IN STATISTICS). - In: Methodological and Applied Statistics and Demography II / [a cura di] A. Pollice, P. Mariani. - Prima edizione. - [s.l] : Springer, 2025. - ISBN 978-3-031-64350-7. - pp. 458-464 (( 52. SIS2024 Bari 2024 [10.1007/978-3-031-64350-7_77].

Optimal Subsampling from Big Datasets in Presence of Misspecification

C. Tommasi
2025

Abstract

In the era of Big Data, several design based subsampling methods are proposed to reduce costs (and time) and to help in informed decision making. Most of these approaches require the specification of a model. A wrong model assumption and/or the possible presence of outliers represent a limitation for the most commonly applied subsampling criteria. Through a simulation study, we explore if a subsampling method, originally introduced by [1] to avoid outliers, works well to account for model uncertainty and, on the other side, if the subsampling approach introduced by [2] to account for model misspecification, is robust to the presence of outliers.
D-optimality; model misspecification; outliers; subsampling
Settore STAT-01/A - Statistica
   Optimal and adaptive designs for modern medical experimentation
   MINISTERO DELL'UNIVERSITA' E DELLA RICERCA
   2022TRB44L_002
2025
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
Deldossi-Tommasi.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Licenza: Nessuna licenza
Dimensione 154.73 kB
Formato Adobe PDF
154.73 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1202695
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact