Big Data are huge amounts of digital information that rarely result from properly planned surveys; as a consequence they often contain redundant observations. When the aim is to answer particular questions of interest, we suggest selecting a subsample of units that contains the majority of the information to achieve this goal. Selection methods driven by the theory of optimal design incorporate the inferential purposes and thus perform better than standard sampling schemes.
Optimal design subsampling from Big Datasets / L. Deldossi, C. Tommasi. - In: JOURNAL OF QUALITY TECHNOLOGY. - ISSN 0022-4065. - (2021), pp. 1-25. [Epub ahead of print] [10.1080/00224065.2021.1889418]
Optimal design subsampling from Big Datasets
C. TommasiUltimo
2021
Abstract
Big Data are huge amounts of digital information that rarely result from properly planned surveys; as a consequence they often contain redundant observations. When the aim is to answer particular questions of interest, we suggest selecting a subsample of units that contains the majority of the information to achieve this goal. Selection methods driven by the theory of optimal design incorporate the inferential purposes and thus perform better than standard sampling schemes.File | Dimensione | Formato | |
---|---|---|---|
Optimal design subsampling from Big Datasets.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
1.11 MB
Formato
Adobe PDF
|
1.11 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
pre-print.pdf
accesso aperto
Descrizione: Online first
Tipologia:
Pre-print (manoscritto inviato all'editore)
Dimensione
541.33 kB
Formato
Adobe PDF
|
541.33 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.