Given a sample of unlabeled observations, the goal of a novelty detec- tion method is to identify which units substantially deviate from the observed la- beled patterns. Therefore, in a model-based framework, it is firstly of paramount importance to learn the components that correspond to the manifest groups in the training set. Secondly, one needs to take into account the lack of knowledge regard- ing the statistical novelties. Thirdly, contaminated elements in the known classes could greatly jeopardize the identification of new groups. Motivated by these chal- lenges, we propose a two-stage Bayesian non-parametric novelty detector. At stage one, robust estimates are extracted from the training set and, subsequently, such in- formation is employed to elicit informative priors within a flexible semiparametric mixture. This general paradigm can be easily adapted to complex modeling frame- works: we provide here an application to functional data from a food authenticity study.

Outlier and novelty detection for Functional data: a semiparametric Bayesian approach / F. Denti, A. Cappozzo, F. Greselin - In: Models and Learning for Clustering and Classification / [a cura di] S. Ingrassia, A. Punzo, R. Rocci. - [s.l] : Ledizioni, 2021. - ISBN 9788855265393. - pp. 33-38 (( Intervento presentato al 5. convegno International workshop on Models and Learning for Clustering and Classification tenutosi a Catania nel 2020.

Outlier and novelty detection for Functional data: a semiparametric Bayesian approach

A. Cappozzo;
2021

Abstract

Given a sample of unlabeled observations, the goal of a novelty detec- tion method is to identify which units substantially deviate from the observed la- beled patterns. Therefore, in a model-based framework, it is firstly of paramount importance to learn the components that correspond to the manifest groups in the training set. Secondly, one needs to take into account the lack of knowledge regard- ing the statistical novelties. Thirdly, contaminated elements in the known classes could greatly jeopardize the identification of new groups. Motivated by these chal- lenges, we propose a two-stage Bayesian non-parametric novelty detector. At stage one, robust estimates are extracted from the training set and, subsequently, such in- formation is employed to elicit informative priors within a flexible semiparametric mixture. This general paradigm can be easily adapted to complex modeling frame- works: we provide here an application to functional data from a food authenticity study.
Bayesian mixture model; Dirichlet Process Mixture Model; Functional data; Minimum Regularized Covariance Determinant
Settore SECS-S/01 - Statistica
2021
https://zenodo.org/records/5598945
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
Book-Short-Papers-MBC2-2020.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 1.77 MB
Formato Adobe PDF
1.77 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1039291
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact