Three important issues are often encountered in Supervised Classifica- tion: class-memberships are unreliable for some training units (Label Noise), a pro- portion of observations might depart from the bulk of the data structure (Outliers) and groups represented in the test set may have not been encountered earlier in the learn- ing phase (Unobserved Classes). The present work introduces a Robust and Adaptive Eigenvalue-Decomposition Discriminant Analysis (RAEDDA) capable of handling situations in which one or more of the afore described problems occur. Transductive and inductive robust EM-based procedures are proposed for parameter estimation and experiments on real data, artificially adulterated, are provided to underline the benefits of the proposed method.

Supervised learning in presence of outliers, label noise and unobserved classes / A. Cappozzo, F. Greselin, B. Murphy - In: Cladag2019 : Book of short papers[s.l] : Centro Editoriale di Ateneo Università di Cassino e del Lazio Meridionale, 2019. - ISBN 978-88-8317-108-6. - pp. 104-107 (( Intervento presentato al 12. convegno Scientific Meeting Classification and Data Analysis Group tenutosi a Cassino nel 2019.

Supervised learning in presence of outliers, label noise and unobserved classes

A. Cappozzo;
2019

Abstract

Three important issues are often encountered in Supervised Classifica- tion: class-memberships are unreliable for some training units (Label Noise), a pro- portion of observations might depart from the bulk of the data structure (Outliers) and groups represented in the test set may have not been encountered earlier in the learn- ing phase (Unobserved Classes). The present work introduces a Robust and Adaptive Eigenvalue-Decomposition Discriminant Analysis (RAEDDA) capable of handling situations in which one or more of the afore described problems occur. Transductive and inductive robust EM-based procedures are proposed for parameter estimation and experiments on real data, artificially adulterated, are provided to underline the benefits of the proposed method.
model-based classification; unobserved classes; label noise; outliers detection; impartial trimming; robust estimation
Settore SECS-S/01 - Statistica
2019
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
cappozzo_greselin_murphy_cladag2019.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 110.72 kB
Formato Adobe PDF
110.72 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1039367
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact