This paper proposes a novel two-class classifier, called IPCAC, based on the Isotropic Principal Component Analysis technique; it allows to deal with training data drawn from Mixture of Gaussian distributions, by projecting the data on the Fisher subspace that separates the two classes. The obtained results demonstrate that IPCAC is a promising technique; furthermore, to cope with training datasets being dynamically supplied, and to work with non-linearly separable classes, two improvements of this classifier are defined: a model merging algorithm, and a kernel version of IPCAC. The effectiveness of the proposed methods is shown by their application to the spam classification problem, and by the comparison of the achieved results with those obtained by Support Vector Machines SVM, and K-Nearest Neighbors KNN.

Novel IPCA-Based Classifiers and Their Application to Spam Filtering / A. Rozza, G. Lombardi, E. Casiraghi - In: Proceedings of the ninth international conference on intelligent systems design and applications : 30 November-2 December 2009, Pisa, ItalyLos Alamitos : IEEE Computer Society, 2009. - ISBN 9781424447350. - pp. 792-802 (( Intervento presentato al 9th. convegno International Conference on Intelligent Systems Design and Applications tenutosi a Pisa. Italy nel 2009 [10.1109/ISDA.2009.21].

Novel IPCA-Based Classifiers and Their Application to Spam Filtering

A. Rozza;G. Lombardi;E. Casiraghi
2009

Abstract

This paper proposes a novel two-class classifier, called IPCAC, based on the Isotropic Principal Component Analysis technique; it allows to deal with training data drawn from Mixture of Gaussian distributions, by projecting the data on the Fisher subspace that separates the two classes. The obtained results demonstrate that IPCAC is a promising technique; furthermore, to cope with training datasets being dynamically supplied, and to work with non-linearly separable classes, two improvements of this classifier are defined: a model merging algorithm, and a kernel version of IPCAC. The effectiveness of the proposed methods is shown by their application to the spam classification problem, and by the comparison of the achieved results with those obtained by Support Vector Machines SVM, and K-Nearest Neighbors KNN.
Classification ; Isotropic PCA ; Kernel methods ; Model-Merging
Settore INF/01 - Informatica
2009
Book Part (author)
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/141707
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 10
  • ???jsp.display-item.citation.isi??? 7
social impact