Pottery classification based on chemical composition characterization through non-invasive analytical techniques is a well-known method typically adopted to solve the problem of pottery provenance attribution. Machine learning approaches have been recently introduced as a tool to develop models for archaeological classification directly inferred from data. This classification is quite often given in terms of local or non-local samples: in these cases, if the hypothesized provenance is to be validated, one-class models could be more adequate than binary or multi-class predictors. Indeed, one-class classifiers are trained only using positive examples of the class to be learned, and they can be subsequently tested on positive and negative examples in order to evaluate their generalization capability. They are thus in principle more apt to efficiently classify local samples, as the non-local ones do not naturally gather in a well-defined class. In this paper, we tested a one-class classifier on a dataset of 112 examples representing pottery fragments described in terms of nine chemical elements. Different examples of the dataset can correspond to measurements done on a same physical fragment, thus the hypothesis of independence among observations might be violated. For this reason, we investigated the use of three data stratification techniques, based on physical fragments and on measures. We employed the support vector one-class classification algorithm, finding the smallest sphere in a feature space that contains most of the training points. The obtained classification performances were compared with those of several machine learning algorithms for binary classification. All the models were trained by a nested cross validation technique, separately taking into account the fine-tuning of hyperparameters and the robust estimation of generalization performance. Comparisons were done on the same dataset according to different performance metrics. The obtained results show that one-class classification attains a similar sensitivity of binary classification approaches, meanwhile improving the performance in terms of specificity, therefore showing a good behavior both on positive and negative examples.

One-class vs binary machine learning classification of ceramic samples described by chemical element concentrations / D. Malchiodi, A.M. Zanaboni, A. Di Gioacchino, L. Bonizzoni. - In: JOURNAL OF CULTURAL HERITAGE. - ISSN 1296-2074. - 71:(2025 Feb), pp. 234-241. [10.1016/j.culher.2024.11.015]

One-class vs binary machine learning classification of ceramic samples described by chemical element concentrations

D. Malchiodi
Primo
;
A.M. Zanaboni
Secondo
;
L. Bonizzoni
Ultimo
2025

Abstract

Pottery classification based on chemical composition characterization through non-invasive analytical techniques is a well-known method typically adopted to solve the problem of pottery provenance attribution. Machine learning approaches have been recently introduced as a tool to develop models for archaeological classification directly inferred from data. This classification is quite often given in terms of local or non-local samples: in these cases, if the hypothesized provenance is to be validated, one-class models could be more adequate than binary or multi-class predictors. Indeed, one-class classifiers are trained only using positive examples of the class to be learned, and they can be subsequently tested on positive and negative examples in order to evaluate their generalization capability. They are thus in principle more apt to efficiently classify local samples, as the non-local ones do not naturally gather in a well-defined class. In this paper, we tested a one-class classifier on a dataset of 112 examples representing pottery fragments described in terms of nine chemical elements. Different examples of the dataset can correspond to measurements done on a same physical fragment, thus the hypothesis of independence among observations might be violated. For this reason, we investigated the use of three data stratification techniques, based on physical fragments and on measures. We employed the support vector one-class classification algorithm, finding the smallest sphere in a feature space that contains most of the training points. The obtained classification performances were compared with those of several machine learning algorithms for binary classification. All the models were trained by a nested cross validation technique, separately taking into account the fine-tuning of hyperparameters and the robust estimation of generalization performance. Comparisons were done on the same dataset according to different performance metrics. The obtained results show that one-class classification attains a similar sensitivity of binary classification approaches, meanwhile improving the performance in terms of specificity, therefore showing a good behavior both on positive and negative examples.
Machine learning; One-class classification; Ancient pottery classification
Settore INFO-01/A - Informatica
Settore PHYS-06/A - Fisica per le scienze della vita, l'ambiente e i beni culturali
feb-2025
19-dic-2024
Article (author)
File in questo prodotto:
File Dimensione Formato  
jch-zanaboni.pdf

accesso aperto

Descrizione: Article
Tipologia: Publisher's version/PDF
Dimensione 1.03 MB
Formato Adobe PDF
1.03 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1125838
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact