Identifying the minimal number of parameters needed to describe a dataset is a challenging problem known in the literature as intrinsic dimension estimation. All the existing intrinsic dimension estimators are not reliable whenever the dataset is locally undersampled, and this is at the core of the so called curse of dimensionality. Here we introduce a new intrinsic dimension estimator that leverages on simple properties of the tangent space of a manifold and extends the usual correlation integral estimator to alleviate the extreme undersampling problem. Based on this insight, we explore a multiscale generalization of the algorithm that is capable of (i) identifying multiple dimensionalities in a dataset, and (ii) providing accurate estimates of the intrinsic dimension of extremely curved manifolds. We test the method on manifolds generated from global transformations of high-contrast images, relevant for invariant object recognition and considered a challenge for state-of-the-art intrinsic dimension estimators.

Intrinsic dimension estimation for locally undersampled data / V. Erba, M. Gherardi, P. Rotondo. - In: SCIENTIFIC REPORTS. - ISSN 2045-2322. - 9:1(2019 Nov 20), pp. 17133.1-17133.9. [10.1038/s41598-019-53549-9]

Intrinsic dimension estimation for locally undersampled data

V. Erba
;
M. Gherardi;P. Rotondo
2019

Abstract

Identifying the minimal number of parameters needed to describe a dataset is a challenging problem known in the literature as intrinsic dimension estimation. All the existing intrinsic dimension estimators are not reliable whenever the dataset is locally undersampled, and this is at the core of the so called curse of dimensionality. Here we introduce a new intrinsic dimension estimator that leverages on simple properties of the tangent space of a manifold and extends the usual correlation integral estimator to alleviate the extreme undersampling problem. Based on this insight, we explore a multiscale generalization of the algorithm that is capable of (i) identifying multiple dimensionalities in a dataset, and (ii) providing accurate estimates of the intrinsic dimension of extremely curved manifolds. We test the method on manifolds generated from global transformations of high-contrast images, relevant for invariant object recognition and considered a challenge for state-of-the-art intrinsic dimension estimators.
Settore FIS/02 - Fisica Teorica, Modelli e Metodi Matematici
20-nov-2019
Article (author)
File in questo prodotto:
File Dimensione Formato  
s41598-019-53549-9.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 1.58 MB
Formato Adobe PDF
1.58 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/692125
Citazioni
  • ???jsp.display-item.citation.pmc??? 4
  • Scopus 22
  • ???jsp.display-item.citation.isi??? 18
social impact