Data structure has a dramatic impact on the properties of neural networks, yet its significance in the established theoretical frameworks is poorly understood. Here we compute the Vapnik-Chervonenkis entropy of a kernel machine operating on data grouped into equally labeled subsets. At variance with the unstructured scenario, entropy is nonmonotonic in the size of the training set, and displays an additional critical point besides the storage capacity. Remarkably, the same behavior occurs in margin classifiers even with randomly labeled data, as is elucidated by identifying the synaptic volume encoding the transition. These findings reveal aspects of expressivity lying beyond the condensed description provided by the storage capacity, and they indicate the path towards more realistic bounds for the generalization error of neural networks.

Beyond the Storage Capacity: Data-Driven Satisfiability Transition / P. Rotondo, M. Pastore, M. Gherardi. - In: PHYSICAL REVIEW LETTERS. - ISSN 0031-9007. - 125:12(2020 Sep 14). [10.1103/PhysRevLett.125.120601]

Beyond the Storage Capacity: Data-Driven Satisfiability Transition

P. Rotondo
Primo
;
M. Pastore
Secondo
;
M. Gherardi
Ultimo
2020

Abstract

Data structure has a dramatic impact on the properties of neural networks, yet its significance in the established theoretical frameworks is poorly understood. Here we compute the Vapnik-Chervonenkis entropy of a kernel machine operating on data grouped into equally labeled subsets. At variance with the unstructured scenario, entropy is nonmonotonic in the size of the training set, and displays an additional critical point besides the storage capacity. Remarkably, the same behavior occurs in margin classifiers even with randomly labeled data, as is elucidated by identifying the synaptic volume encoding the transition. These findings reveal aspects of expressivity lying beyond the condensed description provided by the storage capacity, and they indicate the path towards more realistic bounds for the generalization error of neural networks.
Settore FIS/02 - Fisica Teorica, Modelli e Metodi Matematici
14-set-2020
2020
Article (author)
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/776164
Citazioni
  • ???jsp.display-item.citation.pmc??? 1
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 11
social impact