The existence of multiple solutions in clustering, and in hierarchical clustering in particular, is often ignored in practical applications. However, this is a non-trivial problem, as different data orderings can result in different cluster sets that, in turns, may lead to different interpretations of the same data. The method presented here offers a solution to this issue. It is based on the definition of an equivalence relation over dendrograms that allows developing all and only the significantly different dendrograms for the same dataset, thus reducing the computational complexity to polynomial from the exponential obtained when all possible dendrograms are considered. Experimental results in the neuroimaging and bioinformatics domains show the effectiveness of the proposed method.
A novel approach to the problem of non-uniqueness of the solution in hierarchical clustering / I. Cattinelli, G. Valentini, E. Paulesu, N.A. Borghese. - In: IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS. - ISSN 2162-237X. - 24:7(2013 Jul), pp. 6497531.1166-6497531.1173.
Titolo: | A novel approach to the problem of non-uniqueness of the solution in hierarchical clustering |
Autori: | CATTINELLI, ISABELLA (Primo) VALENTINI, GIORGIO (Secondo) BORGHESE, NUNZIO ALBERTO (Ultimo) |
Parole Chiave: | Bioinformatics; dendrogram equivalence relation; hierarchical clustering (HC); neuroimaging |
Settore Scientifico Disciplinare: | Settore INF/01 - Informatica Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni |
Data di pubblicazione: | lug-2013 |
Rivista: | |
Tipologia: | Article (author) |
Digital Object Identifier (DOI): | http://dx.doi.org/10.1109/TNNLS.2013.2247058 |
Appare nelle tipologie: | 01 - Articolo su periodico |