Given n points in ℝd and a maximum allowed tolerance ε >0, the minimum hyperplanes clustering problem consists in finding a minimum number of hyperplanes such that the Euclidean distance between each point and the nearest hyperplane is at most .. We present a column generation approach for this problem based on a mixed integer nonlinear formulation in which the master is a set covering problem and the pricing subproblem is a mixed integer program with a nonconvex normalization constraint. We propose different ways of generating the initial pool of columns and investigate their impact on the overall algorithm. Since the pricing subproblem is substantially complicated by the ℓ2-norm constraint, we consider approximate pricing subproblems involving different norms. Some strategies for refining the solution and speeding-up the overall method are also discussed. The performance of our column generation algorithm is assessed on realistic randomly generated instances as well as on real-world instances.
Column generation for the minimum hyperplanes clustering problem / E. Amaldi, K. Dhyani, A. Ceselli. - In: INFORMS JOURNAL ON COMPUTING. - ISSN 1091-9856. - 25:3(2013), pp. 446-460. [10.1287/ijoc.1120.0513]
Column generation for the minimum hyperplanes clustering problem
A. CeselliUltimo
2013
Abstract
Given n points in ℝd and a maximum allowed tolerance ε >0, the minimum hyperplanes clustering problem consists in finding a minimum number of hyperplanes such that the Euclidean distance between each point and the nearest hyperplane is at most .. We present a column generation approach for this problem based on a mixed integer nonlinear formulation in which the master is a set covering problem and the pricing subproblem is a mixed integer program with a nonconvex normalization constraint. We propose different ways of generating the initial pool of columns and investigate their impact on the overall algorithm. Since the pricing subproblem is substantially complicated by the ℓ2-norm constraint, we consider approximate pricing subproblems involving different norms. Some strategies for refining the solution and speeding-up the overall method are also discussed. The performance of our column generation algorithm is assessed on realistic randomly generated instances as well as on real-world instances.Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.