MMKK++ algorithm for clustering heterogeneous images into an unknown number of clusters

Authors

  • Dávid Papp Budapest University of Technology and Economics
  • Gábor Szűcs Budapest University of Technology and Economics

Abstract

In this paper we present a suggested automatic clustering procedure with the main aim to predict the number of clusters of unknown, heterogeneous images. We used the state-of-the-art Fisher-vector for mathematical representation of the images and these vectors were considered as input data points for the clustering algorithm. We implemented a novel variant of K-means, the kernel K-means++, furthermore the min-max kernel K-means plusplus (MMKK++) as clustering method. The proposed approach examines some candidate cluster numbers and uses the law of large numbers in order to choose the optimal cluster size. We conducted experiments on four image sets to demonstrate the efficiency of our solution. The first two image sets are subsets of different popular collections; the third is their union; the fourth is the complete Caltech101 image set.

Keywords

image clustering, kernel K-means, cluster number, Fisher-vector

Published

2018-01-10

Downloads

Download data is not yet available.