MMKK++ algorithm for clustering heterogeneous images into an unknown number of clusters
Abstract
In this paper we present a suggested automatic clustering procedure with the main aim to predict the number of clusters of unknown, heterogeneous images. We used the state-of-the-art Fisher-vector for mathematical representation of the images and these vectors were considered as input data points for the clustering algorithm. We implemented a novel variant of K-means, the kernel K-means++, furthermore the min-max kernel K-means plusplus (MMKK++) as clustering method. The proposed approach examines some candidate cluster numbers and uses the law of large numbers in order to choose the optimal cluster size. We conducted experiments on four image sets to demonstrate the efficiency of our solution. The first two image sets are subsets of different popular collections; the third is their union; the fourth is the complete Caltech101 image set.Keywords
image clustering, kernel K-means, cluster number, Fisher-vectorPublished
2018-01-10
Downloads
Download data is not yet available.
Copyright (c) 2018 Dávid Papp, Gábor Szűcs
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.