Страница публикации

A Computational Comparison of Parallel and Distributed K-median Clustering Algorithms on Large-Scale Image Data

Авторы: Ushakov A., Vasilyev I.

Журнал: Communications in Computer and Information Science: Proc. of the Intern. Conf. on Mathematical Optimization Theory and Operations Research (MOTOR'2019)

Том: 1090


Год: 2019

Отчётный год: 2019


Местоположение издательства:


Аннотация: Most commonly used clustering algorithms are those aimed at solving the well-known k-median problem. Their main advantage is that they are simple to implement and use, and they are flexible in choosing dissimilarity measures (not necessarily metrics). K-median algorithms are also known to be more robust to noise and outliers in comparison with k-means algorithms. In spite of that, they have been of limited use for large-scale clustering problems due to their high computational and space complexity. This work aims at computational comparison of k-median clustering algorithms in a specific large-scale setting—clustering large image collections. We implement distributed versions of the most common k-median clustering algorithms and compare them with our parallel heuristic for solving large-scale k-median problem instances. We analyze clustering results with respect to external evaluation measures and run time.

Индексируется WOS: 1

Индексируется Scopus: 1

Индексируется РИНЦ: 0

Публикация в печати: 0

Добавил в систему: