项目作者: hiroyuki-kasai

项目描述 :
Sparse simplex projection-based Wasserstein k-means
高级语言: MATLAB
项目地址: git://github.com/hiroyuki-kasai/SSPW-kmeans.git
创建时间: 2021-06-08T05:44:15Z
项目社区:https://github.com/hiroyuki-kasai/SSPW-kmeans

开源协议:MIT License

下载


SSPW k-means: Sparse simplex projection-based Wasserstein k-means


Authors: Hiroyuki Kasai and Takumi Fukunaga

Last page update: June 08, 2021

Latest version: 1.0.0 (see Release notes for more info)


Introduction

This repository contains the code of sparse simplex projection-based Wasserstein k-means, called SSPW k-means, that is a faster Wasserstein k-means algorithm for histogram
data by reducing Wasserstein distance computations and exploiting sparse simplex projection. We shrink data samples, centroids, and the ground cost matrix, which
leads to considerable reduction of the computations used to solve optimal transport problems without loss of clustering quality. Furthermore, SSPW k-means dynamically
reduced the computational complexity by removing lower-valued data samples and harnessing sparse simplex projection while keeping the degradation of clustering quality lower.


Paper

T. Fukunaga and H. Kasai, “Wasserstein k-means with sparse simplex projection,” ICPR2020. Publisher’s site, arXiv.


Folders and files

  1. ./ - Top directory.
  2. ./README.md - This readme file.
  3. ./run_me_first.m - The scipt that you need to run first.
  4. ./demo.m - A demonstration script.
  5. |algorithms - Contains the implementation file of the proposed SSPW k-means
  6. |tools - Contains some files for execution.
  7. |datasets - Contains some datasets.


First to do

Run run_me_first for path configurations.

  1. %% First run the setup script
  2. run_me_first;


Demonstration

Run demo for a demonstration.

  1. %% Execute a demonstration script.
  2. demo;


Notes

  • Some parts are borrowed from below:

    • Staib, Matthew and Jegelka, Stefanie, “Wasserstein k-means++ for Cloud Regime Histogram Clustering,” Proceedings of the Seventh International Workshop on Climate Informatics: CI 2017, 2017, Code.


Problems or questions

If you have any problems or questions, please contact the author: Hiroyuki Kasai (email: hiroyuki dot kasai at waseda dot jp)


Release Notes

  • Version 1.0.0 (June 08, 2021)
    • Initial version.