Fast approximation of similarity graphs with kernel density estimation

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Constructing a similarity graph from a set X of data points in ℝd is the first step of many modern clustering algorithms. However, typical constructions of a similarity graph have high time complexity, and a quadratic space dependency with respect to |X|. We address this limitation and present a new algorithmic framework that constructs a sparse approximation of the fully connected similarity graph while preserving its cluster structure. Our presented algorithm is based on the kernel density estimation problem, and is applicable for arbitrary kernel functions. We compare our designed algorithm with the well-known implementations from the scikit-learn library and the FAISS library, and find that our method significantly outperforms the implementation from both libraries on a variety of datasets.
Original languageEnglish
Title of host publicationNeurIPS Proceedings - Advances in Neural Information Processing Systems 36 (NeurIPS 2023)
EditorsA. Oh, T. Naumann, A. Globeron, K. Saenko, M. Hardt, S. Levine
PublisherCurran Associates, Inc.
Pages67603-67624
Number of pages22
Volume36
ISBN (Print)9781713899921
DOIs
Publication statusPublished - 10 Dec 2023

Fingerprint

Dive into the research topics of 'Fast approximation of similarity graphs with kernel density estimation'. Together they form a unique fingerprint.

Cite this