Variance matrix priors for Dirichlet process mixture models with Gaussian kernels

Wei Jing, Michail Papathomas*, Silvia Liverani

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Bayesian mixture modelling is widely used for density estimation and clustering. The Dirichlet process mixture model (DPMM) is the most popular Bayesian non-parametric mixture modelling approach. In this manuscript, we study the choice of prior for the variance or precision matrix when Gaussian kernels are adopted. Typically, in the relevant literature, the assessment of mixture models is done by considering observations in a space of only a handful of dimensions. Instead, we are concerned with more realistic problems of higher dimensionality, in a space of up to 20 dimensions. We observe that the choice of prior is increasingly important as the dimensionality of the problem increases. After identifying certain undesirable properties of standard priors in problems of higher dimensionality, we review and implement possible alternative priors. The most promising priors are identified, as well as other factors that affect the convergence of MCMC samplers. Our results show that the choice of prior is critical for deriving reliable posterior inferences. This manuscript offers a thorough overview and comparative investigation into possible priors, with detailed guidelines for their implementation. Although our work focuses on the use of the DPMM in clustering, it is also applicable to density estimation.
Original languageEnglish
Number of pages25
JournalInternational Statistical Review
VolumeEarly View
Early online date15 Sept 2024
DOIs
Publication statusE-pub ahead of print - 15 Sept 2024

Keywords

  • Bayesian non-parametrics
  • Clustering

Fingerprint

Dive into the research topics of 'Variance matrix priors for Dirichlet process mixture models with Gaussian kernels'. Together they form a unique fingerprint.

Cite this