pivmet: an R package proposing pivotal methods for consensus clustering and mixture modelling

We introduce the R package pivmet, a software that performs different pivotal methods for identifying, extracting, and using the so-called pivotal units that are chosen from a partition of data points to represent the groups to which they belong. Such units turn out be very useful in both unsupervised and supervised learning frameworks such as clustering, classification and mixture modelling. More specifically, applications of pivotal methods include, among the others: a Markov-Chain Monte Carlo (MCMC) relabelling procedure to deal with the well-known label-switching problem occurring during Bayesian estimation of mixture models (Egidi et al., 2018; Frühwirth-Schnatter, 2001; Richardson & Green, 1997; Stephens, 2000); model-based clustering through sparse finite mixture models (SFMM) (Frühwirth-Schnatter & Malsiner-Walli, 2019; Malsiner-Walli et al., 2016); consensus clustering (Strehl & Ghosh, 2002), which may allow to improve classical clustering techniques -e.g. the classical k-means- via a careful seeding; and Dirichlet process mixture models (DPMM) in Bayesian nonparametrics (Escobar & West, 1995; Ferguson, 1973; Neal, 2000).

pivmet: an R package proposing pivotal methods for consensus clustering and mixture modelling

Egidi Leonardo;Pappada Roberta;Pauli Francesco;Torelli Nicola

2024-01-01

Abstract

We introduce the R package pivmet, a software that performs different pivotal methods for identifying, extracting, and using the so-called pivotal units that are chosen from a partition of data points to represent the groups to which they belong. Such units turn out be very useful in both unsupervised and supervised learning frameworks such as clustering, classification and mixture modelling. More specifically, applications of pivotal methods include, among the others: a Markov-Chain Monte Carlo (MCMC) relabelling procedure to deal with the well-known label-switching problem occurring during Bayesian estimation of mixture models (Egidi et al., 2018; Frühwirth-Schnatter, 2001; Richardson & Green, 1997; Stephens, 2000); model-based clustering through sparse finite mixture models (SFMM) (Frühwirth-Schnatter & Malsiner-Walli, 2019; Malsiner-Walli et al., 2016); consensus clustering (Strehl & Ghosh, 2002), which may allow to improve classical clustering techniques -e.g. the classical k-means- via a careful seeding; and Dirichlet process mixture models (DPMM) in Bayesian nonparametrics (Escobar & West, 1995; Ferguson, 1973; Neal, 2000).

Scheda breve

Scheda completa

	Anno
	
				2024
			
	Data ahead of print
	
				12-giu-2024
			
	Stato di pubblicazione
	
				Pubblicato
			
	Rivista
	
				JOURNAL OF OPEN SOURCE SOFTWARE
			
	DOI
	
				https://dx.doi.org/10.21105/joss.06461
			
	URL
	
				https://doi.org/10.21105/joss.06461
			
	Appare nelle tipologie:
	
				1.1 Articolo in Rivista

File in questo prodotto:

File	Dimensione	Formato
paper_pivmet_final.pdf accesso aperto Descrizione: articolo Tipologia: Documento in Versione Editoriale Licenza: Creative commons Dimensione 1.14 MB Formato Adobe PDF Visualizza/Apri	1.14 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11368/3077638

Citazioni

ND

ND

ND

social impact