This work introduces different approaches to model mean structures in matrix-variate normals, addressing the over-parameterization issue commonly encountered when exploiting these distributions in the context of model-based clustering. The methodology employs parsimonious parameterizations of the means using additive, interaction and/or polynomial terms to reveal the intricate multivariate interdependence underlying the data’s mean structure. Identifiability issues related to the proposed parameterizations are discussed and expressions to compute maximum likelihood estimates of the parameters of the resulting mean-restricted matrix normal are derived. In order to exploit the proposed parameterizations in a model-based clustering setting, finite mixtures of mean-restricted matrix normals are considered. Integrating structured covariance matrices, the approach maintains model flexibility without succumbing to overfitting. An Expectation-Maximization (EM) algorithm is developed to estimate all model parameters. Through a comprehensive simulation study and a real-world example on climate data, the efficacy of the proposed solutions in capturing complex data relationships is demonstrated, offering significant improvements over traditional methods.

Berrettini, M., Galimberti, G., Viroli, C. (2025). Mean-restricted Matrix-variate Normals with an application to clustering. STATISTICS AND COMPUTING, 35(100), 1-20 [10.1007/s11222-025-10641-7].

Mean-restricted Matrix-variate Normals with an application to clustering

Berrettini Marco
;
Galimberti Giuliano;Viroli Cinzia
2025

Abstract

This work introduces different approaches to model mean structures in matrix-variate normals, addressing the over-parameterization issue commonly encountered when exploiting these distributions in the context of model-based clustering. The methodology employs parsimonious parameterizations of the means using additive, interaction and/or polynomial terms to reveal the intricate multivariate interdependence underlying the data’s mean structure. Identifiability issues related to the proposed parameterizations are discussed and expressions to compute maximum likelihood estimates of the parameters of the resulting mean-restricted matrix normal are derived. In order to exploit the proposed parameterizations in a model-based clustering setting, finite mixtures of mean-restricted matrix normals are considered. Integrating structured covariance matrices, the approach maintains model flexibility without succumbing to overfitting. An Expectation-Maximization (EM) algorithm is developed to estimate all model parameters. Through a comprehensive simulation study and a real-world example on climate data, the efficacy of the proposed solutions in capturing complex data relationships is demonstrated, offering significant improvements over traditional methods.
2025
Berrettini, M., Galimberti, G., Viroli, C. (2025). Mean-restricted Matrix-variate Normals with an application to clustering. STATISTICS AND COMPUTING, 35(100), 1-20 [10.1007/s11222-025-10641-7].
Berrettini, Marco; Galimberti, Giuliano; Viroli, Cinzia
File in questo prodotto:
File Dimensione Formato  
s11222-025-10641-7.pdf

accesso aperto

Tipo: Versione (PDF) editoriale / Version Of Record
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 1.24 MB
Formato Adobe PDF
1.24 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1022415
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact