Normal cluster-weighted models constitute a modern approach to linear regression which simultaneously perform model-based cluster analysis and multivariate linear regression analysis with random quantitative regressors. Robustified models have been recently developed, based on the use of the contaminated normal distribution, which can manage the presence of mildly atypical observations. A more flexible class of contaminated normal linear cluster-weighted models is specified here, in which the researcher is free to use a different vector of regressors for each response. The novel class also includes parsimonious models, where parsimony is attained by imposing suitable constraints on the component-covariance matrices of either the responses or the regressors. Identifiability conditions are illustrated and discussed. An expectation-conditional maximisation algorithm is provided for the maximum likelihood estimation of the model parameters. The effectiveness and usefulness of the proposed models are shown through the analysis of simulated and real datasets.

Perrone, G., Soffritti, G. (2024). Parsimonious Seemingly Unrelated Contaminated Normal Cluster-Weighted Models. JOURNAL OF CLASSIFICATION, 41(November), 533-567 [10.1007/s00357-023-09458-8].

Parsimonious Seemingly Unrelated Contaminated Normal Cluster-Weighted Models

Perrone G.
Primo
;
Soffritti G.
Secondo
2024

Abstract

Normal cluster-weighted models constitute a modern approach to linear regression which simultaneously perform model-based cluster analysis and multivariate linear regression analysis with random quantitative regressors. Robustified models have been recently developed, based on the use of the contaminated normal distribution, which can manage the presence of mildly atypical observations. A more flexible class of contaminated normal linear cluster-weighted models is specified here, in which the researcher is free to use a different vector of regressors for each response. The novel class also includes parsimonious models, where parsimony is attained by imposing suitable constraints on the component-covariance matrices of either the responses or the regressors. Identifiability conditions are illustrated and discussed. An expectation-conditional maximisation algorithm is provided for the maximum likelihood estimation of the model parameters. The effectiveness and usefulness of the proposed models are shown through the analysis of simulated and real datasets.
2024
Perrone, G., Soffritti, G. (2024). Parsimonious Seemingly Unrelated Contaminated Normal Cluster-Weighted Models. JOURNAL OF CLASSIFICATION, 41(November), 533-567 [10.1007/s00357-023-09458-8].
Perrone, G.; Soffritti, G.
File in questo prodotto:
File Dimensione Formato  
preprint per iris senza supplementary material.pdf

embargo fino al 08/01/2025

Descrizione: paper accettato per la pubblicazione
Tipo: Postprint
Licenza: Licenza per accesso libero gratuito
Dimensione 2.58 MB
Formato Adobe PDF
2.58 MB Adobe PDF   Visualizza/Apri   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/994494
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact