Normal cluster-weighted models constitute a modern approach to linear regression which simultaneously perform model-based cluster analysis and multivariate linear regression analysis with random quantitative regressors. Robustified models have been recently developed, based on the use of the contaminated normal distribution, which can manage the presence of mildly atypical observations. A more flexible class of contaminated normal linear cluster-weighted models is specified here, in which the researcher is free to use a different vector of regressors for each response. The novel class also includes parsimonious models, where parsimony is attained by imposing suitable constraints on the component-covariance matrices of either the responses or the regressors. Identifiability conditions are illustrated and discussed. An expectation-conditional maximisation algorithm is provided for the maximum likelihood estimation of the model parameters. The effectiveness and usefulness of the proposed models are shown through the analysis of simulated and real datasets.
Perrone G., Soffritti G. (In stampa/Attività in corso). Parsimonious Seemingly Unrelated Contaminated Normal Cluster-Weighted Models. JOURNAL OF CLASSIFICATION, Accepted 05 December 2023 - Published online 08 January 2024, 1-35 [10.1007/s00357-023-09458-8].
Parsimonious Seemingly Unrelated Contaminated Normal Cluster-Weighted Models
Perrone G.Primo
;Soffritti G.
Secondo
In corso di stampa
Abstract
Normal cluster-weighted models constitute a modern approach to linear regression which simultaneously perform model-based cluster analysis and multivariate linear regression analysis with random quantitative regressors. Robustified models have been recently developed, based on the use of the contaminated normal distribution, which can manage the presence of mildly atypical observations. A more flexible class of contaminated normal linear cluster-weighted models is specified here, in which the researcher is free to use a different vector of regressors for each response. The novel class also includes parsimonious models, where parsimony is attained by imposing suitable constraints on the component-covariance matrices of either the responses or the regressors. Identifiability conditions are illustrated and discussed. An expectation-conditional maximisation algorithm is provided for the maximum likelihood estimation of the model parameters. The effectiveness and usefulness of the proposed models are shown through the analysis of simulated and real datasets.File | Dimensione | Formato | |
---|---|---|---|
preprint per iris senza supplementary material.pdf
embargo fino al 08/01/2025
Descrizione: paper accettato per la pubblicazione
Tipo:
Postprint
Licenza:
Licenza per Accesso Aperto. Altra tipologia di licenza compatibile con Open Access
Dimensione
2.58 MB
Formato
Adobe PDF
|
2.58 MB | Adobe PDF | Visualizza/Apri Contatta l'autore |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.