We propose a method for imputing missing data by using conditional copula functions. Copulas are a powerful tool for multivariate analysis especially because they allow to i) fit any combination of marginal distribution functions, ii) model the marginal distributions and the dependence structure separately and iii) take into account complex dependence relationships. We present the method and perform a simulation study in order to compare it with two well–known imputation techniques: the regression imputation by EM algorithm and the nearest neighbour donor imputation. By varying different parameters we evaluate the performance of our proposal. Finally, we propose a generalization of our method by using non parametric estimation and inversion algorithms to generate random variates for conditional distributions.

Exploring copulas for the imputation of missing nonlinearly dependent data / Bianchi G.; Di Lascio F.M.L.; Giannerini S.; Manzari A.; Reale A.; Ruocco G.. - STAMPA. - (2009), pp. 429-432. (Intervento presentato al convegno Seventh Meeting of the Classification and Data Analysis Group of the Italian Statistical Society (Cladag) tenutosi a Università di Catania, Italy nel 9-11 settembre 2009).

Exploring copulas for the imputation of missing nonlinearly dependent data

DI LASCIO, FRANCESCA MARTA LILJA;GIANNERINI, SIMONE;
2009

Abstract

We propose a method for imputing missing data by using conditional copula functions. Copulas are a powerful tool for multivariate analysis especially because they allow to i) fit any combination of marginal distribution functions, ii) model the marginal distributions and the dependence structure separately and iii) take into account complex dependence relationships. We present the method and perform a simulation study in order to compare it with two well–known imputation techniques: the regression imputation by EM algorithm and the nearest neighbour donor imputation. By varying different parameters we evaluate the performance of our proposal. Finally, we propose a generalization of our method by using non parametric estimation and inversion algorithms to generate random variates for conditional distributions.
2009
Classification and Data Analysis 2009. Book of Short Papers
429
432
Exploring copulas for the imputation of missing nonlinearly dependent data / Bianchi G.; Di Lascio F.M.L.; Giannerini S.; Manzari A.; Reale A.; Ruocco G.. - STAMPA. - (2009), pp. 429-432. (Intervento presentato al convegno Seventh Meeting of the Classification and Data Analysis Group of the Italian Statistical Society (Cladag) tenutosi a Università di Catania, Italy nel 9-11 settembre 2009).
Bianchi G.; Di Lascio F.M.L.; Giannerini S.; Manzari A.; Reale A.; Ruocco G.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/103853
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact