In this work, we study the asymptotic behavior of mixture of experts (MoE) trained via gradient flow on supervised learning problems. Our main result establishes the propagation of chaos for a MoE as the number of experts diverges. We demonstrate that the corresponding empirical measure of their parameters is close to a probability measure that solves a nonlinear continuity equation, and we provide an explicit convergence rate that depends solely on the number of experts and on the dimensionality of the parameter space. We apply our results to a MoE generated by a quantum neural network.

Melchor Hernandez, A., Pastorello, D., De Palma, G. (2026). Mean-field limit from general mixtures of experts to quantum neural networks. LETTERS IN MATHEMATICAL PHYSICS, 116, 1-23 [10.1007/s11005-026-02065-9].

Mean-field limit from general mixtures of experts to quantum neural networks

Melchor Hernandez Anderson
;
Pastorello Davide;De Palma Giacomo
2026

Abstract

In this work, we study the asymptotic behavior of mixture of experts (MoE) trained via gradient flow on supervised learning problems. Our main result establishes the propagation of chaos for a MoE as the number of experts diverges. We demonstrate that the corresponding empirical measure of their parameters is close to a probability measure that solves a nonlinear continuity equation, and we provide an explicit convergence rate that depends solely on the number of experts and on the dimensionality of the parameter space. We apply our results to a MoE generated by a quantum neural network.
2026
Melchor Hernandez, A., Pastorello, D., De Palma, G. (2026). Mean-field limit from general mixtures of experts to quantum neural networks. LETTERS IN MATHEMATICAL PHYSICS, 116, 1-23 [10.1007/s11005-026-02065-9].
Melchor Hernandez, Anderson; Pastorello, Davide; De Palma, Giacomo
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1060879
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex ND
social impact