CRIS Current Research Information System

In this paper, we propose and analyze Block-wise Incremental Gradient with Averaging (BIG-A), i.e., a novel optimization algorithm tailored for large-scale and bigdata nonconvex optimization problems with a composite cost function. At each iteration, the algorithm uses a block of a single function gradient to properly update auxiliary variables providing a proxy of a descent direction. We interpret BIG-A as a dynamical system arising from the interconnection between a fast, time-varying scheme and a slow, time-invariant one. This interpretation allows us to prove the convergence properties by using system theory results relying on the LaSalle-Yoshizawa invariance principle and singular perturbations. The solution estimate sequence generated by BIG-A is shown to converge toward the set of stationary points of the problem, which is not assumed to be convex nor satisfying the Polyak-Łojasiewicz condition. If strong convexity is also assumed, linear convergence toward the unique optimal solution is established. Finally, numerical simulations confirm the theoretical findings.

Carnevale, G., Notarnicola, I., Notarstefano, G. (2024). Nonconvex Big-Data Optimization via System Theory: a Block-wise Incremental Gradient Method. Piscataway : Institute of Electrical and Electronics Engineers Inc. [10.1109/cdc56724.2024.10885905].

Nonconvex Big-Data Optimization via System Theory: a Block-wise Incremental Gradient Method

Carnevale, Guido;Notarnicola, Ivano;Notarstefano, Giuseppe

2024

Abstract

In this paper, we propose and analyze Block-wise Incremental Gradient with Averaging (BIG-A), i.e., a novel optimization algorithm tailored for large-scale and bigdata nonconvex optimization problems with a composite cost function. At each iteration, the algorithm uses a block of a single function gradient to properly update auxiliary variables providing a proxy of a descent direction. We interpret BIG-A as a dynamical system arising from the interconnection between a fast, time-varying scheme and a slow, time-invariant one. This interpretation allows us to prove the convergence properties by using system theory results relying on the LaSalle-Yoshizawa invariance principle and singular perturbations. The solution estimate sequence generated by BIG-A is shown to converge toward the set of stationary points of the problem, which is not assumed to be convex nor satisfying the Polyak-Łojasiewicz condition. If strong convexity is also assumed, linear convergence toward the unique optimal solution is established. Finally, numerical simulations confirm the theoretical findings.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Titolo del volume
	
				Proceedings of the IEEE Conference on Decision and Control
			
	Pagina iniziale
	
				4746
			
	Pagina finale
	
				4751
			
	Collana/Serie
	
				PROCEEDINGS OF THE ... IEEE CONFERENCE ON DECISION & CONTROL.
			
	Codice DOI
	
				https://dx.doi.org/10.1109/cdc56724.2024.10885905
			
	Citazione
	
				Carnevale, G., Notarnicola, I., Notarstefano, G. (2024). Nonconvex Big-Data Optimization via System Theory: a Block-wise Incremental Gradient Method. Piscataway : Institute of Electrical and Electronics Engineers Inc. [10.1109/cdc56724.2024.10885905].
			
	Tutti gli autori
	
						Carnevale, Guido; Notarnicola, Ivano; Notarstefano, Giuseppe
					
	Appare nelle tipologie:
	
				4.01 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
final_sbm_cdc_sgd_with_averaging.pdf accesso aperto Tipo: Postprint / Author's Accepted Manuscript (AAM) - versione accettata per la pubblicazione dopo la peer-review Licenza: Licenza per accesso libero gratuito Dimensione 313.66 kB Formato Adobe PDF Visualizza/Apri	313.66 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/1013320

Citazioni

ND

0

0

social impact