Aggressive thermal management is a critical feature for high-end computing platforms, as worst-case thermal budgeting is becoming unaffordable. Reactive thermal management, which sets temperature thresholds to trigger thermal capping actions, is too "near-sighted", and it may lead to severe performance degradation and thermal overshoots. More aggressive proactive thermal management minimizes performance penalty with smooth optimal control, but it requires the knowledge of the system thermal models to be precise. Unfortunately, in practice these models are not provided by equipment manufacturers, and they strongly depend on the deployment environment. Hence, we need to develop procedures to derive thermal models automatically in the field. In this paper, we focus on static thermal model learning. We tackle the problem in a real-life context: we developed a complete infrastructure for model-building and thermal data collection in the Linux environment, and we tested it on an Intel Nehalem-based server CPU. Model building is based on a least-square procedure which extracts the model linking power dissipation with temperature in steady-state conditions. Our results show high accuracy and robustness even in presence of a complex thermal environment and limited-precision power and temperature measurements typical of today's commercial servers.

Static Thermal Model Learning for High-Performance Multicore Servers / Beneventi F.; Bartolini A.; Benini L.. - STAMPA. - (2011), pp. 1-6. (Intervento presentato al convegno 20th International Conference on Computer Communications and Networks (ICCCN), 2011 tenutosi a Lahaina, HI, USA nel July 31 2011-Aug. 4 2011) [10.1109/ICCCN.2011.6006065].

Static Thermal Model Learning for High-Performance Multicore Servers

BENEVENTI, FRANCESCO;BARTOLINI, ANDREA;BENINI, LUCA
2011

Abstract

Aggressive thermal management is a critical feature for high-end computing platforms, as worst-case thermal budgeting is becoming unaffordable. Reactive thermal management, which sets temperature thresholds to trigger thermal capping actions, is too "near-sighted", and it may lead to severe performance degradation and thermal overshoots. More aggressive proactive thermal management minimizes performance penalty with smooth optimal control, but it requires the knowledge of the system thermal models to be precise. Unfortunately, in practice these models are not provided by equipment manufacturers, and they strongly depend on the deployment environment. Hence, we need to develop procedures to derive thermal models automatically in the field. In this paper, we focus on static thermal model learning. We tackle the problem in a real-life context: we developed a complete infrastructure for model-building and thermal data collection in the Linux environment, and we tested it on an Intel Nehalem-based server CPU. Model building is based on a least-square procedure which extracts the model linking power dissipation with temperature in steady-state conditions. Our results show high accuracy and robustness even in presence of a complex thermal environment and limited-precision power and temperature measurements typical of today's commercial servers.
2011
Computer Communications and Networks (ICCCN), 2011 Proceedings of 20th International Conference on
1
6
Static Thermal Model Learning for High-Performance Multicore Servers / Beneventi F.; Bartolini A.; Benini L.. - STAMPA. - (2011), pp. 1-6. (Intervento presentato al convegno 20th International Conference on Computer Communications and Networks (ICCCN), 2011 tenutosi a Lahaina, HI, USA nel July 31 2011-Aug. 4 2011) [10.1109/ICCCN.2011.6006065].
Beneventi F.; Bartolini A.; Benini L.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/106156
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? ND
social impact