The power requirements of modern High-Performance Computing (HPC) systems pose environmental and financial challenges, as they contribute to carbon emissions and strain power grids. Optimizing power consumption together with system performance has thus become a crucial goal for HPC centers. As jobs running on a system contribute to the whole system’s power usage, predicting their power requirements before execution on the system would allow to forecast the overall power consumption and perform techniques like power capping at the workload manager level. Such predictive studies need quality data, which is limited due to the inherent complexity of collecting structured data for job power characterization in a production system. This paper aims to fill the lack of resources for job power prediction and provide the HPC community with (i) a methodology to create a job power consumption dataset from workload manager data and node power metrics logs, and (ii) a novel and large dataset comprising around 230K jobs and their corresponding power consumption values. The dataset is derived from M100, a holistic dataset extracted from a production supercomputer hosted at the HPC centre CINECA in Italy.

Antici, F., Seyedkazemi Ardebili, M., Bartolini, A., Kiziltan, Z. (2023). PM100: A Job Power Consumption Dataset of a Large-scale Production HPC System. New York : Association for Computing Machinery [10.1145/3624062.3624263].

PM100: A Job Power Consumption Dataset of a Large-scale Production HPC System

Antici, Francesco
Primo
;
Seyedkazemi Ardebili, Mohsen;Bartolini, Andrea;Kiziltan, Zeynep
2023

Abstract

The power requirements of modern High-Performance Computing (HPC) systems pose environmental and financial challenges, as they contribute to carbon emissions and strain power grids. Optimizing power consumption together with system performance has thus become a crucial goal for HPC centers. As jobs running on a system contribute to the whole system’s power usage, predicting their power requirements before execution on the system would allow to forecast the overall power consumption and perform techniques like power capping at the workload manager level. Such predictive studies need quality data, which is limited due to the inherent complexity of collecting structured data for job power characterization in a production system. This paper aims to fill the lack of resources for job power prediction and provide the HPC community with (i) a methodology to create a job power consumption dataset from workload manager data and node power metrics logs, and (ii) a novel and large dataset comprising around 230K jobs and their corresponding power consumption values. The dataset is derived from M100, a holistic dataset extracted from a production supercomputer hosted at the HPC centre CINECA in Italy.
2023
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis (SC-W '23)
1812
1819
Antici, F., Seyedkazemi Ardebili, M., Bartolini, A., Kiziltan, Z. (2023). PM100: A Job Power Consumption Dataset of a Large-scale Production HPC System. New York : Association for Computing Machinery [10.1145/3624062.3624263].
Antici, Francesco; Seyedkazemi Ardebili, Mohsen; Bartolini, Andrea; Kiziltan, Zeynep
File in questo prodotto:
File Dimensione Formato  
3624062.3624263.pdf

accesso aperto

Tipo: Versione (PDF) editoriale
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 739.59 kB
Formato Adobe PDF
739.59 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/954579
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact