The automation of operations is essential to reduce manpower costs and improve the reliability of the system. The Site Status Board (SSB) is a framework which allows Virtual Organizations to monitor their computing activities at distributed sites and to evaluate site performance. The ATLAS experiment intensively uses the SSB for the distributed computing shifts, for estimating data processing and data transfer efficiencies at a particular site, and for implementing automatic exclusion of sites from computing activities, in case of potential problems. The ATLAS SSB provides a real-time aggregated monitoring view and keeps the history of the monitoring metrics. Based on this history, usability of a site from the perspective of ATLAS is calculated. The paper will describe how the SSB is integrated in the ATLAS operations and computing infrastructure and will cover implementation details of the ATLAS SSB sensors and alarm system, based on the information in the SSB. It will demonstrate the positive impact of the use of the SSB on the overall performance of ATLAS computing activities and will overview future plans.

Automating ATLAS Computing Operations using the Site Status Board / J Andreeva;C Borrego Iglesias;S Campana;A Di Girolamo;I Dzhunov;X Espinal Curull;S Gayazov;E Magradze;M M Nowotka;L Rinaldi;P Saiz;J Schovancova;G A Stewart;M Wright. - In: JOURNAL OF PHYSICS. CONFERENCE SERIES. - ISSN 1742-6588. - STAMPA. - 396:(2012), pp. 032072-032078. [10.1088/1742-6596/396/3/032072]

Automating ATLAS Computing Operations using the Site Status Board

RINALDI, LORENZO;
2012

Abstract

The automation of operations is essential to reduce manpower costs and improve the reliability of the system. The Site Status Board (SSB) is a framework which allows Virtual Organizations to monitor their computing activities at distributed sites and to evaluate site performance. The ATLAS experiment intensively uses the SSB for the distributed computing shifts, for estimating data processing and data transfer efficiencies at a particular site, and for implementing automatic exclusion of sites from computing activities, in case of potential problems. The ATLAS SSB provides a real-time aggregated monitoring view and keeps the history of the monitoring metrics. Based on this history, usability of a site from the perspective of ATLAS is calculated. The paper will describe how the SSB is integrated in the ATLAS operations and computing infrastructure and will cover implementation details of the ATLAS SSB sensors and alarm system, based on the information in the SSB. It will demonstrate the positive impact of the use of the SSB on the overall performance of ATLAS computing activities and will overview future plans.
2012
Automating ATLAS Computing Operations using the Site Status Board / J Andreeva;C Borrego Iglesias;S Campana;A Di Girolamo;I Dzhunov;X Espinal Curull;S Gayazov;E Magradze;M M Nowotka;L Rinaldi;P Saiz;J Schovancova;G A Stewart;M Wright. - In: JOURNAL OF PHYSICS. CONFERENCE SERIES. - ISSN 1742-6588. - STAMPA. - 396:(2012), pp. 032072-032078. [10.1088/1742-6596/396/3/032072]
J Andreeva;C Borrego Iglesias;S Campana;A Di Girolamo;I Dzhunov;X Espinal Curull;S Gayazov;E Magradze;M M Nowotka;L Rinaldi;P Saiz;J Schovancova;G A Stewart;M Wright
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/394920
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 3
social impact