Distributed data management at LHC scales is a stagering task, accompained by equally challenging pratical management iussues with storage systems and wide-area networks. CMS data transfer management system, PhEDEx, is designed to handle this task with minimum operator effort, automating the workflows from large scale distribution of HEP experiment datasets down to reliable and scalable transfers of individual files over frequentlly unreliable infrastructures. PhEDEx has been designed and proven to scale beyond the current CMS needs. Few of the techniques we have used are novel, but rarely documented in HEP. We describe many of the techniques we have used to make the system robust and able to deliver high performance. On schema and data organisation we describe our use of hierarchical data organisation, separation of active and inactive data, and tuning the database for the data and access patterns. Regarding monitoring we describe our use of optimised queries, moving queries away from hot tables, and using multi-level performance histograms to precalculate partial aggregated results. Robustness applies both detecting and recovering from local errors, and robustness in the distributed environment. We describe the coding patterns we use for error-resilient and selfhealing agents for the former, and the breakdown of handshakes in file transfer, routing files to destinations, and in managing site presence for the later.

Techniques fro High-Throughput, Reliable Transfer Systems: Break-Down of PhEDEx Design / T.Barrass; D.Bonacorsi; J.Hernandez; J.Rhen; L.Tuura; Y.Wu. - STAMPA. - II:(2006), pp. 1030-1034. (Intervento presentato al convegno CHEP 2006 Computing in High Energy and Nuclear Physics tenutosi a Mumbai,India nel 13-17 Febbraio 2006).

Techniques fro High-Throughput, Reliable Transfer Systems: Break-Down of PhEDEx Design

BONACORSI, DANIELE;
2006

Abstract

Distributed data management at LHC scales is a stagering task, accompained by equally challenging pratical management iussues with storage systems and wide-area networks. CMS data transfer management system, PhEDEx, is designed to handle this task with minimum operator effort, automating the workflows from large scale distribution of HEP experiment datasets down to reliable and scalable transfers of individual files over frequentlly unreliable infrastructures. PhEDEx has been designed and proven to scale beyond the current CMS needs. Few of the techniques we have used are novel, but rarely documented in HEP. We describe many of the techniques we have used to make the system robust and able to deliver high performance. On schema and data organisation we describe our use of hierarchical data organisation, separation of active and inactive data, and tuning the database for the data and access patterns. Regarding monitoring we describe our use of optimised queries, moving queries away from hot tables, and using multi-level performance histograms to precalculate partial aggregated results. Robustness applies both detecting and recovering from local errors, and robustness in the distributed environment. We describe the coding patterns we use for error-resilient and selfhealing agents for the former, and the breakdown of handshakes in file transfer, routing files to destinations, and in managing site presence for the later.
2006
Computing in High Energy and Nuclear Physics(CHEP-2006)
1030
1034
Techniques fro High-Throughput, Reliable Transfer Systems: Break-Down of PhEDEx Design / T.Barrass; D.Bonacorsi; J.Hernandez; J.Rhen; L.Tuura; Y.Wu. - STAMPA. - II:(2006), pp. 1030-1034. (Intervento presentato al convegno CHEP 2006 Computing in High Energy and Nuclear Physics tenutosi a Mumbai,India nel 13-17 Febbraio 2006).
T.Barrass; D.Bonacorsi; J.Hernandez; J.Rhen; L.Tuura; Y.Wu
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/74182
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact