Performance, reliability and scalability in data-access are key issues in the context of the computing Grid and High Energy Physics data processing and analysis applications, in particular considering the large data size and I/O load that a Large Hadron Collider data centre has to support. In this paper we present the technical details and the results of a large scale validation and performance measurement employing different data-access platforms—namely CASTOR, dCache, GPFS and Scalla/Xrootd. The tests have been performed at the CNAF Tier-1, the central computing facility of the Italian National Institute for Nuclear Research (INFN). Our storage back-end was based on Fibre Channel disk-servers organized in a Storage Area Network, being the disk-servers connected to the computing farm via Gigabit LAN. We used 24 disk-servers, 260 TB of raw-disk space and 280 worker nodes as computing clients, able to run concurrently up to about 1100 jobs. The aim of the test was to perform sequential and random read/write accesses to the data, as well as more realistic access patterns, in order to evaluate efficiency, availability, robustness and performance of the various data-access solutions.

M. Bencivenni, A. Carbone, A. Chierici, A. D’Apice, D. De Girolamo, L. dell’Agnello, et al. (2008). A Comparison of Data-Access Platforms for the Computing of Large Hadron Collider Experiments. IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 55(3), 1621-1630 [10.1109/TNS.2008.924087].

A Comparison of Data-Access Platforms for the Computing of Large Hadron Collider Experiments

CARBONE, ANGELO;A. Chierici;GALLI, DOMENICO;VAGNONI, VINCENZO MARIA;
2008

Abstract

Performance, reliability and scalability in data-access are key issues in the context of the computing Grid and High Energy Physics data processing and analysis applications, in particular considering the large data size and I/O load that a Large Hadron Collider data centre has to support. In this paper we present the technical details and the results of a large scale validation and performance measurement employing different data-access platforms—namely CASTOR, dCache, GPFS and Scalla/Xrootd. The tests have been performed at the CNAF Tier-1, the central computing facility of the Italian National Institute for Nuclear Research (INFN). Our storage back-end was based on Fibre Channel disk-servers organized in a Storage Area Network, being the disk-servers connected to the computing farm via Gigabit LAN. We used 24 disk-servers, 260 TB of raw-disk space and 280 worker nodes as computing clients, able to run concurrently up to about 1100 jobs. The aim of the test was to perform sequential and random read/write accesses to the data, as well as more realistic access patterns, in order to evaluate efficiency, availability, robustness and performance of the various data-access solutions.
2008
M. Bencivenni, A. Carbone, A. Chierici, A. D’Apice, D. De Girolamo, L. dell’Agnello, et al. (2008). A Comparison of Data-Access Platforms for the Computing of Large Hadron Collider Experiments. IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 55(3), 1621-1630 [10.1109/TNS.2008.924087].
M. Bencivenni; A. Carbone; A. Chierici; A. D’Apice; D. De Girolamo; L. dell’Agnello; M. Donatelli; G. Donvito; A. Fella; A. Forti; F. Furano; D. Galli...espandi
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/61347
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 10
  • ???jsp.display-item.citation.isi??? 12
social impact