Performance, reliability and scalability in data-access are key issues in the context of the computing Grid and High Energy Physics data processing and analysis applications, in particular considering the large data size and I/O load that a Large Hadron Collider data centre has to support. In this paper we present the technical details and the results of a large scale validation and performance measurement employing different data-access platforms—namely CASTOR, dCache, GPFS and Scalla/Xrootd. The tests have been performed at the CNAF Tier-1, the central computing facility of the Italian National Institute for Nuclear Research (INFN). Our storage back-end was based on Fibre Channel disk-servers organized in a Storage Area Network, being the disk-servers connected to the computing farm via Gigabit LAN. We used 24 disk-servers, 260 TB of raw-disk space and 280 worker nodes as computing clients, able to run concurrently up to about 1100 jobs. The aim of the test was to perform sequential and random read/write accesses to the data, as well as more realistic access patterns, in order to evaluate efficiency, availability, robustness and performance of the various data-access solutions.
A Comparison of Data-Access Platforms for the Computing of Large Hadron Collider Experiments / M. Bencivenni; A. Carbone; A. Chierici; A. D’Apice; D. De Girolamo; L. dell’Agnello; M. Donatelli; G. Donvito; A. Fella; A. Forti; F. Furano; D. Galli; A. Ghiselli; A. Italiano; E. Lanciotti; G. Lo Re; L. Magnoni; U. Marconi; B. Martelli; M. Mazzucato; P. P. Ricci; F. Rosso; D. Salomoni; R. Santinelli; V. Sapunenko; V. Vagnoni; R. Veraldi; D. Vitlacil; S. Zani; R. Zappi. - In: IEEE TRANSACTIONS ON NUCLEAR SCIENCE. - ISSN 0018-9499. - STAMPA. - 55:3(2008), pp. 4545073.1621-4545073.1630. [10.1109/TNS.2008.924087]
A Comparison of Data-Access Platforms for the Computing of Large Hadron Collider Experiments
CARBONE, ANGELO;A. Chierici;GALLI, DOMENICO;VAGNONI, VINCENZO MARIA;
2008
Abstract
Performance, reliability and scalability in data-access are key issues in the context of the computing Grid and High Energy Physics data processing and analysis applications, in particular considering the large data size and I/O load that a Large Hadron Collider data centre has to support. In this paper we present the technical details and the results of a large scale validation and performance measurement employing different data-access platforms—namely CASTOR, dCache, GPFS and Scalla/Xrootd. The tests have been performed at the CNAF Tier-1, the central computing facility of the Italian National Institute for Nuclear Research (INFN). Our storage back-end was based on Fibre Channel disk-servers organized in a Storage Area Network, being the disk-servers connected to the computing farm via Gigabit LAN. We used 24 disk-servers, 260 TB of raw-disk space and 280 worker nodes as computing clients, able to run concurrently up to about 1100 jobs. The aim of the test was to perform sequential and random read/write accesses to the data, as well as more realistic access patterns, in order to evaluate efficiency, availability, robustness and performance of the various data-access solutions.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.