The last few years have seen a growing demand of distributed Cloud infrastructures able to process big data generated by geographically scattered sources. A key challenge of this environment is how to manage big data across multiple heterogeneous datacenters interconnected through imbalanced network links. We designed a Hierarchical Hadoop Framework (H2F) where a top-level business logic smartly schedules bottom-level computing tasks capable of exploiting the potential of the MapReduce within each datacenter. In this work we discuss on the opportunity of fragmenting the big data into small pieces so that better workload configurations may be devised for the bottom-level tasks. Several case study experiments were run on a testbed where a software prototype of the designed framework was deployed. The test results are reported and discussed in the last part of the paper.

Cavallo M., Di Modica G., Polito C., Tomarchio O. (2017). Fragmenting Big Data to Boost the Performance of MapReduce in Geographical Computing Contexts. 345 E 47TH ST, NEW YORK, NY 10017 USA : Institute of Electrical and Electronics Engineers Inc. [10.1109/Innovate-Data.2017.12].

Fragmenting Big Data to Boost the Performance of MapReduce in Geographical Computing Contexts

Di Modica G.;
2017

Abstract

The last few years have seen a growing demand of distributed Cloud infrastructures able to process big data generated by geographically scattered sources. A key challenge of this environment is how to manage big data across multiple heterogeneous datacenters interconnected through imbalanced network links. We designed a Hierarchical Hadoop Framework (H2F) where a top-level business logic smartly schedules bottom-level computing tasks capable of exploiting the potential of the MapReduce within each datacenter. In this work we discuss on the opportunity of fragmenting the big data into small pieces so that better workload configurations may be devised for the bottom-level tasks. Several case study experiments were run on a testbed where a software prototype of the designed framework was deployed. The test results are reported and discussed in the last part of the paper.
2017
Proceedings - 2017 International Conference on Big Data Innovations and Applications, Innovate-Data 2017
17
24
Cavallo M., Di Modica G., Polito C., Tomarchio O. (2017). Fragmenting Big Data to Boost the Performance of MapReduce in Geographical Computing Contexts. 345 E 47TH ST, NEW YORK, NY 10017 USA : Institute of Electrical and Electronics Engineers Inc. [10.1109/Innovate-Data.2017.12].
Cavallo M.; Di Modica G.; Polito C.; Tomarchio O.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/736225
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 2
social impact