Fault tolerance, reliability and availability in Cloud computing are critical to ensure correct and continuous system operation also in the presence of failures. In this paper, we present an approach to evaluate fault tolerance mechanisms that use the virtualization technology to transparently increase the reliability and availability of applications deployed in the virtual machines in a Cloud. In contrast to several existing solutions that assume independent failures, we take into account the failure behavior of various server components, network and power distribution in a typical Cloud computing infrastructure, the correlation between individual failures, and the impact of each failure on user's applications. We use this evaluation to study fault tolerance mechanisms under different deployment contexts, and use it as the basis to develop a methodology for identifying and selecting mechanisms that match user's fault tolerance requirements.

Fault tolerance management in IaaS clouds / R. Jhawar, V. Piuri - In: Proceedings of the 2012 IEEE Conference in Europe about space and satellite telecommunications : Rome, 2-5 october 2012Piscataway : Institute of electrical and electronics engineers, 2012. - ISBN 9781467346870. - pp. 1-6 (( Intervento presentato al 1. convegno European Conference on Satellite Telecommunications (ESTEL) tenutosi a Roma nel 2013 [10.1109/ESTEL.2012.6400113].

Fault tolerance management in IaaS clouds

R. Jhawar;V. Piuri
2012

Abstract

Fault tolerance, reliability and availability in Cloud computing are critical to ensure correct and continuous system operation also in the presence of failures. In this paper, we present an approach to evaluate fault tolerance mechanisms that use the virtualization technology to transparently increase the reliability and availability of applications deployed in the virtual machines in a Cloud. In contrast to several existing solutions that assume independent failures, we take into account the failure behavior of various server components, network and power distribution in a typical Cloud computing infrastructure, the correlation between individual failures, and the impact of each failure on user's applications. We use this evaluation to study fault tolerance mechanisms under different deployment contexts, and use it as the basis to develop a methodology for identifying and selecting mechanisms that match user's fault tolerance requirements.
Fault tolerance as a service; Fault tolerance management; Infrastructure clouds
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
Book Part (author)
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

Caricamento pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/2434/228418
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 53
  • ???jsp.display-item.citation.isi??? 5
social impact