Back to Search Start Over

Impact of pacemaker failover configuration on mean time to recovery for small cloud clusters

Authors :
Benz, Konstantin
Bohnert, Thomas Michael
Benz, Konstantin
Bohnert, Thomas Michael
Publication Year :
2019

Abstract

In cloud environments High Availability characteristics are established by the usage of failover software (like e.g. HAProxy, Keepalive or Pacemaker). Though these tools enable automatic recovery of cloud services from outages, the recovery can still be very slow if it is not configured adequately. In this paper we developed a "Recovery Time Test" to determine if recovery time depends on configuration of the failover software and how recovery time depends on configuration settings. Another goal of the Recovery Time Test is to determine the factor by which recovery time can be decreased by a given configuration. As proof of concept, we applied the Recovery Time Test to an OpenStack cloud environment which is controlled by the Pacemaker failover software. Pacemaker mean recovery time can take a value between 110 and 160 seconds, if the tool is configured badly. We found that with a proper configuration Pacemaker mean recovery time can be reduced significantly to a value between 15 and 20 seconds.

Details

Database :
OAIster
Notes :
application/pdf, 2014 IEEE 7th International Conference on Cloud Computing, English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1097648628
Document Type :
Electronic Resource