This success story overviews how an American investment bank used Cutover runbook automation software to test disaster recovery scenarios and implement a comprehensive resilience solution.
The need: Reducing the time to test thousands of disaster recovery plans
Like other financial services firms, the American bank had an obligation to demonstrate their ability to deal with a catastrophic scenario in a timely manner. They needed the capability to pull together thousands of standardized technical, disaster recovery plans into test scenarios in minutes rather than weeks. The resilience testing function involved thousands of people globally and up to 2,000 applications that needed to be tested every year.
The existing home-grown system for this process was inadequate and did not provide the level of IT disaster recovery planning, visibility, communication, orchestration, or observability that was needed to ensure success. It lacked the required level of resilience assurance they needed to meet regulatory requirements. Ultimately, they needed to find a better way to store and execute the bank’s disaster recovery plans for their 2,000 services.
The solution: A single data recovery test for improved disaster recovery
Cutover’s SaaS platform and automated runbooks provides an effective disaster recovery planning software for banks. The U.S. bank’s disaster recovery plan included thousands of individual plans and Cutover enabled them to be configured into various test scenarios, in minutes.
Approach taken: Cutover runbooks implementation
Now that they are using Cutover, a single data center recovery (DCR) test that can encompass more than 300 bank disaster recovery plans can be prepared and managed independently and then merged into Cutover for orchestration and enterprise observability. Having the bank’s disaster recovery plans in Cutover automated runbooks also allows them to standardize with templates so that there is minimal work to finalize DCR test runbooks. Cutover provides users with the status information and updates they need during the test itself to ensure success without manual effort, such as being able to visualize the critical path, which is highlighted in Cutover.
To increase efficiency and resilience, the bank also created templated disaster recovery plans in Cutover. This made it quicker and easier for users to find, review, edit, and execute the plans and build new ones based on templates, and to collect data on executed disaster recovery plans for the bank.
Other ways Cutover improved the process:
- Auto-calculation of recovery time objective through structured disaster recovery plans for the bank
- Standardized and observable event execution led to fewer issues and better decision making
- Providing the ability to benchmark performance in data center tests against previous runs
- Integrated with existing apps to provide observability across the entire process
- Better compliance as there was demonstrable evidence of testing and the associated timings
- Robust auditability and reporting resulted in meeting all audit requirements
Results: Better informed, faster disaster recovery
Using Cutover, the bank was able to reduce event planning time by 70% and easily facilitated and recorded all 143,000 completed tasks across its 10,000 users.
The bank is now in the process of decommissioning an existing system in favor of Cutover. The team running the event is now better informed during data center tests, helping to improve decision making. They were also better able to collaborate with the auditor and meet audit points, as a record of all activity was automatically provided by Cutover.
Due to the success of past and current resilience activities, Cutover is also being used to support building power downs and to orchestrate some of the infrastructure parts of resilience testing, including the application testing part of DCR tests.
Let Cutover runbooks help you!
Cutover helps organizations recover applications more confidently to meet regulatory requirements. Contact Cutover to learn how we can help you effectively plan and execute large-scale IT disaster recovery scenarios. Schedule a demo today to learn more.