Disaster Recovery vs High Availability

What is high availability?

High availability is a method of designing your storage infrastructure to minimize or even eliminate costly downtime by ensuring you have a failover solution in place. It is meant to address periodic outages that could be caused by hardware failure or routine downtime.

Generally, it is measured as a percentage, with 100% being always available. However, there are diminishing returns as the percentage goes higher. A common target is 99.999% available (or downtime of around 5 minutes a year) for those looking for extremely high availability.

Redundant in the server world means something that can take the place of another to keep your infrastructure running when there is a failure. Adding redundancy is the primary method of designing highly available systems.

At 45Drives, we achieve this by clustering servers. One of the protocols for designing highly available solutions is to eliminate any single point of failure. This is an important aspect that ensures any single component that is failing or has failed won't stop your whole system from working. With a multiple server setup, it is also important that any failures are detected and workloads redirected. With high available clustered solutions - if you're performing maintenance or an entire server goes down, your data will still be available.

*Redundancy also exists on the component level. Disaster Hardware such as redundant power supplies or switches, for example, can keep a single server from going down and prevent you from needing to failover.

So what is disaster recovery?

High availability won't save you from data loss; that is what disaster recovery is for. Having a way to deal with a flood, fire, theft, cyber-attack, an IT admin who makes a catastrophic mistake, or any other way you could take down your entire server infrastructure is important. Losing all your servers and data is rare but can be the end of many businesses, and at best, will likely be expensive. Disaster recovery is all about being prepared for the worst-case scenario when your infrastructure is destroyed. It is a planned strategy, combined with how you designed your system, to make sure recovery is within your acceptable downtime for your business to survive.

Some think disaster recovery is like having a backup, but it is more than that. Disaster recovery also includes the plan for how to handle those situations and minimize additional or surprise expenses. In addition, it means ensuring you have your backup data kept in a geographically separate location so it won't be destroyed with the rest of your data, which can be done over your local network or the internet. Unlike highly available clusters, these copies aren't meant to be quickly failed over when the primary server(s) fail. Instead, they are meant to keep your mission-critical data safe.

Your budget, how quickly you need your data back, and how much data loss you can tolerate (how old your backups can be) will determine what sort of solution you require.

How quickly you get the backup implemented and how old it is will depend on your recovery time objective (RTO) and recovery point objective (RPO).

RPO is how old your data can be when it is back up and running; this determines how often you should back up your server. RTO is how long you can wait for post-catastrophe to have your servers back up and running normally. This can inform how you should design your system to balance costs with recovery time. With these together, you can understand your needs for creating a disaster recovery solution.

So, let's summarize the differences between the two:

High Availability

High availability is a way you can design your storage system to minimize downtime.

Eliminating single points of failure is the core protocol of high availability.

High availability protects you from hardware failure but no data loss. Therefore, it is useful for planned outages such as maintenance.

High availability - Synchronous

Disaster Recovery

Disaster recovery is central for dealing with worst-case scenarios to get your storage systems up as quickly as possible. It is meant to give you protection from situations that could otherwise be lethal to your business.

Having a geographically separated backup is at the core of disaster recovery.

Disaster recovery solutions often contain high availability in their design, especially if they are clustered. Unfortunately, availability is something many with the forethought to plan for disaster recovery also plan for.

Disaster recovery is a higher-level implementation that consists of a combination of a plan and technology design. High availability is much more about the technology design, combining failovers and redundancy to eliminate single points of failure.

Disaster recovery - Asynchronous

Interested in learning more about one or both of these solutions? Talk with an account manager today.