High Availability (HA)

Definition

High availability means a system is continuously operational and accessible with minimal downtime (often 99.99% uptime or better).

Core concept

HA designs avoid single points of failure by using redundancy (multiple nodes, replicas), automatic failover, and distributed architectures. The goal is close to “always on” operation.

Use cases

Critical services like banking, healthcare, cloud services (AWS, Azure) use HA to meet SLAs. Databases can use clustering, replication, and load balancing for HA.

Trade-offs

HA adds cost and complexity: extra hardware, more complex architecture, and sophisticated monitoring. But it prevents service outages.

Example

A highly available web app might run on multiple servers behind a load balancer; if one fails, traffic shifts to others seamlessly.

References

HA defined as being “accessible and reliable close to 100% of the time”[[37]](https://www.ibm.com/think/topics/high-availability "What is High Availability? | IBM").