5 Self Healing Patterns: Important for Distributed Systems

  • Recover itself from an outage.
  • If there is a failure at one part of the system, make sure other healthy parts are running.
  • Automatic resource reduction for faulty parts of the system to reduce unnecessary computing resource utilization.
  • During high load, the ability to prioritize critical functionality to be running, if require de-prioritize or take offline non-critical functionalities.

1. Retry Pattern

Something from History ….

2. Circuit Breaker

Threadpool starvation without Circuit Breaker
Circuit Breaker in Action

3. Load Leveling

Service getting crushed

4. Checkpoint

5. Throttling Pattern



Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Mainak Saha

Mainak Saha


Cloud / Artificial Intelligence / Financial Services Enthusiast ..