System Design Problem

Design a Backup and Disaster Recovery System

Commonly Asked By:AWSGoogleMicrosoftDropbox

  • Full backups: Complete copy of all data (databases, object storage, config)
  • Incremental backups: Only changes since last backup
  • Point-in-time recovery (PITR): Restore to any second within retention window
  • Cross-region replication: Backups stored in geographically separate region
  • Backup scheduling: Configurable policies (hourly incremental, daily full, weekly archive)
  • Restore testing: Automated periodic restore verification
  • Multi-tier storage: Recent on fast storage, older on cold/archive
  • RPO and RTO enforcement
  • Disaster recovery runbook: Automated failover to DR region
Loading...

DR Strategy Tiers

StrategyRPORTOCostDescription
Backup & RestoreHoursHours$Restore from S3 on demand
Pilot LightMinutes30 min$$Minimal infra, scale on failover
Warm StandbySeconds15 min$$$Reduced capacity, always running
Active-Active0~0$$$$Both regions serve traffic