System Design Interview Patterns

What:

A catalog of recurring solution shapes and a delivery rhythm for 45-minute system design interviews.

Primary purpose:

Recognize which pattern applies, name it confidently, and allocate time so the interviewer sees requirements, architecture, and depth — not just one area.

Usually used for:

Product systems (feeds, chat, commerce), infra-adjacent designs (queues, storage), and any problem where the same building blocks reappear with different nouns.

Most interview problems reduce to one or two of these axes:

📡 Push vs Pull

Does the client need live updates? If yes, plan fan-out and connection management. If no, cache + pagination on pull is enough.

⚡ Sync vs Async

Can the user wait? Short requests stay on the API path. Anything over ~200 ms of CPU or I/O belongs on a queue with status polling or webhooks.

📈 Read vs Write Hot

Identify the hot dimension first. Read-heavy → replicas and CDN. Write-heavy → shard keys, batch writes, or aggregate counters.

Seven recurring patterns — map the problem statement to one or two before choosing databases:

Pattern	Core Mechanic	Primary Role
Real-Time Updates	WebSocket, SSE, or long-poll with pub/sub fan-out to connected clients.	Push live state (chat, bids, dashboards) without polling every resource.
Long-Running Tasks	Enqueue work to a durable queue; stateless workers process asynchronously.	Keep API latency low for transcoding, email, reports, and batch jobs.
Contention Control	Distributed locks, compare-and-swap (CAS), or single-writer queues.	Prevent double-booking, overselling inventory, or duplicate payments.
Scaling Reads	Read replicas, layered cache, and CDN edge delivery.	Absorb read-heavy traffic without overloading primary databases.
Scaling Writes	Sharding, write batching, and pre-aggregation counters.	Spread write load and reduce hot-row pressure on a single node.
Large Blob Handling	Presigned multipart upload directly to object storage.	Offload multi-GB files from application servers and API gateways.
Multi-Step Processes	Saga compensations or workflow orchestrators (Temporal-style).	Coordinate checkout, booking, and onboarding across multiple services.

Benefit	Cost
Pattern reuse — once you recognize the shape (feed, chat, checkout), you spend less time inventing from scratch	Over-application — forcing WebSockets or sharding when a simple REST + cache design suffices loses credibility
Structured pacing — a time budget prevents drowning in API details before drawing architecture	Rigid scripts — interviewers may jump to deep dives early; adapt while keeping scope explicit

Interview Phase	Time Budget	What to Cover
Requirements & scope	~5 min	Functional vs non-functional, scale assumptions, in/out of scope
Entities & relationships	~2 min	Core nouns, ownership boundaries, read vs write paths
API surface	~5 min	Key endpoints, idempotency keys, pagination, error contracts
High-level design	~15 min	Boxes-and-arrows diagram, data flow, bottleneck callouts
Deep dives	~10 min	Interviewer-chosen topics: sharding, fan-out, failure modes
Buffer / trade-offs	~8 min	Explicit trade-offs, evolution path, monitoring hooks

Treat the table as a default — senior interviewers often allocate more time to deep dives if your HLD is crisp. Always leave ~2 minutes to summarize trade-offs and next evolution steps.

Pattern Composition: A Live Auction Example

Geographic Proximity Routing