Core Concept

Load Balancing Algorithms

Load balancers distribute traffic across server fleets to maximize throughput and ensure high availability, operating at either the L4 (network) or L7 (application) layers.


What:

Load balancers act as central gateways, routing incoming traffic streams across a fleet of backend servers to prevent hotspots and ensure high availability.

Primary purpose:

Preventing single-server exhaustion, providing failover routing, and maximizing overall system QPS capacity.

Usually used for:

Stateless web server pools, API gateways, reverse proxies, and microservices internal meshes.

How should I think about this inside system architectures?

🔗 Layer 4 Transport

Low overhead routing. Looks only at IP addresses and TCP port numbers, forwarding raw packets to servers without opening them.

🧠 Layer 7 Smart Routing

High intelligence. Decrypts HTTP/TLS traffic to steer requests dynamically based on URL paths (`/feed`), cookies, or headers.

🏥 Active Health Checks

Continuously ping downstream nodes. Automatically eject crashed servers from the active routing pool within seconds.