Code-Memo

a. What is Load Balancing?

Load balancing involves distributing incoming network or application traffic across multiple servers or resources to ensure that no single server becomes a bottleneck. This technique improves the availability, reliability, and responsiveness of applications by preventing any one server from being overwhelmed.

b. Benefits of Load Balancing:

Enhanced Availability: By distributing traffic across multiple servers, load balancing helps ensure that if one server fails, others can continue to handle requests, thereby improving application availability.
Improved Performance: Load balancing can reduce latency and response times by directing traffic to the least busy servers, which improves the overall user experience.
Scalability: It allows applications to handle increased load by adding more servers to the pool, facilitating horizontal scaling.
Fault Tolerance: Load balancing can detect server failures and reroute traffic to healthy servers, enhancing fault tolerance.

a. Round Robin:

How It Works: Distributes incoming requests sequentially across a pool of servers in a cyclic order.
Advantages:
- Simple and easy to implement.
- Distributes traffic evenly assuming all servers have similar capabilities.
Disadvantages:
- Does not account for server load or performance differences.
- May not be effective if servers have varying capacities.

b. Least Connections:

How It Works: Routes traffic to the server with the fewest active connections.
Advantages:
- Balances load based on server utilization, which can be more effective than round robin.
- Suitable for scenarios where request processing times vary significantly.
Disadvantages:
- May lead to uneven distribution if connection duration varies.

c. Least Response Time:

How It Works: Directs traffic to the server with the lowest response time or latency.
Advantages:
- Improves user experience by minimizing response times.
- Adapts to servers with varying performance characteristics.
Disadvantages:
- Requires continuous monitoring of server response times, which can add overhead.

d. Weighted Round Robin:

How It Works: Assigns weights to servers based on their capacity and distributes requests accordingly. Servers with higher weights receive more traffic.
Advantages:
- Allows for traffic distribution based on server capabilities.
- Useful in environments where servers have different performance levels.
Disadvantages:
- Requires configuration and maintenance of server weights.

e. IP Hash:

How It Works: Routes requests based on the hash of the client’s IP address, ensuring that the same client is directed to the same server.
Advantages:
- Ensures session persistence for clients, as they consistently hit the same server.
- Useful for applications requiring session affinity.
Disadvantages:
- Can lead to uneven distribution if client IP addresses are not well-distributed.

f. Random:

a. Hardware Load Balancers:

How They Work: Physical devices dedicated to distributing network traffic across servers. They are often used in enterprise environments.
Advantages:
- High performance and reliability.
- Feature-rich with advanced capabilities such as SSL offloading and DDoS protection.
Disadvantages:
- Expensive and require dedicated infrastructure.

b. Software Load Balancers:

How They Work: Load balancing is performed by software running on standard servers. Examples include HAProxy and NGINX.
Advantages:
- Cost-effective and flexible.
- Easier to scale and configure.
Disadvantages:
- Performance may be limited by the underlying server’s capabilities.

c. Cloud-Based Load Balancers:

How They Work: Provided by cloud service providers (e.g., AWS Elastic Load Balancing, Azure Load Balancer) and integrate with cloud infrastructure.
Advantages:
- Scalable and easy to deploy.
- Integrated with other cloud services and often includes features like auto-scaling and monitoring.
Disadvantages:
- Dependency on cloud provider’s infrastructure and pricing model.

a. Network Layer Load Balancing:

How It Works: Operates at the transport layer (Layer 4) of the OSI model, distributing traffic based on IP addresses and ports.
Use Cases: Suitable for TCP/UDP traffic and applications where layer 7 (application) information is not needed for routing.

b. Application Layer Load Balancing:

How It Works: Operates at the application layer (Layer 7) of the OSI model, distributing traffic based on application-level information such as HTTP headers or URLs.
Use Cases: Ideal for web applications and services where routing decisions can be based on URL paths, HTTP methods, or other application-specific data.

c. Global Load Balancing:

How It Works: Distributes traffic across multiple geographic locations or data centers based on factors like geographic location, server health, and latency.
Use Cases: Enhances global application performance and provides redundancy across regions.