Understand how GCP Load Balancer works and build one

Aug 4 2019 GCP 16 minutes read (About 2394 words)

Introduction

Why Load Balancer?

It split the traffic of multiple services among multiple machines
When one node dies, you still have another one
It supports auto-scaling when pre-set benchmark is reached (Not covered in this article)
With proper CI / CD and health-check, it could achieve rolling upgrade

In this article, we are going to build a unmanaged Load Balancer, the easiest one.
Detailed explanation for every component
The concept in Graph (Image source： Google )：

Users from IPv4 and IPv6 make requests to our service
The IPv4 and IPv6 forwarding rules lead the request to HTTP(s) proxy
When requests reach HTTP(S) proxy, it will be led to some backend-service according to the url-map we set. For example, the request with domain ‘test1’ would be led to backend-service 1, and ‘test2’ would be led to backend-service 2
backend-service consist of instance group. For example, we could specify that backend-service A lead the request to port 8000 of instance group A, and backend-service B lead the request to port 6000 of instance group B
instance group, as its name, consist of instance. If we set up instance group and backend-service properly, request will reach backend-service, and be led to designated port of instance via instance group and its balance condition
Every backend-service could have a health-check, and health-check would periodically prob specified port and get response. If there is no response, or the response is slower than the benchmark we’ve set, then the instacnce will be diagnosed as unhealthy. Request would not be led to unhealthy instance
If SSL is needed, SSL certificate could be created and added in HTTPS proxy
Let’s get our hands dirty now!

Google Cloud SDK Installation

In this article, we are going to use Google Cloud SDK on every section, so before we start, let’s install it first.
Installation way varies per your Operating System. We could refer to Official Documentation

Create an instance

Build two instances

gcloud compute instances create test-01 \
--image-project=ubuntu-os-cloud \
--image-family=ubuntu-1804-lts \
--boot-disk-size=30GB \
--boot-disk-type=pd-standard \
--machine-type=f1-micro \
--tags=test-01,http-server,https-server \
--zone=asia-east1-c

gcloud compute instances create test-02 \
--image-project=ubuntu-os-cloud \
--image-family=ubuntu-1804-lts \
--boot-disk-size=30GB \
--boot-disk-type=pd-standard \
--machine-type=f1-micro \
--tags=test-02,http-server,https-server \
--zone=asia-east1-c

Build two machines, named test-01 and test-02
Boot-drive capacity is 30 GB
Pull image from ubuntu-os-cloud
Use ubuntu-1804-lts as version of the image
The disk-types type is pd-standard, you could also check all disk-types by running gcloud compute disk-types
The machine-types is f1-micro, you could also check all machine-types by running gcloud compute machine-types list
As a identifier of instance, which we are going to use with later when creating firewall-rules
zone specify the zone of the instance. Be aware that some resources are limited with zone and region
Reference official documentation

Instance Environment Installation

The followings are about instance environment. You could simply skip it because it doesn’t have much to do with our subject.

apt-get update -y && apt-get install curl -y && curl -o- https://raw.githubusercontent.com/nvm-sh/nvm/v0.34.0/install.sh | bash && export NVM_DIR="$HOME/.nvm" && [ -s "$NVM_DIR/nvm.sh" ] && \. "$NVM_DIR/nvm.sh" && [ -s "$NVM_DIR/bash_completion" ] && \. "$NVM_DIR/bash_completion" && nvm install yourNodeVersion && apt-get install npm -y && npm install pm2 -g && pm2 update && apt-get install git -y && apt-get install build-essential

Create firewall-rules

We need to create firewall-rules per the port you use and so request could be led to the instance.

We could refer to official documentation

gcloud compute firewall-rules create test-01 --allow tcp:123,tcp:456,tcp:789 --target-tags test-01

gcloud compute firewall-rules create test-02 --allow tcp:123,tcp:456,tcp:789 --target-tags test-02

Static External IP

We are going to create a static IP for further usage, we could refer to official documentation

gcloud compute addresses create lb-test \
    --ip-version=IPV4 \
    --global

Instance group

Instance group can consist of multiple instance, and it the bulk of backend-service, we could refer to official documentation

With instance group, we could create different backend-service later on Load Balancer

gcloud compute instance-groups unmanaged create test --description 'run test project' --zone asia-east1-c

Set named port, which could be used on different backend-service

gcloud compute instance-groups unmanaged set-named-ports test --named-ports port1:3000,port2:6000,port3:9000,port4:12000,port5:15000 --zone asia-east1-c

Add existing instance into instance group

gcloud compute instance-groups unmanaged add-instances test \
    --instances test-01,test-02 \
    --zone asia-east1-c

Health check

health-check could prob specified port with specified frequency. If there is no response from specified port, health-check would diagnose this port as unhealthy, and backend-service would not send request to unhealthy destination.

We could refer to official documentation

gcloud compute health-checks create tcp test-tcp-3000 \
    --description='test tcp 3000'\
    --port=3000

gcloud compute health-checks create tcp test-tcp-6000 \
    --description='test tcp 6000'\
    --port=6000

gcloud compute health-checks create tcp test-tcp-9000 \
    --description='test tcp 9000'\
    --port=9000

gcloud compute health-checks create tcp test-tcp-12000 \
    --description='test tcp 12000'\
    --port=12000

gcloud compute health-checks create tcp test-tcp-15000 \
    --description='test tcp 15000'\
    --port=15000

Backend service

The --port-name here is what we set up above. In this example, different backend-service would send request to different port. instance group has not been mentioned yet? Don’t worry, in next step, we will add instance group into backend-service

Also, health-check will be added into backend-service because backend-service is going to decide which instance the request should be sent to.
We could refer to the official documentation

Build backend-service

gcloud compute backend-services create backend-service-port1 \
    --protocol http \
    --port-name port1 \
    --health-checks test-tcp-3000 \
    --global

gcloud compute backend-services create backend-service-port2 \
    --protocol http \
    --port-name port2 \
    --health-checks test-tcp-6000 \
    --global

gcloud compute backend-services create backend-service-port3 \
    --protocol http \
    --port-name port3 \
    --health-checks test-tcp-9000 \
    --global

gcloud compute backend-services create backend-service-port4 \
    --protocol http \
    --port-name port4 \
    --health-checks test-tcp-12000 \
    --global

gcloud compute backend-services create backend-service-port5 \
    --protocol http \
    --port-name port5 \
    --health-checks test-tcp-15000 \
    --global

Next, we are going to add instance group into the backend-service we just built. Since we’ve designated port when creating backend-service, backend-service would lead requests to the designated port of the instance group

Besides setting which port the request to be sent, we are going to set up the benchmark of instance utilisation. UTILIZATION means the percentage of usage. When it reaches 80%, backend-service would stop sending request to this instance

capacity-scaler means 1 * 0.8, so if you have multiple backend-service using one instance-group, and you want to keep more utilisation capacity for some other backend-service, then you could give a lower capacity-scaler. So when this backend-service already use capacity-scaler * max-utilization, request from this backend-service would not be sent to this instance-group, which save the utilisation capacity of the instance-group for other backend-service

We could refer to the example below, and also the official documentation

Add instance group into backend-service

gcloud compute backend-services add-backend backend-service-port1 \
    --balancing-mode UTILIZATION \
    --max-utilization 0.8 \
    --capacity-scaler 1 \
    --instance-group test \
    --instance-group-zone asia-east1-c \
    --global

gcloud compute backend-services add-backend backend-service-port2 \
    --balancing-mode UTILIZATION \
    --max-utilization 0.8 \
    --capacity-scaler 1 \
    --instance-group test \
    --instance-group-zone asia-east1-c \
    --global

gcloud compute backend-services add-backend backend-service-port3 \
    --balancing-mode UTILIZATION \
    --max-utilization 0.8 \
    --capacity-scaler 1 \
    --instance-group test \
    --instance-group-zone asia-east1-c \
    --global

gcloud compute backend-services add-backend backend-service-port4 \
    --balancing-mode UTILIZATION \
    --max-utilization 0.8 \
    --capacity-scaler 1 \
    --instance-group test \
    --instance-group-zone asia-east1-c \
    --global

gcloud compute backend-services add-backend backend-service-port5 \
    --balancing-mode UTILIZATION \
    --max-utilization 0.8 \
    --capacity-scaler 1 \
    --instance-group test \
    --instance-group-zone asia-east1-c \
    --global

URL map

We’ve covered backend-service above, and url-map is what leads request to backend-service

Firstly, let’s create a url-map, and specify a default backend-service. It means that if destination is not specified, request would be sent to this default backend-service

We could refer to the example below, also the official documentation

Create a url-map

gcloud compute url-maps create web-map \
    --default-service backend-service-port1

After we created a url-map, and specified a default backend-service, now we could specify more rules with which the request should be led to backend-service

We should use path-matcher to specify the path as example below:

path-matcher: create a path-matcher and specify the rule

new-hosts: The request requesting the host sunday.com.tw is going to be applied for this rule.

It said that the request requesting sunday.com.tw would be led to backend-service-port1

We could refer to the example below, also the official documentation

Add path-matcher

gcloud compute url-maps add-path-matcher web-map \
   --default-service backend-service-port1 \
   --path-matcher-name pathmap-port1 \
   --new-hosts=sunday.com.tw

Here you could find a new component called path-rules

When requested host is monday.com.tw, and the default path is /, the request would be sent to backend-service-port2

When the requested path is happy, like monday.com.tw/happy, the request would be sent to backend-service-port1

When the requested path is unhappy, like monday.com.tw/unhappy, the request would be sent to backend-service-port2

When the requested path is sad, like monday.com.tw/sad, the request would be sent to backend-service-port3

gcloud compute url-maps add-path-matcher web-map \
   --default-service backend-service-port2 \
   --path-matcher-name pathmap-port2 \
   --path-rules=/happy=backend-service-port1,/unhappy=backend-service-port2,/sad=backend-service-port3 \
   --new-hosts=monday.com.tw

example same as above, the request to tuesday.com.tw would be sent to backend-service-port3

gcloud compute url-maps add-path-matcher web-map \
   --default-service backend-service-port3 \
   --path-matcher-name pathmap-port3 \
   --new-hosts=tuesday.com.tw

Create a SSL certificate

In order to let our service support HTTPS, we need to create ssl-certificates. It could be either self-managed or google-managed

self-managed means the ssl certificate managed on your own. In the following example, we will use google-managed

We could refer to official documentation

gcloud beta compute ssl-certificates create www-ssl-cert \
  --domains sunday.com.tw,monday.com.tw,tuesday.com.tw

HTTP proxy

All the HTTP request will get here, and be sent to backend-service via url-map

We could refer to official documentation

Create a HTTP proxy

gcloud compute target-http-proxies create http-lb-proxy \
    --url-map web-map

HTTPS proxy

All the HTTPS request will get here, and be sent to backend-service via url-map

Also, we are going to add the ssl-certificates we just created here so target-https-proxies can support HTTPS

可參考官方文件
We could refer to official documentation

Create HTTPS proxy

gcloud compute target-https-proxies create https-lb-proxy \
--url-map web-map \
--ssl-certificates www-ssl-cert

Check Static External IP

List the addresses we’ve created

We could refer to official documentation

gcloud compute addresses list

Forwarding rules

When the requested address and port match the forwarding-rules, lead the request to designated target-http-proxy

Replace the [LB_IP_ADDRESS] below with the static external IP we just created

We could refer to the official documentation

Create HTTP forwarding-rules

gcloud compute forwarding-rules create http-content-rule \
    --address [LB_IP_ADDRESS] \
    --global \
    --target-http-proxy http-lb-proxy \
    --ports 80

When the requested address and port match the forwarding-rules, lead the request to designated target-https-proxy

Replace the [LB_IP_ADDRESS] below with the static external IP we just created

We could refer to the official documentation

Create HTTPS forwarding-rules

gcloud compute forwarding-rules create https-content-rule \
    --address [LB_IP_ADDRESS] \
    --global \
    --target-https-proxy https-lb-proxy \
    --ports 443

Conclusion

Above-mentioned is the flow of Load Balancer of GCP as follows:
request => forwarding-rules => target-http(s)-proxy => url-map => backend-service => instance-group => instance

Follow the example above, you should be able to run services on your instance and get, process, respond requests properly.

I spent quite a lot of time writing this article, hoping it will help whoever in need. If you’ve got here, I would like to thank you for it.

Finally, if you find this article is helpful, your clap is the best reward to me.

Also, if you find anything incorrect, feel free to let me know.

#Google Load Balancers

Understand how GCP Load Balancer works and build one

Introduction

Why Load Balancer?

The concept in Graph (Image source： Google )：

Google Cloud SDK Installation

Create an instance

Instance Environment Installation

Create firewall-rules

Static External IP

Instance group

Health check

Backend service

URL map

Create a SSL certificate

HTTP proxy

HTTPS proxy

Check Static External IP

Forwarding rules

Conclusion

Comments

Your browser is out-of-date!