doc/architecture.txt - haproxy - Gitiles

                            -------------------
                              H A - P r o x y
                            Architecture  Guide
                            -------------------
                              version 1.1.32
                               willy tarreau
                                2005/07/17


 This document provides real world examples with working configurations.
 Please note that except stated otherwise, global configuration parameters
 such as logging, chrooting, limits and time-outs are not described here.

 ===================================================
 1. Simple HTTP load-balancing with cookie insertion
 ===================================================

 A web application often saturates the front-end server with high CPU loads,
 due to the scripting language involved. It also relies on a back-end database
 which is not much loaded. User contexts are stored on the server itself, and
 not in the database, so that simply adding another server with simple IP/TCP
 load-balancing would not work.

                 +-------+
                 |clients|  clients and/or reverse-proxy
                 +---+---+
                     |
                    -+-----+--------+----
                           |       _|_db
                        +--+--+   (___)
                        | web |   (___)
                        +-----+   (___)
                    192.168.1.1   192.168.1.2


 Replacing the web server with a bigger SMP system would cost much more than
 adding low-cost pizza boxes. The solution is to buy N cheap boxes and install
 the application on them. Install haproxy on the old one which will spread the
 load across the new boxes.

   192.168.1.1    192.168.1.11-192.168.1.14   192.168.1.2
  -------+-----------+-----+-----+-----+--------+----
         |           |     |     |     |       _|_db
      +--+--+      +-+-+ +-+-+ +-+-+ +-+-+    (___)
      | LB1 |      | A | | B | | C | | D |    (___)
      +-----+      +---+ +---+ +---+ +---+    (___)
      haproxy        4 cheap web servers


 Config on haproxy (LB1) :
 -------------------------

     listen webfarm 192.168.1.1:80
        mode http
        balance roundrobin
        cookie SERVERID insert indirect
        option httpchk HEAD /index.html HTTP/1.0
        server webA 192.168.1.11:80 cookie A check
        server webB 192.168.1.12:80 cookie B check
        server webC 192.168.1.13:80 cookie C check
        server webD 192.168.1.14:80 cookie D check


 Description :
 -------------
  - LB1 will receive clients requests.
  - if a request does not contain a cookie, it will be forwarded to a valid
    server
  - in return, a cookie "SERVERID" will be inserted in the response holding the
    server name (eg: "A").
  - when the client comes again with the cookie "SERVERID=A", LB1 will know that
    it must be forwarded to server A. The cookie will be removed so that the
    server does not see it.
  - if server "webA" dies, the requests will be sent to another valid server
    and a cookie will be reassigned.


 Flows :
 -------

 (client)                           (haproxy)                         (server A)
   >-- GET /URI1 HTTP/1.0 ------------> |
                ( no cookie, haproxy forwards in load-balancing mode. )
                                        | >-- GET /URI1 HTTP/1.0 ---------->
                                        | <-- HTTP/1.0 200 OK -------------<
                ( the proxy now adds the server cookie in return )
   <-- HTTP/1.0 200 OK ---------------< |
       Set-Cookie: SERVERID=A           |
   >-- GET /URI2 HTTP/1.0 ------------> |
       Cookie: SERVERID=A               |
       ( the proxy sees the cookie. it forwards to server A and deletes it )
                                        | >-- GET /URI2 HTTP/1.0 ---------->
                                        | <-- HTTP/1.0 200 OK -------------<
    ( the proxy does not add the cookie in return because the client knows it )
   <-- HTTP/1.0 200 OK ---------------< |
   >-- GET /URI3 HTTP/1.0 ------------> |
       Cookie: SERVERID=A               |
                                     ( ... )


 Limits :
 --------
  - if clients use keep-alive (HTTP/1.1), only the first response will have
    a cookie inserted, and only the first request of each session will be
    analyzed. This does not cause trouble in insertion mode because the cookie
    is put immediately in the first response, and the session is maintained to
    the same server for all subsequent requests in the same session. However,
    the cookie will not be removed from the requests forwarded to the servers,
    so the server must not be sensitive to unknown cookies. If this causes
    trouble, you can disable keep-alive by adding the following option :

         option httpclose

  - if for some reason the clients cannot learn more than one cookie (eg: the
    clients are indeed some home-made applications or gateways), and the
    application already produces a cookie, you can use the "prefix" mode (see
    below).

  - LB1 becomes a very sensible server. If LB1 dies, nothing works anymore.
    => you can back it up using keepalived.

  - if the application needs to log the original client's IP, use the
    "forwardfor" option which will add an "X-Forwarded-For" header with the
    original client's IP address. You must also use "httpclose" to ensure
    that you will rewrite every requests and not only the first one of each
    session :

         option httpclose
         option forwardfor

    The web server will have to be configured to use this header instead.
    For example, on apache, you can use LogFormat for this :

         LogFormat "%{X-Forwarded-For}i %l %u %t \"%r\" %>s %b " combined
         CustomLog /var/log/httpd/access_log combined


 ==================================================================
 2. HTTP load-balancing with cookie prefixing and high availability
 ==================================================================

 Now you don't want to add more cookies, but rather use existing ones. The
 application already generates a "JSESSIONID" cookie which is enough to track
 sessions, so we'll prefix this cookie with the server name when we see it.
 Since the load-balancer becomes critical, it will be backed up with a second
 one in VRRP mode using keepalived under Linux.

 Download the latest version of keepalived from this site and install it
 on each load-balancer LB1 and LB2 :

        http://www.keepalived.org/

 You then have a shared IP between the two load-balancers (we will still use the
 original IP). It is active only on one of them at any moment. To allow the
 proxy to bind to the shared IP on Linux 2.4, you must enable it in /proc :

 # echo 1 >/proc/sys/net/ipv4/ip_nonlocal_bind


     shared IP=192.168.1.1
   192.168.1.3  192.168.1.4    192.168.1.11-192.168.1.14   192.168.1.2
  -------+------------+-----------+-----+-----+-----+--------+----
         |            |           |     |     |     |       _|_db
      +--+--+      +--+--+      +-+-+ +-+-+ +-+-+ +-+-+    (___)
      | LB1 |      | LB2 |      | A | | B | | C | | D |    (___)
      +-----+      +-----+      +---+ +---+ +---+ +---+    (___)
      haproxy      haproxy        4 cheap web servers
      keepalived   keepalived


 Config on both proxies (LB1 and LB2) :
 --------------------------------------

     listen webfarm 192.168.1.1:80
        mode http
        balance roundrobin
        cookie JSESSIONID prefix
        option httpclose
        option forwardfor
        option httpchk HEAD /index.html HTTP/1.0
        server webA 192.168.1.11:80 cookie A check
        server webB 192.168.1.12:80 cookie B check
        server webC 192.168.1.13:80 cookie C check
        server webD 192.168.1.14:80 cookie D check


 Notes: the proxy will modify EVERY cookie sent by the client and the server,
 so it is important that it can access to ALL cookies in ALL requests for
 each session. This implies that there is no keep-alive (HTTP/1.1), thus the
 "httpclose" option. Only if you know for sure that the client(s) will never
 use keep-alive (eg: Apache 1.3 in reverse-proxy mode), you can remove this
 option.


 Description :
 -------------
  - LB1 is VRRP master (keepalived), LB2 is backup.
  - LB1 will receive clients requests on IP 192.168.1.1.
  - both load-balancers send their checks from their native IP.
  - if a request does not contain a cookie, it will be forwarded to a valid
    server
  - in return, if a JESSIONID cookie is seen, the server name will be prefixed
    into it, followed by a delimitor ('~')
  - when the client comes again with the cookie "JSESSIONID=A~xxx", LB1 will
    know that it must be forwarded to server A. The server name will then be
    extracted from cookie before it is sent to the server.
  - if server "webA" dies, the requests will be sent to another valid server
    and a cookie will be reassigned.


 Flows :
 -------

 (client)                           (haproxy)                         (server A)
   >-- GET /URI1 HTTP/1.0 ------------> |
                ( no cookie, haproxy forwards in load-balancing mode. )
                                        | >-- GET /URI1 HTTP/1.0 ---------->
                                        |     X-Forwarded-For: 10.1.2.3
                                        | <-- HTTP/1.0 200 OK -------------<
                         ( no cookie, nothing changed )
   <-- HTTP/1.0 200 OK ---------------< |
   >-- GET /URI2 HTTP/1.0 ------------> |
     ( no cookie, haproxy forwards in lb mode, possibly to another server. )
                                        | >-- GET /URI2 HTTP/1.0 ---------->
                                        |     X-Forwarded-For: 10.1.2.3
                                        | <-- HTTP/1.0 200 OK -------------<
                                        |     Set-Cookie: JSESSIONID=123
     ( the cookie is identified, it will be prefixed with the server name )
   <-- HTTP/1.0 200 OK ---------------< |
       Set-Cookie: JSESSIONID=A~123     |
   >-- GET /URI3 HTTP/1.0 ------------> |
       Cookie: JSESSIONID=A~123         |
        ( the proxy sees the cookie, removes the server name and forwards
           to server A which sees the same cookie as it previously sent )
                                        | >-- GET /URI3 HTTP/1.0 ---------->
                                        |     Cookie: JSESSIONID=123
                                        |     X-Forwarded-For: 10.1.2.3
                                        | <-- HTTP/1.0 200 OK -------------<
                         ( no cookie, nothing changed )
   <-- HTTP/1.0 200 OK ---------------< |
                                     ( ... )


 ========================================================
 2.1 Variations involving external layer 4 load-balancers
 ========================================================

 Instead of using a VRRP-based active/backup solution for the proxies,
 they can also be load-balanced by a layer4 load-balancer (eg: Alteon)
 which will also check that the services run fine on both proxies :

               | VIP=192.168.1.1
          +----+----+
          | Alteon  |
          +----+----+
               |
  192.168.1.3  |  192.168.1.4  192.168.1.11-192.168.1.14   192.168.1.2
  -------+-----+------+-----------+-----+-----+-----+--------+----
         |            |           |     |     |     |       _|_db
      +--+--+      +--+--+      +-+-+ +-+-+ +-+-+ +-+-+    (___)
      | LB1 |      | LB2 |      | A | | B | | C | | D |    (___)
      +-----+      +-----+      +---+ +---+ +---+ +---+    (___)
      haproxy      haproxy        4 cheap web servers


 Config on both proxies (LB1 and LB2) :
 --------------------------------------

     listen webfarm 0.0.0.0:80
        mode http
        balance roundrobin
        cookie JSESSIONID prefix
        option httpclose
        option forwardfor
        option httplog
        option dontlognull
        option httpchk HEAD /index.html HTTP/1.0
        server webA 192.168.1.11:80 cookie A check
        server webB 192.168.1.12:80 cookie B check
        server webC 192.168.1.13:80 cookie C check
        server webD 192.168.1.14:80 cookie D check

 The "dontlognull" option is used to prevent the proxy from logging the health
 checks from the Alteon. If a session exchanges no data, then it will not be
 logged.

 Config on the Alteon :
 ----------------------

     /c/slb/real  11
            ena
            name "LB1"
            rip 192.168.1.3
     /c/slb/real  12
            ena
            name "LB2"
            rip 192.168.1.4
     /c/slb/group 10
            name "LB1-2"
            metric roundrobin
            health tcp
            add 11
            add 12
     /c/slb/virt 10
            ena
            vip 192.168.1.1
     /c/slb/virt 10/service http
            group 10


 Note: the health-check on the Alteon is set to "tcp" to prevent the proxy from
 forwarding the connections. It can also be set to "http", but for this the
 proxy must specify a "monitor-net" with the Alteons' addresses, so that the
 Alteon can really check that the proxies can talk HTTP but without forwarding
 the connections to the end servers. Check next section for an example on how to
 use monitor-net.


 ============================================================
 2.2 Generic TCP relaying and external layer 4 load-balancers
 ============================================================

 Sometimes it's useful to be able to relay generic TCP protocols (SMTP, TSE,
 VNC, etc...), for example to interconnect private networks. The problem comes
 when you use external load-balancers which need to send periodic health-checks
 to the proxies, because these health-checks get forwarded to the end servers.
 The solution is to specify a network which will be dedicated to monitoring
 systems and must not lead to a forwarding connection nor to any log, using the
 "monitor-net" keyword. Note: this feature expects a version of haproxy greater
 than or equal to 1.1.32 or 1.2.6.


                 |  VIP=172.16.1.1   |
            +----+----+         +----+----+
            | Alteon1 |         | Alteon2 |
            +----+----+         +----+----+
  192.168.1.252  |  GW=192.168.1.254 |  192.168.1.253
                 |                   |
           ------+---+------------+--+-----------------> TSE farm : 192.168.1.10
        192.168.1.1  |            | 192.168.1.2
                  +--+--+      +--+--+
                  | LB1 |      | LB2 |
                  +-----+      +-----+
                  haproxy      haproxy


 Config on both proxies (LB1 and LB2) :
 --------------------------------------

     listen tse-proxy
        bind :3389,:1494,:5900  # TSE, ICA and VNC at once.
        mode tcp
        balance roundrobin
        server tse-farm 192.168.1.10
        monitor-net 192.168.1.252/31

 The "monitor-net" option instructs the proxies that any connection coming from
 192.168.1.252 or 192.168.1.253 will not be logged nor forwarded and will be
 closed immediately. The Alteon load-balancers will then see the proxies alive
 without perturbating the service.

 Config on the Alteon :
 ----------------------

     /c/l3/if 1
            ena
            addr 192.168.1.252
            mask 255.255.255.0
     /c/slb/real  11
            ena
            name "LB1"
            rip 192.168.1.1
     /c/slb/real  12
            ena
            name "LB2"
            rip 192.168.1.2
     /c/slb/group 10
            name "LB1-2"
            metric roundrobin
            health tcp
            add 11
            add 12
     /c/slb/virt 10
            ena
            vip 172.16.1.1
     /c/slb/virt 10/service 1494
            group 10
     /c/slb/virt 10/service 3389
            group 10
     /c/slb/virt 10/service 5900
            group 10


 =========================================================
 3. Simple HTTP/HTTPS load-balancing with cookie insertion
 =========================================================

 This is the same context as in example 1 above, but the web
 server uses HTTPS.

                 +-------+
                 |clients|  clients
                 +---+---+
                     |
                    -+-----+--------+----
                           |       _|_db
                        +--+--+   (___)
                        | SSL |   (___)
                        | web |   (___)
                        +-----+
                    192.168.1.1   192.168.1.2


 Since haproxy does not handle SSL, this part will have to be extracted from the
 servers (freeing even more ressources) and installed on the load-balancer
 itself. Install haproxy and apache+mod_ssl on the old box which will spread the
 load between the new boxes. Apache will work in SSL reverse-proxy-cache. If the
 application is correctly developped, it might even lower its load. However,
 since there now is a cache between the clients and haproxy, some security
 measures must be taken to ensure that inserted cookies will not be cached.


   192.168.1.1    192.168.1.11-192.168.1.14   192.168.1.2
  -------+-----------+-----+-----+-----+--------+----
         |           |     |     |     |       _|_db
      +--+--+      +-+-+ +-+-+ +-+-+ +-+-+    (___)
      | LB1 |      | A | | B | | C | | D |    (___)
      +-----+      +---+ +---+ +---+ +---+    (___)
      apache         4 cheap web servers
      mod_ssl
      haproxy


 Config on haproxy (LB1) :
 -------------------------

     listen 127.0.0.1:8000
        mode http
        balance roundrobin
        cookie SERVERID insert indirect nocache
        option httpchk HEAD /index.html HTTP/1.0
        server webA 192.168.1.11:80 cookie A check
        server webB 192.168.1.12:80 cookie B check
        server webC 192.168.1.13:80 cookie C check
        server webD 192.168.1.14:80 cookie D check


 Description :
 -------------
  - apache on LB1 will receive clients requests on port 443
  - it forwards it to haproxy bound to 127.0.0.1:8000
  - if a request does not contain a cookie, it will be forwarded to a valid
    server
  - in return, a cookie "SERVERID" will be inserted in the response holding the
    server name (eg: "A"), and a "Cache-control: private" header will be added
    so that the apache does not cache any page containing such cookie.
  - when the client comes again with the cookie "SERVERID=A", LB1 will know that
    it must be forwarded to server A. The cookie will be removed so that the
    server does not see it.
  - if server "webA" dies, the requests will be sent to another valid server
    and a cookie will be reassigned.

 Notes :
 -------
  - if the cookie works in "prefix" mode, there is no need to add the "nocache"
    option because it is an application cookie which will be modified, and the
    application flags will be preserved.
  - if apache 1.3 is used as a front-end before haproxy, it always disables
    HTTP keep-alive on the back-end, so there is no need for the "httpclose"
    option on haproxy.
  - configure apache to set the X-Forwarded-For header itself, and do not do
    it on haproxy if you need the application to know about the client's IP.


 Flows :
 -------

 (apache)                           (haproxy)                         (server A)
   >-- GET /URI1 HTTP/1.0 ------------> |
                ( no cookie, haproxy forwards in load-balancing mode. )
                                        | >-- GET /URI1 HTTP/1.0 ---------->
                                        | <-- HTTP/1.0 200 OK -------------<
                ( the proxy now adds the server cookie in return )
   <-- HTTP/1.0 200 OK ---------------< |
       Set-Cookie: SERVERID=A           |
       Cache-Control: private           |
   >-- GET /URI2 HTTP/1.0 ------------> |
       Cookie: SERVERID=A               |
       ( the proxy sees the cookie. it forwards to server A and deletes it )
                                        | >-- GET /URI2 HTTP/1.0 ---------->
                                        | <-- HTTP/1.0 200 OK -------------<
    ( the proxy does not add the cookie in return because the client knows it )
   <-- HTTP/1.0 200 OK ---------------< |
   >-- GET /URI3 HTTP/1.0 ------------> |
       Cookie: SERVERID=A               |
                                     ( ... )


 ========================================
 4. Soft-stop for application maintenance
 ========================================

 When an application is spread across several severs, the time to update all
 instances increases, so the application seems jerky for a longer period.

 HAproxy offers several solutions for this. Although it cannot be reconfigured
 without being stopped, nor does it offer any external command, there are other
 working solutions.


 =========================================
 4.1 Soft-stop using a file on the servers
 =========================================

 This trick is quite common and very simple: put a file on the server which will
 be checked by the proxy. When you want to stop the server, first remove this
 file. The proxy will see the server as failed, and will not send it any new
 session, only the old ones if the "persist" option is used. Wait a bit then
 stop the server when it does not receive anymore connections.


     listen 192.168.1.1:80
        mode http
        balance roundrobin
        cookie SERVERID insert indirect
        option httpchk HEAD /running HTTP/1.0
        server webA 192.168.1.11:80 cookie A check inter 2000 rise 2 fall 2
        server webB 192.168.1.12:80 cookie B check inter 2000 rise 2 fall 2
        server webC 192.168.1.13:80 cookie C check inter 2000 rise 2 fall 2
        server webD 192.168.1.14:80 cookie D check inter 2000 rise 2 fall 2
        option persist
        redispatch
        contimeout 5000


 Description :
 -------------
  - every 2 seconds, haproxy will try to access the file "/running" on the
    servers, and declare the server as down after 2 attempts (4 seconds).
  - only the servers which respond with a 200 or 3XX response will be used.
  - if a request does not contain a cookie, it will be forwarded to a valid
    server
  - if a request contains a cookie for a failed server, haproxy will insist
    on trying to reach the server anyway, to let the user finish what he was
    doing. ("persist" option)
  - if the server is totally stopped, the connection will fail and the proxy
    will rebalance the client to another server ("redispatch")

 Usage on the web servers :
 --------------------------
 - to start the server :
     # /etc/init.d/httpd start
     # touch /home/httpd/www/running

 - to soft-stop the server
     # rm -f /home/httpd/www/running

 - to completely stop the server :
     # /etc/init.d/httpd stop

 Limits
 ------
 If the server is totally powered down, the proxy will still try to reach it
 for those clients who still have a cookie referencing it, and the connection
 attempt will expire after 5 seconds ("contimeout"), and only after that, the
 client will be redispatched to another server. So this mode is only useful
 for software updates where the server will suddenly refuse the connection
 because the process is stopped. The problem is the same if the server suddenly
 crashes. All of its users will be fairly perturbated.


 ==================================
 4.2 Soft-stop using backup servers
 ==================================

 A better solution which covers every situation is to use backup servers.
 Version 1.1.30 fixed a bug which prevented a backup server from sharing
 the same cookie as a standard server.


     listen 192.168.1.1:80
        mode http
        balance roundrobin
        redispatch
        cookie SERVERID insert indirect
        option httpchk HEAD / HTTP/1.0
        server webA 192.168.1.11:80 cookie A check port 81 inter 2000
        server webB 192.168.1.12:80 cookie B check port 81 inter 2000
        server webC 192.168.1.13:80 cookie C check port 81 inter 2000
        server webD 192.168.1.14:80 cookie D check port 81 inter 2000

        server bkpA 192.168.1.11:80 cookie A check port 80 inter 2000 backup
        server bkpB 192.168.1.12:80 cookie B check port 80 inter 2000 backup
        server bkpC 192.168.1.13:80 cookie C check port 80 inter 2000 backup
        server bkpD 192.168.1.14:80 cookie D check port 80 inter 2000 backup

 Description
 -----------
 Four servers webA..D are checked on their port 81 every 2 seconds. The same
 servers named bkpA..D are checked on the port 80, and share the exact same
 cookies. Those servers will only be used when no other server is available
 for the same cookie.

 When the web servers are started, only the backup servers are seen as
 available. On the web servers, you need to redirect port 81 to local
 port 80, either with a local proxy (eg: a simple haproxy tcp instance),
 or with iptables (linux) or pf (openbsd). This is because we want the
 real web server to reply on this port, and not a fake one. Eg, with
 iptables :

   # /etc/init.d/httpd start
   # iptables -t nat -A PREROUTING -p tcp --dport 81 -j REDIRECT --to-port 80

 A few seconds later, the standard server is seen up and haproxy starts to send
 it new requests on its real port 80 (only new users with no cookie, of course).

 If a server completely crashes (even if it does not respond at the IP level),
 both the standard and backup servers will fail, so clients associated to this
 server will be redispatched to other live servers and will lose their sessions.

 Now if you want to enter a server into maintenance, simply stop it from
 responding on port 81 so that its standard instance will be seen as failed,
 but the backup will still work. Users will not notice anything since the
 service is still operational :

   # iptables -t nat -D PREROUTING -p tcp --dport 81 -j REDIRECT --to-port 80

 The health checks on port 81 for this server will quickly fail, and the
 standard server will be seen as failed. No new session will be sent to this
 server, and existing clients with a valid cookie will still reach it because
 the backup server will still be up.

 Now wait as long as you want for the old users to stop using the service, and
 once you see that the server does not receive any traffic, simply stop it :

   # /etc/init.d/httpd stop

 The associated backup server will in turn fail, and if any client still tries
 to access this particular server, he will be redispatched to any other valid
 server because of the "redispatch" option.

 This method has an advantage : you never touch the proxy when doing server
 maintenance. The people managing the servers can make them disappear smoothly.


 4.2.1 Variations for operating systems without any firewall software
 --------------------------------------------------------------------

 The downside is that you need a redirection solution on the server just for
 the health-checks. If the server OS does not support any firewall software,
 this redirection can also be handled by a simple haproxy in tcp mode :

     global
         daemon
         quiet
         pidfile /var/run/haproxy-checks.pid
     listen 0.0.0.0:81
         mode tcp
         dispatch 127.0.0.1:80
         contimeout 1000
         clitimeout 10000
         srvtimeout 10000

 To start the web service :

   # /etc/init.d/httpd start
   # haproxy -f /etc/haproxy/haproxy-checks.cfg

 To soft-stop the service :

   # kill $(</var/run/haproxy-checks.pid)

 The port 81 will stop responding and the load-balancer will notice the failure.


 4.2.2 Centralizing the server management
 ----------------------------------------

 If one finds it preferable to manage the servers from the load-balancer itself,
 the port redirector can be installed on the load-balancer itself. See the
 example with iptables below.

 Make the servers appear as operational :
   # iptables -t nat -A OUTPUT -d 192.168.1.11 -p tcp --dport 81 -j DNAT --to-dest :80
   # iptables -t nat -A OUTPUT -d 192.168.1.12 -p tcp --dport 81 -j DNAT --to-dest :80
   # iptables -t nat -A OUTPUT -d 192.168.1.13 -p tcp --dport 81 -j DNAT --to-dest :80
   # iptables -t nat -A OUTPUT -d 192.168.1.14 -p tcp --dport 81 -j DNAT --to-dest :80

 Soft stop one server :
   # iptables -t nat -D OUTPUT -d 192.168.1.12 -p tcp --dport 81 -j DNAT --to-dest :80

 Another solution is to use the "COMAFILE" patch provided by Alexander Lazic,
 which is available for download here :

    http://w.ods.org/tools/haproxy/contrib/


 4.2.3 Notes :
 -------------
   - Never, ever, start a fake service on port 81 for the health-checks, because
     a real web service failure will not be detected as long as the fake service
     runs. You must really forward the check port to the real application.

   - health-checks will be sent twice as often, once for each standard server,
     and once for reach backup server. All this will be multiplicated by the
     number of processes if you use multi-process mode. You will have to ensure
     that all the checks sent to the server do not overload it.


 ==================================================
 5. Multi-site load-balancing with local preference
 ==================================================

 5.1 Description of the problem
 ==============================

 Consider a world-wide company with sites on several continents. There are two
 production sites SITE1 and SITE2 which host identical applications. There are
 many offices around the world. For speed and communication cost reasons, each
 office uses the nearest site by default, but can switch to the backup site in
 the event of a site or application failure. There also are users on the
 production sites, which use their local sites by default, but can switch to the
 other site in case of a local application failure.

 The main constraints are :

   - application persistence : although the application is the same on both
     sites, there is no session synchronisation between the sites. A failure
     of one server or one site can cause a user to switch to another server
     or site, but when the server or site comes back, the user must not switch
     again.

   - communication costs : inter-site communication should be reduced to the
     minimum. Specifically, in case of a local application failure, every
     office should be able to switch to the other site without continuing to
     use the default site.

 5.2 Solution
 ============
   - Each production site will have two haproxy load-balancers in front of its
     application servers to balance the load across them and provide local HA.
     We will call them "S1L1" and "S1L2" on site 1, and "S2L1" and "S2L2" on
     site 2. These proxies will extend the application's JSESSIONID cookie to
     put the server name as a prefix.

   - Each production site will have one front-end haproxy director to provide
     the service to local users and to remote offices. It will load-balance
     across the two local load-balancers, and will use the other site's
     load-balancers as backup servers. It will insert the local site identifier
     in a SITE cookie for the local load-balancers, and the remote site
     identifier for the remote load-balancers. These front-end directors will
     be called "SD1" and "SD2" for "Site Director".

   - Each office will have one haproxy near the border gateway which will direct
     local users to their preference site by default, or to the backup site in
     the event of a previous failure. It will also analyze the SITE cookie, and
     direct the users to the site referenced in the cookie. Thus, the preferred
     site will be declared as a normal server, and the backup site will be
     declared as a backup server only, which will only be used when the primary
     site is unreachable, or when the primary site's director has forwarded
     traffic to the second site. These proxies will be called "OP1".."OPXX"
     for "Office Proxy #XX".


 5.3 Network diagram
 ===================

 Note : offices 1 and 2 are on the same continent as site 1, while
        office 3 is on the same continent as site 3. Each production
        site can reach the second one either through the WAN or through
        a dedicated link.


         Office1         Office2                          Office3
          users           users                            users
 192.168  # # #   192.168 # # #                            # # #
 .1.0/24  | | |   .2.0/24 | | |             192.168.3.0/24 | | |
   --+----+-+-+-   --+----+-+-+-                   ---+----+-+-+-
     |      | .1     |      | .1                      |      | .1
     |    +-+-+      |    +-+-+                       |    +-+-+
     |    |OP1|      |    |OP2|                       |    |OP3|  ...
   ,-:-.  +---+    ,-:-.  +---+                     ,-:-.  +---+
  (  X  )         (  X  )                          (  X  )
   `-:-'           `-:-'             ,---.          `-:-'
   --+---------------+------+----~~~(  X  )~~~~-------+---------+-
                            |        `---'                      |
                            |                                   |
                  +---+   ,-:-.                       +---+   ,-:-.
                  |SD1|  (  X  )                      |SD2|  (  X  )
    ( SITE 1 )    +-+-+   `-:-'         ( SITE 2 )    +-+-+   `-:-'
                    |.1     |                           |.1     |
    10.1.1.0/24     |       |     ,---. 10.2.1.0/24     |       |
         -+-+-+-+-+-+-+-----+-+--(  X  )------+-+-+-+-+-+-+-----+-+--
          | | | | |   |       |   `---'       | | | | |   |       |
       ...# # # # #   |.11    |.12         ...# # # # #   |.11    |.12
           Site 1   +-+--+  +-+--+              Site 2  +-+--+  +-+--+
           Local    |S1L1|  |S1L2|              Local   |S2L1|  |S2L2|
           users    +-+--+  +--+-+              users   +-+--+  +--+-+
                      |        |	                         |        |
    10.1.2.0/24    -+-+-+--+--++--      10.2.2.0/24    -+-+-+--+--++--
                    |.1       |.4                       |.1       |.4
                  +-+-+     +-+-+                     +-+-+     +-+-+
                  |W11| ~~~ |W14|                     |W21| ~~~ |W24|
                  +---+     +---+                     +---+     +---+
               4 application servers               4 application servers
                     on site 1                           on site 2


 5.4 Description
 ===============

 5.4.1 Local users
 -----------------
  - Office 1 users connect to OP1 = 192.168.1.1
  - Office 2 users connect to OP2 = 192.168.2.1
  - Office 3 users connect to OP3 = 192.168.3.1
  - Site 1 users connect to SD1 = 10.1.1.1
  - Site 2 users connect to SD2 = 10.2.1.1

 5.4.2 Office proxies
 --------------------
  - Office 1 connects to site 1 by default and uses site 2 as a backup.
  - Office 2 connects to site 1 by default and uses site 2 as a backup.
  - Office 3 connects to site 2 by default and uses site 1 as a backup.

 The offices check the local site's SD proxy every 30 seconds, and the
 remote one every 60 seconds.


 Configuration for Office Proxy OP1
 ----------------------------------

     listen 192.168.1.1:80
        mode http
        balance roundrobin
        redispatch
        cookie SITE
        option httpchk HEAD / HTTP/1.0
        server SD1 10.1.1.1:80 cookie SITE1 check inter 30000
        server SD2 10.2.1.1:80 cookie SITE2 check inter 60000 backup


 Configuration for Office Proxy OP2
 ----------------------------------

     listen 192.168.2.1:80
        mode http
        balance roundrobin
        redispatch
        cookie SITE
        option httpchk HEAD / HTTP/1.0
        server SD1 10.1.1.1:80 cookie SITE1 check inter 30000
        server SD2 10.2.1.1:80 cookie SITE2 check inter 60000 backup


 Configuration for Office Proxy OP3
 ----------------------------------

     listen 192.168.3.1:80
        mode http
        balance roundrobin
        redispatch
        cookie SITE
        option httpchk HEAD / HTTP/1.0
        server SD2 10.2.1.1:80 cookie SITE2 check inter 30000
        server SD1 10.1.1.1:80 cookie SITE1 check inter 60000 backup


 5.4.3 Site directors ( SD1 and SD2 )
 ------------------------------------
 The site directors forward traffic to the local load-balancers, and set a
 cookie to identify the site. If no local load-balancer is available, or if
 the local application servers are all down, it will redirect traffic to the
 remote site, and report this in the SITE cookie. In order not to uselessly
 load each site's WAN link, each SD will check the other site at a lower
 rate. The site directors will also insert their client's address so that
 the application server knows which local user or remote site accesses it.

 The SITE cookie which is set by these directors will also be understood
 by the office proxies. This is important because if SD1 decides to forward
 traffic to site 2, it will write "SITE2" in the "SITE" cookie, and on next
 request, the office proxy will automatically and directly talk to SITE2 if
 it can reach it. If it cannot, it will still send the traffic to SITE1
 where SD1 will in turn try to reach SITE2.

 The load-balancers checks are performed on port 81. As we'll see further,
 the load-balancers provide a health monitoring port 81 which reroutes to
 port 80 but which allows them to tell the SD that they are going down soon
 and that the SD must not use them anymore.


 Configuration for SD1
 ---------------------

     listen 10.1.1.1:80
        mode http
        balance roundrobin
        redispatch
        cookie SITE insert indirect
        option httpchk HEAD / HTTP/1.0
        option forwardfor
        server S1L1 10.1.1.11:80 cookie SITE1 check port 81 inter 4000
        server S1L2 10.1.1.12:80 cookie SITE1 check port 81 inter 4000
        server S2L1 10.2.1.11:80 cookie SITE2 check port 81 inter 8000 backup
        server S2L2 10.2.1.12:80 cookie SITE2 check port 81 inter 8000 backup

 Configuration for SD2
 ---------------------

     listen 10.2.1.1:80
        mode http
        balance roundrobin
        redispatch
        cookie SITE insert indirect
        option httpchk HEAD / HTTP/1.0
        option forwardfor
        server S2L1 10.2.1.11:80 cookie SITE2 check port 81 inter 4000
        server S2L2 10.2.1.12:80 cookie SITE2 check port 81 inter 4000
        server S1L1 10.1.1.11:80 cookie SITE1 check port 81 inter 8000 backup
        server S1L2 10.1.1.12:80 cookie SITE1 check port 81 inter 8000 backup


 5.4.4 Local load-balancers S1L1, S1L2, S2L1, S2L2
 -------------------------------------------------
 Please first note that because SD1 and SD2 use the same cookie for both
 servers on a same site, the second load-balancer of each site will only
 receive load-balanced requests, but as soon as the SITE cookie will be
 set, only the first LB will receive the requests because it will be the
 first one to match the cookie.

 The load-balancers will spread the load across 4 local web servers, and
 use the JSESSIONID provided by the application to provide server persistence
 using the new 'prefix' method. Soft-stop will also be implemented as described
 in section 4 above. Moreover, these proxies will provide their own maintenance
 soft-stop. Port 80 will be used for application traffic, while port 81 will
 only be used for health-checks and locally rerouted to port 80. A grace time
 will be specified to service on port 80, but not on port 81. This way, a soft
 kill (kill -USR1) on the proxy will only kill the health-check forwarder so
 that the site director knows it must not use this load-balancer anymore. But
 the service will still work for 20 seconds and as long as there are established
 sessions.

 These proxies will also be the only ones to disable HTTP keep-alive in the
 chain, because it is enough to do it at one place, and it's necessary to do
 it with 'prefix' cookies.

 Configuration for S1L1/S1L2
 ---------------------------

     listen 10.1.1.11:80 # 10.1.1.12:80 for S1L2
        grace 20000  # don't kill us until 20 seconds have elapsed
        mode http
        balance roundrobin
        cookie JSESSIONID prefix
        option httpclose
        option forwardfor
        option httpchk HEAD / HTTP/1.0
        server W11 10.1.2.1:80 cookie W11 check port 81 inter 2000
        server W12 10.1.2.2:80 cookie W12 check port 81 inter 2000
        server W13 10.1.2.3:80 cookie W13 check port 81 inter 2000
        server W14 10.1.2.4:80 cookie W14 check port 81 inter 2000

        server B11 10.1.2.1:80 cookie W11 check port 80 inter 4000 backup
        server B12 10.1.2.2:80 cookie W12 check port 80 inter 4000 backup
        server B13 10.1.2.3:80 cookie W13 check port 80 inter 4000 backup
        server B14 10.1.2.4:80 cookie W14 check port 80 inter 4000 backup

     listen 10.1.1.11:81 # 10.1.1.12:81 for S1L2
        mode tcp
        dispatch 10.1.1.11:80  # 10.1.1.12:80 for S1L2


 Configuration for S2L1/S2L2
 ---------------------------

     listen 10.2.1.11:80 # 10.2.1.12:80 for S2L2
        grace 20000  # don't kill us until 20 seconds have elapsed
        mode http
        balance roundrobin
        cookie JSESSIONID prefix
        option httpclose
        option forwardfor
        option httpchk HEAD / HTTP/1.0
        server W21 10.2.2.1:80 cookie W21 check port 81 inter 2000
        server W22 10.2.2.2:80 cookie W22 check port 81 inter 2000
        server W23 10.2.2.3:80 cookie W23 check port 81 inter 2000
        server W24 10.2.2.4:80 cookie W24 check port 81 inter 2000

        server B21 10.2.2.1:80 cookie W21 check port 80 inter 4000 backup
        server B22 10.2.2.2:80 cookie W22 check port 80 inter 4000 backup
        server B23 10.2.2.3:80 cookie W23 check port 80 inter 4000 backup
        server B24 10.2.2.4:80 cookie W24 check port 80 inter 4000 backup

     listen 10.2.1.11:81 # 10.2.1.12:81 for S2L2
        mode tcp
        dispatch 10.2.1.11:80  # 10.2.1.12:80 for S2L2


 5.5 Comments
 ------------
 Since each site director sets a cookie identifying the site, remote office
 users will have their office proxies direct them to the right site and stick
 to this site as long as the user still uses the application and the site is
 available. Users on production sites will be directed to the right site by the
 site directors depending on the SITE cookie.

 If the WAN link dies on a production site, the remote office users will not
 see their site anymore, so they will redirect the traffic to the second site.
 If there are dedicated inter-site links as on the diagram above, the second
 SD will see the cookie and still be able to reach the original site. For
 example :

 Office 1 user sends the following to OP1 :
   GET / HTTP/1.0
   Cookie: SITE=SITE1; JSESSIONID=W14~123;

 OP1 cannot reach site 1 because its external router is dead. So the SD1 server
 is seen as dead, and OP1 will then forward the request to SD2 on site 2,
 regardless of the SITE cookie.

 SD2 on site 2 receives a SITE cookie containing "SITE1". Fortunately, it
 can reach Site 1's load balancers S1L1 and S1L2. So it forwards the request
 so S1L1 (the first one with the same cookie).

 S1L1 (on site 1) finds "W14" in the JSESSIONID cookie, so it can forward the
 request to the right server, and the user session will continue to work. Once
 the Site 1's WAN link comes back, OP1 will see SD1 again, and will not route
 through SITE 2 anymore.

 However, when a new user on Office 1 connects to the application during a
 site 1 failure, it does not contain any cookie. Since OP1 does not see SD1
 because of the network failure, it will direct the request to SD2 on site 2,
 which will by default direct the traffic to the local load-balancers, S2L1 and
 S2L2. So only initial users will load the inter-site link, not the new ones.


 ===================
 6. Source balancing
 ===================

 Sometimes it may reveal useful to access servers from a pool of IP addresses
 instead of only one or two. Some equipments (NAT firewalls, load-balancers)
 are sensible to source address, and often need many sources to distribute the
 load evenly amongst their internal hash buckets.

 To do this, you simply have to use several times the same server with a
 different source. Example :

     listen 0.0.0.0:80
        mode tcp
        balance roundrobin
        server from1to1 10.1.1.1:80 source 10.1.2.1
        server from2to1 10.1.1.1:80 source 10.1.2.2
        server from3to1 10.1.1.1:80 source 10.1.2.3
        server from4to1 10.1.1.1:80 source 10.1.2.4
        server from5to1 10.1.1.1:80 source 10.1.2.5
        server from6to1 10.1.1.1:80 source 10.1.2.6
        server from7to1 10.1.1.1:80 source 10.1.2.7
        server from8to1 10.1.1.1:80 source 10.1.2.8