Blame - doc/architecture.txt - haproxy

blob: b875b8480701e77a273f61096466c56074deed82 [file] [log] [blame]

willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	1	-------------------
				2	H A - P r o x y
				3	Architecture Guide
				4	-------------------
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	5	version 1.1.32
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	6	willy tarreau
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	7	2005/07/17
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	8
				9
				10	This document provides real world examples with working configurations.
				11	Please note that except stated otherwise, global configuration parameters
				12	such as logging, chrooting, limits and time-outs are not described here.
				13
				14	===================================================
				15	1. Simple HTTP load-balancing with cookie insertion
				16	===================================================
				17
				18	A web application often saturates the front-end server with high CPU loads,
				19	due to the scripting language involved. It also relies on a back-end database
				20	which is not much loaded. User contexts are stored on the server itself, and
				21	not in the database, so that simply adding another server with simple IP/TCP
				22	load-balancing would not work.
				23
				24	+-------+
				25	\|clients\| clients and/or reverse-proxy
				26	+---+---+
				27	\|
				28	-+-----+--------+----
				29	\| _\|_db
				30	+--+--+ (___)
				31	\| web \| (___)
				32	+-----+ (___)
				33	192.168.1.1 192.168.1.2
				34
				35
				36	Replacing the web server with a bigger SMP system would cost much more than
				37	adding low-cost pizza boxes. The solution is to buy N cheap boxes and install
				38	the application on them. Install haproxy on the old one which will spread the
				39	load across the new boxes.
				40
				41	192.168.1.1 192.168.1.11-192.168.1.14 192.168.1.2
				42	-------+-----------+-----+-----+-----+--------+----
				43	\| \| \| \| \| _\|_db
				44	+--+--+ +-+-+ +-+-+ +-+-+ +-+-+ (___)
				45	\| LB1 \| \| A \| \| B \| \| C \| \| D \| (___)
				46	+-----+ +---+ +---+ +---+ +---+ (___)
				47	haproxy 4 cheap web servers
				48
				49
				50	Config on haproxy (LB1) :
				51	-------------------------
				52
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	53	listen webfarm 192.168.1.1:80
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	54	mode http
				55	balance roundrobin
				56	cookie SERVERID insert indirect
				57	option httpchk HEAD /index.html HTTP/1.0
				58	server webA 192.168.1.11:80 cookie A check
				59	server webB 192.168.1.12:80 cookie B check
				60	server webC 192.168.1.13:80 cookie C check
				61	server webD 192.168.1.14:80 cookie D check
				62
				63
				64	Description :
				65	-------------
				66	- LB1 will receive clients requests.
				67	- if a request does not contain a cookie, it will be forwarded to a valid
				68	server
				69	- in return, a cookie "SERVERID" will be inserted in the response holding the
				70	server name (eg: "A").
				71	- when the client comes again with the cookie "SERVERID=A", LB1 will know that
				72	it must be forwarded to server A. The cookie will be removed so that the
				73	server does not see it.
				74	- if server "webA" dies, the requests will be sent to another valid server
				75	and a cookie will be reassigned.
				76
				77
				78	Flows :
				79	-------
				80
				81	(client) (haproxy) (server A)
				82	>-- GET /URI1 HTTP/1.0 ------------> \|
				83	( no cookie, haproxy forwards in load-balancing mode. )
				84	\| >-- GET /URI1 HTTP/1.0 ---------->
				85	\| <-- HTTP/1.0 200 OK -------------<
				86	( the proxy now adds the server cookie in return )
				87	<-- HTTP/1.0 200 OK ---------------< \|
				88	Set-Cookie: SERVERID=A \|
				89	>-- GET /URI2 HTTP/1.0 ------------> \|
				90	Cookie: SERVERID=A \|
				91	( the proxy sees the cookie. it forwards to server A and deletes it )
				92	\| >-- GET /URI2 HTTP/1.0 ---------->
				93	\| <-- HTTP/1.0 200 OK -------------<
				94	( the proxy does not add the cookie in return because the client knows it )
				95	<-- HTTP/1.0 200 OK ---------------< \|
				96	>-- GET /URI3 HTTP/1.0 ------------> \|
				97	Cookie: SERVERID=A \|
				98	( ... )
				99
				100
				101	Limits :
				102	--------
				103	- if clients use keep-alive (HTTP/1.1), only the first response will have
				104	a cookie inserted, and only the first request of each session will be
				105	analyzed. This does not cause trouble in insertion mode because the cookie
				106	is put immediately in the first response, and the session is maintained to
				107	the same server for all subsequent requests in the same session. However,
				108	the cookie will not be removed from the requests forwarded to the servers,
				109	so the server must not be sensitive to unknown cookies. If this causes
				110	trouble, you can disable keep-alive by adding the following option :
				111
				112	option httpclose
				113
				114	- if for some reason the clients cannot learn more than one cookie (eg: the
				115	clients are indeed some home-made applications or gateways), and the
				116	application already produces a cookie, you can use the "prefix" mode (see
				117	below).
				118
				119	- LB1 becomes a very sensible server. If LB1 dies, nothing works anymore.
				120	=> you can back it up using keepalived.
				121
				122	- if the application needs to log the original client's IP, use the
				123	"forwardfor" option which will add an "X-Forwarded-For" header with the
				124	original client's IP address. You must also use "httpclose" to ensure
				125	that you will rewrite every requests and not only the first one of each
				126	session :
				127
				128	option httpclose
				129	option forwardfor
				130
				131	The web server will have to be configured to use this header instead.
				132	For example, on apache, you can use LogFormat for this :
				133
				134	LogFormat "%{X-Forwarded-For}i %l %u %t \"%r\" %>s %b " combined
				135	CustomLog /var/log/httpd/access_log combined
				136
				137
				138	==================================================================
				139	2. HTTP load-balancing with cookie prefixing and high availability
				140	==================================================================
				141
				142	Now you don't want to add more cookies, but rather use existing ones. The
				143	application already generates a "JSESSIONID" cookie which is enough to track
				144	sessions, so we'll prefix this cookie with the server name when we see it.
				145	Since the load-balancer becomes critical, it will be backed up with a second
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	146	one in VRRP mode using keepalived under Linux.
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	147
				148	Download the latest version of keepalived from this site and install it
				149	on each load-balancer LB1 and LB2 :
				150
				151	http://www.keepalived.org/
				152
				153	You then have a shared IP between the two load-balancers (we will still use the
				154	original IP). It is active only on one of them at any moment. To allow the
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	155	proxy to bind to the shared IP on Linux 2.4, you must enable it in /proc :
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	156
				157	# echo 1 >/proc/sys/net/ipv4/ip_nonlocal_bind
				158
				159
				160	shared IP=192.168.1.1
				161	192.168.1.3 192.168.1.4 192.168.1.11-192.168.1.14 192.168.1.2
				162	-------+------------+-----------+-----+-----+-----+--------+----
				163	\| \| \| \| \| \| _\|_db
				164	+--+--+ +--+--+ +-+-+ +-+-+ +-+-+ +-+-+ (___)
				165	\| LB1 \| \| LB2 \| \| A \| \| B \| \| C \| \| D \| (___)
				166	+-----+ +-----+ +---+ +---+ +---+ +---+ (___)
				167	haproxy haproxy 4 cheap web servers
				168	keepalived keepalived
				169
				170
				171	Config on both proxies (LB1 and LB2) :
				172	--------------------------------------
				173
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	174	listen webfarm 192.168.1.1:80
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	175	mode http
				176	balance roundrobin
				177	cookie JSESSIONID prefix
				178	option httpclose
				179	option forwardfor
				180	option httpchk HEAD /index.html HTTP/1.0
				181	server webA 192.168.1.11:80 cookie A check
				182	server webB 192.168.1.12:80 cookie B check
				183	server webC 192.168.1.13:80 cookie C check
				184	server webD 192.168.1.14:80 cookie D check
				185
				186
				187	Notes: the proxy will modify EVERY cookie sent by the client and the server,
				188	so it is important that it can access to ALL cookies in ALL requests for
				189	each session. This implies that there is no keep-alive (HTTP/1.1), thus the
				190	"httpclose" option. Only if you know for sure that the client(s) will never
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	191	use keep-alive (eg: Apache 1.3 in reverse-proxy mode), you can remove this
				192	option.
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	193
				194
				195	Description :
				196	-------------
				197	- LB1 is VRRP master (keepalived), LB2 is backup.
				198	- LB1 will receive clients requests on IP 192.168.1.1.
				199	- both load-balancers send their checks from their native IP.
				200	- if a request does not contain a cookie, it will be forwarded to a valid
				201	server
				202	- in return, if a JESSIONID cookie is seen, the server name will be prefixed
				203	into it, followed by a delimitor ('~')
				204	- when the client comes again with the cookie "JSESSIONID=A~xxx", LB1 will
				205	know that it must be forwarded to server A. The server name will then be
				206	extracted from cookie before it is sent to the server.
				207	- if server "webA" dies, the requests will be sent to another valid server
				208	and a cookie will be reassigned.
				209
				210
				211	Flows :
				212	-------
				213
				214	(client) (haproxy) (server A)
				215	>-- GET /URI1 HTTP/1.0 ------------> \|
				216	( no cookie, haproxy forwards in load-balancing mode. )
				217	\| >-- GET /URI1 HTTP/1.0 ---------->
				218	\| X-Forwarded-For: 10.1.2.3
				219	\| <-- HTTP/1.0 200 OK -------------<
				220	( no cookie, nothing changed )
				221	<-- HTTP/1.0 200 OK ---------------< \|
				222	>-- GET /URI2 HTTP/1.0 ------------> \|
				223	( no cookie, haproxy forwards in lb mode, possibly to another server. )
				224	\| >-- GET /URI2 HTTP/1.0 ---------->
				225	\| X-Forwarded-For: 10.1.2.3
				226	\| <-- HTTP/1.0 200 OK -------------<
				227	\| Set-Cookie: JSESSIONID=123
				228	( the cookie is identified, it will be prefixed with the server name )
				229	<-- HTTP/1.0 200 OK ---------------< \|
				230	Set-Cookie: JSESSIONID=A~123 \|
				231	>-- GET /URI3 HTTP/1.0 ------------> \|
				232	Cookie: JSESSIONID=A~123 \|
				233	( the proxy sees the cookie, removes the server name and forwards
				234	to server A which sees the same cookie as it previously sent )
				235	\| >-- GET /URI3 HTTP/1.0 ---------->
				236	\| Cookie: JSESSIONID=123
				237	\| X-Forwarded-For: 10.1.2.3
				238	\| <-- HTTP/1.0 200 OK -------------<
				239	( no cookie, nothing changed )
				240	<-- HTTP/1.0 200 OK ---------------< \|
				241	( ... )
				242
				243
				244
				245	========================================================
				246	2.1 Variations involving external layer 4 load-balancers
				247	========================================================
				248
				249	Instead of using a VRRP-based active/backup solution for the proxies,
				250	they can also be load-balanced by a layer4 load-balancer (eg: Alteon)
				251	which will also check that the services run fine on both proxies :
				252
				253	\| VIP=192.168.1.1
				254	+----+----+
				255	\| Alteon \|
				256	+----+----+
				257	\|
				258	192.168.1.3 \| 192.168.1.4 192.168.1.11-192.168.1.14 192.168.1.2
				259	-------+-----+------+-----------+-----+-----+-----+--------+----
				260	\| \| \| \| \| \| _\|_db
				261	+--+--+ +--+--+ +-+-+ +-+-+ +-+-+ +-+-+ (___)
				262	\| LB1 \| \| LB2 \| \| A \| \| B \| \| C \| \| D \| (___)
				263	+-----+ +-----+ +---+ +---+ +---+ +---+ (___)
				264	haproxy haproxy 4 cheap web servers
				265
				266
				267	Config on both proxies (LB1 and LB2) :
				268	--------------------------------------
				269
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	270	listen webfarm 0.0.0.0:80
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	271	mode http
				272	balance roundrobin
				273	cookie JSESSIONID prefix
				274	option httpclose
				275	option forwardfor
				276	option httplog
				277	option dontlognull
				278	option httpchk HEAD /index.html HTTP/1.0
				279	server webA 192.168.1.11:80 cookie A check
				280	server webB 192.168.1.12:80 cookie B check
				281	server webC 192.168.1.13:80 cookie C check
				282	server webD 192.168.1.14:80 cookie D check
				283
				284	The "dontlognull" option is used to prevent the proxy from logging the health
				285	checks from the Alteon. If a session exchanges no data, then it will not be
				286	logged.
				287
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	288	Config on the Alteon :
				289	----------------------
				290
				291	/c/slb/real 11
				292	ena
				293	name "LB1"
				294	rip 192.168.1.3
				295	/c/slb/real 12
				296	ena
				297	name "LB2"
				298	rip 192.168.1.4
				299	/c/slb/group 10
				300	name "LB1-2"
				301	metric roundrobin
				302	health tcp
				303	add 11
				304	add 12
				305	/c/slb/virt 10
				306	ena
				307	vip 192.168.1.1
				308	/c/slb/virt 10/service http
				309	group 10
				310
				311
				312	Note: the health-check on the Alteon is set to "tcp" to prevent the proxy from
				313	forwarding the connections. It can also be set to "http", but for this the
				314	proxy must specify a "monitor-net" with the Alteons' addresses, so that the
				315	Alteon can really check that the proxies can talk HTTP but without forwarding
				316	the connections to the end servers. Check next section for an example on how to
				317	use monitor-net.
				318
				319
				320	============================================================
				321	2.2 Generic TCP relaying and external layer 4 load-balancers
				322	============================================================
				323
				324	Sometimes it's useful to be able to relay generic TCP protocols (SMTP, TSE,
				325	VNC, etc...), for example to interconnect private networks. The problem comes
				326	when you use external load-balancers which need to send periodic health-checks
				327	to the proxies, because these health-checks get forwarded to the end servers.
				328	The solution is to specify a network which will be dedicated to monitoring
				329	systems and must not lead to a forwarding connection nor to any log, using the
				330	"monitor-net" keyword. Note: this feature expects a version of haproxy greater
				331	than or equal to 1.1.32 or 1.2.6.
				332
				333
				334	\| VIP=172.16.1.1 \|
				335	+----+----+ +----+----+
				336	\| Alteon1 \| \| Alteon2 \|
				337	+----+----+ +----+----+
				338	192.168.1.252 \| GW=192.168.1.254 \| 192.168.1.253
				339	\| \|
				340	------+---+------------+--+-----------------> TSE farm : 192.168.1.10
				341	192.168.1.1 \| \| 192.168.1.2
				342	+--+--+ +--+--+
				343	\| LB1 \| \| LB2 \|
				344	+-----+ +-----+
				345	haproxy haproxy
				346
				347
				348	Config on both proxies (LB1 and LB2) :
				349	--------------------------------------
				350
				351	listen tse-proxy
				352	bind :3389,:1494,:5900 # TSE, ICA and VNC at once.
				353	mode tcp
				354	balance roundrobin
				355	server tse-farm 192.168.1.10
				356	monitor-net 192.168.1.252/31
				357
				358	The "monitor-net" option instructs the proxies that any connection coming from
				359	192.168.1.252 or 192.168.1.253 will not be logged nor forwarded and will be
				360	closed immediately. The Alteon load-balancers will then see the proxies alive
				361	without perturbating the service.
				362
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	363	Config on the Alteon :
				364	----------------------
				365
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	366	/c/l3/if 1
				367	ena
				368	addr 192.168.1.252
				369	mask 255.255.255.0
				370	/c/slb/real 11
				371	ena
				372	name "LB1"
				373	rip 192.168.1.1
				374	/c/slb/real 12
				375	ena
				376	name "LB2"
				377	rip 192.168.1.2
				378	/c/slb/group 10
				379	name "LB1-2"
				380	metric roundrobin
				381	health tcp
				382	add 11
				383	add 12
				384	/c/slb/virt 10
				385	ena
				386	vip 172.16.1.1
				387	/c/slb/virt 10/service 1494
				388	group 10
				389	/c/slb/virt 10/service 3389
				390	group 10
				391	/c/slb/virt 10/service 5900
				392	group 10
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	393
				394
				395	=========================================================
				396	3. Simple HTTP/HTTPS load-balancing with cookie insertion
				397	=========================================================
				398
				399	This is the same context as in example 1 above, but the web
				400	server uses HTTPS.
				401
				402	+-------+
				403	\|clients\| clients
				404	+---+---+
				405	\|
				406	-+-----+--------+----
				407	\| _\|_db
				408	+--+--+ (___)
				409	\| SSL \| (___)
				410	\| web \| (___)
				411	+-----+
				412	192.168.1.1 192.168.1.2
				413
				414
				415	Since haproxy does not handle SSL, this part will have to be extracted from the
				416	servers (freeing even more ressources) and installed on the load-balancer
				417	itself. Install haproxy and apache+mod_ssl on the old box which will spread the
				418	load between the new boxes. Apache will work in SSL reverse-proxy-cache. If the
				419	application is correctly developped, it might even lower its load. However,
				420	since there now is a cache between the clients and haproxy, some security
				421	measures must be taken to ensure that inserted cookies will not be cached.
				422
				423
				424	192.168.1.1 192.168.1.11-192.168.1.14 192.168.1.2
				425	-------+-----------+-----+-----+-----+--------+----
				426	\| \| \| \| \| _\|_db
				427	+--+--+ +-+-+ +-+-+ +-+-+ +-+-+ (___)
				428	\| LB1 \| \| A \| \| B \| \| C \| \| D \| (___)
				429	+-----+ +---+ +---+ +---+ +---+ (___)
				430	apache 4 cheap web servers
				431	mod_ssl
				432	haproxy
				433
				434
				435	Config on haproxy (LB1) :
				436	-------------------------
				437
				438	listen 127.0.0.1:8000
				439	mode http
				440	balance roundrobin
				441	cookie SERVERID insert indirect nocache
				442	option httpchk HEAD /index.html HTTP/1.0
				443	server webA 192.168.1.11:80 cookie A check
				444	server webB 192.168.1.12:80 cookie B check
				445	server webC 192.168.1.13:80 cookie C check
				446	server webD 192.168.1.14:80 cookie D check
				447
				448
				449	Description :
				450	-------------
				451	- apache on LB1 will receive clients requests on port 443
				452	- it forwards it to haproxy bound to 127.0.0.1:8000
				453	- if a request does not contain a cookie, it will be forwarded to a valid
				454	server
				455	- in return, a cookie "SERVERID" will be inserted in the response holding the
				456	server name (eg: "A"), and a "Cache-control: private" header will be added
				457	so that the apache does not cache any page containing such cookie.
				458	- when the client comes again with the cookie "SERVERID=A", LB1 will know that
				459	it must be forwarded to server A. The cookie will be removed so that the
				460	server does not see it.
				461	- if server "webA" dies, the requests will be sent to another valid server
				462	and a cookie will be reassigned.
				463
				464	Notes :
				465	-------
				466	- if the cookie works in "prefix" mode, there is no need to add the "nocache"
				467	option because it is an application cookie which will be modified, and the
				468	application flags will be preserved.
				469	- if apache 1.3 is used as a front-end before haproxy, it always disables
				470	HTTP keep-alive on the back-end, so there is no need for the "httpclose"
				471	option on haproxy.
				472	- configure apache to set the X-Forwarded-For header itself, and do not do
				473	it on haproxy if you need the application to know about the client's IP.
				474
				475
				476	Flows :
				477	-------
				478
				479	(apache) (haproxy) (server A)
				480	>-- GET /URI1 HTTP/1.0 ------------> \|
				481	( no cookie, haproxy forwards in load-balancing mode. )
				482	\| >-- GET /URI1 HTTP/1.0 ---------->
				483	\| <-- HTTP/1.0 200 OK -------------<
				484	( the proxy now adds the server cookie in return )
				485	<-- HTTP/1.0 200 OK ---------------< \|
				486	Set-Cookie: SERVERID=A \|
				487	Cache-Control: private \|
				488	>-- GET /URI2 HTTP/1.0 ------------> \|
				489	Cookie: SERVERID=A \|
				490	( the proxy sees the cookie. it forwards to server A and deletes it )
				491	\| >-- GET /URI2 HTTP/1.0 ---------->
				492	\| <-- HTTP/1.0 200 OK -------------<
				493	( the proxy does not add the cookie in return because the client knows it )
				494	<-- HTTP/1.0 200 OK ---------------< \|
				495	>-- GET /URI3 HTTP/1.0 ------------> \|
				496	Cookie: SERVERID=A \|
				497	( ... )
				498
				499
				500
				501	========================================
				502	4. Soft-stop for application maintenance
				503	========================================
				504
				505	When an application is spread across several severs, the time to update all
				506	instances increases, so the application seems jerky for a longer period.
				507
				508	HAproxy offers several solutions for this. Although it cannot be reconfigured
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	509	without being stopped, nor does it offer any external command, there are other
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	510	working solutions.
				511
				512
				513	=========================================
				514	4.1 Soft-stop using a file on the servers
				515	=========================================
				516
				517	This trick is quite common and very simple: put a file on the server which will
				518	be checked by the proxy. When you want to stop the server, first remove this
				519	file. The proxy will see the server as failed, and will not send it any new
				520	session, only the old ones if the "persist" option is used. Wait a bit then
				521	stop the server when it does not receive anymore connections.
				522
				523
				524	listen 192.168.1.1:80
				525	mode http
				526	balance roundrobin
				527	cookie SERVERID insert indirect
				528	option httpchk HEAD /running HTTP/1.0
				529	server webA 192.168.1.11:80 cookie A check inter 2000 rise 2 fall 2
				530	server webB 192.168.1.12:80 cookie B check inter 2000 rise 2 fall 2
				531	server webC 192.168.1.13:80 cookie C check inter 2000 rise 2 fall 2
				532	server webD 192.168.1.14:80 cookie D check inter 2000 rise 2 fall 2
				533	option persist
				534	redispatch
				535	contimeout 5000
				536
				537
				538	Description :
				539	-------------
				540	- every 2 seconds, haproxy will try to access the file "/running" on the
				541	servers, and declare the server as down after 2 attempts (4 seconds).
				542	- only the servers which respond with a 200 or 3XX response will be used.
				543	- if a request does not contain a cookie, it will be forwarded to a valid
				544	server
				545	- if a request contains a cookie for a failed server, haproxy will insist
				546	on trying to reach the server anyway, to let the user finish what he was
				547	doing. ("persist" option)
				548	- if the server is totally stopped, the connection will fail and the proxy
				549	will rebalance the client to another server ("redispatch")
				550
				551	Usage on the web servers :
				552	--------------------------
				553	- to start the server :
				554	# /etc/init.d/httpd start
				555	# touch /home/httpd/www/running
				556
				557	- to soft-stop the server
				558	# rm -f /home/httpd/www/running
				559
				560	- to completely stop the server :
				561	# /etc/init.d/httpd stop
				562
				563	Limits
				564	------
				565	If the server is totally powered down, the proxy will still try to reach it
				566	for those clients who still have a cookie referencing it, and the connection
				567	attempt will expire after 5 seconds ("contimeout"), and only after that, the
				568	client will be redispatched to another server. So this mode is only useful
				569	for software updates where the server will suddenly refuse the connection
				570	because the process is stopped. The problem is the same if the server suddenly
				571	crashes. All of its users will be fairly perturbated.
				572
				573
				574	==================================
				575	4.2 Soft-stop using backup servers
				576	==================================
				577
				578	A better solution which covers every situation is to use backup servers.
				579	Version 1.1.30 fixed a bug which prevented a backup server from sharing
				580	the same cookie as a standard server.
				581
				582
				583	listen 192.168.1.1:80
				584	mode http
				585	balance roundrobin
				586	redispatch
				587	cookie SERVERID insert indirect
				588	option httpchk HEAD / HTTP/1.0
				589	server webA 192.168.1.11:80 cookie A check port 81 inter 2000
				590	server webB 192.168.1.12:80 cookie B check port 81 inter 2000
				591	server webC 192.168.1.13:80 cookie C check port 81 inter 2000
				592	server webD 192.168.1.14:80 cookie D check port 81 inter 2000
				593
				594	server bkpA 192.168.1.11:80 cookie A check port 80 inter 2000 backup
				595	server bkpB 192.168.1.12:80 cookie B check port 80 inter 2000 backup
				596	server bkpC 192.168.1.13:80 cookie C check port 80 inter 2000 backup
				597	server bkpD 192.168.1.14:80 cookie D check port 80 inter 2000 backup
				598
				599	Description
				600	-----------
				601	Four servers webA..D are checked on their port 81 every 2 seconds. The same
				602	servers named bkpA..D are checked on the port 80, and share the exact same
				603	cookies. Those servers will only be used when no other server is available
				604	for the same cookie.
				605
				606	When the web servers are started, only the backup servers are seen as
				607	available. On the web servers, you need to redirect port 81 to local
				608	port 80, either with a local proxy (eg: a simple haproxy tcp instance),
				609	or with iptables (linux) or pf (openbsd). This is because we want the
				610	real web server to reply on this port, and not a fake one. Eg, with
				611	iptables :
				612
				613	# /etc/init.d/httpd start
				614	# iptables -t nat -A PREROUTING -p tcp --dport 81 -j REDIRECT --to-port 80
				615
				616	A few seconds later, the standard server is seen up and haproxy starts to send
				617	it new requests on its real port 80 (only new users with no cookie, of course).
				618
				619	If a server completely crashes (even if it does not respond at the IP level),
				620	both the standard and backup servers will fail, so clients associated to this
				621	server will be redispatched to other live servers and will lose their sessions.
				622
				623	Now if you want to enter a server into maintenance, simply stop it from
				624	responding on port 81 so that its standard instance will be seen as failed,
				625	but the backup will still work. Users will not notice anything since the
				626	service is still operational :
				627
				628	# iptables -t nat -D PREROUTING -p tcp --dport 81 -j REDIRECT --to-port 80
				629
				630	The health checks on port 81 for this server will quickly fail, and the
				631	standard server will be seen as failed. No new session will be sent to this
				632	server, and existing clients with a valid cookie will still reach it because
				633	the backup server will still be up.
				634
				635	Now wait as long as you want for the old users to stop using the service, and
				636	once you see that the server does not receive any traffic, simply stop it :
				637
				638	# /etc/init.d/httpd stop
				639
				640	The associated backup server will in turn fail, and if any client still tries
				641	to access this particular server, he will be redispatched to any other valid
				642	server because of the "redispatch" option.
				643
				644	This method has an advantage : you never touch the proxy when doing server
				645	maintenance. The people managing the servers can make them disappear smoothly.
				646
				647
				648	4.2.1 Variations for operating systems without any firewall software
				649	--------------------------------------------------------------------
				650
				651	The downside is that you need a redirection solution on the server just for
				652	the health-checks. If the server OS does not support any firewall software,
				653	this redirection can also be handled by a simple haproxy in tcp mode :
				654
				655	global
				656	daemon
				657	quiet
				658	pidfile /var/run/haproxy-checks.pid
				659	listen 0.0.0.0:81
				660	mode tcp
				661	dispatch 127.0.0.1:80
				662	contimeout 1000
				663	clitimeout 10000
				664	srvtimeout 10000
				665
				666	To start the web service :
				667
				668	# /etc/init.d/httpd start
				669	# haproxy -f /etc/haproxy/haproxy-checks.cfg
				670
				671	To soft-stop the service :
				672
				673	# kill $(</var/run/haproxy-checks.pid)
				674
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	675	The port 81 will stop responding and the load-balancer will notice the failure.
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	676
				677
				678	4.2.2 Centralizing the server management
				679	----------------------------------------
				680
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	681	If one finds it preferable to manage the servers from the load-balancer itself,
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	682	the port redirector can be installed on the load-balancer itself. See the
				683	example with iptables below.
				684
				685	Make the servers appear as operational :
				686	# iptables -t nat -A OUTPUT -d 192.168.1.11 -p tcp --dport 81 -j DNAT --to-dest :80
				687	# iptables -t nat -A OUTPUT -d 192.168.1.12 -p tcp --dport 81 -j DNAT --to-dest :80
				688	# iptables -t nat -A OUTPUT -d 192.168.1.13 -p tcp --dport 81 -j DNAT --to-dest :80
				689	# iptables -t nat -A OUTPUT -d 192.168.1.14 -p tcp --dport 81 -j DNAT --to-dest :80
				690
				691	Soft stop one server :
				692	# iptables -t nat -D OUTPUT -d 192.168.1.12 -p tcp --dport 81 -j DNAT --to-dest :80
				693
				694	Another solution is to use the "COMAFILE" patch provided by Alexander Lazic,
				695	which is available for download here :
				696
				697	http://w.ods.org/tools/haproxy/contrib/
				698
				699
				700	4.2.3 Notes :
				701	-------------
				702	- Never, ever, start a fake service on port 81 for the health-checks, because
				703	a real web service failure will not be detected as long as the fake service
				704	runs. You must really forward the check port to the real application.
				705
				706	- health-checks will be sent twice as often, once for each standard server,
				707	and once for reach backup server. All this will be multiplicated by the
willy tarreau	c5f73ed	2005-12-18 01:26:38 +0100	[diff] [blame^]	708	number of processes if you use multi-process mode. You will have to ensure
				709	that all the checks sent to the server do not overload it.
willy tarreau	0174f31	2005-12-18 01:02:42 +0100	[diff] [blame]	710
				711
				712	==================================================
				713	5. Multi-site load-balancing with local preference
				714	==================================================
				715
				716	5.1 Description of the problem
				717	==============================
				718
				719	Consider a world-wide company with sites on several continents. There are two
				720	production sites SITE1 and SITE2 which host identical applications. There are
				721	many offices around the world. For speed and communication cost reasons, each
				722	office uses the nearest site by default, but can switch to the backup site in
				723	the event of a site or application failure. There also are users on the
				724	production sites, which use their local sites by default, but can switch to the
				725	other site in case of a local application failure.
				726
				727	The main constraints are :
				728
				729	- application persistence : although the application is the same on both
				730	sites, there is no session synchronisation between the sites. A failure
				731	of one server or one site can cause a user to switch to another server
				732	or site, but when the server or site comes back, the user must not switch
				733	again.
				734
				735	- communication costs : inter-site communication should be reduced to the
				736	minimum. Specifically, in case of a local application failure, every
				737	office should be able to switch to the other site without continuing to
				738	use the default site.
				739
				740	5.2 Solution
				741	============
				742	- Each production site will have two haproxy load-balancers in front of its
				743	application servers to balance the load across them and provide local HA.
				744	We will call them "S1L1" and "S1L2" on site 1, and "S2L1" and "S2L2" on
				745	site 2. These proxies will extend the application's JSESSIONID cookie to
				746	put the server name as a prefix.
				747
				748	- Each production site will have one front-end haproxy director to provide
				749	the service to local users and to remote offices. It will load-balance
				750	across the two local load-balancers, and will use the other site's
				751	load-balancers as backup servers. It will insert the local site identifier
				752	in a SITE cookie for the local load-balancers, and the remote site
				753	identifier for the remote load-balancers. These front-end directors will
				754	be called "SD1" and "SD2" for "Site Director".
				755
				756	- Each office will have one haproxy near the border gateway which will direct
				757	local users to their preference site by default, or to the backup site in
				758	the event of a previous failure. It will also analyze the SITE cookie, and
				759	direct the users to the site referenced in the cookie. Thus, the preferred
				760	site will be declared as a normal server, and the backup site will be
				761	declared as a backup server only, which will only be used when the primary
				762	site is unreachable, or when the primary site's director has forwarded
				763	traffic to the second site. These proxies will be called "OP1".."OPXX"
				764	for "Office Proxy #XX".
				765
				766
				767	5.3 Network diagram
				768	===================
				769
				770	Note : offices 1 and 2 are on the same continent as site 1, while
				771	office 3 is on the same continent as site 3. Each production
				772	site can reach the second one either through the WAN or through
				773	a dedicated link.
				774
				775
				776	Office1 Office2 Office3
				777	users users users
				778	192.168 # # # 192.168 # # # # # #
				779	.1.0/24 \| \| \| .2.0/24 \| \| \| 192.168.3.0/24 \| \| \|
				780	--+----+-+-+- --+----+-+-+- ---+----+-+-+-
				781	\| \| .1 \| \| .1 \| \| .1
				782	\| +-+-+ \| +-+-+ \| +-+-+
				783	\| \|OP1\| \| \|OP2\| \| \|OP3\| ...
				784	,-:-. +---+ ,-:-. +---+ ,-:-. +---+
				785	( X ) ( X ) ( X )
				786	`-:-' `-:-' ,---. `-:-'
				787	--+---------------+------+----~~~( X )~~~~-------+---------+-
				788	\| `---' \|
				789	\| \|
				790	+---+ ,-:-. +---+ ,-:-.
				791	\|SD1\| ( X ) \|SD2\| ( X )
				792	( SITE 1 ) +-+-+ `-:-' ( SITE 2 ) +-+-+ `-:-'
				793	\|.1 \| \|.1 \|
				794	10.1.1.0/24 \| \| ,---. 10.2.1.0/24 \| \|
				795	-+-+-+-+-+-+-+-----+-+--( X )------+-+-+-+-+-+-+-----+-+--
				796	\| \| \| \| \| \| \| `---' \| \| \| \| \| \| \|
				797	...# # # # # \|.11 \|.12 ...# # # # # \|.11 \|.12
				798	Site 1 +-+--+ +-+--+ Site 2 +-+--+ +-+--+
				799	Local \|S1L1\| \|S1L2\| Local \|S2L1\| \|S2L2\|
				800	users +-+--+ +--+-+ users +-+--+ +--+-+
				801	\| \| \| \|
				802	10.1.2.0/24 -+-+-+--+--++-- 10.2.2.0/24 -+-+-+--+--++--
				803	\|.1 \|.4 \|.1 \|.4
				804	+-+-+ +-+-+ +-+-+ +-+-+
				805	\|W11\| ~~~ \|W14\| \|W21\| ~~~ \|W24\|
				806	+---+ +---+ +---+ +---+
				807	4 application servers 4 application servers
				808	on site 1 on site 2
				809
				810
				811
				812	5.4 Description
				813	===============
				814
				815	5.4.1 Local users
				816	-----------------
				817	- Office 1 users connect to OP1 = 192.168.1.1
				818	- Office 2 users connect to OP2 = 192.168.2.1
				819	- Office 3 users connect to OP3 = 192.168.3.1
				820	- Site 1 users connect to SD1 = 10.1.1.1
				821	- Site 2 users connect to SD2 = 10.2.1.1
				822
				823	5.4.2 Office proxies
				824	--------------------
				825	- Office 1 connects to site 1 by default and uses site 2 as a backup.
				826	- Office 2 connects to site 1 by default and uses site 2 as a backup.
				827	- Office 3 connects to site 2 by default and uses site 1 as a backup.
				828
				829	The offices check the local site's SD proxy every 30 seconds, and the
				830	remote one every 60 seconds.
				831
				832
				833	Configuration for Office Proxy OP1
				834	----------------------------------
				835
				836	listen 192.168.1.1:80
				837	mode http
				838	balance roundrobin
				839	redispatch
				840	cookie SITE
				841	option httpchk HEAD / HTTP/1.0
				842	server SD1 10.1.1.1:80 cookie SITE1 check inter 30000
				843	server SD2 10.2.1.1:80 cookie SITE2 check inter 60000 backup
				844
				845
				846	Configuration for Office Proxy OP2
				847	----------------------------------
				848
				849	listen 192.168.2.1:80
				850	mode http
				851	balance roundrobin
				852	redispatch
				853	cookie SITE
				854	option httpchk HEAD / HTTP/1.0
				855	server SD1 10.1.1.1:80 cookie SITE1 check inter 30000
				856	server SD2 10.2.1.1:80 cookie SITE2 check inter 60000 backup
				857
				858
				859	Configuration for Office Proxy OP3
				860	----------------------------------
				861
				862	listen 192.168.3.1:80
				863	mode http
				864	balance roundrobin
				865	redispatch
				866	cookie SITE
				867	option httpchk HEAD / HTTP/1.0
				868	server SD2 10.2.1.1:80 cookie SITE2 check inter 30000
				869	server SD1 10.1.1.1:80 cookie SITE1 check inter 60000 backup
				870
				871
				872	5.4.3 Site directors ( SD1 and SD2 )
				873	------------------------------------
				874	The site directors forward traffic to the local load-balancers, and set a
				875	cookie to identify the site. If no local load-balancer is available, or if
				876	the local application servers are all down, it will redirect traffic to the
				877	remote site, and report this in the SITE cookie. In order not to uselessly
				878	load each site's WAN link, each SD will check the other site at a lower
				879	rate. The site directors will also insert their client's address so that
				880	the application server knows which local user or remote site accesses it.
				881
				882	The SITE cookie which is set by these directors will also be understood
				883	by the office proxies. This is important because if SD1 decides to forward
				884	traffic to site 2, it will write "SITE2" in the "SITE" cookie, and on next
				885	request, the office proxy will automatically and directly talk to SITE2 if
				886	it can reach it. If it cannot, it will still send the traffic to SITE1
				887	where SD1 will in turn try to reach SITE2.
				888
				889	The load-balancers checks are performed on port 81. As we'll see further,
				890	the load-balancers provide a health monitoring port 81 which reroutes to
				891	port 80 but which allows them to tell the SD that they are going down soon
				892	and that the SD must not use them anymore.
				893
				894
				895	Configuration for SD1
				896	---------------------
				897
				898	listen 10.1.1.1:80
				899	mode http
				900	balance roundrobin
				901	redispatch
				902	cookie SITE insert indirect
				903	option httpchk HEAD / HTTP/1.0
				904	option forwardfor
				905	server S1L1 10.1.1.11:80 cookie SITE1 check port 81 inter 4000
				906	server S1L2 10.1.1.12:80 cookie SITE1 check port 81 inter 4000
				907	server S2L1 10.2.1.11:80 cookie SITE2 check port 81 inter 8000 backup
				908	server S2L2 10.2.1.12:80 cookie SITE2 check port 81 inter 8000 backup
				909
				910	Configuration for SD2
				911	---------------------
				912
				913	listen 10.2.1.1:80
				914	mode http
				915	balance roundrobin
				916	redispatch
				917	cookie SITE insert indirect
				918	option httpchk HEAD / HTTP/1.0
				919	option forwardfor
				920	server S2L1 10.2.1.11:80 cookie SITE2 check port 81 inter 4000
				921	server S2L2 10.2.1.12:80 cookie SITE2 check port 81 inter 4000
				922	server S1L1 10.1.1.11:80 cookie SITE1 check port 81 inter 8000 backup
				923	server S1L2 10.1.1.12:80 cookie SITE1 check port 81 inter 8000 backup
				924
				925
				926	5.4.4 Local load-balancers S1L1, S1L2, S2L1, S2L2
				927	-------------------------------------------------
				928	Please first note that because SD1 and SD2 use the same cookie for both
				929	servers on a same site, the second load-balancer of each site will only
				930	receive load-balanced requests, but as soon as the SITE cookie will be
				931	set, only the first LB will receive the requests because it will be the
				932	first one to match the cookie.
				933
				934	The load-balancers will spread the load across 4 local web servers, and
				935	use the JSESSIONID provided by the application to provide server persistence
				936	using the new 'prefix' method. Soft-stop will also be implemented as described
				937	in section 4 above. Moreover, these proxies will provide their own maintenance
				938	soft-stop. Port 80 will be used for application traffic, while port 81 will
				939	only be used for health-checks and locally rerouted to port 80. A grace time
				940	will be specified to service on port 80, but not on port 81. This way, a soft
				941	kill (kill -USR1) on the proxy will only kill the health-check forwarder so
				942	that the site director knows it must not use this load-balancer anymore. But
				943	the service will still work for 20 seconds and as long as there are established
				944	sessions.
				945
				946	These proxies will also be the only ones to disable HTTP keep-alive in the
				947	chain, because it is enough to do it at one place, and it's necessary to do
				948	it with 'prefix' cookies.
				949
				950	Configuration for S1L1/S1L2
				951	---------------------------
				952
				953	listen 10.1.1.11:80 # 10.1.1.12:80 for S1L2
				954	grace 20000 # don't kill us until 20 seconds have elapsed
				955	mode http
				956	balance roundrobin
				957	cookie JSESSIONID prefix
				958	option httpclose
				959	option forwardfor
				960	option httpchk HEAD / HTTP/1.0
				961	server W11 10.1.2.1:80 cookie W11 check port 81 inter 2000
				962	server W12 10.1.2.2:80 cookie W12 check port 81 inter 2000
				963	server W13 10.1.2.3:80 cookie W13 check port 81 inter 2000
				964	server W14 10.1.2.4:80 cookie W14 check port 81 inter 2000
				965
				966	server B11 10.1.2.1:80 cookie W11 check port 80 inter 4000 backup
				967	server B12 10.1.2.2:80 cookie W12 check port 80 inter 4000 backup
				968	server B13 10.1.2.3:80 cookie W13 check port 80 inter 4000 backup
				969	server B14 10.1.2.4:80 cookie W14 check port 80 inter 4000 backup
				970
				971	listen 10.1.1.11:81 # 10.1.1.12:81 for S1L2
				972	mode tcp
				973	dispatch 10.1.1.11:80 # 10.1.1.12:80 for S1L2
				974
				975
				976	Configuration for S2L1/S2L2
				977	---------------------------
				978
				979	listen 10.2.1.11:80 # 10.2.1.12:80 for S2L2
				980	grace 20000 # don't kill us until 20 seconds have elapsed
				981	mode http
				982	balance roundrobin
				983	cookie JSESSIONID prefix
				984	option httpclose
				985	option forwardfor
				986	option httpchk HEAD / HTTP/1.0
				987	server W21 10.2.2.1:80 cookie W21 check port 81 inter 2000
				988	server W22 10.2.2.2:80 cookie W22 check port 81 inter 2000
				989	server W23 10.2.2.3:80 cookie W23 check port 81 inter 2000
				990	server W24 10.2.2.4:80 cookie W24 check port 81 inter 2000
				991
				992	server B21 10.2.2.1:80 cookie W21 check port 80 inter 4000 backup
				993	server B22 10.2.2.2:80 cookie W22 check port 80 inter 4000 backup
				994	server B23 10.2.2.3:80 cookie W23 check port 80 inter 4000 backup
				995	server B24 10.2.2.4:80 cookie W24 check port 80 inter 4000 backup
				996
				997	listen 10.2.1.11:81 # 10.2.1.12:81 for S2L2
				998	mode tcp
				999	dispatch 10.2.1.11:80 # 10.2.1.12:80 for S2L2
				1000
				1001
				1002	5.5 Comments
				1003	------------
				1004	Since each site director sets a cookie identifying the site, remote office
				1005	users will have their office proxies direct them to the right site and stick
				1006	to this site as long as the user still uses the application and the site is
				1007	available. Users on production sites will be directed to the right site by the
				1008	site directors depending on the SITE cookie.
				1009
				1010	If the WAN link dies on a production site, the remote office users will not
				1011	see their site anymore, so they will redirect the traffic to the second site.
				1012	If there are dedicated inter-site links as on the diagram above, the second
				1013	SD will see the cookie and still be able to reach the original site. For
				1014	example :
				1015
				1016	Office 1 user sends the following to OP1 :
				1017	GET / HTTP/1.0
				1018	Cookie: SITE=SITE1; JSESSIONID=W14~123;
				1019
				1020	OP1 cannot reach site 1 because its external router is dead. So the SD1 server
				1021	is seen as dead, and OP1 will then forward the request to SD2 on site 2,
				1022	regardless of the SITE cookie.
				1023
				1024	SD2 on site 2 receives a SITE cookie containing "SITE1". Fortunately, it
				1025	can reach Site 1's load balancers S1L1 and S1L2. So it forwards the request
				1026	so S1L1 (the first one with the same cookie).
				1027
				1028	S1L1 (on site 1) finds "W14" in the JSESSIONID cookie, so it can forward the
				1029	request to the right server, and the user session will continue to work. Once
				1030	the Site 1's WAN link comes back, OP1 will see SD1 again, and will not route
				1031	through SITE 2 anymore.
				1032
				1033	However, when a new user on Office 1 connects to the application during a
				1034	site 1 failure, it does not contain any cookie. Since OP1 does not see SD1
				1035	because of the network failure, it will direct the request to SD2 on site 2,
				1036	which will by default direct the traffic to the local load-balancers, S2L1 and
				1037	S2L2. So only initial users will load the inter-site link, not the new ones.
				1038
				1039
				1040	===================
				1041	6. Source balancing
				1042	===================
				1043
				1044	Sometimes it may reveal useful to access servers from a pool of IP addresses
				1045	instead of only one or two. Some equipments (NAT firewalls, load-balancers)
				1046	are sensible to source address, and often need many sources to distribute the
				1047	load evenly amongst their internal hash buckets.
				1048
				1049	To do this, you simply have to use several times the same server with a
				1050	different source. Example :
				1051
				1052	listen 0.0.0.0:80
				1053	mode tcp
				1054	balance roundrobin
				1055	server from1to1 10.1.1.1:80 source 10.1.2.1
				1056	server from2to1 10.1.1.1:80 source 10.1.2.2
				1057	server from3to1 10.1.1.1:80 source 10.1.2.3
				1058	server from4to1 10.1.1.1:80 source 10.1.2.4
				1059	server from5to1 10.1.1.1:80 source 10.1.2.5
				1060	server from6to1 10.1.1.1:80 source 10.1.2.6
				1061	server from7to1 10.1.1.1:80 source 10.1.2.7
				1062	server from8to1 10.1.1.1:80 source 10.1.2.8
				1063