MINOR: polling: add an option to support busy polling In some situations, especially when dealing with low latency on processors supporting a variable frequency or when running inside virtual machines, each time the process waits for an I/O using the poller, the processor goes back to sleep or is offered to another VM for a long time, and it causes excessively high latencies. A solution to this provided by this patch is to enable busy polling using a global option. When busy polling is enabled, the pollers never sleep and loop over themselves waiting for an I/O event to happen or for a timeout to occur. On multi-processor machines it can significantly overheat the processor but it usually results in much lower latencies. A typical test consisting in injecting traffic over a single connection at a time over the loopback shows a bump from 4640 to 8540 connections per second on forwarded connections, indicating a latency reduction of 98 microseconds for each connection, and a bump from 12500 to 21250 for locally terminated connections (redirects), indicating a reduction of 33 microseconds. It is only usable with epoll and kqueue because select() and poll()'s API is not convenient for such usages, and the level of performance they are used in doesn't benefit from this anyway. The option, which obviously remains disabled by default, can be turned on using "busy-polling" in the global section, and turned off later using "no busy-polling". Its status is reported in "show info" to help troubleshooting suspicious CPU spikes.

commit: beb859abcebf2dc1b025b7bb83e53a28192ea6e9 [log] [tgz]
author: Willy Tarreau <w@1wt.eu> Thu Nov 22 18:07:59 2018 +0100
committer: Willy Tarreau <w@1wt.eu> Thu Nov 22 19:47:30 2018 +0100
tree: c3e240baf584ca7d327694341beb8ba45b6993b0
parent: 48f8bc13685230b45dbcded7659bd8dbc6018791 [diff] [blame]
diff --git a/doc/configuration.txt b/doc/configuration.txt
index 2ce9616..b349464 100644
--- a/doc/configuration.txt
+++ b/doc/configuration.txt

@@ -1294,6 +1294,25 @@
 3.2. Performance tuning
 -----------------------
 
+busy-polling
+  In some situations, especially when dealing with low latency on processors
+  supporting a variable frequency or when running inside virtual machines, each
+  time the process waits for an I/O using the poller, the processor goes back
+  to sleep or is offered to another VM for a long time, and it causes
+  excessively high latencies. This option provides a solution preventing the
+  processor from sleeping by always using a null timeout on the pollers. This
+  results in a significant latency reduction (30 to 100 microseconds observed)
+  at the expense of a risk to overheat the processor. It may even be used with
+  threads, in which case improperly bound threads may heavily conflict,
+  resulting in a worse performance and high values for the CPU stolen fields
+  in "show info" output, indicating which threads are misconfigured. It is
+  important not to let the process run on the same processor as the network
+  interrupts when this option is used. It is also better to avoid using it on
+  multiple CPU threads sharing the same core. This option is disabled by
+  default. If it has been enabled, it may still be forcibly disabled by
+  prefixing it with the "no" keyword. It is ignored by the "select" and
+  "poll" pollers.
+
 max-spread-checks <delay in milliseconds>
   By default, haproxy tries to spread the start of health checks across the
   smallest health check interval of all the servers in a farm. The principle is
commit	beb859abcebf2dc1b025b7bb83e53a28192ea6e9	[log] [tgz]
author	Willy Tarreau <w@1wt.eu>	Thu Nov 22 18:07:59 2018 +0100
committer	Willy Tarreau <w@1wt.eu>	Thu Nov 22 19:47:30 2018 +0100
tree	c3e240baf584ca7d327694341beb8ba45b6993b0
parent	48f8bc13685230b45dbcded7659bd8dbc6018791 [diff] [blame]