Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 1 | ------------------------ |
| 2 | HAProxy Management Guide |
| 3 | ------------------------ |
Willy Tarreau | eaded98 | 2022-12-01 15:25:34 +0100 | [diff] [blame] | 4 | version 2.8 |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 5 | |
| 6 | |
| 7 | This document describes how to start, stop, manage, and troubleshoot HAProxy, |
| 8 | as well as some known limitations and traps to avoid. It does not describe how |
| 9 | to configure it (for this please read configuration.txt). |
| 10 | |
| 11 | Note to documentation contributors : |
| 12 | This document is formatted with 80 columns per line, with even number of |
| 13 | spaces for indentation and without tabs. Please follow these rules strictly |
| 14 | so that it remains easily printable everywhere. If you add sections, please |
| 15 | update the summary below for easier searching. |
| 16 | |
| 17 | |
| 18 | Summary |
| 19 | ------- |
| 20 | |
| 21 | 1. Prerequisites |
| 22 | 2. Quick reminder about HAProxy's architecture |
| 23 | 3. Starting HAProxy |
| 24 | 4. Stopping and restarting HAProxy |
| 25 | 5. File-descriptor limitations |
| 26 | 6. Memory management |
| 27 | 7. CPU usage |
| 28 | 8. Logging |
| 29 | 9. Statistics and monitoring |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 30 | 9.1. CSV format |
Willy Tarreau | 5d8b979 | 2016-03-11 11:09:34 +0100 | [diff] [blame] | 31 | 9.2. Typed output format |
| 32 | 9.3. Unix Socket commands |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 33 | 9.4. Master CLI |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 34 | 9.4.1. Master CLI commands |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 35 | 10. Tricks for easier configuration management |
| 36 | 11. Well-known traps to avoid |
| 37 | 12. Debugging and performance issues |
| 38 | 13. Security considerations |
| 39 | |
| 40 | |
| 41 | 1. Prerequisites |
| 42 | ---------------- |
| 43 | |
| 44 | In this document it is assumed that the reader has sufficient administration |
| 45 | skills on a UNIX-like operating system, uses the shell on a daily basis and is |
| 46 | familiar with troubleshooting utilities such as strace and tcpdump. |
| 47 | |
| 48 | |
| 49 | 2. Quick reminder about HAProxy's architecture |
| 50 | ---------------------------------------------- |
| 51 | |
Willy Tarreau | 3f36448 | 2019-02-27 15:01:46 +0100 | [diff] [blame] | 52 | HAProxy is a multi-threaded, event-driven, non-blocking daemon. This means is |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 53 | uses event multiplexing to schedule all of its activities instead of relying on |
| 54 | the system to schedule between multiple activities. Most of the time it runs as |
| 55 | a single process, so the output of "ps aux" on a system will report only one |
| 56 | "haproxy" process, unless a soft reload is in progress and an older process is |
| 57 | finishing its job in parallel to the new one. It is thus always easy to trace |
Willy Tarreau | 3f36448 | 2019-02-27 15:01:46 +0100 | [diff] [blame] | 58 | its activity using the strace utility. In order to scale with the number of |
| 59 | available processors, by default haproxy will start one worker thread per |
| 60 | processor it is allowed to run on. Unless explicitly configured differently, |
| 61 | the incoming traffic is spread over all these threads, all running the same |
| 62 | event loop. A great care is taken to limit inter-thread dependencies to the |
| 63 | strict minimum, so as to try to achieve near-linear scalability. This has some |
| 64 | impacts such as the fact that a given connection is served by a single thread. |
| 65 | Thus in order to use all available processing capacity, it is needed to have at |
| 66 | least as many connections as there are threads, which is almost always granted. |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 67 | |
| 68 | HAProxy is designed to isolate itself into a chroot jail during startup, where |
| 69 | it cannot perform any file-system access at all. This is also true for the |
| 70 | libraries it depends on (eg: libc, libssl, etc). The immediate effect is that |
| 71 | a running process will not be able to reload a configuration file to apply |
| 72 | changes, instead a new process will be started using the updated configuration |
| 73 | file. Some other less obvious effects are that some timezone files or resolver |
| 74 | files the libc might attempt to access at run time will not be found, though |
| 75 | this should generally not happen as they're not needed after startup. A nice |
| 76 | consequence of this principle is that the HAProxy process is totally stateless, |
| 77 | and no cleanup is needed after it's killed, so any killing method that works |
| 78 | will do the right thing. |
| 79 | |
| 80 | HAProxy doesn't write log files, but it relies on the standard syslog protocol |
| 81 | to send logs to a remote server (which is often located on the same system). |
| 82 | |
| 83 | HAProxy uses its internal clock to enforce timeouts, that is derived from the |
| 84 | system's time but where unexpected drift is corrected. This is done by limiting |
| 85 | the time spent waiting in poll() for an event, and measuring the time it really |
| 86 | took. In practice it never waits more than one second. This explains why, when |
| 87 | running strace over a completely idle process, periodic calls to poll() (or any |
| 88 | of its variants) surrounded by two gettimeofday() calls are noticed. They are |
| 89 | normal, completely harmless and so cheap that the load they imply is totally |
| 90 | undetectable at the system scale, so there's nothing abnormal there. Example : |
| 91 | |
| 92 | 16:35:40.002320 gettimeofday({1442759740, 2605}, NULL) = 0 |
| 93 | 16:35:40.002942 epoll_wait(0, {}, 200, 1000) = 0 |
| 94 | 16:35:41.007542 gettimeofday({1442759741, 7641}, NULL) = 0 |
| 95 | 16:35:41.007998 gettimeofday({1442759741, 8114}, NULL) = 0 |
| 96 | 16:35:41.008391 epoll_wait(0, {}, 200, 1000) = 0 |
| 97 | 16:35:42.011313 gettimeofday({1442759742, 11411}, NULL) = 0 |
| 98 | |
| 99 | HAProxy is a TCP proxy, not a router. It deals with established connections that |
| 100 | have been validated by the kernel, and not with packets of any form nor with |
| 101 | sockets in other states (eg: no SYN_RECV nor TIME_WAIT), though their existence |
| 102 | may prevent it from binding a port. It relies on the system to accept incoming |
| 103 | connections and to initiate outgoing connections. An immediate effect of this is |
| 104 | that there is no relation between packets observed on the two sides of a |
| 105 | forwarded connection, which can be of different size, numbers and even family. |
| 106 | Since a connection may only be accepted from a socket in LISTEN state, all the |
| 107 | sockets it is listening to are necessarily visible using the "netstat" utility |
| 108 | to show listening sockets. Example : |
| 109 | |
| 110 | # netstat -ltnp |
| 111 | Active Internet connections (only servers) |
| 112 | Proto Recv-Q Send-Q Local Address Foreign Address State PID/Program name |
| 113 | tcp 0 0 0.0.0.0:22 0.0.0.0:* LISTEN 1629/sshd |
| 114 | tcp 0 0 0.0.0.0:80 0.0.0.0:* LISTEN 2847/haproxy |
| 115 | tcp 0 0 0.0.0.0:443 0.0.0.0:* LISTEN 2847/haproxy |
| 116 | |
| 117 | |
| 118 | 3. Starting HAProxy |
| 119 | ------------------- |
| 120 | |
| 121 | HAProxy is started by invoking the "haproxy" program with a number of arguments |
| 122 | passed on the command line. The actual syntax is : |
| 123 | |
| 124 | $ haproxy [<options>]* |
| 125 | |
| 126 | where [<options>]* is any number of options. An option always starts with '-' |
| 127 | followed by one of more letters, and possibly followed by one or multiple extra |
| 128 | arguments. Without any option, HAProxy displays the help page with a reminder |
| 129 | about supported options. Available options may vary slightly based on the |
| 130 | operating system. A fair number of these options overlap with an equivalent one |
| 131 | if the "global" section. In this case, the command line always has precedence |
| 132 | over the configuration file, so that the command line can be used to quickly |
| 133 | enforce some settings without touching the configuration files. The current |
| 134 | list of options is : |
| 135 | |
| 136 | -- <cfgfile>* : all the arguments following "--" are paths to configuration |
Maxime de Roucy | 379d9c7 | 2016-05-13 23:52:56 +0200 | [diff] [blame] | 137 | file/directory to be loaded and processed in the declaration order. It is |
| 138 | mostly useful when relying on the shell to load many files that are |
| 139 | numerically ordered. See also "-f". The difference between "--" and "-f" is |
| 140 | that one "-f" must be placed before each file name, while a single "--" is |
| 141 | needed before all file names. Both options can be used together, the |
| 142 | command line ordering still applies. When more than one file is specified, |
| 143 | each file must start on a section boundary, so the first keyword of each |
| 144 | file must be one of "global", "defaults", "peers", "listen", "frontend", |
| 145 | "backend", and so on. A file cannot contain just a server list for example. |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 146 | |
Maxime de Roucy | 379d9c7 | 2016-05-13 23:52:56 +0200 | [diff] [blame] | 147 | -f <cfgfile|cfgdir> : adds <cfgfile> to the list of configuration files to be |
| 148 | loaded. If <cfgdir> is a directory, all the files (and only files) it |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 149 | contains are added in lexical order (using LC_COLLATE=C) to the list of |
Maxime de Roucy | 379d9c7 | 2016-05-13 23:52:56 +0200 | [diff] [blame] | 150 | configuration files to be loaded ; only files with ".cfg" extension are |
| 151 | added, only non hidden files (not prefixed with ".") are added. |
| 152 | Configuration files are loaded and processed in their declaration order. |
| 153 | This option may be specified multiple times to load multiple files. See |
| 154 | also "--". The difference between "--" and "-f" is that one "-f" must be |
| 155 | placed before each file name, while a single "--" is needed before all file |
| 156 | names. Both options can be used together, the command line ordering still |
| 157 | applies. When more than one file is specified, each file must start on a |
| 158 | section boundary, so the first keyword of each file must be one of |
| 159 | "global", "defaults", "peers", "listen", "frontend", "backend", and so on. |
| 160 | A file cannot contain just a server list for example. |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 161 | |
| 162 | -C <dir> : changes to directory <dir> before loading configuration |
| 163 | files. This is useful when using relative paths. Warning when using |
| 164 | wildcards after "--" which are in fact replaced by the shell before |
| 165 | starting haproxy. |
| 166 | |
| 167 | -D : start as a daemon. The process detaches from the current terminal after |
| 168 | forking, and errors are not reported anymore in the terminal. It is |
| 169 | equivalent to the "daemon" keyword in the "global" section of the |
| 170 | configuration. It is recommended to always force it in any init script so |
| 171 | that a faulty configuration doesn't prevent the system from booting. |
| 172 | |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 173 | -L <name> : change the local peer name to <name>, which defaults to the local |
William Lallemand | daf4cd2 | 2018-04-17 16:46:13 +0200 | [diff] [blame] | 174 | hostname. This is used only with peers replication. You can use the |
| 175 | variable $HAPROXY_LOCALPEER in the configuration file to reference the |
| 176 | peer name. |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 177 | |
| 178 | -N <limit> : sets the default per-proxy maxconn to <limit> instead of the |
| 179 | builtin default value (usually 2000). Only useful for debugging. |
| 180 | |
| 181 | -V : enable verbose mode (disables quiet mode). Reverts the effect of "-q" or |
| 182 | "quiet". |
| 183 | |
William Lallemand | e202b1e | 2017-06-01 17:38:56 +0200 | [diff] [blame] | 184 | -W : master-worker mode. It is equivalent to the "master-worker" keyword in |
| 185 | the "global" section of the configuration. This mode will launch a "master" |
| 186 | which will monitor the "workers". Using this mode, you can reload HAProxy |
| 187 | directly by sending a SIGUSR2 signal to the master. The master-worker mode |
| 188 | is compatible either with the foreground or daemon mode. It is |
| 189 | recommended to use this mode with multiprocess and systemd. |
| 190 | |
Pavlos Parissis | f65f257 | 2018-02-07 21:42:16 +0100 | [diff] [blame] | 191 | -Ws : master-worker mode with support of `notify` type of systemd service. |
| 192 | This option is only available when HAProxy was built with `USE_SYSTEMD` |
| 193 | build option enabled. |
| 194 | |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 195 | -c : only performs a check of the configuration files and exits before trying |
| 196 | to bind. The exit status is zero if everything is OK, or non-zero if an |
Willy Tarreau | bebd212 | 2020-04-15 16:06:11 +0200 | [diff] [blame] | 197 | error is encountered. Presence of warnings will be reported if any. |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 198 | |
Maximilian Mader | fc0cceb | 2021-06-06 00:50:22 +0200 | [diff] [blame] | 199 | -cc : evaluates a condition as used within a conditional block of the |
| 200 | configuration. The exit status is zero if the condition is true, 1 if the |
| 201 | condition is false or 2 if an error is encountered. |
| 202 | |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 203 | -d : enable debug mode. This disables daemon mode, forces the process to stay |
Willy Tarreau | ccf4299 | 2020-10-09 19:15:03 +0200 | [diff] [blame] | 204 | in foreground and to show incoming and outgoing events. It must never be |
| 205 | used in an init script. |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 206 | |
Erwan Le Goas | b0c0501 | 2022-09-14 17:51:55 +0200 | [diff] [blame] | 207 | -dC[key] : dump the configuration file. It is performed after the lines are |
| 208 | tokenized, so comments are stripped and indenting is forced. If a non-zero |
| 209 | key is specified, lines are truncated before sensitive/confidential fields, |
| 210 | and identifiers and addresses are emitted hashed with this key using the |
| 211 | same algorithmm as the one used by the anonymized mode on the CLI. This |
| 212 | means that the output may safely be shared with a developer who needs it |
| 213 | to figure what's happening in a dump that was anonymized using the same |
| 214 | key. Please also see the CLI's "set anon" command. |
| 215 | |
Amaury Denoyelle | 7b01a8d | 2021-03-29 10:29:07 +0200 | [diff] [blame] | 216 | -dD : enable diagnostic mode. This mode will output extra warnings about |
| 217 | suspicious configuration statements. This will never prevent startup even in |
| 218 | "zero-warning" mode nor change the exit status code. |
| 219 | |
Christopher Faulet | 678a4ce | 2023-02-14 16:12:54 +0100 | [diff] [blame] | 220 | -dF : disable data fast-forward. It is a mechanism to optimize the data |
| 221 | forwarding by passing data directly from a side to the other one without |
| 222 | waking the stream up. Thanks to this directive, it is possible to disable |
| 223 | this optimization. Note it also disable any kernel tcp splicing. This |
| 224 | command is not meant for regular use, it will generally only be suggested by |
| 225 | developers along complex debugging sessions. |
| 226 | |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 227 | -dG : disable use of getaddrinfo() to resolve host names into addresses. It |
| 228 | can be used when suspecting that getaddrinfo() doesn't work as expected. |
| 229 | This option was made available because many bogus implementations of |
| 230 | getaddrinfo() exist on various systems and cause anomalies that are |
| 231 | difficult to troubleshoot. |
| 232 | |
Willy Tarreau | 76871a4 | 2022-03-08 16:01:40 +0100 | [diff] [blame] | 233 | -dK<class[,class]*> : dumps the list of registered keywords in each class. |
| 234 | The list of classes is available with "-dKhelp". All classes may be dumped |
| 235 | using "-dKall", otherwise a selection of those shown in the help can be |
| 236 | specified as a comma-delimited list. The output format will vary depending |
| 237 | on what class of keywords is being dumped (e.g. "cfg" will show the known |
Willy Tarreau | 55b9689 | 2022-05-31 08:07:43 +0200 | [diff] [blame] | 238 | configuration keywords in a format resembling the config file format while |
Willy Tarreau | 76871a4 | 2022-03-08 16:01:40 +0100 | [diff] [blame] | 239 | "smp" will show sample fetch functions prefixed with a compatibility matrix |
| 240 | with each rule set). These may rarely be used as-is by humans but can be of |
| 241 | great help for external tools that try to detect the appearance of new |
| 242 | keywords at certain places to automatically update some documentation, |
| 243 | syntax highlighting files, configuration parsers, API etc. The output |
| 244 | format may evolve a bit over time so it is really recommended to use this |
| 245 | output mostly to detect differences with previous archives. Note that not |
| 246 | all keywords are listed because many keywords have existed long before the |
| 247 | different keyword registration subsystems were created, and they do not |
| 248 | appear there. However since new keywords are only added via the modern |
| 249 | mechanisms, it's reasonably safe to assume that this output may be used to |
| 250 | detect language additions with a good accuracy. The keywords are only |
| 251 | dumped after the configuration is fully parsed, so that even dynamically |
| 252 | created keywords can be dumped. A good way to dump and exit is to run a |
| 253 | silent config check on an existing configuration: |
| 254 | |
| 255 | ./haproxy -dKall -q -c -f foo.cfg |
| 256 | |
| 257 | If no configuration file is available, using "-f /dev/null" will work as |
| 258 | well to dump all default keywords, but then the return status will not be |
| 259 | zero since there will be no listener, and will have to be ignored. |
| 260 | |
Willy Tarreau | 654726d | 2021-12-28 15:43:11 +0100 | [diff] [blame] | 261 | -dL : dumps the list of dynamic shared libraries that are loaded at the end |
| 262 | of the config processing. This will generally also include deep dependencies |
| 263 | such as anything loaded from Lua code for example, as well as the executable |
| 264 | itself. The list is printed in a format that ought to be easy enough to |
| 265 | sanitize to directly produce a tarball of all dependencies. Since it doesn't |
| 266 | stop the program's startup, it is recommended to only use it in combination |
| 267 | with "-c" and "-q" where only the list of loaded objects will be displayed |
| 268 | (or nothing in case of error). In addition, keep in mind that when providing |
| 269 | such a package to help with a core file analysis, most libraries are in fact |
| 270 | symbolic links that need to be dereferenced when creating the archive: |
| 271 | |
| 272 | ./haproxy -W -q -c -dL -f foo.cfg | tar -T - -hzcf archive.tgz |
| 273 | |
Willy Tarreau | f4b79c4 | 2022-02-23 15:20:53 +0100 | [diff] [blame] | 274 | -dM[<byte>[,]][help|options,...] : forces memory poisoning, and/or changes |
| 275 | memory other debugging options. Memory poisonning means that each and every |
Willy Tarreau | bafbe01 | 2017-11-24 17:34:44 +0100 | [diff] [blame] | 276 | memory region allocated with malloc() or pool_alloc() will be filled with |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 277 | <byte> before being passed to the caller. When <byte> is not specified, it |
| 278 | defaults to 0x50 ('P'). While this slightly slows down operations, it is |
| 279 | useful to reliably trigger issues resulting from missing initializations in |
| 280 | the code that cause random crashes. Note that -dM0 has the effect of |
| 281 | turning any malloc() into a calloc(). In any case if a bug appears or |
| 282 | disappears when using this option it means there is a bug in haproxy, so |
Willy Tarreau | f4b79c4 | 2022-02-23 15:20:53 +0100 | [diff] [blame] | 283 | please report it. A number of other options are available either alone or |
| 284 | after a comma following the byte. The special option "help" will list the |
| 285 | currently supported options and their current value. Each debugging option |
| 286 | may be forced on or off. The most optimal options are usually chosen at |
| 287 | build time based on the operating system and do not need to be adjusted, |
| 288 | unless suggested by a developer. Supported debugging options include |
| 289 | (set/clear): |
| 290 | - fail / no-fail: |
| 291 | This enables randomly failing memory allocations, in conjunction with |
| 292 | the global "tune.fail-alloc" setting. This is used to detect missing |
| 293 | error checks in the code. |
| 294 | |
| 295 | - no-merge / merge: |
| 296 | By default, pools of very similar sizes are merged, resulting in more |
| 297 | efficiency, but this complicates the analysis of certain memory dumps. |
| 298 | This option allows to disable this mechanism, and may slightly increase |
| 299 | the memory usage. |
| 300 | |
| 301 | - cold-first / hot-first: |
| 302 | In order to optimize the CPU cache hit ratio, by default the most |
| 303 | recently released objects ("hot") are recycled for new allocations. |
| 304 | But doing so also complicates analysis of memory dumps and may hide |
| 305 | use-after-free bugs. This option allows to instead pick the coldest |
| 306 | objects first, which may result in a slight increase of CPU usage. |
| 307 | |
| 308 | - integrity / no-integrity: |
| 309 | When this option is enabled, memory integrity checks are enabled on |
| 310 | the allocated area to verify that it hasn't been modified since it was |
| 311 | last released. This works best with "no-merge", "cold-first" and "tag". |
| 312 | Enabling this option will slightly increase the CPU usage. |
| 313 | |
| 314 | - no-global / global: |
| 315 | Depending on the operating system, a process-wide global memory cache |
| 316 | may be enabled if it is estimated that the standard allocator is too |
| 317 | slow or inefficient with threads. This option allows to forcefully |
| 318 | disable it or enable it. Disabling it may result in a CPU usage |
| 319 | increase with inefficient allocators. Enabling it may result in a |
| 320 | higher memory usage with efficient allocators. |
| 321 | |
| 322 | - no-cache / cache: |
| 323 | Each thread uses a very fast local object cache for allocations, which |
| 324 | is always enabled by default. This option allows to disable it. Since |
| 325 | the global cache also passes via the local caches, this will |
| 326 | effectively result in disabling all caches and allocating directly from |
| 327 | the default allocator. This may result in a significant increase of CPU |
| 328 | usage, but may also result in small memory savings on tiny systems. |
| 329 | |
| 330 | - caller / no-caller: |
| 331 | Enabling this option reserves some extra space in each allocated object |
| 332 | to store the address of the last caller that allocated or released it. |
| 333 | This helps developers go back in time when analysing memory dumps and |
| 334 | to guess how something unexpected happened. |
| 335 | |
| 336 | - tag / no-tag: |
| 337 | Enabling this option reserves some extra space in each allocated object |
| 338 | to store a tag that allows to detect bugs such as double-free, freeing |
| 339 | an invalid object, and buffer overflows. It offers much stronger |
| 340 | reliability guarantees at the expense of 4 or 8 extra bytes per |
| 341 | allocation. It usually is the first step to detect memory corruption. |
| 342 | |
| 343 | - poison / no-poison: |
| 344 | Enabling this option will fill allocated objects with a fixed pattern |
| 345 | that will make sure that some accidental values such as 0 will not be |
| 346 | present if a newly added field was mistakenly forgotten in an |
| 347 | initialization routine. Such bugs tend to rarely reproduce, especially |
| 348 | when pools are not merged. This is normally enabled by directly passing |
| 349 | the byte's value to -dM but using this option allows to disable/enable |
| 350 | use of a previously set value. |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 351 | |
| 352 | -dS : disable use of the splice() system call. It is equivalent to the |
| 353 | "global" section's "nosplice" keyword. This may be used when splice() is |
| 354 | suspected to behave improperly or to cause performance issues, or when |
| 355 | using strace to see the forwarded data (which do not appear when using |
| 356 | splice()). |
| 357 | |
| 358 | -dV : disable SSL verify on the server side. It is equivalent to having |
| 359 | "ssl-server-verify none" in the "global" section. This is useful when |
| 360 | trying to reproduce production issues out of the production |
| 361 | environment. Never use this in an init script as it degrades SSL security |
| 362 | to the servers. |
| 363 | |
Willy Tarreau | 3eb10b8 | 2020-04-15 16:42:39 +0200 | [diff] [blame] | 364 | -dW : if set, haproxy will refuse to start if any warning was emitted while |
| 365 | processing the configuration. This helps detect subtle mistakes and keep the |
| 366 | configuration clean and portable across versions. It is recommended to set |
| 367 | this option in service scripts when configurations are managed by humans, |
| 368 | but it is recommended not to use it with generated configurations, which |
| 369 | tend to emit more warnings. It may be combined with "-c" to cause warnings |
| 370 | in checked configurations to fail. This is equivalent to global option |
| 371 | "zero-warning". |
| 372 | |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 373 | -db : disable background mode and multi-process mode. The process remains in |
| 374 | foreground. It is mainly used during development or during small tests, as |
| 375 | Ctrl-C is enough to stop the process. Never use it in an init script. |
| 376 | |
| 377 | -de : disable the use of the "epoll" poller. It is equivalent to the "global" |
| 378 | section's keyword "noepoll". It is mostly useful when suspecting a bug |
| 379 | related to this poller. On systems supporting epoll, the fallback will |
| 380 | generally be the "poll" poller. |
| 381 | |
| 382 | -dk : disable the use of the "kqueue" poller. It is equivalent to the |
| 383 | "global" section's keyword "nokqueue". It is mostly useful when suspecting |
| 384 | a bug related to this poller. On systems supporting kqueue, the fallback |
| 385 | will generally be the "poll" poller. |
| 386 | |
| 387 | -dp : disable the use of the "poll" poller. It is equivalent to the "global" |
| 388 | section's keyword "nopoll". It is mostly useful when suspecting a bug |
| 389 | related to this poller. On systems supporting poll, the fallback will |
| 390 | generally be the "select" poller, which cannot be disabled and is limited |
| 391 | to 1024 file descriptors. |
| 392 | |
Willy Tarreau | 3eed10e | 2016-11-07 21:03:16 +0100 | [diff] [blame] | 393 | -dr : ignore server address resolution failures. It is very common when |
| 394 | validating a configuration out of production not to have access to the same |
| 395 | resolvers and to fail on server address resolution, making it difficult to |
| 396 | test a configuration. This option simply appends the "none" method to the |
| 397 | list of address resolution methods for all servers, ensuring that even if |
| 398 | the libc fails to resolve an address, the startup sequence is not |
| 399 | interrupted. |
| 400 | |
Willy Tarreau | 7006045 | 2015-12-14 12:46:07 +0100 | [diff] [blame] | 401 | -m <limit> : limit the total allocatable memory to <limit> megabytes across |
| 402 | all processes. This may cause some connection refusals or some slowdowns |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 403 | depending on the amount of memory needed for normal operations. This is |
Willy Tarreau | 7006045 | 2015-12-14 12:46:07 +0100 | [diff] [blame] | 404 | mostly used to force the processes to work in a constrained resource usage |
| 405 | scenario. It is important to note that the memory is not shared between |
| 406 | processes, so in a multi-process scenario, this value is first divided by |
| 407 | global.nbproc before forking. |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 408 | |
| 409 | -n <limit> : limits the per-process connection limit to <limit>. This is |
| 410 | equivalent to the global section's keyword "maxconn". It has precedence |
| 411 | over this keyword. This may be used to quickly force lower limits to avoid |
| 412 | a service outage on systems where resource limits are too low. |
| 413 | |
| 414 | -p <file> : write all processes' pids into <file> during startup. This is |
| 415 | equivalent to the "global" section's keyword "pidfile". The file is opened |
| 416 | before entering the chroot jail, and after doing the chdir() implied by |
| 417 | "-C". Each pid appears on its own line. |
| 418 | |
| 419 | -q : set "quiet" mode. This disables some messages during the configuration |
| 420 | parsing and during startup. It can be used in combination with "-c" to |
| 421 | just check if a configuration file is valid or not. |
| 422 | |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 423 | -S <bind>[,bind_options...]: in master-worker mode, bind a master CLI, which |
| 424 | allows the access to every processes, running or leaving ones. |
| 425 | For security reasons, it is recommended to bind the master CLI to a local |
| 426 | UNIX socket. The bind options are the same as the keyword "bind" in |
| 427 | the configuration file with words separated by commas instead of spaces. |
| 428 | |
| 429 | Note that this socket can't be used to retrieve the listening sockets from |
| 430 | an old process during a seamless reload. |
| 431 | |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 432 | -sf <pid>* : send the "finish" signal (SIGUSR1) to older processes after boot |
| 433 | completion to ask them to finish what they are doing and to leave. <pid> |
| 434 | is a list of pids to signal (one per argument). The list ends on any |
| 435 | option starting with a "-". It is not a problem if the list of pids is |
| 436 | empty, so that it can be built on the fly based on the result of a command |
Amaury Denoyelle | fb37557 | 2023-02-01 09:28:32 +0100 | [diff] [blame] | 437 | like "pidof" or "pgrep". |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 438 | |
| 439 | -st <pid>* : send the "terminate" signal (SIGTERM) to older processes after |
| 440 | boot completion to terminate them immediately without finishing what they |
| 441 | were doing. <pid> is a list of pids to signal (one per argument). The list |
| 442 | is ends on any option starting with a "-". It is not a problem if the list |
| 443 | of pids is empty, so that it can be built on the fly based on the result of |
| 444 | a command like "pidof" or "pgrep". |
| 445 | |
| 446 | -v : report the version and build date. |
| 447 | |
| 448 | -vv : display the version, build options, libraries versions and usable |
| 449 | pollers. This output is systematically requested when filing a bug report. |
| 450 | |
Olivier Houchard | d33fc3a | 2017-04-05 22:50:59 +0200 | [diff] [blame] | 451 | -x <unix_socket> : connect to the specified socket and try to retrieve any |
| 452 | listening sockets from the old process, and use them instead of trying to |
| 453 | bind new ones. This is useful to avoid missing any new connection when |
William Lallemand | f6975e9 | 2017-05-26 17:42:10 +0200 | [diff] [blame] | 454 | reloading the configuration on Linux. The capability must be enable on the |
| 455 | stats socket using "expose-fd listeners" in your configuration. |
William Lallemand | 2be557f | 2021-11-24 18:45:37 +0100 | [diff] [blame] | 456 | In master-worker mode, the master will use this option upon a reload with |
| 457 | the "sockpair@" syntax, which allows the master to connect directly to a |
| 458 | worker without using stats socket declared in the configuration. |
Olivier Houchard | d33fc3a | 2017-04-05 22:50:59 +0200 | [diff] [blame] | 459 | |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 460 | A safe way to start HAProxy from an init file consists in forcing the daemon |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 461 | mode, storing existing pids to a pid file and using this pid file to notify |
| 462 | older processes to finish before leaving : |
| 463 | |
| 464 | haproxy -f /etc/haproxy.cfg \ |
| 465 | -D -p /var/run/haproxy.pid -sf $(cat /var/run/haproxy.pid) |
| 466 | |
| 467 | When the configuration is split into a few specific files (eg: tcp vs http), |
| 468 | it is recommended to use the "-f" option : |
| 469 | |
| 470 | haproxy -f /etc/haproxy/global.cfg -f /etc/haproxy/stats.cfg \ |
| 471 | -f /etc/haproxy/default-tcp.cfg -f /etc/haproxy/tcp.cfg \ |
| 472 | -f /etc/haproxy/default-http.cfg -f /etc/haproxy/http.cfg \ |
| 473 | -D -p /var/run/haproxy.pid -sf $(cat /var/run/haproxy.pid) |
| 474 | |
| 475 | When an unknown number of files is expected, such as customer-specific files, |
| 476 | it is recommended to assign them a name starting with a fixed-size sequence |
| 477 | number and to use "--" to load them, possibly after loading some defaults : |
| 478 | |
| 479 | haproxy -f /etc/haproxy/global.cfg -f /etc/haproxy/stats.cfg \ |
| 480 | -f /etc/haproxy/default-tcp.cfg -f /etc/haproxy/tcp.cfg \ |
| 481 | -f /etc/haproxy/default-http.cfg -f /etc/haproxy/http.cfg \ |
| 482 | -D -p /var/run/haproxy.pid -sf $(cat /var/run/haproxy.pid) \ |
| 483 | -f /etc/haproxy/default-customers.cfg -- /etc/haproxy/customers/* |
| 484 | |
| 485 | Sometimes a failure to start may happen for whatever reason. Then it is |
| 486 | important to verify if the version of HAProxy you are invoking is the expected |
| 487 | version and if it supports the features you are expecting (eg: SSL, PCRE, |
| 488 | compression, Lua, etc). This can be verified using "haproxy -vv". Some |
| 489 | important information such as certain build options, the target system and |
| 490 | the versions of the libraries being used are reported there. It is also what |
| 491 | you will systematically be asked for when posting a bug report : |
| 492 | |
| 493 | $ haproxy -vv |
Willy Tarreau | 58000fe | 2021-05-09 06:25:16 +0200 | [diff] [blame] | 494 | HAProxy version 1.6-dev7-a088d3-4 2015/10/08 |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 495 | Copyright 2000-2015 Willy Tarreau <willy@haproxy.org> |
| 496 | |
| 497 | Build options : |
| 498 | TARGET = linux2628 |
| 499 | CPU = generic |
| 500 | CC = gcc |
| 501 | CFLAGS = -pg -O0 -g -fno-strict-aliasing -Wdeclaration-after-statement \ |
| 502 | -DBUFSIZE=8030 -DMAXREWRITE=1030 -DSO_MARK=36 -DTCP_REPAIR=19 |
| 503 | OPTIONS = USE_ZLIB=1 USE_DLMALLOC=1 USE_OPENSSL=1 USE_LUA=1 USE_PCRE=1 |
| 504 | |
| 505 | Default settings : |
| 506 | maxconn = 2000, bufsize = 8030, maxrewrite = 1030, maxpollevents = 200 |
| 507 | |
| 508 | Encrypted password support via crypt(3): yes |
| 509 | Built with zlib version : 1.2.6 |
| 510 | Compression algorithms supported : identity("identity"), deflate("deflate"), \ |
| 511 | raw-deflate("deflate"), gzip("gzip") |
| 512 | Built with OpenSSL version : OpenSSL 1.0.1o 12 Jun 2015 |
| 513 | Running on OpenSSL version : OpenSSL 1.0.1o 12 Jun 2015 |
| 514 | OpenSSL library supports TLS extensions : yes |
| 515 | OpenSSL library supports SNI : yes |
| 516 | OpenSSL library supports prefer-server-ciphers : yes |
| 517 | Built with PCRE version : 8.12 2011-01-15 |
| 518 | PCRE library supports JIT : no (USE_PCRE_JIT not set) |
| 519 | Built with Lua version : Lua 5.3.1 |
| 520 | Built with transparent proxy support using: IP_TRANSPARENT IP_FREEBIND |
| 521 | |
| 522 | Available polling systems : |
| 523 | epoll : pref=300, test result OK |
| 524 | poll : pref=200, test result OK |
| 525 | select : pref=150, test result OK |
| 526 | Total: 3 (3 usable), will use epoll. |
| 527 | |
| 528 | The relevant information that many non-developer users can verify here are : |
| 529 | - the version : 1.6-dev7-a088d3-4 above means the code is currently at commit |
| 530 | ID "a088d3" which is the 4th one after after official version "1.6-dev7". |
| 531 | Version 1.6-dev7 would show as "1.6-dev7-8c1ad7". What matters here is in |
| 532 | fact "1.6-dev7". This is the 7th development version of what will become |
| 533 | version 1.6 in the future. A development version not suitable for use in |
| 534 | production (unless you know exactly what you are doing). A stable version |
| 535 | will show as a 3-numbers version, such as "1.5.14-16f863", indicating the |
| 536 | 14th level of fix on top of version 1.5. This is a production-ready version. |
| 537 | |
| 538 | - the release date : 2015/10/08. It is represented in the universal |
| 539 | year/month/day format. Here this means August 8th, 2015. Given that stable |
| 540 | releases are issued every few months (1-2 months at the beginning, sometimes |
| 541 | 6 months once the product becomes very stable), if you're seeing an old date |
| 542 | here, it means you're probably affected by a number of bugs or security |
| 543 | issues that have since been fixed and that it might be worth checking on the |
| 544 | official site. |
| 545 | |
| 546 | - build options : they are relevant to people who build their packages |
| 547 | themselves, they can explain why things are not behaving as expected. For |
| 548 | example the development version above was built for Linux 2.6.28 or later, |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 549 | targeting a generic CPU (no CPU-specific optimizations), and lacks any |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 550 | code optimization (-O0) so it will perform poorly in terms of performance. |
| 551 | |
| 552 | - libraries versions : zlib version is reported as found in the library |
| 553 | itself. In general zlib is considered a very stable product and upgrades |
| 554 | are almost never needed. OpenSSL reports two versions, the version used at |
| 555 | build time and the one being used, as found on the system. These ones may |
| 556 | differ by the last letter but never by the numbers. The build date is also |
| 557 | reported because most OpenSSL bugs are security issues and need to be taken |
| 558 | seriously, so this library absolutely needs to be kept up to date. Seeing a |
| 559 | 4-months old version here is highly suspicious and indeed an update was |
| 560 | missed. PCRE provides very fast regular expressions and is highly |
| 561 | recommended. Certain of its extensions such as JIT are not present in all |
| 562 | versions and still young so some people prefer not to build with them, |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 563 | which is why the build status is reported as well. Regarding the Lua |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 564 | scripting language, HAProxy expects version 5.3 which is very young since |
| 565 | it was released a little time before HAProxy 1.6. It is important to check |
| 566 | on the Lua web site if some fixes are proposed for this branch. |
| 567 | |
| 568 | - Available polling systems will affect the process's scalability when |
| 569 | dealing with more than about one thousand of concurrent connections. These |
| 570 | ones are only available when the correct system was indicated in the TARGET |
| 571 | variable during the build. The "epoll" mechanism is highly recommended on |
| 572 | Linux, and the kqueue mechanism is highly recommended on BSD. Lacking them |
| 573 | will result in poll() or even select() being used, causing a high CPU usage |
| 574 | when dealing with a lot of connections. |
| 575 | |
| 576 | |
| 577 | 4. Stopping and restarting HAProxy |
| 578 | ---------------------------------- |
| 579 | |
| 580 | HAProxy supports a graceful and a hard stop. The hard stop is simple, when the |
| 581 | SIGTERM signal is sent to the haproxy process, it immediately quits and all |
| 582 | established connections are closed. The graceful stop is triggered when the |
| 583 | SIGUSR1 signal is sent to the haproxy process. It consists in only unbinding |
| 584 | from listening ports, but continue to process existing connections until they |
| 585 | close. Once the last connection is closed, the process leaves. |
| 586 | |
| 587 | The hard stop method is used for the "stop" or "restart" actions of the service |
| 588 | management script. The graceful stop is used for the "reload" action which |
| 589 | tries to seamlessly reload a new configuration in a new process. |
| 590 | |
| 591 | Both of these signals may be sent by the new haproxy process itself during a |
| 592 | reload or restart, so that they are sent at the latest possible moment and only |
| 593 | if absolutely required. This is what is performed by the "-st" (hard) and "-sf" |
| 594 | (graceful) options respectively. |
| 595 | |
William Lallemand | e202b1e | 2017-06-01 17:38:56 +0200 | [diff] [blame] | 596 | In master-worker mode, it is not needed to start a new haproxy process in |
| 597 | order to reload the configuration. The master process reacts to the SIGUSR2 |
| 598 | signal by reexecuting itself with the -sf parameter followed by the PIDs of |
| 599 | the workers. The master will then parse the configuration file and fork new |
| 600 | workers. |
| 601 | |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 602 | To understand better how these signals are used, it is important to understand |
| 603 | the whole restart mechanism. |
| 604 | |
| 605 | First, an existing haproxy process is running. The administrator uses a system |
Jackie Tapia | 749f74c | 2020-07-22 18:59:40 -0500 | [diff] [blame] | 606 | specific command such as "/etc/init.d/haproxy reload" to indicate they want to |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 607 | take the new configuration file into effect. What happens then is the following. |
| 608 | First, the service script (/etc/init.d/haproxy or equivalent) will verify that |
| 609 | the configuration file parses correctly using "haproxy -c". After that it will |
| 610 | try to start haproxy with this configuration file, using "-st" or "-sf". |
| 611 | |
| 612 | Then HAProxy tries to bind to all listening ports. If some fatal errors happen |
| 613 | (eg: address not present on the system, permission denied), the process quits |
| 614 | with an error. If a socket binding fails because a port is already in use, then |
| 615 | the process will first send a SIGTTOU signal to all the pids specified in the |
| 616 | "-st" or "-sf" pid list. This is what is called the "pause" signal. It instructs |
| 617 | all existing haproxy processes to temporarily stop listening to their ports so |
| 618 | that the new process can try to bind again. During this time, the old process |
| 619 | continues to process existing connections. If the binding still fails (because |
| 620 | for example a port is shared with another daemon), then the new process sends a |
| 621 | SIGTTIN signal to the old processes to instruct them to resume operations just |
| 622 | as if nothing happened. The old processes will then restart listening to the |
Jonathon Lacher | c5b5e7b | 2021-08-04 00:29:05 -0500 | [diff] [blame] | 623 | ports and continue to accept connections. Note that this mechanism is system |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 624 | dependent and some operating systems may not support it in multi-process mode. |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 625 | |
| 626 | If the new process manages to bind correctly to all ports, then it sends either |
| 627 | the SIGTERM (hard stop in case of "-st") or the SIGUSR1 (graceful stop in case |
| 628 | of "-sf") to all processes to notify them that it is now in charge of operations |
| 629 | and that the old processes will have to leave, either immediately or once they |
| 630 | have finished their job. |
| 631 | |
| 632 | It is important to note that during this timeframe, there are two small windows |
| 633 | of a few milliseconds each where it is possible that a few connection failures |
| 634 | will be noticed during high loads. Typically observed failure rates are around |
| 635 | 1 failure during a reload operation every 10000 new connections per second, |
| 636 | which means that a heavily loaded site running at 30000 new connections per |
| 637 | second may see about 3 failed connection upon every reload. The two situations |
| 638 | where this happens are : |
| 639 | |
| 640 | - if the new process fails to bind due to the presence of the old process, |
| 641 | it will first have to go through the SIGTTOU+SIGTTIN sequence, which |
| 642 | typically lasts about one millisecond for a few tens of frontends, and |
| 643 | during which some ports will not be bound to the old process and not yet |
| 644 | bound to the new one. HAProxy works around this on systems that support the |
| 645 | SO_REUSEPORT socket options, as it allows the new process to bind without |
| 646 | first asking the old one to unbind. Most BSD systems have been supporting |
| 647 | this almost forever. Linux has been supporting this in version 2.0 and |
| 648 | dropped it around 2.2, but some patches were floating around by then. It |
| 649 | was reintroduced in kernel 3.9, so if you are observing a connection |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 650 | failure rate above the one mentioned above, please ensure that your kernel |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 651 | is 3.9 or newer, or that relevant patches were backported to your kernel |
| 652 | (less likely). |
| 653 | |
| 654 | - when the old processes close the listening ports, the kernel may not always |
| 655 | redistribute any pending connection that was remaining in the socket's |
| 656 | backlog. Under high loads, a SYN packet may happen just before the socket |
| 657 | is closed, and will lead to an RST packet being sent to the client. In some |
| 658 | critical environments where even one drop is not acceptable, these ones are |
| 659 | sometimes dealt with using firewall rules to block SYN packets during the |
| 660 | reload, forcing the client to retransmit. This is totally system-dependent, |
| 661 | as some systems might be able to visit other listening queues and avoid |
| 662 | this RST. A second case concerns the ACK from the client on a local socket |
| 663 | that was in SYN_RECV state just before the close. This ACK will lead to an |
| 664 | RST packet while the haproxy process is still not aware of it. This one is |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 665 | harder to get rid of, though the firewall filtering rules mentioned above |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 666 | will work well if applied one second or so before restarting the process. |
| 667 | |
| 668 | For the vast majority of users, such drops will never ever happen since they |
| 669 | don't have enough load to trigger the race conditions. And for most high traffic |
| 670 | users, the failure rate is still fairly within the noise margin provided that at |
| 671 | least SO_REUSEPORT is properly supported on their systems. |
| 672 | |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 673 | 5. File-descriptor limitations |
| 674 | ------------------------------ |
| 675 | |
| 676 | In order to ensure that all incoming connections will successfully be served, |
| 677 | HAProxy computes at load time the total number of file descriptors that will be |
| 678 | needed during the process's life. A regular Unix process is generally granted |
| 679 | 1024 file descriptors by default, and a privileged process can raise this limit |
| 680 | itself. This is one reason for starting HAProxy as root and letting it adjust |
| 681 | the limit. The default limit of 1024 file descriptors roughly allow about 500 |
| 682 | concurrent connections to be processed. The computation is based on the global |
| 683 | maxconn parameter which limits the total number of connections per process, the |
| 684 | number of listeners, the number of servers which have a health check enabled, |
| 685 | the agent checks, the peers, the loggers and possibly a few other technical |
| 686 | requirements. A simple rough estimate of this number consists in simply |
| 687 | doubling the maxconn value and adding a few tens to get the approximate number |
| 688 | of file descriptors needed. |
| 689 | |
| 690 | Originally HAProxy did not know how to compute this value, and it was necessary |
| 691 | to pass the value using the "ulimit-n" setting in the global section. This |
| 692 | explains why even today a lot of configurations are seen with this setting |
| 693 | present. Unfortunately it was often miscalculated resulting in connection |
| 694 | failures when approaching maxconn instead of throttling incoming connection |
| 695 | while waiting for the needed resources. For this reason it is important to |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 696 | remove any vestigial "ulimit-n" setting that can remain from very old versions. |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 697 | |
| 698 | Raising the number of file descriptors to accept even moderate loads is |
| 699 | mandatory but comes with some OS-specific adjustments. First, the select() |
| 700 | polling system is limited to 1024 file descriptors. In fact on Linux it used |
| 701 | to be capable of handling more but since certain OS ship with excessively |
| 702 | restrictive SELinux policies forbidding the use of select() with more than |
| 703 | 1024 file descriptors, HAProxy now refuses to start in this case in order to |
| 704 | avoid any issue at run time. On all supported operating systems, poll() is |
| 705 | available and will not suffer from this limitation. It is automatically picked |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 706 | so there is nothing to do to get a working configuration. But poll's becomes |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 707 | very slow when the number of file descriptors increases. While HAProxy does its |
| 708 | best to limit this performance impact (eg: via the use of the internal file |
| 709 | descriptor cache and batched processing), a good rule of thumb is that using |
| 710 | poll() with more than a thousand concurrent connections will use a lot of CPU. |
| 711 | |
| 712 | For Linux systems base on kernels 2.6 and above, the epoll() system call will |
| 713 | be used. It's a much more scalable mechanism relying on callbacks in the kernel |
| 714 | that guarantee a constant wake up time regardless of the number of registered |
| 715 | monitored file descriptors. It is automatically used where detected, provided |
| 716 | that HAProxy had been built for one of the Linux flavors. Its presence and |
| 717 | support can be verified using "haproxy -vv". |
| 718 | |
| 719 | For BSD systems which support it, kqueue() is available as an alternative. It |
| 720 | is much faster than poll() and even slightly faster than epoll() thanks to its |
| 721 | batched handling of changes. At least FreeBSD and OpenBSD support it. Just like |
| 722 | with Linux's epoll(), its support and availability are reported in the output |
| 723 | of "haproxy -vv". |
| 724 | |
| 725 | Having a good poller is one thing, but it is mandatory that the process can |
| 726 | reach the limits. When HAProxy starts, it immediately sets the new process's |
| 727 | file descriptor limits and verifies if it succeeds. In case of failure, it |
| 728 | reports it before forking so that the administrator can see the problem. As |
| 729 | long as the process is started by as root, there should be no reason for this |
| 730 | setting to fail. However, it can fail if the process is started by an |
| 731 | unprivileged user. If there is a compelling reason for *not* starting haproxy |
| 732 | as root (eg: started by end users, or by a per-application account), then the |
| 733 | file descriptor limit can be raised by the system administrator for this |
| 734 | specific user. The effectiveness of the setting can be verified by issuing |
| 735 | "ulimit -n" from the user's command line. It should reflect the new limit. |
| 736 | |
| 737 | Warning: when an unprivileged user's limits are changed in this user's account, |
| 738 | it is fairly common that these values are only considered when the user logs in |
| 739 | and not at all in some scripts run at system boot time nor in crontabs. This is |
| 740 | totally dependent on the operating system, keep in mind to check "ulimit -n" |
| 741 | before starting haproxy when running this way. The general advice is never to |
| 742 | start haproxy as an unprivileged user for production purposes. Another good |
| 743 | reason is that it prevents haproxy from enabling some security protections. |
| 744 | |
| 745 | Once it is certain that the system will allow the haproxy process to use the |
| 746 | requested number of file descriptors, two new system-specific limits may be |
| 747 | encountered. The first one is the system-wide file descriptor limit, which is |
| 748 | the total number of file descriptors opened on the system, covering all |
| 749 | processes. When this limit is reached, accept() or socket() will typically |
| 750 | return ENFILE. The second one is the per-process hard limit on the number of |
| 751 | file descriptors, it prevents setrlimit() from being set higher. Both are very |
| 752 | dependent on the operating system. On Linux, the system limit is set at boot |
| 753 | based on the amount of memory. It can be changed with the "fs.file-max" sysctl. |
| 754 | And the per-process hard limit is set to 1048576 by default, but it can be |
| 755 | changed using the "fs.nr_open" sysctl. |
| 756 | |
| 757 | File descriptor limitations may be observed on a running process when they are |
| 758 | set too low. The strace utility will report that accept() and socket() return |
| 759 | "-1 EMFILE" when the process's limits have been reached. In this case, simply |
| 760 | raising the "ulimit-n" value (or removing it) will solve the problem. If these |
| 761 | system calls return "-1 ENFILE" then it means that the kernel's limits have |
| 762 | been reached and that something must be done on a system-wide parameter. These |
| 763 | trouble must absolutely be addressed, as they result in high CPU usage (when |
| 764 | accept() fails) and failed connections that are generally visible to the user. |
| 765 | One solution also consists in lowering the global maxconn value to enforce |
| 766 | serialization, and possibly to disable HTTP keep-alive to force connections |
| 767 | to be released and reused faster. |
| 768 | |
| 769 | |
| 770 | 6. Memory management |
| 771 | -------------------- |
| 772 | |
| 773 | HAProxy uses a simple and fast pool-based memory management. Since it relies on |
| 774 | a small number of different object types, it's much more efficient to pick new |
| 775 | objects from a pool which already contains objects of the appropriate size than |
| 776 | to call malloc() for each different size. The pools are organized as a stack or |
| 777 | LIFO, so that newly allocated objects are taken from recently released objects |
| 778 | still hot in the CPU caches. Pools of similar sizes are merged together, in |
| 779 | order to limit memory fragmentation. |
| 780 | |
| 781 | By default, since the focus is set on performance, each released object is put |
| 782 | back into the pool it came from, and allocated objects are never freed since |
| 783 | they are expected to be reused very soon. |
| 784 | |
| 785 | On the CLI, it is possible to check how memory is being used in pools thanks to |
| 786 | the "show pools" command : |
| 787 | |
| 788 | > show pools |
| 789 | Dumping pools usage. Use SIGQUIT to flush them. |
Willy Tarreau | 0a93b64 | 2018-10-16 07:58:39 +0200 | [diff] [blame] | 790 | - Pool cache_st (16 bytes) : 0 allocated (0 bytes), 0 used, 0 failures, 1 users, @0x9ccc40=03 [SHARED] |
| 791 | - Pool pipe (32 bytes) : 5 allocated (160 bytes), 5 used, 0 failures, 2 users, @0x9ccac0=00 [SHARED] |
| 792 | - Pool comp_state (48 bytes) : 3 allocated (144 bytes), 3 used, 0 failures, 5 users, @0x9cccc0=04 [SHARED] |
| 793 | - Pool filter (64 bytes) : 0 allocated (0 bytes), 0 used, 0 failures, 3 users, @0x9ccbc0=02 [SHARED] |
| 794 | - Pool vars (80 bytes) : 0 allocated (0 bytes), 0 used, 0 failures, 2 users, @0x9ccb40=01 [SHARED] |
| 795 | - Pool uniqueid (128 bytes) : 0 allocated (0 bytes), 0 used, 0 failures, 2 users, @0x9cd240=15 [SHARED] |
| 796 | - Pool task (144 bytes) : 55 allocated (7920 bytes), 55 used, 0 failures, 1 users, @0x9cd040=11 [SHARED] |
| 797 | - Pool session (160 bytes) : 1 allocated (160 bytes), 1 used, 0 failures, 1 users, @0x9cd140=13 [SHARED] |
| 798 | - Pool h2s (208 bytes) : 0 allocated (0 bytes), 0 used, 0 failures, 2 users, @0x9ccec0=08 [SHARED] |
| 799 | - Pool h2c (288 bytes) : 0 allocated (0 bytes), 0 used, 0 failures, 1 users, @0x9cce40=07 [SHARED] |
| 800 | - Pool spoe_ctx (304 bytes) : 0 allocated (0 bytes), 0 used, 0 failures, 2 users, @0x9ccf40=09 [SHARED] |
| 801 | - Pool connection (400 bytes) : 2 allocated (800 bytes), 2 used, 0 failures, 1 users, @0x9cd1c0=14 [SHARED] |
| 802 | - Pool hdr_idx (416 bytes) : 0 allocated (0 bytes), 0 used, 0 failures, 1 users, @0x9cd340=17 [SHARED] |
| 803 | - Pool dns_resolut (480 bytes) : 0 allocated (0 bytes), 0 used, 0 failures, 1 users, @0x9ccdc0=06 [SHARED] |
| 804 | - Pool dns_answer_ (576 bytes) : 0 allocated (0 bytes), 0 used, 0 failures, 1 users, @0x9ccd40=05 [SHARED] |
| 805 | - Pool stream (960 bytes) : 1 allocated (960 bytes), 1 used, 0 failures, 1 users, @0x9cd0c0=12 [SHARED] |
| 806 | - Pool requri (1024 bytes) : 0 allocated (0 bytes), 0 used, 0 failures, 1 users, @0x9cd2c0=16 [SHARED] |
| 807 | - Pool buffer (8030 bytes) : 3 allocated (24090 bytes), 2 used, 0 failures, 1 users, @0x9cd3c0=18 [SHARED] |
| 808 | - Pool trash (8062 bytes) : 1 allocated (8062 bytes), 1 used, 0 failures, 1 users, @0x9cd440=19 |
| 809 | Total: 19 pools, 42296 bytes allocated, 34266 used. |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 810 | |
| 811 | The pool name is only indicative, it's the name of the first object type using |
| 812 | this pool. The size in parenthesis is the object size for objects in this pool. |
| 813 | Object sizes are always rounded up to the closest multiple of 16 bytes. The |
| 814 | number of objects currently allocated and the equivalent number of bytes is |
| 815 | reported so that it is easy to know which pool is responsible for the highest |
| 816 | memory usage. The number of objects currently in use is reported as well in the |
| 817 | "used" field. The difference between "allocated" and "used" corresponds to the |
Willy Tarreau | 0a93b64 | 2018-10-16 07:58:39 +0200 | [diff] [blame] | 818 | objects that have been freed and are available for immediate use. The address |
| 819 | at the end of the line is the pool's address, and the following number is the |
| 820 | pool index when it exists, or is reported as -1 if no index was assigned. |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 821 | |
| 822 | It is possible to limit the amount of memory allocated per process using the |
| 823 | "-m" command line option, followed by a number of megabytes. It covers all of |
| 824 | the process's addressable space, so that includes memory used by some libraries |
| 825 | as well as the stack, but it is a reliable limit when building a resource |
| 826 | constrained system. It works the same way as "ulimit -v" on systems which have |
| 827 | it, or "ulimit -d" for the other ones. |
| 828 | |
| 829 | If a memory allocation fails due to the memory limit being reached or because |
| 830 | the system doesn't have any enough memory, then haproxy will first start to |
| 831 | free all available objects from all pools before attempting to allocate memory |
| 832 | again. This mechanism of releasing unused memory can be triggered by sending |
| 833 | the signal SIGQUIT to the haproxy process. When doing so, the pools state prior |
| 834 | to the flush will also be reported to stderr when the process runs in |
| 835 | foreground. |
| 836 | |
| 837 | During a reload operation, the process switched to the graceful stop state also |
| 838 | automatically performs some flushes after releasing any connection so that all |
| 839 | possible memory is released to save it for the new process. |
| 840 | |
| 841 | |
| 842 | 7. CPU usage |
| 843 | ------------ |
| 844 | |
| 845 | HAProxy normally spends most of its time in the system and a smaller part in |
| 846 | userland. A finely tuned 3.5 GHz CPU can sustain a rate about 80000 end-to-end |
| 847 | connection setups and closes per second at 100% CPU on a single core. When one |
| 848 | core is saturated, typical figures are : |
| 849 | - 95% system, 5% user for long TCP connections or large HTTP objects |
| 850 | - 85% system and 15% user for short TCP connections or small HTTP objects in |
| 851 | close mode |
| 852 | - 70% system and 30% user for small HTTP objects in keep-alive mode |
| 853 | |
| 854 | The amount of rules processing and regular expressions will increase the user |
| 855 | land part. The presence of firewall rules, connection tracking, complex routing |
| 856 | tables in the system will instead increase the system part. |
| 857 | |
| 858 | On most systems, the CPU time observed during network transfers can be cut in 4 |
| 859 | parts : |
| 860 | - the interrupt part, which concerns all the processing performed upon I/O |
| 861 | receipt, before the target process is even known. Typically Rx packets are |
| 862 | accounted for in interrupt. On some systems such as Linux where interrupt |
| 863 | processing may be deferred to a dedicated thread, it can appear as softirq, |
| 864 | and the thread is called ksoftirqd/0 (for CPU 0). The CPU taking care of |
| 865 | this load is generally defined by the hardware settings, though in the case |
| 866 | of softirq it is often possible to remap the processing to another CPU. |
| 867 | This interrupt part will often be perceived as parasitic since it's not |
| 868 | associated with any process, but it actually is some processing being done |
| 869 | to prepare the work for the process. |
| 870 | |
| 871 | - the system part, which concerns all the processing done using kernel code |
| 872 | called from userland. System calls are accounted as system for example. All |
| 873 | synchronously delivered Tx packets will be accounted for as system time. If |
| 874 | some packets have to be deferred due to queues filling up, they may then be |
| 875 | processed in interrupt context later (eg: upon receipt of an ACK opening a |
| 876 | TCP window). |
| 877 | |
| 878 | - the user part, which exclusively runs application code in userland. HAProxy |
| 879 | runs exclusively in this part, though it makes heavy use of system calls. |
| 880 | Rules processing, regular expressions, compression, encryption all add to |
| 881 | the user portion of CPU consumption. |
| 882 | |
| 883 | - the idle part, which is what the CPU does when there is nothing to do. For |
| 884 | example HAProxy waits for an incoming connection, or waits for some data to |
| 885 | leave, meaning the system is waiting for an ACK from the client to push |
| 886 | these data. |
| 887 | |
| 888 | In practice regarding HAProxy's activity, it is in general reasonably accurate |
| 889 | (but totally inexact) to consider that interrupt/softirq are caused by Rx |
| 890 | processing in kernel drivers, that user-land is caused by layer 7 processing |
| 891 | in HAProxy, and that system time is caused by network processing on the Tx |
| 892 | path. |
| 893 | |
| 894 | Since HAProxy runs around an event loop, it waits for new events using poll() |
| 895 | (or any alternative) and processes all these events as fast as possible before |
| 896 | going back to poll() waiting for new events. It measures the time spent waiting |
| 897 | in poll() compared to the time spent doing processing events. The ratio of |
| 898 | polling time vs total time is called the "idle" time, it's the amount of time |
| 899 | spent waiting for something to happen. This ratio is reported in the stats page |
| 900 | on the "idle" line, or "Idle_pct" on the CLI. When it's close to 100%, it means |
| 901 | the load is extremely low. When it's close to 0%, it means that there is |
| 902 | constantly some activity. While it cannot be very accurate on an overloaded |
| 903 | system due to other processes possibly preempting the CPU from the haproxy |
| 904 | process, it still provides a good estimate about how HAProxy considers it is |
| 905 | working : if the load is low and the idle ratio is low as well, it may indicate |
| 906 | that HAProxy has a lot of work to do, possibly due to very expensive rules that |
| 907 | have to be processed. Conversely, if HAProxy indicates the idle is close to |
| 908 | 100% while things are slow, it means that it cannot do anything to speed things |
| 909 | up because it is already waiting for incoming data to process. In the example |
| 910 | below, haproxy is completely idle : |
| 911 | |
| 912 | $ echo "show info" | socat - /var/run/haproxy.sock | grep ^Idle |
| 913 | Idle_pct: 100 |
| 914 | |
| 915 | When the idle ratio starts to become very low, it is important to tune the |
| 916 | system and place processes and interrupts correctly to save the most possible |
| 917 | CPU resources for all tasks. If a firewall is present, it may be worth trying |
| 918 | to disable it or to tune it to ensure it is not responsible for a large part |
| 919 | of the performance limitation. It's worth noting that unloading a stateful |
| 920 | firewall generally reduces both the amount of interrupt/softirq and of system |
| 921 | usage since such firewalls act both on the Rx and the Tx paths. On Linux, |
| 922 | unloading the nf_conntrack and ip_conntrack modules will show whether there is |
| 923 | anything to gain. If so, then the module runs with default settings and you'll |
| 924 | have to figure how to tune it for better performance. In general this consists |
| 925 | in considerably increasing the hash table size. On FreeBSD, "pfctl -d" will |
| 926 | disable the "pf" firewall and its stateful engine at the same time. |
| 927 | |
| 928 | If it is observed that a lot of time is spent in interrupt/softirq, it is |
| 929 | important to ensure that they don't run on the same CPU. Most systems tend to |
| 930 | pin the tasks on the CPU where they receive the network traffic because for |
| 931 | certain workloads it improves things. But with heavily network-bound workloads |
| 932 | it is the opposite as the haproxy process will have to fight against its kernel |
| 933 | counterpart. Pinning haproxy to one CPU core and the interrupts to another one, |
| 934 | all sharing the same L3 cache tends to sensibly increase network performance |
| 935 | because in practice the amount of work for haproxy and the network stack are |
| 936 | quite close, so they can almost fill an entire CPU each. On Linux this is done |
| 937 | using taskset (for haproxy) or using cpu-map (from the haproxy config), and the |
| 938 | interrupts are assigned under /proc/irq. Many network interfaces support |
| 939 | multiple queues and multiple interrupts. In general it helps to spread them |
| 940 | across a small number of CPU cores provided they all share the same L3 cache. |
| 941 | Please always stop irq_balance which always does the worst possible thing on |
| 942 | such workloads. |
| 943 | |
| 944 | For CPU-bound workloads consisting in a lot of SSL traffic or a lot of |
| 945 | compression, it may be worth using multiple processes dedicated to certain |
| 946 | tasks, though there is no universal rule here and experimentation will have to |
| 947 | be performed. |
| 948 | |
| 949 | In order to increase the CPU capacity, it is possible to make HAProxy run as |
| 950 | several processes, using the "nbproc" directive in the global section. There |
| 951 | are some limitations though : |
| 952 | - health checks are run per process, so the target servers will get as many |
| 953 | checks as there are running processes ; |
| 954 | - maxconn values and queues are per-process so the correct value must be set |
| 955 | to avoid overloading the servers ; |
| 956 | - outgoing connections should avoid using port ranges to avoid conflicts |
| 957 | - stick-tables are per process and are not shared between processes ; |
| 958 | - each peers section may only run on a single process at a time ; |
| 959 | - the CLI operations will only act on a single process at a time. |
| 960 | |
| 961 | With this in mind, it appears that the easiest setup often consists in having |
| 962 | one first layer running on multiple processes and in charge for the heavy |
| 963 | processing, passing the traffic to a second layer running in a single process. |
| 964 | This mechanism is suited to SSL and compression which are the two CPU-heavy |
| 965 | features. Instances can easily be chained over UNIX sockets (which are cheaper |
fengpeiyuan | cc123c6 | 2016-01-15 16:40:53 +0800 | [diff] [blame] | 966 | than TCP sockets and which do not waste ports), and the proxy protocol which is |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 967 | useful to pass client information to the next stage. When doing so, it is |
| 968 | generally a good idea to bind all the single-process tasks to process number 1 |
| 969 | and extra tasks to next processes, as this will make it easier to generate |
| 970 | similar configurations for different machines. |
| 971 | |
| 972 | On Linux versions 3.9 and above, running HAProxy in multi-process mode is much |
| 973 | more efficient when each process uses a distinct listening socket on the same |
| 974 | IP:port ; this will make the kernel evenly distribute the load across all |
| 975 | processes instead of waking them all up. Please check the "process" option of |
| 976 | the "bind" keyword lines in the configuration manual for more information. |
| 977 | |
| 978 | |
| 979 | 8. Logging |
| 980 | ---------- |
| 981 | |
| 982 | For logging, HAProxy always relies on a syslog server since it does not perform |
| 983 | any file-system access. The standard way of using it is to send logs over UDP |
| 984 | to the log server (by default on port 514). Very commonly this is configured to |
| 985 | 127.0.0.1 where the local syslog daemon is running, but it's also used over the |
| 986 | network to log to a central server. The central server provides additional |
| 987 | benefits especially in active-active scenarios where it is desirable to keep |
| 988 | the logs merged in arrival order. HAProxy may also make use of a UNIX socket to |
| 989 | send its logs to the local syslog daemon, but it is not recommended at all, |
| 990 | because if the syslog server is restarted while haproxy runs, the socket will |
| 991 | be replaced and new logs will be lost. Since HAProxy will be isolated inside a |
| 992 | chroot jail, it will not have the ability to reconnect to the new socket. It |
| 993 | has also been observed in field that the log buffers in use on UNIX sockets are |
| 994 | very small and lead to lost messages even at very light loads. But this can be |
| 995 | fine for testing however. |
| 996 | |
| 997 | It is recommended to add the following directive to the "global" section to |
| 998 | make HAProxy log to the local daemon using facility "local0" : |
| 999 | |
| 1000 | log 127.0.0.1:514 local0 |
| 1001 | |
| 1002 | and then to add the following one to each "defaults" section or to each frontend |
| 1003 | and backend section : |
| 1004 | |
| 1005 | log global |
| 1006 | |
| 1007 | This way, all logs will be centralized through the global definition of where |
| 1008 | the log server is. |
| 1009 | |
| 1010 | Some syslog daemons do not listen to UDP traffic by default, so depending on |
| 1011 | the daemon being used, the syntax to enable this will vary : |
| 1012 | |
| 1013 | - on sysklogd, you need to pass argument "-r" on the daemon's command line |
| 1014 | so that it listens to a UDP socket for "remote" logs ; note that there is |
| 1015 | no way to limit it to address 127.0.0.1 so it will also receive logs from |
| 1016 | remote systems ; |
| 1017 | |
| 1018 | - on rsyslogd, the following lines must be added to the configuration file : |
| 1019 | |
| 1020 | $ModLoad imudp |
| 1021 | $UDPServerAddress * |
| 1022 | $UDPServerRun 514 |
| 1023 | |
| 1024 | - on syslog-ng, a new source can be created the following way, it then needs |
| 1025 | to be added as a valid source in one of the "log" directives : |
| 1026 | |
| 1027 | source s_udp { |
| 1028 | udp(ip(127.0.0.1) port(514)); |
| 1029 | }; |
| 1030 | |
| 1031 | Please consult your syslog daemon's manual for more information. If no logs are |
| 1032 | seen in the system's log files, please consider the following tests : |
| 1033 | |
| 1034 | - restart haproxy. Each frontend and backend logs one line indicating it's |
| 1035 | starting. If these logs are received, it means logs are working. |
| 1036 | |
| 1037 | - run "strace -tt -s100 -etrace=sendmsg -p <haproxy's pid>" and perform some |
| 1038 | activity that you expect to be logged. You should see the log messages |
| 1039 | being sent using sendmsg() there. If they don't appear, restart using |
| 1040 | strace on top of haproxy. If you still see no logs, it definitely means |
| 1041 | that something is wrong in your configuration. |
| 1042 | |
| 1043 | - run tcpdump to watch for port 514, for example on the loopback interface if |
| 1044 | the traffic is being sent locally : "tcpdump -As0 -ni lo port 514". If the |
| 1045 | packets are seen there, it's the proof they're sent then the syslogd daemon |
| 1046 | needs to be troubleshooted. |
| 1047 | |
| 1048 | While traffic logs are sent from the frontends (where the incoming connections |
| 1049 | are accepted), backends also need to be able to send logs in order to report a |
| 1050 | server state change consecutive to a health check. Please consult HAProxy's |
| 1051 | configuration manual for more information regarding all possible log settings. |
| 1052 | |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 1053 | It is convenient to chose a facility that is not used by other daemons. HAProxy |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 1054 | examples often suggest "local0" for traffic logs and "local1" for admin logs |
| 1055 | because they're never seen in field. A single facility would be enough as well. |
| 1056 | Having separate logs is convenient for log analysis, but it's also important to |
| 1057 | remember that logs may sometimes convey confidential information, and as such |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 1058 | they must not be mixed with other logs that may accidentally be handed out to |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 1059 | unauthorized people. |
| 1060 | |
| 1061 | For in-field troubleshooting without impacting the server's capacity too much, |
| 1062 | it is recommended to make use of the "halog" utility provided with HAProxy. |
| 1063 | This is sort of a grep-like utility designed to process HAProxy log files at |
| 1064 | a very fast data rate. Typical figures range between 1 and 2 GB of logs per |
| 1065 | second. It is capable of extracting only certain logs (eg: search for some |
| 1066 | classes of HTTP status codes, connection termination status, search by response |
| 1067 | time ranges, look for errors only), count lines, limit the output to a number |
| 1068 | of lines, and perform some more advanced statistics such as sorting servers |
| 1069 | by response time or error counts, sorting URLs by time or count, sorting client |
| 1070 | addresses by access count, and so on. It is pretty convenient to quickly spot |
| 1071 | anomalies such as a bot looping on the site, and block them. |
| 1072 | |
| 1073 | |
| 1074 | 9. Statistics and monitoring |
| 1075 | ---------------------------- |
| 1076 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1077 | It is possible to query HAProxy about its status. The most commonly used |
| 1078 | mechanism is the HTTP statistics page. This page also exposes an alternative |
| 1079 | CSV output format for monitoring tools. The same format is provided on the |
| 1080 | Unix socket. |
| 1081 | |
Amaury Denoyelle | 072f97e | 2020-10-05 11:49:37 +0200 | [diff] [blame] | 1082 | Statistics are regroup in categories labelled as domains, corresponding to the |
Ilya Shipitsin | 2272d8a | 2020-12-21 01:22:40 +0500 | [diff] [blame] | 1083 | multiple components of HAProxy. There are two domains available: proxy and dns. |
Amaury Denoyelle | fbd0bc9 | 2020-10-05 11:49:46 +0200 | [diff] [blame] | 1084 | If not specified, the proxy domain is selected. Note that only the proxy |
| 1085 | statistics are printed on the HTTP page. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1086 | |
| 1087 | 9.1. CSV format |
| 1088 | --------------- |
| 1089 | |
| 1090 | The statistics may be consulted either from the unix socket or from the HTTP |
| 1091 | page. Both means provide a CSV format whose fields follow. The first line |
| 1092 | begins with a sharp ('#') and has one word per comma-delimited field which |
| 1093 | represents the title of the column. All other lines starting at the second one |
| 1094 | use a classical CSV format using a comma as the delimiter, and the double quote |
| 1095 | ('"') as an optional text delimiter, but only if the enclosed text is ambiguous |
| 1096 | (if it contains a quote or a comma). The double-quote character ('"') in the |
| 1097 | text is doubled ('""'), which is the format that most tools recognize. Please |
| 1098 | do not insert any column before these ones in order not to break tools which |
| 1099 | use hard-coded column positions. |
| 1100 | |
Amaury Denoyelle | 50660a8 | 2020-10-05 11:49:39 +0200 | [diff] [blame] | 1101 | For proxy statistics, after each field name, the types which may have a value |
| 1102 | for that field are specified in brackets. The types are L (Listeners), F |
| 1103 | (Frontends), B (Backends), and S (Servers). There is a fixed set of static |
| 1104 | fields that are always available in the same order. A column containing the |
| 1105 | character '-' delimits the end of the static fields, after which presence or |
| 1106 | order of the fields are not guaranteed. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1107 | |
Amaury Denoyelle | 50660a8 | 2020-10-05 11:49:39 +0200 | [diff] [blame] | 1108 | Here is the list of static fields using the proxy statistics domain: |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1109 | 0. pxname [LFBS]: proxy name |
| 1110 | 1. svname [LFBS]: service name (FRONTEND for frontend, BACKEND for backend, |
| 1111 | any name for server/listener) |
| 1112 | 2. qcur [..BS]: current queued requests. For the backend this reports the |
| 1113 | number queued without a server assigned. |
| 1114 | 3. qmax [..BS]: max value of qcur |
| 1115 | 4. scur [LFBS]: current sessions |
| 1116 | 5. smax [LFBS]: max sessions |
| 1117 | 6. slim [LFBS]: configured session limit |
Willy Tarreau | c73810f | 2016-01-11 13:52:04 +0100 | [diff] [blame] | 1118 | 7. stot [LFBS]: cumulative number of sessions |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1119 | 8. bin [LFBS]: bytes in |
| 1120 | 9. bout [LFBS]: bytes out |
| 1121 | 10. dreq [LFB.]: requests denied because of security concerns. |
| 1122 | - For tcp this is because of a matched tcp-request content rule. |
| 1123 | - For http this is because of a matched http-request or tarpit rule. |
| 1124 | 11. dresp [LFBS]: responses denied because of security concerns. |
| 1125 | - For http this is because of a matched http-request rule, or |
| 1126 | "option checkcache". |
| 1127 | 12. ereq [LF..]: request errors. Some of the possible causes are: |
| 1128 | - early termination from the client, before the request has been sent. |
| 1129 | - read error from the client |
| 1130 | - client timeout |
| 1131 | - client closed connection |
| 1132 | - various bad requests from the client. |
| 1133 | - request was tarpitted. |
| 1134 | 13. econ [..BS]: number of requests that encountered an error trying to |
| 1135 | connect to a backend server. The backend stat is the sum of the stat |
| 1136 | for all servers of that backend, plus any connection errors not |
| 1137 | associated with a particular server (such as the backend having no |
| 1138 | active servers). |
| 1139 | 14. eresp [..BS]: response errors. srv_abrt will be counted here also. |
| 1140 | Some other errors are: |
| 1141 | - write error on the client socket (won't be counted for the server stat) |
| 1142 | - failure applying filters to the response. |
| 1143 | 15. wretr [..BS]: number of times a connection to a server was retried. |
| 1144 | 16. wredis [..BS]: number of times a request was redispatched to another |
| 1145 | server. The server value counts the number of times that server was |
| 1146 | switched away from. |
Willy Tarreau | b96dd28 | 2016-11-09 14:45:51 +0100 | [diff] [blame] | 1147 | 17. status [LFBS]: status (UP/DOWN/NOLB/MAINT/MAINT(via)/MAINT(resolution)...) |
Willy Tarreau | bd71510 | 2020-10-23 22:44:30 +0200 | [diff] [blame] | 1148 | 18. weight [..BS]: total effective weight (backend), effective weight (server) |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1149 | 19. act [..BS]: number of active servers (backend), server is active (server) |
| 1150 | 20. bck [..BS]: number of backup servers (backend), server is backup (server) |
| 1151 | 21. chkfail [...S]: number of failed checks. (Only counts checks failed when |
| 1152 | the server is up.) |
| 1153 | 22. chkdown [..BS]: number of UP->DOWN transitions. The backend counter counts |
| 1154 | transitions to the whole backend being down, rather than the sum of the |
| 1155 | counters for each server. |
| 1156 | 23. lastchg [..BS]: number of seconds since the last UP<->DOWN transition |
| 1157 | 24. downtime [..BS]: total downtime (in seconds). The value for the backend |
| 1158 | is the downtime for the whole backend, not the sum of the server downtime. |
| 1159 | 25. qlimit [...S]: configured maxqueue for the server, or nothing in the |
| 1160 | value is 0 (default, meaning no limit) |
| 1161 | 26. pid [LFBS]: process id (0 for first instance, 1 for second, ...) |
| 1162 | 27. iid [LFBS]: unique proxy id |
| 1163 | 28. sid [L..S]: server id (unique inside a proxy) |
| 1164 | 29. throttle [...S]: current throttle percentage for the server, when |
| 1165 | slowstart is active, or no value if not in slowstart. |
| 1166 | 30. lbtot [..BS]: total number of times a server was selected, either for new |
| 1167 | sessions, or when re-dispatching. The server counter is the number |
| 1168 | of times that server was selected. |
| 1169 | 31. tracked [...S]: id of proxy/server if tracking is enabled. |
| 1170 | 32. type [LFBS]: (0=frontend, 1=backend, 2=server, 3=socket/listener) |
| 1171 | 33. rate [.FBS]: number of sessions per second over last elapsed second |
| 1172 | 34. rate_lim [.F..]: configured limit on new sessions per second |
| 1173 | 35. rate_max [.FBS]: max number of new sessions per second |
| 1174 | 36. check_status [...S]: status of last health check, one of: |
| 1175 | UNK -> unknown |
| 1176 | INI -> initializing |
| 1177 | SOCKERR -> socket error |
| 1178 | L4OK -> check passed on layer 4, no upper layers testing enabled |
| 1179 | L4TOUT -> layer 1-4 timeout |
| 1180 | L4CON -> layer 1-4 connection problem, for example |
| 1181 | "Connection refused" (tcp rst) or "No route to host" (icmp) |
| 1182 | L6OK -> check passed on layer 6 |
| 1183 | L6TOUT -> layer 6 (SSL) timeout |
| 1184 | L6RSP -> layer 6 invalid response - protocol error |
| 1185 | L7OK -> check passed on layer 7 |
| 1186 | L7OKC -> check conditionally passed on layer 7, for example 404 with |
| 1187 | disable-on-404 |
| 1188 | L7TOUT -> layer 7 (HTTP/SMTP) timeout |
| 1189 | L7RSP -> layer 7 invalid response - protocol error |
| 1190 | L7STS -> layer 7 response error, for example HTTP 5xx |
Daniel Schneller | b6c8b0d | 2017-09-01 19:13:55 +0200 | [diff] [blame] | 1191 | Notice: If a check is currently running, the last known status will be |
| 1192 | reported, prefixed with "* ". e. g. "* L7OK". |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1193 | 37. check_code [...S]: layer5-7 code, if available |
| 1194 | 38. check_duration [...S]: time in ms took to finish last health check |
| 1195 | 39. hrsp_1xx [.FBS]: http responses with 1xx code |
| 1196 | 40. hrsp_2xx [.FBS]: http responses with 2xx code |
| 1197 | 41. hrsp_3xx [.FBS]: http responses with 3xx code |
| 1198 | 42. hrsp_4xx [.FBS]: http responses with 4xx code |
| 1199 | 43. hrsp_5xx [.FBS]: http responses with 5xx code |
| 1200 | 44. hrsp_other [.FBS]: http responses with other codes (protocol error) |
| 1201 | 45. hanafail [...S]: failed health checks details |
| 1202 | 46. req_rate [.F..]: HTTP requests per second over last elapsed second |
| 1203 | 47. req_rate_max [.F..]: max number of HTTP requests per second observed |
Willy Tarreau | fb981bd | 2016-12-12 14:31:46 +0100 | [diff] [blame] | 1204 | 48. req_tot [.FB.]: total number of HTTP requests received |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1205 | 49. cli_abrt [..BS]: number of data transfers aborted by the client |
| 1206 | 50. srv_abrt [..BS]: number of data transfers aborted by the server |
| 1207 | (inc. in eresp) |
| 1208 | 51. comp_in [.FB.]: number of HTTP response bytes fed to the compressor |
| 1209 | 52. comp_out [.FB.]: number of HTTP response bytes emitted by the compressor |
| 1210 | 53. comp_byp [.FB.]: number of bytes that bypassed the HTTP compressor |
| 1211 | (CPU/BW limit) |
| 1212 | 54. comp_rsp [.FB.]: number of HTTP responses that were compressed |
| 1213 | 55. lastsess [..BS]: number of seconds since last session assigned to |
| 1214 | server/backend |
| 1215 | 56. last_chk [...S]: last health check contents or textual error |
| 1216 | 57. last_agt [...S]: last agent check contents or textual error |
| 1217 | 58. qtime [..BS]: the average queue time in ms over the 1024 last requests |
| 1218 | 59. ctime [..BS]: the average connect time in ms over the 1024 last requests |
| 1219 | 60. rtime [..BS]: the average response time in ms over the 1024 last requests |
| 1220 | (0 for TCP) |
| 1221 | 61. ttime [..BS]: the average total session time in ms over the 1024 last |
| 1222 | requests |
Willy Tarreau | 7f61884 | 2016-01-08 11:40:03 +0100 | [diff] [blame] | 1223 | 62. agent_status [...S]: status of last agent check, one of: |
| 1224 | UNK -> unknown |
| 1225 | INI -> initializing |
| 1226 | SOCKERR -> socket error |
| 1227 | L4OK -> check passed on layer 4, no upper layers testing enabled |
| 1228 | L4TOUT -> layer 1-4 timeout |
| 1229 | L4CON -> layer 1-4 connection problem, for example |
| 1230 | "Connection refused" (tcp rst) or "No route to host" (icmp) |
| 1231 | L7OK -> agent reported "up" |
| 1232 | L7STS -> agent reported "fail", "stop", or "down" |
| 1233 | 63. agent_code [...S]: numeric code reported by agent if any (unused for now) |
| 1234 | 64. agent_duration [...S]: time in ms taken to finish last check |
Willy Tarreau | dd7354b | 2016-01-08 13:47:26 +0100 | [diff] [blame] | 1235 | 65. check_desc [...S]: short human-readable description of check_status |
| 1236 | 66. agent_desc [...S]: short human-readable description of agent_status |
Willy Tarreau | 3141f59 | 2016-01-08 14:25:28 +0100 | [diff] [blame] | 1237 | 67. check_rise [...S]: server's "rise" parameter used by checks |
| 1238 | 68. check_fall [...S]: server's "fall" parameter used by checks |
| 1239 | 69. check_health [...S]: server's health check value between 0 and rise+fall-1 |
| 1240 | 70. agent_rise [...S]: agent's "rise" parameter, normally 1 |
| 1241 | 71. agent_fall [...S]: agent's "fall" parameter, normally 1 |
| 1242 | 72. agent_health [...S]: agent's health parameter, between 0 and rise+fall-1 |
Willy Tarreau | a6f5a73 | 2016-01-08 16:59:56 +0100 | [diff] [blame] | 1243 | 73. addr [L..S]: address:port or "unix". IPv6 has brackets around the address. |
Willy Tarreau | e4847c6 | 2016-01-08 15:43:54 +0100 | [diff] [blame] | 1244 | 74: cookie [..BS]: server's cookie value or backend's cookie name |
Willy Tarreau | f8211df | 2016-01-11 14:09:38 +0100 | [diff] [blame] | 1245 | 75: mode [LFBS]: proxy mode (tcp, http, health, unknown) |
Willy Tarreau | f1516d9 | 2016-01-11 14:48:36 +0100 | [diff] [blame] | 1246 | 76: algo [..B.]: load balancing algorithm |
Willy Tarreau | c73810f | 2016-01-11 13:52:04 +0100 | [diff] [blame] | 1247 | 77: conn_rate [.F..]: number of connections over the last elapsed second |
| 1248 | 78: conn_rate_max [.F..]: highest known conn_rate |
| 1249 | 79: conn_tot [.F..]: cumulative number of connections |
Willy Tarreau | 5b9bdff | 2016-01-11 14:40:47 +0100 | [diff] [blame] | 1250 | 80: intercepted [.FB.]: cum. number of intercepted requests (monitor, stats) |
Willy Tarreau | 8a90b8e | 2016-10-21 18:15:32 +0200 | [diff] [blame] | 1251 | 81: dcon [LF..]: requests denied by "tcp-request connection" rules |
Willy Tarreau | a5bc36b | 2016-10-21 18:16:27 +0200 | [diff] [blame] | 1252 | 82: dses [LF..]: requests denied by "tcp-request session" rules |
Willy Tarreau | ea96a82 | 2018-05-28 15:15:43 +0200 | [diff] [blame] | 1253 | 83: wrew [LFBS]: cumulative number of failed header rewriting warnings |
Jérôme Magnin | 708eb88 | 2019-07-17 09:24:46 +0200 | [diff] [blame] | 1254 | 84: connect [..BS]: cumulative number of connection establishment attempts |
| 1255 | 85: reuse [..BS]: cumulative number of connection reuses |
Willy Tarreau | 7297429 | 2019-11-08 07:29:34 +0100 | [diff] [blame] | 1256 | 86: cache_lookups [.FB.]: cumulative number of cache lookups |
Jérôme Magnin | 34ebb5c | 2019-07-17 14:04:40 +0200 | [diff] [blame] | 1257 | 87: cache_hits [.FB.]: cumulative number of cache hits |
Christopher Faulet | 2ac2574 | 2019-11-08 15:27:27 +0100 | [diff] [blame] | 1258 | 88: srv_icur [...S]: current number of idle connections available for reuse |
| 1259 | 89: src_ilim [...S]: limit on the number of available idle connections |
| 1260 | 90. qtime_max [..BS]: the maximum observed queue time in ms |
| 1261 | 91. ctime_max [..BS]: the maximum observed connect time in ms |
| 1262 | 92. rtime_max [..BS]: the maximum observed response time in ms (0 for TCP) |
| 1263 | 93. ttime_max [..BS]: the maximum observed total session time in ms |
Christopher Faulet | 0159ee4 | 2019-12-16 14:40:39 +0100 | [diff] [blame] | 1264 | 94. eint [LFBS]: cumulative number of internal errors |
Pierre Cheynier | 08eb718 | 2020-10-08 16:37:14 +0200 | [diff] [blame] | 1265 | 95. idle_conn_cur [...S]: current number of unsafe idle connections |
| 1266 | 96. safe_conn_cur [...S]: current number of safe idle connections |
| 1267 | 97. used_conn_cur [...S]: current number of connections in use |
| 1268 | 98. need_conn_est [...S]: estimated needed number of connections |
Willy Tarreau | bd71510 | 2020-10-23 22:44:30 +0200 | [diff] [blame] | 1269 | 99. uweight [..BS]: total user weight (backend), server user weight (server) |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1270 | |
Amaury Denoyelle | 50660a8 | 2020-10-05 11:49:39 +0200 | [diff] [blame] | 1271 | For all other statistics domains, the presence or the order of the fields are |
| 1272 | not guaranteed. In this case, the header line should always be used to parse |
| 1273 | the CSV data. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1274 | |
Phil Scherer | b931f96 | 2020-12-02 19:36:08 +0000 | [diff] [blame] | 1275 | 9.2. Typed output format |
Willy Tarreau | 5d8b979 | 2016-03-11 11:09:34 +0100 | [diff] [blame] | 1276 | ------------------------ |
| 1277 | |
| 1278 | Both "show info" and "show stat" support a mode where each output value comes |
| 1279 | with its type and sufficient information to know how the value is supposed to |
| 1280 | be aggregated between processes and how it evolves. |
| 1281 | |
| 1282 | In all cases, the output consists in having a single value per line with all |
| 1283 | the information split into fields delimited by colons (':'). |
| 1284 | |
| 1285 | The first column designates the object or metric being dumped. Its format is |
| 1286 | specific to the command producing this output and will not be described in this |
| 1287 | section. Usually it will consist in a series of identifiers and field names. |
| 1288 | |
| 1289 | The second column contains 3 characters respectively indicating the origin, the |
| 1290 | nature and the scope of the value being reported. The first character (the |
| 1291 | origin) indicates where the value was extracted from. Possible characters are : |
| 1292 | |
| 1293 | M The value is a metric. It is valid at one instant any may change depending |
| 1294 | on its nature . |
| 1295 | |
| 1296 | S The value is a status. It represents a discrete value which by definition |
| 1297 | cannot be aggregated. It may be the status of a server ("UP" or "DOWN"), |
| 1298 | the PID of the process, etc. |
| 1299 | |
| 1300 | K The value is a sorting key. It represents an identifier which may be used |
| 1301 | to group some values together because it is unique among its class. All |
| 1302 | internal identifiers are keys. Some names can be listed as keys if they |
| 1303 | are unique (eg: a frontend name is unique). In general keys come from the |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 1304 | configuration, even though some of them may automatically be assigned. For |
Willy Tarreau | 5d8b979 | 2016-03-11 11:09:34 +0100 | [diff] [blame] | 1305 | most purposes keys may be considered as equivalent to configuration. |
| 1306 | |
| 1307 | C The value comes from the configuration. Certain configuration values make |
| 1308 | sense on the output, for example a concurrent connection limit or a cookie |
| 1309 | name. By definition these values are the same in all processes started |
| 1310 | from the same configuration file. |
| 1311 | |
| 1312 | P The value comes from the product itself. There are very few such values, |
| 1313 | most common use is to report the product name, version and release date. |
| 1314 | These elements are also the same between all processes. |
| 1315 | |
| 1316 | The second character (the nature) indicates the nature of the information |
| 1317 | carried by the field in order to let an aggregator decide on what operation to |
| 1318 | use to aggregate multiple values. Possible characters are : |
| 1319 | |
| 1320 | A The value represents an age since a last event. This is a bit different |
| 1321 | from the duration in that an age is automatically computed based on the |
| 1322 | current date. A typical example is how long ago did the last session |
| 1323 | happen on a server. Ages are generally aggregated by taking the minimum |
| 1324 | value and do not need to be stored. |
| 1325 | |
| 1326 | a The value represents an already averaged value. The average response times |
| 1327 | and server weights are of this nature. Averages can typically be averaged |
| 1328 | between processes. |
| 1329 | |
| 1330 | C The value represents a cumulative counter. Such measures perpetually |
| 1331 | increase until they wrap around. Some monitoring protocols need to tell |
| 1332 | the difference between a counter and a gauge to report a different type. |
| 1333 | In general counters may simply be summed since they represent events or |
| 1334 | volumes. Examples of metrics of this nature are connection counts or byte |
| 1335 | counts. |
| 1336 | |
| 1337 | D The value represents a duration for a status. There are a few usages of |
| 1338 | this, most of them include the time taken by the last health check and |
| 1339 | the time a server has spent down. Durations are generally not summed, |
| 1340 | most of the time the maximum will be retained to compute an SLA. |
| 1341 | |
| 1342 | G The value represents a gauge. It's a measure at one instant. The memory |
| 1343 | usage or the current number of active connections are of this nature. |
| 1344 | Metrics of this type are typically summed during aggregation. |
| 1345 | |
| 1346 | L The value represents a limit (generally a configured one). By nature, |
| 1347 | limits are harder to aggregate since they are specific to the point where |
| 1348 | they were retrieved. In certain situations they may be summed or be kept |
| 1349 | separate. |
| 1350 | |
| 1351 | M The value represents a maximum. In general it will apply to a gauge and |
| 1352 | keep the highest known value. An example of such a metric could be the |
| 1353 | maximum amount of concurrent connections that was encountered in the |
| 1354 | product's life time. To correctly aggregate maxima, you are supposed to |
| 1355 | output a range going from the maximum of all maxima and the sum of all |
| 1356 | of them. There is indeed no way to know if they were encountered |
| 1357 | simultaneously or not. |
| 1358 | |
| 1359 | m The value represents a minimum. In general it will apply to a gauge and |
| 1360 | keep the lowest known value. An example of such a metric could be the |
| 1361 | minimum amount of free memory pools that was encountered in the product's |
| 1362 | life time. To correctly aggregate minima, you are supposed to output a |
| 1363 | range going from the minimum of all minima and the sum of all of them. |
| 1364 | There is indeed no way to know if they were encountered simultaneously |
| 1365 | or not. |
| 1366 | |
| 1367 | N The value represents a name, so it is a string. It is used to report |
| 1368 | proxy names, server names and cookie names. Names have configuration or |
| 1369 | keys as their origin and are supposed to be the same among all processes. |
| 1370 | |
| 1371 | O The value represents a free text output. Outputs from various commands, |
| 1372 | returns from health checks, node descriptions are of such nature. |
| 1373 | |
| 1374 | R The value represents an event rate. It's a measure at one instant. It is |
| 1375 | quite similar to a gauge except that the recipient knows that this measure |
| 1376 | moves slowly and may decide not to keep all values. An example of such a |
| 1377 | metric is the measured amount of connections per second. Metrics of this |
| 1378 | type are typically summed during aggregation. |
| 1379 | |
| 1380 | T The value represents a date or time. A field emitting the current date |
| 1381 | would be of this type. The method to aggregate such information is left |
| 1382 | as an implementation choice. For now no field uses this type. |
| 1383 | |
| 1384 | The third character (the scope) indicates what extent the value reflects. Some |
| 1385 | elements may be per process while others may be per configuration or per system. |
| 1386 | The distinction is important to know whether or not a single value should be |
| 1387 | kept during aggregation or if values have to be aggregated. The following |
| 1388 | characters are currently supported : |
| 1389 | |
| 1390 | C The value is valid for a whole cluster of nodes, which is the set of nodes |
| 1391 | communicating over the peers protocol. An example could be the amount of |
| 1392 | entries present in a stick table that is replicated with other peers. At |
| 1393 | the moment no metric use this scope. |
| 1394 | |
| 1395 | P The value is valid only for the process reporting it. Most metrics use |
| 1396 | this scope. |
| 1397 | |
| 1398 | S The value is valid for the whole service, which is the set of processes |
| 1399 | started together from the same configuration file. All metrics originating |
| 1400 | from the configuration use this scope. Some other metrics may use it as |
| 1401 | well for some shared resources (eg: shared SSL cache statistics). |
| 1402 | |
| 1403 | s The value is valid for the whole system, such as the system's hostname, |
| 1404 | current date or resource usage. At the moment this scope is not used by |
| 1405 | any metric. |
| 1406 | |
| 1407 | Consumers of these information will generally have enough of these 3 characters |
| 1408 | to determine how to accurately report aggregated information across multiple |
| 1409 | processes. |
| 1410 | |
| 1411 | After this column, the third column indicates the type of the field, among "s32" |
| 1412 | (signed 32-bit integer), "s64" (signed 64-bit integer), "u32" (unsigned 32-bit |
| 1413 | integer), "u64" (unsigned 64-bit integer), "str" (string). It is important to |
| 1414 | know the type before parsing the value in order to properly read it. For example |
| 1415 | a string containing only digits is still a string an not an integer (eg: an |
| 1416 | error code extracted by a check). |
| 1417 | |
| 1418 | Then the fourth column is the value itself, encoded according to its type. |
| 1419 | Strings are dumped as-is immediately after the colon without any leading space. |
| 1420 | If a string contains a colon, it will appear normally. This means that the |
| 1421 | output should not be exclusively split around colons or some check outputs |
| 1422 | or server addresses might be truncated. |
| 1423 | |
| 1424 | |
| 1425 | 9.3. Unix Socket commands |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1426 | ------------------------- |
| 1427 | |
| 1428 | The stats socket is not enabled by default. In order to enable it, it is |
| 1429 | necessary to add one line in the global section of the haproxy configuration. |
| 1430 | A second line is recommended to set a larger timeout, always appreciated when |
| 1431 | issuing commands by hand : |
| 1432 | |
| 1433 | global |
| 1434 | stats socket /var/run/haproxy.sock mode 600 level admin |
| 1435 | stats timeout 2m |
| 1436 | |
| 1437 | It is also possible to add multiple instances of the stats socket by repeating |
| 1438 | the line, and make them listen to a TCP port instead of a UNIX socket. This is |
| 1439 | never done by default because this is dangerous, but can be handy in some |
| 1440 | situations : |
| 1441 | |
| 1442 | global |
| 1443 | stats socket /var/run/haproxy.sock mode 600 level admin |
| 1444 | stats socket ipv4@192.168.0.1:9999 level admin |
| 1445 | stats timeout 2m |
| 1446 | |
| 1447 | To access the socket, an external utility such as "socat" is required. Socat is |
| 1448 | a swiss-army knife to connect anything to anything. We use it to connect |
| 1449 | terminals to the socket, or a couple of stdin/stdout pipes to it for scripts. |
| 1450 | The two main syntaxes we'll use are the following : |
| 1451 | |
| 1452 | # socat /var/run/haproxy.sock stdio |
| 1453 | # socat /var/run/haproxy.sock readline |
| 1454 | |
| 1455 | The first one is used with scripts. It is possible to send the output of a |
| 1456 | script to haproxy, and pass haproxy's output to another script. That's useful |
| 1457 | for retrieving counters or attack traces for example. |
| 1458 | |
| 1459 | The second one is only useful for issuing commands by hand. It has the benefit |
| 1460 | that the terminal is handled by the readline library which supports line |
| 1461 | editing and history, which is very convenient when issuing repeated commands |
| 1462 | (eg: watch a counter). |
| 1463 | |
| 1464 | The socket supports two operation modes : |
| 1465 | - interactive |
| 1466 | - non-interactive |
| 1467 | |
| 1468 | The non-interactive mode is the default when socat connects to the socket. In |
| 1469 | this mode, a single line may be sent. It is processed as a whole, responses are |
| 1470 | sent back, and the connection closes after the end of the response. This is the |
| 1471 | mode that scripts and monitoring tools use. It is possible to send multiple |
| 1472 | commands in this mode, they need to be delimited by a semi-colon (';'). For |
| 1473 | example : |
| 1474 | |
| 1475 | # echo "show info;show stat;show table" | socat /var/run/haproxy stdio |
| 1476 | |
Dragan Dosen | a1c35ab | 2016-11-24 11:33:12 +0100 | [diff] [blame] | 1477 | If a command needs to use a semi-colon or a backslash (eg: in a value), it |
Joseph Herlant | 71b4b15 | 2018-11-13 16:55:16 -0800 | [diff] [blame] | 1478 | must be preceded by a backslash ('\'). |
Chad Lavoie | e3f5031 | 2016-05-26 16:42:25 -0400 | [diff] [blame] | 1479 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1480 | The interactive mode displays a prompt ('>') and waits for commands to be |
| 1481 | entered on the line, then processes them, and displays the prompt again to wait |
| 1482 | for a new command. This mode is entered via the "prompt" command which must be |
| 1483 | sent on the first line in non-interactive mode. The mode is a flip switch, if |
| 1484 | "prompt" is sent in interactive mode, it is disabled and the connection closes |
| 1485 | after processing the last command of the same line. |
| 1486 | |
| 1487 | For this reason, when debugging by hand, it's quite common to start with the |
| 1488 | "prompt" command : |
| 1489 | |
| 1490 | # socat /var/run/haproxy readline |
| 1491 | prompt |
| 1492 | > show info |
| 1493 | ... |
| 1494 | > |
| 1495 | |
| 1496 | Since multiple commands may be issued at once, haproxy uses the empty line as a |
| 1497 | delimiter to mark an end of output for each command, and takes care of ensuring |
| 1498 | that no command can emit an empty line on output. A script can thus easily |
| 1499 | parse the output even when multiple commands were pipelined on a single line. |
| 1500 | |
Aurélien Nephtali | abbf607 | 2018-04-18 13:26:46 +0200 | [diff] [blame] | 1501 | Some commands may take an optional payload. To add one to a command, the first |
| 1502 | line needs to end with the "<<\n" pattern. The next lines will be treated as |
| 1503 | the payload and can contain as many lines as needed. To validate a command with |
| 1504 | a payload, it needs to end with an empty line. |
| 1505 | |
| 1506 | Limitations do exist: the length of the whole buffer passed to the CLI must |
| 1507 | not be greater than tune.bfsize and the pattern "<<" must not be glued to the |
| 1508 | last word of the line. |
| 1509 | |
| 1510 | When entering a paylod while in interactive mode, the prompt will change from |
| 1511 | "> " to "+ ". |
| 1512 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1513 | It is important to understand that when multiple haproxy processes are started |
| 1514 | on the same sockets, any process may pick up the request and will output its |
| 1515 | own stats. |
| 1516 | |
| 1517 | The list of commands currently supported on the stats socket is provided below. |
| 1518 | If an unknown command is sent, haproxy displays the usage message which reminds |
| 1519 | all supported commands. Some commands support a more complex syntax, generally |
| 1520 | it will explain what part of the command is invalid when this happens. |
| 1521 | |
Olivier Doucet | d8703e8 | 2017-08-31 11:05:10 +0200 | [diff] [blame] | 1522 | Some commands require a higher level of privilege to work. If you do not have |
| 1523 | enough privilege, you will get an error "Permission denied". Please check |
| 1524 | the "level" option of the "bind" keyword lines in the configuration manual |
| 1525 | for more information. |
| 1526 | |
Remi Tricot-Le Breton | e88a2ca | 2021-04-08 15:30:23 +0200 | [diff] [blame] | 1527 | abort ssl ca-file <cafile> |
| 1528 | Abort and destroy a temporary CA file update transaction. |
| 1529 | |
| 1530 | See also "set ssl ca-file" and "commit ssl ca-file". |
| 1531 | |
William Lallemand | 6ab08b3 | 2019-11-29 16:48:43 +0100 | [diff] [blame] | 1532 | abort ssl cert <filename> |
| 1533 | Abort and destroy a temporary SSL certificate update transaction. |
| 1534 | |
| 1535 | See also "set ssl cert" and "commit ssl cert". |
| 1536 | |
Remi Tricot-Le Breton | 3c222bd | 2021-04-27 16:28:25 +0200 | [diff] [blame] | 1537 | abort ssl crl-file <crlfile> |
| 1538 | Abort and destroy a temporary CRL file update transaction. |
| 1539 | |
| 1540 | See also "set ssl crl-file" and "commit ssl crl-file". |
| 1541 | |
Willy Tarreau | bb51c44 | 2021-04-30 15:23:36 +0200 | [diff] [blame] | 1542 | add acl [@<ver>] <acl> <pattern> |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1543 | Add an entry into the acl <acl>. <acl> is the #<id> or the <file> returned by |
Willy Tarreau | bb51c44 | 2021-04-30 15:23:36 +0200 | [diff] [blame] | 1544 | "show acl". This command does not verify if the entry already exists. Entries |
| 1545 | are added to the current version of the ACL, unless a specific version is |
| 1546 | specified with "@<ver>". This version number must have preliminary been |
| 1547 | allocated by "prepare acl", and it will be comprised between the versions |
| 1548 | reported in "curr_ver" and "next_ver" on the output of "show acl". Entries |
| 1549 | added with a specific version number will not match until a "commit acl" |
| 1550 | operation is performed on them. They may however be consulted using the |
| 1551 | "show acl @<ver>" command, and cleared using a "clear acl @<ver>" command. |
| 1552 | This command cannot be used if the reference <acl> is a file also used with |
| 1553 | a map. In this case, the "add map" command must be used instead. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1554 | |
Willy Tarreau | bb51c44 | 2021-04-30 15:23:36 +0200 | [diff] [blame] | 1555 | add map [@<ver>] <map> <key> <value> |
| 1556 | add map [@<ver>] <map> <payload> |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1557 | Add an entry into the map <map> to associate the value <value> to the key |
| 1558 | <key>. This command does not verify if the entry already exists. It is |
Willy Tarreau | bb51c44 | 2021-04-30 15:23:36 +0200 | [diff] [blame] | 1559 | mainly used to fill a map after a "clear" or "prepare" operation. Entries |
| 1560 | are added to the current version of the ACL, unless a specific version is |
| 1561 | specified with "@<ver>". This version number must have preliminary been |
| 1562 | allocated by "prepare acl", and it will be comprised between the versions |
| 1563 | reported in "curr_ver" and "next_ver" on the output of "show acl". Entries |
| 1564 | added with a specific version number will not match until a "commit map" |
| 1565 | operation is performed on them. They may however be consulted using the |
| 1566 | "show map @<ver>" command, and cleared using a "clear acl @<ver>" command. |
| 1567 | If the designated map is also used as an ACL, the ACL will only match the |
| 1568 | <key> part and will ignore the <value> part. Using the payload syntax it is |
| 1569 | possible to add multiple key/value pairs by entering them on separate lines. |
| 1570 | On each new line, the first word is the key and the rest of the line is |
| 1571 | considered to be the value which can even contains spaces. |
Aurélien Nephtali | 25650ce | 2018-04-18 14:04:47 +0200 | [diff] [blame] | 1572 | |
| 1573 | Example: |
| 1574 | |
| 1575 | # socat /tmp/sock1 - |
| 1576 | prompt |
| 1577 | |
| 1578 | > add map #-1 << |
| 1579 | + key1 value1 |
| 1580 | + key2 value2 with spaces |
| 1581 | + key3 value3 also with spaces |
| 1582 | + key4 value4 |
| 1583 | |
| 1584 | > |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1585 | |
Amaury Denoyelle | f99f77a | 2021-03-08 17:13:32 +0100 | [diff] [blame] | 1586 | add server <backend>/<server> [args]* |
Amaury Denoyelle | 76e8b70 | 2022-03-09 15:07:31 +0100 | [diff] [blame] | 1587 | Instantiate a new server attached to the backend <backend>. |
Amaury Denoyelle | f99f77a | 2021-03-08 17:13:32 +0100 | [diff] [blame] | 1588 | |
| 1589 | The <server> name must not be already used in the backend. A special |
Amaury Denoyelle | eafd701 | 2021-04-29 14:59:42 +0200 | [diff] [blame] | 1590 | restriction is put on the backend which must used a dynamic load-balancing |
| 1591 | algorithm. A subset of keywords from the server config file statement can be |
| 1592 | used to configure the server behavior. Also note that no settings will be |
| 1593 | reused from an hypothetical 'default-server' statement in the same backend. |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1594 | |
Amaury Denoyelle | efbf35c | 2021-06-10 17:34:10 +0200 | [diff] [blame] | 1595 | Currently a dynamic server is statically initialized with the "none" |
| 1596 | init-addr method. This means that no resolution will be undertaken if a FQDN |
| 1597 | is specified as an address, even if the server creation will be validated. |
| 1598 | |
Amaury Denoyelle | 14c3c5c | 2021-08-23 14:10:51 +0200 | [diff] [blame] | 1599 | To support the reload operations, it is expected that the server created via |
| 1600 | the CLI is also manually inserted in the relevant haproxy configuration file. |
| 1601 | A dynamic server not present in the configuration won't be restored after a |
| 1602 | reload operation. |
| 1603 | |
Amaury Denoyelle | 56eb8ed | 2021-07-13 10:36:03 +0200 | [diff] [blame] | 1604 | A dynamic server may use the "track" keyword to follow the check status of |
| 1605 | another server from the configuration. However, it is not possible to track |
| 1606 | another dynamic server. This is to ensure that the tracking chain is kept |
| 1607 | consistent even in the case of dynamic servers deletion. |
| 1608 | |
Amaury Denoyelle | 2fc4d39 | 2021-07-22 16:04:59 +0200 | [diff] [blame] | 1609 | Use the "check" keyword to enable health-check support. Note that the |
| 1610 | health-check is disabled by default and must be enabled independently from |
Amaury Denoyelle | b65f4ca | 2021-08-04 11:33:14 +0200 | [diff] [blame] | 1611 | the server using the "enable health" command. For agent checks, use the |
| 1612 | "agent-check" keyword and the "enable agent" command. Note that in this case |
| 1613 | the server may be activated via the agent depending on the status reported, |
| 1614 | without an explicit "enable server" command. This also means that extra care |
| 1615 | is required when removing a dynamic server with agent check. The agent should |
| 1616 | be first deactivated via "disable agent" to be able to put the server in the |
| 1617 | required maintenance mode before removal. |
Amaury Denoyelle | 2fc4d39 | 2021-07-22 16:04:59 +0200 | [diff] [blame] | 1618 | |
Amaury Denoyelle | 414a612 | 2021-08-06 10:25:32 +0200 | [diff] [blame] | 1619 | It may be possible to reach the fd limit when using a large number of dynamic |
| 1620 | servers. Please refer to the "u-limit" global keyword documentation in this |
| 1621 | case. |
| 1622 | |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1623 | Here is the list of the currently supported keywords : |
| 1624 | |
Amaury Denoyelle | b65f4ca | 2021-08-04 11:33:14 +0200 | [diff] [blame] | 1625 | - agent-addr |
| 1626 | - agent-check |
| 1627 | - agent-inter |
| 1628 | - agent-port |
| 1629 | - agent-send |
Amaury Denoyelle | 34897d2 | 2021-05-19 09:49:41 +0200 | [diff] [blame] | 1630 | - allow-0rtt |
| 1631 | - alpn |
Amaury Denoyelle | 2fc4d39 | 2021-07-22 16:04:59 +0200 | [diff] [blame] | 1632 | - addr |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1633 | - backup |
Amaury Denoyelle | 34897d2 | 2021-05-19 09:49:41 +0200 | [diff] [blame] | 1634 | - ca-file |
Amaury Denoyelle | 2fc4d39 | 2021-07-22 16:04:59 +0200 | [diff] [blame] | 1635 | - check |
Amaury Denoyelle | 79b90e8 | 2021-09-20 15:15:19 +0200 | [diff] [blame] | 1636 | - check-alpn |
Amaury Denoyelle | 2fc4d39 | 2021-07-22 16:04:59 +0200 | [diff] [blame] | 1637 | - check-proto |
| 1638 | - check-send-proxy |
Amaury Denoyelle | 79b90e8 | 2021-09-20 15:15:19 +0200 | [diff] [blame] | 1639 | - check-sni |
| 1640 | - check-ssl |
Amaury Denoyelle | 2fc4d39 | 2021-07-22 16:04:59 +0200 | [diff] [blame] | 1641 | - check-via-socks4 |
Amaury Denoyelle | 34897d2 | 2021-05-19 09:49:41 +0200 | [diff] [blame] | 1642 | - ciphers |
| 1643 | - ciphersuites |
| 1644 | - crl-file |
| 1645 | - crt |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1646 | - disabled |
Amaury Denoyelle | 2fc4d39 | 2021-07-22 16:04:59 +0200 | [diff] [blame] | 1647 | - downinter |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1648 | - enabled |
Amaury Denoyelle | 725f8d2 | 2021-09-20 15:16:12 +0200 | [diff] [blame] | 1649 | - error-limit |
Amaury Denoyelle | 2fc4d39 | 2021-07-22 16:04:59 +0200 | [diff] [blame] | 1650 | - fall |
| 1651 | - fastinter |
Amaury Denoyelle | 34897d2 | 2021-05-19 09:49:41 +0200 | [diff] [blame] | 1652 | - force-sslv3/tlsv10/tlsv11/tlsv12/tlsv13 |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1653 | - id |
Amaury Denoyelle | 2fc4d39 | 2021-07-22 16:04:59 +0200 | [diff] [blame] | 1654 | - inter |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1655 | - maxconn |
| 1656 | - maxqueue |
| 1657 | - minconn |
Amaury Denoyelle | 34897d2 | 2021-05-19 09:49:41 +0200 | [diff] [blame] | 1658 | - no-ssl-reuse |
| 1659 | - no-sslv3/tlsv10/tlsv11/tlsv12/tlsv13 |
| 1660 | - no-tls-tickets |
| 1661 | - npn |
Amaury Denoyelle | 725f8d2 | 2021-09-20 15:16:12 +0200 | [diff] [blame] | 1662 | - observe |
| 1663 | - on-error |
| 1664 | - on-marked-down |
| 1665 | - on-marked-up |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1666 | - pool-low-conn |
| 1667 | - pool-max-conn |
| 1668 | - pool-purge-delay |
Amaury Denoyelle | 2fc4d39 | 2021-07-22 16:04:59 +0200 | [diff] [blame] | 1669 | - port |
Amaury Denoyelle | 3046723 | 2021-03-12 18:03:27 +0100 | [diff] [blame] | 1670 | - proto |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1671 | - proxy-v2-options |
Amaury Denoyelle | 2fc4d39 | 2021-07-22 16:04:59 +0200 | [diff] [blame] | 1672 | - rise |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1673 | - send-proxy |
| 1674 | - send-proxy-v2 |
Amaury Denoyelle | 34897d2 | 2021-05-19 09:49:41 +0200 | [diff] [blame] | 1675 | - send-proxy-v2-ssl |
| 1676 | - send-proxy-v2-ssl-cn |
Amaury Denoyelle | cd8a6f2 | 2021-09-21 11:51:54 +0200 | [diff] [blame] | 1677 | - slowstart |
Amaury Denoyelle | 34897d2 | 2021-05-19 09:49:41 +0200 | [diff] [blame] | 1678 | - sni |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1679 | - source |
Amaury Denoyelle | 34897d2 | 2021-05-19 09:49:41 +0200 | [diff] [blame] | 1680 | - ssl |
| 1681 | - ssl-max-ver |
| 1682 | - ssl-min-ver |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1683 | - tfo |
Amaury Denoyelle | 34897d2 | 2021-05-19 09:49:41 +0200 | [diff] [blame] | 1684 | - tls-tickets |
Amaury Denoyelle | 56eb8ed | 2021-07-13 10:36:03 +0200 | [diff] [blame] | 1685 | - track |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1686 | - usesrc |
Amaury Denoyelle | 34897d2 | 2021-05-19 09:49:41 +0200 | [diff] [blame] | 1687 | - verify |
| 1688 | - verifyhost |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1689 | - weight |
Amaury Denoyelle | f9d5957 | 2021-10-18 14:40:29 +0200 | [diff] [blame] | 1690 | - ws |
Amaury Denoyelle | fc465a5 | 2021-03-09 17:36:23 +0100 | [diff] [blame] | 1691 | |
| 1692 | Their syntax is similar to the server line from the configuration file, |
| 1693 | please refer to their individual documentation for details. |
Amaury Denoyelle | f99f77a | 2021-03-08 17:13:32 +0100 | [diff] [blame] | 1694 | |
William Lallemand | 62c0b99 | 2022-07-29 17:50:58 +0200 | [diff] [blame] | 1695 | add ssl ca-file <cafile> <payload> |
| 1696 | Add a new certificate to a ca-file. This command is useful when you reached |
| 1697 | the buffer size limit on the CLI and want to add multiple certicates. |
| 1698 | Instead of doing a "set" with all the certificates you are able to add each |
| 1699 | certificate individually. A "set ssl ca-file" will reset the ca-file. |
| 1700 | |
| 1701 | Example: |
| 1702 | echo -e "set ssl ca-file cafile.pem <<\n$(cat rootCA.crt)\n" | \ |
| 1703 | socat /var/run/haproxy.stat - |
| 1704 | echo -e "add ssl ca-file cafile.pem <<\n$(cat intermediate1.crt)\n" | \ |
| 1705 | socat /var/run/haproxy.stat - |
| 1706 | echo -e "add ssl ca-file cafile.pem <<\n$(cat intermediate2.crt)\n" | \ |
| 1707 | socat /var/run/haproxy.stat - |
| 1708 | echo "commit ssl ca-file cafile.pem" | socat /var/run/haproxy.stat - |
| 1709 | |
William Lallemand | accac23 | 2020-04-02 17:42:51 +0200 | [diff] [blame] | 1710 | add ssl crt-list <crtlist> <certificate> |
| 1711 | add ssl crt-list <crtlist> <payload> |
| 1712 | Add an certificate in a crt-list. It can also be used for directories since |
| 1713 | directories are now loaded the same way as the crt-lists. This command allow |
| 1714 | you to use a certificate name in parameter, to use SSL options or filters a |
| 1715 | crt-list line must sent as a payload instead. Only one crt-list line is |
| 1716 | supported in the payload. This command will load the certificate for every |
| 1717 | bind lines using the crt-list. To push a new certificate to HAProxy the |
| 1718 | commands "new ssl cert" and "set ssl cert" must be used. |
| 1719 | |
| 1720 | Example: |
| 1721 | $ echo "new ssl cert foobar.pem" | socat /tmp/sock1 - |
| 1722 | $ echo -e "set ssl cert foobar.pem <<\n$(cat foobar.pem)\n" | socat |
| 1723 | /tmp/sock1 - |
| 1724 | $ echo "commit ssl cert foobar.pem" | socat /tmp/sock1 - |
| 1725 | $ echo "add ssl crt-list certlist1 foobar.pem" | socat /tmp/sock1 - |
| 1726 | |
| 1727 | $ echo -e 'add ssl crt-list certlist1 <<\nfoobar.pem [allow-0rtt] foo.bar.com |
| 1728 | !test1.com\n' | socat /tmp/sock1 - |
| 1729 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1730 | clear counters |
| 1731 | Clear the max values of the statistics counters in each proxy (frontend & |
Willy Tarreau | d80cb4e | 2018-01-20 19:30:13 +0100 | [diff] [blame] | 1732 | backend) and in each server. The accumulated counters are not affected. The |
| 1733 | internal activity counters reported by "show activity" are also reset. This |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1734 | can be used to get clean counters after an incident, without having to |
| 1735 | restart nor to clear traffic counters. This command is restricted and can |
| 1736 | only be issued on sockets configured for levels "operator" or "admin". |
| 1737 | |
| 1738 | clear counters all |
| 1739 | Clear all statistics counters in each proxy (frontend & backend) and in each |
| 1740 | server. This has the same effect as restarting. This command is restricted |
| 1741 | and can only be issued on sockets configured for level "admin". |
| 1742 | |
Willy Tarreau | ff3feeb | 2021-04-30 13:31:43 +0200 | [diff] [blame] | 1743 | clear acl [@<ver>] <acl> |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1744 | Remove all entries from the acl <acl>. <acl> is the #<id> or the <file> |
| 1745 | returned by "show acl". Note that if the reference <acl> is a file and is |
Willy Tarreau | ff3feeb | 2021-04-30 13:31:43 +0200 | [diff] [blame] | 1746 | shared with a map, this map will be also cleared. By default only the current |
| 1747 | version of the ACL is cleared (the one being matched against). However it is |
| 1748 | possible to specify another version using '@' followed by this version. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1749 | |
Willy Tarreau | ff3feeb | 2021-04-30 13:31:43 +0200 | [diff] [blame] | 1750 | clear map [@<ver>] <map> |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1751 | Remove all entries from the map <map>. <map> is the #<id> or the <file> |
| 1752 | returned by "show map". Note that if the reference <map> is a file and is |
Willy Tarreau | ff3feeb | 2021-04-30 13:31:43 +0200 | [diff] [blame] | 1753 | shared with a acl, this acl will be also cleared. By default only the current |
| 1754 | version of the map is cleared (the one being matched against). However it is |
| 1755 | possible to specify another version using '@' followed by this version. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1756 | |
| 1757 | clear table <table> [ data.<type> <operator> <value> ] | [ key <key> ] |
| 1758 | Remove entries from the stick-table <table>. |
| 1759 | |
| 1760 | This is typically used to unblock some users complaining they have been |
| 1761 | abusively denied access to a service, but this can also be used to clear some |
| 1762 | stickiness entries matching a server that is going to be replaced (see "show |
| 1763 | table" below for details). Note that sometimes, removal of an entry will be |
| 1764 | refused because it is currently tracked by a session. Retrying a few seconds |
| 1765 | later after the session ends is usual enough. |
| 1766 | |
| 1767 | In the case where no options arguments are given all entries will be removed. |
| 1768 | |
| 1769 | When the "data." form is used entries matching a filter applied using the |
| 1770 | stored data (see "stick-table" in section 4.2) are removed. A stored data |
| 1771 | type must be specified in <type>, and this data type must be stored in the |
| 1772 | table otherwise an error is reported. The data is compared according to |
| 1773 | <operator> with the 64-bit integer <value>. Operators are the same as with |
| 1774 | the ACLs : |
| 1775 | |
| 1776 | - eq : match entries whose data is equal to this value |
| 1777 | - ne : match entries whose data is not equal to this value |
| 1778 | - le : match entries whose data is less than or equal to this value |
| 1779 | - ge : match entries whose data is greater than or equal to this value |
| 1780 | - lt : match entries whose data is less than this value |
| 1781 | - gt : match entries whose data is greater than this value |
| 1782 | |
| 1783 | When the key form is used the entry <key> is removed. The key must be of the |
| 1784 | same type as the table, which currently is limited to IPv4, IPv6, integer and |
| 1785 | string. |
| 1786 | |
| 1787 | Example : |
| 1788 | $ echo "show table http_proxy" | socat stdio /tmp/sock1 |
| 1789 | >>> # table: http_proxy, type: ip, size:204800, used:2 |
| 1790 | >>> 0x80e6a4c: key=127.0.0.1 use=0 exp=3594729 gpc0=0 conn_rate(30000)=1 \ |
| 1791 | bytes_out_rate(60000)=187 |
| 1792 | >>> 0x80e6a80: key=127.0.0.2 use=0 exp=3594740 gpc0=1 conn_rate(30000)=10 \ |
| 1793 | bytes_out_rate(60000)=191 |
| 1794 | |
| 1795 | $ echo "clear table http_proxy key 127.0.0.1" | socat stdio /tmp/sock1 |
| 1796 | |
| 1797 | $ echo "show table http_proxy" | socat stdio /tmp/sock1 |
| 1798 | >>> # table: http_proxy, type: ip, size:204800, used:1 |
| 1799 | >>> 0x80e6a80: key=127.0.0.2 use=0 exp=3594740 gpc0=1 conn_rate(30000)=10 \ |
| 1800 | bytes_out_rate(60000)=191 |
| 1801 | $ echo "clear table http_proxy data.gpc0 eq 1" | socat stdio /tmp/sock1 |
| 1802 | $ echo "show table http_proxy" | socat stdio /tmp/sock1 |
| 1803 | >>> # table: http_proxy, type: ip, size:204800, used:1 |
| 1804 | |
Willy Tarreau | 7a562ca | 2021-04-30 15:10:01 +0200 | [diff] [blame] | 1805 | commit acl @<ver> <acl> |
| 1806 | Commit all changes made to version <ver> of ACL <acl>, and deletes all past |
| 1807 | versions. <acl> is the #<id> or the <file> returned by "show acl". The |
| 1808 | version number must be between "curr_ver"+1 and "next_ver" as reported in |
| 1809 | "show acl". The contents to be committed to the ACL can be consulted with |
| 1810 | "show acl @<ver> <acl>" if desired. The specified version number has normally |
| 1811 | been created with the "prepare acl" command. The replacement is atomic. It |
| 1812 | consists in atomically updating the current version to the specified version, |
| 1813 | which will instantly cause all entries in other versions to become invisible, |
| 1814 | and all entries in the new version to become visible. It is also possible to |
| 1815 | use this command to perform an atomic removal of all visible entries of an |
| 1816 | ACL by calling "prepare acl" first then committing without adding any |
| 1817 | entries. This command cannot be used if the reference <acl> is a file also |
| 1818 | used as a map. In this case, the "commit map" command must be used instead. |
| 1819 | |
| 1820 | commit map @<ver> <map> |
| 1821 | Commit all changes made to version <ver> of map <map>, and deletes all past |
| 1822 | versions. <map> is the #<id> or the <file> returned by "show map". The |
| 1823 | version number must be between "curr_ver"+1 and "next_ver" as reported in |
| 1824 | "show map". The contents to be committed to the map can be consulted with |
| 1825 | "show map @<ver> <map>" if desired. The specified version number has normally |
| 1826 | been created with the "prepare map" command. The replacement is atomic. It |
| 1827 | consists in atomically updating the current version to the specified version, |
| 1828 | which will instantly cause all entries in other versions to become invisible, |
| 1829 | and all entries in the new version to become visible. It is also possible to |
| 1830 | use this command to perform an atomic removal of all visible entries of an |
| 1831 | map by calling "prepare map" first then committing without adding any |
| 1832 | entries. |
| 1833 | |
Remi Tricot-Le Breton | e88a2ca | 2021-04-08 15:30:23 +0200 | [diff] [blame] | 1834 | commit ssl ca-file <cafile> |
| 1835 | Commit a temporary SSL CA file update transaction. |
| 1836 | |
| 1837 | In the case of an existing CA file (in a "Used" state in "show ssl ca-file"), |
| 1838 | the new CA file tree entry is inserted in the CA file tree and every instance |
| 1839 | that used the CA file entry is rebuilt, along with the SSL contexts it needs. |
| 1840 | All the contexts previously used by the rebuilt instances are removed. |
| 1841 | Upon success, the previous CA file entry is removed from the tree. |
| 1842 | Upon failure, nothing is removed or deleted, and all the original SSL |
| 1843 | contexts are kept and used. |
| 1844 | Once the temporary transaction is committed, it is destroyed. |
| 1845 | |
| 1846 | In the case of a new CA file (after a "new ssl ca-file" and in a "Unused" |
| 1847 | state in "show ssl ca-file"), the CA file will be inserted in the CA file |
| 1848 | tree but it won't be used anywhere in HAProxy. To use it and generate SSL |
| 1849 | contexts that use it, you will need to add it to a crt-list with "add ssl |
| 1850 | crt-list". |
| 1851 | |
William Lallemand | 62c0b99 | 2022-07-29 17:50:58 +0200 | [diff] [blame] | 1852 | See also "new ssl ca-file", "set ssl ca-file", "add ssl ca-file", |
| 1853 | "abort ssl ca-file" and "add ssl crt-list". |
Remi Tricot-Le Breton | e88a2ca | 2021-04-08 15:30:23 +0200 | [diff] [blame] | 1854 | |
William Lallemand | 6ab08b3 | 2019-11-29 16:48:43 +0100 | [diff] [blame] | 1855 | commit ssl cert <filename> |
William Lallemand | c184d87 | 2020-06-26 15:39:57 +0200 | [diff] [blame] | 1856 | Commit a temporary SSL certificate update transaction. |
| 1857 | |
| 1858 | In the case of an existing certificate (in a "Used" state in "show ssl |
| 1859 | cert"), generate every SSL contextes and SNIs it need, insert them, and |
| 1860 | remove the previous ones. Replace in memory the previous SSL certificates |
| 1861 | everywhere the <filename> was used in the configuration. Upon failure it |
| 1862 | doesn't remove or insert anything. Once the temporary transaction is |
| 1863 | committed, it is destroyed. |
| 1864 | |
| 1865 | In the case of a new certificate (after a "new ssl cert" and in a "Unused" |
Ilya Shipitsin | 2272d8a | 2020-12-21 01:22:40 +0500 | [diff] [blame] | 1866 | state in "show ssl cert"), the certificate will be committed in a certificate |
William Lallemand | c184d87 | 2020-06-26 15:39:57 +0200 | [diff] [blame] | 1867 | storage, but it won't be used anywhere in haproxy. To use it and generate |
| 1868 | its SNIs you will need to add it to a crt-list or a directory with "add ssl |
| 1869 | crt-list". |
William Lallemand | 6ab08b3 | 2019-11-29 16:48:43 +0100 | [diff] [blame] | 1870 | |
Remi Tricot-Le Breton | e88a2ca | 2021-04-08 15:30:23 +0200 | [diff] [blame] | 1871 | See also "new ssl cert", "set ssl cert", "abort ssl cert" and |
William Lallemand | c184d87 | 2020-06-26 15:39:57 +0200 | [diff] [blame] | 1872 | "add ssl crt-list". |
William Lallemand | 6ab08b3 | 2019-11-29 16:48:43 +0100 | [diff] [blame] | 1873 | |
Remi Tricot-Le Breton | 3c222bd | 2021-04-27 16:28:25 +0200 | [diff] [blame] | 1874 | commit ssl crl-file <crlfile> |
| 1875 | Commit a temporary SSL CRL file update transaction. |
| 1876 | |
| 1877 | In the case of an existing CRL file (in a "Used" state in "show ssl |
| 1878 | crl-file"), the new CRL file entry is inserted in the CA file tree (which |
| 1879 | holds both the CA files and the CRL files) and every instance that used the |
| 1880 | CRL file entry is rebuilt, along with the SSL contexts it needs. |
| 1881 | All the contexts previously used by the rebuilt instances are removed. |
| 1882 | Upon success, the previous CRL file entry is removed from the tree. |
| 1883 | Upon failure, nothing is removed or deleted, and all the original SSL |
| 1884 | contexts are kept and used. |
| 1885 | Once the temporary transaction is committed, it is destroyed. |
| 1886 | |
| 1887 | In the case of a new CRL file (after a "new ssl crl-file" and in a "Unused" |
| 1888 | state in "show ssl crl-file"), the CRL file will be inserted in the CRL file |
| 1889 | tree but it won't be used anywhere in HAProxy. To use it and generate SSL |
| 1890 | contexts that use it, you will need to add it to a crt-list with "add ssl |
| 1891 | crt-list". |
| 1892 | |
| 1893 | See also "new ssl crl-file", "set ssl crl-file", "abort ssl crl-file" and |
| 1894 | "add ssl crt-list". |
| 1895 | |
Willy Tarreau | 6bdf3e9 | 2019-05-20 14:25:05 +0200 | [diff] [blame] | 1896 | debug dev <command> [args]* |
Willy Tarreau | b24ab22 | 2019-10-24 18:03:39 +0200 | [diff] [blame] | 1897 | Call a developer-specific command. Only supported on a CLI connection running |
| 1898 | in expert mode (see "expert-mode on"). Such commands are extremely dangerous |
| 1899 | and not forgiving, any misuse may result in a crash of the process. They are |
| 1900 | intended for experts only, and must really not be used unless told to do so. |
| 1901 | Some of them are only available when haproxy is built with DEBUG_DEV defined |
| 1902 | because they may have security implications. All of these commands require |
| 1903 | admin privileges, and are purposely not documented to avoid encouraging their |
| 1904 | use by people who are not at ease with the source code. |
Willy Tarreau | 6bdf3e9 | 2019-05-20 14:25:05 +0200 | [diff] [blame] | 1905 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1906 | del acl <acl> [<key>|#<ref>] |
| 1907 | Delete all the acl entries from the acl <acl> corresponding to the key <key>. |
| 1908 | <acl> is the #<id> or the <file> returned by "show acl". If the <ref> is used, |
| 1909 | this command delete only the listed reference. The reference can be found with |
| 1910 | listing the content of the acl. Note that if the reference <acl> is a file and |
| 1911 | is shared with a map, the entry will be also deleted in the map. |
| 1912 | |
| 1913 | del map <map> [<key>|#<ref>] |
| 1914 | Delete all the map entries from the map <map> corresponding to the key <key>. |
| 1915 | <map> is the #<id> or the <file> returned by "show map". If the <ref> is used, |
| 1916 | this command delete only the listed reference. The reference can be found with |
| 1917 | listing the content of the map. Note that if the reference <map> is a file and |
| 1918 | is shared with a acl, the entry will be also deleted in the map. |
| 1919 | |
Remi Tricot-Le Breton | e88a2ca | 2021-04-08 15:30:23 +0200 | [diff] [blame] | 1920 | del ssl ca-file <cafile> |
| 1921 | Delete a CA file tree entry from HAProxy. The CA file must be unused and |
| 1922 | removed from any crt-list. "show ssl ca-file" displays the status of the CA |
| 1923 | files. The deletion doesn't work with a certificate referenced directly with |
| 1924 | the "ca-file" or "ca-verify-file" directives in the configuration. |
| 1925 | |
William Lallemand | 419e634 | 2020-04-08 12:05:39 +0200 | [diff] [blame] | 1926 | del ssl cert <certfile> |
| 1927 | Delete a certificate store from HAProxy. The certificate must be unused and |
| 1928 | removed from any crt-list or directory. "show ssl cert" displays the status |
| 1929 | of the certificate. The deletion doesn't work with a certificate referenced |
| 1930 | directly with the "crt" directive in the configuration. |
| 1931 | |
Remi Tricot-Le Breton | 3c222bd | 2021-04-27 16:28:25 +0200 | [diff] [blame] | 1932 | del ssl crl-file <crlfile> |
| 1933 | Delete a CRL file tree entry from HAProxy. The CRL file must be unused and |
| 1934 | removed from any crt-list. "show ssl crl-file" displays the status of the CRL |
| 1935 | files. The deletion doesn't work with a certificate referenced directly with |
| 1936 | the "crl-file" directive in the configuration. |
| 1937 | |
William Lallemand | 0a9b941 | 2020-04-06 17:43:05 +0200 | [diff] [blame] | 1938 | del ssl crt-list <filename> <certfile[:line]> |
| 1939 | Delete an entry in a crt-list. This will delete every SNIs used for this |
| 1940 | entry in the frontends. If a certificate is used several time in a crt-list, |
| 1941 | you will need to provide which line you want to delete. To display the line |
| 1942 | numbers, use "show ssl crt-list -n <crtlist>". |
| 1943 | |
Amaury Denoyelle | e558043 | 2021-04-15 14:41:20 +0200 | [diff] [blame] | 1944 | del server <backend>/<server> |
Amaury Denoyelle | 14c3c5c | 2021-08-23 14:10:51 +0200 | [diff] [blame] | 1945 | Remove a server attached to the backend <backend>. All servers are eligible, |
| 1946 | except servers which are referenced by other configuration elements. The |
| 1947 | server must be put in maintenance mode prior to its deletion. The operation |
| 1948 | is cancelled if the serveur still has active or idle connection or its |
| 1949 | connection queue is not empty. |
Amaury Denoyelle | e558043 | 2021-04-15 14:41:20 +0200 | [diff] [blame] | 1950 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1951 | disable agent <backend>/<server> |
| 1952 | Mark the auxiliary agent check as temporarily stopped. |
| 1953 | |
| 1954 | In the case where an agent check is being run as a auxiliary check, due |
| 1955 | to the agent-check parameter of a server directive, new checks are only |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 1956 | initialized when the agent is in the enabled. Thus, disable agent will |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1957 | prevent any new agent checks from begin initiated until the agent |
| 1958 | re-enabled using enable agent. |
| 1959 | |
| 1960 | When an agent is disabled the processing of an auxiliary agent check that |
| 1961 | was initiated while the agent was set as enabled is as follows: All |
| 1962 | results that would alter the weight, specifically "drain" or a weight |
| 1963 | returned by the agent, are ignored. The processing of agent check is |
| 1964 | otherwise unchanged. |
| 1965 | |
| 1966 | The motivation for this feature is to allow the weight changing effects |
| 1967 | of the agent checks to be paused to allow the weight of a server to be |
| 1968 | configured using set weight without being overridden by the agent. |
| 1969 | |
| 1970 | This command is restricted and can only be issued on sockets configured for |
| 1971 | level "admin". |
| 1972 | |
Olivier Houchard | 614f8d7 | 2017-03-14 20:08:46 +0100 | [diff] [blame] | 1973 | disable dynamic-cookie backend <backend> |
Ilya Shipitsin | 2a950d0 | 2020-03-06 13:07:38 +0500 | [diff] [blame] | 1974 | Disable the generation of dynamic cookies for the backend <backend> |
Olivier Houchard | 614f8d7 | 2017-03-14 20:08:46 +0100 | [diff] [blame] | 1975 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 1976 | disable frontend <frontend> |
| 1977 | Mark the frontend as temporarily stopped. This corresponds to the mode which |
| 1978 | is used during a soft restart : the frontend releases the port but can be |
| 1979 | enabled again if needed. This should be used with care as some non-Linux OSes |
| 1980 | are unable to enable it back. This is intended to be used in environments |
| 1981 | where stopping a proxy is not even imaginable but a misconfigured proxy must |
| 1982 | be fixed. That way it's possible to release the port and bind it into another |
| 1983 | process to restore operations. The frontend will appear with status "STOP" |
| 1984 | on the stats page. |
| 1985 | |
| 1986 | The frontend may be specified either by its name or by its numeric ID, |
| 1987 | prefixed with a sharp ('#'). |
| 1988 | |
| 1989 | This command is restricted and can only be issued on sockets configured for |
| 1990 | level "admin". |
| 1991 | |
| 1992 | disable health <backend>/<server> |
| 1993 | Mark the primary health check as temporarily stopped. This will disable |
| 1994 | sending of health checks, and the last health check result will be ignored. |
| 1995 | The server will be in unchecked state and considered UP unless an auxiliary |
| 1996 | agent check forces it down. |
| 1997 | |
| 1998 | This command is restricted and can only be issued on sockets configured for |
| 1999 | level "admin". |
| 2000 | |
| 2001 | disable server <backend>/<server> |
| 2002 | Mark the server DOWN for maintenance. In this mode, no more checks will be |
| 2003 | performed on the server until it leaves maintenance. |
| 2004 | If the server is tracked by other servers, those servers will be set to DOWN |
| 2005 | during the maintenance. |
| 2006 | |
| 2007 | In the statistics page, a server DOWN for maintenance will appear with a |
| 2008 | "MAINT" status, its tracking servers with the "MAINT(via)" one. |
| 2009 | |
| 2010 | Both the backend and the server may be specified either by their name or by |
| 2011 | their numeric ID, prefixed with a sharp ('#'). |
| 2012 | |
| 2013 | This command is restricted and can only be issued on sockets configured for |
| 2014 | level "admin". |
| 2015 | |
| 2016 | enable agent <backend>/<server> |
| 2017 | Resume auxiliary agent check that was temporarily stopped. |
| 2018 | |
| 2019 | See "disable agent" for details of the effect of temporarily starting |
| 2020 | and stopping an auxiliary agent. |
| 2021 | |
| 2022 | This command is restricted and can only be issued on sockets configured for |
| 2023 | level "admin". |
| 2024 | |
Olivier Houchard | 614f8d7 | 2017-03-14 20:08:46 +0100 | [diff] [blame] | 2025 | enable dynamic-cookie backend <backend> |
n9@users.noreply.github.com | 25a1c8e | 2019-08-23 11:21:05 +0200 | [diff] [blame] | 2026 | Enable the generation of dynamic cookies for the backend <backend>. |
| 2027 | A secret key must also be provided. |
Olivier Houchard | 614f8d7 | 2017-03-14 20:08:46 +0100 | [diff] [blame] | 2028 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2029 | enable frontend <frontend> |
| 2030 | Resume a frontend which was temporarily stopped. It is possible that some of |
| 2031 | the listening ports won't be able to bind anymore (eg: if another process |
| 2032 | took them since the 'disable frontend' operation). If this happens, an error |
| 2033 | is displayed. Some operating systems might not be able to resume a frontend |
| 2034 | which was disabled. |
| 2035 | |
| 2036 | The frontend may be specified either by its name or by its numeric ID, |
| 2037 | prefixed with a sharp ('#'). |
| 2038 | |
| 2039 | This command is restricted and can only be issued on sockets configured for |
| 2040 | level "admin". |
| 2041 | |
| 2042 | enable health <backend>/<server> |
| 2043 | Resume a primary health check that was temporarily stopped. This will enable |
| 2044 | sending of health checks again. Please see "disable health" for details. |
| 2045 | |
| 2046 | This command is restricted and can only be issued on sockets configured for |
| 2047 | level "admin". |
| 2048 | |
| 2049 | enable server <backend>/<server> |
| 2050 | If the server was previously marked as DOWN for maintenance, this marks the |
| 2051 | server UP and checks are re-enabled. |
| 2052 | |
| 2053 | Both the backend and the server may be specified either by their name or by |
| 2054 | their numeric ID, prefixed with a sharp ('#'). |
| 2055 | |
| 2056 | This command is restricted and can only be issued on sockets configured for |
| 2057 | level "admin". |
| 2058 | |
Amaury Denoyelle | 18487fb | 2021-03-18 15:32:53 +0100 | [diff] [blame] | 2059 | experimental-mode [on|off] |
| 2060 | Without options, this indicates whether the experimental mode is enabled or |
| 2061 | disabled on the current connection. When passed "on", it turns the |
| 2062 | experimental mode on for the current CLI connection only. With "off" it turns |
| 2063 | it off. |
| 2064 | |
| 2065 | The experimental mode is used to access to extra features still in |
| 2066 | development. These features are currently not stable and should be used with |
Ilya Shipitsin | ba13f16 | 2021-03-19 22:21:44 +0500 | [diff] [blame] | 2067 | care. They may be subject to breaking changes across versions. |
Amaury Denoyelle | 18487fb | 2021-03-18 15:32:53 +0100 | [diff] [blame] | 2068 | |
William Lallemand | 7267f78 | 2022-02-01 16:08:50 +0100 | [diff] [blame] | 2069 | When used from the master CLI, this command shouldn't be prefixed, as it will |
| 2070 | set the mode for any worker when connecting to its CLI. |
| 2071 | |
| 2072 | Example: |
Amaury Denoyelle | 76e8b70 | 2022-03-09 15:07:31 +0100 | [diff] [blame] | 2073 | echo "@1; experimental-mode on; <experimental_cmd>..." | socat /var/run/haproxy.master - |
| 2074 | echo "experimental-mode on; @1 <experimental_cmd>..." | socat /var/run/haproxy.master - |
William Lallemand | 7267f78 | 2022-02-01 16:08:50 +0100 | [diff] [blame] | 2075 | |
Willy Tarreau | abb9f9b | 2019-10-24 17:55:53 +0200 | [diff] [blame] | 2076 | expert-mode [on|off] |
Amaury Denoyelle | 18487fb | 2021-03-18 15:32:53 +0100 | [diff] [blame] | 2077 | This command is similar to experimental-mode but is used to toggle the |
| 2078 | expert mode. |
| 2079 | |
| 2080 | The expert mode enables displaying of expert commands that can be extremely |
Willy Tarreau | abb9f9b | 2019-10-24 17:55:53 +0200 | [diff] [blame] | 2081 | dangerous for the process and which may occasionally help developers collect |
| 2082 | important information about complex bugs. Any misuse of these features will |
| 2083 | likely lead to a process crash. Do not use this option without being invited |
| 2084 | to do so. Note that this command is purposely not listed in the help message. |
| 2085 | This command is only accessible in admin level. Changing to another level |
| 2086 | automatically resets the expert mode. |
| 2087 | |
William Lallemand | 7267f78 | 2022-02-01 16:08:50 +0100 | [diff] [blame] | 2088 | When used from the master CLI, this command shouldn't be prefixed, as it will |
| 2089 | set the mode for any worker when connecting to its CLI. |
| 2090 | |
| 2091 | Example: |
| 2092 | echo "@1; expert-mode on; debug dev exit 1" | socat /var/run/haproxy.master - |
| 2093 | echo "expert-mode on; @1 debug dev exit 1" | socat /var/run/haproxy.master - |
| 2094 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2095 | get map <map> <value> |
| 2096 | get acl <acl> <value> |
| 2097 | Lookup the value <value> in the map <map> or in the ACL <acl>. <map> or <acl> |
| 2098 | are the #<id> or the <file> returned by "show map" or "show acl". This command |
| 2099 | returns all the matching patterns associated with this map. This is useful for |
| 2100 | debugging maps and ACLs. The output format is composed by one line par |
| 2101 | matching type. Each line is composed by space-delimited series of words. |
| 2102 | |
| 2103 | The first two words are: |
| 2104 | |
| 2105 | <match method>: The match method applied. It can be "found", "bool", |
| 2106 | "int", "ip", "bin", "len", "str", "beg", "sub", "dir", |
| 2107 | "dom", "end" or "reg". |
| 2108 | |
| 2109 | <match result>: The result. Can be "match" or "no-match". |
| 2110 | |
| 2111 | The following words are returned only if the pattern matches an entry. |
| 2112 | |
| 2113 | <index type>: "tree" or "list". The internal lookup algorithm. |
| 2114 | |
| 2115 | <case>: "case-insensitive" or "case-sensitive". The |
| 2116 | interpretation of the case. |
| 2117 | |
| 2118 | <entry matched>: match="<entry>". Return the matched pattern. It is |
| 2119 | useful with regular expressions. |
| 2120 | |
| 2121 | The two last word are used to show the returned value and its type. With the |
| 2122 | "acl" case, the pattern doesn't exist. |
| 2123 | |
| 2124 | return=nothing: No return because there are no "map". |
| 2125 | return="<value>": The value returned in the string format. |
| 2126 | return=cannot-display: The value cannot be converted as string. |
| 2127 | |
| 2128 | type="<type>": The type of the returned sample. |
| 2129 | |
Willy Tarreau | c35eb38 | 2021-03-26 14:51:31 +0100 | [diff] [blame] | 2130 | get var <name> |
| 2131 | Show the existence, type and contents of the process-wide variable 'name'. |
| 2132 | Only process-wide variables are readable, so the name must begin with |
| 2133 | 'proc.' otherwise no variable will be found. This command requires levels |
| 2134 | "operator" or "admin". |
| 2135 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2136 | get weight <backend>/<server> |
| 2137 | Report the current weight and the initial weight of server <server> in |
| 2138 | backend <backend> or an error if either doesn't exist. The initial weight is |
| 2139 | the one that appears in the configuration file. Both are normally equal |
| 2140 | unless the current weight has been changed. Both the backend and the server |
| 2141 | may be specified either by their name or by their numeric ID, prefixed with a |
| 2142 | sharp ('#'). |
| 2143 | |
Willy Tarreau | 0b1b830 | 2021-05-09 20:59:23 +0200 | [diff] [blame] | 2144 | help [<command>] |
| 2145 | Print the list of known keywords and their basic usage, or commands matching |
| 2146 | the requested one. The same help screen is also displayed for unknown |
| 2147 | commands. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2148 | |
William Lallemand | b175c23 | 2021-10-19 14:53:55 +0200 | [diff] [blame] | 2149 | httpclient <method> <URI> |
| 2150 | Launch an HTTP client request and print the response on the CLI. Only |
| 2151 | supported on a CLI connection running in expert mode (see "expert-mode on"). |
William Lallemand | 9ae05bb | 2022-09-29 15:00:15 +0200 | [diff] [blame] | 2152 | It's only meant for debugging. The httpclient is able to resolve a server |
| 2153 | name in the URL using the "default" resolvers section, which is populated |
| 2154 | with the DNS servers of your /etc/resolv.conf by default. However it won't be |
| 2155 | able to resolve an host from /etc/hosts if you don't use a local dns daemon |
| 2156 | which can resolve those. |
William Lallemand | b175c23 | 2021-10-19 14:53:55 +0200 | [diff] [blame] | 2157 | |
Remi Tricot-Le Breton | e88a2ca | 2021-04-08 15:30:23 +0200 | [diff] [blame] | 2158 | new ssl ca-file <cafile> |
| 2159 | Create a new empty CA file tree entry to be filled with a set of CA |
| 2160 | certificates and added to a crt-list. This command should be used in |
William Lallemand | 62c0b99 | 2022-07-29 17:50:58 +0200 | [diff] [blame] | 2161 | combination with "set ssl ca-file", "add ssl ca-file" and "add ssl crt-list". |
Remi Tricot-Le Breton | e88a2ca | 2021-04-08 15:30:23 +0200 | [diff] [blame] | 2162 | |
William Lallemand | accac23 | 2020-04-02 17:42:51 +0200 | [diff] [blame] | 2163 | new ssl cert <filename> |
| 2164 | Create a new empty SSL certificate store to be filled with a certificate and |
| 2165 | added to a directory or a crt-list. This command should be used in |
| 2166 | combination with "set ssl cert" and "add ssl crt-list". |
| 2167 | |
Remi Tricot-Le Breton | 3c222bd | 2021-04-27 16:28:25 +0200 | [diff] [blame] | 2168 | new ssl crl-file <crlfile> |
| 2169 | Create a new empty CRL file tree entry to be filled with a set of CRLs |
| 2170 | and added to a crt-list. This command should be used in combination with "set |
| 2171 | ssl crl-file" and "add ssl crt-list". |
| 2172 | |
Willy Tarreau | 97218ce | 2021-04-30 14:57:03 +0200 | [diff] [blame] | 2173 | prepare acl <acl> |
| 2174 | Allocate a new version number in ACL <acl> for atomic replacement. <acl> is |
| 2175 | the #<id> or the <file> returned by "show acl". The new version number is |
| 2176 | shown in response after "New version created:". This number will then be |
| 2177 | usable to prepare additions of new entries into the ACL which will then |
| 2178 | atomically replace the current ones once committed. It is reported as |
| 2179 | "next_ver" in "show acl". There is no impact of allocating new versions, as |
| 2180 | unused versions will automatically be removed once a more recent version is |
| 2181 | committed. Version numbers are unsigned 32-bit values which wrap at the end, |
| 2182 | so care must be taken when comparing them in an external program. This |
| 2183 | command cannot be used if the reference <acl> is a file also used as a map. |
| 2184 | In this case, the "prepare map" command must be used instead. |
| 2185 | |
| 2186 | prepare map <map> |
| 2187 | Allocate a new version number in map <map> for atomic replacement. <map> is |
| 2188 | the #<id> or the <file> returned by "show map". The new version number is |
| 2189 | shown in response after "New version created:". This number will then be |
| 2190 | usable to prepare additions of new entries into the map which will then |
| 2191 | atomically replace the current ones once committed. It is reported as |
| 2192 | "next_ver" in "show map". There is no impact of allocating new versions, as |
| 2193 | unused versions will automatically be removed once a more recent version is |
| 2194 | committed. Version numbers are unsigned 32-bit values which wrap at the end, |
| 2195 | so care must be taken when comparing them in an external program. |
| 2196 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2197 | prompt |
| 2198 | Toggle the prompt at the beginning of the line and enter or leave interactive |
| 2199 | mode. In interactive mode, the connection is not closed after a command |
| 2200 | completes. Instead, the prompt will appear again, indicating the user that |
| 2201 | the interpreter is waiting for a new command. The prompt consists in a right |
| 2202 | angle bracket followed by a space "> ". This mode is particularly convenient |
| 2203 | when one wants to periodically check information such as stats or errors. |
| 2204 | It is also a good idea to enter interactive mode before issuing a "help" |
| 2205 | command. |
| 2206 | |
| 2207 | quit |
| 2208 | Close the connection when in interactive mode. |
| 2209 | |
Erwan Le Goas | 54966df | 2022-09-14 17:24:22 +0200 | [diff] [blame] | 2210 | set anon [on|off] [<key>] |
| 2211 | This command enables or disables the "anonymized mode" for the current CLI |
| 2212 | session, which replaces certain fields considered sensitive or confidential |
| 2213 | in command outputs with hashes that preserve sufficient consistency between |
| 2214 | elements to help developers identify relations between elements when trying |
| 2215 | to spot bugs, but a low enough bit count (24) to make them non-reversible due |
| 2216 | to the high number of possible matches. When turned on, if no key is |
| 2217 | specified, the global key will be used (either specified in the configuration |
Erwan Le Goas | d786931 | 2022-09-29 10:36:11 +0200 | [diff] [blame] | 2218 | file by "anonkey" or set via the CLI command "set anon global-key"). If no such |
Erwan Le Goas | 54966df | 2022-09-14 17:24:22 +0200 | [diff] [blame] | 2219 | key was set, a random one will be generated. Otherwise it's possible to |
| 2220 | specify the 32-bit key to be used for the current session, for example, to |
| 2221 | reuse the key that was used in a previous dump to help compare outputs. |
| 2222 | Developers will never need this key and it's recommended never to share it as |
| 2223 | it could allow to confirm/infirm some guesses about what certain hashes could |
| 2224 | be hiding. |
| 2225 | |
Olivier Houchard | 614f8d7 | 2017-03-14 20:08:46 +0100 | [diff] [blame] | 2226 | set dynamic-cookie-key backend <backend> <value> |
| 2227 | Modify the secret key used to generate the dynamic persistent cookies. |
| 2228 | This will break the existing sessions. |
| 2229 | |
Erwan Le Goas | d786931 | 2022-09-29 10:36:11 +0200 | [diff] [blame] | 2230 | set anon global-key <key> |
Erwan Le Goas | fad9da8 | 2022-09-14 17:24:22 +0200 | [diff] [blame] | 2231 | This sets the global anonymizing key to <key>, which must be a 32-bit |
| 2232 | integer between 0 and 4294967295 (0 disables the global key). This command |
| 2233 | requires admin privilege. |
| 2234 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2235 | set map <map> [<key>|#<ref>] <value> |
| 2236 | Modify the value corresponding to each key <key> in a map <map>. <map> is the |
| 2237 | #<id> or <file> returned by "show map". If the <ref> is used in place of |
| 2238 | <key>, only the entry pointed by <ref> is changed. The new value is <value>. |
| 2239 | |
| 2240 | set maxconn frontend <frontend> <value> |
| 2241 | Dynamically change the specified frontend's maxconn setting. Any positive |
| 2242 | value is allowed including zero, but setting values larger than the global |
| 2243 | maxconn does not make much sense. If the limit is increased and connections |
| 2244 | were pending, they will immediately be accepted. If it is lowered to a value |
| 2245 | below the current number of connections, new connections acceptation will be |
| 2246 | delayed until the threshold is reached. The frontend might be specified by |
| 2247 | either its name or its numeric ID prefixed with a sharp ('#'). |
| 2248 | |
Andrew Hayworth | edb93a7 | 2015-10-27 21:46:25 +0000 | [diff] [blame] | 2249 | set maxconn server <backend/server> <value> |
| 2250 | Dynamically change the specified server's maxconn setting. Any positive |
| 2251 | value is allowed including zero, but setting values larger than the global |
| 2252 | maxconn does not make much sense. |
| 2253 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2254 | set maxconn global <maxconn> |
| 2255 | Dynamically change the global maxconn setting within the range defined by the |
| 2256 | initial global maxconn setting. If it is increased and connections were |
| 2257 | pending, they will immediately be accepted. If it is lowered to a value below |
| 2258 | the current number of connections, new connections acceptation will be |
| 2259 | delayed until the threshold is reached. A value of zero restores the initial |
| 2260 | setting. |
| 2261 | |
Willy Tarreau | 00dd44f | 2021-05-05 16:44:23 +0200 | [diff] [blame] | 2262 | set profiling { tasks | memory } { auto | on | off } |
| 2263 | Enables or disables CPU or memory profiling for the indicated subsystem. This |
| 2264 | is equivalent to setting or clearing the "profiling" settings in the "global" |
Willy Tarreau | cfa7101 | 2021-01-29 11:56:21 +0100 | [diff] [blame] | 2265 | section of the configuration file. Please also see "show profiling". Note |
| 2266 | that manually setting the tasks profiling to "on" automatically resets the |
| 2267 | scheduler statistics, thus allows to check activity over a given interval. |
Willy Tarreau | 00dd44f | 2021-05-05 16:44:23 +0200 | [diff] [blame] | 2268 | The memory profiling is limited to certain operating systems (known to work |
| 2269 | on the linux-glibc target), and requires USE_MEMORY_PROFILING to be set at |
| 2270 | compile time. |
Willy Tarreau | 75c62c2 | 2018-11-22 11:02:09 +0100 | [diff] [blame] | 2271 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2272 | set rate-limit connections global <value> |
| 2273 | Change the process-wide connection rate limit, which is set by the global |
| 2274 | 'maxconnrate' setting. A value of zero disables the limitation. This limit |
| 2275 | applies to all frontends and the change has an immediate effect. The value |
| 2276 | is passed in number of connections per second. |
| 2277 | |
| 2278 | set rate-limit http-compression global <value> |
| 2279 | Change the maximum input compression rate, which is set by the global |
| 2280 | 'maxcomprate' setting. A value of zero disables the limitation. The value is |
| 2281 | passed in number of kilobytes per second. The value is available in the "show |
| 2282 | info" on the line "CompressBpsRateLim" in bytes. |
| 2283 | |
| 2284 | set rate-limit sessions global <value> |
| 2285 | Change the process-wide session rate limit, which is set by the global |
| 2286 | 'maxsessrate' setting. A value of zero disables the limitation. This limit |
| 2287 | applies to all frontends and the change has an immediate effect. The value |
| 2288 | is passed in number of sessions per second. |
| 2289 | |
| 2290 | set rate-limit ssl-sessions global <value> |
| 2291 | Change the process-wide SSL session rate limit, which is set by the global |
| 2292 | 'maxsslrate' setting. A value of zero disables the limitation. This limit |
| 2293 | applies to all frontends and the change has an immediate effect. The value |
| 2294 | is passed in number of sessions per second sent to the SSL stack. It applies |
| 2295 | before the handshake in order to protect the stack against handshake abuses. |
| 2296 | |
Baptiste Assmann | 3749ebf | 2016-08-03 22:34:12 +0200 | [diff] [blame] | 2297 | set server <backend>/<server> addr <ip4 or ip6 address> [port <port>] |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2298 | Replace the current IP address of a server by the one provided. |
Michael Prokop | 4438c60 | 2019-05-24 10:25:45 +0200 | [diff] [blame] | 2299 | Optionally, the port can be changed using the 'port' parameter. |
Baptiste Assmann | 3749ebf | 2016-08-03 22:34:12 +0200 | [diff] [blame] | 2300 | Note that changing the port also support switching from/to port mapping |
| 2301 | (notation with +X or -Y), only if a port is configured for the health check. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2302 | |
| 2303 | set server <backend>/<server> agent [ up | down ] |
| 2304 | Force a server's agent to a new state. This can be useful to immediately |
| 2305 | switch a server's state regardless of some slow agent checks for example. |
| 2306 | Note that the change is propagated to tracking servers if any. |
| 2307 | |
William Dauchy | 7cabc06 | 2021-02-11 22:51:24 +0100 | [diff] [blame] | 2308 | set server <backend>/<server> agent-addr <addr> [port <port>] |
Misiek | 4397290 | 2017-01-09 09:53:06 +0100 | [diff] [blame] | 2309 | Change addr for servers agent checks. Allows to migrate agent-checks to |
| 2310 | another address at runtime. You can specify both IP and hostname, it will be |
| 2311 | resolved. |
William Dauchy | 7cabc06 | 2021-02-11 22:51:24 +0100 | [diff] [blame] | 2312 | Optionally, change the port agent. |
| 2313 | |
| 2314 | set server <backend>/<server> agent-port <port> |
| 2315 | Change the port used for agent checks. |
Misiek | 4397290 | 2017-01-09 09:53:06 +0100 | [diff] [blame] | 2316 | |
| 2317 | set server <backend>/<server> agent-send <value> |
| 2318 | Change agent string sent to agent check target. Allows to update string while |
| 2319 | changing server address to keep those two matching. |
| 2320 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2321 | set server <backend>/<server> health [ up | stopping | down ] |
| 2322 | Force a server's health to a new state. This can be useful to immediately |
| 2323 | switch a server's state regardless of some slow health checks for example. |
| 2324 | Note that the change is propagated to tracking servers if any. |
| 2325 | |
William Dauchy | b456e1f | 2021-02-11 22:51:23 +0100 | [diff] [blame] | 2326 | set server <backend>/<server> check-addr <ip4 | ip6> [port <port>] |
| 2327 | Change the IP address used for server health checks. |
| 2328 | Optionally, change the port used for server health checks. |
| 2329 | |
Baptiste Assmann | 5094656 | 2016-08-31 23:26:29 +0200 | [diff] [blame] | 2330 | set server <backend>/<server> check-port <port> |
| 2331 | Change the port used for health checking to <port> |
| 2332 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2333 | set server <backend>/<server> state [ ready | drain | maint ] |
| 2334 | Force a server's administrative state to a new state. This can be useful to |
| 2335 | disable load balancing and/or any traffic to a server. Setting the state to |
| 2336 | "ready" puts the server in normal mode, and the command is the equivalent of |
| 2337 | the "enable server" command. Setting the state to "maint" disables any traffic |
| 2338 | to the server as well as any health checks. This is the equivalent of the |
| 2339 | "disable server" command. Setting the mode to "drain" only removes the server |
| 2340 | from load balancing but still allows it to be checked and to accept new |
| 2341 | persistent connections. Changes are propagated to tracking servers if any. |
| 2342 | |
| 2343 | set server <backend>/<server> weight <weight>[%] |
| 2344 | Change a server's weight to the value passed in argument. This is the exact |
| 2345 | equivalent of the "set weight" command below. |
| 2346 | |
Frédéric Lécaille | b418c12 | 2017-04-26 11:24:02 +0200 | [diff] [blame] | 2347 | set server <backend>/<server> fqdn <FQDN> |
Lukas Tribus | c5dd5a5 | 2018-08-14 11:39:35 +0200 | [diff] [blame] | 2348 | Change a server's FQDN to the value passed in argument. This requires the |
| 2349 | internal run-time DNS resolver to be configured and enabled for this server. |
Frédéric Lécaille | b418c12 | 2017-04-26 11:24:02 +0200 | [diff] [blame] | 2350 | |
William Lallemand | 9998a33 | 2022-01-19 15:17:08 +0100 | [diff] [blame] | 2351 | set server <backend>/<server> ssl [ on | off ] (deprecated) |
William Dauchy | f637044 | 2020-11-14 19:25:33 +0100 | [diff] [blame] | 2352 | This option configures SSL ciphering on outgoing connections to the server. |
William Dauchy | a087f87 | 2022-01-06 16:57:15 +0100 | [diff] [blame] | 2353 | When switch off, all traffic becomes plain text; health check path is not |
| 2354 | changed. |
William Dauchy | f637044 | 2020-11-14 19:25:33 +0100 | [diff] [blame] | 2355 | |
William Lallemand | 9998a33 | 2022-01-19 15:17:08 +0100 | [diff] [blame] | 2356 | This command is deprecated, create a new server dynamically with or without |
| 2357 | SSL instead, using the "add server" command. |
| 2358 | |
Andjelko Iharos | c4df59e | 2017-07-20 11:59:48 +0200 | [diff] [blame] | 2359 | set severity-output [ none | number | string ] |
| 2360 | Change the severity output format of the stats socket connected to for the |
| 2361 | duration of the current session. |
| 2362 | |
Remi Tricot-Le Breton | e88a2ca | 2021-04-08 15:30:23 +0200 | [diff] [blame] | 2363 | set ssl ca-file <cafile> <payload> |
William Lallemand | 62c0b99 | 2022-07-29 17:50:58 +0200 | [diff] [blame] | 2364 | this command is part of a transaction system, the "commit ssl ca-file" and |
Remi Tricot-Le Breton | e88a2ca | 2021-04-08 15:30:23 +0200 | [diff] [blame] | 2365 | "abort ssl ca-file" commands could be required. |
William Lallemand | 62c0b99 | 2022-07-29 17:50:58 +0200 | [diff] [blame] | 2366 | if there is no on-going transaction, it will create a ca file tree entry into |
| 2367 | which the certificates contained in the payload will be stored. the ca file |
| 2368 | entry will not be stored in the ca file tree and will only be kept in a |
| 2369 | temporary transaction. if a transaction with the same filename already exists, |
| 2370 | the previous ca file entry will be deleted and replaced by the new one. |
| 2371 | once the modifications are done, you have to commit the transaction through |
| 2372 | a "commit ssl ca-file" call. If you want to add multiple certificates |
| 2373 | separately, you can use the "add ssl ca-file" command |
Remi Tricot-Le Breton | e88a2ca | 2021-04-08 15:30:23 +0200 | [diff] [blame] | 2374 | |
| 2375 | Example: |
| 2376 | echo -e "set ssl ca-file cafile.pem <<\n$(cat rootCA.crt)\n" | \ |
| 2377 | socat /var/run/haproxy.stat - |
| 2378 | echo "commit ssl ca-file cafile.pem" | socat /var/run/haproxy.stat - |
| 2379 | |
William Lallemand | 6ab08b3 | 2019-11-29 16:48:43 +0100 | [diff] [blame] | 2380 | set ssl cert <filename> <payload> |
| 2381 | This command is part of a transaction system, the "commit ssl cert" and |
| 2382 | "abort ssl cert" commands could be required. |
Remi Tricot-Le Breton | 3445909 | 2021-04-14 16:19:28 +0200 | [diff] [blame] | 2383 | This whole transaction system works on any certificate displayed by the |
Remi Tricot-Le Breton | b5f0fac | 2021-04-14 16:19:29 +0200 | [diff] [blame] | 2384 | "show ssl cert" command, so on any frontend or backend certificate. |
William Lallemand | 6ab08b3 | 2019-11-29 16:48:43 +0100 | [diff] [blame] | 2385 | If there is no on-going transaction, it will duplicate the certificate |
| 2386 | <filename> in memory to a temporary transaction, then update this |
| 2387 | transaction with the PEM file in the payload. If a transaction exists with |
| 2388 | the same filename, it will update this transaction. It's also possible to |
| 2389 | update the files linked to a certificate (.issuer, .sctl, .oscp etc.) |
| 2390 | Once the modification are done, you have to "commit ssl cert" the |
| 2391 | transaction. |
| 2392 | |
William Lallemand | ed8bfad | 2021-09-16 17:30:51 +0200 | [diff] [blame] | 2393 | Injection of files over the CLI must be done with caution since an empty line |
| 2394 | is used to notify the end of the payload. It is recommended to inject a PEM |
| 2395 | file which has been sanitized. A simple method would be to remove every empty |
| 2396 | line and only leave what are in the PEM sections. It could be achieved with a |
| 2397 | sed command. |
| 2398 | |
William Lallemand | 6ab08b3 | 2019-11-29 16:48:43 +0100 | [diff] [blame] | 2399 | Example: |
William Lallemand | ed8bfad | 2021-09-16 17:30:51 +0200 | [diff] [blame] | 2400 | |
| 2401 | # With some simple sanitizing |
| 2402 | echo -e "set ssl cert localhost.pem <<\n$(sed -n '/^$/d;/-BEGIN/,/-END/p' 127.0.0.1.pem)\n" | \ |
| 2403 | socat /var/run/haproxy.stat - |
| 2404 | |
| 2405 | # Complete example with commit |
William Lallemand | 6ab08b3 | 2019-11-29 16:48:43 +0100 | [diff] [blame] | 2406 | echo -e "set ssl cert localhost.pem <<\n$(cat 127.0.0.1.pem)\n" | \ |
| 2407 | socat /var/run/haproxy.stat - |
| 2408 | echo -e \ |
| 2409 | "set ssl cert localhost.pem.issuer <<\n $(cat 127.0.0.1.pem.issuer)\n" | \ |
| 2410 | socat /var/run/haproxy.stat - |
| 2411 | echo -e \ |
| 2412 | "set ssl cert localhost.pem.ocsp <<\n$(base64 -w 1000 127.0.0.1.pem.ocsp)\n" | \ |
| 2413 | socat /var/run/haproxy.stat - |
| 2414 | echo "commit ssl cert localhost.pem" | socat /var/run/haproxy.stat - |
| 2415 | |
Remi Tricot-Le Breton | 3c222bd | 2021-04-27 16:28:25 +0200 | [diff] [blame] | 2416 | set ssl crl-file <crlfile> <payload> |
| 2417 | This command is part of a transaction system, the "commit ssl crl-file" and |
| 2418 | "abort ssl crl-file" commands could be required. |
| 2419 | If there is no on-going transaction, it will create a CRL file tree entry into |
| 2420 | which the Revocation Lists contained in the payload will be stored. The CRL |
| 2421 | file entry will not be stored in the CRL file tree and will only be kept in a |
| 2422 | temporary transaction. If a transaction with the same filename already exists, |
| 2423 | the previous CRL file entry will be deleted and replaced by the new one. |
| 2424 | Once the modifications are done, you have to commit the transaction through |
| 2425 | a "commit ssl crl-file" call. |
| 2426 | |
| 2427 | Example: |
| 2428 | echo -e "set ssl crl-file crlfile.pem <<\n$(cat rootCRL.pem)\n" | \ |
| 2429 | socat /var/run/haproxy.stat - |
| 2430 | echo "commit ssl crl-file crlfile.pem" | socat /var/run/haproxy.stat - |
| 2431 | |
Aurélien Nephtali | 1e0867c | 2018-04-18 14:04:58 +0200 | [diff] [blame] | 2432 | set ssl ocsp-response <response | payload> |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2433 | This command is used to update an OCSP Response for a certificate (see "crt" |
| 2434 | on "bind" lines). Same controls are performed as during the initial loading of |
| 2435 | the response. The <response> must be passed as a base64 encoded string of the |
Emmanuel Hocdet | 2c32d8f | 2017-05-22 14:58:00 +0200 | [diff] [blame] | 2436 | DER encoded response from the OCSP server. This command is not supported with |
| 2437 | BoringSSL. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2438 | |
| 2439 | Example: |
| 2440 | openssl ocsp -issuer issuer.pem -cert server.pem \ |
| 2441 | -host ocsp.issuer.com:80 -respout resp.der |
| 2442 | echo "set ssl ocsp-response $(base64 -w 10000 resp.der)" | \ |
| 2443 | socat stdio /var/run/haproxy.stat |
| 2444 | |
Aurélien Nephtali | 1e0867c | 2018-04-18 14:04:58 +0200 | [diff] [blame] | 2445 | using the payload syntax: |
| 2446 | echo -e "set ssl ocsp-response <<\n$(base64 resp.der)\n" | \ |
| 2447 | socat stdio /var/run/haproxy.stat |
| 2448 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2449 | set ssl tls-key <id> <tlskey> |
| 2450 | Set the next TLS key for the <id> listener to <tlskey>. This key becomes the |
| 2451 | ultimate key, while the penultimate one is used for encryption (others just |
| 2452 | decrypt). The oldest TLS key present is overwritten. <id> is either a numeric |
| 2453 | #<id> or <file> returned by "show tls-keys". <tlskey> is a base64 encoded 48 |
Emeric Brun | 9e75477 | 2019-01-10 17:51:55 +0100 | [diff] [blame] | 2454 | or 80 bits TLS ticket key (ex. openssl rand 80 | openssl base64 -A). |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2455 | |
| 2456 | set table <table> key <key> [data.<data_type> <value>]* |
| 2457 | Create or update a stick-table entry in the table. If the key is not present, |
| 2458 | an entry is inserted. See stick-table in section 4.2 to find all possible |
| 2459 | values for <data_type>. The most likely use consists in dynamically entering |
| 2460 | entries for source IP addresses, with a flag in gpc0 to dynamically block an |
| 2461 | IP address or affect its quality of service. It is possible to pass multiple |
| 2462 | data_types in a single call. |
| 2463 | |
| 2464 | set timeout cli <delay> |
| 2465 | Change the CLI interface timeout for current connection. This can be useful |
| 2466 | during long debugging sessions where the user needs to constantly inspect |
| 2467 | some indicators without being disconnected. The delay is passed in seconds. |
| 2468 | |
Willy Tarreau | 4000ff0 | 2021-04-30 14:45:53 +0200 | [diff] [blame] | 2469 | set var <name> <expression> |
Willy Tarreau | e93bff4 | 2021-09-03 09:47:37 +0200 | [diff] [blame] | 2470 | set var <name> expr <expression> |
| 2471 | set var <name> fmt <format> |
Willy Tarreau | 4000ff0 | 2021-04-30 14:45:53 +0200 | [diff] [blame] | 2472 | Allows to set or overwrite the process-wide variable 'name' with the result |
Willy Tarreau | e93bff4 | 2021-09-03 09:47:37 +0200 | [diff] [blame] | 2473 | of expression <expression> or format string <format>. Only process-wide |
| 2474 | variables may be used, so the name must begin with 'proc.' otherwise no |
| 2475 | variable will be set. The <expression> and <format> may only involve |
| 2476 | "internal" sample fetch keywords and converters even though the most likely |
| 2477 | useful ones will be str('something'), int(), simple strings or references to |
| 2478 | other variables. Note that the command line parser doesn't know about quotes, |
| 2479 | so any space in the expression must be preceded by a backslash. This command |
| 2480 | requires levels "operator" or "admin". This command is only supported on a |
| 2481 | CLI connection running in experimental mode (see "experimental-mode on"). |
Willy Tarreau | 4000ff0 | 2021-04-30 14:45:53 +0200 | [diff] [blame] | 2482 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2483 | set weight <backend>/<server> <weight>[%] |
| 2484 | Change a server's weight to the value passed in argument. If the value ends |
| 2485 | with the '%' sign, then the new weight will be relative to the initially |
| 2486 | configured weight. Absolute weights are permitted between 0 and 256. |
| 2487 | Relative weights must be positive with the resulting absolute weight is |
| 2488 | capped at 256. Servers which are part of a farm running a static |
| 2489 | load-balancing algorithm have stricter limitations because the weight |
| 2490 | cannot change once set. Thus for these servers, the only accepted values |
| 2491 | are 0 and 100% (or 0 and the initial weight). Changes take effect |
| 2492 | immediately, though certain LB algorithms require a certain amount of |
| 2493 | requests to consider changes. A typical usage of this command is to |
| 2494 | disable a server during an update by setting its weight to zero, then to |
| 2495 | enable it again after the update by setting it back to 100%. This command |
| 2496 | is restricted and can only be issued on sockets configured for level |
| 2497 | "admin". Both the backend and the server may be specified either by their |
| 2498 | name or by their numeric ID, prefixed with a sharp ('#'). |
| 2499 | |
Willy Tarreau | 95f753e | 2021-04-30 12:09:54 +0200 | [diff] [blame] | 2500 | show acl [[@<ver>] <acl>] |
Willy Tarreau | d6129fc | 2017-07-28 16:52:23 +0200 | [diff] [blame] | 2501 | Dump info about acl converters. Without argument, the list of all available |
Willy Tarreau | 95f753e | 2021-04-30 12:09:54 +0200 | [diff] [blame] | 2502 | acls is returned. If a <acl> is specified, its contents are dumped. <acl> is |
| 2503 | the #<id> or <file>. By default the current version of the ACL is shown (the |
| 2504 | version currently being matched against and reported as 'curr_ver' in the ACL |
| 2505 | list). It is possible to instead dump other versions by prepending '@<ver>' |
| 2506 | before the ACL's identifier. The version works as a filter and non-existing |
| 2507 | versions will simply report no result. The dump format is the same as for the |
| 2508 | maps even for the sample values. The data returned are not a list of |
| 2509 | available ACL, but are the list of all patterns composing any ACL. Many of |
Dragan Dosen | a75eea7 | 2021-05-21 16:59:15 +0200 | [diff] [blame] | 2510 | these patterns can be shared with maps. The 'entry_cnt' value represents the |
| 2511 | count of all the ACL entries, not just the active ones, which means that it |
| 2512 | also includes entries currently being added. |
Willy Tarreau | d6129fc | 2017-07-28 16:52:23 +0200 | [diff] [blame] | 2513 | |
Erwan Le Goas | 54966df | 2022-09-14 17:24:22 +0200 | [diff] [blame] | 2514 | show anon |
| 2515 | Display the current state of the anonymized mode (enabled or disabled) and |
| 2516 | the current session's key. |
| 2517 | |
Willy Tarreau | d6129fc | 2017-07-28 16:52:23 +0200 | [diff] [blame] | 2518 | show backend |
| 2519 | Dump the list of backends available in the running process |
| 2520 | |
William Lallemand | 67a234f | 2018-12-13 09:05:45 +0100 | [diff] [blame] | 2521 | show cli level |
| 2522 | Display the CLI level of the current CLI session. The result could be |
| 2523 | 'admin', 'operator' or 'user'. See also the 'operator' and 'user' commands. |
| 2524 | |
| 2525 | Example : |
| 2526 | |
| 2527 | $ socat /tmp/sock1 readline |
| 2528 | prompt |
| 2529 | > operator |
| 2530 | > show cli level |
| 2531 | operator |
| 2532 | > user |
| 2533 | > show cli level |
| 2534 | user |
| 2535 | > operator |
| 2536 | Permission denied |
| 2537 | |
| 2538 | operator |
| 2539 | Decrease the CLI level of the current CLI session to operator. It can't be |
Amaury Denoyelle | 18487fb | 2021-03-18 15:32:53 +0100 | [diff] [blame] | 2540 | increased. It also drops expert and experimental mode. See also "show cli |
| 2541 | level". |
William Lallemand | 67a234f | 2018-12-13 09:05:45 +0100 | [diff] [blame] | 2542 | |
| 2543 | user |
| 2544 | Decrease the CLI level of the current CLI session to user. It can't be |
Amaury Denoyelle | 18487fb | 2021-03-18 15:32:53 +0100 | [diff] [blame] | 2545 | increased. It also drops expert and experimental mode. See also "show cli |
| 2546 | level". |
William Lallemand | 67a234f | 2018-12-13 09:05:45 +0100 | [diff] [blame] | 2547 | |
Willy Tarreau | 9a7fa90 | 2022-07-15 16:51:16 +0200 | [diff] [blame] | 2548 | show activity [-1 | 0 | thread_num] |
Willy Tarreau | 4c35693 | 2019-05-16 17:39:32 +0200 | [diff] [blame] | 2549 | Reports some counters about internal events that will help developers and |
| 2550 | more generally people who know haproxy well enough to narrow down the causes |
| 2551 | of reports of abnormal behaviours. A typical example would be a properly |
| 2552 | running process never sleeping and eating 100% of the CPU. The output fields |
| 2553 | will be made of one line per metric, and per-thread counters on the same |
Thayne McCombs | cdbcca9 | 2021-01-07 21:24:41 -0700 | [diff] [blame] | 2554 | line. These counters are 32-bit and will wrap during the process's life, which |
Willy Tarreau | 4c35693 | 2019-05-16 17:39:32 +0200 | [diff] [blame] | 2555 | is not a problem since calls to this command will typically be performed |
| 2556 | twice. The fields are purposely not documented so that their exact meaning is |
| 2557 | verified in the code where the counters are fed. These values are also reset |
Willy Tarreau | 9a7fa90 | 2022-07-15 16:51:16 +0200 | [diff] [blame] | 2558 | by the "clear counters" command. On multi-threaded deployments, the first |
| 2559 | column will indicate the total (or average depending on the nature of the |
| 2560 | metric) for all threads, and the list of all threads' values will be |
| 2561 | represented between square brackets in the thread order. Optionally the |
| 2562 | thread number to be dumped may be specified in argument. The special value |
| 2563 | "0" will report the aggregated value (first column), and "-1", which is the |
| 2564 | default, will display all the columns. Note that just like in single-threaded |
| 2565 | mode, there will be no brackets when a single column is requested. |
Willy Tarreau | 4c35693 | 2019-05-16 17:39:32 +0200 | [diff] [blame] | 2566 | |
William Lallemand | 5113216 | 2016-12-16 16:38:58 +0100 | [diff] [blame] | 2567 | show cli sockets |
| 2568 | List CLI sockets. The output format is composed of 3 fields separated by |
| 2569 | spaces. The first field is the socket address, it can be a unix socket, a |
| 2570 | ipv4 address:port couple or a ipv6 one. Socket of other types won't be dump. |
| 2571 | The second field describe the level of the socket: 'admin', 'user' or |
| 2572 | 'operator'. The last field list the processes on which the socket is bound, |
| 2573 | separated by commas, it can be numbers or 'all'. |
| 2574 | |
| 2575 | Example : |
| 2576 | |
| 2577 | $ echo 'show cli sockets' | socat stdio /tmp/sock1 |
| 2578 | # socket lvl processes |
| 2579 | /tmp/sock1 admin all |
| 2580 | 127.0.0.1:9999 user 2,3,4 |
| 2581 | 127.0.0.2:9969 user 2 |
| 2582 | [::1]:9999 operator 2 |
| 2583 | |
William Lallemand | 86d0df0 | 2017-11-24 21:36:45 +0100 | [diff] [blame] | 2584 | show cache |
Cyril Bonté | 7b888f1 | 2017-11-26 22:24:31 +0100 | [diff] [blame] | 2585 | List the configured caches and the objects stored in each cache tree. |
William Lallemand | 86d0df0 | 2017-11-24 21:36:45 +0100 | [diff] [blame] | 2586 | |
| 2587 | $ echo 'show cache' | socat stdio /tmp/sock1 |
| 2588 | 0x7f6ac6c5b03a: foobar (shctx:0x7f6ac6c5b000, available blocks:3918) |
| 2589 | 1 2 3 4 |
| 2590 | |
| 2591 | 1. pointer to the cache structure |
| 2592 | 2. cache name |
| 2593 | 3. pointer to the mmap area (shctx) |
| 2594 | 4. number of blocks available for reuse in the shctx |
| 2595 | |
Remi Tricot-Le Breton | e3e1e5f | 2020-11-27 15:48:40 +0100 | [diff] [blame] | 2596 | 0x7f6ac6c5b4cc hash:286881868 vary:0x0011223344556677 size:39114 (39 blocks), refcount:9, expire:237 |
| 2597 | 1 2 3 4 5 6 7 |
William Lallemand | 86d0df0 | 2017-11-24 21:36:45 +0100 | [diff] [blame] | 2598 | |
| 2599 | 1. pointer to the cache entry |
| 2600 | 2. first 32 bits of the hash |
Remi Tricot-Le Breton | e3e1e5f | 2020-11-27 15:48:40 +0100 | [diff] [blame] | 2601 | 3. secondary hash of the entry in case of vary |
| 2602 | 4. size of the object in bytes |
| 2603 | 5. number of blocks used for the object |
| 2604 | 6. number of transactions using the entry |
| 2605 | 7. expiration time, can be negative if already expired |
William Lallemand | 86d0df0 | 2017-11-24 21:36:45 +0100 | [diff] [blame] | 2606 | |
Willy Tarreau | ae79572 | 2016-02-16 11:27:28 +0100 | [diff] [blame] | 2607 | show env [<name>] |
| 2608 | Dump one or all environment variables known by the process. Without any |
| 2609 | argument, all variables are dumped. With an argument, only the specified |
| 2610 | variable is dumped if it exists. Otherwise "Variable not found" is emitted. |
| 2611 | Variables are dumped in the same format as they are stored or returned by the |
| 2612 | "env" utility, that is, "<name>=<value>". This can be handy when debugging |
| 2613 | certain configuration files making heavy use of environment variables to |
| 2614 | ensure that they contain the expected values. This command is restricted and |
| 2615 | can only be issued on sockets configured for levels "operator" or "admin". |
| 2616 | |
Willy Tarreau | 35069f8 | 2016-11-25 09:16:37 +0100 | [diff] [blame] | 2617 | show errors [<iid>|<proxy>] [request|response] |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2618 | Dump last known request and response errors collected by frontends and |
| 2619 | backends. If <iid> is specified, the limit the dump to errors concerning |
Willy Tarreau | 234ba2d | 2016-11-25 08:39:10 +0100 | [diff] [blame] | 2620 | either frontend or backend whose ID is <iid>. Proxy ID "-1" will cause |
| 2621 | all instances to be dumped. If a proxy name is specified instead, its ID |
Willy Tarreau | 35069f8 | 2016-11-25 09:16:37 +0100 | [diff] [blame] | 2622 | will be used as the filter. If "request" or "response" is added after the |
| 2623 | proxy name or ID, only request or response errors will be dumped. This |
| 2624 | command is restricted and can only be issued on sockets configured for |
| 2625 | levels "operator" or "admin". |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2626 | |
| 2627 | The errors which may be collected are the last request and response errors |
| 2628 | caused by protocol violations, often due to invalid characters in header |
| 2629 | names. The report precisely indicates what exact character violated the |
| 2630 | protocol. Other important information such as the exact date the error was |
| 2631 | detected, frontend and backend names, the server name (when known), the |
| 2632 | internal session ID and the source address which has initiated the session |
| 2633 | are reported too. |
| 2634 | |
| 2635 | All characters are returned, and non-printable characters are encoded. The |
| 2636 | most common ones (\t = 9, \n = 10, \r = 13 and \e = 27) are encoded as one |
| 2637 | letter following a backslash. The backslash itself is encoded as '\\' to |
| 2638 | avoid confusion. Other non-printable characters are encoded '\xNN' where |
| 2639 | NN is the two-digits hexadecimal representation of the character's ASCII |
| 2640 | code. |
| 2641 | |
| 2642 | Lines are prefixed with the position of their first character, starting at 0 |
| 2643 | for the beginning of the buffer. At most one input line is printed per line, |
| 2644 | and large lines will be broken into multiple consecutive output lines so that |
| 2645 | the output never goes beyond 79 characters wide. It is easy to detect if a |
| 2646 | line was broken, because it will not end with '\n' and the next line's offset |
| 2647 | will be followed by a '+' sign, indicating it is a continuation of previous |
| 2648 | line. |
| 2649 | |
| 2650 | Example : |
Willy Tarreau | 35069f8 | 2016-11-25 09:16:37 +0100 | [diff] [blame] | 2651 | $ echo "show errors -1 response" | socat stdio /tmp/sock1 |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2652 | >>> [04/Mar/2009:15:46:56.081] backend http-in (#2) : invalid response |
| 2653 | src 127.0.0.1, session #54, frontend fe-eth0 (#1), server s2 (#1) |
| 2654 | response length 213 bytes, error at position 23: |
| 2655 | |
| 2656 | 00000 HTTP/1.0 200 OK\r\n |
| 2657 | 00017 header/bizarre:blah\r\n |
| 2658 | 00038 Location: blah\r\n |
| 2659 | 00054 Long-line: this is a very long line which should b |
| 2660 | 00104+ e broken into multiple lines on the output buffer, |
| 2661 | 00154+ otherwise it would be too large to print in a ter |
| 2662 | 00204+ minal\r\n |
| 2663 | 00211 \r\n |
| 2664 | |
| 2665 | In the example above, we see that the backend "http-in" which has internal |
| 2666 | ID 2 has blocked an invalid response from its server s2 which has internal |
| 2667 | ID 1. The request was on session 54 initiated by source 127.0.0.1 and |
| 2668 | received by frontend fe-eth0 whose ID is 1. The total response length was |
| 2669 | 213 bytes when the error was detected, and the error was at byte 23. This |
| 2670 | is the slash ('/') in header name "header/bizarre", which is not a valid |
| 2671 | HTTP character for a header name. |
| 2672 | |
Willy Tarreau | 1d181e4 | 2019-08-30 11:17:01 +0200 | [diff] [blame] | 2673 | show events [<sink>] [-w] [-n] |
Willy Tarreau | 9f830d7 | 2019-08-26 18:17:04 +0200 | [diff] [blame] | 2674 | With no option, this lists all known event sinks and their types. With an |
| 2675 | option, it will dump all available events in the designated sink if it is of |
Willy Tarreau | 1d181e4 | 2019-08-30 11:17:01 +0200 | [diff] [blame] | 2676 | type buffer. If option "-w" is passed after the sink name, then once the end |
| 2677 | of the buffer is reached, the command will wait for new events and display |
| 2678 | them. It is possible to stop the operation by entering any input (which will |
| 2679 | be discarded) or by closing the session. Finally, option "-n" is used to |
| 2680 | directly seek to the end of the buffer, which is often convenient when |
| 2681 | combined with "-w" to only report new events. For convenience, "-wn" or "-nw" |
| 2682 | may be used to enable both options at once. |
Willy Tarreau | 9f830d7 | 2019-08-26 18:17:04 +0200 | [diff] [blame] | 2683 | |
Willy Tarreau | 7a4a0ac | 2017-07-25 19:32:50 +0200 | [diff] [blame] | 2684 | show fd [<fd>] |
| 2685 | Dump the list of either all open file descriptors or just the one number <fd> |
| 2686 | if specified. This is only aimed at developers who need to observe internal |
| 2687 | states in order to debug complex issues such as abnormal CPU usages. One fd |
| 2688 | is reported per lines, and for each of them, its state in the poller using |
| 2689 | upper case letters for enabled flags and lower case for disabled flags, using |
| 2690 | "P" for "polled", "R" for "ready", "A" for "active", the events status using |
| 2691 | "H" for "hangup", "E" for "error", "O" for "output", "P" for "priority" and |
| 2692 | "I" for "input", a few other flags like "N" for "new" (just added into the fd |
| 2693 | cache), "U" for "updated" (received an update in the fd cache), "L" for |
| 2694 | "linger_risk", "C" for "cloned", then the cached entry position, the pointer |
| 2695 | to the internal owner, the pointer to the I/O callback and its name when |
| 2696 | known. When the owner is a connection, the connection flags, and the target |
| 2697 | are reported (frontend, proxy or server). When the owner is a listener, the |
| 2698 | listener's state and its frontend are reported. There is no point in using |
| 2699 | this command without a good knowledge of the internals. It's worth noting |
| 2700 | that the output format may evolve over time so this output must not be parsed |
Willy Tarreau | 8050efe | 2021-01-21 08:26:06 +0100 | [diff] [blame] | 2701 | by tools designed to be durable. Some internal structure states may look |
| 2702 | suspicious to the function listing them, in this case the output line will be |
| 2703 | suffixed with an exclamation mark ('!'). This may help find a starting point |
| 2704 | when trying to diagnose an incident. |
Willy Tarreau | 7a4a0ac | 2017-07-25 19:32:50 +0200 | [diff] [blame] | 2705 | |
Willy Tarreau | 2745620 | 2021-05-08 07:54:24 +0200 | [diff] [blame] | 2706 | show info [typed|json] [desc] [float] |
Willy Tarreau | 5d8b979 | 2016-03-11 11:09:34 +0100 | [diff] [blame] | 2707 | Dump info about haproxy status on current process. If "typed" is passed as an |
| 2708 | optional argument, field numbers, names and types are emitted as well so that |
| 2709 | external monitoring products can easily retrieve, possibly aggregate, then |
| 2710 | report information found in fields they don't know. Each field is dumped on |
Simon Horman | 05ee213 | 2017-01-04 09:37:25 +0100 | [diff] [blame] | 2711 | its own line. If "json" is passed as an optional argument then |
| 2712 | information provided by "typed" output is provided in JSON format as a |
| 2713 | list of JSON objects. By default, the format contains only two columns |
| 2714 | delimited by a colon (':'). The left one is the field name and the right |
| 2715 | one is the value. It is very important to note that in typed output |
| 2716 | format, the dump for a single object is contiguous so that there is no |
Willy Tarreau | 2745620 | 2021-05-08 07:54:24 +0200 | [diff] [blame] | 2717 | need for a consumer to store everything at once. If "float" is passed as an |
| 2718 | optional argument, some fields usually emitted as integers may switch to |
| 2719 | floats for higher accuracy. It is purposely unspecified which ones are |
| 2720 | concerned as this might evolve over time. Using this option implies that the |
| 2721 | consumer is able to process floats. The output format used is sprintf("%f"). |
Willy Tarreau | 5d8b979 | 2016-03-11 11:09:34 +0100 | [diff] [blame] | 2722 | |
| 2723 | When using the typed output format, each line is made of 4 columns delimited |
| 2724 | by colons (':'). The first column is a dot-delimited series of 3 elements. The |
| 2725 | first element is the numeric position of the field in the list (starting at |
| 2726 | zero). This position shall not change over time, but holes are to be expected, |
| 2727 | depending on build options or if some fields are deleted in the future. The |
| 2728 | second element is the field name as it appears in the default "show info" |
| 2729 | output. The third element is the relative process number starting at 1. |
| 2730 | |
| 2731 | The rest of the line starting after the first colon follows the "typed output |
| 2732 | format" described in the section above. In short, the second column (after the |
| 2733 | first ':') indicates the origin, nature and scope of the variable. The third |
| 2734 | column indicates the type of the field, among "s32", "s64", "u32", "u64" and |
| 2735 | "str". Then the fourth column is the value itself, which the consumer knows |
| 2736 | how to parse thanks to column 3 and how to process thanks to column 2. |
| 2737 | |
| 2738 | Thus the overall line format in typed mode is : |
| 2739 | |
| 2740 | <field_pos>.<field_name>.<process_num>:<tags>:<type>:<value> |
| 2741 | |
Willy Tarreau | 6b19b14 | 2019-10-09 15:44:21 +0200 | [diff] [blame] | 2742 | When "desc" is appended to the command, one extra colon followed by a quoted |
| 2743 | string is appended with a description for the metric. At the time of writing, |
| 2744 | this is only supported for the "typed" and default output formats. |
| 2745 | |
Willy Tarreau | 5d8b979 | 2016-03-11 11:09:34 +0100 | [diff] [blame] | 2746 | Example : |
| 2747 | |
| 2748 | > show info |
| 2749 | Name: HAProxy |
| 2750 | Version: 1.7-dev1-de52ea-146 |
| 2751 | Release_date: 2016/03/11 |
| 2752 | Nbproc: 1 |
| 2753 | Process_num: 1 |
| 2754 | Pid: 28105 |
| 2755 | Uptime: 0d 0h00m04s |
| 2756 | Uptime_sec: 4 |
| 2757 | Memmax_MB: 0 |
| 2758 | PoolAlloc_MB: 0 |
| 2759 | PoolUsed_MB: 0 |
| 2760 | PoolFailed: 0 |
| 2761 | (...) |
| 2762 | |
| 2763 | > show info typed |
| 2764 | 0.Name.1:POS:str:HAProxy |
| 2765 | 1.Version.1:POS:str:1.7-dev1-de52ea-146 |
| 2766 | 2.Release_date.1:POS:str:2016/03/11 |
| 2767 | 3.Nbproc.1:CGS:u32:1 |
| 2768 | 4.Process_num.1:KGP:u32:1 |
| 2769 | 5.Pid.1:SGP:u32:28105 |
| 2770 | 6.Uptime.1:MDP:str:0d 0h00m08s |
| 2771 | 7.Uptime_sec.1:MDP:u32:8 |
| 2772 | 8.Memmax_MB.1:CLP:u32:0 |
| 2773 | 9.PoolAlloc_MB.1:MGP:u32:0 |
| 2774 | 10.PoolUsed_MB.1:MGP:u32:0 |
| 2775 | 11.PoolFailed.1:MCP:u32:0 |
| 2776 | (...) |
| 2777 | |
Simon Horman | 1084a36 | 2016-11-21 17:00:24 +0100 | [diff] [blame] | 2778 | In the typed format, the presence of the process ID at the end of the |
| 2779 | first column makes it very easy to visually aggregate outputs from |
| 2780 | multiple processes. |
Willy Tarreau | 5d8b979 | 2016-03-11 11:09:34 +0100 | [diff] [blame] | 2781 | Example : |
| 2782 | |
| 2783 | $ ( echo show info typed | socat /var/run/haproxy.sock1 ; \ |
| 2784 | echo show info typed | socat /var/run/haproxy.sock2 ) | \ |
| 2785 | sort -t . -k 1,1n -k 2,2 -k 3,3n |
| 2786 | 0.Name.1:POS:str:HAProxy |
| 2787 | 0.Name.2:POS:str:HAProxy |
| 2788 | 1.Version.1:POS:str:1.7-dev1-868ab3-148 |
| 2789 | 1.Version.2:POS:str:1.7-dev1-868ab3-148 |
| 2790 | 2.Release_date.1:POS:str:2016/03/11 |
| 2791 | 2.Release_date.2:POS:str:2016/03/11 |
| 2792 | 3.Nbproc.1:CGS:u32:2 |
| 2793 | 3.Nbproc.2:CGS:u32:2 |
| 2794 | 4.Process_num.1:KGP:u32:1 |
| 2795 | 4.Process_num.2:KGP:u32:2 |
| 2796 | 5.Pid.1:SGP:u32:30120 |
| 2797 | 5.Pid.2:SGP:u32:30121 |
| 2798 | 6.Uptime.1:MDP:str:0d 0h01m28s |
| 2799 | 6.Uptime.2:MDP:str:0d 0h01m28s |
| 2800 | (...) |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2801 | |
Simon Horman | 05ee213 | 2017-01-04 09:37:25 +0100 | [diff] [blame] | 2802 | The format of JSON output is described in a schema which may be output |
Simon Horman | 6f6bb38 | 2017-01-04 09:37:26 +0100 | [diff] [blame] | 2803 | using "show schema json". |
Simon Horman | 05ee213 | 2017-01-04 09:37:25 +0100 | [diff] [blame] | 2804 | |
| 2805 | The JSON output contains no extra whitespace in order to reduce the |
| 2806 | volume of output. For human consumption passing the output through a |
| 2807 | pretty printer may be helpful. Example : |
| 2808 | |
| 2809 | $ echo "show info json" | socat /var/run/haproxy.sock stdio | \ |
| 2810 | python -m json.tool |
| 2811 | |
Simon Horman | 6f6bb38 | 2017-01-04 09:37:26 +0100 | [diff] [blame] | 2812 | The JSON output contains no extra whitespace in order to reduce the |
| 2813 | volume of output. For human consumption passing the output through a |
| 2814 | pretty printer may be helpful. Example : |
| 2815 | |
| 2816 | $ echo "show info json" | socat /var/run/haproxy.sock stdio | \ |
| 2817 | python -m json.tool |
| 2818 | |
Willy Tarreau | 6ab7b21 | 2021-12-28 09:57:10 +0100 | [diff] [blame] | 2819 | show libs |
| 2820 | Dump the list of loaded shared dynamic libraries and object files, on systems |
| 2821 | that support it. When available, for each shared object the range of virtual |
| 2822 | addresses will be indicated, the size and the path to the object. This can be |
| 2823 | used for example to try to estimate what library provides a function that |
| 2824 | appears in a dump. Note that on many systems, addresses will change upon each |
| 2825 | restart (address space randomization), so that this list would need to be |
| 2826 | retrieved upon startup if it is expected to be used to analyse a core file. |
| 2827 | This command may only be issued on sockets configured for levels "operator" |
| 2828 | or "admin". Note that the output format may vary between operating systems, |
| 2829 | architectures and even haproxy versions, and ought not to be relied on in |
| 2830 | scripts. |
| 2831 | |
Willy Tarreau | 95f753e | 2021-04-30 12:09:54 +0200 | [diff] [blame] | 2832 | show map [[@<ver>] <map>] |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2833 | Dump info about map converters. Without argument, the list of all available |
| 2834 | maps is returned. If a <map> is specified, its contents are dumped. <map> is |
Willy Tarreau | 95f753e | 2021-04-30 12:09:54 +0200 | [diff] [blame] | 2835 | the #<id> or <file>. By default the current version of the map is shown (the |
| 2836 | version currently being matched against and reported as 'curr_ver' in the map |
| 2837 | list). It is possible to instead dump other versions by prepending '@<ver>' |
| 2838 | before the map's identifier. The version works as a filter and non-existing |
Dragan Dosen | a75eea7 | 2021-05-21 16:59:15 +0200 | [diff] [blame] | 2839 | versions will simply report no result. The 'entry_cnt' value represents the |
| 2840 | count of all the map entries, not just the active ones, which means that it |
| 2841 | also includes entries currently being added. |
Willy Tarreau | 95f753e | 2021-04-30 12:09:54 +0200 | [diff] [blame] | 2842 | |
| 2843 | In the output, the first column is a unique entry identifier, which is usable |
| 2844 | as a reference for operations "del map" and "set map". The second column is |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2845 | the pattern and the third column is the sample if available. The data returned |
| 2846 | are not directly a list of available maps, but are the list of all patterns |
| 2847 | composing any map. Many of these patterns can be shared with ACL. |
| 2848 | |
Willy Tarreau | 49962b5 | 2021-02-12 16:56:22 +0100 | [diff] [blame] | 2849 | show peers [dict|-] [<peers section>] |
Frédéric Lécaille | 21dde50 | 2019-04-15 13:50:23 +0200 | [diff] [blame] | 2850 | Dump info about the peers configured in "peers" sections. Without argument, |
| 2851 | the list of the peers belonging to all the "peers" sections are listed. If |
| 2852 | <peers section> is specified, only the information about the peers belonging |
Willy Tarreau | 49962b5 | 2021-02-12 16:56:22 +0100 | [diff] [blame] | 2853 | to this "peers" section are dumped. When "dict" is specified before the peers |
| 2854 | section name, the entire Tx/Rx dictionary caches will also be dumped (very |
| 2855 | large). Passing "-" may be required to dump a peers section called "dict". |
Frédéric Lécaille | 21dde50 | 2019-04-15 13:50:23 +0200 | [diff] [blame] | 2856 | |
Michael Prokop | 4438c60 | 2019-05-24 10:25:45 +0200 | [diff] [blame] | 2857 | Here are two examples of outputs where hostA, hostB and hostC peers belong to |
Frédéric Lécaille | 21dde50 | 2019-04-15 13:50:23 +0200 | [diff] [blame] | 2858 | "sharedlb" peers sections. Only hostA and hostB are connected. Only hostA has |
| 2859 | sent data to hostB. |
| 2860 | |
| 2861 | $ echo "show peers" | socat - /tmp/hostA |
| 2862 | 0x55deb0224320: [15/Apr/2019:11:28:01] id=sharedlb state=0 flags=0x3 \ |
Emeric Brun | 0bbec0f | 2019-04-18 11:39:43 +0200 | [diff] [blame] | 2863 | resync_timeout=<PAST> task_calls=45122 |
Frédéric Lécaille | 21dde50 | 2019-04-15 13:50:23 +0200 | [diff] [blame] | 2864 | 0x55deb022b540: id=hostC(remote) addr=127.0.0.12:10002 status=CONN \ |
| 2865 | reconnect=4s confirm=0 |
| 2866 | flags=0x0 |
| 2867 | 0x55deb022a440: id=hostA(local) addr=127.0.0.10:10000 status=NONE \ |
| 2868 | reconnect=<NEVER> confirm=0 |
| 2869 | flags=0x0 |
| 2870 | 0x55deb0227d70: id=hostB(remote) addr=127.0.0.11:10001 status=ESTA |
| 2871 | reconnect=2s confirm=0 |
Emeric Brun | 0bbec0f | 2019-04-18 11:39:43 +0200 | [diff] [blame] | 2872 | flags=0x20000200 appctx:0x55deb028fba0 st0=7 st1=0 task_calls=14456 \ |
| 2873 | state=EST |
Frédéric Lécaille | 21dde50 | 2019-04-15 13:50:23 +0200 | [diff] [blame] | 2874 | xprt=RAW src=127.0.0.1:37257 addr=127.0.0.10:10000 |
| 2875 | remote_table:0x55deb0224a10 id=stkt local_id=1 remote_id=1 |
| 2876 | last_local_table:0x55deb0224a10 id=stkt local_id=1 remote_id=1 |
| 2877 | shared tables: |
| 2878 | 0x55deb0224a10 local_id=1 remote_id=1 flags=0x0 remote_data=0x65 |
| 2879 | last_acked=0 last_pushed=3 last_get=0 teaching_origin=0 update=3 |
| 2880 | table:0x55deb022d6a0 id=stkt update=3 localupdate=3 \ |
| 2881 | commitupdate=3 syncing=0 |
| 2882 | |
| 2883 | $ echo "show peers" | socat - /tmp/hostB |
| 2884 | 0x55871b5ab320: [15/Apr/2019:11:28:03] id=sharedlb state=0 flags=0x3 \ |
Emeric Brun | 0bbec0f | 2019-04-18 11:39:43 +0200 | [diff] [blame] | 2885 | resync_timeout=<PAST> task_calls=3 |
Frédéric Lécaille | 21dde50 | 2019-04-15 13:50:23 +0200 | [diff] [blame] | 2886 | 0x55871b5b2540: id=hostC(remote) addr=127.0.0.12:10002 status=CONN \ |
| 2887 | reconnect=3s confirm=0 |
| 2888 | flags=0x0 |
| 2889 | 0x55871b5b1440: id=hostB(local) addr=127.0.0.11:10001 status=NONE \ |
| 2890 | reconnect=<NEVER> confirm=0 |
| 2891 | flags=0x0 |
| 2892 | 0x55871b5aed70: id=hostA(remote) addr=127.0.0.10:10000 status=ESTA \ |
| 2893 | reconnect=2s confirm=0 |
Emeric Brun | 0bbec0f | 2019-04-18 11:39:43 +0200 | [diff] [blame] | 2894 | flags=0x20000200 appctx:0x7fa46800ee00 st0=7 st1=0 task_calls=62356 \ |
| 2895 | state=EST |
Frédéric Lécaille | 21dde50 | 2019-04-15 13:50:23 +0200 | [diff] [blame] | 2896 | remote_table:0x55871b5ab960 id=stkt local_id=1 remote_id=1 |
| 2897 | last_local_table:0x55871b5ab960 id=stkt local_id=1 remote_id=1 |
| 2898 | shared tables: |
| 2899 | 0x55871b5ab960 local_id=1 remote_id=1 flags=0x0 remote_data=0x65 |
| 2900 | last_acked=3 last_pushed=0 last_get=3 teaching_origin=0 update=0 |
| 2901 | table:0x55871b5b46a0 id=stkt update=1 localupdate=0 \ |
| 2902 | commitupdate=0 syncing=0 |
| 2903 | |
Willy Tarreau | 7583c36 | 2022-11-21 10:02:29 +0100 | [diff] [blame] | 2904 | show pools [byname|bysize|byusage] [match <pfx>] [<nb>] |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2905 | Dump the status of internal memory pools. This is useful to track memory |
| 2906 | usage when suspecting a memory leak for example. It does exactly the same |
Willy Tarreau | 2fba08f | 2022-11-21 09:34:02 +0100 | [diff] [blame] | 2907 | as the SIGQUIT when running in foreground except that it does not flush the |
| 2908 | pools. The output is not sorted by default. If "byname" is specified, it is |
| 2909 | sorted by pool name; if "bysize" is specified, it is sorted by item size in |
| 2910 | reverse order; if "byusage" is specified, it is sorted by total usage in |
| 2911 | reverse order, and only used entries are shown. It is also possible to limit |
Willy Tarreau | 7583c36 | 2022-11-21 10:02:29 +0100 | [diff] [blame] | 2912 | the output to the <nb> first entries (e.g. when sorting by usage). Finally, |
| 2913 | if "match" followed by a prefix is specified, then only pools whose name |
| 2914 | starts with this prefix will be shown. The reported total only concerns pools |
| 2915 | matching the filtering criteria. Example: |
| 2916 | |
| 2917 | $ socat - /tmp/haproxy.sock <<< "show pools match quic byusage" |
| 2918 | Dumping pools usage. Use SIGQUIT to flush them. |
| 2919 | - Pool quic_conn_r (65560 bytes) : 1337 allocated (87653720 bytes), ... |
| 2920 | - Pool quic_crypto (1048 bytes) : 6685 allocated (7005880 bytes), ... |
| 2921 | - Pool quic_conn (4056 bytes) : 1337 allocated (5422872 bytes), ... |
| 2922 | - Pool quic_rxbuf (262168 bytes) : 8 allocated (2097344 bytes), ... |
| 2923 | - Pool quic_connne (184 bytes) : 9359 allocated (1722056 bytes), ... |
| 2924 | - Pool quic_frame (184 bytes) : 7938 allocated (1460592 bytes), ... |
| 2925 | - Pool quic_tx_pac (152 bytes) : 6454 allocated (981008 bytes), ... |
| 2926 | - Pool quic_tls_ke (56 bytes) : 12033 allocated (673848 bytes), ... |
| 2927 | - Pool quic_rx_pac (408 bytes) : 1596 allocated (651168 bytes), ... |
| 2928 | - Pool quic_tls_se (88 bytes) : 6685 allocated (588280 bytes), ... |
| 2929 | - Pool quic_cstrea (88 bytes) : 4011 allocated (352968 bytes), ... |
| 2930 | - Pool quic_tls_iv (24 bytes) : 12033 allocated (288792 bytes), ... |
| 2931 | - Pool quic_dgram (344 bytes) : 732 allocated (251808 bytes), ... |
| 2932 | - Pool quic_arng (56 bytes) : 4011 allocated (224616 bytes), ... |
| 2933 | - Pool quic_conn_c (152 bytes) : 1337 allocated (203224 bytes), ... |
| 2934 | Total: 15 pools, 109578176 bytes allocated, 109578176 used ... |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2935 | |
Willy Tarreau | e86bc35 | 2022-09-08 16:38:10 +0200 | [diff] [blame] | 2936 | show profiling [{all | status | tasks | memory}] [byaddr|bytime|aggr|<max_lines>]* |
Willy Tarreau | 75c62c2 | 2018-11-22 11:02:09 +0100 | [diff] [blame] | 2937 | Dumps the current profiling settings, one per line, as well as the command |
Willy Tarreau | 1bd67e9 | 2021-01-29 00:07:40 +0100 | [diff] [blame] | 2938 | needed to change them. When tasks profiling is enabled, some per-function |
| 2939 | statistics collected by the scheduler will also be emitted, with a summary |
Willy Tarreau | 42712cb | 2021-05-05 17:48:13 +0200 | [diff] [blame] | 2940 | covering the number of calls, total/avg CPU time and total/avg latency. When |
| 2941 | memory profiling is enabled, some information such as the number of |
| 2942 | allocations/releases and their sizes will be reported. It is possible to |
| 2943 | limit the dump to only the profiling status, the tasks, or the memory |
| 2944 | profiling by specifying the respective keywords; by default all profiling |
| 2945 | information are dumped. It is also possible to limit the number of lines |
Willy Tarreau | f1c8a38 | 2021-05-13 10:00:17 +0200 | [diff] [blame] | 2946 | of output of each category by specifying a numeric limit. If is possible to |
Willy Tarreau | e86bc35 | 2022-09-08 16:38:10 +0200 | [diff] [blame] | 2947 | request that the output is sorted by address or by total execution time |
| 2948 | instead of usage, e.g. to ease comparisons between subsequent calls or to |
| 2949 | check what needs to be optimized, and to aggregate task activity by called |
| 2950 | function instead of seeing the details. Please note that profiling is |
Willy Tarreau | f1c8a38 | 2021-05-13 10:00:17 +0200 | [diff] [blame] | 2951 | essentially aimed at developers since it gives hints about where CPU cycles |
| 2952 | or memory are wasted in the code. There is nothing useful to monitor there. |
Willy Tarreau | 75c62c2 | 2018-11-22 11:02:09 +0100 | [diff] [blame] | 2953 | |
Willy Tarreau | 87ef323 | 2021-01-29 12:01:46 +0100 | [diff] [blame] | 2954 | show resolvers [<resolvers section id>] |
| 2955 | Dump statistics for the given resolvers section, or all resolvers sections |
| 2956 | if no section is supplied. |
| 2957 | |
| 2958 | For each name server, the following counters are reported: |
| 2959 | sent: number of DNS requests sent to this server |
| 2960 | valid: number of DNS valid responses received from this server |
| 2961 | update: number of DNS responses used to update the server's IP address |
| 2962 | cname: number of CNAME responses |
| 2963 | cname_error: CNAME errors encountered with this server |
| 2964 | any_err: number of empty response (IE: server does not support ANY type) |
| 2965 | nx: non existent domain response received from this server |
| 2966 | timeout: how many time this server did not answer in time |
| 2967 | refused: number of requests refused by this server |
| 2968 | other: any other DNS errors |
| 2969 | invalid: invalid DNS response (from a protocol point of view) |
| 2970 | too_big: too big response |
| 2971 | outdated: number of response arrived too late (after an other name server) |
| 2972 | |
Amaury Denoyelle | 3f9758e | 2023-02-01 17:31:02 +0100 | [diff] [blame] | 2973 | show quic [all] |
Amaury Denoyelle | 15c7470 | 2023-02-01 10:18:26 +0100 | [diff] [blame] | 2974 | Dump information on all active QUIC frontend connections. This command is |
| 2975 | restricted and can only be issued on sockets configured for levels "operator" |
Amaury Denoyelle | 3f9758e | 2023-02-01 17:31:02 +0100 | [diff] [blame] | 2976 | or "admin". By default, connections on closing or draining state are not |
| 2977 | displayed. Use the extra argument "all" to include them in the output. |
Amaury Denoyelle | 15c7470 | 2023-02-01 10:18:26 +0100 | [diff] [blame] | 2978 | |
Willy Tarreau | 69f591e | 2020-07-01 07:00:59 +0200 | [diff] [blame] | 2979 | show servers conn [<backend>] |
| 2980 | Dump the current and idle connections state of the servers belonging to the |
| 2981 | designated backend (or all backends if none specified). A backend name or |
| 2982 | identifier may be used. |
| 2983 | |
| 2984 | The output consists in a header line showing the fields titles, then one |
| 2985 | server per line with for each, the backend name and ID, server name and ID, |
| 2986 | the address, port and a series or values. The number of fields varies |
| 2987 | depending on thread count. |
| 2988 | |
| 2989 | Given the threaded nature of idle connections, it's important to understand |
| 2990 | that some values may change once read, and that as such, consistency within a |
| 2991 | line isn't granted. This output is mostly provided as a debugging tool and is |
| 2992 | not relevant to be routinely monitored nor graphed. |
| 2993 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 2994 | show servers state [<backend>] |
| 2995 | Dump the state of the servers found in the running configuration. A backend |
| 2996 | name or identifier may be provided to limit the output to this backend only. |
| 2997 | |
| 2998 | The dump has the following format: |
| 2999 | - first line contains the format version (1 in this specification); |
| 3000 | - second line contains the column headers, prefixed by a sharp ('#'); |
| 3001 | - third line and next ones contain data; |
| 3002 | - each line starting by a sharp ('#') is considered as a comment. |
| 3003 | |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 3004 | Since multiple versions of the output may co-exist, below is the list of |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3005 | fields and their order per file format version : |
| 3006 | 1: |
| 3007 | be_id: Backend unique id. |
| 3008 | be_name: Backend label. |
| 3009 | srv_id: Server unique id (in the backend). |
| 3010 | srv_name: Server label. |
| 3011 | srv_addr: Server IP address. |
| 3012 | srv_op_state: Server operational state (UP/DOWN/...). |
Cyril Bonté | 5b2ce8a | 2016-11-02 00:19:58 +0100 | [diff] [blame] | 3013 | 0 = SRV_ST_STOPPED |
| 3014 | The server is down. |
| 3015 | 1 = SRV_ST_STARTING |
| 3016 | The server is warming up (up but |
| 3017 | throttled). |
| 3018 | 2 = SRV_ST_RUNNING |
| 3019 | The server is fully up. |
| 3020 | 3 = SRV_ST_STOPPING |
| 3021 | The server is up but soft-stopping |
| 3022 | (eg: 404). |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3023 | srv_admin_state: Server administrative state (MAINT/DRAIN/...). |
Cyril Bonté | 5b2ce8a | 2016-11-02 00:19:58 +0100 | [diff] [blame] | 3024 | The state is actually a mask of values : |
| 3025 | 0x01 = SRV_ADMF_FMAINT |
| 3026 | The server was explicitly forced into |
| 3027 | maintenance. |
| 3028 | 0x02 = SRV_ADMF_IMAINT |
| 3029 | The server has inherited the maintenance |
| 3030 | status from a tracked server. |
| 3031 | 0x04 = SRV_ADMF_CMAINT |
| 3032 | The server is in maintenance because of |
| 3033 | the configuration. |
| 3034 | 0x08 = SRV_ADMF_FDRAIN |
| 3035 | The server was explicitly forced into |
| 3036 | drain state. |
| 3037 | 0x10 = SRV_ADMF_IDRAIN |
| 3038 | The server has inherited the drain status |
| 3039 | from a tracked server. |
Baptiste Assmann | 89aa7f3 | 2016-11-02 21:31:27 +0100 | [diff] [blame] | 3040 | 0x20 = SRV_ADMF_RMAINT |
| 3041 | The server is in maintenance because of an |
| 3042 | IP address resolution failure. |
Frédéric Lécaille | b418c12 | 2017-04-26 11:24:02 +0200 | [diff] [blame] | 3043 | 0x40 = SRV_ADMF_HMAINT |
| 3044 | The server FQDN was set from stats socket. |
| 3045 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3046 | srv_uweight: User visible server's weight. |
| 3047 | srv_iweight: Server's initial weight. |
| 3048 | srv_time_since_last_change: Time since last operational change. |
| 3049 | srv_check_status: Last health check status. |
| 3050 | srv_check_result: Last check result (FAILED/PASSED/...). |
Cyril Bonté | 5b2ce8a | 2016-11-02 00:19:58 +0100 | [diff] [blame] | 3051 | 0 = CHK_RES_UNKNOWN |
| 3052 | Initialized to this by default. |
| 3053 | 1 = CHK_RES_NEUTRAL |
| 3054 | Valid check but no status information. |
| 3055 | 2 = CHK_RES_FAILED |
| 3056 | Check failed. |
| 3057 | 3 = CHK_RES_PASSED |
| 3058 | Check succeeded and server is fully up |
| 3059 | again. |
| 3060 | 4 = CHK_RES_CONDPASS |
| 3061 | Check reports the server doesn't want new |
| 3062 | sessions. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3063 | srv_check_health: Checks rise / fall current counter. |
| 3064 | srv_check_state: State of the check (ENABLED/PAUSED/...). |
Cyril Bonté | 5b2ce8a | 2016-11-02 00:19:58 +0100 | [diff] [blame] | 3065 | The state is actually a mask of values : |
| 3066 | 0x01 = CHK_ST_INPROGRESS |
| 3067 | A check is currently running. |
| 3068 | 0x02 = CHK_ST_CONFIGURED |
| 3069 | This check is configured and may be |
| 3070 | enabled. |
| 3071 | 0x04 = CHK_ST_ENABLED |
| 3072 | This check is currently administratively |
| 3073 | enabled. |
| 3074 | 0x08 = CHK_ST_PAUSED |
| 3075 | Checks are paused because of maintenance |
| 3076 | (health only). |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3077 | srv_agent_state: State of the agent check (ENABLED/PAUSED/...). |
Cyril Bonté | 5b2ce8a | 2016-11-02 00:19:58 +0100 | [diff] [blame] | 3078 | This state uses the same mask values as |
| 3079 | "srv_check_state", adding this specific one : |
| 3080 | 0x10 = CHK_ST_AGENT |
| 3081 | Check is an agent check (otherwise it's a |
| 3082 | health check). |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3083 | bk_f_forced_id: Flag to know if the backend ID is forced by |
| 3084 | configuration. |
| 3085 | srv_f_forced_id: Flag to know if the server's ID is forced by |
| 3086 | configuration. |
Frédéric Lécaille | b418c12 | 2017-04-26 11:24:02 +0200 | [diff] [blame] | 3087 | srv_fqdn: Server FQDN. |
Frédéric Lécaille | 3169471 | 2017-08-01 08:47:19 +0200 | [diff] [blame] | 3088 | srv_port: Server port. |
Baptiste Assmann | 6d0f38f | 2018-07-02 17:00:54 +0200 | [diff] [blame] | 3089 | srvrecord: DNS SRV record associated to this SRV. |
William Dauchy | f637044 | 2020-11-14 19:25:33 +0100 | [diff] [blame] | 3090 | srv_use_ssl: use ssl for server connections. |
William Dauchy | d1a7b85 | 2021-02-11 22:51:26 +0100 | [diff] [blame] | 3091 | srv_check_port: Server health check port. |
| 3092 | srv_check_addr: Server health check address. |
| 3093 | srv_agent_addr: Server health agent address. |
| 3094 | srv_agent_port: Server health agent port. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3095 | |
| 3096 | show sess |
| 3097 | Dump all known sessions. Avoid doing this on slow connections as this can |
| 3098 | be huge. This command is restricted and can only be issued on sockets |
Willy Tarreau | c6e7a1b | 2020-06-28 01:24:12 +0200 | [diff] [blame] | 3099 | configured for levels "operator" or "admin". Note that on machines with |
| 3100 | quickly recycled connections, it is possible that this output reports less |
| 3101 | entries than really exist because it will dump all existing sessions up to |
| 3102 | the last one that was created before the command was entered; those which |
| 3103 | die in the mean time will not appear. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3104 | |
| 3105 | show sess <id> |
| 3106 | Display a lot of internal information about the specified session identifier. |
| 3107 | This identifier is the first field at the beginning of the lines in the dumps |
| 3108 | of "show sess" (it corresponds to the session pointer). Those information are |
| 3109 | useless to most users but may be used by haproxy developers to troubleshoot a |
| 3110 | complex bug. The output format is intentionally not documented so that it can |
| 3111 | freely evolve depending on demands. You may find a description of all fields |
| 3112 | returned in src/dumpstats.c |
| 3113 | |
| 3114 | The special id "all" dumps the states of all sessions, which must be avoided |
| 3115 | as much as possible as it is highly CPU intensive and can take a lot of time. |
| 3116 | |
Daniel Corbett | c40edac | 2020-11-01 10:54:17 -0500 | [diff] [blame] | 3117 | show stat [domain <dns|proxy>] [{<iid>|<proxy>} <type> <sid>] [typed|json] \ |
Willy Tarreau | 698097b | 2020-10-23 20:19:47 +0200 | [diff] [blame] | 3118 | [desc] [up|no-maint] |
Daniel Corbett | c40edac | 2020-11-01 10:54:17 -0500 | [diff] [blame] | 3119 | Dump statistics. The domain is used to select which statistics to print; dns |
| 3120 | and proxy are available for now. By default, the CSV format is used; you can |
Amaury Denoyelle | 072f97e | 2020-10-05 11:49:37 +0200 | [diff] [blame] | 3121 | activate the extended typed output format described in the section above if |
| 3122 | "typed" is passed after the other arguments; or in JSON if "json" is passed |
| 3123 | after the other arguments. By passing <id>, <type> and <sid>, it is possible |
| 3124 | to dump only selected items : |
Willy Tarreau | a1b1ed5 | 2016-11-25 08:50:58 +0100 | [diff] [blame] | 3125 | - <iid> is a proxy ID, -1 to dump everything. Alternatively, a proxy name |
| 3126 | <proxy> may be specified. In this case, this proxy's ID will be used as |
| 3127 | the ID selector. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3128 | - <type> selects the type of dumpable objects : 1 for frontends, 2 for |
| 3129 | backends, 4 for servers, -1 for everything. These values can be ORed, |
| 3130 | for example: |
| 3131 | 1 + 2 = 3 -> frontend + backend. |
| 3132 | 1 + 2 + 4 = 7 -> frontend + backend + server. |
| 3133 | - <sid> is a server ID, -1 to dump everything from the selected proxy. |
| 3134 | |
| 3135 | Example : |
| 3136 | $ echo "show info;show stat" | socat stdio unix-connect:/tmp/sock1 |
| 3137 | >>> Name: HAProxy |
| 3138 | Version: 1.4-dev2-49 |
| 3139 | Release_date: 2009/09/23 |
| 3140 | Nbproc: 1 |
| 3141 | Process_num: 1 |
| 3142 | (...) |
| 3143 | |
| 3144 | # pxname,svname,qcur,qmax,scur,smax,slim,stot,bin,bout,dreq, (...) |
| 3145 | stats,FRONTEND,,,0,0,1000,0,0,0,0,0,0,,,,,OPEN,,,,,,,,,1,1,0, (...) |
| 3146 | stats,BACKEND,0,0,0,0,1000,0,0,0,0,0,,0,0,0,0,UP,0,0,0,,0,250,(...) |
| 3147 | (...) |
| 3148 | www1,BACKEND,0,0,0,0,1000,0,0,0,0,0,,0,0,0,0,UP,1,1,0,,0,250, (...) |
| 3149 | |
| 3150 | $ |
| 3151 | |
Willy Tarreau | 5d8b979 | 2016-03-11 11:09:34 +0100 | [diff] [blame] | 3152 | In this example, two commands have been issued at once. That way it's easy to |
| 3153 | find which process the stats apply to in multi-process mode. This is not |
| 3154 | needed in the typed output format as the process number is reported on each |
| 3155 | line. Notice the empty line after the information output which marks the end |
| 3156 | of the first block. A similar empty line appears at the end of the second |
| 3157 | block (stats) so that the reader knows the output has not been truncated. |
| 3158 | |
| 3159 | When "typed" is specified, the output format is more suitable to monitoring |
| 3160 | tools because it provides numeric positions and indicates the type of each |
| 3161 | output field. Each value stands on its own line with process number, element |
| 3162 | number, nature, origin and scope. This same format is available via the HTTP |
| 3163 | stats by passing ";typed" after the URI. It is very important to note that in |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 3164 | typed output format, the dump for a single object is contiguous so that there |
Willy Tarreau | 5d8b979 | 2016-03-11 11:09:34 +0100 | [diff] [blame] | 3165 | is no need for a consumer to store everything at once. |
| 3166 | |
Willy Tarreau | 698097b | 2020-10-23 20:19:47 +0200 | [diff] [blame] | 3167 | The "up" modifier will result in listing only servers which reportedly up or |
| 3168 | not checked. Those down, unresolved, or in maintenance will not be listed. |
| 3169 | This is analogous to the ";up" option on the HTTP stats. Similarly, the |
| 3170 | "no-maint" modifier will act like the ";no-maint" HTTP modifier and will |
| 3171 | result in disabled servers not to be listed. The difference is that those |
| 3172 | which are enabled but down will not be evicted. |
| 3173 | |
Willy Tarreau | 5d8b979 | 2016-03-11 11:09:34 +0100 | [diff] [blame] | 3174 | When using the typed output format, each line is made of 4 columns delimited |
| 3175 | by colons (':'). The first column is a dot-delimited series of 5 elements. The |
| 3176 | first element is a letter indicating the type of the object being described. |
| 3177 | At the moment the following object types are known : 'F' for a frontend, 'B' |
| 3178 | for a backend, 'L' for a listener, and 'S' for a server. The second element |
| 3179 | The second element is a positive integer representing the unique identifier of |
| 3180 | the proxy the object belongs to. It is equivalent to the "iid" column of the |
| 3181 | CSV output and matches the value in front of the optional "id" directive found |
| 3182 | in the frontend or backend section. The third element is a positive integer |
| 3183 | containing the unique object identifier inside the proxy, and corresponds to |
| 3184 | the "sid" column of the CSV output. ID 0 is reported when dumping a frontend |
| 3185 | or a backend. For a listener or a server, this corresponds to their respective |
| 3186 | ID inside the proxy. The fourth element is the numeric position of the field |
| 3187 | in the list (starting at zero). This position shall not change over time, but |
| 3188 | holes are to be expected, depending on build options or if some fields are |
| 3189 | deleted in the future. The fifth element is the field name as it appears in |
| 3190 | the CSV output. The sixth element is a positive integer and is the relative |
| 3191 | process number starting at 1. |
| 3192 | |
| 3193 | The rest of the line starting after the first colon follows the "typed output |
| 3194 | format" described in the section above. In short, the second column (after the |
| 3195 | first ':') indicates the origin, nature and scope of the variable. The third |
Willy Tarreau | 589722e | 2021-05-08 07:46:44 +0200 | [diff] [blame] | 3196 | column indicates the field type, among "s32", "s64", "u32", "u64", "flt' and |
Willy Tarreau | 5d8b979 | 2016-03-11 11:09:34 +0100 | [diff] [blame] | 3197 | "str". Then the fourth column is the value itself, which the consumer knows |
| 3198 | how to parse thanks to column 3 and how to process thanks to column 2. |
| 3199 | |
Willy Tarreau | 6b19b14 | 2019-10-09 15:44:21 +0200 | [diff] [blame] | 3200 | When "desc" is appended to the command, one extra colon followed by a quoted |
| 3201 | string is appended with a description for the metric. At the time of writing, |
| 3202 | this is only supported for the "typed" output format. |
| 3203 | |
Willy Tarreau | 5d8b979 | 2016-03-11 11:09:34 +0100 | [diff] [blame] | 3204 | Thus the overall line format in typed mode is : |
| 3205 | |
| 3206 | <obj>.<px_id>.<id>.<fpos>.<fname>.<process_num>:<tags>:<type>:<value> |
| 3207 | |
| 3208 | Here's an example of typed output format : |
| 3209 | |
| 3210 | $ echo "show stat typed" | socat stdio unix-connect:/tmp/sock1 |
| 3211 | F.2.0.0.pxname.1:MGP:str:private-frontend |
| 3212 | F.2.0.1.svname.1:MGP:str:FRONTEND |
| 3213 | F.2.0.8.bin.1:MGP:u64:0 |
| 3214 | F.2.0.9.bout.1:MGP:u64:0 |
| 3215 | F.2.0.40.hrsp_2xx.1:MGP:u64:0 |
| 3216 | L.2.1.0.pxname.1:MGP:str:private-frontend |
| 3217 | L.2.1.1.svname.1:MGP:str:sock-1 |
| 3218 | L.2.1.17.status.1:MGP:str:OPEN |
| 3219 | L.2.1.73.addr.1:MGP:str:0.0.0.0:8001 |
| 3220 | S.3.13.60.rtime.1:MCP:u32:0 |
| 3221 | S.3.13.61.ttime.1:MCP:u32:0 |
| 3222 | S.3.13.62.agent_status.1:MGP:str:L4TOUT |
| 3223 | S.3.13.64.agent_duration.1:MGP:u64:2001 |
| 3224 | S.3.13.65.check_desc.1:MCP:str:Layer4 timeout |
| 3225 | S.3.13.66.agent_desc.1:MCP:str:Layer4 timeout |
| 3226 | S.3.13.67.check_rise.1:MCP:u32:2 |
| 3227 | S.3.13.68.check_fall.1:MCP:u32:3 |
| 3228 | S.3.13.69.check_health.1:SGP:u32:0 |
| 3229 | S.3.13.70.agent_rise.1:MaP:u32:1 |
| 3230 | S.3.13.71.agent_fall.1:SGP:u32:1 |
| 3231 | S.3.13.72.agent_health.1:SGP:u32:1 |
| 3232 | S.3.13.73.addr.1:MCP:str:1.255.255.255:8888 |
| 3233 | S.3.13.75.mode.1:MAP:str:http |
| 3234 | B.3.0.0.pxname.1:MGP:str:private-backend |
| 3235 | B.3.0.1.svname.1:MGP:str:BACKEND |
| 3236 | B.3.0.2.qcur.1:MGP:u32:0 |
| 3237 | B.3.0.3.qmax.1:MGP:u32:0 |
| 3238 | B.3.0.4.scur.1:MGP:u32:0 |
| 3239 | B.3.0.5.smax.1:MGP:u32:0 |
| 3240 | B.3.0.6.slim.1:MGP:u32:1000 |
| 3241 | B.3.0.55.lastsess.1:MMP:s32:-1 |
| 3242 | (...) |
| 3243 | |
Simon Horman | 1084a36 | 2016-11-21 17:00:24 +0100 | [diff] [blame] | 3244 | In the typed format, the presence of the process ID at the end of the |
| 3245 | first column makes it very easy to visually aggregate outputs from |
| 3246 | multiple processes, as show in the example below where each line appears |
| 3247 | for each process : |
Willy Tarreau | 5d8b979 | 2016-03-11 11:09:34 +0100 | [diff] [blame] | 3248 | |
| 3249 | $ ( echo show stat typed | socat /var/run/haproxy.sock1 - ; \ |
| 3250 | echo show stat typed | socat /var/run/haproxy.sock2 - ) | \ |
| 3251 | sort -t . -k 1,1 -k 2,2n -k 3,3n -k 4,4n -k 5,5 -k 6,6n |
| 3252 | B.3.0.0.pxname.1:MGP:str:private-backend |
| 3253 | B.3.0.0.pxname.2:MGP:str:private-backend |
| 3254 | B.3.0.1.svname.1:MGP:str:BACKEND |
| 3255 | B.3.0.1.svname.2:MGP:str:BACKEND |
| 3256 | B.3.0.2.qcur.1:MGP:u32:0 |
| 3257 | B.3.0.2.qcur.2:MGP:u32:0 |
| 3258 | B.3.0.3.qmax.1:MGP:u32:0 |
| 3259 | B.3.0.3.qmax.2:MGP:u32:0 |
| 3260 | B.3.0.4.scur.1:MGP:u32:0 |
| 3261 | B.3.0.4.scur.2:MGP:u32:0 |
| 3262 | B.3.0.5.smax.1:MGP:u32:0 |
| 3263 | B.3.0.5.smax.2:MGP:u32:0 |
| 3264 | B.3.0.6.slim.1:MGP:u32:1000 |
| 3265 | B.3.0.6.slim.2:MGP:u32:1000 |
| 3266 | (...) |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3267 | |
Simon Horman | 05ee213 | 2017-01-04 09:37:25 +0100 | [diff] [blame] | 3268 | The format of JSON output is described in a schema which may be output |
Simon Horman | 6f6bb38 | 2017-01-04 09:37:26 +0100 | [diff] [blame] | 3269 | using "show schema json". |
| 3270 | |
| 3271 | The JSON output contains no extra whitespace in order to reduce the |
| 3272 | volume of output. For human consumption passing the output through a |
| 3273 | pretty printer may be helpful. Example : |
| 3274 | |
| 3275 | $ echo "show stat json" | socat /var/run/haproxy.sock stdio | \ |
| 3276 | python -m json.tool |
Simon Horman | 05ee213 | 2017-01-04 09:37:25 +0100 | [diff] [blame] | 3277 | |
| 3278 | The JSON output contains no extra whitespace in order to reduce the |
| 3279 | volume of output. For human consumption passing the output through a |
| 3280 | pretty printer may be helpful. Example : |
| 3281 | |
| 3282 | $ echo "show stat json" | socat /var/run/haproxy.sock stdio | \ |
| 3283 | python -m json.tool |
| 3284 | |
Remi Tricot-Le Breton | e88a2ca | 2021-04-08 15:30:23 +0200 | [diff] [blame] | 3285 | show ssl ca-file [<cafile>[:<index>]] |
William Lallemand | 0c39526 | 2023-01-10 14:44:27 +0100 | [diff] [blame] | 3286 | Display the list of CA files loaded into the process and their respective |
| 3287 | certificate counts. The certificates are not used by any frontend or backend |
| 3288 | until their status is "Used". |
William Lallemand | f29c415 | 2023-01-10 15:07:12 +0100 | [diff] [blame] | 3289 | A "@system-ca" entry can appear in the list, it is loaded by the httpclient |
| 3290 | by default. It contains the list of trusted CA of your system returned by |
| 3291 | OpenSSL. |
William Lallemand | 0c39526 | 2023-01-10 14:44:27 +0100 | [diff] [blame] | 3292 | If a filename is prefixed by an asterisk, it is a transaction which |
Remi Tricot-Le Breton | e88a2ca | 2021-04-08 15:30:23 +0200 | [diff] [blame] | 3293 | is not committed yet. If a <cafile> is specified without <index>, it will show |
| 3294 | the status of the CA file ("Used"/"Unused") followed by details about all the |
| 3295 | certificates contained in the CA file. The details displayed for every |
| 3296 | certificate are the same as the ones displayed by a "show ssl cert" command. |
| 3297 | If a <cafile> is specified followed by an <index>, it will only display the |
| 3298 | details of the certificate having the specified index. Indexes start from 1. |
| 3299 | If the index is invalid (too big for instance), nothing will be displayed. |
| 3300 | This command can be useful to check if a CA file was properly updated. |
| 3301 | You can also display the details of an ongoing transaction by prefixing the |
| 3302 | filename by an asterisk. |
| 3303 | |
| 3304 | Example : |
| 3305 | |
| 3306 | $ echo "show ssl ca-file" | socat /var/run/haproxy.master - |
| 3307 | # transaction |
| 3308 | *cafile.crt - 2 certificate(s) |
| 3309 | # filename |
| 3310 | cafile.crt - 1 certificate(s) |
| 3311 | |
| 3312 | $ echo "show ssl ca-file cafile.crt" | socat /var/run/haproxy.master - |
| 3313 | Filename: /home/tricot/work/haproxy/reg-tests/ssl/set_cafile_ca2.crt |
| 3314 | Status: Used |
| 3315 | |
| 3316 | Certificate #1: |
| 3317 | Serial: 11A4D2200DC84376E7D233CAFF39DF44BF8D1211 |
| 3318 | notBefore: Apr 1 07:40:53 2021 GMT |
| 3319 | notAfter: Aug 17 07:40:53 2048 GMT |
| 3320 | Subject Alternative Name: |
| 3321 | Algorithm: RSA4096 |
| 3322 | SHA1 FingerPrint: A111EF0FEFCDE11D47FE3F33ADCA8435EBEA4864 |
| 3323 | Subject: /C=FR/ST=Some-State/O=HAProxy Technologies/CN=HAProxy Technologies CA |
| 3324 | Issuer: /C=FR/ST=Some-State/O=HAProxy Technologies/CN=HAProxy Technologies CA |
| 3325 | |
| 3326 | $ echo "show ssl ca-file *cafile.crt:2" | socat /var/run/haproxy.master - |
| 3327 | Filename: */home/tricot/work/haproxy/reg-tests/ssl/set_cafile_ca2.crt |
| 3328 | Status: Unused |
| 3329 | |
| 3330 | Certificate #2: |
| 3331 | Serial: 587A1CE5ED855040A0C82BF255FF300ADB7C8136 |
| 3332 | [...] |
| 3333 | |
William Lallemand | d4f946c | 2019-12-05 10:26:40 +0100 | [diff] [blame] | 3334 | show ssl cert [<filename>] |
William Lallemand | 0c39526 | 2023-01-10 14:44:27 +0100 | [diff] [blame] | 3335 | Display the list of certificates loaded into the process. They are not used |
| 3336 | by any frontend or backend until their status is "Used". |
Remi Tricot-Le Breton | b5f0fac | 2021-04-14 16:19:29 +0200 | [diff] [blame] | 3337 | If a filename is prefixed by an asterisk, it is a transaction which is not |
| 3338 | committed yet. If a filename is specified, it will show details about the |
| 3339 | certificate. This command can be useful to check if a certificate was well |
| 3340 | updated. You can also display details on a transaction by prefixing the |
| 3341 | filename by an asterisk. |
Remi Tricot-Le Breton | 6056e61 | 2021-06-10 13:51:15 +0200 | [diff] [blame] | 3342 | This command can also be used to display the details of a certificate's OCSP |
| 3343 | response by suffixing the filename with a ".ocsp" extension. It works for |
| 3344 | committed certificates as well as for ongoing transactions. On a committed |
| 3345 | certificate, this command is equivalent to calling "show ssl ocsp-response" |
| 3346 | with the certificate's corresponding OCSP response ID. |
William Lallemand | d4f946c | 2019-12-05 10:26:40 +0100 | [diff] [blame] | 3347 | |
| 3348 | Example : |
| 3349 | |
| 3350 | $ echo "@1 show ssl cert" | socat /var/run/haproxy.master - |
| 3351 | # transaction |
| 3352 | *test.local.pem |
| 3353 | # filename |
| 3354 | test.local.pem |
| 3355 | |
| 3356 | $ echo "@1 show ssl cert test.local.pem" | socat /var/run/haproxy.master - |
| 3357 | Filename: test.local.pem |
William Lallemand | 0c39526 | 2023-01-10 14:44:27 +0100 | [diff] [blame] | 3358 | Status: Used |
William Lallemand | d4f946c | 2019-12-05 10:26:40 +0100 | [diff] [blame] | 3359 | Serial: 03ECC19BA54B25E85ABA46EE561B9A10D26F |
| 3360 | notBefore: Sep 13 21:20:24 2019 GMT |
| 3361 | notAfter: Dec 12 21:20:24 2019 GMT |
| 3362 | Issuer: /C=US/O=Let's Encrypt/CN=Let's Encrypt Authority X3 |
| 3363 | Subject: /CN=test.local |
| 3364 | Subject Alternative Name: DNS:test.local, DNS:imap.test.local |
| 3365 | Algorithm: RSA2048 |
| 3366 | SHA1 FingerPrint: 417A11CAE25F607B24F638B4A8AEE51D1E211477 |
| 3367 | |
| 3368 | $ echo "@1 show ssl cert *test.local.pem" | socat /var/run/haproxy.master - |
| 3369 | Filename: *test.local.pem |
William Lallemand | 0c39526 | 2023-01-10 14:44:27 +0100 | [diff] [blame] | 3370 | Status: Unused |
William Lallemand | d4f946c | 2019-12-05 10:26:40 +0100 | [diff] [blame] | 3371 | [...] |
| 3372 | |
Remi Tricot-Le Breton | 3c222bd | 2021-04-27 16:28:25 +0200 | [diff] [blame] | 3373 | show ssl crl-file [<crlfile>[:<index>]] |
William Lallemand | 0c39526 | 2023-01-10 14:44:27 +0100 | [diff] [blame] | 3374 | Display the list of CRL files loaded into the process. They are not used |
| 3375 | by any frontend or backend until their status is "Used". |
Remi Tricot-Le Breton | 3c222bd | 2021-04-27 16:28:25 +0200 | [diff] [blame] | 3376 | If a filename is prefixed by an asterisk, it is a transaction which is not |
| 3377 | committed yet. If a <crlfile> is specified without <index>, it will show the |
| 3378 | status of the CRL file ("Used"/"Unused") followed by details about all the |
| 3379 | Revocation Lists contained in the CRL file. The details displayed for every |
| 3380 | list are based on the output of "openssl crl -text -noout -in <file>". |
| 3381 | If a <crlfile> is specified followed by an <index>, it will only display the |
| 3382 | details of the list having the specified index. Indexes start from 1. |
| 3383 | If the index is invalid (too big for instance), nothing will be displayed. |
| 3384 | This command can be useful to check if a CRL file was properly updated. |
| 3385 | You can also display the details of an ongoing transaction by prefixing the |
| 3386 | filename by an asterisk. |
| 3387 | |
| 3388 | Example : |
| 3389 | |
| 3390 | $ echo "show ssl crl-file" | socat /var/run/haproxy.master - |
| 3391 | # transaction |
| 3392 | *crlfile.pem |
| 3393 | # filename |
| 3394 | crlfile.pem |
| 3395 | |
| 3396 | $ echo "show ssl crl-file crlfile.pem" | socat /var/run/haproxy.master - |
| 3397 | Filename: /home/tricot/work/haproxy/reg-tests/ssl/crlfile.pem |
| 3398 | Status: Used |
| 3399 | |
| 3400 | Certificate Revocation List #1: |
| 3401 | Version 1 |
| 3402 | Signature Algorithm: sha256WithRSAEncryption |
| 3403 | Issuer: /C=FR/O=HAProxy Technologies/CN=Intermediate CA2 |
| 3404 | Last Update: Apr 23 14:45:39 2021 GMT |
| 3405 | Next Update: Sep 8 14:45:39 2048 GMT |
| 3406 | Revoked Certificates: |
| 3407 | Serial Number: 1008 |
| 3408 | Revocation Date: Apr 23 14:45:36 2021 GMT |
| 3409 | |
| 3410 | Certificate Revocation List #2: |
| 3411 | Version 1 |
| 3412 | Signature Algorithm: sha256WithRSAEncryption |
| 3413 | Issuer: /C=FR/O=HAProxy Technologies/CN=Root CA |
| 3414 | Last Update: Apr 23 14:30:44 2021 GMT |
| 3415 | Next Update: Sep 8 14:30:44 2048 GMT |
| 3416 | No Revoked Certificates. |
| 3417 | |
William Lallemand | c69f02d | 2020-04-06 19:07:03 +0200 | [diff] [blame] | 3418 | show ssl crt-list [-n] [<filename>] |
William Lallemand | accac23 | 2020-04-02 17:42:51 +0200 | [diff] [blame] | 3419 | Display the list of crt-list and directories used in the HAProxy |
William Lallemand | c69f02d | 2020-04-06 19:07:03 +0200 | [diff] [blame] | 3420 | configuration. If a filename is specified, dump the content of a crt-list or |
| 3421 | a directory. Once dumped the output can be used as a crt-list file. |
| 3422 | The '-n' option can be used to display the line number, which is useful when |
| 3423 | combined with the 'del ssl crt-list' option when a entry is duplicated. The |
| 3424 | output with the '-n' option is not compatible with the crt-list format and |
| 3425 | not loadable by haproxy. |
William Lallemand | accac23 | 2020-04-02 17:42:51 +0200 | [diff] [blame] | 3426 | |
| 3427 | Example: |
William Lallemand | c69f02d | 2020-04-06 19:07:03 +0200 | [diff] [blame] | 3428 | echo "show ssl crt-list -n localhost.crt-list" | socat /tmp/sock1 - |
William Lallemand | accac23 | 2020-04-02 17:42:51 +0200 | [diff] [blame] | 3429 | # localhost.crt-list |
William Lallemand | c69f02d | 2020-04-06 19:07:03 +0200 | [diff] [blame] | 3430 | common.pem:1 !not.test1.com *.test1.com !localhost |
| 3431 | common.pem:2 |
| 3432 | ecdsa.pem:3 [verify none allow-0rtt ssl-min-ver TLSv1.0 ssl-max-ver TLSv1.3] localhost !www.test1.com |
| 3433 | ecdsa.pem:4 [verify none allow-0rtt ssl-min-ver TLSv1.0 ssl-max-ver TLSv1.3] |
William Lallemand | accac23 | 2020-04-02 17:42:51 +0200 | [diff] [blame] | 3434 | |
Remi Tricot-Le Breton | d92fd11 | 2021-06-10 13:51:13 +0200 | [diff] [blame] | 3435 | show ssl ocsp-response [<id>] |
| 3436 | Display the IDs of the OCSP tree entries corresponding to all the OCSP |
| 3437 | responses used in HAProxy, as well as the issuer's name and key hash and the |
| 3438 | serial number of the certificate for which the OCSP response was built. |
| 3439 | If a valid <id> is provided, display the contents of the corresponding OCSP |
| 3440 | response. The information displayed is the same as in an "openssl ocsp -respin |
| 3441 | <ocsp-response> -text" call. |
| 3442 | |
| 3443 | Example : |
| 3444 | |
| 3445 | $ echo "show ssl ocsp-response" | socat /var/run/haproxy.master - |
| 3446 | # Certificate IDs |
| 3447 | Certificate ID key : 303b300906052b0e03021a050004148a83e0060faff709ca7e9b95522a2e81635fda0a0414f652b0e435d5ea923851508f0adbe92d85de007a0202100a |
| 3448 | Certificate ID: |
Remi Tricot-Le Breton | d92fd11 | 2021-06-10 13:51:13 +0200 | [diff] [blame] | 3449 | Issuer Name Hash: 8A83E0060FAFF709CA7E9B95522A2E81635FDA0A |
| 3450 | Issuer Key Hash: F652B0E435D5EA923851508F0ADBE92D85DE007A |
| 3451 | Serial Number: 100A |
| 3452 | |
| 3453 | $ echo "show ssl ocsp-response 303b300906052b0e03021a050004148a83e0060faff709ca7e9b95522a2e81635fda0a0414f652b0e435d5ea923851508f0adbe92d85de007a0202100a" | socat /var/run/haproxy.master - |
| 3454 | OCSP Response Data: |
| 3455 | OCSP Response Status: successful (0x0) |
| 3456 | Response Type: Basic OCSP Response |
| 3457 | Version: 1 (0x0) |
| 3458 | Responder Id: C = FR, O = HAProxy Technologies, CN = ocsp.haproxy.com |
| 3459 | Produced At: May 27 15:43:38 2021 GMT |
| 3460 | Responses: |
| 3461 | Certificate ID: |
| 3462 | Hash Algorithm: sha1 |
| 3463 | Issuer Name Hash: 8A83E0060FAFF709CA7E9B95522A2E81635FDA0A |
| 3464 | Issuer Key Hash: F652B0E435D5EA923851508F0ADBE92D85DE007A |
| 3465 | Serial Number: 100A |
| 3466 | Cert Status: good |
| 3467 | This Update: May 27 15:43:38 2021 GMT |
| 3468 | Next Update: Oct 12 15:43:38 2048 GMT |
| 3469 | [...] |
| 3470 | |
Remi Tricot-Le Breton | f87c67e | 2022-04-21 12:06:41 +0200 | [diff] [blame] | 3471 | show ssl providers |
| 3472 | Display the names of the providers loaded by OpenSSL during init. Provider |
| 3473 | loading can indeed be configured via the OpenSSL configuration file and this |
| 3474 | option allows to check that the right providers were loaded. This command is |
| 3475 | only available with OpenSSL v3. |
| 3476 | |
| 3477 | Example : |
| 3478 | $ echo "show ssl providers" | socat /var/run/haproxy.master - |
| 3479 | Loaded providers : |
| 3480 | - fips |
| 3481 | - base |
| 3482 | |
William Lallemand | f76b3b4 | 2022-10-14 15:29:07 +0200 | [diff] [blame] | 3483 | show startup-logs |
| 3484 | Dump all messages emitted during the startup of the current haproxy process, |
| 3485 | each startup-logs buffer is unique to its haproxy worker. |
| 3486 | |
William Lallemand | 5d1e131 | 2022-10-14 15:41:55 +0200 | [diff] [blame] | 3487 | This keyword also exists on the master CLI, which shows the latest startup or |
| 3488 | reload tentative. |
| 3489 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3490 | show table |
| 3491 | Dump general information on all known stick-tables. Their name is returned |
| 3492 | (the name of the proxy which holds them), their type (currently zero, always |
| 3493 | IP), their size in maximum possible number of entries, and the number of |
| 3494 | entries currently in use. |
| 3495 | |
| 3496 | Example : |
| 3497 | $ echo "show table" | socat stdio /tmp/sock1 |
| 3498 | >>> # table: front_pub, type: ip, size:204800, used:171454 |
| 3499 | >>> # table: back_rdp, type: ip, size:204800, used:0 |
| 3500 | |
Adis Nezirovic | 1a693fc | 2020-01-16 15:19:29 +0100 | [diff] [blame] | 3501 | show table <name> [ data.<type> <operator> <value> [data.<type> ...]] | [ key <key> ] |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3502 | Dump contents of stick-table <name>. In this mode, a first line of generic |
| 3503 | information about the table is reported as with "show table", then all |
| 3504 | entries are dumped. Since this can be quite heavy, it is possible to specify |
| 3505 | a filter in order to specify what entries to display. |
| 3506 | |
| 3507 | When the "data." form is used the filter applies to the stored data (see |
| 3508 | "stick-table" in section 4.2). A stored data type must be specified |
| 3509 | in <type>, and this data type must be stored in the table otherwise an |
| 3510 | error is reported. The data is compared according to <operator> with the |
| 3511 | 64-bit integer <value>. Operators are the same as with the ACLs : |
| 3512 | |
| 3513 | - eq : match entries whose data is equal to this value |
| 3514 | - ne : match entries whose data is not equal to this value |
| 3515 | - le : match entries whose data is less than or equal to this value |
| 3516 | - ge : match entries whose data is greater than or equal to this value |
| 3517 | - lt : match entries whose data is less than this value |
| 3518 | - gt : match entries whose data is greater than this value |
| 3519 | |
Adis Nezirovic | 1a693fc | 2020-01-16 15:19:29 +0100 | [diff] [blame] | 3520 | In this form, you can use multiple data filter entries, up to a maximum |
| 3521 | defined during build time (4 by default). |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3522 | |
| 3523 | When the key form is used the entry <key> is shown. The key must be of the |
| 3524 | same type as the table, which currently is limited to IPv4, IPv6, integer, |
| 3525 | and string. |
| 3526 | |
| 3527 | Example : |
| 3528 | $ echo "show table http_proxy" | socat stdio /tmp/sock1 |
| 3529 | >>> # table: http_proxy, type: ip, size:204800, used:2 |
| 3530 | >>> 0x80e6a4c: key=127.0.0.1 use=0 exp=3594729 gpc0=0 conn_rate(30000)=1 \ |
| 3531 | bytes_out_rate(60000)=187 |
| 3532 | >>> 0x80e6a80: key=127.0.0.2 use=0 exp=3594740 gpc0=1 conn_rate(30000)=10 \ |
| 3533 | bytes_out_rate(60000)=191 |
| 3534 | |
| 3535 | $ echo "show table http_proxy data.gpc0 gt 0" | socat stdio /tmp/sock1 |
| 3536 | >>> # table: http_proxy, type: ip, size:204800, used:2 |
| 3537 | >>> 0x80e6a80: key=127.0.0.2 use=0 exp=3594740 gpc0=1 conn_rate(30000)=10 \ |
| 3538 | bytes_out_rate(60000)=191 |
| 3539 | |
| 3540 | $ echo "show table http_proxy data.conn_rate gt 5" | \ |
| 3541 | socat stdio /tmp/sock1 |
| 3542 | >>> # table: http_proxy, type: ip, size:204800, used:2 |
| 3543 | >>> 0x80e6a80: key=127.0.0.2 use=0 exp=3594740 gpc0=1 conn_rate(30000)=10 \ |
| 3544 | bytes_out_rate(60000)=191 |
| 3545 | |
| 3546 | $ echo "show table http_proxy key 127.0.0.2" | \ |
| 3547 | socat stdio /tmp/sock1 |
| 3548 | >>> # table: http_proxy, type: ip, size:204800, used:2 |
| 3549 | >>> 0x80e6a80: key=127.0.0.2 use=0 exp=3594740 gpc0=1 conn_rate(30000)=10 \ |
| 3550 | bytes_out_rate(60000)=191 |
| 3551 | |
| 3552 | When the data criterion applies to a dynamic value dependent on time such as |
| 3553 | a bytes rate, the value is dynamically computed during the evaluation of the |
| 3554 | entry in order to decide whether it has to be dumped or not. This means that |
| 3555 | such a filter could match for some time then not match anymore because as |
| 3556 | time goes, the average event rate drops. |
| 3557 | |
| 3558 | It is possible to use this to extract lists of IP addresses abusing the |
| 3559 | service, in order to monitor them or even blacklist them in a firewall. |
| 3560 | Example : |
| 3561 | $ echo "show table http_proxy data.gpc0 gt 0" \ |
| 3562 | | socat stdio /tmp/sock1 \ |
| 3563 | | fgrep 'key=' | cut -d' ' -f2 | cut -d= -f2 > abusers-ip.txt |
| 3564 | ( or | awk '/key/{ print a[split($2,a,"=")]; }' ) |
| 3565 | |
Willy Tarreau | 16b282f | 2022-11-29 11:55:18 +0100 | [diff] [blame] | 3566 | When the stick-table is synchronized to a peers section supporting sharding, |
| 3567 | the shard number will be displayed for each key (otherwise '0' is reported). |
| 3568 | This allows to know which peers will receive this key. |
| 3569 | Example: |
| 3570 | $ echo "show table http_proxy" | socat stdio /tmp/sock1 | fgrep shard= |
| 3571 | 0x7f23b0c822a8: key=10.0.0.2 use=0 exp=296398 shard=9 gpc0=0 |
| 3572 | 0x7f23a063f948: key=10.0.0.6 use=0 exp=296075 shard=12 gpc0=0 |
| 3573 | 0x7f23b03920b8: key=10.0.0.8 use=0 exp=296766 shard=1 gpc0=0 |
| 3574 | 0x7f23a43c09e8: key=10.0.0.12 use=0 exp=295368 shard=8 gpc0=0 |
| 3575 | |
Willy Tarreau | 7eff06e | 2021-01-29 11:32:55 +0100 | [diff] [blame] | 3576 | show tasks |
| 3577 | Dumps the number of tasks currently in the run queue, with the number of |
| 3578 | occurrences for each function, and their average latency when it's known |
| 3579 | (for pure tasks with task profiling enabled). The dump is a snapshot of the |
| 3580 | instant it's done, and there may be variations depending on what tasks are |
| 3581 | left in the queue at the moment it happens, especially in mono-thread mode |
| 3582 | as there's less chance that I/Os can refill the queue (unless the queue is |
| 3583 | full). This command takes exclusive access to the process and can cause |
| 3584 | minor but measurable latencies when issued on a highly loaded process, so |
| 3585 | it must not be abused by monitoring bots. |
| 3586 | |
Willy Tarreau | 4e2b646 | 2019-05-16 17:44:30 +0200 | [diff] [blame] | 3587 | show threads |
| 3588 | Dumps some internal states and structures for each thread, that may be useful |
| 3589 | to help developers understand a problem. The output tries to be readable by |
Willy Tarreau | c7091d8 | 2019-05-17 10:08:49 +0200 | [diff] [blame] | 3590 | showing one block per thread. When haproxy is built with USE_THREAD_DUMP=1, |
| 3591 | an advanced dump mechanism involving thread signals is used so that each |
| 3592 | thread can dump its own state in turn. Without this option, the thread |
| 3593 | processing the command shows all its details but the other ones are less |
Willy Tarreau | e6a02fa | 2019-05-22 07:06:44 +0200 | [diff] [blame] | 3594 | detailed. A star ('*') is displayed in front of the thread handling the |
| 3595 | command. A right angle bracket ('>') may also be displayed in front of |
| 3596 | threads which didn't make any progress since last invocation of this command, |
| 3597 | indicating a bug in the code which must absolutely be reported. When this |
| 3598 | happens between two threads it usually indicates a deadlock. If a thread is |
| 3599 | alone, it's a different bug like a corrupted list. In all cases the process |
| 3600 | needs is not fully functional anymore and needs to be restarted. |
| 3601 | |
| 3602 | The output format is purposely not documented so that it can easily evolve as |
| 3603 | new needs are identified, without having to maintain any form of backwards |
| 3604 | compatibility, and just like with "show activity", the values are meaningless |
| 3605 | without the code at hand. |
Willy Tarreau | 4e2b646 | 2019-05-16 17:44:30 +0200 | [diff] [blame] | 3606 | |
William Lallemand | bb93346 | 2016-05-31 21:09:53 +0200 | [diff] [blame] | 3607 | show tls-keys [id|*] |
| 3608 | Dump all loaded TLS ticket keys references. The TLS ticket key reference ID |
| 3609 | and the file from which the keys have been loaded is shown. Both of those |
| 3610 | can be used to update the TLS keys using "set ssl tls-key". If an ID is |
| 3611 | specified as parameter, it will dump the tickets, using * it will dump every |
| 3612 | keys from every references. |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3613 | |
Simon Horman | 6f6bb38 | 2017-01-04 09:37:26 +0100 | [diff] [blame] | 3614 | show schema json |
| 3615 | Dump the schema used for the output of "show info json" and "show stat json". |
| 3616 | |
| 3617 | The contains no extra whitespace in order to reduce the volume of output. |
| 3618 | For human consumption passing the output through a pretty printer may be |
| 3619 | helpful. Example : |
| 3620 | |
| 3621 | $ echo "show schema json" | socat /var/run/haproxy.sock stdio | \ |
| 3622 | python -m json.tool |
| 3623 | |
| 3624 | The schema follows "JSON Schema" (json-schema.org) and accordingly |
| 3625 | verifiers may be used to verify the output of "show info json" and "show |
| 3626 | stat json" against the schema. |
| 3627 | |
Willy Tarreau | f909c91 | 2019-08-22 20:06:04 +0200 | [diff] [blame] | 3628 | show trace [<source>] |
| 3629 | Show the current trace status. For each source a line is displayed with a |
| 3630 | single-character status indicating if the trace is stopped, waiting, or |
| 3631 | running. The output sink used by the trace is indicated (or "none" if none |
| 3632 | was set), as well as the number of dropped events in this sink, followed by a |
| 3633 | brief description of the source. If a source name is specified, a detailed |
| 3634 | list of all events supported by the source, and their status for each action |
| 3635 | (report, start, pause, stop), indicated by a "+" if they are enabled, or a |
| 3636 | "-" otherwise. All these events are independent and an event might trigger |
| 3637 | a start without being reported and conversely. |
Simon Horman | 6f6bb38 | 2017-01-04 09:37:26 +0100 | [diff] [blame] | 3638 | |
William Lallemand | 740629e | 2021-12-14 15:22:29 +0100 | [diff] [blame] | 3639 | show version |
| 3640 | Show the version of the current HAProxy process. This is available from |
| 3641 | master and workers CLI. |
| 3642 | Example: |
| 3643 | |
| 3644 | $ echo "show version" | socat /var/run/haproxy.sock stdio |
| 3645 | 2.4.9 |
| 3646 | |
| 3647 | $ echo "show version" | socat /var/run/haproxy-master.sock stdio |
| 3648 | 2.5.0 |
| 3649 | |
Willy Tarreau | 44aed90 | 2015-10-13 14:45:29 +0200 | [diff] [blame] | 3650 | shutdown frontend <frontend> |
| 3651 | Completely delete the specified frontend. All the ports it was bound to will |
| 3652 | be released. It will not be possible to enable the frontend anymore after |
| 3653 | this operation. This is intended to be used in environments where stopping a |
| 3654 | proxy is not even imaginable but a misconfigured proxy must be fixed. That |
| 3655 | way it's possible to release the port and bind it into another process to |
| 3656 | restore operations. The frontend will not appear at all on the stats page |
| 3657 | once it is terminated. |
| 3658 | |
| 3659 | The frontend may be specified either by its name or by its numeric ID, |
| 3660 | prefixed with a sharp ('#'). |
| 3661 | |
| 3662 | This command is restricted and can only be issued on sockets configured for |
| 3663 | level "admin". |
| 3664 | |
| 3665 | shutdown session <id> |
| 3666 | Immediately terminate the session matching the specified session identifier. |
| 3667 | This identifier is the first field at the beginning of the lines in the dumps |
| 3668 | of "show sess" (it corresponds to the session pointer). This can be used to |
| 3669 | terminate a long-running session without waiting for a timeout or when an |
| 3670 | endless transfer is ongoing. Such terminated sessions are reported with a 'K' |
| 3671 | flag in the logs. |
| 3672 | |
| 3673 | shutdown sessions server <backend>/<server> |
| 3674 | Immediately terminate all the sessions attached to the specified server. This |
| 3675 | can be used to terminate long-running sessions after a server is put into |
| 3676 | maintenance mode, for instance. Such terminated sessions are reported with a |
| 3677 | 'K' flag in the logs. |
| 3678 | |
Willy Tarreau | f909c91 | 2019-08-22 20:06:04 +0200 | [diff] [blame] | 3679 | trace |
| 3680 | The "trace" command alone lists the trace sources, their current status, and |
| 3681 | their brief descriptions. It is only meant as a menu to enter next levels, |
| 3682 | see other "trace" commands below. |
| 3683 | |
| 3684 | trace 0 |
| 3685 | Immediately stops all traces. This is made to be used as a quick solution |
| 3686 | to terminate a debugging session or as an emergency action to be used in case |
| 3687 | complex traces were enabled on multiple sources and impact the service. |
| 3688 | |
| 3689 | trace <source> event [ [+|-|!]<name> ] |
| 3690 | Without argument, this will list all the events supported by the designated |
| 3691 | source. They are prefixed with a "-" if they are not enabled, or a "+" if |
| 3692 | they are enabled. It is important to note that a single trace may be labelled |
| 3693 | with multiple events, and as long as any of the enabled events matches one of |
| 3694 | the events labelled on the trace, the event will be passed to the trace |
| 3695 | subsystem. For example, receiving an HTTP/2 frame of type HEADERS may trigger |
| 3696 | a frame event and a stream event since the frame creates a new stream. If |
| 3697 | either the frame event or the stream event are enabled for this source, the |
| 3698 | frame will be passed to the trace framework. |
| 3699 | |
| 3700 | With an argument, it is possible to toggle the state of each event and |
| 3701 | individually enable or disable them. Two special keywords are supported, |
| 3702 | "none", which matches no event, and is used to disable all events at once, |
| 3703 | and "any" which matches all events, and is used to enable all events at |
| 3704 | once. Other events are specific to the event source. It is possible to |
| 3705 | enable one event by specifying its name, optionally prefixed with '+' for |
| 3706 | better readability. It is possible to disable one event by specifying its |
| 3707 | name prefixed by a '-' or a '!'. |
| 3708 | |
| 3709 | One way to completely disable a trace source is to pass "event none", and |
| 3710 | this source will instantly be totally ignored. |
| 3711 | |
| 3712 | trace <source> level [<level>] |
Willy Tarreau | 2ea549b | 2019-08-29 08:01:48 +0200 | [diff] [blame] | 3713 | Without argument, this will list all trace levels for this source, and the |
Willy Tarreau | f909c91 | 2019-08-22 20:06:04 +0200 | [diff] [blame] | 3714 | current one will be indicated by a star ('*') prepended in front of it. With |
Willy Tarreau | 2ea549b | 2019-08-29 08:01:48 +0200 | [diff] [blame] | 3715 | an argument, this will change the trace level to the specified level. Detail |
Willy Tarreau | f909c91 | 2019-08-22 20:06:04 +0200 | [diff] [blame] | 3716 | levels are a form of filters that are applied before reporting the events. |
Willy Tarreau | 2ea549b | 2019-08-29 08:01:48 +0200 | [diff] [blame] | 3717 | These filters are used to selectively include or exclude events depending on |
| 3718 | their level of importance. For example a developer might need to know |
| 3719 | precisely where in the code an HTTP header was considered invalid while the |
| 3720 | end user may not even care about this header's validity at all. There are |
| 3721 | currently 5 distinct levels for a trace : |
Willy Tarreau | f909c91 | 2019-08-22 20:06:04 +0200 | [diff] [blame] | 3722 | |
| 3723 | user this will report information that are suitable for use by a |
| 3724 | regular haproxy user who wants to observe his traffic. |
| 3725 | Typically some HTTP requests and responses will be reported |
| 3726 | without much detail. Most sources will set this as the |
| 3727 | default level to ease operations. |
| 3728 | |
Willy Tarreau | 2ea549b | 2019-08-29 08:01:48 +0200 | [diff] [blame] | 3729 | proto in addition to what is reported at the "user" level, it also |
| 3730 | displays protocol-level updates. This can for example be the |
| 3731 | frame types or HTTP headers after decoding. |
Willy Tarreau | f909c91 | 2019-08-22 20:06:04 +0200 | [diff] [blame] | 3732 | |
| 3733 | state in addition to what is reported at the "proto" level, it |
| 3734 | will also display state transitions (or failed transitions) |
| 3735 | which happen in parsers, so this will show attempts to |
| 3736 | perform an operation while the "proto" level only shows |
| 3737 | the final operation. |
| 3738 | |
Willy Tarreau | 2ea549b | 2019-08-29 08:01:48 +0200 | [diff] [blame] | 3739 | data in addition to what is reported at the "state" level, it |
| 3740 | will also include data transfers between the various layers. |
| 3741 | |
Willy Tarreau | f909c91 | 2019-08-22 20:06:04 +0200 | [diff] [blame] | 3742 | developer it reports everything available, which can include advanced |
| 3743 | information such as "breaking out of this loop" that are |
| 3744 | only relevant to a developer trying to understand a bug that |
Willy Tarreau | 09fb0df | 2019-08-29 08:40:59 +0200 | [diff] [blame] | 3745 | only happens once in a while in field. Function names are |
| 3746 | only reported at this level. |
Willy Tarreau | f909c91 | 2019-08-22 20:06:04 +0200 | [diff] [blame] | 3747 | |
| 3748 | It is highly recommended to always use the "user" level only and switch to |
| 3749 | other levels only if instructed to do so by a developer. Also it is a good |
| 3750 | idea to first configure the events before switching to higher levels, as it |
| 3751 | may save from dumping many lines if no filter is applied. |
| 3752 | |
| 3753 | trace <source> lock [criterion] |
| 3754 | Without argument, this will list all the criteria supported by this source |
| 3755 | for lock-on processing, and display the current choice by a star ('*') in |
| 3756 | front of it. Lock-on means that the source will focus on the first matching |
| 3757 | event and only stick to the criterion which triggered this event, and ignore |
| 3758 | all other ones until the trace stops. This allows for example to take a trace |
| 3759 | on a single connection or on a single stream. The following criteria are |
| 3760 | supported by some traces, though not necessarily all, since some of them |
| 3761 | might not be available to the source : |
| 3762 | |
| 3763 | backend lock on the backend that started the trace |
| 3764 | connection lock on the connection that started the trace |
| 3765 | frontend lock on the frontend that started the trace |
| 3766 | listener lock on the listener that started the trace |
| 3767 | nothing do not lock on anything |
| 3768 | server lock on the server that started the trace |
| 3769 | session lock on the session that started the trace |
| 3770 | thread lock on the thread that started the trace |
| 3771 | |
| 3772 | In addition to this, each source may provide up to 4 specific criteria such |
| 3773 | as internal states or connection IDs. For example in HTTP/2 it is possible |
| 3774 | to lock on the H2 stream and ignore other streams once a strace starts. |
| 3775 | |
| 3776 | When a criterion is passed in argument, this one is used instead of the |
| 3777 | other ones and any existing tracking is immediately terminated so that it can |
| 3778 | restart with the new criterion. The special keyword "nothing" is supported by |
| 3779 | all sources to permanently disable tracking. |
| 3780 | |
| 3781 | trace <source> { pause | start | stop } [ [+|-|!]event] |
| 3782 | Without argument, this will list the events enabled to automatically pause, |
| 3783 | start, or stop a trace for this source. These events are specific to each |
| 3784 | trace source. With an argument, this will either enable the event for the |
| 3785 | specified action (if optionally prefixed by a '+') or disable it (if |
| 3786 | prefixed by a '-' or '!'). The special keyword "now" is not an event and |
| 3787 | requests to take the action immediately. The keywords "none" and "any" are |
| 3788 | supported just like in "trace event". |
| 3789 | |
| 3790 | The 3 supported actions are respectively "pause", "start" and "stop". The |
| 3791 | "pause" action enumerates events which will cause a running trace to stop and |
| 3792 | wait for a new start event to restart it. The "start" action enumerates the |
| 3793 | events which switch the trace into the waiting mode until one of the start |
| 3794 | events appears. And the "stop" action enumerates the events which definitely |
| 3795 | stop the trace until it is manually enabled again. In practice it makes sense |
| 3796 | to manually start a trace using "start now" without caring about events, and |
| 3797 | to stop it using "stop now". In order to capture more subtle event sequences, |
| 3798 | setting "start" to a normal event (like receiving an HTTP request) and "stop" |
| 3799 | to a very rare event like emitting a certain error, will ensure that the last |
| 3800 | captured events will match the desired criteria. And the pause event is |
| 3801 | useful to detect the end of a sequence, disable the lock-on and wait for |
| 3802 | another opportunity to take a capture. In this case it can make sense to |
| 3803 | enable lock-on to spot only one specific criterion (e.g. a stream), and have |
| 3804 | "start" set to anything that starts this criterion (e.g. all events which |
| 3805 | create a stream), "stop" set to the expected anomaly, and "pause" to anything |
| 3806 | that ends that criterion (e.g. any end of stream event). In this case the |
| 3807 | trace log will contain complete sequences of perfectly clean series affecting |
| 3808 | a single object, until the last sequence containing everything from the |
| 3809 | beginning to the anomaly. |
| 3810 | |
| 3811 | trace <source> sink [<sink>] |
| 3812 | Without argument, this will list all event sinks available for this source, |
| 3813 | and the currently configured one will have a star ('*') prepended in front |
| 3814 | of it. Sink "none" is always available and means that all events are simply |
| 3815 | dropped, though their processing is not ignored (e.g. lock-on does occur). |
| 3816 | Other sinks are available depending on configuration and build options, but |
| 3817 | typically "stdout" and "stderr" will be usable in debug mode, and in-memory |
| 3818 | ring buffers should be available as well. When a name is specified, the sink |
| 3819 | instantly changes for the specified source. Events are not changed during a |
| 3820 | sink change. In the worst case some may be lost if an invalid sink is used |
| 3821 | (or "none"), but operations do continue to a different destination. |
| 3822 | |
Willy Tarreau | 370a694 | 2019-08-29 08:24:16 +0200 | [diff] [blame] | 3823 | trace <source> verbosity [<level>] |
| 3824 | Without argument, this will list all verbosity levels for this source, and the |
| 3825 | current one will be indicated by a star ('*') prepended in front of it. With |
| 3826 | an argument, this will change the verbosity level to the specified one. |
| 3827 | |
| 3828 | Verbosity levels indicate how far the trace decoder should go to provide |
| 3829 | detailed information. It depends on the trace source, since some sources will |
| 3830 | not even provide a specific decoder. Level "quiet" is always available and |
| 3831 | disables any decoding. It can be useful when trying to figure what's |
| 3832 | happening before trying to understand the details, since it will have a very |
| 3833 | low impact on performance and trace size. When no verbosity levels are |
| 3834 | declared by a source, level "default" is available and will cause a decoder |
| 3835 | to be called when specified in the traces. It is an opportunistic decoding. |
| 3836 | When the source declares some verbosity levels, these ones are listed with |
| 3837 | a description of what they correspond to. In this case the trace decoder |
| 3838 | provided by the source will be as accurate as possible based on the |
| 3839 | information available at the trace point. The first level above "quiet" is |
| 3840 | set by default. |
| 3841 | |
Remi Tricot-Le Breton | eeaa29b | 2022-12-20 11:11:07 +0100 | [diff] [blame] | 3842 | update ssl ocsp-response <certfile> |
| 3843 | Create an OCSP request for the specified <certfile> and send it to the OCSP |
| 3844 | responder whose URI should be specified in the "Authority Information Access" |
| 3845 | section of the certificate. Only the first URI is taken into account. The |
| 3846 | OCSP response that we should receive in return is then checked and inserted |
| 3847 | in the local OCSP response tree. This command will only work for certificates |
| 3848 | that already had a stored OCSP response, either because it was provided |
| 3849 | during init or if it was previously set through the "set ssl cert" or "set |
| 3850 | ssl ocsp-response" commands. |
| 3851 | If the received OCSP response is valid and was properly inserted into the |
| 3852 | local tree, its contents will be displayed on the standard output. The format |
| 3853 | is the same as the one described in "show ssl ocsp-response". |
| 3854 | |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 3855 | |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3856 | 9.4. Master CLI |
| 3857 | --------------- |
| 3858 | |
| 3859 | The master CLI is a socket bound to the master process in master-worker mode. |
| 3860 | This CLI gives access to the unix socket commands in every running or leaving |
| 3861 | processes and allows a basic supervision of those processes. |
| 3862 | |
| 3863 | The master CLI is configurable only from the haproxy program arguments with |
| 3864 | the -S option. This option also takes bind options separated by commas. |
| 3865 | |
| 3866 | Example: |
| 3867 | |
| 3868 | # haproxy -W -S 127.0.0.1:1234 -f test1.cfg |
| 3869 | # haproxy -Ws -S /tmp/master-socket,uid,1000,gid,1000,mode,600 -f test1.cfg |
William Lallemand | b7ea141 | 2018-12-13 09:05:47 +0100 | [diff] [blame] | 3870 | # haproxy -W -S /tmp/master-socket,level,user -f test1.cfg |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3871 | |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3872 | |
William Lallemand | a662275 | 2022-03-31 15:26:51 +0200 | [diff] [blame] | 3873 | 9.4.1. Master CLI commands |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 3874 | -------------------------- |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3875 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 3876 | @<[!]pid> |
| 3877 | The master CLI uses a special prefix notation to access the multiple |
| 3878 | processes. This notation is easily identifiable as it begins by a @. |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3879 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 3880 | A @ prefix can be followed by a relative process number or by an exclamation |
| 3881 | point and a PID. (e.g. @1 or @!1271). A @ alone could be use to specify the |
| 3882 | master. Leaving processes are only accessible with the PID as relative process |
| 3883 | number are only usable with the current processes. |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3884 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 3885 | Examples: |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3886 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 3887 | $ socat /var/run/haproxy-master.sock readline |
| 3888 | prompt |
| 3889 | master> @1 show info; @2 show info |
| 3890 | [...] |
| 3891 | Process_num: 1 |
| 3892 | Pid: 1271 |
| 3893 | [...] |
| 3894 | Process_num: 2 |
| 3895 | Pid: 1272 |
| 3896 | [...] |
| 3897 | master> |
Willy Tarreau | 52880f9 | 2018-12-15 13:30:03 +0100 | [diff] [blame] | 3898 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 3899 | $ echo '@!1271 show info; @!1272 show info' | socat /var/run/haproxy-master.sock - |
| 3900 | [...] |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3901 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 3902 | A prefix could be use as a command, which will send every next commands to |
| 3903 | the specified process. |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3904 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 3905 | Examples: |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3906 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 3907 | $ socat /var/run/haproxy-master.sock readline |
| 3908 | prompt |
| 3909 | master> @1 |
| 3910 | 1271> show info |
| 3911 | [...] |
| 3912 | 1271> show stat |
| 3913 | [...] |
| 3914 | 1271> @ |
| 3915 | master> |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3916 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 3917 | $ echo '@1; show info; show stat; @2; show info; show stat' | socat /var/run/haproxy-master.sock - |
| 3918 | [...] |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3919 | |
William Lallemand | a5ce28b | 2022-02-02 15:29:21 +0100 | [diff] [blame] | 3920 | expert-mode [on|off] |
| 3921 | This command activates the "expert-mode" for every worker accessed from the |
William Lallemand | 2a17191 | 2022-02-02 11:43:20 +0100 | [diff] [blame] | 3922 | master CLI. Combined with "mcli-debug-mode" it also activates the command on |
William Lallemand | dae12c7 | 2022-02-02 14:13:54 +0100 | [diff] [blame] | 3923 | the master. Display the flag "e" in the master CLI prompt. |
William Lallemand | a5ce28b | 2022-02-02 15:29:21 +0100 | [diff] [blame] | 3924 | |
William Lallemand | 2a17191 | 2022-02-02 11:43:20 +0100 | [diff] [blame] | 3925 | See also "expert-mode" in Section 9.3 and "mcli-debug-mode" in 9.4.1. |
William Lallemand | a5ce28b | 2022-02-02 15:29:21 +0100 | [diff] [blame] | 3926 | |
| 3927 | experimental-mode [on|off] |
| 3928 | This command activates the "experimental-mode" for every worker accessed from |
William Lallemand | 2a17191 | 2022-02-02 11:43:20 +0100 | [diff] [blame] | 3929 | the master CLI. Combined with "mcli-debug-mode" it also activates the command on |
William Lallemand | dae12c7 | 2022-02-02 14:13:54 +0100 | [diff] [blame] | 3930 | the master. Display the flag "x" in the master CLI prompt. |
William Lallemand | 2a17191 | 2022-02-02 11:43:20 +0100 | [diff] [blame] | 3931 | |
| 3932 | See also "experimental-mode" in Section 9.3 and "mcli-debug-mode" in 9.4.1. |
William Lallemand | a5ce28b | 2022-02-02 15:29:21 +0100 | [diff] [blame] | 3933 | |
William Lallemand | 2a17191 | 2022-02-02 11:43:20 +0100 | [diff] [blame] | 3934 | mcli-debug-mode [on|off] |
| 3935 | This keyword allows a special mode in the master CLI which enables every |
| 3936 | keywords that were meant for a worker CLI on the master CLI, allowing to debug |
| 3937 | the master process. Once activated, you list the new available keywords with |
| 3938 | "help". Combined with "experimental-mode" or "expert-mode" it enables even |
William Lallemand | dae12c7 | 2022-02-02 14:13:54 +0100 | [diff] [blame] | 3939 | more keywords. Display the flag "d" in the master CLI prompt. |
William Lallemand | a5ce28b | 2022-02-02 15:29:21 +0100 | [diff] [blame] | 3940 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 3941 | prompt |
| 3942 | When the prompt is enabled (via the "prompt" command), the context the CLI is |
| 3943 | working on is displayed in the prompt. The master is identified by the "master" |
| 3944 | string, and other processes are identified with their PID. In case the last |
| 3945 | reload failed, the master prompt will be changed to "master[ReloadFailed]>" so |
| 3946 | that it becomes visible that the process is still running on the previous |
| 3947 | configuration and that the new configuration is not operational. |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3948 | |
William Lallemand | dae12c7 | 2022-02-02 14:13:54 +0100 | [diff] [blame] | 3949 | The prompt of the master CLI is able to display several flags which are the |
| 3950 | enable modes. "d" for mcli-debug-mode, "e" for expert-mode, "x" for |
| 3951 | experimental-mode. |
| 3952 | |
| 3953 | Example: |
| 3954 | $ socat /var/run/haproxy-master.sock - |
| 3955 | prompt |
| 3956 | master> expert-mode on |
| 3957 | master(e)> experimental-mode on |
| 3958 | master(xe)> mcli-debug-mode on |
| 3959 | master(xed)> @1 |
| 3960 | 95191(xed)> |
| 3961 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 3962 | reload |
| 3963 | You can also reload the HAProxy master process with the "reload" command which |
| 3964 | does the same as a `kill -USR2` on the master process, provided that the user |
| 3965 | has at least "operator" or "admin" privileges. |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3966 | |
William Lallemand | 70c5ad4 | 2022-09-24 16:44:44 +0200 | [diff] [blame] | 3967 | This command allows you to perform a synchronous reload, the command will |
| 3968 | return a reload status, once the reload was performed. Be careful with the |
| 3969 | timeout if a tool is used to parse it, it is only returned once the |
William Lallemand | bb650f2 | 2022-09-27 11:38:10 +0200 | [diff] [blame] | 3970 | configuration is parsed and the new worker is forked. The "socat" command uses |
| 3971 | a timeout of 0.5s by default so it will quits before showing the message if |
| 3972 | the reload is too long. "ncat" does not have a timeout by default. |
William Lallemand | ef3e5a1 | 2022-10-13 18:14:55 +0200 | [diff] [blame] | 3973 | When compiled with USE_SHM_OPEN=1, the reload command is also able to dump |
| 3974 | the startup-logs of the master. |
William Lallemand | 70c5ad4 | 2022-09-24 16:44:44 +0200 | [diff] [blame] | 3975 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 3976 | Example: |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 3977 | |
William Lallemand | bb650f2 | 2022-09-27 11:38:10 +0200 | [diff] [blame] | 3978 | $ echo "reload" | socat -t300 /var/run/haproxy-master.sock stdin |
William Lallemand | ef3e5a1 | 2022-10-13 18:14:55 +0200 | [diff] [blame] | 3979 | Success=1 |
| 3980 | -- |
| 3981 | [NOTICE] (482713) : haproxy version is 2.7-dev7-4827fb-69 |
| 3982 | [NOTICE] (482713) : path to executable is ./haproxy |
| 3983 | [WARNING] (482713) : config : 'http-request' rules ignored for proxy 'frt1' as they require HTTP mode. |
| 3984 | [NOTICE] (482713) : New worker (482720) forked |
| 3985 | [NOTICE] (482713) : Loading success. |
William Lallemand | 70c5ad4 | 2022-09-24 16:44:44 +0200 | [diff] [blame] | 3986 | |
William Lallemand | bb650f2 | 2022-09-27 11:38:10 +0200 | [diff] [blame] | 3987 | $ echo "reload" | socat -t300 /var/run/haproxy-master.sock stdin |
William Lallemand | ef3e5a1 | 2022-10-13 18:14:55 +0200 | [diff] [blame] | 3988 | Success=0 |
| 3989 | -- |
| 3990 | [NOTICE] (482886) : haproxy version is 2.7-dev7-4827fb-69 |
| 3991 | [NOTICE] (482886) : path to executable is ./haproxy |
| 3992 | [ALERT] (482886) : config : parsing [test3.cfg:1]: unknown keyword 'Aglobal' out of section. |
| 3993 | [ALERT] (482886) : config : Fatal errors found in configuration. |
| 3994 | [WARNING] (482886) : Loading failure! |
| 3995 | |
| 3996 | $ |
William Lallemand | 70c5ad4 | 2022-09-24 16:44:44 +0200 | [diff] [blame] | 3997 | |
| 3998 | The reload command is the last executed on the master CLI, every other |
| 3999 | command after it are ignored. Once the reload command returns its status, it |
| 4000 | will close the connection to the CLI. |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 4001 | |
William Lallemand | 70c5ad4 | 2022-09-24 16:44:44 +0200 | [diff] [blame] | 4002 | Note that a reload will close all connections to the master CLI. |
William Lallemand | a57b7e3 | 2018-12-14 21:11:31 +0100 | [diff] [blame] | 4003 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 4004 | show proc |
| 4005 | The master CLI introduces a 'show proc' command to surpervise the |
| 4006 | processe. |
William Lallemand | a57b7e3 | 2018-12-14 21:11:31 +0100 | [diff] [blame] | 4007 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 4008 | Example: |
William Lallemand | a57b7e3 | 2018-12-14 21:11:31 +0100 | [diff] [blame] | 4009 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 4010 | $ echo 'show proc' | socat /var/run/haproxy-master.sock - |
| 4011 | #<PID> <type> <reloads> <uptime> <version> |
| 4012 | 1162 master 5 [failed: 0] 0d00h02m07s 2.5-dev13 |
| 4013 | # workers |
| 4014 | 1271 worker 1 0d00h00m00s 2.5-dev13 |
| 4015 | # old workers |
| 4016 | 1233 worker 3 0d00h00m43s 2.0-dev3-6019f6-289 |
| 4017 | # programs |
| 4018 | 1244 foo 0 0d00h00m00s - |
| 4019 | 1255 bar 0 0d00h00m00s - |
William Lallemand | a57b7e3 | 2018-12-14 21:11:31 +0100 | [diff] [blame] | 4020 | |
William Lallemand | af140ab | 2022-02-02 14:44:19 +0100 | [diff] [blame] | 4021 | In this example, the master has been reloaded 5 times but one of the old |
| 4022 | worker is still running and survived 3 reloads. You could access the CLI of |
| 4023 | this worker to understand what's going on. |
William Lallemand | 142db37 | 2018-12-11 18:56:45 +0100 | [diff] [blame] | 4024 | |
William Lallemand | 5d1e131 | 2022-10-14 15:41:55 +0200 | [diff] [blame] | 4025 | show startup-logs |
| 4026 | HAProxy needs to be compiled with USE_SHM_OPEN=1 to be used correctly on the |
| 4027 | master CLI or all messages won't be visible. |
| 4028 | |
| 4029 | Like its counterpart on the stats socket, this command is able to show the |
| 4030 | startup messages of HAProxy. However it does not dump the startup messages |
| 4031 | of the current worker, but the startup messages of the latest startup or |
| 4032 | reload, which means it is able to dump the parsing messages of a failed |
| 4033 | reload. |
| 4034 | |
| 4035 | Those messages are also dumped with the "reload" command. |
| 4036 | |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 4037 | 10. Tricks for easier configuration management |
| 4038 | ---------------------------------------------- |
| 4039 | |
| 4040 | It is very common that two HAProxy nodes constituting a cluster share exactly |
| 4041 | the same configuration modulo a few addresses. Instead of having to maintain a |
| 4042 | duplicate configuration for each node, which will inevitably diverge, it is |
| 4043 | possible to include environment variables in the configuration. Thus multiple |
| 4044 | configuration may share the exact same file with only a few different system |
| 4045 | wide environment variables. This started in version 1.5 where only addresses |
| 4046 | were allowed to include environment variables, and 1.6 goes further by |
| 4047 | supporting environment variables everywhere. The syntax is the same as in the |
| 4048 | UNIX shell, a variable starts with a dollar sign ('$'), followed by an opening |
| 4049 | curly brace ('{'), then the variable name followed by the closing brace ('}'). |
| 4050 | Except for addresses, environment variables are only interpreted in arguments |
| 4051 | surrounded with double quotes (this was necessary not to break existing setups |
| 4052 | using regular expressions involving the dollar symbol). |
| 4053 | |
| 4054 | Environment variables also make it convenient to write configurations which are |
| 4055 | expected to work on various sites where only the address changes. It can also |
| 4056 | permit to remove passwords from some configs. Example below where the the file |
| 4057 | "site1.env" file is sourced by the init script upon startup : |
| 4058 | |
| 4059 | $ cat site1.env |
| 4060 | LISTEN=192.168.1.1 |
| 4061 | CACHE_PFX=192.168.11 |
| 4062 | SERVER_PFX=192.168.22 |
| 4063 | LOGGER=192.168.33.1 |
| 4064 | STATSLP=admin:pa$$w0rd |
| 4065 | ABUSERS=/etc/haproxy/abuse.lst |
| 4066 | TIMEOUT=10s |
| 4067 | |
| 4068 | $ cat haproxy.cfg |
| 4069 | global |
| 4070 | log "${LOGGER}:514" local0 |
| 4071 | |
| 4072 | defaults |
| 4073 | mode http |
| 4074 | timeout client "${TIMEOUT}" |
| 4075 | timeout server "${TIMEOUT}" |
| 4076 | timeout connect 5s |
| 4077 | |
| 4078 | frontend public |
| 4079 | bind "${LISTEN}:80" |
| 4080 | http-request reject if { src -f "${ABUSERS}" } |
| 4081 | stats uri /stats |
| 4082 | stats auth "${STATSLP}" |
| 4083 | use_backend cache if { path_end .jpg .css .ico } |
| 4084 | default_backend server |
| 4085 | |
| 4086 | backend cache |
| 4087 | server cache1 "${CACHE_PFX}.1:18080" check |
| 4088 | server cache2 "${CACHE_PFX}.2:18080" check |
| 4089 | |
| 4090 | backend server |
| 4091 | server cache1 "${SERVER_PFX}.1:8080" check |
| 4092 | server cache2 "${SERVER_PFX}.2:8080" check |
| 4093 | |
| 4094 | |
| 4095 | 11. Well-known traps to avoid |
| 4096 | ----------------------------- |
| 4097 | |
| 4098 | Once in a while, someone reports that after a system reboot, the haproxy |
| 4099 | service wasn't started, and that once they start it by hand it works. Most |
| 4100 | often, these people are running a clustered IP address mechanism such as |
| 4101 | keepalived, to assign the service IP address to the master node only, and while |
| 4102 | it used to work when they used to bind haproxy to address 0.0.0.0, it stopped |
| 4103 | working after they bound it to the virtual IP address. What happens here is |
| 4104 | that when the service starts, the virtual IP address is not yet owned by the |
| 4105 | local node, so when HAProxy wants to bind to it, the system rejects this |
| 4106 | because it is not a local IP address. The fix doesn't consist in delaying the |
| 4107 | haproxy service startup (since it wouldn't stand a restart), but instead to |
| 4108 | properly configure the system to allow binding to non-local addresses. This is |
| 4109 | easily done on Linux by setting the net.ipv4.ip_nonlocal_bind sysctl to 1. This |
| 4110 | is also needed in order to transparently intercept the IP traffic that passes |
| 4111 | through HAProxy for a specific target address. |
| 4112 | |
| 4113 | Multi-process configurations involving source port ranges may apparently seem |
| 4114 | to work but they will cause some random failures under high loads because more |
| 4115 | than one process may try to use the same source port to connect to the same |
| 4116 | server, which is not possible. The system will report an error and a retry will |
| 4117 | happen, picking another port. A high value in the "retries" parameter may hide |
| 4118 | the effect to a certain extent but this also comes with increased CPU usage and |
| 4119 | processing time. Logs will also report a certain number of retries. For this |
| 4120 | reason, port ranges should be avoided in multi-process configurations. |
| 4121 | |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 4122 | Since HAProxy uses SO_REUSEPORT and supports having multiple independent |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 4123 | processes bound to the same IP:port, during troubleshooting it can happen that |
| 4124 | an old process was not stopped before a new one was started. This provides |
| 4125 | absurd test results which tend to indicate that any change to the configuration |
| 4126 | is ignored. The reason is that in fact even the new process is restarted with a |
| 4127 | new configuration, the old one also gets some incoming connections and |
| 4128 | processes them, returning unexpected results. When in doubt, just stop the new |
| 4129 | process and try again. If it still works, it very likely means that an old |
| 4130 | process remains alive and has to be stopped. Linux's "netstat -lntp" is of good |
| 4131 | help here. |
| 4132 | |
| 4133 | When adding entries to an ACL from the command line (eg: when blacklisting a |
| 4134 | source address), it is important to keep in mind that these entries are not |
| 4135 | synchronized to the file and that if someone reloads the configuration, these |
| 4136 | updates will be lost. While this is often the desired effect (for blacklisting) |
| 4137 | it may not necessarily match expectations when the change was made as a fix for |
| 4138 | a problem. See the "add acl" action of the CLI interface. |
| 4139 | |
| 4140 | |
| 4141 | 12. Debugging and performance issues |
| 4142 | ------------------------------------ |
| 4143 | |
| 4144 | When HAProxy is started with the "-d" option, it will stay in the foreground |
| 4145 | and will print one line per event, such as an incoming connection, the end of a |
| 4146 | connection, and for each request or response header line seen. This debug |
| 4147 | output is emitted before the contents are processed, so they don't consider the |
| 4148 | local modifications. The main use is to show the request and response without |
| 4149 | having to run a network sniffer. The output is less readable when multiple |
| 4150 | connections are handled in parallel, though the "debug2ansi" and "debug2html" |
| 4151 | scripts found in the examples/ directory definitely help here by coloring the |
| 4152 | output. |
| 4153 | |
| 4154 | If a request or response is rejected because HAProxy finds it is malformed, the |
| 4155 | best thing to do is to connect to the CLI and issue "show errors", which will |
| 4156 | report the last captured faulty request and response for each frontend and |
| 4157 | backend, with all the necessary information to indicate precisely the first |
| 4158 | character of the input stream that was rejected. This is sometimes needed to |
| 4159 | prove to customers or to developers that a bug is present in their code. In |
| 4160 | this case it is often possible to relax the checks (but still keep the |
| 4161 | captures) using "option accept-invalid-http-request" or its equivalent for |
| 4162 | responses coming from the server "option accept-invalid-http-response". Please |
| 4163 | see the configuration manual for more details. |
| 4164 | |
| 4165 | Example : |
| 4166 | |
| 4167 | > show errors |
| 4168 | Total events captured on [13/Oct/2015:13:43:47.169] : 1 |
| 4169 | |
| 4170 | [13/Oct/2015:13:43:40.918] frontend HAProxyLocalStats (#2): invalid request |
| 4171 | backend <NONE> (#-1), server <NONE> (#-1), event #0 |
| 4172 | src 127.0.0.1:51981, session #0, session flags 0x00000080 |
| 4173 | HTTP msg state 26, msg flags 0x00000000, tx flags 0x00000000 |
| 4174 | HTTP chunk len 0 bytes, HTTP body len 0 bytes |
| 4175 | buffer flags 0x00808002, out 0 bytes, total 31 bytes |
| 4176 | pending 31 bytes, wrapping at 8040, error at position 13: |
| 4177 | |
| 4178 | 00000 GET /invalid request HTTP/1.1\r\n |
| 4179 | |
| 4180 | |
| 4181 | The output of "show info" on the CLI provides a number of useful information |
| 4182 | regarding the maximum connection rate ever reached, maximum SSL key rate ever |
| 4183 | reached, and in general all information which can help to explain temporary |
| 4184 | issues regarding CPU or memory usage. Example : |
| 4185 | |
| 4186 | > show info |
| 4187 | Name: HAProxy |
| 4188 | Version: 1.6-dev7-e32d18-17 |
| 4189 | Release_date: 2015/10/12 |
| 4190 | Nbproc: 1 |
| 4191 | Process_num: 1 |
| 4192 | Pid: 7949 |
| 4193 | Uptime: 0d 0h02m39s |
| 4194 | Uptime_sec: 159 |
| 4195 | Memmax_MB: 0 |
| 4196 | Ulimit-n: 120032 |
| 4197 | Maxsock: 120032 |
| 4198 | Maxconn: 60000 |
| 4199 | Hard_maxconn: 60000 |
| 4200 | CurrConns: 0 |
| 4201 | CumConns: 3 |
| 4202 | CumReq: 3 |
| 4203 | MaxSslConns: 0 |
| 4204 | CurrSslConns: 0 |
| 4205 | CumSslConns: 0 |
| 4206 | Maxpipes: 0 |
| 4207 | PipesUsed: 0 |
| 4208 | PipesFree: 0 |
| 4209 | ConnRate: 0 |
| 4210 | ConnRateLimit: 0 |
| 4211 | MaxConnRate: 1 |
| 4212 | SessRate: 0 |
| 4213 | SessRateLimit: 0 |
| 4214 | MaxSessRate: 1 |
| 4215 | SslRate: 0 |
| 4216 | SslRateLimit: 0 |
| 4217 | MaxSslRate: 0 |
| 4218 | SslFrontendKeyRate: 0 |
| 4219 | SslFrontendMaxKeyRate: 0 |
| 4220 | SslFrontendSessionReuse_pct: 0 |
| 4221 | SslBackendKeyRate: 0 |
| 4222 | SslBackendMaxKeyRate: 0 |
| 4223 | SslCacheLookups: 0 |
| 4224 | SslCacheMisses: 0 |
| 4225 | CompressBpsIn: 0 |
| 4226 | CompressBpsOut: 0 |
| 4227 | CompressBpsRateLim: 0 |
| 4228 | ZlibMemUsage: 0 |
| 4229 | MaxZlibMemUsage: 0 |
| 4230 | Tasks: 5 |
| 4231 | Run_queue: 1 |
| 4232 | Idle_pct: 100 |
| 4233 | node: wtap |
| 4234 | description: |
| 4235 | |
| 4236 | When an issue seems to randomly appear on a new version of HAProxy (eg: every |
| 4237 | second request is aborted, occasional crash, etc), it is worth trying to enable |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 4238 | memory poisoning so that each call to malloc() is immediately followed by the |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 4239 | filling of the memory area with a configurable byte. By default this byte is |
| 4240 | 0x50 (ASCII for 'P'), but any other byte can be used, including zero (which |
| 4241 | will have the same effect as a calloc() and which may make issues disappear). |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 4242 | Memory poisoning is enabled on the command line using the "-dM" option. It |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 4243 | slightly hurts performance and is not recommended for use in production. If |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 4244 | an issue happens all the time with it or never happens when poisoning uses |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 4245 | byte zero, it clearly means you've found a bug and you definitely need to |
| 4246 | report it. Otherwise if there's no clear change, the problem it is not related. |
| 4247 | |
| 4248 | When debugging some latency issues, it is important to use both strace and |
| 4249 | tcpdump on the local machine, and another tcpdump on the remote system. The |
| 4250 | reason for this is that there are delays everywhere in the processing chain and |
| 4251 | it is important to know which one is causing latency to know where to act. In |
| 4252 | practice, the local tcpdump will indicate when the input data come in. Strace |
| 4253 | will indicate when haproxy receives these data (using recv/recvfrom). Warning, |
| 4254 | openssl uses read()/write() syscalls instead of recv()/send(). Strace will also |
| 4255 | show when haproxy sends the data, and tcpdump will show when the system sends |
| 4256 | these data to the interface. Then the external tcpdump will show when the data |
| 4257 | sent are really received (since the local one only shows when the packets are |
| 4258 | queued). The benefit of sniffing on the local system is that strace and tcpdump |
| 4259 | will use the same reference clock. Strace should be used with "-tts200" to get |
| 4260 | complete timestamps and report large enough chunks of data to read them. |
| 4261 | Tcpdump should be used with "-nvvttSs0" to report full packets, real sequence |
| 4262 | numbers and complete timestamps. |
| 4263 | |
| 4264 | In practice, received data are almost always immediately received by haproxy |
| 4265 | (unless the machine has a saturated CPU or these data are invalid and not |
| 4266 | delivered). If these data are received but not sent, it generally is because |
| 4267 | the output buffer is saturated (ie: recipient doesn't consume the data fast |
| 4268 | enough). This can be confirmed by seeing that the polling doesn't notify of |
| 4269 | the ability to write on the output file descriptor for some time (it's often |
| 4270 | easier to spot in the strace output when the data finally leave and then roll |
| 4271 | back to see when the write event was notified). It generally matches an ACK |
| 4272 | received from the recipient, and detected by tcpdump. Once the data are sent, |
| 4273 | they may spend some time in the system doing nothing. Here again, the TCP |
| 4274 | congestion window may be limited and not allow these data to leave, waiting for |
| 4275 | an ACK to open the window. If the traffic is idle and the data take 40 ms or |
| 4276 | 200 ms to leave, it's a different issue (which is not an issue), it's the fact |
| 4277 | that the Nagle algorithm prevents empty packets from leaving immediately, in |
| 4278 | hope that they will be merged with subsequent data. HAProxy automatically |
| 4279 | disables Nagle in pure TCP mode and in tunnels. However it definitely remains |
| 4280 | enabled when forwarding an HTTP body (and this contributes to the performance |
| 4281 | improvement there by reducing the number of packets). Some HTTP non-compliant |
| 4282 | applications may be sensitive to the latency when delivering incomplete HTTP |
| 4283 | response messages. In this case you will have to enable "option http-no-delay" |
| 4284 | to disable Nagle in order to work around their design, keeping in mind that any |
| 4285 | other proxy in the chain may similarly be impacted. If tcpdump reports that data |
| 4286 | leave immediately but the other end doesn't see them quickly, it can mean there |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 4287 | is a congested WAN link, a congested LAN with flow control enabled and |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 4288 | preventing the data from leaving, or more commonly that HAProxy is in fact |
| 4289 | running in a virtual machine and that for whatever reason the hypervisor has |
| 4290 | decided that the data didn't need to be sent immediately. In virtualized |
| 4291 | environments, latency issues are almost always caused by the virtualization |
| 4292 | layer, so in order to save time, it's worth first comparing tcpdump in the VM |
| 4293 | and on the external components. Any difference has to be credited to the |
| 4294 | hypervisor and its accompanying drivers. |
| 4295 | |
| 4296 | When some TCP SACK segments are seen in tcpdump traces (using -vv), it always |
| 4297 | means that the side sending them has got the proof of a lost packet. While not |
| 4298 | seeing them doesn't mean there are no losses, seeing them definitely means the |
| 4299 | network is lossy. Losses are normal on a network, but at a rate where SACKs are |
| 4300 | not noticeable at the naked eye. If they appear a lot in the traces, it is |
| 4301 | worth investigating exactly what happens and where the packets are lost. HTTP |
| 4302 | doesn't cope well with TCP losses, which introduce huge latencies. |
| 4303 | |
| 4304 | The "netstat -i" command will report statistics per interface. An interface |
| 4305 | where the Rx-Ovr counter grows indicates that the system doesn't have enough |
| 4306 | resources to receive all incoming packets and that they're lost before being |
| 4307 | processed by the network driver. Rx-Drp indicates that some received packets |
| 4308 | were lost in the network stack because the application doesn't process them |
| 4309 | fast enough. This can happen during some attacks as well. Tx-Drp means that |
| 4310 | the output queues were full and packets had to be dropped. When using TCP it |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 4311 | should be very rare, but will possibly indicate a saturated outgoing link. |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 4312 | |
| 4313 | |
| 4314 | 13. Security considerations |
| 4315 | --------------------------- |
| 4316 | |
| 4317 | HAProxy is designed to run with very limited privileges. The standard way to |
| 4318 | use it is to isolate it into a chroot jail and to drop its privileges to a |
| 4319 | non-root user without any permissions inside this jail so that if any future |
| 4320 | vulnerability were to be discovered, its compromise would not affect the rest |
| 4321 | of the system. |
| 4322 | |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 4323 | In order to perform a chroot, it first needs to be started as a root user. It is |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 4324 | pointless to build hand-made chroots to start the process there, these ones are |
| 4325 | painful to build, are never properly maintained and always contain way more |
| 4326 | bugs than the main file-system. And in case of compromise, the intruder can use |
| 4327 | the purposely built file-system. Unfortunately many administrators confuse |
| 4328 | "start as root" and "run as root", resulting in the uid change to be done prior |
| 4329 | to starting haproxy, and reducing the effective security restrictions. |
| 4330 | |
| 4331 | HAProxy will need to be started as root in order to : |
| 4332 | - adjust the file descriptor limits |
| 4333 | - bind to privileged port numbers |
| 4334 | - bind to a specific network interface |
| 4335 | - transparently listen to a foreign address |
| 4336 | - isolate itself inside the chroot jail |
| 4337 | - drop to another non-privileged UID |
| 4338 | |
| 4339 | HAProxy may require to be run as root in order to : |
| 4340 | - bind to an interface for outgoing connections |
| 4341 | - bind to privileged source ports for outgoing connections |
Dan Lloyd | 8e48b87 | 2016-07-01 21:01:18 -0400 | [diff] [blame] | 4342 | - transparently bind to a foreign address for outgoing connections |
Willy Tarreau | 2212e6a | 2015-10-13 14:40:55 +0200 | [diff] [blame] | 4343 | |
| 4344 | Most users will never need the "run as root" case. But the "start as root" |
| 4345 | covers most usages. |
| 4346 | |
| 4347 | A safe configuration will have : |
| 4348 | |
| 4349 | - a chroot statement pointing to an empty location without any access |
| 4350 | permissions. This can be prepared this way on the UNIX command line : |
| 4351 | |
| 4352 | # mkdir /var/empty && chmod 0 /var/empty || echo "Failed" |
| 4353 | |
| 4354 | and referenced like this in the HAProxy configuration's global section : |
| 4355 | |
| 4356 | chroot /var/empty |
| 4357 | |
| 4358 | - both a uid/user and gid/group statements in the global section : |
| 4359 | |
| 4360 | user haproxy |
| 4361 | group haproxy |
| 4362 | |
| 4363 | - a stats socket whose mode, uid and gid are set to match the user and/or |
| 4364 | group allowed to access the CLI so that nobody may access it : |
| 4365 | |
| 4366 | stats socket /var/run/haproxy.stat uid hatop gid hatop mode 600 |
| 4367 | |