Christopher Faulet | f959d08 | 2019-02-07 15:38:42 +0100 | [diff] [blame] | 1 | PROMEX: A Prometheus exporter for HAProxy |
| 2 | ------------------------------------------- |
| 3 | |
| 4 | Prometheus is a monitoring and alerting system. More and more people use it to |
| 5 | monitor their environment (this is written February 2019). It collects metrics |
| 6 | from monitored targets by scraping metrics HTTP endpoints on these targets. For |
Ilya Shipitsin | 1fae8db | 2020-03-14 17:47:28 +0500 | [diff] [blame] | 7 | HAProxy, The Prometheus team officially supports an exporter written in Go |
Christopher Faulet | f959d08 | 2019-02-07 15:38:42 +0100 | [diff] [blame] | 8 | (https://github.com/prometheus/haproxy_exporter). But it requires an extra |
| 9 | software to deploy and monitor. PROMEX, on its side, is a built-in Prometheus |
| 10 | exporter for HAProxy. It was developed as a service and is directly available in |
| 11 | HAProxy, like the stats applet. |
| 12 | |
| 13 | However, PROMEX is not built by default with HAProxy. It is provided as an extra |
Ilya Shipitsin | 1fae8db | 2020-03-14 17:47:28 +0500 | [diff] [blame] | 14 | component for everyone want to use it. So you need to explicitly build HAProxy |
Christopher Faulet | f959d08 | 2019-02-07 15:38:42 +0100 | [diff] [blame] | 15 | with the PROMEX service, using the Makefile variable "EXTRA_OBJS". For instance: |
| 16 | |
Willy Tarreau | d254aa8 | 2019-06-14 18:40:48 +0200 | [diff] [blame] | 17 | > make TARGET=linux-glibc EXTRA_OBJS="contrib/prometheus-exporter/service-prometheus.o" |
Christopher Faulet | f959d08 | 2019-02-07 15:38:42 +0100 | [diff] [blame] | 18 | |
| 19 | if HAProxy provides the PROMEX service, the following build option will be |
| 20 | reported by the command "haproxy -vv": |
| 21 | |
| 22 | Built with the Prometheus exporter as a service |
| 23 | |
| 24 | To be used, it must be enabled in the configuration with an "http-request" rule |
| 25 | and the corresponding HTTP proxy must enable the HTX support. For instance: |
| 26 | |
| 27 | frontend test |
| 28 | mode http |
| 29 | ... |
| 30 | option http-use-htx |
| 31 | http-request use-service prometheus-exporter if { path /metrics } |
| 32 | ... |
| 33 | |
| 34 | |
| 35 | This service has been developed as a third-party component because it could |
| 36 | become obsolete, depending on how much time Prometheus will remain heavily |
| 37 | used. This is said with no ulterior motive of course. Prometheus is a great |
| 38 | software and I hope all the well for it. But we involve in a environment moving |
| 39 | quickly and a solution may be obvious today could be deprecated the next |
| 40 | year. And because PROMEX is not integrated by default into the HAProxy codebase, |
| 41 | it will need some interest to be actively supported. All contribution of any |
| 42 | kind are welcome. |
| 43 | |
| 44 | You must also be careful if you use with huge configurations. Unlike the stats |
| 45 | applet, all metrics are not grouped by service (proxy, listener or server). With |
| 46 | PROMEX, all lines for a given metric are provided as one single group. So |
| 47 | instead of collecting all metrics for a proxy before moving to the next one, we |
| 48 | must loop on all proxies for each metric. Same for the servers. Thus, it will |
Ilya Shipitsin | 1fae8db | 2020-03-14 17:47:28 +0500 | [diff] [blame] | 49 | spend much more resources to produce the Prometheus metrics than the CSV export |
Christopher Faulet | f959d08 | 2019-02-07 15:38:42 +0100 | [diff] [blame] | 50 | through the stats page. To give a comparison order, quick benchmarks shown that |
| 51 | a PROMEX dump is 5x slower and 20x more verbose than a CSV export. |
| 52 | |
Christopher Faulet | 78407ce | 2019-11-18 14:47:08 +0100 | [diff] [blame] | 53 | |
| 54 | metrics filtering |
| 55 | ------------------- |
| 56 | |
| 57 | It is possible to dynamically select the metrics to export if you don't use all |
| 58 | of them passing parameters in the query-string. |
| 59 | |
| 60 | * Filtering on scopes |
| 61 | |
| 62 | The metrics may be filtered by scopes. Multiple parameters with "scope" as name |
| 63 | may be passed in the query-string to filter exported metrics, with one of those |
| 64 | values: global, frontend, backend, server or '*' (means all). A scope parameter |
| 65 | with no value means to filter out all scopes (nothing is returned). The scope |
| 66 | parameters are parsed in their appearance order in the query-string. So an empty |
| 67 | scope will reset all scopes already parsed. But it can be overridden by |
| 68 | following scope parameters in the query-string. By default everything is |
| 69 | exported. Here are examples: |
| 70 | |
| 71 | /metrics?scope=server # ==> server metrics will be exported |
| 72 | /metrics?scope=frontend&scope=backend # ==> Frontend and backend metrics will be exported |
| 73 | /metrics?scope=*&scope= # ==> no metrics will be exported |
| 74 | /metrics?scope=&scope=global # ==> global metrics will be exported |
| 75 | |
William Dauchy | de3c326 | 2021-02-01 13:11:51 +0100 | [diff] [blame] | 76 | * How do I prevent my prometheus instance to explode? |
| 77 | |
| 78 | ** Filtering on servers state |
Christopher Faulet | eba2294 | 2019-11-19 14:18:24 +0100 | [diff] [blame] | 79 | |
| 80 | It is possible to exclude from returned metrics all servers in maintenance mode |
| 81 | passing the parameter "no-maint" in the query-string. This parameter may help to |
| 82 | solve performance issues of configuration that use the server templates to |
| 83 | manage dynamic provisionning. Note there is no consistency check on the servers |
| 84 | state. So, if the state of a server changes while the exporter is running, only |
| 85 | a part of the metrics for this server will be dumped. |
Christopher Faulet | 78407ce | 2019-11-18 14:47:08 +0100 | [diff] [blame] | 86 | |
William Dauchy | de3c326 | 2021-02-01 13:11:51 +0100 | [diff] [blame] | 87 | prometheus example config: |
| 88 | |
| 89 | For server-template users: |
| 90 | - <job> |
| 91 | params: |
| 92 | no-maint: |
| 93 | - empty |
| 94 | |
| 95 | ** Scrap server health checks only |
| 96 | |
| 97 | All health checks status are dump through `state` label values. If you want to |
| 98 | scrap server health check status but prevent all server metrics to be saved, |
| 99 | except the server_check_status, you may configure prometheus that way: |
| 100 | |
| 101 | - <job> |
| 102 | metric_relabel_configs: |
| 103 | - source_labels: ['__name__'] |
| 104 | regex: 'haproxy_(process_|frontend_|backend_|server_check_status).*' |
| 105 | action: keep |
| 106 | |
Christopher Faulet | f959d08 | 2019-02-07 15:38:42 +0100 | [diff] [blame] | 107 | Exported metrics |
| 108 | ------------------ |
| 109 | |
William Dauchy | 4b7bf7e | 2021-02-01 13:12:00 +0100 | [diff] [blame^] | 110 | See prometheus export for the description of each field. |
| 111 | |
Christopher Faulet | f959d08 | 2019-02-07 15:38:42 +0100 | [diff] [blame] | 112 | * Globals metrics |
| 113 | |
William Dauchy | 4b7bf7e | 2021-02-01 13:12:00 +0100 | [diff] [blame^] | 114 | +------------------------------------------------+ |
| 115 | | Metric name | |
| 116 | +------------------------------------------------+ |
| 117 | | haproxy_process_nbthread | |
| 118 | | haproxy_process_nbproc | |
| 119 | | haproxy_process_relative_process_id | |
| 120 | | haproxy_process_start_time_seconds | |
| 121 | | haproxy_process_max_memory_bytes | |
| 122 | | haproxy_process_pool_allocated_bytes | |
| 123 | | haproxy_process_pool_used_bytes | |
| 124 | | haproxy_process_pool_failures_total | |
| 125 | | haproxy_process_max_fds | |
| 126 | | haproxy_process_max_sockets | |
| 127 | | haproxy_process_max_connections | |
| 128 | | haproxy_process_hard_max_connections | |
| 129 | | haproxy_process_current_connections | |
| 130 | | haproxy_process_connections_total | |
| 131 | | haproxy_process_requests_total | |
| 132 | | haproxy_process_max_ssl_connections | |
| 133 | | haproxy_process_current_ssl_connections | |
| 134 | | haproxy_process_ssl_connections_total | |
| 135 | | haproxy_process_max_pipes | |
| 136 | | haproxy_process_pipes_used_total | |
| 137 | | haproxy_process_pipes_free_total | |
| 138 | | haproxy_process_current_connection_rate | |
| 139 | | haproxy_process_limit_connection_rate | |
| 140 | | haproxy_process_max_connection_rate | |
| 141 | | haproxy_process_current_session_rate | |
| 142 | | haproxy_process_limit_session_rate | |
| 143 | | haproxy_process_max_session_rate | |
| 144 | | haproxy_process_current_ssl_rate | |
| 145 | | haproxy_process_limit_ssl_rate | |
| 146 | | haproxy_process_max_ssl_rate | |
| 147 | | haproxy_process_current_frontend_ssl_key_rate | |
| 148 | | haproxy_process_max_frontend_ssl_key_rate | |
| 149 | | haproxy_process_frontend_ssl_reuse | |
| 150 | | haproxy_process_current_backend_ssl_key_rate | |
| 151 | | haproxy_process_max_backend_ssl_key_rate | |
| 152 | | haproxy_process_ssl_cache_lookups_total | |
| 153 | | haproxy_process_ssl_cache_misses_total | |
| 154 | | haproxy_process_http_comp_bytes_in_total | |
| 155 | | haproxy_process_http_comp_bytes_out_total | |
| 156 | | haproxy_process_limit_http_comp | |
| 157 | | haproxy_process_current_zlib_memory | |
| 158 | | haproxy_process_max_zlib_memory | |
| 159 | | haproxy_process_current_tasks | |
| 160 | | haproxy_process_current_run_queue | |
| 161 | | haproxy_process_idle_time_percent | |
| 162 | | haproxy_process_stopping | |
| 163 | | haproxy_process_jobs | |
| 164 | | haproxy_process_unstoppable_jobs | |
| 165 | | haproxy_process_listeners | |
| 166 | | haproxy_process_active_peers | |
| 167 | | haproxy_process_connected_peers | |
| 168 | | haproxy_process_dropped_logs_total | |
| 169 | | haproxy_process_busy_polling_enabled | |
| 170 | | haproxy_process_failed_resolutions | |
| 171 | | haproxy_process_bytes_out_total | |
| 172 | | haproxy_process_spliced_bytes_out_total | |
| 173 | | haproxy_process_bytes_out_rate | |
| 174 | | haproxy_process_recv_logs_total | |
| 175 | +------------------------------------------------+ |
Christopher Faulet | f959d08 | 2019-02-07 15:38:42 +0100 | [diff] [blame] | 176 | |
| 177 | * Frontend metrics |
| 178 | |
William Dauchy | 4b7bf7e | 2021-02-01 13:12:00 +0100 | [diff] [blame^] | 179 | +-------------------------------------------------+ |
| 180 | | Metric name | |
| 181 | +-------------------------------------------------+ |
| 182 | | haproxy_frontend_status | |
| 183 | | haproxy_frontend_current_sessions | |
| 184 | | haproxy_frontend_max_sessions | |
| 185 | | haproxy_frontend_limit_sessions | |
| 186 | | haproxy_frontend_sessions_total | |
| 187 | | haproxy_frontend_limit_session_rate | |
| 188 | | haproxy_frontend_max_session_rate | |
| 189 | | haproxy_frontend_connections_rate_current | |
| 190 | | haproxy_frontend_connections_rate_max | |
| 191 | | haproxy_frontend_connections_total | |
| 192 | | haproxy_frontend_bytes_in_total | |
| 193 | | haproxy_frontend_bytes_out_total | |
| 194 | | haproxy_frontend_requests_denied_total | |
| 195 | | haproxy_frontend_responses_denied_total | |
| 196 | | haproxy_frontend_request_errors_total | |
| 197 | | haproxy_frontend_denied_connections_total | |
| 198 | | haproxy_frontend_denied_sessions_total | |
| 199 | | haproxy_frontend_failed_header_rewriting_total | |
| 200 | | haproxy_frontend_http_requests_rate_max | |
| 201 | | haproxy_frontend_http_requests_total | |
| 202 | | haproxy_frontend_http_responses_total | |
| 203 | | haproxy_frontend_intercepted_requests_total | |
| 204 | | haproxy_frontend_http_cache_lookups_total | |
| 205 | | haproxy_frontend_http_cache_hits_total | |
| 206 | | haproxy_frontend_http_comp_bytes_in_total | |
| 207 | | haproxy_frontend_http_comp_bytes_out_total | |
| 208 | | haproxy_frontend_http_comp_bytes_bypassed_total | |
| 209 | | haproxy_frontend_http_comp_responses_total | |
| 210 | | haproxy_frontend_internal_errors_total | |
| 211 | +-------------------------------------------------+ |
Christopher Faulet | f959d08 | 2019-02-07 15:38:42 +0100 | [diff] [blame] | 212 | |
Rick Rackow | 35efbe2 | 2019-10-08 11:28:06 +0200 | [diff] [blame] | 213 | * Backend metrics |
Christopher Faulet | f959d08 | 2019-02-07 15:38:42 +0100 | [diff] [blame] | 214 | |
William Dauchy | 4b7bf7e | 2021-02-01 13:12:00 +0100 | [diff] [blame^] | 215 | +-----------------------------------------------------+ |
| 216 | | Metric name | |
| 217 | +-----------------------------------------------------+ |
| 218 | | haproxy_backend_status | |
| 219 | | haproxy_backend_current_sessions | |
| 220 | | haproxy_backend_max_sessions | |
| 221 | | haproxy_backend_limit_sessions | |
| 222 | | haproxy_backend_sessions_total | |
| 223 | | haproxy_backend_max_session_rate | |
| 224 | | haproxy_backend_last_session_seconds | |
| 225 | | haproxy_backend_current_queue | |
| 226 | | haproxy_backend_max_queue | |
| 227 | | haproxy_backend_connection_attempts_total | |
| 228 | | haproxy_backend_connection_reuses_total | |
| 229 | | haproxy_backend_bytes_in_total | |
| 230 | | haproxy_backend_bytes_out_total | |
| 231 | | haproxy_backend_queue_time_average_seconds | |
| 232 | | haproxy_backend_connect_time_average_seconds | |
| 233 | | haproxy_backend_response_time_average_seconds | |
| 234 | | haproxy_backend_total_time_average_seconds | |
| 235 | | haproxy_backend_max_queue_time_seconds | |
| 236 | | haproxy_backend_max_connect_time_seconds | |
| 237 | | haproxy_backend_max_response_time_seconds | |
| 238 | | haproxy_backend_max_total_time_seconds | |
| 239 | | haproxy_backend_requests_denied_total | |
| 240 | | haproxy_backend_responses_denied_total | |
| 241 | | haproxy_backend_connection_errors_total | |
| 242 | | haproxy_backend_response_errors_total | |
| 243 | | haproxy_backend_retry_warnings_total | |
| 244 | | haproxy_backend_redispatch_warnings_total | |
| 245 | | haproxy_backend_failed_header_rewriting_total | |
| 246 | | haproxy_backend_client_aborts_total | |
| 247 | | haproxy_backend_server_aborts_total | |
| 248 | | haproxy_backend_weight | |
| 249 | | haproxy_backend_uweight | |
| 250 | | haproxy_backend_active_servers | |
| 251 | | haproxy_backend_backup_servers | |
| 252 | | haproxy_backend_check_up_down_total | |
| 253 | | haproxy_backend_check_last_change_seconds | |
| 254 | | haproxy_backend_downtime_seconds_total | |
| 255 | | haproxy_backend_loadbalanced_total | |
| 256 | | haproxy_backend_http_requests_total | |
| 257 | | haproxy_backend_http_responses_total | |
| 258 | | haproxy_backend_http_cache_lookups_total | |
| 259 | | haproxy_backend_http_cache_hits_total | |
| 260 | | haproxy_backend_http_comp_bytes_in_total | |
| 261 | | haproxy_backend_http_comp_bytes_out_total | |
| 262 | | haproxy_backend_http_comp_bytes_bypassed_total | |
| 263 | | haproxy_backend_http_comp_responses_total | |
| 264 | | haproxy_backend_internal_errors_total | |
| 265 | +-----------------------------------------------------+ |
Christopher Faulet | f959d08 | 2019-02-07 15:38:42 +0100 | [diff] [blame] | 266 | |
| 267 | * Server metrics |
| 268 | |
William Dauchy | 4b7bf7e | 2021-02-01 13:12:00 +0100 | [diff] [blame^] | 269 | +----------------------------------------------------+ |
| 270 | | Metric name | |
| 271 | +----------------------------------------------------+ |
| 272 | | haproxy_server_status | |
| 273 | | haproxy_server_current_sessions | |
| 274 | | haproxy_server_max_sessions | |
| 275 | | haproxy_server_limit_sessions | |
| 276 | | haproxy_server_sessions_total | |
| 277 | | haproxy_server_max_session_rate | |
| 278 | | haproxy_server_last_session_seconds | |
| 279 | | haproxy_server_current_queue | |
| 280 | | haproxy_server_max_queue | |
| 281 | | haproxy_server_queue_limit | |
| 282 | | haproxy_server_bytes_in_total | |
| 283 | | haproxy_server_bytes_out_total | |
| 284 | | haproxy_server_queue_time_average_seconds | |
| 285 | | haproxy_server_connect_time_average_seconds | |
| 286 | | haproxy_server_response_time_average_seconds | |
| 287 | | haproxy_server_total_time_average_seconds | |
| 288 | | haproxy_server_max_queue_time_seconds | |
| 289 | | haproxy_server_max_connect_time_seconds | |
| 290 | | haproxy_server_max_response_time_seconds | |
| 291 | | haproxy_server_max_total_time_seconds | |
| 292 | | haproxy_server_connection_attempts_total | |
| 293 | | haproxy_server_connection_reuses_total | |
| 294 | | haproxy_server_responses_denied_total | |
| 295 | | haproxy_server_connection_errors_total | |
| 296 | | haproxy_server_response_errors_total | |
| 297 | | haproxy_server_retry_warnings_total | |
| 298 | | haproxy_server_redispatch_warnings_total | |
| 299 | | haproxy_server_failed_header_rewriting_total | |
| 300 | | haproxy_server_client_aborts_total | |
| 301 | | haproxy_server_server_aborts_total | |
| 302 | | haproxy_server_weight | |
| 303 | | haproxy_server_uweight | |
| 304 | | haproxy_server_check_status | |
| 305 | | haproxy_server_check_code | |
| 306 | | haproxy_server_check_duration_seconds | |
| 307 | | haproxy_server_check_failures_total | |
| 308 | | haproxy_server_check_up_down_total | |
| 309 | | haproxy_server_downtime_seconds_total | |
| 310 | | haproxy_server_check_last_change_seconds | |
| 311 | | haproxy_server_current_throttle | |
| 312 | | haproxy_server_loadbalanced_total | |
| 313 | | haproxy_server_http_responses_total | |
| 314 | | haproxy_server_idle_connections_current | |
| 315 | | haproxy_server_idle_connections_limit | |
| 316 | | haproxy_server_internal_errors_total | |
| 317 | | haproxy_server_unsafe_idle_connections_current | |
| 318 | | haproxy_server_safe_idle_connections_current | |
| 319 | | haproxy_server_used_connections_current | |
| 320 | | haproxy_server_need_connections_current | |
| 321 | +----------------------------------------------------+ |