30a5895daa74b3e2102f7f86c5101e7c25eb70e5 - haproxy

commit	30a5895daa74b3e2102f7f86c5101e7c25eb70e5	[log] [tgz]
author	Willy Tarreau <w@1wt.eu>	Wed Apr 24 11:37:06 2024 +0200
committer	Amaury Denoyelle <adenoyelle@haproxy.com>	Thu Jun 06 17:28:13 2024 +0200
tree	6a7c8b759dc841334f757514ff5708948a7d6112
parent	1c8372b1714961f11f943fe2ef6d3e26cf2774ce [diff]

BUG/MINOR: h1: fix detection of upper bytes in the URI

In 1.7 with commit 5f10ea30f4 ("OPTIM: http: improve parsing performance
of long URIs") we improved the URI parser's performance on platforms
supporting unaligned accesses by reading 4 chars at a time in a 32-bit
word. However, as reported in GH issue #2545, there's a bug in the way
the top bytes are checked, as the parser will stop when all 4 of them
are above 7e instead of when one of them is, so certain patterns can be
accepted through if the last ones are all valid. The fix requires to
negate the value but on the other hand it allows to parallelize some of
the tests and fuse the masks, which could even end up slightly faster.

This needs to be backported to all stable versions, but be careful, this
code moved a lot over time, from proto_http.c to h1.c, to http_msg.c, to
h1.c again. Better just grep for "24242424" or "21212121" in each version
to find it.

Big kudos to Martijn van Oosterhout (@kleptog) for spotting this problem
while analyzing that piece of code, and reporting it.

(cherry picked from commit b9bf16b3827c58e61bf4cf27b30e14484fd2b6e5)
Signed-off-by: Amaury Denoyelle <adenoyelle@haproxy.com>
(cherry picked from commit 59f36c5e95591a9f256a39bac910f6dc9716ccc9)
Signed-off-by: Amaury Denoyelle <adenoyelle@haproxy.com>

src/h1.c[diff]

1 file changed

tree: 6a7c8b759dc841334f757514ff5708948a7d6112