5bd37fa625e95130b4f576983943ef9a4b04d73c - haproxy

commit	5bd37fa625e95130b4f576983943ef9a4b04d73c	[log] [tgz]
author	Willy Tarreau <w@1wt.eu>	Wed Apr 04 20:17:03 2018 +0200
committer	Willy Tarreau <w@1wt.eu>	Wed Apr 04 20:17:03 2018 +0200
tree	e82cc286e9ecad7ce5a57a9fbcdcad0571ef57c4
parent	afe1de5d98f2f4eb3d1fa495f090a1f99551d61a [diff]

BUG/MAJOR: cache: fix random crashes caused by incorrect delete() on non-first blocks Several segfaults were reported in the cache, each time in eb_delete() called from cache_free_blocks() itself called from shctx_row_reserve_hot(). Each time the tree node was corrupted with random cached data (often JS or HTML contents). The problem comes from an incompatibility between the cache's expectations and the recycling algorithm used in the shctx. The shctx allocates and releases a chain of blocks at once. And when it needs to allocate N blocks from the avail list while a chain of M>N is found, it picks the first N from the list, moves them to the hot list, and marks all remaining M-N blocks as isolated blocks (chains of 1). For each such released block, the shctx->free_block() callback is used and passed a pointer to the first and current block of the chain. For the cache, it's cache_free_blocks(). What this function does is check that the current block is the first one, and in this case delete the object from the tree and mark it as not in tree by setting key to zero. The problem this causes is that the tail blocks when M>N become first blocks for the next call to shctx_row_reserve_hot(), these ones will be passed to cache_free_blocks() as list heads, and will be sent to eb_delete() despite containing only cached data. The simplest solution for now is to mark each block as holding no cache object by setting key to zero all the time. It keeps the principle used elsewhere in the code. The SSL code is not subject to this problem because it relies on the block's len not being null, which happens immediately after a block was released. It was uncertain however whether this method is suitable for the cache. It is not critical though since this code is going to change soon in 1.9 to dynamically allocate only the number of required blocks. This fix must be backported to 1.8. Thanks to Thierry for providing exploitable cores.