scylladb/db/commitlog/commitlog.hh at master

Files

Calle Wilund a7cdb602e1 db::commitlog: Fix sanity check error on race between segment flushing and oversized alloc

Fixes #27992

When doing a commit log oversized allocation, we lock out all other writers by grabbing
the _request_controller semaphore fully (max capacity).
We thereafter assert that the semaphore is in fact zero. However, due to how things work
with the bookkeep here, the semaphore can in fact become negative (some paths will not
actually wait for the semaphore, because this could deadlock).

Thus, if, after we grab the semaphore and execution actually returns to us (task schedule),
new_buffer via segment::allocate is called (due to a non-fully-full segment), we might
in fact grab the segment overhead from zero, resulting in a negative semaphore.

The same problem applies later when we try to sanity check the return of our permits.

Fix is trivial, just accept less-than-zero values, and take same possible ltz-value
into account in exit check (returning units)

Added whitebox (special callback interface for sync) unit test that provokes/creates
the race condition explicitly (and reliably).

Closes scylladb/scylladb#27998

2026-01-09 14:06:58 +02:00

16 KiB

Raw Permalink Blame History

View Raw

16 KiB Raw Permalink Blame History

16 KiB

Raw Permalink Blame History