scylladb

mirrors/scylladb

Fork 0

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-26 01:30:47 +00:00

Files

History

Tomasz Grabiec 1256a9faa7 tablets: Fix deadlock in background storage group merge fiber

When it deadlocks, groups stop merging and compaction group merge
backlog will run-away.

Also, graceful shutdown will be blocked on it.

Found by flaky unit test
test_merge_chooses_best_replica_with_odd_count, which timed-out in 1
in 100 runs.

Reason for deadlock:

When storage groups are merged, the main compaction group of the new
storage group takes a compaction lock, which is appended to
_compaction_reenablers_for_merging, and released when the merge
completion fiber is done with the whole batch.

If we accumulate more than 1 merge cycle for the fiber, deadlock
occurs. Lock order will be this

Initial state:

cg0: main
cg1: main
cg2: main
cg3: main

After 1st merge:

cg0': main [locked], merging_groups=[cg0.main, cg1.main]
cg1': main [locked], merging_groups=[cg2.main, cg3.main]

After 2nd merge:

cg0'': main [locked], merging_groups=[cg0'.main [locked], cg0.main, cg1.main, cg1'.main [locked], cg2.main, cg3.main]

merge completion fiber will try to stop cg0'.main, which will be
blocked on compaction lock. which is held by the reenabler in
_compaction_reenablers_for_merging, hence deadlock.

The fix is to wait for background merge to finish before we start the
next merge. It's achieved by holding old erm in the background merge,
and doing a topology barrier from the merge finalizing transition.

Background merge is supposed to be a relatively quick operation, it's
stopping compaction groups. So may wait for active requests. It
shouldn't prolong the barrier indefinitely.

Tablet boost unit tests which trigger merge need to be adjusted to
call the barrier, otherwise they will be vulnerable to the deadlock.

Two cluster tests were removed because they assumed that merge happens
in the backgournd. Now that it happens as part of merge finalization,
and blocks topology state machine, those tests deadlock because they
are unable to make topology changes (node bootstrap) while background
merge is blocked.

The test "test_tablets_merge_waits_for_lwt" needed to be adjusted. It
assumed that merge finalization doesn't wait for the erm held by the
LWT operation, and triggered tablet movement afterwards, and assumed
that this migration will issue a barrier which will block on the LWT
operation. After this commit, it's the barrier in merge finalization
which is blocked. The test was adjusted to use an earlier log mark
when waiting for "Got raft_topology_cmd::barrier_and_drain", which
will catch the barrier in merge finalization.

Fixes SCYLLADB-928

2026-03-12 22:45:01 +01:00

alternator

test/alternator,cqlpy: avoid xfail_strict against DynamoDB/Cassandra

2026-03-11 09:29:30 +02:00

boost

tablets: Fix deadlock in background storage group merge fiber

2026-03-12 22:45:01 +01:00

broadcast_tables

…

cluster

tablets: Fix deadlock in background storage group merge fiber

2026-03-12 22:45:01 +01:00

cql

…

cqlpy

test/alternator,cqlpy: avoid xfail_strict against DynamoDB/Cassandra

2026-03-11 09:29:30 +02:00

ldap

auth: ldap: add permissions reload to unified cache

2026-02-17 17:56:27 +01:00

lib

Merge 'raft: add global read barrier to group0_batch::commit and switch auth and service levels' from Marcin Maliszkiewicz

2026-03-11 10:37:19 +01:00

manual

gossiper: remove the code that was only used in gossiper topology

2026-03-10 10:39:58 +02:00

nodetool

nodetool: cluster repair: do not fail if a table was dropped

2026-03-11 16:35:04 +02:00

perf

test: move away from tombstone_gc_state(nullptr) ctor

2026-03-03 14:09:28 +02:00

pylib

test.py: eliminite drivers exception

2026-03-10 14:31:36 +02:00

pylib_test

test/pylib: introduce scale_timeout fixture helper

2026-03-05 13:07:09 +02:00

raft

Merge 'raft: Throw stopped_error if server aborted' from Dawid Mędrek

2026-03-05 10:47:39 +01:00

resource

schema: remove calculate_schema_digest function

2026-03-10 10:46:47 +02:00

rest_api

test: Keep test_gossiper_live_endpoints checks togethger

2026-01-23 16:53:48 +02:00

scylla_gdb

test/scylla_gdb: skip coroutine tests if coroutine frame is not found

2026-02-24 10:12:03 +01:00

unit

…

vector_search

vector_store_client: Return HTTP error description, not just code

2026-03-10 17:22:30 +01:00

__init__.py

test/pylib: introduce scale_timeout fixture helper

2026-03-05 13:07:09 +02:00

CMakeLists.txt

Revert "Merge 'vector_search: add validator tests' from Pawel Pery"

2026-02-08 16:29:58 +02:00

conftest.py

…

pytest.ini

test.py: fix strict-config argument.

2026-03-08 16:09:29 +02:00

README.md

…

README.md

Scylla in-source tests.

For details on how to run the tests, see docs/dev/testing.md

Shared C++ utils, libraries are in lib/, for Python - pylib/

alternator - Python tests which connect to a single server and use the DynamoDB API unit, boost, raft - unit tests in C++ cqlpy - Python tests which connect to a single server and use CQL topology* - tests that set up clusters and add/remove nodes cql - approval tests that use CQL and pre-recorded output rest_api - tests for Scylla REST API Port 9000 scylla-gdb - tests for scylla-gdb.py helper script nodetool - tests for C++ implementation of nodetool

If you can use an existing folder, consider adding your test to it. New folders should be used for new large categories/subsystems, or when the test environment is significantly different from some existing suite, e.g. you plan to start scylladb with different configuration, and you intend to add many tests and would like them to reuse an existing Scylla cluster (clusters can be reused for tests within the same folder).

To add a new folder, create a new directory, and then copy & edit its suite.ini.