scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 12:06:44 +00:00

Files

Avi Kivity 63e3a22f2e Merge 'group0_state_machine: don't update in-memory state machine until start' from Piotr Dulikowski

Group0 commands consist of one or more mutations and are supposed to be
atomic - i.e. the data structures that reflect the group0 tables state
are not supposed to be updated while only some mutations of a command
are applied, the logic responsible for that is not supposed to observe
an inconsistent state of group0 tables.

It turns out that this assumption can be broken if a node crashes in the
middle of applying a multi-mutation group0 command. Because these
mutations are, in general, applied separately, only some mutations might
survive a crash and a restart, so the group0 tables might be in an
inconsistent state. The current logic of group0_state_machine will
attempt to read the group0 tables' state as it was left after restart,
so it may observe inconsistent state.

This can confuse the node as it may observe a state that it was not
supposed to observe, or the state will just outright break some
invariants and trigger some sanity checks. One of those was observed in
https://github.com/scylladb/scylladb/issues/26945, where a command from the CDC generation
publisher fiber was partially applied. The fiber, in addition to
publishing generations, it removes old, expired generations as well.
Removal is done by removing data that describes the generation from
cdc_generations_v3 and by removing the generation's ID from the
committed generation list in the topology table. If only the first
mutation gets through but not the other one, on reload the node will see
a committed CDC generation without data, which will trigger an
on_internal_error check.

Fix this by delaying the moment when the in memory data structures are
first loaded. In 579dcf187a, a mechanism was introduced which persists the
commit index before applying commands that are considered committed.
Starting a raft server waits until commands are replayed up to that
point. The fix is to start the group0_state_machine in a mode which only
applies mutations - the aforementioned mechanism will re-apply the
commands which will, thanks to the mutation idempotency, bring the
group0 to a consistent state. After the group0 is known to be in
consistent state (so, after raft::server_impl::start) the in-memory data
structures of group0 are loaded for the first time.

There is an exception, however: schema tables. Information about schema
is actually loaded into memory earlier than the moment when group0 is
started. Applying changes to schema is done through the migration
manager module which compares the persisted state before and after the
schema mutations are applied and acts on that. Refactoring migration
manager is out of scope of this PR. However, this is not a problem
because the migration manager takes care to apply all of the mutations
given in a command in a single commitlog segment, so the initial schema
loading code should not see an inconsistent state due to the state being
partially applied.

The fix is accompanied by a reproducer of scylladb/scylladb#26945.

Fixes: scylladb/scylladb#26945

This is not a regression, so no need to backport.

Closes scylladb/scylladb#27528

* github.com:scylladb/scylladb:
  test: cluster: test for recovery after partial group0 command
  group0_state_machine: remove obsolete comment about group0 consistency
  group0_state_machine: don't update in-memory state machine until start
  group0_state_machine: move reloading out of std::visit
  service: raft: add state machine ref to raft_server_for_group

2025-12-28 13:59:26 +02:00

alternator

test/alternator: remove unused imports

2025-12-24 13:44:28 +02:00

boost

test/boost: fix flaky test_inject_future_disabled

2025-12-25 20:46:31 +02:00

broadcast_tables

…

cluster

Merge 'group0_state_machine: don't update in-memory state machine until start' from Piotr Dulikowski

2025-12-28 13:59:26 +02:00

cql

vector_search: Restrict vector index tests to tablets only

2025-11-25 09:26:16 +02:00

cqlpy

cql3/statements/batch_statement: make size error message more verbose

2025-12-24 15:27:01 +02:00

ldap

main: auth: add auth cache dependency to auth service

2025-11-26 12:01:31 +01:00

lib

raft: auth: add semaphore to auth_cache::load_all

2025-12-24 10:56:24 +02:00

manual

replica/table: keep track of total pre-compression file size

2025-11-13 00:49:57 +01:00

nodetool

Fix for Statement has no effect

2025-12-24 06:43:26 +02:00

perf

service: pass topology and system_keyspace to load_balancer ctor

2025-12-16 13:25:38 +01:00

pylib

test: pylib: log_browsing: fix infinite loop in find_backtraces()

2025-12-25 20:22:17 +02:00

pylib_test

…

raft

test/raft: improve reporting in the randomized_nemesis_test digest functions

2025-12-22 20:02:40 +02:00

resource

build: apply sccache to rust builds too

2025-12-22 15:36:15 +02:00

rest_api

test: add API tests for client_routes endpoints

2025-12-15 17:46:14 +01:00

scylla_gdb

test/scylla_gdb: use gcore instead of signal SIGSEGV to generate a coredump on failure

2025-12-16 06:53:43 +02:00

storage

Merge 'Synchronize incremental repair and tablet split' from Raphael Raph Carvalho

2025-12-23 07:28:56 +02:00

unit

…

vector_search

vector_search: test: Fix flaky DNS resolution test

2025-12-21 20:02:16 +02:00

vector_search_validator

Revert "vector_search_validator: move high availability tests from vector-store.git"

2025-12-25 12:30:22 +00:00

__init__.py

…

CMakeLists.txt

vector_search: implement building vector-search-validator

2025-11-24 17:26:04 +01:00

conftest.py

…

pytest.ini

…

README.md

…

README.md

Scylla in-source tests.

For details on how to run the tests, see docs/dev/testing.md

Shared C++ utils, libraries are in lib/, for Python - pylib/

alternator - Python tests which connect to a single server and use the DynamoDB API unit, boost, raft - unit tests in C++ cqlpy - Python tests which connect to a single server and use CQL topology* - tests that set up clusters and add/remove nodes cql - approval tests that use CQL and pre-recorded output rest_api - tests for Scylla REST API Port 9000 scylla-gdb - tests for scylla-gdb.py helper script nodetool - tests for C++ implementation of nodetool

If you can use an existing folder, consider adding your test to it. New folders should be used for new large categories/subsystems, or when the test environment is significantly different from some existing suite, e.g. you plan to start scylladb with different configuration, and you intend to add many tests and would like them to reuse an existing Scylla cluster (clusters can be reused for tests within the same folder).

To add a new folder, create a new directory, and then copy & edit its suite.ini.