scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 04:06:59 +00:00

Files

Petr Gusev 8b8b7adbe5 raft_group0: split shutdown into abort_and_drain and destroy

Previously, raft_group0::abort() was called in
storage_service::do_drain (introduced in #24418) to
stop the group0 Raft server before destroying local storage.
This was necessary because raft::server depends on storage
(via raft_sys_table_storage and group0_state_machine).

However, this caused issues: services like
sstable_dict_autotrainer and auth::service, which use
group0_client but are not stopped by storage_service,
could trigger use-after-free if raft_group0 was destroyed
too early. This can happen both during normal shutdown
and when 'nodetool drain' is used.

This commit reworks the shutdown logic:
* Introduces abort_and_drain(), which aborts the server
and waits for background tasks to finish, but keeps the
server object alive. Clients will see raft::stopped_error if
they try to access group0 after abort_and_drain().
* Final destruction happens in a separate method destroy(),
called later from main.cc.

The raft_server_for_group::aborted is changed to a
shared_future -- abort_server now returns a future so that
we can wait for it in abort_and_drain(), it should return
the future from the previous abort_server call, which can
happen in the on_background_error callback.

Node startup can fail before reaching storage_service,
in which case ss.drain_on_shutdown() and abort_and_drain()
are never called. To ensure proper cleanup,
abort_and_drain() is called from main.cc before destroy().

Clients of raft_group_registry are expected to call
destroy_server() for the servers they own. Currently,
the only such client is raft_group0, which satisfies
this requirement. As a result,
raft_group_registry::stop_servers() is no longer needed.
Instead, raft_group_registry::stop() now verifies that all
servers have been properly destroyed.
If any remain, it calls on_internal_error().

The call to drain_on_shutdown() in cql_test_env.cc
appears redundant. The only source of raft::server
instances in raft_group_registry is group0_service, and
if group0_service.start() succeeds, both abort_and_drain()
and destroy() are guaranteed to be called during shutdown.

2025-07-25 17:16:14 +02:00

alternator

Merge 'alternator: avoid oversized allocation in Query/Scan' from Nadav Har'El

2025-07-17 11:30:40 +03:00

boost

treewide: Move misc files to utils directory

2025-07-21 11:56:40 +03:00

broadcast_tables

test.py: cql: run tests using bare pytest command

2025-06-03 07:54:51 +00:00

cluster

raft_group0: split shutdown into abort_and_drain and destroy

2025-07-25 17:16:14 +02:00

cql

test.py: cql: don't exit from pytest session on failed CQL

2025-06-03 07:54:51 +00:00

cqlpy

test/cqlpy: in README.md, remind users of run-cassandra to set NODETOOL

2025-07-22 12:39:00 +02:00

ldap

test.py: clean code that isn't used anymore

2025-06-11 18:29:26 +02:00

lib

raft_group0: split shutdown into abort_and_drain and destroy

2025-07-25 17:16:14 +02:00

manual

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

nodetool

tools/scylla-nodetool: backup: add --move-files parameter

2025-06-27 16:21:39 +03:00

perf

Merge 'Atomic in-memory schema changes application' from Marcin Maliszkiewicz

2025-07-13 20:47:55 +03:00

pylib

test.py: don't crash on early cleanup of ScyllaServer

2025-07-22 12:39:01 +02:00

pylib_test

test.py: remove pylib_test from test.py/CI run

2025-04-01 16:43:45 +03:00

raft

test.py: clean code that isn't used anymore

2025-06-11 18:29:26 +02:00

redis

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

resource

types/comparable_bytes: add testcase to verify compatibility with cassandra

2025-07-01 22:19:08 +05:30

rest_api

test.py: Fix test_compactionhistory_rows_merged_time_window_compaction_strategy

2025-07-01 15:01:21 +03:00

scylla_gdb

test/scylla_gdb: better error message when running on dev build mode

2025-04-22 15:02:06 +03:00

unit

test.py: clean code that isn't used anymore

2025-06-11 18:29:26 +02:00

__init__.py

test.py: allow cmake configuration and ./configure.py configuration to coexist

2025-06-03 16:46:41 +03:00

CMakeLists.txt

Introduce LDAP role manager & saslauthd authenticator

2025-01-12 14:50:29 +02:00

conftest.py

test.py: add bypassing x_log2_compaction_groups to boost tests

2025-07-11 12:30:09 +02:00

pytest.ini

test.py: dtest: add missed markers to pytest.ini

2025-06-30 10:06:32 +00:00

README.md

…

README.md

Scylla in-source tests.

For details on how to run the tests, see docs/dev/testing.md

Shared C++ utils, libraries are in lib/, for Python - pylib/

alternator - Python tests which connect to a single server and use the DynamoDB API unit, boost, raft - unit tests in C++ cqlpy - Python tests which connect to a single server and use CQL topology* - tests that set up clusters and add/remove nodes cql - approval tests that use CQL and pre-recorded output rest_api - tests for Scylla REST API Port 9000 scylla-gdb - tests for scylla-gdb.py helper script nodetool - tests for C++ implementation of nodetool

If you can use an existing folder, consider adding your test to it. New folders should be used for new large categories/subsystems, or when the test environment is significantly different from some existing suite, e.g. you plan to start scylladb with different configuration, and you intend to add many tests and would like them to reuse an existing Scylla cluster (clusters can be reused for tests within the same folder).

To add a new folder, create a new directory, and then copy & edit its suite.ini.