mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 11:36:54 +00:00

Go to file

Tomasz Grabiec 6bffc0d2e0 Merge 'utils/serialized_action: harden shutdown synchronization' from Piotr Szymaniak

`serialized_action::join()` is used as a shutdown barrier. After it returns, callers commonly destroy the owning object, and action lambdas often capture that owner by `this`.

The previous implementation waited for the internal semaphore once. This handles actions that are already running or triggers already queued before `join()`, because Seastar semaphores serve waiters FIFO. The problematic case is a late `trigger()` after `join()` has started while an older action is still running. Such a trigger can queue behind `join()`, allowing `join()` to return before that late trigger runs.

Review also found a separate semaphore bookkeeping bug in `trigger()`. The code manually waited on the semaphore and later signaled it through the caller-visible pending future. If the wait itself completed exceptionally, the signal path could still run and give back a semaphore unit that had never been acquired.

Make `join()` a terminal operation for `serialized_action`. Once `join()` starts, new `trigger()` calls fail with `broken_semaphore`. `join()` still waits for work that was accepted before it started, and only then breaks the semaphore so later waiters are rejected.

I audited the existing `serialized_action` users. Some callers explicitly remove trigger sources before `join()`, such as audit and topology_coordinator. Others rely on observer destruction or broader shutdown ordering, such as database, compaction_manager, io_throughput_updater, and schema_push. The least locally fenced case is `migration_manager::_group0_barrier`, which is reachable through several external paths, including task status lookup and other services. That makes this better enforced in `serialized_action` itself rather than relying on each caller to prove all trigger entrances are closed.

This is generic hardening of the shutdown contract, not a fix for a confirmed topology_coordinator-specific reproducer.

Also restore acquire/release ownership in `trigger()` by using `with_semaphore()`. This keeps semaphore release tied to successful acquisition while preserving the existing behavior where action completion and action errors are reported through the shared pending future.

Refs SCYLLADB-1904

No backport: this is generic shutdown hardening without a confirmed user-visible reproducer. The semaphore bookkeeping fix closes a latent exceptional wait path noticed during review, not a known production failure.

Closes scylladb/scylladb#29991

* github.com:scylladb/scylladb:
  utils/serialized_action: pair semaphore release with acquisition
  utils/serialized_action: harden join() against late triggers

2026-05-23 00:45:24 +02:00

.github

call_backport_with_jira.yaml: add missing workflow permissions

2026-05-18 15:50:00 +03:00

abseil @ 255c84dadd

abseil: update to lts_2026_01_07

2026-04-08 12:19:54 +03:00

alternator

tree: move away from collection_mutation_description

2026-05-21 10:23:29 +03:00

api

api: failure_detector: Introduce convict-node API

2026-05-21 21:13:54 +02:00

audit

tree: add missing -present to copyright headers

2026-05-21 10:57:42 +02:00

auth

tree: add missing -present to copyright headers

2026-05-21 10:57:42 +02:00

bin

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

cdc

tree: move away from collection_mutation_description

2026-05-21 10:23:29 +03:00

cmake

build: add -ftime-trace support for compilation profiling

2026-05-11 08:55:33 +03:00

compaction

Merge 'Don't materialize collections into intermediate representations' from Botond Dénes

2026-05-21 17:10:40 +03:00

conf

config: add live audit_rules option

2026-05-20 06:55:14 +02:00

cql3

Merge 'Don't materialize collections into intermediate representations' from Botond Dénes

2026-05-21 17:10:40 +03:00

data_dictionary

data_dictionary: fix swapped arguments in extraneous options error

2026-05-10 17:51:20 +03:00

Merge 'Don't materialize collections into intermediate representations' from Botond Dénes

2026-05-21 17:10:40 +03:00

debug

…

dht

locator: tablets: Support arbitrary tablet boundaries

2026-04-15 01:25:14 +02:00

dist

fix: raise scylla-helper.slice CPUWeight from 10 to 100 to prevent node_exporter CPU starvation

2026-05-18 11:55:14 +03:00

docs

Merge 'logstor: compare records by timestamp and segment sequence number' from Michael Litvak

2026-05-21 08:44:18 +03:00

ent

tree: add missing -present to copyright headers

2026-05-21 10:57:42 +02:00

exceptions

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

gms

gms: gossiper: Make convict() public and safe to call from any scheduling group

2026-05-21 21:13:54 +02:00

idl

Merge 'logstor: compare records by timestamp and segment sequence number' from Michael Litvak

2026-05-21 08:44:18 +03:00

index

external_index: fix require CDC options for disabled CDC

2026-05-19 08:53:15 +02:00

keys

cql: return InvalidRequest for oversized partition/clustering keys

2026-05-11 16:56:35 +03:00

lang

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

licenses

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

locator

cql: fix missing TABLETS_ROUTING_V1 payload after CAS shard bounce

2026-05-15 11:56:14 +02:00

message

strong_consistency: wait for raft servers to start in create table

2026-05-13 08:43:24 +02:00

mutation

Merge 'Don't materialize collections into intermediate representations' from Botond Dénes

2026-05-21 17:10:40 +03:00

mutation_writer

tree: move away from collection_mutation_view::with_deserialized()

2026-05-21 10:23:29 +03:00

node_ops

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

pgo

Update pgo profiles - aarch64

2026-05-15 05:49:12 +03:00

query

Merge 'query: result_set: change row member to a chunked vector' from Benny Halevy

2026-04-15 14:40:15 +03:00

raft

raft: fix send_snapshot abort_source lifetime

2026-05-18 21:49:37 +00:00

readers

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

reloc

…

repair

tasks: fix busy-spin and shutdown hang in tablet_virtual_task::wait() for repair tasks

2026-05-22 16:47:48 +03:00

replica

replica: Fix use-after-free in get_sstables_from_object_store

2026-05-22 15:05:21 +03:00

rust

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

schema

schema: column_computation: move away from collection_mutation_view::with_deserialized()

2026-05-21 10:23:29 +03:00

scripts

tree: add missing -present to copyright headers

2026-05-21 10:57:42 +02:00

seastar @ 510f314870

Update seastar submodule

2026-05-20 13:47:12 +03:00

service

tablet_allocator: use chunked_vector in cluster_resize_load to avoid oversized allocations

2026-05-22 16:52:12 +03:00

sstables

sstables: include SSTable filename in Stats metadata error messages

2026-05-22 16:49:37 +03:00

streaming

Merge 'streaming: add oos protection in mutation based streaming' from Łukasz Paszkowski

2026-04-20 17:56:36 +03:00

swagger-ui @ 12f1da1082

…

tasks

service: Add virtual task for vnodes-to-tablets migrations

2026-04-17 20:59:05 +03:00

test

sstables: include SSTable filename in Stats metadata error messages

2026-05-22 16:49:37 +03:00

tools

tree: move away from collection_mutation_view::with_deserialized()

2026-05-21 10:23:29 +03:00

tracing

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

transport

Merge 'cql: request-side custom payload parsing' from Dario Mirovic

2026-05-22 12:18:26 +02:00

types

types: fix indendation, left broken by previous commit

2026-05-21 10:23:29 +03:00

unified

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

utils

Merge 'utils/serialized_action: harden shutdown synchronization' from Piotr Szymaniak

2026-05-23 00:45:24 +02:00

vector_search

Update seastar submodule

2026-05-20 13:47:12 +03:00

.clang-format

…

.dockerignore

…

.gitattributes

…

.gitignore

gitignore: add missing rust build artifacts

2026-05-11 07:06:26 +03:00

.gitmodules

…

.gitorderfile

…

.mailmap

…

absl-flat_hash_map.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

absl-flat_hash_map.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

AGENTS.md

tree: add AGENTS.md router and improve AI instruction files

2026-04-19 21:59:52 +03:00

amplify.yml

…

backlog_controller_fwd.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

backlog_controller.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

build_mode.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

bytes_fwd.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

bytes_ostream.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

bytes.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

bytes.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

cartesian_product.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

client_data.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

client_data.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

clocks-impl.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

clocks-impl.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

CMakeLists.txt

cmake: reuse precompiled header in scylla-main target

2026-05-14 19:46:51 +03:00

configure.py

audit: add preprocessed rule matching cache

2026-05-20 06:55:15 +02:00

CONTRIBUTING.md

…

coverage_excludes.txt

…

coverage_sources.list

…

db_clock.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

debug.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

debug.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

default.nix

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

Doxyfile

…

encoding_stats.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

enum_set.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

exported_templates.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

exported_templates.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

fix_system_distributed_tables.py

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

flake.lock

…

flake.nix

…

gc_clock.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

gdbinit

…

gen_segmented_compress_params.py

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

HACKING.md

…

hashing_partition_visitor.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

idl-compiler.py

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

inet_address_vectors.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

init.cc

alternator: Graduate Alternator Streams from experimental

2026-04-22 15:22:15 +02:00

init.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

install-dependencies.sh

build: add slirp4netns to dependencies

2026-03-05 17:44:17 +02:00

install.sh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

LICENSE-ScyllaDB-Source-Available.md

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

main.cc

Merge 'Introduce auth::config to decouple auth modules from db::config' from Pavel Emelyanov

2026-05-18 11:32:11 +02:00

marshal_exception.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

mutation_query.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

mutation_query.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

mutation/collection_mutation: collection_mutation(): remove unused abstract_type param

2026-05-21 08:34:21 +03:00

partition_range_compat.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

partition_slice_builder.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

partition_slice_builder.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

query_ranges_to_vnodes.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

query_ranges_to_vnodes.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

reader_concurrency_semaphore_group.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

reader_concurrency_semaphore_group.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: drop unused stop_ext_{pre,post}()

2026-04-15 14:40:15 +03:00

reader_concurrency_semaphore.hh

reader_concurrency_semaphore: drop unused stop_ext_{pre,post}()

2026-04-15 14:40:15 +03:00

reader_permit.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

README.md

docs: fix link to docker build README.MD

2026-02-18 12:12:46 +01:00

real_dirty_memory_accounter.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

release.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

release.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

reversibly_mergeable.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

schema_upgrader.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

scylla_post_install.sh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

scylla-gdb.py

scylla-gdb: display ms-format sstable summary from partitions db footer

2026-05-11 16:58:22 +03:00

SCYLLA-VERSION-GEN

Update ScyllaDB version to: 2026.3.0-dev

2026-04-26 15:30:13 +03:00

seastarx.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

serialization_visitors.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

serializer_impl.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

serializer.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

serializer.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

service_permit.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

shell.nix

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

sstable_dict_autotrainer.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

sstable_dict_autotrainer.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

sstables_loader_helpers.cc

sstables_loader: Fail tablet-restore task if not all sstables were downloaded

2026-05-12 10:40:24 +03:00

sstables_loader_helpers.hh

sstables_loader_helpers: just reformat the code

2026-05-12 10:40:22 +03:00

sstables_loader.cc

sstables_loader: hold token_metadata_ptr to prevent use-after-free in tablet_restore_task_impl::run()

2026-05-22 01:10:25 +02:00

sstables_loader.hh

sstables_loader: return shared_sstable from attach_sstable

2026-05-12 10:40:24 +03:00

stdafx.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

stdafx.hh

build: drop utils/rolling_max_tracker.hh from precompiled header

2026-04-22 15:46:50 +03:00

supervisor.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

table_helper.cc

test/boost: add regression test for table_helper insert() UAF

2026-04-30 11:45:12 +02:00

table_helper.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

test.py

docker: fix coredump collection when host uses pipe-based core_pattern

2026-05-12 14:16:22 +03:00

timeout_config.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

timeout_config.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

tombstone_gc_extension.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

tombstone_gc_options.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

tombstone_gc_options.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

tombstone_gc-internals.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

tombstone_gc.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

tombstone_gc.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

ubsan-suppressions.supp

…

unimplemented.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

unimplemented.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

validation.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

validation.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

version.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

view_info.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

vint-serialization.cc

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

vint-serialization.hh

LICENSE: Update to version 1.1

2026-04-12 19:46:33 +03:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++23 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain. This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its API - CQL. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The community forum and Slack channel are for users to discuss configuration, management, and operations of ScyllaDB.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.3%

Python 26.5%

CMake 0.3%

GAP 0.3%

Shell 0.3%