mirror of https://github.com/scylladb/scylladb.git synced 2026-05-28 10:41:12 +00:00

Go to file

Kamil Braun db734cd74f service: storage_service: make leaving node a non-voter before removing it from group 0 in decommission/removenode

removenode currently works roughly like this:
1. stream/repair data so it ends up on new replica sets (calculated
   without the node we want to remove)
2. remove the node from the token ring
3. remove the node from group 0 configuration.

If the procedure fails before after step 2 but before step 3 finishes,
we're in trouble: the cluster is left with an additional voting group 0
member, which reduces group 0's availability, and there is no way to
remove this member because `removenode` no longer considers it to be
part of the cluster (it consults the token ring to decide).

Improve this failure scenario by including a new step at the beginning:
make the node a non-voter in group 0 configuration. Then, even if we
fail after removing the node from the token ring but before removing it
from group 0, we'll only be left with a non-voter which doesn't reduce
availability.

We make a similar change for `decommission`: between `unbootstrap()` (which
streams data) and `leave_ring()` (which removes our tokens from the
ring), become a non-voter. The difference here is that we don't become a
non-voter at the beginning, but only after streaming/repair. In
`removenode` it's desirable to make the node a non-voter as soon as
possible because it's already dead. In decommission it may be desirable
for us to remain a voter if we fail during streaming because we're still
alive and functional in that case.

In a later commit we'll also make it possible to retry `removenode` to
remove a node that is only a group 0 member and not a token ring member.

2023-01-17 12:28:00 +01:00

.github

Update CODEOWNERS file

2022-12-06 19:26:03 +02:00

alternator

treewide: drop cql_serialization_format

2023-01-03 19:54:13 +02:00

api

api: get task statuses recursively

2023-01-11 12:34:06 +01:00

auth

api: Add API for resetting authorization cache

2022-06-28 19:58:06 -03:00

cdc

treewide: drop cql_serialization_format

2023-01-03 19:54:13 +02:00

compaction

compaction: LCS: don't reshape all levels if only a single breaks disjointness

2023-01-17 09:55:15 +02:00

conf

config: Change wording of "none" in encryption options to maybe reduce user confusion

2022-12-12 16:14:53 +02:00

cql3

cql: allow disabling of USING TIMESTAMP sanity checking

2023-01-16 23:18:56 +02:00

data_dictionary

data_dictonary: add get_all_keyspaces() and get_user_keyspaces()

2022-12-10 12:51:05 +01:00

cql: allow disabling of USING TIMESTAMP sanity checking

2023-01-16 23:18:56 +02:00

debug

…

dht

dht/i_partitioner.hh: ring_position_ext: add weight() accessor

2023-01-09 09:46:57 -05:00

direct_failure_detector

direct_failure_detector: don't change meaning of endpoint_liveness

2022-11-28 21:58:30 +02:00

dist

dist/docker: support --replace-node-first-boot

2023-01-13 18:36:09 +02:00

docs

Merge 'Add replace-node-first-boot option' from Benny Halevy

2023-01-16 15:08:31 +01:00

exceptions

exception: fix the error code used for rate_limit_exception

2022-09-13 11:46:15 +02:00

gms

raft: replace experimental raft option with dedicated flag

2023-01-03 11:15:11 +02:00

idl

forward_service: fix timeout support in parallel aggregates

2023-01-16 12:08:13 +02:00

index

secondary_index_manager: don't add clustering key columns to index table of static column index

2022-12-06 11:21:16 +01:00

interface

…

lang

Merge 'tools/scylla-sstable: add lua scripting support' from Botond Dénes

2023-01-09 20:54:42 +02:00

licenses

…

locator

azure_snitch: Handle empty zone returned from IMDS

2023-01-09 11:57:45 +02:00

message

messaging: check that a node knows its own topology before accessing it

2023-01-02 11:53:14 +02:00

mutation_writer

position_in_partition: Make after_key() work with non-full keys

2022-12-14 14:47:33 +01:00

raft

Merge 'raft: raft_group0, register RPC verbs on all shards' from Gusev Petr

2023-01-04 11:11:21 +01:00

readers

readers/multishard: shard_reader::close() silence read-ahead timeouts

2023-01-04 16:10:09 +02:00

redis

storage_proxy.hh: Remove unused headers

2022-10-02 20:48:50 +03:00

reloc

SCYLLA-VERSION-GEN: use semver-compatible version

2022-07-25 18:06:28 +03:00

repair

Merge 'Abort repair tasks' from Aleksandra Martyniuk

2023-01-05 15:21:35 +01:00

replica

functions: initialize aggregates on scylla start

2023-01-10 17:44:18 +02:00

rust

test: assert that WASM allocations can fail without crashing

2023-01-06 14:07:29 +01:00

scripts

open-coredump.sh: handle dev versions

2023-01-12 19:28:58 +02:00

seastar @ 8889cbc198

Update seastar submodule

2023-01-08 18:56:00 +02:00

service

service: storage_service: make leaving node a non-voter before removing it from group 0 in decommission/removenode

2023-01-17 12:28:00 +01:00

sstables

Merge 'replica/database: fix read related metrics' from Botond Dénes

2023-01-09 12:18:49 +02:00

streaming

streaming: Enable offstrategy for all classic streaming based node ops

2022-12-28 11:12:02 +02:00

swagger-ui @ 12f1da1082

…

tasks

Merge 'Abort repair tasks' from Aleksandra Martyniuk

2023-01-05 15:21:35 +01:00

test

test: test_raft_upgrade: remove test_raft_upgrade_with_node_remove

2023-01-17 12:28:00 +01:00

thrift

treewide: drop cql_serialization_format

2023-01-03 19:54:13 +02:00

tools

Update python3 submodule (license file fix)

2023-01-15 17:59:27 +02:00

tracing

cql3, transport, tests: remove "unset" from value type system

2023-01-16 21:10:56 +02:00

transport

cql3, transport, tests: remove "unset" from value type system

2023-01-16 21:10:56 +02:00

types

treewide: drop cql_serialization_format

2023-01-03 19:54:13 +02:00

unified

install.sh: Skip systemd existance check when --without-systemd

2022-11-14 14:07:46 +02:00

utils

utils: small_vector: mark throw_out_of_range() const

2023-01-11 20:58:53 +02:00

.dockerignore

…

.gitattributes

gitattributes: Mark *.svg as binary

2022-07-31 15:25:24 +03:00

.gitignore

Add rust/Cargo.lock to .gitignore

2022-10-14 13:54:50 +03:00

.gitmodules

build: drop abseil submodule, replace with distribution abseil

2022-12-28 19:02:23 +02:00

.gitorderfile

…

.mailmap

Add .mailmap

2022-07-04 13:44:28 +03:00

absl-flat_hash_map.cc

…

absl-flat_hash_map.hh

…

amplify.yml

docs: automatic previews configuration

2022-11-04 15:44:22 +02:00

atomic_cell_hash.hh

…

atomic_cell_or_collection.hh

…

atomic_cell.cc

…

atomic_cell.hh

…

backlog_controller.hh

backlog_controller: keep scheduling_group by value

2022-08-02 07:38:40 +03:00

build_mode.hh

release: properly evaluate SCYLLA_BUILD_MODE_* macros

2022-08-29 10:20:19 +03:00

bytes_ostream.hh

bytes_ostream: don't take reference to packed variable

2022-11-28 21:40:18 +02:00

bytes.cc

…

bytes.hh

…

cache_flat_mutation_reader.hh

position_in_partition: Make after_key() work with non-full keys

2022-12-14 14:47:33 +01:00

cache_temperature.hh

…

caching_options.cc

…

caching_options.hh

…

canonical_mutation.cc

mutation_partition_view: make mutation_partition_view_virtual_visitor stoppable

2022-08-22 20:12:58 +03:00

canonical_mutation.hh

schema, everywhere: define and use table_id as a strong type

2022-08-08 08:09:41 +03:00

cartesian_product.hh

…

cell_locking.hh

…

checked-file-impl.hh

…

client_data.cc

…

client_data.hh

…

clocks-impl.cc

…

clocks-impl.hh

…

clustering_bounds_comparator.hh

…

clustering_interval_set.hh

…

clustering_key_filter.hh

…

clustering_ranges_walker.hh

…

CMakeLists.txt

build: drop abseil submodule, replace with distribution abseil

2022-12-28 19:02:23 +02:00

collection_mutation.cc

…

collection_mutation.hh

treewide: drop cql_serialization_format

2023-01-03 19:54:13 +02:00

column_computation.hh

column_computation: adjust to use clustering_or_static_row

2022-12-06 11:21:16 +01:00

combine.hh

…

compatible_ring_position.hh

compatible_ring_position_or_view: make it cheap to copy

2022-10-04 12:00:21 +03:00

compound_compat.hh

…

compound.hh

treewide: drop cql_serialization_format

2023-01-03 19:54:13 +02:00

compress.cc

…

compress.hh

…

concrete_types.hh

…

configure.py

Merge 'configure.py: a bunch of clean-up changes' from Michał Chojnowski

2023-01-12 16:40:02 +02:00

CONTRIBUTING.md

Add redirections

2022-06-28 09:39:14 +01:00

converting_mutation_partition_applier.cc

…

converting_mutation_partition_applier.hh

…

counters.cc

everywhere: define locator::host_id as a strong tagged_uuid type

2022-08-12 06:01:44 +03:00

counters.hh

everywhere: define locator::host_id as a strong tagged_uuid type

2022-08-12 06:01:44 +03:00

cql_serialization_format.hh

treewide: drop cql_serialization_format

2023-01-03 19:54:13 +02:00

db_clock.hh

gc_clock, db_clock: mark functions noexcept

2022-07-27 13:17:01 +03:00

debug.hh

…

default.nix

build: remove references to unused c bindings of wasmtime

2023-01-06 14:07:29 +01:00

digest_algorithm.hh

…

digester.hh

…

Doxyfile

…

duration.cc

…

duration.hh

…

encoding_stats.hh

encoding_state: mark functions noexcept

2022-07-27 13:43:17 +03:00

enum_set.hh

…

fix_system_distributed_tables.py

…

flake.lock

build: fix Nix devenv

2022-12-19 20:53:07 +02:00

flake.nix

build: fix Nix devenv

2022-12-19 20:53:07 +02:00

frozen_mutation.cc

schema, everywhere: define and use table_schema_version as a strong type

2022-08-08 08:09:45 +03:00

frozen_mutation.hh

mutation_partition_view: make mutation_partition_view_virtual_visitor stoppable

2022-08-22 20:12:58 +03:00

frozen_schema.cc

idl: make idl headers self-sufficient

2022-08-08 08:02:27 +03:00

frozen_schema.hh

…

full_position.hh

service/storage_proxy: set smallest continue pos as query's continue pos

2022-08-10 06:03:38 +03:00

gc_clock.hh

gc_clock, db_clock: mark functions noexcept

2022-07-27 13:17:01 +03:00

gdbinit

gdbinit: add ignore clause for SIG35

2023-01-12 12:13:04 +02:00

gen_segmented_compress_params.py

…

generic_server.cc

…

generic_server.hh

…

HACKING.md

Move dev docs to docs/dev

2022-06-24 18:07:08 +01:00

hashers.cc

…

hashers.hh

…

hashing_partition_visitor.hh

…

hashing.hh

…

idl-compiler.py

idl-compiler: introduce cancellable verbs

2022-08-19 19:15:18 +02:00

inet_address_vectors.hh

…

init.cc

init: do not allow cfg.replace_node_first_boot of seed node

2023-01-13 18:30:48 +02:00

init.hh

…

install-dependencies.sh

build: remove references to unused c bindings of wasmtime

2023-01-06 14:07:29 +01:00

install.sh

install.sh: drop locale workaround from python3 thunk

2022-11-28 13:07:03 +02:00

interval.hh

…

intrusive_set_external_comparator.hh

…

keys.cc

add utf8:validate to operator<< partition_key with_schema.

2022-09-22 16:42:31 +03:00

keys.hh

…

LICENSE.AGPL

…

log.hh

…

main.cc

main: use std::shift_left() to consume tool name

2023-01-16 21:01:34 +02:00

map_difference.hh

…

marshal_exception.hh

…

multishard_mutation_query.cc

database.hh: Remove unused headers

2022-10-04 09:01:38 +03:00

multishard_mutation_query.hh

…

mutation_cleaner.hh

db: mutation_cleaner: Enqueue new snapshots at the back

2022-06-28 18:29:29 +03:00

mutation_compactor.hh

mutation_compactor: reset stop flag on page start

2022-12-24 13:52:45 +02:00

mutation_consumer_concepts.hh

…

mutation_consumer.hh

mutation{,_consumer,_partition}: remove consume_in_reverse::legacy_half_reverse

2023-01-05 18:48:55 +01:00

mutation_fragment_fwd.hh

…

mutation_fragment_stream_validator.hh

mutation_fragment_stream_validator: avoid allocation when stream is correct

2022-11-22 19:19:18 +02:00

mutation_fragment_v2.hh

…

mutation_fragment.cc

position_in_partition: Make after_key() work with non-full keys

2022-12-14 14:47:33 +01:00

mutation_fragment.hh

Merge 'Fix handling of non-full clustering keys in the read path' from Tomasz Grabiec

2022-12-15 10:47:12 +02:00

mutation_partition_serializer.cc

idl: make idl headers self-sufficient

2022-08-08 08:02:27 +03:00

mutation_partition_serializer.hh

…

mutation_partition_view.cc

mutation_partition_view: make mutation_partition_view_virtual_visitor stoppable

2022-08-22 20:12:58 +03:00

mutation_partition_view.hh

mutation_partition_view: make mutation_partition_view_virtual_visitor stoppable

2022-08-22 20:12:58 +03:00

mutation_partition_visitor.hh

…

mutation_partition.cc

Merge 'Drop support for cql binary protocols versions 1 and 2' from Avi Kivity

2023-01-09 18:52:41 +02:00

mutation_partition.hh

deletable_row: add column_kind parameter to is_live

2022-12-06 11:21:16 +01:00

mutation_query.cc

…

mutation_query.hh

query: coroutinize to_data_query_result

2022-05-05 13:32:25 +03:00

mutation_rebuilder.hh

…

mutation_source_metadata.hh

…

mutation.cc

mutation_partition: compact_for_compaction: get tombstone_gc_state

2022-09-07 07:43:15 +03:00

mutation.hh

mutation{,_consumer,_partition}: remove consume_in_reverse::legacy_half_reverse

2023-01-05 18:48:55 +01:00

noexcept_traits.hh

…

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

…

partition_range_compat.hh

…

partition_slice_builder.cc

treewide: drop cql_serialization_format

2023-01-03 19:54:13 +02:00

partition_slice_builder.hh

…

partition_snapshot_reader.hh

utils/logalloc: allocating_section: don't use the global tracker

2022-08-23 10:38:58 +03:00

partition_snapshot_row_cursor.hh

row_cache: Fix missing row if upper bound of population range is evicted and has adjacent dummy

2022-08-09 02:28:56 +02:00

partition_version_list.hh

row_cache: Fix undefined behavior during eviction under some conditions

2022-08-01 23:53:15 +02:00

partition_version.cc

mvcc: Add snapshot details to the printout of partition_entry

2022-10-16 14:22:14 +03:00

partition_version.hh

mvcc: Add snapshot details to the printout of partition_entry

2022-10-16 14:22:14 +03:00

position_in_partition.hh

cache: Fix undefined behavior when populating with non-full keys

2023-01-10 12:51:54 +02:00

protocol_server.hh

…

querier.cc

Show warn message if tombstone_warn_threshold reached on querier.

2022-09-22 16:42:31 +03:00

querier.hh

querier: consume_page(): use partition_start as the sentinel value

2022-11-11 09:58:18 +02:00

query_class_config.hh

…

query_ranges_to_vnodes.cc

…

query_ranges_to_vnodes.hh

…

query_result_merger.hh

…

query-request.hh

forward_service: fix timeout support in parallel aggregates

2023-01-16 12:08:13 +02:00

query-result-reader.hh

treewide: use ::for_partition_start() instead of ::partition_start_tag_t{}

2022-11-11 09:58:18 +02:00

query-result-set.cc

treewide: drop cql_serialization_format

2023-01-03 19:54:13 +02:00

query-result-set.hh

…

query-result-writer.hh

query-result-writer: stop when tombstone-limit is reached

2022-08-10 06:03:38 +03:00

query-result.hh

service/storage_proxy: set smallest continue pos as query's continue pos

2022-08-10 06:03:38 +03:00

query.cc

treewide: drop cql_serialization_format

2023-01-03 19:54:13 +02:00

range_tombstone_assembler.hh

…

range_tombstone_change_generator.hh

…

range_tombstone_list.cc

range_tombstone_list: Avoid amortized_reserve()

2022-08-09 11:34:16 +03:00

range_tombstone_list.hh

db: range_tombstone_list: Avoid quadratic behavior when applying

2022-08-05 20:34:07 +03:00

range_tombstone_splitter.hh

…

range_tombstone.cc

…

range_tombstone.hh

Move dev docs to docs/dev

2022-06-24 18:07:08 +01:00

range.hh

…

read_context.hh

…

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: add disk_reads and sstables_read stats

2023-01-03 09:37:29 -05:00

reader_concurrency_semaphore.hh

reader_concurrency_semaphore: add disk_reads and sstables_read stats

2023-01-03 09:37:29 -05:00

reader_permit.hh

reader_concurrency_semaphore: add disk_reads and sstables_read stats

2023-01-03 09:37:29 -05:00

README.md

Fix broken links

2022-06-28 15:19:36 +01:00

real_dirty_memory_accounter.hh

dirty_memory_manager: move to replica module

2022-12-06 22:24:17 +02:00

release.cc

release: define SCYLLA_BUILD_MODE_STR by stringifying SCYLLA_BUILD_MODE

2022-08-25 16:50:42 +02:00

release.hh

release: define SCYLLA_BUILD_MODE_STR by stringifying SCYLLA_BUILD_MODE

2022-08-25 16:50:42 +02:00

reversibly_mergeable.hh

…

row_cache.cc

row_cache: Fix violation of the "oldest version are evicted first" when evicting last dummy

2023-01-09 16:10:52 +02:00

row_cache.hh

…

schema_builder.hh

schema, everywhere: define and use table_id as a strong type

2022-08-08 08:09:41 +03:00

schema_fwd.hh

schema, everywhere: define and use table_schema_version as a strong type

2022-08-08 08:09:45 +03:00

schema_mutations.cc

db: schema_mutations: Make operator<<() print all mutations

2022-08-26 16:48:15 +02:00

schema_mutations.hh

schema_mutations: Make it a monoid by defining appropriate += operator

2022-08-26 16:48:15 +02:00

schema_registry.cc

schema_registry: fix abandoned feature warning

2022-08-11 15:11:21 +03:00

schema_registry.hh

…

schema_upgrader.hh

…

schema.cc

schema: operator<<: print also tombstone_gc_options

2022-12-22 16:40:18 +02:00

schema.hh

implement keyspace_element interface

2022-12-10 12:34:09 +01:00

scylla_post_install.sh

…

scylla-gdb.py

Merge 'scylla-gdb.py: introduce scylla get-config-value' from Botond Dénes

2022-12-21 18:38:23 +02:00

SCYLLA-VERSION-GEN

SCYLLA-VERSION-GEN: remove unnecessary bashism

2023-01-16 20:34:01 +02:00

seastarx.hh

…

serialization_visitors.hh

…

serializer_impl.hh

serializer_impl.hh: add reverse vector serializer

2022-11-14 16:06:24 +01:00

serializer.cc

…

serializer.hh

…

service_permit.hh

…

setup.py

…

shell.nix

build: improvements & upgrades to Nix dev environment

2022-10-02 11:47:16 +03:00

sstables_loader.cc

streaming: define plan_id as a strong tagged_uuid type

2022-08-22 19:45:30 +03:00

sstables_loader.hh

schema, everywhere: define and use table_id as a strong type

2022-08-08 08:09:41 +03:00

supervisor.hh

…

table_helper.cc

treewide: replace parallel_for_each with coroutine::parallel_for_each in coroutines

2022-05-31 09:06:24 +03:00

table_helper.hh

…

test.py

test/pylib: prefix cluster/manager logs with the current test name

2023-01-11 10:09:39 +01:00

timeout_config.cc

…

timeout_config.hh

…

timestamp.hh

…

to_string.hh

to_string: generalize operator<< for unordered_set

2022-07-18 18:20:33 +02:00

tombstone_gc_extension.hh

…

tombstone_gc_options.cc

…

tombstone_gc_options.hh

…

tombstone_gc.cc

tombstone_gc: deglobalize repair_history_maps

2022-09-07 07:43:15 +03:00

tombstone_gc.hh

tombstone_gc: deglobalize repair_history_maps

2022-09-07 07:43:15 +03:00

tombstone.hh

…

tox.ini

…

types.cc

types: add some missing explicit instantiations

2023-01-17 10:46:01 +02:00

types.hh

treewide: drop cql_serialization_format

2023-01-03 19:54:13 +02:00

ubsan-suppressions.supp

…

unimplemented.cc

…

unimplemented.hh

…

validation.cc

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

validation.hh

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

version.hh

version: Reverse version increase

2022-12-12 18:45:32 +02:00

view_info.hh

view_info: adjust view_column to accept column_kind

2022-12-06 11:21:16 +01:00

vint-serialization.cc

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

vint-serialization.hh

…

xx_hasher.hh

…

zstd.cc

…

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.3%

Python 26.5%

CMake 0.3%

GAP 0.3%

Shell 0.3%