mirror of https://github.com/scylladb/scylladb.git synced 2026-06-02 04:56:58 +00:00

Go to file

Tomasz Grabiec 809ddd7f79 Merge 'Move pending_ranges and endpoints_for_reading from token_metadata to erm' from Gusev Petr

This refactoring is a follow-up for https://github.com/scylladb/scylladb/pull/13376, move per keyspace data structures related to topology changes from `token_metadata` to `erm`.

We move `pending_endpoints` and `read_endpoints`, along with their computation logic, from `token_metadata` to `vnode_effective_replication_map`. The `vnode_effective_replication_map` seems more appropriate for them since it contains functionally similar `replication_map` and we will be able to reuse `pending_endpoints/read_endpoints` across keyspaces sharing the same `factory_key`.

At present, `pending_endpoints` and `read_endpoints` are updated in the `update_pending_ranges` function. The update logic comprises two parts - preparing data common to all keyspaces/replication_strategies, and calculating the `migration_info` for specific keyspaces. In this PR we introduce a new `topology_change_info` structure to hold the first part's data and create an `update_topology_change_info` function to update it. This structure will be used in `vnode_effective_replication_map` to compute `pending_endpoints` and `read_endpoints`. This enables the reuse of `topology_change_info` across all keyspaces, unlike the current `update_pending_ranges` implementation, which is another benefit of this refactoring.

The PR also optimises `replication_map` memory usage for the case `natural_endpoints_depend_on_token == false`. We store endpoints list only once with special key
instead of duplicating them for each `vnode` token.

The original `update_pending_ranges` remains unchanged during the PR commits, and will be removed entirely upon transitioning to the new implementation.

Closes #13715

* github.com:scylladb/scylladb:
  token_metadata_test: add a test for everywhere strategy
  token_metadata_test: check read_endpoints when bootstrapping first node
  token_metadata_test: refactor tests, extract create_erm
  token_metadata: drop has_pending_ranges and migration_info
  effective_replication_map: add has_pending_ranges
  token_metadata: drop update_pending_ranges
  effective_replication_map: use new get_pending_endpoints and get_endpoints_for_reading
  token_metadata_test.cc: create token_metadata and replication_strategy as shared pointers
  vnode_effective_replication_map: get_pending_endpoints and get_endpoints_for_reading
  calculate_effective_replication_map: compute pending_endpoints and read_endpoints
  vnode_erm: optimize replication_map
  vnode_erm::get_range_addresses: use sorted_tokens
  abstract_replication_strategy.hh: de-virtualize natural_endpoints_depend_on_token
  sequenced_set: add extract_vector method
  effective_replication_map: clone_endpoints_gently -> clone_data_gently
  vnode_erm: gentle destruction of _pending_endpoints and _read_endpoints
  stall_free.hh: add clear_gently for rvalues
  stall_free.hh: relax Container requirement
  token_metadata: add pending_endpoints and read_endpoints to vnode_effective_replication_map
  token_metadata: introduce topology_change_info
  token_metadata: replace set_topology_transition_state with set_read_new

2023-05-22 21:37:06 +02:00

.github

docs: Separate conf.py

2023-03-27 13:42:58 +03:00

alternator

Merge 'alternator,config: make alternator_timeout_in_ms live-updateable' from Kefu Chai

2023-05-15 10:16:29 +03:00

api

Merge 'raft topology: implement check_and_repair_cdc_streams API' from Kamil Braun

2023-05-22 11:33:58 +02:00

auth

auth: disallow CREATE permission on a specific function

2023-05-14 18:40:34 +03:00

cdc

Merge 'cdc, db_clock: specialize fmt::formatter<{db_clock::time_point, generation_id}>' from Kefu Chai

2023-05-01 22:56:33 +03:00

cmake

build: cmake: disable deprecated warning

2023-05-05 15:31:39 +08:00

compaction

Merge 'perform_cleanup: wait until all candidates are cleaned up' from Benny Halevy

2023-05-19 12:35:59 +03:00

conf

db: config: Introduce experimental "TABLETS" feature

2023-04-24 10:49:36 +02:00

cql3

cql3/expr: print expressions in user-friendly way by default

2023-05-18 20:57:00 +03:00

data_dictionary

data_dictionary: define helpers in options

2023-05-09 21:51:52 +08:00

Merge 'Move pending_ranges and endpoints_for_reading from token_metadata to erm' from Gusev Petr

2023-05-22 21:37:06 +02:00

debug

…

dht

Revert "dht: incremental_owned_ranges_checker: use lower_bound()"

2023-05-02 08:01:44 +03:00

direct_failure_detector

direct_failure_detector: Avoid throwing exceptions in the success path

2023-03-31 12:40:43 +02:00

dist

scylla_raid_setup: wipe filesystem signatures from specified disks

2023-05-08 16:53:43 +03:00

docs

Merge 'raft topology: implement check_and_repair_cdc_streams API' from Kamil Braun

2023-05-22 11:33:58 +02:00

exceptions

…

gms

Merge 'Cut feature_service -> system_keyspace dependency' from Pavel Emelyanov

2023-05-18 18:21:06 +02:00

idl

raft, idl: restore internal::tagged_uint64 type

2023-05-09 12:38:20 +03:00

index

index: s/std::regex/boost::regex/

2023-04-06 09:50:41 -04:00

interface

…

lang

db, cql3: functions: pass function parameters as a span instead of a vector

2023-04-19 20:38:55 +03:00

licenses

scripts: remove git-archive-all

2023-03-29 18:59:23 +03:00

locator

token_metadata: drop has_pending_ranges and migration_info

2023-05-21 13:17:42 +04:00

message

gossiper: version_generator: add {debug_,}validate_gossip_generation

2023-04-23 08:48:01 +03:00

mutation

Resurrect optimization to avoid bloom filter checks during compaction

2023-05-18 09:01:50 +03:00

mutation_writer

reader_permit: keep trace_state pointer on permit

2023-03-22 04:58:01 -04:00

raft

raft, idl: restore internal::tagged_uint64 type

2023-05-09 12:38:20 +03:00

readers

readers,mutation: move mutation_fragment_stream_validator to mutation/

2023-05-09 07:55:13 -04:00

redis

redis,thrift,transport: make timeout_config live-updateable

2023-03-29 20:17:45 +08:00

reloc

…

repair

repair: Add per peer node error for get_sync_boundary and friends

2023-05-15 09:52:27 +03:00

replica

Merge 'perform_cleanup: wait until all candidates are cleaned up' from Benny Halevy

2023-05-19 12:35:59 +03:00

rust

rust: update wasmtime dependency

2023-05-16 13:03:29 +03:00

schema

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

scripts

scripts/refresh-submodules.sh: use the correct sha1 in title

2023-05-22 12:10:03 +03:00

seastar @ f94b1bb9cb

Update seastar submodule

2023-05-05 00:32:11 +03:00

service

Merge 'Move pending_ranges and endpoints_for_reading from token_metadata to erm' from Gusev Petr

2023-05-22 21:37:06 +02:00

sstables

Merge 'perform_cleanup: wait until all candidates are cleaned up' from Benny Halevy

2023-05-19 12:35:59 +03:00

streaming

migration manager: Make schema pull abortable.

2023-05-11 16:31:23 +03:00

swagger-ui @ 12f1da1082

…

tasks

repair: rename repair_module

2023-03-27 16:33:39 +02:00

test

Merge 'Move pending_ranges and endpoints_for_reading from token_metadata to erm' from Gusev Petr

2023-05-22 21:37:06 +02:00

thrift

cql3: result_set: switch cell data type from bytes_opt to managed_bytes_opt

2023-05-07 17:17:36 +03:00

tools

sstables: move get_components_lister() into sstable_directory

2023-05-18 08:43:35 +03:00

tracing

tracing: List-initialize trace_state::_records

2023-05-12 16:15:58 +03:00

transport

cql3: result_set: switch cell data type from bytes_opt to managed_bytes_opt

2023-05-07 17:17:36 +03:00

types

cql3: result_set: switch cell data type from bytes_opt to managed_bytes_opt

2023-05-07 17:17:36 +03:00

unified

install.sh: use scylla-jmx for detecting JRE

2023-05-19 11:22:57 +03:00

utils

sequenced_set: add extract_vector method

2023-05-21 11:33:38 +04:00

.dockerignore

…

.gitattributes

…

.gitignore

…

.gitmodules

Repackaging cqlsh

2023-03-12 20:22:33 +02:00

.gitorderfile

…

.mailmap

…

absl-flat_hash_map.cc

…

absl-flat_hash_map.hh

…

amplify.yml

…

backlog_controller.hh

…

build_mode.hh

release: correct a typo in comment

2023-03-29 13:42:38 +03:00

bytes_ostream.hh

utils/managed_bytes, serializer: add conversion between buffer_view<bytes_ostream> and managed_bytes_view

2023-05-07 17:17:34 +03:00

bytes.cc

bytes: implement formatting helpers using formatter

2023-03-27 20:06:45 +08:00

bytes.hh

utils: hashing: use simple_xx_hasher

2023-04-24 14:07:25 +03:00

cache_flat_mutation_reader.hh

…

cache_temperature.hh

…

cartesian_product.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

cell_locking.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

checked-file-impl.hh

…

client_data.cc

…

client_data.hh

…

clocks-impl.cc

db_clock: specialize fmt::formatter<db_clock::time_point>

2023-04-28 15:48:06 +08:00

clocks-impl.hh

…

clustering_bounds_comparator.hh

…

clustering_interval_set.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

clustering_key_filter.hh

…

clustering_ranges_walker.hh

…

CMakeLists.txt

build: cmake: add Scylla_USE_LINKER option

2023-05-16 11:30:18 +03:00

collection_mutation.cc

…

collection_mutation.hh

…

column_computation.hh

…

combine.hh

…

compatible_ring_position.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

compound_compat.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

compound.hh

keys: change from_optional_exploded to accept a span instead of a vector

2023-04-19 20:18:50 +03:00

compress.cc

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

compress.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

concrete_types.hh

…

configure.py

Merge 'Fix heart_beat_state::force_highest_possible_version_unsafe' from Benny Halevy

2023-05-16 13:59:41 +02:00

CONTRIBUTING.md

…

converting_mutation_partition_applier.cc

…

converting_mutation_partition_applier.hh

…

counters.cc

counters: specialize fmt::formatter<counter_{shard,cell}_view>

2023-05-21 17:13:06 +03:00

counters.hh

counters: specialize fmt::formatter<counter_{shard,cell}_view>

2023-05-21 17:13:06 +03:00

cql_serialization_format.hh

…

db_clock.hh

db_clock: specialize fmt::formatter<db_clock::time_point>

2023-04-28 15:48:06 +08:00

debug.cc

…

debug.hh

…

default.nix

…

Doxyfile

…

duration.cc

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

duration.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

encoding_stats.hh

…

enum_set.hh

…

fix_system_distributed_tables.py

…

flake.lock

…

flake.nix

…

frozen_schema.cc

…

frozen_schema.hh

…

full_position.hh

…

gc_clock.hh

…

gdbinit

…

gen_segmented_compress_params.py

…

generic_server.cc

…

generic_server.hh

…

HACKING.md

commitlog: use separate directory for schema commitlog

2023-03-30 21:55:50 +04:00

hashing_partition_visitor.hh

…

idl-compiler.py

…

inet_address_vectors.hh

…

init.cc

gms: get rid of unused failure_detector

2023-04-21 09:08:27 +03:00

init.hh

configurables: Add optional service lookup to init callback

2023-03-14 17:13:52 +02:00

install-dependencies.sh

install-dependencies.sh: don't use fgrep

2023-05-02 11:15:40 +03:00

install.sh

…

interval.hh

keys: specialize fmt::formatter<partition_key> and friends

2023-04-14 13:21:30 +08:00

keys.cc

keys: specialize fmt::formatter<partition_key> and friends

2023-04-14 13:21:30 +08:00

keys.hh

keys: print "non-utf8-key" when clustering_key is not UTF-8

2023-04-24 10:40:23 +03:00

LICENSE.AGPL

…

log.hh

…

main.cc

main: Load tablet metadata after schema commit log replay

2023-05-21 18:50:11 +03:00

map_difference.hh

…

marshal_exception.hh

…

multishard_mutation_query.cc

readers/multishard: reader_lifecycle_policy: add get_read_range()

2023-03-24 08:40:11 -04:00

multishard_mutation_query.hh

…

mutation_query.cc

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

mutation_query.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

noexcept_traits.hh

…

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

…

partition_range_compat.hh

…

partition_slice_builder.cc

…

partition_slice_builder.hh

…

partition_snapshot_reader.hh

…

partition_snapshot_row_cursor.hh

mutation: specialize fmt::formatter<tombstone> and fmt::formatter<shadowable_tombstone>

2023-04-12 10:57:03 +08:00

protocol_server.hh

…

querier.cc

…

querier.hh

…

query_class_config.hh

…

query_id.hh

…

query_ranges_to_vnodes.cc

dht, storage_proxy: Abstract token space splitting

2023-04-24 10:49:36 +02:00

query_ranges_to_vnodes.hh

dht, storage_proxy: Abstract token space splitting

2023-04-24 10:49:36 +02:00

query_result_merger.hh

…

query-request.hh

…

query-result-reader.hh

…

query-result-set.cc

…

query-result-set.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

query-result-writer.hh

…

query-result.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

query.cc

treewide: use fmtlib when printing UUID

2023-03-20 15:38:45 +08:00

range.hh

…

read_context.hh

…

reader_concurrency_semaphore.cc

Merge 'reader_permit: minor improvements to resource consume/release safety' from Botond Dénes

2023-05-14 14:14:23 +03:00

reader_concurrency_semaphore.hh

reader_concurrency_semaphore: update RAII state guard classes w.r.t. recent permit state name changes

2023-04-19 05:20:42 -04:00

reader_permit.hh

Merge 'reader_permit: minor improvements to resource consume/release safety' from Botond Dénes

2023-05-14 14:14:23 +03:00

README.md

…

real_dirty_memory_accounter.hh

…

release.cc

…

release.hh

…

reversibly_mergeable.hh

…

row_cache.cc

Fix use-after-move when initializing row cache with dummy entry

2023-03-31 19:46:53 +03:00

row_cache.hh

row_cache: pass "const cache_entry" to operator<<

2023-03-16 07:46:11 +08:00

schema_mutations.cc

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

schema_mutations.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

schema_upgrader.hh

…

scylla_post_install.sh

…

scylla-gdb.py

Merge 'De-globalize storage proxy' from Pavel Emelyanov

2023-04-24 09:38:00 +03:00

SCYLLA-VERSION-GEN

release: prepare for 5.4.0-dev

2023-05-21 10:39:21 +03:00

seastarx.hh

…

serialization_visitors.hh

…

serializer_impl.hh

…

serializer.cc

utils/managed_bytes, serializer: add conversion between buffer_view<bytes_ostream> and managed_bytes_view

2023-05-07 17:17:34 +03:00

serializer.hh

utils/managed_bytes, serializer: add conversion between buffer_view<bytes_ostream> and managed_bytes_view

2023-05-07 17:17:34 +03:00

service_permit.hh

…

setup.py

…

shell.nix

…

sstables_loader.cc

locator: Introduce per-table replication strategy

2023-04-24 10:49:36 +02:00

sstables_loader.hh

…

supervisor.hh

…

table_helper.cc

…

table_helper.hh

…

test.py

test.py: warn and skip for missing unit/boost tests

2023-05-22 12:49:32 +03:00

timeout_config.cc

timeout_config: correct the misconfigured {truncate, other}_timeout

2023-04-24 12:26:14 +03:00

timeout_config.hh

timeout_config: remove unused make_timeout_config()

2023-03-29 20:17:45 +08:00

timestamp.hh

…

tombstone_gc_extension.hh

…

tombstone_gc_options.cc

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

tombstone_gc_options.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

tombstone_gc.cc

Resurrect optimization to avoid bloom filter checks during compaction

2023-05-18 09:01:50 +03:00

tombstone_gc.hh

Resurrect optimization to avoid bloom filter checks during compaction

2023-05-18 09:01:50 +03:00

tox.ini

…

ubsan-suppressions.supp

…

unimplemented.cc

…

unimplemented.hh

…

validation.cc

validation: Avoid throwing schema lookup

2023-03-24 08:43:48 +02:00

validation.hh

…

version.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

view_info.hh

view_info: Add data_dictionary argument to select_statement()

2023-04-20 11:17:46 +03:00

vint-serialization.cc

…

vint-serialization.hh

…

zstd.cc

zstd: share buffers between compressor instances

2023-04-26 22:09:17 +02:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The community forum and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.2%

Python 26.6%

CMake 0.3%

GAP 0.3%

Shell 0.3%