mirror of https://github.com/scylladb/scylladb.git synced 2026-06-05 14:33:08 +00:00

Go to file

Avi Kivity 4f3b8f38e2 Merge "Add effective_replication_map" from Benny

"
The current api design of abstract_replication_strategy
provides a can_yield parameter to calls that may stall
when traversing the token metadata in O(n^2) and even
in O(n) for a large number of token ranges.

But, to use this option the caller must run in a seastar thread.
It can't be used if the caller runs a coroutine or plain
async tasks.

Rather than keep adding threads (e.g. in storage_service::load_and_stream
or storage_service::describe_ring), the series offers an infrastructure
change: precalculating the token->endpoints map once, using an async task,
and keeping the results in a `effective_replication_map` object.
The latter can be used for efficient and stall-free calls, like
get_natural_endpoints, or get_ranges/get_primary_range, replacing their
equivalents in abstract_replication_strategy, and dropping the public
abstract_replication_strategy::calculate_natural_endpoints and its
internal cached_endpoints map.

Other than the performance benefits of:
1. The current calls require running a thread to yield.
Precalculating the map (using async task) allows us to use synchronous calls
without stalling the rector.

2. The replication maps can and should be shared
between keyspaces that use the same replication strategy.
(Will be sent as a follow-up to the series)

The bigger benefits (courtesy of Avi Kivity) are laying the groundwork for:
1. atomic replication metadata - an operation can capture a replication map once, and then use consistent information from the map without worrying that it changes under its feet. We may even be able to s/inet_address/replica_ptr/ later.

2. establish boundaries on the use of replication information - by making a replication map not visible, and observing when its reference count drops to zero, we can tell when the new replication map is fully in use. When we start writing to a new node we'll be able to locate a point in time where all writes that were not aware of the new node were completed (this is the point where we should start streaming).

Notes:
* The get_natural_endpoints method that uses the effective_replication_map
  is still provided as a abstract_replication_strategy virtual method
  so that local_strategy can override it and privide natural endpoints
  for any search token, even in the absence of token_metadata, when\
  called early-on, before token_metadata has been established.

  The effective_replication_map materializes the replication strategy
  over a given replication strategy options and token_metadata.
  Whenever either of those change for a keyspace, we make a new
  effective_replication_map and keep it in the keyspace for latter use.

  Methods that depend on an ad-hoc token_metadata (e.g. during
  node operations like bootstrap or replace) are still provided
  by abstract_replication_strategy.

TODO:
- effective_replication_map registry
- Move pending ranges from token_metadata to replication map
- get rid of abstract_replication_strategy::get_range_addresses(token_metadata&)
  - calculate replication map and use it instead.

Test: unit(dev, debug)
Dtest: next-gating, bootstrap_test.py update_cluster_layout_tests.py alternator_tests.py -a 'dtest-full,!dtest-heavy' (release)
"

* tag 'effective_replication_strategy-v6' of github.com:bhalevy/scylla: (44 commits)
  effective_replication_map: add get_range_addresses
  abstract_replication_strategy: get rid of shared_token_metadata member and ctor param
  abstract_replication_strategy: recognized_options: pass const topology&
  abstract_replication_strategy: precacluate get_replication_factor for effective_replication_map
  token_metadata: get rid of now-unused sync methods
  abstract_replication_strategy: get rid of do_calculate_natural_endpoints
  abstract_replication_strategy: futurize get_*address_ranges
  abstract_replication_strategy: futurize get_range_addresses
  abstract_replication_strategy: futurize get_ranges(inet_address ep, token_metadata_ptr)
  abstract_replication_strategy: move get_ranges and get_primary_ranges* to effective_replication_map
  compaction_manager: pass owned_ranges via cleanup/upgrade options
  abstract_replication_strategy: get rid of cached_endpoints
  all replication strategies: get rid of do_get_natural_endpoints
  storage_proxy: use effective_replication_map token_metadata_ptr along with endpoints
  abstract_replication_strategy: move get_natural_endpoints_without_node_being_replaced to effective_replication_map
  storage_service: bootstrap: add log messages
  storage_service: get_mutable_token_metadata_ptr: always invalidate_cached_rings
  shared_token_metadata: set: check version monotonicity
  token_metadata: use static ring version
  token_metadata: get rid of copy constructor and assignment operator
  ...

2021-10-13 20:28:30 +03:00

.github

CODEOWNERS: some fixes and additions

2021-09-29 18:07:07 +03:00

abseil @ 9c6a50fdd8

Update abseil submodule

2021-02-08 15:41:46 +02:00

alternator

alternator: disambiguate attrs_to_get in table_requests

2021-10-06 14:55:48 +03:00

api

storage_service, api: Move set-tables-autocompaction back into API

2021-10-11 11:13:59 +03:00

auth

abstract_replication_strategy: use shared_ptr in registry

2021-10-13 12:39:36 +03:00

cdc

cdc: adjust type of streams_count

2021-10-06 14:56:00 +03:00

compaction

compaction_manager: pass owned_ranges via cleanup/upgrade options

2021-10-13 14:17:46 +03:00

conf

scylla.yaml: refresh list of experimental features

2021-10-13 20:24:02 +03:00

cql3

abstract_replication_strategy: get rid of shared_token_metadata member and ctor param

2021-10-13 16:10:06 +03:00

abstract_replication_strategy: precacluate get_replication_factor for effective_replication_map

2021-10-13 16:10:06 +03:00

debug

…

dht

effective_replication_map: add get_range_addresses

2021-10-13 16:10:06 +03:00

dist

dist: raise fs.file-max and fs.nr_open to enough size for scylla

2021-10-12 12:47:35 +03:00

docs

Merge 'IDL: support generating boilerplate code for RPC verbs' from Pavel Solodovnikov

2021-10-05 18:05:24 +03:00

exceptions

utils: exceptions: convert sprint() to format()

2021-07-12 11:17:57 +03:00

gms

gossiper: Send generation number with shutdown message

2021-09-27 11:08:43 +03:00

idl

system_keyspace: prepare forward-declared members

2021-09-13 15:11:26 +03:00

index

flat_mutation_reader: get rid of timeout parameter

2021-08-24 16:30:51 +03:00

interface

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

lang

wasm: Localize it to database usage

2021-09-15 17:35:17 +03:00

libdeflate @ e7e54eab42

…

licenses

…

locator

effective_replication_map: add get_range_addresses

2021-10-13 16:10:06 +03:00

message

Merge 'IDL: support generating boilerplate code for RPC verbs' from Pavel Solodovnikov

2021-10-05 18:05:24 +03:00

mutation_writer

mutation_writer: partition_based_splitting_writer: limit number of max buckets

2021-09-29 16:31:29 +03:00

raft

raft: disambiguate promise name in raft::active_read

2021-10-10 18:16:50 +03:00

redis

build: enable -Winconsistent-missing-override warning

2021-09-15 12:55:54 +03:00

reloc

reloc: stop removing entire BUILDDIR

2021-09-19 10:33:33 +03:00

repair

abstract_replication_strategy: precacluate get_replication_factor for effective_replication_map

2021-10-13 16:10:06 +03:00

scripts

scripts/pull_github_pr.sh: don't guess git remote name

2021-10-04 12:32:39 +03:00

seastar @ 994b4b5a0c

Update seastar submodule

2021-10-04 15:36:45 +03:00

service

Merge "Add effective_replication_map" from Benny

2021-10-13 20:28:30 +03:00

sstables

Merge 'tools/scylla-sstable: allow opening sstables from any path' from Botond Dénes

2021-10-12 12:50:11 +03:00

streaming

Merge "flat_mutation_reader: keep timeout in permit" from Benny

2021-08-25 17:51:10 +03:00

swagger-ui @ 12f1da1082

…

test

Merge "Add effective_replication_map" from Benny

2021-10-13 20:28:30 +03:00

thrift

cql3: deinline non-trivial methods in selection.hh

2021-10-05 12:58:55 +02:00

tools

token_metadata, storage_service: unify token_metadata_lock and merge_lock.

2021-10-13 13:01:25 +03:00

tracing

abstract_replication_strategy: use shared_ptr in registry

2021-10-13 12:39:36 +03:00

transport

transport: respond with overloaded exception during shedding

2021-10-07 15:38:40 +03:00

types

cql3: types: Optimize abstract_type::contains_collection

2021-09-24 13:45:38 +02:00

unified

unified: fix handling --supervisor option

2021-08-18 13:17:08 +03:00

utils

abstract_replication_strategy: use shared_ptr in registry

2021-10-13 12:39:36 +03:00

.dockerignore

…

.gitattributes

…

.gitignore

.gitignore: add compile_commands.json

2021-09-02 13:37:35 +03:00

.gitmodules

…

.gitorderfile

…

absl-flat_hash_map.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

absl-flat_hash_map.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

atomic_cell_hash.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

atomic_cell_or_collection.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

atomic_cell.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

atomic_cell.hh

atomic_cell: change compare_atomic_cell_for_merge() to std::strong_ordering

2021-07-28 13:26:27 +03:00

backlog_controller.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

bytes_ostream.hh

repair: row_level: clear_gently: clear_gently each repair_row

2021-07-01 19:16:11 +03:00

bytes.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

bytes.hh

bytes: compare_unsigned(): change to std::strong_ordering

2021-07-28 13:21:01 +03:00

cache_flat_mutation_reader.hh

build: enable -Winconsistent-missing-override warning

2021-09-15 12:55:54 +03:00

cache_temperature.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

caching_options.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

caching_options.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

canonical_mutation.cc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

canonical_mutation.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

cartesian_product.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

cell_locking.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

checked-file-impl.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clocks-impl.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clocks-impl.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clustering_bounds_comparator.hh

clustering_bounds_comparator: add reverse_kind()

2021-09-09 11:49:05 +03:00

clustering_interval_set.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clustering_key_filter.hh

clustering_key_filter: clustering_key_filter_ranges owning constructor

2021-09-30 12:10:52 +02:00

clustering_ranges_walker.hh

sstables: remove unused uppermost_bound from clustering_ranges_walker and mutation_fragment_filter

2021-08-11 10:54:59 +02:00

CMakeLists.txt

lua: move to lang/ directory

2021-09-13 11:01:33 +02:00

collection_mutation.cc

compaction: Move compaction_garbage_collector.hh to compaction dir

2021-08-07 08:07:09 +08:00

collection_mutation.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

column_computation.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

combine.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

compatible_ring_position.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

compound_compat.hh

treewide: remove redundant "x <=> 0" compares

2021-07-28 13:30:32 +03:00

compound.hh

treewide: remove redundant "x <=> 0" compares

2021-07-28 13:30:32 +03:00

compress.cc

abstract_replication_strategy: use shared_ptr in registry

2021-10-13 12:39:36 +03:00

compress.hh

abstract_replication_strategy: use shared_ptr in registry

2021-10-13 12:39:36 +03:00

concrete_types.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

configure.py

Merge 'treewide: improve compatibility with gcc 11' from Avi Kivity

2021-10-11 16:54:01 +03:00

connection_notifier.cc

system_keyspace: prepare forward-declared members

2021-09-13 15:11:26 +03:00

connection_notifier.hh

system_keyspace: prepare forward-declared members

2021-09-13 15:11:26 +03:00

CONTRIBUTING.md

CONTRIBUTING.md: add the requirement for self-contained headers

2021-05-05 15:10:46 +03:00

converting_mutation_partition_applier.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

converting_mutation_partition_applier.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

counters.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

counters.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

cql_serialization_format.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

database_fwd.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

database.cc

abstract_replication_strategy: get rid of shared_token_metadata member and ctor param

2021-10-13 16:10:06 +03:00

database.hh

abstract_replication_strategy: get rid of shared_token_metadata member and ctor param

2021-10-13 16:10:06 +03:00

db_clock.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

debug.hh

main, scylla-gdb, cql-test-env: Unify debug::the_database

2021-09-15 17:35:30 +03:00

default.nix

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

digest_algorithm.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

digester.hh

hasher: More picky noexcept marking of feed_hash()

2021-07-07 12:00:16 +03:00

dirty_memory_manager.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

distributed_loader.cc

distributed_loader, utils: Move verify_owner_and_mode

2021-10-11 11:03:51 +03:00

distributed_loader.hh

distributed_loader, utils: Move verify_owner_and_mode

2021-10-11 11:03:51 +03:00

Doxyfile

…

duration.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

duration.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

encoding_stats.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

enum_set.hh

enum_set: add toggle()

2021-09-13 18:05:11 +03:00

fix_system_distributed_tables.py

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

flat_mutation_reader_v2.hh

flat_mutation_reader: remove unused reserve_one method

2021-09-29 17:22:29 +02:00

flat_mutation_reader.cc

flat_mutation_reader: make_reversing_reader(): add convenience stored slice

2021-09-28 17:03:57 +03:00

flat_mutation_reader.hh

flat_mutation_reader: mention reversed schema in make_reversing_reader docstring

2021-09-30 12:10:52 +02:00

frozen_mutation.cc

flat_mutation_reader: get rid of timeout parameter

2021-08-24 16:30:51 +03:00

frozen_mutation.hh

repair: row_level: clear_gently: clear_gently each repair_row

2021-07-01 19:16:11 +03:00

frozen_schema.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

frozen_schema.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

gc_clock.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

gen_segmented_compress_params.py

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

generic_server.cc

transport, generic_server: Remove no longer used functionality

2021-07-22 18:41:32 +03:00

generic_server.hh

transport, generic_server: Remove no longer used functionality

2021-07-22 18:41:32 +03:00

HACKING.md

docs: clean up codeowners

2021-09-22 18:55:25 +03:00

hashers.cc

build, treewide: enable -Wpessimizing-move warning

2021-07-08 17:52:34 +03:00

hashers.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

hashing_partition_visitor.hh

build: enable -Winconsistent-missing-override warning

2021-09-15 12:55:54 +03:00

hashing.hh

hasher: More picky noexcept marking of feed_hash()

2021-07-07 12:00:16 +03:00

idl-compiler.py

idl: support generating boilerplate code for RPC verbs

2021-09-30 02:21:57 +03:00

inet_address_vectors.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

init.cc

gossiper, main: Turn init_gossiper into get_seeds_from_config

2021-09-22 13:13:06 +03:00

init.hh

gossiper, main: Turn init_gossiper into get_seeds_from_config

2021-09-22 13:13:06 +03:00

install-dependencies.sh

install-dependencies.sh: add scylla-driver to relocatable python3

2021-09-02 11:52:47 +03:00

install.sh

install.sh: add supervisor support

2021-07-27 12:51:29 +03:00

interval.hh

interval: constrain comparator parameters

2021-09-10 16:43:16 +02:00

intrusive_set_external_comparator.hh

everywhere: make deferred actions noexcept

2021-08-22 21:11:52 +03:00

keys.cc

clustering_bounds_comparator: add reverse_kind()

2021-09-09 11:49:05 +03:00

keys.hh

treewide: remove redundant "x <=> 0" compares

2021-07-28 13:30:32 +03:00

LICENSE.AGPL

…

lister.cc

…

lister.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

log.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

main.cc

token_metadata, storage_service: unify token_metadata_lock and merge_lock.

2021-10-13 13:01:25 +03:00

map_difference.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

marshal_exception.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

memtable-sstable.hh

memtable: migrate off the global reader concurrency semaphore

2021-07-08 12:31:36 +03:00

memtable.cc

memtable: enable native reversing

2021-10-10 20:38:18 +02:00

memtable.hh

memtable: enable native reversing

2021-10-10 20:38:18 +02:00

multishard_mutation_query.cc

querier: consume_page(): remove now unused max_size parameter

2021-09-29 12:15:48 +03:00

multishard_mutation_query.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_cleaner.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_compactor.hh

mutation_compactor: collect stats about compacted data

2021-09-22 13:59:19 +03:00

mutation_consumer_concepts.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_fragment_stream_validator.hh

mutation_fragment_stream_validator: disambiguate schema member definition

2021-07-27 11:55:42 +03:00

mutation_fragment_v2.hh

mutation_fragment{_v2}: MutationFragmentConsumer: allow for abstract consumer

2021-08-25 13:12:41 +03:00

mutation_fragment.cc

range_tombstone, code: Add range_tombstone& getters

2021-09-03 19:34:45 +03:00

mutation_fragment.hh

mutation_fragment{_v2}: MutationFragmentConsumer: allow for abstract consumer

2021-08-25 13:12:41 +03:00

mutation_partition_serializer.cc

range_tombstone, code: Add range_tombstone& getters

2021-09-03 19:34:45 +03:00

mutation_partition_serializer.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_partition_view.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_partition_view.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_partition_visitor.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_partition.cc

cache_tracker: remove unused parameter from on_remove

2021-09-30 13:03:13 +03:00

mutation_partition.hh

mutation_partition: Return immutable collection for range tombstones

2021-07-27 20:06:53 +03:00

mutation_query.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_query.hh

mutation_query: reconcilable_result_builder: document reverse query preconditions

2021-09-28 17:03:57 +03:00

mutation_reader.cc

mutation_reader: evictable_reader: add reverse read support

2021-09-28 17:03:57 +03:00

mutation_reader.hh

sstables: mx: implement reversed single-partition reads

2021-10-04 15:24:12 +02:00

mutation_rebuilder.hh

mutation_rebuilder: make it standalone

2021-09-09 15:42:15 +03:00

mutation_source_metadata.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation.cc

mutation: introduce reverse()

2021-09-09 15:42:15 +03:00

mutation.hh

mutation: introduce reverse()

2021-09-09 15:42:15 +03:00

noexcept_traits.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

NOTICE.txt

import wasmtime.hh

2021-09-13 11:01:33 +02:00

ORIGIN

…

partition_builder.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

partition_range_compat.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

partition_slice_builder.cc

partition_slice_builder: add range mutating methods

2021-09-09 14:16:21 +03:00

partition_slice_builder.hh

partition_slice_builder(): add with_option_toggled()

2021-09-13 18:05:11 +03:00

partition_snapshot_reader.hh

partition_snapshot_reader: pop_range_tombstone returns reference

2021-10-10 20:38:18 +02:00

partition_snapshot_row_cursor.hh

mutation_partition: Return immutable collection for rows

2021-07-27 20:06:53 +03:00

partition_version_list.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

partition_version.cc

range_tombstone, code: Add range_tombstone& getters

2021-09-03 19:34:45 +03:00

partition_version.hh

row_cache: Consume range tombstones incrementally

2021-07-26 17:48:05 +02:00

position_in_partition.hh

treewide: handle switch statements that return

2021-10-10 18:16:50 +03:00

querier.cc

reader_concurrency_semaphore: adjust reactivated reader timeout

2021-08-24 16:30:51 +03:00

querier.hh

querier: consume_page(): remove now unused max_size parameter

2021-09-29 12:15:48 +03:00

query_class_config.hh

table, database: query, mutation_query: remove unnecessary class_config param

2021-09-14 13:39:56 +02:00

query_result_merger.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query-request.hh

query: reverse clustering_range

2021-10-05 16:47:04 +02:00

query-result-reader.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

query-result-set.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query-result-set.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

query-result-writer.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query-result.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query.cc

query: reverse clustering_range

2021-10-05 16:47:04 +02:00

range_tombstone_assembler.hh

flat_mutation_reader: Introduce adaptors between v1 and v2 of mutation fragment stream

2021-06-15 13:10:47 +02:00

range_tombstone_change_generator.hh

range_tombstone, code: Add range_tombstone& getters

2021-09-03 19:34:45 +03:00

range_tombstone_list.cc

range_tombstone: Move linkage into range_tombstone_entry

2021-09-03 19:34:45 +03:00

range_tombstone_list.hh

range_tombstone: Move linkage into range_tombstone_entry

2021-09-03 19:34:45 +03:00

range_tombstone_splitter.hh

flat_mutation_reader: Trim range tombstones in make_flat_mutation_reader_from_fragments()

2021-06-16 00:23:49 +02:00

range_tombstone.cc

range_tombstone_accumulator: drop _reversed flag

2021-09-09 15:42:15 +03:00

range_tombstone.hh

range_tombstone_accumulator: drop _reversed flag

2021-09-09 15:42:15 +03:00

range.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

read_context.hh

flat_mutation_reader: get rid of timeout parameter

2021-08-24 16:30:51 +03:00

reader_concurrency_semaphore.cc

Prepare for inheriting from reader_concurrency_semaphore

2021-09-26 12:57:48 +03:00

reader_concurrency_semaphore.hh

Prepare for inheriting from reader_concurrency_semaphore

2021-09-26 12:57:48 +03:00

reader_permit.hh

reader_permit: make query max result size accessible from the permit

2021-09-14 13:27:25 +02:00

README.md

README.md: update link to docker build instructions

2021-09-01 11:50:11 +03:00

real_dirty_memory_accounter.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

release.cc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

release.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

reversibly_mergeable.hh

everywhere: cleanup defer.hh includes

2021-08-22 21:11:39 +03:00

row_cache.cc

treewide: move reversing to the mutation sources

2021-09-29 12:15:45 +03:00

row_cache.hh

treewide: move reversing to the mutation sources

2021-09-29 12:15:45 +03:00

schema_builder.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_fwd.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_mutations.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_mutations.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_registry.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_registry.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_upgrader.hh

Adapt flat_mutation_reader_v2 to the new version of the API

2021-06-15 13:10:47 +02:00

schema.cc

schema: add get_reversed()

2021-09-22 18:55:25 +03:00

schema.hh

schema: add get_reversed()

2021-09-22 18:55:25 +03:00

scylla_post_install.sh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

scylla-gdb.py

sstables, gdb: Retire usage of sstable_tracker

2021-10-07 14:40:47 +02:00

SCYLLA-VERSION-GEN

build: allow to run SCYLLA-VERSION-GEN utility out of source

2021-09-02 13:04:34 +03:00

seastarx.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

serialization_visitors.hh

…

serializer_impl.hh

serialize: add serialized for std::monostate

2021-08-25 08:19:25 +03:00

serializer.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

serializer.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

service_permit.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

setup.py

…

shell.nix

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

sstables_loader.cc

storage_service, sstables_loader: use effective_replication_map to get_natural_endpoints

2021-10-13 13:50:27 +03:00

sstables_loader.hh

sstables_loader: Accept the sstables loading code

2021-10-11 11:08:21 +03:00

supervisor.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

table_helper.cc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

table_helper.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

table.cc

avoid race between compaction and table stop

2021-10-07 14:36:39 +03:00

test.py

test.py: Always disable boost colored output

2021-07-30 12:22:31 +03:00

timeout_config.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

timeout_config.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

timestamp.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

to_string.hh

to_string: Add formatter for strong_ordering

2021-06-08 11:33:04 +03:00

tombstone.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

tox.ini

…

types.cc

types: remove recursive constraint in deserialize_value

2021-10-10 18:16:50 +03:00

types.hh

cql3: types: Optimize abstract_type::contains_collection

2021-09-24 13:45:38 +02:00

ubsan-suppressions.supp

…

unimplemented.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

unimplemented.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

user_types_metadata.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

validation.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

validation.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

version.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

view_info.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

vint-serialization.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

vint-serialization.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

xx_hasher.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

zstd.cc

abstract_replication_strategy: use shared_ptr in registry

2021-10-13 12:39:36 +03:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.1%

Python 26.7%

CMake 0.3%

GAP 0.3%

Shell 0.3%