mirror of https://github.com/scylladb/scylladb.git synced 2026-06-09 00:13:31 +00:00

Go to file

Avi Kivity aa8f135f64 Merge 'Block flush until compaction finishes if sstables accumulate' from Mikołaj Sielużycki

If we reach a situation where flush rate exceeds compaction rate, we may
end up with arbitrarily large number of sstables on disk. If a read is
executed in such case, the amount of memory required is proportional to
the number of sstables for the given shard, which in extreme cases can
lead to OOM.

In the wild, this was observed in 2 scenarios:
- A node with >10 shards creates a keyspace with thousands of tables,
  drops the keyspace and shuts down before compaction finishes. Dropping
  keyspace drops tables, and each dropped table is smp::count writes to
  system.local table with flush after write, which creates tens of
  thousands of sstables. Bootstrap read from system.local will run OOM.
- A failure to agree on table schema (due to a code bug) between nodes
  during repair resulted in excessive flushing of small sstables which
  compaction couldn't keep up with.

In the unit test introduced in this patch series it can be proved that
even hard setting maximum shares for compaction and minimum shares for
flushing doesn't tilt the balance towards compaction enough to prevent
the problem. Since it's a fast producer, slow consumer problem, the
remaining solution is to block producer until the consumer catches up.
If there are too many table runs originating from memtable, we block the
current flush until the number of sstables is reduced (via ongoing
compaction or a truncate operation).

Fixes https://github.com/scylladb/scylla/issues/4116

Changelog:
v5:
- added a nicer way of timing the stalls caused by waiting for flush
- added predicate on signal when waiting for reduction of the number of sstables to correctly handle spurious wake ups
- added comment why we trigger compaction before waiting for sstable count reduction
- removed unnecessary cv.signal from table::stop

v4:
- removed conversion of table::stop to coroutines. It's an orthogonal change and doesn't need to go into this patchset

v3:
- removed unnecessary change to scheduling groups from v2
- moved sstables_changed signalling to suggested place in table::stop
- added log how long the table flush was blocked for
- changed the threshold to max(schema()->max_compaction_threshold(), 32) and comparison to <=

v2:
- Reimplemented waiting algorithm based on reviewers' feedback. It's confined to the table class and it waits in a loop until the number of sstable runs goes below threshold. It uses condition variable which is signaled on sstable set refresh. It handles node shutdown as well.
- Converted table::stop to coroutines.
- Reordered commits so that test is committed after fix, so it doesn't trip up bisection.

Closes #10717

* github.com:scylladb/scylla:
  table: Add test where compaction doesn't keep up with flush rate.
  random_mutation_generator: Add option to specify ks_name and cf_name
  table: Prevent creating unbounded number of sstables

2022-06-15 14:51:08 +03:00

.github

docs: disable link checker

2022-05-09 12:45:28 +02:00

abseil @ 9e408e050f

Update abseil submodule

2022-05-22 23:46:33 +03:00

alternator

alternator: improve error handling when trying to tag a GSI or LSI

2022-06-13 18:14:42 +03:00

api

Update seastar submodule. Unfortunately, also requires two changes

2022-05-11 14:46:30 +02:00

auth

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

cdc

cdc/log.hh: expose is_log_name()

2022-06-10 10:57:12 +03:00

compaction

compaction_manager: task: convert semaphore_aborted to compaction_stopped exception

2022-06-13 16:20:39 +03:00

conf

conf: update the description of the seeds parameter in scylla.yaml

2022-06-02 18:45:11 +03:00

cql3

cql3: expr: make evaluate() return a cql3::raw_value rather than an expr::constant

2022-06-15 08:47:24 +02:00

data_dictionary

data_dictionary: Introduce user types storage

2022-05-05 09:44:26 +03:00

Merge 'various group0 start/stop issues' from Gleb

2022-06-15 11:44:03 +03:00

debug

…

dht

range_streamer: Disable restream logic

2022-05-24 11:24:25 +03:00

direct_failure_detector

treewide: replace parallel_for_each with coroutine::parallel_for_each in coroutines

2022-05-31 09:06:24 +03:00

dist

scylla_util.py: fix "systemctl is-active" causes error

2022-06-13 13:45:50 +03:00

docs

raft: add Raft design nodes to the docs

2022-06-08 12:33:51 +02:00

exceptions

cql3: Remove relation class

2022-05-16 18:17:58 +02:00

gms

treewide: replace parallel_for_each with coroutine::parallel_for_each in coroutines

2022-05-31 09:06:24 +03:00

idl

tracing: Trace slow queries on replicas wrt. parent's clock

2022-02-10 12:03:53 +01:00

index

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

interface

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

lang

wasm: add wasm ABI version 2

2022-03-30 20:49:35 +02:00

libdeflate @ e7e54eab42

…

licenses

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

locator

snitch: Use invoke_on_others() to replicate

2022-05-20 18:16:22 +03:00

message

messaging: add boilerplate to rpc_protocol_impl.hh

2022-06-13 07:29:32 +02:00

mutation_writer

flat_mutation_reader ist tot

2022-05-31 23:42:34 +03:00

raft

raft: server: if add_entry with wait_type::applied successfully returns, ensure state_machine::apply is called for this entry

2022-05-27 12:06:18 +02:00

readers

flat_mutation_reader ist tot

2022-05-31 23:42:34 +03:00

redis

treewide: replace parallel_for_each with coroutine::parallel_for_each in coroutines

2022-05-31 09:06:24 +03:00

reloc

reloc: stop removing entire BUILDDIR

2021-09-19 10:33:33 +03:00

repair

Merge 'Fix stalls during repair with rbno' from Asias He

2022-06-08 06:41:51 +03:00

replica

table: Prevent creating unbounded number of sstables

2022-06-15 10:57:28 +02:00

rust

tests: add rust example

2022-05-11 16:49:31 +02:00

scripts

scripts: make pull_github_pr.sh more universally usable

2022-06-13 08:15:40 +03:00

seastar @ 443e6a9b77

Update seastar submodule

2022-06-15 08:36:08 +03:00

service

Merge 'various group0 start/stop issues' from Gleb

2022-06-15 11:44:03 +03:00

sstables

sstables: processing_result_generator: prefer standard coroutines over the technical specification with clang 14

2022-06-12 20:05:28 +03:00

streaming

streaming: Enable auto off strategy compaction trigger for all rbno ops

2022-06-09 17:10:14 +03:00

swagger-ui @ 12f1da1082

…

test

Merge 'Block flush until compaction finishes if sstables accumulate' from Mikołaj Sielużycki

2022-06-15 14:51:08 +03:00

thrift

treewide: change metric calls from make_derive to make_counter

2022-05-14 12:53:55 +02:00

tools

Update tools/python3 submodule (/usr/lib/sysimage filtering)

2022-06-15 09:27:06 +03:00

tracing

treewide: change metric calls from make_derive to make_counter

2022-05-14 12:53:55 +02:00

transport

treewide: change metric calls from make_derive to make_counter

2022-05-14 12:53:55 +02:00

types

fix "ninja dev-headers"

2022-05-31 23:42:34 +03:00

unified

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

utils

build_id: cache the value

2022-06-02 11:21:05 +03:00

.dockerignore

…

.gitattributes

…

.gitignore

.gitignore: ignore mypy_cache, the python lint cache

2022-04-19 16:48:47 +03:00

.gitmodules

…

.gitorderfile

…

absl-flat_hash_map.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

absl-flat_hash_map.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

atomic_cell_hash.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

atomic_cell_or_collection.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

atomic_cell.cc

atomic_cell: compare_atomic_cell_for_merge: compare ttl if expiry is equal

2022-03-07 11:05:30 +02:00

atomic_cell.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

backlog_controller.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

bytes_ostream.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

bytes.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

bytes.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

cache_flat_mutation_reader.hh

flat_mutation_reader ist tot

2022-05-31 23:42:34 +03:00

cache_temperature.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

caching_options.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

caching_options.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

canonical_mutation.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

canonical_mutation.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

cartesian_product.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

cell_locking.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

checked-file-impl.hh

treewide: use system-#include (angle brackets) for seastar

2022-04-26 14:46:42 +03:00

client_data.cc

client_data: Sanitize connection_notifier

2022-02-18 15:02:26 +03:00

client_data.hh

client_data: Sanitize connection_notifier

2022-02-18 15:02:26 +03:00

clocks-impl.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clocks-impl.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clustering_bounds_comparator.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clustering_interval_set.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clustering_key_filter.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clustering_ranges_walker.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

CMakeLists.txt

cql3: Remove relation class

2022-05-16 18:17:58 +02:00

collection_mutation.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

collection_mutation.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

column_computation.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

combine.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

compatible_ring_position.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

compound_compat.hh

compound_compat.hh: add missing methods of iterator

2022-03-08 15:37:03 +02:00

compound.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

compress.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

compress.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

concrete_types.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

configure.py

Update seastar submodule

2022-06-15 08:36:08 +03:00

CONTRIBUTING.md

…

converting_mutation_partition_applier.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

converting_mutation_partition_applier.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

counters.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

counters.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

cql_serialization_format.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

db_clock.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

debug.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

default.nix

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

digest_algorithm.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

digester.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

dirty_memory_manager.hh

table: clear: serialize with ongoing flush

2022-04-25 18:57:07 +03:00

Doxyfile

…

duration.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

duration.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

encoding_stats.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

enum_set.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

fix_system_distributed_tables.py

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

frozen_mutation.cc

frozen_mutation: add unfreeze_gently

2022-05-05 13:32:25 +03:00

frozen_mutation.hh

messaging: forward-declare types in messaging_service.hh

2022-06-09 15:52:12 +03:00

frozen_schema.cc

frozen_schema: avoid allocating contiguous memory

2022-02-21 01:39:02 +01:00

frozen_schema.hh

frozen_schema: avoid allocating contiguous memory

2022-02-21 01:39:02 +01:00

gc_clock.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

gdbinit

docs: debugging.md: add a sample gdbinit file

2022-05-11 10:23:08 +03:00

gen_segmented_compress_params.py

treewide: clean up stray license blurbs

2022-02-13 14:16:16 +02:00

generic_server.cc

generic_server: Gentle iterator

2022-02-18 14:25:08 +03:00

generic_server.hh

generic_server.hh: add missing include

2022-04-04 17:31:55 +03:00

HACKING.md

docs: update theme 1.2.1

2022-04-03 13:45:07 +03:00

hashers.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

hashers.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

hashing_partition_visitor.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

hashing.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

idl-compiler.py

ser: use vector_deserializer by default for all idl vectors

2022-05-18 19:24:18 +03:00

inet_address_vectors.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

init.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

init.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

install-dependencies.sh

install-dependencies.sh: add scylla-api-client PIP package

2022-06-07 09:43:50 +03:00

install.sh

docker: revert scylla-server.conf service name change

2022-04-03 19:18:18 +03:00

interval.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

intrusive_set_external_comparator.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

keys.cc

replica, partition_snapshot_reader, keys: replace boost::any with std::any

2022-04-28 07:18:53 +03:00

keys.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

LICENSE.AGPL

…

log.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

main.cc

main: stop raft before the migration manager

2022-06-09 09:40:55 +03:00

map_difference.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

marshal_exception.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

multishard_mutation_query.cc

multishard_mutation_query: do_query: couroutinize save_readers lambda

2022-06-08 09:31:17 +03:00

multishard_mutation_query.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_cleaner.hh

Revert "Merge 'memtable, cache: Eagerly compact data with tombstones' from Tomasz Grabiec"

2022-06-14 18:06:22 +03:00

mutation_compactor.hh

Revert "Merge 'memtable, cache: Eagerly compact data with tombstones' from Tomasz Grabiec"

2022-06-14 18:06:22 +03:00

mutation_consumer_concepts.hh

introduce the MutationConsumer concept

2022-02-28 17:11:54 +02:00

mutation_fragment_fwd.hh

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

mutation_fragment_stream_validator.hh

mutation_fragment_stream_validator: validate range tombstone changes

2022-03-29 13:19:05 +03:00

mutation_fragment_v2.hh

mutation_fragment_v2: range_tombstone_change: add minimal_memory_usage()

2022-04-28 14:11:51 +03:00

mutation_fragment.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_fragment.hh

treewide: use system-#include (angle brackets) for seastar

2022-04-26 14:46:42 +03:00

mutation_partition_serializer.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_partition_serializer.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_partition_view.cc

ser: use vector_deserializer by default for all idl vectors

2022-05-18 19:24:18 +03:00

mutation_partition_view.hh

mutation_partition_view: add accept_gently methods

2022-05-05 13:32:25 +03:00

mutation_partition_visitor.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_partition.cc

Revert "Merge 'memtable, cache: Eagerly compact data with tombstones' from Tomasz Grabiec"

2022-06-14 18:06:22 +03:00

mutation_partition.hh

Revert "Merge 'memtable, cache: Eagerly compact data with tombstones' from Tomasz Grabiec"

2022-06-14 18:06:22 +03:00

mutation_query.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_query.hh

query: coroutinize to_data_query_result

2022-05-05 13:32:25 +03:00

mutation_rebuilder.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_source_metadata.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation.cc

Revert "Merge 'memtable, cache: Eagerly compact data with tombstones' from Tomasz Grabiec"

2022-06-14 18:06:22 +03:00

mutation.hh

Revert "Merge 'memtable, cache: Eagerly compact data with tombstones' from Tomasz Grabiec"

2022-06-14 18:06:22 +03:00

noexcept_traits.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

NOTICE.txt

import wasmtime.hh

2021-09-13 11:01:33 +02:00

ORIGIN

…

partition_builder.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_range_compat.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_slice_builder.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_slice_builder.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_snapshot_reader.hh

fix "ninja dev-headers"

2022-05-31 23:42:34 +03:00

partition_snapshot_row_cursor.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_version_list.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_version.cc

Revert "Merge 'memtable, cache: Eagerly compact data with tombstones' from Tomasz Grabiec"

2022-06-14 18:06:22 +03:00

partition_version.hh

Revert "Merge 'memtable, cache: Eagerly compact data with tombstones' from Tomasz Grabiec"

2022-06-14 18:06:22 +03:00

position_in_partition.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

protocol_server.hh

compile: Fix headers so that *-headers targets compile cleanly.

2022-03-25 16:19:26 +02:00

querier.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

querier.hh

mutation_reader: move mutation source into readers/

2022-03-30 15:42:51 +03:00

query_class_config.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query_ranges_to_vnodes.cc

storage_proxy: extract query_ranges_to_vnodes_generator to a separate file

2022-02-01 21:14:41 +01:00

query_ranges_to_vnodes.hh

storage_proxy: extract query_ranges_to_vnodes_generator to a separate file

2022-02-01 21:14:41 +01:00

query_result_merger.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query-request.hh

messaging_service: add verb for count(*) request forwarding

2022-02-01 21:14:41 +01:00

query-result-reader.hh

ser: use vector_deserializer by default for all idl vectors

2022-05-18 19:24:18 +03:00

query-result-set.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query-result-set.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query-result-writer.hh

query_result_builder: remove v1 support

2022-03-11 09:24:17 +02:00

query-result.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query.cc

query: do not assert in operator<<(ostream&, const forward_result::printer&)

2022-03-09 14:58:11 +01:00

range_tombstone_assembler.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range_tombstone_change_generator.hh

range_tombstone_change_generator: flush(): add end_of_range

2022-04-21 14:37:10 +03:00

range_tombstone_list.cc

range_tombstone_list: insert_from: correct rev.update range_tombstone in not overlapping case

2022-04-04 22:26:29 +02:00

range_tombstone_list.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range_tombstone_splitter.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range_tombstone.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range_tombstone.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

read_context.hh

row_cache: update reader implementations to v2

2022-04-21 14:57:04 +03:00

reader_concurrency_semaphore.cc

treewide: fix compilation issues with fmtlib 8.1.0+

2022-03-16 12:31:50 +03:00

reader_concurrency_semaphore.hh

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

reader_permit.hh

evicatble_reader: avoid preemption pitfall around waiting for readmission

2022-03-15 14:37:22 +02:00

README.md

README.md: update link to docker build instructions

2021-09-01 11:50:11 +03:00

real_dirty_memory_accounter.hh

memtable: move to replica module and namespace

2022-02-23 09:05:16 +02:00

release.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

release.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

reversibly_mergeable.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

row_cache.cc

Revert "Merge 'memtable, cache: Eagerly compact data with tombstones' from Tomasz Grabiec"

2022-06-14 18:06:22 +03:00

row_cache.hh

row_cache: update reader implementations to v2

2022-04-21 14:57:04 +03:00

schema_builder.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_fwd.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_mutations.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_mutations.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_registry.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_registry.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_upgrader.hh

compile: Fix headers so that *-headers targets compile cleanly.

2022-03-25 16:19:26 +02:00

schema.cc

secondary index: avoid special characters in default index names

2022-03-20 18:33:48 +02:00

schema.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

scylla_post_install.sh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

scylla-gdb.py

scylla-gdb.py: make scylla-threads more flexible

2022-06-13 13:05:27 +03:00

SCYLLA-VERSION-GEN

SCYLLA-VERSION-GEN:set release-version value length

2022-02-21 13:28:04 +02:00

seastarx.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

serialization_visitors.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

serializer_impl.hh

serializer_impl: add vector_deserializer

2022-05-18 19:10:13 +03:00

serializer.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

serializer.hh

code: Convert is_integral assertions to concepts

2022-02-24 19:44:29 +03:00

service_permit.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

setup.py

…

shell.nix

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

sstables_loader.cc

messaging: forward-declare types in messaging_service.hh

2022-06-09 15:52:12 +03:00

sstables_loader.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

supervisor.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

table_helper.cc

treewide: replace parallel_for_each with coroutine::parallel_for_each in coroutines

2022-05-31 09:06:24 +03:00

table_helper.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

test.py

test.py: add support for flaky tests

2022-06-10 14:10:21 +03:00

timeout_config.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

timeout_config.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

timestamp.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

to_string.hh

to_string.hh: include <map>

2022-02-17 08:53:48 +02:00

tombstone_gc_extension.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

tombstone_gc_options.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

tombstone_gc_options.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

tombstone_gc.cc

gms: feature_service: remove variable/helper function duplication

2022-05-04 18:59:56 +03:00

tombstone_gc.hh

Merge "tools: cut schema loader free of replica::database" from Botond

2022-03-27 17:01:05 +03:00

tombstone.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

tox.ini

…

types.cc

Merge 'cql: Add proper validation for null and unset inside collections send as bound values' from Jan Ciołek

2022-05-19 11:25:24 +03:00

types.hh

Merge 'cql: Add proper validation for null and unset inside collections send as bound values' from Jan Ciołek

2022-05-19 11:25:24 +03:00

ubsan-suppressions.supp

…

unimplemented.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

unimplemented.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

validation.cc

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

validation.hh

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

version.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

view_info.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

vint-serialization.cc

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

vint-serialization.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

xx_hasher.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

zstd.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.1%

Python 26.7%

CMake 0.3%

GAP 0.3%

Shell 0.3%