mirror of https://github.com/scylladb/scylladb.git synced 2026-05-12 19:02:12 +00:00

Go to file

Raphael S. Carvalho 284dd21ef7 compaction_manager: Fix race when selecting sstables for rewrite operations

Rewrite operations are scrub, cleanup and upgrade.

Race can happen because 'selection of sstables' and 'mark sstables as
compacting' are decoupled. So any deferring point in between can lead
to a parallel compaction picking the same files. After commit 2cf0c4bbf,
files are marked as compacting before rewrite starts, but it didn't
take into account the commit c84217ad which moved retrieval of
candidates to a deferring thread, before rewrite_sstables() is even
called.

Scrub isn't affected by this because it uses a coarse grained approach
where whole operation is run with compaction disabled, which isn't good
because regular compaction cannot run until its completion.

From now on, selection of files and marking them as compacting will
be serialized by running them with compaction disabled.

Now cleanup will also retrieve sstables with compaction disabled,
meaning it will no longer leave uncleaned files behind, which is
important to avoid data resurrection if node regains ownership of
data in uncleaned files.

Fixes #8168.
Refs #8155.

[backport notes:
- minor conflict around run_with_compaction_disabled()
- bumped into our old friend
  https://gcc.gnu.org/bugzilla/show_bug.cgi?id=95111,
so I had to use std::ref() on local copy of lambda
- with the yielding part of candidate retrieval now happening in
rewrite_sstables(), task registration is moved to after run_with_
compaction_disabled() call, so the latter won't incorrectly try
to stop the task that called it, which triggers an assert in
debug mode.
]

Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>
Message-Id: <20211129133107.53011-1-raphaelsc@scylladb.com>
(cherry picked from commit 80a1ebf0f3)

Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>

Closes #10963

2022-07-13 18:45:36 +03:00

.github

Review docs config

2021-10-22 13:34:56 +01:00

abseil @ f70eadadd7

Update abseil submodule

2021-10-28 16:22:18 +03:00

alternator

alternator: forbid empty AttributesToGet

2022-07-03 13:36:02 +03:00

api

repair: Return HTTP 400 when repiar id is not found

2021-11-03 17:15:40 +02:00

auth

auth: replace seastar::sprint() with fmt::format()

2021-10-27 14:29:32 +03:00

cdc

cdc: check_and_repair_cdc_streams: regenerate if too many streams are present

2022-07-07 18:53:14 +02:00

compaction

compaction_manager: Fix race when selecting sstables for rewrite operations

2022-07-13 18:45:36 +03:00

conf

scylla.yaml: refresh list of experimental features

2021-10-13 20:24:02 +03:00

cql3

prepared_statements: Invalidate batch statement too

2022-05-08 12:33:00 +03:00

view: Fix trace-state pointer use after move

2022-07-12 14:21:11 +03:00

debug

…

dht

effective_replication_map: add get_range_addresses

2021-10-13 16:10:06 +03:00

dist

scylla_coredump_setup: support new format of Storage field

2022-07-03 13:55:25 +03:00

docs

fix some typo in docs

2021-11-02 19:59:16 +03:00

exceptions

utils: exceptions: convert sprint() to format()

2021-07-12 11:17:57 +03:00

gms

Merge "Run gossiper message handlers in a gate" from Pavel E

2021-11-19 07:25:26 +02:00

idl

system_keyspace: prepare forward-declared members

2021-09-13 15:11:26 +03:00

index

rjson, alternator: rename set() functions add()

2021-11-04 16:35:38 +01:00

interface

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

lang

wasm: Localize it to database usage

2021-09-15 17:35:17 +03:00

libdeflate @ e7e54eab42

…

licenses

…

locator

locator: replace seastar::sprint() with fmt::format()

2021-10-27 17:02:00 +03:00

message

messaging: do isolate default tenants

2022-07-05 13:42:10 +03:00

mutation_writer

mutation_writer/feed_writer: don't drop readers with small amount of content

2021-11-09 14:13:21 +02:00

raft

raft: disambiguate promise name in raft::active_read

2021-10-10 18:16:50 +03:00

redis

redis: replace seastar::sprint() with fmt::format()

2021-10-27 17:02:00 +03:00

reloc

reloc: stop removing entire BUILDDIR

2021-09-19 10:33:33 +03:00

repair

Merge "repair: make sure there is one permit per repair with count res" from Botond

2022-01-17 16:02:55 +02:00

scripts

build: have configure.py create compile_commands.json

2021-11-05 11:28:37 +02:00

seastar @ 6217d6ff4e

Update seastar submodule (json crash in describe_ring)

2022-06-08 16:49:53 +03:00

service

Merge 'service: storage_service: announce new CDC generation immediately with RBNO' from Kamil Braun

2022-03-16 12:27:24 +01:00

sstables

sstable: partition_index_cache: Fix abort on bad_alloc during page loading

2022-05-30 13:00:46 +03:00

streaming

streaming: replace seastar::sprint() with fmt::format()

2021-10-27 17:02:00 +03:00

swagger-ui @ 12f1da1082

…

test

view: exclude using static columns in the view filter

2022-07-11 17:07:22 +03:00

thrift

Merge 'Convert last uses of sprint() to fmt::format()' from Avi Kivity

2021-10-28 22:33:23 +03:00

tools

Update tools/java submodule (bad IPv6 addresses in nodetool)

2022-04-28 11:35:09 +03:00

tracing

tracing: replace seastar::sprint() with fmt::format()

2021-10-27 17:02:00 +03:00

transport

CQL: Replace assert by exception on invalid auth opcode

2022-05-10 14:03:03 +02:00

types

cql3: types: Optimize abstract_type::contains_collection

2021-09-24 13:45:38 +02:00

unified

unified: fix handling --supervisor option

2021-08-18 13:17:08 +03:00

utils

loading_cache: Make invalidation take immediate effect

2022-05-04 15:38:11 +03:00

.dockerignore

…

.gitattributes

…

.gitignore

build: have configure.py create compile_commands.json

2021-11-05 11:28:37 +02:00

.gitmodules

Point seastar submodule at scylla-seastar.git

2022-01-30 20:01:12 +02:00

.gitorderfile

…

absl-flat_hash_map.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

absl-flat_hash_map.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

atomic_cell_hash.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

atomic_cell_or_collection.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

atomic_cell.cc

atomic_cell: compare_atomic_cell_for_merge: compare ttl if expiry is equal

2022-03-24 18:08:07 +02:00

atomic_cell.hh

atomic_cell: change compare_atomic_cell_for_merge() to std::strong_ordering

2021-07-28 13:26:27 +03:00

backlog_controller.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

bytes_ostream.hh

repair: row_level: clear_gently: clear_gently each repair_row

2021-07-01 19:16:11 +03:00

bytes.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

bytes.hh

bytes: compare_unsigned(): change to std::strong_ordering

2021-07-28 13:21:01 +03:00

cache_flat_mutation_reader.hh

row-cache: Handle exception (un)safety of rows_entry insertion

2021-12-14 15:53:42 +02:00

cache_temperature.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

caching_options.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

caching_options.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

canonical_mutation.cc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

canonical_mutation.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

cartesian_product.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

cell_locking.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

checked-file-impl.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clocks-impl.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clocks-impl.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clustering_bounds_comparator.hh

clustering_bounds_comparator: add reverse_kind()

2021-09-09 11:49:05 +03:00

clustering_interval_set.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clustering_key_filter.hh

database, treewide: Introduce partition_slice::is_reversed()

2021-10-14 12:39:16 +03:00

clustering_ranges_walker.hh

sstables: remove unused uppermost_bound from clustering_ranges_walker and mutation_fragment_filter

2021-08-11 10:54:59 +02:00

CMakeLists.txt

lua: move to lang/ directory

2021-09-13 11:01:33 +02:00

collection_mutation.cc

compaction: Move compaction_garbage_collector.hh to compaction dir

2021-08-07 08:07:09 +08:00

collection_mutation.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

column_computation.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

combine.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

compatible_ring_position.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

compound_compat.hh

treewide: remove redundant "x <=> 0" compares

2021-07-28 13:30:32 +03:00

compound.hh

treewide: remove redundant "x <=> 0" compares

2021-07-28 13:30:32 +03:00

compress.cc

compress: convert fmt::sprintf() to fmt::format()

2021-10-27 17:02:00 +03:00

compress.hh

abstract_replication_strategy: use shared_ptr in registry

2021-10-13 12:39:36 +03:00

concrete_types.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

configure.py

build: have configure.py create compile_commands.json

2021-11-05 11:28:37 +02:00

connection_notifier.cc

system_keyspace: prepare forward-declared members

2021-09-13 15:11:26 +03:00

connection_notifier.hh

system_keyspace: prepare forward-declared members

2021-09-13 15:11:26 +03:00

CONTRIBUTING.md

CONTRIBUTING.md: add the requirement for self-contained headers

2021-05-05 15:10:46 +03:00

converting_mutation_partition_applier.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

converting_mutation_partition_applier.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

counters.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

counters.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

cql_serialization_format.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

database_fwd.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

database.cc

Merge 'replica/database: drop_column_family(): properly cleanup stale querier cache entries' from Botond Dénes

2022-05-01 17:11:52 +03:00

database.hh

Merge 'replica/database: drop_column_family(): properly cleanup stale querier cache entries' from Botond Dénes

2022-05-01 17:11:52 +03:00

db_clock.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

debug.hh

main, scylla-gdb, cql-test-env: Unify debug::the_database

2021-09-15 17:35:30 +03:00

default.nix

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

digest_algorithm.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

digester.hh

hasher: More picky noexcept marking of feed_hash()

2021-07-07 12:00:16 +03:00

dirty_memory_manager.hh

table: clear: serialize with ongoing flush

2022-05-15 13:43:43 +03:00

distributed_loader.cc

distributed_loader, utils: Move verify_owner_and_mode

2021-10-11 11:03:51 +03:00

distributed_loader.hh

distributed_loader, utils: Move verify_owner_and_mode

2021-10-11 11:03:51 +03:00

Doxyfile

…

duration.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

duration.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

encoding_stats.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

enum_set.hh

enum_set: add toggle()

2021-09-13 18:05:11 +03:00

fix_system_distributed_tables.py

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

flat_mutation_reader_v2.hh

flat_mutation_reader: remove unused reserve_one method

2021-09-29 17:22:29 +02:00

flat_mutation_reader.cc

database, treewide: Introduce partition_slice::is_reversed()

2021-10-14 12:39:16 +03:00

flat_mutation_reader.hh

flat_mutation_reader: mention reversed schema in make_reversing_reader docstring

2021-09-30 12:10:52 +02:00

frozen_mutation.cc

flat_mutation_reader: get rid of timeout parameter

2021-08-24 16:30:51 +03:00

frozen_mutation.hh

repair: row_level: clear_gently: clear_gently each repair_row

2021-07-01 19:16:11 +03:00

frozen_schema.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

frozen_schema.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

gc_clock.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

gen_segmented_compress_params.py

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

generic_server.cc

generic_server: Keep server alive during conn background processing

2021-11-17 10:21:11 +02:00

generic_server.hh

transport, generic_server: Remove no longer used functionality

2021-07-22 18:41:32 +03:00

HACKING.md

fix runtime errors

2021-10-13 15:08:24 +03:00

hashers.cc

build, treewide: enable -Wpessimizing-move warning

2021-07-08 17:52:34 +03:00

hashers.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

hashing_partition_visitor.hh

build: enable -Winconsistent-missing-override warning

2021-09-15 12:55:54 +03:00

hashing.hh

hasher: More picky noexcept marking of feed_hash()

2021-07-07 12:00:16 +03:00

idl-compiler.py

idl: support generating boilerplate code for RPC verbs

2021-09-30 02:21:57 +03:00

inet_address_vectors.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

init.cc

gossiper, main: Turn init_gossiper into get_seeds_from_config

2021-09-22 13:13:06 +03:00

init.hh

gossiper, main: Turn init_gossiper into get_seeds_from_config

2021-09-22 13:13:06 +03:00

install-dependencies.sh

install-dependencies.sh: add scylla-driver to relocatable python3

2021-09-02 11:52:47 +03:00

install.sh

docker: revert scylla-server.conf service name change

2022-04-05 12:42:36 +03:00

interval.hh

interval: constrain comparator parameters

2021-09-10 16:43:16 +02:00

intrusive_set_external_comparator.hh

everywhere: make deferred actions noexcept

2021-08-22 21:11:52 +03:00

keys.cc

clustering_bounds_comparator: add reverse_kind()

2021-09-09 11:49:05 +03:00

keys.hh

treewide: remove redundant "x <=> 0" compares

2021-07-28 13:30:32 +03:00

LICENSE.AGPL

…

lister.cc

…

lister.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

log.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

main.cc

main: shutdown: do not abort on certain system errors

2022-03-24 14:49:24 +02:00

map_difference.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

marshal_exception.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

memtable-sstable.hh

memtable-sstable: Extend interface to allow adjustment of estimated partitions

2021-11-03 17:51:03 +02:00

memtable.cc

memtable: fix gcc function argument evaluation order induced use after move

2021-11-10 08:58:09 +02:00

memtable.hh

memtable: enable native reversing

2021-10-10 20:38:18 +02:00

multishard_mutation_query.cc

database, treewide: Introduce partition_slice::is_reversed()

2021-10-14 12:39:16 +03:00

multishard_mutation_query.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_cleaner.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_compactor.hh

mutation_compactor: collect stats about compacted data

2021-09-22 13:59:19 +03:00

mutation_consumer_concepts.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_fragment_stream_validator.hh

mutation_fragment_stream_validator: disambiguate schema member definition

2021-07-27 11:55:42 +03:00

mutation_fragment_v2.hh

mutation_fragment{_v2}: MutationFragmentConsumer: allow for abstract consumer

2021-08-25 13:12:41 +03:00

mutation_fragment.cc

range_tombstone, code: Add range_tombstone& getters

2021-09-03 19:34:45 +03:00

mutation_fragment.hh

mutation_fragment{_v2}: MutationFragmentConsumer: allow for abstract consumer

2021-08-25 13:12:41 +03:00

mutation_partition_serializer.cc

range_tombstone, code: Add range_tombstone& getters

2021-09-03 19:34:45 +03:00

mutation_partition_serializer.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_partition_view.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_partition_view.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_partition_visitor.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_partition.cc

treewide: replace seastar::fmt_print() with fmt::print()

2021-11-01 10:05:16 +02:00

mutation_partition.hh

mutation_partition: row: make row marker shadowing symmetric

2021-10-26 20:40:31 +02:00

mutation_query.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_query.hh

mutation_query: reconcilable_result_builder: document reverse query preconditions

2021-09-28 17:03:57 +03:00

mutation_reader.cc

shard_reader: check that _reader is valid before dereferencing

2022-02-07 10:10:58 +02:00

mutation_reader.hh

mutation_reader: queue_reader_handle: make abandoned() exception safe

2021-10-21 06:50:22 +03:00

mutation_rebuilder.hh

mutation_rebuilder: make it standalone

2021-09-09 15:42:15 +03:00

mutation_source_metadata.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation.cc

treewide: replace seastar::fmt_print() with fmt::print()

2021-11-01 10:05:16 +02:00

mutation.hh

mutation: introduce reverse()

2021-09-09 15:42:15 +03:00

noexcept_traits.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

NOTICE.txt

import wasmtime.hh

2021-09-13 11:01:33 +02:00

ORIGIN

…

partition_builder.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

partition_range_compat.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

partition_slice_builder.cc

partition_slice_builder: add range mutating methods

2021-09-09 14:16:21 +03:00

partition_slice_builder.hh

partition_slice_builder(): add with_option_toggled()

2021-09-13 18:05:11 +03:00

partition_snapshot_reader.hh

partition_snapshot_reader: fix indentation in fill_buffer

2021-11-05 10:51:58 +01:00

partition_snapshot_row_cursor.hh

row-cache: Handle exception (un)safety of rows_entry insertion

2021-12-14 15:53:42 +02:00

partition_version_list.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

partition_version.cc

range_tombstone, code: Add range_tombstone& getters

2021-09-03 19:34:45 +03:00

partition_version.hh

row_cache: Consume range tombstones incrementally

2021-07-26 17:48:05 +02:00

position_in_partition.hh

treewide: handle switch statements that return

2021-10-10 18:16:50 +03:00

querier.cc

database, treewide: Introduce partition_slice::is_reversed()

2021-10-14 12:39:16 +03:00

querier.hh

querier: consume_page(): remove now unused max_size parameter

2021-09-29 12:15:48 +03:00

query_class_config.hh

table, database: query, mutation_query: remove unnecessary class_config param

2021-09-14 13:39:56 +02:00

query_result_merger.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query-request.hh

database, treewide: Introduce partition_slice::is_reversed()

2021-10-14 12:39:16 +03:00

query-result-reader.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

query-result-set.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query-result-set.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

query-result-writer.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query-result.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query.cc

query: reverse clustering_range

2021-10-05 16:47:04 +02:00

range_tombstone_assembler.hh

flat_mutation_reader: Introduce adaptors between v1 and v2 of mutation fragment stream

2021-06-15 13:10:47 +02:00

range_tombstone_change_generator.hh

range_tombstone, code: Add range_tombstone& getters

2021-09-03 19:34:45 +03:00

range_tombstone_list.cc

Merge 'db: range_tombstone_list: Deoverlap empty range tombstones' from Tomasz Grabiec

2022-01-20 12:35:21 +02:00

range_tombstone_list.hh

Merge 'db: range_tombstone_list: Deoverlap empty range tombstones' from Tomasz Grabiec

2022-01-20 12:35:21 +02:00

range_tombstone_splitter.hh

flat_mutation_reader: Trim range tombstones in make_flat_mutation_reader_from_fragments()

2021-06-16 00:23:49 +02:00

range_tombstone.cc

range_tombstone_accumulator: drop _reversed flag

2021-09-09 15:42:15 +03:00

range_tombstone.hh

range_tombstone_accumulator: drop _reversed flag

2021-09-09 15:42:15 +03:00

range.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

read_context.hh

flat_mutation_reader: get rid of timeout parameter

2021-08-24 16:30:51 +03:00

reader_concurrency_semaphore.cc

reader_permit: release_base_resources(): also update _resources

2022-01-20 18:39:25 +02:00

reader_concurrency_semaphore.hh

Prepare for inheriting from reader_concurrency_semaphore

2021-09-26 12:57:48 +03:00

reader_permit.hh

Merge "repair: make sure there is one permit per repair with count res" from Botond

2022-01-17 16:02:55 +02:00

README.md

README.md: update link to docker build instructions

2021-09-01 11:50:11 +03:00

real_dirty_memory_accounter.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

release.cc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

release.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

reversibly_mergeable.hh

everywhere: cleanup defer.hh includes

2021-08-22 21:11:39 +03:00

row_cache.cc

treewide: move reversing to the mutation sources

2021-09-29 12:15:45 +03:00

row_cache.hh

treewide: move reversing to the mutation sources

2021-09-29 12:15:45 +03:00

schema_builder.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_fwd.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_mutations.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_mutations.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_registry.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_registry.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_upgrader.hh

Adapt flat_mutation_reader_v2 to the new version of the API

2021-06-15 13:10:47 +02:00

schema.cc

schema: make private constructor invokable via make_lw_shared

2021-11-07 12:51:09 +02:00

schema.hh

schema: make private constructor invokable via make_lw_shared

2021-11-07 12:51:09 +02:00

scylla_post_install.sh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

scylla-gdb.py

sstables, gdb: Retire usage of sstable_tracker

2021-10-07 14:40:47 +02:00

SCYLLA-VERSION-GEN

release: prepare for 4.6.4

2022-05-16 15:20:35 +03:00

seastarx.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

serialization_visitors.hh

treewide: distinguish truncated frame errors

2021-10-27 12:27:16 +02:00

serializer_impl.hh

serialize: add serialized for std::monostate

2021-08-25 08:19:25 +03:00

serializer.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

serializer.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

service_permit.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

setup.py

…

shell.nix

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

sstables_loader.cc

storage_service, sstables_loader: use effective_replication_map to get_natural_endpoints

2021-10-13 13:50:27 +03:00

sstables_loader.hh

sstables_loader: Accept the sstables loading code

2021-10-11 11:08:21 +03:00

supervisor.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

table_helper.cc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

table_helper.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

table.cc

table: clear: serialize with ongoing flush

2022-05-15 13:43:43 +03:00

test.py

test: refine test suite names exposed via xunit format

2021-12-05 19:58:22 +02:00

timeout_config.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

timeout_config.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

timestamp.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

to_string.hh

to_string: Add formatter for strong_ordering

2021-06-08 11:33:04 +03:00

tombstone.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

tox.ini

…

types.cc

Merge 'types: fix is_string for reversed types' from Piotr Sarna

2022-07-03 17:59:56 +03:00

types.hh

cql3: types: Optimize abstract_type::contains_collection

2021-09-24 13:45:38 +02:00

ubsan-suppressions.supp

suppress ubsan error in boost::deque::clear()

2020-11-09 11:25:19 +02:00

unimplemented.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

unimplemented.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

user_types_metadata.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

validation.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

validation.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

version.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

view_info.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

vint-serialization.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

vint-serialization.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

xx_hasher.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

zstd.cc

abstract_replication_strategy: use shared_ptr in registry

2021-10-13 12:39:36 +03:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.5%

Python 26.2%

CMake 0.4%

GAP 0.3%

Shell 0.3%