mirror of https://github.com/scylladb/scylladb.git synced 2026-05-12 19:02:12 +00:00

Go to file

Raphael S. Carvalho 0f59deffaa replica: Fix truncate and drop table after tablet migration happens

When running those operations after a tablet replica is migrated away from
a shard, an assert can fail resulting in a crash.

Status quo (around the assert in truncate procedure):

1) Highest RP seen by table is saved in low_mark, and the current time in
low_mark_at.
2) Then compaction is disabled in order to not mix data written before truncate,
and data written later.
3) Then memtable is flushed in order for the data written before truncate to be
available in sstables and then removed.
4) Now, current time is saved in truncated_at, which is supposedly the time of
truncate to decide which sstables to remove.

Note: truncated_at is likely above low_mark_at due to steps 2 and 3.

The interesting part of the assert is:
    (truncated_at <= low_mark_at ? rp <= low_mark : low_mark <= rp)

Note: RP in the assert above is the highest RP among all sstables generated
before truncated_at. RP is retrieved by table::discard_sstables().

If truncated_at > low_mark_at, maybe newer data was written during steps 2 and
3, and memtable's RP becomes greater than low_mark, resulting in a SSTable with
RP > low_mark.
So assert's 2nd condition is there to defend against the scenario above.

truncated_at and low_mark_at uses millisecond granularity, so even if
truncated_at == low_mark_at, data could have been written in steps 2 and 3
(during same MS window), failing the assert. This is fragile.

Reproducer:

To reproduce the problem, truncated_at must be > low_mark_at, which can easily
happen with both drop table and truncate due to steps 2 and 3.

If a shard has 2 or more tablets, the table's highest RP refer to just one
tablet in that shard.
If the tablet with the highest RP is migrated away, then the sstables in that
shard will have lower RP than the recorded highest RP (it's a table wide state,
which makes sense since CL is shared among tablets).

So when either drop table or truncate runs, low_mark will be potentially bigger
than highest RP retrieved from sstables.

Proposed solution:

The current assert is hacked to not fail if writes sneak in, during steps 2 and
3, but it's still fragile and seems not to serve its real purpose, since it's
allowing for RP > low_mark.

We should be able to say that low_mark >= RP, as a way of asserting we're not
leaving data targeted by truncate behind (or that we're not removing the wrong
data).

But the problem is that we're saving low_mark in step 1, before preparation
steps (2 and 3). When truncated_at is recorded in step 4, it's a way of saying
all data written so far is targeted for removal. But as of today, low_mark
refers to all data written up to step 1. So low_mark is now only one set
before issuing flush, and also accounts for all potentially flushed data.

Fixes #18059.

Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>

Closes scylladb/scylladb#23560

2025-04-08 07:32:58 +03:00

.github

.github: add delay before checking for required PR labels

2025-04-02 19:28:15 +03:00

abseil @ d7aaad83b4

…

alternator

alternator: in GetRecords, enforce Limit to be <= 1000

2025-04-07 12:52:03 +03:00

api

gossiper: move force_remove_endpoint to work on host id

2025-04-06 18:39:24 +03:00

audit

audit/syslog: escape quotes and add explicit section names

2025-03-20 19:55:51 +03:00

auth

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

bin

treewide: improve bash error reporting

2025-02-10 18:28:52 +03:00

cdc

treewide: drop id parameter from gossiper::for_each_endpoint_state

2025-03-31 16:50:50 +03:00

cmake

cmake: add the -dynamic-linker=... form to the -dynamic-linker regex

2025-03-30 11:58:47 +03:00

compaction

database, compaction_manager, large_data_handler: use pluggable<system_keysapce>

2025-03-05 08:27:23 +02:00

conf

Merge 'Add tablet enforcing option' from Benny Halevy

2025-04-03 16:32:19 +03:00

cql3

cql: Remove unused "initial_tablets" mention from guardrails

2025-04-06 16:52:07 +03:00

data_dictionary

Merge 'scylla-sstable: add native S3 support' from Ernest Zaslavsky

2025-03-14 15:05:52 +02:00

Merge 'Add tablet enforcing option' from Benny Halevy

2025-04-03 16:32:19 +03:00

debug

…

dht

gossiper: move _live_endpoints and _unreachable_endpoints endpoint to host_id

2025-03-11 12:09:21 +02:00

direct_failure_detector

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

dist

dist/docker: run the container as non-root user

2025-03-05 23:39:56 +09:00

docs

build(deps): bump sphinx-scylladb-theme from 1.8.5 to 1.8.6 in /docs

2025-04-07 13:42:19 +03:00

ent

encryption::gcp: Use seastar http client wrapper

2025-04-01 08:18:05 +00:00

exceptions

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

gms

Merge "Fixes for gossiper conversion to host id" from Gleb

2025-04-07 17:04:28 +03:00

idl

Merge "Convert gossiper's endpoint state map to be host id based" from Gleb

2025-04-02 12:30:00 +03:00

index

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

lang

treewide: Reduce db/config.hh header fanout

2025-02-25 15:16:40 +01:00

licenses

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

locator

Merge 'tablets: Make tablet allocation equalize per-shard load ' from Tomasz Grabiec

2025-04-03 16:32:53 +03:00

message

messaging_service: add SAMPLE_SSTABLES and ESTIMATE_SSTABLE_VOLUME verbs

2025-04-01 00:07:29 +02:00

mutation

mutation: fold FragmentConsumer[V2] into FlattenedConsumer[V2]

2025-03-18 09:24:49 -04:00

mutation_writer

feed_writers: optimize error path

2025-02-23 18:22:39 +02:00

node_ops

tree: migrate from boost::find to std::ranges algorithms

2025-02-20 09:28:57 +03:00

pgo

Update pgo profiles - aarch64

2025-04-01 04:45:44 +03:00

raft

fms: extract entry_size to log_entry::get_size

2025-02-12 14:33:41 +01:00

readers

readers/mutation_readers: queue_reader_handle_v2::push_end_of_stream() raise _ex if set

2025-04-03 10:39:56 +03:00

redis

service: do not include unused headers

2025-03-20 11:18:16 +08:00

reloc

treewide: improve bash error reporting

2025-02-10 18:28:52 +03:00

repair

Merge 'repair: release erm in repair_writer_impl::create_writer when possible' from Aleksandra Martyniuk

2025-04-03 11:15:08 +02:00

replica

replica: Fix truncate and drop table after tablet migration happens

2025-04-08 07:32:58 +03:00

rust

rust: update dependencies

2025-03-04 09:45:23 +02:00

schema

schema: deprecate schema_extension

2025-03-19 20:36:16 +02:00

scripts

scripts/open-coredump.sh: use the remote repo containing given sha1

2025-03-03 08:22:41 +02:00

seastar @ ed8952fbc6

Update seastar submodule

2025-04-03 19:45:37 +03:00

service

Merge "Fixes for gossiper conversion to host id" from Gleb

2025-04-07 17:04:28 +03:00

sstables

sstable_set: incremental_reader_selector: be more careful when filtering out already engaged sstables

2025-04-07 12:49:04 +03:00

streaming

streaming: Relax load_sstable_for_tablet()

2025-03-14 15:26:48 +02:00

swagger-ui @ 12f1da1082

…

tasks

tasks: make release_resources() a coroutine

2025-02-14 11:13:58 +08:00

test

replica: Fix truncate and drop table after tablet migration happens

2025-04-08 07:32:58 +03:00

tools

Merge 'Remove object_storage.yaml and move the endpoints to scylla.yaml' from Robert Bindar

2025-04-01 16:01:44 +03:00

tracing

CQL Tracing: set common query parameters in a single function

2025-03-06 09:30:51 -05:00

transport

Merge "Convert gossiper's endpoint state map to be host id based" from Gleb

2025-04-02 12:30:00 +03:00

types

allow "UTC" and "GMT" in string format of timestamp

2025-02-12 09:38:28 +02:00

unified

treewide: improve bash error reporting

2025-02-10 18:28:52 +03:00

utils

s3/client: Optimize file streaming with zero-copy multipart uploads

2025-04-07 12:50:06 +03:00

.clang-format

…

.dockerignore

…

.gitattributes

configure.py: prepare the build for a default PGO profile in version control

2024-12-27 16:16:04 +08:00

.gitignore

…

.gitmodules

…

.gitorderfile

…

.mailmap

…

absl-flat_hash_map.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

absl-flat_hash_map.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

amplify.yml

…

backlog_controller.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

build_mode.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes_fwd.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes_ostream.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes.hh

bytes: adapt fmt_hex to std::span<const std::byte>

2025-04-01 00:07:27 +02:00

cache_temperature.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cartesian_product.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cell_locking.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

client_data.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

client_data.hh

transport/server: use scheduling group assigned to current user

2025-01-02 07:13:34 +01:00

clocks-impl.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

clocks-impl.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

clustering_bounds_comparator.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

clustering_interval_set.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

clustering_key_filter.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

clustering_ranges_walker.hh

clustering_range_walker: drop boost iterator_range dependency

2025-02-17 11:34:46 +03:00

CMakeLists.txt

dict_autotrainer: introduce sstable_dict_autotrainer

2025-04-01 00:07:30 +02:00

collection_mutation.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

collection_mutation.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

column_computation.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

combine.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

compound_compat.hh

utils: do not include unused headers

2025-01-14 07:56:39 -05:00

compound.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

compress.cc

compress: add ZstdWithDictsCompressor and LZ4WithDictsCompressor

2025-04-01 00:07:30 +02:00

compress.hh

compress: add ZstdWithDictsCompressor and LZ4WithDictsCompressor

2025-04-01 00:07:30 +02:00

concrete_types.hh

types: implement vector_type_impl

2025-01-26 19:36:41 +01:00

configure.py

test: remove alternator code from perf-simple-query

2025-04-06 18:15:16 +03:00

CONTRIBUTING.md

Fix typos

2025-02-11 00:17:43 +02:00

converting_mutation_partition_applier.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

converting_mutation_partition_applier.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

counters.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

counters.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

coverage_excludes.txt

…

coverage_sources.list

…

cql_serialization_format.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

db_clock.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

debug.cc

gdb: protect debug::the_database from lto

2025-01-23 22:26:04 +02:00

debug.hh

gdb: protect debug::the_database from lto

2025-01-23 22:26:04 +02:00

default.nix

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

Doxyfile

…

duration.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

duration.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

encoding_stats.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

enum_set.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

fix_system_distributed_tables.py

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

flake.lock

…

flake.nix

…

frozen_schema.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

frozen_schema.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

full_position.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

gc_clock.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

gdbinit

…

gen_segmented_compress_params.py

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

generic_server.cc

generic_server: Update conditions for is_broken_pipe_or_connection_reset

2025-02-25 10:35:11 +02:00

generic_server.hh

generic_server: Allow sharing reloadability of certificates across shards

2025-01-27 16:16:23 +00:00

HACKING.md

HACKING.md: Provide step-by-step support to enable development with CLion

2025-03-09 16:22:24 +02:00

hashing_partition_visitor.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

idl-compiler.py

idl: generate ip based version of a verb only for verbs that need it

2025-03-11 12:09:21 +02:00

inet_address_vectors.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

init.cc

Move object_storage.yaml endpoints to scylla.yaml

2025-03-31 13:39:39 +03:00

init.hh

Move object_storage.yaml endpoints to scylla.yaml

2025-03-31 13:39:39 +03:00

install-dependencies.sh

install-dependencies.sh: disabiguate python magic package

2025-03-24 10:18:27 +03:00

install.sh

install.sh: simplify check_usermode_support()

2025-02-24 11:29:30 +03:00

interval.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

keys.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

keys.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

LICENSE-ScyllaDB-Source-Available.md

Fix typos

2025-02-13 01:54:08 +02:00

main.cc

main: Remove unused member variable _sys_ks

2025-04-02 20:07:39 +03:00

map_difference.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

marshal_exception.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

multishard_mutation_query.cc

treewide: Reduce db/config.hh header fanout

2025-02-25 15:16:40 +01:00

multishard_mutation_query.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_query.cc

schema: deinline some speculative_retry methods

2025-01-02 12:28:33 +01:00

mutation_query.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

partition_range_compat.hh

partition_range_compat: drop dependency on boost ranges

2025-01-20 16:43:21 +02:00

partition_slice_builder.cc

tree: Remove unused boost headers

2025-02-25 10:32:32 +03:00

partition_slice_builder.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

partition_snapshot_reader.hh

moved cache files to db

2025-02-04 12:21:31 +03:00

protocol_server.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

querier.cc

querier: demote tombstone warning for range-scans to debug level

2025-03-04 10:38:06 +03:00

querier.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query_id.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query_ranges_to_vnodes.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query_ranges_to_vnodes.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query_result_merger.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query-request.hh

db: cql3: add comments regarding unsafe interval<clustering_key_prefix>

2025-02-26 12:01:28 +01:00

query-result-reader.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query-result-set.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query-result-set.hh

db: prevent accidental copies of result_set_row by making it move-only

2025-02-17 09:48:08 +02:00

query-result-writer.hh

query-result-writer: reorder initialization to prevent use-after-move

2025-02-17 13:45:35 +03:00

query-result.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

reader_concurrency_semaphore_group.cc

treewide: fix misspellings

2025-01-05 16:13:09 +02:00

reader_concurrency_semaphore_group.hh

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: register_inactive_read(): handle aborted permit

2025-02-28 01:32:46 -05:00

reader_concurrency_semaphore.hh

reader_concurrency_semaphore: with_permit(): proper clean-up after queue overload

2025-02-04 21:27:16 +02:00

reader_permit.hh

reader_permit: mark check_abort() as const

2025-02-07 01:32:35 -05:00

README.md

README: adjust to reflect license change

2025-01-30 10:28:32 +03:00

real_dirty_memory_accounter.hh

moved cache files to db

2025-02-04 12:21:31 +03:00

release.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

release.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

reversibly_mergeable.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

schema_mutations.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

schema_mutations.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

schema_upgrader.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

scylla_post_install.sh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

scylla-gdb.py

treewide: move gossiper to index nodes by host id

2025-03-31 16:50:50 +03:00

SCYLLA-VERSION-GEN

Update ScyllaDB version to: 2025.2.0-dev

2025-01-27 13:13:41 +01:00

seastarx.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

serialization_visitors.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

serializer_impl.hh

serialization: fix std::map de-serializer to not invoke value's default constructor

2025-03-31 15:42:07 +03:00

serializer.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

serializer.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

service_permit.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

shell.nix

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

sstable_dict_autotrainer.cc

dict_autotrainer: introduce sstable_dict_autotrainer

2025-04-01 00:07:30 +02:00

sstable_dict_autotrainer.hh

dict_autotrainer: introduce sstable_dict_autotrainer

2025-04-01 00:07:30 +02:00

sstables_loader.cc

sstables_loader: Do not stop sharded<progress_monitor> unconditionally

2025-04-02 12:09:02 +03:00

sstables_loader.hh

sstable_loader: fix cross-shard resource cleanup in download_task_impl

2025-02-14 11:13:58 +08:00

supervisor.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

table_helper.cc

audit: Add service level support to CQL login process

2025-01-15 11:10:36 +01:00

table_helper.hh

audit: Add the audit subsystem

2025-01-15 11:10:35 +01:00

test.py

test.py: refactor paths constants and options

2025-03-30 03:19:29 +00:00

timeout_config.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

timeout_config.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

timestamp.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc_extension.hh

schema: deprecate schema_extension

2025-03-19 20:36:16 +02:00

tombstone_gc_options.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc_options.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc-internals.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc.cc

Merge 'scylla sstable: Add standard extensions and propagate to schema load ' from Calle Wilund

2025-02-26 13:52:47 +02:00

tombstone_gc.hh

repair: Wire repair_time in system.tablets for tombstone gc

2025-01-17 16:12:05 +08:00

ubsan-suppressions.supp

…

unimplemented.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

unimplemented.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

validation.cc

tree: Make values mutable to enable move semantics

2025-03-03 13:53:02 +03:00

validation.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

version.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

view_info.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

vint-serialization.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

vint-serialization.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++23 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its API - CQL. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The community forum and Slack channel are for users to discuss configuration, management, and operations of ScyllaDB.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.5%

Python 26.2%

CMake 0.4%

GAP 0.3%

Shell 0.3%