mirror of https://github.com/scylladb/scylladb.git synced 2026-05-28 18:50:53 +00:00

Go to file

Wojciech Mitros 055a6c2cee storage_proxy: send hints to pending replicas

Consider the following scenario:
- Current replica set is [A, B, C]
- write succeeds on [A, B], and a hint is logged for node C
- before the hint is replayed, D bootstraps and the token migrates from C to D
- hint is replayed to node C while D is pending, but it's too late, since streaming for that token is already done
- C is cleaned up, replayed data is lost, and D has a stale copy until next repair.
In the scenario we effectively fail to send the hint. This scenario is also more likely to happen with tablets,
as it can happen for every tablet migration.

This issue is particularly detrimental to materialized views. View updates use hints by default and a specific
view update may be sent to just one view replica (when a single base replica has a different row state due to
reordering or missed writes). When we lose a hint for such a view update, we can generate a persistent inconsistency
between the base and view - ghost rows can appear due to a lost tombstone and rows may be missing in the view due
to a lost row update. Such inconsistencies can't be fixed neither by repairing the view or the base table.

To handle this, in this patch we add the pending replicas to the list of targets of each hint, even if the original
target is still alive.

This will cause some updates to be redundant. These updates are probably unavoidable for now, but they shouldn't
be too common either. The scenarios for them are:
1. managing to send the hint to the source of a migrating replica before streaming that its token - the write will
arrive on the pending replica anyway in streaming
2. the hint target not being the source of the migration - if we managed to apply the original write of the hint to
the actual source of the migration, the pending replica will get it during streaming
3. sending the same hint to many targets at a similar time - while sending to each target, we'll see the same pending
replica for the hint so we'll send it multiple times
4. possible retries where even though the hint was successfully sent to the main target, we failed to send it to the
pending replica, so we need to retry the entire write

This patch handles both tablet migrations and tablet rebuilds. In the future, for tablet migrations, we can avoid
sending the hint to pending replias if the hint target is not the source fo the migration, which would allow us to
avoid the redundant writes 2 and 3. For rack-aware RF, this will be as simple as checking whether the replicas are
in the same rack.

We also add a test case reproducing the issue.

Co-Authored-By: Raphael S. Carvalho <raphaelsc@scylladb.com>

Fixes https://github.com/scylladb/scylladb/issues/19835

Closes scylladb/scylladb#25590

(cherry picked from commit 10b8e1c51c)

Closes scylladb/scylladb#25882

2025-09-10 10:29:52 +03:00

.github

auto-backport.py: Avoid bot push to existing backport branches

2025-07-15 10:25:30 +02:00

abseil @ d7aaad83b4

…

alternator

alternator: clean up by co-routinizing

2025-07-27 07:42:01 +00:00

api

api: storage_service: fix token_range documentation

2025-09-01 09:11:12 +03:00

audit

audit: introduce debug level logs on happy path

2025-06-27 16:27:27 +02:00

auth

Merge 'auth: move passwords::check call to alien thread' from Andrzej Jackowski

2025-09-07 13:38:33 +03:00

bin

treewide: improve bash error reporting

2025-02-10 18:28:52 +03:00

cdc

test: cdc: add test_cdc_with_alter

2025-07-20 09:04:00 +02:00

cmake

build: cmake: Use LINKER: prefix for consistent linker option handling

2025-06-25 11:17:15 +03:00

compaction

compaction_manager: cancel submission timer on drain

2025-06-20 11:33:49 +03:00

conf

scylla.yaml: add recommended value for stream_io_throughput_mb_per_sec

2025-08-01 15:02:01 +03:00

cql3

select_statement: check for access to CDC base table

2025-09-08 07:39:52 +00:00

data_dictionary

Merge 'scylla-sstable: add native S3 support' from Ernest Zaslavsky

2025-03-14 15:05:52 +02:00

storage_proxy: send hints to pending replicas

2025-09-10 10:29:52 +03:00

debug

…

dht

interval: reduce sizeof

2025-06-14 21:29:43 +03:00

dist

build_docker.sh: enable debug symboles installation

2025-09-09 11:41:17 +03:00

docs

doc: add support for i7i instances

2025-09-08 10:40:52 +03:00

ent

encryption: gcp: Fix the grant type for user credentials

2025-08-07 21:46:24 +00:00

exceptions

transport: storage_proxy: release ERM when waiting for query timeout

2025-04-23 09:29:47 +02:00

gms

gossiper: fix empty initial local node state

2025-09-08 21:55:16 +00:00

idl

gossip: add recovery_leader to gossip_digest_syn

2025-08-05 10:59:39 +00:00

index

interval: rename start() to start_ref() (and end() etc).

2025-06-14 21:26:16 +03:00

lang

treewide: Reduce db/config.hh header fanout

2025-02-25 15:16:40 +01:00

licenses

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

locator

token_metadata_impl: clear_gently: release version tracker early

2025-07-07 09:42:29 +03:00

message

dht: fragment token_range_vector

2025-05-27 14:47:24 +03:00

mutation

Merge 'sstables/mx/writer: handle non-full prefix row keys' from Botond Dénes

2025-06-29 18:18:36 +03:00

mutation_writer

readers/mutation_reader: s/reader_consumer_v2/mutation_reader_consumer/

2025-05-09 07:53:29 -04:00

node_ops

topology request: make it possible to hold global request types in request_type field

2025-06-09 13:38:49 +03:00

pgo

Update pgo profiles - aarch64

2025-09-01 05:24:53 +03:00

raft

raft: server_impl: use named gate

2025-04-12 11:28:48 +03:00

readers

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

redis

generic_server: Two-step connection shutdown.

2025-08-18 15:46:46 +02:00

reloc

treewide: improve bash error reporting

2025-02-10 18:28:52 +03:00

repair

repair: distribute tablet_repair_task_meta among shards

2025-07-31 15:56:53 +02:00

replica

system_keyspace: Prune dropped tables from truncation on start/drop

2025-09-05 19:02:39 +03:00

rust

rust: update dependencies

2025-03-04 09:45:23 +02:00

schema

cql, schema: Extend name length limit from 48 to 192 bytes

2025-06-18 14:08:38 +02:00

scripts

docs: expose alternator metrics

2025-09-01 09:10:41 +03:00

seastar @ 26badcb14c

Update seastar submodule

2025-06-03 13:47:05 +03:00

service

storage_proxy: send hints to pending replicas

2025-09-10 10:29:52 +03:00

sstables

s3: Export memory usage gauge (metrics)

2025-09-10 10:28:39 +03:00

streaming

streaming: close sink when exception is thrown

2025-07-31 12:17:05 +03:00

swagger-ui @ 12f1da1082

…

tasks

tasks: change _finished_children type

2025-08-06 07:36:04 +03:00

test

storage_proxy: send hints to pending replicas

2025-09-10 10:29:52 +03:00

tools

s3: Export memory usage gauge (metrics)

2025-09-10 10:28:39 +03:00

tracing

tracing: trace_keyspace_helper: use named gate

2025-04-12 11:29:48 +03:00

transport

generic_server: Two-step connection shutdown.

2025-08-18 15:46:46 +02:00

types

allow "UTC" and "GMT" in string format of timestamp

2025-02-12 09:38:28 +02:00

unified

treewide: improve bash error reporting

2025-02-10 18:28:52 +03:00

utils

Merge 'auth: move passwords::check call to alien thread' from Andrzej Jackowski

2025-09-07 13:38:33 +03:00

.clang-format

clang-format: argument and function packing

2024-10-04 14:52:41 +02:00

.dockerignore

…

.gitattributes

configure.py: prepare the build for a default PGO profile in version control

2024-12-27 16:16:04 +08:00

.gitignore

Add .idea folder to .gitignore

2024-09-20 11:49:41 +03:00

.gitmodules

build: replace tools/java submodule with packaged cassandra-stress

2025-04-15 10:11:28 +03:00

.gitorderfile

…

.mailmap

…

absl-flat_hash_map.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

absl-flat_hash_map.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

amplify.yml

…

backlog_controller.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

build_mode.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes_fwd.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes_ostream.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes.hh

bytes: adapt fmt_hex to std::span<const std::byte>

2025-04-01 00:07:27 +02:00

cache_temperature.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cartesian_product.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cell_locking.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

client_data.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

client_data.hh

transport/server: use scheduling group assigned to current user

2025-01-02 07:13:34 +01:00

clocks-impl.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

clocks-impl.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

clustering_bounds_comparator.hh

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

clustering_interval_set.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

clustering_key_filter.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

clustering_ranges_walker.hh

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

CMakeLists.txt

tools: add patchelf utility

2025-06-30 07:24:05 +03:00

collection_mutation.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

collection_mutation.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

column_computation.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

combine.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

compound_compat.hh

utils: do not include unused headers

2025-01-14 07:56:39 -05:00

compound.hh

compound: optimize is_full() for single-component types

2025-06-23 09:38:45 +03:00

compress.cc

transport/server: silence the oversized allocation warning in snappy_compress

2025-06-10 19:13:26 +03:00

compress.hh

db/config: add an option that disables dict-aware sstable compressors in DDL statements

2025-06-09 13:30:40 +03:00

concrete_types.hh

types: implement vector_type_impl

2025-01-26 19:36:41 +01:00

configure.py

tools: add patchelf utility

2025-06-30 07:24:05 +03:00

CONTRIBUTING.md

Fix typos

2025-02-11 00:17:43 +02:00

converting_mutation_partition_applier.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

converting_mutation_partition_applier.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

counters.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

counters.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

coverage_excludes.txt

…

coverage_sources.list

…

cql_serialization_format.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

db_clock.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

debug.cc

gdb: protect debug::the_database from lto

2025-01-23 22:26:04 +02:00

debug.hh

gdb: protect debug::the_database from lto

2025-01-23 22:26:04 +02:00

default.nix

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

Doxyfile

…

duration.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

duration.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

encoding_stats.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

enum_set.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

fix_system_distributed_tables.py

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

flake.lock

…

flake.nix

…

frozen_schema.cc

Revert "Merge 'Atomic in-memory schema changes application' from Marcin Maliszkiewicz"

2025-06-16 22:38:12 +03:00

frozen_schema.hh

Revert "Merge 'Atomic in-memory schema changes application' from Marcin Maliszkiewicz"

2025-06-16 22:38:12 +03:00

full_position.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

gc_clock.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

gdbinit

…

gen_segmented_compress_params.py

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

generic_server.cc

generic_server: Two-step connection shutdown.

2025-08-18 15:46:46 +02:00

generic_server.hh

generic_server: Two-step connection shutdown.

2025-08-18 15:46:46 +02:00

HACKING.md

build: replace tools/java submodule with packaged cassandra-stress

2025-04-15 10:11:28 +03:00

hashing_partition_visitor.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

idl-compiler.py

idl-compiler.py: generate skip() definition for enums serializers

2025-06-24 11:05:31 +03:00

inet_address_vectors.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

init.cc

compress: distribute compression dictionaries over shards

2025-05-07 14:43:18 +02:00

init.hh

Move object_storage.yaml endpoints to scylla.yaml

2025-03-31 13:39:39 +03:00

install-dependencies.sh

toolchain: set scylla-driver release based on tools/cqlsh

2025-05-15 06:08:14 +03:00

install.sh

install.sh: simplify check_usermode_support()

2025-02-24 11:29:30 +03:00

interval.hh

interval: reduce sizeof

2025-06-14 21:29:43 +03:00

keys.cc

keys: from_nodetool_style_string don't split single partition keys

2025-09-01 15:36:56 +03:00

keys.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

LICENSE-ScyllaDB-Source-Available.md

Fix typos

2025-02-13 01:54:08 +02:00

main.cc

Merge 'auth: move passwords::check call to alien thread' from Andrzej Jackowski

2025-09-07 13:38:33 +03:00

map_difference.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

marshal_exception.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

multishard_mutation_query.cc

readers/mutation_source: s/make_reader_v2/make_mutation_reader/

2025-05-09 07:53:29 -04:00

multishard_mutation_query.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_query.cc

schema: deinline some speculative_retry methods

2025-01-02 12:28:33 +01:00

mutation_query.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

partition_range_compat.hh

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

partition_slice_builder.cc

tree: Remove unused boost headers

2025-02-25 10:32:32 +03:00

partition_slice_builder.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

partition_snapshot_reader.hh

moved cache files to db

2025-02-04 12:21:31 +03:00

protocol_server.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

querier.cc

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

querier.hh

readers/mutation_source: s/make_reader_v2/make_mutation_reader/

2025-05-09 07:53:29 -04:00

query_id.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query_ranges_to_vnodes.cc

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

query_ranges_to_vnodes.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query_result_merger.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query-request.hh

mapreduce: add shard_id_hint to mapreduce request

2025-06-25 19:23:07 +02:00

query-result-reader.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query-result-set.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query-result-set.hh

db: prevent accidental copies of result_set_row by making it move-only

2025-02-17 09:48:08 +02:00

query-result-writer.hh

query-result-writer: reorder initialization to prevent use-after-move

2025-02-17 13:45:35 +03:00

query-result.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query.cc

mapreduce: add missing comma and space in mapreduce_request operator<<

2025-06-25 19:23:07 +02:00

reader_concurrency_semaphore_group.cc

treewide: fix misspellings

2025-01-05 16:13:09 +02:00

reader_concurrency_semaphore_group.hh

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: use named gate

2025-04-12 11:28:48 +03:00

reader_concurrency_semaphore.hh

reader_concurrency_semaphore: use named gate

2025-04-12 11:28:48 +03:00

reader_permit.hh

reader_permit: mark check_abort() as const

2025-02-07 01:32:35 -05:00

README.md

README: adjust to reflect license change

2025-01-30 10:28:32 +03:00

real_dirty_memory_accounter.hh

moved cache files to db

2025-02-04 12:21:31 +03:00

release.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

release.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

reversibly_mergeable.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

schema_mutations.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

schema_mutations.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

schema_upgrader.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

scylla_post_install.sh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

scylla-gdb.py

gdb: handle zero-size reads in managed_bytes

2025-07-24 16:34:04 +03:00

SCYLLA-VERSION-GEN

Update ScyllaDB version to: 2025.3.1

2025-08-31 11:05:24 +03:00

seastarx.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

serialization_visitors.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

serializer_impl.hh

serializer_impl.hh: add as_input_stream(managed_bytes_view) overload

2025-05-13 10:32:32 +02:00

serializer.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

serializer.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

service_permit.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

shell.nix

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

sstable_dict_autotrainer.cc

compress: distribute compression dictionaries over shards

2025-05-07 14:43:18 +02:00

sstable_dict_autotrainer.hh

dict_autotrainer: introduce sstable_dict_autotrainer

2025-04-01 00:07:30 +02:00

sstables_loader.cc

Add support for nodetool refresh --skip-reshape

2025-06-10 12:52:13 +03:00

sstables_loader.hh

Add support for nodetool refresh --skip-reshape

2025-06-10 12:52:13 +03:00

supervisor.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

table_helper.cc

audit: Add service level support to CQL login process

2025-01-15 11:10:36 +01:00

table_helper.hh

audit: Add the audit subsystem

2025-01-15 11:10:35 +01:00

test.py

test.py: add bypassing x_log2_compaction_groups to boost tests

2025-08-25 15:15:30 +02:00

timeout_config.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

timeout_config.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

timestamp.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc_extension.hh

schema: deprecate schema_extension

2025-03-19 20:36:16 +02:00

tombstone_gc_options.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc_options.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc-internals.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc.cc

Merge 'scylla sstable: Add standard extensions and propagate to schema load ' from Calle Wilund

2025-02-26 13:52:47 +02:00

tombstone_gc.hh

repair: Wire repair_time in system.tablets for tombstone gc

2025-01-17 16:12:05 +08:00

ubsan-suppressions.supp

…

unimplemented.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

unimplemented.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

validation.cc

tree: Make values mutable to enable move semantics

2025-03-03 13:53:02 +03:00

validation.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

version.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

view_info.hh

base_info: remove the lw_shared_ptr variant

2025-04-24 01:08:40 +02:00

vint-serialization.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

vint-serialization.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++23 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its API - CQL. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The community forum and Slack channel are for users to discuss configuration, management, and operations of ScyllaDB.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.3%

Python 26.5%

CMake 0.3%

GAP 0.3%

Shell 0.3%