mirror of https://github.com/scylladb/scylladb.git synced 2026-05-12 19:02:12 +00:00

Go to file

Avi Kivity bc5773f777 Merge 'Add out of space prevention mechanisms' from Łukasz Paszkowski

When a scaling out is delayed or fails, it is crucial to ensure that clusters remain operational
and recoverable even under extreme conditions. To achieve this, the following proactive measures
are implemented:
- reject writes
      - includes: inserts, updates, deletes, counter updates, hints, read+repair and lwt writes
      - applicable to: user tables, views, CDC log, audit, cql tracing
- stop running compactions/repairs and prevent from starting new ones
- reject incoming tablet migrations

The aforementioned mechanisms are automatically enabled when node's disk utilization reaches
the critical level (default: 98%) and disabled when the utilization drop below the threshold.

Apart from that, the series add tests that require mounted volumes to simulate out of space.
The paths to the volumes can be provided using the a pytest argument, i.e.  `--space-limited-dirs`.
When not provided, tests are skipped.

Test scenarios:

1. Start a cluster and write data until one of the nodes reaches 90% of the disk utilization
2. Perform an **operation** that would take the nodes over 100%
3. The nodes should not exceed the critical disk utilization (98% by default)
4. Scale out the cluster by adding one node per rack
5. Retry or wait for the **operation** from step 2

The **operation** is: writing data, running compactions, building materialized views, running repair,
migrating tablets (caused by RF change, decommission).

The test is successful, if no nodes run out of space, the **operation** from step 2 is
aborted/paused/timed out and the **operation** from step 5 is successful.

`perf-simple-query --smp 1 -m 1G` results obtained for fixed 400MHz frequency:

Read path (before)

```
instructions_per_op:
	mean=   39661.51 standard-deviation=34.53
	median= 39655.39 median-absolute-deviation=23.33
	maximum=39708.71 minimum=39622.61
```

Read path (after)

```
instructions_per_op:
	mean=   39691.68 standard-deviation=34.54
	median= 39683.14 median-absolute-deviation=11.94
	maximum=39749.32 minimum=39656.63
```

Write path (before):

```
instructions_per_op:
	mean=   50942.86 standard-deviation=97.69
	median= 50974.11 median-absolute-deviation=34.25
	maximum=51019.23 minimum=50771.60
```

Write path (after):

```
instructions_per_op:
	mean=   51000.15 standard-deviation=115.04
	median= 51043.93 median-absolute-deviation=52.19
	maximum=51065.81 minimum=50795.00
```

Fixes: https://github.com/scylladb/scylladb/issues/14067
Refs: https://github.com/scylladb/scylladb/issues/2871

No backport, as it is a new feature.

Closes scylladb/scylladb#23917

* github.com:scylladb/scylladb:
  tests/cluster: Add new storage tests
  test/scylla_cluster: Override workdir when passed via cmdline
  streaming: Reject incoming migrations
  storage_service: extend locator::load_stats to collect per-node critical disk utilization flag
  repair_service: Add a facility to disable the service
  compaction_manager: Subscribe to out of space controller
  compaction_manager: Replace enabled/disabled states with running state
  database: Add critical_disk_utilization mode database can be moved to
  disk_space_monitor: add subscription API for threshold-based disk space monitoring
  docs: Add feature documentation
  config: Add critical_disk_utilization_level option
  replica/exceptions: Add a new custom replica exception

2025-08-30 18:47:57 +03:00

.github

trigger scylla-ci Jenkins job by command

2025-08-27 14:12:28 +03:00

abseil @ d7aaad83b4

…

alternator

Merge 'Introduce view building coordinator' from Michał Jadwiszczak

2025-08-29 17:28:44 +02:00

api

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

audit

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

auth

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

bin

treewide: improve bash error reporting

2025-02-10 18:28:52 +03:00

cdc

Merge 'Introduce view building coordinator' from Michał Jadwiszczak

2025-08-29 17:28:44 +02:00

cmake

PowerPC: remove ppc stuff

2025-07-08 10:38:23 +03:00

compaction

compaction_manager: Subscribe to out of space controller

2025-08-29 14:56:07 +02:00

conf

scylla.yaml: add recommended value for stream_io_throughput_mb_per_sec

2025-07-25 10:45:32 +03:00

cql3

Merge 'Introduce view building coordinator' from Michał Jadwiszczak

2025-08-29 17:28:44 +02:00

data_dictionary

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

Merge 'Add out of space prevention mechanisms' from Łukasz Paszkowski

2025-08-30 18:47:57 +03:00

debug

…

dht

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

dist

dist/docker: add dc and rack arguments

2025-08-24 17:48:07 +03:00

docs

Merge 'Add out of space prevention mechanisms' from Łukasz Paszkowski

2025-08-30 18:47:57 +03:00

ent

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

exceptions

exceptions.hh: fix message argument passing

2025-08-13 13:39:52 +02:00

gms

Merge 'Introduce view building coordinator' from Michał Jadwiszczak

2025-08-29 17:28:44 +02:00

idl

Merge 'Add out of space prevention mechanisms' from Łukasz Paszkowski

2025-08-30 18:47:57 +03:00

index

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

keys

keys: from_nodetool_style_string don't split single partition keys

2025-08-14 19:52:04 +03:00

lang

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

licenses

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

locator

storage_service: extend locator::load_stats to collect per-node critical disk utilization flag

2025-08-29 14:56:13 +02:00

message

Merge 'Introduce view building coordinator' from Michał Jadwiszczak

2025-08-29 17:28:44 +02:00

mutation

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

mutation_writer

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

node_ops

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

pgo

Update pgo profiles - aarch64

2025-08-15 05:03:28 +03:00

raft

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

readers

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

reloc

treewide: improve bash error reporting

2025-02-10 18:28:52 +03:00

repair

Merge 'Add out of space prevention mechanisms' from Łukasz Paszkowski

2025-08-30 18:47:57 +03:00

replica

Merge 'Add out of space prevention mechanisms' from Łukasz Paszkowski

2025-08-30 18:47:57 +03:00

rust

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

schema

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

scripts

docs: expose alternator metrics

2025-08-22 09:49:52 +03:00

seastar @ c2d9893334

Update seastar submodule

2025-08-30 14:53:34 +03:00

service

Merge 'Add out of space prevention mechanisms' from Łukasz Paszkowski

2025-08-30 18:47:57 +03:00

sstables

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

streaming

Merge 'Add out of space prevention mechanisms' from Łukasz Paszkowski

2025-08-30 18:47:57 +03:00

swagger-ui @ 12f1da1082

…

tasks

tasks: do not use binary progress for task manager tasks

2025-07-25 10:45:32 +03:00

test

Merge 'Add out of space prevention mechanisms' from Łukasz Paszkowski

2025-08-30 18:47:57 +03:00

tools

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

tracing

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

transport

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

types

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

unified

treewide: improve bash error reporting

2025-02-10 18:28:52 +03:00

utils

Merge 'Add out of space prevention mechanisms' from Łukasz Paszkowski

2025-08-30 18:47:57 +03:00

.clang-format

…

.dockerignore

…

.gitattributes

configure.py: prepare the build for a default PGO profile in version control

2024-12-27 16:16:04 +08:00

.gitignore

.gitignore: add rust target

2025-08-19 13:09:18 +03:00

.gitmodules

build: replace tools/java submodule with packaged cassandra-stress

2025-04-15 10:11:28 +03:00

.gitorderfile

…

.mailmap

…

absl-flat_hash_map.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

absl-flat_hash_map.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

amplify.yml

…

backlog_controller.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

build_mode.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes_fwd.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes_ostream.hh

treewide: Replace __builtin_expect with (un)likely

2025-07-03 13:34:04 +03:00

bytes.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes.hh

bytes: adapt fmt_hex to std::span<const std::byte>

2025-04-01 00:07:27 +02:00

cache_temperature.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cartesian_product.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cell_locking.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

client_data.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

client_data.hh

transport/server: use scheduling group assigned to current user

2025-01-02 07:13:34 +01:00

clocks-impl.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

clocks-impl.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

CMakeLists.txt

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

collection_mutation.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

collection_mutation.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

column_computation.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

combine.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

concrete_types.hh

types: implement vector_type_impl

2025-01-26 19:36:41 +01:00

configure.py

Merge 'Add out of space prevention mechanisms' from Łukasz Paszkowski

2025-08-30 18:47:57 +03:00

CONTRIBUTING.md

Fix typos

2025-02-11 00:17:43 +02:00

converting_mutation_partition_applier.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

converting_mutation_partition_applier.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

counters.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

counters.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

coverage_excludes.txt

…

coverage_sources.list

…

cql_serialization_format.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

db_clock.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

debug.cc

gdb: protect debug::the_database from lto

2025-01-23 22:26:04 +02:00

debug.hh

gdb: protect debug::the_database from lto

2025-01-23 22:26:04 +02:00

default.nix

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

Doxyfile

…

duration.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

duration.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

encoding_stats.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

enum_set.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

fix_system_distributed_tables.py

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

flake.lock

…

flake.nix

…

frozen_schema.cc

db: atomically apply changes to tables and views

2025-07-10 10:46:55 +02:00

frozen_schema.hh

db: atomically apply changes to tables and views

2025-07-10 10:46:55 +02:00

gc_clock.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

gdbinit

…

gen_segmented_compress_params.py

compress: move compress.cc/hh to sstables/compressor

2025-07-31 13:10:41 +03:00

generic_server.cc

generic_server: use utils::scoped_item_list

2025-08-01 02:32:14 +03:00

generic_server.hh

generic_server: use utils::scoped_item_list

2025-08-01 02:32:14 +03:00

HACKING.md

build: replace tools/java submodule with packaged cassandra-stress

2025-04-15 10:11:28 +03:00

hashing_partition_visitor.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

idl-compiler.py

idl-compiler.py: generate skip() definition for enums serializers

2025-06-24 11:05:31 +03:00

inet_address_vectors.hh

storage_proxy: handle node_local_only in mutate

2025-07-24 19:48:08 +02:00

init.cc

gms,init: Move get_disabled_features_from_db_config() from gms

2025-07-21 19:20:17 +03:00

init.hh

Merge 'Move feature-service config creation code out of feature-service itself' from Pavel Emelyanov

2025-07-29 08:17:49 +03:00

install-dependencies.sh

remove ./redis and dependencies

2025-08-20 17:53:23 +03:00

install.sh

install.sh: simplify check_usermode_support()

2025-02-24 11:29:30 +03:00

LICENSE-ScyllaDB-Source-Available.md

Fix typos

2025-02-13 01:54:08 +02:00

main.cc

Merge 'Add out of space prevention mechanisms' from Łukasz Paszkowski

2025-08-30 18:47:57 +03:00

marshal_exception.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

multishard_mutation_query.cc

readers/mutation_source: s/make_reader_v2/make_mutation_reader/

2025-05-09 07:53:29 -04:00

multishard_mutation_query.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_query.cc

schema: deinline some speculative_retry methods

2025-01-02 12:28:33 +01:00

mutation_query.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

NOTICE.txt

PowerPC: remove ppc stuff

2025-07-08 10:38:23 +03:00

ORIGIN

…

partition_builder.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

partition_range_compat.hh

treewide: Move misc files to utils directory

2025-07-21 11:56:40 +03:00

partition_slice_builder.cc

tree: Remove unused boost headers

2025-02-25 10:32:32 +03:00

partition_slice_builder.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

partition_snapshot_reader.hh

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

protocol_server.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

querier.cc

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

querier.hh

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

query_id.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query_ranges_to_vnodes.cc

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

query_ranges_to_vnodes.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query_result_merger.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query-request.hh

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

query-result-reader.hh

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

query-result-set.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

query-result-set.hh

db: prevent accidental copies of result_set_row by making it move-only

2025-02-17 09:48:08 +02:00

query-result-writer.hh

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

query-result.hh

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

query.cc

mapreduce: add missing comma and space in mapreduce_request operator<<

2025-06-25 19:23:07 +02:00

reader_concurrency_semaphore_group.cc

treewide: fix misspellings

2025-01-05 16:13:09 +02:00

reader_concurrency_semaphore_group.hh

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: use named gate

2025-04-12 11:28:48 +03:00

reader_concurrency_semaphore.hh

reader_concurrency_semaphore: use named gate

2025-04-12 11:28:48 +03:00

reader_permit.hh

reader_permit: mark check_abort() as const

2025-02-07 01:32:35 -05:00

README.md

README: adjust to reflect license change

2025-01-30 10:28:32 +03:00

real_dirty_memory_accounter.hh

moved cache files to db

2025-02-04 12:21:31 +03:00

release.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

release.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

reversibly_mergeable.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

schema_mutations.cc

everywhere: use utils::chunked_vector for list of mutations

2025-07-13 19:13:11 +03:00

schema_mutations.hh

everywhere: use utils::chunked_vector for list of mutations

2025-07-13 19:13:11 +03:00

schema_upgrader.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

scylla_post_install.sh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

scylla-gdb.py

Update seastar submodule

2025-07-22 18:19:58 +02:00

SCYLLA-VERSION-GEN

Update ScyllaDB version to: 2025.4.0-dev

2025-07-01 11:33:20 +03:00

seastarx.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

serialization_visitors.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

serializer_impl.hh

serializer_impl.hh: add as_input_stream(managed_bytes_view) overload

2025-05-13 10:32:32 +02:00

serializer.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

serializer.hh

treewide: include boost headers as "system" headers

2025-08-22 17:21:24 +03:00

service_permit.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

shell.nix

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

sstable_dict_autotrainer.cc

compress: distribute compression dictionaries over shards

2025-05-07 14:43:18 +02:00

sstable_dict_autotrainer.hh

dict_autotrainer: introduce sstable_dict_autotrainer

2025-04-01 00:07:30 +02:00

sstables_loader.cc

db/view/view_building_worker: register staging sstable to view building coordinator when needed

2025-08-27 10:23:03 +02:00

sstables_loader.hh

db/view/view_building_worker: register staging sstable to view building coordinator when needed

2025-08-27 10:23:03 +02:00

stdafx.cc

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

stdafx.hh

build: add precompiled headers to CMakeLists.txt

2025-08-27 21:37:54 +03:00

supervisor.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

table_helper.cc

everywhere: use utils::chunked_vector for list of mutations

2025-07-13 19:13:11 +03:00

table_helper.hh

audit: Add the audit subsystem

2025-01-15 11:10:35 +01:00

test.py

test.py: metrics: add host_id suffix to .db file

2025-08-19 11:33:11 +00:00

timeout_config.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

timeout_config.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

timestamp.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc_extension.hh

schema: deprecate schema_extension

2025-03-19 20:36:16 +02:00

tombstone_gc_options.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc_options.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc-internals.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone_gc.cc

tombstone_gc: introduce tombstone_gc_state_snapshot

2025-08-11 07:09:14 +03:00

tombstone_gc.hh

tombstone_gc: introduce tombstone_gc_state_snapshot

2025-08-11 07:09:14 +03:00

ubsan-suppressions.supp

…

unimplemented.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

unimplemented.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

validation.cc

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

validation.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

version.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

view_info.hh

base_info: remove the lw_shared_ptr variant

2025-04-24 01:08:40 +02:00

vint-serialization.cc

treewide: Replace __builtin_expect with (un)likely

2025-07-03 13:34:04 +03:00

vint-serialization.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++23 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its API - CQL. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The community forum and Slack channel are for users to discuss configuration, management, and operations of ScyllaDB.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.5%

Python 26.2%

CMake 0.4%

GAP 0.3%

Shell 0.3%