mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 11:36:54 +00:00

Go to file

Nadav Har'El 96dff367f8 Merge 'storage_proxy: update view update backlog on correct shard when writing' from Wojciech Mitros

This series is another approach of https://github.com/scylladb/scylladb/pull/18646 and https://github.com/scylladb/scylladb/pull/19181. In this series we only change where the view backlog gets
updated - we do not assure that the view update backlog returned in a response is necessarily the backlog
that increased due to the corresponding write, the returned backlog may be outdated up to 10ms. Because
 this series does not include this change, it's considerably less complex and it doesn't modify the common
write patch, so no particular performance considerations were needed in that context. The issue being fixed
is still the same, the full description can be seen below.

When a replica applies a write on a table which has a materialized view
it generates view updates. These updates take memory which is tracked
by `database::_view_update_concurrency_sem`, separate on each shard.
The fraction of units taken from the semaphore to the semaphore limit
is the shard's view update backlog. Based on these backlogs, we want
to estimate how busy a node is with its view updates work. We do that
by taking the max backlog across all shards.
To avoid excessive cross-shard operations, the node's (max) backlog isn't
calculated each time we need it, but up to 1 time per 10ms (the `_interval`) with an optimization where the backlog of the calculating shard is immediately up-to-date (we don't need cross-shard operations for it):
```
update_backlog node_update_backlog::fetch() {
    auto now = clock::now();
    if (now >= _last_update.load(std::memory_order_relaxed) + _interval) {
        _last_update.store(now, std::memory_order_relaxed);
        auto new_max = boost::accumulate(
                _backlogs,
                update_backlog::no_backlog(),
                [] (const update_backlog& lhs, const per_shard_backlog& rhs) {
                    return std::max(lhs, rhs.load());
                });
        _max.store(new_max, std::memory_order_relaxed);
        return new_max;
    }
    return std::max(fetch_shard(this_shard_id()), _max.load(std::memory_order_relaxed));
}
```
For the same reason, even when we do calculate the new node's backlog,
we don't read from the `_view_update_concurrency_sem`. Instead, for
each shard we also store a update_backlog atomic which we use for
calculation:
```
    struct per_shard_backlog {
        // Multiply by 2 to defeat the prefetcher
        alignas(seastar::cache_line_size * 2) std::atomic<update_backlog> backlog = update_backlog::no_backlog();
        need_publishing need_publishing = need_publishing::no;

        update_backlog load() const {
            return backlog.load(std::memory_order_relaxed);
        }
    };
 std::vector<per_shard_backlog> _backlogs;
```
Due to this distinction, the update_backlog atomic need to be updated
separately, when the `_view_update_concurrency_sem` changes.
This is done by calling `storage_proxy::update_view_update_backlog`, which reads the `_view_update_concurrency_sem` of the shard (in `database::get_view_update_backlog`)
and then calls node`_update_backlog::add` where the read backlog
is stored in the atomic:
```
void storage_proxy::update_view_update_backlog() {
    _max_view_update_backlog.add(get_db().local().get_view_update_backlog());
}
void node_update_backlog::add(update_backlog backlog) {
    _backlogs[this_shard_id()].backlog.store(backlog, std::memory_order_relaxed);
    _backlogs[this_shard_id()].need_publishing = need_publishing::yes;
}
```
For this implementation of calculating the node's view update backlog to work,
we need the atomics to be updated correctly when the semaphores of corresponding
shards change.

The main event where the view update backlog changes is an incoming write
request. That's why when handling the request and preparing a response
we update the backlog calling `storage_proxy::get_view_update_backlog` (also
because we want to read the backlog and send it in the response):
backlog update after local view updates (`storage_proxy::send_to_live_endpoints` in `mutate_begin`)
```
 auto lmutate = [handler_ptr, response_id, this, my_address, timeout] () mutable {
     return handler_ptr->apply_locally(timeout, handler_ptr->get_trace_state())
             .then([response_id, this, my_address, h = std::move(handler_ptr), p = shared_from_this()] {
         // make mutation alive until it is processed locally, otherwise it
         // may disappear if write timeouts before this future is ready
         got_response(response_id, my_address, get_view_update_backlog());
     });
 };
backlog update after remote view updates (storage_proxy::remote::handle_write)

 auto f = co_await coroutine::as_future(send_mutation_done(netw::messaging_service::msg_addr{reply_to, shard}, trace_state_ptr,
         shard, response_id, p->get_view_update_backlog()));
```
Now assume that on a certain node we have a write request received on shard A,
which updates a row on shard B (A!=B). As a result, shard B will generate view
updates and consume units from its `_view_update_concurrency_sem`, but will
not update its atomic in `_backlogs` yet. Because both shards in the example
are on the same node, shard A will perform a local write calling `lmutate` shown
above. In the `lmutate` call, the `apply_locally` will initiate the actual write on
shard B and the `storage_proxy::update_view_update_backlog` will be called back
on shard A. In no place will the backlog atomic on shard B get updated even
though it increased in size due to the view updates generated there.
Currently, what we calculate there doesn't really matter - it's only used for the
MV flow control delays, so currently, in this scenario, we may only overload
a replica causing failed replica writes which will be later retried as hints. However,
when we add MV admission control, the calculated backlog will be the difference
between an accepted and a rejected request.

Fixes: https://github.com/scylladb/scylladb/issues/18542

Without admission control (https://github.com/scylladb/scylladb/pull/18334), this patch doesn't affect much, so I'm marking it as backport/none

Closes scylladb/scylladb#19341

* github.com:scylladb/scylladb:
  test: add test for view backlog not being updated on correct shard
  test: move auxiliary methods for waiting until a view is built to util
  mv: update view update backlog when it increases on correct shard

2024-07-04 11:40:09 +03:00

.github

.github: use the latest dbuild image

2024-07-03 16:57:22 +03:00

abseil @ d7aaad83b4

build: bring abseil submodule back

2024-05-05 23:31:09 +03:00

alternator

Merge 'alternator: fix "/localnodes" to use broadcast_rpc_address' from Nadav Har'El

2024-07-04 10:37:28 +03:00

api

Merge 'Close output_stream in get_compaction_history() API handler' from Pavel Emelyanov

2024-07-03 17:00:26 +03:00

auth

auth: do not include unused headers

2024-06-25 12:11:28 +03:00

bin

install.sh: use the native nodetool directly

2024-04-25 22:52:00 +03:00

cdc

auth: do not include unused headers

2024-06-25 12:11:28 +03:00

cmake

build: cmake: find libzstd before using it

2024-07-01 11:51:19 +08:00

compaction

compaction_manager: define compaction_manager::strategy_control earlier

2024-06-27 17:54:12 +03:00

conf

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

cql3

forward_service: rename to mapreduce_service

2024-07-03 19:29:47 +03:00

data_dictionary

data_dictionary: keyspace_metadata: format: print also initial_tablets

2024-05-31 10:09:58 +03:00

Merge 'storage_proxy: update view update backlog on correct shard when writing' from Wojciech Mitros

2024-07-04 11:40:09 +03:00

debug

…

dht

dht: remove unused operator<<

2024-06-18 11:26:20 +08:00

direct_failure_detector

direct_failure_detector: increase ping timeout and make it tunable

2024-05-07 23:40:23 +02:00

dist

scylla_coredump_setup: enable compress by default when zstd support detected

2024-07-04 10:38:22 +03:00

docs

docs: explain tuning for a node that is overcommitted at the hypervisor level

2024-07-04 10:23:25 +03:00

exceptions

Merge '.github: change severity to error in clang-include-cleaner ' from Kefu Chai

2024-06-12 10:16:17 +03:00

gms

gossiper: fix get_rpc_address() for this node

2024-06-30 18:38:15 +03:00

idl

forward_service: rename to mapreduce_service

2024-07-03 19:29:47 +03:00

index

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

lang

lang: remove unused operator<<

2024-06-18 15:55:22 +08:00

licenses

…

locator

locator/topology: update_node: format also shard_count in debug log message

2024-06-12 10:04:23 +03:00

message

forward_service: rename to mapreduce_service

2024-07-03 19:29:47 +03:00

mutation

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

mutation_writer

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

node_ops

node_ops: remove unused operator<<

2024-06-18 15:55:22 +08:00

raft

raft: add more raft metrics to make debug easier

2024-07-01 10:55:22 +02:00

readers

readers: define query::partition_slice before using it in default argument

2024-06-27 19:36:13 +03:00

redis

code: Switch to sched group in request_stop_server()

2024-05-24 18:00:01 +03:00

reloc

reloc: create $BUILDDIR for getting its path

2024-05-01 09:52:17 +03:00

repair

repair: remove unused operator<<

2024-06-26 21:57:03 +03:00

replica

mv: skip building view updates on a pending replica

2024-07-02 13:10:18 +02:00

rust

rust: disable incremental build for release build

2024-06-20 12:01:14 +03:00

schema

Merge 'schema: Make "describe" use extensions to string' from Calle Wilund

2024-06-18 11:28:11 +03:00

scripts

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

seastar @ 908ccd936a

Update seastar submodule

2024-06-21 18:52:58 +03:00

service

Merge 'storage_proxy: update view update backlog on correct shard when writing' from Wojciech Mitros

2024-07-04 11:40:09 +03:00

sstables

sstables::maybe_rebuild_filter_from_index: log sstable origin

2024-07-04 10:01:23 +03:00

streaming

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

swagger-ui @ 12f1da1082

…

tasks

treewide: include seastar headers with brackets

2024-06-21 19:20:27 +03:00

test

Merge 'storage_proxy: update view update backlog on correct shard when writing' from Wojciech Mitros

2024-07-04 11:40:09 +03:00

tools

Update tools/python3 submodule

2024-06-30 19:31:23 +03:00

tracing

cql3: Define prepared_statement weak pointer as const

2024-05-25 16:40:35 +03:00

transport

transport: Use sharded<>::invoke_on_others()

2024-06-25 22:17:59 +03:00

types

treewide: include seastar headers with brackets

2024-06-21 19:20:27 +03:00

unified

cqlsh: update cqlsh submodule

2024-06-26 12:07:21 +03:00

utils

s3/client: Unmark put-object lambdas from mutable

2024-07-04 10:07:48 +03:00

.dockerignore

…

.gitattributes

gitattributes: Mark swagger .js files as binary

2024-06-19 15:07:56 +03:00

.gitignore

git: add build.ninja.new to .gitignore

2024-06-24 16:48:50 +03:00

.gitmodules

build: bring abseil submodule back

2024-05-05 23:31:09 +03:00

.gitorderfile

…

.mailmap

…

absl-flat_hash_map.cc

…

absl-flat_hash_map.hh

…

amplify.yml

…

backlog_controller.hh

treewide: apply codespell to the comments in source code

2023-12-20 10:25:03 +02:00

build_mode.hh

…

bytes_ostream.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

bytes.cc

bytes: drop unused operator<<

2024-06-25 12:11:28 +03:00

bytes.hh

bytes: drop unused operator<<

2024-06-25 12:11:28 +03:00

cache_mutation_reader.hh

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

cache_temperature.hh

…

cartesian_product.hh

…

cell_locking.hh

…

checked-file-impl.hh

…

client_data.cc

…

client_data.hh

transport: do not return client_type from cql_server::connection::make_client_key()

2024-06-07 09:23:06 +08:00

clocks-impl.cc

clocks-impl: format time_point using fmt

2023-11-22 17:44:07 +02:00

clocks-impl.hh

…

clustering_bounds_comparator.hh

clustering_bounds_comparator: drop operator<< for bound_kind

2024-06-11 18:01:06 +02:00

clustering_interval_set.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

clustering_key_filter.hh

…

clustering_ranges_walker.hh

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

CMakeLists.txt

build: cmake: include the whole archive of zstd.a

2024-07-01 11:51:19 +08:00

collection_mutation.cc

collection_mutation: improve collection_mutation_view formatting

2024-05-02 18:42:41 +03:00

collection_mutation.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

column_computation.hh

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

combine.hh

…

compound_compat.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

compound.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

compress.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

compress.hh

compress, auth: include used headers

2024-05-30 09:16:23 +03:00

concrete_types.hh

use fmt::to_string() for seastar::net::inet_address

2024-02-05 16:56:40 +01:00

configure.py

forward_service: rename to mapreduce_service

2024-07-03 19:29:47 +03:00

CONTRIBUTING.md

…

converting_mutation_partition_applier.cc

…

converting_mutation_partition_applier.hh

…

counters.cc

…

counters.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

coverage_excludes.txt

test.py: support code coverage

2024-01-18 11:11:34 +02:00

coverage_sources.list

configure.py support coverage profiles on standrad build modes

2024-01-18 11:11:34 +02:00

cql_serialization_format.hh

…

db_clock.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

debug.cc

…

debug.hh

…

default.nix

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

Doxyfile

…

duration.cc

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

duration.hh

…

encoding_stats.hh

…

enum_set.hh

…

fix_system_distributed_tables.py

…

flake.lock

…

flake.nix

…

frozen_schema.cc

…

frozen_schema.hh

…

full_position.hh

…

gc_clock.hh

db: add formatter for gc_clock::time_point

2024-02-11 16:39:25 +02:00

gdbinit

…

gen_segmented_compress_params.py

Typos: fix typos in code

2023-12-13 10:45:21 +02:00

generic_server.cc

generic_server: Fix indentation after previous patch

2024-05-03 12:29:08 +03:00

generic_server.hh

transport/controller: pass unix_domain_socket_permissions to generic_server::listen

2024-02-05 14:22:03 +01:00

HACKING.md

HACKING.md: fix typo in "--overprovisioned" option name

2024-06-25 12:11:28 +03:00

hashing_partition_visitor.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

idl-compiler.py

idl-compiler: generate async serialization functions for stub members

2024-05-02 19:27:56 +03:00

inet_address_vectors.hh

abstract_replication_strategy: calculate_natural_endpoints: make it work with both versions of token_metadata

2023-12-12 23:19:53 +04:00

init.cc

treewide: replace seastar::future::get0() with seastar::future::get()

2024-02-02 22:12:57 +08:00

init.hh

Merge 'Typos: fix typos in code' from Yaniv Kaul

2023-12-06 07:36:41 +02:00

install-dependencies.sh

install-dependencies.sh: set file mode creation mask to 0022

2024-06-24 19:46:15 +03:00

install.sh

install.sh: use the native nodetool directly

2024-04-25 22:52:00 +03:00

interval.hh

treewide: replace std::result_of_t with std::invoke_result_t

2024-05-26 16:45:42 +03:00

keys.cc

clustering_bounds_comparator: drop operator<< for bound_kind

2024-06-11 18:01:06 +02:00

keys.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

LICENSE.AGPL

…

log.hh

…

main.cc

forward_service: rename to mapreduce_service

2024-07-03 19:29:47 +03:00

map_difference.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

marshal_exception.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

multishard_mutation_query.cc

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

multishard_mutation_query.hh

treewide: apply codespell to the comments in source code

2023-12-20 10:25:03 +02:00

mutation_query.cc

mutation_query: reconcilable_result: add merge_disjoint()

2024-02-21 02:08:48 -05:00

mutation_query.hh

treewide: Use partition_slice::is_reversed()

2024-03-13 08:52:46 +02:00

noexcept_traits.hh

treewide: replace seastar::future::get0() with seastar::future::get()

2024-02-02 22:12:57 +08:00

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

…

partition_range_compat.hh

interval: rename nonwrapping_interval to interval

2024-02-21 19:43:17 +02:00

partition_slice_builder.cc

…

partition_slice_builder.hh

…

partition_snapshot_reader.hh

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

partition_snapshot_row_cursor.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

protocol_server.hh

protocol_server: Keep scheduling group on board

2024-05-24 17:54:29 +03:00

querier.cc

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

querier.hh

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

query_id.hh

…

query_ranges_to_vnodes.cc

./: not include unused headers

2024-03-20 09:16:46 +02:00

query_ranges_to_vnodes.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

query_result_merger.hh

…

query-request.hh

forward_service: rename to mapreduce_service

2024-07-03 19:29:47 +03:00

query-result-reader.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

query-result-set.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

query-result-set.hh

query-result-set: add formatter for query-result-set.hh types

2024-02-21 17:54:48 +08:00

query-result-writer.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

query-result.hh

query-result.hh: add formatter for query::result::printer

2024-02-21 17:57:18 +08:00

query.cc

forward_service: rename to mapreduce_service

2024-07-03 19:29:47 +03:00

read_context.hh

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: wire in the configurable cpu concurrency

2024-06-27 09:57:11 -04:00

reader_concurrency_semaphore.hh

reader_concurrency_semaphore: wire in the configurable cpu concurrency

2024-06-27 09:57:11 -04:00

reader_permit.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

README.md

README.md: add badges for cron jobs

2024-06-23 19:24:40 +03:00

real_dirty_memory_accounter.hh

…

release.cc

release: introduce doc_link()

2024-05-08 09:41:17 -04:00

release.hh

release: introduce doc_link()

2024-05-08 09:41:17 -04:00

reversibly_mergeable.hh

…

row_cache.cc

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

row_cache.hh

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

schema_mutations.cc

schema_mutations: add fmt::formatter for schema_mutations

2024-03-15 09:49:56 +02:00

schema_mutations.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

schema_upgrader.hh

…

scylla_post_install.sh

dist: drop legacy control group parameters

2023-12-11 19:38:28 +09:00

scylla-gdb.py

scylla-gdb.py: add line information to coroutine names in scylla fiber

2024-06-25 13:55:10 +03:00

SCYLLA-VERSION-GEN

Update ScyllaDB version to: 6.1.0-dev

2024-05-22 14:08:56 +03:00

seastarx.hh

…

serialization_visitors.hh

…

serializer_impl.hh

serializer_impl, sstables: fix build failure due to missing includes

2024-04-23 12:03:51 +03:00

serializer.cc

…

serializer.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

service_permit.hh

…

setup.py

…

shell.nix

…

sstables_loader.cc

sstables-loader: Run loading in its scheduling group

2024-05-28 11:07:58 +03:00

sstables_loader.hh

sstables-loader: Add scheduling group to constructor

2024-05-28 11:07:22 +03:00

supervisor.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

table_helper.cc

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

table_helper.hh

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

test.py

Merge '[test.py] add --extra-scylla-cmdline-options argument for test.py' from Artsiom Mishuta

2024-06-28 11:11:29 +02:00

timeout_config.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

timeout_config.hh

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

timestamp.hh

…

tombstone_gc_extension.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

tombstone_gc_options.cc

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

tombstone_gc_options.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

tombstone_gc.cc

cql3: statements: change default tombstone_gc mode for tablets

2024-04-24 10:42:10 +02:00

tombstone_gc.hh

cql3: statements: change default tombstone_gc mode for tablets

2024-04-24 10:42:10 +02:00

tox.ini

…

ubsan-suppressions.supp

…

unimplemented.cc

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

unimplemented.hh

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

validation.cc

…

validation.hh

…

version.hh

…

view_info.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

vint-serialization.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

vint-serialization.hh

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

zstd.cc

zstd: include external header with brackets

2024-07-04 10:42:29 +03:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its API - CQL. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The community forum and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.3%

Python 26.5%

CMake 0.3%

GAP 0.3%

Shell 0.3%