mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 11:36:54 +00:00

Go to file

Kamil Braun 9e85921006 storage_proxy: remove a feedback loop from the speculative retry latency metric

To handle a read request from a client, the coordinator node must send
data and digest requests to replicas, reconcile the obtained results
(by merging the obtained mutations and comparing digests), and possibly
send more requests to replicas if the digests turned out to be different
in order to perform read repair and preserve consistency of observed reads.

In contrast to writes, where coordinators send their mutation write requests
to all replicas in the replica set, for reads the coordinators send
their requests only to as many replicas as is required to achieve
the desired CL.

For example consider RF=3 and a CL=QUORUM read. Then the coordinator sends
its request to a subset of 2 nodes out of the 3 possible replicas. The
choice of the 2-node subset is random; the distribution used for the
random roll is affected by certain things such as the "cache hitrate"
metric. The details are not that relevant for this discussion.

If not all of the the initially chosen replicas
answer within a certain time period, the coordinator may send an
additional request to one more replica, hoping that this replica helps
achieving the desired CL so the entire client request succeeds. This
mechanism is called "speculative retry" and is enabled by default.

This time period - call it `T` - is chosen based on keyspace
configuration. The default value is "99.0PERCENTILE", which means that
`T` is roughly equal to the 99th percentile of the latency distribution
of previous requests (or at least the most recent requests; the
algorithm uses an exponential decay strategy to make old request less
relevant for the metric). The latencies used are the durations of whole
coordinator read requests: each such duration measurement starts before
the first replica request is sent and ends after the last replica
request is answered, among the replica requests whose results were used
for the reconciled result returned to the client (there may be more
requests sent later "in the background" - they don't affect the client
result and are not taken into account for the latency measurement).

This strategy, however, gives an undesired effect which appears
when a significant part of all requests require a speculative retry to
succeed. To explain this effect it's best to consider a scenario which
takes this to the extreme - where *all* requests require a speculative retry.

Consider RF=3 and CL=QUORUM so each read request initially uses 2
replicas. Let {A, B, C} be the set of replicas. We run a uniformly
distributed read workload.

Initially the cluster operates normally. Roughly 1/3 of all requests go
to replicas {A, B}, 1/3 go to {A, C}, and 1/3 go to {B, C}. The 99th
percentile of read request latencies is 50ms. Suppose that the average
round-trip latency between a coordinator and any replica is 10ms.

Suddenly replica C is hard-killed: non-graceful shutdown, e.g. power
outage. This means that other nodes are initially not aware that C is down,
they must wait for the failure detector to convict C as unavailable
which happens after a configurable amount of time. The current default
is 20s, meaning that by default coordinators will still attempt to send
requests to C for 20s after it is hard-killed.

During this period the following happens:
- About 2/3 of all requests - the ones which were routed to {A, C} and
  {B, C} - do not finish within 50ms because C does not answer. For
  these requests to finish, the coordinator performs a speculative retry
  to the third replica which finishes after ~10ms (the average round-trip
  latency). Thus the entire request, from the coordinator's POV, takes ~60ms.
- Eventually (very quickly in fact - assuming there are many concurrent
  requests) the P99 latency rises to 60ms.
- Furthermore, the requests which initially use {A, C} and {B, C} start
  taking more than 2/3 of all requests because they are stuck in the foreground
  longer than the {A, B} requests (since their latencies are higher).
- These requests do not finish within 60ms. Thus coordinators perform
  speculative retries. Thus they finish after ~70ms.
- Eventually the P99 latency rises to 70ms.
- These bad requests take an even longer portion of all requests.
- These requests do not finish within 70ms. They finish after ~80ms.
- Eventually the P99 latency rises to 80ms.
- And so on.

In metrics, we observe the following:
- Latencies rise roughly linearly. They rise until they hit a certain limit;
  this limit comes from the fact that `T` is upper-bounded by the
  read request timeout parameter divided by 2. Thus if the read request
  timeout is `5s` and P99 latencies are `3s`, `T` will be `2.5s`, not `3s`.
  Thus eventually all requests will take about `2.5s + 10ms` to finish
  (`2.5s` until speculative retry happens, `10ms` for the last round-trip),
  unless the node is marked as DOWN before we reach that limit.
- Throughput decreases roughly proportionally to the y = 1/x function, as
  expected from Little's law.

Everything goes back to normal when nodes mark C as DOWN, which happens
after ~20s by default as explained above. Then coordinators start
routing all requests to {A, B} only.

This does not happen for graceful shutdowns, where C announces to the
cluster that it's shutting down before shutting down, causing other
nodes to mark it as DOWN almost immediately.

The root cause of the issue is a feedback loop in the metric used to
calculate `T`: we perform a speculative retry after `T` -> P99 request
latencies rise above `T + 10ms` -> `T` rises above `T + 10ms` -> etc.

We fix the problem by changing the measurements used for calculating
`T`. Instead of measuring the entire coordinator read latency, we
measure each replica request separately and take the maximum over these
measurements. We only take into account the measurements for requests
that actually contributed to the request's result.

The previous statistic would also measure failed requests latencies. Now we
measure only latencies of successful replica requests. Indeed this makes
sense for the speculative retry use case; the idea behind speculative retry
is that we assume that requests usually succeed within a certain time
period, and we should perform the retry if they take longer than that.
To measure this time period, taking failed requests into account doesn't
make much sense.

In the scenario above, for a request that initially goes to {A, C}, the
following would happen after applying the fix:
- We send the requests to A and C.
- After ~10ms A responds. We record the ~10ms measurement.
- After ~50ms we perform speculative retry, sending a request to B.
- After ~10ms B responds. We record the ~10ms measurement.

The maximum over recorded measurements is ~10ms, not ~60ms.
The feedback loop is removed.

Experiments show that the solution is effective: in scenarios like
above, after C is killed, latencies only rise slightly by a constant
amount and then maintain their level, as expected. Throughput also drops
by a constant amount and maintains its level instead of continuously
dropping with an asymptote at 0.

Fixes #3746.
Fixes #7342.

Closes #8783

2021-06-13 16:19:11 +03:00

.github

docs: added multiversion_regex_builder

2021-01-13 11:07:29 +02:00

abseil @ 9c6a50fdd8

Update abseil submodule

2021-02-08 15:41:46 +02:00

alternator

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

api

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

auth

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

cdc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

conf

config: relax batch size warning and failure thresholds

2021-04-06 20:56:06 +03:00

cql3

cql: create_keyspace_statement: move logger out of header file

2021-06-13 14:45:40 +03:00

config: add configuration option restrict_replication_simplestrategy

2021-06-13 14:45:16 +03:00

debug

…

dht

Merge 'dht: token: make some cosmetic changes' from Michał Chojnowski

2021-06-07 15:41:15 +03:00

dist

dist: rpm: Add specific versioning and python3 dependency

2021-06-09 20:02:43 +03:00

docs

docs/guides/debugging.md: expand section on libthread-db

2021-06-12 21:36:47 +03:00

exceptions

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

gms

gossip: Handle nodes removed from live endpoints directly

2021-06-09 15:02:25 +02:00

idl

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

index

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

interface

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

libdeflate @ e7e54eab42

…

licenses

Add abseil as a submodule

2020-06-14 08:18:37 -07:00

locator

Merge 'locator: token_metadata: simplify tokens_iterator' from Michał Chojnowski

2021-06-08 15:42:41 +03:00

message

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_writer

flat_mutation_reader: unify reader_consumer declarations

2021-06-07 16:11:18 +03:00

raft

raft: (testing) test receiving a confchange in a snapshot

2021-06-11 17:16:56 +03:00

redis

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

reloc

reloc: Remove "build_reloc.sh" script as obsolete

2020-11-20 22:41:26 +02:00

repair

repair: get_sharder_for_tables: throw no_such_column_family

2021-06-08 14:45:44 +03:00

scripts

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

seastar @ 4506b8784a

Update seastar submodule

2021-05-28 11:47:54 +03:00

service

storage_proxy: remove a feedback loop from the speculative retry latency metric

2021-06-13 16:19:11 +03:00

sstables

compaction_manager: stop_ongoing_compactions: print reason for stopping

2021-06-10 11:52:57 +03:00

streaming

Merge 'streaming: make_streaming_consumer: close reader on errors' from Benny Halevy

2021-06-09 15:02:36 +03:00

swagger-ui @ 12f1da1082

…

test

Merge "raft: add tests for non-voters and fix related bugs" from Kostja

2021-06-12 21:36:47 +03:00

thrift

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

tools

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

tracing

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

transport

transport: Untie transport and database

2021-06-09 20:04:12 +03:00

types

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

unified

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

utils

utils/enum_option.hh: add implicit converter to the underlying enum

2021-06-13 13:18:49 +03:00

.dockerignore

…

.gitattributes

…

.gitignore

docs: added theme

2020-12-03 17:37:18 +01:00

.gitmodules

scylla-python3: move scylla-python3 to separated repository

2020-08-18 09:34:08 +03:00

.gitorderfile

…

absl-flat_hash_map.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

absl-flat_hash_map.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

atomic_cell_hash.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

atomic_cell_or_collection.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

atomic_cell.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

atomic_cell.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

backlog_controller.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

bytes_ostream.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

bytes.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

bytes.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

cache_flat_mutation_reader.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

cache_temperature.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

caching_options.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

caching_options.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

canonical_mutation.cc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

canonical_mutation.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

cartesian_product.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

cell_locking.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

checked-file-impl.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clocks-impl.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clocks-impl.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clustering_bounds_comparator.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clustering_interval_set.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clustering_key_filter.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

clustering_ranges_walker.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

CMakeLists.txt

db: Add virtual tables interface

2021-05-12 17:05:34 +02:00

collection_mutation.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

collection_mutation.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

column_computation.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

combine.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

compaction_garbage_collector.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

compaction_strategy_type.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

compaction_strategy.hh

flat_mutation_reader: unify reader_consumer declarations

2021-06-07 16:11:18 +03:00

compatible_ring_position.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

compound_compat.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

compound.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

compress.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

compress.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

concrete_types.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

configure.py

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

connection_notifier.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

connection_notifier.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

CONTRIBUTING.md

CONTRIBUTING.md: add the requirement for self-contained headers

2021-05-05 15:10:46 +03:00

converting_mutation_partition_applier.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

converting_mutation_partition_applier.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

counters.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

counters.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

cql_serialization_format.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

database_fwd.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

database.cc

repair: get_sharder_for_tables: throw no_such_column_family

2021-06-08 14:45:44 +03:00

database.hh

repair: get_sharder_for_tables: throw no_such_column_family

2021-06-08 14:45:44 +03:00

db_clock.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

debug.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

default.nix

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

digest_algorithm.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

digester.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

dirty_memory_manager.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

distributed_loader.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

distributed_loader.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

Doxyfile

…

duration.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

duration.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

encoding_stats.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

enum_set.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

fix_system_distributed_tables.py

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

flat_mutation_reader.cc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

flat_mutation_reader.hh

flat_mutation_reader: unify reader_consumer declarations

2021-06-07 16:11:18 +03:00

frozen_mutation.cc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

frozen_mutation.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

frozen_schema.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

frozen_schema.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

gc_clock.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

gen_segmented_compress_params.py

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

generic_server.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

generic_server.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

HACKING.md

HACKING.md: redirect to ./coverage.py for more details

2021-05-21 11:50:39 +03:00

hashers.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

hashers.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

hashing_partition_visitor.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

hashing.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

idl-compiler.py

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

inet_address_vectors.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

init.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

init.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

install-dependencies.sh

build: drop lld from install-dependencies.sh on s390x

2021-04-12 09:46:33 +03:00

install.sh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

interval.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

intrusive_set_external_comparator.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

keys.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

keys.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

LICENSE.AGPL

…

lister.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

lister.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

log.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

lua.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

lua.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

main.cc

main: improve process file limit handling

2021-06-13 09:19:35 +03:00

map_difference.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

marshal_exception.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

memtable-sstable.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

memtable.cc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

memtable.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

multishard_mutation_query.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

multishard_mutation_query.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_cleaner.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_compactor.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_consumer_concepts.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_fragment_stream_validator.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_fragment.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_fragment.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_partition_serializer.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_partition_serializer.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_partition_view.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_partition_view.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_partition_visitor.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_partition.cc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_partition.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_query.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_query.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

mutation_reader.cc

queue_reader_handle: mark copy constructor noexcept

2021-06-09 20:09:01 +03:00

mutation_reader.hh

queue_reader_handle: mark copy constructor noexcept

2021-06-09 20:09:01 +03:00

mutation_rebuilder.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation_source_metadata.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

mutation.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

noexcept_traits.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

NOTICE.txt

raft: etcd unit tests: initial boost tests

2021-01-18 12:33:12 -04:00

ORIGIN

…

partition_builder.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

partition_range_compat.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

partition_slice_builder.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

partition_slice_builder.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

partition_snapshot_reader.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

partition_snapshot_row_cursor.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

partition_version_list.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

partition_version.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

partition_version.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

position_in_partition.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

querier.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

querier.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query_class_config.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query_result_merger.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query-request.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query-result-reader.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

query-result-set.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query-result-set.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

query-result-writer.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query-result.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

query.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

range_tombstone_list.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

range_tombstone_list.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

range_tombstone.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

range_tombstone.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

range.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

read_context.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

reader_concurrency_semaphore.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

reader_concurrency_semaphore.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

reader_permit.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

README.md

docs: fix invalid path in README.mds

2021-02-21 13:49:12 +02:00

real_dirty_memory_accounter.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

release.cc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

release.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

reversibly_mergeable.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

row_cache.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

row_cache.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

schema_builder.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_fwd.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_mutations.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_mutations.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_registry.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_registry.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema_upgrader.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

schema.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

scylla_post_install.sh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

scylla-gdb.py

scylla-gdb: Remove maximum-request-size report

2021-06-11 19:06:43 +02:00

SCYLLA-VERSION-GEN

version: prepare for the 4.6 cycle

2021-04-01 20:40:52 +03:00

seastarx.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

serialization_visitors.hh

…

serializer_impl.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

serializer.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

serializer.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

service_permit.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

setup.py

…

shell.nix

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

supervisor.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

table_helper.cc

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

table_helper.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

table.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

test.py

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

timeout_config.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

timeout_config.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

timestamp.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

to_string.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

tombstone.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

tox.ini

…

types.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

types.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

ubsan-suppressions.supp

suppress ubsan error in boost::deque::clear()

2020-11-09 11:25:19 +02:00

unimplemented.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

unimplemented.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

user_types_metadata.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

validation.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

validation.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

version.hh

treewide: reduce header interdependencies

2021-06-07 15:58:35 +03:00

view_info.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

vint-serialization.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

vint-serialization.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

xx_hasher.hh

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

zstd.cc

treewide: extent copyright statements to present day

2021-06-06 19:18:49 +03:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.3%

Python 26.5%

CMake 0.3%

GAP 0.3%

Shell 0.3%