mirror of https://github.com/scylladb/scylladb.git synced 2026-06-06 15:03:06 +00:00

Go to file

Nadav Har'El d61513c41c Merge 'reader_concurrency_semaphore: make CPU concurrency configurable' from Botond Dénes

The reader concurrency semaphore restricts the concurrency of reads that require CPU (intention: they read from the cache) to 1, meaning that if there is even a single active read which declares that it needs just CPU to proceed, no new read is admitted. This is meant to keep the concurrency of reads in the cache at 1. The idea is that concurrency in the cache is not useful: it just leads to the reactor rotating between these reads, all of the finishing later then they could if they were the only active read in the cache.
This was observed to backfire in the case where there reads from a single table are mostly very fast, but on some keys are very slow (hint: collection full of tombstones). In this case the slow read keeps up the fast reads in the queue, increasing the 99th percentile latencies significantly.

This series proposes to fix this, by making the CPU concurrency configurable. We don't like tunables like this and this is not a proper fix, but a workaround. The proper fix would be to allow to cut any page early, but we cannot cut a page in the middle of a row. We could maybe have a way of detecting slow reads and excluding them from the CPU concurrency. This would be a heuristic and it would be hard to get right. So in this series a robust and simple configurable is offered, which can be used on those few clusters which do suffer from the too strict concurrency limit. We have seen it in very few cases so far, so this doesn't seem to be wide-spread.

Fixes: https://github.com/scylladb/scylladb/issues/19017

This fixes a regression introduced in 5.0, so we have to backport to all currently supported releases

Closes scylladb/scylladb#19018

* github.com:scylladb/scylladb:
  test/boost/reader_concurrency_semaphore_test: add test for live-configurable cpu concurrenc  Please enter the commit message for your changes. Lines starting
  test/boost/reader_concurrency_semaphore_test: hoist require_can_admit
  reader_concurrency_semaphore: wire in the configurable cpu concurrency
  reader_concurrency_semaphore: add cpu_concurrency constructor parameter
  db/config: introduce reader_concurrency_semahore_cpu_concurrency

2024-07-02 13:39:00 +03:00

.github

.github/scripts/label_promoted_commits.py: fix adding labels when PR is closed

2024-06-27 14:00:44 +03:00

abseil @ d7aaad83b4

build: bring abseil submodule back

2024-05-05 23:31:09 +03:00

alternator

bytes: drop unused operator<<

2024-06-25 12:11:28 +03:00

api

Merge 'Close output stream in task manager's API get_tasks handler' from Pavel Emelyanov

2024-06-30 19:34:00 +03:00

auth

auth: do not include unused headers

2024-06-25 12:11:28 +03:00

bin

install.sh: use the native nodetool directly

2024-04-25 22:52:00 +03:00

cdc

auth: do not include unused headers

2024-06-25 12:11:28 +03:00

cmake

build: remove aarch64 workarounds

2024-06-28 17:53:51 +03:00

compaction

compaction_manager: define compaction_manager::strategy_control earlier

2024-06-27 17:54:12 +03:00

conf

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

cql3

cql3: define dtor of modification_statement in .cc file

2024-06-30 19:35:05 +03:00

data_dictionary

data_dictionary: keyspace_metadata: format: print also initial_tablets

2024-05-31 10:09:58 +03:00

Merge 'reader_concurrency_semaphore: make CPU concurrency configurable' from Botond Dénes

2024-07-02 13:39:00 +03:00

debug

…

dht

dht: remove unused operator<<

2024-06-18 11:26:20 +08:00

direct_failure_detector

direct_failure_detector: increase ping timeout and make it tunable

2024-05-07 23:40:23 +02:00

dist

cqlsh: update cqlsh submodule

2024-06-26 12:07:21 +03:00

docs

Docs: Fix a typo in sstable-corruption.rst

2024-07-02 11:58:27 +02:00

exceptions

Merge '.github: change severity to error in clang-include-cleaner ' from Kefu Chai

2024-06-12 10:16:17 +03:00

gms

Merge notify other nodes on boot from Gleb

2024-06-25 17:58:17 +02:00

idl

Merge notify other nodes on boot from Gleb

2024-06-25 17:58:17 +02:00

index

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

lang

lang: remove unused operator<<

2024-06-18 15:55:22 +08:00

licenses

…

locator

locator/topology: update_node: format also shard_count in debug log message

2024-06-12 10:04:23 +03:00

message

gossiper: move gossip verbs to the idl

2024-06-17 12:47:17 +03:00

mutation

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

mutation_writer

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

node_ops

node_ops: remove unused operator<<

2024-06-18 15:55:22 +08:00

raft

raft: add more raft metrics to make debug easier

2024-07-01 10:55:22 +02:00

readers

readers: define query::partition_slice before using it in default argument

2024-06-27 19:36:13 +03:00

redis

code: Switch to sched group in request_stop_server()

2024-05-24 18:00:01 +03:00

reloc

reloc: create $BUILDDIR for getting its path

2024-05-01 09:52:17 +03:00

repair

repair: remove unused operator<<

2024-06-26 21:57:03 +03:00

replica

Merge 'reader_concurrency_semaphore: make CPU concurrency configurable' from Botond Dénes

2024-07-02 13:39:00 +03:00

rust

rust: disable incremental build for release build

2024-06-20 12:01:14 +03:00

schema

Merge 'schema: Make "describe" use extensions to string' from Calle Wilund

2024-06-18 11:28:11 +03:00

scripts

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

seastar @ 908ccd936a

Update seastar submodule

2024-06-21 18:52:58 +03:00

service

Merge 'co-routinize paxos_state functions' from Gleb

2024-07-02 11:54:13 +02:00

sstables

sstables/mx/writer: rebuild bloom filters with bad partition estimates

2024-06-24 12:06:02 +05:30

streaming

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

swagger-ui @ 12f1da1082

…

tasks

treewide: include seastar headers with brackets

2024-06-21 19:20:27 +03:00

test

Merge 'reader_concurrency_semaphore: make CPU concurrency configurable' from Botond Dénes

2024-07-02 13:39:00 +03:00

tools

Update tools/python3 submodule

2024-06-30 19:31:23 +03:00

tracing

cql3: Define prepared_statement weak pointer as const

2024-05-25 16:40:35 +03:00

transport

transport: Use sharded<>::invoke_on_others()

2024-06-25 22:17:59 +03:00

types

treewide: include seastar headers with brackets

2024-06-21 19:20:27 +03:00

unified

cqlsh: update cqlsh submodule

2024-06-26 12:07:21 +03:00

utils

config: avoid binding an lvalue reference to an rvalue reference

2024-06-27 19:36:13 +03:00

.dockerignore

…

.gitattributes

gitattributes: Mark swagger .js files as binary

2024-06-19 15:07:56 +03:00

.gitignore

git: add build.ninja.new to .gitignore

2024-06-24 16:48:50 +03:00

.gitmodules

build: bring abseil submodule back

2024-05-05 23:31:09 +03:00

.gitorderfile

…

.mailmap

…

absl-flat_hash_map.cc

…

absl-flat_hash_map.hh

…

amplify.yml

…

backlog_controller.hh

…

build_mode.hh

…

bytes_ostream.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

bytes.cc

bytes: drop unused operator<<

2024-06-25 12:11:28 +03:00

bytes.hh

bytes: drop unused operator<<

2024-06-25 12:11:28 +03:00

cache_mutation_reader.hh

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

cache_temperature.hh

…

cartesian_product.hh

…

cell_locking.hh

…

checked-file-impl.hh

…

client_data.cc

…

client_data.hh

transport: do not return client_type from cql_server::connection::make_client_key()

2024-06-07 09:23:06 +08:00

clocks-impl.cc

…

clocks-impl.hh

…

clustering_bounds_comparator.hh

clustering_bounds_comparator: drop operator<< for bound_kind

2024-06-11 18:01:06 +02:00

clustering_interval_set.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

clustering_key_filter.hh

…

clustering_ranges_walker.hh

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

CMakeLists.txt

build: switch to C++23

2024-06-27 19:36:13 +03:00

collection_mutation.cc

collection_mutation: improve collection_mutation_view formatting

2024-05-02 18:42:41 +03:00

collection_mutation.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

column_computation.hh

…

combine.hh

…

compound_compat.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

compound.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

compress.cc

…

compress.hh

compress, auth: include used headers

2024-05-30 09:16:23 +03:00

concrete_types.hh

use fmt::to_string() for seastar::net::inet_address

2024-02-05 16:56:40 +01:00

configure.py

Merge 'build: update C++ standard to C++23' from Avi Kivity

2024-06-28 18:02:33 +03:00

CONTRIBUTING.md

…

converting_mutation_partition_applier.cc

…

converting_mutation_partition_applier.hh

…

counters.cc

…

counters.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

coverage_excludes.txt

…

coverage_sources.list

…

cql_serialization_format.hh

…

db_clock.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

debug.cc

…

debug.hh

…

default.nix

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

Doxyfile

…

duration.cc

…

duration.hh

…

encoding_stats.hh

…

enum_set.hh

…

fix_system_distributed_tables.py

…

flake.lock

…

flake.nix

…

frozen_schema.cc

…

frozen_schema.hh

…

full_position.hh

…

gc_clock.hh

db: add formatter for gc_clock::time_point

2024-02-11 16:39:25 +02:00

gdbinit

…

gen_segmented_compress_params.py

…

generic_server.cc

generic_server: Fix indentation after previous patch

2024-05-03 12:29:08 +03:00

generic_server.hh

transport/controller: pass unix_domain_socket_permissions to generic_server::listen

2024-02-05 14:22:03 +01:00

HACKING.md

HACKING.md: fix typo in "--overprovisioned" option name

2024-06-25 12:11:28 +03:00

hashing_partition_visitor.hh

…

idl-compiler.py

idl-compiler: generate async serialization functions for stub members

2024-05-02 19:27:56 +03:00

inet_address_vectors.hh

…

init.cc

treewide: replace seastar::future::get0() with seastar::future::get()

2024-02-02 22:12:57 +08:00

init.hh

…

install-dependencies.sh

install-dependencies.sh: set file mode creation mask to 0022

2024-06-24 19:46:15 +03:00

install.sh

install.sh: use the native nodetool directly

2024-04-25 22:52:00 +03:00

interval.hh

treewide: replace std::result_of_t with std::invoke_result_t

2024-05-26 16:45:42 +03:00

keys.cc

clustering_bounds_comparator: drop operator<< for bound_kind

2024-06-11 18:01:06 +02:00

keys.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

LICENSE.AGPL

…

log.hh

…

main.cc

config: avoid binding an lvalue reference to an rvalue reference

2024-06-27 19:36:13 +03:00

map_difference.hh

…

marshal_exception.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

multishard_mutation_query.cc

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

multishard_mutation_query.hh

…

mutation_query.cc

mutation_query: reconcilable_result: add merge_disjoint()

2024-02-21 02:08:48 -05:00

mutation_query.hh

treewide: Use partition_slice::is_reversed()

2024-03-13 08:52:46 +02:00

noexcept_traits.hh

treewide: replace seastar::future::get0() with seastar::future::get()

2024-02-02 22:12:57 +08:00

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

…

partition_range_compat.hh

interval: rename nonwrapping_interval to interval

2024-02-21 19:43:17 +02:00

partition_slice_builder.cc

…

partition_slice_builder.hh

…

partition_snapshot_reader.hh

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

partition_snapshot_row_cursor.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

protocol_server.hh

protocol_server: Keep scheduling group on board

2024-05-24 17:54:29 +03:00

querier.cc

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

querier.hh

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

query_id.hh

…

query_ranges_to_vnodes.cc

./: not include unused headers

2024-03-20 09:16:46 +02:00

query_ranges_to_vnodes.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

query_result_merger.hh

…

query-request.hh

query-request: use default-generated operator==

2024-03-07 09:02:42 +03:00

query-result-reader.hh

…

query-result-set.cc

…

query-result-set.hh

query-result-set: add formatter for query-result-set.hh types

2024-02-21 17:54:48 +08:00

query-result-writer.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

query-result.hh

query-result.hh: add formatter for query::result::printer

2024-02-21 17:57:18 +08:00

query.cc

treewide: do not define FMT_DEPRECATED_OSTREAM

2024-04-19 22:57:36 +08:00

read_context.hh

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: wire in the configurable cpu concurrency

2024-06-27 09:57:11 -04:00

reader_concurrency_semaphore.hh

reader_concurrency_semaphore: wire in the configurable cpu concurrency

2024-06-27 09:57:11 -04:00

reader_permit.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

README.md

README.md: add badges for cron jobs

2024-06-23 19:24:40 +03:00

real_dirty_memory_accounter.hh

…

release.cc

release: introduce doc_link()

2024-05-08 09:41:17 -04:00

release.hh

release: introduce doc_link()

2024-05-08 09:41:17 -04:00

reversibly_mergeable.hh

…

row_cache.cc

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

row_cache.hh

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

schema_mutations.cc

schema_mutations: add fmt::formatter for schema_mutations

2024-03-15 09:49:56 +02:00

schema_mutations.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

schema_upgrader.hh

…

scylla_post_install.sh

…

scylla-gdb.py

scylla-gdb.py: add line information to coroutine names in scylla fiber

2024-06-25 13:55:10 +03:00

SCYLLA-VERSION-GEN

Update ScyllaDB version to: 6.1.0-dev

2024-05-22 14:08:56 +03:00

seastarx.hh

…

serialization_visitors.hh

…

serializer_impl.hh

serializer_impl, sstables: fix build failure due to missing includes

2024-04-23 12:03:51 +03:00

serializer.cc

…

serializer.hh

…

service_permit.hh

…

setup.py

…

shell.nix

…

sstables_loader.cc

sstables-loader: Run loading in its scheduling group

2024-05-28 11:07:58 +03:00

sstables_loader.hh

sstables-loader: Add scheduling group to constructor

2024-05-28 11:07:22 +03:00

supervisor.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

table_helper.cc

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

table_helper.hh

…

test.py

Merge '[test.py] add --extra-scylla-cmdline-options argument for test.py' from Artsiom Mishuta

2024-06-28 11:11:29 +02:00

timeout_config.cc

…

timeout_config.hh

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

timestamp.hh

…

tombstone_gc_extension.hh

./: not include unused headers

2024-03-20 09:16:46 +02:00

tombstone_gc_options.cc

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

tombstone_gc_options.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

tombstone_gc.cc

cql3: statements: change default tombstone_gc mode for tablets

2024-04-24 10:42:10 +02:00

tombstone_gc.hh

cql3: statements: change default tombstone_gc mode for tablets

2024-04-24 10:42:10 +02:00

tox.ini

…

ubsan-suppressions.supp

…

unimplemented.cc

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

unimplemented.hh

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

validation.cc

…

validation.hh

…

version.hh

…

view_info.hh

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

vint-serialization.cc

…

vint-serialization.hh

…

zstd.cc

…

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its API - CQL. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The community forum and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.1%

Python 26.7%

CMake 0.3%

GAP 0.3%

Shell 0.3%