scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-29 20:57:00 +00:00

Author	SHA1	Message	Date
Kamil Braun	e01cef01a6	Merge 'Ignore seed name resolution errors during the restart of a cluster member node.' from Sergey Zolotukhin All seeds hostname resolution errors will be ignored during a node restart in case the node had already joined a cluster. This will prevent restart errors if some seed names are not resolvable. Fixes scylladb/scylladb#14945 Closes scylladb/scylladb#20292 * github.com:scylladb/scylladb: Ignore seed name resolution errors on restart. Add a test for starting with a wrong seed.	2024-08-30 11:33:44 +02:00
Avi Kivity	7da3314deb	Merge 'Integrated restore' from Ernest Zaslavsky Handed over from https://github.com/scylladb/scylladb/pull/20149 This adds minimal implementation of the start-restore API call. The method starts a task that runs load-and-stream functionality against sstables from S3 bucket. Arguments are: ``` endpoint -- the ID in object_store.yaml config file bucket -- the target bucket to get objects from keyspace -- the keyspace to work on table -- the table to work on snapshot -- the name of the snapshot from which the backup was taken ``` The task runs in the background, its task_id is returned from the method once it's spawned and it should be used via /task_manager API to track the task execution and completion. Remote sstables components are scanned as if they were placed in local upload/ directory. Then colelcted sstables are fed into load-and-stream. This branch has https://github.com/scylladb/scylladb/pull/19890 (Integrated backup), https://github.com/scylladb/scylladb/pull/20120 (S3 lister) and few more minor PRs merged in. The restore branch itself starts with [utils: Introduce abstract (directory) lister](`29c867b54d`) commit. refs: https://github.com/scylladb/scylladb/issues/18392 Closes scylladb/scylladb#20305 * github.com:scylladb/scylladb: tools/scylla-nodetool: add restore integration test/object_store: Add simple restore test test/object_store: Generalize prepare_snapshot_for_backup() code: Introduce restore API method sstable_loader: Add sstables::storage_manager dependency sstable_loader: Maintain task manager module sstable_loader: Out-line constructor distributed_loader: Split get_sstables_from_upload_dir() sstables/storage: Compose uploaded sstable path simpler sstable_directory: Prepare FS lister to scan files on S3 sstable_directory: Parse sstable component without full path s3-client: Add support for lister::filter utils: Introduce abstract (directory) lister	2024-08-29 18:25:30 +03:00
Patryk Jędrzejczak	22d907e721	treewide: introduce support for zero-token nodes in Raft topology We revive the `join_ring` option. We support it only in the Raft-based topology, as we plan to remove the gossip-based topology when we fix the last blocker - the implementation of the manual recovery tool. In the Raft-based topology, a node can be assigned tokens only once when it joins the cluster. Hence, we disallow joining the ring later, which is possible in Cassandra. The main idea behind the solution is simple. We make the unsupported special case of zero tokens a supported normal case. Nodes with zero tokens assigned are called "zero-token nodes" from now on. From the topology point of view, zero-token nodes are the same as token-owning nodes. They can be in the same states, etc. From the data point of view, they are different. They are not members of the token ring, so they are not present in `token_metadata::_normal_token_owners`. Hence, they are ignored in all non-local replication strategies. The tablet load balancer also ignores them. Topology operations involving zero-token nodes are simplified: - `add` and `replace` finish in the `join_group0` state, so creating a new CDC generation and streaming are skipped, - `removenode` and `decommission` skip streaming, - `rebuild` does not even contact the topology coordinator as there is nothing to rebuild, Also, if the topology operation involves a token-owning node, zero-token nodes are ignored in streaming. Zero-token nodes can be used as coordinator-only nodes, just like in Cassandra. They can handle requests just like token-owning nodes. The main motivation behind zero-token nodes is that they can prevent the Raft majority loss efficiently. Zero-token nodes are group 0 voters, but they can run on much weaker and cheaper machines because they do not replicate data and handle client requests by default (drivers ignore them). For example, if there are two DCs, one with 4 nodes and one with 5 nodes, if we add a DC with 2 zero-token nodes, every DC will contain less than half of the nodes, so we won't lose the majority when any DC dies. Another way of preventing the Raft majority loss is changing the voter set, which is tracked by scylladb/scylladb#18793. That approach can be used together with zero-token nodes. In the example above, if we choose equal numbers of voters in both DCs, then a DC with one zero-token node will be sufficient. However, in the typical setup of 2 DCs with the same number of nodes it is enough to add a DC with only one zero-token node without changing the voter set. Zero-token nodes could also be used as load balancers in the Alternator.	2024-08-29 10:37:07 +02:00
Sergey Zolotukhin	65f37f3ba6	Ignore seed name resolution errors on restart. Gossiper seeds host name resolution failures are ignored during restart if a node is already boostrapped (i.e. it has successfully joined the cluster). Fixes scylladb/scylladb#14945	2024-08-28 14:01:04 +02:00
Pavel Emelyanov	1f3f0b1926	sstable_loader: Add sstables::storage_manager dependency The storage_manager maintains set of clients to configured object storage(s). The sstables loader is going to spawn tasks that will talk to to those storages, thus it needs the storage manager to get the clients clients from. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-27 16:15:41 +03:00
Pavel Emelyanov	06c3c53deb	sstable_loader: Maintain task manager module This service is going to start tasks managed by task manager. For that, it should have its module set up and registered. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-27 16:15:41 +03:00
Avi Kivity	72a85e3812	Merge 'Integrated backup' from Pavel Emelyanov This adds minimal implementation of the start-backup API call. The method starts a task that uploads all files from the given keyspace's snapshot to the requested endpoint/bucket. Arguments are: - endpoint -- the ID in object_store.yaml config file - bucket -- the target bucket to put objects into - keyspace -- the keyspace to work on - snapshot -- the method assumes that the snapshot had been already taken and only copies sstables from it The task runs in the background, its task_id is returned from the method once it's spawned and it should be used via /task_manager API to track the task execution and completion (hint: it's good to have non-zero TTL value to make sure fast backups don't finish before the caller manages to call wait_task API). Sstables components are scanned for all tables in the keyspace and are uploaded into the /bucket/${cf_name}/${snapshot_name}/ path. refs: #18391 Closes scylladb/scylladb#19890 * github.com:scylladb/scylladb: tools/scylla-nodetool: add backup integration docs: Document the new backup method test/object_store: Test that backup task is abortable test/object_store: Add simple backup test test/object_store: Move format_tuples() test/pylib: Add more methods to rest client backup-task: Make it abortable (almost) code: Introduce backup API method database: Export parse_table_directory_name() helper database: Introduce format_table_directory_name() helper snapshot-ctl: Add config to snapshot_ctl snapshot-ctl: Add sstables::storage_manager dependency snapshot-ctl: Maintain task manager module snapshot-ctl: Add "snapshots" logger snapshot-ctl: Outline stop() method and constructor snapshot-ctl: Inline run_snapshot_list<> test/cql_test_env: Export task manager from cql test env task_manager: Print task ttl on start (for debugging) docs: Update object_storage.md with AWS_ environment docs: Restructure object_storage.md	2024-08-25 20:19:10 +03:00
Pavel Emelyanov	38edbebb10	compaction_manager: Keep flush-all-before-major option on own config Currently the major compaction task impl grabs this (non-updateable) value from db::config. That's not good, all services including compaction manager have their own configs from which they take options. Said that, this patch puts the said option onto compaction_manager::config, makes use of it and configures one from db::config on start (and tests). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#20174	2024-08-23 10:31:55 +03:00
Pavel Emelyanov	a812f13ddd	code: Introduce backup API method The method starts a task that uploads all files from the given keyspace's snapshot to the requested endpoint/bucket. The task runs in the background, its task_id is returned from the method once it's spawned and it should be used via /task_manager API to track the task execution and completion (hint: it's good to have non-zero TTL value to make sure fast backups don't finish before the caller manages to call wait_task API). If snapshot doesn't exist, nothing happens (FIXME, need to return back an error in that case). If endpoint is not configured locally, the API call resolves with bad-request instantly. Sstables components are scanned for all tables in the keyspace and are uploaded into the /bucket/${cf_name}/${snapshot_name}/ path. Task is not abortable (FIXME -- to be added) and doesn't really report its progress other than running/done state (FIXME -- to be added too). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 19:47:06 +03:00
Pavel Emelyanov	dff51fd58c	snapshot-ctl: Add config to snapshot_ctl Pretty much all services in Scylla have their own config. Add one to snapshot-ctl too, it will be populated later. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 14:57:20 +03:00
Pavel Emelyanov	f37857e20a	snapshot-ctl: Add sstables::storage_manager dependency The storage_manager maintains set of clients to configured object storage(s). The snapshot ctl is going to spawn tasks that will talk to those storages, thus it needs the storage manager to get the clients from. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 14:08:21 +03:00
Pavel Emelyanov	362331c89b	snapshot-ctl: Maintain task manager module This service is going to start tasks managed by task manager. For that, it should have its module set up and registered. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 14:08:21 +03:00
Botond Dénes	5bff422b54	service/storage_service: load_tablet_metadata(): add hint parameter Allowing for reloading only those parts of the tablet metadata that were actually changed.	2024-08-11 09:53:19 -04:00
Michał Jadwiszczak	870bdaa6b1	api/cql_server_test: add CQL server testing API Add a CQL server testing API with and endpoint to dump service level parameters of all CQL connections. This endpoint will be later used to test functionality of automated updating CQL connections parameters.	2024-08-08 10:42:09 +02:00
Piotr Dulikowski	1963619803	Merge 'Use cross shard barrier to start view builder' from Pavel Emelyanov When starting, view builder wants all shards to synchronize with each other in the middle of initialization. For that they all synchronize via shard-0's instance counter and a shared future. There's cross-shard barrier in utils/ that provides the same facility. Closes scylladb/scylladb#19954 * github.com:scylladb/scylladb: view_builder: Drop unused members view_builder: Use cross-shard barrier on start view_builder: Add cross-shard barrier to its .start() method	2024-08-07 08:54:15 +02:00
Avi Kivity	aa1270a00c	treewide: change assert() to SCYLLA_ASSERT() assert() is traditionally disabled in release builds, but not in scylladb. This hasn't caused problems so far, but the latest abseil release includes a commit [1] that causes a 1000 insn/op regression when NDEBUG is not defined. Clearly, we must move towards a build system where NDEBUG is defined in release builds. But we can't just define it blindly without vetting all the assert() calls, as some were written with the expectation that they are enabled in release mode. To solve the conundrum, change all assert() calls to a new SCYLLA_ASSERT() macro in utils/assert.hh. This macro is always defined and is not conditional on NDEBUG, so we can later (after vetting Seastar) enable NDEBUG in release mode. [1] `66ef711d68` Closes scylladb/scylladb#20006	2024-08-05 08:23:35 +03:00
Pavel Emelyanov	fb1b749445	view_builder: Add cross-shard barrier to its .start() method The barrier will be used by next patch to synchronize shards with each other. When passed to invoke_on_all() lambda like this, each lambda gets its its copy of the barrier "handler" that maintains shared state across shards. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-07-31 12:54:28 +03:00
Pavel Emelyanov	aaad2bbeaf	storage_service: Remote gossiper argument from join_cluster() This pointer was only needed to pull all the way down the hints resource manager start() method. It's no longer needed for that. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-07-26 16:29:58 +03:00
Pavel Emelyanov	456dbc122b	api: Unset cache_service endpoints on stop They currently stay registered long after the dependent services get stopped. There's a need for batch unsetting (scylladb/seastar#1620), so currently only this explicit listing :( Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-07-24 18:51:32 +03:00
Pavel Emelyanov	61fb0ad996	main: Don't ignore set_cache_service() future The call itself seem to be in wrong place -- there's no "cache service" also the API uses database and snapshot_ctl to work on. So it deserves more cleanup, but at least don't throw the returned future<> away. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-07-24 18:51:32 +03:00
Pavel Emelyanov	e1eb48f9c2	api: Move storage API few steps above The sequence currently is sharded<storage_service>.start() sharded<query_processor>.invoke_on_all(start_remote) api::set_server_storage_service() The last two steps can be safely swapped to keep storage service API next to its service. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-07-24 18:51:32 +03:00
Pavel Emelyanov	6ae09cc6bf	api: Register token-metadata API next to token-metadata itsels Right now API registration happens quite late because it waits storage service to register its "function" first. This can be done beforeheand and the t.m. API can be moved to where it should be. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-07-24 18:51:32 +03:00
Pavel Emelyanov	29738f0cb6	api: Move snitch API registration next to snitch itself Once sharded<snitch> is started, it can register its handlers Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-07-24 18:51:07 +03:00
Aleksandra Martyniuk	8e56913fdf	service: node_ops: keep node ops module in storage service Keep task manager node ops module in storage service. It will be used to create and manage tasks related to topology changes. The module is created and registered in storage service constructor. In storage_service::stop() the module is stopped and so all the remaining tasks would be unregistered immediately after they are finished.	2024-07-23 13:35:01 +02:00
Aleksandra Martyniuk	6029936665	tasks: implement task_manager::virtual_task::impl::get_children Return a vector of task_identity of all children of a virtual task in a cluster.	2024-07-23 13:35:01 +02:00
Lakshmi Narayanan Sreethar	e2142974f8	replica/database: pass abort_source to database constructor This is in preparation for the following patch that adds abort_source variable to the sstables_manager. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2024-07-16 20:36:06 +05:30
Michał Jadwiszczak	85119b90df	service/qos/service_level_controller: maybe start and stop legacy update loop In previous commit, we marked the update loop as legacy. For compatibility reasons, we need to start legacy update loop when the cluster is in recovery mode or it hasn't been upgraded to raft topology. Then, in the update loop we check if all conditions are met and stop the loop. This commit also moves start of update loop later (after topology state is loaded) in main.cc. There is no risk in doing it later.	2024-07-10 10:23:04 +02:00
Michał Jadwiszczak	b0f76db9f2	service/qos/service_level_controller: make update loop legacy Rename method which started update loop to better reflect what it does. Previously the method was named `update_from_distributed_data`, however it doesn't update anything but only start the update loop, which we are making legacy.	2024-07-10 10:23:04 +02:00
Avi Kivity	3fc4e23a36	forward_service: rename to mapreduce_service forward_service is nondescriptive and misnamed, as it does more than forward requests. It's a classic map/reduce algorithm (and in fact one of its parameters is "reducer"), so name it accordingly. The name "forward" leaked into the wire protocol for the messaging service RPC isolation cookie, so it's kept there. It's also maintained in the name of the logger (for "nodetool setlogginglevel") for compatibility with tests. Closes scylladb/scylladb#19444	2024-07-03 19:29:47 +03:00
Avi Kivity	d14eec8160	config: avoid binding an lvalue reference to an rvalue reference config_file::add_deprecated_options() returns an lvalue reference to a parameter which itself is an rvalue reference. In C++20 this is bad practice (but not a bug in this case) as rvalue references are not expected to live past the call. In C++23, it fails to compile. Fix by accepting an lvalue reference for the parameter, and adjust the caller.	2024-06-27 19:36:13 +03:00
Pavel Emelyanov	6c1e5c248f	main,proxy: Drain proxy in its stop_remote Currently proxy initialization is pretty disperse, in particular it's stopped in several steps -- first drain_on_shutdown() then stop_remote(). In between there's nothing that needs proxy in any particular sate, so those two steps can be merged into one. refs: scylladb/scylladb#2737 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#19344	2024-06-27 12:26:51 +02:00
Nadav Har'El	35ace0af5c	Merge 'Move some /storage_proxy API endpoints to config.cc' from Pavel Emelyanov API endpoints that need a particular service to get data from are registered next to this service (#2737). In /storage_proxy function there live some endpoints that work with config, so this PR moves them to the existing config.cc with config-related endpoints. The path these endpoints are registered with remains intact, so some tweak in proxy API registration is also here. Closes scylladb/scylladb#19417 * github.com:scylladb/scylladb: api: Use provided db::config, not the one from ctx api: Move some config endpoints from proxy to config api: Split storage_proxy api registration api: Unset config endpoints	2024-06-25 13:55:58 +03:00
Pavel Emelyanov	755be887a6	api: Remove dedicated failure_detector registration method It's now empty and can be dropped Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-21 19:30:54 +03:00
Pavel Emelyanov	f84694166e	api: (Un)Register gossiper API in correct place Each service's endpoints are to be registered just after the service itself, so should gossiper's Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-21 19:30:53 +03:00
Pavel Emelyanov	19f3a9805a	api: Unset gossiper endpoints on stop Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-21 19:30:53 +03:00
Pavel Emelyanov	473cb62a9a	api: Unset config endpoints The set_server_config() needs the stop-time peer, here it is. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-21 13:28:06 +03:00
Pavel Emelyanov	873d76c02b	api: Remove ctx->load_meter dependency Now the API uses captured reference and the explicit dependency is not needed. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-20 12:38:28 +03:00
Pavel Emelyanov	724d62aa87	api: Add set/unset methods for load_meter The meter is pretty small sevice and its API is also tiny. Still, it's a standalone top-level service, and its API should come next to it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-20 12:35:58 +03:00
Nadav Har'El	4faceeaa33	Merge 'treewide: drop thrift support' from Kefu Chai thrift support was deprecated since ScyllaDB 5.2 > Thrift API - legacy ScyllaDB (and Apache Cassandra) API is > deprecated and will be removed in followup release. Thrift has > been disabled by default. so let's drop it. in this change, * thrift protocol support is dropped * all references to thrift support in document are dropped * the "thrift_version" column in system.local table is preserved for backward compatibility, as we could load from an existing system.local table which still contains this clolumn, so we need to write this column as well. * "/storage_service/rpc_server" is only preserved for backward compatibility with java-based nodetool. Fixes #3811 Fixes #18416 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> - [x] not a fix, no need to backport Closes scylladb/scylladb#18453 * github.com:scylladb/scylladb: config: expand on rpc_keepalive's description api: s/rpc/thrift/ db/system_keyspace: drop thrift_version from system.local table transport: do not return client_type from cql_server::connection::make_client_key() treewide: drop thrift support	2024-06-17 22:36:49 +03:00
Botond Dénes	aa27f8f365	Merge 'Improve handling of outdated --experimental-features' from Pavel Emelyanov Some time ago it turned out that if unrecognized feature name is met in scylla.yaml, the whole experimental features list is ignored, but scylla continues to boot. There's UNUSED feature which is the proper way to deprecate a feature, and this PR improves its handling in several ways. 1. The recently removed "tablets" feature is partially brought back, but marked as UNUSED 2. Any UNUSED features met while parsing are printed into logs 3. The enum_option<> helper is enlightened along the way refs: #18968 Closes scylladb/scylladb#19230 * github.com:scylladb/scylladb: config: Mark tablets feature as unused main: Warn unused features enum_option: Carry optional key on board enum_option: Remove on-board _map member	2024-06-12 17:33:14 +03:00
Botond Dénes	d2a4cd9cae	Merge 'Register API endpoints next to corresponding services' from Pavel Emelyanov The API endpoints are registered for particular services (with rare exceptions), and once the corresponding service is ready, its endpoints section can be registered too. Same but reversed is for shutdown, and it's automatic with deferred actions. refs: #2737 Closes scylladb/scylladb#19208 * github.com:scylladb/scylladb: main: Register task manager API next to task manager itself main: Register messaging API next to messaging service main: Register repair API next to repair service	2024-06-12 17:31:30 +03:00
Pavel Emelyanov	24c818453d	main: Start view builder earlier Commit `47dbf23773` (Rework view services and system-distributed-keyspace dependencies) made streaming and repair services depend on view builder, but missed the fact that the builder itself starts much later. Move view builder earlier, that's safe, no activity is started upon that, real building is kicked much later when invoke_on_all(start) happens. Other than than, start system distributed keyspace earlier, which also looks safe, as it's also started "for real" later, by storage service when it joins the ring. fixes: #19133 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#19250	2024-06-12 16:46:55 +03:00
Pavel Emelyanov	b85a02a3fe	main: Warn unused features When seeing an UNUSED feature -- print it into log. This is where the enum_option::key is in use. The thing is that experimental features map different unused feature names into the single UNUSED feature enum value, so once the feature is parsed its configured name only persists in the option's key member (saved by previous patch). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-11 12:56:51 +03:00
Calle Wilund	51c53d8db6	main/minio_server.py: Respect any preexisting AWS_ACCESS_KEY_ID/AWS_SECRET_ACCESS_KEY vars Fixes scylladb/scylla-pkg#3845 Don't overwrite (or rather change) AWS credentials variables if already set in enclosing environment. Ensures EAR tests for AWS KMS can run properly in CI. v2: * Allow environment variables in reading obj storage config - allows CI to use real credentials in env without risking putting them info less seure files * Don't write credentials info from miniserver into config, instead use said environment vars to propagate creds. v3: * Fix python launch scripts to not clear environment, thus retaining above aws envs. Closes scylladb/scylladb#19086	2024-06-11 06:59:04 +03:00
Pavel Emelyanov	b10ddcfd18	main: Register task manager API next to task manager itself Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-10 12:49:11 +03:00
Pavel Emelyanov	02c36ebd2e	main: Register messaging API next to messaging service Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-10 12:49:02 +03:00
Pavel Emelyanov	f7e4724770	main: Register repair API next to repair service Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-10 12:48:51 +03:00
Avi Kivity	7b301f0cb9	Merge 'Encapsulate wasm and lua management in lang::manager service' from Pavel Emelyanov After wasm udf appeared, code in main, create_function_statement and schema_tables got some involvements into details of wasm engine management. Also, even prior to this, there was duplication in how function context is created by statement code and schema_tables code. This PR generalizes function context creation and encapsulates the management in sharded<lang::manager> service. Also it removes the wasm::startup_context thing and makes wasm start/stop be "classical" (see #2737) Closes scylladb/scylladb#19166 * github.com:scylladb/scylladb: code: Enlighten wasm headers usage lang: Unfriend wasm context from manager lang, cql3, schema_tables: Don't mess with db::config lang: Don't use db::config to create lua context lang: Don't use db::config to create wasm context lang: Drop manager::precompile() method cql3, schema_tables: Generalize function creation wasm: Replace startup_context with wasm_config lang: Add manager::start() method lang: Move manager to lang namespace lang: Move wasm::manager to its .cc/.hh files	2024-06-09 19:32:26 +03:00
Avi Kivity	b2a500a9a1	Merge 'alternator: keep TTL work in the maintenance scheduling group' from Botond Dénes Alternator has a custom TTL implementation. This is based on a loop, which scans existing rows in the table, then decides whether each row have reached its end-of-life and deletes it if it did. This work is done in the background, and therefore it uses the maintenance (streaming) scheduling group. However, it was observed that part of this work leaks into the statement scheduling group, competing with user workloads, negatively affecting its latencies. This was found to be causes by the reads and writes done on behalf of the alternator TTL, which looses its maintenance scheduling group when these have to go to a remote node. This is because the messaging service was not configured to recognize the streaming scheduling group, when statement verbs like read or writes are invoked. The messaging service currently recognizes two statement "tenants": the user tenant (statement scheduling group) and system (default scheduling group), as we used to have only user-initiated operations and sytsem (internal) ones. With alternator TTL, there is now a need to distinguish between two kinds of system operation: foreground and background ones. The former should use the system tenant while the latter will use the new maintenance tenant (streaming scheduling group). This series adds a streaming tenant to the messaging service configuration and it adds a test which confirms that with this change, alternator TTL is entirely contained in the maintenance scheduling group. Fixes: #18719 - [x] Scans executed on behalf of alternator TTL are running in the statement group, disturbing user-workloads, this PR has to be backported to fix this. Closes scylladb/scylladb#18729 * github.com:scylladb/scylladb: alternator, scheduler: test reproducing RPC scheduling group bug main: add maintenance tenant to messaging_service's scheduling config	2024-06-09 19:20:18 +03:00
Gleb Natapov	34cf5c81f6	group0, topology coordinator: run group0 and the topology coordinator in gossiper scheduling group Currently they both run in streaming group and it may become busy during repair/mv building and affect group0 functionality. Move it to the gossiper group where it should have more time to run. Fixes scylladb/scylladb#18863 Closes scylladb/scylladb#19138	2024-06-07 15:31:44 +02:00

1 2 3 4 5 ...

1321 Commits