scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-26 11:30:36 +00:00

Author	SHA1	Message	Date
Andrzej Jackowski	452313f5a5	test: add test to verify use of `sl:driver` `sl:driver` is expected to be used for new and control connections, but other connections that run user load should not use it after the user is authenticated. Refs: scylladb/scylladb#24411	2025-09-18 09:29:37 +02:00
Andrzej Jackowski	c02535635e	transport: use `sl:driver` to handle driver's control connections Before `sl:driver` was introduced, service levels were assigned as follows: 1. New connections were processed in `main`. 2. After user authentication was completed, the connection's SL was changed to the user's SL (or `sl:default` if the user had no SL). This commit introduces `service_level_state` to `client_state` and implements the following logic in `transport/server`: 1. If `sl:driver` is not present in the system (for example, it was removed), service levels behave as described above. 2. If `sl:driver` is present, the flow is: I. New connections use `sl:driver`. II. After user authentication is completed, the connection's SL is changed to the user's SL (or `sl:default`). III. If a REGISTER (to events) request is handled, the client is processing the control connection. We mark the client_state to permanently use `sl:driver`. The aforementioned state `2.III` is represented by `_control_connection` flag in `client_state`. Fixes: scylladb/scylladb#24411	2025-09-18 09:29:37 +02:00
Andrzej Jackowski	49aa7613ae	transport: whitespace only change in update_scheduling_group The indentation is changed because it will be required in the next commit of this patch series.	2025-09-18 09:29:37 +02:00
Andrzej Jackowski	43472e8633	transport: call update_scheduling_group for non-auth connections Before this change, unauthorized connections stayed in `main` scheduling group. It is not ideal, in such case, rather `sl:default` should be used, to have a consistent behavior with a scenario where users is authenticated but there is no service level assigned to the user. This commit adds a call to `update_scheduling_group` at the end of connection creation for an unauthenticated user, to make sure the service level is switched to `sl:default`. Fixes: scylladb/scylladb#26040	2025-09-18 09:29:37 +02:00
Andrzej Jackowski	1ad483749a	generic_server: transport: start using `sl:driver` for new connections Before this change, new connections were handled in a default scheduling group (`main`), because before the user is authenticated we do not know which service level should be used. With the new `sl:driver` service level, creation of new connections can be moved to `sl:driver`. We switch the service level as early as possible, in `do_accepts`. There is a possibility, that `sl:driver` will not exist yet, for instance, in specific upgrade cases, or if it was removed. Therefore, we also switch to `sl:driver` after a connection is accepted. Refs: scylladb/scylladb#24411	2025-09-18 09:29:29 +02:00
Andrzej Jackowski	e1b4a338ba	test: add test_desc_* for driver service level Driver service level is a special service level that is created automatically by the system. Therefore, it requires special handling in DESC SCHEMA WITH INTERNALS and those test verifies the special behavior. Refs: scylladb/scylladb#24411	2025-09-18 09:28:32 +02:00
Andrzej Jackowski	43a0eb7b0b	test: service_levels: add tests for sl:driver creation and removal Refs: scylladb/scylladb#24411	2025-09-18 09:28:32 +02:00
Andrzej Jackowski	4af270a271	test: add reload_raft_topology_state() to ScyllaRESTAPIClient To encapsulate `/storage_service/raft_topology/reload` API call	2025-09-18 09:28:32 +02:00
Andrzej Jackowski	6f678a2d1f	service_level_controller: automatically create `sl:driver` This commit: - Increases the number of allowed scheduling groups to allow the creation of `sl:driver`. - Adds the `DRIVER_SERVICE_LEVEL` feature, which prevents creating `sl:driver` until all nodes have increased the number of scheduling groups. - Starts using `get_create_driver_service_level_mutations` to unconditionally create `sl:driver` on `raft_initialize_discovery_leader`. The purpose of this code path is ensuring existence of `sl:driver` in new system and tests. - Starts using `migrate_to_driver_service_level` to create `sl:driver` if it is not already present. The creation of `sl:driver` is managed by `topology_coordinator`, similar to other system keyspace updates, such as the `view_builder` migration. The purpose of this code path is handling upgrades. - Modifies related tests to pass after `sl:driver` is added. Later in this patch series, `sl:driver` will be used by `transport/server` to handle selected traffic, such as the driver's schema and topology fetches. Refs: scylladb/scylladb#24411	2025-09-18 09:28:32 +02:00
Andrzej Jackowski	6a911bff3f	service_level_controller: methods to create driver service level This commit implements `get_create_driver_service_level_mutations` and `migrate_to_driver_service_level` in service_level_controller. Both methods create `sl:driver` with shares=200 and store this fact in `system.scylla_local`. Both methods will be used later in this patch series for automatic creation of sl:driver. Refs: scylladb/scylladb#24411	2025-09-18 09:28:32 +02:00
Andrzej Jackowski	5cb4577800	service_level_controller: handle special sl:driver in DESC output Later in this patch series, `sl:driver` will be added as a special service level created automatically by the system. It needs special handling in `DESC SCHEMA ...` to ensure that during backup restore: 1. CREATE SERVICE LEVEL does not fail if `sl:driver` already exists 2. If `sl:driver` exists, its configuration is fully restored (emit ALTER SERVICE LEVEL). 3. If `sl:driver` was removed, the information is retained (emit DROP SERVICE LEVEL instead of CREATE/ALTER). Refs: scylladb/scylladb#24411	2025-09-18 09:28:32 +02:00
Andrzej Jackowski	09c8f67e69	topology_coordinator: add service_level_controller reference This adds a reference to sl_controller so that, later in this patch series, topology_coordinator can manage creating `sl:driver` once group0 is fully operational. Refs: scylladb/scylladb#24411	2025-09-18 09:28:32 +02:00
Andrzej Jackowski	dd9b4c64d2	system_keyspace: add service_level_driver_created This commit extends sytem.scylla_local table with an additional key/value pair that can be used later in this patch series to keep an information that `sl:driver` was already created. The purpose of storing this information is to ensure that `sl:driver` is not recreated after being intentionally removed. A new mutation is included in `register_raft_pull_snapshot` to keep `service_level_driver_created` in state machine shapshot, which is required for proper propagation of the value when a new node is added to the cluster. Refs: scylladb/scylladb#24411	2025-09-18 09:28:32 +02:00
Andrzej Jackowski	d30590c1d0	test: add MAX_USER_SERVICE_LEVELS Previously, tests used the hardcoded value 7 for the maximum number of user service levels. This commit introduces a named variable that can be shared across tests to avoid cases where this magic number goes out of sync.	2025-09-18 09:28:32 +02:00
Ernest Zaslavsky	c9c245c756	rest_client: set `version` on http::request to avoid invalid state Upcoming changes in Seastar cause `rest::simple_send` to move the `http::request` into `seastar::http::experimental::client::make_request` when called multiple times. This leaves the original request in an invalid state. Specifically, the `_version` field becomes empty, causing request validation to fail. This patch ensures `version` is explicitly set to prevent such failures. Fixes: https://github.com/scylladb/scylladb/issues/26018 Closes scylladb/scylladb#26066	2025-09-18 07:36:25 +03:00
Wojciech Mitros	f17beba834	load_balancer: include dead nodes when calculating rack load Load balancer aims to preserve a balance in rack loads when generating tablet migrations. However, this balance might get broken when dead nodes are present. Currently, these nodes aren't include in rack load calculations, even if they own tablet replicas. As a result, load balancer treats racks with dead nodes as racks with a lower load, so I generates migrations to these racks. This is incorrect, because a dead node might come back alive, which would result in having multiple tablet replicas on the same rack. It's also inefficient even if we know that the node won't come back - when it's being replaced or removed. In that case we know we are going to rebuild the lost tablet replicas so migrating tablets to this rack just doubles the work. Allowing such migrations to happen would also require adjustments in the materialized view pairing code because we'd temporarily allow having multiple tablet replicas on the same rack. So in this patch we include dead nodes when calculating rack loads in the load balancer. The dead nodes still aren't treated as potential migration sources or destinations. We also add a test which verifies that no migrations are performed by doing a node replace with a mv workload in parallel. Before the patch, we'd get pairing errors and after the patch, no pairing errors are detected. Fixes https://github.com/scylladb/scylladb/issues/24485 Closes scylladb/scylladb#26028	2025-09-17 20:49:18 +02:00
Avi Kivity	3acfc577d8	Merge 'tools/scylla-sstable: extract json mutation stream parser into own hh,cc' from Botond Dénes tools/scylla-sstable.cc has 3.5k SLOC, out of which this class alone is 1K. Extract into own hh and cc. Since this class was already using pimpl, the header remains nice and small. Code cleanup, no backport needed. Closes scylladb/scylladb#26064 * github.com:scylladb/scylladb: tools: extract json_mtuation_stream_parser to its own hh,cc files tools/scylla-sstable: fix indentation tools/scylla-sstable: prepare for extracting json_mutation_stream_parser	2025-09-17 18:30:30 +03:00
Ernest Zaslavsky	54aa552af7	treewide: Move type related files to a `type` directory As requested in #22110 , moved the files and fixed other includes and build system. Moved files: - duration.hh - duration.cc - concrete_types.hh Fixes: #22110 This is a cleanup, no need to backport Closes scylladb/scylladb#25088	2025-09-17 17:32:19 +03:00
Ernest Zaslavsky	a1f18a8883	treewide: Move schema related files to a `schema` directory As requested in #22111 , moved the files and fixed other includes and build system. Moved files: - frozen_schema.hh - frozen_schema.cc - schema_mutations.hh - schema_mutations.cc - column_computation.hh Fixes: #22111 Closes scylladb/scylladb#25089	2025-09-17 17:31:05 +03:00
Botond Dénes	bde7d8ddbd	Merge 'service: pass current session_id to repair rpc' from Aleksandra Martyniuk Currently, in repair_tablet we retrieve session_id from tablet map (and throw if it isn't specified). In case of topology coordinator failover, we may end up in a situation where a node runs outdated repair, treating session of a different operation as the repair's session: - topology coordinator starts repair transition (A); - topology coordinator sends tablet repair rpc to node1; - topology coordinator is separated from the cluster; - new topology coordinator is elected; - new topology coordinator sees waiting repair request (A_2) and executes it; - new repair of the same tablet is requested (B); - new topology coordinator starts repair transition (B); - new topology coordinator sends tablet repair rpc to node2; - node2 starts repair (B) as repair master; - node1 starts repair (A), checks the current session (B), proceeds with repair (B) as repair master. Send current session_id in repair_tablet rpc. If this session_id and session id got from tablet map don't match, an exception is thrown. Fixes: https://github.com/scylladb/scylladb/issues/23318. No backport; changes in rpc signatures Closes scylladb/scylladb#25369 * github.com:scylladb/scylladb: test: check that repair with outdated session_id fails service: pass current session_id to repair rpc	2025-09-17 17:28:35 +03:00
Botond Dénes	cc5153ef8c	Merge 'db: cache: consider preempting after each partition' from Aleksandra Martyniuk Currently, during cache invaldation we check if we need to preempt only after the partition gets invaldaited. This may lead to stalls if we have a chain of filtered out partitions. Check for preemption even if the partition does not get invaldated. Refs: https://github.com/scylladb/scylladb/issues/9136. Optimization; no backport Closes scylladb/scylladb#26053 * github.com:scylladb/scylladb: db: fix indentation db: cache: consider preempting after each partition	2025-09-17 17:26:29 +03:00
Botond Dénes	a8d22a66fa	Merge 'Improve Encryption at Rest documentation' from Nikos Dragazis This PR introduces a major rewrite of the EaR document. The initial motivation for this PR was to fully cover all our supported key providers with working examples, and to add instructions for key rotation. However, many other improvements were made along the way. Main changes in this PR: * Add a high-level description for every key provider. Mention limitations. * Better organize existing provider-specific instructions by placing them into clearly separated, tabbed sections. * Add instructions for the replicated key provider. Mention explicitly that it cannot be used as default option for user or system encryption, and that it does not support key rotation. * Add more examples for KMS and GCP to cover all credential types. * Document missing configuration options. * Add a new section for key rotation. Notes: * Some of the patches in this series have been cherry-picked from Laszlo's wip branch. * This PR is expected to conflict with the Azure Key Vault PR, which should be merged first. (https://github.com/scylladb/scylladb/pull/23920/) * Support for KMIP system keys in the Replicated Key Provider is currently broken. (https://github.com/scylladb/scylladb/issues/24443) Fixes scylladb/scylla-enterprise#3535. Refs scylladb/scylla-enterprise#3183. Only doc changes. No backport is needed. Closes scylladb/scylladb#24558 * github.com:scylladb/scylladb: encryption-at-rest.rst: add "Rotate Encryption Keys" section encryption-at-rest.rst: rewrite "Encrypt System Resources" section encryption-at-rest.rst: rewrite "Update Encryption Properties of Existing Tables" section encryption-at-rest.rst: rewrite "Encrypt a Single Table" section encryption-at-rest.rst: rewrite "Encrypt Tables" section encryption-at-rest.rst: update "Set the Azure Host" section encryption-at-rest.rst: update "Set the GCP Host" section encryption-at-rest.rst: update "Set the KMS Host" section encryption-at-rest.rst: update "Set the KMIP Host" section encryption-at-rest.rst: rewrite "Create Encryption Keys" section encryption-at-rest.rst: rewrite "Key Providers" section encryption-at-rest.rst: hoist and update "Cipher Algorithm Descriptors" encryption-at-rest.rst: rewrite/replace section "Encryption Key Types" encryption-at-rest.rst: About: describe high-level operation more precisely encryption-at-rest.rst: improve wording / formatting in About intro encryption-at-rest.rst: users (plural) typo fix encryption-at-rest.rst: rewrap encryption-at-rest.rst: strip trailing whitespace	2025-09-17 17:25:25 +03:00
Nadav Har'El	d63fdd1e8b	test/cqlpy: fix run-cassandra to run with Java 21 The script test/cqpy/run-cassandra aims to make it easy to run any version of Cassandra using whatever version of Java the user has installed. Sadly, the fact that Java keeps changing and the Cassandra developers are very slow to adapt to new Javas makes doing this non-trivial. This patch makes it possible for run-cassandra to run Cassandra 5 on the Java 21 that is now the default on Fedora 42. Fedora 42 no longer carries antique version of Java (like Java 8 or 11), not even as an optional package. Sadly, even with this patch it is not possible to run older versions of Cassandra (4 and 3) with Java 21, because the new Java is missing features such as Netty that the older Cassandra require. But at least it restores the ability to run our cqlpy tests against Cassandra 5. Also, this patch adds to test/cqlpy/README.md simple instructions on how to install Java 11 (in addition to the system's default Java 21) on Fedora 42. Doing this is very easy and very recommended because it restores the ability to run Cassandra 3 and 4, not just Cassandra 5. Fixes #25822. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#25825	2025-09-17 17:24:47 +03:00
Botond Dénes	85f6eeda30	Merge 'compaction/scrub: register sstables for compaction before validation' from Lakshmi Narayanan Sreethar compaction/scrub: register sstables for compaction before validation When `scrub --validate` runs, it collects all candidate sstables at the start and validates them one by one in separate compaction tasks. However, scrub in validate mode does not register these sstables for compaction, which allows regular compaction to pick them up and potentially compact them away before validation begins. This leads to scrub failures because the sstables can no longer be found. This patch fixes the issue by first disabling compaction, collecting the sstables, and then registering them for compaction before starting validation. This ensures that the enqueued sstables remain available for the entire duration of the scrub validation task. Fixes #23363 This reported scrub failure occurs on all versions that have the checksum/digest validation feature for uncompressed sstables. So, backport it to older versions. Closes scylladb/scylladb#26034 * github.com:scylladb/scylladb: compaction/scrub: register sstables for compaction before validation compaction/scrub: handle exceptions when moving invalid sstables to quarantine	2025-09-17 17:22:00 +03:00
Piotr Smaron	bdb90ee15c	set ssl_* columns in system.clients Depends on https://github.com/scylladb/seastar/pull/2651 Missing columns have been present since probably forever - they were added to the schema but never assigned any value: ``` cqlsh> select * from system.clients; ------------------+------------------------ ... ssl_cipher_suite \| null ssl_enabled \| null ssl_protocol \| null ... ``` This patch sets values of these columns: - with a TLS connection, the 3 TLS-related fields are filled in, - without TLS, `ssl_enabled` is set to `false` and other columns are `null`, - if there's an error while inspecting TLS values, the connection is dropped. We want to save the TLS info of a connection just after accepting it, but without waiting for a TLS handshake to complete, so once the connection is accepted, we're inspecting it in the background for the server to be able to accept next connections immediately. Later, when we construct system.clients virtual table, the previously saved data can be instantaneously assigned to client_data, which is a struct representing a row in system.clients table. This way we don't slow down constructing this table by more than necessary, which is relevant for cases with plenty of connections. Fixes: #9216 Closes scylladb/scylladb#22961	2025-09-17 16:29:55 +03:00
Nadav Har'El	3c0032deb4	alternator: fix bug in combination of AttributeUpdates + ReturnValues In test/alternator/test_returnvalues.py we had tests for the ReturnValues feature on UpdateItem requests - but we only tested UpdateItem requests with the "modern" UpdateExpression, and forgot to test the combination of ReturnValues with the old AttributeUpdates API. It turns out this combination is buggy: when both ReturnValues=ALL_OLD and AttributeUpdates need the previous value of the item, we may wrongly std::move() the value out, and the operation will fail with a strange error: An error occurred (ValidationException) when calling the UpdateItem operation: JSON assert failed on condition 'IsObject()' The fix in this patch is trivial - just move the std::move() to the correct place, after both UpdateExpression and AttributeUpdates handling is done. This patch also includes a reproducing test, which fails before this patch and passes with it - and of course passes on DynamoDB. This test reproduces two cases where the bug happened, as well as one case where it didn't (to make sure we don't regress in what already worked). Fixes #25894 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#25900	2025-09-17 16:04:01 +03:00
Piotr Dulikowski	6a90a1fd29	Merge 'db/view/view_building_worker: split batch's data preparation and execution' from Michał Jadwiszczak The view building batch lives on shard0 but it might be doing work on shard which owns the tablet replica. Until now the batch data was accessed from multiple shards (shard0 and where the batch was executed). This patch fixes this by splitting tasks execution into: - preparation which is always happening on shard0 - actual execution of the tasks on relevant shard, but all necessary data is copied to the shard and batch object isn't accessed. Fixes https://github.com/scylladb/scylladb/issues/25804 View building coordinator hasn't been released yet, so no backport needed. Closes scylladb/scylladb#26058 * github.com:scylladb/scylladb: db/view/view_building_worker: move try-catch outside `invoke_on()` db/view/view_building_worker: split batch's data preparation and execution	2025-09-17 14:17:25 +02:00
Botond Dénes	30a3f61fa0	Merge 'compaction: handle exception in expected_total_workload' from Aleksandra Martyniuk expected_total_workload methods of scrub compaction tasks create a vector of table_info based on table names. If any table was already dropped, then the exception is thrown. It leaves table_info in corrupted state and node crashes with `free(): invalid size`. Return std::nullopt if an exception was thrown to indicate that total workload cannot be found. Fixes: #25941. No release branches affected Closes scylladb/scylladb#25944 * github.com:scylladb/scylladb: tasks: get progress of failed task based on children compaction: handle exception in expected_total_workload	2025-09-17 15:10:19 +03:00
Nadav Har'El	e322902506	Merge 'index, metrics: add per-index metrics' from Michał Hudobski This patch adds the possibility to track metrics per secondary index. Currently, only a histogram of query latencies is tracked, but more metrics can be added in the future. To add a new metric, it needs to be added to the index_metrics struct in index/secondary_index_manager.hh and then initialized in index/secondary_index_manager.cc in the constructor of the index_metrics struct. The metrics are created when the index is created and removed when the index is dropped. First lines of the new metric: \# HELP scylla_index_query_latencies Index query latencies \# TYPE scylla_index_query_latencies histogram scylla_index_query_latencies_sum{idx="test_i_idx",ks="test"} 640 scylla_index_query_latencies_count{idx="test_i_idx",ks="test"} 1 scylla_index_query_latencies_bucket{idx="test_i_idx",ks="test",le="640.000000"} 1 scylla_index_query_latencies_bucket{idx="test_i_idx",ks="test",le="768.000000"} 1 Fixes: https://github.com/scylladb/scylladb/issues/25970 Closes scylladb/scylladb#25995 * github.com:scylladb/scylladb: test: verify that the index metric is added index, metrics: add per-index metrics	2025-09-17 14:54:12 +03:00
Szymon Malewski	776f90e2f8	alternator/expressions.g: Fix antlr3 missing token leak This patch overrides the antlr3 function that allocates the missing tokens that would eventually leak. The override stores these tokens in a vector, ensuring memory is freed whenever the parser is destroyed. Solution is copied from CQL implementation. A unit test to reproduce the issue is added - leak would be reported by ASAN, when running this test in debug mode - the test passed but the leak is discovered when the test file exits. Fixes #25878 Closes scylladb/scylladb#25930	2025-09-17 13:05:24 +03:00
Botond Dénes	2fa0f82910	tools: extract json_mtuation_stream_parser to its own hh,cc files tools/scylla-sstable.cc has 3.5k SLOC, out of which this class alone is 1K. Extract into own hh and cc, just a copy-paste after the preparation commit.	2025-09-17 12:18:07 +03:00
Botond Dénes	ffe8918522	tools/scylla-sstable: fix indentation Left broken by previous patch.	2025-09-17 12:16:22 +03:00
Botond Dénes	8c36a983cc	tools/scylla-sstable: prepare for extracting json_mutation_stream_parser Make methods out-of-line, so class declaration stands on its own, without definition of impl. Move auxiliary structures, used only by impl, out of the class scope. Move parser to tools namespace, and auxiliaries to anonymous namespace within the tools one. Pass down logger ref to parser impl and below, to prepare for sst_log not being available in scope. Add comment to parser class explaining what it does.	2025-09-17 12:16:21 +03:00
Benny Halevy	3a6208b319	utils: stall_free: clear_gently: release wrapped objects As discussed in https://github.com/scylladb/scylladb/pull/24606#discussion_r2281870939 clear_gently of shared pointers should release the wrapped object reference and when the object's use_count reaches 1, the object itself would be cleared_gently, before it's destroyed. This behavior is similar to the way we clear gently containers like arrays or vectors, and so it is extended in this patch to smart pointers like unique_ptr and foreign_ptr. The unit tests are adjusted respectively to expect the smart pointers to be reset after clear_gently, plus the use of `reset()` for `foreign_ptr<shared_ptr<>>` was replaced by `clear_gently().get()` which now ensures the reference to a shared object is released, and awaited for, if it happens on a foreign owner shard, unlike reset of a foreign_ptr that kicks off destroy of that shared object in the background on the owner shard - causing flakiness. Fixes #25723 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#25759	2025-09-17 11:44:26 +03:00
Patryk Jędrzejczak	454eb08cb4	Merge 'group0: remove obsolete "stop_before_becoming_raft_voter" error injection' from Emil Maskovsky The Raft topology workflow was changed by the limited voters feature: nodes no longer request votership themselves. As a result, the "stop_before_becoming_raft_voter" error injection is now obsolete and has been removed. Fixes: scylladb/scylladb#23418 No backport: This re-enables a test, only needed for master. Closes scylladb/scylladb#26042 * https://github.com/scylladb/scylladb: group0: remove obsolete "stop_before_becoming_raft_voter" error injection test/random_failures: preserve test repeatability when removing error injections	2025-09-17 10:38:32 +02:00
Michał Jadwiszczak	d98237b33c	db/view/view_building_worker: move try-catch outside `invoke_on()` It's just stylist change, to me doing `invoke_on()` in try-catch block looks better than the other way.	2025-09-16 23:15:44 +02:00
Michał Jadwiszczak	9458ceff8f	db/view/view_building_worker: split batch's data preparation and execution The view building batch lives on shard0 but it might be doing work on shard which owns the tablet replica. Until now the batch data was accessed from multiple shards (shard0 and where the batch was executed). This patch fixes this by splitting tasks execution into: - preparation which is always happening on shard0 - actual execution of the tasks on relevant shard, but all necessary data is copied to the shard and batch object isn't accessed. Fixes scylladb/scylladb#25804	2025-09-16 23:13:36 +02:00
Patryk Jędrzejczak	368d70ee15	Merge 'LWT: implement fencing' from Petr Gusev This PR consists of three parts: * Small refactoring of the fencing APIs in storage_proxy (renames + comments + some functions were extracted) * Implement the fencing for LWT verbs itself. This includes checking the fencing token before and after local replica data accesses. * Two new `test.py` tests in `test_fencing.py`, which check the fencing in some real-world scenarios. Backport: no need -- fencing for LWT requests is needed primarily for LWT over tablets, which is not released yet. Fixes scylladb/scylladb#22332 Closes scylladb/scylladb#25550 * https://github.com/scylladb/scylladb: test_tablets_lwt: eliminate redundant disable_tablet_balancing test_fencing: add test_lwt_fencing_upgrade pylib: extract upgrade helpers from test_sstable_compression_dictionaries_upgrade.py test_fencing: add test_fenced_out_on_tablet_migration_while_handling_paxos_verb test_fencing: test_fence_lwt_during_bootstap pylib/rest_client.py: encode injection name storage_proxy_stats: add fenced_out_requests metric storage_proxy: add fencing to Paxos verbs storage_proxy::apply_fence: add overload that throws on failure storage_proxy: extract apply_fence_result sp::apply_fence: rename to apply_fence_on_ready sp::apply_fence: rename to check_fence sp::apply_fence: make non-generic	2025-09-16 23:40:48 +03:00
Ernest Zaslavsky	d624413ddd	treewide: Move query related files to a new `query` directory As requested in #22120, moved the files and fixed other includes and build system. Moved files: - query.cc - query-request.hh - query-result.hh - query-result-reader.hh - query-result-set.cc - query-result-set.hh - query-result-writer.hh - query_id.hh - query_result_merger.hh Fixes: #22120 This is a cleanup, no need to backport Closes scylladb/scylladb#25105	2025-09-16 23:40:47 +03:00
Pavel Emelyanov	6fb66b796a	s3: Add metrics to show S3 prefetch bytes The chunked download source sends large GET requests and then consumes data as it arrives. Sometimes it can stop reading from socket early and drop the in-flight data. The existing read-bytes metrics show only the number of consumed bytes, we we also want to know the number of requested bytes Refs #25770 (accounting of read-bytes) Fixes #25876 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#25877	2025-09-16 23:40:47 +03:00
Sergey Zolotukhin	2640b288c2	raft: disable caching for raft log. This change disables caching for raft log table due to the following reasons: * Immediate reason is a deficiency in handling emerging range tombstones in the cache, which causes stalls. * Long-term reason is that sequential reads from the raft log do not benefit from the cache, making it better to bypass it to free up space and avoid stalls. Fixes scylladb/scylladb#26027 Closes scylladb/scylladb#26031	2025-09-16 23:40:47 +03:00
Pavel Emelyanov	d69a51f42a	compaction: Use function when filtering compaction tasks for stopping The compaction_manager::stop_compaction() method internally walks the list of tasks and compares each task's compacting_table (which is compaction group view pointer) with the given one. In case this stop_compaction() method is called via API for a specific table, the method walks the list of tasks for every compaction group from the table, thus resulting in nr_groups * nr_tasks complexity. Not terrible, but not nice either. The proposal is to pass filtering function into the inner do_stop_ongoing_compactions() method. Some users will pass a simple "return true" lambda, but those that need to stop compactions for a specitif table (e.g. -- the API handler) will effectively walk the list of tasks once comparing the given compaction group's schema with the target table one (spoiler: eventually this place will also be simplified not to mess with replica::table at all). One ugliness with the change is the way "scope" for logging message is collected. If all tasks belong to the same table, then "for table ..." is printed in logs. With the change the scope is no longer known instantly and is evaluated dynamically while walking the list of tasks. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#25846	2025-09-16 23:40:47 +03:00
Michał Chojnowski	68e6141211	scylla-gdb: add `scylla prepared-statements` Add a helper which prints all prepared statements currently present in the query processor. Example output: ``` (gdb) scylla prepared-statements (cql3::cql_statement)(0x600003d71050): SELECT FROM ks.ks WHERE pk = ? (cql3::cql_statement*)(0x600003972b50): SELECT pk FROM ks.ks WHERE pk = ? ``` Closes scylladb/scylladb#26007	2025-09-16 23:40:47 +03:00
Botond Dénes	0cf6a648bb	Merge 'Default create keyspace syntax' from Dario Mirovic Allow for the following CQL syntax: ``` CREATE KEYSPACE [IF NOT EXISTS] <name>; ``` for example: ``` CREATE KEYSPACE test_keyspace; ``` With this syntax all the keyspace's parameters would be defaulted to: replication strategy = `NetworkTopologyStrategy`, replication factor = number of racks , but excluding racks that only have arbiter nodes storage options, durable writes = defaults we normally would use, tablets enabled if they are enabled in the db configuration, e.g. scylla.yaml or db/config.cc by default. Options besides `replication` already have defaults. `replication` had to be specified, but it could be an empty set, where defaults for sub-options (replication strategy and replication factor) would be used - `replication = {}`. Now there is no need for specifying an empty set - omitting `replication = {}` has the same effect as `replication = {}`. Since all the options now have defaults, `WITH` is optional for `CREATE KEYSPACE` statement. Fixes #25145 This is an improvement, no backport needed. Closes scylladb/scylladb#25872 * github.com:scylladb/scylladb: docs: cql: default create keyspace syntax test: cqlpy: add test for create keyspace with no options specified cql: default `CREATE KEYSPACE` syntax	2025-09-16 23:40:47 +03:00
Emil Maskovsky	943af1ef1c	topology_coordinator: consistently rethrow `raft::request_aborted` for direct/global commands Ensure all direct and global topology commands rethrow the `raft::request_aborted` exception when aborted, typically due to leadership changes. This makes abortion explicit to callers, enabling proper handling such as retries or workflow termination. This change completes the work started in PR scylladb/scylladb#23962, covering all remaining cases where the exception was not rethrown. Fixes: scylladb/scylladb#23589 No backport: No related issues observed in previous versions; backport not required. Closes scylladb/scylladb#26021	2025-09-16 23:40:47 +03:00
Emil Maskovsky	87bd328873	group0: remove obsolete "stop_before_becoming_raft_voter" error injection The Raft topology workflow was changed by the limited voters feature: nodes no longer request votership themselves. As a result, the "stop_before_becoming_raft_voter" error injection is now obsolete and has been removed. Fixes: scylladb/scylladb#23418	2025-09-16 18:24:27 +02:00
Emil Maskovsky	0453052d66	test/random_failures: preserve test repeatability when removing error injections The order of entries in the ERROR_INJECTIONS list determines test repeatability for a given random seed. To allow removing error injections without affecting the order of the remaining ones, removed injections are now renamed with a "REMOVED_" prefix instead of being deleted. This ensures they are ignored by the tests, while the sequence of active injections—and thus test reproducibility—remains unchanged.	2025-09-16 18:22:45 +02:00
Michał Hudobski	3364cc96f5	test: verify that the index metric is added This commit adds a test that performs a sanity check that the implemented metric is actually being added to Scylla's metrics and has the correct value.	2025-09-16 18:10:01 +02:00
Aleksandra Martyniuk	3324f08e9c	tasks: get progress of failed task based on children Currently, for failed tasks task_manager::task::impl::get_progress attempts to find expected_total_workload. However, if the task has finished long time ago, the state might have totally changed, e.g. some tables might have been dropped or have changed their sizes. Due to that, the result of expected_total_workload might be irrelevant. Count the progress of a finish task based on children only, regardless whether the task has succeeded or failed.	2025-09-16 17:15:01 +02:00
Aleksandra Martyniuk	17e9ec11d7	db: fix indentation	2025-09-16 14:49:54 +02:00

1 2 3 4 5 ...

49469 Commits