scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-05 22:43:15 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	39a85df231	Merge '[Backport 2025.2] test: test_mv_backlog: fix to consider internal writes' from Scylladb[bot] The PR fixes a test flakiness issue in test_mv_backlog related to reading metrics. The first commit fixes a more general issue in the ScyllaMetrics helper class where it doesn't return the value of all matching lines when a specific shard is requested, but it breaks after the first match. The second commit fixes a test issue where it expects exactly one write to be throttled, not taking into account other internal writes that may be executed during this time. Fixes https://github.com/scylladb/scylladb/issues/23139 backport to improve CI stability - test only change - (cherry picked from commit `5c28cffdb4`) - (cherry picked from commit `276a09ac6e`) Parent PR: #25279 Closes scylladb/scylladb#25474 * github.com:scylladb/scylladb: test: test_mv_backlog: fix to consider internal writes test/pylib/rest_client: fix ScyllaMetrics filtering	2025-08-19 17:10:39 +03:00
Dawid Mędrek	0f4965b8ae	db/commitlog: Extend error messages for corrupted data We're providing additional information in error messages when throwing an exception related to data corruption: when a segment is truncated and when it's content is invalid. That might prove helpful when debugging. Closes scylladb/scylladb#25190 (cherry picked from commit `408b45fa7e`) Closes scylladb/scylladb#25460	2025-08-19 17:10:17 +03:00
Ferenc Szili	a0a346496e	test: remove test_tombstone_gc_disabled_on_pending_replica The test test_tombstone_gc_disabled_on_pending_replica was added when we fixed (#20788) the potential problem with data resurrection during file based streaming. The issue was occurring only in Enterprise, but we added the fix in OSS to limit code divergence. This test was added together with the fix in OSS with the idea to guard this change in OSS. The real reproducer and test for this fix was added later, after the fix was ported into Enterprise. It is in: test/cluster/test_resurrection.py Since Enterprise has been merged into OSS, there is no more need to keep the test test_tombstone_gc_disabled_on_pending_replica. Also, it is flaky with very low probability of failure, making it difficult to investigate the cause of failure. Fixes: #22182 Refs: scylladb/scylladb#25448 Closes scylladb/scylladb#25134 (cherry picked from commit `7ce96345bf`) Closes scylladb/scylladb#25572	2025-08-19 16:02:42 +03:00
Patryk Jędrzejczak	3113968380	test: test_maintenance_socket: use cluster_con for driver sessions The test creates all driver sessions by itself. As a consequence, all sessions use the default request timeout of 10s. This can be too low for the debug mode, as observed in scylladb/scylla-enterprise#5601. In this commit, we change the test to use `cluster_con`, so that the sessions have the request timeout set to 200s from now on. Fixes scylladb/scylla-enterprise#5601 This commit changes only the test and is a CI stability improvement, so it should be backported all the way to 2024.2. 2024.1 doesn't have this test. Closes scylladb/scylladb#25510 (cherry picked from commit `03cc34e3a0`) Closes scylladb/scylladb#25546	2025-08-18 16:43:31 +02:00
Abhinav Jha	a0161ef67a	raft: replication test: change rpc_propose_conf_change test to SEASTAR_THREAD_TEST_CASE RAFT_TEST_CASE macro creates 2 test cases, one with random 20% packet loss named name_drops. The framework makes hard coded assumptions about leader which doesn't hold well in case of packet losses. This short term fix disables the packet drop variant of the specified test. It should be safe to re-enable it once the whole framework is re-worked to remove these hard coded assumptions. This PR fixes a bug. Hence we need to backport it. Fixes: scylladb/scylladb#23816 Closes scylladb/scylladb#25489 (cherry picked from commit `a0ee5e4b85`) Closes scylladb/scylladb#25527	2025-08-15 13:27:52 +03:00
Wojciech Przytuła	a95ad052df	Fix link to ScyllaDB manual The link would point to outdated OS docs. I fixed it to point to up-to-date Enterprise docs. Closes scylladb/scylladb#25328 (cherry picked from commit `7600ccfb20`) Closes scylladb/scylladb#25484	2025-08-15 13:27:08 +03:00
Jenkins Promoter	5b5fbff2e6	Update pgo profiles - aarch64	2025-08-15 04:40:12 +03:00
Jenkins Promoter	981eeb275d	Update pgo profiles - x86_64	2025-08-15 04:06:14 +03:00
Anna Stuchlik	1abb9106cf	doc: add the patch release upgrade procedure for version 2025.2 Fixes https://github.com/scylladb/scylladb/issues/25322 Closes scylladb/scylladb#25343	2025-08-14 10:43:16 +02:00
Michael Litvak	a114e95798	test: test_mv_backlog: fix to consider internal writes The test executes a single write, fetching metrics before and after the write, and expects the total throttled writes count to be increased exactly by one. However, other internal writes (compaction for example) may be executed during this time and be throttled, causing the metrics to be increased by more than expected. To address this, we filter the metrics by the scheduling group label of the user write, to filter out the compaction writes that run in the compaction scheduling group. Fixes scylladb/scylladb#23139 (cherry picked from commit `276a09ac6e`)	2025-08-12 20:53:16 +03:00
Michael Litvak	5161342910	test/pylib/rest_client: fix ScyllaMetrics filtering In the ScyllaMetrics `get` function, when requesting the value for a specific shard, it is expected to return the sum of all values of metrics for that shard that match the labels. However, it would return the value of the first matching line it finds instead of summing all matching lines. For example, if we have two lines for one shard like: some_metric{scheduling_group_name="compaction",shard="0"} 1 some_metric{scheduling_group_name="sl:default",shard="0"} 2 The result of this call would be 1 instead of 3: get('some_metric', shard="0") We fix this to sum all matching lines. The filtering of lines by labels is fixed to allow specifying only some of the labels. Previously, for the line to match the filter, either the filter needs to be empty, or all the labels in the metric line had to be specified in the filter parameter and match its value, which is unexpected, and breaks when more labels are added. We also simplify the function signature and the implementation - instead of having the shard as a separate parameter, it can be specified as a label, like any other label. (cherry picked from commit `5c28cffdb4`)	2025-08-12 20:53:16 +03:00
Patryk Jędrzejczak	0a3b93a4e4	docs: Raft recovery procedure: recommend verifying participation in Raft recovery This instruction adds additional safety. The faster we notice that a node didn't restart properly, the better. The old gossip-based recovery procedure had a similar recommendation to verify that each restarting node entered `RECOVERY` mode. Fixes #25375 This is a documentation improvement. We should backport it to all branches with the new recovery procedure, so 2025.2 and 2025.3. Closes scylladb/scylladb#25376 (cherry picked from commit `7b77c6cc4a`) Closes scylladb/scylladb#25439	2025-08-11 15:52:41 +02:00
Botond Dénes	96ed160bd9	Merge '[Backport 2025.2] GCP Key Provider: Fix authentication issues' from Scylladb[bot] * Fix discovery of application default credentials by using fully expanded pathnames (no tildes). * Fix grant type in token request with user credentials. Fixes #25345. - (cherry picked from commit `77cc6a7bad`) - (cherry picked from commit `b1d5a67018`) Parent PR: #25351 Closes scylladb/scylladb#25406 * github.com:scylladb/scylladb: encryption: gcp: Fix the grant type for user credentials encryption: gcp: Expand tilde in pathnames for credentials file	2025-08-11 07:00:03 +03:00
Szymon Malewski	3791a6a4c5	test/alternator: enable more relevant logs in CI. This patch sets, for alternator test suite, all 'alternator-*' loggers and 'paxos' logger to trace level. This should significantly ease debugging of failed tests, while it has no effect on test time and increases log size only by 7%. This affects running alternator tests only with `test.py`, not with `test/alternator/run`. Closes #24645 Closes scylladb/scylladb#25327 (cherry picked from commit `eb11485969`) Closes scylladb/scylladb#25382	2025-08-11 06:59:34 +03:00
Taras Veretilnyk	3d31b2118f	docs: fix typo in command name enbleautocompaction -> enableautocompaction Renamed the file and updated all references from 'enbleautocompaction' to the correct 'enableautocompaction'. Fixes scylladb/scylladb#25172 Closes scylladb/scylladb#25175 (cherry picked from commit `6b6622e07a`) Closes scylladb/scylladb#25217	2025-08-11 06:58:08 +03:00
Benny Halevy	da0f9608d8	scylla-sstable: print_query_results_json: continue loop if row is disengaged Otherwise it is accessed right when exiting the if block. Add a unit test reproducing the issue and validating the fix. Fixes #25325 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#25326 (cherry picked from commit `5e5e63af10`) Closes scylladb/scylladb#25378	2025-08-10 18:54:29 +03:00
Botond Dénes	d845de84aa	Merge '[Backport 2025.2] truncate: change check for write during truncate into a log warning' from Scylladb[bot] TRUNCATE TABLE performs a memtable flush and then discards the sstables of the table being truncated. It collects the highest replay position for both of these. When the highest replay position of the discarded sstables is higher than the highest replay position of the flushed memtable, that means that we have had writes during truncate which have been flushed to disk independently of the truncate process. We check for this and trigger an on_internal_error() which throws an exception, informing the user that writing data concurrently with TRUNCATE TABLE is not advised. The problem with this is that truncate is also called from DROP KEYSPACE and DROP TABLE. These are raft operations and exceptions thrown by them are caught by the (...) exception handler in the raft applier fiber, which then exits leaving the node without the ability to execute subsequent raft commands. This commit changes the on_internal_error() into a warning log entry. It also outputs to keyspace/table names, and the offending replay positions which caused the check to fail. This PR also adds a test which validates that TRUNCATE works correctly with concurrent writes. More specifically, it checks that: - all data written before TRUNCATE starts is deleted - none of the data after TRUNCATE completes is deleted Fixes: #25173 Fixes: #25013 Backport is needed in versions which check for truncate with concurrent writes using `on_internal_error()`: 2025.3 2025.2 2025.1 - (cherry picked from commit `268ec72dc9`) - (cherry picked from commit `33488ba943`) Parent PR: #25174 Closes scylladb/scylladb#25349 * github.com:scylladb/scylladb: truncate: add test for truncate with concurrent writes truncate: change check for write during truncate into a log warning scylla-2025.2.2-candidate-20250808072156 scylla-2025.2.2	2025-08-08 11:44:41 +03:00
Nikos Dragazis	0e4b1196ee	encryption: gcp: Fix the grant type for user credentials Exchanging a refresh token for an access token requires the "refresh_token" grant type [1]. [1] https://datatracker.ietf.org/doc/html/rfc6749#section-6 Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com> (cherry picked from commit `b1d5a67018`)	2025-08-07 21:45:48 +00:00
Nikos Dragazis	ebac51202f	encryption: gcp: Expand tilde in pathnames for credentials file The GCP host searches for application default credentials in known locations within the user's home directory using `seastar::file_exists()`. However, this function does not perform tilde expansion in pathnames. Replace tildes with the home directory from the HOME environment variable. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com> (cherry picked from commit `77cc6a7bad`)	2025-08-07 21:45:48 +00:00
Taras Veretilnyk	edb10b8f4d	docs: Sort commands list in nodetool.rst Fixes scylladb/scylladb#25330 Closes scylladb/scylladb#25331 (cherry picked from commit `bcb90c42e4`) Closes scylladb/scylladb#25371	2025-08-07 13:13:59 +03:00
Botond Dénes	cccf726b54	Merge '[Backport 2025.2] test: introduce upgrade tests to test.py, add a SSTable dict compression upgrade test' from Michał Chojnowski This PR adds an upgrade test for SSTable compression with shared dictionaries, and adds some bits to pylib and test.py to support that. In the series, we: 1. Mount $XDG_CACHE_DIR into dbuild. 2. Add a pylib function which downloads and installs a released ScyllaDB package into a subdirectory of $XDG_CACHE_DIR/scylladb/test.py, and returns the path to bin/scylla. 3. Add new methods and params to the cluster manager, which let the test start nodes with historical Scylla executables, and switch executables during the test. 4. Add a test which uses the above to run an upgrade test between the released package and the current build. 5. Add --run-internet-dependent-tests to test.py which lets the user of test.py skip this test (and potentially other internet-dependent tests in the future). (The patch modifying wait_for_cql_and_get_hosts is a part of the new test — the new test needs it to test how particular nodes in a mixed-version cluster react to some CQL queries.) This is a follow-up to https://github.com/scylladb/scylladb/pull/23025, split into a separate PR because the potential addition of upgrade tests to test.py deserved a separate thread. Needs backport to 2025.2, because that's where the tested feature is introduced. Fixes https://github.com/scylladb/scylladb/issues/24110 - (cherry picked from commit `63218bb094`) - (cherry picked from commit `cc7432888e`) - (cherry picked from commit `34098fbd1f`) - (cherry picked from commit `2ef0db0a6b`) - (cherry picked from commit `1ff7e09edc`) - (cherry picked from commit `5da19ff6a6`) - (cherry picked from commit `d3cb873532`) - (cherry picked from commit `dd878505ca`) Parent PR: https://github.com/scylladb/scylladb/pull/23538 Closes scylladb/scylladb#25158 * github.com:scylladb/scylladb: test: add test_sstable_compression_dictionaries_upgrade.py test.py: add --run-internet-dependent-tests pylib/manager_client: add server_switch_executable test/pylib: in add_server, give a way to specify the executable and version-specific config pylib: pass scylla_env environment variables to the topology suite test/pylib: add get_scylla_2025_1_executable() pylib/scylla_cluster: give a way to pass executable-specific options to nodes dbuild: mount "$XDG_CACHE_HOME/scylladb"	2025-08-07 06:26:25 +03:00
Nikos Dragazis	5ae00a3dab	test: kmip: Fix segfault from premature destruction of port_promise `kmip_test_helper()` is a utility function to spawn a dedicated PyKMIP server for a particular Boost test case. The function runs the server as an external process and uses a thread to parse the port from the server's logs. The thread communicates the port to the main thread via a promise. The current implementation has a bug where the thread may set a value to the promise after its destruction, causing a segfault. This happens when the server does not start within 20 seconds, in which case the port future throws and the stack unwinding machinery destroys the port promise before the thread that writes to it. Fix the bug by declaring the promise before the cleanup action. The bug has been encountered in CI runs on slow machines, where the PyKMIP server takes too long to create its internal tables (due to slow fdatasync calls from SQLite). This patch does not improve CI stability - it only ensures that the error condition is properly reflected in the test output. This patch is not a backport. The same bug has been fixed in master as part of a larger rewrite of the `kmip_test_helper()` (see `722e2bce96`). Refs #24747, #24842. Fixes #24574. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com> Closes scylladb/scylladb#25029	2025-08-06 11:58:56 +03:00
Raphael S. Carvalho	be94db3ace	replica: Fix take_storage_snapshot() running concurrently to merge completion Some background: When merge happens, a background fiber wakes up to merge compaction groups of sibling tablets into main one. It cannot happen when rebuilding the storage group list, since token metadata update is not preemptable. So a storage group, post merge, has the main compaction group and two other groups to be merged into the main. When the merge happens, those two groups are empty and will be freed. Consider this scenario: 1) merge happens, from 2 to 1 tablet 2) produces a single storage group, containing main and two other compaction groups to be merged into main. 3) take_storage_snapshot(), triggered by migration post merge, gets a list of pointer to all compaction groups. 4) t__s__s() iterates first on main group, yields. 5) background fiber wakes up, moves the data into main and frees the two groups 6) t__s__s() advances to other groups that are now freed, since step 5. 7) segmentation fault In addition to memory corruption, there's also a potential for data to escape the iteration in take_storage_snapshot(), since data can be moved across compaction groups in background, all belonging to the same storage group. That could result in data loss. Readers should all operate on storage group level since it can provide a view on all the data owned by a tablet replica. The movement of sstable from group A to B is atomic, but iteration first on A, then later on B, might miss data that was moved from B to A, before the iteration reached B. By switching to storage group in the interface that retrieves groups by token range, we guarantee that all data of a given replica can be found regardless of which compaction group they sit on. Fixes #23162. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes scylladb/scylladb#24058 (cherry picked from commit `28056344ba`) Closes scylladb/scylladb#25338	2025-08-06 09:41:56 +03:00
Botond Dénes	22942c0a85	Merge '[Backport 2025.2] Raft-based recovery procedure: simplify rolling restart with recovery_leader' from Scylladb[bot] The following steps are performed in sequence as part of the Raft-based recovery procedure: - set `recovery_leader` to the host ID of the recovery leader in `scylla.yaml` on all live nodes, - send the `SIGHUP` signal to all Scylla processes to reload the config, - perform a rolling restart (with the recovery leader being restarted first). These steps are not intuitive and more complicated than they could be. In this PR, we simplify these steps. From now on, we will be able to simply set `recovery_leader` on each node just before restarting it. Apart from making necessary changes in the code, we also update all tests of the Raft-based recovery procedure and the user-facing documentation. Fixes scylladb/scylladb#25015 The Raft-based procedure was added in 2025.2. This PR makes the procedure simpler and less error-prone, so it should be backported to 2025.2 and 2025.3. - (cherry picked from commit `ec69028907`) - (cherry picked from commit `445a15ff45`) - (cherry picked from commit `23f59483b6`) - (cherry picked from commit `ba5b5c7d2f`) - (cherry picked from commit `9e45e1159b`) - (cherry picked from commit `f408d1fa4f`) Parent PR: #25032 Closes scylladb/scylladb#25334 * github.com:scylladb/scylladb: docs: document the option to set recovery_leader later test: delay setting recovery_leader in the recovery procedure tests gossip: add recovery_leader to gossip_digest_syn db: system_keyspace: peers_table_read_fixup: remove rows with null host_id db/config, gms/gossiper: change recovery_leader to UUID db/config, utils: allow using UUID as a config option	2025-08-06 09:41:17 +03:00
Michał Jadwiszczak	b58543dab7	storage_service, group0_state_machine: move SL cache update from `topology_state_load()` to `load_snapshot()` Currently the service levels cache is unnecessarily updated in every call of `topology_state_load()`. But it is enough to reload it only when a snapshot is loaded. (The cache is also already updated when there is a change to one of `service_levels_v2`, `role_members`, `role_attributes` tables.) Fixes scylladb/scylladb#25114 Fixes scylladb/scylladb#23065 Closes scylladb/scylladb#25116 (cherry picked from commit `10214e13bd`) Closes scylladb/scylladb#25304	2025-08-06 09:39:55 +03:00
Aleksandra Martyniuk	a468138716	api: storage_service: do not log the exception that is passed to user The exceptions that are thrown by the tasks started with API are propagated to users. Hence, there is no need to log it. Remove the logs about exception in user started tasks. Fixes: https://github.com/scylladb/scylladb/issues/16732. Closes scylladb/scylladb#25153 (cherry picked from commit `e607ef10cd`) Closes scylladb/scylladb#25296	2025-08-06 09:36:07 +03:00
Dawid Mędrek	8e96968fb7	test: Enable RF-rack-valid keyspaces in all Python suites We're enabling the configuration option `rf_rack_valid_keyspaces` in all Python test suites. All relevant tests have been adjusted to work with it enabled. That encompasses the following suites: * alternator, * broadcast_tables, * cluster (already enabled in scylladb/scylladb@ee96f8dcfc), * cql, * cqlpy (already enabled in scylladb/scylladb@be0877ce69), * nodetool, * rest_api. Two remaining suites that use tests written in Python, redis and scylla_gdb, are not affected, at least not directly. The redis suite requires creating an instance of Scylla manually, and the tests don't do anything that could violate the restriction. The scylla_gdb suite focuses on testing the capabilities of scylla-gdb.py, but even then it reuses the `run` file from the cqlpy suite. Fixes scylladb/scylladb#25126 Closes scylladb/scylladb#24617 (cherry picked from commit `b41151ff1a`) Closes scylladb/scylladb#25230	2025-08-06 09:35:34 +03:00
Ferenc Szili	932223414b	truncate: add test for truncate with concurrent writes test_validate_truncate_with_concurrent_writes checks if truncate deletes all the data written before the truncate starts, and does not delete any data after truncate completes. (cherry picked from commit `33488ba943`)	2025-08-06 00:51:43 +00:00
Ferenc Szili	b50272d663	truncate: change check for write during truncate into a log warning TRUNCATE TABLE performs a memtable flush and then discards the sstables of the table being truncated. It collects the highest replay position for both of these. When the highest replay position of the discarded sstables is higher than the highest replay position of the flushed memtable, that means that we have had writes during truncate which have been flushed to disk independently of the truncate process. We check for this and trigger an on_internal_error() which throws an exception, informing the user that writing data concurrently with TRUNCATE TABLE is not advised. The problem with this is that truncate is also called from DROP KEYSPACE and DROP TABLE. These are raft operations and exceptions thrown by them are caught by the (...) exception handler in the raft applier fiber, which then exits leaving the node without the ability to execute subsequent raft commands. This commit changes the on_internal_error() into a warning log entry. It also outputs to keyspace/table names, the truncated_at timepoint, the offending replay positions which caused the check to fail. Fixes: #25173 Fixes: #25013 (cherry picked from commit `268ec72dc9`)	2025-08-06 00:51:43 +00:00
Patryk Jędrzejczak	762b5e3ae8	docs: document the option to set recovery_leader later In one of the previous commits, we made it possible to set `recovery_leader` on each node just before restarting it. Here, we update the corresponding documentation. (cherry picked from commit `f408d1fa4f`)	2025-08-05 10:59:07 +00:00
Patryk Jędrzejczak	beb28a6155	test: delay setting recovery_leader in the recovery procedure tests In the previous commit, we made it possible to set `recovery_leader` on each node just before restarting it. Here, we change all the tests of the Raft-based recovery procedure to use and test this option. (cherry picked from commit `9e45e1159b`)	2025-08-05 10:59:06 +00:00
Patryk Jędrzejczak	b79902baf8	gossip: add recovery_leader to gossip_digest_syn In the new Raft-based recovery procedure, live nodes join the new group 0 one by one during a rolling restart. There is a time window when some of them are in the old group 0, while others are in the new group 0. This causes a group 0 mismatch in `gossiper::handle_syn_msg`. The current solution for this problem is to ignore group 0 mismatches if `recovery_leader` is set on the local node and to ask the administrator to perform the rolling restart in the following way: - set `recovery_leader` in `scylla.yaml` on all live nodes, - send the `SIGHUP` signal to all Scylla processes to reload the config, - proceed with the rolling restart. This commit makes `gossiper::handle_syn_msg` ignore group 0 mismatches when exactly one of the two gossiping nodes has `recovery_leader` set. We achieve this by adding `recovery_leader` to `gossip_digest_syn`. This change makes setting `recovery_leader` earlier on all nodes and reloading the config unnecessary. From now on, the administrator can simply restart each node with `recovery_leader` set. However, note that nodes that join group 0 must have `recovery_leader` set until all nodes join the new group 0. For example, assume that we are in the middle of the rolling restart and one of the nodes in the new group 0 crashes. It must be restarted with `recovery_leader` set, or else it would reject `gossip_digest_syn` messages from nodes in the old group 0. To avoid problems in such cases, we will continue to recommend setting `recovery_leader` in `scylla.yaml` instead of passing it as a command line argument. (cherry picked from commit `ba5b5c7d2f`)	2025-08-05 10:59:06 +00:00
Patryk Jędrzejczak	1738f244d2	db: system_keyspace: peers_table_read_fixup: remove rows with null host_id Currently, `peers_table_read_fixup` removes rows with no `host_id`, but not with null `host_id`. Null host IDs are known to appear in system tables, for example in `system.cluster_status` after a failed bootstrap. We better make sure we handle them properly if they ever appear in `system.peers`. This commit guarantees that null UUID cannot belong to `loaded_endpoints` in `storage_service::join_cluster`, which in particular ensures that we throw a runtime error when a user sets `recovery_leader` to null UUID during the recovery procedure. This is handled by the code verifying that `recovery_leader` belongs to `loaded_endpoints`. (cherry picked from commit `23f59483b6`)	2025-08-05 10:59:06 +00:00
Patryk Jędrzejczak	98e3b5e9b5	db/config, gms/gossiper: change recovery_leader to UUID We change the type of the `recovery_leader` config parameter and `gossip_config::recovery_leader` from sstring to UUID. `recovery_leader` is supposed to store host ID, so UUID is a natural choice. After changing the type to UUID, if the user provides an incorrect UUID, parsing `recovery_leader` will fail early, but the start-up will continue. Outside the recovery procedure, `recovery_leader` will then be ignored. In the recovery procedure, the start-up will fail on: ``` throw std::runtime_error( "Cannot start - Raft-based topology has been enabled but persistent group 0 ID is not present. " "If you are trying to run the Raft-based recovery procedure, you must set recovery_leader."); ``` (cherry picked from commit `445a15ff45`)	2025-08-05 10:59:06 +00:00
Patryk Jędrzejczak	1434afa588	db/config, utils: allow using UUID as a config option We change the `recovery_leader` option to UUID in the following commit. (cherry picked from commit `ec69028907`)	2025-08-05 10:59:06 +00:00
Nikos Dragazis	f53588813b	test: Use in-memory SQLite for PyKMIP server The PyKMIP server uses an SQLite database to store artifacts such as encryption keys. By default, SQLite performs a full journal and data flush to disk on every CREATE TABLE operation. Each operation triggers three fdatasync(2) calls. If we multiply this by 16, that is the number of tables created by the server, we get a significant number of file syncs, which can last for several seconds on slow machines. This behavior has led to CI stability issues from KMIP unit tests where the server failed to complete its schema creation within the 20-second timeout (observed on spider9 and spider11). Fix this by configuring the server to use an in-memory SQLite. Fixes #24842. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com> Closes scylladb/scylladb#24995 (cherry picked from commit `2656fca504`) Closes scylladb/scylladb#25299	2025-08-02 17:13:13 +03:00
Tomasz Grabiec	1dfb9d23ea	topology_coordinator: Trigger load stats refresh after replace Otherwise, tablet rebuilt will be delayed for up to 60s, as the tablet scheduler needs load stats for the new node (replacing) to make decisisons. Fixes #25163 Closes scylladb/scylladb#25181 (cherry picked from commit `55116ee660`) Closes scylladb/scylladb#25214	2025-08-02 01:26:59 +02:00
Piotr Dulikowski	c4b4db62e3	Merge '[Backport 2025.2] qos: don't populate effective service level cache until auth is migrated to raft' from Scylladb[bot] Right now, service levels are migrated in one group0 command and auth is migrated in the next one. This has a bad effect on the group0 state reload logic - modifying service levels in group0 causes the effective service levels cache to be recalculated, and to do so we need to fetch information about all roles. If the reload happens after SL upgrade and before auth upgrade, the query for roles will be directed to the legacy auth tables in system_auth - and the query, being a potentially remote query, has a timeout. If the query times out, it will throw an exception which will break the group0 apply fiber and the node will need to be restarted to bring it back to work. In order to solve this issue, make sure that the service level module does not start populating and using the service level cache until both service levels and auth are migrated to raft. This is achieved by adding the check both to the cache population logic and the effective service level getter - they now look at service level's accessor new method, `can_use_effective_service_level_cache` which takes a look at the auth version. Fixes: scylladb/scylladb#24963 Should be backported to all versions which support upgrade to topology over raft - the issue described here may put the cluster into a state which is difficult to get out of (group0 apply fiber can break on multiple nodes, which necessitates their restart). - (cherry picked from commit `2bb800c004`) - (cherry picked from commit `3a082d314c`) Parent PR: #25188 Closes scylladb/scylladb#25284 * github.com:scylladb/scylladb: test: sl: verify that legacy auth is not queried in sl to raft upgrade qos: don't populate effective service level cache until auth is migrated to raft	2025-08-01 17:15:25 +02:00
Jenkins Promoter	4ff03275b5	Update ScyllaDB version to: 2025.2.2	2025-08-01 16:35:40 +03:00
Jenkins Promoter	4e6c0aaab5	Update pgo profiles - aarch64	2025-08-01 04:41:23 +03:00
Jenkins Promoter	dd95ea454a	Update pgo profiles - x86_64	2025-08-01 04:29:06 +03:00
Piotr Dulikowski	d9075e6160	test: sl: verify that legacy auth is not queried in sl to raft upgrade Adjust `test_service_levels_upgrade`: right before upgrade to topology on raft, enable an error injection which triggers when the standard role manager is about to query the legacy auth tables in the system_auth keyspace. The preceding commit which fixes scylladb/scylladb#24963 makes sure that the legacy tables are not queried during upgrade to topology on raft, so the error injection does not trigger and does not cause a problem; without that commit, the test fails. (cherry picked from commit `3a082d314c`)	2025-07-31 15:13:24 +00:00
Piotr Dulikowski	618459125d	qos: don't populate effective service level cache until auth is migrated to raft Right now, service levels are migrated in one group0 command and auth is migrated in the next one. This has a bad effect on the group0 state reload logic - modifying service levels in group0 causes the effective service levels cache to be recalculated, and to do so we need to fetch information about all roles. If the reload happens after SL upgrade and before auth upgrade, the query for roles will be directed to the legacy auth tables in system_auth - and the query, being a potentially remote query, has a timeout. If the query times out, it will throw an exception which will break the group0 apply fiber and the node will need to be restarted to bring it back to work. In order to solve this issue, make sure that the service level module does not start populating and using the service level cache until both service levels and auth are migrated to raft. This is achieved by adding the check both to the cache population logic and the effective service level getter - they now look at service level's accessor new method, `can_use_effective_service_level_cache` which takes a look at the auth version. Fixes: scylladb/scylladb#24963 (cherry picked from commit `2bb800c004`)	2025-07-31 15:13:23 +00:00
Jakub Smolar	4a1de6725a	gdb: handle zero-size reads in managed_bytes Fixes: https://github.com/scylladb/scylladb/issues/25048 Closes scylladb/scylladb#25050 (cherry picked from commit `6e0a063ce3`) Closes scylladb/scylladb#25141	2025-07-31 13:06:45 +03:00
Pavel Emelyanov	45101e072e	Merge '[Backport 2025.2] transport: remove throwing protocol_exception on connection start' from Dario Mirovic Note: The simplest approach to resolving `process_request_one` merge issues, since it has been refactored, was to include the three commits from before, and then the commits that are actually being backported. `protocol_exception` is thrown in several places. This has become a performance issue, especially when starting/restarting a server. To alleviate this issue, throwing the exception has to be replaced with returning it as a result or an exceptional future. This PR replaces throws in the `transport/server` module. This is achieved by using result_with_exception, and in some places, where suitable, just by creating and returning an exceptional future. There are four commits in this PR. The first commit introduces tests in `test/cqlpy`. The second commit refactors transport server `handle_error` to not rethrow exceptions. The third commit refactors reusable buffer writer callbacks. The fourth commit replaces throwing `protocol_exception` to returning it. Based on the comments on an issue linked in https://github.com/scylladb/scylladb/issues/24567, the main culprit from the side of protocol exceptions is the invalid protocol version one, so I tested that exception for performance. In order to see if there is a measurable difference, a modified version of `test_protocol_version_mismatch` Python is used, with 100'000 runs across 10 processes (not threads, to avoid Python GIL). One test run consisted of 1 warm-up run and 5 measured runs. First test run has been executed on the current code, with throwing protocol exceptions. Second test urn has been executed on the new code, with returning protocol exceptions. The performance report is in https://github.com/scylladb/scylladb/pull/24738#issuecomment-3051611069. It shows ~10% gains in real, user, and sys time for this test. Testing Build: `release` Test file: `test/cqlpy/test_protocol_exceptions.py` Test name: `test_protocol_version_mismatch` (modified for mass connection requests) Test arguments: ``` max_attempts=100'000 num_parallel=10 ``` Throwing `protocol_exception` results: ``` real=1:26.97 user=10:00.27 sys=2:34.55 cpu=867% real=1:26.95 user=9:57.10 sys=2:32.50 cpu=862% real=1:26.93 user=9:56.54 sys=2:35.59 cpu=865% real=1:26.96 user=9:54.95 sys=2:32.33 cpu=859% real=1:26.96 user=9:53.39 sys=2:33.58 cpu=859% real=1:26.95 user=9:56.85 sys=2:34.11 cpu=862% # average ``` Returning `protocol_exception` as `result_with_exception` or an exceptional future: ``` real=1:18.46 user=9:12.21 sys=2:19.08 cpu=881% real=1:18.44 user=9:04.03 sys=2:17.91 cpu=869% real=1:18.47 user=9:12.94 sys=2:19.68 cpu=882% real=1:18.49 user=9:13.60 sys=2:19.88 cpu=883% real=1:18.48 user=9:11.76 sys=2:17.32 cpu=878% real=1:18.47 user=9:10.91 sys=2:18.77 cpu=879% # average ``` This PR replaced `transport/server` throws of `protocol_exception` with returns. There are a few other places where protocol exceptions are thrown, and there are many places where `invalid_request_exception` is thrown. That is out of scope of this single PR, so the PR just refs, and does not resolve issue #24567. Refs: #24567 This PR improves performance in cases when protocol exceptions happen, for example during connection storms. It will require backporting. * (cherry picked from commit `7aaeed012e`) * (cherry picked from commit `30d424e0d3`) * (cherry picked from commit `9f4344a435`) * (cherry picked from commit `5390f92afc`) * (cherry picked from commit `4a6f71df68`) Parent PR: #24738 Closes scylladb/scylladb#25239 * github.com:scylladb/scylladb: test/cqlpy: add cpp exception metric test conditions transport/server: replace protocol_exception throws with returns utils/reusable_buffer: accept non-throwing writer callbacks via result_with_exception transport/server: avoid exception-throw overhead in handle_error test/cqlpy: add protocol_exception tests transport: remove redundant references in process_request_one transport: fix the indentation in process_request_one transport: add futures in CQL server exception handling	2025-07-31 12:18:50 +03:00
Anna Stuchlik	96a01082bb	doc: add tablets support information to the Drivers table This commit: - Extends the Drivers support table with information on which driver supports tablets and since which version. - Adds the driver support policy to the Drivers page. - Reorganizes the Drivers page to accommodate the updates. In addition: - The CPP-over-Rust driver is added to the table. - The information about Serverless (which we don't support) is removed and replaced with tablets to correctly describe the contents of the table. Fixes https://github.com/scylladb/scylladb/issues/19471 Refs https://github.com/scylladb/scylladb-docs-homepage/issues/69 Closes scylladb/scylladb#24635 (cherry picked from commit `18b4d4a77c`) Closes scylladb/scylladb#25250	2025-07-31 12:18:33 +03:00
Aleksandra Martyniuk	782fb029d6	streaming: close sink when exception is thrown If an exception is thrown in result_handling_cont in streaming, then the sink does not get closed. This leads to a node crash. Close sink in exception handler. Fixes: https://github.com/scylladb/scylladb/issues/25165. Closes scylladb/scylladb#25238 (cherry picked from commit `99ff08ae78`) Closes scylladb/scylladb#25267	2025-07-31 12:18:17 +03:00
Dario Mirovic	9708d9c4d7	test/cqlpy: add cpp exception metric test conditions Tested code paths should not throw exceptions. `scylla_reactor_cpp_exceptions` metric is used. This is a global metric. To address potential test flakiness, each test runs multiple times: - `run_count = 100` - `cpp_exception_threshold = 10` If a change in the code introduced an exception, expectation is that the number of registered exceptions will be > `cpp_exception_threshold` in `run_count` runs. In which case the test fails. Fixes: #25272 (cherry picked from commit `4a6f71df68`)	2025-07-30 21:54:47 +02:00
Dario Mirovic	dc819ebda1	transport/server: replace protocol_exception throws with returns Replace throwing protocol_exception with returning it as a result or an exceptional future in the transport server module. This improves performance, for example during connection storms and server restarts, where protocol exceptions are more frequent. In functions already returning a future, protocol exceptions are propagated using an exceptional future. In functions not already returning a future, result_with_exception is used. Notable change is checking v.failed() before calling v.get() in process_request function, to avoid throwing in case of an exceptional future. Refs: #24567 Fixes: #25272 (cherry picked from commit `5390f92afc`)	2025-07-30 21:54:45 +02:00
Dario Mirovic	a8d38882ff	utils/reusable_buffer: accept non-throwing writer callbacks via result_with_exception Make make_bytes_ostream and make_fragmented_temporary_buffer accept writer callbacks that return utils::result_with_exception instead of forcing them to throw on error. This lets callers propagate failures by returning an error result rather than throwing an exception. Introduce buffer_writer_for, bytes_ostream_writer, and fragmented_buffer_writer concepts to simplify and document the template requirements on writer callbacks. This patch does not modify the actual callbacks passed, except for the syntax changes needed for successful compilation, without changing the logic. Refs: #24567 Fixes: #25272 (cherry picked from commit `9f4344a435`)	2025-07-30 21:54:41 +02:00

1 2 3 4 5 ...

48002 Commits