scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 19:46:48 +00:00

Author	SHA1	Message	Date
Kefu Chai	2480decbc7	doc: import the new pub keys used to sign the package before this change, when user follows the instruction, they'd get ```console $ sudo apt-get update Hit:1 http://us-east-1.ec2.archive.ubuntu.com/ubuntu noble InRelease Hit:2 http://us-east-1.ec2.archive.ubuntu.com/ubuntu noble-updates InRelease Hit:3 http://us-east-1.ec2.archive.ubuntu.com/ubuntu noble-backports InRelease Hit:4 http://security.ubuntu.com/ubuntu noble-security InRelease Get:5 https://downloads.scylladb.com/downloads/scylla/deb/debian-ubuntu/scylladb-6.2 stable InRelease [7550 B] Err:5 https://downloads.scylladb.com/downloads/scylla/deb/debian-ubuntu/scylladb-6.2 stable InRelease The following signatures couldn't be verified because the public key is not available: NO_PUBKEY A43E06657BAC99E3 Reading package lists... Done W: GPG error: https://downloads.scylladb.com/downloads/scylla/deb/debian-ubuntu/scylladb-6.2 stable InRelease: The following signatures couldn't be verified because the public key is not av ailable: NO_PUBKEY A43E06657BAC99E3 E: The repository 'https://downloads.scylladb.com/downloads/scylla/deb/debian-ubuntu/scylladb-6.2 stable InRelease' is not signed. N: Updating from such a repository can't be done securely, and is therefore disabled by default. N: See apt-secure(8) manpage for repository creation and user configuration details. ``` because the packages were signed with a different keyring. in this change, we import the new pubkey, so that the pacakge manager can verify the new packages (2024.2+ and 6.2+) signed with the new key. see also https://github.com/scylladb/scylla-ansible-roles/issues/399 and https://forum.scylladb.com/t/release-scylla-manager-3-3-1/2516 for the annonucement on using the new key. Fixes scylladb/scylladb#21557 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21524 (cherry picked from commit `1cedc45c35`) Closes scylladb/scylladb#21588	2024-11-15 10:36:44 +02:00
Botond Dénes	687a18db38	Merge 'scylla_raid_setup: fix failure on SELinux package installation' from Takuya ASADA After merged `5a470b2bfb`, we found that scylla_raid_setup fails on offline mode installation. This is because pkg_install() just print error and exit script on offline mode, instead of installing packages since offline mode not supposed able to connect internet. Seems like it occur because of missing "policycoreutils-python-utils" package, which is the package for "semange" command. So we need to implement the relabeling patch without using the command. Fixes https://github.com/scylladb/scylladb/issues/21441 Also, since Amazon Linux 2 has different package name for semange, we need to adjust package name. Fixes https://github.com/scylladb/scylladb/issues/21351 Closes scylladb/scylladb#21474 * github.com:scylladb/scylladb: scylla_raid_setup: support installing semanage on Amazon Linux 2 scylla_raid_setup: fix failure on SELinux package installation (cherry picked from commit `1c212df62d`) Closes scylladb/scylladb#21547	2024-11-14 15:51:06 +02:00
Botond Dénes	548170fb68	Merge '[Backport 6.2] compaction_manager: stop_tasks, stop_ongoing_compactions: ignore errors' from ScyllaDB stop() methods, like destructors must always succeed, and returning errors from them is futile as there is nothing else we can do with them by continue with shutdown. stop_ongoing_compactions, in particular, currently returns the status of stopped compaction tasks from `stop_tasks`, but still all tasks must be stopped after it, even if they failed, so assert that and ignore the errors. Fixes scylladb/scylladb#21159 * Needs backport to 6.2 and 6.1, as commit `8cc99973eb` causes handles storage that might cause compaction tasks to fail and eventually terminate on shudown when the exceptions are thrown in noexcept context in the deferred stop destructor body (cherry picked from commit `e942c074f2`) (cherry picked from commit `d8500472b3`) (cherry picked from commit `c08ba8af68`) (cherry picked from commit `a7a55298ea`) (cherry picked from commit `6cce67bec8`) Refs #21299 Closes scylladb/scylladb#21434 * github.com:scylladb/scylladb: compaction_manager: stop: await _stop_future if engaged compaction_manager: really_do_stop: assert that no tasks are left behind compaction_manager: stop_tasks, stop_ongoing_compactions: ignore errors compaction/compaction_manager: stop_tasks(): unlink stopped tasks compaction/compaction_manager: make _tasks an intrusive list	2024-11-14 06:59:52 +02:00
Jenkins Promoter	75b79a30da	Update ScyllaDB version to: 6.2.2	2024-11-13 23:22:52 +02:00
Benny Halevy	bdf31d7f54	compaction_manager: stop: await _stop_future if engaged The current condition that consults the compaction manager state for awaiting `_stop_future` works since _stop_future is assigned after the state is set to `stopped`, but it is incidental. What matters is that `_stop_future` is engaged. While at it, exchange _stop_future with a ready future so that stop() can be safely called multiple times. And dropped the superfluous co_return. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `6cce67bec8`)	2024-11-13 10:00:47 +02:00
Benny Halevy	3d915cd091	compaction_manager: really_do_stop: assert that no tasks are left behind stop_ongoing_compactions now ignores any errors returned by tasks, and it should leave no task left behind. Assert that here, before the compaction_manager is destroyed. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `a7a55298ea`)	2024-11-13 09:59:57 +02:00
Benny Halevy	abb26ff913	compaction_manager: stop_tasks, stop_ongoing_compactions: ignore errors stop() methods, like destructors must always succeed, and returning errors from them is futile as there is nothing else we can do with them but continue with shutdown. Leaked errors on the stop path may cause termination on shutdown, when called in a deferred action destructor. Fixes scylladb/scylladb#21298 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `c08ba8af68`)	2024-11-13 09:56:42 +02:00
Botond Dénes	3f821b7f4f	compaction/compaction_manager: stop_tasks(): unlink stopped tasks Stopped tasks currently linger in _tasks until the fiber that created the task is scheduled again and unlinks the task. This window between stop and remove prevents reliable checks for empty _tasks list after all tasks are stopped. Unlink the task early so really_do_stop() can safely check for an empty _tasks list (next patch). (cherry picked from commit `d8500472b3`)	2024-11-13 09:56:21 +02:00
Botond Dénes	cab3b86240	compaction/compaction_manager: make _tasks an intrusive list _tasks is currently std::list<shared_ptr<compaction_task_executor>>, but it has no role in keeping the instances alive, this is done by the fibers which create the task (and pin a shared ptr instance). This lends itself to an intrusive list, avoiding that extra allocation upon push_back(). Using an intrusive list also makes it simpler and much cheaper (O(1) vs. O(N)) to remove tasks from the _tasks list. This will be made use of in the next patch. Code using _task has to be updated because the value_type changes from shared_ptr<compaction_task_executor> to compaction_task_executor&. (cherry picked from commit `e942c074f2`)	2024-11-13 09:48:00 +02:00
Piotr Dulikowski	2fa4f3a9fc	Merge 'main,cql_test_env: start group0_service before view_builder' from Michał Jadwiszczak In scylladb/scylladb#19745, view_builder was migrated to group0 and since then it is dependant on group0_service. Because of this, group0_service should be initialized/destroyed before/after view_builder. This patch also adds error injection to `raft_server_with_timeouts::read_barrier`, which does 1s sleep before doing the read barrier. There is a new test which reproduces the use after free bug using the error injection. Fixes scylladb/scylladb#20772 scylladb/scylladb#19745 is present in 6.2, so this fix should be backported to it. Closes scylladb/scylladb#21471 * github.com:scylladb/scylladb: test/boost/secondary_index_test: add test for use after free api/raft: use `get_server_with_timeouts().read_barrier()` in coroutines main,cql_test_env: start group0_service before view_builder (cherry picked from commit `7021efd6b0`) Closes scylladb/scylladb#21506	2024-11-12 14:36:06 +01:00
Yaron Kaikov	a3e69cc8fb	./github/workflows/add-label-when-promoted.yaml: Run auto-backport only on default branch In https://github.com/scylladb/scylladb/pull/21496#event-15221789614 ``` scylladbbot force-pushed the backport/21459/to-6.1 branch from 414691c to `59a4ccd` Compare 2 days ago ``` Backport automation triggered by `push` but also should either start from `master` branch (or `enterprise` branch from Enterprise), we need to verify it by checking also the default branch. Fixes: https://github.com/scylladb/scylladb/issues/21514 Closes scylladb/scylladb#21515 (cherry picked from commit `2596d1577b`) Closes scylladb/scylladb#21531	2024-11-11 17:43:54 +02:00
Michał Chojnowski	876017efee	mvcc_test: fix a benign failure of test_apply_to_incomplete_respects_continuity For performance reasons, mutation_partition_v2::maybe_drop(), and by extension also mutation_partition_v2::apply_monotonically(mutation_partition_v2&&) can evict empty row entries, and hence change the continuity of the merged entry. For checking that apply_to_incomplete respects continuity, test_apply_to_incomplete_respects_continuity obtains the continuity of the partition entry before and after apply_to_incomplete by calling e.squashed().get_continuity(). But squashed() uses apply_monotonically(), so in some circumstances the result of squashed() can have smaller continuity than the argument of squashed(), which messes with the thing that the test is trying to check, and causes spurious failures. This patch changes the method of calculating the continuity set, so that it matches the entry exactly, fixing the test failures. Fixes scylladb/scylladb#13757 Closes scylladb/scylladb#21459 (cherry picked from commit `35921eb67e`) Closes scylladb/scylladb#21497	2024-11-08 15:32:24 +01:00
Yaron Kaikov	9eed1d1cbd	.github/scripts/auto-backport.py: update method to get closed prs `commit.get_pulls()` in PyGithub returns pull requests that are directly associated with the given commit Since in closed PR. the relevant commit is an event type, the backport automation didn't get the PR info for backporting Ref: https://github.com/scylladb/scylladb/issues/18973 Closes scylladb/scylladb#21468 (cherry picked from commit `ef104b7b96`) Closes scylladb/scylladb#21483	2024-11-08 10:26:10 +02:00
Yaron Kaikov	d33538bdd4	.github/script/auto-backport.py: push backport PR to `scylladbbot` fork Since Scylla is a public repo, when we create a fork, it doesn't fork the team and permissions (unlike private repos where it does). When we have a backport PR with conflicts, the developers need to be able to update the branch to fix the conflicts. To do so, we modified the logic of the backport automation as follows: - Every backport PR (with and without conflicts) will be open directly on the `scylladbbot` fork repo - When there are conflicts, an email will be sent to the original PR author with an invitation to become a contributor in the `scylladbbot` fork with `push` permissions. This will happen only once if Auther is not a contributor. - Together with sending the invite, all backport labels will be removed and a comment will be added to the original PR with instructions - The PR author must add the backport labels after the invitation is accepted Fixes: https://github.com/scylladb/scylladb/issues/18973 Closes scylladb/scylladb#21401 (cherry picked from commit `77604b4ac7`) Closes scylladb/scylladb#21466	2024-11-07 12:38:56 +02:00
Yaron Kaikov	073c9cbaa1	github: add script for backports automation instead of Mergify Adding an auto-backport.py script to handle backport automation instead of Mergify. The rules of backport are as follows: * Merged or Closed PRs with any backport/x.y label (one or more) and promoted-to-master label * Backport PR will be automatically assigned to the original PR author * In case of conflicts the backport PR will be open in the original autoor fork in draft mode. This will give the PR owner the option to resolve conflicts and push those changes to the PR branch (Today in Scylla when we have conflicts, the developers are forced to open another PR and manually close the backport PR opened by Mergify) * Fixing cherry-pick the wrong commit SHA. With the new script, we always take the SHA from the stable branch * Support backport for enterprise releases (from Enterprise branch) Fixes: https://github.com/scylladb/scylladb/issues/18973 (cherry picked from commit `f9e171c7af`) Closes scylladb/scylladb#21469	2024-11-07 06:57:05 +02:00
Tomasz Grabiec	a3a0ffbcd0	Merge 'tablet: Fix single-sstable split when attaching new unsplit sstables' from Raphael "Raph" Carvalho To fix a race between split and repair here `c1de4859d8`, a new sstable generated during streaming can be split before being attached to the sstable set. That's to prevent an unsplit sstable from reaching the set after the tablet map is resized. So we can think this split is an extension of the sstable writer. A failure during split means the new sstable won't be added. Also, the duration of split is also adding to the time erm is held. For example, repair writer will only release its erm once the split sstable is added into the set. This single-sstable split is going through run_custom_job(), which serializes with other maintenance tasks. That was a terrible decision, since the split may have to wait for ongoing maintenance task to finish, which means holding erm for longer. Additionally, if split monitor decides to run split on the entire compaction group, it can cause single-sstable split to be aborted since the former wants to select all sstables, propagating a failure to the streaming writer. That results in new sstable being leaked and may cause problems on restart, since the underlying tablet may have moved elsewhere or multiple splits may have happened. We have some fragility today in cleaning up leaked sstables on streaming failure, but this single-sstable split made it worse since the failure can happen during normal operation, when there's e.g. no I/O error. It makes sense to kill run_custom_job() usage, since the single-sstable split is offline and an extension of sstable writing, therefore it makes no sense to serialize with maintenance tasks. It must also inherit the sched group of the process writing the new sstable. The inheritance happens today, but is fragile. Fixes #20626. Closes scylladb/scylladb#20737 * github.com:scylladb/scylladb: tablet: Fix single-sstable split when attaching new unsplit sstables replica: Fix tablet split execute after restart (cherry picked from commit `bca8258150`) Ref scylladb/scylladb#21415 scylla-6.2.1 scylla-6.2.1-candidate-20241106103631	2024-11-06 15:01:35 +02:00
Botond Dénes	8bf76d6be7	Merge '[Backport 6.2] replica: Fix tombstone GC during tablet split preparation' from Raphael Raph Carvalho During split prepare phase, there will be more than 1 compaction group with overlapping token range for a given replica. Assume tablet 1 has sstable A containing deleted data, and sstable B containing a tombstone that shadows data in A. Then split starts: sstable B is split first, and moved from main (unsplit) group to a split-ready group now compaction runs in split-ready group before sstable A is split tombstone GC logic today only looks at underlying group, so compaction is step 2 will discard the deleted data in A, since it belongs to another group (the unsplit one), and so the tombstone can be purged incorrectly. To fix it, compaction will now work with all uncompacting sstables that belong to the same replica, since tombstone GC requires all sstables that possibly contain shadowed data to be available for correct decision to be made. Fixes https://github.com/scylladb/scylladb/issues/20044. Please replace this line with justification for the backport/* labels added to this PR Branches 6.0, 6.1 and 6.2 are vulnerable, so backport is needed. (cherry picked from commit `bcd358595f`) (cherry picked from commit `93815e0649`) Refs https://github.com/scylladb/scylladb/pull/20939 Closes scylladb/scylladb#21206 * github.com:scylladb/scylladb: replica: Fix tombstone GC during tablet split preparation service: Improve error handling for split	2024-11-06 09:55:47 +02:00
Raphael S. Carvalho	1e51ed88c6	replica: Fix tombstone GC during tablet split preparation During split prepare phase, there will be more than 1 compaction group with overlapping token range for a given replica. Assume tablet 1 has sstable A containing deleted data, and sstable B containing a tombstone that shadows data in A. Then split starts: 1) sstable B is split first, and moved from main (unsplit) group to a split-ready group 2) now compaction runs in split-ready group before sstable A is split tombstone GC logic today only looks at underlying group, so compaction is step 2 will discard the deleted data in A, since it belongs to another group (the unsplit one), and so the tombstone can be purged incorrectly. To fix it, compaction will now work with all uncompacting sstables that belong to the same replica, since tombstone GC requires all sstables that possibly contain shadowed data to be available for correct decision to be made. Fixes #20044. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> (cherry picked from commit `93815e0649`)	2024-11-04 14:24:18 -03:00
Raphael S. Carvalho	ca5f938ed4	service: Improve error handling for split Retry wasn't really happening since the loop was broken and sleep part was skipped on error. Also, we were treating abort of split during shutdown as if it were an actual error and that confused longevity tests that parse for logs with error level. The fix is about demoting the level of logs when we know the exception comes from shutdown. Fixes #20890. (cherry picked from commit `bcd358595f`)	2024-11-04 14:22:08 -03:00
Botond Dénes	fb20ea7de1	Merge '[Backport 6.2] tasks: fix virtual tasks children' from ScyllaDB Fix how regular tasks that have a virtual parent are created in task_manager::module::make_task: set sequence number of a task and subscribe to module's abort source. Fixes: #21278. Needs backport to 6.2 (cherry picked from commit `1eb47b0bbf`) (cherry picked from commit `910a6fc032`) Refs #21280 Closes scylladb/scylladb#21332 * github.com:scylladb/scylladb: tasks: fix sequence number assignment tasks: fix abort source subscription of virtual task's child	2024-11-04 18:18:35 +02:00
Tzach Livyatan	d5eb12c25d	Update os-support-info.rst - add CentOS ScyllaDB support RHEL 9 and derivatives, including CentOS 9. Fix https://github.com/scylladb/scylladb/issues/21309 (cherry picked from commit `1878af9399`) Closes scylladb/scylladb#21331	2024-11-04 18:17:46 +02:00
Aleksandra Martyniuk	291f568585	test: repair: drop log checks from test_repair_succeeds_with_unitialized_bm Currently, test_repair_succeeds_with_unitialized_bm checks whether repair finishes successfully and the error is properly handled if batchlog_manager isn't initialized. Error handling depends on logs, making the test fragile to external conditions and flaky. Drop the error handling check, successful repair is a sufficient passing condition. Fixes: #21167. (cherry picked from commit `85d9565158`) Closes scylladb/scylladb#21330	2024-11-04 18:16:55 +02:00
Botond Dénes	d5475fbc07	Merge '[Backport 6.2] repair: Fix finished ranges metrics for removenode' from ScyllaDB The skipped ranges should be multiplied by the number of tables Otherwise the finished ranges ratio will not reach 100%. Fixes #21174 (cherry picked from commit `cffe3dc49f`) (cherry picked from commit `1392a6068d`) (cherry picked from commit `9868ccbac0`) Refs #21252 Closes scylladb/scylladb#21313 * github.com:scylladb/scylladb: test: Add test_node_ops_metrics.py repair: Make the ranges more consistent in the log repair: Fix finished ranges metrics for removenode	2024-11-04 18:16:21 +02:00
Anna Stuchlik	6916dbe822	doc: remove the Cassandra references from notedool This PR removes the reference to Cassandra from the nodetool index, as the native nodetool is no longer a fork. In addition, it removes the Apache copyright. Fixes https://github.com/scylladb/scylladb/issues/21238 (cherry picked from commit `ef4bcf8b3f`) Closes scylladb/scylladb#21307	2024-11-04 18:15:36 +02:00
Michał Jadwiszczak	f51a8ed541	test/auth_cluster/test_raft_service_levels: match enterprise SL limit Despite OSS doesn't limit number of created service levels, match the enterprise limit to decrease divergence in the test between OSS and enterprise. Fixes scylladb/scylladb#21044 (cherry picked from commit `846d94134f`) Closes scylladb/scylladb#21282	2024-11-04 18:14:38 +02:00
Calle Wilund	127606f788	cql_test_env/gossip: Prevent double shutdown call crash Fixes #21159 When an exception is thrown in sstable write etc such that storage_manager::isolate is initiated, we start a shutdown chain for message service, gossip etc. These are synced (properly) in storage_manager::stop, but if we somehow call gossiper::shutdown outside the normal service::stop cycle, we can end up running the method simultaneously, intertwined (missing the guard because of the state change between check and set). We then end up co_awaiting an invalid future (_failure_detector_loop_done) - a second wait. Fixed by a.) Remove superfluous gossiper::shutdown in cql_test_env. This was added in `20496ed`, ages ago. However, it should not be needed nowadays. b.) Ensure _failure_detector_loop_done is always waitable. Just to be sure. (cherry picked from commit `c28a5173d9`) Closes scylladb/scylladb#21393	2024-11-04 16:52:42 +01:00
Benny Halevy	56a0fa922d	storage_service: on_change: update_peer_info only if peer info changed Return an optional peer_info from get_peer_info_for_update when the `app_state_map` arg does not change peer_info, so that we can skip calling update_peer_info, if it didn't change. Fixes scylladb/scylladb#20991 Refs scylladb/scylladb#16376 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#21152 (cherry picked from commit `04d741bcbb`)	2024-11-04 11:20:32 +02:00
Benny Halevy	c841a4a851	compaction_manager: compaction_disabled: return true if not in compaction_state When a compaction_group is removed via `compaction_manager::remove`, it is erase from `_compaction_state`, and therefore compaction is definitely not enabled on it. This triggers an internal error if tablets are cleaned up during drop/truncate, which checks that compaction is disabled in all compaction groups. Note that the callers of `compaction_disabled` aren't really interested in compaction being actively disabled on the compaction_group, but rather if it's enabled or not. A follow-up patch can be consider to reverse the logic and expose `compaction_enabled` rather than `compaction_disabled`. Fixes scylladb/scylladb#20060 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `1c55747637`) Closes scylladb/scylladb#21404	2024-11-03 16:05:05 +02:00
Gleb Natapov	1a9721e93e	topology coordinator: take a copy of a replication state in raft_topology_cmd_handler Current code takes a reference and holds it past preemption points. And while the state itself is not suppose to change the reference may become stale because the state is re-created on each raft topology command. Fix it by taking a copy instead. This is a slow path anyway. Fixes: scylladb/scylladb#21220 (cherry picked from commit `fb38bfa35d`) Closes scylladb/scylladb#21361	2024-10-30 14:11:17 +01:00
Kamil Braun	1dded7e52f	Merge '[Backport 6.2] fix nodetool status to show zero-token nodes' from ScyllaDB In the current scenario, the nodetool status doesn’t display information regarding zero token nodes. For example, if 5 nodes are spun by the administrator, out of which, 2 nodes are zero token nodes, then nodetool status only shows information regarding the 3 non-zero token nodes. This commit intends to fix this issue by leveraging the “/storage_service/host_id ” API and adding appropriate logic in scylla-nodetool.cc to support zero token nodes. A test is also added in nodetool/test_status.py to verify this logic. This test fails without this commit’s zero token node support logic, hence verifying the behavior. This PR fixes a bug. Hence we need to backport it. Backporting needs to be done only to 6.2 version, since earlier versions don't support zero token nodes. Fixes: scylladb/scylladb#19849 Fixes: scylladb/scylladb#17857 (cherry picked from commit `72f3c95a63`) (cherry picked from commit `39dfd2d7ac`) (cherry picked from commit `c00d40b239`) Refs scylladb/scylladb#20909 Closes scylladb/scylladb#21334 * github.com:scylladb/scylladb: fix nodetool status to show zero-token nodes test: move `wait_for_first_completed` to pylib/util.py token_metadata: rename endpoint_to_host_id_map getter and add support for joining nodes	2024-10-29 10:50:35 +01:00
Abhinav	9082d66d8a	fix nodetool status to show zero-token nodes In the current scenario, the nodetool status doesn’t display information regarding zero token nodes. For example, if 5 nodes are spun by the administrator, out of which, 2 nodes are zero token nodes, then nodetool status only shows information regarding the 3 non-zero token nodes. This commit intends to fix this issue by leveraging the “/storage_service/host_id ” API and adding appropriate logic in scylla-nodetool.cc to support zero token nodes. Robust topology tests are added, which spins up scylla nodes and confirm nodetool status output for various cases, providing good coverage. A test is also added in nodetool/test_status.py to verify this logic. These tests fail without this commit’s zero token node support logic, hence verifying the behavior. The test `test_status_keyspace_joining_node` has been removed. This test is based on case where host_id=None, which is impossible. Since we now use host_id_map for node discovery in nodetool, the nodes with "host_id=None" go undetected. Since this case is anyway impossible, we can get rid of this. This PR fixes a bug. Hence we need to backport it. Backporting needs to be done only to 6.2 version, since earlier versions dont support zero token nodes. Fixes: scylladb/scylladb#19849 (cherry picked from commit `c00d40b239`)	2024-10-28 21:33:55 +00:00
Abhinav	c7a0876a73	test: move `wait_for_first_completed` to pylib/util.py This function is needed in a new test added in the next commit and this refactoring avoids code duplication. (cherry picked from commit `39dfd2d7ac`)	2024-10-28 21:33:55 +00:00
Abhinav	917d40e600	token_metadata: rename endpoint_to_host_id_map getter and add support for joining nodes Rename host_id map getter, 'get_endpoint_to_host_id_map_for_reading' to 'get_endpoint_to_host_id_map_' Also modify the getter to return information regarding joining nodes as well. This getter will later be used for retrieving the nodes in nodetool status, hence it needs to show all nodes, including joining ones. The function name suffix `_for_reading` suggests that the function was used in some other places in the past, and indeed if we need endpoints "for reading" then we cannot show joining endpoints. But it was confirmed that this function is currently only used by "/storage_service/host_id" endpoint, hence it can be modified as required. Fixes: scylladb/scylladb#17857 (cherry picked from commit `72f3c95a63`)	2024-10-28 21:33:54 +00:00
Aleksandra Martyniuk	1fd60424d9	tasks: fix sequence number assignment Currently, children of virtual tasks do not have sequence number assigned. Fix it. (cherry picked from commit `910a6fc032`)	2024-10-28 21:32:49 +00:00
Aleksandra Martyniuk	af6ddebc7f	tasks: fix abort source subscription of virtual task's child Currently, if a regular task does not have a parent or its parent is a virtual tasks then it subscribes to module's abort source in task_manager::task::impl constructor. However, at this point the kind of the task's parent isn't set. Due to that, children of virtual tasks aren't aborted on shutdown. Subscribe to module's abort source in task::impl::set_virtual_parent. (cherry picked from commit `1eb47b0bbf`)	2024-10-28 21:32:49 +00:00
Tomasz Grabiec	fa71b82da4	node-exporter: Disable hwmon collector This collector reads nvme temperature sensor, which was observed to cause bad performance on Azure cloud following the reading of the sensor for ~6 seconds. During the event, we can see elevated system time (up to 30%) and softirq time. CPU utilization is high, with nvm_queue_rq taking several orders of magnitude more time than normally. There are signs of contention, we can see __pv_queued_spin_lock_slowpath in the perf profile, called. This manifests as latency spikes and potentially also throughput drop due to reduced CPU capacity. By default, the monitoring stack queries it once every 60s. (cherry picked from commit `93777fa907`) Closes scylladb/scylladb#21304	2024-10-28 15:05:06 +01:00
Asias He	1a5a6a0758	test: Add test_node_ops_metrics.py It tests the node_ops_metrics_done metric reaches 100% when a node ops is done. Refs: #21174 (cherry picked from commit `9868ccbac0`)	2024-10-28 09:54:30 +00:00
Asias He	6ae5481de4	repair: Make the ranges more consistent in the log Consider the number of tables for the number of ranges logging. Make it more consistent with the log when the ops starts. (cherry picked from commit `1392a6068d`)	2024-10-28 09:54:30 +00:00
Asias He	0bc22db3a9	repair: Fix finished ranges metrics for removenode The skipped ranges should be multiplied by the number of tables. Otherwise the finished ranges ratio will not reach 100%. Fixes #21174 (cherry picked from commit `cffe3dc49f`)	2024-10-28 09:54:30 +00:00
Botond Dénes	b78675270e	streaming: stream-session: switch to tracking permit The stream-session is the receiving end of streaming, it reads the mutation fragment stream from an RPC stream and writes it onto the disk. As such, this part does no disk IO and therefore, using a permit with count resources is superfluous. Furthermore, after `d98708013c`, the count resources on this permit can cause a deadlock on the receiver end, via the `db::view::check_view_update_path()`, which wants to read the content of a system table and therefore has to obtain a permit of its own. Switch to a tracking-only permit, primarily to resolve the deadlock, but also because admission is not necessary for a read which does no IO. Refs: scylladb/scylladb#20885 (partial fix, solves only one of the deadlocks) Fixes: scylladb/scylladb#21264 (cherry picked from commit `dbb26da2aa`) Closes scylladb/scylladb#21303	2024-10-28 08:07:05 +02:00
Jenkins Promoter	ea6fe4bfa1	Update ScyllaDB version to: 6.2.1	2024-10-27 12:06:35 +02:00
Botond Dénes	30a2ed7488	Merge '[Backport 6.2] cql/tablets: fix retrying ALTER tablets KEYSPACE' from Marcin Maliszkiewicz ALTER tablets-enabled KEYSPACES (KS) may fail due to group0_concurrent_modification, in which case it's repeated by a for loop surrounding the code. But because raft's add_entry consumes the raft's guard (by std::move'ing the guard object), retries of ALTER KS will use a moved-from guard object, which is UB, potentially a crash. The fix is to remove the before mentioned for loop altogether and rethrow the exception, as the rf_change event will be repeated by the topology state machine if it receives the concurrent modification exception, because the event will remain present in the global requests queue, hence it's going to be executed as the very next event. Note: refactor is implemented in the follow-up commit. Fixes: https://github.com/scylladb/scylladb/issues/21102 Should be backported to every 6.x branch, as it may lead to a crash. (cherry picked from commit `de511f56ac`) (cherry picked from commit `3f4c8a30e3`) (cherry picked from commit `522bede8ec`) Refs https://github.com/scylladb/scylladb/pull/21121 Closes scylladb/scylladb#21256 * github.com:scylladb/scylladb: test: topology: add disable_schema_agreement_wait utility function test: add UT to test retrying ALTER tablets KEYSPACE cql/tablets: fix indentation in `rf_change` event handler cql/tablets: fix retrying ALTER tablets KEYSPACE	2024-10-25 10:57:36 +03:00
Botond Dénes	dcddb1ff4a	Merge '[Backport 6.2] multishard reader: make it safe to create with admitted permits' from ScyllaDB Passing an admitted permit -- i.e. one with count resources on it -- to the multishard reader, will possibly result in a deadlock, because the permit of the multishard reader is destroyed after the permits of its child readers. Therefore its semaphore resources won't be automatically released until children acquire their own resources. This creates a dependency (an edge in the "resource allocation graph"), where the semaphore used by the multishard reader depends on the semaphores used by children. When such dependencies create a cycle, and permits are acquired by different reads in just the right order, a deadlock will happen. Users of the multishard reader have to be aware of this gotcha -- and of course they aren't. This is small wonder, considering that not even the documentation on the multishard reader mentions this problem. To work around this, the user has to call `reader_permit::release_base_resources()` on the permit, before passing it to the multishard reader. On multiple occasions, developers (including the very author of the multishard reader), forgot or didn't know about this and this resulted in deadlocks down the line. This is a design-flaw of the multishard reader, which is addressed in this PR, after which, it is safe to pass admitted or not admitted permits to the multishard reader, it will handle the call to `release_base_resources()` if needed. After fixing the problem in the multishard reader, the existing calls to `release_base_resources()` on permits passed to multishard readers are removed. A test is added which reproduces the problem and ensures we don't regress. Refs: https://github.com/scylladb/scylladb/issues/20885 (partial fix, there is another deadlock in that issue, which this PR doesn't fix) Fixes: https://github.com/scylladb/scylladb/issues/21263 This fixes (indirectly) a regression introduced by `d98708013c` so it has to be backported to 6.2 (cherry picked from commit `e1d8cddd09`) Refs scylladb/scylladb#21058 Closes scylladb/scylladb#21178 * github.com:scylladb/scylladb: test/boost/mutation_test: add test for multishard permit safety test/lib/reader_lifecycle_policy: add semaphore factory to constructor test/lib/reader_lifecycle_policy: rename factory_function repair/row_level: drop now unneeded release_base_resource() calls readers/multishard: make multishard reader safe to create with admitted permits	2024-10-25 09:32:03 +03:00
Piotr Dulikowski	4ca0e31415	test/test_view_build_status: properly wait for v2 in migration test The test_view_build_status_migration_to_v2 test case creates a new view (vt2) after peforming the view_build_status -> view_build_status_v2 migration and waits until it is built by `wait_for_view_v2` function. It works by waiting until a SELECT from view_build_status_v2 will return the expected number of rows for a given view. However, if the host parameter is unspecified, it will query only one node on each attempt. Because `view_build_status_v2` is managed via raft, queries always return data from the queried node only. It might happen that `wait_for_view_v2` fetches expected results from one node while a different node might be lagging behind the group0 coordinator and might not have all data yet. In case of test_view_build_status_migration_to_v2 this is a problem - it first uses `wait_for_view_v2` to wait for view, later it queries `view_build_status_v2` on a random node and asserts its state - and might fail because that node didn't have the newest state yet. Fix the issue by issuing `wait_for_view_v2` in parallel for all nodes in the cluster and waiting until all nodes have the most recent state. Fixes: scylladb/scylladb#21060 (cherry picked from commit `a380a2efd9`) Closes scylladb/scylladb#21129	2024-10-24 16:42:53 +03:00
Raphael S. Carvalho	363bc7424e	locator: Always preserve balancing_enabled in tablet_metadata::copy() When there are zero tablets, tablet_metadata::_balancing_enabled is ignored in the copy. The property not being preserved can result in balancer not respecting user's wish to disable balancing when a replica is created later on. Fixes #21175. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> (cherry picked from commit `dfc217f99a`) Closes scylladb/scylladb#21190	2024-10-24 16:37:41 +03:00
Botond Dénes	a5b11a3189	test/boost/mutation_test: add test for multishard permit safety Add a test checking that the multishard reader will not deadlock, when created with an admitted permit, on a semaphore with a single count resource. (cherry picked from commit `e1d8cddd09`)	2024-10-24 09:18:11 -04:00
Botond Dénes	c0eba659f6	test/lib/reader_lifecycle_policy: add semaphore factory to constructor Allowing callers to specify how the semaphore is created and stopped, instead of doing so via boolean flags like it is done currently. This method doesn't scale, so use a factory instead. (cherry picked from commit `5a3fd69374`)	2024-10-24 09:18:11 -04:00
Botond Dénes	dbb1dc872d	test/lib/reader_lifecycle_policy: rename factory_function To reader_factor_function. We are about to add a new factory function parameters, so the current factory_function has to be renamed to something more specific. (cherry picked from commit `c8598e21e8`)	2024-10-24 09:18:11 -04:00
Botond Dénes	07b288b7d7	repair/row_level: drop now unneeded release_base_resource() calls The multishard reader now does this itself, no need to do it here. (cherry picked from commit `76a5ba2342`)	2024-10-24 09:18:11 -04:00
Botond Dénes	41a44ddc12	readers/multishard: make multishard reader safe to create with admitted permits Passing an admitted permit -- i.e. one with count resources on it -- to the multishard reader, will possibly result in a deadlock, because the permit of the multishard reader is destroyed after the permits of its child readers. Therefore its semaphore resources won't be automatically released until children acquire their own resources. This creates a dependency (an edge in the "resource allocation graph"), where the semaphore used by the multishard reader depends on the semaphores used by children. When such dependencies create a cycle, and permits are acquired by different reads in just the right order, a deadlock will happen. Users of the multishard reader have to be aware of this gotcha -- and of course they aren't. This is small wonder, considering that not even the documentation on the multishard reader mentions this problem. To work around this, the user has to call `reader_permit::release_base_resources()` on the permit, before passing it to the multishard reader. On multiple occasions, developers (including the very author of the multishard reader), forgot or didn't know about this and this resulted in deadlocks down the line. This is a design-flaw of the multishard reader, which is addressed in this patch, after which, it is safe to pass admitted or not admitted permits to the multishard reader, it will handle the call to `release_base_resources()` if needed. (cherry picked from commit `218ea449a5`)	2024-10-24 09:18:11 -04:00

1 2 3 4 5 ...

44646 Commits