scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-29 04:37:00 +00:00

Author	SHA1	Message	Date
Benny Halevy	ce0c6e7595	test/topology/util: CREATE KEYSPACE IF NOT EXISTS Workaround spurious keyspace creation errors due to retries caused by https://github.com/scylladb/python-driver/issues/317. This is safe since the function uses a unique_name for the keyspace so it should never exist by mistake. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `5d448f721e`)	2025-05-12 13:58:17 +03:00
Benny Halevy	6d88937dcd	test/topology/util: new_test_keyspace: accept ManagerClient Following patch will convert topology tests to use new_test_keyspace and friends. Some tests restart server and reset the driver connection so we cannot use the original cql Session for dropping the created keyspace in the `finally` block. Pass the ManagerClient instead to get a new cql session for dropping the keyspace. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `50ce0aaf1c`) Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-05-12 13:58:16 +03:00
David Garcia	65d57819c9	docs: fix md redirections for multiversion support This change resolves an issue where selecting a version from the multiversion dropdown on Markdown pages (e.g. https://docs.scylladb.com/manual/stable/alternator/getting-started.html) incorrectly redirected users to the main page instead of the corresponding versioned page. The underlying cause was that the `multiversion` extension relies on `source_suffix` to identify available pages for URL mapping. Without this configuration, proper redirection fails for `.md` files. This fix should be backported to `2025.1` to ensure correct behavior. Otherwise, the fix will only take effect in future releases. Testing locally is non-trivial: clone the repository, apply the changes to each relevant branch, set `smv_remote_whitelist` to "", then run `make multiversionpreview`. Afterward, switch between versions in the dropdown to verify behavior. I've tested it locally, so the best next step is to merge and confirm that it works as expected in the live environment. Closes scylladb/scylladb#23957 (cherry picked from commit `4ba7182515`) Closes scylladb/scylladb#24010	2025-05-08 11:18:56 +03:00
Botond Dénes	c341668371	Merge '[Backport 2025.1] replica: skip flush of dropped table' from Scylladb[bot] Currently, flush throws no_such_column_family if a table is dropped. Skip the flush of dropped table instead. Fixes: #16095. Needs backport to 2025.1 and 6.2 as they contain the bug - (cherry picked from commit `91b57e79f3`) - (cherry picked from commit `c1618c7de5`) Parent PR: #23876 Closes scylladb/scylladb#23905 * github.com:scylladb/scylladb: test: test table drop during flush replica: skip flush of dropped table	2025-05-08 11:18:17 +03:00
Aleksandra Martyniuk	880f11fdba	streaming: skip dropped tables Currently, stream_session::prepare throws when a table in requests or summaries is dropped. However, we do not want to fail streaming if the table is dropped. Delete table checks from stream_session::prepare. Further streaming steps can handle the dropped table and finish the streaming successfully. Fixes: #15257. Closes scylladb/scylladb#23915 (cherry picked from commit `20c2d6210e`) Closes scylladb/scylladb#24052	2025-05-08 11:17:18 +03:00
Pavel Emelyanov	d19d02b543	sstable_directory: Print ks.cf when moving unshared remove sstables When an sstable is identified by sstable_directory as remote-unshared, it will at some point be moved to the target shard. When it happens a log-message appears: sstable_directory - Moving 1 unshared SSTables to shard 1 Processing of tables by sstable_directory often happens in parallel, and messages from sstable_directory are intermixed. Having a message like above is not very informative, as it tells nothing about sstables that are being moved. Equip the message with ks:cf pair to make it more informative. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#23912 (cherry picked from commit `d40d6801b0`) Closes scylladb/scylladb#24015	2025-05-08 11:14:40 +03:00
Anna Stuchlik	db7de7f434	doc: add a link to the previous Enterprise documentation This commit adds a link to the docs for previous Enterprise versions at https://enterprise.docs.scylladb.com/ to the left menu. As we still support versions 2024.1 and 2024.2, we need to ensure easier access to those docs sets. Fixes https://github.com/scylladb/scylladb/issues/23870 Closes scylladb/scylladb#23945 (cherry picked from commit `851a433663`) Closes scylladb/scylladb#24017	2025-05-08 10:58:05 +03:00
Botond Dénes	6201531b39	replica/database: memtable_list: save ref to memtable_table_shared_data This is passed by reference to the constructor, but a copy is saved into the _table_shared_data member. A reference to this member is passed down to all memtable readers. Because of the copy, the memtable readers save a reference to the memtable_list's member, which goes away together with the memtable_list when the storage_group is destroyed. This causes use-after-free when a storage group is destroyed while a memtable read is still ongoing. The memtable reader keeps the memtable alive, but its reference to the memtable_table_shared_data becomes stale. Fix by saving a reference in the memtable_list too, so memtable readers receive a reference pointing to the original replica::table member, which is stable accross tablet migrations and merges. The copy was introduced by `2a76065e3d`. There was a copy even before this commit, but in the previous vnode-only world this was fine -- there was one memtable_list per table and it was around until the table itself was. In the tablet world, this is no longer given, but the above commit didn't account for this. A test is included, which reproduces the use-after-free on memtable migration. The test is somewhat artificial in that the use-after-free would be prevented by holding on to an ERM, but this is done intentionaly to keep the test simple. Migration -- unlike merge where this use-after-free was originally observed -- is easy to trigger from unit tests. Fixes: #23762 Closes scylladb/scylladb#23984 (cherry picked from commit `0a9ca52cfd`) Closes scylladb/scylladb#24033	2025-05-07 19:21:03 +03:00
Piotr Dulikowski	7591e131a2	test: mv: skip test_view_building_scheduling_group in debug The test populates a table with 50k rows, creates a view on that table and then compares the time spent in streaming vs. gossip scheduling groups. It only takes 10s in dev mode on my machine, but is much slower in debug mode in CI - building the view doesn't finish within 2 minutes. The bigger the view to build, the more accurrate the measurement; moreover, the test scenario isn't interesting enough to be worth running it in debug mode as this should be covered by other tests. Therefore, just skip this test in debug mode. Fixes: scylladb/scylladb#23862 Closes scylladb/scylladb#23866 (cherry picked from commit `3d73c79a72`) Closes scylladb/scylladb#23881	2025-05-06 10:16:57 +02:00
Aleksandra Martyniuk	bfdf7c944b	test: test table drop during flush (cherry picked from commit `c1618c7de5`)	2025-05-06 09:52:42 +02:00
Aleksandra Martyniuk	238cf27471	replica: skip flush of dropped table (cherry picked from commit `91b57e79f3`)	2025-05-06 09:52:30 +02:00
Benny Halevy	fc21a0f8a1	loading_cache_test: test_loading_cache_reload_during_eviction: use manual_clock Rather than lowres_clock, as since `32b7cab917`, loading_cache_for_test uses manual_clock for timing and relying on lowres_clock to time the test might run out of memory on fast test machines. Fixes #23497 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#23498 (cherry picked from commit `5f2ce0b022`) Closes scylladb/scylladb#23983	2025-05-01 14:07:25 +03:00
Piotr Dulikowski	b77db75280	utils::loading_cache: gracefully skip timer if gate closed The loading_cache has a periodic timer which acquires the _timer_reads_gate. The stop() method first closes the gate and then cancels the timer - this order is necessary because the timer is re-armed under the gate. However, the timer callback does not check whether the gate was closed but tries to acquire it, which might result in unhandled exception which is logged with ERROR severity. Fix the timer callback by acquiring access to the gate at the beginning and gracefully returning if the gate is closed. Even though the gate used to be entered in the middle of the callback, it does not make sense to execute the timer's logic at all if the cache is being stopped. Fixes: scylladb/scylladb#23951 Closes scylladb/scylladb#23952 (cherry picked from commit `8ffe4b0308`) Closes scylladb/scylladb#23981	2025-05-01 08:28:46 +03:00
Aleksandra Martyniuk	aaaeb6dcee	test_tablet_tasks: use injection to revoke resize Currently, test_tablet_resize_revoked tries to trigger split revoke by deleting some rows. This method isn't deterministic and so a test is flaky. Use error injection to trigger resize revoke. Fixes: #22570. Closes scylladb/scylladb#23966 (cherry picked from commit `1f4edd8683`) Closes scylladb/scylladb#23974	2025-05-01 08:28:09 +03:00
Botond Dénes	f76cf6118a	Merge '[Backport 2025.1] topology coordinator: do not proceed further on invalid boostrap tokens' from Scylladb[bot] In case when dht::boot_strapper::get_boostrap_tokens fail to parse the tokens, the topology coordinator handles the exception and schedules a rollback. However, the current code tries to continue with the topology coordinator logic even if an exception occurs, leaving boostrap_tokens empty. This does not make sense and can actually cause issues, specifically in prepare_and_broadcast_cdc_generation_data which implicitly expect that the bootstrap_tokens of the first node in the cluster will not be empty. Fix this by adding the missing break. Fixes: scylladb/scylladb#23897 From the code inspection alone it looks like 2025.1 and 6.2 have this problem, so marking for backport to both of them. - (cherry picked from commit `66acaa1bf8`) - (cherry picked from commit `845cedea7f`) - (cherry picked from commit `670a69007e`) Parent PR: #23914 Closes scylladb/scylladb#23949 * github.com:scylladb/scylladb: test: cluster: add test_bad_initial_token topology coordinator: do not proceed further on invalid boostrap tokens cdc: add sanity check for generating an empty generation	2025-05-01 08:23:06 +03:00
Botond Dénes	b8a8f288fb	Merge '[Backport 2025.1] tasks: check whether a node is alive before rpc' from Scylladb[bot] Check whether a node is alive before making an rpc that gathers children infos from the whole cluster in virtual_task::impl::get_children. Fixes: https://github.com/scylladb/scylladb/issues/22514. Needs backport to 2025.1 and 6.2 as they contain the bug. - (cherry picked from commit `53e0f79947`) - (cherry picked from commit `e178bd7847`) Parent PR: #23787 Closes scylladb/scylladb#23943 * github.com:scylladb/scylladb: test: add test for getting tasks children tasks: check whether a node is alive before rpc	2025-05-01 08:21:49 +03:00
Botond Dénes	503748b1ae	Merge '[Backport 2025.1] topology_coordinator: stop: await all background_action_holder:s' from Scylladb[bot] Add missing awaits for the rebuild_repair and repair background actions. Although the background actions hold the _async_gate which is closed in topology_coordinator::run(), stop() still needs to await all background action futures and handle any errors they may have left behind. Fixes #23755 * The issue exists since 6.2 - (cherry picked from commit `d624795fda`) - (cherry picked from commit `6de79d0dd3`) - (cherry picked from commit `7a0f5e0a54`) Parent PR: #17712 Closes scylladb/scylladb#23799 * github.com:scylladb/scylladb: topology_coordinator: stop: await all background_action_holder:s topology_coordinator: stop: improve error messages topology_coordinator: stop: define stop_background_action helper	2025-05-01 08:20:03 +03:00
Jenkins Promoter	15c88b6453	Update pgo profiles - aarch64	2025-05-01 04:20:37 +03:00
Jenkins Promoter	0bdd7bd755	Update pgo profiles - x86_64	2025-05-01 04:06:29 +03:00
Aleksandra Martyniuk	6d9bf63e06	test: add test for getting tasks children Add test that checks whether the children of a virtual task will be properly gathered if a node is down. (cherry picked from commit `e178bd7847`)	2025-04-30 10:25:09 +02:00
Aleksandra Martyniuk	9385dc1d4e	tasks: check whether a node is alive before rpc Check whether a node is alive before making an rpc that gathers children infos from the whole cluster in virtual_task::impl::get_children. (cherry picked from commit `53e0f79947`)	2025-04-30 10:14:58 +02:00
Patryk Jędrzejczak	bf688890ad	Merge '[Backport 2025.1] Ensure raft group0 RPCs use the gossip scheduling group.' from Scylladb[bot] Scylla operations use concurrency semaphores to limit the number of concurrent operations and prevent resource exhaustion. The semaphore is selected based on the current scheduling group. For RAFT group operations, it is essential to use a system semaphore to avoid queuing behind user operations. This patch ensures that RAFT operations use the `gossip` scheduling group to leverage the system semaphore. Fixes scylladb/scylladb#21637 Backport: 6.2 and 6.1 - (cherry picked from commit `60f1053087`) - (cherry picked from commit `e05c082002`) Parent PR: #22779 Closes scylladb/scylladb#23844 * https://github.com/scylladb/scylladb: Ensure raft group0 RPCs use the gossip scheduling group Move RAFT operations verbs to GOSSIP group.	2025-04-29 16:02:47 +02:00
Piotr Dulikowski	791b4a9fe0	test: cluster: add test_bad_initial_token Adds a test which checks that rollback works properly in case when a bad value of the initial_token function is provided. (cherry picked from commit `670a69007e`)	2025-04-28 17:07:43 +00:00
Piotr Dulikowski	d25dd24a7e	topology coordinator: do not proceed further on invalid boostrap tokens In case when dht::boot_strapper::get_boostrap_tokens fail to parse the tokens, the topology coordinator handles the exception and schedules a rollback. However, the current code tries to continue with the topology coordinator logic even if an exception occurs, leaving boostrap_tokens empty. This does not make sense and can actually cause issues, specifically in prepare_and_broadcast_cdc_generation_data which implicitly expect that the bootstrap_tokens of the first node in the cluster will not be empty. Fix this by adding the missing break. Fixes: scylladb/scylladb#23897 (cherry picked from commit `845cedea7f`)	2025-04-28 17:07:43 +00:00
Piotr Dulikowski	d8380e5d5e	cdc: add sanity check for generating an empty generation It doesn't make sense to create an empty CDC generation because it does not make sense to have a cluster with no tokens. Add a sanity check to cdc::make_new_generation_description which fails if somebody attempts to do that (i.e. when the set of current tokens + optionally bootstrapping node's tokens is empty). The function does not work correctly if it is misused, as we saw in scylladb/scylladb#23897. While the function should not be misused in the first place, it's better to throw an exception rather than crash - especially that this crash could happen on the topology coordinator. (cherry picked from commit `66acaa1bf8`)	2025-04-28 17:07:43 +00:00
Jenkins Promoter	964468d901	Update ScyllaDB version to: 2025.1.3	2025-04-28 16:10:06 +03:00
Tomasz Grabiec	962fa5a369	Merge '[Backport 2025.1] tablets: Equalize per-table balance when allocating tablets for a new table' from Scylladb[bot] Fixes the following scenario: 1. Scale out adds new nodes to each rack 2. Table is created - all tablets are allocated to new nodes because they have low load 3. Rebalancing moves tablets from old nodes to new nodes - table balance for the new table is not fixed We're wrong to try to equalize global load when allocating tablets, and we should equalize per-table load instead, and let background load balancing fix it in a fair way. It will add to the allocated storage imbalance, but: 1. The table is initially empty, so doesn't impact actual storage imbalance. 2. It's more important to avoid overloading CPU on the nodes - imbalance hurts this aspect immediately. 3. If the table was created before imbalance was formed, we would end up in the same situation as in the problematic scenario after the patch. 4. It's the job of the load balancing to keep up with storage growing, and if it's not, scale out should kick in. Before we have CPU-aware tablet allocation, and thus can prove we have CPU capacity on the small nodes, we should respect per-table balance as this is the way in which we achieve full CPU utilization. Fixes #23631 Backport to 2025.1 because load imbalance is a serious problem in production. - (cherry picked from commit `d493a8d736`) - (cherry picked from commit `2597a7e980`) - (cherry picked from commit `1e407ab4d2`) Parent PR: #23708 Closes scylladb/scylladb#23873 * github.com:scylladb/scylladb: tablets: Equalize per-table balance when allocating tablets for a new table load_sketch: Tolerate missing tablet_map when selecting for a given table tests: tablets: Simplify tests by moving common code to topology_builder	2025-04-28 13:23:25 +02:00
Tomasz Grabiec	885924067f	Merge '[Backport 2025.1] Simplify loading_cache_test and use manual_clock' from Scylladb[bot] This series exposes a Clock template parameter for loading_cache so that the test could use the manual_clock rather than the lowres_clock, since relying on the latter is flaky. In addition, the test load function is simplified to sleep some small random time and co_return the expected string, rather than reading it from a real file, since the latter's timing might also be flaky, and it out-of-scope for this test. Fixes #20322 * The test was flaky forever, so backport is required for all live versions. - (cherry picked from commit `b509644972`) - (cherry picked from commit `934a9d3fd6`) - (cherry picked from commit `d68829243f`) - (cherry picked from commit `b258f8cc69`) - (cherry picked from commit `0841483d68`) - (cherry picked from commit `32b7cab917`) Parent PR: #22064 Closes scylladb/scylladb#23926 * github.com:scylladb/scylladb: tests: loading_cache_test: use manual_clock utils: loading_cache: make clock_type a template parameter test: loading_cache_test: use function-scope loader test: loading_cache_test: simlute loader using sleep test: lib: eventually: add sleep function param test: lib: eventually: make *EVENTUALLY_EQUAL inline functions	2025-04-28 10:55:14 +02:00
Benny Halevy	bf3b09313c	tests: loading_cache_test: use manual_clock Relying on a real-time clock like lowres_clock can be flaky (in particular in debug mode). Use manual_clock instead to harden the test against timing issues. Fixes #20322 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `32b7cab917`)	2025-04-27 08:49:05 +00:00
Benny Halevy	49bc81415b	utils: loading_cache: make clock_type a template parameter So the unit test can use manual_clock rather than lowres_clock which can be flaky (in particular in debug mode). Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `0841483d68`)	2025-04-27 08:49:05 +00:00
Benny Halevy	325cdcbd58	test: loading_cache_test: use function-scope loader Rather than a global function, accessing a thread-local `load_count`. The thread-local load_count cannot be used when multiple test cases run in parallel. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `b258f8cc69`)	2025-04-27 08:49:04 +00:00
Benny Halevy	a4584e205c	test: loading_cache_test: simlute loader using sleep This test isn't about reading values from file, but rather it's about the loading_cache. Reading from the file can sometimes take longer than the expected refresh times, causing flakiness (see #20322). Rather than reading a string from a real file, just sleep a random, short time, and co_return the string. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `d68829243f`)	2025-04-27 08:49:04 +00:00
Benny Halevy	52c82527f9	test: lib: eventually: add sleep function param To allow support for manual_clock instead of seastar::sleep. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `934a9d3fd6`)	2025-04-27 08:49:04 +00:00
Benny Halevy	3792810030	test: lib: eventually: make *EVENTUALLY_EQUAL inline functions rather then macros. This is a first cleanup step before adding a sleep function parameter to support also manual_clock. Also, add a call to BOOST_REQUIRE_EQUAL/BOOST_CHECK_EQUAL, respectively, to make an error more visible in the test log since those entry points print the offending values when not equal. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `b509644972`)	2025-04-27 08:49:04 +00:00
Tomasz Grabiec	ba3d53be55	tablets: Equalize per-table balance when allocating tablets for a new table Fixes the following scenario: 1. Scale out adds new nodes to each rack 2. Table is created - all tablets are allocated to new nodes because they have low load 3. Rebalancing moves tablets from old nodes to new nodes - table balance for the new table is not fixed We're wrong to try to equalize global load when allocating tablets, and we should equalize per-table load instead, and let background load balancing fix it in a fair way. It will add to the allocated storage imbalance, but: 1. The table is initially empty, so doesn't impact actual storage imbalance. 2. It's more important to avoid overloading CPU on the nodes - imbalance hurts this aspect immediately. 3. If the table was created before imbalance was formed, we would end up in the same situation in the problematic scenario after the patch. 4. It's the job of the load balancing to keep up with storage growing, and if it's not, scale out should kick in. Before we have CPU-aware tablet allocation, and thus can prove we have CPU capacity on the small nodes, we should respect per-table balance as this is the way in which we achieve full CPU utilization. Fixes #23631 (cherry picked from commit `1e407ab4d2`)	2025-04-25 18:29:36 +02:00
Tomasz Grabiec	e73954da80	load_sketch: Tolerate missing tablet_map when selecting for a given table To simplify future usage in network_topology_strategy::add_tablets_in_dc() which invokes populate() for a given table, which may be both new and preexisitng. (cherry picked from commit `2597a7e980`)	2025-04-25 18:29:36 +02:00
Tomasz Grabiec	93dab31007	tests: tablets: Simplify tests by moving common code to topology_builder Reduces code duplication. (cherry picked from commit `d493a8d736`)	2025-04-25 18:29:36 +02:00
Benny Halevy	923944bf21	test_tablets_cql: test_alter_dropped_tablets_keyspace: extend expected error The query may fail also on a no_such_keyspace exception, which generates the following cql error: ``` Error from server: code=2200 [Invalid query] message="Can\'t find a keyspace test_1745198244144_qoohq" ``` Extend the pytest.raises match expression to include this error as well. Fixes #23812 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#23875 (cherry picked from commit `f279625f59`) Closes scylladb/scylladb#23887	2025-04-25 11:34:27 +02:00
Michael Litvak	f71eff37bb	test: test_mv_topology_change: increase timeout for remove_node The test `test_mv_write_to_dead_node` currently uses a timeout of 60 seconds for remove_node, after it was increased from 30 seconds to fix scylladb/scylladb#22953. Apparently it is still too low, and it was observed to fail in debug mode. Normally remove_node uses a default timeout of TOPOLOGY_TIMEOUT = 1000 seconds, but the test requires a timeout which is shorter than 5 minutes, because it is a regression test for an issue where MV updates hold topology changes for more than 5 minutes, and we want to verify in the test that the topology change completes in less than 5 minutes. To resolve the issue, we set the test to skip in debug mode, because the remove node operation is unpredictably slow, and we increase the timeout to 180 seconds which is hopefully enough time for remove_node in non-debug modes, and still sufficient to satisfy the test requirements. Fixes scylladb/scylladb#22530 Closes scylladb/scylladb#23833 (cherry picked from commit `5c1d24f983`) Closes scylladb/scylladb#23874	2025-04-24 17:43:16 +02:00
Sergey Zolotukhin	d34061818c	Ensure raft group0 RPCs use the gossip scheduling group Scylla operations use concurrency semaphores to limit the number of concurrent operations and prevent resource exhaustion. The semaphore is selected based on the current scheduling group. For Raft group operations, it is essential to use a system semaphore to avoid queuing behind user operations. This commit adds a check to ensure that the raft group0 RPCs are executed with the `gossiper` scheduling group. (cherry picked from commit `e05c082002`)	2025-04-22 07:59:01 +00:00
Sergey Zolotukhin	b0fe705c80	Move RAFT operations verbs to GOSSIP group. In order for RAFT operations to use the gossip system semaphore, moving RAFT verbs to the gossip group in `do_get_rpc_client_idx`, messaging_service. Fixes scylladb/scylladb21637 (cherry picked from commit `60f1053087`)	2025-04-22 07:59:01 +00:00
Yaron Kaikov	502c62d91d	install-dependencies.sh: update node_exporter to 1.9.0 Update node_exporter to 1.9.0 to resolve the following CVE's https://github.com/advisories/GHSA-49gw-vxvf-fc2g https://github.com/advisories/GHSA-8xfx-rj4p-23jm https://github.com/advisories/GHSA-crqm-pwhx-j97f https://github.com/advisories/GHSA-j7vj-rw65-4v26 Fixes: https://github.com/scylladb/scylladb/issues/22884 regenerate frozen toolchain with optimized clang from * https://devpkg.scylladb.com/clang/clang-18.1.8-Fedora-40-aarch64.tar.gz * https://devpkg.scylladb.com/clang/clang-18.1.8-Fedora-40-x86_64.tar.gz Closes scylladb/scylladb#22987 (cherry picked from commit `e6227f9a25`) Closes scylladb/scylladb#23021 scylla-2025.1.2-candidate-20250423025213 scylla-2025.1.2	2025-04-21 23:12:28 +03:00
Avi Kivity	8ea6f5824a	Update seastar submodule * seastar 6d8fccf14c...ed31c1ce82 (1): > Merge 'Share IO queues between mountpoints' from Pavel Emelyanov Changes in io-queue call for scylla-gdb update as well -- now the reactor map of device to io-queue uses seastar::shared_ptr, not std::unique_ptr. Closes scylladb/scylladb#23733 Ref `70ac5828a8` Fixes #23820	2025-04-20 13:31:17 +03:00
Benny Halevy	e9b57a149d	topology_coordinator: stop: await all background_action_holder:s Add missing awaits for the rebuild_repair and repair background actions. Although the background actions hold the _async_gate which is closed in topology_coordinator::run(), stop() still needs to await all background action futures and handle any errors they may have left behind. Fixes #23755 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `7a0f5e0a54`)	2025-04-20 07:46:23 +00:00
Benny Halevy	7bdb93aa1c	topology_coordinator: stop: improve error messages "when cleanup" is ill-formed. Use "when XYZ" to "during XYZ" instead. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `6de79d0dd3`)	2025-04-20 07:46:22 +00:00
Benny Halevy	985492018b	topology_coordinator: stop: define stop_background_action helper Refactor the code to use a helper to await background_action_holder and handle any errors by printing a warning. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `d624795fda`)	2025-04-20 07:46:22 +00:00
Avi Kivity	6a8b033510	Merge '[Backport 2025.1] managed_bytes: in the copy constructor, respect the target preferred allocation size' from Scylladb[bot] Commit `14bf09f447` added a single-chunk layout to `managed_bytes`, which makes the overhead of `managed_bytes` smaller in the common case of a small buffer. But there was a bug in it. In the copy constructor of `managed_bytes`, a copy of a single-chunk `managed_bytes` is made single-chunk too. But this is wrong, because the source of the copy and the target of the copy might have different preferred max contiguous allocation sizes. In particular, if a `managed_bytes` of size between 13 kiB and 128 kiB is copied from the standard allocator into LSA, the resulting `managed_bytes` is a single chunk which violates LSA's preferred allocation size. (And therefore is placed by LSA in the standard allocator). In other words, since Scylla 6.0, cache and memtable cells between 13 kiB and 128 kiB are getting allocated in the standard allocator rather than inside LSA segments. Consequences of the bug: 1. Effective memory consumption of an affected cell is rounded up to the nearest power of 2. 2. With a pathological-enough allocation pattern (for example, one which somehow ends up placing a single 16 kiB memtable-owned allocation in every aligned 128 kiB span), memtable flushing could theoretically deadlock, because the allocator might be too fragmented to let the memtable grow by another 128 kiB segment, while keeping the sum of all allocations small enough to avoid triggering a flush. (Such an allocation pattern probably wouldn't happen in practice though). 3. It triggers a bug in reclaim which results in spurious allocation failures despite ample evictable memory. There is a path in the reclaimer procedure where we check whether reclamation succeeded by checking that the number of free LSA segments grew. But in the presence of evictable non-LSA allocations, this is wrong because the reclaim might have met its target by evicting the non-LSA allocations, in which case memory is returned directly to the standard allocator, rather than to the pool of free segments. If that happens, the reclaimer wrongly returns `reclaimed_nothing` to Seastar, which fails the allocation. Refs (possibly fixes) https://github.com/scylladb/scylladb/issues/21072 Fixes https://github.com/scylladb/scylladb/issues/22941 Fixes https://github.com/scylladb/scylladb/issues/22389 Fixes https://github.com/scylladb/scylladb/issues/23781 This is a regression fix, should be backported to all affected releases. - (cherry picked from commit `4e2f62143b`) - (cherry picked from commit `6c1889f65c`) Parent PR: #23782 Closes scylladb/scylladb#23810 * github.com:scylladb/scylladb: managed_bytes_test: add a reproducer for #23781 managed_bytes: in the copy constructor, respect the target preferred allocation size	2025-04-19 18:42:45 +03:00
Botond Dénes	e19ab12f5e	Merge '[Backport 2025.1] service/storage_proxy: schedule_repair(): materialize the range into a vector' from Scylladb[bot] Said method passes down its diff input to mutate_internal(), after some std::ranges massaging. Said massaging is destructive -- it moves items from the diff. If the output range is iterated-over multiple times, only the first time will see the actual output, further iterations will get an empty range. When trace-level logging is enabled, this is exactly what happens: mutate_internal() iterates over the range multiple times, first to log its content, then to pass it down the stack. This ends up resulting in an empty range being pased down and write handlers being created with nullopt optionals. Fixes: scylladb/scylladb#21907 Fixes: scylladb/scylladb#21714 A follow-up stability fix for the test is also included. Fixes: https://github.com/scylladb/scylladb/issues/23513 Fixes: https://github.com/scylladb/scylladb/issues/23512 Based on code-inspection, all versions are vulnerable, although <=6.2 use boost::ranges, not std::ranges. - (cherry picked from commit `7150442f6a`) Parent PR: #21910 Closes scylladb/scylladb#23791 * github.com:scylladb/scylladb: test/cluster/test_read_repair.py: increase read request timeout service/storage_proxy: schedule_repair(): materialize the range into a vector	2025-04-18 14:04:23 +03:00
Anna Stuchlik	f8615b8c53	doc: add info about Scylla Doctor Automation to the docs Fixes https://github.com/scylladb/scylladb/issues/23642 Closes scylladb/scylladb#23745 (cherry picked from commit `0b4740f3d7`) Closes scylladb/scylladb#23776	2025-04-18 14:03:54 +03:00
Botond Dénes	34ea9af232	Merge '[Backport 2025.1] tablets: rebuild: use repair for tablet rebuild' from Scylladb[bot] Currently, when we rebuild a tablet, we stream data from all replicas. This creates a lot of redundancy, wastes bandwidth and CPU resources. In this series, we split the streaming stage of tablet rebuild into two phases: first we stream tablet's data from only one replica and then repair the tablet. Fixes: https://github.com/scylladb/scylladb/issues/17174. Needs backport to 2025.1 to prevent out of space during streaming - (cherry picked from commit `b80e957a40`) - (cherry picked from commit `ed7b8bb787`) - (cherry picked from commit `5d6041617b`) - (cherry picked from commit `4a847df55c`) - (cherry picked from commit `eb17af6143`) - (cherry picked from commit `acd32b24d3`) - (cherry picked from commit `372b562f5e`) Parent PR: #23187 Closes scylladb/scylladb#23682 * github.com:scylladb/scylladb: test: add test for rebuild with repair locator: service: move to rebuild_v2 transition if cluster is upgraded locator: service: add transition to rebuild_repair stage for rebuild_v2 locator: service: add rebuild_repair tablet transition stage locator: add maybe_get_primary_replica locator: service: add rebuild_v2 tablet transition kind gms: add REPAIR_BASED_TABLET_REBUILD cluster feature	2025-04-18 14:03:25 +03:00

1 2 3 4 5 ...

46623 Commits