scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 20:27:03 +00:00

Author	SHA1	Message	Date
Michael Litvak	338ff18dfe	cdc: throw error if column doesn't exist in the CDC log transformer, when creating a CDC mutation based on some base table mutation, for each value of a base column we set the value in the CDC column with the same name. When looking up the column in the CDC schema by name, we may get a null pointer if a column by that name is not found. This shouldn't happen normally because the base schema and CDC schema should be compatible, and for each base column there should be a CDC column with the same name. However, there are scenarios where the base schema and CDC schema are incompatible for a short period of time when they are being altered. When a base column is being added or dropped, we could get a base mutation with this column set, and then the CDC transformer picks up the latest CDC schema which doesn't have this column. If such thing happens, we fix the code to throw an exception instead of crashing on null pointer dereference. Currently we don't have a safer approach to handle this, but this might be changed in the future. The other alternative is dropping that data silently which we prefer not to do. Throwing an error is acceptable because this scenario most likely indicates this behavior by the user: * The user adds a new column, and start writing values to the column before the ALTER is complete. or, * The user drops a column, and continues writing values to the column while it's being dropped. Both cases might as well fail with an error because the column is not found in the base table. Fixes scylladb/scylladb#24952 (cherry picked from commit `b336f282ae`)	2025-07-18 10:36:44 +00:00
Piotr Dulikowski	e49b312be9	auth: fix crash when migration code runs parallel with raft upgrade The functions password_authenticator::start and standard_role_manager::start have a similar structure: they spawn a fiber which invokes a callback that performs some migration until that migration succeeds. Both handlers set a shared promise called _superuser_created_promise (those are actually two promises, one for the password authenticator and the other for the role manager). The handlers are similar in both cases. They check if auth is in legacy mode, and behave differently depending on that. If in legacy mode, the promise is set (if it was not set before), and some legacy migration actions follow. In auth-on-raft mode, the superuser is attempted to be created, and if it succeeds then the promise is _unconditionally_ set. While it makes sense at a glance to set the promise unconditionally, there is a non-obvious corner case during upgrade to topology on raft. During the upgrade, auth switches from the legacy mode to auth on raft mode. Thus, if the callback didn't succeed in legacy mode and then tries to run in auth-on-raft mode and succeds, it will unconditionally set a promise that was already set - this is a bug and triggers an assertion in seastar. Fix the issue by surrounding the `shared_promise::set_value` call with an `if` - like it is already done for the legacy case. Fixes: scylladb/scylladb#24975 Closes scylladb/scylladb#24976 (cherry picked from commit `a14b7f71fe`) Closes scylladb/scylladb#25019	2025-07-17 13:32:35 +02:00
Patryk Jędrzejczak	7caacf958b	test: test_zero_token_nodes_multidc: properly handle reads with CL=ONE The test could fail with RF={DC1: 2, DC2: 0} and CL=ONE when: - both writes succeeded with the same replica responding first, - one of the following reads succeeded with the other replica responding before it applied mutations from any of the writes. We fix the test by not expecting reads with CL=ONE to return a row. We also harden the test by inserting different rows for every pair (CL, coordinator), where one of the two coordinators is a normal node from DC1, and the other one is a zero-token node from DC2. This change makes sure that, for example, every write really inserts a row. Fixes scylladb/scylladb#22967 The fix addresses CI flakiness and only changes the test, so it should be backported. Closes scylladb/scylladb#23518 (cherry picked from commit `21edec1ace`) Closes scylladb/scylladb#24985	2025-07-15 15:47:43 +02:00
Botond Dénes	489e4fdb4e	Merge '[Backport 2025.3] S3 chunked download source bug fixes' from Scylladb[bot] - Fix missing negation in the `if` in the background downloading fiber - Add test to catch this case - Improve the s3 proxy to inject errors if the same resource requested more than once - Suppress client retry since retrying the same request when each produces multiple buffers may lead to the same data appear more than once in the buffer deque - Inject exception from the test to simulate response callback failure in the middle No need to backport anything since this class in not used yet - (cherry picked from commit `f1d0690194`) - (cherry picked from commit `e73b83e039`) - (cherry picked from commit `6d9cec558a`) - (cherry picked from commit `ec59fcd5e4`) - (cherry picked from commit `c75acd274c`) - (cherry picked from commit `d2d69cbc8c`) - (cherry picked from commit `e50f247bf1`) - (cherry picked from commit `49e8c14a86`) - (cherry picked from commit `a5246bbe53`) - (cherry picked from commit `acf15eba8e`) Parent PR: #24657 Closes scylladb/scylladb#24943 * github.com:scylladb/scylladb: s3_test: Add s3_client test for non-retryable error handling s3_test: Add trace logging for default_retry_strategy s3_client: Fix edge case when the range is exhausted s3_client: Fix indentation in try..catch block s3_client: Stop retries in chunked download source s3_client: Enhance test coverage for retry logic s3_client: Add test for Content-Range fix s3_client: Fix missing negation s3_client: Refine logging s3_client: Improve logging placement for current_range output	2025-07-15 15:28:48 +03:00
Michael Litvak	26738588db	tablets: stop storage group on deallocation When a tablet transitions to a post-cleanup stage on the leaving replica we deallocate its storage group. Before the storage can be deallocated and destroyed, we must make sure it's cleaned up and stopped properly. Normally this happens during the tablet cleanup stage, when table::cleanup_table is called, so by the time we transition to the next stage the storage group is already stopped. However, it's possible that tablet cleanup did not run in some scenario: 1. The topology coordinator runs tablet cleanup on the leaving replica. 2. The leaving replica is restarted. 3. When the leaving replica starts, still in `cleanup` stage, it allocates a storage group for the tablet. 4. The topology coordinator moves to the next stage. 5. The leaving replica deallocates the storage group, but it was not stopped. To address this scenario, we always stop the storage group when deallocating it. Usually it will be already stopped and complete immediately, and otherwise it will be stopped in the background. Fixes scylladb/scylladb#24857 Fixes scylladb/scylladb#24828 Closes scylladb/scylladb#24896 (cherry picked from commit `fa24fd7cc3`) Closes scylladb/scylladb#24909	2025-07-15 13:14:35 +03:00
Aleksandra Martyniuk	f69f59afbd	repair: Reduce max row buf size when small table optimization is on If small_table_optimization is on, a repair works on a whole table simultaneously. It may be distributed across the whole cluster and all nodes might participate in repair. On a repair master, row buffer is copied for each repair peer. This means that the memory scales with the number of peers. In large clusters, repair with small_table_optimization leads to OOM. Divide the max_row_buf_size by the number of repair peers if small_table_optimization is on. Use max_row_buf_size to calculate number of units taken from mem_sem. Fixes: https://github.com/scylladb/scylladb/issues/22244. Closes scylladb/scylladb#24868 (cherry picked from commit `17272c2f3b`) Closes scylladb/scylladb#24907	2025-07-15 13:13:49 +03:00
Łukasz Paszkowski	e1e0c721e7	test.py: Fix test_compactionhistory_rows_merged_time_window_compaction_strategy The test has two major problems 1. Wrongly computed time windows. Data was not spread across two 1-minute windows causing the test to generate even three sstables instead of two 2. Timestamp was not propagated to the prepared CQL statements. So in fact, a current time was used implicitly 3. Because of the incorrect timestamp issue, the remaining tests testing purged tombstones were affected as well. Fixes https://github.com/scylladb/scylladb/issues/24532 Closes scylladb/scylladb#24609 (cherry picked from commit `a22d1034af`) Closes scylladb/scylladb#24791	2025-07-15 13:12:39 +03:00
Yaron Kaikov	05a6d4da23	dist/common/scripts/scylla_sysconfig_setup: fix `SyntaxWarning: invalid escape sequence` There are invalid escape sequence warnings where raw strings should be used for the regex patterns Fixes: https://github.com/scylladb/scylladb/issues/24915 Closes scylladb/scylladb#24916 (cherry picked from commit `fdcaa9a7e7`) Closes scylladb/scylladb#24970	2025-07-15 11:01:28 +02:00
Yaron Kaikov	1e1aeed3cd	auto-backport.py: Avoid bot push to existing backport branches Changed the backport logic so that the bot only pushes the backport branch if it does not already exist in the remote fork. If the branch exists, the bot skips the push, allowing only users to update (force-push) the branch after the backport PR is open. Fixes: https://github.com/scylladb/scylladb/issues/24953 Closes scylladb/scylladb#24954 (cherry picked from commit `ed7c7784e4`) Closes scylladb/scylladb#24969	2025-07-15 10:25:30 +02:00
Jenkins Promoter	af10d6f03b	Update pgo profiles - aarch64	2025-07-15 05:21:25 +03:00
Jenkins Promoter	0d3742227d	Update pgo profiles - x86_64	2025-07-15 04:58:36 +03:00
Yaron Kaikov	c6987e3fed	packaging: add `ps` command to dependancies ScyllaDB container image doesn't have ps command installed, while this command is used by perftune.py script shipped within the same image. This breaks node and container tuning in Scylla Operator. Fixes: #24827 Closes scylladb/scylladb#24830 (cherry picked from commit `66ff6ab6f9`) Closes scylladb/scylladb#24956	2025-07-14 14:19:17 +03:00
Ernest Zaslavsky	873c8503cd	s3_test: Add s3_client test for non-retryable error handling Introduce a test that injects a non-retryable error and verifies that the chunked download source throws an exception as expected. (cherry picked from commit `acf15eba8e`)	2025-07-13 13:17:14 +00:00
Ernest Zaslavsky	dbf4bd162e	s3_test: Add trace logging for default_retry_strategy Introduce trace-level logging for `default_retry_strategy` in `s3_test` to improve visibility into retry logic during test execution. (cherry picked from commit `a5246bbe53`)	2025-07-13 13:17:14 +00:00
Ernest Zaslavsky	7f303bfda3	s3_client: Fix edge case when the range is exhausted Handle case where the download loop exits after consuming all data, but before receiving an empty buffer signaling EOF. Without this, the next request is sent with a non-zero offset and zero length, resulting in "Range request cannot be satisfied" errors. Now, an empty buffer is pushed to indicate completion and exit the fiber properly. (cherry picked from commit `49e8c14a86`)	2025-07-13 13:17:14 +00:00
Ernest Zaslavsky	22739df69f	s3_client: Fix indentation in try..catch block Correct indentation in the `try..catch` block to improve code readability and maintain consistent formatting. (cherry picked from commit `e50f247bf1`)	2025-07-13 13:17:14 +00:00
Ernest Zaslavsky	54db6ca088	s3_client: Stop retries in chunked download source Disable retries for S3 requests in the chunked download source to prevent duplicate chunks from corrupting the buffer queue. The response handler now throws an exception to bypass the retry strategy, allowing the next range to be attempted cleanly. This exception is only triggered for retryable errors; unretryable ones immediately halt further requests. (cherry picked from commit `d2d69cbc8c`)	2025-07-13 13:17:14 +00:00
Ernest Zaslavsky	c841ffe398	s3_client: Enhance test coverage for retry logic Extend the S3 proxy to support error injection when the client makes multiple requests to the same resource—useful for testing retry behavior and failure handling. (cherry picked from commit `c75acd274c`)	2025-07-13 13:17:14 +00:00
Ernest Zaslavsky	c748a97170	s3_client: Add test for Content-Range fix Introduce a test that accurately verifies the Content-Range behavior, ensuring the previous fix is properly validated. (cherry picked from commit `ec59fcd5e4`)	2025-07-13 13:17:14 +00:00
Ernest Zaslavsky	00f10e7f1d	s3_client: Fix missing negation Restore a missing `not` in a conditional check that caused incorrect behavior during S3 client execution. (cherry picked from commit `6d9cec558a`)	2025-07-13 13:17:14 +00:00
Ernest Zaslavsky	4cd1792528	s3_client: Refine logging Fix typo in log message to improve clarity and accuracy during S3 operations. (cherry picked from commit `e73b83e039`)	2025-07-13 13:17:14 +00:00
Ernest Zaslavsky	115e8c85e4	s3_client: Improve logging placement for current_range output Relocated logging to occur after determining the `current_range`, ensuring more relevant output during S3 client operations. (cherry picked from commit `f1d0690194`)	2025-07-13 13:17:14 +00:00
Gleb Natapov	087d3bb957	api: unregister raft_topology_get_cmd_status on shutdown In `c8ce9d1c60` we introduced raft_topology_get_cmd_status REST api but the commit forgot to unregister the handler during shutdown. Fixes #24910 Closes scylladb/scylladb#24911 (cherry picked from commit `89f2edf308`) Closes scylladb/scylladb#24923	2025-07-13 15:15:52 +03:00
Avi Kivity	f3297824e3	Revert "config: decrease default large allocation warning threshold to 128k" This reverts commit `04fb2c026d`. 2025.3 got the reduced threshold, but won't get many of the fixes the warning will generate, leaving it very noisy. Better to avoid the noise for this release. Fixes #24384. scylla-2025.3.0-rc1-candidate-20250713045928 scylla-2025.3.0-rc1	2025-07-10 14:12:14 +03:00
Avi Kivity	4eb220d3ab	service: tablet_allocator: avoid large contiguous vector in make_repair_plan() make_repair_plan() allocates a temporary vector which can grow larger than our 128k basic allocation unit. Use a chunked vector to avoid stalls due to large allocations. Fixes #24713. Closes scylladb/scylladb#24801 (cherry picked from commit `0138afa63b`) Closes scylladb/scylladb#24902	2025-07-10 12:41:35 +03:00
Patryk Jędrzejczak	c9de7d68f2	Merge '[Backport 2025.3] Make it easier to debug stuck raft topology operation.' from Scylladb[bot] The series adds more logging and provides new REST api around topology command rpc execution to allow easier debugging of stuck topology operations. Backport since we want to have in the production as quick as possible. Fixes #24860 - (cherry picked from commit `c8ce9d1c60`) - (cherry picked from commit `4e6369f35b`) Parent PR: #24799 Closes scylladb/scylladb#24881 * https://github.com/scylladb/scylladb: topology coordinator: log a start and an end of topology coordinator command execution at info level topology coordinator: add REST endpoint to query the status of ongoing topology cmd rpc	2025-07-09 12:55:48 +02:00
Piotr Dulikowski	b535f44db2	Merge '[Backport 2025.3] batchlog_manager: abort replay of a failed batch on shutdown or node down' from Scylladb[bot] When replaying a failed batch and sending the mutation to all replicas, make the write response handler cancellable and abort it on shutdown or if some target is marked down. also set a reasonable timeout so it gets aborted if it's stuck for some other unexpected reason. Previously, the write response handler is not cancellable and has no timeout. This can cause a scenario where some write operation by the batchlog manager is stuck indefinitely, and node shutdown gets stuck as well because it waits for the batchlog manager to complete, without aborting the operation. backport to relevant versions since the issue can cause node shutdown to hang Fixes scylladb/scylladb#24599 - (cherry picked from commit `8d48b27062`) - (cherry picked from commit `fc5ba4a1ea`) - (cherry picked from commit `7150632cf2`) - (cherry picked from commit `74a3fa9671`) - (cherry picked from commit `a9b476e057`) - (cherry picked from commit `d7af26a437`) Parent PR: #24595 Closes scylladb/scylladb#24882 * github.com:scylladb/scylladb: test: test_batchlog_manager: batchlog replay includes cdc test: test_batchlog_manager: test batch replay when a node is down batchlog_manager: set timeout on writes batchlog_manager: abort writes on shutdown batchlog_manager: create cancellable write response handler storage_proxy: add write type parameter to mutate_internal	2025-07-08 12:35:55 +02:00
Michael Litvak	ec1dd1bf31	test: test_batchlog_manager: batchlog replay includes cdc Add a new test that verifies that when replaying batch mutations from the batchlog, the mutations include cdc augmentation if needed. This is done in order to verify that it works currently as expected and doesn't break in the future. (cherry picked from commit `d7af26a437`)	2025-07-08 06:25:36 +00:00
Michael Litvak	7b30f487dd	test: test_batchlog_manager: test batch replay when a node is down Add a test of the batchlog manager replay loop applying failed batches while some replica is down. The test reproduces an issue where the batchlog manager tries to replay a failed batch, doesn't get a response from some replica, and becomes stuck. It verifies that the batchlog manager can eventually recover from this situation and continue applying failed batches. (cherry picked from commit `a9b476e057`)	2025-07-08 06:25:36 +00:00
Michael Litvak	c3c489d3d4	batchlog_manager: set timeout on writes Set a timeout on writes of replayed batches by the batchlog manager. We want to avoid having infinite timeout for the writes in case it gets stuck for some unexpected reason. The timeout is set to be high enough to allow any reasonable write to complete. (cherry picked from commit `74a3fa9671`)	2025-07-08 06:25:36 +00:00
Michael Litvak	6fb6bb8dc7	batchlog_manager: abort writes on shutdown On shutdown of batchlog manager, abort all writes of replayed batches by the batchlog manager. To achieve this we set the appropriate write_type to BATCH, and on shutdown cancel all write handlers with this type. (cherry picked from commit `7150632cf2`)	2025-07-08 06:25:36 +00:00
Michael Litvak	02c038efa8	batchlog_manager: create cancellable write response handler When replaying a batch mutation from the batchlog manager and sending it to all replicas, create the write response handler as cancellable. To achieve this we define a new wrapper type for batchlog mutations - batchlog_replay_mutation, and this allows us to overload create_write_response_handler for this type. This is similar to how it's done with hint_wrapper and read_repair_mutation. (cherry picked from commit `fc5ba4a1ea`)	2025-07-08 06:25:36 +00:00
Michael Litvak	d3175671b7	storage_proxy: add write type parameter to mutate_internal Currently mutate_internal has a boolean parameter `counter_write` that indicates whether the write is of counter type or not. We replace it with a more general parameter that allows to indicate the write type. It is compatible with the previous behavior - for a counter write, the type COUNTER is passed, and otherwise a default value will be used as before. (cherry picked from commit `8d48b27062`)	2025-07-08 06:25:36 +00:00
Gleb Natapov	4651c44747	topology coordinator: log a start and an end of topology coordinator command execution at info level Those calls a relatively rare and the output may help to analyze issues in production. (cherry picked from commit `4e6369f35b`)	2025-07-08 06:24:22 +00:00
Gleb Natapov	0e67f6f6c2	topology coordinator: add REST endpoint to query the status of ongoing topology cmd rpc The topology coordinator executes several topology cmd rpc against some nodes during a topology change. A topology operation will not proceed unless rpc completes (successfully or not), but sometimes it appears that it hangs and it is hard to tell on which nodes it did not complete yet. Introduce new REST endpoint that can help with debugging such cases. If executed on the topology coordinator it returns currently running topology rpc (if any) and a list of nodes that did not reply yet. (cherry picked from commit `c8ce9d1c60`)	2025-07-08 06:24:21 +00:00
Avi Kivity	859d9dd3b1	Merge '[Backport 2025.3] Improve background disposal of tablet_metadata' from Scylladb[bot] As seen in #23284, when the tablet_metadata contains many tables, even empty ones, we're seeing a long queue of seastar tasks coming from the individual destruction of `tablet_map_ptr = foreign_ptr<lw_shared_ptr<const tablet_map>>`. This change improves `tablet_metadata::clear_gently` to destroy the `tablet_map_ptr` objects on their owner shard by sorting them into vectors, per- owner shard. Also, background call to clear_gently was added to `~token_metadata`, as it is destroyed arbitrarily when automatic token_metadata_ptr variables go out of scope, so that the contained tablet_metadata would be cleared gently. Finally, a unit test was added to reproduce the `Too long queue accumulated for gossip` symptom and verify that it is gone with this change. Fixes #24814 Refs #23284 This change is not marked as fixing the issue since we still need to verify that there is no impact on query performance, reactor stalls, or large allocations, with a large number of tablet-based tables. * Since the issue exists in 2025.1, requesting backport to 2025.1 and upwards - (cherry picked from commit `3acca0aa63`) - (cherry picked from commit `493a2303da`) - (cherry picked from commit `e0a19b981a`) - (cherry picked from commit `2b2cfaba6e`) - (cherry picked from commit `2c0bafb934`) - (cherry picked from commit `4a3d14a031`) - (cherry picked from commit `6e4803a750`) Parent PR: #24618 Closes scylladb/scylladb#24864 * github.com:scylladb/scylladb: token_metadata_impl: clear_gently: release version tracker early test: cluster: test_tablets_merge: add test_tablet_split_merge_with_many_tables token_metadata: clear_and_destroy_impl when destroyed token_metadata: keep a reference to shared_token_metadata token_metadata: move make_token_metadata_ptr into shared_token_metadata class replica: database: get and expose a mutable locator::shared_token_metadata locator: tablets: tablet_metadata: clear_gently: optimize foreign ptr destruction	2025-07-07 14:02:19 +03:00
Gleb Natapov	a25bd068bf	topology coordinator: do not set request_type field for truncation command if topology_global_request_queue feature is not enabled yet Old nodes do not expect global topology request names to be in request_type field, so set it only if a cluster is fully upgraded already. Closes scylladb/scylladb#24731 (cherry picked from commit `ca7837550d`) Closes scylladb/scylladb#24833	2025-07-07 11:50:55 +02:00
Benny Halevy	9bc487e79e	token_metadata_impl: clear_gently: release version tracker early No need to wait for all members to be cleared gently. We can release the version earlier since the held version may be awaited for in barriers. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `6e4803a750`)	2025-07-07 09:42:29 +03:00
Benny Halevy	41dc86ffa8	test: cluster: test_tablets_merge: add test_tablet_split_merge_with_many_tables Reproduces #23284 Currently skipped in release mode since it requires the `short_tablet_stats_refresh_interval` interval. Ref #24641 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `4a3d14a031`)	2025-07-07 09:42:26 +03:00
Benny Halevy	f78a352a29	token_metadata: clear_and_destroy_impl when destroyed We have a lot of places in the code where a token_metadata_ptr is kept in an automatic variable and destroyed when it leaves the scope. since it's a referenced counted lw_shared_ptr, the token_metadata object is rarely destroyed in those cases, but when it is, it doesn't go through clear_gently, and in particular its tablet_metadata is not cleared gently, leading to inefficient destruction of potentially many foreign_ptr:s. This patch calls clear_and_destroy_impl that gently clears and destroys the impl object in the background using the shared_token_metadata. Fixes #13381 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `2c0bafb934`)	2025-07-07 09:38:17 +03:00
Benny Halevy	b647dbd547	token_metadata: keep a reference to shared_token_metadata To be used by a following patch to gently clean and destroy the token_data_impl in the background. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `2b2cfaba6e`)	2025-07-07 09:34:10 +03:00
Benny Halevy	0e7d3b4eb9	token_metadata: move make_token_metadata_ptr into shared_token_metadata class So we can use the local shared_token_metadata instance for safe background destroy of token_metadata_impl:s. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `e0a19b981a`)	2025-07-07 09:30:01 +03:00
Benny Halevy	c8043e05c1	replica: database: get and expose a mutable locator::shared_token_metadata Prepare for next patch, the will use this shared_token_metadata to make mutable_token_metadata_ptr:s Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `493a2303da`)	2025-07-07 09:27:06 +03:00
Benny Halevy	54fb9ed03b	locator: tablets: tablet_metadata: clear_gently: optimize foreign ptr destruction Sort all tablet_map_ptr:s by shard_id and then destroy them on each shard to prevent long cross-shard task queues for foreign_ptr destructions. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> (cherry picked from commit `3acca0aa63`)	2025-07-07 09:27:01 +03:00
Avi Kivity	f60c54df77	storage_proxy: avoid large allocation when storing batch in system.batchlog Currently, when computing the mutation to be stored in system.batchlog, we go through data_value. In turn this goes through `bytes` type (#24810), so it causes a large contiguous allocation if the batch is large. Fix by going through the more primitive, but less contiguous, atomic_cell API. Fixes #24809. Closes scylladb/scylladb#24811 (cherry picked from commit `60f407bff4`) Closes scylladb/scylladb#24846	2025-07-05 00:37:09 +03:00
Patryk Jędrzejczak	f1ec51133e	docs: handling-node-failures: fix typo Replacing "from" is incorrect. The typo comes from recently merged #24583. Fixes #24732 Requires backport to 2025.2 since #24583 has been backported to 2025.2. Closes scylladb/scylladb#24733 (cherry picked from commit `fa982f5579`) Closes scylladb/scylladb#24832	2025-07-04 19:35:00 +02:00
Jenkins Promoter	648fe6a4e8	Update ScyllaDB version to: 2025.3.0-rc1	2025-07-03 11:35:01 +03:00
Michał Chojnowski	1bd536a228	utils/alien_worker: fix a data race in submit() We move a `seastar::promise` on the external worker thread, after the matching `seastar::future` was returned to the shard. That's illegal. If the `promise` move occurs concurrently with some operation (move, await) on the `future`, it becomes a data race which could cause various kinds of corruption. This patch fixes that by keeping the promise at a stable address on the shard (inside a coroutine frame) and only passing through the worker. Fixes #24751 Closes scylladb/scylladb#24752 (cherry picked from commit `a29724479a`) Closes scylladb/scylladb#24780	2025-07-03 10:45:51 +03:00
Avi Kivity	d5b11098e8	repair: row_level: unstall to_repair_rows_on_wire() destroying its input to_repair_rows_on_wire() moves the contents of its input std::list and is careful to yield after each element, but the final destruction of the input list still deals with all of the list elements without yielding. This is expensive as not all contents of repair_row are moved (_dk_with_hash is of type lw_shared_ptr<const decorated_key_with_hash>). To fix, destroy each row element as we move along. This is safe as we own the input and don't reference row_list other than for the iteration. Fixes #24725. Closes scylladb/scylladb#24726 (cherry picked from commit `6aa71205d8`) Closes scylladb/scylladb#24771	2025-07-03 10:44:58 +03:00
Tomasz Grabiec	775916132e	Merge '[Backport 2025.3] repair: postpone repair until topology is not busy ' from Scylladb[bot] Currently, repair_service::repair_tablets starts repair if there is no ongoing tablet operations. The check does not consider global topology operations, like tablet resize finalization. Hence, if: - topology is in the tablet_resize_finalization state; - repair starts (as there is no tablet transitions) and holds the erm; - resize finalization finishes; then the repair sees a topology state different than the actual - it does not see that the storage groups were already split. Repair code does not handle this case and it results with on_internal_error. Start repair when topology is not busy. The check isn't atomic, as it's done on a shard 0. Thus, we compare the topology versions to ensure that the business check is valid. Fixes: https://github.com/scylladb/scylladb/issues/24195. Needs backport to all branches since they are affected - (cherry picked from commit `df152d9824`) - (cherry picked from commit `83c9af9670`) Parent PR: #24202 Closes scylladb/scylladb#24783 * github.com:scylladb/scylladb: test: add test for repair and resize finalization repair: postpone repair until topology is not busy	2025-07-02 13:17:08 +02:00

1 2 3 4 5 ...

48311 Commits