scylladb

Author	SHA1	Message	Date
Pavel Emelyanov	1796997ace	Merge 'restore: Enable direct download of fully contained SSTables' from Ernest Zaslavsky This PR refactors the streaming subsystem to support direct download of fully contained sstables. Instead of streaming these files, they are downloaded and attached directly to their corresponding tables. This approach reduces overhead, simplifies logic, and improves efficiency. Expected node scope restore performance improvement: ~4 times faster in best case scenario when all sstables are fully contained. 1. Add storage options field to sstable Introduce a data member to store storage options, enabling distinction between local and object storage types. 2. Add method to create component source Extend the storage interface with a public method to create a data_source for any sstable component. 3. Inline streamer instance creation Remove make_sstable_streamer and inline its usage to allow different sets of arguments at call sites. 4. Skip streaming empty sstable sets Avoid unnecessary streaming calls when the sstable set is empty. 5. Enable direct download of contained sstables Replace streaming of fully contained sstables with direct download, attaching them to their corresponding table. Fixes: https://scylladb.atlassian.net/browse/SCYLLADB-200 Refs: https://github.com/scylladb/scylladb/issues/23908 No need to backport as this code targets 2026.2 release (for tablet-aware restore) Closes scylladb/scylladb#26834 * github.com:scylladb/scylladb: tests: reuse test_backup_broken_streaming streaming: enable direct download of contained sstables storage: add method to create component source streaming: keep sharded database reference on tablet_sstable_streamer streaming: skip streaming empty sstable sets streaming: inline streamer instance creation tests: fix incorrect backup/restore test flow	2026-01-26 10:22:34 +03:00
Andrei Chekun	cc5ac75d73	test.py: remove deprecated skip_mode decorator Finishing the deprecation of the skip_mode function in favor of pytest.mark.skip_mode. This PR is only cleaning and migrating leftover tests that are still used and old way of skip_mode. Closes scylladb/scylladb#28299	2026-01-25 18:17:27 +02:00
Ernest Zaslavsky	70f5bc1a50	tests: reuse test_backup_broken_streaming reuse the `test_backup_broken_streaming` test to check for direct sstable download	2026-01-25 13:27:44 +02:00
Ernest Zaslavsky	13fb605edb	streaming: enable direct download of contained sstables Instead of streaming fully contained sstables, download them directly and attach them to their corresponding table. This simplifies the process and avoids unnecessary streaming overhead.	2026-01-25 13:27:44 +02:00
Ernest Zaslavsky	32173ccfe1	tests: fix incorrect backup/restore test flow When working directly with sstable components, the provided name should be only the file name without path prefixes. Any prefixing tokens belong in the 'prefix' argument, as the name suggests.	2026-01-21 16:40:12 +02:00
Robert Bindar	ea8a661119	reduce test_backup.py and test_refresh.py datasets backup and restore tests. This made the testing times explode with both cluster/object_store/test_backup.py and cluster/test_refresh.py taking more than an hour each to complete under test.py and around 14min under pytest directly. This was painful especially in CI because it runs tests under test.py which suffers from the issue of not being able to run test cases from within the same file in parallel (a fix is attempted in 27618). This patch reduces the dataset of these tests to the minimum and gets rid of one of the tested topology as it was redundant. The test times are reduced to 2min under pytest and 14 mins under test.py. Signed-off-by: Robert Bindar <robert.bindar@scylladb.com> Closes scylladb/scylladb#28280	2026-01-21 10:47:36 +02:00
Pavel Emelyanov	8ecd4d73ac	test: Update cluster/object_store/ tests to use new S3 config format Currently the suite generates config in old format, and only a single test validates that using new format "works". This change updates the suite (mainly the MinioServer::create_conf() method) to generate endpoint confit in new format. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#28113	2026-01-20 10:53:34 +02:00
Patryk Jędrzejczak	eb7be9010d	Merge 'topology_coordinator: Refresh load stats after table is created or altered' from Tomasz Grabiec We switched to the size-based load balancing, which now has more strict requirements for load stats. We no longer need only per-node stats, but also per-tablet stats. Bootstrapping a node triggers stats refresh, but allocating tablets on table creation didn't. So after creating a table, load balancer couldn't make progress for up to 60s (stats refresh period). This makes tests take longer, and can even cause failures if tests are using a low-enough timeout. Fixes https://github.com/scylladb/scylladb/issues/27921 No backport becuse only master is vulnerable (size-based load balancing). Closes scylladb/scylladb#27926 * https://github.com/scylladb/scylladb: test: cluster: Add reproducer for missed notification in topology coordinator topology_coordinator: Wake up the state machine after stats refresh topology_coordinator: Move tablet_load_stats_refresh_before_rebalancing injection earlier topology_coordinator: Fix potential missed notification topology_coordinator: Refresh load stats after table is created or altered tablets: Do a group0 read barrier on tablet load stats refresh topology_coordinator: Ensure stats are refreshed in the gossip scheduling group test: Use ManagerClient.{disable,enable}_tablet_balancing() test: Add missing calls to disable_tablet_balancing() in tests which use move_tablet() API test: pylib: Introduce ManagerClient.{disable,enable}_tablet_balancing()	2026-01-16 11:34:57 +01:00
Tomasz Grabiec	eef798d84f	Merge 'Distribute data evenly among primary replicas during restore' from Robert Bindar Most likely `817fdad` uncovered the fact that our choice of primary replica was resonating with tablet allocation and we were ending up picking the same replica as primary within a scope instead of rotating primaryship among all replicas in the scope. This created situations where for instance, restoring into a 9 nodes with primary_replica_only=true would put all data into 3 nodes, leaving the other 6 unused. The balancing of the dataset was performed by the subsequent repair step. This PR fixes this by changing the formula for picking up the primary replica out of a set of eligible replicas from within the passed scope. The PR also extends the testing scenarios in `test_backup.py` so we get to run restore for a set of topologies, for all combinations of scope, primary_replica_only and min_tablet_counts. Most of the work was done by @bhalevy [here](https://github.com/scylladb/scylladb/compare/master...bhalevy:scylla:load-balance-primary-replica), this PR just splitted it and did touchups here and there. Fixes #27281 Closes scylladb/scylladb#27397 * github.com:scylladb/scylladb: test: reduce dataset and number of test cases or debug builds test: bump repair timeout up, it's sometimes not enough in CI test: refactor test_refresh.py to match test_restore_with_streaming_scopes. test: extend test_restore_with_streaming_scopes test: Adjust test_restore_primary_replica_different_dc_scope_all test: Refactor restoring code in test_backup to match SM pattern test: add check_mutation_replicas calls after fresh creation of dataset test: extend create_dataset to accept consistency_level test: refactor check_mutation_replicas so it's more readable test: make create_dataset async and refactor so it's configurable test: use defaultdict in collect_mutations test: add log marks to facilitate reusing server for restore locator: tablets: Distribute data evenly among primary replicas during restore	2026-01-14 18:57:55 +01:00
Avi Kivity	bd08b6e5b2	Merge 'Unify configuration of object storage endpoints (take 2)' from Pavel Emelyanov To configure S3 storage, one needs to do ``` object_storage_endpoints: - name: s3.us-east-1.amazonaws.com port: 443 https: true aws_region: us-east-1 ``` and for GCS it's ``` object_storage_endpoints: - name: https://storage.googleapis.com:433 type: gs credentials_file: <gcp account credentials json file> ``` This PR updates the S3 part to look like ``` object_storage_endpoints: - name: https://s3.us-east-1.amazonaws.com:443 aws_region: us-east-1 ``` fixes: #26570 This is 2nd attempt, previous one (#27360) was reverted because it reported endpoint configs in new format via API and CQL always, even if the endpoint was configured in the old way. This "broke" scylla manager and some dtests. This version has this bug fixed, and endpoints are reported in the same format as they were configured with. About correctness of the changes. No modifications to existing tests are made here, so old format is respected correctly (as far as it's covered by tests). To prove the new format works the the test_get_object_store_endpoints is extended to validate both options. Some preparations to this test to make this happen come on their own with the PR #28111 to show that they are valid and pass before changing the core code. Enhancing the way configuration is made, likely no need to backport. Closes scylladb/scylladb#28112 * github.com:scylladb/scylladb: test: Validate S3 endpoints new format works docs: Update docs according to new endpoints config option format object_storage: Create s3 client with "extended" endpoint name s3/storage: Tune config updating sstable: Shuffle args for s3_client_wrapper test: Rename badconf variable into objconf test: Split the object_store/test_get_object_store_endpoints test	2026-01-14 18:29:03 +02:00
Pavel Emelyanov	9ffd22491f	test: Validate S3 endpoints new format works Extend the test_get_object_store_endpoints() test to configure S3 endpoints in full-url format and check that they are rendered properly via API/CQL. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-13 13:24:18 +03:00
Pavel Emelyanov	83e88d206c	test: Rename badconf variable into objconf It's not actually a "bad" config, it's just some config the test works with. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-13 13:23:20 +03:00
Pavel Emelyanov	9c627bc44a	test: Split the object_store/test_get_object_store_endpoints test It tests two things -- the way object storage config is represented via API and CQL (from sytem.config) and that updating config affects CREATE KEYSPACE CQL (with keyspace storage options) It's better to split the test, as its former part is going to be extented to validate old/new config formats (see #26570) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-13 13:23:03 +03:00
Robert Bindar	dfcabb5fa4	test: reduce dataset and number of test cases or debug builds Signed-off-by: Robert Bindar <robert.bindar@scylladb.com>	2026-01-13 11:46:51 +02:00
Robert Bindar	ca3c57e821	test: bump repair timeout up, it's sometimes not enough in CI Signed-off-by: Robert Bindar <robert.bindar@scylladb.com>	2026-01-13 11:46:49 +02:00
Robert Bindar	6f5e58e718	test: refactor test_refresh.py to match test_restore_with_streaming_scopes. Signed-off-by: Robert Bindar <robert.bindar@scylladb.com>	2026-01-13 11:46:48 +02:00
Robert Bindar	6e636a4231	test: extend test_restore_with_streaming_scopes to test restoring with a different min_tablet_count than the schema was originally created with. Signed-off-by: Robert Bindar <robert.bindar@scylladb.com>	2026-01-13 11:46:46 +02:00
Robert Bindar	92cd1ddec3	test: Adjust test_restore_primary_replica_different_dc_scope_all to match the new topology arhitecture Signed-off-by: Robert Bindar <robert.bindar@scylladb.com>	2026-01-13 11:46:44 +02:00
Robert Bindar	db13ece9a0	test: Refactor restoring code in test_backup to match SM pattern This patch refactors the restoring code in cluster/test_backup.py so it matches better the way SM works. The patch also refactors test_restore_with_streaming_scopes so to facilitate running restore scenarios under all supported scopes with or w/o primary_replica_only enabled by reusing the servers and backups for a topology. This allows us to test a lot more scenarios without making the test impossibly slow. split from bhalevy/load-balance-primary-replica Signed-off-by: Robert Bindar <robert.bindar@scylladb.com>	2026-01-13 11:46:43 +02:00
Robert Bindar	ba01589f53	test: add check_mutation_replicas calls after fresh creation of dataset to validate that mutation assertions are sane split from bhalevy/load-balance-primary-replica Signed-off-by: Robert Bindar <robert.bindar@scylladb.com>	2026-01-13 11:46:41 +02:00
Robert Bindar	b835d32cb0	test: extend create_dataset to accept consistency_level Signed-off-by: Robert Bindar <robert.bindar@scylladb.com>	2026-01-13 11:46:35 +02:00
Robert Bindar	e7d44356d9	test: refactor check_mutation_replicas so it's more readable split from bhalevy/load-balance-primary-replica Signed-off-by: Robert Bindar <robert.bindar@scylladb.com>	2026-01-13 11:46:31 +02:00
Robert Bindar	733b4dbbb7	test: make create_dataset async and refactor so it's configurable with num_keys and min_tablet_count split from bhalevy/load-balance-primary-replica Signed-off-by: Robert Bindar <robert.bindar@scylladb.com>	2026-01-13 11:46:20 +02:00
Robert Bindar	f2c8949e4a	test: use defaultdict in collect_mutations split from bhalevy/load-balance-primary-replica Signed-off-by: Robert Bindar <robert.bindar@scylladb.com>	2026-01-13 11:45:03 +02:00
Robert Bindar	45faeba97d	test: add log marks to facilitate reusing server for restore split from bhalevy/load-balance-primary-replica Signed-off-by: Robert Bindar <robert.bindar@scylladb.com>	2026-01-13 11:44:48 +02:00
Botond Dénes	d2db84714e	test/cluster/object_store/conftest.py: add missing call to parent constructor Replace manual init of parent fields. Found by CodeQL: "Missing call to superclass `__init__` during object initialization". The secret_key is not initialized to server.secret_key, instead of server.access_key. This probably fixes a (benign) bug.	2026-01-13 08:33:17 +02:00
Tomasz Grabiec	5e6935f276	test: Use ManagerClient.{disable,enable}_tablet_balancing()	2026-01-13 00:38:00 +01:00
Botond Dénes	9b4a7f1d14	Merge 'test: cluster: object_store: test_backup: modernize do_abort_restore' from Benny Halevy Currently the function uses a regular expression to check the system log for a specific message. This is tangential to the ability to cleanly abort the restore task, plus the regular expression has a syntax error: ``` test/cluster/object_store/test_backup.py:534 /home/bhalevy/dev/scylla/test/cluster/object_store/test_backup.py:534: SyntaxWarning: "\(" is an invalid escape sequence. Such sequences will not work in the future. Did you mean "\\("? A raw string is also an option. await wait_for_first_completed([l.wait_for("Failed to handle STREAM_MUTATION_FRAGMENTS \(receive and distribute phase\) for .+: Streaming aborted", timeout=10) for l in logs]) ``` Thsi change modernizes the implementation by: - using auto_dc_rack for manager.servers_add - using new_test_keyspace to generate and auto delete the keyspace - using async gatherio and a prepared statement to insert the data - simplifing the keys and values by NOT using os.urandom (that is notoriously slow) - inserting fewer keys in debug mode - removing the log check With that, the test can be reenabled in all modes. * No backport needed since the test was disabled Closes scylladb/scylladb#27892 * github.com:scylladb/scylladb: test_backup: do_abort_restore: reduce data footprint test_backup: do_abort_restore: use error injection test_backup: do_abort_restore: use asyncio for cql test_backup: do_abort_restore: use new_test_keyspace test_backup: do_abort_restore: use logger rather than print test_backup: do_abort_restore: pass auto_rack_dc to servers_add	2026-01-08 21:55:18 +02:00
Andrei Chekun	c950c2e582	test.py: convert skip_mode function to pytest.mark Function skip_mode works only on function and only in cluster test. This if OK when we need to skip one test, but it's not possible to use it with pytestmark to automatically mark all tests in the file. The goal of this PR is to migrate skip_mode to be dynamic pytest.mark that can be used as ordinary mark. Closes scylladb/scylladb#27853 [avi: apply to test/cluster/test_tablets.py::test_table_creation_wakes_up_balancer]	2026-01-08 21:55:16 +02:00
Avi Kivity	0df85c8ae8	Revert "Merge 'Unify configuration of object storage endpoints' from Pavel Emelyanov" This reverts commit `1bb897c7ca`, reversing changes made to `954f2cbd2f`. It makes incompatible changes to the object storage configuration format, breaking tests [1]. It's likely that it doesn't break any production configuration, but we can't be sure. Fixes #27966 Closes scylladb/scylladb#27969	2026-01-05 08:53:41 +02:00
Benny Halevy	a8114f9bcc	test_backup: do_abort_restore: reduce data footprint To make the test fast, in particular in debug mode insert fewer keys and do not rely on os.urandom which is notoriously slow Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2026-01-05 08:04:08 +02:00
Benny Halevy	c0dd662144	test_backup: do_abort_restore: use error injection Currently the test depends on timing and enough inserted data to abort the restore tasks at exactly the right time. This is flaky in nature, so instead, use error injection to synchronize the abort with mutation streaming. Note that with that we no longer get the STREAM_MUTATION_FRAGMENTS log message, so waiting for it is dropped from the test. The most imporant thing is that some restore tasks must fail. (We cannot guarantee all would fail unfortunately) Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2026-01-05 08:03:53 +02:00
Benny Halevy	16dd07c7d4	test_backup: do_abort_restore: use asyncio for cql Use the more modern asyncio facility to run cql queries and a prepared statement to insert data into the table. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2026-01-05 08:03:53 +02:00
Benny Halevy	f1a583c39c	test_backup: do_abort_restore: use new_test_keyspace For creating a keyspace with a unique name and auto-deleting it on exit. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2026-01-05 08:03:53 +02:00
Benny Halevy	3e8431a3d9	test_backup: do_abort_restore: use logger rather than print Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2026-01-05 08:03:53 +02:00
Benny Halevy	acb2c9b045	test_backup: do_abort_restore: pass auto_rack_dc to servers_add To generate multi-rack cluster, otherwise we get the following error: ``` E cassandra.protocol.ConfigurationException: <Error from server: code=2300 [Query invalid because of configuration issue] message="Replication factor 3 exceeds the number of racks (1) in dc datacenter1"> ``` Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2026-01-05 08:03:53 +02:00
Nadav Har'El	e28df9b3d0	test: fix Python warnings in regular expressions Like C, Python supports some escape sequences in strings such as the familiar "\n" that converts to a newline character. Originally, when backslash was used before a random character, for example, "\.", Python used to just use these literal characters backslash and dot, in the string - and not make a fuss about it. This made it ok to use a string like "hi\.there" as a regular expression. We have a few instances of this in our Python tests. But recent releases of Python started to produce ugly warnings about these cases. The error message looks like: SyntaxWarning: "\." is an invalid escape sequence. Such sequences will not work in the future. Did you mean "\\."? A raw string is also an option. Indeed in most cases the easiest solution is to use a "raw string", a string literal preceded with r. For example, r"hi\.there". In such strings Python doesn't replace escape sequences like \n in the string, and also leaves the \. unchanged for the regular expression to see. So in this patch we use raw strings in all places in test/ where Python warns have this problem. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#27856	2025-12-31 20:44:01 +02:00
Botond Dénes	95d4c73eb1	Merge 'Make object storage config truly updateable' from Pavel Emelyanov The db::config::object_storage_endpoints parameter is live-updateable, but when the update really happens, the new endpoints may fail to propagate to non-zero shards because of the way db::config sharding is implemented. Refs: #7316 Fixes: #26509 Backport to 2025.3 and 2025.4, AFAIK there are set ups with object storage configs for native backup Closes scylladb/scylladb#27689 * github.com:scylladb/scylladb: sstables/storage_manager: Fix configured endpoints observer test/object_store: Add test to validate how endpoint config update works	2025-12-24 13:42:44 +02:00
Botond Dénes	1bb897c7ca	Merge 'Unify configuration of object storage endpoints' from Pavel Emelyanov To configure S3 storage, one needs to do ``` object_storage_endpoints: - name: s3.us-east-1.amazonaws.com port: 443 https: true aws_region: us-east-1 ``` and for GCS it's ``` object_storage_endpoints: - name: https://storage.googleapis.com:433 type: gs credentials_file: <gcp account credentials json file> ``` This PR updates the S3 part to look like ``` object_storage_endpoints: - name: https://s3.us-east-1.amazonaws.com:443 aws_region: us-east-1 ``` fixes: #26570 Not-yet released feature, no need to backport. Old configs are not accepted any longer. If it's needed, then this decision needs to be revised. Closes scylladb/scylladb#27360 * github.com:scylladb/scylladb: object_storage: Temporarily handle pure endpoint addresses as endpoints code: Remove dangling mentions of s3::endpoint_config docs: Update docs according to new endpoints config option format object_storage: Create s3 client with "extended" endpoint name test: Add named constants for test_get_object_store_endpoints endpoint names s3/storage: Tune config updating sstable: Shuffle args for s3_client_wrapper	2025-12-24 06:59:02 +02:00
Pavel Emelyanov	132aa753da	sstables/storage_manager: Fix configured endpoints observer On start the manager creates observer for object_storage_endpoints config parameter. The goal is to refresh the maintained set of endpoint parameters and client upon config change. The observer is created on shard 0 only, and when kicked it calls manager.invoke-on-all to update manager on all shards. However, there's a race here. The thing is that db::config values are implicitly "sharded" under the hood with the help of plain array. When any code tries to read a value from db::config::something, the reading code secretly gets the value from this inner array indexed by the current shard id. Next, when the config is updated, it first assigns new values to [0] element of the hidden array, then calls broadcast_to_all_shards() helper that copies the valaues from zeroth slot to all the others. But the manager's observer is triggered when the new value is assigned on zero index, and if the invoke-on-all lambda (mentioned above) happens to be faster than broadcast_to_all_shards, the non-zero shards will read old values from db::config's inner array. The fix is to instantiate observer on all shards and update only local shard, whenever this update is triggered. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-12-23 12:43:11 +03:00
Pavel Emelyanov	f902eb1632	test/object_store: Add test to validate how endpoint config update works There's a test for backup with non-existing endpoint/bucket/snapshot. It checks that API call to backup sstables properly fails in that case. This patch adds similar test for "unconfigured endpoint", but it adds the endpoint configuration on-the-fly and expects that backup will proceed after config update. Currently the test fails, as config update only affect the config itself, the storage_manager, that's in charge of maintaining endpoint clients, is not really updated. Next patch will fix it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-12-23 12:41:38 +03:00
Pavel Emelyanov	cd2568ad00	test: Merge and parametrize test_backup_to_non_existent_something tests There are three tests in cluster/object_store suite that check how backup fails in case either of its parameters doesn't really exists. All three greatly duplicate each other, it makes sense to merge them into one larger parametrized test. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#27695	2025-12-23 07:02:18 +02:00
Pavel Emelyanov	a3ca4fccef	object_storage: Create s3 client with "extended" endpoint name For this, add the s3::client::make(endpoint, ...) overload that accepts endpoint in proto://host:port format. Then it parses the provided url and calls the legacy one, that accepts raw host string and config with port, https bit, etc. The generic object_storage_endpoint_param no longer needs to carry the internal s3::endpoint_config, the config option parsing changes respectively. Tests, that generate the config files, and docs are updated. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-12-10 15:33:47 +03:00
Pavel Emelyanov	5953a89822	test: Add named constants for test_get_object_store_endpoints endpoint names Instead of hard-coded 'a' and 'b' here and there Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-12-10 15:33:46 +03:00
Pavel Emelyanov	6ae72ed134	test: Reuse S3 fixtures facilities in cqlpy/test_tools.py Creating endpoint conf can be made with the s3_server method Getting boto3 resource from s3_server itself is also possible Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#27380	2025-12-03 16:32:54 +02:00
Botond Dénes	357f91de52	Revert "Merge 'db/config: enable `ms` sstable format by default' from Michał Chojnowski" This reverts commit `b0643f8959`, reversing changes made to `e8b0f8faa9`. The change forgot to update sstables_manager::get_highest_supported_format(), which results in /system/highest_supported_sstable_version still returning me, confusing and breaking tests. Fixes: scylladb/scylla-dtest#6435 Closes scylladb/scylladb#27379	2025-12-02 14:38:56 +02:00
Michał Chojnowski	da51a30780	db/config: enable `ms` sstable format by default Trie-based sstable indexes are supposed to be (hopefully) a better default than the old BIG indexes. Make them the new default. If we change our mind, this change can be reverted later.	2025-11-21 12:39:46 +01:00
Ernest Zaslavsky	dedc8bdf71	streaming: fix loop break condition in tablet_sstable_streamer::stream Correct the loop termination logic that previously caused certain SSTables to be prematurely excluded, resulting in lost mutations. This change ensures all relevant SSTables are properly streamed and their mutations preserved.	2025-11-19 17:32:49 +02:00
Ernest Zaslavsky	656ce27e7f	streaming: add pytest case to reproduce mutation loss issue Introduce a test that demonstrates mutation loss caused by premature loop termination in tablet_sstable_streamer::stream. The code broke out of the SSTable iteration when encountering a non-overlapping range, which skipped subsequent SSTables that should have been partially contained. This test showcases the problem only. Example: Tablet range: [4, 5] SSTable ranges: [0,5] [0, 3] <--- is considered exhausted, and causes skip to next tablet [2, 5] <--- is missed for range [4, 5]	2025-11-18 09:34:41 +02:00
Robert Bindar	a04ebb829c	Add cluster tests for checking scoped primary_replica_only streaming This commits adds a tests checking various scenarios of restoring via load and stream with primary_replica_only and a scope specified. The tests check that in a few topologies, a mutation is replicated a correct amount of times given primary_replica_only and that streaming happens according to the scope rule passed. Signed-off-by: Robert Bindar <robert.bindar@scylladb.com>	2025-11-11 09:18:01 +02:00

1 2

77 Commits