scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-25 01:02:20 +00:00

Author	SHA1	Message	Date
copilot-swe-agent[bot]	c06760cf15	Fix multiple issues in test_out_of_space_prevention.py - Fix variable name error: host[0] → hosts[0] on line 98 - Add missing await keywords for async operations on lines 209 and 385 - Rename class random_content_file to RandomContentFile (PascalCase) - Fix function name typo: test_autotoogle_compaction → test_autotoggle_compaction Co-authored-by: mykaul <4655593+mykaul@users.noreply.github.com>	2025-12-23 09:25:16 +00:00
Pavel Emelyanov	cd2568ad00	test: Merge and parametrize test_backup_to_non_existent_something tests There are three tests in cluster/object_store suite that check how backup fails in case either of its parameters doesn't really exists. All three greatly duplicate each other, it makes sense to merge them into one larger parametrized test. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#27695	2025-12-23 07:02:18 +02:00
Avi Kivity	7586c5ccbd	Merge 'system.clients: add `client_options` map column' from Vladislav Zolotarov This pull request introduces a new caching mechanism for client options in the Alternator and transport layers, refactors how client metadata is stored and accessed, and extends the `system.clients` virtual table to surface richer client information. The changes improve efficiency by deduplicating commonly used strings (like driver names/versions and client options), and ensure that client data is handled in a way that's safe for cross-shard access. Additionally, the test suite and virtual table schema are updated to reflect the new client options data. Caching and client metadata refactoring: * The largest and most repeatable items in the connection state before this PR were a `driver_name` and a `driver_version` which were stored as an `sstring` object which means that the corresponding memory consumption was 16 bytes per each such value at least (the smallest size of the `seastar`'s `sstring` object) per-connection. In reality the driver name is usually longer than 15 characters, e.g. "ScyllaDB Python Driver" is 23 characters and this is not the longest driver name there is. In such cases the actual memory usage of a corresponding `sstring` object jumps to 8 + 4 + 1 + (string length, 23 in our example) + 1. So, for "ScyllaDB Python Driver" it would be 37 bytes (in reality it would be a bit more due to natural alignment of other allocations since the `contents` size is not well aligned (13 bytes), but let's ignore this for now). * These bytes add up quickly as there are more connections and, sometimes we are talking about millions of connections per-shard. * Using a smart pointer (`lw_shared_ptr`) referencing a corresponding cached value will effectively reduce the per-connection memory usage to be 8 bytes (a size of a pointer on 64-bit CPU platform) for each such value. While storing a corresponding `sstring` value only once. * This will would reduce the "variable" (per-connection) memory usage by at least 50%. And in case of "ScyllaDB Python Driver" driver version - by 78%! * And all this for a price of a single `loading_shared_values` object per-shard (implements a hash table) and a minor overhead for each value stored in it. * Introduced a new cache type (`client_options_cache_type`) for deduplicating and sharing client option strings, and refactored `client_data`, `client_state`, and related classes to use `foreign_ptr<std::unique_ptr<client_data>>` and cached entry types for fields like driver name, driver version, and client options. (`client_data.hh`, `service/client_state.hh`, `alternator/server.hh`, `alternator/controller.hh`, `transport/controller.hh`, `transport/protocol_server.hh`) [[1]](diffhunk://#diff-664a3b19e905481bdf8eb3843fc4d34691067bb97ab11cfd6e652e74aac51d9fR33-R36) [[2]](diffhunk://#diff-664a3b19e905481bdf8eb3843fc4d34691067bb97ab11cfd6e652e74aac51d9fL40-R56) [[3]](diffhunk://#diff-daadce1a2de3667511e59558f3a8f077b5ee30a14bcc6a99d588db90d0fcd2bdL105-R107) [[4]](diffhunk://#diff-daadce1a2de3667511e59558f3a8f077b5ee30a14bcc6a99d588db90d0fcd2bdL154-R182) [[5]](diffhunk://#diff-5fce246edf5abffb2351bd02e2eb1e9850880f7a00607ccaa90c3eee7ef57c6bL91-R92) [[6]](diffhunk://#diff-5fce246edf5abffb2351bd02e2eb1e9850880f7a00607ccaa90c3eee7ef57c6bL110-R111) [[7]](diffhunk://#diff-31730ba8e7374f784a88dc27c1512291cf73b7f24e08768f7466a3c8cfcc7a1aL96-R96) [[8]](diffhunk://#diff-19a97c0247cc08155ee49b277e43859ca32d6ef8cbff0ed7368ec5fa19e0a11eL172-R172) [[9]](diffhunk://#diff-eea7e2db5d799a25e717a72ac8ce5842bd4adb72b694d38d8f47166d9cd926faL356-R356) [[10]](diffhunk://#diff-d0b4ec3a144bbc5dc993866cf0b940850a457ff6156064f7e2b4b10ad0a95fefL80-R80) [[11]](diffhunk://#diff-4293b94c444d9bd5ecd17ce7eda8c00685d35ecf6e07f844efc91a91bbe85be1L46-R48) * Updated the methods for setting and getting driver name, driver version, and client options in `client_state` to be asynchronous and use the new cache. (`service/client_state.hh`, `service/client_state.cc`) [[1]](diffhunk://#diff-daadce1a2de3667511e59558f3a8f077b5ee30a14bcc6a99d588db90d0fcd2bdL154-R182) [[2]](diffhunk://#diff-99634aae22e2573f38b4e2f050ed2ac4f8173ff27f0ae8b3609d1f0cc1aeb775R347-R362) Virtual table and API enhancements: * Extended the `system.clients` virtual table schema and implementation to include a new `client_options` column (a map of option key/value pairs), and updated the table population logic to use the new cached types and foreign pointers. (`db/virtual_tables.cc`) [[1]](diffhunk://#diff-05f7bff3edb39fb8759c90b445e860189f2f30e04717ed58bae42716082af3d1R752) [[2]](diffhunk://#diff-05f7bff3edb39fb8759c90b445e860189f2f30e04717ed58bae42716082af3d1L769-R770) [[3]](diffhunk://#diff-05f7bff3edb39fb8759c90b445e860189f2f30e04717ed58bae42716082af3d1L809-R816) [[4]](diffhunk://#diff-05f7bff3edb39fb8759c90b445e860189f2f30e04717ed58bae42716082af3d1L828-R879) API and interface changes: * Changed the signatures of `get_client_data` methods throughout the codebase to return vectors of `foreign_ptr<std::unique_ptr<client_data>>` instead of plain `client_data` objects, to ensure safe cross-shard access. (`alternator/controller.hh`, `alternator/controller.cc`, `alternator/server.hh`, `alternator/server.cc`, `transport/controller.hh`, `transport/protocol_server.hh`) [[1]](diffhunk://#diff-31730ba8e7374f784a88dc27c1512291cf73b7f24e08768f7466a3c8cfcc7a1aL96-R96) [[2]](diffhunk://#diff-19a97c0247cc08155ee49b277e43859ca32d6ef8cbff0ed7368ec5fa19e0a11eL172-R172) [[3]](diffhunk://#diff-5fce246edf5abffb2351bd02e2eb1e9850880f7a00607ccaa90c3eee7ef57c6bL110-R111) [[4]](diffhunk://#diff-a7e2cda866c03a75afcf3b087de1c1dcd2e7aa996214db67f9a11ed6451e596dL988-R995) [[5]](diffhunk://#diff-eea7e2db5d799a25e717a72ac8ce5842bd4adb72b694d38d8f47166d9cd926faL356-R356) [[6]](diffhunk://#diff-d0b4ec3a144bbc5dc993866cf0b940850a457ff6156064f7e2b4b10ad0a95fefL80-R80) [[7]](diffhunk://#diff-4293b94c444d9bd5ecd17ce7eda8c00685d35ecf6e07f844efc91a91bbe85be1L46-R48) Testing and validation: * Updated the Python test for the `system.clients` table to verify the new `client_options` column and its contents, ensuring that driver name and version are present in the options map. (`test/cqlpy/test_virtual_tables.py`) [[1]](diffhunk://#diff-6dd8bd4a6a82cd642252a29dc70726f89a46ceefb991c3e63fc67e283f323f03R79) [[2]](diffhunk://#diff-6dd8bd4a6a82cd642252a29dc70726f89a46ceefb991c3e63fc67e283f323f03R88-R90) Closes scylladb/scylladb#25746 * github.com:scylladb/scylladb: transport/server: declare a new "CLIENT_OPTIONS" option as supported service/client_state and alternator/server: use cached values for driver_name and driver_version fields system.clients: add a client_options column controller: update get_client_data to use foreign_ptr for client_data	2025-12-22 20:02:40 +02:00
Emil Maskovsky	d60b908a8e	test/raft: improve reporting in the randomized_nemesis_test digest functions The Boost ASSERTs in the digest functions of the randomized_nemesis_test were not working well inside the state machine digest functions, leading to unhelpful boost::execution_exception errors that terminated the apply fiber, and didn't provide any helpful information. Replaced by explicit checks with on_fatal_internal_error calls that provide more context about the failure. Also added validation of the digest value after appending or removing an element, which allows to determine which operation resulted in causing the wrong value. This effectively reverts the changes done in https://github.com/scylladb/scylladb/pull/19282, but adds improved error reporting. Refs: scylladb/scylladb#27307 Refs: scylladb/scylladb#17030 Closes scylladb/scylladb#27791	2025-12-22 20:02:40 +02:00
Andrei Chekun	6ffdada0ea	test.py: modify JUnit report for easier rerun on CI This will allow to add custom XML attribute to the JUnit report. In this case there will be path to the function that can be used to run with pytest command. Parametrized tests will have path to the function excluding parameter. Closes scylladb/scylladb#27707	2025-12-22 20:02:40 +02:00
Botond Dénes	af5e73def9	Merge 'test/cqlpy: remove unused variables' from Nadav Har'El These patches fix a bunch of variables defined in test/cqlpy tests, but not used. Besides wasting a few bytes on disk, these unused variables can add confusion for readers who see them and might think they have some use which they are missing. All these unused variables were found by Copilot's "code quality" scanner, but I considered each of them, and fixed them manually. Closes scylladb/scylladb#27667 * github.com:scylladb/scylladb: test/cqlpy: remove unused variables test/cqlpy: use unique partition in test	2025-12-22 20:02:39 +02:00
Yaniv Michael Kaul	c1da552fa4	test/pylib/scylla_cluster.py:get_scylla_2025_1_executable() - retry curl download of 2025.1 For some reason, we might fail. Retry 10 times, and fail with an error code instead of 404 or whatnot. Benign, I hope - no need to backport. Signed-off-by: Yaniv Kaul <yaniv.kaul@scylladb.com> Closes scylladb/scylladb#27746	2025-12-22 14:45:06 +02:00
Karol Nowacki	addac8b3f7	vector_search: test: Fix flaky DNS resolution test The `vector_store_client_test_dns_resolving_repeated` test had race conditions causing it to be flaky. Two main issues were identified: 1. Race between initial refresh and manual trigger: The test assumes a specific resolution sequence, but timing variations between the initial DNS refresh (on client creation) and the first manual trigger (in the test loop) can cause unexpected delayed scheduling. 2. Extra triggers from resolve_hostname fiber: During the client refresh phase, the background DNS fiber clears the client list. If resolve_hostname executes in the window after clearing but before the update completes, pending triggers are processed, incrementing the resolution count unexpectedly. At count 6, the mock resolver returns a valid address (count % 3 == 0), causing the test to fail. The fix relaxes test assertions to verify retry behavior and client clearing on DNS address loss, rather than enforcing exact resolution counts. Fixes: #27074 Closes scylladb/scylladb#27685	2025-12-21 20:02:16 +02:00
Vlad Zolotarov	85adf6bdb1	system.clients: add a client_options column This new column is going to contain all OPTIONS sent in the STARTUP frame of the corresponding CQL session. The new column has a `frozen<map<text, text>>` type, and we are also optimizing the amount of required memory for storing corresponding keys and values by caching them on each shard level. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2025-12-20 12:26:15 -05:00
Botond Dénes	df2ac0f257	Merge 'test: dtest: schema_management_test.py: migrate from dtest' from Dario Mirovic This PR migrates schema management tests from dtest to this repository. One reason is that there is an ongoing effort to migrate tests from dtest to here. Test `TestLargePartitionAlterSchema.test_large_partition_with_drop_column` failed with timeout error once. The main suspect so far are infra related problems, like infra congestion. The [logs from the test execution](https://jenkins.scylladb.com/job/scylla-master/job/dtest-release/1062/testReport/junit/schema_management_test/TestLargePartitionAlterSchema/Run_Dtest_Parallel_Cloud_Machines___Dtest___full_split001___test_large_partition_with_drop_column/), linked in the issue [test_large_partition_with_drop_column failed on TimeoutError #26932](https://github.com/scylladb/scylladb/issues/26932) show the following: - `populate` works as intended - it starts, then during populate/insert drop column happened, then an exception is raised and intentionally ignored in the test, so no `Finish populate DB` for 50 x 1490 records - expected - drop column works as intended - interrupts `populate` and proceeds to flush - flush probably works as intended - logs are consistent with what we expect and what I got in local test runs - `read` is the only thing that visibly got stuck, all the way until timeout happened, 5 minutes after the start Migrating the test to this repo will also give us test start and end times on CI machines, in the sql report database. It has start and end timestamp for each test executed. We will be able to see how long does it usually take when the test is successful. It can not be seen from the logs, because logs are not kept for successful tests. Another thing this PR does is adding a log message at the end of `database::flush_all_tables`. This will let us know if a thread got stuck inside or finished successfully. This addresses the probably part of the flush analysis step described above. If the issue reoccurs, we will have more information. The test `test_large_partition_with_add_column` has not been executing for ~5 years. It was never migrated to pytest. The name was left as `large_partition_with_add_column_test`, and was skipped. Now it is enabled and updated. Both `test_large_partition_with_add_column` and `test_large_partition_with_drop_column` are improved. Small performance improvements: - Regex compilation extracted from the stress function to the module level, to avoid recompilation. - Do not materialize list in `stress_object` for loop. Use a generator expression. The tests in `TestLargePartitionAlterSchema` are `test_large_partition_with_add_column` and `test_large_partition_with_drop_column`. These tests need to replicate the following conditions that led to a bug before a fix from around 5 years ago. The scenario in which the problem could have happened has to involve: - a large partition with many rows, large enough for preemption (every 0.5ms) to happen during the scan of the partition. - appending writes to the partition (not overwrites) - scans of the partition - schema alter of that table. The issue is exposed only by adding or dropping a column, such that the added/dropped column lands in the middle (in alphabetical order) of the old column set. The way the test is set up is: - fixed number of writes per populate call - fixed number of reads This has the following implications: - if the machine executing the test is fast, all the writes are done before the 10 seconds sleep - there are too many reads - most of them get executed after the test logic is done This patch solves these issues in the following way: - populate lazily generates write data, and stops when instructed by `stop_populating` event - read, which is done sequentially, stops when instructed by `stop_reading` event - number of max operations is increased significantly, but the operations are stopped 1 second after node flush; this makes sure there are enough operations during the test, but also that the test does not take unnecessary time Test execution time has been reduced severalfold. On dev machine the time the tests take is reduced from 110 seconds to 34 seconds. scylla-dtest PR that removes migrated tests: [schema_management_test.py: remove tests already ported to scylladb repo #6427](https://github.com/scylladb/scylla-dtest/pull/6427) Fixes #26932 This is a migration of existing tests to this repository. No need for backport. Closes scylladb/scylladb#27106 * github.com:scylladb/scylladb: test: dtest: schema_management_test.py: speed up `TestLargePartitionAlterSchema` tests test: dtest: schema_management_test.py: fix large partition add column test test: dtest: schema_management_test.py: add `TestSchemaManagement.prepare` test: dtest: schema_management_test.py: test enhancements test: dtest: schema_management_test.py: make the tests work test: dtest: migrate setup and tools from dtest test: dtest: copy unmodified schema_management_test.py replica: database: flush_all_tables log on completion	2025-12-19 12:30:00 +02:00
Botond Dénes	093e97a539	Merge 'test: increase num of requests in driver_service_level tests' from Andrzej Jackowski `_verify_tasks_processed_metrics()` is used to check that the correct service level is used to process requests. It takes two service levels as arguments and executes numerous requests. After that, the number of tasks processed by one of the service levels is expected to rise by at least the number of executed requests. In contrast, the second service level is expected to process fewer tasks than the number of requests. Unfortunately, background noise may cause some tasks to be executed on the service level that is not supposed to process requests. This patch increases the number of executed requests to eliminate the chance of noise causing test failures. Additionally, this commit extends logging to make future investigation easier. Fixes: https://github.com/scylladb/scylladb/issues/27715 No backport, fix for test on master. Closes scylladb/scylladb#27735 * github.com:scylladb/scylladb: test: remove unused `get_processed_tasks_for_group` test: increase num of requests in driver_service_level tests	2025-12-19 10:54:14 +02:00
Emil Maskovsky	fa6e5d0754	test/random_failures: fix handling of banned notification After `39cec4a` node join may fail with either "init - Startup failed" notification or occasionally because it was banned, depending on timing. The change updates the test to handle both cases. Fixes: scylladb/scylladb#27697 No backport: This failure is only present in master. Closes scylladb/scylladb#27768	2025-12-19 09:55:31 +02:00
Emil Maskovsky	08518b2c12	test/raft: fix `test_joining_old_node_fails` flakiness When a node without the required feature attempts to join a Raft-based cluster with the feature enabled, there is a race between the join rejection response ("Feature check failed") and the ban notification ("received notification of being banned"). Depending on timing, either message may appear in the joining node's log. This starts to happen after `39cec4a` (which introduced informing the nodes about being banned). Updated the test to accept both error messages as valid, making the test robust against this race condition, which is more likely in debug mode or under slow execution. Fixes: scylladb/scylladb#27603 No backport: This failure is only present in master. Closes scylladb/scylladb#27760	2025-12-19 09:44:09 +02:00
Emil Maskovsky	2a75b1374e	test/raft: fix race condition in failure_detector_test The test had a sporadic failure due to a broken promise exception. The issue was in `test_pinger::ping()` which captured the promise by move into the subscription lambda, causing the promise to be destroyed when the lambda was destroyed during coroutine unwinding. Simplify `test_pinger::ping()` by replacing manual abort_source/promise logic with `seastar::sleep_abortable()`. This removes the risk of promise lifetime/race issues and makes the code simpler and more robust. Fixes: scylladb/scylladb#27136 Backport to active branches: This fixes a CI test issue, so it is beneficial to backport the fix. As this is a test-only fix, it is a low risk change. Closes scylladb/scylladb#27737	2025-12-19 09:42:19 +02:00
Łukasz Paszkowski	2cb9bb8f3a	test_user_writes_rejection: Disable speculative retries This test starts a 3-node cluster and creates a large blob file so that one node reaches critical disk utilization, triggering write rejections on that node. The test then writes data with CL=QUORUM and validates that the data: - did not reach the critically utilized node - did reach the remaining two nodes By default, tables use speculative retries to determine when coordinators may query additional replicas. Since the validation uses CL=ONE, it is possible that an additional request is sent to satisfy the consistency level. As a result: - the first check may fail if the additional request is sent to a node that already contains data, making it appear as if data reached the critically utilized node - the second check may fail if the additional request is sent to the critically utilized node, making it appear as if data did not reach the healthy node The patch fixes the flakiness by disabling the speculative retries. Fixes https://github.com/scylladb/scylladb/issues/27212 Closes scylladb/scylladb#27488	2025-12-19 09:39:09 +02:00
Dario Mirovic	f1d63d014c	test: dtest: schema_management_test.py: speed up `TestLargePartitionAlterSchema` tests The tests in `TestLargePartitionAlterSchema` are `test_large_partition_with_add_column` and `test_large_partition_with_drop_column`. These tests need to replicate the following conditions that led to a bug before a fix from around 5 years ago. The scenario in which the problem could have happened has to involve: - a large partition with many rows, large enough for preemption (every 0.5ms) to happen during the scan of the partition. - appending writes to the partition (not overwrites) - scans of the partition - schema alter of that table. The issue is exposed only by adding or dropping a column, such that the added/dropped column lands in the middle (in alphabetical order) of the old column set. The way the test is set up is: - fixed number of writes per populate call - fixed number of reads This has the following implications: - if the machine executing the test is fast, all the writes are done before the 10 seconds sleep - there are too many reads - most of them get executed after the test logic is done This patch solves these issues in the following way: - populate lazily generates write data, and stops when instructed by `stop_populating` event - read, which is done sequentially, stops when instructed by `stop_reading` event - number of max operations is increased significantly, but the operations are stopped 1 second after node flush; this makes sure there are enough operations during the test, but also that the test does not take unnecessary time Test execution time has been reduced severalfold. On dev machine the time the tests take is reduced from 110 seconds to 34 seconds. The patch also introduces a few small improvements: - `cs_run` renamed to `run_stress` for clarity - Stopped checking if cluster is `ScyllaCluster`, since it is the only one we use - `case_map` removed from `test_alter_table_in_parallel_to_read_and_write`, used `mixed` param directly - Added explanation comment on why we do `data[i].append(None)` - Replaced `alter_table` inner function with its body, for simplicity - Removed unnecessary `ck_rows` variable in `populate` - Removed unnecessary `isinstance(self.cluster. ScyllaCluster)` - Adjusted `ThreadPoolExecutor` size in several places where 5 workers are not needed - Replaced functional programming style expressions for `new_versions` and `columns_list` with comprehension/generator statement python style code, improving readability Refs #26932 fix	2025-12-18 17:07:27 +01:00
Dario Mirovic	f831ca5ab5	test: dtest: schema_management_test.py: fix large partition add column test `large_partition_with_add_column_test` and `large_partition_with_drop_column_test` were added on August 17th, 2020 in scylladb/scylla-dtest#1569. Only `large_partition_with_drop_column_test` was migrated to pytest, and renamed to `test_large_partition_with_drop_column` on March 31st, 2021 in scylladb/scylla-dtest#2051. Since then this test has not been running. This patch fixes it - the test is updated and renamed and the testing environment now properly picks it up. Refs #26932	2025-12-18 12:54:43 +01:00
Dario Mirovic	1fe0509a9b	test: dtest: schema_management_test.py: add `TestSchemaManagement.prepare` Extract repeated cluster initialization code in `TestSchemaManagement` into a separate `prepare` method. It holds all the common code for cluster preparation, with just the necessary parameters. Refs #26932	2025-12-18 12:54:43 +01:00
Dario Mirovic	e7d76fd8f3	test: dtest: schema_management_test.py: test enhancements Extract regex compilation from the stress functions to the module level, to avoid unnecessary regex compilation repetition. Add descriptions to the stress functions. Do not materialize list in `stress_object` for loop. Use a generator expression. Make `_set_stress_val` an object method. Refs #26932	2025-12-18 12:54:43 +01:00
Dario Mirovic	700853740d	test: dtest: schema_management_test.py: make the tests work Remove unused function markers. Add wait_other_notice=True to cluster start method in TestSchemaHistory.prepare function to make the test stable. Enable the test in suite.yaml for dev and debug modes. Fixes #26932	2025-12-18 12:54:43 +01:00
Dario Mirovic	3c5dd5e5ae	test: dtest: migrate setup and tools from dtest Migrate several functionalities from dtest. These will be used by the schema_management_test.py tests when they are enabled. Refs #26932	2025-12-18 12:54:43 +01:00
Dario Mirovic	5971b2ad97	test: dtest: copy unmodified schema_management_test.py Copy schema_management_test.py from scylla-dtest to test/cluster/dtest/schema_management_test.py. Add license header. Disable it for debug, dev, and release mode. Refs #26932	2025-12-18 12:54:42 +01:00
Dario Mirovic	f89315d02f	replica: database: flush_all_tables log on completion In database::flush_all_tables add log on completion. This slightly improves the readability of logs when debugging an issue. Refs #26932	2025-12-18 12:54:42 +01:00
Patryk Jędrzejczak	d5c205194b	Merge 'topology: Make removenode use left_token_ring state for global barrier' from Emil Maskovsky Make the removenode operation go through the `left_token_ring` state, similar to decommission. This ensures that when removenode completes, all nodes in the cluster are aware of the topology change through a global token metadata barrier. Previously, removenode would skip the `left_token_ring` state and go directly from `write_both_read_new` to `left` state. This meant that when the operation completed, some nodes might not yet know about the topology change, potentially causing issues with subsequent data plane requests. Key changes: - Both decommission and removenode now transition to `left_token_ring` state in the `write_both_read_new` handler - In `left_token_ring` state, only decommissioning nodes receive the shutdown RPC (removed nodes are already dead) - Updated documentation to reflect that both operations use this state This change improves consistency guarantees for removenode operations by ensuring cluster-wide awareness before completion. The change is protected by "REMOVENODE_WITH_LEFT_TOKEN_RING" feature flag to also support mixed clusters during e.g. upgrade. Fixes: scylladb/scylladb#25530 No backport: This fixes and issue found in tests. It can theoretically happen in production too, but wasn't reported in any customer issue, so a backport is not needed. Closes scylladb/scylladb#26931 * https://github.com/scylladb/scylladb: topology: make removenode use left_token_ring state for global barrier topology: allow removing nodes not having tokens features: add feature flag for removenode via left token ring	2025-12-18 09:34:38 +01:00
Andrzej Jackowski	6ad10b141a	test: remove unused `get_processed_tasks_for_group` The function `get_processed_tasks_for_group` was defined twice in `test_raft_service_levels.py`. This change removes the unused definition to avoid confusion and clean up the code.	2025-12-17 20:45:53 +01:00
Andrzej Jackowski	8cf8e6c87d	test: increase num of requests in driver_service_level tests `_verify_tasks_processed_metrics()` is used to check that the correct service level is used to process requests. It takes two service levels as arguments and executes numerous requests. After that, the number of tasks processed by one of the service levels is expected to rise by at least the number of executed requests. In contrast, the second service level is expected to process fewer tasks than the number of requests. Unfortunately, background noise may cause some tasks to be executed on the service level that is not supposed to process requests. This patch increases the number of executed requests to eliminate the chance of noise causing test failures. Additionally, this commit extends logging to make future investigation easier. Fixes: scylladb/scylladb#27715	2025-12-17 20:45:48 +01:00
Michael Litvak	3a06c32749	schema_registry: fix learning a schema with cdc schema When learning a schema that has a linked cdc schema, we need to learn also the cdc schema, and at the end the schema should point to the learned cdc schema. This is needed because the linked cdc schema is used for generating cdc mutations, and when we process the mutations later it is assumed in some places that the mutation's schema has a schema registry entry. We fix a scenario where we could end up with a schema that points to a cdc schema that doesn't have a schema registry entry. This could happen for example if the schema is loaded before it is learned, so when we learn it we see that it already has an entry. In that case, we need to set the cdc schema to the learned cdc schema as well, because it could have been loaded previously with a cdc schema that was not learned. Fixes scylladb/scylladb#27610 Closes scylladb/scylladb#27704	2025-12-17 20:01:00 +02:00
Michał Jadwiszczak	74ab5addd3	test/cluster/test_view_building_coordinator: fix flakiness in test_file_streaming The test generates a staging sstable on a node and verifies whether the view is correctly populated. However view updates generated by a staging sstable (`view_update_generator::generate_and_propagate_view_updates()`) aren't awaited by sstable consumer. It's possible that the view building coordinator may see the task as finished (so the staging sstable was processed) but not all view updates were writted yet. This patch fixes the flakiness by waiting until `scylla_database_view_update_backlog` drops down to 0 on all shards. Fixes scylladb/scylladb#26683 Closes scylladb/scylladb#27389	2025-12-17 17:29:15 +01:00
Michael Litvak	55f4a2b754	migration_listener: fix deadlock in nested notifications When calling a migration notification from the context of a notification callback, this could lead to a deadlock with unregistering a listener: A: the parent notification is called. it calls thread_for_each, where it acquires a read lock on the vector of listeners, and calls the callback function for each listener while holding the lock. B: a listener is unregistered. it calls `remove` and tries to acquire a write lock on the vector of listeners. it waits because the lock is held. A: the callback function calls another notification and calls thread_for_each which tries to acquire the read lock again. but it waits since there is a waiter. Currently we have such concrete scenario when creating a table, where the callback of `before_create_column_family` in the tablet allocator calls `before_allocate_tablet_map`, and this could deadlock with node shutdown where we unregister listeners. Fix this by not acquiring the read lock again in the nested notification. There is no need because the read lock is already held by the parent notification while the child notification is running. We add a function `thread_for_each_nested` that is similar to `thread_for_each` except it assumes the read lock is already held and doesn't acquire it, and it should be used for nested notifications instead of `thread_for_each`. Fixes scylladb/scylladb#27364 Closes scylladb/scylladb#27637	2025-12-17 14:00:28 +01:00
Emil Maskovsky	9431826c52	topology: allow removing nodes not having tokens For the changes to go through the left_token_ring state when REMOVENODE_WITH_LEFT_TOKEN_RING feature is enabled, we need to allow removing nodes to not have any tokens (similarly to decommissioning nodes, which use the same sequence of states). This means the tests also need to change to allow for this new behavior - it can temporarily happen that a removing node has no tokens but is still part of Raft group 0 (so there may be a temporary mismatch between the token ring and group 0 membership). Therefore, the `check_token_ring_and_group0_consistency` function is replaced by `wait_for_token_ring_and_group0_consistency`, which waits up to 30 seconds for consistency to be reached.	2025-12-17 13:31:11 +01:00
Tomasz Grabiec	c077283352	Merge 'service: support conversion of tablet keyspaces to rack-list using ALTER KEYSPACE' from Aleksandra Martyniuk If a keyspace has a numeric replication factor in a DC and rf < #racks, then the replicas of tablets in this keyspace can be distributed among all racks in the DC (different for each tablet). With rack list, we need all tablet replicas to be placed on the same racks. Hence, the conversion requires tablet co-location. After this series, the conversion can be done using ALTER KEYSPACE statement. The statement that does this conversion in any DC is not allowed to change a rf in any DC. So, if we have dc1 and dc2 with 3 racks each and a keyspace ks then with a single ALTER KEYSPACE we can do: - {dc1 : 2} -> {dc1 : [r1, r2]}; - {dc1 : 2, dc2: 2} -> {dc1 : [r1, r2], dc2: [r2,r3]}; - {dc1 : 2, dc2: 2} -> {dc1 : [r1, r2], dc2: 2} - {dc1 : 2} -> {dc1 : 2, dc2 : [r1]} But we cannot do: - {dc1 : 2} -> {dc1 : [r1, r2, r3]}; - {dc1 : 1, dc2 : [r1, r2] → dc1: [r1], dc2: [r1]. In order to do the co-locations rf change request is paused. Tablet load balancer examines the paused rf change requests and schedules necessary tablet migrations. During the process of co-location, no other cross-rack migration is allowed. Load balancer checks whether any paused rf change request is ready to be resumed. If so, it puts the request back to global topology request queue. While an rf change request for a keyspace is running, any other rf change of this keyspace will fail. Fixes: #26398. New feature, no backport Closes scylladb/scylladb#27279 * github.com:scylladb/scylladb: test: add est_rack_list_conversion_with_two_replicas_in_rack test: test creating tablet_rack_list_colocation_plan test: add test_numeric_rf_to_rack_list_conversion test tasks: service: add global_topology_request_virtual_task cql3: statements: allow altering from numeric rf to rack list service: topology_coordinator: pause keyspace_rf_change request service: implement make_rack_list_colocation_plan service: add tablet_rack_list_colocation_plan cql3: reject concurrent alter of the same keyspace test: check paused rf change requests persistence db: service: add paused_rf_change_requests to system.topology service: pass topology and system_keyspace to load_balancer ctor service: tablet_allocator: extract load updates service: tablet_allocator: extract ensure_node tasks, system_keyspace: Introduce get_topology_request_entry_opt() node_ops: Drop get_pending_ids() node_ops: Drop redundant get_status_helper()	2025-12-17 10:05:06 +01:00
Łukasz Paszkowski	a61c221902	test/pylib/suite/python.py: Handle extra_cmdline_options correctly runner.py defines a command-line option `--extra-scylla-cmdline-options` with the default type=str. However, the function `merge_cmdline_options`, which consumes this value to merge command-line options from multiple sources, expects a list of strings. This mismatch results in the following exception: ``` raise ValueError(f'invalid argument name {name}, all args {args}') ValueError: invalid argument name o, all args --logger-log-level repair=debug --default-log-level=error ``` when a test is run with pytest using: `--extra-scylla-cmdline-options='--logger-log-level repair=debug --default-log-level=error'` Fix this by handling the option consistently and calling `.split()`. Also change the default value from an empty list to an empty string to avoid confusion both in runner.py and test.py. Closes scylladb/scylladb#27523	2025-12-16 20:14:43 +03:00
Pavel Emelyanov	c4496dd63c	Merge 'test/cqlpy: rename tests with duplicate name' from Nadav Har'El When translating Cassandra's unit tests, in a couple of places I accidentally used the same name for two tests, resulting in the first of each pair to never running. Let's fix the name of the second of the each pair to be the real name it had in the original Cassandra test. Closes scylladb/scylladb#27644 * github.com:scylladb/scylladb: test/cqlpy: rename test with duplicate name test/cqlpy: rename test with duplicate name	2025-12-16 19:32:20 +03:00
Nadav Har'El	84df5cfaf8	test/alternator: delete unnecessary "pass" Fixing something that never bothered anyone but our automated "code quality" tool: there's an unnecessary call to "pass" in one of our tests. Just remove it. Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Closes scylladb/scylladb#27645	2025-12-16 19:29:23 +03:00
Botond Dénes	f06db096bd	test/boost/reader_concurrency_semaphore_test: un-flake memory limit engages test This test was observed to fail multiple times recently in promotion, because there were successful reads. The failure only reproduces on arm64, it doesn't reproduce on x86. The suspected reason is that the data set is too close to the edge, where all reads fail due to too high memory consumption. Reduce the number of sstables used by this test to 54 (from 64). Fixes: #27248 Closes scylladb/scylladb#27650	2025-12-16 19:24:49 +03:00
Pavel Emelyanov	31f90c089c	Merge 'test/alternator: remove unused variable assignments and statements' from Nadav Har'El Copilot found in test/alternator a bunch of places where we unnecessarily assign a variable that we don't use, or had a duplicated statement which doesn't do anything. This patch fixes all of them. AI still doesn't know how to prepare a patch that looks anything close to reasonable, so I did this part manually, and also carefully investigated each and every change (this took a lot of human time). These patches don't change anything in the functionality of any of the tests. It's all cosmetic. Closes scylladb/scylladb#27655 * github.com:scylladb/scylladb: test/alternator: remove unnecessary duplicate statement test/alternator: remove unused variable assignments	2025-12-16 19:23:34 +03:00
Nadav Har'El	c58739de6a	Fix for Unused local variable Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Closes scylladb/scylladb#27665	2025-12-16 19:20:53 +03:00
Piotr Dulikowski	7900aa5319	Merge 'server: fix scheduling group update timing in system.clients' from Alex Dathskovsky Previously, the scheduling_group column was updated during the switch_tenant function, which meant the update occurred only after the tenant change operation completed—updating rows one by one. With this change, the scheduling_group column is now updated before the switch_tenant logic runs, ensuring that the table reflects the correct scheduling groups for all rows as early as possible. fixes: #26060 fixes: #27295 backport: not required this is a minor bug fix. Internal logic worked but the user couldnt see the change in the table if they would read the system.clients table Closes scylladb/scylladb#26404 * github.com:scylladb/scylladb: test: cqlpy: Remove test_switch_tenants and add test in cluster testing. The test needs to run twice, in two separate Scylla runs, using two different modes: gossip and raft. The cluster framework supports this setup, while cqlpy only runs against Scylla instances in raft mode. Therefore, the test was moved from cqlpy to the cluster-based framework. This commit both adds the test in cluster/ and removes the old version in cqlpy/. server: Refactor update_control_connection_scheduling_group functionality This refactoring moves the logic that retrieves the scheduling group for driver_service_level_name out of switch_tenant. This change is possible because the scheduling group for the driver is retrieved from a map (LOOKUP). The lookup function is fully synchronized, non-coroutine, and returns immediately. For that reason, it’s better to perform this lookup outside of the switch_tenant function. server: Refactor scheduling group update functionality. This change generalizes the scheduling-group update functionality and removes some copy-paste code, improving overall readability and maintainability. To achieve this, capturing lambdas were introduced. As a result, self-deducing this was added to those lambdas to avoid coroutine-related issues (“coroutine fiasco”). server: Fix switch_tenant problem, When running on a V2 server, service-level data comes from service level cache. Because of this, we can use synchronized function to get the schedualing group. Since we are transitioning to a Raft-based architecture where all servers will be V2, we can safely implement this fix specifically for that case. This change adds get_cached_user_scheduling_group functionality and moves its usage out of switch_tenant function in update_scheduling_group_v2 usage. server: Add update_service_level_scheduling_group_v1 functions to create placehholder for functionality that will introduce v2 implementation. The new functionality will allow usage of service level cache	2025-12-16 15:39:49 +01:00
Aleksandra Martyniuk	9d20f0a3d2	test: add est_rack_list_conversion_with_two_replicas_in_rack	2025-12-16 13:31:24 +01:00
Aleksandra Martyniuk	0476e8d272	test: test creating tablet_rack_list_colocation_plan	2025-12-16 13:31:24 +01:00
Aleksandra Martyniuk	e48789cf6c	test: add test_numeric_rf_to_rack_list_conversion test	2025-12-16 13:31:24 +01:00
Aleksandra Martyniuk	1884e655d6	cql3: statements: allow altering from numeric rf to rack list Allow altering from numeric replication factor to rack list. Ensure that a single ALTER KEYSPACE statement doesn't try to both convert to rack list and change rf.	2025-12-16 13:29:08 +01:00
Aleksandra Martyniuk	b3a0e4c2dc	test: check paused rf change requests persistence	2025-12-16 13:25:38 +01:00
Aleksandra Martyniuk	d66a36058b	service: pass topology and system_keyspace to load_balancer ctor Pass a pointer to service::topology and db::system_keyspace to load balancer. It will be used in the following patches to create rack_list_colocation plan.	2025-12-16 13:25:38 +01:00
Patryk Jędrzejczak	73db5c94de	Merge 'db: api: service: introduce system.client_routes table and related API endpoints' from Andrzej Jackowski `system.client_routes` is a system table that sets the target address and ports for each `host_id`, for one or more connection (e.g., Private Link) represented by `connection_id`. Cloud will write the table via REST, and drivers will read it via CQL to override values obtained from `system.local` and `system.peers`. This patch series contains: - Introduction of `CLIENT_ROUTES` feature flag. - Implementation of raft-based `system.client_routes` table - Implementation of `v2/client-routes` POST/DELETE/GET endpoints - Implementation of new `CLIENT_ROUTES_CHANGE` event that is sent to drivers when `system.client_routes` is changed - New tests that verifies the aforementioned features Ref: scylladb/scylla-enterprise#5699 For now, no automatic backport. However, the changes are planned to be release on `2025.4` either as a backport or a private build. Closes scylladb/scylladb#27323 * https://github.com/scylladb/scylladb: docs: describe CLIENT_ROUTES_CHANGE extension test: add test for CLIENT_ROUTES event service: transport: add CLIENT_ROUTES_CHANGE event test: add cluster tests for client routes test: add API tests for client_routes endpoints test: add `timeout` parameter to `delete` in RESTClient test: allow json_body in send api: implement client_routes endpoints api: add client_routes.json service: main: add client_routes_service db: add system.client_routes table gms: add CLIENT_ROUTES feature	2025-12-16 10:38:27 +01:00
Botond Dénes	85f05fbe1b	Revert "Merge 'Add digests for all sstable components in scylla metadata' from Taras Veretilnyk" This reverts commit `866c96f536`, reversing changes made to `367633270a`. This change caused all longevities to fail, with a crash in parsing scylla-metadata. The investigation is still ongoing, with no quick fix in sight yet. Fixes: #27496 Closes scylladb/scylladb#27518	2025-12-16 11:34:40 +02:00
Botond Dénes	dace39fd6c	Merge 'Make commitlog replay handle files with corrupt file header (non-zero) as data loss, not startup failure' from Calle Wilund Fixes #26744 If a segment to replay is broken such that the main header is not zero, but still broken, we throw header_checksum_error. This was not handled in replayer, which grouped this into the "user error/fundamental problem" category. However, assuming we allow for "real" disk corruption, this should really be treated same as data corruption, i.e. reported data loss, not failure to start up. The `test_one_big_mutation_corrupted_on_startup` test accidentally sometimes provoked this issue, by doing random file wrecking, which on rare occasions provoked this, and thus failed test due to scylla not starting up, instead of losing data as expected. Closes scylladb/scylladb#27556 * github.com:scylladb/scylladb: test::cluster::dtest::tools::files: Remove file commitlog_replay: Handle fully corrupt files same as partial corruption. test::pylib::suite::base: Split options.name test specifier only once	2025-12-16 06:55:42 +02:00
Michał Chojnowski	df93ea626b	test/scylla_gdb: use `gcore` instead of `signal SIGSEGV` to generate a coredump on failure The test fails in CI sometimes, and we want a coredump from a failure to debug that. We made the test send a `signal SIGSEGV` to Scylla on failure, but apparently that doesn't work as intended on our CI hosts. (The CI runner seemingly can't find any coredump afterwards). We can use gdb's `gcore` command to produce a coredump in a more predictable way. Refs scylladb/scylladb#22501 Closes scylladb/scylladb#27498	2025-12-16 06:53:43 +02:00
Botond Dénes	74347625f9	Merge 'test/alternator: add reproducers for more issues' from Nadav Har'El This series adds an xfailing reproducers for two issue: #8070 and #27037: 27037 is about where even with alternator_streams_increased_compatibility set to true, if an attribute is set to the same value it had but using a different JSON representation - a Alternator Streams event is unduly produced. 8070 is about the ability to write malformed values into the database and then fail during read - instead of failing, as expected, during the write. This issue was known for years, but we never really had a reproducer for it - it's not possible to reproduce it using clean boto3 code and we need to build a request manually. The first two patches are two small cleanups (including fixes #27372) that I did while preparing the real tests - which are in the final two patches. Closes scylladb/scylladb#27376 * github.com:scylladb/scylladb: test/alternator: add reproducer for bug with storing invalid values test/alternator: reproducer for issue 27375 utils/rjson: fix error messages from rjson::parse() test/alternator: extract get_signed_request() to util.py	2025-12-16 06:53:14 +02:00
Andrzej Jackowski	61bbea51ad	test: add test for CLIENT_ROUTES event Ref: scylladb/scylla-enterprise#5699	2025-12-15 18:19:37 +01:00

1 2 3 4 5 ...

10340 Commits