scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-25 01:02:20 +00:00

Author	SHA1	Message	Date
Nadav Har'El	d32fe72252	Merge 'alternator: check concurrency limit before memory acquisition' from Łukasz Paszkowski Fix the ordering of the concurrency limit check in the Alternator HTTP server so it happens before memory acquisition, and reduce test pressure to avoid LSA exhaustion on the memory-constrained test node. The patch moves the concurrency check to right after the content-length early-out, before any memory acquisition or I/O. The check was originally placed before memory acquisition but was inadvertently moved after it during a refactoring. This allowed unlimited requests to pile up consuming memory, reading bodies, verifying signatures, and decompressing — all before being rejected. Restores the original ordering and mirrors the CQL transport (`transport/server.cc`). Lowers `concurrent_requests_limit` from 5 to 3 and the thread multiplier from 5 to 2 (6 threads instead of 25). This is still sufficient to reliably trigger RequestLimitExceeded, while keeping flush pressure within what 512MB per shard can sustain. Fixes https://scylladb.atlassian.net/browse/SCYLLADB-1248 Fixes https://scylladb.atlassian.net/browse/SCYLLADB-1181 The test started to fail quite recently. It affects master only. No backport is needed. We might want to consider backporting a commit moving the concurrency check earlier. Closes scylladb/scylladb#29272 * github.com:scylladb/scylladb: test: reduce concurrent-request-limit test pressure to avoid LSA exhaustion alternator: check concurrency limit before memory acquisition	2026-03-29 11:08:28 +03:00
Łukasz Paszkowski	b8e3ef0c64	test: reduce concurrent-request-limit test pressure to avoid LSA exhaustion The test_limit_concurrent_requests dtest uses concurrent CreateTable requests to verify Alternator's concurrency limiting. Each admitted CreateTable triggers Raft consensus, schema mutations, and memtable flushes—all of which consume LSA memory. On the 1 GB test node (2 SMP × 512 MB), the original settings (limit=5, 25 threads) created enough flush pressure to exhaust the LSA emergency reserve, producing logalloc::bad_alloc errors in the node log. The test was always marginal under these settings and became flaky as new system tables increased baseline LSA usage over time. Lower concurrent_requests_limit from 5 to 3 and the thread multiplier from 5 to 2 (6 threads total). This is still well above the limit and sufficient to reliably trigger RequestLimitExceeded, while keeping flush pressure within what 512 MB per shard can sustain.	2026-03-28 20:40:33 +01:00
Łukasz Paszkowski	a86928caa1	alternator: check concurrency limit before memory acquisition The concurrency limit check in the Alternator server was positioned after memory acquisition (get_units), request body reading (read_entire_stream), signature verification, and decompression. This allowed unlimited requests to pile up consuming memory before being rejected, exhausting LSA memory and causing logalloc::bad_alloc errors that cascade into Raft applier and topology coordinator failures, breaking subsequent operations. Without this fix, test_limit_concurrent_requests on a 1GB node produces 50 logalloc::bad_alloc errors and cascading failures: reads from system.scylla_local fail, the Raft applier fiber stops, the topology coordinator stops, and all subsequent CreateTable operations fail with InternalServerError (500). With this fix, the cascade is eliminated -- admitted requests may still cause LSA pressure on a memory-constrained node, but the server remains functional. Move the concurrency check to right after the content-length early-out, before any memory acquisition or I/O. This mirrors the CQL transport which correctly checks concurrency before memory acquisition (transport/server.cc). The concurrency check was originally added in `1b8c946ad7` (Sep 2020) before memory acquisition, which at the time lived inside with_gate (after the concurrency gate). The ordering was inverted by `f41dac2a3a` (Mar 2021, "avoid large contiguous allocation for request body"), which moved get_units() earlier in the function to reserve memory before reading the newly-introduced content stream -- but inadvertently also moved it before the concurrency check. `c3593462a4` (Mar 2025) further worsened the situation by adding a 16MB fallback reservation for requests without Content-Length and ungzip/deflate decompression steps -- all before the concurrency check -- greatly increasing the memory consumed by requests that would ultimately be rejected.	2026-03-28 20:40:33 +01:00
Aleksandra Martyniuk	166b293d06	test: add test_failed_tablet_rebuild_is_retried_on_alter Test if alter keyspace statement with the current rf values will fix the state of replicas.	2026-03-27 17:29:31 +01:00
Aleksandra Martyniuk	9ec54a8207	test: add a test to ensure that failed rebuilds are retried	2026-03-27 17:29:31 +01:00
Aleksandra Martyniuk	200dc084c5	service: fail ALTER KEYSPACE if replicas do not satisfy the replication RF change of tablet keyspace starts tablet rebuilds. Even if any of the rebuilds is rolled back (because pending replica was excluded), rf change request finishes successfully. Yet, we are left with not enough replicas. Then, a next new rf change request handler would generate a rebuild of two replicas of the same tablet. Such a transition would not be applied, as we don't allow many pending replicas. An exception would be thrown and the request would be retried infinitely, blocking the topology coordinator. Throw and fail rf change request if there is not enough replicas. The request should be retried later, after the issue is fixed by the mechanism introduced in previous changes.	2026-03-27 17:29:26 +01:00
Aleksandra Martyniuk	7951f92270	service: retry failed tablet rebuilds RF change of tablet keyspace starts tablet rebuilds. Even if any of the rebuilds is rolled back (because pending replica was excluded), rf change request finishes successfully. In this case we end up with the state of the replicas that isn't compatible with the expected keyspace replication. After this change, if topology_coordinator has nothing to do, it proceeds to check if the state of replicas reflects the keyspace replication. If there are any mismatches, the tablet rebuilds are scheduled. All required rebuilds of a single keyspace are scheduled together without respecting the node's load (just as it happens in case of keyspace rf change).	2026-03-27 17:26:45 +01:00
Aleksandra Martyniuk	6f1bba8faf	service: maybe_start_tablet_migration returns std::optional<group0_guard> maybe_start_tablet_migration takes an ownership of group0_guard and does not give it back, even if no work was done. In the following patches, we will proceed with different operations, if there are no migrations to be started. Thus, the guard would be needed. Return group0_guard from maybe_start_tablet_migration is no work was done.	2026-03-27 17:26:45 +01:00
Emil Maskovsky	9dad68e58d	raft: abort stale snapshot transfers when term changes The Bug Assertion failure: `SCYLLA_ASSERT(res.second)` in `raft/server.cc` when creating a snapshot transfer for a destination that already had a stale in-flight transfer. Root Cause If a node loses leadership and later becomes leader again before the next `io_fiber` iteration, the old transfer from the previous term can remain in `_snapshot_transfers` while `become_leader()` resets progress state. When the new term emits `install_snapshot(dst)`, `send_snapshot(dst)` tries to create a new entry for the same destination and can hit the assertion. The Fix Abort all in-flight snapshot transfers in `process_fsm_output()` when `term_and_vote` is persisted. A term/vote change marks existing transfers as stale, so we clean them up before dispatching messages from that batch and before any new snapshot transfer is started. With cross-term cleanup moved to the term-change path, `send_snapshot()` now asserts the within-term invariant that there is at most one in-flight transfer per destination. Fixes: SCYLLADB-862 Backport: The issue is reproducible in master, but is present in all active branches. Closes scylladb/scylladb#29092	2026-03-27 10:00:15 +01:00
Andrzej Jackowski	181ad9f476	Revert "audit: disable DDL by default" This reverts commit `c30607d80b`. With the default configuration, enabling DDL has no effect because no `audit_keyspaces` or `audit_tables` are specified. Including DDL in the default categories can be misleading for some customers, and ideally we would like to avoid it. However, DDL has been one of the default audit categories for years, and removing it risks silently breaking existing deployments that depend on it. Therefore, the recent change to disable DDL by default is reverted. Fixes: SCYLLADB-1155 Closes scylladb/scylladb#29169	2026-03-27 09:55:11 +01:00
Botond Dénes	854c374ebf	test/encryption: wait for topology convergence after abrupt restart test_reboot uses a custom restart function that SIGKILLs and restarts nodes sequentially. After all nodes are back up, the test proceeded directly to reads after wait_for_cql_and_get_hosts(), which only confirms CQL reachability. While a node is restarted, other nodes might execute global token metadata barriers, which advance the topology fence version. The restarted node has to learn about the new version before it can send reads/writes to the other nodes. The test issues reads as soon as the CQL port is opened, which might happen before the last restarted node learns of the latest topology version. If this node acts as a coordinator for reads/write before this happens, these will fail as the other nodes will reject the ops with the outdated topology fence version. Fix this by replacing wait_for_cql_and_get_hosts() on the abrupt-restart path with the more robus get_ready_cql(), which makes sure servers see each other before refreshing the cql connection. This should ensure that nodes have exchanged gossip and converged on topology state before any reads are executed. The rolling_restart() path is unaffected as it handles this internally. Fixes: SCYLLADB-557 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> Closes scylladb/scylladb#29211	2026-03-27 09:52:27 +01:00
Avi Kivity	b708e5d7c9	Merge 'test: fix race condition in test_crashed_node_substitution' from Sergey Zolotukhin `test_crashed_node_substitution` intermittently failed: ```python assert len(gossiper_eps) == (len(server_eps) + 1) ``` The test crashed the node right after a single ACK2 handshake (`finished do_send_ack2_msg`), assuming the node state was visible to all peers. However, since gossip is eventually consistent, the update may not have propagated yet, so some nodes did not see the failed node. This change: Wait until the gossiper state is visible on peers before continuing the test and asserting. Fixes: [SCYLLADB-1256](https://scylladb.atlassian.net/browse/SCYLLADB-1256). backport: this issue may affect CI for all branches, so should be backported to all versions. [SCYLLADB-1256]: https://scylladb.atlassian.net/browse/SCYLLADB-1256?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ Closes scylladb/scylladb#29254 * github.com:scylladb/scylladb: test: test_crashed_node_substitution: add docstring and fix whitespace test: fix race condition in test_crashed_node_substitution	2026-03-26 21:40:33 +02:00
Petr Gusev	c38e312321	test_lwt_fencing_upgrade: fix quorum failure due to gossip lag If lwt_workload() sends an update immediately after a rolling restart, the coordinator might still see a replica as down due to gossip lagging behind. Concurrently restarting another node leaves only one available replica, failing the LOCAL_QUORUM requirement for learn or eventually consistent sp::query() in sp::cas() and resulting in a mutation_write_failure_exception. We fix this problem by waiting for the restarted server to see 2 other peers. The server_change_version doesn't do that by default -- it passes wait_others=0 to server_start(). Fixes SCYLLADB-1136 Closes scylladb/scylladb#29234	2026-03-26 21:25:53 +02:00
bitpathfinder	627a8294ed	test: test_crashed_node_substitution: add docstring and fix whitespace Add a description of the test's intent and scenario; remove extra blanks.	2026-03-26 18:40:17 +01:00
bitpathfinder	5a086ae9b7	test: fix race condition in test_crashed_node_substitution `test_crashed_node_substitution` intermittently failed: ``` assert len(gossiper_eps) == (len(server_eps) + 1) ``` The test crashed the node right after a single ACK2 handshake ("finished do_send_ack2_msg"), assuming the node state was visible to all peers. However, since gossip is eventually consistent, the update may not have propagated yet, so some nodes did not see the failed node. This change: Wait until the gossiper state is visible on peers before continuing the test and asserting. Fixes: SCYLLADB-1256.	2026-03-26 18:25:05 +01:00
Robert Bindar	c575bbf1e8	test_refresh_deletes_uploaded_sstables should wait for sstables to get deleted SSTable unlinking is async, so in some cases it may happen that the upload dir is not empty immediately after refresh is done. This patch adjusts test_refresh_deletes_uploaded_sstables so it waits with a timeout till the upload dir becomes empty instead of just assuming the API will sync on sstables being gone. Fixes SCYLLADB-1190 Signed-off-by: Robert Bindar <robert.bindar@scylladb.com> Closes scylladb/scylladb#29215	2026-03-26 08:43:14 +03:00
Nikos Dragazis	8789c95a85	test: cluster: Add test for migration of multiple keyspaces Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2026-03-25 19:11:29 +02:00
Nikos Dragazis	25af8bdc24	test: cluster: Add test for error conditions Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2026-03-25 19:11:29 +02:00
Nikos Dragazis	01a51817c4	test: cluster: Add vnodes->tablets migration test (rollback) Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2026-03-25 19:11:29 +02:00
Nikos Dragazis	56ec33d3e0	test: cluster: Add vnodes->tablets migration test (1 table, 3 nodes) Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2026-03-25 19:11:29 +02:00
Nikos Dragazis	58e930c490	test: cluster: Add vnodes->tablets migration test (1 table, 1 node) This test runs the vnodes-to-tablets migration for a single table on a single-node cluster. The node has multiple shards and multiple power-of-two aligned vnodes, so resharding is triggered. More details in the docstring. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2026-03-25 19:11:29 +02:00
Nikos Dragazis	8837dac2f9	scylla-nodetool: Add migrate-to-tablets subcommand The vnodes-to-tablets migration is a manual procedure, so orchestration must be done via nodetool. This patch adds the following new commands: * nodetool migrate-to-tablets start {ks} * nodetool migrate-to-tablets upgrade * nodetool migrate-to-tablets downgrade * nodetool migrate-to-tablets status {ks} * nodetool migrate-to-tablets finalize {ks} The commands are just wrappers over the REST API. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2026-03-25 19:11:29 +02:00
Nikos Dragazis	2a5e6b832a	api: Add REST endpoint for vnode-to-tablet migration status If the keyspace is migrating, it reports the intended and actual storage mode for each node. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2026-03-25 19:11:24 +02:00
Marcin Maliszkiewicz	7fdd650009	Merge 'test: audit: clean up test helper class naming' from Dario Mirovic Remove unused `pytest.mark.single_node` marker from `TestCQLAudit`. Rename `TestCQLAudit` to `CQLAuditTester` to reflect that it is a test helper, not a test class. This avoids accidental pytest collection and subsequent warning about `__init__`. Logs before the fixes: ``` test/cluster/test_audit.py:514: 14 warnings /home/dario/dev/scylladb/test/cluster/test_audit.py:514: PytestCollectionWarning: cannot collect test class 'TestCQLAudit' because it has a __init__ constructor (from: cluster/test_audit.py) @pytest.mark.single_node ``` Fixes SCYLLADB-1237 This is an addition to the latest master code. No backport needed. Closes scylladb/scylladb#29237 * github.com:scylladb/scylladb: test: audit: rename TestCQLAudit to CQLAuditTester test: audit: remove unused pytest.mark.single_node	2026-03-25 15:30:16 +01:00
Radosław Cybulski	1dc20cc8f9	alternator/test: explain why 'always' write isolation mode is used in tests Improve test comments for test_streams_batchwrite_into_the_same_partition_deletes_existing_items and test_streams_batchwrite_into_the_same_partition_will_report_wrong_stream_data to explain why 'always' write isolation mode is required: in always_use_lwt mode all items in a batch get the same CDC timestamp, which triggers the squashing bug. In other modes each item gets a separate timestamp so the bug doesn't manifest. Also fix the example in the second test comment to use cleaner key values and correct event type (INSERT, not MODIFY, since items are inserted into an empty table), and fix the issue reference from #28452 (the PR) to #28439 (the issue).	2026-03-25 15:15:20 +01:00
Dario Mirovic	552a2d0995	test: audit: rename TestCQLAudit to CQLAuditTester pytest tries to collect tests for execution in several ways. One is to pick all classes that start with 'Test'. Those classes must not have custom '__init__' constructor. TestCQLAudit does. TestCQLAudit after migration from test/cluster/dtest is not a test class anymore, but rather a helper class. There are two ways to fix this: 1. Add __init__ = False to the TestCQLAudit class 2. Rename it to not start with 'Test' Option 2 feels better because the new name itself does not convey the wrong message about its role. Fixes SCYLLADB-1237	2026-03-25 13:21:08 +01:00
Dario Mirovic	73de865ca3	test: audit: remove unused pytest.mark.single_node Remove unused pytest.mark.single_node in TestCQLAudit class. This is a leftover from audit tests migration from test/cluster/dtest to test/cluster. Refs SCYLLADB-1237	2026-03-25 13:18:37 +01:00
Radosław Cybulski	ded62b2c5e	alternator/test: add scylla_only to always write isolation fixture Add scylla_only fixture dependency to the test_table_ss_new_and_old_images_write_isolation_always fixture. This ensures all tests using the 'always' write isolation mode are skipped when running against DynamoDB (--aws), since the system:write_isolation tag is a Scylla-only feature.	2026-03-25 12:38:09 +01:00
Radosław Cybulski	7d404cdd51	alternator: fix BatchWriteItem squashed Streams entries BatchWriteItem with items for the same partition (and write isolation set to always) will trigger LWT and run different cdc code path, which will result in wrong Streams data being returned to the user - changes will be randomly squashed together. For example batch write: batch.put_item(Item={'p': 'p', 'c': 'c0'}) batch.put_item(Item={'p': 'p', 'c': 'c1'}) batch.put_item(Item={'p': 'p', 'c': 'c2'}) instead of producing 3 modify / insert events will produce one: type=INSERT, key={'c': {'S': 'c0'}, 'p': {'S': 'p'}}, old_image=None, new_image={'c': {'S': 'c2'}, 'p': {'S': 'p'}} with `new_image` having different `c` key from `key` field. This happens because BatchWriteItem (when using LWT) emits it's changes to cdc under the same timestamp. This results in in all log entries being put in single cdc "bucket" (under the same cdc$timestamp key). Previous parsing algorithm would interpret those changes as a change to a single item and squash them together. The patch rewrites algorithm to use `std::unordered_map` for records based on value of clustering key, that is added to every cdc log entry. This allows rebuilding all item modifications. Fixes #28439 Fixes: SCYLLADB-540	2026-03-25 11:40:53 +01:00
Radosław Cybulski	85da03c88d	alternator: add BatchWriteItem test (failing) Add additional BatchWriteItem tests (some failing): - `test_streams_batchwrite_no_clustering_deletes_non_existing_items` `test_streams_batchwrite_no_clustering_deletes_existing_items` - those tests pass, we add it here for completness, as non clustering tables trigger different paths. - `test_streams_batchwrite_into_the_same_partition_deletes_existing_items` - failing test, that checks combinations of puts and deletes in a single batch write (so for example 3 items, 2 puts and 1 delete). - `test_streams_batchwrite_into_the_same_partition_will_report_wrong_stream_data` - failing simple test. Tests fail, because current implementation, when writing cdc log entries will squash all changes done to the same partition together. The data is still there, but when GetRecords is called and we parse cdc log entries, we don't correctly recover it (see issue #28439 for more details).	2026-03-25 11:40:53 +01:00
Marcin Maliszkiewicz	f988ec18cb	test/lib: fix port in-use detection in start_docker_service Previously, the result of when_all was discarded. when_all stores exceptions in the returned futures rather than throwing, so the outer catch(in_use&) could never trigger. Now we capture the when_all result and inspect each future individually to properly detect in_use from either stream. Fixes https://scylladb.atlassian.net/browse/SCYLLADB-1216 Closes scylladb/scylladb#29219	2026-03-25 11:45:53 +02:00
Artsiom Mishuta	cd1679934c	test/pylib: use exponential backoff in wait_for() Change wait_for() defaults from period=1s/no backoff to period=0.1s with 1.5x backoff capped at 1.0s. This catches fast conditions in 100ms instead of 1000ms, benefiting ~100 call sites automatically. Add completion logging with elapsed time and iteration count. Tested local with test/cluster/test_fencing.py::test_fence_hints (dev mode), log output: wait_for(at_least_one_hint_failed) completed in 0.83s (4 iterations) wait_for(exactly_one_hint_sent) completed in 1.34s (5 iterations) Fixes SCYLLADB-738 Closes scylladb/scylladb#29173	2026-03-24 23:49:49 +02:00
Botond Dénes	d52fbf7ada	Merge 'test: cluster: Deflake test_startup_with_keyspaces_violating_rf_rack_valid_keyspaces' from Dawid Mędrek The test was flaky. The scenario looked like this: 1. Stop server 1. 2. Set its rf_rack_valid_keyspaces configuration option to true. 3. Create an RF-rack-invalid keyspace. 4. Start server 1 and expect a failure during start-up. It was wrong. We cannot predict when the Raft mutation corresponding to the newly created keyspace will arrive at the node or when it will be processed. If the check of the RF-rack-valid keyspaces we perform at start-up was done before that, it won't include the keyspace. This will lead to a test failure. Unfortunately, it's not feasible to perform a read barrier during start-up. What's more, although it would help the test, it wouldn't be useful otherwise. Because of that, we simply fix the test, at least for now. The new scenario looks like this: 1. Disable the rf_rack_valid_keyspaces configuration option on server 1. 2. Start the server. 3. Create an RF-rack-invalid keyspace. 4. Perform a read barrier on server 1. This will ensure that it has observed all Raft mutations, and we won't run into the same problem. 5. Stop the node. 6. Set its rf_rack_valid_keyspaces configuration option to true. 7. Try to start the node and observe a failure. This will make the test perform consistently. --- I ran the test (in dev mode, on my local machine) three times before these changes, and three times with them. I include the time results below. Before: ``` real 0m47.570s user 0m41.631s sys 0m8.634s real 0m50.495s user 0m42.499s sys 0m8.607s real 0m50.375s user 0m41.832s sys 0m8.789s ``` After: ``` real 0m50.509s user 0m43.535s sys 0m9.715s real 0m50.857s user 0m44.185s sys 0m9.811s real 0m50.873s user 0m44.289s sys 0m9.737s ``` Fixes SCYLLADB-1137 Backport: The test is present on all supported branches, and so we should backport these changes to them. Closes scylladb/scylladb#29218 * github.com:scylladb/scylladb: test: cluster: Deflake test_startup_with_keyspaces_violating_rf_rack_valid_keyspaces test: cluster: Mark test with @pytest.mark.asyncio in test_multidc.py	2026-03-24 21:09:19 +02:00
Patryk Jędrzejczak	141aa2d696	Merge 'test/cluster/test_incremental_repair.py: fix typo + enable compaction DEBUG logs' from Botond Dénes This PR contains two small improvements to `test_incremental_repair.py` motivated by the sporadic failure of `test_tablet_incremental_repair_and_scrubsstables_abort`. The test fails with `assert 3 == 2` on `len(sst_add)` in the second repair round. The extra SSTable has `repaired_at=0`, meaning scrub unexpectedly produced more unrepaired SSTables than anticipated. Since scrub (and compaction in general) logs at DEBUG level and the test did not enable debug logging, the existing logs do not contain enough information to determine the root cause. Commit 1 fixes a long-standing typo in the helper function name (`preapre` -> `prepare`). Commit 2 enables `compaction=debug` for the Scylla nodes started by `do_tablet_incremental_repair_and_ops`, which covers all `test_tablet_incremental_repair_and_` variants. This will capture full compaction/scrub activity on the next reproduction, making the failure diagnosable. Refs: SCYLLADB-1086 Backport: test improvement, no backport Closes scylladb/scylladb#29175 https://github.com/scylladb/scylladb: test/cluster/test_incremental_repair.py: enable compaction DEBUG logs in do_tablet_incremental_repair_and_ops test/cluster/test_incremental_repair.py: fix typo preapre -> prepare	2026-03-24 16:27:01 +01:00
Pavel Emelyanov	2d8540f1ee	transport: fix process_startup cert-auth path missing connection-ready setup When authenticate() returns a user directly (certificate-based auth, introduced in `20e9619bb1`), process_startup was missing the same post-authentication bookkeeping that the no-auth and SASL paths perform: - update_scheduling_group(): without it, the connection runs under the default scheduling group instead of the one mapped to the user's service level. - _authenticating = false / _ready = true: without them, system.clients reports connection_stage = AUTHENTICATING forever instead of READY. - on_connection_ready(): without it, the connection never releases its slot in the uninitialized-connections concurrency semaphore (acquired at connection creation), leaking one unit per cert-authenticated connection for the lifetime of the connection. The omission was introduced when on_connection_ready() was added to the else and SASL branches in `474e84199c` but the cert-auth branch was missed. Fixes: `20e9619bb1` ("auth: support certificate-based authentication") Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-24 18:02:46 +03:00
Pavel Emelyanov	da6fe14035	transport: test that connection_stage is READY after auth via all process_startup paths The cert-auth path in process_startup (introduced in `20e9619bb1`) was missing _ready = true, _authenticating = false, update_scheduling_group() and on_connection_ready(). The result is that connections authenticated via certificate show connection_stage = AUTHENTICATING in system.clients forever, run under the wrong service-level scheduling group, and hold the uninitialized-connections semaphore slot for the lifetime of the connection. Add a parametrized cluster test that verifies all three process_startup branches result in connection_stage = READY: - allow_all: AllowAllAuthenticator (no-auth path) - password: PasswordAuthenticator (SASL/process_auth_response path) - cert_bypass: CertificateAuthenticator with transport_early_auth_bypass error injection (cert-auth path -- the buggy one) The injection is added to certificate_authenticator::authenticate() so tests can bypass actual TLS certificate parsing while still exercising the cert-auth code path in process_startup. The cert_bypass case is marked xfail until the bug is fixed. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-24 18:01:28 +03:00
Benny Halevy	1a7b013377	test: add test_sstable_clone_preserves_staging_state	2026-03-24 16:48:01 +02:00
Benny Halevy	22f2010477	test: derive sstable state from directory in test_env::make_sstable Instead of always passing sstable_state::normal, infer the state from the last component of the directory path by comparing against the known state subdirectory constants (staging_dir, upload_dir, quarantine_dir). Any unrecognized path component (the common case for normal-state sstables) maps to sstable_state::normal. When a non-normal state is detected, strip the state subdirectory from dir so that the base table directory is passed to storage.	2026-03-24 16:48:01 +02:00
Ernest Zaslavsky	c670183be8	cmake: fix precompiled header (PCH) creation Two issues prevented the precompiled header from compiling successfully when using CMake directly (rather than the configure.py + ninja build system): a) Propagate build flags to Rust binding targets reusing the PCH. The wasmtime_bindings and inc targets reuse the PCH from scylla-precompiled-header, which is compiled with Seastar's flags (including sanitizer flags in Debug/Sanitize modes). Without matching compile options, the compiler rejects the PCH due to flag mismatch (e.g., -fsanitize=address). Link these targets against Seastar::seastar to inherit the required compile options. Closes scylladb/scylladb#28941	2026-03-24 15:53:40 +02:00
Dawid Mędrek	e639dcda0b	test: cluster: Deflake test_startup_with_keyspaces_violating_rf_rack_valid_keyspaces The test was flaky. The scenario looked like this: 1. Stop server 1. 2. Set its rf_rack_valid_keyspaces configuration option to true. 3. Create an RF-rack-invalid keyspace. 4. Start server 1 and expect a failure during start-up. It was wrong. We cannot predict when the Raft mutation corresponding to the newly created keyspace will arrive at the node or when it will be processed. If the check of the RF-rack-valid keyspaces we perform at start-up was done before that, it won't include the keyspace. This will lead to a test failure. Unfortunately, it's not feasible to perform a read barrier during start-up. What's more, although it would help the test, it wouldn't be useful otherwise. Because of that, we simply fix the test, at least for now. The new scenario looks like this: 1. Disable the rf_rack_valid_keyspaces configuration option on server 1. 2. Start the server. 3. Create an RF-rack-invalid keyspace. 4. Perform a read barrier on server 1. This will ensure that it has observed all Raft mutations, and we won't run into the same problem. 5. Stop the node. 6. Set its rf_rack_valid_keyspaces configuration option to true. 7. Try to start the node and observe a failure. This will make the test perform consistently. --- I ran the test (in dev mode, on my local machine) three times before these changes, and three times with them. I include the time results below. Before: ``` real 0m47.570s user 0m41.631s sys 0m8.634s real 0m50.495s user 0m42.499s sys 0m8.607s real 0m50.375s user 0m41.832s sys 0m8.789s ``` After: ``` real 0m50.509s user 0m43.535s sys 0m9.715s real 0m50.857s user 0m44.185s sys 0m9.811s real 0m50.873s user 0m44.289s sys 0m9.737s ``` Fixes SCYLLADB-1137	2026-03-24 14:27:36 +01:00
Patryk Jędrzejczak	503a6e2d7e	locator: everywhere_replication_strategy: fix sanity_check_read_replicas when read_new is true ERMs created in `calculate_vnode_effective_replication_map` have RF computed based on the old token metadata during a topology change. The reading replicas, however, are computed based on the new token metadata (`target_token_metadata`) when `read_new` is true. That can create a mismatch for EverywhereStrategy during some topology changes - RF can be equal to the number of reading replicas +-1. During bootstrap, this can cause the `everywhere_replication_strategy::sanity_check_read_replicas` check to fail in debug mode. We fix the check in this commit by allowing one more reading replica when `read_new` is true. Fixes https://scylladb.atlassian.net/browse/SCYLLADB-1147 Closes scylladb/scylladb#29150	2026-03-24 13:43:39 +01:00
Jenkins Promoter	0f02c0d6fa	Update pgo profiles - x86_64	2026-03-24 14:11:38 +02:00
Dawid Mędrek	4fead4baae	test: cluster: Mark test with @pytest.mark.asyncio in test_multidc.py One of the tests, test_startup_with_keyspaces_violating_rf_rack_valid_keyspaces, didn't have the marker. Let's add it now.	2026-03-24 12:52:00 +01:00
Botond Dénes	ffd58ca1f0	Merge 'test: cluster: Deflake test_write_cl_any_to_dead_node_generates_hints' from Dawid Mędrek Before these changes, we would send mutations to the node and immediately query the metrics to see how many hints had been written. However, that could lead to random failures of the test: even if the mutations have finished executing, hints are stored asynchronously, so we don't have a guarantee they have already been processed. To prevent such failures, we rewrite the check: we will perform multiple checks against the metrics until we have confirmed that the hints have indeed been written or we hit the timeout. We're generous with the timeout: we give the test 60 seconds. That should be enough time to avoid flakiness even on super slow machines, and if the test does fail, we will know something is really wrong. As a bonus, we improve the test in general too. We explicitly express the preconditions we rely on, as well as bump the log level. If the test fails in the future, it might be very difficult do debug it without this additional information. Fixes SCYLLADB-1133 Backport: The test is present on all supported branches. To avoid running into more failures, we should backport these changes to them. Closes scylladb/scylladb#29191 * github.com:scylladb/scylladb: test: cluster: Increase log level in test_write_cl_any_to_dead_node_generates_hints test: cluster: Await all mutations concurrently in test_write_cl_any_to_dead_node_generates_hints test: cluster: Specify min_tablet_count in test_write_cl_any_to_dead_node_generates_hints test: cluster: Use new_test_table in test_write_cl_any_to_dead_node_generates_hints test: cluster: Introduce auxiliary function keyspace_has_tablets test: cluster: Deflake test_write_cl_any_to_dead_node_generates_hints	2026-03-24 13:39:56 +02:00
Calle Wilund	f1b3bff4a5	dockerized_service: Convert log reader to pipes and push to test log Refs: SCYLLADB-1106 Ensures any stderr logs from mock services will echo to the test log regardless of the log file we write. To help debug failed CI.	2026-03-24 12:35:42 +01:00
Calle Wilund	38aaed1ed4	test::cluster::conftest::GSServer: Fix unpublish for when publish was not called Use checked dict access to check the set vars. Fixes: SCYLLADB-1106	2026-03-24 12:33:56 +01:00
Calle Wilund	b382f3593c	scylla_cluster: Use thread safe future signalling	2026-03-24 12:33:56 +01:00
Nikos Dragazis	d09196068c	api: Add REST endpoint for migration finalization The endpoint is the following: POST /storage_service/vnode_tablet_migrations/keyspaces/{keyspace}/finalization When called, it issues a `finalize_migration` topology request and waits for its completion. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2026-03-24 13:21:12 +02:00
Nikos Dragazis	c88ddecfca	topology_coordinator: Add `finalize_migration` request Vnodes-to-tablets migration needs a finalization step to finish or rollback the migration. Finishing the migration involves switching the keyspace schema to tablets and clearing the `intended_storage_mode` from system.topology. Rolling back the migration involves deleting the tablet maps and clearing the `intended_storage_mode`. The finalization needs to be done as a topology request to exclude with other operations such as repair and TRUNCATE. This patch introduces the `finalize_migration` global topology request for this purpose. The request takes a keyspace name as an argument. The direction of the finalization (i.e., forward path vs rollback) is inferred from the `intended_storage_mode` of all nodes (not ideal, should be made explicit). Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2026-03-24 13:20:39 +02:00
Nikos Dragazis	0e1e6ebdc5	database: Construct migrating tables with tablet ERMs Extend `database::add_column_family()` with a `storage_mode` argument. If the table is under vnodes-to-tablets migration and the storage mode is "tablets", create a tablet ERM. Make the distributed loader determine the storage mode from topology (`intended_storage_mode` column in system.topology). Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2026-03-24 13:20:39 +02:00

... 20 21 22 23 24 ...

53948 Commits