scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-23 08:12:08 +00:00

Author	SHA1	Message	Date
Avi Kivity	eec0b20dbc	cql3: statement_restrictions: prepare statement_restrictions for capturing `this` Prevent copying/moving, that can change the address, and instead enforce using shared_ptr. Most of the code is already using shared_ptr, so the changes aren't very large. To forbid non-shared_ptr construction, the constructors are annotated with a private_tag tag class.	2026-04-19 20:57:03 +03:00
Avi Kivity	374be94faa	test: statement_restrictions: add index_selection regression test In preparation for refactoring statement_restrictions, add a simple and an exhaustive regression test, encoding the index selection algorithm into the test. We cannot change the index selection algorithm because then mixed-node clusters will alter the sorting key mid-query (if paging takes place). Because the exhaustive space has such a large stack frame, and because Address Santizer bloats the stack frame, increase it for debug builds.	2026-04-19 20:57:01 +03:00
Artsiom Mishuta	dce0c24a02	test/alternator: replace bare pytest.skip() with typed skip helpers	2026-04-19 17:34:41 +02:00
Artsiom Mishuta	b078cd1e72	test: migrate new bare skips introduced by upstream after rebase Migrate 3 bare skip sites that appeared in upstream/master after the initial migration: - test/cluster/test_strong_consistency.py: 2 @pytest.mark.skip → @pytest.mark.skip_bug (SCYLLADB-1056) - test/cqlpy/conftest.py: pytest.skip() → skip_env() in skip_on_scylla_vnodes fixture	2026-04-19 17:34:41 +02:00
Artsiom Mishuta	9c4d3ce097	test/pylib: reject bare pytest.mark.skip and add codebase guards Harden the skip_reason_plugin to reject bare @pytest.mark.skip at collection time with pytest.UsageError instead of warnings.warn(). Add test/pylib_test/test_no_bare_skips.py with three guard tests: - AST scan for bare pytest.skip() runtime calls - Real pytest --collect-only against all Python test directories	2026-04-19 17:34:31 +02:00
Avi Kivity	9fb67e3e96	Revert "alternator: optional stripping of http response headers" This reverts commit `73f0deef6d`. It prevents `2943d30b0c`, which causes high flakiness, from being reverted.	2026-04-19 15:14:48 +03:00
Artsiom Mishuta	0b6b380b80	test: update comments referencing pytest.skip() to skip_env() Update 7 comments/docstrings across 5 files that still referenced pytest.skip() to reference the typed skip_env() wrapper for consistency with the migrated code.	2026-04-19 11:14:03 +02:00
Artsiom Mishuta	b10028e556	test: migrate runtime pytest.skip() to typed skip_bug() Migrate 2 runtime pytest.skip() calls referencing known bugs to use the typed skip_bug() wrapper from test.pylib.skip_types: - test/alternator/test_ttl.py: Streams on tablets (#23838) - test/scylla_gdb/test_task_commands.py: coroutine task not found (#22501)	2026-04-19 11:10:42 +02:00
Artsiom Mishuta	8a80e2c3be	test: migrate runtime pytest.skip() to typed skip_env() Migrate runtime pytest.skip() calls across 34 files to use the typed skip_env() wrapper from test.pylib.skip_types. These sites skip at runtime because a required feature, config option, library version, build mode, or runtime topology is not available. Also fixes 'raise pytest.skip(...)' in test_audit.py — skip_env() already raises internally, so the explicit raise was incorrect. Each file gains one new import: from test.pylib.skip_types import skip_env	2026-04-19 11:09:29 +02:00
Artsiom Mishuta	fb0974a329	test: migrate bare @pytest.mark.skip to skip_not_implemented Migrate 2 bare @pytest.mark.skip decorators (no reason string) to @pytest.mark.skip_not_implemented with an explicit reason referencing issue #3882 (COMPACT STORAGE not implemented).	2026-04-19 11:06:30 +02:00
Artsiom Mishuta	a39fb9d29a	test: migrate @pytest.mark.skip to @pytest.mark.skip_slow Migrate 4 @pytest.mark.skip decorator sites to @pytest.mark.skip_slow across 3 test files where the skip reason indicates a slow test.	2026-04-19 11:06:30 +02:00
Artsiom Mishuta	638efedc3c	test: migrate @pytest.mark.skip to @pytest.mark.skip_not_implemented Migrate 10 @pytest.mark.skip decorator sites to @pytest.mark.skip_not_implemented across 5 test files where the skip reason indicates a feature not yet implemented.	2026-04-19 11:06:30 +02:00
Artsiom Mishuta	465636bc53	test: migrate @pytest.mark.skip to @pytest.mark.skip_bug for known bugs Migrate 24 @pytest.mark.skip decorator sites to @pytest.mark.skip_bug across 16 test files where the reason references a known bug or issue.	2026-04-19 11:06:30 +02:00
Szymon Malewski	73f0deef6d	alternator: optional stripping of http response headers In Alternator's HTTP API, response headers can dominate bandwidth for small payloads. The Server, Date, and Content-Type headers were sent on every response but many clients never use them. This patch introduces three Alternator config options: - alternator_http_response_server_header, - alternator_http_response_disable_date_header, - alternator_http_response_disable_content_type_header, which allow customizing or suppressing the respective HTTP response headers. All three options support live update (no restart needed). The Server header is no longer sent by default; the Date and Content-Type defaults preserve the existing behavior. The Server and Date header suppression uses Seastar's set_server_header() and set_generate_date_header() APIs added in https://github.com/scylladb/seastar/pull/3217. This patch also fixes deprecation warnings from older Seastar HTTP APIs. Tests are in test/alternator/test_http_headers.py. Fixes https://scylladb.atlassian.net/browse/SCYLLADB-70 Closes scylladb/scylladb#28288	2026-04-19 09:22:04 +03:00
Nadav Har'El	f83270df12	Merge 'alternator/streams: Block tablet merges for Alternator Streams on tablet tables' from Piotr Szymaniak DynamoDB Streams API can only convey a single parent per stream shard. Tablet merges produce two parents, making them incompatible with Alternator Streams. This series blocks tablet merges when streams are active on a tablet table. For CreateTable, a freshly created table has no pending merges, so streams are enabled immediately with tablet merges blocked. For UpdateTable on an existing table, stream enablement is deferred: the user's intent is stored via `enable_requested`, tablet merges are blocked (new merge decisions are suppressed and any active merge decision is revoked), and the topology coordinator finalizes enablement once no in-flight merges remain. The topology coordinator is woken promptly on error injection release and tablet split completion, reducing finalization latency from ~60s to seconds. `test_parent_children_merge` is marked xfail (merges are now blocked), and downward (merge) steps are removed from `test_parent_filtering` and `test_get_records_with_alternating_tablets_count`. Not addressed here: using a topology request to preempt long-running operations like repair (tracked in SCYLLADB-1304). Refs SCYLLADB-461 Closes scylladb/scylladb#29224 * github.com:scylladb/scylladb: topology: Wake coordinator promptly for stream enablement lifecycle test/cluster: Test deferred stream enablement on tablet tables alternator/streams: Block tablet merges when Alternator Streams are enabled	2026-04-19 09:15:13 +03:00
Nadav Har'El	0d05e3b4a4	alternator: fix ListStreams paging if table is deleted during paging Currently, ListStreams paging works by looking in the list of tables for ExclusiveStartStreamArn and starting there. But it's possible that during the paging process, one of the tables got deleted and ExclusiveStartStreamArn no longer points to an existing table. In the current implementation this caused the paging to stop (think it reached the end). The solution is simple: ListStreams will now sort the list of tables by name (it anyway needs to be sorted by something to be consistent across pages), and will look with std::upper_bound for the first table after the ExclusiveStartStreamArn - we don't need to find that table name itself. The patch also includes a test reproducing this bug. As usual, the test passes on DynamoDB, fails on Alternator before this patch, and passes with the patch. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-04-19 09:12:02 +03:00
Nadav Har'El	930fb4c330	test/alternator: test DescribeStream on non-existent table We already had a test for DescribeStream being called on a bogus ARN returns a ValidationException. But if the stream is more legitimate- looking but refers to a non-existent table (e.g., an ARN taken in the past from a table that no longer exists), we should return ResourceNotFoundException. In this patch we add a test that verifies we indeed do this correctly. Moreover, Alternator's current stream ARNs include both a keyspace name and a table name, and either one being incorrect should lead to ResourceNotFoundException, and indeed the new test validates that it works as expected - there is no bug here (AI guessed we have a bug in the missing keyspace case, but this guess was wrong).	2026-04-19 09:12:02 +03:00
Nadav Har'El	02d474fca8	alternator: ListStreams: on last page, avoid LastEvaluatedStreamArn When ListStreams is on its last page and ran out streams to list, it shouldn't return a paging cookie (LastEvaluatedStreamArn) at all. Before this patch it does, and forces the user to make another call just to get another empty page, which is silly. This patch includes a fix and a reproducer test (that, as usual, passes on DynamoDB and fails on Alternator before the patch and succeeds after). Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-04-19 09:12:02 +03:00
Nadav Har'El	1ac910c2ab	alternator: fix ListStreams to return real ARN as LastEvaluatedStreamArn Alternator Streams' "ListStreams" does paging by returning a "cookie" LastEvaluatedStreamArn from one request, that the user passes to the next request as ExclusiveStartStreamArn. In the past, Alternator's stream ARNs were UUIDs, but we recently changed them to match DynamoDB's ARN format which the KCL library requires. However, we didn't change ListStream's cookie format, and it remained UUIDs. This, however, goes against the documentation of DynamoDB, which states that LastEvaluatedStreamArn should be "the stream ARN of the item where the operation stopped". It shouldn't be some weird opaque cookie. So in this patch we add a test that confirms that indeed, in DynamoDB the LastEvaluatedStreamARN is really the last returned ARN and not an opaque cookie. The new test passes on DynamoDB, and fails on Alternator before the simple fix that this patch then does. Fixes SCYLLADB-539.	2026-04-19 09:12:01 +03:00
Piotr Szymaniak	a5d35d2b4c	test/cluster: Test deferred stream enablement on tablet tables Async cluster test exercising the deferred enablement lifecycle: ENABLING -> ENABLED -> disabled, verifying tablet merge blocking and unblocking at each stage. Uses delay_cdc_stream_finalization error injection and CQL ALTER TABLE with tablet count constraints. Also adds tablet scheduler config to test_config.yaml (fast refresh interval, scale factor 1) for reliable tablet count changes.	2026-04-19 03:54:33 +02:00
Piotr Szymaniak	4b6937b570	alternator/streams: Block tablet merges when Alternator Streams are enabled DynamoDB Streams API can only convey a single parent per stream shard. Tablet merges produce 2 parents, which is incompatible. When streams are requested on a tablet table, block tablet merges via tablet_merge_blocked (the allocator suppresses new merge decisions and revokes any active merge decision). add_stream_options() sets tablet_merge_blocked=true alongside enabled=true, so CreateTable needs no special handling — the flag is inert on vnode tables and immediately effective on tablet tables. For UpdateTable, CDC enablement is deferred: store the user's intent via enable_requested, and let the topology coordinator finalize enablement once no in-progress merges remain. A new helper, defer_enabling_streams_block_tablet_merges(), amends the CDC options to this deferred state. Disabling streams clears all flags, immediately re-allowing merges. The tablet allocator accesses the merge-blocked flag through a schema::tablet_merges_forbidden() accessor rather than reaching into CDC options directly. Mark test_parent_children_merge as xfail and remove downward (merge) steps from tablet_multipliers in test_parent_filtering and test_get_records_with_alternating_tablets_count.	2026-04-19 03:54:33 +02:00
Avi Kivity	f5886b4fdd	Merge 'Add virtual task for vnodes-to-tablets migrations' from Nikos Dragazis This PR exposes vnodes-to-tablets migrations through the task manager API via a virtual task. This allows users to list, query status, and wait on ongoing migrations through a standard interface, consistent with other global operations such as tablet operations and topology requests are already exposed. The virtual task exposes all migrations that are currently in progress. Each migrating keyspace appears as a separate task, identified by a deterministic name-based (v3) UUID derived from the keyspace name. Progress is reported as the number of nodes that have switched to tablets vs. the total. The number increases on the forward path and decreases on rollback. The task is not abortable - rolling back a migration requires a manual procedure. The `wait` API blocks until the migration either completes (returning `done`) or is rolled back (returning `suspended`). Example output: ``` $ scylla nodetool tasks list vnodes_to_tablets_migration task_id type kind scope state sequence_number keyspace table entity shard start_time end_time 1747b573-6cd6-312d-abb1-9b66c1c2d81f vnodes_to_tablets_migration cluster keyspace running 0 ks 0 $ scylla nodetool tasks status 1747b573-6cd6-312d-abb1-9b66c1c2d81f id: 1747b573-6cd6-312d-abb1-9b66c1c2d81f type: vnodes_to_tablets_migration kind: cluster scope: keyspace state: running is_abortable: false start_time: end_time: error: parent_id: none sequence_number: 0 shard: 0 keyspace: ks table: entity: progress_units: nodes progress_total: 3 progress_completed: 0 ``` Fixes SCYLLADB-1150. New feature, no backport needed. Closes scylladb/scylladb#29256 * github.com:scylladb/scylladb: test: cluster: Verify vnodes-to-tablets migration virtual task distributed_loader: Link resharding tasks to migration virtual task distributed_loader: Make table_populator aware of migration rollbacks service: Add virtual task for vnodes-to-tablets migrations storage_service: Guard migration status against uninitialized group0 compaction: Add parent_id to table_resharding_compaction_task_impl storage_service: Add keyspace-level migration status function storage_service: Replace migration status string with enum utils: Add UUID::is_name_based()	2026-04-19 00:56:33 +03:00
Nadav Har'El	31e0315710	Merge 'alternator: fix unnecesary cdc log entries' from Radosław Cybulski Fix cdc writing unnecesary entries to it's log, like for example when Alternator deletes an item which in reality doesn't exist. Originally @wps0 tackled this issue. This patch is an extension of his work. His work involved adding `should_skip` function to cdc, which would process a `mutation` object and decide, wherever changes in the object should be added to cdc log or not. The issue with his approach is that `mutation` object might contain changes for more than one row. If - for example - the `mutation` object contains two changes, delete of non-existing row and create of non-existing row, `should_skip` function will detect changes in second item and allow whole `mutation` (BOTH items) to be added. For example (using python's boto3) running this on empty table: ``` with table.batch_writer() as batch: batch.put_item({'p': 'p', 'c': 'c0'}) batch.delete_item(Key={'p': 'p', 'c': 'c1'}) ``` will emit two events ("put" event and "delete" event), even though the item with `c` set to `c1` does not exist (thus can't be deleted). Note, that both entries in batch write must use the same partition key, otherwise upper layer with split them into separate `mutation` objects and the issue will not happen. The solution is to do similar processing, but consider each change separated from others. This is tricky to implement due to a way cdc works. When cdc processes `mutation` object (containing X changes), it emits cdc entries in phases. Phase 1 - emit `preimage` (old state) for each change (if requested). Phase 2 - for each change emit actual "diff" (update / delete and so on). Phase 3 - emit `postimage` (new state). We will know if change needs to be skipped during phase 2. By that time phase 1 is completed and preimage for the change is emited. At that moment we set a flag that the change (identified by clustering key value) needs to be skipped - we add a clustering key to a `ignore-rows` set (`_alternator_clustering_keys_to_ignore` variable) and continue normally. Once all phases finish we add a `postprocess` phase (`clean_up_noop_rows` function). It will go through generated cdc mutations and skip all modifications, for which clustering key is in `ignore-rows` set. After skipping we need to do a "cleanup" operation - each generated cdc mutation contain index (incremented by one), if we skipped some parts, the index is not consecutive anymore, so we reindex final changes. There's a special case worth mentioning - Alternator tables without clustering keys. At that point `mutation` object passed to cdc can contain exactly one change (since different partition keys are splitted by upper layers and Alternator will never emit `mutation` object containing two (or more) changes with the same primary key. Here, when we decide the change is to be skipped we add empty `bytes` object to `ignore-rows` set. When checking `ignore-rows` set, we check if it's empty or not (we don't check for presence of empty `bytes` object). Note: there might be some confusion between this patch and #28452 patch. Both started from the same error observation and use similar tests for validation, as both are easily triggered by BatchWrite commands (both needs `mutation` object passed to cdc to contain more than one single change). This issue tho is about wrong data written in cdc log and is fixed at cdc, where #28452 is about wrong way of parsing correct cdc data and is fixed at Alternator side of things. Note, that we need #28452 to truly verify (otherwise we will emit correct cdc entries, but Alternator will incorrectly parse them). Note: to benefit / notice this patch you need `alternator_streams_increased_compatibility` flag turned on. Note: rework is quite "broad" and covers a lot of ground - every operation, that might result in a no-change to the database state should be tested. An additional test was added - trying to remove a column from non-existing item, as well as trying to remove non-existing column from existing item. Fixes: #28368 Fixes: SCYLLADB-1528 Fixes: SCYLLADB-538 Closes scylladb/scylladb#28544 * github.com:scylladb/scylladb: alternator: remove unnecesary code alternator: fix Alternator writing unnecesary cdc entries alternator: add failing tests for Streams	2026-04-18 00:07:51 +03:00
Nadav Har'El	32060d73df	Merge 'alternator: Add stream support for tablets' from Radosław Cybulski Implements neccesary changes for Streams to work with tablet based tables. - add utility functions to `system_keyspace` that helps reading cdc content from cdc log tables for tablet based base tables (similar api to ones for vnodes) - remove antitablet `if` checks, update tests that fail / skip if tablets are selected - add two tests to extensively test tablet based version, especially while manipulating stream count Fixes #23838 Fixes SCYLLADB-463 Closes scylladb/scylladb#28500 * github.com:scylladb/scylladb: alternator: add streams with tablets tests alternator: remove antitablet guards when using Streams alternator: implement streams for tablets treewide: add cdc helper functions to system_keyspace alternator: add system_keyspace reference	2026-04-17 23:48:31 +03:00
Nikos Dragazis	d361a0dd83	test: cluster: Verify vnodes-to-tablets migration virtual task Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2026-04-17 21:13:52 +03:00
Nikos Dragazis	a00056381f	utils: Add UUID::is_name_based() The UUID class already provides `is_timestamp()` for identifying time-based (version 1) UUIDs. Add the analogous `is_name_based()` predicate for version 3 (name-based) UUIDs, along with a test. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2026-04-17 20:58:39 +03:00
Radosław Cybulski	9a6aed721b	alternator: add streams with tablets tests Add tests for Streams, when table uses tablets underneath. One test verifies filtering using CHILD_SHARDS feature. Other one makes sure we get read all data while the table undergoes tablet count change. Add `--tablet-load-stats-refresh-interval-in-seconds=1` to `alternator/run` script, as otherwise newly added tests will fail. The setting changes how often scylla refreshes tablet metadata. This can't be done using `scylla_config_temporary`, as 1) default is 60 seconds 2) scylla will wait full timeout (60s) to read configuration variable again.	2026-04-17 18:58:27 +02:00
Radosław Cybulski	6be16cf224	alternator: remove antitablet guards when using Streams Remove `if` condition, that prevented tables with tablets working with Streams. Remove a test, that verifies, that Alternator will reject tables with tablets underneath working with Streams feature enabled on them. Update few tests, that were expected to fail on tablets to enable their normal execution.	2026-04-17 18:58:26 +02:00
Radosław Cybulski	6e5aaa85b6	alternator: fix Alternator writing unnecesary cdc entries Work in this patch is a result of two bugs - spurious MODIFY event, when remove column is used in `update_item` on non-existing item and spurious events, when batch write item mixed noop operations with operations involving actual changes (the former would still emit cdc log entries). The latter issue required rework of Piotr Wieczorek's algorithm, which fixed former issue as well. Piotr Wieczorek previously wrote checks, that should prevent unnecesary cdc events from being written. His implementation missed the fact, that a single `mutation` object passed to cdc code to be analysed for cdc log entries can contain modifications for multiple rows (with the same timestamp - for example as a result to BatchWriteItem call). His code tries to skip whole `mutation`, which in such case is not possible, because BatchWriteItem might have one item that does nothing and second item that does modification (this is the reason for the second bug). His algorithm was extended and moved. Originally it was working as follows - user would sent a `mutation` object with some changes to be "augmented". The cdc would process those changes and built a set of cdc log changes based on them, that would be added to cdc log table. Piotr added a `should_skip` function, which processes user changes and tried to determine if they all should be dropped or not. New version, instead of trying to skip adding rows to cdc log `mutation` object, builds a rows-to-ignore set. After whole cdc log `mutation` object is completed, it processes it and go through it row by row. Any row that was previously added to a `rows_to_ignore` set will now be removed. Remaining rows are written to new cdc log `mutation` with new clustering key (`cdc$batch_seq_no` index value should probably be consecutive - we just want to be safe here) and returns new `mutation` object to be sent to cdc log table. The first bug is fixed as a side effect of new algorithm, which contains more precise checks detecting, if given mutation actually made a difference. Fixes: #28368 Fixes: SCYLLADB-538 Fixes: SCYLLADB-1528 Refs: #28452	2026-04-17 18:00:25 +02:00
Botond Dénes	6ce0968960	compaction: release GC'ed sstables incrementally during compaction Garbage collected sstables created during incremental compaction are deleted only at the end of the compaction, which increases the memory footprint. This is inefficient, especially considering that the related input sstables are released regularly during compaction. This commit implements incremental release of GC sstables after each output sstable is sealed. Unlike regular input sstables, GC sstables use a different exhaustion predicate: a GC sstable is only released when its token range no longer overlaps with any remaining input sstable. This is because GC sstables hold tombstones that may shadow data in still-alive overlapping input sstables; releasing them prematurely would cause data resurrection. Fixes #5563 Closes scylladb/scylladb#28984	2026-04-17 18:20:47 +03:00
Radosław Cybulski	2894542e57	alternator: add failing tests for Streams Add failing tests for Streams functionality. Trying to remove column from non-existing item is producing a MODIFY event (while it should none). Doing batch write with operations working on the same partition, where one operation is without side effects and second with will produce events for both operations, even though first changes nothing. First test has two versions - with and without clustering key. Second has only with clustering key, as we can't produce batch write with two items for the same partition - batch write can't use primary key more than once in single call. We also add a test for batch write, where one of three operations has no observable side effects and should not show up in Streams output, but in current scylla's version it does show.	2026-04-17 16:28:14 +02:00
Piotr Smaron	218f8adc8f	transport: add per-service-level cql_requests_serving metric Add a per-scheduling-group gauge that tracks the number of in-flight CQL requests for each service level. The existing scylla_transport_requests_serving metric is a single global per-shard counter; the new metric breaks it down by scheduling group so operators can see which service level contributes the most in-flight requests when debugging latency. The metric is named cql_requests_serving (exposed as scylla_transport_cql_requests_serving) following the cql_ prefix convention used by all other per-scheduling-group transport metrics (cql_requests_count, cql_request_bytes, cql_response_bytes, cql_pending_response_memory). Using a cql_ prefix avoids Prometheus confusion with the global requests_serving metric, which lacks the scheduling_group_name label. The counter is incremented when a request enters process_request() and decremented in the same 'leave' defer block as the global requests_serving, ensuring the request is counted as in-flight until the response is sent.	2026-04-17 15:07:14 +02:00
Andrzej Jackowski	e256d9f69d	test: retry get_coordinator_host() after topology coordinator stop After stopping the topology coordinator, a new topology coordinator may not yet be started when get_coordinator_host() is called. Make the function always retry via wait_for so that every caller is protected against this race. Fixes SCYLLADB-1553 Closes scylladb/scylladb#29489	2026-04-17 12:08:26 +02:00
Botond Dénes	fbcfe3f88f	test: use uuid4 for DockerizedServer container names to avoid collisions Container names were generated as {name}-{pid}-{counter}, where the counter is a per-process itertools.count. This scheme breaks across CI runs on the same host: if a prior job was killed abruptly (SIGKILL, cancellation) its containers are left running since --rm only removes containers on exit. A subsequent run whose worker inherits the same PID (common in containerized CI with small PID namespaces) and reaches the same counter value will collide with the orphaned container. Replace pid+counter with uuid.uuid4(), which generates a random UUID, making names unique across processes, hosts, and time without any shared state or leaking host identifiers. Fixes: SCYLLADB-1540 Closes scylladb/scylladb#29509	2026-04-17 11:56:51 +02:00
Avi Kivity	cad3c0de94	test: write minio log to testlog dir for Jenkins artifact collection Write the MinIO server log directly to tempdir_base (testlog/<arch>/) instead of the per-server temp directory that gets destroyed on shutdown. This preserves the log for Jenkins artifact collection, helping debug S3-related flaky test failures like the stcs_reshape_overlapping_s3_test hang (SCYLLADB-1481). Closes scylladb/scylladb#29458	2026-04-17 12:51:55 +03:00
Botond Dénes	facb50cbf9	Merge 'test.py: refactor test.py' from Andrei Chekun With the latest changes, there are a lot of code that is redundant in the test.py. This PR just cleans this code. Also, it narrows using dynamic scope for fixtures to test/alternator and test/cqlpy. All the rest by default will have module scope. test.py will be a wrapper for pytest mostly for CI use. As for now test.py have important part of calculating the number of threads to start pytest with. This is not possible to do in pytest itself. No backport needed, framework enhancement only. Fixes: https://scylladb.atlassian.net/browse/SCYLLADB-666 Closes scylladb/scylladb#28852 * github.com:scylladb/scylladb: test.py: remove testpy_test_fixture_scope test.py: add logger for 3rd party service test.py: delete dead code in test.py	2026-04-17 12:51:14 +03:00
Pawel Pery	7883f161bb	vector-store: fix creating local vector search indexes with a part of the partition key Users ought to have possibility to create the local index for Vector Search based only on a part of the partition key. This commits provides this by removing requirements of 'full partition key only' for custom local index. The commit updates docs to explain that local vector index can use only a part of the partition key. The commit implements cqlpy test to check fixed functionality. Fixes: SCYLLADB-953 Needs to be backported to 2026.1 as it is a fix for local vector indexes. Closes scylladb/scylladb#28931	2026-04-17 11:44:15 +02:00
Karol Nowacki	c643f321af	vector_search: decrease default connection timeout to 3s Decrease the default connection timeout to 3s to better align with the default CQL query timeout of 10s. The previous timeout allowed only one failover request in high availability scenario before hitting the CQL query timeout. By decreasing the timeout to 3s, we can perform up to three failover requests within the CQL query timeout, which significantly improves the chances of successfully completing the query in high availability scenarios. Fixes: SCYLLADB-95	2026-04-17 12:26:39 +03:00
Karol Nowacki	9269ca9cf7	vector_search: add unreachable node detection time config Add option `vector_store_unreachable_node_detection_time_in_ms` to control parameters related to detecting unreachable vector store nodes. This parameter is used to set the TCP connect timeout, keepalive parameters, and TCP_USER_TIMEOUT. By configuring these parameters, we can detect unreachable vector store nodes faster and trigger failover mechanisms in a timely manner.	2026-04-17 12:26:38 +03:00
Piotr Dulikowski	37fc1507f0	Merge 'Alternator: Add vector search support' from Nadav Har'El This series adds support for vector search in Alternator based on the existing implementation in CQL. The series adds APIs for `CreateTable` and `UpdateTable` to add or remove vector indexes to Alternator tables, `DescribeTable` to list them and check the indexing status, and `Query` to perform a vector search - which contacts the vector store for the actual ANN (approximate nearest neighbor) search. Correct functionality of these features depend on some features of the the vector store, that were already done (see https://github.com/scylladb/vector-store/pull/394). This initial implementation is fully functional, and can already be useful, but we do not yet support all the features we hope to eventually support. Here are things that we have not done yet, and plan to do later in follow-up pull requests: 1. Support a new optimized vector type ("V") - in addition to the "list of numbers" type supported in this version. 2. Allow choosing a different similarity function when creating an index, by SimilarityFunction in VectorIndex definition. 3. Allow choosing quantization (f32/f16/bf16/i8/b1) to ask the vector index to compress stored vectors. 4. Support oversampling and rescoring, defined per-index and per-query. 5. Support HNSW tuning parameters — maximum_node_connections, construction_beam_width, search_beam_width. 6. Support pre-filtering over key columns, which are available at the vector store, by sending the filter to the vector store (translated from DynamoDB filter syntax to the vector's store's filter syntax). A decision still need to be made if this will use KeyConditionExpression or FilterExpression. This version supports only post-filtering (with `FilterExpression`). 7. Support projecting non-key attributes into the index (Projection=INCLUDE and Projection=ALL), and then 1. pre-filtering using these attributes, and 2. efficiently return these attributes (using Select=ALL_PROJECTED_ATTRIBUTES, which today returns just the key columns). 8. Optimize the performance of `Query`, which today is inefficient for Select=ALL_ATTRIBUTES because it serially retrieves the matching items one at a time. 9. Returning the similarity scores with the items (the design proposes ReturnVectorSearchSimilarity). 10. Add more vector-search-specific metrics, beyond the metric we already have counting Query requests. For example separate latency and request-count metrics for vector-search Queries (distinct from GSI/LSI queries), and a metric accumulating the total Limit (K) across all vector search queries. 11. Consider how (and if at all) we want to run the tests in test/alternator/test_vector.py that need the vector store in the CI. Currently they are skipped in CI and only run manually (with `test/alternator/run --vs test_vector`). 12. UpdateTable 'Update' operation to modify index parameters. Only some can be modified, e.g., Oversampling. 13. Support for "local index" (separate index for each partition). 14. Make sure that vector search and Streams can be enabled concurrently on the same table - both need CDC but we need to verify that one doesn't confuse the other or disables options that the other needs. We can only do this after we have Alternator Streams running on tablets (since vector store requires tablets). Testing the new Alternator vector search end-to-end requires running both Scylla and the vector store together. We will have such end-to-end tests in the vector store repository (see https://github.com/scylladb/vector-store/pull/392), but we also add in this pull request many end-to-end tests written in Python, that can be run with the command "test/alternator/run --vs test_vector.py". The "--vs" option tells the run script to run both Scylla and the vector store (currently assumed to be in `.../vector-store/target/release/vector-store`). About 65% of the tests in this pull request check supported syntax and error paths so can run without the vector store, while about 35% of the tests do perform actual Query operations and require the vector store to be running. Currently, the tests that do require the vector store will not get run by CI, but can be easily re-run manually with `test/alternator/run --vs test_vector.py`. In total, this series includes 78 functional tests in 2200 lines of Python code. This series also includes documentation for the new Alternator feature and the new APIs introduced. You can see a more detailed design document here: https://docs.google.com/document/d/1cxLI7n-AgV5hhH1DTyU_Es8_f-t8Acql-1f58eQjZLY/edit Two patches in this series split the huge alternator/executor.cc, after this series continued to grow it and it reached a whoppng 7,000 lines. These patches are just reorganization of code, no functional changes. But it's time that we finally do this (Refs #5783), we can't just continue to grow executor.cc with no end... Closes scylladb/scylladb#29046 * github.com:scylladb/scylladb: test/alternator: add option to "run" script to run with vector search alternator: document vector search test/alternator: fix retries in new_dynamodb_session test/alternator: test for allowed characters in attribute names test/alternator: tests for vector index support alternator, vector: add validation of non-finite numbers in Query alternator: Query: improve error message when VectorSearch is missing alternator: add per-table metrics for vector query alternator: clean up duplicated code alternator: fix default Select of Query alternator: split executor.cc even more alternator: split alternator/executor.cc alternator: validate vector index attribute values on write alternator: DescribeTable for vector index: add IndexStatus and Backfilling alternator: implement Query with a vector index alternator: fix bug in describe_multi_item() alternator: prevent adding GSI conflicting with a vector index alternator: implement UpdateTable with a vector index alternator: implement DescribeTable with a vector index alternator: implement CreateTable with a vector index alternator: reject empty attribute names cdc: fix on_pre_create_column_families to create CDC log for vector search	2026-04-17 10:25:45 +02:00
Aleksandra Martyniuk	2c0de7d9b3	test: test multi RF changes	2026-04-17 09:58:08 +02:00
Aleksandra Martyniuk	38bad5f316	cql3: allow changing RF by more than one when adding or removing a DC rf_rack_valid_keyspaces relies on the fact that replicas of base table and mv are streamed concurrently. This is no longer true for newly introduced method of adding a DC. Disable rf_rack_valid_keyspaces in test_mv_first_replica_in_dc to force the old method.	2026-04-17 09:58:08 +02:00
Aleksandra Martyniuk	bcdab2e012	service: extend tablet_migration_info to handle rebuilds Make tablet_migration_info::{src,dst} optional, so that it can be reused by rebuild, for respectively leaving and pending replica.	2026-04-17 09:58:07 +02:00
Aleksandra Martyniuk	72bb3113ac	db: add columns to system_schema.keyspaces Add a new next_replication column to system_schema.keyspaces table. While there is an ongoing RF change: - next_replication keeps the target RF values; - existing replication_v2 column keeps initial RF values - the ones we started the RF change with. DESCRIBE KEYSPACE statement shows replication_v2. When there is no ongoing RF change for this keyspace, its next_replication is empty. In this commit no data is kept in the new column.	2026-04-17 09:58:07 +02:00
Łukasz Paszkowski	4657d9e32c	streaming: reject mutation fragments on critical disk utilization The stream_mutation_fragments RPC handler did not check is_in_critical_disk_utilization_mode before accepting incoming mutation fragments. This meant load-and-stream (nodetool refresh --load-and-stream) could push data onto a node at critical disk utilization, potentially filling the disk completely. Add a critical disk utilization check in the get_next_mutation_fragment lambda, throwing critical_disk_utilization_exception when the node is in critical mode. This mirrors the existing protection in stream_blob.cc. Also remove the xfail marker from the corresponding test added in the previous commit.	2026-04-17 09:31:26 +02:00
Avi Kivity	04b54f363b	Merge 'Enable vnodes-to-tablets migrations with arbitrary tokens' from Nikos Dragazis This PR removes the power-of-two token constraint from vnodes-to-tablets migrations, allowing clusters with randomly generated tokens to migrate without manual token reassignment. Previously, migrations required vnode tokens to be a power of two and aligned. In practice, these conditions are not met with Scylla's default random token assignment, so the constraint is a blocker for real-world use. With the introduction of arbitrary tablet boundaries in PR #28459, the tablet layer can now support arbitrary tablet boundaries. This PR builds on that capability to allow arbitrary vnode tokens during migration. When the highest vnode token does not coincide with the end of the token ring, the vnode wraps around, but tablets do not support that. This is handled by splitting it into two tablets: one covering the tail end of the ring and one covering the beginning. Testing has been updated accordingly: existing cluster tests now use randomly generated tokens instead of precomputed power-of-two values, and a new Boost test validates the wrap-around tablet boundary logic. Fixes SCYLLADB-724. New feature, no backport is needed. Closes scylladb/scylladb#29319 * github.com:scylladb/scylladb: test: Use arbitrary tokens in vnodes->tablets migration tests test: boost: Add test for wrap-around vnodes storage_service: Support vnodes->tablets migrations w/ arbitrary tokens storage_service: Hoist migration precondition	2026-04-17 00:46:35 +03:00
Andrei Chekun	745debe9ec	test.py: remove testpy_test_fixture_scope With migration to pyest this fixture is useless. Removing and setting the session to the module for the most of the tests. Add dynamic_scope function to support running alternator fixtures in session scope, while Test and TestSuite are not deleted. This is for migration period, later on this function should be deleted.	2026-04-16 22:08:33 +02:00
Andrei Chekun	21addb2173	test.py: add logger for 3rd party service With migration of preparation environment and starting 3rd party services to the pytest, they're output the logs to the terminal. So this PR binds them their own log file to avoid polluting the terminal.	2026-04-16 22:08:33 +02:00
Andrei Chekun	13770ab394	test.py: delete dead code in test.py With the latest changes, there are a lot of code that is redundant in the test.py. This PR just cleans this code. Changes in other files are related to cleaning code from the test.py, especially with redundant parameter --test-py-init and moving prepare_environment to pytest itself.	2026-04-16 22:08:31 +02:00
Avi Kivity	999e108139	Merge 'test: lib: fix broken retry in start_docker_service' from Dario Mirovic The retry loop in `start_docker_service` passes the parse callbacks via `std::move` into `create_handler` on each iteration. After the first iteration, the moved-from `std::function` objects are empty. All subsequent retries skip output parsing entirely and immediately treat the service as successfully started. This defeats the entire purpose of the retry mechanism. Fix by passing the callbacks by copy instead of move, so the original callbacks remain valid across retries. Fixes SCYLLADB-1542 This is a CI stability issue and should be backported. Closes scylladb/scylladb#29504 * github.com:scylladb/scylladb: test/lib: fix typos in proc_utils, gcs_fixture, and dockerized_service test: gcs_fixture: rename container from "local-kms" to "fake-gcs-server" test: fix proc_utils.cc formatting from previous commit test: lib: use unique container name per retry attempt test: lib: fix broken retry in start_docker_service	2026-04-16 21:48:25 +03:00

... 4 5 6 7 8 ...

11801 Commits