scylladb

Author	SHA1	Message	Date
Botond Dénes	34473302b0	Merge 'docs: document existing guardrails' from Andrzej Jackowski This patch series introduces a new documentation for exiting guardrails. Moreover: - Warning / failure messages of recently added write CL guardrails (SCYLLADB-259) are rephrased, so all guardrails have similar messages. - Some new tests are added, to help verify the correctness of the documentation and avoid situations where the documentation and implementation diverge. Fixes: [SCYLLADB-257](https://scylladb.atlassian.net/browse/SCYLLADB-257) No backport, just new docs and tests. [SCYLLADB-257]: https://scylladb.atlassian.net/browse/SCYLLADB-257?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ Closes scylladb/scylladb#29011 * github.com:scylladb/scylladb: test: add new guardrail tests matching documentation scenarios test: add metric assertions to guardrail replication strategy tests test: use regex matching in guardrail replication strategy tests test: extract ks_opts helper in test_guardrail_replication_strategy docs: document CQL guardrails cql: improve write consistency level guardrail messages	2026-03-20 08:56:00 +02:00
Botond Dénes	f9adbc7548	test/cqlpy/test_tombstone_limit.py: disable tombstone-gc for test table Since `7564a56dc8`, all tables default to repair-mode tombstone-gc, which is identical to immediate-mode for RF=1 tables. Consequently the tombstones written by the tests in this test file are immediately collectible and with some unlucky timing, some of them can be collected before the end of the test, failing the empty-page prefix check because the empty pages prefix will be smaller than expected based on the number of tombstones written. Disable tombstone-gc to remove this source of flakyness. Fixes: SCYLLADB-1062 Closes scylladb/scylladb#29077	2026-03-20 09:14:29 +03:00
Avi Kivity	062751fcec	Merge 'db/config: enable ms sstable format by default' from Łukasz Paszkowski Trie-based sstable indexes are supposed to be (hopefully) a better default than the old BIG indexes. Make the new format a new default for new clusters by naming ms in the default scylla.yaml. New functionality. No backport needed. This PR is basically Michał's one https://github.com/scylladb/scylladb/pull/26377, Jakub's https://github.com/scylladb/scylladb/pull/27332 fixing `sstables_manager::get_highest_supported_format()` and one test fix. Closes scylladb/scylladb#28960 * github.com:scylladb/scylladb: db/config: announce ms format as highest supported db/config: enable `ms` sstable format by default cluster/dtest/bypass_cache_test: switch from highest_supported_sstable_format to chosen_sstable_format api/system: add /system/chosen_sstable_version test/cluster/dtest: reduce num_tokens to 16	2026-03-19 18:19:01 +02:00
Andrzej Jackowski	4deeb7ebfc	test: add new guardrail tests matching documentation scenarios Add tests for RF guardrails (min/max warn/fail, RF=0 bypass, threshold=-1 disable, ALTER KEYSPACE) and write consistency level guardrails to cover all scenarios described in guardrails.rst. Test runtime (dev): test_guardrail_replication_strategy - 6s test_guardrail_write_consistency_level - 5s Refs: SCYLLADB-257	2026-03-19 15:07:03 +01:00
Andrzej Jackowski	2a03c634c0	test: add metric assertions to guardrail replication strategy tests Verify that guardrail violations increment the corresponding metrics. Refs: SCYLLADB-257	2026-03-19 15:07:03 +01:00
Andrzej Jackowski	81c4e717e2	test: use regex matching in guardrail replication strategy tests Replace loose substring assertions with regex-based matching against the exact server message formats. Add regex constants for all guardrail messages and rewrite create_ks_and_assert_warnings_and_errors() to verify count and content of warnings and failures. Refs: SCYLLADB-257	2026-03-19 15:07:03 +01:00
Andrzej Jackowski	517bb8655d	test: extract ks_opts helper in test_guardrail_replication_strategy Factor out ks_opts() to build keyspace options with tablets handling and use it across all existing replication strategy guardrail tests. No behavioral changes. This facilitates further modification of the tests later in this patch series. Refs: SCYLLADB-257	2026-03-19 12:49:41 +01:00
Botond Dénes	fc8cebd671	Merge 'Verify components digests during component load and scrub in validate mode' from Taras Veretilnyk This PR adds integrity verification for SSTable component files during loading. When component digests are present in Scylla metadata, the loader now validates each component's CRC32 digest against the stored expected value, catching silent corruption of component files. Index, Rows and Partitions components digests are also validated duriung scrub in validate mode Added corruption tests that write an SSTable, flip a bit in a specific component file, then verify that reloading the SSTable detects the corruption and throws the expected exception. Depends on https://github.com/scylladb/scylladb/pull/28338 Backport is not required, this is new feature Fixes https://github.com/scylladb/scylladb/issues/20103 Closes scylladb/scylladb#28761 * github.com:scylladb/scylladb: test/cqlpy: test --ignore-component-digest-mismatch flag in scylla sstable upgrade docs: document --ignore-component-digest-mismatch flag for scylla sstable upgrade sstables: propagate ignore_component_digest_mismatch config to all load sites sstables: add option to ignore component digest mismatches sstable_compaction_test: Add scrub validate test for corrupted index sstables: add tests for component digest validation on corrupted SSTables sstables: validate index components digests during SSTable scrub in validate mode sstables: verify component digests on SSTable load sstables: add digest_file_random_access_reader for CRC32 digest computation	2026-03-13 09:55:55 +02:00
Szymon Malewski	3116db6c2d	test: fix `testJsonOrdering` The `test/cqlpy/cassandra_tests/validation/entities/json_test.py::testJsonOrdering` was failing because of differences between Cassandra and Scylla in printing JSON floating point values - e.g. Cassandra prints 30.0, where Scylla prints 30. Both are valid, so in this patch, instead of comparing strings, we compare parsed JSON using `EquivalentJson`. Fixes #28467 Closes scylladb/scylladb#28924	2026-03-12 09:07:08 +01:00
Nadav Har'El	401dc1894c	test/alternator,cqlpy: avoid xfail_strict against DynamoDB/Cassandra Recently, in commit `7b30a39`, we added to pytest.ini the option xfail_strict. This option causes every time a test XPASSes, i.e., an XFAIL test actually passes, to be considered an error and fail the test. While this has some benefits, it's a big problem when running tests against a reference implementation like DynamoDB or Cassandra: We typically mark a test "xfail" if the test shows a known bug - i.e., if the test fails on Scylla but passes on the reference system (DynamoDB or Cassandra). This means that when running "test/cqlpy/run-cassandra" or "test/alternator/run --aws", we expect to see many tests XPASS, and now this will cause these runs to "fail". So in this patch we add the xfail_strict=false to cqlpy/run-cassandra and alternator/run --aws. This option is not added to cqlpy/run or to alternator/run without --aws, and also doesn't affect test.py or Jenkins. P.S. This is another nail in the coffin of doing "cd test/alternator; pytest --aws". You should get used to running Alternator tests through test/alternator/run, even if you don't need to run Scylla (the "--aws" option doesn't run Scylla). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#28973	2026-03-11 09:29:30 +02:00
Taras Veretilnyk	579269b3c5	test/cqlpy: test --ignore-component-digest-mismatch flag in scylla sstable upgrade Verify that scylla sstable upgrade fails when an sstable has a corrupted Statistics component digest, and succeeds when the --ignore-component-digest-mismatch flag is provided.	2026-03-10 19:24:05 +01:00
Andrei Chekun	c36df5ecf4	test.py: eliminite drivers exception There is a race condition in driver that raises the RuntimeException. This pollutes the output, so this PR is just silencing this exception. Fixes: SCYLLADB-900 Closes scylladb/scylladb#28957	2026-03-10 14:31:36 +02:00
Nadav Har'El	d78ea3d498	test/cqlpy: mark test_unbuilt_index_not_used not strictly xfail A few days ago, in commit `7b30a3981b` we added to pytest.ini the option xfail_strict. This option causes every time a test XPASSes, i.e., an xfail test actually passes - to be considered an error and fail the test. But some tests demonstrate a timing-related bug and do not reproduce the bug every single time. An example we noticed in one CI run is: test/cqlpy/test_secondary_index.py::test_unbuilt_index_not_used This test reproduces a timing-related bug (if you read from a secondary index "too quickly" you can get wrong results), but only about 90% of the time, not 100% of the time. The solution is to add "strict=False" for the xfail marker on this specific test. This undoes the xfail_strict for this specific test, accepting that this specific test can either pass or fail. Note that this does NOT make this test worthless - we still see this test failing most of the time, and when a developer finally fixes this issue, the test will begin to pass all the time. Fixes https://scylladb.atlassian.net/browse/SCYLLADB-956 (we'll probably need to follow up this fix with the same fix for other xfail tests that can sometime pass). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#28942	2026-03-09 22:48:20 +02:00
Michał Chojnowski	949fc85217	db/config: enable `ms` sstable format by default Trie-based sstable indexes are supposed to be (hopefully) a better default than the old BIG indexes. Make them the new default. If we change our mind, this change can be reverted later.	2026-03-09 17:12:09 +01:00
Szymon Malewski	f9d213547f	cql3: selection: fix `add_column_for_post_processing` for ORDER BY The purpose of `add_column_for_post_processing` is to add columns that are required for processing of a query, but are not part of SELECT clause and shouldn't be returned. They are added to the final result set, but later are not serialized. Mainly it is used for filtering and grouping columns, with a special case of `WHERE primary_key IN ... ORDER BY ...` when the whole result set needs additional final sorting, and ordering columns must be added as well. There was a bug that manifested in #9435, #8100 and was actually identified in #22061. In case of selection with processing (e.g functions involved), result set row is formed in two stages. Initially it is a list of columns fetched from replicas - on which filtering and grouping is performed. After that the actual selection is resolved and the final number of columns can change. Ordering is performed on this final shape, but the ordering column index returned by `add_column_for_post_processing` refereed to initial shape. If selection refereed to the same column twice (e.g. `v, TTL(v)` as in #9435) final row was longer than initial and ordering refereed to incorrect column. If a function in selection refereed to multiple columns (e.g. as_json(.., ..) which #8100 effectively uses) the final row was shorter and ordering tried to use a non-existing column. This patch fixes the problem by making sure that column index of the final result set is used for ordering. The previously crashing test `cassandra_tests/validation/entities/json_test.py::testJsonOrdering` doesn't have to be skipped, but now it is failing on issue #28467. Fixes #9435 Fixes #8100 Fixes #22061 Closes scylladb/scylladb#28472	2026-03-05 19:22:34 +02:00
Dawid Mędrek	7564a56dc8	Merge 'tombstone_gc: allow using repair-mode tombstone gc with RF=1 tables' from Botond Dénes Currently, repair-mode tombstone-gc cannot be used on tables with RF=1. We want to make repair-mode the default for all tablet tables (and more, see https://github.com/scylladb/scylladb/issues/22814), but currently a keyspace created with RF=1 and later altered to RF>1 will end up using timeout-mode tombstone gc. This is because the repair-mode tombstone-gc code relies on repair history to determine the gc-before time for keys/ranges. RF=1 tables cannot run repairs so they will have empty repair history and consequently won't be able to purge tombstones. This PR solves this by keeping a registry of RF=1 tables and consulting this registry when creating `tombstone_gc_state` objects. If the table is RF=1, tombstone-gc will work as if the table used immediate-mode tombstone-gc. The registry is updated on each replication update. As soon as the table is not RF=1 anymore, the tombstone-gc reverts to the natural repair-mode behaviour. After this PR, tombstone-gc defaults to repair-mode for all tables, regardless of RF and tablets/vnodes. Fixes: SCYLLADB-106. New feature, no backport required. Closes scylladb/scylladb#22945 * github.com:scylladb/scylladb: test/{boost,cluster}: add test for tombstone gc mode=repair with RF=1 tombstone_gc: allow use of repair-mode for RF=1 tables replica/table: update rf=1 table registry in shared tombstone-gc state tombstone_gc: tombstone_gc_before_getter: consider RF when getting gc before time tombstone_gc: unpack per_table_history_maps tombstone_gc: extract _group0_gc_time from per_table_history_map tombstone_gc: drop tombstone_gc_state(nullptr) ctor and operator bool() test/lib/random_schema: use timeout-mode tombstone_gc tombstone_gc_options: add C++ friendly constructor test: move away from tombstone_gc_state(nullptr) ctor treewide: move away from tombstone_gc_state(nullptr) ctor sstable: move away from tombstone_gc_mode::operator bool() replica/table: add get_tombstone_gc_state() compaction: use tombstone_gc_state with value semantics db/row_cache: use tombstone_gc_state with value semantics tombstone_gc: introduce tombstone_gc_state::for_tests()	2026-03-05 11:50:31 +01:00
Marcin Maliszkiewicz	c3f59e4fa1	Merge 'cql3: implement write_consistency_levels guardrails' from Andrzej Jackowski This patch series implements `write_consistency_levels_warned` and `write_consistency_levels_disallowed` guardrails, allowing the configuration of which consistency levels are unwanted for writes. The motivation for these guardrails is to forbid writing with consistency levels that don't provide high durability guarantees (like CL=ANY, ONE, or LOCAL_ONE). Neither guardrail is enabled by default, so as not to disrupt clusters that are currently using any of the CLs for writes. The warning guardrail may seem harmless, as it only adds a warning to the CQL response; however, enabling it can significantly increase network traffic (as a warning message is added to each response) and also decrease throughput due to additional allocations required to prepare the warning. Therefore, both guardrails should be enabled with care. The newly added `writes_per_consistency_level` metric, which is incremented unconditionally, can help decide whether a guardrail can be safely enabled in an existing cluster. This commit adds additional `if` instructions on the critical path. However, based on the `perf_simple_query` benchmark for writes, the difference is marginal (~40 additional instructions, which is a relative difference smaller than 0.001). BEFORE: ``` 291443.35 tps ( 53.3 allocs/op, 16.0 logallocs/op, 14.2 tasks/op, 48067 insns/op, 18885 cycles/op, 0 errors) throughput: mean= 289743.07 standard-deviation=6075.60 median= 291424.69 median-absolute-deviation=1702.56 maximum=292498.27 minimum=261920.06 instructions_per_op: mean= 48072.30 standard-deviation=21.15 median= 48074.49 median-absolute-deviation=12.07 maximum=48119.87 minimum=48019.89 cpu_cycles_per_op: mean= 18884.09 standard-deviation=56.43 median= 18877.33 median-absolute-deviation=14.71 maximum=19155.48 minimum=18821.57 ``` AFTER: ``` 290108.83 tps ( 53.3 allocs/op, 16.0 logallocs/op, 14.2 tasks/op, 48121 insns/op, 18988 cycles/op, 0 errors) throughput: mean= 289105.08 standard-deviation=3626.58 median= 290018.90 median-absolute-deviation=1072.25 maximum=291110.44 minimum=274669.98 instructions_per_op: mean= 48117.57 standard-deviation=18.58 median= 48114.51 median-absolute-deviation=12.08 maximum=48162.18 minimum=48087.18 cpu_cycles_per_op: mean= 18953.43 standard-deviation=28.76 median= 18945.82 median-absolute-deviation=20.84 maximum=19023.93 minimum=18916.46 ``` Fixes: SCYLLADB-259 Refs: SCYLLADB-739 No backport, it's a new feature Closes scylladb/scylladb#28570 * github.com:scylladb/scylladb: scylla.yaml: add write CL guardrails to scylla.yaml scylla.yaml: reorganize guardrails config to be in one place test: add cluster tests for write CL guardrails test: implement test_guardrail_write_consistency_level cql3: start using write CL guardrails cql3/query_processor: implement metrics to track CL of writes db: cql3/query_processor: add write_consistency_levels enum_sets config: add write_consistency_levels_* guardrails configuration	2026-03-05 09:55:38 +01:00
Nadav Har'El	af07718fff	test/cqlpy: fix "run --release" for versions 5.4 or older Recently we started to rely on the options "--auth-superuser-name" and "--auth-superuser-salted-password" to ensure that a cassandra/cassandra user exists for tests - without those options a default superuser no longer exists. This broke "test/cqlpy/run --release" for old releases, earlier than 5.4 (in the enterprise stream, 2024.1 or earlier), because those old release didn't have this option. So in this patch we fix the "--release" logic that removes these options from the command line when running these old versions. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#28894	2026-03-05 09:59:46 +03:00
Szymon Malewski	4c4673e8f9	test: vector_similarity: Fix similarity value checks `isclose` function checks if returned similarity floats are close enough to expected value, but it doesn't `assert` by itself. Several tests missed that `assert`, effectively always passing. With this patch similarity values checks are wrapped in helper function `assert_similarity` with predefined tolerance. Fixes https://scylladb.atlassian.net/browse/SCYLLADB-877 Closes scylladb/scylladb#28748	2026-03-04 09:53:32 +01:00
Botond Dénes	5998a859f7	tombstone_gc: allow use of repair-mode for RF=1 tables Modify the methods which calculate the default gc mode as well as that which validates whether repair-mode can be used at all, so both accepts use of repair-mode on RF=1 tables. This de-facto changes the default tombstone-gc to repair-mode for all tables. Documentation is updated accordingly. Some tests need adjusting: * cqlpy/test_select_from_mutation_fragments.py: disable GC for some test cases because this patch makes tombstones they write subject to GC when using defaults. * test/cluster/test_mv.py::test_mv_tombstone_gc_not_inherited used repair-mode as a non-default for the base table and expected the MV to revert to default. Another mode has to be used as the non-default (immediate). * test/cqlpy/test_tools.py::test_scylla_sstable_dump_schema: don't compare tombstone_gc schema extension when comparing dumped schema vs. original. The tool's schema loader doesn't have access to the keyspace definition so it will come up with different defaults for tombstone-gc. * test/boost/row_cache_test.cc::test_populating_cache_with_expired_and_nonexpired_tombstones sets tombstone expiry assuming the tombstone-gc timeout-mode default. Change the CREATE TABLE statement to set the expected mode.	2026-03-04 09:44:24 +02:00
Andrzej Jackowski	446539f12f	test: implement test_guardrail_write_consistency_level Implement basic tests for write consistency level guardrails, verifying that they work for each type of write request (inserts, updates, deletes, logged batches, unlogged batches, conditional batches, and counter operations). All tests are marked as Scylla-only because they currently don't pass with Cassandra due to differences in handling superusers (see: SCYLLADB-882). Tests execution time: - Dev: 3s - Debug: 14s Refs: SCYLLADB-259 Refs: SCYLLADB-882	2026-03-04 08:00:13 +01:00
Dario Mirovic	fd17dcbec8	auth: do not create default 'cassandra:cassandra' superuser Changes the behavior of default superuser creation. Previously, without configuration 'cassandra:cassandra' credentials were used. Now default superuser creation is skipped if not configured. The two ways to create default superuser are: - Config file - auth_superuser_name and auth_superuser_salted_password fields - Maintenance socket - connect over maintenance socket and CREATE/ALTER ROLE ... Behavior changes: Old behavior: - No config - 'cassandra:cassandra' created - auth_superuser_name only - <name>:cassandra created - auth_superuser_salted_password only - 'cassandra:<password>' created - Both specified - '<name>:<password>' created New behavior: - No config - no default superuser - Requires maintenance socket setup - auth_superuser_name only - '<name>:' created WITHOUT password - Requires maintenance socket setup - auth_superuser_salted_password only - no default superuser - Both specified - '<name>:<password>' created Fixes SCYLLADB-409	2026-03-03 23:42:25 +01:00
Dimitrios Symonidis	80b74d7df2	tablet options: Add max_tablet_count tablet option to enforce tablet count upper bounds Introduced a new max_tablet_count tablet option that caps the maximum number of tablets a table can have. This feature is designed primarily for backup and restore workflows. During backup, when load balancing is disabled for snapshot consistency, the current tablet count is recorded in the backup manifest. During restore, max_tablet_count is set to this recorded value, ensuring the restored table's tablet count never exceeds the original snapshot's tablet distribution. This guarantee enables efficient file-based SSTable streaming during restore, as each SSTable remains fully contained within a single tablet boundary. Closes scylladb/scylladb#28450	2026-03-03 11:19:24 +03:00
Botond Dénes	ab532882db	tools/scylla-sstable: introduce scylla sstable split Split input sstable(s) into multiple output sstables based on the provided token boundaries. The input sstable(s) are divided according to the specified split tokens, creating one output sstable per token range. Fixes: SCYLLADB-10 Closes scylladb/scylladb#28741	2026-03-02 15:19:17 +01:00
Marcin Maliszkiewicz	6bf706ef1b	Merge 'scylla-sstable: query: handle nested UDTs' from Botond Dénes The query (and in certain modes the write) operations uses virtual table facility inside `cql_test_env`. The schema of the sstable is created as a table in `cql_test_env`. This involves registering all UDTs with the keyspace, so they are available for lookups. This was done with a flat loop over all column types, but this is not enough. UDTs might be nested in other types, like collections. One has to do a traversal of the type tree and register every UDT on the way. This PR changes the flat loop to a recursive traversal of the type tree. The query operation now works with UDTs, no matter how deeply nested they are. Backport: Implements missing functionality of a tool, no backport. Closes scylladb/scylladb#28798 * github.com:scylladb/scylladb: tools/scylla-sstable: create_table_in_cql_env(): register UDTs recursively tools/scylla-sstable: generalize dump_if_user_type tools/scylla-sstable: move dump_if_user_type() definition	2026-03-02 14:14:43 +01:00
Marcin Maliszkiewicz	8c2da76fde	test/cqlpy: remove xfail from test_constant_function_parameter The issue was fixed by commit `cc03f5c89d` ("cql3: support literals and bind variables in selectors"), so the xfail marker is no longer needed. Closes scylladb/scylladb#28776	2026-03-01 20:03:42 +02:00
Marcin Maliszkiewicz	a03ebe1a29	Merge 'cql: implement a new per-row TTL feature' from Nadav Har'El This series implements a new per-row TTL feature for CQL. The per-row TTL feature was requested in issue #13000. It is a feature that does not exist in Cassandra, and was inspired by DynamoDB's TTL feature - and under the hood uses the same implementation that we used in Alternator to implement this DynamoDB feature. The new per-row TTL feature is completely separate from CQL's existing per-write (and per-cell) TTL, and both will be available to users. In the per-row TTL feature, one column in the table is designated as the "TTL" column, and its value for a row is the expiration time for that row. The TTL column can be designated at table creation time, e.g.: ```cql CREATE TABLE tab ( id int PRIMARY KEY, t text, expiration timestamp TTL ); ``` Or after the table already exists with: ```cql ALTER TABLE tab TTL expiration ``` Expiration can also be disabled, with: ```cql ALTER TABLE tab TTL NULL ``` The new per-row TTL feature has two features that users have been asking for: 1. A user can change the value of just the TTL column - without rewriting the entire row - to change the expiration time of the entire row. 2. When an expired row is finally deleted, a CDC event about this deletion appears in the CDC log (if CDC is enabled), including - if a preimage is enabled - the content of the deleted row. To achieve the second goal (CDC events), a row is not guaranteed to disappear at exactly its expiration time (as CQL's original TTL feature guarantees). Rather, the row is deleted some time later, depending on `alternator_ttl_period_in_seconds`; Until the actual deletion, the row is still readable (and even writable). But we are guaranteed that when the row is finally deleted, the CDC event will come too. The implementation uses the same background thread used by Alternator to periodically scan for expired items and delete them. The expiration thread keeps the same metrics as it did for Alternator: * `scylla_expiration_scan_passes` * `scylla_expiration_scan_table` * `scylla_expiration_items_deleted` * `scylla_expiration_secondary_ranges_scanned` The series begins with a few small preparation patches, followed by the main part of the feature (which isn't big, since we are just enabling the pre-existing Alternator expiration machinary for CQL) and finally 30 tests (single-node and multi-node tests) and documentation. This series is a new feature, so traditionally would not be backported. However, I wouldn't be surprised if we will be requested to backport it so that customers will not need to wait for a new major release. Fixes #13000 Closes scylladb/scylladb#28320 * github.com:scylladb/scylladb: test/cqlpy: verify that a column can't be both STATIC and PRIMARY KEY docs/cql: document the new CQL per-row TTL feature test/cluster: tests for the new CQL per-row TTL feature test/cqlpy: tests for the new CQL per-row TTL feature test: set low alternator_ttl_period_in_seconds in CQL tests cql ttl: fix ALTER TABLE to disable TTL if column is dropped cql ttl: add setting/unsetting of TTL column to ALTER TABLE cql ttl: add TTL column support to CREATE TABLE and DESC TABLE ttl: add CQL support to Alternator's TTL expiration service alternator ttl: move TTL_TAG_KEY to a header file alternator ttl: remove unnecessary check of feature flag cql: add "cql_row_ttl" cluster feature alternator: fix error message if UpdateTimeToLive is not supported	2026-02-26 15:29:12 +01:00
Yaniv Michael Kaul	ead9961783	cql: vector: fix vector dimension type Switch vector dimension handling to fixed-width `uint32_t` type, update parsing/validation, and add boundary tests. The dimension is parsed as `unsigned long` at first which is guaranteed to be at least 32-bit long, which is safe to downcast to `uint32_t`. Move `MAX_VECTOR_DIMENSION` from `cql3_type::raw_vector` to `cql3_type` to ensure public visibility for checks outside the class. Add tests to verify the type boundaries. Fixes: https://scylladb.atlassian.net/browse/SCYLLADB-223 Signed-off-by: Yaniv Kaul <yaniv.kaul@scylladb.com> Co-authored-by: Dawid Pawlik <dawid.pawlik@scylladb.com> Closes scylladb/scylladb#28762	2026-02-26 14:46:53 +02:00
Michael Litvak	4a60ee28a2	test/cqlpy/test_materialized_view.py: increase view build timeout The test test_build_view_with_large_row creates a materialized view and expects the view to be built with a timeout of 5 seconds. It was observed to fail because the timeout is too short on slow machines. Increase the timeout to 60 seconds to make the test less flaky on slow machines. Similarly for the other tests in the file that have a timeout for view build, increase the timeout to 60 seconds to be consistent and safer. Fixes SCYLLADB-769 Closes scylladb/scylladb#28817	2026-02-26 11:27:51 +02:00
Nadav Har'El	1d265e7d6d	test/cqlpy: verify that a column can't be both STATIC and PRIMARY KEY While adding the new syntax "TTL" to CREATE TABLE, I noticed that the parser actually allows a column to be defined as "STATIC PRIMARY KEY". So I add here a small test to verify that this is not really allowed: The syntax "c int STATIC PRIMARY KEY" is accepted, but then rejected by a later check. The syntax "c int PRIMARY KEY STATIC" is rejected as a syntax error. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-02-25 14:59:45 +02:00
Nadav Har'El	7a1351c6cf	test/cqlpy: tests for the new CQL per-row TTL feature This patch contains 27 functional tests (in the test/cqlpy framework) for the new CQL per-row TTL feature. The tests cover the TTL column configuration statements (CREATE TABLE, ALTER TABLE) as well as the actual item expiration or non-expiration depending on the value of the expiration-time column - and also CDC events generated on expiration and the metrics generated by the expiration process. These tests were written together with the code, as in "test-driven development", so they aim to cover every corner case considered during the development, and they reproduce every bug and misstep seen during the development process. As a result, they hopefully achieve very high code coverage - but since we don't have a working code-coverage tool, I can't report any specific code coverage numbers. These tests check everything which we can check on single-node cluster. The next patch will add additional multi-node tests for things we can't check here with a single node - such as the scheduling group used by the distributed work, the effect of dead nodes on the TTL functionality, and the process of rolling upgrade. The tests in this patch do NOT try to stress the background expiration scanning threads, or to check how they handle topology changes, large amounts of data or clusters spanning multiple DCs. These tests also don't test the performance impact of these scanning threads. Because the expiration scanning thread is identical to the one already used by Alternator TTL, we assume that many of these aspects were already tested for Alternator TTL and did not change when the same implementation is used for the new CQL feature. All new tests pass on ScyllaDB. Because the per-row TTL feature is a new ScyllaDB feature that does not exist on Cassandra, all these tests are skipped on Cassandra. Because some of these tests involve waiting for expiration, they can't be very quick. Still, because we set alternator_ttl_period_in_seconds to 0.5 seconds in the test framework, all 27 tests running sequentially finish in roughly 6 seconds total, which we consider acceptable. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-02-25 14:59:44 +02:00
Nadav Har'El	154cecda71	test: set low alternator_ttl_period_in_seconds in CQL tests In test/alternator/run we set alternator_ttl_period_in_seconds to a very low number (0.5 seconds) to allow TTL tests to expire items very quickly and finish quickly. Until now, we didn't need to do this for CQL tests, because they weren't using this Alternator-only feature. Now that CQL uses the same expiration feature with its original configuration parameter, we need to set it in CQL tests too. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-02-25 14:59:43 +02:00
Botond Dénes	8dbcd8a0b3	tools/scylla-sstable: create_table_in_cql_env(): register UDTs recursively It is not enough to go over all column types and register the UDTs. UDTs might be nested in other types, like collections. One has to do a traversal of the type tree and register every UDT on the way. That is what this patch does. This function is used by the query and write operations, which should now both work with nested UDTs. Add a test which fails before and passes after this patch.	2026-02-25 08:51:25 +02:00
Andrei Chekun	4a7d8cd99d	test.py: add explicit default values to pytest options Add explicit default values to pytest command line options to prevent issues when running tests with pytest's parallel execution where options are not present on upper conftest, so they're just not set at all.	2026-02-24 09:48:38 +01:00
Marcin Maliszkiewicz	c5dc086baf	Merge 'vector_search: return NaN for similarity_cosine with all-zero vectors' from Dawid Pawlik The ANN vector queries with all-zero vectors are allowed even on vector indexes with similarity function set to cosine. When enabling the rescoring option, those queries would fail as the rescoring calls `similarity_cosine` function underneath, causing an `InvalidRequest` exception as all-zero vectors were not allowed matching Cassandra's behaviour. To eliminate the discrepancy we want the all-zero vector `similarity_cosine` calls to pass, but return the NaN as the cosine similarity for zero vectors is mathematically incorrect. We decided not to use arbitrary values contrary to USearch, for which the distance (not to be confused with similarity) is defined as cos(0, 0) = 0, cos(0, x) = 1 while supporting the range of values [0, 2]. If we wanted to convert that to similarity, that would mean sim_cos(0, x) = 0.5, which does not support mathematical reasoning why that would be more similar than for example vectors marking obtuse angles. It's safe to assume that all-zero vectors for cosine similarity shouldn't make any impact, therefore we return NaN and eliminate them from best results. Adjusted the tests accordingly to check both proper Cassandra and Scylla's behaviour. Fixes: SCYLLADB-456 Backport to 2026.1 needed, as it fixes the bug for ANN vector queries using rescoring introduced there. Closes scylladb/scylladb#28609 * github.com:scylladb/scylladb: test/vector_search: add reproducer for rescoring with zero vectors vector_search: return NaN for similarity_cosine with all-zero vectors	2026-02-23 13:10:44 +01:00
Nadav Har'El	d01915131a	test/cqlpy: make test_indexing_paging_and_aggregation much faster Currently, test_secondary_index.py::test_indexing_paging_and_aggregation is very slow, and the slowest test in the test/cqlpy framework: It takes around 13 seconds on dev build, and because it is CPU-bound (doesn't sleep), it is much slower on debug builds. The reason for this slowness is that it needs to set up and read over 10,000 rows which is the default select_internal_page_size. But after the patches in pull request (#25368), we can configure select_internal_page_size, so in this patch we change the test to temporarily reduce this option to just 50, and then the test can reach the same code paths with just 142 rows instead of 20120 rows before this patch. As a result, the test should now be 140 times faster than it was before. In practice, because of some fixed overheads (the test creates several tables and indexes), in dev build mode the test run speedup is "only" 26-fold (to around half a second). I verified that removing the code added in `bb08af7` indeed makes the new shorter test fail - and this is the only test in test_secondary_index.py that starts to fail besides test_index_paging_group_by which is also related (so my revert didn't just break secondary indexing completely). So the shorter test is still a good regression test. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#28268	2026-02-20 15:44:53 +02:00
Nadav Har'El	a1475dbeb9	test/cqlpy: make test testMapWithLargePartition faster Right now the slowest test in the test/cqlpy directory is cassandra_tests/validation/entities/collections_test.py:: testMapWithLargePartition This test (translated from Cassandra's unit test), just wants to verify that we can write and flush a partition with a single large map - with 200 items totalling around 2MB in size. 200 items totalling 2MB is large, but not huge, and is not the reason why this test was so so slow (around 9 seconds). It turns out that most of the test time was spent in Python code, preparing a 2MB random string the slowest possible way. But there is no need for this string to be random at all - we only care about the large size of the value, not the specific characters in it! Making the characters written in this text constant instead of random made it 20 times fast - it now takes less than half a second. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#28271	2026-02-18 10:12:16 +03:00
Andrei Chekun	8c5c1096c2	test: ensure that that table used it cqlpy/test_tools have at least 3 pk One of the tests check that amount of the PK should be more than 2, but the method that creates it can return table with less keys. This leads to flakiness and to avoid it, this PR ensures that table will have at least 3 PK Closes scylladb/scylladb#28636	2026-02-16 09:50:58 +02:00
Dawid Pawlik	af0889d194	vector_search: return NaN for similarity_cosine with all-zero vectors The ANN vector queries with all-zero vectors are allowed even on vector indexes with similarity function set to cosine. When enabling the rescoring option, those queries would fail as the rescoring calls `similarity_cosine` function underneath, causing an `InvalidRequest` exception as all-zero vectors were not allowed matching Cassandra's behaviour. To eliminate the discrepancy we want the all-zero vector `similarity_cosine` calls to pass, but return the NaN as the cosine similarity for zero vectors is mathematically incorrect. We decided not to use arbitrary values contrary to USearch, for which the distance (not to be confused with similarity) is defined as cos(0, 0) = 0, cos(0, x) = 1 while supporting the range of values [0, 2]. If we wanted to convert that to similarity, that would mean sim_cos(0, x) = 0.5, which does not support mathematical reasoning why that would be more similar than for example vectors marking obtuse angles. It's safe to assume that all-zero vectors for cosine similarity shouldn't make any impact, therefore we return NaN and eliminate them from best results. Adjusted the tests accordingly to check both proper Cassandra and Scylla's behaviour. Fixes: SCYLLADB-456	2026-02-11 12:31:47 +01:00
Marcin Maliszkiewicz	0753d9fae5	Merge 'test: remove xfail marker from a few passing tests' from Nadav Har'El This patch fixes the few remaining cases of XPASS in test/cqlpy and test/alternator. These are tests which, when written, reproduced a bug and therefore were marked "xfail", but some time later the bug was fixed and we either did not notice it was ever fixed, or just forgot to remove the xfail marker. Removing the no-longer-needed xfail markers is good for test hygiene, but more importantly is needed to avoid regressions in those already-fixed areas (if a test is already marked xfail, it can start to fail in a new way and we wouldn't notice). Backport not needed, xpass doesn't bother anyone. Closes scylladb/scylladb#28441 * github.com:scylladb/scylladb: test/cqlpy: remove xfail from tests for fixed issue 7972 test/cqlpy: remove xfail from tests for fixed issue 10358 test/cqlpy: remove xfail from passing test testInvalidNonFrozenUDTRelation test/alternator: remove xfail from passing test_update_item_increases_metrics_for_new_item_size_only	2026-02-05 10:10:43 +01:00
Nadav Har'El	a63ad48b0f	test/cqlpy: remove xfail from tests for fixed issue 7972 The test test_to_json_double used to fail due to #7972, but this issue was already fixed in Scylla 5.1 and we didn't notice. So remove the xfail marker from this test, and also update another test which still xfails but no longer due to this issue. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-02-02 23:49:32 +02:00
Nadav Har'El	10b81c1e97	test/cqlpy: remove xfail from tests for fixed issue 10358 The tests testWithUnsetValues and testFilteringWithoutIndices used to fail due to #10358, but this issue was already fixed three years ago, when the UNSET-checking code was cleaned up, and the test is now passing. So remove the xfail marker from these tests. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-02-02 23:49:31 +02:00
Nadav Har'El	508bb97089	test/cqlpy: remove xfail from passing test testInvalidNonFrozenUDTRelation The test testInvalidNonFrozenUDTRelation used to fail due to #10632 (an incorrectly-printed column name in an error message) and was marked "xfail". But this issue has already been fixed two years ago, and the test is now passing. So remove the xfail marker.	2026-02-02 23:49:31 +02:00
Nadav Har'El	df69dbec2a	Merge ' cql3/statements/describe_statement: hide paxos state tables ' from Michał Jadwiszczak Paxos state tables are internal tables fully managed by Scylla and they shouldn't be exposed to the user nor they shouldn't be backed up. This commit hides those kind of tables from all listings and if such table is directly described with `DESC ks."tbl$paxos"`, the description is generated withing a comment and a note for the user is added. Fixes https://github.com/scylladb/scylladb/issues/28183 LWT on tablets and paxos state tables are present in 2025.4, so the patch should be backported to this version. Closes scylladb/scylladb#28230 * github.com:scylladb/scylladb: test/cqlpy: add reproducer for hidden Paxos table being shown by DESC cql3/statements/describe_statement: hide paxos state tables	2026-02-02 21:22:59 +02:00
Avi Kivity	cc03f5c89d	cql3: support literals and bind variables in selectors Add support for literals in the SELECT clause. This allows SELECT fn(column, 4) or SELECT fn(column, ?). Note, "SELECT 7 FROM tab" becomes valid in the grammar, but is still not accepted because of failed type inference - we cannot infer the type of 7, and don't have a favored type for literals (like C favors int). We might relax this later. In the WHERE clause, and Cassandra in the SELECT clause, type hints can also resolve type ambiguity: (bigint)7 or (text)?. But this is deferred to a later patch. A few changes to the grammar are needed on top of adding a `value` alternative to `unaliasedSelector`: - vectorSimilarityArg gained access to `value` via `unaliasedSelector`, so it loses that alternate to avoid ambiguity. We may drop `vectorSimilarityArg` later. - COUNT(1) became ambiguous via the general function path (since function arguments can now be literals), so we remove this case from the COUNT special cases, remaining with count(*). - SELECT JSON and SELECT DISTINCT became "ambiguous enough" for ANTLR to complain, though as far as I can tell `value` does not add real ambiguity. The solution is to commit early (via "=>") to a parsing path. Due to the loss of count(1) recognition in the parser, we have to special-case it in prepare. We may relax it to count any expression later, like modern Cassandra and SQL. Testing is awkward because of the type inference problem in top-level. We test via the set_intersection() function and via lua functions. Example: ``` cqlsh> CREATE FUNCTION ks.sum(a int, b int) RETURNS NULL ON NULL INPUT RETURNS int LANGUAGE LUA AS 'return a + b'; cqlsh> SELECT ks.sum(1, 2) FROM system.local; ks.sum(1, 2) -------------- 3 (1 rows) cqlsh> ``` (There are no suitable system functions!) Fixes https://scylladb.atlassian.net/browse/SCYLLADB-296 Closes scylladb/scylladb#28256	2026-02-02 00:06:13 +02:00
Pawel Pery	f49c9e896a	vector_search: allow full secondary indexes syntax while creating the vector index Vector Search feature needs to support creating vector indexes with additional filtering column. There will be two types of indexes: global which indexes vectors per table, and local which indexes vectors per partition key. The new syntaxes are based on ScyllaDB's Global Secondary Index and Local Secondary Index. Vector indexes don't use secondary indexes functionalities in any way - all indexing, filtering and processing data will be done on Vector Store side. This patch allows creating vector indexes using this CQL syntax: ``` CREATE TABLE IF NOT EXISTS cycling.comments_vs ( commenter text, comment text, comment_vector VECTOR <FLOAT, 5>, created_at timestamp, discussion_board_id int, country text, lang text, PRIMARY KEY ((commenter, discussion_board_id), created_at) ); CREATE CUSTOM INDEX IF NOT EXISTS global_ann_index ON cycling.comments_vs(comment_vector, country, lang) USING 'vector_index' WITH OPTIONS = { 'similarity_function': 'DOT_PRODUCT' }; CREATE CUSTOM INDEX IF NOT EXISTS local_ann_index ON cycling.comments_vs((commenter, discussion_board_id), comment_vector, country, lang) USING 'vector_index' WITH OPTIONS = { 'similarity_function': 'DOT_PRODUCT' }; ``` Currently, if we run these queries to create indexes we will receive such errors: ``` InvalidRequest: Error from server: code=2200 [Invalid query] message="Vector index can only be created on a single column" InvalidRequest: Error from server: code=2200 [Invalid query] message="Local index definition must contain full partition key only. Redundant column: XYZ" ``` This commit refactors `vector_index::check_target` to correctly validate columns building the index. Vector-store currently support filtering by native types, so the type of columns is checked. The first column from the list must be a vector (to build index based on these vectors), so it is also checked. Allowed types for columns are native types without counter (it is not possible to create a table with counter and vector) and without duration (it is not possible to correctly compare durations, this type is even not allowed in secondary indexes). This commits adds cqlpy test to check errors while creating indexes. Fixes: SCYLLADB-298 This needs to be backported to version 2026.1 as this is a fix for filtering support. Closes scylladb/scylladb#28366	2026-01-30 01:14:31 +02:00
Avi Kivity	3d1558be7e	test: remove xfail markers from SELECT JSON count(*) tests These were marked xfail due to #8077 (the column name was wrong), but it was fixed long ago for 5.4 (exact commit not known). Remove the xfail markers to prevent regressions. Closes scylladb/scylladb#28432	2026-01-29 21:56:00 +02:00
Nadav Har'El	1454228a05	test/cqlpy: fix "assertion rewriting" in translated Cassandra tests One of the best features of the pytest framework is "assertion rewriting": If your test does for example "assert a + 1 == b", the assertion is "rewritten" so that if it fails it tells you not only that "a+1" and "b" are not equal, what the non-equal values are, how they are not equal (e.g., find different elements of arrays) and how each side of the equality was calculated. But pytest can only "rewrite" assertion that it sees. If you call a utility function checksomething() from another module and that utility function calls assert, it will not be able to rewrite it, and you'll get ugly, hard-to-debug, assertion failures. This problem is especially noticable in tests we translated from Cassandra, in test/cqlpy/cassandra_tests. Those tests use a bunch of assertion-performing utility functions like assertRows() et al. Those utility functions are defined in a separate source file, porting.py, so by default do not get their assertions rewritten. We had a solution for this: test/cqlpy/cassandra_test/__init__.py had: pytest.register_assert_rewrite("cassandra_tests.porting") This tells pytest to rewrite assertions in porting.py the first time that it is imported. It used to work well, but recently it stopped working. This is because we change the module paths recently, and it should be written as test.cqlpy.cassandra_tests.porting. I verified by editing one of the cassandra_tests to make a bad check that indeed this statement stopped working, and fixing the module path in this way solves it, and makes assertion rewriting work again. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#28411	2026-01-28 18:34:57 +02:00
Yaron Kaikov	7c49711906	test/cqlpy: Remove redundant pytest.register_assert_rewrite call During test.py run, noticed this warning: ``` 10:38:22 test/cqlpy/cassandra_tests/validation/operations/insert_update_if_condition_test.py:14: 32 warnings 10:38:22 /jenkins/workspace/releng-testing/scylla-ci/scylla/test/cqlpy/cassandra_tests/validation/operations/insert_update_if_condition_test.py:14: PytestAssertRewriteWarning: Module already imported so cannot be rewritten: test.cqlpy.cassandra_tests.porting 10:38:22 pytest.register_assert_rewrite('test.cqlpy.cassandra_tests.porting') ``` The insert_update_if_condition_test.py was calling pytest.register_assert_rewrite() for the porting module, but this registration is already handled by cassandra_tests/__init__.py which is automatically loaded before any test runs. Closes scylladb/scylladb#28409	2026-01-28 13:17:05 +02:00
Gleb Natapov	9daa109d2c	test: get rid of consistent_cluster_management usage in test consistent_cluster_management is deprecated since scylla-5.2 and no longer used by Scylladb, so it should not be used by test either. Closes scylladb/scylladb#28340	2026-01-27 11:31:30 +01:00

1 2 3 4 5 ...

347 Commits