scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-20 16:40:35 +00:00

Author	SHA1	Message	Date
Avi Kivity	59ec93b86b	Merge 'Allow arbitrary tablet boundaries and count' from Tomasz Grabiec There are several reasons we want to do that. One is that it will give us more flexibility in distributing the load. We can subdivide tablets at any token, and achieve more evenly-sized tablets. In particular, we can isolate large partitions into separate tablets. We can also split and merge incrementally individual tablets. Currently, we do it for the whole table or nothing, which makes splits and merges take longer and cause wide swings of the count. This is not implemented in this PR yet, we still split/merge the whole table. Another reason is vnode to tablets migration. We now could construct a tablet map which matches exactly the vnode boundaries, so migration can happen transparently from CQL-coordinator point of view. Tablet count is still a power-of-two by default for newly created tables. It may be different if tablet map is created by non-standard means, or if per-table tablet option "pow2_count" is set to "false". build/release/scylla perf-tablets: Memory footprint for 131k tablets increased from 56 MiB to 58.1 MiB (+3.5%) Before: ``` Generating tablet metadata Total tablet count: 131072 Size of tablet_metadata in memory: 57456 KiB Copied in 0.014346 [ms] Cleared in 0.002698 [ms] Saved in 1234.685303 [ms] Read in 445.577881 [ms] Read mutations in 299.596313 [ms] 128 mutations Read required hosts in 247.482742 [ms] Size of canonical mutations: 33.945053 [MiB] Disk space used by system.tablets: 1.456761 [MiB] Tablet metadata reload: full 407.69ms partial 2.65ms ``` After: ``` Generating tablet metadata Total tablet count: 131072 Size of tablet_metadata in memory: 59504 KiB Copied in 0.032475 [ms] Cleared in 0.002965 [ms] Saved in 1093.877441 [ms] Read in 387.027100 [ms] Read mutations in 255.752121 [ms] 128 mutations Read required hosts in 211.202805 [ms] Size of canonical mutations: 33.954453 [MiB] Disk space used by system.tablets: 1.450162 [MiB] Tablet metadata reload: full 354.50ms partial 2.19ms ``` Closes scylladb/scylladb#28459 * github.com:scylladb/scylladb: test: boost: tablets: Add test for merge with arbitrary tablet count tablets, database: Advertise 'arbitrary' layout in snapshot manifest tablets: Introduce pow2_count per-table tablet option tablets: Prepare for non-power-of-two tablet count tablets: Implement merged tablet_map constructor on top of for_each_sibling_tablets() tablets: Prepare resize_decision to hold data in decisions tablets: table: Make storage_group handle arbitrary merge boundaries tablets: Make stats update post-merge work with arbitrary merge boundaries locator: tablets: Support arbitrary tablet boundaries locator: tablets: Introduce tablet_map::get_split_token() dht: Introduce get_uniform_tokens()	2026-04-15 18:57:22 +03:00
Nadav Har'El	1eb8d170dd	Merge 'vector_index: allow recreating vector indexes on the same column' from Dawid Pawlik This series allows creating multiple vector indexes on the same column so users can rebuild an index without losing query availability. The intended flow is: 1. Create a new vector index on a column that already has one. 2. Keep serving ANN queries from the old index while the new one is being built. 3. Verify the new index is ready. 4. Automatically switch to the remaining index. 5. Drop the old index. To make that deterministic, `index_version` is changed from the base table schema version to a real creation timeuuid. When multiple vector indexes exist on the same column, ANN query planning now picks the index according to the routing implemented in Vector Store (newest serving index). This keeps queries on the old index until it the new one is up and ready. This patch also removes the create-time restriction that rejected a second vector index on the same column. Name collisions are still rejected as before. Test coverage is updated accordingly: - Scylla now verifies that two vector indexes can coexist on the same column. - Cassandra/SAI behavior is still covered and is still expected to reject duplicate indexes on the same column. Fixes: VECTOR-610 Closes scylladb/scylladb#29407 * github.com:scylladb/scylladb: docs: document vector index metadata and duplicate handling test/cqlpy: cover vector index duplicate creation rules vector_index: allow multiple named indexes on one column vector_index: store `index_version` as creation timeuuid	2026-04-15 14:40:15 +03:00
Botond Dénes	5891efc2ca	Merge 'service: add missing replicas if tablet rebuild was rolled back' from Aleksandra Martyniuk RF change of tablet keyspace starts tablet rebuilds. Even if any of the rebuilds is rolled back (because pending replica was excluded), rf change request finishes successfully. In this case we end up with the state of the replicas that isn't compatible with the expected keyspace replication. Modify topology coordinator so that if it were to be idle, it starts checking if there are any missing replicas. It moves to transition_state::tablet_migration and run required rebuilds. If a new RF change request encounters invalid state of replicas it fails. The state will be fixed later and the analogical ALTER KEYSPACE statement will be allowed. Fixes: SCYLLADB-109. Requires backport to all versions with tablet keyspace rf change. Closes scylladb/scylladb#28709 * github.com:scylladb/scylladb: test: add test_failed_tablet_rebuild_is_retried_on_alter test: add a test to ensure that failed rebuilds are retried service: fail ALTER KEYSPACE if replicas do not satisfy the replication service: retry failed tablet rebuilds service: maybe_start_tablet_migration returns std::optional<group0_guard>	2026-04-15 14:40:15 +03:00
Tomasz Grabiec	50fbac6ea6	tablets: Introduce pow2_count per-table tablet option By default it's true, in which case tablet count of the table is rounded up to a power of two. This option allows lifting this, in which case the count can be arbitrary. This will allow testing the logic of arbitrary tablet count.	2026-04-15 10:40:56 +02:00
Dawid Pawlik	f40ab83d02	docs: document vector index metadata and duplicate handling Document the new vector index behavior in the user-facing and developer docs. Describe `index_version` as a creation timeuuid stored in `system_schema.indexes`, clarify that recreating an index changes it while ALTER TABLE does not, and document that Scylla allows multiple named vector indexes on the same column while still rejecting unnamed duplicates.	2026-04-14 12:21:38 +02:00
Nadav Har'El	bb2fb810bb	cql3: document WRITETIME() and TTL() for elements of map, set or UDT Add to the SELECT documentation (docs/cql/dml/select.rst) documentation of the new ability to select WRITETIME() and TTL() of a single element of map, set or UDT. Also in the TTL documentation (docs/cql/time-to-live.rst), which already had a section on "TTL for a collection", add a mention of the ability to read a single element's TTL(), and an example. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-04-12 14:28:01 +03:00
Szymon Wasik	573def7cd8	cql: accept source_model option and show options in DESCRIBE Accept the Cassandra SAI 'source_model' option for vector indexes. This option is used by Cassandra libraries (e.g., CassIO, LangChain) to tag vector indexes with the name of the embedding model that produced the vectors. ScyllaDB does not use the source_model value but stores it and includes it in the DESCRIBE INDEX output for Cassandra compatibility. Additionally, extend vector_index::describe() to emit a WITH OPTIONS = {...} clause containing all user-provided index options (filtering out system keys: target, class_name, index_version). This makes options like similarity_function, source_model, etc. visible in DESCRIBE output.	2026-04-09 17:20:03 +02:00
Szymon Wasik	80a2e4a0ab	cql: add Cassandra SAI (StorageAttachedIndex) compatibility Libraries such as CassIO, LangChain, and LlamaIndex create vector indexes using Cassandra's StorageAttachedIndex (SAI) class name. This commit lets ScyllaDB accept these statements without modification. When a CREATE CUSTOM INDEX statement specifies an SAI class name on a vector column, ScyllaDB automatically rewrites it to the native vector_index implementation. Accepted class names (case-insensitive): - org.apache.cassandra.index.sai.StorageAttachedIndex - StorageAttachedIndex - sai SAI on non-vector columns is rejected with a clear error directing users to a secondary index instead. The SAI detection and rewriting logic is extracted into a dedicated static function (maybe_rewrite_sai_to_vector_index) to keep the already-long validate_while_executing method manageable. Multi-column (local index) targets and nonexistent columns are skipped with continue — the former are treated as filtering columns by vector_index::check_target(), and the latter are caught later by vector_index::validate(). Tests that exercise features common to both backends (basic creation, similarity_function, IF NOT EXISTS, bad options, etc.) now use the SAI class name with the skip_on_scylla_vnodes fixture so they run against both ScyllaDB and Cassandra. ScyllaDB-specific tests continue to use USING 'vector_index' with scylla_only.	2026-04-09 17:20:03 +02:00
Aleksandra Martyniuk	200dc084c5	service: fail ALTER KEYSPACE if replicas do not satisfy the replication RF change of tablet keyspace starts tablet rebuilds. Even if any of the rebuilds is rolled back (because pending replica was excluded), rf change request finishes successfully. Yet, we are left with not enough replicas. Then, a next new rf change request handler would generate a rebuild of two replicas of the same tablet. Such a transition would not be applied, as we don't allow many pending replicas. An exception would be thrown and the request would be retried infinitely, blocking the topology coordinator. Throw and fail rf change request if there is not enough replicas. The request should be retried later, after the issue is fixed by the mechanism introduced in previous changes.	2026-03-27 17:29:26 +01:00
Botond Dénes	34473302b0	Merge 'docs: document existing guardrails' from Andrzej Jackowski This patch series introduces a new documentation for exiting guardrails. Moreover: - Warning / failure messages of recently added write CL guardrails (SCYLLADB-259) are rephrased, so all guardrails have similar messages. - Some new tests are added, to help verify the correctness of the documentation and avoid situations where the documentation and implementation diverge. Fixes: [SCYLLADB-257](https://scylladb.atlassian.net/browse/SCYLLADB-257) No backport, just new docs and tests. [SCYLLADB-257]: https://scylladb.atlassian.net/browse/SCYLLADB-257?atlOrigin=eyJpIjoiNWRkNTljNzYxNjVmNDY3MDlhMDU5Y2ZhYzA5YTRkZjUiLCJwIjoiZ2l0aHViLWNvbS1KU1cifQ Closes scylladb/scylladb#29011 * github.com:scylladb/scylladb: test: add new guardrail tests matching documentation scenarios test: add metric assertions to guardrail replication strategy tests test: use regex matching in guardrail replication strategy tests test: extract ks_opts helper in test_guardrail_replication_strategy docs: document CQL guardrails cql: improve write consistency level guardrail messages	2026-03-20 08:56:00 +02:00
Andrzej Jackowski	9b24d9ee7d	docs: document CQL guardrails Add docs/cql/guardrails.rst covering replication factor, replication strategy, write consistency level, and compact storage guardrails. Fixes: SCYLLADB-257	2026-03-19 12:49:41 +01:00
Piotr Smaron	a2ad57062f	docs/cql: clarify WHERE clause boolean limitations Document that `SELECT ... WHERE` clause currently accepts only conjunctions of relations joined by `AND` (`OR` is not supported), and that parentheses cannot be used to group boolean subexpressions. Add an unsupported query example and point readers to equivalent `IN` rewrites when applicable. This problem has been raised by one of our users in https://forum.scylladb.com/t/error-parsing-query-or-unsupported-statement/5299, and while one could infer answer to user's question by looking at the syntax of the `SELECT ... WHERE`, it's not immediately obvious to non-advanced users, so clarifying these concepts is justified. Fixes: SCYLLADB-1116 Closes scylladb/scylladb#29100	2026-03-19 09:47:22 +02:00
Michał Hudobski	40d180a7ef	docs: update vector search filtering to reflect primary key support only Remove outdated references to filtering on columns provided in the index definition, and remove the note about equal relations (= and IN) being the only supported operations. Vector search filtering currently supports WHERE clauses on primary key columns only. Closes scylladb/scylladb#28949	2026-03-16 17:16:16 +02:00
Dawid Mędrek	7564a56dc8	Merge 'tombstone_gc: allow using repair-mode tombstone gc with RF=1 tables' from Botond Dénes Currently, repair-mode tombstone-gc cannot be used on tables with RF=1. We want to make repair-mode the default for all tablet tables (and more, see https://github.com/scylladb/scylladb/issues/22814), but currently a keyspace created with RF=1 and later altered to RF>1 will end up using timeout-mode tombstone gc. This is because the repair-mode tombstone-gc code relies on repair history to determine the gc-before time for keys/ranges. RF=1 tables cannot run repairs so they will have empty repair history and consequently won't be able to purge tombstones. This PR solves this by keeping a registry of RF=1 tables and consulting this registry when creating `tombstone_gc_state` objects. If the table is RF=1, tombstone-gc will work as if the table used immediate-mode tombstone-gc. The registry is updated on each replication update. As soon as the table is not RF=1 anymore, the tombstone-gc reverts to the natural repair-mode behaviour. After this PR, tombstone-gc defaults to repair-mode for all tables, regardless of RF and tablets/vnodes. Fixes: SCYLLADB-106. New feature, no backport required. Closes scylladb/scylladb#22945 * github.com:scylladb/scylladb: test/{boost,cluster}: add test for tombstone gc mode=repair with RF=1 tombstone_gc: allow use of repair-mode for RF=1 tables replica/table: update rf=1 table registry in shared tombstone-gc state tombstone_gc: tombstone_gc_before_getter: consider RF when getting gc before time tombstone_gc: unpack per_table_history_maps tombstone_gc: extract _group0_gc_time from per_table_history_map tombstone_gc: drop tombstone_gc_state(nullptr) ctor and operator bool() test/lib/random_schema: use timeout-mode tombstone_gc tombstone_gc_options: add C++ friendly constructor test: move away from tombstone_gc_state(nullptr) ctor treewide: move away from tombstone_gc_state(nullptr) ctor sstable: move away from tombstone_gc_mode::operator bool() replica/table: add get_tombstone_gc_state() compaction: use tombstone_gc_state with value semantics db/row_cache: use tombstone_gc_state with value semantics tombstone_gc: introduce tombstone_gc_state::for_tests()	2026-03-05 11:50:31 +01:00
Botond Dénes	5998a859f7	tombstone_gc: allow use of repair-mode for RF=1 tables Modify the methods which calculate the default gc mode as well as that which validates whether repair-mode can be used at all, so both accepts use of repair-mode on RF=1 tables. This de-facto changes the default tombstone-gc to repair-mode for all tables. Documentation is updated accordingly. Some tests need adjusting: * cqlpy/test_select_from_mutation_fragments.py: disable GC for some test cases because this patch makes tombstones they write subject to GC when using defaults. * test/cluster/test_mv.py::test_mv_tombstone_gc_not_inherited used repair-mode as a non-default for the base table and expected the MV to revert to default. Another mode has to be used as the non-default (immediate). * test/cqlpy/test_tools.py::test_scylla_sstable_dump_schema: don't compare tombstone_gc schema extension when comparing dumped schema vs. original. The tool's schema loader doesn't have access to the keyspace definition so it will come up with different defaults for tombstone-gc. * test/boost/row_cache_test.cc::test_populating_cache_with_expired_and_nonexpired_tombstones sets tombstone expiry assuming the tombstone-gc timeout-mode default. Change the CREATE TABLE statement to set the expected mode.	2026-03-04 09:44:24 +02:00
Dimitrios Symonidis	80b74d7df2	tablet options: Add max_tablet_count tablet option to enforce tablet count upper bounds Introduced a new max_tablet_count tablet option that caps the maximum number of tablets a table can have. This feature is designed primarily for backup and restore workflows. During backup, when load balancing is disabled for snapshot consistency, the current tablet count is recorded in the backup manifest. During restore, max_tablet_count is set to this recorded value, ensuring the restored table's tablet count never exceeds the original snapshot's tablet distribution. This guarantee enables efficient file-based SSTable streaming during restore, as each SSTable remains fully contained within a single tablet boundary. Closes scylladb/scylladb#28450	2026-03-03 11:19:24 +03:00
Sergey Zolotukhin	33923578eb	Update DROP INDEX statement documentation Clarify behavior of DROP INDEX during ongoing builds. Closes scylladb/scylladb#28659	2026-03-02 17:31:23 +02:00
Nadav Har'El	34c0c64d9d	docs/cql: document the new CQL per-row TTL feature Add user-facing documentation for the new CQL per-row TTL feature, in docs/cql/cql-extensions.md. Also mention (and link) the new alternative TTL feature in a few relevant documents about the old (per-write) TTL, about CDC, and about the CREATE TABLE and ALTER TABLE commands. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-02-25 14:59:44 +02:00
Aleksandra Martyniuk	eefe66b2b2	docs: describe conversion to rack-list RF Fixes: SCYLLADB-398	2026-02-23 12:41:33 +01:00
Avi Kivity	cc03f5c89d	cql3: support literals and bind variables in selectors Add support for literals in the SELECT clause. This allows SELECT fn(column, 4) or SELECT fn(column, ?). Note, "SELECT 7 FROM tab" becomes valid in the grammar, but is still not accepted because of failed type inference - we cannot infer the type of 7, and don't have a favored type for literals (like C favors int). We might relax this later. In the WHERE clause, and Cassandra in the SELECT clause, type hints can also resolve type ambiguity: (bigint)7 or (text)?. But this is deferred to a later patch. A few changes to the grammar are needed on top of adding a `value` alternative to `unaliasedSelector`: - vectorSimilarityArg gained access to `value` via `unaliasedSelector`, so it loses that alternate to avoid ambiguity. We may drop `vectorSimilarityArg` later. - COUNT(1) became ambiguous via the general function path (since function arguments can now be literals), so we remove this case from the COUNT special cases, remaining with count(*). - SELECT JSON and SELECT DISTINCT became "ambiguous enough" for ANTLR to complain, though as far as I can tell `value` does not add real ambiguity. The solution is to commit early (via "=>") to a parsing path. Due to the loss of count(1) recognition in the parser, we have to special-case it in prepare. We may relax it to count any expression later, like modern Cassandra and SQL. Testing is awkward because of the type inference problem in top-level. We test via the set_intersection() function and via lua functions. Example: ``` cqlsh> CREATE FUNCTION ks.sum(a int, b int) RETURNS NULL ON NULL INPUT RETURNS int LANGUAGE LUA AS 'return a + b'; cqlsh> SELECT ks.sum(1, 2) FROM system.local; ks.sum(1, 2) -------------- 3 (1 rows) cqlsh> ``` (There are no suitable system functions!) Fixes https://scylladb.atlassian.net/browse/SCYLLADB-296 Closes scylladb/scylladb#28256	2026-02-02 00:06:13 +02:00
Pawel Pery	f49c9e896a	vector_search: allow full secondary indexes syntax while creating the vector index Vector Search feature needs to support creating vector indexes with additional filtering column. There will be two types of indexes: global which indexes vectors per table, and local which indexes vectors per partition key. The new syntaxes are based on ScyllaDB's Global Secondary Index and Local Secondary Index. Vector indexes don't use secondary indexes functionalities in any way - all indexing, filtering and processing data will be done on Vector Store side. This patch allows creating vector indexes using this CQL syntax: ``` CREATE TABLE IF NOT EXISTS cycling.comments_vs ( commenter text, comment text, comment_vector VECTOR <FLOAT, 5>, created_at timestamp, discussion_board_id int, country text, lang text, PRIMARY KEY ((commenter, discussion_board_id), created_at) ); CREATE CUSTOM INDEX IF NOT EXISTS global_ann_index ON cycling.comments_vs(comment_vector, country, lang) USING 'vector_index' WITH OPTIONS = { 'similarity_function': 'DOT_PRODUCT' }; CREATE CUSTOM INDEX IF NOT EXISTS local_ann_index ON cycling.comments_vs((commenter, discussion_board_id), comment_vector, country, lang) USING 'vector_index' WITH OPTIONS = { 'similarity_function': 'DOT_PRODUCT' }; ``` Currently, if we run these queries to create indexes we will receive such errors: ``` InvalidRequest: Error from server: code=2200 [Invalid query] message="Vector index can only be created on a single column" InvalidRequest: Error from server: code=2200 [Invalid query] message="Local index definition must contain full partition key only. Redundant column: XYZ" ``` This commit refactors `vector_index::check_target` to correctly validate columns building the index. Vector-store currently support filtering by native types, so the type of columns is checked. The first column from the list must be a vector (to build index based on these vectors), so it is also checked. Allowed types for columns are native types without counter (it is not possible to create a table with counter and vector) and without duration (it is not possible to correctly compare durations, this type is even not allowed in secondary indexes). This commits adds cqlpy test to check errors while creating indexes. Fixes: SCYLLADB-298 This needs to be backported to version 2026.1 as this is a fix for filtering support. Closes scylladb/scylladb#28366	2026-01-30 01:14:31 +02:00
Botond Dénes	7ac32097da	docs/cql/ddl.rst: Tombstones GC: explain propagation delay This parameter was not mentioned at all anywhere in the documentation. Add an explanation of this parameter: why we need it, what is the default and how it can be changed. Closes scylladb/scylladb#28132	2026-01-27 16:05:52 +01:00
Szymon Wasik	927aebef37	Add vector search documentation links to CQL docs This patch adds links to the Vector Search documentation that is hosted together with Scylla Cloud docs to the CQL documentation. It also make the note about supported capabilities consistent and removes the experimental label as the feature is GAed. Fixes: SCYLLADB-371 Closes scylladb/scylladb#28312	2026-01-22 16:46:38 +01:00
Botond Dénes	7d2e6c0170	Merge 'config: add enforce_rack_list option' from Aleksandra Martyniuk Add enforce_rack_list option. When the option is set to true, all tablet keyspaces have rack list replication factor. When the option is on: - CREATE STATEMENT always auto-extends rf to rack lists; - ALTER STATEMENT fails when there is numeric rf in any DC. The flag is set to false by default and a node needs to be restarted in order to change its value. Starting a node with enforce_rack_list option will fail, if there are any tablet keyspaces with numeric rf in any DC. enforce_rack_list is a per-node option and a user needs to ensure that no tablet keyspace is altered or created while nodes in the cluster don't have the consistent value. Mark rf_rack_valid_keyspaces as deprecated. Fixes: https://github.com/scylladb/scylladb/issues/26399. New feature; no backport needed Closes scylladb/scylladb#28084 * github.com:scylladb/scylladb: test: add test for enforce_rack_list option db: mark rf_rack_valid_keyspaces as deprecated config: add enforce_rack_list option Revert "alternator: require rf_rack_valid_keyspaces when creating index"	2026-01-22 10:27:35 +02:00
Karol Nowacki	41dcf12463	vector_search: doc: Document new index option Adds documentation for the rescoring option for vector search indexes.	2026-01-20 21:01:45 +01:00
Aleksandra Martyniuk	7dc371f312	db: mark rf_rack_valid_keyspaces as deprecated Mark rf_rack_valid_keyspaces option as deprecated. User should use enforce_rack_list option instead. The option can still be used and it does not change it's behavior. Docs is updated accordingly.	2026-01-20 09:58:57 +01:00
Karol Nowacki	324b829263	vector_search: doc: Document new index options Adds documentation for the `quantization` and `oversampling` options for vector search indexes.	2026-01-19 10:28:46 +01:00
Nadav Har'El	34d28475d9	Merge 'Implement Vector Search filtering API' from Dawid Pawlik Since Vector Store service filtering API has been implemented (scylladb/vector-store#334), there is a need for the implementation of Scylla side part. This patch should implement a `statement_restrictions` parsing into Vector Store filtering API compatible JSON objects. Those objects should be added to ANN query vector POST requests as `filter` object. After this patch, the subset of all operations ([Vector Search Filtering Milestone 1](https://scylladb.atlassian.net/wiki/spaces/RND/pages/156729450/Vector+Search+Filtering+Design+Document#Milestone-1)) happy path should be completed, allowing users to filter on primary key columns with single column `=` and `IN` or multiple column `()=()` and `() IN ()`. The restrictions for other operations should be implemented in a PR on Vector Store service side. --- This PR implements parsing the `statement_restrictions` into Vector Store filtering API compatible JSON objects. The JSON objects are created and used in ANN vector queries with filtering. It closes the Scylla side implementation of Vector Search filtering milestone 1. Unit tests for `statement_restrictions` parsing are added. Integration tests will be added on Vector Store service side PR. --- Fixes: SCYLLADB-249 New feature, should land into 2026.1 Closes scylladb/scylladb#28109 * github.com:scylladb/scylladb: docs: update documentation on filtering with vector queries test/vector_search: add test for filtered ANN with VS mock test/vector_search: add restriction to JSON conversion unit tests vector_search: cql: construct and use filter in ANN vector queries select_statement: do not require post query ordering for vector queries vector_search: add `statement_restrictions` to JSON parsing	2026-01-18 16:11:29 +02:00
Dawid Pawlik	383f9e6e56	docs: update documentation on filtering with vector queries Add a description of available filtering options with ANN vector queries. Provide an example of such query and a reference to `WHERE` clause restrictions.	2026-01-16 11:18:23 +01:00
Botond Dénes	122b7847e5	Merge 'index: Accept view properties in CREATE INDEX' from Dawid Mędrek Problem ------- Secondary indexes are implemented via materialized views under the hood. The way an index behaves is determined by the configuration of the view. Currently, it can be modified by performing the CQL statement `ALTER MATERIALIZED VIEW` on it. However, that raises some concerns. Consider, for instance, the following scenario: 1. The user creates a secondary index on a table. 2. In parallel, the user performs writes to the base table. 3. The user modifies the underlying materialized view, e.g. by setting the `synchronous_updates` to `true` [1]. Some of the writes that happened before step 3 used the default value of the property (which is `false`). That had an actual consequence on what happened later on: the view updates were performed asynchronously. Only after step 3 had finished did it change. Unfortunately, as of now, there is no way to avoid a situation like that. Whenever the user wants to configure a secondary index they're creating, they need to do it in another schema change. Since it's not always possible to control how the database is manipulated in the meantime, it leads to problems like the one described. That's not all, though. The fact that it's not possible to configure secondary indexes is inconsistent with other schema entities. When it comes to tables or materialized views, the user always have a means to set some or even all of the properties during their creation. Solution -------- The solution to this problem is extending the `CREATE INDEX` CQL statement by view properties. The syntax is of form: ``` > CREATE INDEX <index name> > .. ON <keyspace>.<table> (<columns>) > .. WITH <properties> ``` where `<properties>` corresponds to both index-specific and view properties [2, 3]. View properties can only be used with indexes implemented with materialized views; for example, it will be impossible to create a vector index when specifying any view property (see examples below). When a view property is provided, it will be applied when creating the underlying materialized view. The behavior should be similar to how other CQL statements responsible for creating schema entities work. High-level implementation strategy ---------------------------------- 1. Make auxiliary changes. 2. Introduce data structures representing the new set of index properties: both index-specific and those corresponding to the underlying view. 3. Extend `CREATE INDEX` to accept view properties. 4. Extend `DESCRIBE INDEX` and other `DESCRIBE` statements to include view properties in their output. User documentation is also updated at the steps to reflect the corresponding changes. Implementation considerations ----------------------------- There are a number of schema properties that are now obsolete. They're accepted by other CQL statements, but they have no effect. They include: * `index_interval` * `replicate_on_write` * `populate_io_cache_on_flush` * `read_repair_chance` * `dclocal_read_repair_chance` If the user tries to create a secondary index specifying any of those keywords, the statement will fail with an appropriate error (see examples below). Unlike materialized views, we forbid specifying the clustering order when creating a secondary index [4]. This limitation may be lifted later on, but it's a detail that may or may not prove troublesome. It's better to postpone covering it to when we have a better perspective on the consequences it would bring. Examples -------- Good examples ``` > CREATE INDEX idx ON ks.t (v); > CREATE INDEX idx ON ks.t (v) WITH comment = 'ok view property'; > CREATE INDEX idx ON ks.t (v) .. WITH comment = 'multiple view properties are ok' .. AND synchronous_updates = true; > CREATE INDEX idx ON ks.t (v) .. WITH comment = 'default value ok' .. AND synchronous_updates = false; ``` Bad examples ``` > CREATE INDEX idx ON ks.t (v) WITH replicate_on_write = true; SyntaxException: Unknown property 'replicate_on_write' > CREATE INDEX idx ON ks.t (v) .. WITH OPTIONS = {'option1': 'value1'} .. AND comment = 'some text'; InvalidRequest: Error from server: code=2200 [Invalid query] message="Cannot specify options for a non-CUSTOM index" > CREATE CUSTOM INDEX idx ON ks.t (v) .. WITH OPTIONS = {'option1': 'value1'} .. AND comment = 'some text'; InvalidRequest: Error from server: code=2200 [Invalid query] message="CUSTOM index requires specifying the index class" > CREATE CUSTOM INDEX idx ON ks.t (v) .. USING 'vector_index' .. WITH OPTIONS = {'option1': 'value1'} .. AND comment = 'some text'; InvalidRequest: Error from server: code=2200 [Invalid query] message="You cannot use view properties with a vector index" > CREATE INDEX idx ON ks.t (v) WITH CLUSTERING ORDER BY (v ASC); InvalidRequest: Error from server: code=2200 [Invalid query] message="Indexes do not allow for specifying the clustering order" ``` and so on. For more examples, see the relevant tests. References: [1] https://docs.scylladb.com/manual/branch-2025.4/cql/cql-extensions.html#synchronous-materialized-views [2] https://docs.scylladb.com/manual/branch-2025.4/cql/secondary-indexes.html#create-index [3] https://docs.scylladb.com/manual/branch-2025.4/cql/mv.html#mv-options [4] https://docs.scylladb.com/manual/branch-2025.4/cql/dml/select.html#ordering-clause Fixes scylladb/scylladb#16454 Backport: not needed. This is an enhancement. Closes scylladb/scylladb#24977 * github.com:scylladb/scylladb: cql3: Extend DESC INDEX by view properties cql3: Forbid using CLUSTERING ORDER BY when creating index cql3: Extend CREATE INDEX by MV properties cql3/statements/create_index_statement: Allow for view options cql3/statements/create_index_statement: Rename member cql3/statements/index_prop_defs: Re-introduce index_prop_defs cql3/statements/property_definitions: Add extract_property() cql3/statements/index_prop_defs.cc: Add namespace cql3/statements/index_prop_defs.hh: Rename type cql3/statements/view_prop_defs.cc: Move validation logic into file cql3/statements: Introduce view_prop_defs.{hh,cc} cql3/statements/create_view_statement.cc: Move validation of ID schema/schema.hh: Do not include index_prop_defs.hh	2026-01-14 09:54:27 +02:00
tomek7667	19313d67e3	docs/cql/ddl.rst: fix formatting of deprecated initial sub-option Closes scylladb/scylladb#26852	2026-01-13 08:55:24 +02:00
Botond Dénes	60570d7114	Merge 'topology coordinator: restrict node join/remove to preserve RF-rack validity' from Michael Litvak Allow creating materialized views and secondary indexes in a tablets keyspace only if it's RF-rack-valid, and enforce RF-rack-validity while the keyspace has views by restricting some operations: * Altering a keyspace's RF if it would make the keyspace RF-rack-invalid * Adding a node in a new rack * Removing / Decommissioning the last node in a rack Previously the config option `rf_rack_valid_keyspaces` was required for creating views. We now remove this restriction - it's not needed because we always maintain RF-rack-validity for keyspaces with views. The restrictions are relevant only for keyspaces with numerical RF. Keyspace with rack-list-based RF are always RF-rack-valid. Fixes scylladb/scylladb#23345 Fixes https://github.com/scylladb/scylladb/issues/26820 backport to relevant versions for materialized views with tablets since it depends on rf-rack validity Closes scylladb/scylladb#26354 * github.com:scylladb/scylladb: docs: update RF-rack restrictions cql3: don't apply RF-rack restrictions on vector indexes cql3: add warning when creating mv/index with tablets about rf-rack service/tablet_allocator: always allow tablet merge of tables with views locator: extend rf-rack validation for rack lists test: test rf-rack validity when creating keyspace during node ops locator: fix rf-rack validation during node join/remove test: test topology restrictions for views with tablets test: add test_topology_ops_with_rf_rack_valid topology coordinator: restrict node join/remove to preserve RF-rack validity topology coordinator: add validation to node remove locator: extend rf-rack validation functions view: change validate_view_keyspace to allow MVs if RF=Racks db: enforce rf-rack-validity for keyspaces with views replica/db: add enforce_rf_rack_validity_for_keyspace helper db: remove enforce parameter from check_rf_rack_validity test: adjust test to not break rf-rack validity	2026-01-09 10:01:23 +02:00
Nadav Har'El	384e394ff0	Merge 'Add similarity functions to calculate similarity of given vectors' from Dawid Pawlik It should be possible to return the similarity of vectors in CQL statements following the [Cassandra compatible syntax](https://cassandra.apache.org/doc/latest/cassandra/getting-started/vector-search-quickstart.html#query-vector-data-with-cql): ``` SELECT comment, similarity_cosine(comment_vector, [0.1, 0.15, 0.3, 0.12, 0.05]) FROM cycling.comments_vs; ``` Although the calculations are slow, and we already have calculated results returned via Vector Store API, we need the functionality as it allows us to calculate similarity of vectors not stored in vector indexes. It will be needed for [quantization and rescoring](https://scylladb.atlassian.net/wiki/spaces/RND/pages/195985800/Quantization+and+Rescoring). The feature is also a nice-to-have in testing as requested many times by testing and CX teams. The optimized version utilizing already calculated distances from Vector Store without a need of rescoring will be coming soon after via https://github.com/scylladb/scylladb/pull/27991. --- The patch adds functions: - `similarity_cosine(<vector>, <vector>)`, - `similarity_euclidean(<vector>, <vector>)`, - `similarity_dot_product(<vector>, <vector>)` Where `<vector>` is either a column of type `VECTOR<FLOAT, N>` or a vector of floats literal. These functions can be called with every `SELECT` query, not only ANN vector queries as opposed to https://github.com/scylladb/scylladb/pull/25993. The similarity calculations are implemented inspired by [USearch's implementation]( `a2f1759910/include/usearch/index_plugins.hpp (L1304-L1385)`) and made compatible with [Cassandra's documentation](https://cassandra.apache.org/doc/5.0/cassandra/developing/cql/functions.html#vector-similarity-functions). That would guarantee the results in ScyllaDB are calculated using the exact same algorithms as used in Vector Store indexes. --- Fixes: SCYLLADB-88 Fixes: SCYLLADB-89 New feature, should land into 2026.1 Closes scylladb/scylladb#27524 * github.com:scylladb/scylladb: docs: add vector similarity functions documentation test/cqlpy: add similarity functions correctness tests test/cqlpy: add similarity functions invalid call tests cql3: introduce similarity functions syntax vector_similarity_fcts: introduce similarity functions vector_similarity_fcts: retrieve similarity function argument types vector_similarity_fcts: add calculating similarity between vectors	2026-01-05 18:28:10 +02:00
Dawid Pawlik	c0b06a7fc6	docs: add vector similarity functions documentation Add documentation in `functions.rst` as the CQL reference for a vector similarity functions. This includes the syntax, example usage, and prerequisites for the parameters.	2026-01-02 13:02:59 +01:00
Nikos Dragazis	20ff2fcc18	docs: Amend limitations for keyspace RF changes The doc about DDL statements claims that an `ALTER KEYSPACE` will fail in the presence of an ongoing global topology operation. This limitation was specifically referring to RF changes, which Scylla implements as global topology requests (`keyspace_rf_change`), and it was true when it was first introduced (`1b913dd880`) because there was no global topology request queue at that time, so only one ongoing global request was allowed in the cluster. This limitation was lifted with the introduction of the global topology request queue (`6489308ebc`), and it was re-introduced again very recently (`2e7ba1f8ce`) in a slightly different form; it now applies only to RF changes (not to any request type) and only those that affect the same keyspace. None of these two changes were ever reflected in the doc. Synchronize the doc with the current state. Fixes #27776. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com> Closes scylladb/scylladb#27786	2025-12-22 20:02:40 +02:00
Michael Litvak	9f8aea21e3	docs: update RF-rack restrictions Update the documentation about restrictions to tablets keyspaces related to RF-rack. * MV/SI require the keyspace to be RF-rack-valid * topology operations are restricted if a keyspace has views to preserve RF-rack-validity	2025-12-22 09:21:07 +01:00
Michael Litvak	33f7bc28da	docs: document restrictions of colocated tables Currently some things are not supported for colocated tables: it's not possible to repair a colocated table, and due to this it's also not possible to use the tombstone_gc=repair mode on a colocated table. Extend the documentation to explain what colocated tables are and document these restrictions. Fixes scylladb/scylladb#27261 Closes scylladb/scylladb#27516	2025-12-18 15:38:29 +01:00
Dawid Mędrek	d0a42852c0	cql3: Forbid using CLUSTERING ORDER BY when creating index This is a temporary solution as handling this property may require a bit more attention or at least a bit more focus. For now, let's forbid using it so it's clear it won't get applied. A simple test is provided to cover it. We document the restriction.	2025-12-16 11:43:38 +01:00
Dawid Mędrek	6541362a0b	cql3: Extend CREATE INDEX by MV properties After the previous patch that extended the grammar and provided basic functionalities to accommodate properties of materialized views in indexes, this commit takes another step and actually applies them to the underlying view when it's being created. We're providing validation tests for each property, with the single exception of CLUSTERING ORDER BY. That one will be handled separately in an upcoming commit. We also update the user documentation.	2025-12-16 11:43:38 +01:00
Botond Dénes	83f46fa7f5	doc: add video link for TTL Closes: #26210	2025-12-16 10:43:03 +02:00
Botond Dénes	9d2f7c3f52	Merge 'mv: allow setting concurrency in PRUNE MATERIALIZED VIEW' from Wojciech Mitros The PRUNE MATERALIZED VIEW statement is performed as follows: 1. Perform a range scan of the view table from the view replicas based on the ranges specified in the statement. 2. While reading the paged scan above, for each view row perform a read from all base replicas at the corresponding primary key. If a discrepancy is detected, delete the row in the view table. When reading multiple rows, this is very slow because for each view row we need to performe a single row query on multiple replicas. In this patch we add an option to speed this up by performing many of the single base row reads concurrently, at the concurrency specified in the USING CONCURRENCY clause. Aside from the unit test, I checked manually on a 3-node cluster with 10M rows, using vnodes. There were actually no ghost rows in the test, but we still had to iterate over all view rows and read the corresponding base rows. And actual ghost rows, if there are any, should be a tiny fraction of all rows. I compared concurrencies 1,2,10,100 and the results were: * Pruning with concurrency 1 took total 1416 seconds * Pruning with concurrency 2 took total 731 seconds * Pruning with concurrency 10 took total 234 seconds * Pruning with concurrency 100 took total 171 seconds So after a concurrency of 10 or so we're hitting diminishing returns (at least in this setup). At that point we may be no longer bottlenecked by the reads, but by CPU on the shard that's handling the PRUNE Fixes https://github.com/scylladb/scylladb/issues/27070 Closes scylladb/scylladb#27097 * github.com:scylladb/scylladb: mv: allow setting concurrency in PRUNE MATERIALIZED VIEW cql: add CONCURRENCY to the USING clause	2025-12-04 11:47:41 +02:00
Wojciech Mitros	323e5cd171	mv: allow setting concurrency in PRUNE MATERIALIZED VIEW The PRUNE MATERALIZED VIEW statement is performed as follows: 1. Perform a range scan of the view table from the view replicas based on the ranges specified in the statement. 2. While reading the paged scan above, for each view row perform a read from all base replicas at the corresponding primary key. If a discrepancy is detected, delete the row in the view table. When reading multiple rows, this is very slow because for each view row we need to performe a single row query on multiple replicas. In this patch we add an option to speed this up by performing many of the single base row reads concurrently, at the concurrency specified in the USING CONCURRENCY clause. Fixes https://github.com/scylladb/scylladb/issues/27070	2025-11-27 00:02:28 +01:00
tomek7667	9bbdd487b4	docs: insert.rst: Update insert example by removing 'year' column Closes scylladb/scylladb#26862	2025-11-26 06:55:28 +02:00
tomek7667	2138ab6b0e	docs: insert.rst: fix INSERT statement for NerdMovies example Closes scylladb/scylladb#26863	2025-11-26 06:53:45 +02:00
tomek7667	90a6aa8057	docs: ddl.rst: Fix formatting of null value note Closes scylladb/scylladb#26853	2025-11-26 06:52:18 +02:00
Anna Stuchlik	724dc1e582	doc: fix the info about object storage This commit fixes the information about object storage: - Object storage configuration is no longer marked as experimental. - Redundant information has been removed from the description. - Information related to object storage for SStabels has been removed as the feature is not working. Fixes https://github.com/scylladb/scylladb/issues/26985 Closes scylladb/scylladb#26987	2025-11-24 17:16:33 +03:00
Szymon Wasik	f714876eaf	Add documentation about lack of returning similarity distances This patch adds the missing warning about the lack of possibility to return the similarity distance. This will be added in the next iteration. Fixes #27086 It has to be backported to 2025.4 as this is the limitation in 2025.4. Closes scylladb/scylladb#27096	2025-11-18 13:50:36 +01:00
Piotr Dulikowski	7f482c39eb	Merge '[schema] Speculative retry rounding fix' from Dario Mirovic This patch series re-enables support for speculative retry values `0` and `100`. These values have been supported some time ago, before [schema: fix issue 21825: add validation for PERCENTILE values in speculative_retry configuration. #21879 ](https://github.com/scylladb/scylladb/pull/21879). When that PR prevented using invalid `101PERCENTILE` values, valid `100PERCENTILE` and `0PERCENTILE` value were prevented too. Reproduction steps from [[Bug]: drop schema and all tables after apply speculative_retry = '99.99PERCENTILE' #26369](https://github.com/scylladb/scylladb/issues/26369) are unable to reproduce the issue after the fix. A test is added to make sure the inclusive border values `0` and `100` are supported. Documentation is updated to give more information to the users. It now states that these border values are inclusive, and also that the precision, with automatic rounding, is 1 decimal digit. Fixes #26369 This is a bug fix. If at any time a client tries to use value >= 99.5 and < 100, the raft error will happen. Backport is needed. The code which introduced inconsistency is introduced in 2025.2, so no backporting to 2025.1. Closes scylladb/scylladb#26909 * github.com:scylladb/scylladb: test: cqlpy: add test case for non-numeric PERCENTILE value schema: speculative_retry: update exception type for sstring ops docs: cql: ddl.rst: update speculative-retry-options test: cqlpy: add test for valid speculative_retry values schema: speculative_retry: allow 0 and 100 PERCENTILE values	2025-11-13 15:27:45 +01:00
Dario Mirovic	aba4c006ba	docs: cql: ddl.rst: update speculative-retry-options Clarify how the value of `XPERCENTILE` is handled: - Values 0 and 100 are supported - The percentile value is rounded to the nearest 0.1 (1 decimal place) Refs #26369	2025-11-09 13:23:29 +01:00
Tomasz Grabiec	28f6bdc99b	cql3: ks_prop_defs: Expand numeric RF to rack list Auto-exands numeric RF in CREATE/ALTER KEYSPACE statements for new DCs specified in the statement. Doesn't auto-expand existing options, as the rack choice may not be in line with current replica placement. This requires co-locating tablet replicas, and tracking of co-location state, which is not implemented yet. Signed-off-by: Tomasz Grabiec <tgrabiec@scylladb.com>	2025-10-29 23:32:59 +01:00

1 2 3 4

185 Commits