scylladb

Author	SHA1	Message	Date
Nadav Har'El	c87d6407ed	cql3: Replace SCYLLA_ASSERT by throwing_assert In this patch we replace every single use of SCYLLA_ASSERT() in the cql3/ directory by throwing_assert(). The problem with SCYLLA_ASSERT() is that when it fails, it crashes Scylla. This is almost always a bad idea (see #7871 discussing why), but it's even riskier in front-end code like cql3/: In front-end code, there is a risk that due to a bug in our code, a specific user request can cause Scylla to crash. A malicious user can send this query to all nodes and crash the entire cluster. When the user is not malicious, it causes a small problem (a failing request) to become a much worse crash - and worse, the user has no idea which request is causing this crash and the crash will repeat if the same request is tried again. All of this is solved by using the new throwing_assert(), which is the same as SCYLLA_ASSERT() but throws an exception (using on_internal_error()) instead of crashing. The exception will prevent the code path with the invalid assumption from continuing, but will result in only the current user request being aborted, with a clear error message reporting the internal server error due to an assertion failure. I reviewed all the changes that I did in this patch to check that (to the best of my understanding) none of the assertions in cql3/ involve the sort of serious corruption that might require crashing the Scylla node entirely. throwing_assert() also improves logging of assertion failures compared to the original SCYLLA_ASSERT() - SCYLLA_ASSERT() printed a message to stderr which in many installations is lost, whereas throwing_assert() uses Scylla's standard logger, and also includes a backtrace in the log message. Fixes #13970 (Exorcise assertions from CQL code paths) Refs #7871 (Exorcise assertions from Scylla) Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-03-11 09:41:20 +02:00
Szymon Malewski	f9d213547f	cql3: selection: fix `add_column_for_post_processing` for ORDER BY The purpose of `add_column_for_post_processing` is to add columns that are required for processing of a query, but are not part of SELECT clause and shouldn't be returned. They are added to the final result set, but later are not serialized. Mainly it is used for filtering and grouping columns, with a special case of `WHERE primary_key IN ... ORDER BY ...` when the whole result set needs additional final sorting, and ordering columns must be added as well. There was a bug that manifested in #9435, #8100 and was actually identified in #22061. In case of selection with processing (e.g functions involved), result set row is formed in two stages. Initially it is a list of columns fetched from replicas - on which filtering and grouping is performed. After that the actual selection is resolved and the final number of columns can change. Ordering is performed on this final shape, but the ordering column index returned by `add_column_for_post_processing` refereed to initial shape. If selection refereed to the same column twice (e.g. `v, TTL(v)` as in #9435) final row was longer than initial and ordering refereed to incorrect column. If a function in selection refereed to multiple columns (e.g. as_json(.., ..) which #8100 effectively uses) the final row was shorter and ordering tried to use a non-existing column. This patch fixes the problem by making sure that column index of the final result set is used for ordering. The previously crashing test `cassandra_tests/validation/entities/json_test.py::testJsonOrdering` doesn't have to be skipped, but now it is failing on issue #28467. Fixes #9435 Fixes #8100 Fixes #22061 Closes scylladb/scylladb#28472	2026-03-05 19:22:34 +02:00
Karol Nowacki	30487e8854	index: fix vector index with filtering target column The secondary index mechanism is currently used to determine the target column. This mechanism works incorrectly for vector indexes with filtering because it returns the last specified column as the target (vectors) column. However, the syntax for a vector index requires the first column to be the target: ``` CREATE CUSTOM INDEX ON t(vectors, users) USING 'vector_index'; ``` This discrepancy eventually leads to the following exception when performing an ANN search on a vector index with filtering columns: ```` ANN ordering by vector requires the column to be indexed using 'vector_index' ```` This commit fixes the issue by introducing dedicated logic for vector indexes to correctly identify the target(vectors) column. Fixes: SCYLLADB-635 Closes scylladb/scylladb#28740	2026-03-02 18:47:58 +02:00
Marcin Maliszkiewicz	e18b519692	cql3: remove find_schema call from select check_access Schema is already a member of select statement, avoiding the call saves around 400 cpu instructions on a select request hot path. Closes scylladb/scylladb#28328	2026-01-30 11:49:09 +01:00
Piotr Dulikowski	3ec4f67407	Merge 'vector_index: Implement rescoring' from Szymon Malewski This series implements rescoring algorithm. Index options allowing to enable this functionality were introduced in earlier PR https://github.com/scylladb/scylladb/pull/28165. When Vector Index has enabled quantization, Vector Store uses reduced vector representation to save memory, but it may degrade correctness of ANN queries. For quantized index we can enable rescoring algorithm, which recalculates similarity score from full vector representation stored in Scylla and reorder returned result set. It works also with oversampling - we fetch more candidates from Vector Store, rescore them at Scylla and return only requested number of results. Example: Creating a Vector Index with Rescoring ```sql -- Create a table with a vector column CREATE TABLE ks.products ( id int PRIMARY KEY, embedding vector<float, 128> ); -- Create a vector index with rescoring enabled CREATE INDEX products_embedding_idx ON ks.products (embedding) USING 'vector_index' WITH OPTIONS = { 'similarity_function': 'cosine', 'quantization': 'i8', 'oversampling': '2.0', 'rescoring': 'true' }; ``` 1. Quantization (`i8`) compresses vectors in the index, reducing memory usage but introducing precision loss in distance calculations 2. Oversampling (`2.0`) retrieves 2× more candidates than requested from the vector store (e.g., `LIMIT 10` fetches 20 candidates) 3. Rescoring (`true`) recalculates similarity scores using full-precision (`f32`) vectors from the base table and re-ranks results Query example: ```sql -- Find 10 most similar products SELECT id, similarity_cosine(embedding, [0.1, 0.2, ...]) AS score FROM ks.products ORDER BY embedding ANN OF [0.1, 0.2, ...] LIMIT 10; ``` With rescoring enabled, the query: 1. Fetches 20 candidates from the quantized index (due to oversampling=2.0) 2. Reads full-precision embeddings from the base table 3. Recalculates similarity scores with full precision 4. Re-ranks and returns the top 10 results In this implementation we use CQL similarity function implementation to calculate new score values and use them in post query ordering. We add that column manually to selection, but it has to be removed from the final response. Follow-up https://github.com/scylladb/scylladb/pull/28165 Fixes https://scylladb.atlassian.net/browse/SCYLLADB-83 New feature - doesn't need backport. Closes scylladb/scylladb#27769 * github.com:scylladb/scylladb: vector_index: rescoring: Fetch oversampled rows vector_index: rescoring: Sort by similarity column select_statement: Modify `needs_post_query_ordering` condition vector_index: rescoring: Add hidden similarity score column vector_index: Refactor extracting ANN query information	2026-01-23 15:20:10 +01:00
Patryk Jędrzejczak	4e984139b2	Merge 'strongly consistent tables: basic implementation' from Petr Gusev In this PR we add a basic implementation of the strongly-consistent tables: * generate raft group id when a strongly-consistent table is created * persist it into system.tables table * start raft groups on replicas when a strongly-consistent tablet_map reaches them * add strongly-consistent version of the storage_proxy, with the `query` and `mutate` methods * the `mutate` method submits a command to the tablets raft group, the query method reads the data with `raft.read_barrier()` * strongly-consistent versions of the `select_statement` and `modification_statement` are added * a basic `test_strong_consistency.py/test_basic_write_read` is added which to check that we can write and read data in a strongly consistent fashion. Limitations: * for now the strongly consistent tables can have tablets only on shard zero. This is because we (ab/re) use the existing raft system tables which live only on shard0. In the next PRs we'll create separate tables for the new tablets raft groups. * No Scylla-side proxying - the test has to figure out who is the leader and submit the command to the right node. This will be fixed separately. * No tablet balancing -- migration/split/merges require separate complicated code. The new behavior is hidden behind `STRONGLY_CONSISTENT_TABLES` feature, which is enabled when the `STRONGLY_CONSISTENT_TABLES` experimental feature flag is set. Requirements, specs and general overview of the feature can be found [here](https://scylladb.atlassian.net/wiki/spaces/RND/pages/91422722/Strong+Consistency). Short term implementation plan is [here](https://docs.google.com/document/d/1afKeeHaCkKxER7IThHkaAQlh2JWpbqhFLIQ3CzmiXhI/edit?tab=t.0#heading=h.thkorgfek290) One can check the strongly consistent writes and reads locally via cqlsh: scylla.yaml: ``` experimental_features: - strongly-consistent-tables ``` cqlsh: ``` CREATE KEYSPACE IF NOT EXISTS my_ks WITH replication = {'class': 'NetworkTopologyStrategy', 'replication_factor': 1} AND tablets = {'initial': 1} AND consistency = 'local'; CREATE TABLE my_ks.test (pk int PRIMARY KEY, c int); INSERT INTO my_ks.test (pk, c) VALUES (10, 20); SELECT * FROM my_ks.test WHERE pk = 10; ``` Fixes SCYLLADB-34 Fixes SCYLLADB-32 Fixes SCYLLADB-31 Fixes SCYLLADB-33 Fixes SCYLLADB-56 backport: no need Closes scylladb/scylladb#27614 * https://github.com/scylladb/scylladb: test_encryption: capture stderr test/cluster: add test_strong_consistency.py raft_group_registry: disable metrics for non-0 groups strong consistency: implement select_statement::do_execute() cql: add select_statement.cc strong consistency: implement coordinator::query() cql: add modification_statement cql: add statement_helpers strong consistency: implement coordinator::mutate() raft.hh: make server::wait_for_leader() public strong_consistency: add coordinator modification_statement: make get_timeout public strong_consistency: add groups_manager strong_consistency: add state_machine and raft_command table: add get_max_timestamp_for_tablet tablets: generate raft group_id-s for new table tablet_replication_strategy: add consistency field tablets: add raft_group_id modification_statement: remove virtual where it's not needed modification_statement: inline prepare_statement() system_keyspace: disable tablet_balancing for strongly_consistent_tables cql: rename strongly_consistent statements to broadcast statements	2026-01-23 09:52:33 +01:00
Szymon Malewski	29d090845a	vector_index: rescoring: Fetch oversampled rows So far with oversampling the extended set of keys was returned from VS, but query to the base table was still limited by the query `limit`. Now for rescoring we want to fetch rows for all the keys returned from VS. However later we need to restore the command limit, to trim result_set accordingly. For non-rescoring scenarios we trim directly keys set returned from VS if it happens to exceed query limit. With this change rescoring validation tests (except `no_nulls_in_rescored_results`) pass fully. Fixes https://scylladb.atlassian.net/browse/SCYLLADB-83	2026-01-22 15:38:44 +01:00
Szymon Malewski	0bc95bcf87	vector_index: rescoring: Sort by similarity column This patch implements second part of rescoring - ordering results by similarity column added in earlier patch. For this purpose in this patch we define `_ordering_comparator`, which enables pre-existing post-query ordering functionality. However, none additional test passes yet, as they include ovesampling, which will be the subject of following patches.	2026-01-22 15:38:44 +01:00
Szymon Malewski	57e7a4fa4f	select_statement: Modify `needs_post_query_ordering` condition Our plan for rescoring is to use the existing post-query ordering mechanism to sort (and trim) result_set by similarity column. For general SELECT case this ordering is permitted only for queries with IN on the partition key and an ORDER BY, which is checked in `needs_post_query_ordering`. Recently this check was overriden for ANN queries in https://github.com/scylladb/scylladb/pull/28109 to enable IN queries handled by VS without excessive post-processing. In this patch we revert that change - ANN case will be handled by general check. However we change the condition - we will enable post processing anytime `_ordering_comparator` is set. In current implementation `_ordering_comparator` is created only in `select_statement::prepare` with `get_ordering_comparator`, only for the same conditions as were checked in `needs_post_query_ordering`, so this change should be transparent for general SELECT. For ANN query it is also not set (yet), so it will not influence ANN filtering, but we confirm that this functionality still works by adding filtering test: `test/vector_search/filter_test.cc::vector_store_client_test_filtering_ann_cql`. Rescoring ordering for ANN queries will be enabled when we add `_ordering_comparator` in following patch.	2026-01-22 15:38:44 +01:00
Szymon Malewski	c89957b725	vector_index: rescoring: Add hidden similarity score column Rescoring consist of recalculating similarity score and reordering results based on it. In this patch we add calculation of similarity score as a hidden (non-serialized) column and following patch will add reordering. Normal ordering uses `add_column_for_post_processing`, however this works only for regular columns, not function. So we create it together with user requested columns (this also forces the use of `selection_with_processing`) and hide the column later. This also requires special handling for 'SELECT *' case - we need to manually add all columns before adding similarity column. In case user already asks for similarity score in the SELECT clause, this value will be calculated twice - is should be optimized in future patches.	2026-01-22 15:38:40 +01:00
Szymon Malewski	e0cc6ca7e6	vector_index: Refactor extracting ANN query information For the purpose of rescoring we will need information if the query is an ANN query and the access to index option earlier in the `select_statement::prepare` than it happened before. This patch refactors extracting this information to new helper structure `ann_ordering_info` and is consistently using it.	2026-01-22 10:00:47 +01:00
Petr Gusev	a5d611866e	cql: add select_statement.cc	2026-01-21 14:56:01 +01:00
Petr Gusev	6b0d757f28	cql: rename strongly_consistent statements to broadcast statements In preparation for upcoming work on strongly consistent queries in Scylla, this commit renames the existing `strongly_consistent` statements to `broadcast_statements` to avoid confusion. The old code paths are kept temporarily, as they may be useful for reference or for copying parts during the implementation of the new strongly consistent statements.	2026-01-21 14:56:00 +01:00
Dawid Pawlik	e62cb29b7d	vector_search: cache restrictions JSON at prepare time Add `prepared_filter` class which handles the preparation, construction and caching of Vector Search filtering compatible JSON object. If no bind markers found in SELECT statement, the JSON object will be built once at prepare time and cached for use during execution calls. Adjust tests accordingly to use prepared filters. Follow-up: #28109 Fixes: SCYLLADB-299	2026-01-20 17:15:52 +01:00
Dawid Pawlik	f54a4010c0	refactor: vector_search: move filter logic to vector_search namespace Move Vector Search filter functions from `cql3::restrictions` to `vector_search` namespace as it's a better place according to it's purpose. The effective name has now changed from `cql3::restrictions::to_json` to `vector_search::to_json` which clearly mentions that the JSON object will be used for Vector Search. Rename the auxilary functions to use `to_json` suffix instead of variety of verbs as those functions logic focuses on building JSON object from different structures. The late naming emphasized too much distinction between those functions, while they do pretty much the same thing. Follow-up: #28109	2026-01-20 13:13:43 +01:00
Szymon Malewski	b8e91ee6ae	vector_index: introduce `quantization` and `oversampling` options This patch adds vector index options allowing to enable quantization and oversampling. Specific quantization value will be used internally by vector store. In the current implementation, `get_oversampling` allows us to decide how many times more candidates to retrieve from vector store - final response is still trimmed to the given limit. It is a first step to allow rescoring - recalculation of similarity metric and re-ranking. Without rescoring oversampling will be also further optimized to happen internally in vector store. Fixes https://scylladb.atlassian.net/browse/SCYLLADB-82 Ref https://scylladb.atlassian.net/browse/SCYLLADB-83	2026-01-19 10:21:43 +01:00
Dawid Pawlik	2a38794b8e	vector_search: cql: construct and use filter in ANN vector queries Add `filter` option in `ann()` function to write the filter JSON object as the POST request in ANN vector queries. Adjust existing `vector_store_client_test` tests accordingly.	2026-01-16 11:18:23 +01:00
Michał Hudobski	92c988514c	vector_search: allow all where clauses in vector search queries To prepare for implementation of filtering we skip validation of where clauses in vector search queries. All queries that would be blocked by the lack of ALLOW FILTERING now will pass through. Fixes: VECTOR-410 Closes scylladb/scylladb#27758	2026-01-11 12:56:44 +02:00
Michał Hudobski	e2e479f20d	auth: fix cdc vector search indexing permission bug VECTOR_SEARCH_INDEXING permission didn't work on cdc tables as we mistakenly checked for vector indexes on the cdc table insted of the base. This patch fixes that and adds a test that validates this behavior. Fixes: VECTOR-476 Closes scylladb/scylladb#28050	2026-01-08 21:55:19 +02:00
Michał Hudobski	12483d8c3c	vector_search: throw an error when we restrict primary in vector search We currently allow restrictions on single column primary key, but we ignore the restriction and return all results. This can confuse the users. We change it so such a restriction will throw an error and add a test to validate it. Fixes: VECTOR-331 Closes scylladb/scylladb#27143	2025-12-15 09:45:56 +02:00
Karol Nowacki	086c6992f5	vector_search: Fix ANN query abort on CQL timeout When a CQL vector search request timed out, the underlying ANN query was not aborted and continued to run. This happened because the abort source was not being signaled upon request expiration. This commit ensures the ANN query is aborted when the CQL request times out preventing unnecessary resource consumption.	2025-12-02 01:17:01 +01:00
Michał Hudobski	7646dde25b	select_statement: add a warning about unsupported paging for vs queries Currently we do not support paging for vector search queries. When we get such a query with paging enabled we ignore the paging and return the entire result. This behavior can be confusing for users, as there is no warning about paging not working with vector search. This patch fixes that by adding a warning to the result of ANN queries with paging enabled. Closes scylladb/scylladb#26384	2025-11-13 18:47:05 +02:00
Karol Nowacki	9f1fd7f5a0	cql3: Rename indexed_table_select_statement To align with `vector_indexed_table_select_statement`, this commit renames `indexed_table_select_statement` to `view_indexed_table_select_statement` to clarify its usage with materialized views.	2025-10-29 08:37:25 +01:00
Karol Nowacki	357c0a8218	cql3: Move vector search select to dedicated class The execution of SELECT statements with ANN ordering (vector search) was previously implemented within `indexed_table_select_statement`. This was not ideal, as vector search logic is independent of secondary index selects. This resulted in unnecessary complexity because vector search queries don't use features like aggregates or paging. More importantly, `indexed_table_select_statement` assumed a non-null `view_schema` pointer, which doesn't hold for vector indexes (where `view_ptr` is null). This caused null pointer dereferences during ANN ordered selects, leading to crashes (VECTOR-179). Other parts of the class still dereference `view_schema` without null checks. Moving the vector search select logic out of `indexed_table_select_statement` simplifies the code and prevents these null pointer dereferences.	2025-10-29 08:37:21 +01:00
Michał Hudobski	541b52cdbf	cql: fail with a better error when null vector is passed to ann query Currently when a null vector is passed to an ANN query we fail with a quite confusing error ("NoHostAvailable: ('Unable to complete the operation against any hosts', {<Host: 127.0.0.1:9042 datacenter1>: <Error from server: code=0000 [Server error] message="to_bytes() called on raw value that is null">})"). This patch fixes that by throwing an InvalidRequestException with an appropriate message instead. We also add a test case that validates this behavior. Fixes: VECTOR-257 Closes scylladb/scylladb#26510	2025-10-27 16:09:08 +02:00
Michał Hudobski	5c957e83cb	vector_search: remove dependence on cql3 This patch removes the dependence of vector search module on the cql3 module by moving the contents of cql3/type_json.hh to types/json_utils.hh and removing the usage of cql3 primary_key object in vector_store_client. We also make the needed adjustments to files that were previously using the afformentioned type_json.hh file. This fixes the circular dependency cql3 <-> vector_search. Closes scylladb/scylladb#26482	2025-10-21 17:41:55 +03:00
Nadav Har'El	eb06ace944	Merge 'auth: implement vector store authorization' from Michał Hudobski This patch implements the changes required by the Vector Store authorization, as described in https://scylladb.atlassian.net/wiki/spaces/RND/pages/107085899/Vector+Store+Authentication+And+Authorization+To+ScyllaDB, that is: - adding a new permission VECTOR_SEARCH_INDEXING, grantable only on ALL KEYSPACES - allowing users with that permission to perform SELECT queries, but only on tables with a vector index - increasing the number of scheduling groups by one to allow users to create a service level for a vector store user - adjusting the tests and documentation These changes are needed, as the vector indexes are managed by the external service, Vector Store, which needs to read the tables to create the indexes in its memory. We would like to limit the privileges of that service to a minimum to maintain the principle of least privilege, therefore a new permission, one that allows the SELECTs conditional on the existence of a vector_index on the table. Fixes: VECTOR-201 Backport reasoning: Backport to 2025.4 required as this can make upgrading clusters more difficult if we add it in 2026.1. As for now Scylla Cloud requires version 2025.4 to enable vector search and permission is set by orchestrator so there is no chance that someone will try to add this permission during upgrade. In 2026.1 it will be more difficult. Closes scylladb/scylladb#25976 * github.com:scylladb/scylladb: docs: adjust docs for VS auth changes test: add tests for VECTOR_SEARCH_INDEXING permission cql: allow VECTOR_SEARCH_INDEXING users to select auth: add possibilty to check for any permission in set auth: add a new permission VECTOR_SEARCH_INDEXING	2025-10-20 17:32:00 +03:00
Nadav Har'El	921d07a26b	cql: make SELECT's "internal page size" configurable In some uses of SELECT, such as aggregation (sum() et al.), GROUP BY or secondary index, it needs to perform internal scans. It uses an "internal page size" which before this patch was always DEFAULT_COUNT_PAGE_SIZE = 10000. There was an ad-hoc and undocumented way to override this default in C++ tests, using functions in test/lib/select_statement_utils.hh, but it was so non-obvious that the test that most needed to override this default - the very slow test test_indexing_paging_and_aggregation which would have been must faster with a lower setting - never used it. So in this patch we replace the ad-hoc configuration functions by a bona-fide Scylla configuration option named "select_internal_page_size". The few C++ tests that used the old configuration functions were modified to use the new configuration parameters. The slow test test_indexing_paging_and_aggregation still doesn't use the new configuration to become faster - we'll do this in the next patch. Another benefit of having this "internal page size" as a configuration option is that one day a user might realize that the default choice 10,000 is bad for some reason (which I can't envision right now), so having it configurable might come it handy. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-10-15 18:42:09 +03:00
Michał Hudobski	6a69bd770a	cql: allow VECTOR_SEARCH_INDEXING users to select This patch allows users with the VECTOR_SEARCH_INDEXING permission to perform SELECT queries on tables that have a vector index. This is needed for the Vector Store service, which reads the vector-indexed tables, but does not require the full SELECT permission.	2025-10-03 16:55:57 +02:00
Szymon Wasik	ccfe80ab97	cql3: Update error messages to be in line with documentation. ANN (aproximate nearest neighborhood) is just the name of the type of algorithm used to perform vector search. For this reason the error messages should refer to vector queries rather than ANN queries.	2025-09-26 17:01:10 +02:00
Karol Nowacki	eae71d3e91	vector_store_client: Move to vector_search module Vector search related implementation moved to a new module vector_search. As the vector search functionality is going to be extended, it is better to keep it in a separate module.	2025-09-22 08:01:47 +02:00
Michał Hudobski	1690e5265a	vector search: correct column name formatting This patch corrects the column name formatting whenever an "Undefined column name" exception is thrown. Until now we used the `name()` function which returns a bytes object. This resulted in a message with a garbled ascii bytes column name instead of a proper string. We switch to the `text()` function that returns a sstring instead, making the message readable. Tests are adjusted to confirm this behavior. Fixes: VECTOR-228 Closes scylladb/scylladb#26120	2025-09-20 07:02:53 +02:00
Nadav Har'El	e322902506	Merge 'index, metrics: add per-index metrics' from Michał Hudobski This patch adds the possibility to track metrics per secondary index. Currently, only a histogram of query latencies is tracked, but more metrics can be added in the future. To add a new metric, it needs to be added to the index_metrics struct in index/secondary_index_manager.hh and then initialized in index/secondary_index_manager.cc in the constructor of the index_metrics struct. The metrics are created when the index is created and removed when the index is dropped. First lines of the new metric: \# HELP scylla_index_query_latencies Index query latencies \# TYPE scylla_index_query_latencies histogram scylla_index_query_latencies_sum{idx="test_i_idx",ks="test"} 640 scylla_index_query_latencies_count{idx="test_i_idx",ks="test"} 1 scylla_index_query_latencies_bucket{idx="test_i_idx",ks="test",le="640.000000"} 1 scylla_index_query_latencies_bucket{idx="test_i_idx",ks="test",le="768.000000"} 1 Fixes: https://github.com/scylladb/scylladb/issues/25970 Closes scylladb/scylladb#25995 * github.com:scylladb/scylladb: test: verify that the index metric is added index, metrics: add per-index metrics	2025-09-17 14:54:12 +03:00
Ernest Zaslavsky	d624413ddd	treewide: Move query related files to a new `query` directory As requested in #22120, moved the files and fixed other includes and build system. Moved files: - query.cc - query-request.hh - query-result.hh - query-result-reader.hh - query-result-set.cc - query-result-set.hh - query-result-writer.hh - query_id.hh - query_result_merger.hh Fixes: #22120 This is a cleanup, no need to backport Closes scylladb/scylladb#25105	2025-09-16 23:40:47 +03:00
Michał Hudobski	b09d1f0a98	index, metrics: add per-index metrics This patch adds the possibility to track metrics per secondary index. Currently, only a histogram of query latencies is tracked, but more metrics can be added in the future. To add a new metric, it needs to be added to the index_metrics struct in index/secondary_index_manager.hh and then initialized in index/secondary_index_manager.cc in the constructor of the index_metrics struct. The metrics are created when the index is created and removed when the index is dropped. First lines of the new metric: \# HELP scylla_index_query_latencies Index query latencies \# TYPE scylla_index_query_latencies histogram scylla_index_query_latencies_sum{idx="test_i_idx",ks="test"} 640 scylla_index_query_latencies_count{idx="test_i_idx",ks="test"} 1 scylla_index_query_latencies_bucket{idx="test_i_idx",ks="test",le="640.000000"} 1 scylla_index_query_latencies_bucket{idx="test_i_idx",ks="test",le="768.000000"} 1	2025-09-16 14:03:43 +02:00
Nadav Har'El	5307d1b9a8	Merge 'vector_index: add version to index options' from Dawid Pawlik Since creating the vector index does not lead to creation of a view table [#24438] (whose version info had been logged in `system_schema.scylla_tables`) we lacked the information about the version of the index. The solution we arrived at is to add the version as a field in options column of `system_schema.indexes`. It requires few changes and seems unintruitive for existing infrastructure. This patch implements the solution described above. Refs: VECTOR-142 Closes scylladb/scylladb#25614 * github.com:scylladb/scylladb: cqlpy/test_vector_index: add vector index version test vector_index, index_prop_defs: add version to index options create_index_statement: rename `validator` to `custom_index_factory` custom index: rename `custom_index_option_name` vector_index: rename `supported_options` to `vector_index_options`	2025-09-14 15:35:53 +03:00
Dawid Pawlik	be54346846	select_statement: check for access to CDC base table Before the patch, user with CREATE access could create a table with CDC or alter the table enabling CDC, but could not query a SELECT on the CDC table they created. It was due to the fact, the SELECT permission was checked on the CDC log, and later it's "parent" - the keyspace, but not thebase table, on which the user had SELECT permission automatically granted on CREATE. This patch matches the behaviour of querying the CDC log to the one implemented for Materialized Views: 1. No new permissions are granted on CREATE. 2. When querying SELECT, the permissions on base table SELECT are checked. Fixes: #19798	2025-09-03 13:20:39 +02:00
Karol Nowacki	3086d15999	cql3: Fix crash on ANN OF query when TRACING ON is enabled Executing a vector search (SELECT with ANN OF ordering) query with `TRACING ON` enabled caused a node to crash due to a null pointer dereference. This occurred because a vector index does not have an associated view table, making its `_view_schema` member null. The implementation attempted to enable tracing on this null view schema, leading to the crash. The fix adds a null check for `_view_schema` before attempting to enable tracing on the view (index) table. A regression test is included to prevent this from happening again. Fixes: VECTOR-179 Closes scylladb/scylladb#25500	2025-09-01 17:26:54 +03:00
Dawid Pawlik	873d7dba5c	custom index: rename `custom_index_option_name` Renamed `custom_index_option_name` to `custom_class_option_name` as the late was a bit misleading since we refactored our model of custom indexes to be index class reliant.	2025-08-29 10:49:15 +02:00
Nadav Har'El	0a990d2a48	config: split tri_mode_restriction to a separate header Today, any source file or header file that wants to use the tri_mode_restriction type needs to include db/config.hh, which is a large and frequently-changing header file. In this patch we split this type into a separate header file, db/tri_mode_restriction.hh, and avoid a few unnecessary inclusions of db/config.hh. However, a few source files now need to explicitly include db/config.hh, after its transitive inclusion is gone. Note that the overwhelmingly common inclusion of db/config.hh continues to be a problem after this patch - 128 source files include it directly. So this patch is just the first step in long journey. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#25692	2025-08-27 13:47:04 +03:00
Avi Kivity	8164f72f6e	Merge 'Separate local_effective_replication_map from vnode_effective_replication_map' from Benny Halevy Derive both vnode_effective_replication_map and local_effective_replication_map from static_effective_replication_map as both are static and per-keyspace. However, local_effective_replication_map does not need vnodes for the mapping of all tokens to the local node. Refs #22733 * No backport required Closes scylladb/scylladb#25222 * github.com:scylladb/scylladb: locator: abstract_replication_strategy: implement local_replication_strategy locator: vnode_effective_replication_map: convert clone_data_gently to clone_gently locator: abstract_replication_map: rename make_effective_replication_map locator: abstract_replication_map: rename calculate_effective_replication_map replica: database: keyspace: rename {create,update}_effective_replication_map locator: effective_replication_map_factory: rename create_effective_replication_map locator: abstract_replication_strategy: rename vnode_effective_replication_map_ptr et. al locator: abstract_replication_strategy: rename global_vnode_effective_replication_map keyspace: rename get_vnode_effective_replication_map dht: range_streamer: use naked e_r_m pointers storage_service: use naked e_r_m pointers alternator: ttl: use naked e_r_m pointers locator: abstract_replication_strategy: define is_local	2025-08-07 12:51:43 +03:00
Benny Halevy	ec85678de1	locator: abstract_replication_strategy: define is_local Prefer for specializing the local replication strategy, local effective replication map, et. al byt defining an is_local() predicate, similar to uses_tablets(). Note that is_vnode_based() still applies to local replication strategy. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-08-06 13:34:23 +03:00
Jan Łakomy	447c66f4ec	cql3/statements: implement external `ANN OF` queries Implement execution of `ANN OF` queries using the vector_store service. Throw invalid_request_exception with specific message using the ann_error_visitor when ANN request returns no result. Co-authored-by: Dawid Pawlik <dawid.pawlik@scylladb.com> Co-authored-by: Michał Hudobski <michal.hudobski@scylladb.com>	2025-08-05 12:34:48 +02:00
Jan Łakomy	5fecad0ec8	cql3/statements: add `ANN OF` queries support to select statements Add parsing of `ANN OF` queries to the `select_statement` and `indexed_table_select_statement` classes. Add a placeholder for the implementation of external ANN queries. Rename `should_create_view` to `view_should_exist` as it is used not only to check if the view should be created but also if the view has been created. Co-authored-by: Dawid Pawlik <dawid.pawlik@scylladb.com>	2025-08-01 12:08:50 +02:00
Petr Gusev	ff1caa9798	query_options: add node_local_only mode We want to access the paxos state table only on the local node and shard (or shards in case of intranode_migration). In this commit we add a node_local_only flag to query_options, which allows to do that. This flag can be set for a query via make_internal_options. We handle this flag on the statements layer by forwarding it to either coordinator_query_options or coordinator_mutate_options.	2025-07-24 19:48:08 +02:00
Pawel Pery	5bfce5290e	cql3: refactor primary_key as a top-level class This patch is a part of vector_store_client sharded service implementation for a communication with vector-store service. There is a need for forward declaration of primary_key class. This patch moves a nested definition of select_statement::primary_key (from a cql3::statements namespace) into a standalone class in a cql3::statements namespace. Reference: VS-47	2025-07-09 11:54:51 +02:00
Petr Gusev	3d262d2be8	LWT: create cas_shard in select_statement In this commit we create cas_shard in select_statement and pass it to the sp::query_result function.	2025-06-30 10:37:33 +02:00
Avi Kivity	f195c05b0d	untyped_result_set: mark get_blob() as returning unfragmented data Blobs can be large, and unfragmented blobs can easily exceed 128k (as seen in #23903). Rename get_blob() to get_blob_unfragmented() to warn users. Note that most uses are fine as the blobs are really short strings. Closes scylladb/scylladb#24102	2025-05-26 09:40:34 +02:00
Marcin Maliszkiewicz	e3f2ebd4fb	cql3: remove not needed cmd copy in indexed_table_select_statement It's not used variable. There should be a tiny perf increase as it saves allocation. Closes scylladb/scylladb#23473	2025-03-31 09:34:32 +03:00
Botond Dénes	39bcf99f8e	Merge 'Apply hard limit to partition range vectors in secondary index queries' from Nikos Dragazis Secondary index queries fetch partition keys from the index view and store them in an `std::vector`. The vector size is currently limited by the user's page size and the page memory limit (1MiB). These are not enough to prevent large contiguous allocations (which can lead to stalls). This series introduces a hard limit to the vector size to ensure it does not exceed the allocator's preferred max contiguous allocation size (128KiB). With the size of each element being 120 bytes, this allows for 1092 partition keys. The limit was set to 1000. Any partitions above this limit are discarded. Discarding partitions breaks the querier cache on the replicas, causing a performance regression, as can be seen from the following measurements: ``` * Cluster: 3 nodes (local Docker containers), 1 vCPU, 4GB memory, dev mode * Schema: CREATE KEYSPACE ks WITH replication = {'class': 'org.apache.cassandra.locator.NetworkTopologyStrategy', 'datacenter1': '3'} AND durable_writes = true AND tablets = {'enabled': false}; CREATE TABLE ks.t1 (pk1 int, pk2 int, ck int, value int, PRIMARY KEY ((pk1, pk2), ck)); CREATE INDEX t1_pk2_idx ON ks.t1(pk2); * Query: CONSISTENCY LOCAL_QUORUM; SELECT * FROM ks.t1 where pk2 = 1; +------------+-------------------+-------------------+ \| Page Size \| Master \| Vector Limit \| +============+===================+===================+ \| \| Latency (sec) \| Latency (sec) \| +------------+-------------------+-------------------+ \| 100 \| 5.80 ± 0.13 \| 5.64 ± 0.10 \| +------------+-------------------+-------------------+ \| 1000 \| 4.77 ± 0.07 \| 4.62 ± 0.06 \| +------------+-------------------+-------------------+ \| 2000 \| 4.67 ± 0.07 \| 5.13 ± 0.03 \| +------------+-------------------+-------------------+ \| 5000 \| 4.82 ± 0.09 \| 6.25 ± 0.06 \| +------------+-------------------+-------------------+ \| 10000 \| 4.89 ± 0.36 \| 7.52 ± 0.13 \| +------------+-------------------+-------------------+ \| -1 \| 4.90 ± 0.67 \| 4.79 ± 0.33 \| +------------+-------------------+-------------------+ ``` We expect this to be fixed with adaptive paging in a future PR. Until then, users can avoid regressions by adjusting their page size. Additionally, this series changes the `untyped_result_set` to store rows in a `chunked_vector` instead of an `std::vector`, similarly to the `result_set`. Secondary index queries use an `untyped_result_set` to store the raw result from the index view before processing. With 1MiB results, the `std::vector` would cause a large allocation of this magnitude. Finally, a unit test is added to reproduce the bug. Fixes #18536. The PR fixes stalls of up to 100ms, but there is an easy workaround: adjust the page size. No need to backport. Closes scylladb/scylladb#22682 * github.com:scylladb/scylladb: cql3: secondary index: Limit page size for single-row partitions cql3: secondary index: Limit the size of partition range vectors cql3: untyped_result_set: Store rows in chunked_vector test: Reproduce bug with large allocations from secondary index	2025-03-14 15:06:07 +02:00

1 2 3 4 5 ...

550 Commits