scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-24 02:20:37 +00:00

Author	SHA1	Message	Date
Nadav Har'El	fe5a5a813f	alternator, vector: add validation of non-finite numbers in Query Non-finite numbers (Inf, NaN) don't make sense in vector search, and also not allowed in the DynamoDB API as numbers. But the parsing code in Query's QueryVector accepted "Inf" and "NaN" and then failed to send the request to the vector store, resulting in a strange error message. Let's fix it in the parsing code. We have a test (test_query_vectorsearch_queryvector_bad_number_string) that verifies this fix. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-04-16 14:30:17 +03:00
Nadav Har'El	aa070fae5b	alternator: Query: improve error message when VectorSearch is missing Before this patch, if we attempt a Query with IndexName is a vector index but forget a "VectorSearch" parameter, the error is misleading: The code expects a GSI or LSI, and when it can't find a GSI or LSI with that name, it reports that the index is missing. But this is not helpful. So in this patch we produce a more helpful message: That the index does exist, and is a vector index, so a "VectorSearch" parameter is mandatory and is missing.	2026-04-16 14:30:16 +03:00
Nadav Har'El	f932f94422	alternator: add per-table metrics for vector query The per-table metrics for Query were not incremented for the vector variant of the Query operations, only the global metrics were incremented. This patch fixes this oversight, and add a test that reproduces it (the new test fails before this patch, and passes after).	2026-04-16 14:30:16 +03:00
Nadav Har'El	8cf510e06c	alternator: clean up duplicated code De-duplicate some code introduced in earlier patches, such a two nearly-identical loops over the indexes (one to check if there is a vector index, the second to get its dimensions), and two nearly- identical chunks of code to get the item contents when there is or there isn't a clustering key. There should be no functional changes in this patch. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-04-16 14:30:16 +03:00
Nadav Har'El	f15c6634a7	alternator: fix default Select of Query In earlier patches, when Query'ing a vector index, we set the default Select to ALL_ATTRIBUTES. However, according to the DynamoDB documentation for Query, "If neither Select nor ProjectionExpression are specified, DynamoDB defaults to ALL_ATTRIBUTES when accessing a table, and ALL_PROJECTED_ATTRIBUTES when accessing an index." This default should also apply to vector index, so this patch fixes this. The new behavior is not only more compatible with DynamoDB, it is also much more efficient by default, as ALL_PROJECTED_ATTRIBUTES does not need to read from the base table - it returns the results that the vector store returned. Of course, if the user needs the more efficient ALL_ATTRIBUTES this option is still available - it's just no longer the default. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-04-16 14:30:16 +03:00
Nadav Har'El	2e274bbdba	alternator: split executor.cc even more This patch continues the effort to split the huge executor.cc (5000 lines before this patch) even more. In this patch we introduce a new source file, executor_util.cc, for various utility functions that are used for many different operations and therefore are useful to have in a header file. These utility functions will now be in executor_util.cc and executor_util.hh - instead of executor.cc and executor.hh. Various source files, including executor.cc, the executor_read.cc introduced in the previous patch, as well as older source files like as streams.cc, ttl.cc and serialization.cc, use the new header file. This patch removes over 700 lines of code from executor.cc, and also removes a large amount of utility functions declerations from executor.hh. Originally, executor.hh was meant to be about the interface that the Alternator server needs to execute the different DynamoDB API operations - and after this patch it returns closer to this original goal. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-04-16 14:30:16 +03:00
Nadav Har'El	751da00692	alternator: split alternator/executor.cc Already six years ago, in #5783, we noticed that alternator/executor.cc has grown too large. The previous patches added hundreds of more lines to it to implement vector search, and it reached a whopping 7,000 lines of code. This is too much. This patch splits from executor.cc two major chunks: 1. The implementation of read requests - GetItem, BatchGetItem, Query (base table, GSI/LSI, and vector-search), and Scan - was moved to a new source file alternator/executor_read.cc. The new file has 2,000 lines. 2. Moved 250 lines of template functions dealing with attribute paths and maps of them to a new header file, attribute_path.hh. These utilities are used for many different operations - various read operations use them for ProjectionExpression, and UpdateItem uses them for modifications to nested attributes, so we need the new header file from both executor.cc and executor_read.cc The remaining executor.cc is still pretty big, 5,000 lines, and contains write operations (PutItem, UpdateItem, DeleteItem, BatchWriteItem) as well as various table and other operations, and also many utility functions used by many types of operations, so we can later continue this refactoring effort. Refs #5783 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-04-16 14:30:10 +03:00

7 Commits