scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-20 16:40:35 +00:00

Author	SHA1	Message	Date
Piotr Dulikowski	9fc2c65d18	Merge 'cql3: implement WRITETIME() and TTL() of individual elements of map, set, and UDT' from Nadav Har'El In commit `727f68e0f5` we added the ability to SELECT: * Individual elements of a map: `SELECT map_col[key]`. * Individual elements of a set: `SELECT set_col[key]` returns key if the key exists in the set, or null if it doesn't, allowing to check if the element exists in the set. * Individual pieces of a UDT: `SELECT udt_col.field`. But at the time, we didn't provide any way to retrieve the meta-data for this value, namely its timestamp and TTL. We did not support `SELECT TIMESTAMP(collection[key])`, or `SELECT TIMESTAMP(udt.field)`. Users requested to support such SELECTs in the past (see issue #15427), and Cassandra 5.0 added support for this feature - for both maps and sets and udts - so we also need this feature for compatibility. This feature was also requested recently by vector-search developers, who wanted to read Alternator columns - stored as map elements, not individual columns - with their WRITETIME information. The first four patches in this series adds the feature (in four smaller patches instead one big one), the fifth and sixth patches add tests (cqlpy and boost tests, respectively). The seventh patch adds documentation. All the new tests pass on Cassandra 5, failed on Scylla before the present fix, and pass with it. The fix was surprisingly difficult. Our existing implementation (from `727f68e0f5` building on earlier machinery) doesn't just "read" `map_col[key]` and allow us to return just its timestamp. Rather, the implementation reads the entire map, serializes it in some temporary format that does not include the timestamps and ttls, and then takes the subscript key, at which point we no longer have the timestamp or ttl of the element. So the fix had to cross all these layers of the implementation. While adding support for UDT fields in a pre-existing grammar nonterminal "subscriptExpr", we unintentionally added support for UDT fields also in LWT expressions (which used this nonterminal). LWT missing support for UDT fields was a long-time known compatibility issue (#13624) so we unintentionally fixed it :-) Actually, to completely fix it we needed another small change in the expression implementation, so the eighth patch in this series does this. Fixes #15427 Fixes #13624 Closes scylladb/scylladb#29134 * github.com:scylladb/scylladb: cql3: support UDT fields in LWT expressions cql3: document WRITETIME() and TTL() for elements of map, set or UDT test/boost: test WRITETIME() and TTL() on map collection elements test/cqlpy: test WRITETIME() and TTL() on element of map, set or UDT cql3: prepare and evaluate WRITETIME/TTL on collection elements and UDT fields cql3: parse per-element timestamps/TTLs in the selection layer cql3: add extended wire format for per-element timestamps and TTLs cql3: extend WRITETIME/TTL grammar to accept collection and UDT elements	2026-04-14 12:35:46 +02:00
Avi Kivity	0ae22a09d4	LICENSE: Update to version 1.1 Updated terms of non-commercial use (must be a never-customer).	2026-04-12 19:46:33 +03:00
Nadav Har'El	33dbb63aef	cql3: support UDT fields in LWT expressions In an earlier patch, we used the CQL grammar's "subscriptExpr" in the rule for WRITETIME() and TTL(). But since we also wanted these to support UDT fields (x.a), not just collection subscripts (x[3]), we expanded subscriptExpr to also support the field syntax. But LWT expressions already used this subscriptExpr, which meant that LWT expressions unintentionally gained support for UDT fields. Missing support for UDT fields in LWT is a long-standing known Cassandra-compatibility bug (#13624), and now our grammar finally supports the missing syntax. But supporting the syntax is not enough for correct implementation of this feature - we also need to fix the expression handling: Two bugs prevented expressions like `v.a = 0` from working in LWT IF clauses, where `v` is a column of user-defined type. The first bug was in get_lhs_receiver() in prepare_expr.cc: it lacked a handler for field_selection nodes, causing an "unexpected expression" internal error when preparing a condition like `IF v.a = 0`. The fix adds a handler that returns a column_specification whose type is taken from the prepared field_selection's type field. The second bug was in search_and_replace() in expression.cc: when recursing into a field_selection node it reconstructed it with only `structure` and `field`, silently dropping the `field_idx` and `type` fields that are set during preparation. As a result, any transformation that uses search_and_replace() on a prepared expression containing a field_selection — such as adjust_for_collection_as_maps() called from column_condition_prepare() — would zero out those fields. At evaluation time, type_of() on the field_selection returned a null data_type pointer, causing a segmentation fault when the comparison operator tried to call ->equal() through it. The fix preserves field_idx and type when reconstructing the node. Fixes #13624.	2026-04-12 14:28:01 +03:00
Nadav Har'El	35e807a36c	cql3: prepare and evaluate WRITETIME/TTL on collection elements and UDT fields Complete the implementation of SELECT WRITETIME(col[key])/TTL(col[key]) and WRITETIME(col.field)/TTL(col.field), building on the grammar (commit 1), wire format (commit 2), and selection-layer (commit 3) changes in the preceding patches. * prepare_column_mutation_attribute() (prepare_expr.cc) now handles the subscript and field_selection nodes that the grammar produces: - For subscripts, it validates that the inner column is a non-frozen map or set and checks the 'writetime_ttl_individual_element' feature flag so the feature is rejected during rolling upgrades. - For field selections, it validates that the inner column is a non-frozen UDT, with the same feature-flag check. * do_evaluate(column_mutation_attribute) (expression.cc) handles the same two cases. For a field selection it serializes the field index as a key and looks it up in collection_element_metadata; for a subscript it evaluates the subscript key and looks it up in the same map. A missing key (element not found or expired) returns NULL, matching Cassandra behavior. Together with the preceding three patches, this finally fixes #15427. The next three patches will add tests and documentation for the new feature, and the final eighth patch will fix the implementation of UDT fields in LWT expressions - which the first patch made the grammar allow but is still not implemented correctly.	2026-04-12 13:28:28 +03:00
Nadav Har'El	34eec020b3	cql3: replace abort() by throwing_assert() After the previous patch replaced all SCYLLA_ASSERT() calls by throwing_assert(), this patch also replaces all calls to abort(). All these abort() calls are supposedly cases that can never happen, but if they ever do happen because of a bug, in none of these places we absolutely need to crash - and exception that aborts the current operation should be enough. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-03-11 09:43:11 +02:00
Karol Nowacki	77da4517d2	cql3: Make abstract_type explicitly noncopyable The polymorphic abstract_type class serves as an interface and should not be copied. To prevent accidental and unsafe copies, make it explicitly uncopyable.	2025-11-12 09:11:56 +01:00
Karol Nowacki	960fe3da60	cql3: Fix std::bad_cast when deserializing vectors of collections When deserializing a vector whose elements are collections (e.g., set, list), the operation raises a `std::bad_cast` exception. This was caused by type slicing due to an incorrect assignment of a polymorphic type by value instead of by reference. This resulted in a failed `dynamic_cast` even when the underlying type was correct.	2025-11-12 09:11:56 +01:00
Tomasz Grabiec	66755db062	locator, cql3: Support rack lists in replication options Allows per-DC replication factor to be either a string, holding a numerical value, or a list of strings, holding a list of rack names. The rack list is not respected yet by the tablet allocator, this is achieved in subsequent commit. This changes the format of options stored in the flattened map in system_schema.keyspaces#replication. Values which are rack lists, are converted into multiple entries, with the list index appended to the key with ':' as the separator: For example, this extended map: { 'dc1': '3', 'dc2': ['rack1', 'rack2'] } is stored as a flattened map: { 'dc1': '3', 'dc2:0': 'rack1', 'dc2:1': 'rack2' } Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Signed-off-by: Tomasz Grabiec <tgrabiec@scylladb.com>	2025-10-02 19:42:39 +02:00
Tomasz Grabiec	11b4a1ab58	cql3: Extract convert_property_map() out of Cql.g So that complex code is in a .cc file for better IDE assistance.	2025-10-01 16:06:52 +02:00
Ernest Zaslavsky	d624413ddd	treewide: Move query related files to a new `query` directory As requested in #22120, moved the files and fixed other includes and build system. Moved files: - query.cc - query-request.hh - query-result.hh - query-result-reader.hh - query-result-set.cc - query-result-set.hh - query-result-writer.hh - query_id.hh - query_result_merger.hh Fixes: #22120 This is a cleanup, no need to backport Closes scylladb/scylladb#25105	2025-09-16 23:40:47 +03:00
Dawid Pawlik	ed49093a01	expression: adjust collection constructor list style Like mentioned in the previous commit, this changes introduce usage of vector style type and adjusts the functions using list style type to distinguish vectors from lists. Rename collection constructor style list to list_or_vector.	2025-01-28 21:14:49 +01:00
Dawid Pawlik	69c754f0d4	expression: add vector style type Motivation for this changes is to provide a distinguishable interface for vector type expressions. The square bracket literal is ambigious for lists and vectors, so that we need to perform a distinction not using CQL layer. At first we should use the collection constructor to manage both lists and vectors (although a vector is not a collection). Later during preparation of expressions we should be able to get to know the exact type using given receiver (column specification). Knowing the type of expression we may use their respective style type (in this case the vector style type being introduced), which would make the implementation more precise and allow us to evaluate the expressions properly. This commit introduces vector style type and functions making use of it. However vector style type is not yet used anywhere, the next commit should adjust collection constructor and make use of the new vector style type and it's features.	2025-01-28 21:14:49 +01:00
Jan Łakomy	9561ae5fc8	types: implement vector_type_impl The vector is a fixed-length array of non-null specified type elements. Implement serialization, deserialization, comparison, JSON and Lua support, and other functionalities. Co-authored-by: Dawid Pawlik <501149991dp@gmail.com>	2025-01-26 19:36:41 +01:00
Michael Litvak	2701b5d50d	cql3: allow set subscript This allows to use subscript on a set column, in addition to map/list which was possible until now. The behavior is compatible with Cassandra - a subscript with a specific value returns the value if it's found in the set, and null otherwise.	2024-12-30 09:50:31 +02:00
Avi Kivity	f8ce49ebe9	cql3: implement NOT IN Where the grammar supports IN, we add NOT IN. This includes the WHERE clause and LWT IF clause. Evaluation of NOT IN follows from IN. In statement_restrictions analysis, they are different, as NOT IN doesn't enable any clever query plan and must filter. Some tests are added. An error message was changed ('in' changed to 'IN'), so some tests are adjusted. Closes scylladb/scylladb#21992	2024-12-22 15:15:23 +02:00
Avi Kivity	f3eade2f62	treewide: relicense to ScyllaDB-Source-Available-1.0 Drop the AGPL license in favor of a source-available license. See the blog post [1] for details. [1] https://www.scylladb.com/2024/12/18/why-were-moving-to-a-source-available-license/	2024-12-18 17:45:13 +02:00
Avi Kivity	494561c4f3	cql3: expr: drop boost usage Replace boost usage with <ranges>, modernizing the code a little and reducing dependencies on a redundant library. Closes scylladb/scylladb#20919	2024-10-03 15:39:40 +03:00
Benny Halevy	5a0f3889e0	treewide: use std::ranges sort functions rather than boost Using the standard library is preffered over boost. In cql3/expr/expression.cc to_sorted_vector got more of a face-list and was modernized to use also std::unique and while at it, to move its input range in the uniquely sorted result vector. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-10-01 14:19:05 +03:00
Avi Kivity	657848dcbb	cql3: statement_restrictions, expr: move restrictions-related expression utilities out of expression.cc Move all of the blatantly restriction-related expression utilities to statement_restrictions.cc. Some are so blatant as to include the word "restriction" in their name. Others are just so specialized that they cannot be used for anything else. The motivation is that further refactoring will be simplified if it can happen within the same module, as there will not be a need to prove it has no effect elsewhere. Most of the declarations are made non-public (in .cc file) to limit proliferation. A few are needed for tests or in select_statement.cc and so are kept public. Other than that, the only changes are namespace qualifications and removal of a now-duplicate definition ("inclusive"). Closes scylladb/scylladb#20732	2024-09-22 11:00:51 +03:00
Kefu Chai	3e84d43f93	treewide: use seastar::format() or fmt::format() explicitly before this change, we rely on `using namespace seastar` to use `seastar::format()` without qualifying the `format()` with its namespace. this works fine until we changed the parameter type of format string `seastar::format()` from `const char*` to `fmt::format_string<...>`. this change practically invited `seastar::format()` to the club of `std::format()` and `fmt::format()`, where all members accept a templated parameter as its `fmt` parameter. and `seastar::format()` is not the best candidate anymore. despite that argument-dependent lookup (ADT for short) favors the function which is in the same namespace as its parameter, but `using namespace` makes `seastar::format()` more competitive, so both `std::format()` and `seastar::format()` are considered as the condidates. that is what is happening scylladb in quite a few caller sites of `format()`, hence ADT is not able to tell which function the winner in the name lookup: ``` /__w/scylladb/scylladb/mutation/mutation_fragment_stream_validator.cc:265:12: error: call to 'format' is ambiguous 265 \| return format("{} ({}.{} {})", _name_view, s.ks_name(), s.cf_name(), s.id()); \| ^~~~~~ /usr/bin/../lib/gcc/x86_64-redhat-linux/14/../../../../include/c++/14/format:4290:5: note: candidate function [with _Args = <const std::basic_string_view<char> &, const seastar::basic_sstring<char, unsigned int, 15> &, const seastar::basic_sstring<char, unsigned int, 15> &, const utils::tagged_uuid<table_id_tag> &>] 4290 \| format(format_string<_Args...> __fmt, _Args&&... __args) \| ^ /__w/scylladb/scylladb/seastar/include/seastar/core/print.hh:143:1: note: candidate function [with A = <const std::basic_string_view<char> &, const seastar::basic_sstring<char, unsigned int, 15> &, const seastar::basic_sstring<char, unsigned int, 15> &, const utils::tagged_uuid<table_id_tag> &>] 143 \| format(fmt::format_string<A...> fmt, A&&... a) { \| ^ ``` in this change, we change all `format()` to either `fmt::format()` or `seastar::format()` with following rules: - if the caller expects an `sstring` or `std::string_view`, change to `seastar::format()` - if the caller expects an `std::string`, change to `fmt::format()`. because, `sstring::operator std::basic_string` would incur a deep copy. we will need another change to enable scylladb to compile with the latest seastar. namely, to pass the format string as a templated parameter down to helper functions which format their parameters. to miminize the scope of this change, let's include that change when bumping up the seastar submodule. as that change will depend on the seastar change. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-09-11 23:21:40 +03:00
Kefu Chai	6556cd684e	cql3: remove unused operator<< as these operators are not used anymore. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19288	2024-06-14 09:45:35 +03:00
Pavel Emelyanov	46bbfc0c53	expression: Shorten making raw_value from FragmetedView The read_field is std::optional<View>. The raw_value::make_value() accepts managed_bytes_opt which is std::optional<manager_bytes>. Finally, there's std::optional<T>::optional(std::optional<U>&&) move constructor (and its copy-constructor peer). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#18128	2024-04-01 16:52:18 +03:00
Kefu Chai	3d8ac06ee8	cql3: add fmt::formatter for expression::printer before this change, we already have a `fmt::formatter` specialized for `expression::printer`. but the formatter was implemented by 1. formatting the `printer` instance to an `ostringstream`, and 2. extracting a `std::string` from this `ostringstream` 3. formatting the `std::string` instance to the fmt context this is convoluted and is not an optimal implementation. so, in this change, it is reimplemented by formatting directly to the context. its operator<< is also dropped in this change. please note, to avoid adding the large chunk of code into the .hh file, the implementation is put in the .cc file. but in order to preserve the usage of `transformed(fmt::to_string<expression::printer>)`, the `format()` function is defined as a template, and instantiated explicitly for two use cases: 1. to format to `fmt::context` 2. to format using `fmt::to_string()` Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-05 14:00:13 +08:00
Kefu Chai	fc774361e8	cql3: add fmt::formatter for raw_value{,_view} before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for * raw_value * raw_value_view `raw_value_view` 's operator<< is still being used by the generic homebrew printer for vector<>, so it is preserved. `raw_value` 's operator<< is still being used by the generic homebrew printer for optional<>, so it's preserved as well. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-05 14:00:13 +08:00
Avi Kivity	51df8b9173	interval: rename nonwrapping_interval to interval Our interval template started life as `range`, and was supported wrapping to follow Cassandra's convention of wrapping around the maximum token. We later recognized that an interval type should usually be non-wrapping and split it into wrapping_range and nonwrapping_range, with `range` aliasing wrapping_range to preserve compatibility. Even later, we realized the name was already taken by C++ ranges and so renamed it to `interval`. Given that intervals are usually non-wrapping, the default `interval` type is non-wrapping. We can now simplify it further, recognizing that everyone assumes that an interval is non-wrapping and so doesn't need the nonwrapping_interval_designation. We just rename nonwrapping_interval to `interval` and remove the type alias.	2024-02-21 19:43:17 +02:00
Avi Kivity	605bf6e221	range.hh: retire range.hh was deprecated in `bd794629f9` (2020) since its names conflict with the C++ library concept of an iterator range. The name ::range also mapped to the dangerous wrapping_interval rather than nonwrapping_interval. Complete the deprecation by removing range.hh and replacing all the aliases by the names they point to from the interval library. Note this now exposes uses of wrapping intervals as they are now explicit. The unit tests are renamed and range.hh is deleted. Closes scylladb/scylladb#17428	2024-02-21 00:24:25 +02:00
Kefu Chai	2dbf044b91	cql3: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16791	2024-01-16 16:43:17 +02:00
Patryk Wrobel	f4e311e871	cql3: add formatter for cql3::expr::oper_t This change introduces a specialization of fmt::formatter for cql3::expr::oper_t. This enables the usage of this type with FMTv10, which dropped the default generated formatter. Usage of cql3::expr::oper_t without the defined formatter resulted in compilation error when compiled with FMTv10. Refs: #13245 Signed-off-by: Patryk Wrobel <patryk.wrobel@scylladb.com> Closes scylladb/scylladb#16719	2024-01-11 08:33:35 +02:00
Yaniv Kaul	c658bdb150	Typos: fix typos in comments Fixes some typos as found by codespell run on the code. In this commit, I was hoping to fix only comments, not user-visible alerts, output, etc. Follow-up commits will take care of them. Refs: https://github.com/scylladb/scylladb/issues/16255 Signed-off-by: Yaniv Kaul <yaniv.kaul@scylladb.com>	2023-12-02 22:37:22 +02:00
Kefu Chai	15bfa09454	treewide: do not mark return value const if this has no effect this change is a cleanup. to mark a return value without value semantics has no effect. these `const` specifier useless. so let's drop them. and, if we compile the tree with `-Wignore-qualifiers`, the compiler would warn like: ``` /home/kefu/dev/scylladb/schema/schema.hh:245:5: error: 'const' type qualifier on return type has no effect [-Werror,-Wignored-qualifiers] 245 \| const index_metadata_kind kind() const; \| ^~~~~ ``` so this change also silences the above warnings. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-11-17 17:46:19 +08:00
Alexander Turetskiy	024ba84637	cql3: SELECT CAST column names should match Cassandra's When doing a SELECT CAST(b AS int), Cassandra returns a column named cast(b as int). Currently, Scylla uses a different name - system.castasint(b). For Cassandra compatibility, we should switch to the same name. fixes #14508 Closes scylladb/scylladb#14800	2023-09-26 17:26:14 +03:00
Avi Kivity	503d21b570	cql3: expr: avoid separating column_mutation_attribute from its column_value when levellizing aggregation depth Since `ec77172b4b` (" Merge 'cql3: convert the SELECT clause evaluation phase to expressions' from Avi Kivity"), we rewrite non-aggregating selectors to include an aggregation, in order to have the rest of the code either deal with no aggregation, or all selectors aggregating, with nothing in between. This is done by wrapping column selectors with "first" function calls: col -> first(col). This broke non-aggregating selectors that included the ttl() or writetime() pseudo functions. This is because we rewrote them as writetime(first(col)), and writetime() isn't a function that operates on any values; it operates on mutations and so must have access to a column, not an expression. Fix by detecting this scenario and rewriting the expression as first(writetime(col)). Unit and integration tests are added. Fixes #14715. Closes #14716	2023-07-19 11:35:01 +03:00
Avi Kivity	4fc870a31a	cql3: expr: avoid redoing prepare work when evaluating field_selection prepare_expression() already validates the types and computes the index of the field; no need to redo that work when evaluating the expression. The tests are adjusted to also prepare the expression. Closes #14562	2023-07-16 14:29:19 +03:00
Kefu Chai	7a334c53af	cql3: expression: correct format string fmtlib uses `{}` as the placeholder for the formatted argument, not `{}}`. so let's correct it. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14586	2023-07-09 22:26:29 +03:00
Avi Kivity	0021f77e30	cql3: expression: fix field_selection::type interpretation by evaluate() field_selection::type refers to the type of the selection operation, not the type of the structure being selected. This is what prepare_expression() generates and how all other expression elements work, but evaluate() for field_selection thinks it's the type of the structure, and so fails when it gets an expression from prepare_expression(). Fix that, and adjust the tests.	2023-07-03 19:45:17 +03:00
Avi Kivity	a26516ef65	cql3: expression: add helper to split expressions with aggregate functions Aggregate functions cannot be evaluated directly, since they implicitly refer to state (the accumulator). To allow for evaluation, we split the expression into two: an inner expression that is evaluated over the input vector (once per element). The inner expression calls the aggregation function, with an extra input parameter (the accumulator). The outer expression is evaluated once per input vector; it calls the final function, and its input is just the accumulator. The outer expression also contains any expressions that operate on the result of the aggregate function. The acculator is stored in a temporary. Simple example: sum(x) is transformed into an inner expression: t1 = (t1 + x) // really sum.aggregation_function and an outer expression: result = t1 // really sum.state_to_result_function Complicated example: scalar_func(agg1(x, f1(y)), agg2(x, f2(y))) is transformed into two inner expressions: t1 = agg1.aggregation_function(t1, x, f1(y)) t2 = agg2.aggregation_function(t2, x, f2(y)) and an outer expression output = scalar_func(agg1.state_to_result_function(t1), agg2.state_to_result_function(t2)) There's a small wart: automatically parallelized queries can generate "reducible" aggregates that have no state_to_result function, since we want to pass the state back to the coordinator. Detect that and short circuit evaluation to pass the accumulator directly.	2023-07-03 19:45:17 +03:00
Avi Kivity	ecdded90cd	cql3: selection: skip first_function when collecting metadata We plan to rewrite aggregation queries that have a non-aggregating selector using the first function, so that all selectors are aggregates (or none are). Prevent the first function from affecting metadata (the auto-generated column names), by skipping over the first function if detected. They input and output types are unchanged so this only affects the name.	2023-07-03 19:45:17 +03:00
Avi Kivity	778ae2b461	cql3: expression: introduce temporaries Temporaries are similar to bind variables - they are values provided from outside the expression. While bind variables are provided by the user, temporaries are generated internally. The intended use is for aggregate accumulator storage. Currently aggregates store the accumulator in aggregate_function_selector::_accumulator, which means the entire selector hierarchy must be cloned for every query. With expressions, we can have a single expression object reused for many computations, but we need a way to inject the accumulator into an aggregation, which this new expression element provides.	2023-07-03 19:45:17 +03:00
Avi Kivity	7aee322a6c	cql3: expressions: add "metadata mode" formatter for expressions When returning a result set (and when preparing a statement), we return metadata about the result set columns. Part of that is the column names, which are derived from the expressions used as selectors. Currently, they are computed via selector::column_name(), but as we're dismantling that hierarchy we need a different way to obtain those names. It turns out that the expression formatter is close enough to what we need. To avoid disturbing the current :user mode, add a new :metadata mode and apply the adjustments needed to bring it in line with what column metadata looks like today. Note that column metadata is visible to applications and they can depend on it; e.g. the Python driver allows choosing columns based on their names rather than ordinal position.	2023-07-03 19:45:17 +03:00
Avi Kivity	99fe0ee772	cql3: expression: reimplement verify_no_aggregate_functions() Most clauses in a CQL statement don't tolerate aggregate functions, and so they call verify_no_aggregate_functions(). It can now be reimplemented in terms of aggregation_depth(), removing some code.	2023-07-03 19:45:17 +03:00
Avi Kivity	b1b4a18ad8	cql3: expression: add helpers to manage an expression's aggregation depth We define the "aggregation depth" of an expression by how many nested aggregation functions are applied. In CQL/SQL, legal values are 0 and 1, but for generality we deal with any aggregation depth. The first helper measures the maximum aggregation depth along any path in the expression graph. If it's 2 or greater, we have something like max(max(x)) and we should reject it (though these helpers don't). If we get 1 it's a simple aggregation. If it's zero then we're not aggregating (though CQL may decide to aggregate anyway if GROUP BY is used). The second helper edits an expression to make sure the aggregation depth along any path that reaches a column is the same. Logically, `SELECT x, max(y)` does not make sense, as one is a vector of values and the other is a scalar. CQL resolves the problem by defining x as "the first value seen". We apply this resolution by converting the query to `SELECT first(x), max(y)` (where `first()` is an internal aggregate function), so both selectors refer to scalars that consume vectors. When a scalar is consumed by an aggregate function (for example, `SELECT max(x), min(17)` we don't have to bother, since a scalar is implicity promoted to a vector by evaluating it every row. There is some ambiguity if the scalar is a non-pure function (e.g. `SELECT max(x), min(random())`, but it's not worth following. A small unit test is added.	2023-07-03 19:45:16 +03:00
Avi Kivity	faf0ea0f68	cql3: expression: improve printing of prepared function calls Currently, a prepared function_call expression is printed as an "anonymous function", but it's not really anonymous - the name is available. Print it out. This helps in a unit test later on (and is worthwhile by itself).	2023-07-03 19:02:33 +03:00
Avi Kivity	b858a4669d	cql3: expr: break up expression.hh header Adding a function declaration to expression.hh causes many recompilations. Reduce that by: - moving some restrictions-related definitions to the existing expr/restrictions.hh - moving evaluation related names to a new header expr/evaluate.hh - move utilities to a new header expr/expr-utilities.hh expression.hh contains only expression definitions and the most basic and common helpers, like printing.	2023-06-22 14:21:03 +03:00
Kamil Braun	23a60df92d	Merge 'cql3: expr: simplify evaluate()' from Avi Kivity Make evaluate()'s body more regular, then exploit it by replacing the long list of branches with a lambda template. Closes #14306 * github.com:scylladb/scylladb: cql3: expr: simplify evaluate() cql3: expr: standardize evaluate() branches to call do_evaluate() cql3: expr: rename evaluate(ExpressionElement) to do_evaluate()	2023-06-22 12:18:36 +02:00
Avi Kivity	32b27d6a08	cql3: expr: change evaluation_input vector components to take spans Spans are slightly cleaner, slightly faster (as they avoid an indirection), and allow for replacing some of the arguments with small_vector:s. Closes #14313	2023-06-22 11:28:01 +02:00
Avi Kivity	453bbc1115	cql3: expr: improve error message when rejecting aggregation functions in illegal contexts Fix a small grammatical error, and capitalize WHERE in accordance with SQL tradition. Closes #14288	2023-06-20 17:52:53 +03:00
Avi Kivity	792c46c0f8	cql3: expr: simplify evaluate() Now that all branches in the visitor are uniform and consist of a single call to do_evaluate() overloads, we can simplify by calling a lambda template that does just that.	2023-06-20 02:33:10 +03:00
Avi Kivity	66e0326385	cql3: expr: standardize evaluate() branches to call do_evaluate() Extract the various snippets into do_evaluate() overloads. We'll exploit this in the next patch.	2023-06-20 02:19:33 +03:00
Avi Kivity	b64eeefa35	cql3: expr: rename evaluate(ExpressionElement) to do_evaluate() evaluate(expression) calls the various evaluate(ExpressionElement) overloads to perform its work. However, if we add an ExpressionElement and forget to implement its evaluate() overload, we'll end up in with infinite recursion. It will be caught immediately, but better to avoid it. Also sprinkle static:s on do_evaluate() where missing.	2023-06-20 02:10:18 +03:00
Avi Kivity	7090f4c43b	cql3: expr: evaluate() column_mutation_attribute Enhance evaluation_inputs with timestamps and ttls, and use them to evaluate writetime/ttl. The data structure is compatible with the current way of doing things (see result_set_builder::_timestamps, result_set_build::_ttls). We use std::span<> instead of std::vector<> as it is more general and a tiny bit faster. The algorithm is taken from writetime_or_ttl_selector::add_input().	2023-06-18 22:41:09 +03:00

1 2 3 4 5 ...

285 Commits