scylladb

Author	SHA1	Message	Date
Pavel Emelyanov	46bbfc0c53	expression: Shorten making raw_value from FragmetedView The read_field is std::optional<View>. The raw_value::make_value() accepts managed_bytes_opt which is std::optional<manager_bytes>. Finally, there's std::optional<T>::optional(std::optional<U>&&) move constructor (and its copy-constructor peer). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#18128	2024-04-01 16:52:18 +03:00
Kefu Chai	3d8ac06ee8	cql3: add fmt::formatter for expression::printer before this change, we already have a `fmt::formatter` specialized for `expression::printer`. but the formatter was implemented by 1. formatting the `printer` instance to an `ostringstream`, and 2. extracting a `std::string` from this `ostringstream` 3. formatting the `std::string` instance to the fmt context this is convoluted and is not an optimal implementation. so, in this change, it is reimplemented by formatting directly to the context. its operator<< is also dropped in this change. please note, to avoid adding the large chunk of code into the .hh file, the implementation is put in the .cc file. but in order to preserve the usage of `transformed(fmt::to_string<expression::printer>)`, the `format()` function is defined as a template, and instantiated explicitly for two use cases: 1. to format to `fmt::context` 2. to format using `fmt::to_string()` Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-05 14:00:13 +08:00
Kefu Chai	fc774361e8	cql3: add fmt::formatter for raw_value{,_view} before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for * raw_value * raw_value_view `raw_value_view` 's operator<< is still being used by the generic homebrew printer for vector<>, so it is preserved. `raw_value` 's operator<< is still being used by the generic homebrew printer for optional<>, so it's preserved as well. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-05 14:00:13 +08:00
Avi Kivity	51df8b9173	interval: rename nonwrapping_interval to interval Our interval template started life as `range`, and was supported wrapping to follow Cassandra's convention of wrapping around the maximum token. We later recognized that an interval type should usually be non-wrapping and split it into wrapping_range and nonwrapping_range, with `range` aliasing wrapping_range to preserve compatibility. Even later, we realized the name was already taken by C++ ranges and so renamed it to `interval`. Given that intervals are usually non-wrapping, the default `interval` type is non-wrapping. We can now simplify it further, recognizing that everyone assumes that an interval is non-wrapping and so doesn't need the nonwrapping_interval_designation. We just rename nonwrapping_interval to `interval` and remove the type alias.	2024-02-21 19:43:17 +02:00
Avi Kivity	605bf6e221	range.hh: retire range.hh was deprecated in `bd794629f9` (2020) since its names conflict with the C++ library concept of an iterator range. The name ::range also mapped to the dangerous wrapping_interval rather than nonwrapping_interval. Complete the deprecation by removing range.hh and replacing all the aliases by the names they point to from the interval library. Note this now exposes uses of wrapping intervals as they are now explicit. The unit tests are renamed and range.hh is deleted. Closes scylladb/scylladb#17428	2024-02-21 00:24:25 +02:00
Kefu Chai	2dbf044b91	cql3: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16791	2024-01-16 16:43:17 +02:00
Patryk Wrobel	f4e311e871	cql3: add formatter for cql3::expr::oper_t This change introduces a specialization of fmt::formatter for cql3::expr::oper_t. This enables the usage of this type with FMTv10, which dropped the default generated formatter. Usage of cql3::expr::oper_t without the defined formatter resulted in compilation error when compiled with FMTv10. Refs: #13245 Signed-off-by: Patryk Wrobel <patryk.wrobel@scylladb.com> Closes scylladb/scylladb#16719	2024-01-11 08:33:35 +02:00
Yaniv Kaul	c658bdb150	Typos: fix typos in comments Fixes some typos as found by codespell run on the code. In this commit, I was hoping to fix only comments, not user-visible alerts, output, etc. Follow-up commits will take care of them. Refs: https://github.com/scylladb/scylladb/issues/16255 Signed-off-by: Yaniv Kaul <yaniv.kaul@scylladb.com>	2023-12-02 22:37:22 +02:00
Kefu Chai	15bfa09454	treewide: do not mark return value const if this has no effect this change is a cleanup. to mark a return value without value semantics has no effect. these `const` specifier useless. so let's drop them. and, if we compile the tree with `-Wignore-qualifiers`, the compiler would warn like: ``` /home/kefu/dev/scylladb/schema/schema.hh:245:5: error: 'const' type qualifier on return type has no effect [-Werror,-Wignored-qualifiers] 245 \| const index_metadata_kind kind() const; \| ^~~~~ ``` so this change also silences the above warnings. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-11-17 17:46:19 +08:00
Alexander Turetskiy	024ba84637	cql3: SELECT CAST column names should match Cassandra's When doing a SELECT CAST(b AS int), Cassandra returns a column named cast(b as int). Currently, Scylla uses a different name - system.castasint(b). For Cassandra compatibility, we should switch to the same name. fixes #14508 Closes scylladb/scylladb#14800	2023-09-26 17:26:14 +03:00
Avi Kivity	503d21b570	cql3: expr: avoid separating column_mutation_attribute from its column_value when levellizing aggregation depth Since `ec77172b4b` (" Merge 'cql3: convert the SELECT clause evaluation phase to expressions' from Avi Kivity"), we rewrite non-aggregating selectors to include an aggregation, in order to have the rest of the code either deal with no aggregation, or all selectors aggregating, with nothing in between. This is done by wrapping column selectors with "first" function calls: col -> first(col). This broke non-aggregating selectors that included the ttl() or writetime() pseudo functions. This is because we rewrote them as writetime(first(col)), and writetime() isn't a function that operates on any values; it operates on mutations and so must have access to a column, not an expression. Fix by detecting this scenario and rewriting the expression as first(writetime(col)). Unit and integration tests are added. Fixes #14715. Closes #14716	2023-07-19 11:35:01 +03:00
Avi Kivity	4fc870a31a	cql3: expr: avoid redoing prepare work when evaluating field_selection prepare_expression() already validates the types and computes the index of the field; no need to redo that work when evaluating the expression. The tests are adjusted to also prepare the expression. Closes #14562	2023-07-16 14:29:19 +03:00
Kefu Chai	7a334c53af	cql3: expression: correct format string fmtlib uses `{}` as the placeholder for the formatted argument, not `{}}`. so let's correct it. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14586	2023-07-09 22:26:29 +03:00
Avi Kivity	0021f77e30	cql3: expression: fix field_selection::type interpretation by evaluate() field_selection::type refers to the type of the selection operation, not the type of the structure being selected. This is what prepare_expression() generates and how all other expression elements work, but evaluate() for field_selection thinks it's the type of the structure, and so fails when it gets an expression from prepare_expression(). Fix that, and adjust the tests.	2023-07-03 19:45:17 +03:00
Avi Kivity	a26516ef65	cql3: expression: add helper to split expressions with aggregate functions Aggregate functions cannot be evaluated directly, since they implicitly refer to state (the accumulator). To allow for evaluation, we split the expression into two: an inner expression that is evaluated over the input vector (once per element). The inner expression calls the aggregation function, with an extra input parameter (the accumulator). The outer expression is evaluated once per input vector; it calls the final function, and its input is just the accumulator. The outer expression also contains any expressions that operate on the result of the aggregate function. The acculator is stored in a temporary. Simple example: sum(x) is transformed into an inner expression: t1 = (t1 + x) // really sum.aggregation_function and an outer expression: result = t1 // really sum.state_to_result_function Complicated example: scalar_func(agg1(x, f1(y)), agg2(x, f2(y))) is transformed into two inner expressions: t1 = agg1.aggregation_function(t1, x, f1(y)) t2 = agg2.aggregation_function(t2, x, f2(y)) and an outer expression output = scalar_func(agg1.state_to_result_function(t1), agg2.state_to_result_function(t2)) There's a small wart: automatically parallelized queries can generate "reducible" aggregates that have no state_to_result function, since we want to pass the state back to the coordinator. Detect that and short circuit evaluation to pass the accumulator directly.	2023-07-03 19:45:17 +03:00
Avi Kivity	ecdded90cd	cql3: selection: skip first_function when collecting metadata We plan to rewrite aggregation queries that have a non-aggregating selector using the first function, so that all selectors are aggregates (or none are). Prevent the first function from affecting metadata (the auto-generated column names), by skipping over the first function if detected. They input and output types are unchanged so this only affects the name.	2023-07-03 19:45:17 +03:00
Avi Kivity	778ae2b461	cql3: expression: introduce temporaries Temporaries are similar to bind variables - they are values provided from outside the expression. While bind variables are provided by the user, temporaries are generated internally. The intended use is for aggregate accumulator storage. Currently aggregates store the accumulator in aggregate_function_selector::_accumulator, which means the entire selector hierarchy must be cloned for every query. With expressions, we can have a single expression object reused for many computations, but we need a way to inject the accumulator into an aggregation, which this new expression element provides.	2023-07-03 19:45:17 +03:00
Avi Kivity	7aee322a6c	cql3: expressions: add "metadata mode" formatter for expressions When returning a result set (and when preparing a statement), we return metadata about the result set columns. Part of that is the column names, which are derived from the expressions used as selectors. Currently, they are computed via selector::column_name(), but as we're dismantling that hierarchy we need a different way to obtain those names. It turns out that the expression formatter is close enough to what we need. To avoid disturbing the current :user mode, add a new :metadata mode and apply the adjustments needed to bring it in line with what column metadata looks like today. Note that column metadata is visible to applications and they can depend on it; e.g. the Python driver allows choosing columns based on their names rather than ordinal position.	2023-07-03 19:45:17 +03:00
Avi Kivity	99fe0ee772	cql3: expression: reimplement verify_no_aggregate_functions() Most clauses in a CQL statement don't tolerate aggregate functions, and so they call verify_no_aggregate_functions(). It can now be reimplemented in terms of aggregation_depth(), removing some code.	2023-07-03 19:45:17 +03:00
Avi Kivity	b1b4a18ad8	cql3: expression: add helpers to manage an expression's aggregation depth We define the "aggregation depth" of an expression by how many nested aggregation functions are applied. In CQL/SQL, legal values are 0 and 1, but for generality we deal with any aggregation depth. The first helper measures the maximum aggregation depth along any path in the expression graph. If it's 2 or greater, we have something like max(max(x)) and we should reject it (though these helpers don't). If we get 1 it's a simple aggregation. If it's zero then we're not aggregating (though CQL may decide to aggregate anyway if GROUP BY is used). The second helper edits an expression to make sure the aggregation depth along any path that reaches a column is the same. Logically, `SELECT x, max(y)` does not make sense, as one is a vector of values and the other is a scalar. CQL resolves the problem by defining x as "the first value seen". We apply this resolution by converting the query to `SELECT first(x), max(y)` (where `first()` is an internal aggregate function), so both selectors refer to scalars that consume vectors. When a scalar is consumed by an aggregate function (for example, `SELECT max(x), min(17)` we don't have to bother, since a scalar is implicity promoted to a vector by evaluating it every row. There is some ambiguity if the scalar is a non-pure function (e.g. `SELECT max(x), min(random())`, but it's not worth following. A small unit test is added.	2023-07-03 19:45:16 +03:00
Avi Kivity	faf0ea0f68	cql3: expression: improve printing of prepared function calls Currently, a prepared function_call expression is printed as an "anonymous function", but it's not really anonymous - the name is available. Print it out. This helps in a unit test later on (and is worthwhile by itself).	2023-07-03 19:02:33 +03:00
Avi Kivity	b858a4669d	cql3: expr: break up expression.hh header Adding a function declaration to expression.hh causes many recompilations. Reduce that by: - moving some restrictions-related definitions to the existing expr/restrictions.hh - moving evaluation related names to a new header expr/evaluate.hh - move utilities to a new header expr/expr-utilities.hh expression.hh contains only expression definitions and the most basic and common helpers, like printing.	2023-06-22 14:21:03 +03:00
Kamil Braun	23a60df92d	Merge 'cql3: expr: simplify evaluate()' from Avi Kivity Make evaluate()'s body more regular, then exploit it by replacing the long list of branches with a lambda template. Closes #14306 * github.com:scylladb/scylladb: cql3: expr: simplify evaluate() cql3: expr: standardize evaluate() branches to call do_evaluate() cql3: expr: rename evaluate(ExpressionElement) to do_evaluate()	2023-06-22 12:18:36 +02:00
Avi Kivity	32b27d6a08	cql3: expr: change evaluation_input vector components to take spans Spans are slightly cleaner, slightly faster (as they avoid an indirection), and allow for replacing some of the arguments with small_vector:s. Closes #14313	2023-06-22 11:28:01 +02:00
Avi Kivity	453bbc1115	cql3: expr: improve error message when rejecting aggregation functions in illegal contexts Fix a small grammatical error, and capitalize WHERE in accordance with SQL tradition. Closes #14288	2023-06-20 17:52:53 +03:00
Avi Kivity	792c46c0f8	cql3: expr: simplify evaluate() Now that all branches in the visitor are uniform and consist of a single call to do_evaluate() overloads, we can simplify by calling a lambda template that does just that.	2023-06-20 02:33:10 +03:00
Avi Kivity	66e0326385	cql3: expr: standardize evaluate() branches to call do_evaluate() Extract the various snippets into do_evaluate() overloads. We'll exploit this in the next patch.	2023-06-20 02:19:33 +03:00
Avi Kivity	b64eeefa35	cql3: expr: rename evaluate(ExpressionElement) to do_evaluate() evaluate(expression) calls the various evaluate(ExpressionElement) overloads to perform its work. However, if we add an ExpressionElement and forget to implement its evaluate() overload, we'll end up in with infinite recursion. It will be caught immediately, but better to avoid it. Also sprinkle static:s on do_evaluate() where missing.	2023-06-20 02:10:18 +03:00
Avi Kivity	7090f4c43b	cql3: expr: evaluate() column_mutation_attribute Enhance evaluation_inputs with timestamps and ttls, and use them to evaluate writetime/ttl. The data structure is compatible with the current way of doing things (see result_set_builder::_timestamps, result_set_build::_ttls). We use std::span<> instead of std::vector<> as it is more general and a tiny bit faster. The algorithm is taken from writetime_or_ttl_selector::add_input().	2023-06-18 22:41:09 +03:00
Nadav Har'El	97d444bbf7	Merge 'cql3/expression: implement evaluate(field_selection) ' from Jan Ciołek Implement `expr:valuate()` for `expr::field_selection`. `field_selection` is used to represent access to a struct field. For example, with a UDT value: ``` CREATE TYPE my_type (a int, b int); ``` The expression `my_type_value.a` would be represented as a `field_selection`, which selects the field `a`. Evaluating such an expression consists of finding the right element's value in a serialized UDT value and returning it. Note that it's still not possible to use `field_selection` inside the `WHERE` clause. Enabling it would require changes to the grammar, as well as query planning, Current `statement_restrictions` just reacts with `on_internal_error` when it encounters a `field_selection`. Nonetheless it's a step towards relaxing the grammar, and now it's finally possible to evaluate all kinds of prepared expressions (#12906) Fixes: https://github.com/scylladb/scylladb/issues/12906 Closes #14235 * github.com:scylladb/scylladb: boost/expr_test: test evaluate(field_selection) cql3/expr: fix printing of field_selection cql3/expression: implement evaluate(field_selection) types/user: modify idx_of_field to use bytes_view column_identifer: add column_identifier_raw::text() types: add read_nth_user_type_field() types: add read_nth_tuple_element()	2023-06-18 11:08:25 +03:00
Jan Ciolek	ee660f2d61	cql3/expr: fix printing of field_selection expression printing has two modes: debug and user. The user mode should output standard CQL that can be parsed back to an expression. In debug mode there can be some additional information that helps with debugging stuff. The code for printing `field_selection` didn't distinguish between user mode and debug mode. It just always printed in debug mode, with extra parenthesis around the field selection. Let's change it so that it emits valid CQL in user mdoe. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-06-16 01:21:02 +02:00
Jan Ciolek	f79f3ea3ae	cql3/expression: implement evaluate(field_selection) Implement expr:valuate() for expr::field_selection. `field_selection` is used to represent access to a struct field. For example, with a UDT value: ``` CREATE TYPE my_type (a int, b int); ``` The expression `my_type_value.a` would be represented as a field_selection, which selects the field 'a'. Evaluating such an expression consists of finding the right element's value in a serialized UDT value and returning it. Fixes: https://github.com/scylladb/scylladb/issues/12906 Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-06-16 01:21:00 +02:00
Avi Kivity	ecfe4ad53a	cql3: expr: make the two styles of cast expressions explicit CQL supports two cast styles: - C-style: (type) expr, used for casts between binary-compatible types and for type hinting of bind variables - SQL-tyle: (expr AS type), used for real type convertions Currently, the expression system differentiates them by the cast::type field, which is a data_type for SQL-style casts and a cql3_type::raw for C-style casts, but that won't work after the prepare phase is applied to SQL-style casts when the type field will be prepared into a data_type. Prepare for this by adding a separate enum to distinguish between the two styles.	2023-06-13 21:04:49 +03:00
Avi Kivity	6db916e5b6	cql3: expr: add verify_no_aggregate_functions() helper Aggregate functions are only allowed in certain contexts (the SELECT clause and the HAVING clause, which we don't yet have). prepare_expr() currently rejects aggregate functions, but that means we cannot use it to prepare selectors. To prepare for the use of prepare_expr() in selectors, we'll have to move the check out of prepare_expr(). This helper is the beginning of that change. I considered adding a parameter to prepare_expr(), but that is even more noisy than adding a call to the helper.	2023-06-13 21:04:49 +03:00
Avi Kivity	54f3050225	cql3: expr: extract column_mutation_attribute_type column_mutation_attribute_type() returns int32_type or long_type depending on whether TTL or WRITETIME is requested. Will be used later when we prepare column_mutation_attribute expressions.	2023-06-13 21:04:49 +03:00
Avi Kivity	d2f4bd8b85	cql3: expr: add fmt formatter for column_mutation_attribute_kind It's easier to use for logging.	2023-06-13 21:04:49 +03:00
Jan Ciolek	1bcb4c024c	cql3/expr: print expressions in user-friendly way by default When a CQL expression is printed, it can be done using either the `debug` mode, or the `user` mode. `user` mode is basically how you would expect the CQL to be printed, it can be printed and then parsed back. `debug` mode is more detailed, for example in `debug` mode a column name can be displayed as `unresolved_identifier(my_column)`, which can't be parsed back to CQL. The default way of printing is the `debug` mode, but this requires us to remember to enable the `user` mode each time we're printing a user-facing message, for example for an invalid_request_exception. It's cumbersome and people forget about it, so let's change the default to `user`. There issues about expressions being printed in a `strange` way, this fixes them. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com> Closes #13916	2023-05-18 20:57:00 +03:00
Jan Ciolek	be8ef63bf5	cql3: remove expr::token Let's remove expr::token and replace all of its functionality with expr::function_call. expr::token is a struct whose job is to represent a partition key token. The idea is that when the user types in `token(p1, p2) < 1234`, this will be internally represented as an expression which uses expr::token to represent the `token(p1, p2)` part. The situation with expr::token is a bit complicated. On one hand side it's supposed to represent the partition token, but sometimes it's also assumed that it can represent a generic call to the token() function, for example `token(1, 2, 3)` could be a function_call, but it could also be expr::token. The query planning code assumes that each occurence of expr::token represents the partition token without checking the arguments. Because of this allowing `token(1, 2, 3)` to be represented as expr::token is dangerous - the query planning might think that it is `token(p1, p2, p3)` and plan the query based on this, which would be wrong. Currently expr::token is created only in one specific case. When the parser detects that the user typed in a restriction which has a call to `token` on the LHS it generates expr::token. In all other cases it generates an `expr::function_call`. Even when the `function_call` represents a valid partition token, it stays a `function_call`. During preparation there is no check to see if a `function_call` to `token` could be turned into `expr::token`. This is a bit inconsistent - sometimes `token(p1, p2, p3)` is represented as `expr::token` and the query planner handles that, but sometimes it might be represented as `function_call`, which the query planner doesn't handle. There is also a problem because there's a lot of duplication between a `function_call` and `expr::token`. All of the evaluation and preparation is the same for `expr::token` as it's for a `function_call` to the token function. Currently it's impossible to evaluate `expr::token` and preparation has some flaws, but implementing it would basically consist of copy-pasting the corresponding code from token `function_call`. One more aspect is multi-table queries. With `expr::token` we turn a call to the `token()` function into a struct that is schema-specific. What happens when a single expression is used to make queries to multiple tables? The schema is different, so something that is representad as `expr::token` for one schema would be represented as `function_call` in the context of a different schema. Translating expressions to different tables would require careful manipulation to convert `expr::token` to `function_call` and vice versa. This could cause trouble for index queries. Overall I think it would be best to remove expr::token. Although having a clear marker for the partition token is sometimes nice for query planning, in my opinion the pros are outweighted by the cons. I'm a big fan of having a single way to represent things, having two separate representations of the same thing without clear boundaries between them causes trouble. Instead of having expr::token and function_call we can just have the function_call and check if it represents a partition token when needed. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:11:31 +02:00
Jan Ciolek	d3a958490e	cql3/expr: properly print token function_call Printing for function_call is a bit strange. When printing an unprepared function it prints the name and then the arguments. For prepared function it prints <anonymous function> as the name and then the arguments. Prepared functions have a name() method, but printing doesn't use it, maybe not all functions have a valid name(?). The token() function will soon be represent as a function_call and it should be printable in a user-readable way. Let's add an if which prints `token(arg1, arg2)` instead of `<anonymous function>(arg1, arg2)` when printing a call to the token function. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:04:53 +02:00
Jan Ciolek	096efc2f38	cql3/expr: split possible_lhs_values into column and token variants The possible_lhs_values takes an expression and a column and finds all possible values for the column that make the expression true. Apart from finding column values it's also capable of finding all matching values for the partition key token. When a nullptr column is passed, possible_lhs_values switches into token values mode and finds all values for the token. This interface isn't ideal. It's confusing to pass a nullptr column when one wants to find values for the token. It would be better to have a flag, or just have a separate function. Additionally in the future expr::token will be removed and we will use expr::is_partition_token_for_schema to find all occurences of the partition token. expr::is_partition_token_for_schema takes a schema as an argument, which possible_lhs_values doesn't have, so it would have to be extended to get the schema from somewhere. To fix these two problems let's split possible_lhs_values into two functions - one that finds possible values for a column, which doesn't require a schema, and one that finds possible values for the partition token and requires a schema: value_set possible_column_values(const column_definition* col, const expression& e, const query_options& options); value_set possible_partition_token_values(const expression& e, const query_options& options, const schema& table_schema); This will make the interface cleaner and enable smooth transition once expr::token is removed. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:04:53 +02:00
Jan Ciolek	f2e5f654f2	cql3/expr: fix error message in possible_lhs_values In possible_lhs_values there was a message talking about is_satisifed_by. It looks like a badly copy-pasted message. Change it to possibel_lhs_values as it should be. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:04:52 +02:00
Avi Kivity	dc3c28516d	cql3: expr: reimplement is_satisfied_by() in terms of evaluate() It calls evaluate() internally anyway. There's a scary if () in there talking about tokens, but everything appears to work.	2023-04-29 13:04:52 +02:00
Jan Ciolek	ad5c931102	cql3/expr: add a schema argument to expr::replace_token Just like has_token, replace_token will use expr::is_partition_token_for_schema to find all instance of the partition token to replace. Let's prepare for this change by adding a schema argument to the function before making the big change. It's unsued at the moment, but having a separate commit should make it easier to review. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:04:52 +02:00
Jan Ciolek	7af010095e	cql3/expr: add expr::is_partition_token_for_schema Add a function to check whether the expression represents a partition token - that is a call to the token function with consecutive partition key columns as the arguments. For example for `token(p1, p2, p3)` this function would return `true`, but for `token(1, 2, 3)` or `token(p3, p2, p1)` the result would be `false`. The function has a schema argument because a schema is required to get the list of partition columns that should be passed as arguments to token(). Maybe it would be possible to infer the schema from the information given earlier during prepare_expression, but it would be complicated and a bit dangerous to do this. Sometimes we operate on multiple tables and the schema is needed to differentiate between them - a token() call can represent the base table's partition token, but for an index table this is just a normal function call, not the partition token. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:04:51 +02:00
Jan Ciolek	694d9298aa	cql3/expr: add expr::is_token_function Add a function that can be used to check whether a given expression represents a call to the token() function. Note that a call to token() doesn't mean that the expression represents a partition token - it could be something like token(1, 2, 3), just a normal function_call. The code for checking has been taken from functions::get. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:04:51 +02:00
Kefu Chai	f5b05cf981	treewide: use defaulted operator!=() and operator==() in C++20, compiler generate operator!=() if the corresponding operator==() is already defined, the language now understands that the comparison is symmetric in the new standard. fortunately, our operator!=() is always equivalent to `! operator==()`, this matches the behavior of the default generated operator!=(). so, in this change, all `operator!=` are removed. in addition to the defaulted operator!=, C++20 also brings to us the defaulted operator==() -- it is able to generated the operator==() if the member-wise lexicographical comparison. under some circumstances, this is exactly what we need. so, in this change, if the operator==() is also implemented as a lexicographical comparison of all memeber variables of the class/struct in question, it is implemented using the default generated one by removing its body and mark the function as `default`. moreover, if the class happen to have other comparison operators which are implemented using lexicographical comparison, the default generated `operator<=>` is used in place of the defaulted `operator==`. sometimes, we fail to mark the operator== with the `const` specifier, in this change, to fulfil the need of C++ standard, and to be more correct, the `const` specifier is added. also, to generate the defaulted operator==, the operand should be `const class_name&`, but it is not always the case, in the class of `version`, we use `version` as the parameter type, to fulfill the need of the C++ standard, the parameter type is changed to `const version&` instead. this does not change the semantic of the comparison operator. and is a more idiomatic way to pass non-trivial struct as function parameters. please note, because in C++20, both operator= and operator<=> are symmetric, some of the operators in `multiprecision` are removed. they are the symmetric form of the another variant. if they were not removed, compiler would, for instance, find ambiguous overloaded operator '=='. this change is a cleanup to modernize the code base with C++20 features. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13687	2023-04-27 10:24:46 +03:00
Avi Kivity	3e0aacc8b5	db, cql3: functions: pass function parameters as a span instead of a vector Spans are more flexible and can be constructed from any contiguous container (such as small_vector), or a subrange of such a container. This can save allocations, so change the signature to accept a span. Spans cannot be constructed from std::initializer_list, so one such call site is changed to use construct a span directly from the single argument.	2023-04-19 20:38:55 +03:00
Avi Kivity	41a2856f78	cql3: expr: fix serialize_listlike() reference-to-temporary with gcc serialize_listlike() is called with a range of either managed_bytes or managed_bytes_opt. If the former, then iterating and assigning to a loop induction variable of type managed_byted_opt& will bind the reference to a temporary managed_bytes_opt, which gcc dislikes. Fix by performing the binding in a separate statement, which allows for lifetime extension.	2023-03-21 13:42:49 +02:00
Jan Ciolek	aa604bd935	cql3: preserve binary_operator.order in search_and_replace There was a bug in `expr::search_and_replace`. It doesn't preserve the `order` field of binary_operator. `order` field is used to mark relations created using the SCYLLA_CLUSTERING_BOUND. It is a CQL feature used for internal queries inside Scylla. It means that we should handle the restriction as a raw clustering bound, not as an expression in the CQL language. Losing the SCYLLA_CLUSTERING_BOUND marker could cause issues, the database could end up selecting the wrong clustering ranges. Fixes: #13055 Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com> Closes #13056	2023-03-06 16:28:06 +02:00
Avi Kivity	568c1a5a36	cql3: expr: generalize evaluation of subscript expressions Currently, evaluation of a subscript expression x[y] requires that x be a column_value, but that's completely artificial. Generalize it to allow any expression. This is needed after we transform a LWT IF condition from "a[x] = y" to "func(a)[x] = y", where func casts a from a map represention of a list back to a list; but it's also generally useful.	2023-02-12 17:25:46 +02:00

1 2 3 4 5 ...

264 Commits