scylladb

Author	SHA1	Message	Date
Tomasz Grabiec	66755db062	locator, cql3: Support rack lists in replication options Allows per-DC replication factor to be either a string, holding a numerical value, or a list of strings, holding a list of rack names. The rack list is not respected yet by the tablet allocator, this is achieved in subsequent commit. This changes the format of options stored in the flattened map in system_schema.keyspaces#replication. Values which are rack lists, are converted into multiple entries, with the list index appended to the key with ':' as the separator: For example, this extended map: { 'dc1': '3', 'dc2': ['rack1', 'rack2'] } is stored as a flattened map: { 'dc1': '3', 'dc2:0': 'rack1', 'dc2:1': 'rack2' } Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Signed-off-by: Tomasz Grabiec <tgrabiec@scylladb.com>	2025-10-02 19:42:39 +02:00
Tomasz Grabiec	11b4a1ab58	cql3: Extract convert_property_map() out of Cql.g So that complex code is in a .cc file for better IDE assistance.	2025-10-01 16:06:52 +02:00
Dawid Pawlik	ed49093a01	expression: adjust collection constructor list style Like mentioned in the previous commit, this changes introduce usage of vector style type and adjusts the functions using list style type to distinguish vectors from lists. Rename collection constructor style list to list_or_vector.	2025-01-28 21:14:49 +01:00
Dawid Pawlik	69c754f0d4	expression: add vector style type Motivation for this changes is to provide a distinguishable interface for vector type expressions. The square bracket literal is ambigious for lists and vectors, so that we need to perform a distinction not using CQL layer. At first we should use the collection constructor to manage both lists and vectors (although a vector is not a collection). Later during preparation of expressions we should be able to get to know the exact type using given receiver (column specification). Knowing the type of expression we may use their respective style type (in this case the vector style type being introduced), which would make the implementation more precise and allow us to evaluate the expressions properly. This commit introduces vector style type and functions making use of it. However vector style type is not yet used anywhere, the next commit should adjust collection constructor and make use of the new vector style type and it's features.	2025-01-28 21:14:49 +01:00
Avi Kivity	f8ce49ebe9	cql3: implement NOT IN Where the grammar supports IN, we add NOT IN. This includes the WHERE clause and LWT IF clause. Evaluation of NOT IN follows from IN. In statement_restrictions analysis, they are different, as NOT IN doesn't enable any clever query plan and must filter. Some tests are added. An error message was changed ('in' changed to 'IN'), so some tests are adjusted. Closes scylladb/scylladb#21992	2024-12-22 15:15:23 +02:00
Avi Kivity	f3eade2f62	treewide: relicense to ScyllaDB-Source-Available-1.0 Drop the AGPL license in favor of a source-available license. See the blog post [1] for details. [1] https://www.scylladb.com/2024/12/18/why-were-moving-to-a-source-available-license/	2024-12-18 17:45:13 +02:00
Kefu Chai	6556cd684e	cql3: remove unused operator<< as these operators are not used anymore. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19288	2024-06-14 09:45:35 +03:00
Kefu Chai	168ade72f8	treewide: replace formatter<std::string_view> with formatter<string_view> in in {fmt} before v10, it provides the specialization of `fmt::formatter<..>` for `std::string_view` as well as the specialization of `fmt::formatter<..>` for `fmt::string_view` which is an implementation builtin in {fmt} for compatibility of pre-C++17. and this type is used even if the code is compiled with C++ stadandard greater or equal to C++17. also, before v10, the `fmt::formatter<std::string_view>::format()` is defined so it accepts `std::string_view`. after v10, `fmt::formatter<std::string_view>` still exists, but it is now defined using `format_as()` machinery, so it's `format()` method does not actually accept `std::string_view`, it accepts `fmt::string_view`, as the former can be converted to `fmt::string_view`. this is why we can inherit from `fmt::formatter<std::string_view>` and use `formatter<std::string_view>::format(foo, ctx);` to implement the `format()` method with {fmt} v9, but we cannot do this with {fmt} v10, and we would have following compilation failure: ``` FAILED: service/CMakeFiles/service.dir/RelWithDebInfo/topology_state_machine.cc.o /home/kefu/.local/bin/clang++ -DFMT_DEPRECATED_OSTREAM -DFMT_SHARED -DSCYLLA_BUILD_MODE=release -DSEASTAR_API_LEVEL=7 -DSEASTAR_LOGGER_COMPILE_TIME_FMT -DSEASTAR_LOGGER_TYPE_STDOUT -DSEASTAR_SCHEDULING_GROUPS_COUNT=16 -DSEASTAR_SSTRING -DXXH_PRIVATE_API -DCMAKE_INTDIR=\"RelWithDebInfo\" -I/home/kefu/dev/scylladb -I/home/kefu/dev/scylladb/build/gen -I/home/kefu/dev/scylladb/seastar/include -I/home/kefu/dev/scylladb/build/seastar/gen/include -I/home/kefu/dev/scylladb/build/seastar/gen/src -ffunction-sections -fdata-sections -O3 -g -gz -std=gnu++20 -fvisibility=hidden -Wall -Werror -Wextra -Wno-error=deprecated-declarations -Wimplicit-fallthrough -Wno-c++11-narrowing -Wno-deprecated-copy -Wno-mismatched-tags -Wno-missing-field-initializers -Wno-overloaded-virtual -Wno-unsupported-friend -Wno-enum-constexpr-conversion -Wno-unused-parameter -ffile-prefix-map=/home/kefu/dev/scylladb=. -march=westmere -mllvm -inline-threshold=2500 -fno-slp-vectorize -U_FORTIFY_SOURCE -Werror=unused-result -MD -MT service/CMakeFiles/service.dir/RelWithDebInfo/topology_state_machine.cc.o -MF service/CMakeFiles/service.dir/RelWithDebInfo/topology_state_machine.cc.o.d -o service/CMakeFiles/service.dir/RelWithDebInfo/topology_state_machine.cc.o -c /home/kefu/dev/scylladb/service/topology_state_machine.cc /home/kefu/dev/scylladb/service/topology_state_machine.cc:254:41: error: no matching member function for call to 'format' 254 \| return formatter<std::string_view>::format(it->second, ctx); \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~ /usr/include/fmt/core.h:2759:22: note: candidate function template not viable: no known conversion from 'seastar::basic_sstring<char, unsigned int, 15>' to 'const fmt::basic_string_view<char>' for 1st argument 2759 \| FMT_CONSTEXPR auto format(const T& val, FormatContext& ctx) const \| ^ ~~~~~~~~~~~~ ``` because the inherited `format()` method actually comes from `fmt::formatter<fmt::string_view>`. to reduce the confusion, in this change, we just inherit from `fmt::format<string_view>`, where `string_view` is actually `fmt::string_view`. this follows the document at https://fmt.dev/latest/api.html#formatting-user-defined-types, and since there is less indirection under the hood -- we do not use the specialization created by `FMT_FORMAT_AS` which inherit from `formatter<fmt::string_view>`, hopefully this can improve the compilation speed a little bit. also, this change addresses the build failure with {fmt} v10. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#18299	2024-04-19 07:44:07 +03:00
Kefu Chai	3d8ac06ee8	cql3: add fmt::formatter for expression::printer before this change, we already have a `fmt::formatter` specialized for `expression::printer`. but the formatter was implemented by 1. formatting the `printer` instance to an `ostringstream`, and 2. extracting a `std::string` from this `ostringstream` 3. formatting the `std::string` instance to the fmt context this is convoluted and is not an optimal implementation. so, in this change, it is reimplemented by formatting directly to the context. its operator<< is also dropped in this change. please note, to avoid adding the large chunk of code into the .hh file, the implementation is put in the .cc file. but in order to preserve the usage of `transformed(fmt::to_string<expression::printer>)`, the `format()` function is defined as a template, and instantiated explicitly for two use cases: 1. to format to `fmt::context` 2. to format using `fmt::to_string()` Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-05 14:00:13 +08:00
Kefu Chai	2dbf044b91	cql3: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16791	2024-01-16 16:43:17 +02:00
Patryk Wrobel	f4e311e871	cql3: add formatter for cql3::expr::oper_t This change introduces a specialization of fmt::formatter for cql3::expr::oper_t. This enables the usage of this type with FMTv10, which dropped the default generated formatter. Usage of cql3::expr::oper_t without the defined formatter resulted in compilation error when compiled with FMTv10. Refs: #13245 Signed-off-by: Patryk Wrobel <patryk.wrobel@scylladb.com> Closes scylladb/scylladb#16719	2024-01-11 08:33:35 +02:00
Jan Ciolek	c256cca6f1	cql3/expr: add more comments in expression.hh `expression` is a std::variant with 16 different variants that represent different types of AST nodes. Let's add documentation that explains what each of these 16 types represents. For people who are not familiar with expression code it might not be clear what each of them does, so let's add clear descriptions for all of them. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com> Closes scylladb/scylladb#15767	2023-10-19 10:56:38 +03:00
Kefu Chai	484d02da14	cql3: expr: do not use multi-line comment do not use muti-line comment. this silences the warning from GCC: ``` In file included from ./cql3/prepare_context.hh:19, from ./cql3/statements/raw/parsed_statement.hh:14, from build/debug/gen/cql3/CqlParser.hpp:62, from build/debug/gen/cql3/CqlParser.cpp:44: ./cql3/expr/expression.hh:490:1: error: multi-line comment [-Werror=comment] 490 \| /// Custom formatter for an expression. Supports multiple modes:\ \| ^ ``` Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#15471	2023-09-19 12:00:09 +03:00
Avi Kivity	b54265034d	cql3: expr: make expression non-default-constructible There is no obvious default expression, so better not to allow default construction of expressions to prevent unintended values from leaking in. Resolves a FIXME.	2023-07-14 18:35:59 +03:00
Avi Kivity	778ae2b461	cql3: expression: introduce temporaries Temporaries are similar to bind variables - they are values provided from outside the expression. While bind variables are provided by the user, temporaries are generated internally. The intended use is for aggregate accumulator storage. Currently aggregates store the accumulator in aggregate_function_selector::_accumulator, which means the entire selector hierarchy must be cloned for every query. With expressions, we can have a single expression object reused for many computations, but we need a way to inject the accumulator into an aggregation, which this new expression element provides.	2023-07-03 19:45:17 +03:00
Avi Kivity	7aee322a6c	cql3: expressions: add "metadata mode" formatter for expressions When returning a result set (and when preparing a statement), we return metadata about the result set columns. Part of that is the column names, which are derived from the expressions used as selectors. Currently, they are computed via selector::column_name(), but as we're dismantling that hierarchy we need a different way to obtain those names. It turns out that the expression formatter is close enough to what we need. To avoid disturbing the current :user mode, add a new :metadata mode and apply the adjustments needed to bring it in line with what column metadata looks like today. Note that column metadata is visible to applications and they can depend on it; e.g. the Python driver allows choosing columns based on their names rather than ordinal position.	2023-07-03 19:45:17 +03:00
Avi Kivity	b858a4669d	cql3: expr: break up expression.hh header Adding a function declaration to expression.hh causes many recompilations. Reduce that by: - moving some restrictions-related definitions to the existing expr/restrictions.hh - moving evaluation related names to a new header expr/evaluate.hh - move utilities to a new header expr/expr-utilities.hh expression.hh contains only expression definitions and the most basic and common helpers, like printing.	2023-06-22 14:21:03 +03:00
Avi Kivity	32b27d6a08	cql3: expr: change evaluation_input vector components to take spans Spans are slightly cleaner, slightly faster (as they avoid an indirection), and allow for replacing some of the arguments with small_vector:s. Closes #14313	2023-06-22 11:28:01 +02:00
Avi Kivity	7090f4c43b	cql3: expr: evaluate() column_mutation_attribute Enhance evaluation_inputs with timestamps and ttls, and use them to evaluate writetime/ttl. The data structure is compatible with the current way of doing things (see result_set_builder::_timestamps, result_set_build::_ttls). We use std::span<> instead of std::vector<> as it is more general and a tiny bit faster. The algorithm is taken from writetime_or_ttl_selector::add_input().	2023-06-18 22:41:09 +03:00
Avi Kivity	8d3d8eeedb	cql3: add optional type annotation to assignment_testable Before this series, function overload resolution peeked at function arguments to see if they happened to be selectors, and if so grabbed their type. If they did not happen to be selectors, we woudln't know their type, but as it happened all generic functions are aggregates, and aggregates are only legal in the SELECT clause, so that didn't matter. In a previous patch, we changed assignment_testable to carry an optional type and wired it to selector, so we wouldn't need to dynamic_cast<selector>. Now, we wire the optional type to assignment_testable_expression, so overload resolution of generic functions can happen during expression preparation. The code that bridges the function argument expressions to assignment_testable is extracted into a function, since it's too complicated to be written as a transform.	2023-06-13 21:04:49 +03:00
Avi Kivity	521a128a2a	cql3: expr: support preparing field_selection expressions The field_selection structure is augmented with the field index so that does not need to be done at evaluation time, similar to the current with_field_selection selectable.	2023-06-13 21:04:49 +03:00
Avi Kivity	ecfe4ad53a	cql3: expr: make the two styles of cast expressions explicit CQL supports two cast styles: - C-style: (type) expr, used for casts between binary-compatible types and for type hinting of bind variables - SQL-tyle: (expr AS type), used for real type convertions Currently, the expression system differentiates them by the cast::type field, which is a data_type for SQL-style casts and a cql3_type::raw for C-style casts, but that won't work after the prepare phase is applied to SQL-style casts when the type field will be prepared into a data_type. Prepare for this by adding a separate enum to distinguish between the two styles.	2023-06-13 21:04:49 +03:00
Avi Kivity	5983e9e7b2	cql3: test_assignment: pass optional schema everywhere test_assignment() and related functions check for type compatibility between a right-hand-side and a left-hand-side. It started its life with a limited functionality for INSERT and UPDATE, but now it's about to be used for cast expression in selectors, which can cast a column_value. A column_value is still an unresolved_identifier during the prepare phase, and cannot be resolved without a schema. To prepare for this, pass an optional schema everywhere. Ultimately, test_assignment likely needs to be folded into prepare_expr(), but before that prepare_expr() has to be used everywhere.	2023-06-13 21:04:49 +03:00
Avi Kivity	6db916e5b6	cql3: expr: add verify_no_aggregate_functions() helper Aggregate functions are only allowed in certain contexts (the SELECT clause and the HAVING clause, which we don't yet have). prepare_expr() currently rejects aggregate functions, but that means we cannot use it to prepare selectors. To prepare for the use of prepare_expr() in selectors, we'll have to move the check out of prepare_expr(). This helper is the beginning of that change. I considered adding a parameter to prepare_expr(), but that is even more noisy than adding a call to the helper.	2023-06-13 21:04:49 +03:00
Avi Kivity	54f3050225	cql3: expr: extract column_mutation_attribute_type column_mutation_attribute_type() returns int32_type or long_type depending on whether TTL or WRITETIME is requested. Will be used later when we prepare column_mutation_attribute expressions.	2023-06-13 21:04:49 +03:00
Avi Kivity	d2f4bd8b85	cql3: expr: add fmt formatter for column_mutation_attribute_kind It's easier to use for logging.	2023-06-13 21:04:49 +03:00
Jan Ciolek	1bcb4c024c	cql3/expr: print expressions in user-friendly way by default When a CQL expression is printed, it can be done using either the `debug` mode, or the `user` mode. `user` mode is basically how you would expect the CQL to be printed, it can be printed and then parsed back. `debug` mode is more detailed, for example in `debug` mode a column name can be displayed as `unresolved_identifier(my_column)`, which can't be parsed back to CQL. The default way of printing is the `debug` mode, but this requires us to remember to enable the `user` mode each time we're printing a user-facing message, for example for an invalid_request_exception. It's cumbersome and people forget about it, so let's change the default to `user`. There issues about expressions being printed in a `strange` way, this fixes them. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com> Closes #13916	2023-05-18 20:57:00 +03:00
Jan Ciolek	be8ef63bf5	cql3: remove expr::token Let's remove expr::token and replace all of its functionality with expr::function_call. expr::token is a struct whose job is to represent a partition key token. The idea is that when the user types in `token(p1, p2) < 1234`, this will be internally represented as an expression which uses expr::token to represent the `token(p1, p2)` part. The situation with expr::token is a bit complicated. On one hand side it's supposed to represent the partition token, but sometimes it's also assumed that it can represent a generic call to the token() function, for example `token(1, 2, 3)` could be a function_call, but it could also be expr::token. The query planning code assumes that each occurence of expr::token represents the partition token without checking the arguments. Because of this allowing `token(1, 2, 3)` to be represented as expr::token is dangerous - the query planning might think that it is `token(p1, p2, p3)` and plan the query based on this, which would be wrong. Currently expr::token is created only in one specific case. When the parser detects that the user typed in a restriction which has a call to `token` on the LHS it generates expr::token. In all other cases it generates an `expr::function_call`. Even when the `function_call` represents a valid partition token, it stays a `function_call`. During preparation there is no check to see if a `function_call` to `token` could be turned into `expr::token`. This is a bit inconsistent - sometimes `token(p1, p2, p3)` is represented as `expr::token` and the query planner handles that, but sometimes it might be represented as `function_call`, which the query planner doesn't handle. There is also a problem because there's a lot of duplication between a `function_call` and `expr::token`. All of the evaluation and preparation is the same for `expr::token` as it's for a `function_call` to the token function. Currently it's impossible to evaluate `expr::token` and preparation has some flaws, but implementing it would basically consist of copy-pasting the corresponding code from token `function_call`. One more aspect is multi-table queries. With `expr::token` we turn a call to the `token()` function into a struct that is schema-specific. What happens when a single expression is used to make queries to multiple tables? The schema is different, so something that is representad as `expr::token` for one schema would be represented as `function_call` in the context of a different schema. Translating expressions to different tables would require careful manipulation to convert `expr::token` to `function_call` and vice versa. This could cause trouble for index queries. Overall I think it would be best to remove expr::token. Although having a clear marker for the partition token is sometimes nice for query planning, in my opinion the pros are outweighted by the cons. I'm a big fan of having a single way to represent things, having two separate representations of the same thing without clear boundaries between them causes trouble. Instead of having expr::token and function_call we can just have the function_call and check if it represents a partition token when needed. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:11:31 +02:00
Jan Ciolek	096efc2f38	cql3/expr: split possible_lhs_values into column and token variants The possible_lhs_values takes an expression and a column and finds all possible values for the column that make the expression true. Apart from finding column values it's also capable of finding all matching values for the partition key token. When a nullptr column is passed, possible_lhs_values switches into token values mode and finds all values for the token. This interface isn't ideal. It's confusing to pass a nullptr column when one wants to find values for the token. It would be better to have a flag, or just have a separate function. Additionally in the future expr::token will be removed and we will use expr::is_partition_token_for_schema to find all occurences of the partition token. expr::is_partition_token_for_schema takes a schema as an argument, which possible_lhs_values doesn't have, so it would have to be extended to get the schema from somewhere. To fix these two problems let's split possible_lhs_values into two functions - one that finds possible values for a column, which doesn't require a schema, and one that finds possible values for the partition token and requires a schema: value_set possible_column_values(const column_definition* col, const expression& e, const query_options& options); value_set possible_partition_token_values(const expression& e, const query_options& options, const schema& table_schema); This will make the interface cleaner and enable smooth transition once expr::token is removed. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:04:53 +02:00
Jan Ciolek	ad5c931102	cql3/expr: add a schema argument to expr::replace_token Just like has_token, replace_token will use expr::is_partition_token_for_schema to find all instance of the partition token to replace. Let's prepare for this change by adding a schema argument to the function before making the big change. It's unsued at the moment, but having a separate commit should make it easier to review. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:04:52 +02:00
Jan Ciolek	d50db32d14	cql3/expr: add a comment for expr::has_partition_token Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:04:52 +02:00
Jan Ciolek	18879aad6f	cql3/expr: add a schema argument to expr::has_token In the future expr::token will be removed and checking whether there is a partition token inside an expression will be done using expr::is_partition_token_for_schema. This function takes a schema as an argument, so all functions that will call it also need to get the schema from somewhere. Right now it's an unused argument, but in the future it will be used. Adding it in a separate commit makes it easier to review. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:04:52 +02:00
Jan Ciolek	7af010095e	cql3/expr: add expr::is_partition_token_for_schema Add a function to check whether the expression represents a partition token - that is a call to the token function with consecutive partition key columns as the arguments. For example for `token(p1, p2, p3)` this function would return `true`, but for `token(1, 2, 3)` or `token(p3, p2, p1)` the result would be `false`. The function has a schema argument because a schema is required to get the list of partition columns that should be passed as arguments to token(). Maybe it would be possible to infer the schema from the information given earlier during prepare_expression, but it would be complicated and a bit dangerous to do this. Sometimes we operate on multiple tables and the schema is needed to differentiate between them - a token() call can represent the base table's partition token, but for an index table this is just a normal function call, not the partition token. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:04:51 +02:00
Jan Ciolek	694d9298aa	cql3/expr: add expr::is_token_function Add a function that can be used to check whether a given expression represents a call to the token() function. Note that a call to token() doesn't mean that the expression represents a partition token - it could be something like token(1, 2, 3), just a normal function_call. The code for checking has been taken from functions::get. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-04-29 13:04:51 +02:00
Avi Kivity	6de4032baf	cql3: expr: introduce adjust_for_collection_as_maps() LWT and some list operations represent lists using a form like their mutations, so that the mutation list keys can be recovered and used to update the list. But the evaluation machinery knows nothing about that, and will return the map-form even though the type system thinks it is a list. To handle that, add a utility to rewrite the expression so that the value is re-serialized into the expected list form. The rewrite is implemented as a scalar function taking the map form and returning the list form.	2023-02-12 17:25:46 +02:00
Avi Kivity	c8d77c204f	cql3: expr: extract extract_column_value() from evaluation machinery Expression evaluation works with the evaluation_input structure to compute values. As we move LWT column_condition towards expressions, we'll start using evaluation_input, so provide this helper to ease the transition.	2023-02-12 17:17:01 +02:00
Avi Kivity	db2fa44a9a	cql3: expr: add optimizer for LIKE with constant pattern Compiling a pattern is expensive and so we should try to do it at prepare time, if the pattern is a constant. Add an optimizer that looks for such cases and replaces them with a unary function that embeds the compiled pattern. This isn't integrated yet with prepare_expr(), since the filtering code isn't ready for generic expressions. Its first user will be LWT, which contains the optimization already (filtering had it as well, but lost it sometime during the expression rewrite). A unit test is added.	2023-02-12 17:16:58 +02:00
Avi Kivity	ecdd49317a	cql3: expr: add LWT IF clause variants of binary operators LWT IF clause interprets equality differently from SQL (and the rest of CQL): it thinks NULL equals NULL. Currently, it implements binary operators all by itself so the fact that oper_t::EQ (and friends) means something else in the rest of the code doesn't bother it. However, we can't unify the code (in column_condition.cc) with the rest of expression evaluation if the meaning changes in different places. To prepare for this, introduce a null_handling_style field to binary_operator that defaults to `sql` but can be changed to `lwt_nulls` to indicate this special semantic. A few unit tests are added. LWT itself still isn't modified.	2023-02-12 17:03:03 +02:00
Avi Kivity	0f15ff740d	cql3: expr: simplify user/debug formatting We have a cql3::expr::expression::printer wrapper that annotates an expression with a debug_mode boolean prior to formatting. The fmt library, however, provides a much simpler alterantive: a custom format specifier. With this, we can write format("{:user}", expr) for user-oriented prints, or format("{:debug}", expr) for debug-oriented prints (if nothing is specified, the default remains debug). This is done by implementing fmt::formatter::parse() for the expression type, can using expression::printer internally. Since sometimes we pass expression element types rather than the expression variant, we also provide a custom formatter for all ExpressionElement Types. Uses for expression::printer are updated to use the nicer syntax. In one place we eliminate a temporary that is no longer needed since ExpressionElement:s can be formatted directly. Closes #12702	2023-02-08 12:24:58 +02:00
Kefu Chai	ccc03dd1ec	cql3, locator: call fmt::format_to() explicitly since format_to() is defined included by both fmt and std namepaces, without specifying which one to use, we'd fail to build with the standard library which implements std::format_to(). yes, we are `using namespace std` somewhere. this change should address the FTBFS with GCC-13. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-01-30 21:50:11 +08:00
Nadav Har'El	9433108158	Merge 'Allow transient list values to contain NULLs' from Avi Kivity The CQL protocol and specification call for lists with NULLs in some places. For example, the statement: ```cql UPDATE tab SET x = 3 IF y IN (1, 2, NULL) WHERE pk = 4 ``` has a list `(1, 2, NULL)` that contains NULL. Although the syntax is tuple-like, the value is a list; consider the same statement as a prepared statement: ```cql UPDATE tab SET x = :x IF y IN :y_values WHERE pk = :pk ``` `:y_values` must have a list type, since the number of elements is unknown. Currently, this is done with special paths inside LWT that bypass normal evaluation, but if we want to unify those paths, we must allow NULLs in lists (except in storage). This series does that. Closes #12411 * github.com:scylladb/scylladb: test: materialized view: add test exercising synthetic empty-type columns cql3: expr: relax evaluate_list() to allow allow NULL elements types: allow lists with NULL test: relax NULL check test predicate cql3, types: validate listlike collections (sets, lists) for storage types: make empty type deserialize to non-null value	2023-01-19 15:15:16 +02:00
Jan Ciolek	9a0c5789a2	cql3: expr: take reference to schema in prepare_binary_operator prepare_binary_operator takes a schema_ptr, but it would be useful to take a reference to schema instead. Every schema_ptr can be easily converted to a reference so there is no loss of functionality. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-01-18 12:04:40 +01:00
Avi Kivity	00145f9ada	test: relax NULL check test predicate When we start allowing NULL in lists in some contexts, the exact location where an error is raised (when it's disallowed) will change. To prepare for that, relax the exception check to just ensure the word NULL is there, without caring about the exact wording.	2023-01-18 10:38:24 +02:00
Avi Kivity	0b418fa7cf	cql3, transport, tests: remove "unset" from value type system The CQL binary protocol introduced "unset" values in version 4 of the protocol. Unset values can be bound to variables, which cause certain CQL fragments to be skipped. For example, the fragment `SET a = :var` will not change the value of `a` if `:var` is bound to an unset value. Unsets, however, are very limited in where they can appear. They can only appear at the top-level of an expression, and any computation done with them is invalid. For example, `SET list_column = [3, :var]` is invalid if `:var` is bound to unset. This causes the code to be littered with checks for unset, and there are plenty of tests dedicated to catching unsets. However, a simpler way is possible - prevent the infiltration of unsets at the point of entry (when evaluating a bind variable expression), and introduce guards to check for the few cases where unsets are allowed. This is what this long patch does. It performs the following: (general) 1. unset is removed from the possible values of cql3::raw_value and cql3::raw_value_view. (external->cql3) 2. query_options is fortified with a vector of booleans, unset_bind_variable_vector, where each boolean corresponds to a bind variable index and is true when it is unset. 3. To avoid churn, two compatiblity structs are introduced: cql3::raw_value{,_view}_vector_with_unset, which can be constructed from a std::vector<raw_value{,_view/}>, which is what most callers have. They can also be constructed with explicit unset vectors, for the few cases they are needed. (cql3->variables) 4. query_options::get_value_at() now throws if the requested bind variable is unset. This replaces all the throwing checks in expression evaluation and statement execution, which are removed. 5. A new query_options::is_unset() is added for the users that can tolerate unset; though it is not used directly. 6. A new cql3::unset_operation_guard class guards against unsets. It accepts an expression, and can be queried whether an unset is present. Two conditions are checked: the expression must be a singleton bind variable, and at runtime it must be bound to an unset value. 7. The modification_statement operations are split into two, via two new subclasses of cql3::operation. cql3::operation_no_unset_support ignores unsets completely. cql3::operation_skip_if_unset checks if an operand is unset (luckily all operations have at most one operand that tolerates unset) and applies unset_operation_guard to it. 8. The various sites that accept expressions or operations are modified to check for should_skip_operation(). This are the loops around operations in update_statement and delete_statement, and the checks for unset in attributes (LIMIT and PER PARTITION LIMIT) (tests) 9. Many unset tests are removed. It's now impossible to enter an unset value into the expression evaluation machinery (there's just no unset value), so it's impossible to test for it. 10. Other unset tests now have to be invoked via bind variables, since there's no way to create an unset cql3::expr::constant. 11. Many tests have their exception message match strings relaxed. Since unsets are now checked very early, we don't know the context where they happen. It would be possible to reintroduce it (by adding a format string parameter to cql3::unset_operation_guard), but it seems not to be worth the effort. Usage of unsets is rare, and it is explicit (at least with the Python driver, an unset cannot be introduced by ommission). I tried as an alternative to wrap cql3::raw_value{,_view} (that doesn't recognize unsets) with cql3::maybe_unset_value (that does), but that caused huge amounts of churn, so I abandoned that in favor of the current approach. Closes #12517	2023-01-16 21:10:56 +02:00
Avi Kivity	2739ac66ed	treewide: drop cql_serialization_format Now that we don't accept cql protocol version 1 or 2, we can drop cql_serialization format everywhere, except when in the IDL (since it's part of the inter-node protocol). A few functions had duplicate versions, one with and one without a cql_serialization_format parameter. They are deduplicated. Care is taken that `partition_slice`, which communicates the cql_serialization_format across nodes, still presents a valid cql_serialization_format to other nodes when transmitting itself and rejects protocol 1 and 2 serialization\ format when receiving. The IDL is unchanged. One test checking the 16-bit serialization format is removed.	2023-01-03 19:54:13 +02:00
Avi Kivity	ea901fdb9d	cql3: expr: fold `null` into untyped_constant/constant Our `null` expression, after the prepare stage, is redundant with a `constant` expression containing the value NULL. Remove it. Its role in the unprepared stage is taken over by untyped_constant, which gains a new type_class enumeration to represent it. Some subtleties: - Usually, handling of null and untyped_constant, or null and constant was the same, so they are just folded into each other - LWT "like" operator now has to discriminate between a literal string and a literal NULL - prepare and test_assignment were folded into the corresponing untyped_constant functions. Some care had to be taken to preserve error messages. Closes #12118	2022-11-29 11:02:18 +02:00
Avi Kivity	9765b2e3bc	cql3: expr: drop remnants of `bool` component from expression In `ad3d2ee47d`, we replaced `bool` as an expression element (representing a boolean constant) with `constant`. But a comment and a concept continue to mention it. Remove the comment and the concept fragment. Closes #12119	2022-11-28 23:18:26 +02:00
Jan Ciolek	ac152af88c	expression: Add for_each_boolean factor boolean_factors is a function that takes an expression and extracts all children of the top level conjunction. The problem is that it returns a vector<expression>, which is inefficent. Sometimes we would like to iterate over all boolean factors without allocations. for_each_boolean_factor is implemented for this purpose. boolean_factors() can be implemented using for_each_boolean_factor, so it's done to reduce code duplication. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-09-25 16:34:22 +03:00
Avi Kivity	8085b9f57a	cql3: expr: add boolean_factors() function to factorize an expression When analyzing a WHERE clause, we want to separate individual factors (usually relations), and later partition them into partition key, clustering key, and regular column relations. The first step is separation, for which this helper is added. Currently, it is not required since the grammar supplies the expression in separated form, but this will not work once it is relaxed to allow any expression in the WHERE clause. A unit test is added.	2022-07-22 20:14:48 +03:00
Avi Kivity	1efb2fecbe	cql3: expression: define operator==() for expressions This is useful for tests, to check that expression manipulations yield the expected results.	2022-07-22 20:14:48 +03:00

1 2 3 4

189 Commits