scylladb

Author	SHA1	Message	Date
Avi Kivity	db2fa44a9a	cql3: expr: add optimizer for LIKE with constant pattern Compiling a pattern is expensive and so we should try to do it at prepare time, if the pattern is a constant. Add an optimizer that looks for such cases and replaces them with a unary function that embeds the compiled pattern. This isn't integrated yet with prepare_expr(), since the filtering code isn't ready for generic expressions. Its first user will be LWT, which contains the optimization already (filtering had it as well, but lost it sometime during the expression rewrite). A unit test is added.	2023-02-12 17:16:58 +02:00
Avi Kivity	ecdd49317a	cql3: expr: add LWT IF clause variants of binary operators LWT IF clause interprets equality differently from SQL (and the rest of CQL): it thinks NULL equals NULL. Currently, it implements binary operators all by itself so the fact that oper_t::EQ (and friends) means something else in the rest of the code doesn't bother it. However, we can't unify the code (in column_condition.cc) with the rest of expression evaluation if the meaning changes in different places. To prepare for this, introduce a null_handling_style field to binary_operator that defaults to `sql` but can be changed to `lwt_nulls` to indicate this special semantic. A few unit tests are added. LWT itself still isn't modified.	2023-02-12 17:03:03 +02:00
Avi Kivity	0f15ff740d	cql3: expr: simplify user/debug formatting We have a cql3::expr::expression::printer wrapper that annotates an expression with a debug_mode boolean prior to formatting. The fmt library, however, provides a much simpler alterantive: a custom format specifier. With this, we can write format("{:user}", expr) for user-oriented prints, or format("{:debug}", expr) for debug-oriented prints (if nothing is specified, the default remains debug). This is done by implementing fmt::formatter::parse() for the expression type, can using expression::printer internally. Since sometimes we pass expression element types rather than the expression variant, we also provide a custom formatter for all ExpressionElement Types. Uses for expression::printer are updated to use the nicer syntax. In one place we eliminate a temporary that is no longer needed since ExpressionElement:s can be formatted directly. Closes #12702	2023-02-08 12:24:58 +02:00
Jan Ciolek	9eb6746a67	test/expr_test: test <int_value> IN (123, ?, 456) Add tests which test evaluating the IN restriction with a list which contains a bind variable. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-02-01 16:29:32 +01:00
Nadav Har'El	9433108158	Merge 'Allow transient list values to contain NULLs' from Avi Kivity The CQL protocol and specification call for lists with NULLs in some places. For example, the statement: ```cql UPDATE tab SET x = 3 IF y IN (1, 2, NULL) WHERE pk = 4 ``` has a list `(1, 2, NULL)` that contains NULL. Although the syntax is tuple-like, the value is a list; consider the same statement as a prepared statement: ```cql UPDATE tab SET x = :x IF y IN :y_values WHERE pk = :pk ``` `:y_values` must have a list type, since the number of elements is unknown. Currently, this is done with special paths inside LWT that bypass normal evaluation, but if we want to unify those paths, we must allow NULLs in lists (except in storage). This series does that. Closes #12411 * github.com:scylladb/scylladb: test: materialized view: add test exercising synthetic empty-type columns cql3: expr: relax evaluate_list() to allow allow NULL elements types: allow lists with NULL test: relax NULL check test predicate cql3, types: validate listlike collections (sets, lists) for storage types: make empty type deserialize to non-null value	2023-01-19 15:15:16 +02:00
Jan Ciolek	ae0e955b90	expr_test: test preparing binary_operator with NULL RHS Make sure that preparing binary_operator works properly when the RHS is NULL. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-01-18 12:04:46 +01:00
Jan Ciolek	65b8a09409	expr_test: test preparing IS NOT NULL binary_operator Add unit test which check that preparing binary_operators which represent IS NOT NULL works as expected Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-01-18 12:04:46 +01:00
Jan Ciolek	5b3e6769f1	expr_test: test preparing binary_operator with LIKE Add unit test which check that preparing binary_operators with the LIKE operation works as expected. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com	2023-01-18 12:04:45 +01:00
Jan Ciolek	e876496f7f	expr_test: test preparing binary_operator with CONTAINS KEY Add unit test which check that preparing binary_operators with the CONTAINS KEY operation works as expected. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-01-18 12:04:45 +01:00
Jan Ciolek	c6d2e1a03e	expr_test: test preparing binary_operator with CONTAINS Add unit test which check that preparing binary_operators with the CONTAINS operation works as expected. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-01-18 12:04:45 +01:00
Jan Ciolek	6b147ecaea	expr_test: test preparing binary_operator with IN Add unit test which check that preparing binary_operators with the IN operation works as expected. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-01-18 12:04:45 +01:00
Jan Ciolek	669d791250	expr_test: test preparing binary_operator with =, !=, <, <=, >, >= Add unit test which check that preparing binary_operators with basic comparison operations works as expected. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-01-18 12:04:44 +01:00
Jan Ciolek	60803d12a9	expr_test: use make_*_untyped function in existing tests Use the newly introduced convenience methods that create untyped_constant in existing tests. This will make the code more readable by removing visual clutter that came with the previous overly verbose code. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2023-01-18 12:04:44 +01:00
Avi Kivity	04925a7b29	cql3: expr: relax evaluate_list() to allow allow NULL elements Tests are similarly relaxed. A test is added in lwt_test to show that insertion of a list with NULL is still rejected, though we allow NULLs in IF conditions. One test is changed from a list of longs to a list of ints, to prevent churn in the test helper library.	2023-01-18 10:38:24 +02:00
Avi Kivity	390a0ca47b	types: allow lists with NULL Allow transient lists that contain NULL throughout the evaluation machinery. This makes is possible to evalute things like `IF col IN (1, 2, NULL)` without hacks, once LWT conditions are converted to expressions. A few tests are relaxed to accommodate the new behavior: - cql_query_test's test_null_and_unset_in_collections is relaxed to allow `WHERE col IN ?`, with the variable bound to a list containing NULL; now it's explicitly allowed - expr_test's evaluate_bind_variable_validates_no_null_in_list was checking generic lists for NULLs, and was similary relaxed (and renamed) - expr_Test's evaluate_bind_variable_validates_null_in_lists_recursively was similarly relaxed to allow NULLs.	2023-01-18 10:38:24 +02:00
Avi Kivity	0b418fa7cf	cql3, transport, tests: remove "unset" from value type system The CQL binary protocol introduced "unset" values in version 4 of the protocol. Unset values can be bound to variables, which cause certain CQL fragments to be skipped. For example, the fragment `SET a = :var` will not change the value of `a` if `:var` is bound to an unset value. Unsets, however, are very limited in where they can appear. They can only appear at the top-level of an expression, and any computation done with them is invalid. For example, `SET list_column = [3, :var]` is invalid if `:var` is bound to unset. This causes the code to be littered with checks for unset, and there are plenty of tests dedicated to catching unsets. However, a simpler way is possible - prevent the infiltration of unsets at the point of entry (when evaluating a bind variable expression), and introduce guards to check for the few cases where unsets are allowed. This is what this long patch does. It performs the following: (general) 1. unset is removed from the possible values of cql3::raw_value and cql3::raw_value_view. (external->cql3) 2. query_options is fortified with a vector of booleans, unset_bind_variable_vector, where each boolean corresponds to a bind variable index and is true when it is unset. 3. To avoid churn, two compatiblity structs are introduced: cql3::raw_value{,_view}_vector_with_unset, which can be constructed from a std::vector<raw_value{,_view/}>, which is what most callers have. They can also be constructed with explicit unset vectors, for the few cases they are needed. (cql3->variables) 4. query_options::get_value_at() now throws if the requested bind variable is unset. This replaces all the throwing checks in expression evaluation and statement execution, which are removed. 5. A new query_options::is_unset() is added for the users that can tolerate unset; though it is not used directly. 6. A new cql3::unset_operation_guard class guards against unsets. It accepts an expression, and can be queried whether an unset is present. Two conditions are checked: the expression must be a singleton bind variable, and at runtime it must be bound to an unset value. 7. The modification_statement operations are split into two, via two new subclasses of cql3::operation. cql3::operation_no_unset_support ignores unsets completely. cql3::operation_skip_if_unset checks if an operand is unset (luckily all operations have at most one operand that tolerates unset) and applies unset_operation_guard to it. 8. The various sites that accept expressions or operations are modified to check for should_skip_operation(). This are the loops around operations in update_statement and delete_statement, and the checks for unset in attributes (LIMIT and PER PARTITION LIMIT) (tests) 9. Many unset tests are removed. It's now impossible to enter an unset value into the expression evaluation machinery (there's just no unset value), so it's impossible to test for it. 10. Other unset tests now have to be invoked via bind variables, since there's no way to create an unset cql3::expr::constant. 11. Many tests have their exception message match strings relaxed. Since unsets are now checked very early, we don't know the context where they happen. It would be possible to reintroduce it (by adding a format string parameter to cql3::unset_operation_guard), but it seems not to be worth the effort. Usage of unsets is rare, and it is explicit (at least with the Python driver, an unset cannot be introduced by ommission). I tried as an alternative to wrap cql3::raw_value{,_view} (that doesn't recognize unsets) with cql3::maybe_unset_value (that does), but that caused huge amounts of churn, so I abandoned that in favor of the current approach. Closes #12517	2023-01-16 21:10:56 +02:00
Jan Ciolek	9afa9f0e50	expr_test: add unit tests for prepare_expression(conjunction) Add unit tests which ensure that preparing conjunctions works as expected. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-12-13 20:23:17 +01:00
Jan Ciolek	5f5b1c4701	expr_test: add tests for evaluate(conjunction) Add unit tests which ensure that evaluating a conjunction behaves as expected. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-12-13 20:23:17 +01:00
Avi Kivity	ea901fdb9d	cql3: expr: fold `null` into untyped_constant/constant Our `null` expression, after the prepare stage, is redundant with a `constant` expression containing the value NULL. Remove it. Its role in the unprepared stage is taken over by untyped_constant, which gains a new type_class enumeration to represent it. Some subtleties: - Usually, handling of null and untyped_constant, or null and constant was the same, so they are just folded into each other - LWT "like" operator now has to discriminate between a literal string and a literal NULL - prepare and test_assignment were folded into the corresponing untyped_constant functions. Some care had to be taken to preserve error messages. Closes #12118	2022-11-29 11:02:18 +02:00
Jan Ciolek	b6cf6e6777	expr_test: test evaluating LIKE binary_operator Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:29 +01:00
Jan Ciolek	6774272fd6	expr_test: test evaluating IS_NOT binary_operator Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:29 +01:00
Jan Ciolek	e6c78bb6c2	expr_test: test evaluating CONTAINS_KEY binary_operator Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:29 +01:00
Jan Ciolek	4f250609ab	expr_test: test evaluating CONTAINS binary_operator Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:29 +01:00
Jan Ciolek	3ca04cfcc2	expr_test: test evaluating IN binary_operator Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:28 +01:00
Jan Ciolek	41f452b73f	expr_test: test evaluating GTE binary_operator Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:28 +01:00
Jan Ciolek	1fe9a9ce2a	expr_test: test evaluating GT binary_operator Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:28 +01:00
Jan Ciolek	ef2a77a3e0	expr_test: test evaluating LTE binary_operator Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:28 +01:00
Jan Ciolek	3cbb2d44e8	expr_test: test evaluating LT binary_operator Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:27 +01:00
Jan Ciolek	9feee70710	expr_test: test evaluating NEQ binary_operator Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:27 +01:00
Jan Ciolek	e77dba0b0b	expr_test: test evaluating EQ binary_operator Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:27 +01:00
Jan Ciolek	77d68153f1	test preparing expr::usertype_constructor Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 20:41:10 +01:00
Jan Ciolek	eb92fb4289	expr_test: test that prepare_expression checks style_type of collection_constructor Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 20:41:10 +01:00
Jan Ciolek	77c63a6b92	expr_test: test preparing expr::collection_constructor for map Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 20:41:09 +01:00
Jan Ciolek	a656fdfe9a	expr_test: test preparing expr::collection_constructor for set Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 20:22:37 +01:00
Jan Ciolek	76f587cfe7	expr_test: test preparing expr::collection_constructor for list Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 20:22:37 +01:00
Jan Ciolek	44b55e6caf	expr_test: test preparing expr::tuple_constructor Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 20:22:37 +01:00
Jan Ciolek	265100a638	expr_test: test preparing expr::untyped_constant Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 20:22:37 +01:00
Jan Ciolek	76b6161386	expr_test: test preparing expr::bind_variable Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 20:22:36 +01:00
Jan Ciolek	42e01cc67f	expr_test: test preparing expr::null Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 17:30:05 +01:00
Jan Ciolek	45b3fca71c	expr_test: test preparing expr::cast Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 17:30:05 +01:00
Jan Ciolek	498c9bfa0d	expr_test_utils: add make_receiver Add a convenience function which creates receivers. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 17:30:04 +01:00
Jan Ciolek	488056acb7	expr_test: test preparing expr::token Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 17:30:04 +01:00
Jan Ciolek	7958f77a40	expr_test: test preparing expr::subscript Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 17:30:04 +01:00
Jan Ciolek	569bd61c6c	expr_test: test preparing expr::column_value Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 17:30:04 +01:00
Jan Ciolek	26174e29c6	expr_test: test preparing expr::unresolved_identifier It's interesting that prepare_expression for column identifiers doesn't require a receiver. I hope this won't break validation in the future. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-17 17:30:04 +01:00
Jan Ciolek	4c4ed8e6df	test/boost: move expr_test_utils.hh to .hh and .cc in test/lib expr_test_utils.hh was a header file with helper methods for expression tests. All functions were inline, because I didn't know how to create and link a .cc file in test/boost. Now the header is split into expr_test_utils.hh and expr_test_utils.cc and moved to test/lib, which is designed to keep this kind of files. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-10-20 17:31:37 +02:00
Jan Ciolek	75b27cb61c	cql3: expr: Add unit tests for bind_variable validation of collections evaluating a bind variable should validate collection values. Test that bound collection values are validated, even in case of a nested collection. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-10-20 12:12:03 +02:00
Jan Ciolek	c4651e897f	cql3: expr: Add test for subscripted list and map Test that subscripting lists and maps works as expected. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-10-20 12:12:03 +02:00
Jan Ciolek	5a00c3dd76	cql3: expr: Add test for usertype_constructor Test that evaluate(usertype_constructor) works as expected. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-10-20 12:12:03 +02:00
Jan Ciolek	8f6309bd66	cql3: expr: Add test for tuple_constructor Test that evaluate(tuple_constructor) works as expected. It was necessary to implement a custom function for serializing tuples, because some tests require the tuple to contain unset_value or an empty value, which is impossible to express using the exisiting code. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-10-20 12:12:03 +02:00

1 2

62 Commits