scylladb

Author	SHA1	Message	Date
Avi Kivity	ea901fdb9d	cql3: expr: fold `null` into untyped_constant/constant Our `null` expression, after the prepare stage, is redundant with a `constant` expression containing the value NULL. Remove it. Its role in the unprepared stage is taken over by untyped_constant, which gains a new type_class enumeration to represent it. Some subtleties: - Usually, handling of null and untyped_constant, or null and constant was the same, so they are just folded into each other - LWT "like" operator now has to discriminate between a literal string and a literal NULL - prepare and test_assignment were folded into the corresponing untyped_constant functions. Some care had to be taken to preserve error messages. Closes #12118	2022-11-29 11:02:18 +02:00
Jan Ciolek	08f40a116d	cql3: expr: change unset value error messages to lowercase The messages used to contain UNSET_VALUE in capital letters, but the tests expect messages with 'unset value'. Change the message so that it can match the expected error text in tests. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-24 17:02:44 +01:00
Jan Ciolek	338af848a8	cql3: expr: remove needless braces around switch cases Originally put braces around the cases because there were local variables that I didn't want to be shadowed. Now there are no variables so the braces can be removed without any problems. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:30 +01:00
Jan Ciolek	e8a46d34c2	cql3: move evaluation IS_NOT NULL to a separate function When evaluating a binary operation with operations like EQUAL, LESS_THAN, IN the logic of the operation is put in a separate function to keep things clean. IS_NOT NULL is the only exception, it has its evaluate implementation right in the evaluate(binary_operator) function. It would be cleaner to have it in a separate dedicated function, so it's moved to one. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:30 +01:00
Jan Ciolek	63a89776a1	cql3: expr properly handle null in is_one_of() Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:27 +01:00
Jan Ciolek	214dab9c77	cql3: expr properly handle null in like() Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:26 +01:00
Jan Ciolek	2ce9c95a9d	cql3: expr properly handle null in contains_key() Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:26 +01:00
Jan Ciolek	336ad61aa3	cql3: expr properly handle null in contains() Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:26 +01:00
Jan Ciolek	e2223be1ec	cql3: expr: properly handle null in limits() Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:26 +01:00
Jan Ciolek	d1abf2e168	cql3: expr: remove unneeded overload of limits() There is a more general version of limits() which takes expressions as both the lhs and rhs arguments. There is no need for a specialized overload. This specialized overload takes a tuple_constructor as lhs, but we call evaluate() on both sides of a binary operator before checking equality, so this won't be useful at all. Having multiple functions increases the risk that one of them has a bug, while giving dubious benfit. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:25 +01:00
Jan Ciolek	0609a425e6	cql3: expr: properly handle null in equality operators Expressions like: 123 = NULL NULL = 123 NULL = NULL NULL != 123 should be tolerated, but evaluate to NULL. The current code assumes that a binary operator can only evaluate to a boolean - true or false. Now a binary operator can also evaluate to NULL. This should happen in cases when one of the operator's sides is NULL. A special class is introduced to represent a value that can be one of three things: true, false or null. It's better than using std::optional<bool>, because optional has implicit conversions to bool that could cause confusion and bugs. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-23 12:44:22 +01:00
Jan Ciolek	6be142e3a0	cql3: expr: remove unneeded overload of equal() There is a more general version of equal() which takes expressions as both the lhs and rhs arguments. There is no need for a specialized overload. This specialized overload takes a tuple_constructor as lhs, but we call evaluate() on both sides of a binary operator before checking equality, so this won't be useful at all. Having multiple functions increases the risk that one of them has a bug, while giving dubious benfit. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-22 14:28:10 +01:00
Jan Ciolek	a1407ef576	cql3: expr: use evaluate(binary_operator) in is_satisfied_by is_satisfied_by has to check if a binary_operator is satisfied by some values. It used to be impossible to evaluate a binary_operator, so is_satisfied had code to check if its satisfied for a limited number of cases occuring when filtering queries. Now evaluate(binary_operator) has been implemented and is_satisfied_by can use it to check if a binary_operator evaluates to true. This is cleaner and reduces code duplication. Additionally cql tests will test the new evalute() implementation. There is one special case with token(). When is_satisfied_by sees a restriction on token it assumes that it's satisfied because it's sure that these token restrictions were used to generate partition ranges. I had to leave this special case in because it's impossible to evaluate(token). Once this is implemented I will remove the special case because it's risky and prone to cause bugs. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-21 20:40:06 +01:00
Jan Ciolek	9c4889ecc3	cql3: expr: handle IS NOT NULL when evaluating binary_operator The code to evaluate binary operators was copied from is_satisfied_by. is_satisfied_by wasn't able to evaluate IS NOT NULL restrictions, so when such restriction is encountered it throws an exception. Implement proper handling for IS NOT NULL binary operators. The switch ensures that all variants of oper_t are handled, otherwise there would be a compilation error. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-21 20:40:00 +01:00
Jan Ciolek	b4cc92216b	cql3: expr: make it possible to evaluate binary_operator evaluate() takes an expression and evaluates it to a constant value. It wasn't possible to evalute binary operators before, so it's added. The code is based on is_satisfied_by, which is currently used to check whether a binary operator evaluates to true or false. It looks like is_satisfied_by and evalate() do pretty much the same thing, one could be implemented using the other. In the future they might get merged into a single function. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-21 17:48:23 +01:00
Jan Ciolek	8d81eaa68f	cql3: expr: accept expression as lhs argument to like() like() used to only accept column_value as the lhs to evaluate. Changed it to accept any generic expression. This will allow to evaluate a more diverse set of binary operators. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-21 16:33:18 +01:00
Jan Ciolek	b1a12686dc	cql3: expr: accept expression as lhs in contains_key contains_key() used to only accept column_value as the lhs to evaluate. Changed it to accept any generic expression. This will allow to evaluate a more diverse set of binary operators. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-21 16:33:02 +01:00
Jan Ciolek	79cd9cd956	cql3: expr: accept expression as lhs argument to contains() contains() used to only accept column_value as the lhs to evaluate. Changed it to accept any generic expression. This will allow to evaluate a more diverse set of binary operators. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-21 16:32:44 +01:00
Jan Ciolek	52bbc1065c	cql3: allow lists of IN elements to be NULL Requests like `col IN NULL` used to cause an error - Invalid null value for colum col. We would like to allow NULLs everywhere. When a NULL occurs on either side of a binary operator, the whole operation should just evaluate to NULL. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com> Closes #11775	2022-10-13 15:11:32 +02:00
Jan Ciolek	a2c359a741	cql3: Make CONTAINS KEY NULL return false A binary operator like this: {1: 2, 3: 4} CONTAINS KEY NULL used to evaluate to `true`. This is wrong, any operation involving null on either side of the operator should evaluate to NULL, which is interpreted as false. This change is not backwards compatible. Some existing code might break. partially fixes: #10359 Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-10-05 18:15:44 +02:00
Jan Ciolek	bbfef4b510	cql3: Make CONTAINS NULL return false A binary operator like this: [1, 2, 3] CONTAINS NULL used to evaluate to `true`. This is wrong, any operation involving null on either side of the operator should evaluate to NULL, which is interpreted as false. This change is not backwards compatible. Some existing code might break. partially fixes: #10359 Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-10-05 18:15:15 +02:00
Jan Ciolek	ac152af88c	expression: Add for_each_boolean factor boolean_factors is a function that takes an expression and extracts all children of the top level conjunction. The problem is that it returns a vector<expression>, which is inefficent. Sometimes we would like to iterate over all boolean factors without allocations. for_each_boolean_factor is implemented for this purpose. boolean_factors() can be implemented using for_each_boolean_factor, so it's done to reduce code duplication. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-09-25 16:34:22 +03:00
Michał Radwański	10e241988e	cql/expr/expression, index/secondary_index_manager: needs_filtering and index_supports_expression rewrite to accomodate for indexes over collections	2022-08-14 10:29:52 +03:00
Karol Baryła	ac97086855	cql3, index: Use entries() indexes on collections for queries Previous commit added the ability to use GSI over non-frozen collections in queries, but only the keys() and values() indexes. This commit adds support for the missing index type - entries() index. Signed-off-by: Karol Baryła <karol.baryla@scylladb.com> Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2022-08-14 10:29:52 +03:00
Karol Baryła	7966841d37	cql3, index: Use keys() and values() indexes on collections for queries. Previous commits added the possibility of creating GSI on non-frozen collections. This (and next) commit allow those indexes to actually be used by queries. This commit enables both keys() and values() indexes, as they are pretty similar.	2022-08-14 10:29:52 +03:00
Avi Kivity	8085b9f57a	cql3: expr: add boolean_factors() function to factorize an expression When analyzing a WHERE clause, we want to separate individual factors (usually relations), and later partition them into partition key, clustering key, and regular column relations. The first step is separation, for which this helper is added. Currently, it is not required since the grammar supplies the expression in separated form, but this will not work once it is relaxed to allow any expression in the WHERE clause. A unit test is added.	2022-07-22 20:14:48 +03:00
Avi Kivity	1efb2fecbe	cql3: expression: define operator==() for expressions This is useful for tests, to check that expression manipulations yield the expected results.	2022-07-22 20:14:48 +03:00
Jan Ciolek	c7495fa59e	cql3: Add expr::index_supports_some_column Add a function that checks if there is an index which supports one of the columns present in the given expression. This functionality will soon be needed for clustering and nonprimary columns so it's good to separate into a reusable function. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-19 15:38:20 +02:00
Jan Ciolek	6cf0981aa6	cql3: expr: Add has_only_eq_binops function Add a function which checks that an expression contains only binary operators with '='. Right now this check is done only in a single place, but soon the same check will have to be done for clustering columns as well, so the code is moved to a separate function to prevent duplication. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-18 17:45:06 +02:00
cvybhu	80dda2bb97	cql3: expr: Fix handling reversed types in limits() There was a bug which caused incorrect results of limits() for columns with reversed clustering order. Such columns have reversed_type as their type and this needs to be taken into account when comparing them. It was introduced in `6d943e6cd0`. This commit replaced uses of get_value_comparator with type_of. The difference between them is that get_value_comparator applied ->without_reversed() on the result type. Because the type was reversed, comparisons like 1 < 2 evaluated to false. This caused the test testIndexOnKeyWithReverseClustering to fail, but sadly it wasn't caught by CI because the CI itself has a bug that makes it skip some tests. The test passes now, although it has to be run manually to check that. Fixes: #10918 Signed-off-by: cvybhu <jan.ciolek@scylladb.com> Closes #10994	2022-07-10 09:24:06 +03:00
Jan Ciolek	83f27fc8c1	cql3: expr: Add contains_multi_column_restriction Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-01 16:29:11 +02:00
Jan Ciolek	fd0798c8a2	cql3: Add expr::value_for value_for is a method from the restriction class which finds the value for a given column. Under the hood it makes use of possible_lhs_values. It will be needed to implement some functionality that was implemented using restrictions before. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-01 16:29:11 +02:00
Jan Ciolek	24b0a61d51	cql3: Add get_columns_in_commons Add a function that finds common columns between two expressions. It's used in error messages in the original restrictions code so it must be included in the new code as well for compatibility. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-01 16:29:09 +02:00
Jan Ciolek	177ba9b9db	cql3: expr: Add is_empty_restriction Add a function to check whether an expression restricts anything at all. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-01 16:29:09 +02:00
Jan Ciolek	228b344d9c	cql3: Replicate column sorting functionality using expressions Restrictions code keeps restrictions for each column in a map sorted by their position in the schema. Then there are methods that allow to access the restricted column in the correct order. To replicate this in upcoming code we need functions that implement this functionality. The original comparator can be found in: cql3/restrictions/single_column_restrictions.hh For primary key columns this comparator compares their positions in the schema. For non-primary columns the position is assumed to be clustering_key_size(), which seems pretty random. To avoid passing the schema to the comparator for nonprimary columns I just assume the position is u32::max(). This seems to be as good of a choice as clustering_key_size(). Orignally Cassandra used -1: `bc8a260471/src/java/org/apache/cassandra/config/ColumnDefinition.java (L79-L86)` We never end up comparing columns of different kind using this comparator anyway. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-01 16:28:41 +02:00
Jan Ciolek	afc482f0a5	cql3: expr: Add get_the_only_column Add a function that gets the only column from a single column restriction expression. The code would be very similiar to is_single_column_restriction, so a new function is introducted to reduce duplication. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-01 15:50:48 +02:00
Jan Ciolek	9c3b0299a1	cql3: expr: Add is_single_column_restriction Add a function that checks whether an expression contains restrictions on exactly one column. This a "single_column_restriction" in the same way that instances of "class single_column_restriction" are. It will be used later to distinguish cases later once this class is removed Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-01 15:49:37 +02:00
Avi Kivity	4d587e0c3d	cql3: raw_value: deduplicate view() and to_view() Commit `e739f2b779` ("cql3: expr: make evaluate() return a cql3::raw_value rather than an expr::constant") introduced raw_value::view() as a synonym to raw_value::to_view() to reduce churn. To fix this duplication, we now remove raw_value::to_view(). raw_value::to_view() was picked for removal because is has fewer call sites, reducing churn again. Closes #10819	2022-06-17 09:32:58 +02:00
Avi Kivity	e739f2b779	cql3: expr: make evaluate() return a cql3::raw_value rather than an expr::constant An expr::constant is an expression that happens to represent a constant, so it's too heavyweight to be used for evaluation. Right now the extra weight is just a type (which causes extra work by having to maintain the shared_ptr reference count), but it will grow in the future to include source location (for error reporting) and maybe other things. Prior to `e9b6171b5` ("Merge 'cql3: expr: unify left-hand-side and right-hand-side of binary_operator prepares' from Avi Kivity"), we had to use expr::constant since there was not enough type infomation in expressions. But now every expression carries its type (in programming language terms, expressions are now statically typed), so carrying types in values is not needed. So change evaluate() to return cql3::raw_value. The majority of the patch just changes that. The rest deals with some fallout: - cql3::raw_value gains a view() helper to convert to a raw_value_view, and is_null_or_unset() to match with expr::constant and reduce further churn. - some helpers that worked on expr::constant and now receive a raw_value now need the type passed via an additional argument. The type is computed from the expression by the caller. - many type checks during expression evaluation were dropped. This is a consequence of static typing - we must trust the expression prepare phase to perform full type checking since values no longer carry type information. Closes #10797	2022-06-15 08:47:24 +02:00
Avi Kivity	6d943e6cd0	cql3: expr: drop column_maybe_subscripted column_maybe_subscripted is a variant<column_value, subscript> that existed for two reasons: 1. evaluation of subscripts and of columns took different paths. 2. calculation of the type of column or column[sub] took different paths. Now that all evaluations go through evaluate(), and the types are present in the expression itself, there is no need for column_maybe_subscripted and it is replaced with plain expressions.	2022-06-12 19:21:28 +03:00
Avi Kivity	2aa9199e9a	cql3: expr: possible_lhs_values(): open-code get_value_comparator() get_value_comparator() is going away soon, so open-code it here. It's not doing much anyway.	2022-06-12 19:14:50 +03:00
Avi Kivity	b1c12073b1	cql3: expr: rationalize lhs/rhs argument order Some functions accept the right-hand-side as the first argument and the left-hand-side as the second argument. This is now confusing, but at least safe-ish, as the arguments have different types. It's going to become dangerous when we switch to expressions for both sides, so let's rationalize it by always starting with lhs. Some parameters were annotated with _lhs/_rhs when it was not clear.	2022-06-12 18:55:24 +03:00
Avi Kivity	9beac1df53	cql3: expr: don't rely on grammar when comparing tuples The grammar only allows comparing tuples of clustering columns, which are non-null, but let's not rely on that deep in expression evaluation as it can be relaxed.	2022-06-12 18:41:03 +03:00
Avi Kivity	9a4f2a8cc3	cql3: expr: wire column_value and subscript to evaluate() With everything standardized on evaluation_inputs(), it's a matter of calling get_value().	2022-06-12 18:21:04 +03:00
Avi Kivity	30721fdc4a	cql3: get_value(subscript): remove gratuitous pointer While extracting get_value(subscript) we inherited a pointer due to the calling convention, we can now remove it.	2022-06-12 18:18:59 +03:00
Avi Kivity	dd2fec9cb1	cql3: expr: reindent get_value(subscript) Whitespace only change.	2022-06-12 18:04:12 +03:00
Avi Kivity	31b9e2a565	cql3: expr: extract get_value(subscript) from get_value(column_maybe_subscripted) We wish to wire get_value(subscript) into evaluate (and get rid of column_maybe_subscripted).	2022-06-12 18:03:03 +03:00
Avi Kivity	b5287db8ea	cql3: expr: drop internal 'column_value_eval_bag' is_satisfied_by() used an internal column_value_eval_bag type that was more awkwardly named (and more awkward to use due to more nesting) than evaluation_inputs. Drop it and use evaluation_inputs throughout. The thunk is_satisified_by(evaluation_inputs) that just called is_satisified_by(column_value_eval_bag) is dropped.	2022-06-12 17:12:41 +03:00
Avi Kivity	55085906ca	cql3: expr: change evalute() to accept evaluation_inputs Currently, evaluate() accepts only query_options, which makes it not useful to evaluate columns. As a result some callers (column_condition) have to call it directly on the right-hand-side of binary expressions instead of evaluating the binary expression itself. Change it to accept evaluation_input as a parameter, but keep the old signature too, since it is called from many places that don't have rows.	2022-06-12 16:51:42 +03:00
Avi Kivity	2ecdb219fb	cql3: expr: make evaluate(<expression subtype>) static They aren't called from anywhere outside expression.cc, and we're playing with the signatures, so hide them to avoid rebuilds.	2022-06-12 16:13:20 +03:00

1 2 3 4

200 Commits