scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-02 04:56:58 +00:00

Author	SHA1	Message	Date
Avi Kivity	f9b3c6ddbd	cql3: expr: drop restrictions on list subscripts Restriction validation forbids lists (somewhat oddly, it talks about indexes; validation should make a soft check about indexes (since it can fall back to filtering) and a hard check about supported filtering expressions), and enforces a map in another place. Remove the first restriction and relax the second to allow lists as well as maps as subscript operands. Some validation messages are adjusted to reflect that lists are supported.	2022-05-30 13:29:49 +03:00
Avi Kivity	35e0474410	cql3: expr: prepare_expr: support subscripted lists Infer the type of a list index as int32_type. The error message when a non-subscriptable type is provided is changed, so the corresponding test is changed too.	2022-05-30 13:29:49 +03:00
Avi Kivity	8d667e374b	cql3: expressions: reindent get_value() Whitespace-only change.	2022-05-30 13:29:49 +03:00
Avi Kivity	05388f7a2a	cql3: expression: evaluate() support subscripting lists We already support subscripting maps (for filtering WHERE m[3] = 6), so adding list subscript support is easy. Most of the code is shared. Differences are: - internal list representation is a vector of values, not of key/values - key type is int32_type, not defined by map - need to check index bounds	2022-05-30 13:29:49 +03:00
Jan Ciolek	f9b1fc0b69	cql: Forbid null in lists of IN values We used to allow nulls in lists of IN values, i.e. a query like this would be valid: SELECT * FROM tab WHERE pk IN (1, null, 2); This is an old feature that isn't really used and is already forbidden in Cassandra. Additionally the current implementation doesn't allow for nulls inside the list if it's sent as a bound value. So something like: SELECT * FROM tab WHERE pk IN ?; would throw an error if ? was (1, null, 2). This is inconsistent. Allowing it made writing code cumbersome because this was the only case where having a null inside of a collection was allowed. Because of it there needed to be separate code paths to handle regular lists and lists of NULL values. Forbidding it makes the code nicer and consistent at the cost of a feature that isn't really important. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-05-24 00:17:41 +02:00
cvybhu	21453ac9a4	cql3: Remove scalar from bind_variable_scalar_prepare_expression There is now only one function to prepare bind_variable, so we can remove 'scalar' from its name. Signed-off-by: cvybhu <jan.ciolek@scylladb.com>	2022-05-16 18:17:58 +02:00
cvybhu	9b49d27a8d	cql3: expr: Remove shape_type from bind_variable shape_type was used in prepare_expression to differentiate between a few cases and create the correct receivers. This was used by the relation class. Now creating the correct receiver has been delegated to the caller of prepare_expression and all bind_variables can be handled in the same simple way. shape_type is not needed anymore. Not having it is better because it simplifies things. Signed-off-by: cvybhu <jan.ciolek@scylladb.com>	2022-05-16 18:17:58 +02:00
cvybhu	c0fc82d4be	cql3: Remove prepare_expression_multi_column This function was used by multi_column_relation.hh, but now it isn't needed anymore. The only way to prepare a bind_variable is now the standard prepare_expression. Signed-off-by: cvybhu <jan.ciolek@scylladb.com>	2022-05-16 18:17:58 +02:00
Michał Sala	f6bdc4d694	cql3: expr: add printer for expression expression::printer is used to print CQL expressions in a pretty way that allows them to be parsed back to the same representation. There is a bunch of things that need to be changed when compared to the current implementation of opreatorr<<(expression) to output something parsable. column names should be printed without 'unresolved_identifier()' and sometimes they need to be quoted to perserve case sensitivity. I needed to write new code for printing constant values because the current one did debug printing (e.g. a set was printed as '1; 2; 3'). A list of IN values should be printed inside () intead of [], but because it is internally represented as a list it is by default printed with []. To fix this a temporary tuple_constructor is created and printed. Signed-off-by: cvybhu <jan.ciolek@scylladb.com>	2022-05-16 18:17:58 +02:00
cvybhu	c4f846dbc8	cql3: expr: expr::to_restriction: Handle token relations Implement converting token relations to expressions. The code is mostly tekken from functions in token_relation.hh, because we are replicating functionliaty of the functions called token_relation::new_XX_restrictions. Signed-off-by: cvybhu <jan.ciolek@scylladb.com>	2022-05-16 18:17:58 +02:00
cvybhu	5fc5012f9b	cql3: expr: expr::to_restriction: Handle multi column relations Implement converting multi column relations to expressions. The code is mostly taken from functions in multi_column_relation.hh, because we are replicating functionality of the functions called multi_column_relation::new_XX_restriction. Signed-off-by: cvybhu <jan.ciolek@scylladb.com>	2022-05-16 18:17:58 +02:00
cvybhu	89950e02b5	cql3: expr: Add expr::to_restriction for single column relations Add a function that will be used to convert expressions received from the parser to restrictions. Currently parser creates relations with expressions inside and then those relations are converted to restrictions. Once this function is implemented we will be able to skip creating relations altogether and convert straight from expression to restriction. This will allow us to remove the relation class. Further functionality will be implemented in the following commits. This commit implements converting single column relations to expressions. The code is mostly taken from functions in single_column_relation.hh, because we are replicating functionality of the functions called single_column_relation::new_XX_restriction. Signed-off-by: cvybhu <jan.ciolek@scylladb.com>	2022-05-16 18:17:57 +02:00
cvybhu	3e5e5c4a17	cql3: expr: Add prepare_binary_operator Add a function that allows to prepare a binary_operator received from the parser. It resolves columns on the LHS, calculates type of LHS, and prepares RHS with the correct type. It will be used by expr::to_restriction. Some basic type checks are performed, but more throughout checks will be required in expr::to_restriction to fully validate a relation. Signed-off-by: cvybhu <jan.ciolek@scylladb.com>	2022-05-16 18:15:37 +02:00
cvybhu	5dee55d433	cql3: expr: Change how prepare_expression handles bind_variable The situation with preparing bind_variable is a bit strange, there are four shapes of bind variables and receiver behaviour is not in line with other types. To prepare a bind_variable for a list of IN values for an int column the current code requires us to pass a receiver of type int. This is counterintuitive, to prepare a string we pass a receiver with string type, so to prepare list<int> we should pass a receiver of type list<int>, not just int. This commit changes the behaviour in two ways: - Shape of bind_variable doesn't matter anymore - The bind_variable gets the receiver passed to prepare_expression, no more list<receiver> magic. Other variants of bind_variable_x_prepare_expression are not removed yet because they are needed by prepare_expression_mutlti_column. They will be removed later, along with bind_variable::shape_type. Signed-off-by: cvybhu <jan.ciolek@scylladb.com>	2022-05-16 18:15:36 +02:00
cvybhu	be6e741b6c	clq3: expr: Add columns to expr::token struct The expr::token struct is created when something like token(p1, p2) occurs in the WHERE clause. Currently expr::token doesn't keep columns passed as arguemnts to the token function. They weren't needed because token() validation was done inside token_relation. Now that we want to use only expressions we need to have columns inside the token struct and validate that those are the correct columns. Signed-off-by: cvybhu <jan.ciolek@scylladb.com>	2022-05-16 18:03:11 +02:00
cvybhu	b99aae7d41	cql3: expr: Modify list_prepare_expression to handle lists of IN values The standard CQL list type doesn't allow for nulls inside the collection. However lists of IN values are the exception where bind nullsare allowed, for example in restrictions like: p IN (1, 2, null) To be able to use list_prepare_expression with lists of IN values a flag is added to specify whether nulls should be allowed. Signed-off-by: cvybhu <jan.ciolek@scylladb.com>	2022-05-16 18:03:11 +02:00
cvybhu	2b5818697a	cql3: expr: Add expr::as_if for non-const expressions expr::as_if is our wrapper for std::get_if. There was a version for const expression, but there weren't one for mutable expression. Add the mutable version, it will be needed in the following commits. Signed-off-by: cvybhu <jan.ciolek@scylladb.com>	2022-05-16 18:03:11 +02:00
Michał Sala	69bc60ef24	cql3: expr: do not allow unset values inside collections Semantic of unset values inside collections is undefined. Previous behavior of transforming list with unset value into unset value was removed, because I couldn't find a reason for its existence.	2022-04-28 19:33:09 +02:00
Michał Sala	4766e25d6e	cql3: expr: prepare_expr: allow bind markers in collection literals It's easier to allow them then not to do so.	2022-04-28 19:31:09 +02:00
Nadav Har'El	fbb2a41246	expressions: don't dereference invalid map subscript in filter If we have the filter expression "WHERE m[?] = 2", the existing code simply assumed that the subscript is an object of the right type. However, while it should indeed be the right type (we already have code that verifies that), there are two more options: It can also be a NULL, or an UNSET_VALUE. Either of these cases causes the existing code to dereference a non-object as an object, leading to bizarre errors (as in issue #10361) or even crashes (as in issue #10399). Cassandra returns a invalid request error in these cases: "Unsupported unset map key for column m" or "Unsupported null map key for column m". We decided to do things differently: * For NULL, we consider m[NULL] to result in NULL - instead of an error. This behavior is more consistent with other expressions that contain null - for example NULL[2] and NULL<2 both result in NULL as well. Moreover, if in the future we allow more complex expressions, such as m[a] (where a is a column), we can find the subscript to be null for some rows and non-null for other rows - and throwing an "invalid query" in the middle of the filtering doesn't make sense. * For UNSET_VALUE, we do consider this an error like Cassandra, and use the same error message as Cassandra. However, the current implementation checks for this error only when the expression is evaluated - not before. It means that if the scan is empty before the filtering, the error will not be reported and we'll silently return an empty result set. We currently consider this ok, but we can also change this in the future by binding the expression only once (today we do it on every evaluation) and validating it once after this binding. Fixes #10361 Fixes #10399 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2022-04-24 16:05:34 +03:00
Nadav Har'El	808a93d29b	expressions: fix invalid dereference in map subscript evaluation When we have an filter such as "WHERE m[2] = 3" (where m is a map column), if a row had a null value for m, our expression evaluation code incorrectly dereferences an unset optional, and continued processing the result of this dereference which resulted in undefined behavior - sometimes we were lucky enough to get "marshaling error" but other times Scylla crashed. The fix is trivial - just check before dereferencing the optional value of the map. We return null in that case, which means that we consider the result of null[2] to be null. I think this is a reasonable approach and fits our overall approach of making null dominate expressions (e.g., the value of "null < 2" is also null). The test test_filtering.py::test_filtering_null_map_with_subscript, which used to frequently fail with marshaling errors or crashes, now passes every time so its "xfail" mark is removed. Fixes #10417 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2022-04-24 14:58:56 +03:00
cvybhu	5c199cad45	cql3: expr: possible_lhs_values: Handle subscript This commit makes subscript an invalid argument to possible_lhs_values. Previously this function simply ignored subscripts and behaved as if it was called on the subscripted column without a subscript. This behaviour is unexpected and potentially dangerous so it would be better to forbid passing subscript to possible_lhs_values entirely. Trying to handle subscript correctly is impossible without refactoring the whole function. The first argument is a column for which we would like to know the possible values. What are possible values of a subscripted column c where c[0] = 1? All lists that have 1 on 0th position? If we wanted to handle this nicely we would have to change the arguments. Such refectoring is best left until the time when this functionality is actually needed, right now it's hard to predict what interface will be needed then. Signed-off-by: cvybhu <jan.ciolek@scylladb.com> Closes #10228	2022-04-11 19:05:09 +03:00
Jan Ciolek	2e7009f427	cql3: expr: is_supported_by: Return false for subscripted values is_supported_by checks whether a given restriction can be supported by some index. Currently when a subscripted value, e.g `m[1]` is encountered, we ignore the fact that there is a subscript and ask whether an index can support the `m` itself. This looks like unintentional behaviour leftover from the times when column_value had a sub field, which could be easily forgotten about. Scylla doesn't support indexes on collection elements at all, so simply returning false there seems like a good idea. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com> Closes #10227	2022-03-15 20:19:33 +02:00
Jan Ciolek	e086201420	cql3: expr: Remove sub from column_value column_value::sub has been replaced by the subscript struct everywhere, so we can finally remove it. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 22:02:39 +01:00
Jan Ciolek	b80f9e6cf8	cql3: Create a subscript in single_column_relation When `val[sub]` is parsed, it used to be the case that column_value with a sub field was created. Now this has been changed to creating a subscript struct. This is the only place where a subscripted value can be created. All the code regarding subscripts now operates using only the subscript struct, so we will be able to remove column_value::sub soon. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 22:02:39 +01:00
Jan Ciolek	cf6e81e731	cql3: expr: Add subscript to expression All handlers for subscript have finally been implemented and subscript can now be added to expression without any trouble. All the commented out code that waited for this moment can now be uncommented. Every such piece of code had a `TODO(subscript)` note and by grepping this phrase we can make sure that we didn't forget any of them. Right now there is two ways to express a subscripted column - either by a column_value with a sub field or by using a subscript struct. The grammar still uses the old column_value way, but column_value.sub will be removed soon and everything will move to the subscript struct. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 22:02:29 +01:00
Jan Ciolek	ec6f93d0c7	cql3: expr: Handle subscript in test_assignment test_assignment can't be passed a column_value, so a subscript won't work as well. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	ab89fc316b	cql3: expr: Handle subscript in prepare_expression column_value can't be prepared, so subscript can't be prepared as well. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	c39498537c	cql3: expr: Handle subscript in fill_prepare_context fill_prepare_context collects useful information about the expression involved in query restrictions. We should collect this information from subscript as well, just like we do from column_value and its sub. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	811685ad6a	cql3: expr: Handle subscript in evaluate A column_value can't be evaluated, so a subscripted column can't evaluated be as well. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	db8990436a	cql3: expr: Handle subscript in extract_single_column_restrictions_for_column extract_single_column_restrictions_for_column finds all restrictions for a column and puts them in a vector. In case we encounter col[sub] we treat it as a restriction on col and add it to the result. This seems to make some sense and is in line with the current behaviour which doesn't check whether a column is subscripted at all. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	2eaa39e1c8	cql3: expr: Handle subscript in search_and_replace Prepare a handler for subscript in search_and_replace. Some of the code must be commented out for now because subscript hasn't been added to expression yet. It will uncommented later. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	e2d983f659	cql3: expr: Handle subscript in recurse_until Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	2d4174dc46	cql3: expr: Implement operator<< for subscript expression can be printed using operator<<. We need to handle subscript there. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	02c3b78e25	cql3: expr: Handle subscript in possible_lhs_values possible_lhs_values returns set of possible values for a column given some restrictions. Current behaviour in case of a subscripted column is to just ignore the subscript and treat the restriction as if it were on just the column. This seems wrong, or at least confusing, but I won't change it in this patch to preserve the existing behaviour. Trying to change this to something more reasonable breaks other code which assumes that possible_lhs_values returns a list of values. (See partition_ranges_from_EQs() in cql3/restrictions/statement_restrictions.cc) Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	07fbf74a97	cql3: expr: Handle subscript in is_supported_by is_supported_by checks whether the given expression is supported by some index. The current behaviour seems wrong, but I kept it to avoid making changes in a refactor PR. Scylla doesn't have indexes on map entries yet, so for a subscript the answer is always no. I think we should just return false there. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	fb59f488df	cql3: expr: Handle subscript in is_satisifed_by For the most part subscript can be handled in the same way as column_value. column_value has a sub argument and all called functions evaluate lhs value using get_value() which is prepared to handle subscripted columns. These functions now take column_maybe_subscripted so we can pass &subscript to them without a problem. The difference is in CONTAINS, CONTAINS_KEY and LIKE. contains() and contains_key() throw an exception when the passed column has a subscript, so now we just throw an exception immediately. like() doesn't have a check for subscripted value, but from reading its code it's clear that it's not ready to handle such values, so an exception is now thrown as well. It shouldn't break any tests because when one tries to perform a query like: `select * from t where m[0] like '%' allow filtering;` an exception is throw somewhere earlier in the code. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	1edaa3ef0d	cql3: expr: Remove unused attribute Functions that were previously marked as unused to make the code compile are now used and we can remove the markings. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	cf839807ac	cql3: expr: Use column_maybe_subscripted in is_one_of() is_one_of() used to take column_value which could be subscripted as an argument. column_value.sub will be removed so this function needs to take column_maybe_subscripted now. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	75c8b2ec6c	cql3: expr: Use column_maybe_subscripted in limits() limits() used to take column_value which could be subscripted as an argument. column_value.sub will be removed so this function needs to take column_maybe_subscripted now. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	bc8c298be3	cql3: expr: Use column_maybe_subscripted in equal() equal() used to take column_value which could be subscripted as an argument. column_value.sub will be removed so this function needs to take column_maybe_subscripted now. To get lhs value the code uses get_value() which is ready to handle subscripted columns. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	ca423a455e	cql3: expr: add get_subscripted_column(column_maybe_subscripted) Add a function that extracts the column_value from column_maybe_subscripted. There were already overloads for expression and subscript, but this one will be needed as well. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	6d42ff580d	cql3: expr: Add as_column_maybe_subscripted Add a convenience function that allows to convert a reference to expression to column_maybe_subscripted. It will be useful in a moment. For now part of it must be commented out because subscript is not in the expression variant yet. It will be uncommented once subscript is finally added to expression. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	d577af0f0c	cql3: expr: Make get_value_comparator work with column_maybe_subscripted There is get_value_comparator(column_value) but soon we will also need get_value_comparator(column_maybe_subscripted). Implement it by copying code from get_value_comparator(column_value). Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	a8287a158a	cql3: expr: Make get_value work with column_maybe_subscripted There is a get_value(column_value), but soon we will also need get_value(column_maybe_subscripted). Implement get_value(column_maybe_subscripted) by checking whether the argument is a column_value or subscript and calling the right code. Code for handling the subscript case is copied from get_value(column_value) where sub has value. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	feee6e4ffb	cql3: expr: Add column_maybe_subscripted column_maybe_subscripted is a variant that can be either a column_value or a subscript. It will be used as an argument to functions which used to take column_value. Right now column_value has a sub field, but this will be removed soon once the subscript struct takes over. Changing the argument type is a smaller change than rewriting all these functions, although if they were rewritten the resulting code would probably be nicer. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:41 +01:00
Jan Ciolek	4d7438d30a	cql3: expr: Add get_subscripted_column Even though the new subscript allows for subscripting anything, the only thing that is really allowed to be subscripted is a column. Add a utility function that extracts the column_value from an expression with is a column_value or subscript. It will came in handy in the following commits. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 21:56:30 +01:00
Jan Ciolek	a5bcd4f7f2	cql3: expr: Add subscript struct Add a struct called subscript, which will be used in expression variant to represent subscripted values e.g col[x], val[sub]. It will replace the sub field of column_value. Having a separate struct in AST for this purpose is cleaner and allows to express subscripting values other than column_value. It is not added to the expression variant yet, because that would require immediately implementing all visitors. The following commits will implement individual visitors and then subscript will finally be added to expression. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-27 01:32:59 +01:00
Jan Ciolek	7234cc851c	cql3: expr: add std::forward in expr::visit expr::visit was missing std::forward on the visitor. In cases where the visitor was passed as an rvalue it wouldn't be properly forwarded to std::visit. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-18 14:19:49 +01:00
Jan Ciolek	8676f60724	cql3: expr: Fix expr::visit so that it works with references expr::visit had a bug where if we wanted to return a reference in the visitor, the reference would be to a temporary stack location instead of the passed argument. So trying to do something like this: ``` const bind_variable& ref = visit(overloaded_functor { [](const bind_variable& bv) -> const bind_variable& { return bv; }, [](const auto&) -> const bind_variable& { ... } }, e); std::cout << ref << std::endl; ``` Would actually print a random location on stack instead of valid value inside of e. Additionally trying to return a non-const reference doesn't even compile. The problem was that the return type of expr::visit was defined as `auto`, which can be `int`, but not `int&`. This has been changed to `decltype(auto)` which can be both `int` and `int&` New version of `expr::visit` works for `const expression&` and `expression&` no matter what the visitor returns. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-02-17 17:29:28 +01:00

1 2 3 4 5

215 Commits