scylladb

Author	SHA1	Message	Date
Piotr Sarna	83ec505fab	cql3: add tracing indexed aggregate queries Commit `1c99ed6ced` added tracing logs about the index chosen for the query, but aggregate queries have a separate code path, which wasn't taken into account. After this patch, tracing for aggregate queries also includes this additional information. Closes #10195	2022-03-11 15:27:03 +02:00
Eliran Sinvani	bf50dbd35b	cql3 statements: Change dependency test API to express better it's purpose Cql statements used to have two API functions, depends_on_keyspace and depends_on_column_family. The former, took as a parameter only a table name, which makes no sense. There could be multiple tables with the same name each in a different keyspace and it doesn't make sense to generalize the test - i.e to ask "Does a statement depend on any table named XXX?" In this change we unify the two calls to one - depends on that takes a keyspace name and optionally also a table name, that way every logical dependency tests that makes sense is supported by a single API call.	2022-02-27 11:48:03 +02:00
Piotr Dulikowski	ddf049738d	indexed_table_select_statement: return some exceptions as exception messages Adjusts the indexed_table_select_statement so that it uses the result-aware methods in storage_proxy and propagates failed results as result_message::exception.	2022-02-22 16:25:21 +01:00
Piotr Dulikowski	c5bcfee28f	select_statement: return exceptions as errors in execute_without_checking_exception_message Modifies the remaining logic of execute_without... (apart from the do_execute call) so that the result-aware versions of storage_proxy's methods are called and failed results are converted to result_message::exception.	2022-02-22 16:25:21 +01:00
Piotr Dulikowski	5106c60cd0	select_statement: return exceptions without throwing in do_execute Modifies do_execute so that it uses the result-aware versions of the query_pager's methods and returns them as result_message::exception.	2022-02-22 16:25:21 +01:00
Piotr Dulikowski	3a4d3f3175	select_statement: implement execute_without_checking_exception_message The select_statement will be able to propagate coordinator failures without throwing, so it's important to override the default implementations of execute and excecute_without... so that the first calls the latter and not the other way around.	2022-02-22 16:25:21 +01:00
Piotr Dulikowski	df7668797b	select_statement: introduce helpers for working with failed results Adds: - Includes for result-related helper methods (to be used in later commits), - Alias for coordinator_result, - The wrap_result_to_error_message function - a bit similar to utils::result_wrap. Adapts a callable T -> shared_ptr<result_message> to take result<T> -> shared_ptr<result_message>. If the result is failed, it converts it into result_message::exception and returns.	2022-02-22 16:25:21 +01:00
Michał Sala	b439d6e710	db: config: add a flag to disable new parallelized aggregation algorithm Just in case the new algorithm turns out to be buggy, add a flag to fall-back to the old algorithm.	2022-02-01 21:26:25 +01:00
Michał Sala	aec96be553	forward_service: add tracing	2022-02-01 21:14:41 +01:00
Michał Sala	f344bd0aaa	cql3: statements: introduce parallelized_select_statement Detect whether a statement is a count() query in prepare time. If so, instantiate a new `select_statement` subclass - `parallelized_select_statement`. This subclass has a different execution logic, that enables it to distribute count() queries across a cluster. Also, a new counter was added - `select_parallelized` that counts the number of parallelized aggregation SELECT query executions.	2022-02-01 21:14:41 +01:00
Michał Sala	0fe59082ec	storage_proxy: extract query_ranges_to_vnodes_generator to a separate file Such separation allows using query_ranges_to_vnodes_generator by other services without needing a storage_proxy dependency.	2022-02-01 21:14:41 +01:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Pavel Emelyanov	00de5f4876	validation: Make validate_column_family use data_dictionary::database And instantly convert the validate_keyspace() as it's not called from anywhere but the validate_column_family(). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-01-14 13:00:53 +03:00
Pavel Emelyanov	b6bc7a9b29	client_state: Make has_column_family_access use data_dictionary::database Straightforward replacement. Internals of the has_column_family_access() temporarily get .real_database(), but it will be changed soon. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-01-14 12:55:15 +03:00
Pavel Emelyanov	095d93eaf8	pager: Keep shared pointer to proxy onboard Pagers are created by alternator and select statement, both have the proxy reference at hands. Next, the pager's unique_ptr is put on the lambda of its fetch_page() continuation and thus it survives the fetch_page execution and then gets destroyed. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-01-10 07:58:57 +03:00
Pavel Emelyanov	d98dd0ff80	cql3: Generalize bounce-to-shard result creation The main intention is actually to free the qp.proxy() from the need to provide the get_stats() method. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-23 11:28:44 +03:00
Pavel Emelyanov	d32de22ee8	cql3: Get data dictionary directly from query_processor After previous patches there's a whole bunch of places that do qp.proxy().data_dictionary() while the data_dictionary is present on the query processor itself and there's a public method to get one. So use it everywhere. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-23 11:28:44 +03:00
Pavel Emelyanov	da4c29105d	select_statement: Replace all proxy-s with query_processor This is the largest user of proxy argument. Fix them all and their callers (all sit in the same .cc file). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-23 10:54:28 +03:00
Pavel Emelyanov	bce2ed9c6c	cql3: Make execution stages carry query_processor over The batch_ , modification_ and select_ statements get proxy from query processor just to push it through execution stage. Simplify that by pushing the query processor itself. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-23 10:53:44 +03:00
Pavel Emelyanov	b990ca5550	cql3: Make .validate() and .check_access() accept query_processor This is mostly a sed script that replaces methods' first argument plus fixes of compiler-generated errors. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-23 10:53:44 +03:00
Avi Kivity	d768e9fac5	cql3, related: switch to data_dictionary Stop using database (and including database.hh) for schema related purposes and use data_dictionary instead. data_dictionary::database::real_database() is called from several places, for these reasons: - calling yet-to-be-converted code - callers with a legitimate need to access data (e.g. system_keyspace) but with the ::database accessor removed from query_processor. We'll need to find another way to supply system_keyspace with data access. - to gain access to the wasm engine for testing whether used defined functions compile. We'll have to find another way to do this as well. The change is a straightforward replacement. One case in modification_statement had to change a capture, but everything else was just a search-and-replace. Some files that lost "database.hh" gained "mutation.hh", which they previously had access to through "database.hh".	2021-12-15 13:54:23 +02:00
Pavel Emelyanov	b0a8c153f7	select_statement: Remove unused proxy args and captures The generate_view_paging_state_from_base_query_results() has unused proxy argument that's carried over quite a long stack for nothing. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20211210175203.26197-1-xemul@scylladb.com>	2021-12-10 20:39:55 +02:00
Nadav Har'El	c6f2afb93d	Merge 'cql3: Allow to skip EQ restricted columns in ORDER BY' from Jan Ciołek In queries like: ```cql SELECT * FROM t WHERE p = 0 AND c1 = 0 ORDER BY (c1 ASC, c2 ASC) ``` we can skip the requirement to specify ordering for `c1` column. The `c1` column is restricted by an `EQ` restriction, so it can have at most one value anyway, there is no need to sort. This commit makes it possible to write just: ```cql SELECT * FROM t WHERE p = 0 AND c1 = 0 ORDER BY (c2 ASC) ``` I reorganized the ordering code, I feel that it's now clearer and easier to understand. It's possible to only introduce a small change to the existing code, but I feel like it becomes a bit too messy. I tried it out on the [`orderby_disorder_small`](https://github.com/cvybhu/scylla/commits/orderby_disorder_small) branch. The diff is a bit messy because I moved all ordering functions to one place, it's better to read [select_statement.cc](https://github.com/cvybhu/scylla/blob/orderby_disorder/cql3/statements/select_statement.cc#L1495-L1658) lines 1495-1658 directly. In the new code it would also be trivial to allow specifying columns in any order, we would just have to sort them. For now I commented out the code needed to do that, because the point of this PR was to fix #2247. Allowing this would require some more work changing the existing tests. Fixes: #2247 Closes #9518 * github.com:scylladb/scylla: cql-pytest: Enable test for skipping eq restricted columns in order by cql3: Allow to skip EQ restricted columns in ORDER BY cql3: Add has_eq_restriction_on_column function cql3: Reorganize orderings code	2021-12-09 21:11:56 +03:00
Jan Ciolek	a548c2dac4	cql3: Allow to skip EQ restricted columns in ORDER BY In queries like: SELECT * FROM t WHERE p = 0 AND c1 = 0 ORDER BY (c1 ASC, c2 ASC) we can skip the requirement to specify ordering for c1 column. The c1 column is restricted by an EQ restriction, so it can have only one value anyway, there is no need to sort. This commit makes it possible to write just: SELECT * FROM t WHERE p = 0 AND c1 = 0 ORDER BY (c2 ASC) Fixes: #2247 Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-12-09 12:07:02 +01:00
Jan Ciolek	f76a1cd4bf	cql3: Reorganize orderings code Reorganized the code that handles column ordering (ASC or DESC). I feel that it's now clearer and easier to understand. Added an enum that describes column ordering. It has two possible values: ascending or descending. It used to be a bool that was sometimes called 'reversed', which could mean multiple things. Instead of column.type->is_reversed() != <ordering bool> there is now a function called are_column_select_results_reversed. Split checking if ordering is reversed and verifying whether it's correct into two functions. Before all of this was done by is_reversed() This is a preparation to later allow skipping ORDER BY restrictions on some columns. Adding this to the existing code caused it to get quite complex, but this new version is better suited for the task. The diff is a bit messy because I moved all ordering functions to one place, it's better to read select_statement.cc lines 1495-1651 directly. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-12-09 12:06:42 +01:00
Raphael S. Carvalho	648c921af2	cql3: statements: Fix UB when getting memory consumption limit for unpaged query get_max_result_size() is called on slice moved in previous argument. This results in use-after-move with clang, which evaluation order is left-to-right. For paged queries, max_result_size is later overriden by query_pager, but for unpaged and/or reversed queries it can happen that max result size incorrectly contains the 1MB limit for paged, non-reversed queries. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20211207145133.69764-1-raphaelsc@scylladb.com>	2021-12-07 16:57:01 +02:00
Jan Ciolek	22c3e00c44	select_statement: Use precomputed value of _restrictions->need_filtering() Instead of calculating _restrictions->need_filtering() each time, we can now use the value that has been already calculated. This used to happen during query execution, so we get an increase in performance. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-12-03 17:03:53 +01:00
Jan Ciolek	075b3a45fd	select_statement: Store whether restrictions need filtering in a variable Instead of calculating _restrictions->need_filtering() we can calculate it only once and then use this computed variable. It turns out that _restrictions->need_filtering() is called during execution of prepared statements and it has to scan through the whole AST, so doing it only once gives us a performance gain. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-12-03 17:01:09 +01:00
Avi Kivity	595cc328b1	Merge 'cql3: Remove term, replace with expression' from Jan Ciołek This PR finally removes the `term` class and replaces it with `expression`. * There was some trouble with `lwt_cache_id` in `expr::function_call`. The current code works the following way: * for each `function_call` inside a `term` that describes a pk restriction, `prepare_context::add_pk_function_call` is called. * `add_pk_function_call` takes a `::shared_ptr<cql3::functions::function_call>`, sets its `cache_id` and pushes this shared pointer onto a vector of all collected function calls * Later when some condiition is met we want to clear cache ids of all those collected function calls. To do this we iterate through shared pointers collected in `prepare_context` and clear cache id for each of them. This doesn't work with `expr::function_call` because it isn't kept inside a shared pointer. To solve this I put the `lwt_cache_id` inside a shared pointer and then `prepare_context` collects these shared pointers to cache ids. I also experimented with doing this without any shared pointers, maybe we could just walk through the expression and clear the cache ids ourselves. But the problem is that expressions are copied all the time, we could clear the cache in one place, but forget about a copy. Doing it using shared pointers more closely matches the original behaviour. The experiment is on the [term2-pr3-backup-altcache](https://github.com/cvybhu/scylla/tree/term2-pr3-backup-altcache) branch * `shared_ptr<term>` being `nullptr` could mean: * It represents a cql value `null` * That there is no value, like `std::nullopt` (for example in `attributes.hh`) * That it's a mistake, it shouldn't be possible A good way to distinguish between optional and mistake is to look for `my_term->bind_and_get()`, we then know that it's not an optional value. * On the other hand `raw_value` cased to bool means: * `false` - null or unset * `true` - some value, maybe empty I ran a simple benchmark on my laptop to see how performance is affected: ``` build/release/test/perf/perf_simple_query --smp 1 -m 1G --operations-per-shard 1000000 --task-quota-ms 10 ``` * On master (`a21b1fbb2f`) I get: ``` 176506.60 tps ( 77.0 allocs/op, 12.0 tasks/op, 45831 insns/op) median 176506.60 tps ( 77.0 allocs/op, 12.0 tasks/op, 45831 insns/op) median absolute deviation: 0.00 maximum: 176506.60 minimum: 176506.60 ``` * On this branch I get: ``` 172225.30 tps ( 75.1 allocs/op, 12.1 tasks/op, 46106 insns/op) median 172225.30 tps ( 75.1 allocs/op, 12.1 tasks/op, 46106 insns/op) median absolute deviation: 0.00 maximum: 172225.30 minimum: 172225.30 ``` Closes #9481 * github.com:scylladb/scylla: cql3: Remove remaining mentions of term cql3: Remove term cql3: Rename prepare_term to prepare_expression cql3: Make prepare_term return an expression instead of term cql3: expr: Add size check to evaluate_set cql3: expr: Add expr::contains_bind_marker cql3: expr: Rename find_atom to find_binop cql3: expr: Add find_in_expression cql3: Remove term in operations cql3: Remove term in relations cql3: Remove term in multi_column_restrictions cql3: Remove term in term_slice, rename to bounds_slice cql3: expr: Remove term in expression cql3: expr: Add evaluate_IN_list(expression, options) cql3: Remove term in column_condition cql3: Remove term in select_statement cql3: Remove term in update_statement cql3: Use internal cql format in insert_prepared_json_statement cache types: Add map_type_impl::serialize(range of <bytes, bytes>) cql3: Remove term in cql3/attributes cql3: expr: Add constant::view() method cql3: expr: Implement fill_prepare_context(expression) cql3: expr: add expr::visit that takes a mutable expression cql3: expr: Add receiver to expr::bind_variable	2021-11-30 16:39:39 +02:00
Tomasz Grabiec	1e4da2dcce	cql: Fix missing data in indexed queries with base table short reads Indexed queries are using paging over the materialized view table. Results of the view read are then used to issue reads of the base table. If base table reads are short reads, the page is returned to the user and paging state is adjusted accordingly so that when paging is resumed it will query the view starting from the row corresponding to the next row in the base which was not yet returned. However, paging state's "remaining" count was not reset, so if the view read was exhausted the reading will stop even though the base table read was short. Fix by restoring the "remaining" count when adjusting the paging state on short read. Tests: - index_with_paging_test - secondary_index_test Fixes #9198 Message-Id: <20210818131840.1160267-1-tgrabiec@scylladb.com>	2021-11-22 17:42:49 +02:00
Jan Ciolek	dcd3199037	cql3: Rename prepare_term to prepare_expression prepare_term now takes an expression and returns a prepared expression. It should be renamed to prepare_expression. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-11-04 15:56:45 +01:00
Jan Ciolek	219f1a4359	cql3: Make prepare_term return an expression instead of term prepare_term is now the only function that uses terms. Change it so that it returns expression instead of term and remove all occurences of expr::to_expression(prepare_term(...)) Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-11-04 15:56:45 +01:00
Jan Ciolek	a24d06c195	cql3: Remove term in select_statement Replace all uses of term with expression in cql3/statements/select_statement Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-10-28 20:55:09 +02:00
Piotr Sarna	f4cb8191fa	cql3: include system distributed tables in system stats Some time ago we started gathering stats for system tables in a separate class in order to be able to distinguish which queries come from the user - e.g. if the unpaged queries are internal or not. Originally, only local system tables were moved into this class, i.e. system and system_schema. It would make sense, however, to also include other internal keyspaces in this separate class - which includes system_distributed, system_traces, etc. Fixes #9380 Closes #9490	2021-10-28 08:58:25 +03:00
Jan Ciolek	221ed38e94	cql3: Replace all uses of bind_and_get with evaluate_to_raw_view Start using evaluate_to_raw_value instead of bind_and_get. This is a step towards using only evaluate. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-21 16:20:30 +02:00
Avi Kivity	8c0f2f9e3d	Revert "Merge 'cql3: Add expr::constant to replace terminal' from Jan Ciołek" This reverts commit `e9343fd382`, reversing changes made to `27138b215b`. It causes a regression in v2 serialization_format support: collection_serialization_with_protocol_v2_test fails with: marshaling error: read_simple_bytes - not enough bytes (requested 1627390306, got 3) Fixes #9360	2021-09-20 15:15:09 +03:00
Jan Ciolek	c3fb2f2b57	cql3: Replace all uses of bind_and_get with evaluate_to_raw_view Start using evaluate_to_raw_value instead of bind_and_get. This is a step towards using only evaluate. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-13 17:44:06 +02:00
Avi Kivity	8b59e3a0b1	Merge ' cql3: Demand ALLOW FILTERING for unlimited, sliced partitions ' from Dejan Mircevski Return the pre- `6773563d3` behavior of demanding ALLOW FILTERING when partition slice is requested but on potentially unlimited number of partitions. Put it on a flag defaulting to "off" for now. Fixes #7608; see comments there for justification. Tests: unit (debug, dev), dtest (cql_additional_test, paging_test) Signed-off-by: Dejan Mircevski <dejan@scylladb.com> Closes #9126 * github.com:scylladb/scylla: cql3: Demand ALLOW FILTERING for unlimited, sliced partitions cql3: Track warnings in prepared_statement test: Use ALLOW FILTERING more strictly cql3: Add statement_restrictions::to_string	2021-08-31 18:05:26 +03:00
Dejan Mircevski	2f28f68e84	cql3: Demand ALLOW FILTERING for unlimited, sliced partitions When a query requests a partition slice but doesn't limit the number of partitions, require that it also says ALLOW FILTERING. Although do_filter() isn't invoked for such queries, the performance can still be unexpectedly slow, and we want to signal that to the user by demanding they explicitly say ALLOW FILTERING. Because we now reject queries that worked fine before, existing applications can break. Therefore, the behavior is controlled by a flag currently defaulting to off. We will default to "on" in the next Scylla version. Fixes #7608; see comments there for justification. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2021-08-31 10:45:41 -04:00
Avi Kivity	b11ec1aeda	cql3: select_statement: convert term::raw to expression Straightforward substitution; using std::optional<> since those expressions are indeed optional.	2021-08-26 15:41:14 +03:00
Jan Ciolek	da54c9e2fb	cql3: Use expressions to calculate the local-index clustering ranges Removes old code used to calculate local-index clustering range and replaces it with new based on the expression variant. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-08-04 17:12:40 +02:00
Avi Kivity	0876248c2b	Merge "cql3: cache function calls evaluation for non-deterministic functions" from Pavel S " `function_call` AST nodes are created for each function with side effects in a CQL query, i.e. non-deterministic functions (`uuid()`, `now()` and some others timeuuid-related). These nodes are evaluated either when a query itself is executed or query restrictions are computed (e.g. partition/clustering key ranges for LWT requests). We need to cache the calls since otherwise when handling a `bounce_to_shard` request for an LWT query, we can possibly enter an infinite bouncing loop (in case a function is used to calculate partition key ranges for a query), since the results can be different each time. Furthermore, we don't support bouncing more than one time. Returning `bounce_to_shard` message more than one time will result in a crash. Caching works only for LWT statements and only for the function calls that affect partition key range computation for the query. `variable_specifications` class is renamed to `prepare_context` and generalized to record information about each `function_call` AST node and modify them, as needed: * Check whether a given function call is a part of partition key statement restriction. * Assign ids for caching if above is true and the call is a part of an LWT statement. There is no need to include any kind of statement identifier in the cache key since `query_options` (which holds the cache) is limited to a single statement, anyway. Function calls are indexed by the order in which they appear within a statement while parsing. There is no need to include any kind of statement identifier to the cache key since `query_options` (which holds the cache) is limited to a single statement, anyway. Note that `function_call::raw` AST nodes are not created for selection clauses of a SELECT statement hence they can only accept only one of the following things as parameters: * Other function calls. * Literal values. * Parameter markers. In other words, only parameters that can be immediately reduced to a byte buffer are allowed and we don't need to handle database inputs to non-pure functions separately since they are not possible in this context. Anyhow, we don't even have a single non-pure function that accepts arguments, so precautions are not needed at the moment. Add a test written in `cql-pytest` framework to verify that both prepared and unprepared lwt statements handle `bounce_to_shard` messages correctly in such scenario. Fixes: #8604 Tests: unit(dev, debug) NOTE: the patchset uses `query_options` as a container for cached values. This doesn't look clean and `service::query_state` seems to be a better place to store them. But it's not forwarded to most of the CQL code and would mean that a huge number of places would have to be amended. The series presents a trade-off to avoid forwarding `query_state` everywhere (but maybe it's the thing that needs to be done, nonetheless). " * 'lwt_bounce_to_shard_cached_fn_v6' of https://github.com/ManManson/scylla: cql-pytest: add a test for non-pure CQL functions cql3: cache function calls evaluation for non-deterministic functions cql3: rename `variable_specifications` to `prepare_context`	2021-07-30 14:21:11 +03:00
Piotr Sarna	60072045db	Merge 'cql3: replace cql3::selection::selectable::raw ... hierarchy with expressions' from Avi Kivity Currently, the grammar has two parallel hierarchies. One hierarchy is used in the WHERE clause, and is based on a combination of `term` and expressions. The other is used in the SELECT clause, and is using the cql3::selection::selectable hierarchy. There is some overlap between the hierarchies: both can name columns. Logically, however, they overlap completely - in SQL anything you can select you can filter on, and vice versa. So merging the two hierarchies is important if we want to enrich CQL. This series does that, partially (see below), converting the SELECT clause to expressions. There is another hierarchy split: between the "raw", pre-prepare object hierarchy, and post-prepare non-raw. This series limits itself to converting the raw hierarchy and leaves the non-raw hierarchy alone. An important design choice is not to have this raw/non-raw split in expressions. Note that most of the hierarchy is completely parallel: addition is addition both before prepare and after prepare (but see [1]). The main difference is around identifiers - before preparation they are unresolved, and after preparation they become `column_definition` objects. We resolve that by having two separate types: `unresolved_identifier` for the pre-prepare phase, and the existing `column_value` for post-prepare phase. Alternative choices would be to keep a separate expression::raw variant, or to template the expression variant on whether it is raw or not. I think it would cause undue bloat and confusion. Note the series introduces many on_internal_error() calls. This is because there is not a lot of overlap in the hierarchies today; you can't have a cast in the WHERE clause, for example. These on_internal_error() calls cannot be triggered since the grammar does not yet allow such expressions to be expressed. As we expand the grammar, they will have to be replaced with working implementations. Lastly, field selection is expressible in both hierarchies. This series does not yet merge the two representations (`column_value.sub` vs `field_selection`), but it should be easy to do so later. [1] the `+` operator can also be translated to list concatenation, which we may choose to represent by yet another type. Test: unit(dev) Closes #9087 * github.com:scylladb/scylla: cql3: expression: update find_atom, count_if for function_call, cast, field_selection cql3: expressions: fix printing of nested expressions cql3: selection: replace selectable::raw with expression cql3: expression: convert selectable::with_field_selection::raw to expression cql3: expression: convert selectable::with_cast::raw to expression cql3: expression: convert selectable::with_anonymous_function::raw to expression cql3: expression: convert selectable::with_function_call::raw to expressions cql3: selectable: make selectable::raw forward-declarable cql3: expressions: convert writetime_or_ttl::raw to expression cql3: expression: add convenience constructor from expression element to nested expression utils: introduce variant_element.hh cql3: expression: use nested_expression in binary_operator cql3: expression: introduce nested_expression class Convert column_identifier_raw's use as selectable to expressions make column_identifier::raw forward declarable cql3: introduce selectable::with_expression::raw	2021-07-30 09:57:39 +02:00
Pavel Solodovnikov	3b6adf3a62	cql3: cache function calls evaluation for non-deterministic functions And reuse these values when handling `bounce_to_shard` messages. Otherwise such a function (e.g. `uuid()`) can yield a different value when a statement re-executed on the other shard. It can lead to an infinite number of `bounce_to_shard` messages sent in case the function value is used to calculate partition key ranges for the query. Which, in turn, will cause crashes since we don't support bouncing more than one time and the second hop will result in a crash. Caching works only for LWT statements and only for the function calls that affect partition key range computation for the query. `variable_specifications` class is renamed to `prepare_context` and generalized to record information about each `function_call` AST node and modify them, as needed: * Check whether a given function call is a part of partition key statement restriction. * Assign ids for caching if above is true and the call is a part of an LWT statement. There is no need to include any kind of statement identifier in the cache key since `query_options` (which holds the cache) is limited to a single statement, anyway. Note that `function_call::raw` AST nodes are not created for selection clauses of a SELECT statement hence they can only accept only one of the following things as parameters: * Other function calls. * Literal values. * Parameter markers. In other words, only parameters that can be immediately reduced to a byte buffer are allowed and we don't need to handle database inputs to non-pure functions separately since they are not possible in this context. Anyhow, we don't even have a single non-pure function that accepts arguments, so precautions are not needed at the moment. Tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-07-30 01:22:39 +03:00
Avi Kivity	e52ebe2da5	types: convert abstract_type::compare and related to std::strong_ordering Change comparators around types to std::strong_ordering. Ref #1449.	2021-07-28 13:19:24 +03:00
Avi Kivity	98c4f0dfb3	cql3: selection: replace selectable::raw with expression Now that all selectable::raw subclasses have been converted to cql3::selectable::with_expression::raw, the class structure is just a wrapper around expressions. Peel it, converting the virtual member functions to free functions, and replacing object instances with expression or nested_expression as the case allows.	2021-07-27 20:16:15 +03:00
Avi Kivity	5adae5837e	cql3: expression: convert selectable::with_anonymous_function::raw to expression Rather than creating a new variant element in expression, we extend function_call to handle both named and anonymous functions, since most of the processing is the same.	2021-07-27 20:13:55 +03:00
Avi Kivity	8a518e9c78	Convert column_identifier_raw's use as selectable to expressions Introduce unresolved_identifer as an unprepared counterpart to column_value. column_identifier_raw no longer inherits from selectable::raw, but methods for now to reduce churn.	2021-07-27 20:08:15 +03:00
Pavel Solodovnikov	49ddd269ea	cql3: rename `variable_specifications` to `prepare_context` The class is repurposed to be more generic and also be able to hold additional metadata related to function calls within a CQL statement. Rename all methods appropriately. Visitor functions in AST nodes (`collect_marker_specification`) are also renamed to a more generic `fill_prepare_context`. The name `prepare_context` designates that this metadata structure is a byproduct of `stmt::raw::prepare()` call and is needed only for "prepare" step of query execution. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-07-24 14:33:33 +03:00
Jan Ciolek	694d62a567	secondary_index: Fix TOKEN() restrictions in indexed SELECTs When using an index, restrictions like token(p) <= x were ignored. Because of this a query like this would select all rows where r = 0: SELECT * FROM tab WHERE r = 0 and token(p) > 0; Adds proper handling of token restrictions to queries that use indexes. Old indexes represented token as a blob, which complicates clustering bounds. Special code is included, which translates token clustering bounds to blob clustering bounds. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-07-21 16:12:49 +02:00

1 2 3 4 5 ...

347 Commits