scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-22 09:30:45 +00:00

Author	SHA1	Message	Date
Nadav Har'El	1aea2136c8	cql: fix regression in SELECT * GROUP BY Recently, the expression-rewrite effort changed the way that GROUP BY is implemented. Usually GROUP BY involves an aggregation function (e.g., if you want a separate SUM per partition). But there's also a query like SELECT p, c1, c2, v FROM tbl GROUP BY p This query is supposed to return one row - the first row in clustering order - per group (in this case, partition). The expression rewrite re-implemented this feature by introducing a new internal aggregator, first(), which returns the first aggregated value. The above query is rewritten into: SELECT first(p), first(c1), first(c2), first(v) FROM tbl GROUP BY p This case works correctly, and we even have a regression test for it. But unfortunately the rewrite broke the following query: SELECT * FROM tbl GROUP BY p Note the "" instead of the explicit list of columns. In our implementation, a selection of "" is looks like an empty selection, and it didn't get the "first()" treatment and it remained a "SELECT " - and wrongly returned all rows instead of just the first one in each partition. This was a regression - it worked correctly in Scylla 5.2 (and also in Cassandra) - see the next patch for a regression test. In this patch we fix this regression. When there is a GROUP BY, the "" is rewritten to the appropriate list of all visible columns and then gets the first() treatment, so it will return only the first row as expected. The next patch will be a test that confirms the bug and its fix. Fixes #16531 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2023-12-25 17:52:57 +02:00
Avi Kivity	66c47d40e6	cql3: selection: drop selector_factories, selectables, and selectors The whole class hierarchy is no longer used by anything and we can just delete it.	2023-07-03 19:45:17 +03:00
Avi Kivity	039472ffb9	cql3: selection: don't create selector_factories any more We no longer use selector_factories for anything, so we can drop them.	2023-07-03 19:45:17 +03:00
Avi Kivity	e521557ce5	cql3: selection: collect column_definitions using expressions The replica needs to know which columns we're interested in. Iterate and recurse into all selector expressions to collect all mentioned columns. We use the same algorithm that create_factories_and_collect_column_definitions() uses, even though it is quadratic, to avoid causing surprises.	2023-07-03 19:45:17 +03:00
Avi Kivity	7bd317ace4	cql3: selection: reimplement selection::is_aggregate() We can get rid of the last use of selector_factories by reimplementing is_aggregate(). It's simple - if we have an inner loop, we're aggregating.	2023-07-03 19:45:17 +03:00
Avi Kivity	91cdaa72bd	cql3: selection: evaluate aggregation queries via expr::evaluate() When constructing a selection_with_processing, split the selectors into an inner loop and an outer loop with split_aggregation(). We can then reimplement add_input_row() and get_output_row() as follows: - add_input_row(): evaluate the inner loop expressions and store the results in temporaries - get_output_row(): evaluate the outer loop expressions, pulling in values from those temporaries. reset(), which is called between groups, simply copies the initial values rathered by split_aggregation() into the temporaries. The only complexity comes from add_column_for_post_query_processing(), which essentially re-does the work of split_aggregation(). It would be much better if we added the column before split_aggregation() was called, but some refactoring has to take place before that happens.	2023-07-03 19:45:17 +03:00
Avi Kivity	6bf1bd7130	cql3: selection: evaluate non-aggregating complex selections using expr::evaluate() Now that everything is in place, implement the fast-path transform_input_row() for selection_with_processing. It's a straightforward call to evaluate() in a loop. We adjust add_column_for_post_processing() to also update _selectors, otherwise ORDER BY clauses that require an additional column will not see that column. Since every sub-class implements transform_input_row(), mark the base class declaration as pure virtual.	2023-07-03 19:45:17 +03:00
Avi Kivity	aed01018a3	cql3: selection: make result_set_builder::current non-optional<> Previously, we used the engagedness of result_set_builder::optional as a flag, but the previous patch eliminated that and it's always engaged. Remove the optional wrapper to reduce noise.	2023-07-03 19:45:17 +03:00
Avi Kivity	44c8507075	cql3: selection: simplify row/group processing Processing a result set relies on calling result_set_builder::new_row(). This function is quite complex as it has several roles: - complete processing of the previously computed row, if any - determine if GROUP BY grouping has changed, and flush the previous group if so - flush the last group if that's the case This works now, but won't work with expr::evaluate. The reason is that new_row() is called after the partition key and clustering key of the new row have been evaluated, so processing of the previous row will see incorrect data. It works today because we copy the partition key and clustering key into result_set_builder::current, but expr::evaluate uses the exploded partition key and clustering key, which have been clobbered. The solution is to separate the roles. Instead of new_row() that's responsible for completing the previous row and starting a new one, we have start_new_row() that's responsible for what its name says, and complete_row() that's responsible for completing the row and checking for group change. The responsibity for flushing the final group is moved to result_set_builder::build(). This removes the awkward "more_rows_coming" parameter that makes everything more complicated. result_set_builder::current is still optional, but it's always engaged. The next patch will clean that up.	2023-07-03 19:45:17 +03:00
Avi Kivity	877f4f86d2	cql3: selection: convert requires_thread to expressions If any function requires a thread to execute (due to running in Lua or wasm), then the entire selection needs to run in a thread.	2023-07-03 19:45:17 +03:00
Avi Kivity	cbd68abde8	cql: selection: convert used_functions() to expressions used_functions() is used to check whether prepared statements need to be invalidated when user-defined functions change. We need to skip over empty scalar components of aggregates, since these can be defined by users (with the same meaning as if the identity function was used).	2023-07-03 19:45:17 +03:00
Avi Kivity	bfb1acc6d3	cql3: selection: convert is_reducible/get_reductions to expressions The current version of automatic query parallelization works when all selectors are reducible (e.g. have a state_reduction_function member), and all the inputs to the aggregates are direct column selectors without further transformation. The actual column names and reductions need to be packed up for forward_service to be used. Convert is_reducible()/get_reductions() to the expression world. The conversion is fairly straightforward.	2023-07-03 19:45:17 +03:00
Avi Kivity	d99fc29e2d	cql3: selection: convert is_count() to expressions Early versions of automatic query parallelization only supported `SELECT count(*)` with one selector. Convert the check to expressions.	2023-07-03 19:45:17 +03:00
Avi Kivity	d36eb8cea6	cql3: selection convert contains_ttl/contains_writetime to work on expressions contains_ttl/contains_writetime are two attributes of a selection. If a selection contains them, we must ask the replica to send them over; otherwise we don't have data to process. Not sending ttl/writetime saves some effort. The implementation is a straightforward recursive descent using expr::find_in_expression.	2023-07-03 19:45:17 +03:00
Avi Kivity	6c2bb5e1ed	cql3: selection: make simple_selectors stateless Now that we push all GROUP BY queries to selection_with_processing, we always process rows via transform_input_row() and there's no reason to keep any state in simple_selectors. Drop the state and raise an internal error if we're ever called for aggregation.	2023-07-03 19:45:17 +03:00
Avi Kivity	f48ecb5049	cql3: selection: short-circuit non-aggregations Currently, selector evaluation assumes the most complex case where we aggregate, so multiple input rows combine into one output row. In effect the query either specifies an outer loop (for the group) and an inner loop (for input rows), or it only specifies the inner loop; but we always perform the outer and inner loop. Prepare to have a separate path for the non-aggregation case by introducing transform_input_row().	2023-07-03 19:45:17 +03:00
Avi Kivity	7c3ceb6473	cql3: select_statement: use prepared selectors Change one more layer of processing to work on prepared rather than raw selectors. This moves the call to prepare the selectors early in select_statement processing. In turn this changes maybe_jsonize_select_clause() and forward_service's mock_selection() to work in the prepared realm as well. This moves us one step closer to using evaluate() to process the select clause, as the prepared selectors are now available in select_statement. We can't use them yet since we can't evaluate aggregations.	2023-07-03 19:45:17 +03:00
Avi Kivity	a338d0455d	cql3: selection: avoid selector_factories in collect_metadata() Generate the column headings in the result set metadata using the newly introduced result_set_metadata mode of the expression printer.	2023-07-03 19:45:17 +03:00
Avi Kivity	a1f4abb753	cql3: selection: convert collect_metadata() to the prepared expression domain Simplifies refactoring later on.	2023-07-03 19:45:17 +03:00
Avi Kivity	91b251f6b4	cql3: selection: convert processes_selection to work on prepared expressions processes_selection() checks whether a selector passes-through a column or applies some form of processing (like a case or function application). It's more sensible to do this in the prepared domain as we have more information about the expression. It doesn't really help here, but it does help the refactoring later in the series.	2023-07-03 19:45:17 +03:00
Avi Kivity	4fb797303f	cql3: selection: prepare selectors earlier Currently, each selector expression is individually prepared, then converted into a selector object that is later executed. This is done (on a vector of raw selectors) by cql3::selection::raw_selector::to_selectables(). Split that into two phases. The first phase converts raw_selector into a new struct prepared_selector (a better name would be plain 'selector', but it's taken for now). The second phase continues the process and converts prepared_selector into selectables. This gives us a full view of the prepared expressions while we're preparing the select clause of the select statement.	2023-07-03 19:45:17 +03:00
Avi Kivity	f6f974cdeb	cql3: selection: fix GROUP BY, empty groups, and aggregations A GROUP BY combined with aggregation should produce a single row per group, except for empty groups. This is in contrast to an aggregation without GROUP BY, which produces a single row no matter what. The existing code only considered the case of no grouping and forced a row into the result, but this caused an unwanted row if grouping was used. Fix by refining the check to also consider GROUP BY. XFAIL tests are relaxed. Fixes #12477. Note, forward_service requires that aggregation produce exactly one row, but since it can't work with grouping, it isn't affected. Closes #14399	2023-06-28 18:56:22 +03:00
Avi Kivity	b858a4669d	cql3: expr: break up expression.hh header Adding a function declaration to expression.hh causes many recompilations. Reduce that by: - moving some restrictions-related definitions to the existing expr/restrictions.hh - moving evaluation related names to a new header expr/evaluate.hh - move utilities to a new header expr/expr-utilities.hh expression.hh contains only expression definitions and the most basic and common helpers, like printing.	2023-06-22 14:21:03 +03:00
Avi Kivity	32b27d6a08	cql3: expr: change evaluation_input vector components to take spans Spans are slightly cleaner, slightly faster (as they avoid an indirection), and allow for replacing some of the arguments with small_vector:s. Closes #14313	2023-06-22 11:28:01 +02:00
Avi Kivity	1040589828	cql3: selection: prepare selector expressions Call prepare_expression() on selector expressions to resolve types. This leaves us with just one way to move from the unprepared domain to the prepared domain. The change is somewhat awkward since do_prepare_selectable() is re-doing work that is done by prepare_expression(), but somehow it all works. The next patch will tear down the unnecessary double-preparation.	2023-06-13 21:04:49 +03:00
Avi Kivity	42a1ced73b	cql3: result_set: switch cell data type from bytes_opt to managed_bytes_opt The expression system uses managed_bytes_opt for values, but result_set uses bytes_opt. This means that processing values from the result set in expressions requires a copy. Out of the two, managed_bytes_opt is the better choice, since it prevents large contiguous allocations for large blobs. So we switch result_set to use managed_bytes_opt. Users of the result_set API are adjusted. The db::function interface is not modified to limit churn; instead we convert the types on entry and exit. This will be adjusted in a following patch.	2023-05-07 17:17:36 +03:00
Piotr Sarna	4624934032	selection: add a getter for used functions The function allows extracting used function definitions from given selection. Thanks to that, it will be possible to verify if the callee has proper permissions to execute given functions.	2023-03-09 17:51:17 +01:00
Avi Kivity	721c05b7ec	cql3: selection: introduce selection_from_partition_slice Since expressions were introduced for SELECT statements, they work with `selection` object to represent which table columns they can work with. Probably a neutral representation would have been better, but that's what we have now. LWT works with partition_slice, so introduce a selection_from_partition_slice() helper to bridge the two worlds.	2023-02-12 17:17:01 +02:00
Avi Kivity	2739ac66ed	treewide: drop cql_serialization_format Now that we don't accept cql protocol version 1 or 2, we can drop cql_serialization format everywhere, except when in the IDL (since it's part of the inter-node protocol). A few functions had duplicate versions, one with and one without a cql_serialization_format parameter. They are deduplicated. Care is taken that `partition_slice`, which communicates the cql_serialization_format across nodes, still presents a valid cql_serialization_format to other nodes when transmitting itself and rejects protocol 1 and 2 serialization\ format when receiving. The IDL is unchanged. One test checking the 16-bit serialization format is removed.	2023-01-03 19:54:13 +02:00
Jan Ciolek	b974d4adfb	cql3: don't ignore other restrictions when a multi column restriction is present during filtering When filtering with multi column restriction present all other restrictions were ignored. So a query like: `SELECT * FROM WHERE pk = 0 AND (ck1, ck2) < (0, 0) AND regular_col = 0 ALLOW FILTERING;` would ignore the restriction `regular_col = 0`. This was caused by a bug in the filtering code: `2779a171fc/cql3/selection/selection.cc (L433-L449)` When multi column restrictions were detected, the code checked if they are satisfied and returned immediately. This is fixed by returning only when these restrictions are not satisfied. When they are satisfied the other restrictions are checked as well to ensure all of them are satisfied. This code was introduced back in 2019, when fixing #3574. Perhaps back then it was impossible to mix multi column and regular columns and this approach was correct. Fixes: #6200 Fixes: #12014 Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-11-18 15:14:16 +01:00
Avi Kivity	13a64d8ab2	Merge 'Remove all remaining restrictions classes' from Jan Ciołek This PR removes all code that used classes `restriction`, `restrictions` and their children. There were two fields in `statement_restrictions` that needed to be dealt with: `_clustering_columns_restrictions` and `_nonprimary_key_restrictions`. Each function was reimplemented to operate on the new expression representaiion and eventually these fields weren't needed anymore. After that the restriction classes weren't used anymore and could be deleted as well. Now all of the code responsible for analyzing WHERE clause and planning a query works on expressions. Closes #11069 * github.com:scylladb/scylla: cql3: Remove all remaining restrictions code cql3: Move a function from restrictions class to the test cql3: Remove initial_key_restrictions cql3: expr: Remove convert_to_restriction cql3: Remove _new from _new_nonprimary_key_restrictions cql3: Remove _nonprimary_key_restrictions field cql3: Reimplement uses of _nonprimary_key_restrictions using expression cql3: Keep a map of single column nonprimary key restrictions cql3: Remove _new from _new_clustering_columns_restrictions cql3: Remove _clustering_columns_restrictions from statement_restrictions cql3: Use a variable instead of dynamic cast cql3: Use the new map of single column clustering restrictions cql3: Keep a map of single column clustering key restrictions cql3: Return an expression in get_clustering_columns_restrctions() cql3: Reimplement _clustering_columns_restrictions->has_supporting_index() cql3: Don't create single element conjunction cql3: Add expr::index_supports_some_column cql3: Reimplement has_unrestricted_components() cql3: Reimplement _clustering_columns_restrictions->need_filtering() cql3: Reimplement num_prefix_columns_that_need_not_be_filtered cql3: Use the new clustering restrictions field instead of ->expression cql3: Reimplement _clustering_columns_restrictions->size() using expressions cql3: Reimplement _clustering_columns_restrictions->get_column_defs() using expressions cql3: Reimplement _clustering_columns_restrictions->is_all_eq() using expressions cql3: expr: Add has_only_eq_binops function cql3: Reimplement _clustering_columns_restrictions->empty() using expressions	2022-07-20 18:01:15 +03:00
Jan Ciolek	599bcd6ea7	cql3: Remove all remaining restrictions code The classes restriction, restrictions and its children aren't used anywhere now and can be safely removed. Some includes need to be modified for the code to compile. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-20 09:10:31 +02:00
Jan Ciolek	9d1ba07471	cql3: Reimplement uses of _nonprimary_key_restrictions using expression All parts of the code that use _nonprimary_key_restrictions are changed to use _new_nonprimary_key_restrictions instead. I decided not to split this into multiple commits, as there isn't a lot of changes and they are analogous to the ones done before for partition and clustering columns. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-20 09:10:30 +02:00
Jan Ciolek	9a03a09422	cql3: Keep a map of single column clustering key restrictions Having this map is useful in a bunch of places. To keep code simple it could be created from scratch each time, but it's also used in do_filter, so this could actually affect performance. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-19 16:02:01 +02:00
Jan Ciolek	2b7ffd57fb	cql3: Return an expression in get_clustering_columns_restrctions() get_clustering_columns_restrctions() used to return a shared pointer to the clustering_restrictions class. Now everything is being converted to expression, so it should return an expression as well. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-19 16:02:01 +02:00
Jadw1	6d977fcf88	cql3: selection: detect parallelize reduction type Detects type of reduction if it is possible. Separate case for `COUNT(*)` is left for compatibility reason. By now only single selection is supported.	2022-07-18 15:25:41 +02:00
Jan Ciolek	4026042cbc	cql3: use the new map in get_single_column_partition_key_restrictions Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-01 16:29:11 +02:00
Jan Ciolek	3e3d2f939c	cql3: Replace uses of single_column_restriction with restriction single_column_restriction is a class used to represent restrictions in a single column. The class is very simple - it's basically an expression with some additional information. As a step towards removing all restriction classes all uses of this class are replaced by uses of the generic restriction class. All functionality of this class has been implemented using free standing functions operating on expressions. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2022-07-01 15:52:10 +02:00
Pavel Emelyanov' via ScyllaDB development	a78af050fd	cql: Constify select_statement restrictions It is in fact immutable (both the pointer and the object it points to), so is the pointer copy returned by get_restrictions() method, so are those propagated to filtering stuff. tests: https://jenkins.scylladb.com/job/releng/job/Scylla-CI/1028 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20220624083351.24970-1-xemul@scylladb.com>	2022-06-24 12:27:36 +03:00
Avi Kivity	c80999fab4	cql3: expr: push is_satisfied_by regular and static column extraction to callers is_satisfied_by() rearranges the static and regular columns from query::result_row_view form (which is a use-once iterator) to std::vector<managed_bytes_opt> (which uses the standard value representation, and allows random access which expression evaluation needs). Doing it in is_saitisfied_by() means that it is done every time an expression is evaluated, which is wasteful. It's also done even if the expression doesn't need it at all. Push it out to callers, which already eliminates some calls. We still pass cql3::expr::selection, which is a layering violation, but that is left to another time. Note that in view.cc's check_if_matches(), we should have been able to move static_and_regular_columns calculation outside the loop. However, we get crashes if we do. This is likely due to a preexisting bug (which the zero iterations loop avoids). However, in selection.cc, we are able to avoid the computation when the code claims it is only handling partition keys or clustering keys.	2022-06-12 16:12:41 +03:00
Avi Kivity	4b715226fe	cql3: expr: convert is_satisfied_by() signature to evaluation_inputs Callers are converted, but the internals are kept using the old conventions until more APIs are converted. Although the new API allows passing no query_options, the view code keeps passing dummy query_options and improvement is left as a FIXME.	2022-06-12 12:53:44 +03:00
Avi Kivity	5937b1fa23	treewide: remove empty comments in top-of-files After `fcb8d040` ("treewide: use Software Package Data Exchange (SPDX) license identifiers"), many dual-licensed files were left with empty comments on top. Remove them to avoid visual noise. Closes #10562	2022-05-13 07:11:58 +02:00
Michał Sala	bb7edf3785	cql3: selection: detect if a selection represents count() The way that this detection works is a bit clunky, but it does its job given the simplest cases e.g. "SELECT COUNT() FROM ks.t". It fails when there are multiple selectors, or when there is a column name specified ("SELECT COUNT(column_name) FROM ks.t").	2022-02-01 21:14:41 +01:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Avi Kivity	d768e9fac5	cql3, related: switch to data_dictionary Stop using database (and including database.hh) for schema related purposes and use data_dictionary instead. data_dictionary::database::real_database() is called from several places, for these reasons: - calling yet-to-be-converted code - callers with a legitimate need to access data (e.g. system_keyspace) but with the ::database accessor removed from query_processor. We'll need to find another way to supply system_keyspace with data access. - to gain access to the wasm engine for testing whether used defined functions compile. We'll have to find another way to do this as well. The change is a straightforward replacement. One case in modification_statement had to change a capture, but everything else was just a search-and-replace. Some files that lost "database.hh" gained "mutation.hh", which they previously had access to through "database.hh".	2021-12-15 13:54:23 +02:00
Avi Kivity	2d25705db0	cql3: deinline non-trivial methods in selection.hh This allows us to forward-declare raw_selector, which in turn reduces indirect inclusions of expression.hh from 147 to 58, reducing rebuilds when anything in that area changes. Includes that were lost due to the change are restored in individual translation units. Closes #9434	2021-10-05 12:58:55 +02:00
Avi Kivity	daf028210b	build: enable -Winconsistent-missing-override warning This warning can catch a virtual function that thinks it overrides another, but doesn't, because the two functions have different signatures. This isn't very likely since most of our virtual functions override pure virtuals, but it's still worth having. Enable the warning and fix numerous violations. Closes #9347	2021-09-15 12:55:54 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Pavel Solodovnikov	fff7ef1fc2	treewide: reduce boost headers usage in scylla header files `dev-headers` target is also ensured to build successfully. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-05-20 01:33:18 +03:00
Avi Kivity	a36eb586ea	cql3: selection: don't use gcc extension "typeof" typeof is not recognized by clang. Use the modern equivalent "decltype" instead. Closes #7386	2020-10-11 17:21:15 +03:00

1 2 3

111 Commits