scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-04 05:53:13 +00:00

Author	SHA1	Message	Date
Gleb Natapov	6a4207f202	Pass service permit to storage_proxy Current cql transport code acquire a permit before processing a query and release it when the query gets a reply, but some quires leave work behind. If the work is allowed to accumulate without any limit a server may eventually run out of memory. To prevent that the permit system should account for the background work as well. The patch is a first step in this direction. It passes a permit down to storage proxy where it will be later hold by background work.	2019-08-12 10:20:43 +03:00
Piotr Sarna	bb08af7e68	cql3: add proper aggregation to paged indexing Aggregated and paged filtering needs to aggregate the results from all pages in order to avoid returning partial per-page results. It's a little bit more complicated than regular aggregation, because each paging state needs to be translated between the base table and the underlying view. The routine keeps fetching pages from the underlying view, which are then used to fetch base rows, which go straight to the result set builder. Fixes #4540	2019-06-24 13:21:32 +02:00
Piotr Sarna	7a8b243ce4	cql3: split execute_base_query implementation In order to handle aggregation queries correctly, the function that returns base query results is split into two, so it's possible to access raw query results, before they're converted into end-user CQL message.	2019-06-24 12:57:03 +02:00
Avi Kivity	a86fdeb02b	Merge "Implement GROUP BY" from Dejan " Cassandra has supported GROUP BY in SELECT statements since 2016 (v3.10), while ScyllaDB currently treats it as a syntax error. To achieve parity with Cassandra in this important bit of functionality, this patch adds full support for GROUP BY, from parsing to validation to implementation to testing. " * 'groupby-implPP' of https://github.com/dekimir/scylla: Implement grouping in selection processing Propagate GROUP BY indices to result_set_builder Process GROUP BY columns into select_statement Parse GROUP BY clause, store column identifiers	2019-05-08 18:35:12 +03:00
Dejan Mircevski	c3929aee3a	Propagate GROUP BY indices to result_set_builder Ensure that the indices recorded in select_statement are passed to result_set_builder when one is created for processing the cell values. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-05-08 10:10:10 -04:00
Dejan Mircevski	274a77f45e	Process GROUP BY columns into select_statement Validate raw GROUP BY identifiers and translate them into a select_statement member. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-05-08 10:10:10 -04:00
Dejan Mircevski	e1fb414805	Parse GROUP BY clause, store column identifiers Extend the grammar file with GROUP BY, collect the column identifiers, and store them in raw::select_statement. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2019-05-08 10:09:22 -04:00
Piotr Jastrzebski	f5f6367674	types: Add cql_serialization_format parameter to validate Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-04-09 16:58:22 +02:00
Piotr Sarna	87f6e37caa	cql3: move finding index restrictions to prepare stage Index restrictions that match a given index were recomputed during execution stage, which is redundant and prone to errors. Now, used index restrictions are cached in a prepare statement.	2019-03-20 10:20:22 +01:00
Piotr Sarna	2f173f7ed8	cql3: add handling paging state for local indexes When computing paging state for local indexes, the partition and clustering keys are different than with global ones: - partition key is the same as base's - clustering key starts with the indexed column	2019-03-20 10:20:02 +01:00
Piotr Sarna	75dd964751	cql3: add handling partition slices for local indexes For local indexes, a slice will consist of the indexed column followed by base clustering columns.	2019-03-20 10:20:01 +01:00
Piotr Sarna	b12162c8f5	cql3: add returning correct partition ranges for local indexes Local indexes always share the partition range with their base.	2019-03-20 09:51:46 +01:00
Piotr Sarna	da8e8f18b3	cql3: make read_posting_list a member function It already accepts several arguments that can be extracted from 'this', and more will be added in the future. New parameters include lambdas prepared during prepare stage that define how to extract partition/clustering key ranges depending on which index is used, so keeping it a static function will result in unbounded number of parameters with complex types, which will in turn make the function header almost illegible for a reader. Hence, read_posting_list becomes a member function with easy access to any data prepared during prepare stage.	2019-03-20 09:51:46 +01:00
Piotr Sarna	85017c5ad4	cql3: look for indexed column definition only once There's no need to look for the column definition inside a loop.	2019-03-20 09:51:46 +01:00
Eliran Sinvani	7df0c873aa	transport: sort bound ranges in read reques in order to conform to cql definitions According to the cql definitions, if no ORDER BY clause is present, records should be returned ordered by the clustering keys. Since the backend returns the ranges according to their order of appearance in the request, the bounds should be sorted before sending it to the backend. This kind of sorting is needed in queries that generates more than one bound to be read, examples to such queris are: 1. a SELECT query with an IN clause. 2. a SELECT query on a mixed order tupple of columns (see #2050). The assumption this commit makes is the correctness of the bounds list, that is, the bounds are non overlapping. If this wasn't true, multiple occurences of the same reccord could have returned for certain queries. Tests: 1. Unit tests release 2. All dtest that requires #2050 and #2029 Fixes #2029 Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2019-03-05 13:51:17 +02:00
Piotr Sarna	c743617236	cql3: unify max value for row limit and per-partition limit Limits are stored as uint32_t everywhere, but in some places int32_t was used, which created inconsistencies when comparing the value to std::numeric_limits<Type>::max(). In order to solve inconsistencies, the types are unified to uint32_t, and instead of explicitly calling numeric limit max, an already existing constant value query::max_rows is utilized. Fixes #4253 Message-Id: <4234712ff61a0391821acaba63455a34844e489b.1550683120.git.sarna@scylladb.com>	2019-02-21 13:56:02 +02:00
Duarte Nunes	6e83457b1b	Merge 'Add PER PARTITION LIMIT' from Piotr " This series introduces PER PARTITION LIMIT to CQL. Protocol and storage is already capable of applying per-partition limits, so for nonpaged queries the changes are superficial - a variable is parsed and passed down. For paged queries and filtering the situation is a little bit more complicated due to corner cases: results for one partition can be split over 2 or more pages, filtering may drop rows, etc. To solve these, another variable is added to paging state - the number of rows already returned from last served partition. Note that "last" partition may be stretched over any number of pages, not just the last one, which is a case especially when considering filtering. As a result, per-partition-limiting queries are not eligible for page generator optimization, because they may need to have their results locally filtered for extraneous rows (e.g. when the next page asks for per-partition limit 5, but we already received 4 rows from the last partition, so need just 1 more from last partition key, but 5 from all next ones). Tests: unit (dev) Fixes #2202 " * 'add_per_partition_limit_3' of https://github.com/psarna/scylla: tests: remove superficial ignore_order from filtering tests tests: add filtering with per partition key limit test tests: publish extract_paging_state and count_rows_fetched tests: fix order of parameters in with_rows_ignore_order cql3,grammar: add PER PARTITION LIMIT idl,service: add persistent last partition row count cql3: prevent page generator usage for per-partition limit cql3: add checking for previous partition count to filtering pager: add adjusting per-partition row limit cql3: obey per partition limit for filtering cql3: clean up unneeded limit variables cql3: obey per partition limit for select statement cql3: add get_per_partition_limit cql3: add per_partition_limit to CQL statement	2019-02-18 14:47:11 +00:00
Piotr Sarna	3a2b004f02	cql3: prevent page generator usage for per-partition limit Paged queries that induce per-partition limits cannot use page generator optimization, as sometimes the results need to be filtered for extraneous rows on page breaks.	2019-02-18 11:06:44 +01:00
Piotr Sarna	1dadae212a	cql3: add checking for previous partition count to filtering Filtering now needs to take into account per partition limits as well, and for that it's essential to be able to compare partition keys and decide which rows should be dropped - if previous page(s) contained rows with the same partition key, these need to be taken into consideration too.	2019-02-18 11:06:43 +01:00
Piotr Sarna	b965c3778f	cql3: obey per partition limit for filtering Filtering queries now take into account the limit of rows per single partition provided by the user.	2019-02-18 10:29:34 +01:00
Piotr Sarna	b3aa939cde	cql3: clean up unneeded limit variables Some places extracted a `limit` variable to be captured by lambdas, but they were not used inside them.	2019-02-18 10:29:34 +01:00
Piotr Sarna	cfb6e9c79c	cql3: obey per partition limit for select statement Select statement now takes into account the limit of rows per single partition provided by the user.	2019-02-18 10:29:34 +01:00
Piotr Sarna	41b466246e	cql3: add get_per_partition_limit	2019-02-18 10:29:34 +01:00
Piotr Sarna	93786a9148	cql3: add per_partition_limit to CQL statement Select statements can now accept per_partition_limit variable.	2019-02-18 10:29:34 +01:00
Gleb Natapov	0cd9bbb71d	cql3/statements/select_statement: convert index query interface to new query_ranges_to_vnodes_generator interface	2019-02-11 14:45:43 +02:00
Piotr Sarna	9982587bea	cql3: alias single_column_primary_key_restrictions In preparation for detemplatizing this class, it's aliased with single_column_partition_key restrictions and single_column_clustering_key_restrictions accordingly.	2019-01-23 17:43:03 +02:00
Piotr Sarna	87c23372fb	cql3: fix filtering with LIMIT with regard to paging Previously the limit was erroneously applied per page instead of being accumulated, which might have caused returning too many rows. As of now, LIMIT is handled properly inside restrictions filter. Fixes #4100	2019-01-17 13:25:09 +01:00
Duarte Nunes	fa2b0384d2	Replace std::experimental types with C++17 std version. Replace stdx::optional and stdx::string_view with the C++ std counterparts. Some instances of boost::variant were also replaced with std::variant, namely those that called seastar::visit. Scylla now requires GCC 8 to compile. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20190108111141.5369-1-duarte@scylladb.com>	2019-01-08 13:16:36 +02:00
Avi Kivity	c3ef99f84f	schema_tables: remove #include of database.hh Distribute in source files (and one header - table_helper.hh) that need it.	2019-01-05 15:43:07 +02:00
Paweł Dziepak	9024187222	partition_slice: use small_vector for column_ids	2018-12-06 14:21:04 +00:00
Piotr Sarna	acf4eadf88	cql3: add proper handling of filtering with LIMIT Previously, limit was erroneously applied before filtering, which might have resulted in truncated results. Now, both paged and unpaged queries are filtered first, and only after that properly trimmed so only X rows are returned for LIMIT X. Fixes #3902	2018-11-29 14:53:30 +01:00
Avi Kivity	4676e07400	consistency_level: simplify validation API Remove unused parameters, replace refcounted pointers by references.	2018-11-27 13:41:49 +02:00
Avi Kivity	2c08bff8d5	Split consistency_level.hh header It has two unrelated users: cql for validation, and storage_proxy for complicated calculations. Split the simple stuff into a new header to reduce dependencies.	2018-11-27 13:32:10 +02:00
Avi Kivity	9201d22c06	cql: remove unneeded includes of consistency_level.hh Move the includes to .cc to reduce include pollution.	2018-11-27 13:18:33 +02:00
Avi Kivity	ecf3f92ec7	cql: add SELECT ... BYPASS CACHE clause The BYPASS CACHE clause instructs the database not to read from or populate the cache for this query. The new keywords (BYPASS and CACHE) are not reserved.	2018-11-26 11:37:49 +02:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Avi Kivity	cb7ee5c765	cql3: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Paweł Dziepak	c94d2b6aa6	cql3: restore original timeout behaviour for aggregate queries Commit `1d34ef38a8` "cql3: make pagers use time_point instead of duration" has unintentionally altered the timeout semantics for aggregate queries. Such requests fetch multiple pages before sending a response to the client. Originally, each of those fetches had a timeout-duration to finish, after the problematic commit the whole request needs to complete in a single timeout-duration. This, unsurprisingly, makes some queries that were successful before fail with a timeout. This patch restores the original behaviour. Fixes #3877. Message-Id: <20181022125318.4384-1-pdziepak@scylladb.com>	2018-10-23 12:52:42 +03:00
Eliran Sinvani	fd422c954e	cql3: ensure retrieval of columns for filtering When a query that needs filtering is executed, the columns that the coordinator is filtering by have to be retrieved.The columns should be retrieved even if they are not used for ordering or named in the actual select clause. If the columns are missing from the result set, then any filtering that restricts the missing column will not take place. Fixes #3803 Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2018-10-21 08:41:46 +03:00
Eliran Sinvani	3e036e2c8c	cql3: refactor find_idx to be part of statement restrictions object find_idx calculates the index that will be used in the statement if indexes are to be used. In the static form it requires redundant information (the schema is already contained within the statement restrictions object). In addition find_idx will need to be used for filtering in order not to include redundant selectors in the selection objects. This change refactors find_idx to run under the statement restrictions object and changes it's scope from private to public. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2018-10-21 08:40:24 +03:00
Eliran Sinvani	ded3a03356	cql3: rename selection metadata manipulation functions In the past the addition of non serializable columns was being used only for post ordering of result sets.The newly added ALLOW FILTERING feature will need to use these functions to other post processing operations i.e filtering. The renaming accounts for the new and existing uses for the function. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2018-10-18 17:52:04 +03:00
Duarte Nunes	b839f551cf	cql3/statements/select_statement: Don't double count unpaged queries Unpaged queries are those for which the client didn't enable paging, and we already account for them in indexed_table_select_statement::do_execute(). Remove the second increment in read_posting_list(). Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20181003121811.11750-1-duarte@scylladb.com>	2018-10-05 17:36:39 +02:00
Duarte Nunes	959559d568	cql3/statements/select_statement: Remove outdated comment Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20181003193033.13862-1-duarte@scylladb.com>	2018-10-04 09:45:17 +03:00
Pekka Enberg	de48966abc	cql3: Move as_json_function class to separate file The as_json_function class is not registered as a function, but we can still keep it cql3/functions, as per its namespace, to reduce the size of select_statement.cc. Message-Id: <20181002132637.30233-1-penberg@scylladb.com>	2018-10-03 13:30:08 +01:00
Piotr Sarna	4a23297117	cql3: add asking for pk/ck in the base query Base query partition and clustering keys are used to generate paging state for an index query, so they always need to be present when a paged base query is processed. Message-Id: <f3bf69453a6fd2bc842c8bdbd602d62c91cf9218.1538568953.git.sarna@scylladb.com>	2018-10-03 13:26:51 +01:00
Piotr Sarna	50d3de0693	cql3: add checking for may_need_paging when executing base query It's not sufficient to check for positive page_size when preparing a base query for indexed select statement - may_need_paging() should be called as well. Message-Id: <d435820019e4082a64ca9807541f0c9ad334e6a8.1538568953.git.sarna@scylladb.com>	2018-10-03 13:26:51 +01:00
Piotr Sarna	11b8831c04	cql3: move base query command creation to a separate function Message-Id: <6b48b8cbd6312da4a17bfd3c85af628b4215e9f4.1538568953.git.sarna@scylladb.com>	2018-10-03 13:26:51 +01:00
Piotr Sarna	4b4f57747a	cql3: remove execute(primary_keys) from select statement Right now, with specialized execute() that takes primary keys for indexed_table_select_statement, the original execute() method implemented in select_statement is not used anywhere, so it's removed.	2018-09-27 15:29:28 +02:00
Piotr Sarna	9e0b3cad1e	cql3: add incremental base queries to index query Base queries that are part of index queries are allowed to be short, which can result in wasted work - e.g. when we query all replicas in parallel, but have to discard most of the result, since the first one (in token order) resulted in a short read. Thus, we start by quering 1 range, check if the read is short, and if not, continue by querying 2x more ranges than before. Refs #2960	2018-09-27 15:29:28 +02:00
Piotr Sarna	5b16aeb395	cql3: add base query handling function to indexed statement Handling a base query during the indexed statement execution may require updating its paging state.	2018-09-27 15:29:28 +02:00

1 2 3 4

189 Commits