scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-25 19:10:42 +00:00

Author	SHA1	Message	Date
Piotr Sarna	c743617236	cql3: unify max value for row limit and per-partition limit Limits are stored as uint32_t everywhere, but in some places int32_t was used, which created inconsistencies when comparing the value to std::numeric_limits<Type>::max(). In order to solve inconsistencies, the types are unified to uint32_t, and instead of explicitly calling numeric limit max, an already existing constant value query::max_rows is utilized. Fixes #4253 Message-Id: <4234712ff61a0391821acaba63455a34844e489b.1550683120.git.sarna@scylladb.com>	2019-02-21 13:56:02 +02:00
Duarte Nunes	6e83457b1b	Merge 'Add PER PARTITION LIMIT' from Piotr " This series introduces PER PARTITION LIMIT to CQL. Protocol and storage is already capable of applying per-partition limits, so for nonpaged queries the changes are superficial - a variable is parsed and passed down. For paged queries and filtering the situation is a little bit more complicated due to corner cases: results for one partition can be split over 2 or more pages, filtering may drop rows, etc. To solve these, another variable is added to paging state - the number of rows already returned from last served partition. Note that "last" partition may be stretched over any number of pages, not just the last one, which is a case especially when considering filtering. As a result, per-partition-limiting queries are not eligible for page generator optimization, because they may need to have their results locally filtered for extraneous rows (e.g. when the next page asks for per-partition limit 5, but we already received 4 rows from the last partition, so need just 1 more from last partition key, but 5 from all next ones). Tests: unit (dev) Fixes #2202 " * 'add_per_partition_limit_3' of https://github.com/psarna/scylla: tests: remove superficial ignore_order from filtering tests tests: add filtering with per partition key limit test tests: publish extract_paging_state and count_rows_fetched tests: fix order of parameters in with_rows_ignore_order cql3,grammar: add PER PARTITION LIMIT idl,service: add persistent last partition row count cql3: prevent page generator usage for per-partition limit cql3: add checking for previous partition count to filtering pager: add adjusting per-partition row limit cql3: obey per partition limit for filtering cql3: clean up unneeded limit variables cql3: obey per partition limit for select statement cql3: add get_per_partition_limit cql3: add per_partition_limit to CQL statement	2019-02-18 14:47:11 +00:00
Piotr Sarna	3a2b004f02	cql3: prevent page generator usage for per-partition limit Paged queries that induce per-partition limits cannot use page generator optimization, as sometimes the results need to be filtered for extraneous rows on page breaks.	2019-02-18 11:06:44 +01:00
Piotr Sarna	1dadae212a	cql3: add checking for previous partition count to filtering Filtering now needs to take into account per partition limits as well, and for that it's essential to be able to compare partition keys and decide which rows should be dropped - if previous page(s) contained rows with the same partition key, these need to be taken into consideration too.	2019-02-18 11:06:43 +01:00
Piotr Sarna	b965c3778f	cql3: obey per partition limit for filtering Filtering queries now take into account the limit of rows per single partition provided by the user.	2019-02-18 10:29:34 +01:00
Piotr Sarna	b3aa939cde	cql3: clean up unneeded limit variables Some places extracted a `limit` variable to be captured by lambdas, but they were not used inside them.	2019-02-18 10:29:34 +01:00
Piotr Sarna	cfb6e9c79c	cql3: obey per partition limit for select statement Select statement now takes into account the limit of rows per single partition provided by the user.	2019-02-18 10:29:34 +01:00
Piotr Sarna	41b466246e	cql3: add get_per_partition_limit	2019-02-18 10:29:34 +01:00
Piotr Sarna	93786a9148	cql3: add per_partition_limit to CQL statement Select statements can now accept per_partition_limit variable.	2019-02-18 10:29:34 +01:00
Gleb Natapov	0cd9bbb71d	cql3/statements/select_statement: convert index query interface to new query_ranges_to_vnodes_generator interface	2019-02-11 14:45:43 +02:00
Piotr Sarna	9982587bea	cql3: alias single_column_primary_key_restrictions In preparation for detemplatizing this class, it's aliased with single_column_partition_key restrictions and single_column_clustering_key_restrictions accordingly.	2019-01-23 17:43:03 +02:00
Piotr Sarna	87c23372fb	cql3: fix filtering with LIMIT with regard to paging Previously the limit was erroneously applied per page instead of being accumulated, which might have caused returning too many rows. As of now, LIMIT is handled properly inside restrictions filter. Fixes #4100	2019-01-17 13:25:09 +01:00
Duarte Nunes	fa2b0384d2	Replace std::experimental types with C++17 std version. Replace stdx::optional and stdx::string_view with the C++ std counterparts. Some instances of boost::variant were also replaced with std::variant, namely those that called seastar::visit. Scylla now requires GCC 8 to compile. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20190108111141.5369-1-duarte@scylladb.com>	2019-01-08 13:16:36 +02:00
Avi Kivity	c3ef99f84f	schema_tables: remove #include of database.hh Distribute in source files (and one header - table_helper.hh) that need it.	2019-01-05 15:43:07 +02:00
Paweł Dziepak	9024187222	partition_slice: use small_vector for column_ids	2018-12-06 14:21:04 +00:00
Piotr Sarna	acf4eadf88	cql3: add proper handling of filtering with LIMIT Previously, limit was erroneously applied before filtering, which might have resulted in truncated results. Now, both paged and unpaged queries are filtered first, and only after that properly trimmed so only X rows are returned for LIMIT X. Fixes #3902	2018-11-29 14:53:30 +01:00
Avi Kivity	4676e07400	consistency_level: simplify validation API Remove unused parameters, replace refcounted pointers by references.	2018-11-27 13:41:49 +02:00
Avi Kivity	2c08bff8d5	Split consistency_level.hh header It has two unrelated users: cql for validation, and storage_proxy for complicated calculations. Split the simple stuff into a new header to reduce dependencies.	2018-11-27 13:32:10 +02:00
Avi Kivity	9201d22c06	cql: remove unneeded includes of consistency_level.hh Move the includes to .cc to reduce include pollution.	2018-11-27 13:18:33 +02:00
Avi Kivity	ecf3f92ec7	cql: add SELECT ... BYPASS CACHE clause The BYPASS CACHE clause instructs the database not to read from or populate the cache for this query. The new keywords (BYPASS and CACHE) are not reserved.	2018-11-26 11:37:49 +02:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Avi Kivity	cb7ee5c765	cql3: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Paweł Dziepak	c94d2b6aa6	cql3: restore original timeout behaviour for aggregate queries Commit `1d34ef38a8` "cql3: make pagers use time_point instead of duration" has unintentionally altered the timeout semantics for aggregate queries. Such requests fetch multiple pages before sending a response to the client. Originally, each of those fetches had a timeout-duration to finish, after the problematic commit the whole request needs to complete in a single timeout-duration. This, unsurprisingly, makes some queries that were successful before fail with a timeout. This patch restores the original behaviour. Fixes #3877. Message-Id: <20181022125318.4384-1-pdziepak@scylladb.com>	2018-10-23 12:52:42 +03:00
Eliran Sinvani	fd422c954e	cql3: ensure retrieval of columns for filtering When a query that needs filtering is executed, the columns that the coordinator is filtering by have to be retrieved.The columns should be retrieved even if they are not used for ordering or named in the actual select clause. If the columns are missing from the result set, then any filtering that restricts the missing column will not take place. Fixes #3803 Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2018-10-21 08:41:46 +03:00
Eliran Sinvani	3e036e2c8c	cql3: refactor find_idx to be part of statement restrictions object find_idx calculates the index that will be used in the statement if indexes are to be used. In the static form it requires redundant information (the schema is already contained within the statement restrictions object). In addition find_idx will need to be used for filtering in order not to include redundant selectors in the selection objects. This change refactors find_idx to run under the statement restrictions object and changes it's scope from private to public. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2018-10-21 08:40:24 +03:00
Eliran Sinvani	ded3a03356	cql3: rename selection metadata manipulation functions In the past the addition of non serializable columns was being used only for post ordering of result sets.The newly added ALLOW FILTERING feature will need to use these functions to other post processing operations i.e filtering. The renaming accounts for the new and existing uses for the function. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2018-10-18 17:52:04 +03:00
Duarte Nunes	b839f551cf	cql3/statements/select_statement: Don't double count unpaged queries Unpaged queries are those for which the client didn't enable paging, and we already account for them in indexed_table_select_statement::do_execute(). Remove the second increment in read_posting_list(). Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20181003121811.11750-1-duarte@scylladb.com>	2018-10-05 17:36:39 +02:00
Duarte Nunes	959559d568	cql3/statements/select_statement: Remove outdated comment Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20181003193033.13862-1-duarte@scylladb.com>	2018-10-04 09:45:17 +03:00
Pekka Enberg	de48966abc	cql3: Move as_json_function class to separate file The as_json_function class is not registered as a function, but we can still keep it cql3/functions, as per its namespace, to reduce the size of select_statement.cc. Message-Id: <20181002132637.30233-1-penberg@scylladb.com>	2018-10-03 13:30:08 +01:00
Piotr Sarna	4a23297117	cql3: add asking for pk/ck in the base query Base query partition and clustering keys are used to generate paging state for an index query, so they always need to be present when a paged base query is processed. Message-Id: <f3bf69453a6fd2bc842c8bdbd602d62c91cf9218.1538568953.git.sarna@scylladb.com>	2018-10-03 13:26:51 +01:00
Piotr Sarna	50d3de0693	cql3: add checking for may_need_paging when executing base query It's not sufficient to check for positive page_size when preparing a base query for indexed select statement - may_need_paging() should be called as well. Message-Id: <d435820019e4082a64ca9807541f0c9ad334e6a8.1538568953.git.sarna@scylladb.com>	2018-10-03 13:26:51 +01:00
Piotr Sarna	11b8831c04	cql3: move base query command creation to a separate function Message-Id: <6b48b8cbd6312da4a17bfd3c85af628b4215e9f4.1538568953.git.sarna@scylladb.com>	2018-10-03 13:26:51 +01:00
Piotr Sarna	4b4f57747a	cql3: remove execute(primary_keys) from select statement Right now, with specialized execute() that takes primary keys for indexed_table_select_statement, the original execute() method implemented in select_statement is not used anywhere, so it's removed.	2018-09-27 15:29:28 +02:00
Piotr Sarna	9e0b3cad1e	cql3: add incremental base queries to index query Base queries that are part of index queries are allowed to be short, which can result in wasted work - e.g. when we query all replicas in parallel, but have to discard most of the result, since the first one (in token order) resulted in a short read. Thus, we start by quering 1 range, check if the read is short, and if not, continue by querying 2x more ranges than before. Refs #2960	2018-09-27 15:29:28 +02:00
Piotr Sarna	5b16aeb395	cql3: add base query handling function to indexed statement Handling a base query during the indexed statement execution may require updating its paging state.	2018-09-27 15:29:28 +02:00
Piotr Sarna	bce7232555	cql3: add generating base key from index keys A function that computes base partition/clustering key from index view primary key is provided.	2018-09-27 15:29:28 +02:00
Piotr Sarna	2f085848d8	cql3: add paging state generation function For indexed queries, the paging state needs to be updated based on the results of base query when the read was short.	2018-09-27 15:29:28 +02:00
Piotr Sarna	f21bcbefdf	cql3: move getting index view schema to prepare stage Searching for index view schema for an indexed statement can be done once in prepare stage, so it's moved to indexed_table_select_statement prepare method.	2018-09-27 15:29:28 +02:00
Piotr Sarna	744ac3bf7b	cql3: rename set_has_more_pages to set_paging_state This function's primary goal is to set the paging state passed as a parameter, so its name is changed to match the semantics better.	2018-09-27 15:29:28 +02:00
Piotr Sarna	7c1e4c2deb	cql3: add paging to read_posting_list Instead of a single query, paging is used in order to query an index.	2018-09-27 15:18:06 +02:00
Piotr Sarna	430a49f91a	cql3: make find_index_* functions return paging state In order to implement secondary index paging, intermediary query functions now also return paging state for the view query.	2018-09-27 15:18:06 +02:00
Piotr Sarna	c3dd1775c8	cql3: make read_posting_list return future<rows> Instead of returning a coordinator result and making a caller parse it later, read_posting_list now extracts rows by itself. This change is later needed when querying is replaced with a pager.	2018-09-27 15:18:06 +02:00
Piotr Sarna	1d34ef38a8	cql3: make pagers use time_point instead of duration A standard way for passing a timeout parameter is specifying a time_point, while pagers used to take a duration in order to compute time points on the fly. This patch adds a timeout parameter, which is a time_point, to fetch_page().	2018-09-27 15:18:06 +02:00
Paweł Dziepak	a3746d3b05	paging: make may_need_paging() more conservative There is a bad interaction between may_need_paging() and query result size limiter. The former is trying to avoid the complexity of paged queries when the number of returned rows is going to be smaller than the page size. The latter uses the fact that paged queries need not return all requested rows to limit the size of a query results. Since may_need_paging() may turn a paged query into non-paged one as a side effect it disables the oversized result protection. This patch limits the cases when may_need_paging() disables paging to the situations when we know for sure that query result size limiter won't be needed, i.e.: the result is not going to contain more than one row. If the client knows for sure that the paging is not needed and the performance impact is worthwhile it can disable paging on its side. Otherwise, let's default to the safer behaviour. Fixes #3620. Message-Id: <20180925134431.24329-1-pdziepak@scylladb.com>	2018-09-25 17:01:04 +03:00
Eliran Sinvani	d743ceae76	cql3: ignore LIMIT in select statement with aggregate LIMIT should restrict the output result and not the query whose result set is aggregated. when using aggregate the output is guarantied to be only one row long. since LIMIT accepts only none negative numbers, it has no effect and can be ignored. Fixes #2028 Tests: The issue described Testcase , UnitTests. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Message-Id: <6c235376c81f052020e2ed23d0a3d071b36d4415.1534416997.git.eliransin@scylladb.com>	2018-08-16 19:31:56 +01:00
Duarte Nunes	1521dc56ae	Merge 'Pass query options to restrictions filter' from Piotr " This miniseries fixes ALLOW FILTERING support for prepared statements by passing correct query options to the filter instead of empty ones. " * 'pass_query_options_to_restrictions_filter' of https://github.com/psarna/scylla: tests: add testing prepared statements with ALLOW FILTERING cql3: pass query options to restrictions filter	2018-08-09 18:15:18 +01:00
Duarte Nunes	95677877c2	Merge 'JSON support fixes' from Piotr " This series addresses SELECT/INSERT JSON support issues, namely handling null values properly and parsing decimals from strings. It also comes with updated cql tests. Tests: unit (release) " * 'json_fixes_3' of https://github.com/psarna/scylla: cql3: remove superfluous null conversions in to_json_string tests: update JSON cql tests cql3: enable parsing decimal JSON values from string cql3: add missing return for dead cells cql3: simplify parsing optional JSON values cql3: add handling null value in to_json cql3: provide to_json_string for optional bytes argument	2018-08-09 18:05:34 +01:00
Piotr Sarna	cdbeed4e3b	cql3: simplify parsing optional JSON values With new to_json_string implementation that accepts bytes_opt, parsing optional values can be simplified to remove explicit branching.	2018-08-09 18:07:12 +02:00
Piotr Sarna	8c18aaa511	cql3: pass query options to restrictions filter Query options may contain bound values needed for checking filtering restrictions. Previously, empty query_options{} were used, which caused prepared statements to fail. Fixes #3677	2018-08-09 17:44:45 +02:00
Eliran Sinvani	3f2bb07599	cql3: Count unpaged select queries If the counter goes up this can be a possible reason for slowdown in queries (since it means that potentially a large amount of data will be sent to the client at once). Fixes #2478 Tests: cqlsh with PAGING OFF and ON and validating with a print. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Message-Id: <01253cee0b8c1110aaee3da41d1f434ca798b430.1533817568.git.eliransin@scylladb.com>	2018-08-09 13:53:44 +01:00

1 2 3 4

174 Commits