scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-20 16:40:35 +00:00

Author	SHA1	Message	Date
Avi Kivity	cb7ee5c765	cql3: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Paweł Dziepak	c94d2b6aa6	cql3: restore original timeout behaviour for aggregate queries Commit `1d34ef38a8` "cql3: make pagers use time_point instead of duration" has unintentionally altered the timeout semantics for aggregate queries. Such requests fetch multiple pages before sending a response to the client. Originally, each of those fetches had a timeout-duration to finish, after the problematic commit the whole request needs to complete in a single timeout-duration. This, unsurprisingly, makes some queries that were successful before fail with a timeout. This patch restores the original behaviour. Fixes #3877. Message-Id: <20181022125318.4384-1-pdziepak@scylladb.com>	2018-10-23 12:52:42 +03:00
Eliran Sinvani	fd422c954e	cql3: ensure retrieval of columns for filtering When a query that needs filtering is executed, the columns that the coordinator is filtering by have to be retrieved.The columns should be retrieved even if they are not used for ordering or named in the actual select clause. If the columns are missing from the result set, then any filtering that restricts the missing column will not take place. Fixes #3803 Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2018-10-21 08:41:46 +03:00
Eliran Sinvani	3e036e2c8c	cql3: refactor find_idx to be part of statement restrictions object find_idx calculates the index that will be used in the statement if indexes are to be used. In the static form it requires redundant information (the schema is already contained within the statement restrictions object). In addition find_idx will need to be used for filtering in order not to include redundant selectors in the selection objects. This change refactors find_idx to run under the statement restrictions object and changes it's scope from private to public. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2018-10-21 08:40:24 +03:00
Eliran Sinvani	ded3a03356	cql3: rename selection metadata manipulation functions In the past the addition of non serializable columns was being used only for post ordering of result sets.The newly added ALLOW FILTERING feature will need to use these functions to other post processing operations i.e filtering. The renaming accounts for the new and existing uses for the function. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2018-10-18 17:52:04 +03:00
Duarte Nunes	b839f551cf	cql3/statements/select_statement: Don't double count unpaged queries Unpaged queries are those for which the client didn't enable paging, and we already account for them in indexed_table_select_statement::do_execute(). Remove the second increment in read_posting_list(). Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20181003121811.11750-1-duarte@scylladb.com>	2018-10-05 17:36:39 +02:00
Duarte Nunes	959559d568	cql3/statements/select_statement: Remove outdated comment Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20181003193033.13862-1-duarte@scylladb.com>	2018-10-04 09:45:17 +03:00
Pekka Enberg	de48966abc	cql3: Move as_json_function class to separate file The as_json_function class is not registered as a function, but we can still keep it cql3/functions, as per its namespace, to reduce the size of select_statement.cc. Message-Id: <20181002132637.30233-1-penberg@scylladb.com>	2018-10-03 13:30:08 +01:00
Piotr Sarna	4a23297117	cql3: add asking for pk/ck in the base query Base query partition and clustering keys are used to generate paging state for an index query, so they always need to be present when a paged base query is processed. Message-Id: <f3bf69453a6fd2bc842c8bdbd602d62c91cf9218.1538568953.git.sarna@scylladb.com>	2018-10-03 13:26:51 +01:00
Piotr Sarna	50d3de0693	cql3: add checking for may_need_paging when executing base query It's not sufficient to check for positive page_size when preparing a base query for indexed select statement - may_need_paging() should be called as well. Message-Id: <d435820019e4082a64ca9807541f0c9ad334e6a8.1538568953.git.sarna@scylladb.com>	2018-10-03 13:26:51 +01:00
Piotr Sarna	11b8831c04	cql3: move base query command creation to a separate function Message-Id: <6b48b8cbd6312da4a17bfd3c85af628b4215e9f4.1538568953.git.sarna@scylladb.com>	2018-10-03 13:26:51 +01:00
Piotr Sarna	4b4f57747a	cql3: remove execute(primary_keys) from select statement Right now, with specialized execute() that takes primary keys for indexed_table_select_statement, the original execute() method implemented in select_statement is not used anywhere, so it's removed.	2018-09-27 15:29:28 +02:00
Piotr Sarna	9e0b3cad1e	cql3: add incremental base queries to index query Base queries that are part of index queries are allowed to be short, which can result in wasted work - e.g. when we query all replicas in parallel, but have to discard most of the result, since the first one (in token order) resulted in a short read. Thus, we start by quering 1 range, check if the read is short, and if not, continue by querying 2x more ranges than before. Refs #2960	2018-09-27 15:29:28 +02:00
Piotr Sarna	5b16aeb395	cql3: add base query handling function to indexed statement Handling a base query during the indexed statement execution may require updating its paging state.	2018-09-27 15:29:28 +02:00
Piotr Sarna	bce7232555	cql3: add generating base key from index keys A function that computes base partition/clustering key from index view primary key is provided.	2018-09-27 15:29:28 +02:00
Piotr Sarna	2f085848d8	cql3: add paging state generation function For indexed queries, the paging state needs to be updated based on the results of base query when the read was short.	2018-09-27 15:29:28 +02:00
Piotr Sarna	f21bcbefdf	cql3: move getting index view schema to prepare stage Searching for index view schema for an indexed statement can be done once in prepare stage, so it's moved to indexed_table_select_statement prepare method.	2018-09-27 15:29:28 +02:00
Piotr Sarna	744ac3bf7b	cql3: rename set_has_more_pages to set_paging_state This function's primary goal is to set the paging state passed as a parameter, so its name is changed to match the semantics better.	2018-09-27 15:29:28 +02:00
Piotr Sarna	7c1e4c2deb	cql3: add paging to read_posting_list Instead of a single query, paging is used in order to query an index.	2018-09-27 15:18:06 +02:00
Piotr Sarna	430a49f91a	cql3: make find_index_* functions return paging state In order to implement secondary index paging, intermediary query functions now also return paging state for the view query.	2018-09-27 15:18:06 +02:00
Piotr Sarna	c3dd1775c8	cql3: make read_posting_list return future<rows> Instead of returning a coordinator result and making a caller parse it later, read_posting_list now extracts rows by itself. This change is later needed when querying is replaced with a pager.	2018-09-27 15:18:06 +02:00
Piotr Sarna	1d34ef38a8	cql3: make pagers use time_point instead of duration A standard way for passing a timeout parameter is specifying a time_point, while pagers used to take a duration in order to compute time points on the fly. This patch adds a timeout parameter, which is a time_point, to fetch_page().	2018-09-27 15:18:06 +02:00
Paweł Dziepak	a3746d3b05	paging: make may_need_paging() more conservative There is a bad interaction between may_need_paging() and query result size limiter. The former is trying to avoid the complexity of paged queries when the number of returned rows is going to be smaller than the page size. The latter uses the fact that paged queries need not return all requested rows to limit the size of a query results. Since may_need_paging() may turn a paged query into non-paged one as a side effect it disables the oversized result protection. This patch limits the cases when may_need_paging() disables paging to the situations when we know for sure that query result size limiter won't be needed, i.e.: the result is not going to contain more than one row. If the client knows for sure that the paging is not needed and the performance impact is worthwhile it can disable paging on its side. Otherwise, let's default to the safer behaviour. Fixes #3620. Message-Id: <20180925134431.24329-1-pdziepak@scylladb.com>	2018-09-25 17:01:04 +03:00
Eliran Sinvani	d743ceae76	cql3: ignore LIMIT in select statement with aggregate LIMIT should restrict the output result and not the query whose result set is aggregated. when using aggregate the output is guarantied to be only one row long. since LIMIT accepts only none negative numbers, it has no effect and can be ignored. Fixes #2028 Tests: The issue described Testcase , UnitTests. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Message-Id: <6c235376c81f052020e2ed23d0a3d071b36d4415.1534416997.git.eliransin@scylladb.com>	2018-08-16 19:31:56 +01:00
Duarte Nunes	1521dc56ae	Merge 'Pass query options to restrictions filter' from Piotr " This miniseries fixes ALLOW FILTERING support for prepared statements by passing correct query options to the filter instead of empty ones. " * 'pass_query_options_to_restrictions_filter' of https://github.com/psarna/scylla: tests: add testing prepared statements with ALLOW FILTERING cql3: pass query options to restrictions filter	2018-08-09 18:15:18 +01:00
Duarte Nunes	95677877c2	Merge 'JSON support fixes' from Piotr " This series addresses SELECT/INSERT JSON support issues, namely handling null values properly and parsing decimals from strings. It also comes with updated cql tests. Tests: unit (release) " * 'json_fixes_3' of https://github.com/psarna/scylla: cql3: remove superfluous null conversions in to_json_string tests: update JSON cql tests cql3: enable parsing decimal JSON values from string cql3: add missing return for dead cells cql3: simplify parsing optional JSON values cql3: add handling null value in to_json cql3: provide to_json_string for optional bytes argument	2018-08-09 18:05:34 +01:00
Piotr Sarna	cdbeed4e3b	cql3: simplify parsing optional JSON values With new to_json_string implementation that accepts bytes_opt, parsing optional values can be simplified to remove explicit branching.	2018-08-09 18:07:12 +02:00
Piotr Sarna	8c18aaa511	cql3: pass query options to restrictions filter Query options may contain bound values needed for checking filtering restrictions. Previously, empty query_options{} were used, which caused prepared statements to fail. Fixes #3677	2018-08-09 17:44:45 +02:00
Eliran Sinvani	3f2bb07599	cql3: Count unpaged select queries If the counter goes up this can be a possible reason for slowdown in queries (since it means that potentially a large amount of data will be sent to the client at once). Fixes #2478 Tests: cqlsh with PAGING OFF and ON and validating with a print. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Message-Id: <01253cee0b8c1110aaee3da41d1f434ca798b430.1533817568.git.eliransin@scylladb.com>	2018-08-09 13:53:44 +01:00
Rafi Einstein	123f2c2a1c	Add a counter for reverse queries Fixes #3492 Tests: dtest(cql_additional_tests.py) Message-Id: <20180729202615.22459-1-rafie@scylladb.com>	2018-07-30 12:34:43 +03:00
Paweł Dziepak	3e32245bb8	cql3: select statement: don't copy metadata if not needed	2018-07-26 12:37:20 +01:00
Piotr Sarna	6cc8ccc771	cql3: use clustering key prefix in index queries If an indexed query has partition+clustering key restrictions as well and at least some of these restrictions create a prefix, this prefix is used in the index query to narrow down the number of rows read. Refs #3611	2018-07-23 14:10:52 +02:00
Avi Kivity	761931659a	Merge "Do not linearise incoming CQL3 requests" from Paweł " This series changes the native CQL3 protocl layer so that it works with fragmented buffers instead of a single temporary_buffer per request. The main part is fragmented_temporary_buffer which represents a fragmented buffer consisting of multiple temporary_buffers. It provides helpers for reading fragmented buffer from an input_stream, interpreting the data in the fragmented buffer as well as view that satisfy FragmentRange concept. There are still situations where a fragmented buffer is linearised. That includes decompressing client requests (this uses reusable buffers in a similar way to the code that sends compressed responses), CQL statement restrictions and values that are hard-coded in prepared statements (hopefully, the values in those cases will be small), value validation in some cases (blobs are not validated, irrelevant for many fixed-size small types, but may be a problem for large text cells) as well as operations on collections. Tests: unit(release), dtests(cql_prepared_test.py, cql_tests.py, cql_additional_tests.py) " * tag 'fragmented-cql3-receive/v1' of https://github.com/pdziepak/scylla: (23 commits) types: bytes_view: override fragmented validate() cql3: value_view: switch to fragmented_temporary_buffer::view types: add validate that accepts fragmented_temporary_buffer::view cql3 query_options: add linearize() cql3: query_options: use bytes_ostream for temporaries cql3: operation: make make_cell accept fragmented_temporary_buffer::view atomic_cell: accept fragmented_temporary_buffer::view values cql3: avoid ambiguity in a call to update_parameters::make_cell() transport: switch to fragmented_temporary_buffer transport: extract compression buffers from response class tests/reusable_buffer: test fragmented_temporary_buffer support utils: reusable_buffer: support fragmented_temporary_buffer tests: add test for fragmented_temporary_buffer util fragment_range: add general linearisation functions utils: add fragmented_temporary_buffer tests: add basic test for transport requests and responses tests/random-utils: print seed tests/random-utils: generate sstrings cql3: add value_view printer and equality comparison transport: move response outside of cql_server class ...	2018-07-22 19:40:37 +03:00
Piotr Sarna	2542630a18	cql3: use primary key restrictions in filtering index queries If both index and partition key is used in a query, it should not require filtering, because indexed query can be narrowed down with partition key information. This commit appends partition key restrictions to index query.	2018-07-18 18:45:08 +02:00
Paweł Dziepak	0b9eed72f4	cql3: value_view: switch to fragmented_temporary_buffer::view	2018-07-18 12:28:06 +01:00
Piotr Sarna	7d9715db27	cql3: use single restriction value in index creation ALLOW FILTERING support caused index-related restrictions to possibly have more values. In order to remain correct, only those restrictions which match the indexed columns should be used.	2018-07-11 18:06:21 +02:00
Piotr Sarna	03f2f8633b	cql3: add updating ALLOW FILTERING metrics Metrics related to ALLOW FILTERING queries are now properly updated on read requests.	2018-07-06 12:00:29 +02:00
Piotr Sarna	27bf20aa3f	cql3: enable ALLOW FILTERING Enables 'ALLOW FILTERING' queries by transfering control to result_set_builder::filtering_visitor. Both regular and primary key columns are allowed, but some things are left unimplemented: - multi-column restrictions - CONTAINS queries Fixes #2025	2018-07-05 10:50:43 +02:00
Paweł Dziepak	2b1fcfe019	cql3: select_statement: use fetch_page_generator() if possible	2018-06-25 09:21:47 +01:00
Paweł Dziepak	fa5dea91e7	cql3: select_statement: use result_generator if possible	2018-06-25 09:21:47 +01:00
Paweł Dziepak	dca68afce6	cql3: add result class So far the only way of returing a result of a CQL query was to build a result_set. An alternative lazy result generator is going to be introduced for the simple cases when no transformations at CQL layer are needed. To do that we need to hide the fact that there are going to be multiple representations of a cql results from the users.	2018-06-25 09:21:47 +01:00
Avi Kivity	fdfc347595	cql: make select_statement execution_stage scheduling aware Inherit scheduling from the caller, preventing a fall back into the main group.	2018-06-18 18:30:21 +03:00
Piotr Sarna	70ba8c8317	cql3: update token order comments Comments about token order were outdated with token column patches and they are now up to date. Fixes #3423	2018-06-06 09:02:37 +02:00
Avi Kivity	b70febe246	cql: cql_statement: remove execute_internal() With no callers, it can be safely removed.	2018-05-27 12:40:27 +03:00
Avi Kivity	eb19798f99	cql: select_statement: make execute() and execute_internal() equivalent execute_internal(), for some code paths, differs from execute by the following: 1. it uses CL_ONE unconditionally 2. it has no query timeout 3. it doesn't use execution stages for other code paths, it just calls execute. As preparation for getting rid of execute_internal(), unify the two code paths. Commit `4859b759b9` caused the consistency level and timeouts to be provided by the caller, so using the caller provided parameters instead of overriding them does not change behavior.	2018-05-27 12:36:02 +03:00
Nadav Har'El	1b29dd44f7	secondary index: fix wrong results returned in certain cases The current secondary-index search code, in indexed_table_select_statement::do_execute(), begins by fetching a list of partitions, and then the content of these partitions from the base table. However, in some cases, when the table has clustering columns and not searching on the first one of them, doing this work in partition granularity is wrong, and yields wrong results as demonstrated in issue #3405. So in this patch, we recognize the cases where we need to work in clustering row granularity, and in those cases use the new functions introduced in the previous patches - find_index_clustering_rows() and the execute() variant taking a list of primary-keys of rows. Fixes #3405. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:56:03 +03:00
Nadav Har'El	adf6d742be	secondary index: method for fetching list of rows from base table We add a new variant of select_statement::execute() which allows selecting an arbitrary list of clustering rows. The existing execute() variant can't do that - it can only take a list of partitions, and read the same clustering rows from all of them. The new select variant is not needed for regular CQL queries (which do not have a syntax allowing reading a list of rows with arbitrary primary keys), but we will need it for secondary index search, for solving issue #3405. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:54:36 +03:00
Nadav Har'El	a096a82adc	secondary index: method for fetching list of rows from index We already have a method find_index_partition_ranges(), to fetch a list of partition keys from the secondary index. However, as we shall see in the following patches (and see also issue #3405), getting a list of entire partitions is not always enough - the secondary index actually holds a list of primary keys, which includes clustering keys, and in some queries we can't just ignore them. So this patch provides a new method find_index_clustering_rows(), to query the secondary index and get a list of matching clustering keys. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:53:29 +03:00
Nadav Har'El	083b2ae573	select_statement.cc: refactor find_index_partition_ranges() The function find_index_partition_ranges() is used in secondary index searches for fetching a list of matching partition. In a following patch, we want to add a similar function for getting a list of rows. To avoid duplicate code, in this patch we split parts of find_index_partition_ranges() into two new functions: 1. get_index_schema() returns a pointer to the index view's schema. 2. read_posting_list() reads from this view the posting list (i.e., list of keys) for the current searched value. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:50:45 +03:00
Nadav Har'El	7dc9b77682	select_statement.cc: fix variable lifetime errors do_with() provides code a reference to an object which will be kept alive. It is a mistake to make a copy of this object or of parts of it, because then the lifetime of this copy will have to be maintained as well. In particular, it is a mistake to do do_with(..., [] (auto x) { ... }) - note how "auto x" appears instead of the correct "auto& x". This causes the object to be copied, and its lifetime not maintained. This patch fixes several cases where this rule was broken in select_statement.cc. I could not reproduce actual crashes caused by these mistakes, but in theory they could have happened. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2018-05-24 15:46:12 +03:00

1 2 3 4

153 Commits