scylladb

Author	SHA1	Message	Date
Botond Dénes	8c0dd99f7c	query: result_merger::get() don't reset last-pos on short-reads and last pages When merging multiple query-results, we use the last-position of the last result in the combined one as the combined result's last position. This only works however if said last result was included fully. Otherwise we have to discard the last-position included with the result and the pager will use the position of the last row in the combined result as the last position. The commit introducing the above logic mistakenly discarded the last position when the result is a short read or a page is not full. This is not necessary and even harmful as it can result in an empty combined result being delivered to the pager, without a last-position.	2022-08-10 06:01:49 +03:00
Botond Dénes	014c5b56a3	query-result: move last_pos up to query::result query_result was the wrong place to put last position into. It is only included in data-responses, but not on digest-responses. If we want to support empty pages from replicas, both data and digest responses have to include the last position. So hoist up the last position to the parent structure: query::result. This is a breaking change inter-node ABI wise, but it is fine: the current code wasn't released yet. Closes #11072	2022-07-20 13:28:09 +03:00
Jadw1	29a0be75da	forward_service: support UDA and native aggregate parallelization Enables parallelization of UDA and native aggregates. The way the query is parallelized is the same as in #9209. Separate reduction type for `COUNT(*)` is left for compatibility reason.	2022-07-18 15:25:41 +02:00
Botond Dénes	fd5f8f2275	query: have replica provide the last position Use the recently introduced query-result facility to have the replica set the position where the query should continue from. For now this is the same as what the implicit position would have been previously (last row in result), but it opens up the possibility to stop the query at a dead row.	2022-06-23 13:36:24 +03:00
Botond Dénes	009d2fe2f7	idl/query: add last_position to query_result To be used to allow the replica to specify the last position in the stream, where the query was left off. Currently this is always the same as the implicit position -- the last row in the result-set -- but this requires only stopping the read on a live row, which is a requirement we want to lift: we want to be able to stop on a tombstone. As tombstones are not included in the query result, we have to allow the replica to overwrite the last seen position explicitly. This patch introduces the new field in the query-result IDL but it is not written to yet, nor is it read, that is left for the next patches.	2022-06-23 13:36:24 +03:00
Michał Sala	538cff651e	query: do not assert in `operator<<(ostream&, const forward_result::printer&)` Printing invalid forward_result should not cause Scylla to stop.	2022-03-09 14:58:11 +01:00
Michał Sala	51362e4e5e	query: transform asserts into on_internal_error in forward_result::merge It was done to show more context in case of forward_result::merge arguments size mismatch and also to prevent aborts caused by another nodes sending malformed data.	2022-03-09 14:58:11 +01:00
Michał Sala	fff454761a	messaging_service: add verb for count() request forwarding Except for the verb addition, this commit also defines forward_request and forward_result structures, used as an argument and result of the new rpc. forward_request is used to forward information about select statement that does count() (or other aggregating functions such as max, min, avg in the future). Due to the inability to serialize cql3::statements::select_statement, I chose to include query::read_command, dht::partition_range_vector and some configuration options in forward_request. They can be serialized and are sufficient enough to allow creation of service::pager::query_pagers::pager.	2022-02-01 21:14:41 +01:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Tomasz Grabiec	3226c5bf9d	Merge 'sstables: mx: enable position fast-forwarding in reverse mode' from Kamil Braun Most of the machinery was already implemented since it was used when jumping between clustering ranges of a query slice. We need only perform one additional thing when performing an index skip during fast-forwarding: reset the stored range tombstone in the consumer (which may only be stored in fast-forwarding mode, so it didn't matter that it wasn't reset earlier). Comments were added to explain the details. As a preparation for the change, we extend the sstable reversing reader random schema test with a fast-forwarding test and include some minor fixes. Fixes #9427. Closes #9484 * github.com:scylladb/scylla: query-request: add comment about clustering ranges with non-full prefix key bounds sstables: mx: enable position fast-forwarding in reverse mode test: sstable_conforms_to_mutation_source_test: extend `test_sstable_reversing_reader_random_schema` with fast-forwarding test: sstable_conforms_to_mutation_source_test: fix `vector::erase` call test: mutation_source_test: extract `forwardable_reader_to_mutation` function test: random_schema: fix clustering column printing in `random_schema::cql`	2021-11-29 16:01:53 +01:00
Kamil Braun	ea6310961c	test: sstable_conforms_to_mutation_source_test: extend `test_sstable_reversing_reader_random_schema` with fast-forwarding The test would check whether the forward and reverse readers returned consistent results when created in non-forwarding mode with slicing. Do the same but using fast-forwarding instead of slicing. To do this we require a vector of `position_range`s. We also need a vector of `clustering_range`s for the existing test. We modify the existing `random_ranges` function to return `position_range`s instead of `clustering_range`s since `position_range`s are easier to reason about, especially when we consider non-full clustering key prefixes. A function is introduced to convert a `position_range` to a `clustering_range` for the existing test.	2021-11-29 11:10:46 +01:00
Botond Dénes	15af80800a	partition_slice: init all fields in copy ctor _partition_row_limit_high_bits was left out for some reason, corrupting the per-partition row limit.	2021-11-23 14:21:50 +02:00
Botond Dénes	c372b9676d	partition_slice: operator<<: print the entire partition row limit Not just the low bits.	2021-11-23 14:21:50 +02:00
Michał Radwański	dac2509a7f	query: reverse clustering_range	2021-10-05 16:47:04 +02:00
Kamil Braun	4bd601c6fd	query-request: introduce `half_reverse_slice` A utility function for converting between forward and half-reversed (or 'legacy'-reversed) slices to be used in the next commit.	2021-09-28 17:03:57 +03:00
Botond Dénes	6b5936812f	reverse_slice(): toggle reversed bit instead of setting it reverse_slice() works in both direction: reverse <-> forward, so it cannot unconditionally set the reversed bit, instead it should toggle it.	2021-09-13 18:05:11 +03:00
Botond Dénes	5d33d76cfd	query: add slice reversing functions	2021-09-09 14:18:32 +03:00
Tomasz Grabiec	9fe3e86368	db: Print more fields of read_command Message-Id: <20210810143752.420988-1-tgrabiec@scylladb.com>	2021-08-17 12:24:40 +03:00
Tomasz Grabiec	e177cd382b	db: Remove superfluous } from read_command printout Message-Id: <20210810131429.407903-1-tgrabiec@scylladb.com>	2021-08-10 17:32:34 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Wojciech Mitros	45215746fe	increase the maximum size of query results to 2^64 Currently, we cannot select more than 2^32 rows from a table because we are limited by types of variables containing the numbers of rows. This patch changes these types and sets new limits. The new limits take effect while selecting all rows from a table - custom limits of rows in a result stay the same (2^32-1). In classes which are being serialized and used in messaging, in order to be able to process queries originating from older nodes, the top 32 bits of new integers are optional and stay at the end of the class - if they're absent we assume they equal 0. The backward compatibility was tested by querying an older node for a paged selection, using the received paging_state with the same select statement on an upgraded node, and comparing the returned rows with the result generated for the same query by the older node, additionally checking if the paging_state returned by the upgraded node contained new fields with correct values. Also verified if the older node simply ignores the top 32 bits of the remaining rows number when handling a query with a paging_state originating from an upgraded node by generating and sending such a query to an older node and checking the paging_state in the reply(using python driver). Fixes #5101.	2020-08-03 17:32:49 +02:00
Botond Dénes	c364c7c6a2	result_memory_limiter: add unlimited_result_size constant To be used as the max result size for internal queries.	2020-07-28 18:00:29 +03:00
Konstantin Osipov	191acec7ab	schema: rename column_mask to column_set Since it contains a precise set of columns, it's more accurate to call it a set, not a mask. Besides, the name column_mask is already used for column options on storage level.	2019-11-13 11:41:30 +03:00
Konstantin Osipov	c0f0ab5edd	lwt: introduce column mask Introduce a bitset container which can be used to compute all columns used in a query. Add a partition_slice constructor which uses the bitset.	2019-10-16 22:40:55 +03:00
Botond Dénes	4cb873abfe	query::trim_clustering_row_ranges_to(): fix handling of non-full prefix keys Non-full prefix keys are currently not handled correctly as all keys are treated as if they were full prefixes, and therefore they represent a point in the key space. Non-full prefixes however represent a sub-range of the key space and therefore require null extending before they can be treated as a point. As a quick reminder, `key` is used to trim the clustering ranges such that they only cover positions >= then key. Thus, `trim_clustering_row_ranges_to()` does the equivalent of intersecting each range with (key, inf). When `key` is a prefix, this would exclude all positions that are prefixed by key as well, which is not desired. Fixes: #4839 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20190819134950.33406-1-bdenes@scylladb.com>	2019-08-20 00:24:51 +02:00
Botond Dénes	87973498a1	query: refactor trim_clustering_row_ranges_to() Allow expressing `pos` in term of a `position_in_partition_view`, which allows finer control of the exact position, allowing specifying position before, at or after a certain key. The previous overload is kept for backward compatibility, invoking the new overload behind the curtains.	2019-08-13 09:47:55 +03:00
Botond Dénes	181bf64858	query: add trim_clustering_row_ranges_to() This algorithm was already duplicated in two places (service/pager/query_pagers.cc and mutation_reader.cc). Soon it will be used in a third place. Instead of triplicating, move it into a function that everybody can use.	2019-02-08 16:30:17 +02:00
Paweł Dziepak	9024187222	partition_slice: use small_vector for column_ids	2018-12-06 14:21:04 +00:00
Avi Kivity	a71ab365e3	toplevel: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Duarte Nunes	baeec0935f	Replace query::full_slice with schema::full_slice() query::full_slice doesn't select any regular or static columns, which is at odds with the expectations of its users. This patch replaces it with the schema::full_slice() version. Refs #2885 Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1507732800-9448-2-git-send-email-duarte@scylladb.com>	2017-10-17 11:25:53 +02:00
Duarte Nunes	3b9a9b7321	query-result: Send row and partition count over the wire To avoid calculating them on the coordinator side. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-08-14 10:29:06 +02:00
Duarte Nunes	d7bab684ea	query::result: Optimize calculate_counts() Now that range queries go through the normal digest path, we rely on query::result::calculate_counts() to count the amount of partitions and rows returned. This patch makes it a bit faster. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-08-14 10:28:29 +02:00
Tomasz Grabiec	0073df30aa	query: Introduce full_clustering_range	2017-02-23 18:50:53 +01:00
Duarte Nunes	21d1bbb527	view: Add may_be_affected_by function This patch adds the may_be_affected_by() function to the view class, which is responsible to determine whether an update to a base class affects one of its views. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-02-06 13:35:30 +01:00
Paweł Dziepak	a7d694654a	query: make result_memory_limiter constants available for linker	2016-12-22 13:35:04 +01:00
Paweł Dziepak	38ee69dee0	idl: allow writers to use any output stream Original IDL generated code was hardcoded to always use bytes_ostream. This patch makes the output stream a template parameter so that any valid output stream can be used. Unfortunately, making IDL writers generic requires updates in the code that uses them, this is fixed in C++17 which would be able to deduce the parameter in most cases.	2016-12-22 13:35:04 +01:00
Asias He	e5485f3ea6	Get rid of query::partition_range Use dht::partition_range instead	2016-12-19 08:09:25 +08:00
Duarte Nunes	fee0b7fa48	query_result_merger: Limit rows This patch makes the row limit enforced by the storage_proxy layer. It adds a row limit to the query_result_merger, useful when merging results for concurrent queries. More importantly, it provides guarantees that upper layers may be relying on implicitly (e.g., the paging code). Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-15 11:00:36 +00:00
Duarte Nunes	9572c19dc6	storage_proxy: Don't fetch superfluous partitions This patch ensures we keep track of how many partitions we've queried so we don't ask for more than the number we need. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-15 10:27:46 +00:00
Duarte Nunes	93be8d7cef	query::result: Add partition count This patch adds a partition count to query::result, filled by the query::result::builder. The partition count is present whenever the result carries data, being absent only for the case where the result contains only a digest. We also ensure that counts are present for an empty query::result. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-15 10:27:46 +00:00
Duarte Nunes	108011a839	query_result_merger: Limit partitions This patch adds a partition limit to the query_result_merger, useful when merging results for concurrent queries. This change also makes the partition limit enforced by the storage_proxy layer, no changes being needed by the upper layers, namely the Thrift interface. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-12-15 10:27:41 +00:00
Paweł Dziepak	ee89d80d5c	query: add result size limiter This patch introduces an infrastrucutre for limiting result size. There is a shard-local limit which makes sure that all results combined do not use more than 10% of the shard memory. There is also an invidual limit which restricts a result to 4 MB. In order In order to avoid sending tiny results there is minimum guaranteed size (4 kB), which the query needs to reserve before it starts producing the result. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-12-14 14:10:02 +00:00
Paweł Dziepak	da7ca85040	query: allow short reads When paging is used the cluster is allowed to return less rows than the client asked for. However, if such possibility is used we need a way of telling that to the coordinator and the paging implementation so that they can differentiate between short reads caused by the replica running out of data to sent and short reads caused by any other means. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-12-14 14:10:01 +00:00
Raphael S. Carvalho	768aced741	partition_slice: introduce key-independent function to get ranges That will be important for sstable code that will rule out a sstable if it doesn't cover a given clustering key range. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-09-02 10:50:56 -03:00
Paweł Dziepak	3fe5ed3cd9	query: use result_view::consume() where appropriate Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00
Duarte Nunes	aaa76d58ba	query: Move to_partition_range to dht namespace This patch moves to_partition_range, from the query namespace to the dht namespace, where it is a more natural fit. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1468498060-19251-1-git-send-email-duarte@scylladb.com>	2016-07-15 10:41:52 +02:00
Paweł Dziepak	3c08ffb275	query: add full_slice query::full_slice is a partiton slice which has full clustering row ranges for all partition keys and no per-partition row limit. Options and columns are not set. It is used as a helper object in cases when a reference to partition_slice is needed but the user code needs just all data there is (an example of such case would be sstable compaction). Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-30 11:37:54 +01:00
Duarte Nunes	69798df95e	query: Limit number of partitions returned This is required to implement a thrift verb. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-22 09:48:13 +02:00
Duarte Nunes	01b18063ea	query: Add per-partition row limit This patch as a per-partition row limit. It ensures both local queries and the reconciliation logic abide by this limit. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-22 09:46:51 +02:00
Gleb Natapov	db322d8f74	query: put live row count into query::result The patch calculates row count during result building and while merging. If one of results that are being merged does not have row count the merged result will not have one either.	2016-05-02 15:10:15 +03:00

1 2

83 Commits