scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-29 04:37:00 +00:00

Author	SHA1	Message	Date
Benny Halevy	e2340d0684	config: enable_sstables_md_format by default Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 19:19:32 +03:00
Benny Halevy	0d85ceaf37	test: cql_query_test: add test_clustering_filtering unit tests Add unit tests reproducing https://github.com/scylladb/scylla/issues/3552 with clustering-key filtering enabled. enable_sstables_md_format option is set to true as clustering-key filtering is enabled only for md-format sstables. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 19:19:32 +03:00
Benny Halevy	65239a6e50	config: add enable_sstables_md_format MD format is disabled by default at this point. The option extends enable_sstables_mc_format so that both are needed to be set for supporting the md format. The MD_FORMAT cluster feature will be added in a following patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:53:04 +03:00
Benny Halevy	8e0e2c8a48	database: add set_format_by_config This is required for test applications that may select a sstable format different than the default mc format, like perf_fast_forward. These apps don't use the gossip-based sstables_format_selector to set the format based on the cluster feature and so they need to rely on the db config. Call set_format_by_config in single_node_cql_env::do_with. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:53:04 +03:00
Benny Halevy	d77ceba498	test: sstable_3_x_test: test both mc and md versions Run the test cases that write sstables using both the mc and md versions. Note that we can still compare the resulting Data, Index, Digest, and Filter components with the prepared mc sstables we have since these haven't changed in md. We take special consideration around validating min/max column names that are now calculated using a revised algorithm in the md format. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:53:04 +03:00
Pekka Enberg	3168be3483	test: Add support for the "md" format Test also the md format in all_sstable_versions. Add pre-computed md-sstable files generated using Cassandra version 3.11.7 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:53:04 +03:00
Benny Halevy	bd4383a842	sstables: mx/writer: update_min_max_components for partition tombstone Partition tombstones represent an implicit clustering range that is unbound on both sides, so reflect than in min/max column names metadata using empty clustering key prefixes. If we don't do that, when using the sstable for filtering, we have no other way of distinguishing range tombstones from partition tombstones given the sstable metadata and we would need to include any sstable with tombstones, even if those are range tombstone, for which we can do a better filtering job, using the sstable min/max column names metadata. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:53:04 +03:00
Benny Halevy	68acae5873	sstables: metadata_collector: support min_max_components for range tombstones We essentially treat min/max column names as range bounds with min as incl_start and max as incl_end. By generating a bound_view for min/max column names on the fly, we can correctly track and compare also short clustering key prefixes that may be used as bounds for range tombstones. Extend the sstable_tombstone_metadata_check unit test to cover these cases. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:53:04 +03:00
Benny Halevy	12393c5ec2	sstables: rename mc folder to mx Prepare for supporting the md format. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:53:04 +03:00
Pekka Enberg	a37eaaa022	sstables: Add support for the "md" format enum value Add the sstable_version_types::md enum value and logically extend sstable_version_types comparisons to cover also the > sstable_version_types::mc cases. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:53:04 +03:00
Benny Halevy	9f114d821a	sstables: keep whole clustering_key_prefix as min/max_column_names Currently we compare each min/max component independently. This may lead to suboptimal, inclusive clustering ranges that do not indicate any actual key we encountered. For example: ['a', 2], ['b', 1] will lead to min=['a', 1], max=['b', 2] instead of the keys themselves. This change keeps the min or max keys as a whole. It considers shorter clustering prefixes (that are possible with compact storage) as range tombstone bounds, so that a shorter key is considered less than the minimum if the latter has a common prefix, and greater than the maximum if the latter has a common prefix. Extend the min_max_clustering_key_test to test for this case. Previously {"a", "2"}, {"b", "1"} clustering keys would erronuously end up with min={"a", "1"} max={"b", "2"} while we want them to be min={"a", "2"} max={"b", "1"}. Adjust sstable_3_x_test to ignore original mc sstables that were previously computed with different min/max column names. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:53:03 +03:00
Benny Halevy	c9cade833c	sstables: metadata_collector: make only for write path make a metadata_collector only when writing the sstable, no need to make one when reading. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-10 18:51:12 +03:00
Nadav Har'El	f8291500cf	alternator test: test for ConditionExpression on key columns While answering a stackoverflow question on how to create an item but only if we don't already have an item with the same key, I realized that we never actually tested that ConditionExpressions works on key columns: all the tests we had in test_condition_expression.py had conditions on non-key attributes. So in this patch we add two tests with a condition on the key attribute. Most examples of conditions on the key attributes would be silly, but in these two tests we demonstrate how a test on key attributes can be useful to solve the above need of creating an item if no such item exists yet. We demonstrate two ways to do this using a condition on the key - using either the "<>" (not equal) operator, or the "attribute_not_exists()" function. These tests pass - we don't have a bug in this. But it's nice to have a test that confirms that we don't (and don't regress in that area). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200806200322.1568103-1-nyh@scylladb.com>	2020-08-07 08:05:48 +02:00
Avi Kivity	ba4d7e8523	Merge "Teach B+ to use AVX for keys search" from Pavel E " The current implementation of B+ benefits from using SIMD instruction in intra-nodes keys search. This set adds this functionality. The general idea behind the implementation is in "asking" the less comparator if it is the plain "<" and allows for key simplification to do this natural comparison. If it does, the search key is simplified to int64_t, the node's array of keys is casted to array of integers, then both are fed into avx-optimized searcher. The searcher should work on nodes that are not filled with keys. For performance the "unused" keys are set to int64_t minimum, the search loop compares them too (!) and adjusts the result index by node size. This needs some care in the maybe_key{} wrapper. fixes: #186 tests: unit(dev) " * 'br-bptree-avx-b' of https://github.com/xemul/scylla: utils: AVX searcher bptree: Special intra-node key search when possible bptree: Add lesses to maybe_key template token: Restrict TokenCarrier concept with noexcept	2020-08-06 19:14:46 +03:00
Rafael Ávila de Espíndola	b1315a2120	cql_test_env: Delay starting the compaction manager In case of an initialization failure after db.get_compaction_manager().enable(); But before stop_database, we would never stop the compaction manager and it would assert during destruction. I am trying to add a test for this using the memory failure injector, but that will require fixing other crashes first. Found while debugging #6831. Refs #6831. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200805181840.196064-1-espindola@scylladb.com>	2020-08-06 16:07:16 +03:00
Pavel Emelyanov	35a22ac48a	bptree: Special intra-node key search when possible If the key type is int64_t and the less-comparator is "natural" (i.e. it's literally 'a < b') we may use the SIMD instructions to search for the key on a node. Before doing so, the maybe_key and the searcher should be prepared for that, in particular: 1. maybe_key should set unused keys to the minimal value 2. the searcher for this case should call the gt() helper with primitive types -- int64_t search key and array of int64_t values To tell to B+ code that the key-less pair is such the less-er should define the simplify_key() method converting search keys to int64_t-s. This searcher is selected automatically, if any mismatch happens it silently falls back to default one. Thus also add a static assertion to the row-cache to mitigate this. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-06 15:41:31 +03:00
Nadav Har'El	8f12ef3628	alternator test: faster stream tests by reducing number of vnodes The single-node test Scylla run by test/alternator/run uses, as the default, 256 vnodes. When we have 256 vnodes and two shards, our CDC implementation produces 512 separate "streams" (called "shards" in DynamoDB lingo). This causes each of the tests in test_streams.py which need to read data from the stream to need to do 1024 (!) API requests (512 calls to GetShardIterator and 512 calls to GetRecords) which takes significant time - about a second per test. In this patch, we reduce the number of vnodes to 16. We still have a non-negligible number of stream "shards" (32) so this part of the CDC code is still exercised. Moreover, to ensure we still routinely test the paging feature of DescribeStream (whose default page size is 100), the patch changes the request to use a Limit of 10, so paging will still be used to retrieve the list of 32 shards. The time to run the 27 tests in test_streams.py, on my laptop: Before this patch: 26 seconds After this patch: 6 seconds. Fixes #6979 Message-Id: <20200805093418.1490305-1-nyh@scylladb.com>	2020-08-05 19:34:18 +03:00
Nadav Har'El	59dff3226b	Alternator tests: more tests for Alternator Streams This patch adds additional tests for Alternator Streams, which helped uncover 9 new issues. The stream tests are noticibly slower than most other Alternator tests - test_streams.py now has 27 tests taking a total of 20 seconds. Much of this slowness is attributed to Alternator Stream's 512 "shards" per stream in the single-node test setup with 256 vnodes, meaning that we need over 1000 API requests per test using GetRecords. These tests could be made significantly faster (as little as 4 seconds) by setting a lower number of vnodes. Issue #6979 is about doing this in the future. The tests in this patch have comments explaining clearly (I hope) what they test, and also pointing to issues I opened about the problems discovered through these tests. In particular, the tests reproduce the following bugs: Refs #6918 Refs #6926 Refs #6930 Refs #6933 Refs #6935 Refs #6939 Refs #6942 The tests also work around the following issues (and can be changed to be more strict and reproduce these issues): Refs #6918 Refs #6931 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200804154755.1461309-1-nyh@scylladb.com>	2020-08-05 19:34:18 +03:00
Avi Kivity	e994275f80	Merge "auth: Avoid more global variable initializations" from Rafael " This patch series converts a few more global variables from sstring to constexpr std::string_view. Doing that makes it impossible for them to be part of any initialization order problems. " * 'espindola/more-constexpr-v2' of https://github.com/espindola/scylla: auth: Turn DEFAULT_USER_NAME into a std::string_view variable auth: Turn SALTED_HASH into a std::string_view variable auth: Turn meta::role_members_table::qualified_name into a std::string_view variable auth: Turn meta::roles_table::qualified_name into a std::string_view variable auth: Turn password_authenticator_name into a std::string_view variable auth: Inline default_authorizer_name into only use auth: Turn allow_all_authorizer_name into a std::string_view variable auth: Turn allow_all_authenticator_name into a std::string_view variable	2020-08-05 10:54:13 +03:00
Rafael Ávila de Espíndola	27c2b3de30	auth: Turn password_authenticator_name into a std::string_view variable Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-08-04 16:40:00 -07:00
Rafael Ávila de Espíndola	0da6807f7e	auth: Turn allow_all_authenticator_name into a std::string_view variable There is no constexpr operator+ for std::string_view, so we have to concatenate the strings ourselves. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-08-04 16:38:27 -07:00
Nadav Har'El	936cf4cce0	merge: Increase row limits Merged pull request https://github.com/scylladb/scylla/pull/6910 by Wojciech Mitros: This patch enables selecting more than 2^32 rows from a table. The change becomes active after upgrading whole cluster - until then old limits are used. Tested reading 4.5*10^9 rows from a virtual table, manually upgrading a cluster with ccm and performing cql SELECT queries during the upgrade, ran unit tests in dev mode and cql and paging dtests. tests: add large paging state tests increase the maximum size of query results to 2^64	2020-08-04 19:52:30 +03:00
Wojciech Mitros	4863e8a11f	tests: add large paging state tests Add a unit test checking if the top 32 bits of the number of remaining rows in paging state is used correctly and a manual test checking if it's possible to select over 2^32 rows from a table and a virtual reader for this table.	2020-08-04 18:44:29 +02:00
Rafael Ávila de Espíndola	ef0bed7253	Drop duplicated 'if' in comment Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200730170109.5789-1-espindola@scylladb.com>	2020-08-04 07:53:34 +03:00
Wojciech Mitros	45215746fe	increase the maximum size of query results to 2^64 Currently, we cannot select more than 2^32 rows from a table because we are limited by types of variables containing the numbers of rows. This patch changes these types and sets new limits. The new limits take effect while selecting all rows from a table - custom limits of rows in a result stay the same (2^32-1). In classes which are being serialized and used in messaging, in order to be able to process queries originating from older nodes, the top 32 bits of new integers are optional and stay at the end of the class - if they're absent we assume they equal 0. The backward compatibility was tested by querying an older node for a paged selection, using the received paging_state with the same select statement on an upgraded node, and comparing the returned rows with the result generated for the same query by the older node, additionally checking if the paging_state returned by the upgraded node contained new fields with correct values. Also verified if the older node simply ignores the top 32 bits of the remaining rows number when handling a query with a paging_state originating from an upgraded node by generating and sending such a query to an older node and checking the paging_state in the reply(using python driver). Fixes #5101.	2020-08-03 17:32:49 +02:00
Calle Wilund	30a700c5b0	system_keyspace: Remove support for legacy truncation records Fixes #6341 Since scylla no longer supports upgrading from a version without the "new" (dedicated) truncation record table, we can remove support for these and the migtration thereof. Make sure the above holds whereever this is committed. Note that this does not remove the "truncated_at" field in system.local.	2020-08-03 17:16:26 +03:00
Nadav Har'El	2dcb6294da	merge: cdc: New delta modes: `off`, `keys`, `fulll` Merged pull request https://github.com/scylladb/scylla/pull/6914 by By Juliusz Stasiewicz: The goal is to have finer control over CDC "delta" rows, i.e.: disable them totally (mode off); record only base PK+CK columns (mode keys); make them behave as usual (mode full, default). The editing of log rows is performed at the stage of finishing CDC mutation. Fixes #6838 tests: Added CQL test for `delta mode` cdc: Implementations of `delta_mode::off/keys` cdc: Infrastructure for controlling `delta_mode`	2020-08-03 14:10:15 +03:00
Nadav Har'El	ae25661d9c	alternator test: set streams time window to zero Alternator Streams have a "alternator_streams_time_window_s" parameter which is used to allow for correct ordering in the stream in the face of clock differences between Scylla nodes and possibly network delays. This parameter currently defaults to 10 seconds, and there is a discussion on issue #6929 on whether it is perhaps too high. But in any case, for tests running on a single node there is no reason not to set this parameter to zero. Setting this parameter to zero greatly speeds up the Alternator Streams tests which use ReadRecords to read from the stream. Previously each such test took at least 10 seconds, because the data was only readable after a 10 second delay. With alternator_streams_time_window_s=0, these tests can finish in less than a second. Unfortunately they are still relatively slow because our Streams implementation has 512 shards, and thus we need over a thousand (!) API calls to read from the stream). Running "test/alternator/run test_streams.py" with 25 tests took before this patch 114 seconds, after this patch, it is down to 18 seconds. Refs #6929 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Reviewed-by: Calle Wilund <calle@scylladb.com> Message-Id: <20200728184612.1253178-1-nyh@scylladb.com>	2020-08-03 09:19:57 +03:00
Avi Kivity	257c17a87a	Merge "Don't depend on seastar::make_(lw_)?shared idiosyncrasies" from Rafael " While working on another patch I was getting odd compiler errors saying that a call to ::make_shared was ambiguous. The reason was that seastar has both: template <typename T, typename... A> shared_ptr<T> make_shared(A&&... a); template <typename T> shared_ptr<T> make_shared(T&& a); The second variant doesn't exist in std::make_shared. This series drops the dependency in scylla, so that a future change can make seastar::make_shared a bit more like std::make_shared. " * 'espindola/make_shared' of https://github.com/espindola/scylla: Everywhere: Explicitly instantiate make_lw_shared Everywhere: Add a make_shared_schema helper Everywhere: Explicitly instantiate make_shared cql3: Add a create_multi_column_relation helper main: Return a shared_ptr from defer_verbose_shutdown	2020-08-02 19:51:24 +03:00
Nadav Har'El	8b9da9c92a	alternator test: tests for combination of query filter and projection The tests in this patch, which pass on DynamoDB but fail on Alternator, reproduce a bug described in issue #6951. This bug makes it impossible for a Query (or Scan) to filter on an attribute if that attribute is not requested to be included in the output. This patch includes two xfailing tests of this type: One testing a combination of FilterExpression and ProjectionExpression, and the second testing a combination of QueryFilter and AttributesToGet; These two pairs are, respectively, DynamoDB's newer and older syntaxes to achieve the same thing. Additionally, we add two xfailing tests that demonstrates that combining old and new style syntax (e.g., FilterExpression with AttributesToGet) should not have been allowed (DynamoDB doesn't allow such combinations), but Alternator currently accepts these combinations. Refs #6951 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200729210346.1308461-1-nyh@scylladb.com>	2020-07-30 09:34:23 +02:00
Rafael Ávila de Espíndola	a548e5f5d1	test: Mark tmpdir::remove noexcept Also disable the allocation failure injection in it. Refs #6831. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200729200019.250908-2-espindola@scylladb.com>	2020-07-30 09:55:52 +03:00
Rafael Ávila de Espíndola	d8ba9678b4	test: Move tmpdir code to a .cc file This is not hot, so we can move it out of the header. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200729200019.250908-1-espindola@scylladb.com>	2020-07-30 09:55:52 +03:00
Juliusz Stasiewicz	7e42a42381	tests: Added CQL test for `delta mode` Tested scenario is just a single insert in every `delta_mode`. It is also checked that CDC cannot be enabled with all its subfeatures disabled.	2020-07-29 16:42:26 +02:00
Nadav Har'El	665b78253a	alternator test: reduce amount of Scylla logs saved The test/alternator/run script follows the pytest log with a full log of Scylla. This saved log can be useful in diagnosing problems, but most of it is filled with non-useful "INFO"-level messages. The two biggest offenders are compaction - which logs every single compaction happening, and the migration manager, which is just a second (and very long) message about schema change operations (e.g., table creations). Neither of these are interesting for Alternator's tests, which shouldn't care exactly when compaction of which sstable is happening. These two components alone are reponsible for 80% of the log lines, and 90% of the log bytes! In this patch we increase the log level of just these two components - compaction and migration_manager - to WARN, which reduces the log by the same percentages (80% by lines, 90% by bytes). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200728191420.1254961-1-nyh@scylladb.com>	2020-07-29 14:17:12 +03:00
Avi Kivity	fea5067dfa	Merge "Limit non-paged query memory consumption" from Botond " Non-paged queries completely ignore the query result size limiter mechanism. They consume all the memory they want. With sufficiently large datasets this can easily lead to a handful or even a single unpaged query producing an OOM. This series continues the work started by `134d5a5f7`, by introducing a configurable pair of soft/hard limit (default to 1MB/100MB) that is applied to otherwise unlimited queries, like reverse and unpaged ones. When an unlimited query reaches the soft limit a warning is logged. This should give users some heads-up to adjust their application. When the hard limit is reached the query is aborted. The idea is to not greet users with failing queries after an upgrade while at the same time protect the database from the really bad queries. The hard limit should be decreased from time to time gradually approaching the desired goal of 1MB. We don't want to limit internal queries, we trust ourselves to either use another form of memory usage control, or read only small datasets. So the limit is selected according to the query class. User reads use the `max_memory_for_unlimited_query_{soft,hard}_limit` configuration items, while internal reads are not limited. The limit is obtained by the coordinator, who passes it down to replicas using the existing `max_result_size` parameter (which is not a special type containing the two limits), which is now passed on every verb, instead of once per connection. This ensures that all replicas work with the same limits. For normal paged queries `max_result_size` is set to the usual `query::result_memory_limiter::maximum_result_size` For queries that can consume unlimited amount of memory -- unpaged and reverse queries -- this is set to the value of the aforementioned `max_memory_for_unlimited_query_{soft,hard}_limit` configuration item, but only for user reads, internal reads are not limited. This has the side-effect that reverse reads now send entire partitions in a single page, but this is not that bad. The data was already read, and its size was below the limit, the replica might as well send it all. Fixes: #5870 " * 'nonpaged-query-limit/v5' of https://github.com/denesb/scylla: (26 commits) test: database_test: add test for enforced max result limit mutation_partition: abort read when hard limit is exceeded for non-paged reads query-result.hh: move the definition of short_read to the top test: cql_test_env: set the max_memory_unlimited_query_{soft,hard}_limit test: set the allow_short_read slice option for paged queries partition_slice_builder: add with_option() result_memory_accounter: remove default constructor query_*(): use the coordinator specified memory limit for unlimited queries storage_proxy: use read_command::max_result_size to pass max result size around query: result_memory_limiter: use the new max_result_size type query: read_command: add max_result_size query: read_command: use tagged ints for limit ctor params query: read_command: add separate convenience constructor service: query_pager: set the allow_short_read flag result_memory_accounter: check(): use _maximum_result_size instead of hardcoded limit storage_proxy: add get_max_result_size() result_memory_limiter: add unlimited_result_size constant database: add get_statement_scheduling_group() database: query_mutations(): obtain the memory accounter inside query: query_class_config: use max_result_size for the max_memory_for_unlimited_query field ...	2020-07-29 13:41:53 +03:00
Botond Dénes	3804dfcc0c	test: database_test: add test for enforced max result limit Two tests are added: one that works on the low-level database API, and another one that works on the CQL API.	2020-07-29 08:32:34 +03:00
Botond Dénes	f7a4d19fb1	mutation_partition: abort read when hard limit is exceeded for non-paged reads If the read is not paged (short read is not allowed) abort the query if the hard memory limit is reached. On reaching the soft memory limit a warning is logged. This should allow users to adjust their application code while at the same time protecting the database from the really bad queries. The enforcement happens inside the memory accounter and doesn't require cooperation from the result builders. This ensures memory limit set for the query is respected for all kind of reads. Previously non-paged reads simply ignored the memory accounter requesting the read to stop and consumed all the memory they wanted.	2020-07-29 08:32:31 +03:00
Botond Dénes	43c0da4b63	test: cql_test_env: set the max_memory_unlimited_query_{soft,hard}_limit To an unlimited value, in order to avoid aborting any unpaged queries executed by tests, that would exceed the default result limit of 1MB/100MB.	2020-07-28 18:00:29 +03:00
Botond Dénes	648ce473ab	test: set the allow_short_read slice option for paged queries Some tests use the lower level methods directly and meant to use paging but didn't and nobody noticed. This was revealed by the enforcement of max result size (introduced in a later patch), which caused these tests to fail due to exceeding the max result size. This patch fixes this by setting the `allow_short_reads` slice option.	2020-07-28 18:00:29 +03:00
Botond Dénes	6660a5df51	result_memory_accounter: remove default constructor If somebody wants to bypass proper memory accounting they should at the very least be forced to consider if that is indeed wise and think a second about the limit they want to apply.	2020-07-28 18:00:29 +03:00
Botond Dénes	9eab5bca27	query_*(): use the coordinator specified memory limit for unlimited queries It is important that all replicas participating in a read use the same memory limits to avoid artificial differences due to different amount of results. The coordinator now passes down its own memory limit for reads, in the form of max_result_size (or max_size). For unpaged or reverse queries this has to be used now instead of the locally set max_memory_unlimited_query configuration item. To avoid the replicas accidentally using the local limit contained in the `query_class_config` returned from `database::make_query_class_config()`, we refactor the latter into `database::get_reader_concurrency_semaphore()`. Most of its callers were only interested in the semaphore only anyway and those that were interested in the limit as well should get it from the coordinator instead, so this refactoring is a win-win.	2020-07-28 18:00:29 +03:00
Botond Dénes	159d37053d	storage_proxy: use read_command::max_result_size to pass max result size around Use the recently added `max_result_size` field of `query::read_command` to pass the max result size around, including passing it to remote nodes. This means that the max result size will be sent along each read, instead of once per connection. As we want to select the appropriate `max_result_size` based on the type of the query as well as based on the query class (user or internal) the previous method won't do anymore. If the remote doesn't fill this field, the old per-connection value is used.	2020-07-28 18:00:29 +03:00
Botond Dénes	fbbbc3e05c	query: result_memory_limiter: use the new max_result_size type	2020-07-28 18:00:29 +03:00
Botond Dénes	92a7b16cba	query: read_command: add max_result_size This field will replace max size which is currently passed once per established rpc connection via the CLIENT_ID verb and stored as an auxiliary value on the client_info. For now it is unused, but we update all sites creating a read command to pass the correct value to it. In the next patch we will phase out the old max size and use this field to pass max size on each verb instead.	2020-07-28 18:00:29 +03:00
Botond Dénes	8992bcd1f8	query: read_command: use tagged ints for limit ctor params The convenience constructor of read_command now has two integer parameter next to each other. In the next patch we intend to add another one. This is recipe for disaster, so to avoid mistakes this patch converts these parameters to tagged integers. This makes sure callers pass what they meant to pass. As a matter of fact, while fixing up call-sites, I already found several ones passing `query::max_partitions` to the `row_limit` parameter. No harm done yet, as `query::max_partitions` == `query::max_rows` but this shows just how easy it is to mix up parameters with the same type.	2020-07-28 18:00:29 +03:00
Botond Dénes	2ca118b2d5	query: read_command: add separate convenience constructor query::read_command currently has a single constructor, which serves both as an idl constructor (order of parameters is fixed) and a convenience one (most parameters have default values). This makes it very error prone to add new parameters, that everyone should fill. The new parameter has to be added as last, with a default value, as the previous ones have a default value as well. This means the compiler's help cannot be enlisted to make sure all usages are updated. This patch adds a separate convenience constructor to be used by normal code. The idl constructor looses all default parameters. New parameters can be added to any position in the convenience constructor (to force users to fill in a meaningful value) while the removed default parameters from the idl constructor means code cannot accidentally use it without noticing.	2020-07-28 18:00:29 +03:00
Botond Dénes	c364c7c6a2	result_memory_limiter: add unlimited_result_size constant To be used as the max result size for internal queries.	2020-07-28 18:00:29 +03:00
Botond Dénes	d5cc932a0b	database: query_mutations(): obtain the memory accounter inside Instead of requesting callers to do it and pass it as a parameter. This is in line with data_query().	2020-07-28 18:00:29 +03:00
Botond Dénes	92ce39f014	query: query_class_config: use max_result_size for the max_memory_for_unlimited_query field We want to switch from using a single limit to a dual soft/hard limit. As a first step we switch the limit field of `query_class_config` to use the recently introduced type for this. As this field has a single user at the moment -- reverse queries (and not a lot of propagation) -- we update it in this same patch to use the soft/hard limit: warn on reaching the soft limit and abort on the hard limit (the previous behaviour).	2020-07-28 18:00:29 +03:00
Botond Dénes	517a941feb	query_class_config: move into the query namespace It belongs there, its name even starts with "query".	2020-07-28 18:00:29 +03:00

1 2 3 4 5 ...

704 Commits