scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-02 14:15:46 +00:00

Author	SHA1	Message	Date
Piotr Dulikowski	e18b29765a	hints: add hint sync point structure Adds a sync_point structure. A sync point is a (possibly incomplete) mapping from hint queues to a replay position in it. Users will be able to create sync points consisting of the last written positions of some hint queues, so then they can wait until hint replay in all of the queues reach that point. The sync point supports serialization - first it is serialized with the help of IDL to a binary form, and then converted to a hexadecimal string. Deserialization is also possible.	2021-08-09 09:24:36 +02:00
Piotr Dulikowski	5a0942a0f8	utils,alternator: move base64 code from alternator to utils The base64 encoding/decoding functions will be used for serialization of hint sync point descriptions. Base64 format is not specific to Alternator, so it can be moved to utils.	2021-08-09 09:24:36 +02:00
Avi Kivity	52364b5da0	Merge 'cql3: Use expressions to calculate the local-index clustering ranges' from Jan Ciołek Calculating clustering ranges on a local index has been rewritten to use the new `expression` variant. This allows us to finally remove the old `bounds_ranges` function. Closes #9080 * github.com:scylladb/scylla: cql3: Remove unused functions like bounds_ranges cql3: Use expressions to calculate the local-index clustering ranges statement_restrictions_test: tests for extracting column restrictions expression: add a function to extract restrictions for a column	2021-08-05 18:32:11 +03:00
Raphael S. Carvalho	a869d61c89	tests: Move compaction-related tests into its own unit With commit `1924e8d2b6`, compaction code was moved into a top level dir as compaction is layered on top of sstables. Let's continue this work by moving all compaction unit tests into its own test file. This also makes things much more organized. sstable_datafile_test, as its name implies, will only contain sstable data tests. Perhaps it should be renamed to only sstable_data_test, as the test also contains tests involving other components, not only the data one. BEFORE $ cat test/boost/sstable_datafile_test.cc \| grep TEST_CASE \| wc -l 105 AFTER $ cat test/boost/sstable_compaction_test.cc \| grep TEST_CASE \| wc -l 57 $ cat test/boost/sstable_datafile_test.cc \| grep TEST_CASE \| wc -l 48 Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210802192120.148583-1-raphaelsc@scylladb.com>	2021-08-02 22:26:26 +03:00
Jan Ciolek	a7d1dab066	statement_restrictions_test: tests for extracting column restrictions Add unit tests for the function extract_single_column_restrictions_for_column() Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-08-02 15:43:42 +02:00
Pavel Solodovnikov	21d758020a	cql3: change `current()` CQL functions to be non-pure These include the following: `currenttimestamp()` * `currenttime()` * `currentdate()` * `currenttimeuuid()` Previously, they were incorrectly marked as pure, meaning that the function is executed at "prepare" step. For functions that possibly depend on timing and random seed, this is clearly a bug. Cassandra doesn't have a notion of pure functions, so they are lazily evaluated. Make Scylla to match Cassandra behavior for these functions. Add a unit-test for a fix (excluding `currentdate()` function, because there is no way to use synthetic clock with query processor and sleeping for a whole day to demonstrate correct behavior is clearly not an option). Tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-08-01 12:17:23 +03:00
Avi Kivity	0876248c2b	Merge "cql3: cache function calls evaluation for non-deterministic functions" from Pavel S " `function_call` AST nodes are created for each function with side effects in a CQL query, i.e. non-deterministic functions (`uuid()`, `now()` and some others timeuuid-related). These nodes are evaluated either when a query itself is executed or query restrictions are computed (e.g. partition/clustering key ranges for LWT requests). We need to cache the calls since otherwise when handling a `bounce_to_shard` request for an LWT query, we can possibly enter an infinite bouncing loop (in case a function is used to calculate partition key ranges for a query), since the results can be different each time. Furthermore, we don't support bouncing more than one time. Returning `bounce_to_shard` message more than one time will result in a crash. Caching works only for LWT statements and only for the function calls that affect partition key range computation for the query. `variable_specifications` class is renamed to `prepare_context` and generalized to record information about each `function_call` AST node and modify them, as needed: * Check whether a given function call is a part of partition key statement restriction. * Assign ids for caching if above is true and the call is a part of an LWT statement. There is no need to include any kind of statement identifier in the cache key since `query_options` (which holds the cache) is limited to a single statement, anyway. Function calls are indexed by the order in which they appear within a statement while parsing. There is no need to include any kind of statement identifier to the cache key since `query_options` (which holds the cache) is limited to a single statement, anyway. Note that `function_call::raw` AST nodes are not created for selection clauses of a SELECT statement hence they can only accept only one of the following things as parameters: * Other function calls. * Literal values. * Parameter markers. In other words, only parameters that can be immediately reduced to a byte buffer are allowed and we don't need to handle database inputs to non-pure functions separately since they are not possible in this context. Anyhow, we don't even have a single non-pure function that accepts arguments, so precautions are not needed at the moment. Add a test written in `cql-pytest` framework to verify that both prepared and unprepared lwt statements handle `bounce_to_shard` messages correctly in such scenario. Fixes: #8604 Tests: unit(dev, debug) NOTE: the patchset uses `query_options` as a container for cached values. This doesn't look clean and `service::query_state` seems to be a better place to store them. But it's not forwarded to most of the CQL code and would mean that a huge number of places would have to be amended. The series presents a trade-off to avoid forwarding `query_state` everywhere (but maybe it's the thing that needs to be done, nonetheless). " * 'lwt_bounce_to_shard_cached_fn_v6' of https://github.com/ManManson/scylla: cql-pytest: add a test for non-pure CQL functions cql3: cache function calls evaluation for non-deterministic functions cql3: rename `variable_specifications` to `prepare_context`	2021-07-30 14:21:11 +03:00
Tomasz Grabiec	7c28f77412	Merge 'Convert all remaining int tri-compares to std::strong_ordering' from Avi Kivity Convert all known tri-compares that return an int to return std::strong_ordering. Returning an int is dangerous since the caller can treat it as a bool, and indeed this series uncovered a minor bug (#9103). Test: unit (dev) Fixes #1449 Closes #9106 * github.com:scylladb/scylla: treewide: remove redundant "x <=> 0" compares test: mutation_test: convert internal tri-compare to std::strong_ordering utils: int_range: change to std::strong_ordering test: change some internal comparators to std::strong_ordering utils: big_decimal: change to std::strong_ordering utils: fragment_range: change to std::strong_ordering atomic_cell: change compare_atomic_cell_for_merge() to std::strong_ordering types: drop scaffolding erected around lexicographical_tri_compare sstables: keys: change to std::strong_ordering internally bytes: compare_unsigned(): change to std::strong_ordering uuid: change comparators to std::strong_ordering types: convert abstract_type::compare and related to std::strong_ordering types: reduce boilerplate when comparing empty value serialized_tri_compare: change to std::strong_ordering compound_compat: change to std::strong-ordering types: change lexicographical_tri_compare, prefix_equality_tri_compare to std::strong_ordering	2021-07-29 21:43:54 +02:00
Avi Kivity	e44d3cc0ea	Merge "Remove global storage service instance" from Pavel E " There are few places that call global storage service, but all are easily fixable without significant changes. 1. alternator -- needs token metadata, switch to using proxy 2. api -- calls methods from storage service, all handlers are registered in main and can capture storage service from there 3. thrift -- calls methods from storage service, can carry the reference via controller 4. view -- needs tokens, switch to using (global) proxy 5. storage_service -- (surprisingly) can use "this" tests: unit(dev), dtest(simple_boot_shutdown, dev) " * 'br-unglobal-storage-service' of https://github.com/xemul/scylla: storage_service: Make it local storage_service: Remove (de)?init_storage_service() storage_service: Use container() in run_with(out)_api_lock storage_service: Unmark update_topology static storage_service: Capture this when appropriate view: Use proxy to get token metadata from thrift: Use local storage service in handlers thrift: Carry sharded<storage_service>& down to handler api: Capture and use sharded<storage_service>& in handlers api: Carry sharded<storage_service>& down to some handlers alternator: Take token metadata from server's storage_proxy alternator: Keep storage_proxy on server	2021-07-29 11:47:16 +03:00
Avi Kivity	8d2255d82c	Merge "Parallelize multishard_combining_reader_as_mutation_source test" from Pavel E " This is the 3rd slowest test in the set. There are 3 cases out there that are hard-coded to be sequential. However, splitting them into boost test cases helps running this test faster in --parallel-cases mode. Timings for debug mode: Total before the patch: 25 min Sequential after the patch: 25 min Basic case: 5 min Evict-paused-readers case: 5 min Single-mutation-buffer case: 15 min tests: unit.multishard_combining_reader_as_mutation_source(debug) " * 'br-parallel-mcr-test' of https://github.com/xemul/scylla: test: Split test_multishard_combining_reader_as_mutation_source into 3 test: Fix indentation after previous patch test: Move out internals of test_multishard_combining_reader_as_mutation_source	2021-07-29 11:39:02 +03:00
Pavel Emelyanov	f9132b582b	storage_service: Make it local There are 3 places that can now declare local instance: - main - cql_test_env - boost gossiper test The global pointer is saved in debug namespace for debugging. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-29 05:12:36 +03:00
Avi Kivity	42e1f318d7	Merge "Respect "bypass cache" in sstable index caching" from Tomasz " This series changes the behavior of the system when executing reads annotated with "bypass cache" clause in CQL. Such reads will not use nor populate the sstable partition index cache and sstable index page cache. " * 'bypass-cache-in-sstable-index-reads' of github.com:tgrabiec/scylla: sstables: Do not populate page cache when searching in promoted index for "bypass cache" reads sstables: Do not populate partition index cache for "bypass cache" reads	2021-07-28 18:45:39 +03:00
Avi Kivity	331eb57e17	Revert "compression: define 'class' attribute for compression and deprecate 'sstable_compression'" This reverts commit `5571ef0d6d`. It causes rolling upgrade failures. Fixes #9055. Reopens #8948.	2021-07-28 14:14:22 +03:00
Avi Kivity	0909e3c17d	treewide: remove redundant "x <=> 0" compares If x is of type std::strong_ordering, then "x <=> 0" is equivalent to x. These no-ops were inserted during #1449 fixes, but are now unnecessary. They have potential for harm, since they can hide an accidental of the type of x to an arithmetic type, so remove them. Ref #1449.	2021-07-28 13:30:32 +03:00
Avi Kivity	70f481a1f0	test: mutation_test: convert internal tri-compare to std::strong_ordering Drop the temporary merge_container() overload we had to support tri-compares that returned int.	2021-07-28 13:30:07 +03:00
Avi Kivity	11fa402ecc	test: change some internal comparators to std::strong_ordering Ref #1449.	2021-07-28 13:28:51 +03:00
Avi Kivity	7729ff03ad	uuid: change comparators to std::strong_ordering Ref #1449.	2021-07-28 13:20:32 +03:00
Avi Kivity	e52ebe2da5	types: convert abstract_type::compare and related to std::strong_ordering Change comparators around types to std::strong_ordering. Ref #1449.	2021-07-28 13:19:24 +03:00
Avi Kivity	d86e529239	serialized_tri_compare: change to std::strong_ordering Also convert a users in mutation_test. Ref #1449.	2021-07-28 13:19:00 +03:00
Benny Halevy	67d5addc09	test: mutation_reader_test: clustering_order_merger_test_generator: use explicit type for num_ranges gcc 10.3.1 spews the following error: ``` _test_generator::generate_scenario(std::mt19937&) const’: test/boost/mutation_reader_test.cc:3731:28: error: comparison of integer expressions of different signedness: ‘int’ and ‘long unsigned int’ [-Werror=sign-compare] 3731 \| for (auto i = 0; i < num_ranges; ++i) { \| ~~^~~~~~~~~~~~ ``` Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210728073538.2467040-1-bhalevy@scylladb.com>	2021-07-28 11:22:59 +03:00
Pavel Emelyanov	b3c89787be	mutation_partition: Return immutable collection for range tombstones Patch the .row_tombstones() to return the range_tombstone_list wrapped into the immutable_collection<> so that callers are guaranteed not to touch the collection itself, but still can modify the tombstones. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	1bf643d4fd	mutation_partition: Pin mutable access to range tombstones Some callers of mutation_partition::row_tomstones() don't want (and shouldn't) modify the list itself, while they may want to modify the tombstones. This patch explicitly locates those that need to modify the collection, because the next patch will return immutable collection for the others. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	05b8cdfd24	mutation_partition: Return immutable collection for rows Patch the .clustered_rows() method to return the btree of rows wrapped into the immutable_collection<> so that callers are guaranteed not to touch the collection itself, but still can modify the elements in it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	e652b03b4e	btree tests: Dont use iterator erase Next patches will mark btree::iterator methods that modify the tree itself as private, so stop using them in tests. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Avi Kivity	f86e65b4e7	Merge "Fix quadratic behavior in memtable/row_cache with lots of range tombstones" from Tomasz " This series fixes two issues which cause very poor efficiency of reads when there is a lot of range tombstones per live row in a partition. The first issue is in the row_cache reader. Before the patch, all range tombstones up to the next row were copied into a vector, and then put into the buffer until it's full. This would get quadratic if there is much more range tombstones than fit in a buffer. The fix is to avoid the accumulation of all tombstones in the vector and invoke the callback instead, which stops the iteration as soon as the buffer is full. Fixes #2581. The second, similar issue was in the memtable reader. Tests: - unit (dev) - perf_row_cache_update (release) " * tag 'no-quadratic-rt-in-reads-v1' of github.com:tgrabiec/scylla: test: perf_row_cache_update: Uncomment test case for lots of range tombstones row_cache: Consume range tombstones incrementally partition_snapshot_reader: Avoid quadratic behavior with lots of range tombstones tests: mvcc: Relax monotonicity check range_tombstone_stream: Introduce peek_next()	2021-07-27 14:39:13 +03:00
Avi Kivity	2cca461652	Merge 'sstables: merge row consumer interfaces with implementations' from Wojciech Mitros This patch follows #9002, further reducing the complexity of the sstable readers. The split between row consumer interfaces and implementations has been first added in 2015, and there is no reason to create new implementations anymore. By merging those classes, we achieve a sizeable reduction in sstable reader length and complexity. Refs #7952 Tests: unit(dev) Closes #9073 * github.com:scylladb/scylla: sstables: merge row_consumer into mp_row_consumer_k_l sstables: move kl row_consumer sstables: merge consumer_m into mp_row_consumer_m sstables: move mp_row_consumer_m	2021-07-27 12:23:29 +03:00
Pavel Emelyanov	ca2dfac7d7	test: Split test_multishard_combining_reader_as_mutation_source into 3 There are 3 independent cases in this test that benefit from running in parallel. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 09:29:20 +03:00
Pavel Emelyanov	e184ed2b9c	test: Fix indentation after previous patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 09:29:12 +03:00
Pavel Emelyanov	3e979a20ea	test: Move out internals of test_multishard_combining_reader_as_mutation_source Preparation. They will be called from 3 independent cases. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 09:28:12 +03:00
Tomasz Grabiec	0d7b3f9463	tests: mvcc: Relax monotonicity check Consecutive range tombstones can have the same position. They will, in one of the test cases, after the range tombstone merger in partition_snapshot_flat_reader no longer uses range_tombstone_list to merge data form multiple versions, which deoverlaps, but rather merges the streams corresponding to each version, which interleaves range tombstones from different versions.	2021-07-26 17:27:03 +02:00
Pavel Solodovnikov	49ddd269ea	cql3: rename `variable_specifications` to `prepare_context` The class is repurposed to be more generic and also be able to hold additional metadata related to function calls within a CQL statement. Rename all methods appropriately. Visitor functions in AST nodes (`collect_marker_specification`) are also renamed to a more generic `fill_prepare_context`. The name `prepare_context` designates that this metadata structure is a byproduct of `stmt::raw::prepare()` call and is needed only for "prepare" step of query execution. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-07-24 14:33:33 +03:00
Tomasz Grabiec	b044db863f	Merge 'db/virtual_table: Streaming tables for large data + describe_ring example table' from Juliusz Stasiewicz This is the 2nd PR in series with the goal to finish the hackathon project authored by @tgrabiec, @kostja, @amnonh and @mmatczuk (improved virtual tables + function call syntax in CQL). This one introduces a new implementation of the virtual tables, the streaming tables, which are suitable for large amounts of data. This PR was created by @jul-stas and @StarostaGit Closes #8961 * github.com:scylladb/scylla: test/boost: run_mutation_source_tests on streaming virtual table system_keyspace: Introduce describe_ring table as virtual_table storage_service: Pass the reference down to system_keyspace endpoint_details: store `_host` as `gms::inet_address` queue_reader: implement next_partition() virtual_tables: Introduce streaming_virtual_table flat_mutation_reader: Add a new filtering reader factory method	2021-07-23 18:05:51 +02:00
Avi Kivity	aaf35b5ac2	Merge "Remove storage-service from transport (and a bit more)" from Pavel E " The cql-server -> storage-service dependency comes from the server's event_notifier which (un)subscribes on the lifecycle events that come from the storage service. To break this link the same trick as with migration manager notifications is used -- the notification engine is split out of the storage service and then is pushed directly into both -- the listeners (to (un)subscribe) and the storage service (to notify). tests: unit(dev), dtest(simple_boot_shutdown, dev) manual({ start/stop, with/without started transport, nodetool enable-/disablebinary } in various combinations, dev) " * 'br-remove-storage-service-from-transport' of https://github.com/xemul/scylla: transport.controller: Brushup cql_server declarations code: Remove storage-service header from irrelevant places storage_service: Remove (unlifecycle) subscribe methods transport: Use local notifier to (un)subscribe server transport: Keep lifecycle notifier sharded reference main: Use local lifecycle notifier to (un)subscribe listeners main, tests: Push notifier through storage service storage_service: Move notification core into dedicated class storage_service: Split lifecycle notification code transport, generic_server: Remove no longer used functionality transport: (Un)Subscribe cql_server::event_notifier from controller tests: Remove storage service from manual gossiper test	2021-07-22 19:27:45 +03:00
Pavel Emelyanov	8248bc9e33	main, tests: Push notifier through storage service Now it's time to move the lifecycle notifier from storage service to the main's scope. Next patches will remove the $lifecycle-subscriber -> storage_service dependency. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-22 18:45:51 +03:00
Piotr Sarna	526ad2a151	Merge 'secondary_index: Fix TOKEN() restrictions in indexed SELECTs' from Jan Ciołek This is a rewrite of an old PR: #7582 `TOKEN()` restrictions don't work properly when a query uses an index. For example this returns both rows: ```cql CREATE TABLE t(pk int, ck int, v int, PRIMARY KEY(pk, ck)); CREATE INDEX ON t(v); INSERT INTO t (pk, ck, v) VALUES (0, 0, 0); INSERT INTO t (pk, ck, v) VALUES (1, 0, 0); SELECT token(pk), pk, ck, v FROM t WHERE v = 0 AND token(pk) = token(0) ALLOW FILTERING; ``` This functionality is supported on both old and new indexes. In old indexes the type of the token column was `blob`. This causes problems, because `blob` representation of tokens is ordered differently. Tokens represented as blobs are ordered like this: ``` 0, 1, 2, 3, 4, 5, ..., bigint_max, bigint_min, ...., -5, -4, -3, -2, -1 ``` Because of that clustering range for `token()` restrictions needs to be translated to two clustering ranges on the `blob` column. To create old indexes disable the feature called: `CORRECT_IDX_TOKEN_IN_SECONDARY_INDEX` or run scylla version from branch [`cvybhu/si-token2-old-index`](https://github.com/cvybhu/scylla/commits/si-token2-old-index) I'm not sure if it's possible to create automatic tests with old indexes. I ran `dev-test` manually on the `si-token2-old-index` branch, and the only tests that failed were the ones testing row ordering. Rows should be ordered by `token`, but because in old indexes the token is represented as a `blob` this ordering breaks. This is a known issue (#7443), that has been fixed by introducing new indexes. To sum up: * `token()` restrictions are fixed on both new and old indexes. * When using old indexes, the rows are not properly ordered by token. * With new indexes the rows are properly ordered by token. Fixes #7043 Closes #9067 * github.com:scylladb/scylla: tests: add secondary index tests with TOKEN clause secondary_index_test: extract test data secondary_index: Fix TOKEN() restrictions in indexed SELECTs expression: Add replace_token function	2021-07-22 10:22:45 +02:00
Wojciech Mitros	1ff72ca0a6	sstables: move kl row_consumer In preparation for the next patch combining row_consumer and mp_row_consumer_k_l, move row_consumer next to row_consumer. Because row_consumer is going to be removed, we retire some old tests for different implementations of the row_consumer interface; as a result, we don't need to expose internal types of kl sstable reader for tests, so all classes from reader_impl.hh are moved to reader.cc, and the reader_impl.hh file is deleted, and the reader.cc file has an analogous structure to the reader.cc file in sstables/mx directory. Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com>	2021-07-21 18:04:22 +02:00
Piotr Grabowski	e06102aed9	tests: add secondary index tests with TOKEN clause Add tests of SELECTs with TOKEN clauses on tables with secondary indexes (both global and local). test_select_with_token_range_cases checks all possible token range combinations (inclusive/exclusive/infinity start/end) on tables without index, with local or with global index. test_select_with_token_range_filtering checks whether TOKEN restrictions combined with column restrictions work properly. As different code paths are taken if index is created on clustering key (first or non-first) or non-primary-key column, the tests checks scenarios when index is created on different columns.	2021-07-21 16:12:55 +02:00
Piotr Grabowski	e2bd1cdb9d	secondary_index_test: extract test data Extract test data to a separate variables, allowing it to be easily reused by other tests. The tokens are hard-coded, because calculating their value brought too much complexity to this code.	2021-07-21 16:12:55 +02:00
Raphael S. Carvalho	e4eb7df1a1	table: Make correctness of concurrent sstable list update robust Today, table relies on row_cache::invalidate() serialization for concurrent sstable list updates to produce correct results. That's very error prone because table is relying on an implementation detail of invalidate() to get things right. Instead, let's make table itself take care of serialization on concurrent updates. To achieve that, sstable_list_builder is introduced. Only one builder can be alive for a given table, so serialization is guaranteed as long as the builder is kept alive throughout the update procedure. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210721001716.210281-1-raphaelsc@scylladb.com>	2021-07-21 16:45:30 +03:00
Juliusz Stasiewicz	38b8a6ce2c	test/boost: run_mutation_source_tests on streaming virtual table Tests that require inter-partition forwarding are excluded.	2021-07-20 14:19:17 +02:00
Tomasz Grabiec	50ec3ea295	lsa: Fix misaccunting of used space when allocating lsa_buffers lsa_buffer allocations are aligned to 4K. If smaller size is requested, whole 4K is used. However, only requested size was used in accounting segment occupancy. This can confuse reclaimer which may think the segment is sparse while it is actually dense, and compacting it will yield no or little gain. This can cause inefficient memory reclamation or lack of progress. Refs #9038 Message-Id: <20210720104110.463812-1-tgrabiec@scylladb.com>	2021-07-20 14:08:06 +03:00
Botond Dénes	11b39cbc23	reader_concurrency_semaphore: merge permit_stats into stats If there was any reason to have them separate when permit_stats was conceived, it is gone now, so merge the two. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20210720073121.63027-1-bdenes@scylladb.com>	2021-07-20 10:35:12 +03:00
Wojciech Mitros	507bdfc36a	tests: add test for skipping bytes at end of consumer The new tests confirms that the regression issue, where we didn't correctly skip bytes at the end of a continuous_data_consumer range, is fixed. Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com>	2021-07-19 14:42:38 +02:00
Pavel Emelyanov	9d59f1daf3	tests: Update boost long tests list Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-16 17:25:07 +03:00
Pavel Emelyanov	cbb4837b77	test.py: Parallelize test-cases run (for boost tests) The parallelizm is acheived by listing the content of each (boost) test and by adding a test for each case found appending the '--run_test={case_name}' option. Also few tests (logallog and memtable) have cases that depend on each other (the former explicitly stated this in the head comment), so these are marked as "no_parallel_cases" in the suite.yaml file. In dev mode tests need 2m:5s to run by default. With parallelizm (and updated long-running tests list) -- 1m 35s. In debug mode there are 6 slow _cases_ that overrun 30 minutes. They finish last and deserve some special (incremental) care. All the other tests run ~1h by default vs ~25m in parallel. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-16 17:25:07 +03:00
Tomasz Grabiec	f4227c303b	sstables: Do not populate partition index cache for "bypass cache" reads Index cursor for reads which bypass cache will use a private temporary instance of the partition index cache. Promoted index scanner (ka/la format) will not go through the page cache.	2021-07-15 12:13:20 +02:00
Botond Dénes	e2dfb2df71	test/boost/reader_concurrency_semaphore_test: add used/blocked test Make sure that releasing a bunch of used/blocked guards in random order doesn't break the permit state.	2021-07-14 17:19:02 +03:00
Botond Dénes	0337d3ea4a	test/boost/reader_concurrency_semaphore_test: add admission test Checking every conceivable admission scenario (hopefully).	2021-07-14 17:19:02 +03:00
Botond Dénes	b81f39cec9	reader_permit: add operator<< for reader_resources And use it in tests, it results in actually useful error messages.	2021-07-14 17:19:02 +03:00
Botond Dénes	1b7eea0f52	reader_concurrency_semaphore: admission: flip the switch This patch flips two "switches": 1) It switches admission to be up-front. 2) It changes the admission algorithm. (1) by now all permits are obtained up-front, so this patch just yanks out the restricted reader from all reader stacks and simultaneously switches all `obtain_permit_nowait()` calls to `obtain_permit()`. By doing this admission is now waited on when creating the permit. (2) we switch to an admission algorithm that adds a new aspect to the existing resource availability: the number of used/blocked reads. Namely it only admits new reads if in addition to the necessary amount of resources being available, all currently used readers are blocked. In other words we only admit new reads if all currently admitted reads requires something other than CPU to progress. They are either waiting on I/O, a remote shard, or attention from their consumers (not used currently). We flip these two switches at the same time because up-front admission means cache reads now need to obtain a permit too. For cache reads the optimal concurrency is 1. Anything above that just increases latency (without increasing throughput). So we want to make sure that if a cache reader hits it doesn't get any competition for CPU and it can run to completion. We admit new reads only if the read misses and has to go to disk. Another change made to accommodate this switch is the replacement of the replica side read execution stages which the reader concurrency semaphore as an execution stage. This replacement is needed because with the introduction of up-front admission, reads are not independent of each other any-more. One read executed can influence whether later reads executed will be admitted or not, and execution stages require independent operations to work well. By moving the execution stage into the semaphore, we have an execution stage which is in control of both admission and running the operations in batches, avoiding the bad interaction between the two.	2021-07-14 17:19:02 +03:00

1 2 3 4 5 ...

1140 Commits