scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-07 07:23:15 +00:00

Author	SHA1	Message	Date
Łukasz Paszkowski	fbd324b5cd	mutation_query: Add reversed function to reverse reconcilable_result The reconcilable_result is reversed by reversing mutations for all paritions it holds. Reversing is asynchronous to avoid potential stall. Use for transitions between legacy and native formats and in order to support mixed-nodes clusters.	2024-08-13 10:03:46 +02:00
Łukasz Paszkowski	b91edbacf1	query-request: Add reversed function to reverse read_command The read_command is reversed by reversing the schema version it holds and transforming a slice from the legacy reversed format to the native reversed format. Use for trasition between format and to support mixed-nodes clusters	2024-08-13 10:03:46 +02:00
Łukasz Paszkowski	9690785112	features: add native_reverse_queries Enabled when all replicas support the native_reversed command slice and return the result in reverse order in this case.	2024-08-13 10:03:42 +02:00
Łukasz Paszkowski	7b201e9165	kl::reader::make_reader: Unify interface with mx::reader::make_reader Ensure both readers have the same interfaces to avoid mistakes as both readers are used in sstable::make_reader. Less error prone.	2024-08-13 10:02:43 +02:00
Łukasz Paszkowski	b270097f1f	config: drop reversed_reads_auto_bypass_cache Reverse reads have already been with us for a while, thus this back door option to bypass in-memory data cache for reversed queries can be retired.	2024-08-13 10:02:42 +02:00
Łukasz Paszkowski	80df313f49	config: drop enable_optimized_reversed_reads Reverse reads have already been with us for a while, thus this back door option to read entire paritions forward and reversing them after can be retired.	2024-08-13 10:02:42 +02:00
Laszlo Ersek	607abe96e8	test/sstable: merge test_using_reusable_sst*() All lambdas passed to test_using_reusable_sst() conform to the prototype void (test_env&, sstable_ptr) All lambdas passed to test_using_reusable_sst_returning() conform to the prototype NON_VOID (test_env&, sstable_ptr) The common parameter list of both prototypes can be expressed with the concept std::invocable<test_env&, sstable_ptr> Once a "Func" template parameter (i.e., function type) satisfying this concept is taken, then "Func"'s void or non-void return type can be commonly expressed with std::invoke_result_t<Func, test_env&, sstable_ptr> In turn, test_env::do_with_async_returning<...> can be instantiated with this return type, even if it happens to be "void". ([stmt.return] specifies, "[a] return statement with an operand of type void shall be used only in a function that has a cv void return type", meaning that return func(env) will do the right thing in the body of test_env::do_with_async_returning<void>().) Merge test_using_reusable_sst() and test_using_reusable_sst_returning() into one. Preserve the function name from the former, and the test_env::do_with_async_returning<...>() call from the latter. Suggested-by: Avi Kivity <avi@scylladb.com> Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com> Closes scylladb/scylladb#20090	2024-08-12 17:52:01 +03:00
Nikos Dragazis	90363ce802	test: Test the SSTable validation API against malformed SSTables Unit testing for the SSTable validation API happens in `sstable_validate_test`. Currently, this test checks the API against some invalid SSTables with out-of-order clustering rows and out-of-order partitions. However, both are types of content-level corruption that do not trigger `malformed_sstable_exception` errors. Extend the test to cover cases of file-level corruption as well, i.e., cases that would raise a `malformed_sstable_exception`. Construct an SSTable with an invalid checksum to trigger this. This is part of the effort to improve scrub to handle all kinds of corruption. Fixes scylladb/scylladb#19057 Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com> Closes scylladb/scylladb#20096	2024-08-12 15:09:58 +03:00
Botond Dénes	fec57c83e6	Merge 'cell_locker: maybe_rehash: ignore allocation failures' from Benny Halevy `maybe_rehash` is complimentary and is not strictly require to succeed. If it fails, it will retry on the next call, but there's no reason to throw an exception that will fail its caller, since `maybe_rehash` is called as the final step after the caller has already succeeded with its action. Minor enhancement for the error path, no backport required. Closes scylladb/scylladb#19910 * github.com:scylladb/scylladb: cell_locker: maybe_rehash: reindent cell_locker: maybe_rehash: ignore allocation failures	2024-08-12 10:54:56 +03:00
Kefu Chai	0ae04ee819	build: cmake: use $<CONFIG:cfgs> when appropriate per https://cmake.org/cmake/help/latest/manual/cmake-generator-expressions.7.html#genex:CONFIG, `cfgs` can be a comma-separated list. this is supported by CMake 3.19 and up, and our minimum required CMake version is 3.27. so let's switch over from the composition of `IN_LIST` and `CONFIG` generator expressions to a single one. simpler this way. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#20110	2024-08-11 21:28:38 +03:00
Avi Kivity	318278ff92	Merge 'tablets: reload only changed metadata' from Botond Dénes Currently, each change to tablet metadata triggers a full metadata reload from disk. This is very wasteful, especially if the metadata change affects only a single row in the `system.tablets` table. This is the case when the tablet load balancer triggers a migration, this will affect a single row in the table, but today will trigger a full reload. We expect tablet count to potentially grow to thousands and beyond and the overhead of this full reload can become significant. This PR makes tablet metadata reload partial, instead of reloading all metadata on topology or schema changes, reload only the partitions that are affected by the change. Copy the rest from the in-memory state. This is done with two passes: first the change mutations are scanned and a hint is produced. This hint is then passed down to the reload code, which will use it to only reload parts (rows/partitions) of the metadata that has actually changed. The performance difference between full reload and partial reload is quite drastic: ``` INFO 2024-07-25 05:06:27,347 [shard 0:stat] testlog - Tablet metadata reload: full 616.39ms partial 0.18ms ``` This was measured with the modified (by this PR) `perf_tablets`, which creates 100 tables, each with 2K tablets. The test was modified to change a single tablet, then do a full and partial reload respectively, measuring the time it takes for reach. Fixes: #15294 New feature, no backport needed. Closes scylladb/scylladb#15541 * github.com:scylladb/scylladb: test/perf/perf_tablets: add tablet metadata reload perf measurement test/boost/tablets_test: add test for partial tablet metadata updates db/schema_tables: pass tablet hint to update_tablet_metadata() service/storage_service: load_tablet_metadata(): add hint parameter service/migration_listener: update_tablet_metadata(): add hint parameter service/raft/group0_state_machine: provide tablet change hint on topology change service/storage_service: topology_state_load(): allow providing change hint replica/tablets: add update_tablet_metadata() replica/tablets: fix indentation replica/tablets: extract tablet_metadata builder logic replica/tablets: add get_tablet_metadata_change_hint() and update_tablet_metadata_change_hint() locator/tablets: add tablet_map::clear_tablet_transition_info() locator/tablets: make tablet_metadata cheap to copy mutation/canonical_mutation: add key()	2024-08-11 21:27:18 +03:00
Botond Dénes	2b2db510b7	test/perf/perf_tablets: add tablet metadata reload perf measurement Measure reload perf of full reload vs. partial reload, after changing a single tablet. While at it, modify the `--tablets-per-table` parameter, so that it has a default parameter which works OOTB. The previous default was both too large (causing oversized commitlog entry errors) and not a power of two.	2024-08-11 09:53:19 -04:00
Botond Dénes	65eee200b2	test/boost/tablets_test: add test for partial tablet metadata updates	2024-08-11 09:53:19 -04:00
Botond Dénes	b886ed44a7	db/schema_tables: pass tablet hint to update_tablet_metadata() Replace the has_tablet_mutations in `merge_tables_and_views()` with a hint parameter, which is calculated in the caller, from the original schema change mutations. This hint is then forwarded to the notifier's `update_tablet_metadata()` so that subscribers can refresh only the tablet partitions that changed.	2024-08-11 09:53:19 -04:00
Botond Dénes	5bff422b54	service/storage_service: load_tablet_metadata(): add hint parameter Allowing for reloading only those parts of the tablet metadata that were actually changed.	2024-08-11 09:53:19 -04:00
Botond Dénes	2cec0d8dd1	service/migration_listener: update_tablet_metadata(): add hint parameter The hint contains information related to what exactly changed, allowing listeners to do partial updates, instead of reloading all metadata on each notification.	2024-08-11 09:53:19 -04:00
Botond Dénes	ca302d9e28	service/raft/group0_state_machine: provide tablet change hint on topology change So that when reloading tablet state metadata from the disk, only the changed parts are reloaded.	2024-08-11 09:53:19 -04:00
Botond Dénes	806ec3244a	service/storage_service: topology_state_load(): allow providing change hint So that when reloading state from disk, only changed parts are reloaded instead of all. For now, only tablets have hints implemented.	2024-08-11 09:53:18 -04:00
Botond Dénes	bb1e733fe0	replica/tablets: add update_tablet_metadata() Allows updateng tablet metadata in-place, according to the provided hint, reading and updating only the parts that actually changed.	2024-08-11 09:52:37 -04:00
Botond Dénes	66292b4baa	replica/tablets: fix indentation Left broken from the previous patch.	2024-08-11 09:52:37 -04:00
Botond Dénes	aa378c458e	replica/tablets: extract tablet_metadata builder logic So it can be reused in a new method. Indentation is left broken deliberately, to make the patch easier to read.	2024-08-11 09:52:37 -04:00
Botond Dénes	f5976aa87b	replica/tablets: add get_tablet_metadata_change_hint() and update_tablet_metadata_change_hint() Extract a hint of what a tablet mutation changed. The hint can be later used to selectively reload only the changed parts from disk. Two variants are added: * get_tablet_metadata_change_hint() - extracts a hint from a list of tablet mutations * update_tablet_metadata_change_hint() - updates an existing hint based on a single mutation, allowing for incremental hint extraction	2024-08-11 09:52:37 -04:00
Botond Dénes	54ea71f8a6	locator/tablets: add tablet_map::clear_tablet_transition_info()	2024-08-11 09:52:37 -04:00
Botond Dénes	0254cfc7d3	locator/tablets: make tablet_metadata cheap to copy Keep lw_shared_ptr<tablet_map> in the tablet map and use COW semantics. To prevent accidental changes to shared tablet_map instances, all modifications to a tablet_map have to go through a new `mutate_tablet_map()` method, which implements the copy-modify-swap idiom.	2024-08-11 09:52:37 -04:00
Botond Dénes	fb0ab3c1fb	mutation/canonical_mutation: add key() Extracts the partition key without deserializing the entire mutation.	2024-08-11 09:52:37 -04:00
Calle Wilund	e18a855abe	extensions: Add exception types for IO extensions and handle in memtable write path Fixes #19960 Write path for sstables/commitlog need to handle the fact that IO extensions can generate errors, some of which should be considered retry-able, and some that should, similar to system IO errors, cause the node to go into isolate mode. One option would of course be for extensions to simply generate std::system_errors, with system_category and appropriate codes. But this is probably a bad idea, since it makes it more muddy at which level an error happened, as well as limits the expressibility of the error. This adds three distinct types (sharing base) distinguishing permission, availabilty and configuration errors. These are treated akin to EACCESS, ENOENT and EINVAL in disk error handler and memtable write loop. Tests updated to use and verify behaviour. Closes scylladb/scylladb#19961	2024-08-11 13:52:35 +03:00
Raphael S. Carvalho	75829d75ec	replica: Fix race between split compaction and migration After removal of rwlock (`53a6ec05ed`), the race was introduced because the order that compaction groups of a tablet are closed, is no longer deterministic. Some background first: Split compaction runs in main (unsplit) group, and adds sstable to left and right groups on completion. The race works as follow: 1) split compaction starts on main group of tablet X 2) tablet X reaches cleanup stage, so its compaction groups are closed in parallel 3) left or right group are closed before main (more likely when only main has flush work to do) 4) split compaction completes, and adds sstable to left and right 5) if e.g left is closed, adjusting backlog tracker will trigger an exception, and since that happens in row cache update's execute(), node crashes. The problem manifested as follow: [shard 0: gms] raft_topology - Initiating tablet cleanup of 5739b9b0-49d4-11ef-828f-770894013415:15 on 102a904a-0b15-4661-ba3f-f9085a5ad03c:0 ... [shard 0:strm] compaction - [Split keyspace1.standard1 009e2f80-49e5-11ef-85e3-7161200fb137] Splitting [/var/lib/scylla/data/keyspace1/...] ... [shard 0:strm] cache - Fatal error during cache update: std::out_of_range (Compaction state for table [0x600007772740] not found), at: ... -------- seastar::continuation<seastar::internal::promise_base_with_type<void>, row_cache::do_update(... -------- seastar::internal::do_with_state<std::tuple<row_cache::external_updater, std::function<seastar::future<void> ()> >, seastar::future<void> > -------- seastar::internal::coroutine_traits_base<void>::promise_type -------- seastar::internal::coroutine_traits_base<void>::promise_type -------- seastar::(anonymous namespace)::thread_wake_task -------- seastar::continuation<seastar::internal::promise_base_with_type<sstables::compaction_result>, seastar::async<sstables::compaction::run(... seastar::continuation<seastar::internal::promise_base_with_type<sstables::compaction_result>, seastar::future<sstables::compaction_resu... From the log above, it can be seen cache update failure happens under streaming sched group and during compaction completion, which was good evidence to the cause. Problem was reproduced locally with the help of tablet shuffling. Fixes: #19873. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes scylladb/scylladb#19987	2024-08-11 11:00:19 +03:00
Botond Dénes	1f4b9a5300	Merge 'compaction: drop compaction executors' possibility to bypass task manager' from Aleksandra Martyniuk If parent_info argument of compaction_manager::perform_compaction is std::nullopt, then created compaction executor isn't tracked by task manager. Currently, all compaction operations should by visible in task manager. Modify split methods to keep split executor in task manager. Get rid of the option to bypass task manager. Closes scylladb/scylladb#19995 * github.com:scylladb/scylladb: compaction: replace optional<task_info> with task_info param compaction: keep split executor in task manager	2024-08-11 10:26:43 +03:00
Botond Dénes	0bb1075a19	Merge 'tasks: fix task handler' from Aleksandra Martyniuk There are some bugs missed in task handler: - wait_for_task does not wait until virtual tasks are done, but returns the status immediately; - wait_for_task suffers from use after return; - get_status_recursively does not set the kind of task essentials. Fix the aforementioned. Closes scylladb/scylladb#19930 * github.com:scylladb/scylladb: test: add test to check that task handler is fixed tasks: fix task handler	2024-08-11 10:23:17 +03:00
Michał Jadwiszczak	3745d0a534	gms/feature_service: allow to suppress features This patch adds `suppress_features` error injection. It allows to revoke support for some features and it can be used to simulate upgrade process in test.py. Features to suppress are passed as injection's value, separated by `;`. Example: `PARALLELIZED_AGGREGATION;UDA_NATIVE_PARALLELIZED_AGGREGATION` Fixes scylladb/scylladb#20034 Closes scylladb/scylladb#20055	2024-08-09 19:15:19 +02:00
Kefu Chai	a78f46aad7	s3/client: customize options for input_stream before this change, we use the default options for performing read on the input. and the default options is like ```c++ struct file_input_stream_options { size_t buffer_size = 8192; ///< I/O buffer size unsigned read_ahead = 0; ///< Maximum number of extra read-ahead operations }; ``` which is not able to offer good throughput when reading from disk, when we stream to S3. so, in this change, we use options which allows better throughput. Refs `061def001d` Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#20074	2024-08-09 11:52:30 +03:00
Dawid Medrek	e5d01d4000	db/hints: Make commitlog use commitlog IO scheduling group Before these changes, we didn't specify which I/O scheduling group commitlog instances in hinted handoff should use. In this commit, we set it explicitly to the commitlog scheduling group. The rationale for this choice is the fact we don't want to cause a bottleneck on the write path -- if hints are written too slowly, new incoming mutations (NOT hints) might be rejected due to a too high number of hints currently being written to disk; see `storage_proxy::create_write_response_handler_helper()` for more context. Fixes scylladb/scylladb#18654 Closes scylladb/scylladb#19170	2024-08-08 16:14:07 +02:00
Piotr Dulikowski	b72906518f	Merge 'service levels: update connections parameters automatically' from Michał Jadwiszczak This patch makes all cql connections update theirs service level parameters automatically when: - any service level is created or changed - one role is granted to another - any service level is attached to/detached from a role First of all, the patch defines what a service level and an effective service level are `938aa10509`. No new type of service levels are introduced, the commit only clarifies definitions and names what an effective service level is. (Effective service level is created by merging all service levels which are attached to all roles granted to the user. It represents exact values of connection's parameters.) Previously, to find an effective service level of a user, it required O(n) internal queries: O(n) queries to recursively find all granted roles (`standard_role_manager::query_granted()`) and a query for each role to get its service level (`standard_role_manager::get_attribute()`, which sums to O(n) queries). Because we want to reload SL parameters for all opened cql connections, we don't want to do O(n) queries for every connection, every time we create or change any service level/grant one role to another/attach or detach a service level to/from a role. To speed it up, the patch adds another layer of service level controller cache, which stored `role_name -> effective_service_level` mapping. This way finding a effective service level for a role is only a lookup to a map. Building the new cache requires only 2 queries: one to obtain all role hierarchy one to get all roles' service level. Fixes scylladb/scylladb#12923 Closes scylladb/scylladb#19085 * github.com:scylladb/scylladb: test/auth_cluster/test_raft_service_levels: add test for automatic connection update api/cql_server_test: add CQL server testing API transport/cql_server: subscribe to sl effective cache reloaded transport/controller: coroutinize `subscribe_server` and `unsubscribe_server` transport/cql_server: add method to update service level params on all connections generic_server: use async function in `for_each_gently()` service/qos/sl_controller: use effective service levels cache service/qos/service_level_controller: notify subscribers on effective cache reloaded service/raft/group0_state_machine: update effective service levels cache service/topology_coordinator: migrate service levels before auth service/qos/service_level_controller: effective service levels cache utils/sorting: allow to pass any container as verticies service/qos/service_level_controller: replace shard check to assert service/qos: define effective service level service/qos/qos_common: use const reference in `init_effective_names()` service/qos/service_level_controller: remove unused field auth: return map of directly granted roles test/auth/test_auth_v2_migration: create sl1 in the test	2024-08-08 15:31:04 +02:00
Anna Stuchlik	a1b4357765	doc: update Raft info in 6.1 This commit updates the Raft information regarding the Raft verification procedure. In 6.1, the procedure is no longer related to the upgrade. Fixes https://github.com/scylladb/scylladb/issues/19932 Closes scylladb/scylladb#20040	2024-08-08 11:25:50 +02:00
PeterFlockhart	0f9c6d24cf	Update SELECT grammar to define group_by_clause explicitly Closes scylladb/scylladb#20046	2024-08-08 12:23:20 +03:00
Avi Kivity	12c68bcf75	Merge 'querier: include cell stats in page stats' from Botond Dénes We have two mechanism to give visibility into reads having to process many tombstones: * a warning in the logs, triggered if a read processed more the `tombstone_warn_threshold` dead rows/tombstones * a trace message, which includes stats of the amount of rows in the page, including the amount of live and dead rows as well as tombstones This series extends this to also include information on cells, so we have visibility into the case where a read has to process an excessive amount of cell tombstones (mainly because of collections). A log line is now also logged if the amount of dead cells/tombstones in the page exceeds `tombstone_warn_threshold`. The trace message is also extended to contain cell stats. The `tombstone_warn_threshold` log lines now receive a 10s rate-limit to avoid excessive log spamming. The rate-limit is separate for the row and cell logs. Example of the new log line (`tombstone_warn_threshold=10` ): ``` WARN 2024-05-30 07:56:44,979 [shard 0:stmt] querier - Read 98 live cells and 126 dead cells/tombstones for system_schema.scylla_tables <partition-range-scan> (-inf, +inf) (see tombstone_warn_threshold) ``` Example of the new tracing message: ``` Page stats: 1 partition(s), 0 static row(s) (0 live, 0 dead), 1 clustering row(s) (1 live, 0 dead), 0 range tombstone(s) and 13 cell(s) (1 live, 12 dead) [shard 0] \| 2024-05-30 08:13:19.690803 \| 127.0.0.1 \| 6114 \| 127.0.0.1 ``` Fixes: https://github.com/scylladb/scylladb/issues/18996 Improvement, not a backport candidate. Closes scylladb/scylladb#18997 * github.com:scylladb/scylladb: test/boost: mutation_test: add test for cell compaction stats mutation/compact_and_expire_result: drop operator bool() querier: consume_page(): add rate-limiting to tombstone warnings querier: consume_page(): add cell stats to page stats trace message querier: consume_page(): add tombstone warning for cell tombstones querier: consume_page(): extract code which logs tombstone warning mutation/mutation_compactor: collect and aggregate cell compaction stats mutation: row::compact_and_expire(): use compact_and_expire_result collection_mutation: compact_and_expire(): use compact_and_expire_result mutation: introduce compact_and_expire_result	2024-08-08 12:16:13 +03:00
Calle Wilund	d6742e9bce	distributed_loader: Remove load_prio_keyspaces Fixes #13334 All required code paths (see enterprise) now uses extensions::is_extension_internal_keyspace. The old mechanism can be removed. One less global var. Closes scylladb/scylladb#20047	2024-08-08 12:10:27 +03:00
Avi Kivity	db77b5bd03	Merge 'convert the rest of `test/boost/sstable_test.cc` to co-routines and seastar::thread' from Laszlo Ersek This is a followup to #19937, for #19803. See in particular [this comment](https://github.com/scylladb/scylladb/issues/19803#issuecomment-2258371923). The primary conversion target is coroutines. However, while coroutines are the most convenient style, they are only infrequently usable in this case, for the following reasons: - Wherever we have a `future::finally()` that calls a cleanup function that returns a future (which must be awaited), we cannot use `co_await`. We can only use `seastar::async()` with `deferred_close` or `defer()`. - The code passes lots of lambdas, and `co_await` cannot be used in lambdas. First, I tried, and the compiler rejects it; second, a capturing lambda that is a coroutine is a trap [[1]](https://devblogs.microsoft.com/oldnewthing/20211103-00/?p=105870) [[2]](https://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines#Rcoro-capture). In most cases, I didn't have to use naked `seastar::async()`; there were specialized wrappers in place already. Thus, most of the changes target `seastar::thread` context under existent `seastar::async()` wrappers, and only a few functions end up as coroutines. The last patch in the series (`test/sstable: remove useless variable from promoted_index_read()`) is an independent micro-cleanup, the opportunity for which I thought to have noticed while reading the code. The tail of `test/boost/sstable_test.cc` (the stuff following `promoted_index_read()`) is already written as `seastar::thread`. That's already better (for readability) than future chaining; but could have I perhaps further converted those functions to coroutines? My answer was "no": - Some of the candidate functions relied on deferred cleanups that might need to yield (all three variants of `count_rows()`). - Some had been implemented by passing lambdas to wrappers of `seastar::async()` (`sub_partition_read()`, `sub_partitions_read()`). - The test case `test_skipping_in_compressed_stream()` initially looked promising for co-routinization (from its starting point `seastar::async()`), because it seemed to employ no deferred cleanup (that might need to yield). However, the function uses three lambdas that must be able to yield internally, and one of those (`make_is()`) is even capturing. - The rest (`test_empty_key_view_comparison()`, `test_parse_path_good()`, `test_parse_path_bad()`) was synchronous code to begin with. ``` test/boost/sstable_test.cc \| 188 +++++++++----------- 1 file changed, 83 insertions(+), 105 deletions(-) ``` Refactoring; no backport needed. Closes scylladb/scylladb#20011 * github.com:scylladb/scylladb: test/sstable: remove useless variable from promoted_index_read() test/sstable: rewrite promoted_index_read() with async() test/sstable: unfuturize lambda invocation in test_using_reusable_sst() test/sstable: rewrite wrong_range() with async() test/sstable: simplify not_find_key_composite_bucket0() under test_using_reusable_sst() test/sstable: rewrite full_index_search() with async() test/sstable: simplify find_key(), all_in_place() under test_using_reusable_sst() test/sstable: rewrite (un)compressed_random_access_read() with async() test/sstable: simplify write_and_validate_sst() test/sstable: simplify check_toc_func() under async() test/sstable: simplify check_statistics_func() under async() test/sstable: simplify check_summary_func() under async() test/sstable: coroutinize check_component_integrity() test/sstable: rewrite write_sst_info() with async() test/sstable: simplify missing_summary_first_last_sane() test/sstable: coroutinize summary_query_fail() test/sstable: rewrite summary_query() with async() test/sstable: coroutinize (simple/composite)_index_read() test/sstable: rewrite index_read() with async() test/sstable: rewrite test_using_reusable_sst() with async() test/sstable: rewrite test_using_working_sst() with async()	2024-08-08 11:55:37 +03:00
Michał Jadwiszczak	b62a8b747a	test/auth_cluster/test_raft_service_levels: add test for automatic connection update	2024-08-08 10:42:09 +02:00
Michał Jadwiszczak	870bdaa6b1	api/cql_server_test: add CQL server testing API Add a CQL server testing API with and endpoint to dump service level parameters of all CQL connections. This endpoint will be later used to test functionality of automated updating CQL connections parameters.	2024-08-08 10:42:09 +02:00
Michał Jadwiszczak	c3e8778ad4	transport/cql_server: subscribe to sl effective cache reloaded Make cql server (but not maintenance server) is subscribed to qos configuration change. Trigger update of connections' service level params on effective cache reloaded event. It's not done on maintenance server because it doesn't support role hierarchy nor attaching service levels.	2024-08-08 10:42:09 +02:00
Michał Jadwiszczak	b2f2288292	transport/controller: coroutinize `subscribe_server` and `unsubscribe_server`	2024-08-08 10:42:09 +02:00
Michał Jadwiszczak	4af90726b6	transport/cql_server: add method to update service level params on all connections Trigger update of service level param on every cql connection. In enterprise, the method needs also to update connections' scheduling group.	2024-08-08 10:42:09 +02:00
Michał Jadwiszczak	324b3c43c0	generic_server: use async function in `for_each_gently()` In the following patch, we will add a method to update service levels parameters for each cql connections. To support this, this patch allows to pass async function as a parameter to `for_each_gently()` method.	2024-08-08 10:42:09 +02:00
Michał Jadwiszczak	93e6de0d04	service/qos/sl_controller: use effective service levels cache Use cache to quickly access effective service level of a role.	2024-08-08 10:42:09 +02:00
Michał Jadwiszczak	664a1913c6	service/qos/service_level_controller: notify subscribers on effective cache reloaded Add event representing reload of effective service level cache and notify subscribers when the cache is reloaded.	2024-08-08 10:42:09 +02:00
Michał Jadwiszczak	5f8132c13c	service/raft/group0_state_machine: update effective service levels cache Updates to `system.role_members` and `system.role_attributes` affect effective service levels cache, so applying mutations to those tables should reload the effective SL cache.	2024-08-08 10:42:09 +02:00
Michał Jadwiszczak	7b28df9b4d	service/topology_coordinator: migrate service levels before auth Effective service level cache will be updated when mutations are applied to some of the auth tables. But the effective cache depends on first-level service levels cache, so service levels data should be migrated before auth data.	2024-08-08 10:42:09 +02:00
Michał Jadwiszczak	842573d0af	service/qos/service_level_controller: effective service levels cache Add a second layer of service_level_controller cache which contains role name -> effective service level mapping. To build the mapping, controller uses first cache layer (service level name -> service level) and 2 queries to auth tables (one to `roles` and one to `role_members`).	2024-08-08 10:42:09 +02:00
Michał Jadwiszczak	4922f87fed	utils/sorting: allow to pass any container as verticies The container containing all verticies doesn't have to be a vector. Allowing to pass any container that meet conditions, will make to function more flexible.	2024-08-08 10:42:09 +02:00

1 2 3 4 5 ...

43855 Commits