scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-26 19:35:12 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	66d72e010c	distributed_loader: Lock table via global table ptr The lock_table() method needs database, ks and cf to find the table on all shards. The same can be achieved with the help of global_table_ptr thing that all the core callers already have at hand. There's a test that doesn't have global table, but it can get one. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#20139	2024-08-14 20:53:21 +03:00
Pavel Emelyanov	7e3e5cfcad	sstable_directory: Simplify special-purpose local-only constructor Typically the sstable_directory is constructed out of a table object. Some code, namely tests and schema-loader, don't have table at hand and construct directory out of schema, sharder, path-to-sstables, etc. This code doesn't work with any storage options other than local ones, so there's no need (yet) to carry this argument over. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#20138	2024-08-14 20:22:50 +03:00
Łukasz Paszkowski	ba2f037af5	mutation_partition: drop reverse parameter in compact_for_query The reverse parameter is no longer used with native reverse reads. The row ranges are provided in native reverse order together with a reversed schema, thus the reverse parameter remain false all the time and can be droped.	2024-08-13 10:07:12 +02:00
Łukasz Paszkowski	158b994676	query::trim_clustering_row_ranges_to: require reversed schema for native reversed ranges Simplify implementation and for clustering key ranges in native reversed format, require a reversed table schema. Trimming native reversed clustering key ranges requires a reversed schema to be passed in. Thus, the reverse flag is no longer required as it would always be set to false.	2024-08-13 10:07:10 +02:00
Łukasz Paszkowski	da95f44adc	readers: Use reversed schema and native reversed slices The reconcilable_result is built as it would be constructed for forward read queries for tables with reversed order. Mutations constructed for reversed queries are consumed forward. Drop overloaded reversed functions that reverse read_command and reconcilable_result directly and keep only those requiring smart pointers. They are not used any more.	2024-08-13 10:03:46 +02:00
Laszlo Ersek	607abe96e8	test/sstable: merge test_using_reusable_sst*() All lambdas passed to test_using_reusable_sst() conform to the prototype void (test_env&, sstable_ptr) All lambdas passed to test_using_reusable_sst_returning() conform to the prototype NON_VOID (test_env&, sstable_ptr) The common parameter list of both prototypes can be expressed with the concept std::invocable<test_env&, sstable_ptr> Once a "Func" template parameter (i.e., function type) satisfying this concept is taken, then "Func"'s void or non-void return type can be commonly expressed with std::invoke_result_t<Func, test_env&, sstable_ptr> In turn, test_env::do_with_async_returning<...> can be instantiated with this return type, even if it happens to be "void". ([stmt.return] specifies, "[a] return statement with an operand of type void shall be used only in a function that has a cv void return type", meaning that return func(env) will do the right thing in the body of test_env::do_with_async_returning<void>().) Merge test_using_reusable_sst() and test_using_reusable_sst_returning() into one. Preserve the function name from the former, and the test_env::do_with_async_returning<...>() call from the latter. Suggested-by: Avi Kivity <avi@scylladb.com> Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com> Closes scylladb/scylladb#20090	2024-08-12 17:52:01 +03:00
Nikos Dragazis	90363ce802	test: Test the SSTable validation API against malformed SSTables Unit testing for the SSTable validation API happens in `sstable_validate_test`. Currently, this test checks the API against some invalid SSTables with out-of-order clustering rows and out-of-order partitions. However, both are types of content-level corruption that do not trigger `malformed_sstable_exception` errors. Extend the test to cover cases of file-level corruption as well, i.e., cases that would raise a `malformed_sstable_exception`. Construct an SSTable with an invalid checksum to trigger this. This is part of the effort to improve scrub to handle all kinds of corruption. Fixes scylladb/scylladb#19057 Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com> Closes scylladb/scylladb#20096	2024-08-12 15:09:58 +03:00
Avi Kivity	318278ff92	Merge 'tablets: reload only changed metadata' from Botond Dénes Currently, each change to tablet metadata triggers a full metadata reload from disk. This is very wasteful, especially if the metadata change affects only a single row in the `system.tablets` table. This is the case when the tablet load balancer triggers a migration, this will affect a single row in the table, but today will trigger a full reload. We expect tablet count to potentially grow to thousands and beyond and the overhead of this full reload can become significant. This PR makes tablet metadata reload partial, instead of reloading all metadata on topology or schema changes, reload only the partitions that are affected by the change. Copy the rest from the in-memory state. This is done with two passes: first the change mutations are scanned and a hint is produced. This hint is then passed down to the reload code, which will use it to only reload parts (rows/partitions) of the metadata that has actually changed. The performance difference between full reload and partial reload is quite drastic: ``` INFO 2024-07-25 05:06:27,347 [shard 0:stat] testlog - Tablet metadata reload: full 616.39ms partial 0.18ms ``` This was measured with the modified (by this PR) `perf_tablets`, which creates 100 tables, each with 2K tablets. The test was modified to change a single tablet, then do a full and partial reload respectively, measuring the time it takes for reach. Fixes: #15294 New feature, no backport needed. Closes scylladb/scylladb#15541 * github.com:scylladb/scylladb: test/perf/perf_tablets: add tablet metadata reload perf measurement test/boost/tablets_test: add test for partial tablet metadata updates db/schema_tables: pass tablet hint to update_tablet_metadata() service/storage_service: load_tablet_metadata(): add hint parameter service/migration_listener: update_tablet_metadata(): add hint parameter service/raft/group0_state_machine: provide tablet change hint on topology change service/storage_service: topology_state_load(): allow providing change hint replica/tablets: add update_tablet_metadata() replica/tablets: fix indentation replica/tablets: extract tablet_metadata builder logic replica/tablets: add get_tablet_metadata_change_hint() and update_tablet_metadata_change_hint() locator/tablets: add tablet_map::clear_tablet_transition_info() locator/tablets: make tablet_metadata cheap to copy mutation/canonical_mutation: add key()	2024-08-11 21:27:18 +03:00
Botond Dénes	65eee200b2	test/boost/tablets_test: add test for partial tablet metadata updates	2024-08-11 09:53:19 -04:00
Botond Dénes	2cec0d8dd1	service/migration_listener: update_tablet_metadata(): add hint parameter The hint contains information related to what exactly changed, allowing listeners to do partial updates, instead of reloading all metadata on each notification.	2024-08-11 09:53:19 -04:00
Botond Dénes	0254cfc7d3	locator/tablets: make tablet_metadata cheap to copy Keep lw_shared_ptr<tablet_map> in the tablet map and use COW semantics. To prevent accidental changes to shared tablet_map instances, all modifications to a tablet_map have to go through a new `mutate_tablet_map()` method, which implements the copy-modify-swap idiom.	2024-08-11 09:52:37 -04:00
Calle Wilund	e18a855abe	extensions: Add exception types for IO extensions and handle in memtable write path Fixes #19960 Write path for sstables/commitlog need to handle the fact that IO extensions can generate errors, some of which should be considered retry-able, and some that should, similar to system IO errors, cause the node to go into isolate mode. One option would of course be for extensions to simply generate std::system_errors, with system_category and appropriate codes. But this is probably a bad idea, since it makes it more muddy at which level an error happened, as well as limits the expressibility of the error. This adds three distinct types (sharing base) distinguishing permission, availabilty and configuration errors. These are treated akin to EACCESS, ENOENT and EINVAL in disk error handler and memtable write loop. Tests updated to use and verify behaviour. Closes scylladb/scylladb#19961	2024-08-11 13:52:35 +03:00
Botond Dénes	1f4b9a5300	Merge 'compaction: drop compaction executors' possibility to bypass task manager' from Aleksandra Martyniuk If parent_info argument of compaction_manager::perform_compaction is std::nullopt, then created compaction executor isn't tracked by task manager. Currently, all compaction operations should by visible in task manager. Modify split methods to keep split executor in task manager. Get rid of the option to bypass task manager. Closes scylladb/scylladb#19995 * github.com:scylladb/scylladb: compaction: replace optional<task_info> with task_info param compaction: keep split executor in task manager	2024-08-11 10:26:43 +03:00
Piotr Dulikowski	b72906518f	Merge 'service levels: update connections parameters automatically' from Michał Jadwiszczak This patch makes all cql connections update theirs service level parameters automatically when: - any service level is created or changed - one role is granted to another - any service level is attached to/detached from a role First of all, the patch defines what a service level and an effective service level are `938aa10509`. No new type of service levels are introduced, the commit only clarifies definitions and names what an effective service level is. (Effective service level is created by merging all service levels which are attached to all roles granted to the user. It represents exact values of connection's parameters.) Previously, to find an effective service level of a user, it required O(n) internal queries: O(n) queries to recursively find all granted roles (`standard_role_manager::query_granted()`) and a query for each role to get its service level (`standard_role_manager::get_attribute()`, which sums to O(n) queries). Because we want to reload SL parameters for all opened cql connections, we don't want to do O(n) queries for every connection, every time we create or change any service level/grant one role to another/attach or detach a service level to/from a role. To speed it up, the patch adds another layer of service level controller cache, which stored `role_name -> effective_service_level` mapping. This way finding a effective service level for a role is only a lookup to a map. Building the new cache requires only 2 queries: one to obtain all role hierarchy one to get all roles' service level. Fixes scylladb/scylladb#12923 Closes scylladb/scylladb#19085 * github.com:scylladb/scylladb: test/auth_cluster/test_raft_service_levels: add test for automatic connection update api/cql_server_test: add CQL server testing API transport/cql_server: subscribe to sl effective cache reloaded transport/controller: coroutinize `subscribe_server` and `unsubscribe_server` transport/cql_server: add method to update service level params on all connections generic_server: use async function in `for_each_gently()` service/qos/sl_controller: use effective service levels cache service/qos/service_level_controller: notify subscribers on effective cache reloaded service/raft/group0_state_machine: update effective service levels cache service/topology_coordinator: migrate service levels before auth service/qos/service_level_controller: effective service levels cache utils/sorting: allow to pass any container as verticies service/qos/service_level_controller: replace shard check to assert service/qos: define effective service level service/qos/qos_common: use const reference in `init_effective_names()` service/qos/service_level_controller: remove unused field auth: return map of directly granted roles test/auth/test_auth_v2_migration: create sl1 in the test	2024-08-08 15:31:04 +02:00
Avi Kivity	12c68bcf75	Merge 'querier: include cell stats in page stats' from Botond Dénes We have two mechanism to give visibility into reads having to process many tombstones: * a warning in the logs, triggered if a read processed more the `tombstone_warn_threshold` dead rows/tombstones * a trace message, which includes stats of the amount of rows in the page, including the amount of live and dead rows as well as tombstones This series extends this to also include information on cells, so we have visibility into the case where a read has to process an excessive amount of cell tombstones (mainly because of collections). A log line is now also logged if the amount of dead cells/tombstones in the page exceeds `tombstone_warn_threshold`. The trace message is also extended to contain cell stats. The `tombstone_warn_threshold` log lines now receive a 10s rate-limit to avoid excessive log spamming. The rate-limit is separate for the row and cell logs. Example of the new log line (`tombstone_warn_threshold=10` ): ``` WARN 2024-05-30 07:56:44,979 [shard 0:stmt] querier - Read 98 live cells and 126 dead cells/tombstones for system_schema.scylla_tables <partition-range-scan> (-inf, +inf) (see tombstone_warn_threshold) ``` Example of the new tracing message: ``` Page stats: 1 partition(s), 0 static row(s) (0 live, 0 dead), 1 clustering row(s) (1 live, 0 dead), 0 range tombstone(s) and 13 cell(s) (1 live, 12 dead) [shard 0] \| 2024-05-30 08:13:19.690803 \| 127.0.0.1 \| 6114 \| 127.0.0.1 ``` Fixes: https://github.com/scylladb/scylladb/issues/18996 Improvement, not a backport candidate. Closes scylladb/scylladb#18997 * github.com:scylladb/scylladb: test/boost: mutation_test: add test for cell compaction stats mutation/compact_and_expire_result: drop operator bool() querier: consume_page(): add rate-limiting to tombstone warnings querier: consume_page(): add cell stats to page stats trace message querier: consume_page(): add tombstone warning for cell tombstones querier: consume_page(): extract code which logs tombstone warning mutation/mutation_compactor: collect and aggregate cell compaction stats mutation: row::compact_and_expire(): use compact_and_expire_result collection_mutation: compact_and_expire(): use compact_and_expire_result mutation: introduce compact_and_expire_result	2024-08-08 12:16:13 +03:00
Avi Kivity	db77b5bd03	Merge 'convert the rest of `test/boost/sstable_test.cc` to co-routines and seastar::thread' from Laszlo Ersek This is a followup to #19937, for #19803. See in particular [this comment](https://github.com/scylladb/scylladb/issues/19803#issuecomment-2258371923). The primary conversion target is coroutines. However, while coroutines are the most convenient style, they are only infrequently usable in this case, for the following reasons: - Wherever we have a `future::finally()` that calls a cleanup function that returns a future (which must be awaited), we cannot use `co_await`. We can only use `seastar::async()` with `deferred_close` or `defer()`. - The code passes lots of lambdas, and `co_await` cannot be used in lambdas. First, I tried, and the compiler rejects it; second, a capturing lambda that is a coroutine is a trap [[1]](https://devblogs.microsoft.com/oldnewthing/20211103-00/?p=105870) [[2]](https://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines#Rcoro-capture). In most cases, I didn't have to use naked `seastar::async()`; there were specialized wrappers in place already. Thus, most of the changes target `seastar::thread` context under existent `seastar::async()` wrappers, and only a few functions end up as coroutines. The last patch in the series (`test/sstable: remove useless variable from promoted_index_read()`) is an independent micro-cleanup, the opportunity for which I thought to have noticed while reading the code. The tail of `test/boost/sstable_test.cc` (the stuff following `promoted_index_read()`) is already written as `seastar::thread`. That's already better (for readability) than future chaining; but could have I perhaps further converted those functions to coroutines? My answer was "no": - Some of the candidate functions relied on deferred cleanups that might need to yield (all three variants of `count_rows()`). - Some had been implemented by passing lambdas to wrappers of `seastar::async()` (`sub_partition_read()`, `sub_partitions_read()`). - The test case `test_skipping_in_compressed_stream()` initially looked promising for co-routinization (from its starting point `seastar::async()`), because it seemed to employ no deferred cleanup (that might need to yield). However, the function uses three lambdas that must be able to yield internally, and one of those (`make_is()`) is even capturing. - The rest (`test_empty_key_view_comparison()`, `test_parse_path_good()`, `test_parse_path_bad()`) was synchronous code to begin with. ``` test/boost/sstable_test.cc \| 188 +++++++++----------- 1 file changed, 83 insertions(+), 105 deletions(-) ``` Refactoring; no backport needed. Closes scylladb/scylladb#20011 * github.com:scylladb/scylladb: test/sstable: remove useless variable from promoted_index_read() test/sstable: rewrite promoted_index_read() with async() test/sstable: unfuturize lambda invocation in test_using_reusable_sst() test/sstable: rewrite wrong_range() with async() test/sstable: simplify not_find_key_composite_bucket0() under test_using_reusable_sst() test/sstable: rewrite full_index_search() with async() test/sstable: simplify find_key(), all_in_place() under test_using_reusable_sst() test/sstable: rewrite (un)compressed_random_access_read() with async() test/sstable: simplify write_and_validate_sst() test/sstable: simplify check_toc_func() under async() test/sstable: simplify check_statistics_func() under async() test/sstable: simplify check_summary_func() under async() test/sstable: coroutinize check_component_integrity() test/sstable: rewrite write_sst_info() with async() test/sstable: simplify missing_summary_first_last_sane() test/sstable: coroutinize summary_query_fail() test/sstable: rewrite summary_query() with async() test/sstable: coroutinize (simple/composite)_index_read() test/sstable: rewrite index_read() with async() test/sstable: rewrite test_using_reusable_sst() with async() test/sstable: rewrite test_using_working_sst() with async()	2024-08-08 11:55:37 +03:00
Michał Jadwiszczak	664a1913c6	service/qos/service_level_controller: notify subscribers on effective cache reloaded Add event representing reload of effective service level cache and notify subscribers when the cache is reloaded.	2024-08-08 10:42:09 +02:00
Tomasz Grabiec	1a4baa5f9e	tablets: Do not allocate tablets on nodes being decommissioned If tablet-based table is created concurrently with node being decommissioned after tablets are already drained, the new table may be permanently left with replicas on the node which is no longer in the topology. That creates an immidiate availability risk because we are running with one replica down. This also violates invariants about replica placement and this state cannot be fixed by topology operations. One effect is that this will lead to load balancer failure which will inhibit progress of any topology operations: load_balancer - Replica 154b0380-1dd2-11b2-9fdd-7156aa720e1a:0 of tablet 7e03dd40-537b-11ef-9fdd-7156aa720e1a:1 not found in topology, at: ... Fixes #20032 Closes scylladb/scylladb#20053	2024-08-07 18:52:58 +03:00
Botond Dénes	78206a3fad	test/boost: mutation_test: add test for cell compaction stats	2024-08-06 08:56:28 -04:00
Botond Dénes	259a59bd64	mutation/compact_and_expire_result: drop operator bool() Having an operator bool() on this struct is counter-intuitive, so this commit drops it and migrates any remaining users to bool is_live(). The purpose of this operator bool() was to help in incrementally replace the previous bool return type with compact_and_expire_result in the compact_and_expire() call stack. Now that this is done, it has served its purpose.	2024-08-06 08:56:28 -04:00
Avi Kivity	aa1270a00c	treewide: change assert() to SCYLLA_ASSERT() assert() is traditionally disabled in release builds, but not in scylladb. This hasn't caused problems so far, but the latest abseil release includes a commit [1] that causes a 1000 insn/op regression when NDEBUG is not defined. Clearly, we must move towards a build system where NDEBUG is defined in release builds. But we can't just define it blindly without vetting all the assert() calls, as some were written with the expectation that they are enabled in release mode. To solve the conundrum, change all assert() calls to a new SCYLLA_ASSERT() macro in utils/assert.hh. This macro is always defined and is not conditional on NDEBUG, so we can later (after vetting Seastar) enable NDEBUG in release mode. [1] `66ef711d68` Closes scylladb/scylladb#20006	2024-08-05 08:23:35 +03:00
Laszlo Ersek	0933a52c0b	test/sstable: remove useless variable from promoted_index_read() The large_partition_schema() call returns a copy of the "schema_ptr" object that points to an effectively statically initialized thread_local "schema" object. The large_partition_schema() call has no bearing on whether, or when, the "schema" object is constructed, and has no side effects (other than copying an "lw_shared_ptr" object). Furthermore, the return value of large_partition_schema() is not used for anything in promoted_index_read(). This redundant call seems to date back to original commit `3dd079fb7a` ("tests: add test for reading parts of a large partition", 2016-08-07). Remove the call and the variable. Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	bb58446258	test/sstable: rewrite promoted_index_read() with async() For better readability, replace future::then() chaining with future::get(). Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	1f565626d4	test/sstable: unfuturize lambda invocation in test_using_reusable_sst() All lambdas passed to test_using_reusable_sst() and test_using_reusable_sst_returning() have been converted to future::get() calls (according to the seastar::thread context that they are now executed in). None of the lambdas return futures anymore; they all directly return void or non-void. Therefore, drop futurize_invoke(...).get() around the lambda invocations in test_using_reusable_sst(). Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	8ea881ae04	test/sstable: rewrite wrong_range() with async() For better readability, replace the future::then() chaining (and the associated manual fiddling with object lifecycles) with future::get() (and rely on seastar::thread's stack). We're already in seastar::thread context. Similarly, replace the future::finally() underlying with_closeable() with deferred_close(); with the assumption that mutation_reader::close() never fails (and is therefore safe to call in the "deferred_close" destructor). This is actually guaranteed, as mutation_reader::close() is marked "noexcept". Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	e7e9a0a696	test/sstable: simplify not_find_key_composite_bucket0() under test_using_reusable_sst() According to early patch "test/sstable: rewrite test_using_reusable_sst() with async" in this series, lambdas passed to test_using_reusable_sst() are invoked: (a) less importantly here, in seastar::thread context, (b) more importantly here, futurized (temporarily so). The test case not_find_key_composite_bucket0() doesn't chain futures; therefore it needs no conversion to future::get() for purpose (a); however, we can eliminate its empty future return. Fact (b) will cover for that, until all such lambdas are converted to direct "void" returns (at which point we can remove the futurization from test_using_reusable_sst()). Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	95cf16708d	test/sstable: rewrite full_index_search() with async() For better readability, replace future::then() chaining with future::get(). (We're already in seastar::thread context.) This patch is best viewed with "git show -b". Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	2a27d5b344	test/sstable: simplify find_key*(), all_in_place() under test_using_reusable_sst() According to early patch "test/sstable: rewrite test_using_reusable_sst() with async" in this series, lambdas passed to test_using_reusable_sst() are invoked: (a) less importantly here, in seastar::thread context, (b) more importantly here, futurized (temporarily so). The test cases find_key_map(), find_key_set(), find_key_list(), find_key_composite(), all_in_place() don't chain futures; therefore they need no conversion to future::get() for purpose (a); however, we can eliminate their empty future returns. Fact (b) will cover for that, until all such lambdas are converted to direct "void" returns (at which point we can remove the futurization from test_using_reusable_sst()). Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	d22bd93abb	test/sstable: rewrite (un)compressed_random_access_read() with async() For better readability, replace future::then() chaining with future::get(). (We're already in seastar::thread context.) Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	6e35e584c8	test/sstable: simplify write_and_validate_sst() All three lambdas passed to write_and_validate_sst() now use future::get() rather than future::then() chaining; in other words, the future::get() calls inside all these seastar::thread contexts have been pushed down to the lambdas. Change all these lambdas' return types from future<> to void. Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	8819b3f134	test/sstable: simplify check_toc_func() under async() The lambda passed to write_and_validate_sst() already runs in seastar::thread context; replace future::then() chaining with future::get() calls. We're going to eliminate the trailing "return make_ready_future<>()" later. This patch is best viewed with "git show -W -b". Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	de56883a17	test/sstable: simplify check_statistics_func() under async() The lambda passed to write_and_validate_sst() already runs in seastar::thread context; replace future::then() chaining with future::get() calls. We're going to eliminate the trailing "return make_ready_future<>()" later. This patch is best viewed with "git show -W -b". Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	1a85412f96	test/sstable: simplify check_summary_func() under async() The lambda passed to write_and_validate_sst() already runs in seastar::thread context; replace future::then() chaining with future::get() calls. We're going to eliminate the trailing "return make_ready_future<>()" later. This patch is best viewed with "git show -W -b". Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	7b21bce1ca	test/sstable: coroutinize check_component_integrity() check_component_integrity() does not rely on any deferred close or stop operations; turn it into a coroutine therefore, for best readability. This conversion demonstrates particularly well how much the stack eases coding. We no longer need to artificially extend the lifetime of "tmp" with a final .then([tmp] {}) future. Consequently, "tmp" no longer needs to be a shared pointer to an on-heap "tmpdir" object; "tmp" can just be a "tmpdir" object on the stack. While at it, eliminate the single-use local objects "s" and "gen", for movability's sake. (We could use std::move() on these variables, but it seems easier to just flatten the function calls that produce the corresponding rvalues into the write_sst_info() argument list.) Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	caca13fe28	test/sstable: rewrite write_sst_info() with async() For better readability, replace future::then() chaining with future::get(). Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	cfe92ee203	test/sstable: simplify missing_summary_first_last_sane() The lambda passed to test_using_reusable_sst() is now invoked -- futurized, transitorily -- in seastar::thread context; stop returning an explicit make_ready_future<>() from the lambda. Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	10ebc0a2d2	test/sstable: coroutinize summary_query_fail() summary_query_fail() does not rely on any deferred close or stop operations; turn it into a coroutine therefore, for best readability. Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	a403ad0703	test/sstable: rewrite summary_query() with async() For better readability, replace future::then() chaining with future::get(). (We're already in seastar::thread context.) Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	3a57a7cfea	test/sstable: coroutinize (simple/composite)_index_read() simple_index_read() and composite_index_read() do not rely on any deferred close or stop operations; turn them into coroutines therefore, for best readability. Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	eeeab1110a	test/sstable: rewrite index_read() with async() For better readability, replace future::then() chaining with future::get(). (We're already in seastar::thread context.) Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	17d4fac669	test/sstable: rewrite test_using_reusable_sst() with async() Improve the readability of test_using_reusable_sst() by replacing future::then() chaining with test_env::do_with_async() and future::get(). Unlike seastar::async(), test_env::do_with_async() restricts its input lambda to returning "void". Because of this, introduce the variant test_using_reusable_sst_returning(), based on test_env::do_with_async_returning(), for lambdas returning non-void. Put the latter to use in index_read() at once. Subsequently, we'll gradually convert the lambdas passed to test_using_reusable_sst() and test_using_reusable_sst_returning() from returning futures to returning direct values. In order for test_using_reusable_sst() and test_using_reusable_sst_returning() to cope with both types of lambdas, wrap the lambdas into futurize_invoke().get(). In the seastar::thread context, future::get() will gracefully block on genuine futures, and return immediately on direct values that were futurized on the spot. Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Laszlo Ersek	79a8a6c638	test/sstable: rewrite test_using_working_sst() with async() Make test_using_working_sst() easier to read by: (1) replacing test_env::do_with() with seastar::async(), seastar::defer(), and future::get(); (2) replacing seastar::async() and seastar::defer() with test_env::do_with_async(). Technically speaking, this change does not perfectly preserve exceptional behavior. Namely, test_env::do_with() uses future::finally() to link test_env::stop() to the chain of futures, and future::finally() permits test_env::stop() itself to throw an exception -- potentially leading to a seastar::nested_exception being thrown, which would carry both the original exception and the one thrown by test_env::stop(). Contrarily, the test_env::stop() deferred with seastar::defer() runs in a destructor, and therefore test_env::stop() had better not throw there. However, we will assume that test_env::stop() does not throw, albeit not marked "noexcept". Prior commits `8d704f2532` ("sstable_test_env: Coroutinize and move to .cc test_env::stop()", 2023-10-31) and `2c78b46c78` ("sstables::test_env: Carry compaction manager on board", 2023-10-31) show that we've considered individual actions in test_env::stop() not to throw before. The 128KB stack of seastar::thread (which underlies seastar::async()) should be a tolerable cost in a test case, in exchange for the improved readability. Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-04 15:35:51 +02:00
Aleksandra Martyniuk	c456a43173	compaction: replace optional<task_info> with task_info param compaction_manager::perform_compaction does not create task manager task for compaction if parent_info is set to std::nullopt. Currently, we always want to create task manager task for compaction. Remove optional from task info parameters which start compaction. Track all compactions with task manager.	2024-08-02 14:38:46 +02:00
Laszlo Ersek	e67eb0ccc1	test/sstable: coroutinize do_write_sst() Make do_write_sst() easier to read by coroutinizing it. Closes #19803. Suggested-by: Benny Halevy <bhalevy@scylladb.com> Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com> Closes scylladb/scylladb#19937	2024-07-31 13:59:26 +03:00
Emil Maskovsky	2dbe9ef2f2	raft: use the abort source reference in raft group0 client interface Most callers of the raft group0 client interface are passing a real source instance, so we can use the abort source reference in the client interface. This change makes the code simpler and more consistent.	2024-07-31 09:18:54 +02:00
Benny Halevy	26abad23d9	sstable_directory: delete_atomically: allow sstables from multiple prefixes Currently, delete_atomically can be called with a list of sstables from mixed prefixes in two cases: 1. truncate: where we delete all the sstables in the table directory 2. tablet cleanup: similar to truncate but restricted to sstables in a single tablet replica In both cases, it is possible that sstables in staging (or quarantine) are mixed with sstables in the base directory. Until a more comprehensive fix is in place, (see https://github.com/scylladb/scylladb/pull/19555) this change just lifts the ban on atomic deletion of sstables from different prefixes, and acknowledging that the implementation is not atomic across prefixes. This is better than crashing for now, and can be backported more easily to branches that support tablets so tablet migration can be done safely in the presence of repair of tables with views. Refs scylladb/scylladb#18862 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#19816	2024-07-28 17:26:31 +03:00
Lakshmi Narayanan Sreethar	27b305b9d1	boost/bloom_filter_test: wait for total memory reclaimed update The testcase `test_bloom_filter_reclaim_during_reload` checks the SSTable manager's `_total_memory_reclaimed` against an expected value to verify that a Bloom filter was reloaded. However, it does not wait for the manager to update the variable, causing the check to fail if the update has not occurred yet. Fix it by making the testcase wait until the variable is updated to the expected value. Fixes #19879 Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com> Closes scylladb/scylladb#19883	2024-07-26 08:15:11 +03:00
Botond Dénes	6337372b9d	test/boost/reader_concurrency_semaphore_test: un-flake test admission The admission test has a section which tests admission when the semaphore has inactive reads. This section (and therefore the enire test) became flaky lately, after a seemingly unrelated seastar upgrade, which improved timers. The cause of the flakyness is the permit which is made inactive later: this permit is created with 0 timeout (times out immediately). For some time now, when the timeout timer of a permit fires, if the permit is inactive, it is evicted. This is what makes the test fail: the inactive read times out and ends up evicting this permit, which is not expected for the test. The reason this was not a problem before, is that the test finishes very quickly, usually, before the timer could even be polled by the reactor. The recent seastar changes changed this and now the timer sometimes get polled and fires, failing the test. Fixes: #19801 Closes scylladb/scylladb#19859	2024-07-24 13:04:50 +03:00
Kefu Chai	061def001d	s3/client: add client::upload_file() this member function prepares for the backup feature, where the object to be stored in the object storage is already persisted as a file on local filesystem. this brings us two benefits: - with the file, we don't need to accumulate the payloads in memory and send them in batch, as we do in upload_sink and in upload_jumbo_sink. this puts less pressure on the memory subsystem. - with the file, we can read multiple parts in parallel if multpart upload applies to it, this helps to improve the throughput. so, this new helper is introduced to help upload an sstable from local filesystem to the object storage. Fixes #16287 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-07-23 14:39:30 +08:00
Botond Dénes	d3135db457	Merge 'commitlog: Add optional max lifetime parameter to cl instance' from Calle Wilund If set, any remaining segment that has data older than this threshold will request flushing, regardless of data pressure. I.e. even a system where nothing happends will after X seconds flush data to free up the commit log. Related to #15820 The functionality here is to prevent pathological/test cases where a silent system cannot fully process stuff like compaction, GC etc due to things like CL forcing smaller GC windows etc. Closes scylladb/scylladb#15971 * github.com:scylladb/scylladb: commitlog: Make max data lifetime runtime-configurable db::config: Expose commitlog_max_data_lifetime_in_s parameter commitlog: Add optional max lifetime parameter to cl instance	2024-07-22 17:21:33 +03:00

1 2 3 4 5 ...

3327 Commits