scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-01 12:36:56 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	f2ed9fcd7e	schema_mutations, migration_manager: Ignore empty partitions in per-table digest Schema digest is calculated by querying for mutations of all schema tables, then compacting them so that all tombstones in them are dropped. However, even if the mutation becomes empty after compaction, we still feed its partition key. If the same mutations were compacted prior to the query, because the tombstones expire, we won't get any mutation at all and won't feed the partition key. So schema digest will change once an empty partition of some schema table is compacted away. Tombstones expire 7 days after schema change which introduces them. If one of the nodes is restarted after that, it will compute a different table schema digest on boot. This may cause performance problems. When sending a request from coordinator to replica, the replica needs schema_ptr of exact schema version request by the coordinator. If it doesn't know that version, it will request it from the coordinator and perform a full schema merge. This adds latency to every such request. Schema versions which are not referenced are currently kept in cache for only 1 second, so if request flow has low-enough rate, this situation results in perpetual schema pulls. After `ae8d2a550d`, it is more liekly to run into this situation, because table creation generates tombstones for all schema tables relevant to the table, even the ones which will be otherwise empty for the new table (e.g. computed_columns). This change inroduces a cluster feature which when enabled will change digest calculation to be insensitive to expiry by ignoring empty partitions in digest calculation. When the feature is enabled, schema_ptrs are reloaded so that the window of discrepancy during transition is short and no rolling restart is required. A similar problem was fixed for per-node digest calculation in 18f484cc753d17d1e3658bcb5c73ed8f319d32e8. Per-table digest calculation was not fixed at that time because we didn't persist enabled features and they were not enabled early-enough on boot for us to depend on them in digest calculation. Now they are enabled before non-system tables are loaded so digest calculation can rely on cluster features. Fixes #4485.	2023-07-03 23:06:55 +02:00
Tomasz Grabiec	0c86abab4d	migration_manager, schema_tables: Implement migration_manager::reload_schema() Will recreate schema_ptr's from schema tables like during table alter. Will be needed when digest calculation changes in reaction to cluster feature at run time.	2023-07-03 20:32:59 +02:00
Tomasz Grabiec	9bfe9f0b2f	schema_tables: Avoid crashing when table selector has only one kind of tables Currently not reachable, because selectors are always constructed with both kinds initailized. Will be triggered by the next patch.	2023-07-03 20:32:59 +02:00
Alejo Sanchez	520bd90008	test/boost/memtable_test: split test plain/reverse Split long running test test_memtable_with_many_versions_conforms_to_mutation_source to 2 tests for _plain and _reverse. Refs #13905 Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Closes #14447	2023-07-03 15:20:12 +03:00
Kefu Chai	04434c02b3	sstables: print generation without {:d} the formatter for sstables::generation_type does not support "d" specifier, so we should not use "{:d}" for printing it. this works before `d7c90b5239`, but after that change, generation_type is not an alias of int64_t anymore. and its formatter does not support "d", so we should either specialize fmt::formatter<generation_type> to support it or just drop the specifier. since seastar::format() is using ```c++ fmt::format_to(fmt::appender(out), fmt::runtime(fmt), std::forward<A>(a)...); ``` to print the arguments with given fmt string, we cannot identify these kind of error at compile time. at runtime, if we have issues like this, {fmt} would throw exception like: ``` terminate called after throwing an instance of 'fmt::v9::format_error' what(): invalid format specifier ``` when constructing the `std::runtime_error` instance. so, in this change, "d" is removed. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14427	2023-07-03 13:53:13 +03:00
Pavel Emelyanov	f18bd23ec5	Merge 'repair: coroutinize some functions' from Kefu Chai also, take this opportunity to let `handle_mutation_fragment()` return void. for better readability. Closes #14258 * github.com:scylladb/scylladb: repair: do not check retval of handle_mutation_fragment() repair: coroutinize move_row_buf_to_working_row_buf() repair: coroutinize read_rows_from_disk() repair: coroutinize get_sync_boundary()	2023-07-03 13:45:42 +03:00
Kefu Chai	1ab2bb69b8	keys: do not use zip_iterator for printing key components boost's the operator==() implementation of boost's zip_iterator returns true only if all elements in enclosed tuple of zip_iterator are equal. and the zip_iterator always advances all the iterators in the enclosed tuple. but in our case, some components might be missing. in other words, the size of the `components` might be smaller than that of the `types` range. so, when the zip_iterator advances past the end of the components, scylla starts reading out of bounds. because zip_iterator does not allow us to customize how it implements the equal operator. and we cannot deduce the size of components without reading all of them. so in this change, we partially revert `3738fcbe05`, instead of using fmt::join(), just iterate through the components manually. this should avoid the out-of-bound reading, and also preserve the original behavior. Branches: 5.3 Fixes #14435 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14457	2023-07-01 23:49:02 +03:00
Avi Kivity	d88dfa0ad2	tools: scylla-sstable: fix stack overflow due to multiple db::config placed on the stack db::config is pretty large (~32k) and there are four of them, blowing the stack. Fix by allocating them on the heap. It's not clear why this shows up on my system (clang 16) and not in the frozen toolchain. Perhaps clang 16 is less able to reuse stack space. Closes #14464	2023-07-01 09:21:05 +03:00
Piotr Dulikowski	ee9bfb583c	combined: mergers: remove recursion in operator()() In mutation_reader_merger and clustering_order_reader_merger, the operator()() is responsible for producing mutation fragments that will be merged and pushed to the combined reader's buffer. Sometimes, it might have to advance existing readers, open new and / or close some existing ones, which requires calling a helper method and then calling operator()() recursively. In some unlucky circumstances, a stack overflow can occur: - Readers have to be opened incrementally, - Most or all readers must not produce any fragments and need to report end of stream without preemption, - There has to be enough readers opened within the lifetime of the combined reader (~500), - All of the above needs to happen within a single task quota. In order to prevent such a situation, the code of both reader merger classes were modified not to perform recursion at all. Most of the code of the operator()() was moved to maybe_produce_batch which does not recur if it is not possible for it to produce a fragment, instead it returns std::nullopt and operator()() calls this method in a loop via seastar::repeat_until_value. A regression test is added. Fixes: scylladb/scylladb#14415 Closes #14452	2023-06-30 12:07:13 +03:00
Botond Dénes	5648bfb9a0	Merge 'Find task manager task progress' from Aleksandra Martyniuk Modify task_manager::task::impl::get_progress method so that, whenever relevant, progress is calculated based on children's progress. Otherwise progress indicates only whether the task is finished or not. The method may be overriden in inheriting classes. Closes #14381 * github.com:scylladb/scylladb: tasks: delete task_manager::task::impl::_progress as it's unused tasks: modify task_manager::task::impl::get_progress method tasks: add is_complete method	2023-06-30 09:47:07 +03:00
Botond Dénes	8a7261fd70	Merge 'doc: fix rollback in the 4.3-to-2021.1, 5.0-to-2022.1, and 5.1-to-2022.2 upgrade guides' from Anna Stuchlik This PR fixes the Restore System Tables section of the upgrade guides by adding a command to clean upgraded SStables during rollback or adding the entire section to restore system tables (which was missing from the older documents). This PR fixes is a bug and must be backported to branch-5.3, branch-5.2., and branch-5.1. Refs: https://github.com/scylladb/scylla-enterprise/issues/3046 - [x] 5.1-to-2022.2 - update command (backport to branch-5.3, branch-5.2, and branch-5.1) - [x] 5.0-to-2022.1 - add "Restore system tables" to rollback (backport to branch-5.3, branch-5.2, and branch-5.1) - [x] 4.3-to-2021.1 - add "Restore system tables" to rollback (backport to branch-5.3, branch-5.2, and branch-5.1) (see https://github.com/scylladb/scylla-enterprise/issues/3046#issuecomment-1604232864) Closes #14444 * github.com:scylladb/scylladb: doc: fix rollback in 4.3-to-2021.1 upgrade guide doc: fix rollback in 5.0-to-2022.1 upgrade guide doc: fix rollback in 5.1-to-2022.2 upgrade guide	2023-06-30 09:38:45 +03:00
Botond Dénes	d20ed2d4db	Merge 'doc: improve the Unified Installer page' from Anna Stuchlik Fixes https://github.com/scylladb/scylladb/issues/14033 This PR: - replaces the OUTDATED list of platforms supported by Unified Installer with a link to the "OS Support" page. In this way, the list of supported OSes will be documented in one place, preventing outdated documentation. - improves the language and syntax, including: - Improving the wording. - Replacing "Scylla" with "ScyllaDB" - Fixing language mistakes - Fixing heading underline so that the headings render correctly. Closes #14445 * github.com:scylladb/scylladb: doc: update the language - Unified Installer page doc: update Unified Installer support	2023-06-30 09:38:18 +03:00
Kamil Braun	ff386e7a44	service: raft: force initial snapshot transfer in new cluster When we upgrade a cluster to use Raft, or perform manual Raft recovery procedure (which also creates a fresh group 0 cluster, using the same algorithm as during upgrade), we start with a non-empty group 0 state machine; in particular, the schema tables are non-empty. In this case we need to ensure that nodes which join group 0 receive the group 0 state. Right now this is not the case. In previous releases, where group 0 consisted only of schema, and schema pulls were also done outside Raft, those nodes received schema through this outside mechanism. In `91f609d065` we disabled schema pulls outside Raft; we're also extending group 0 with other things, like topology-specific state. To solve this, we force snapshot transfers by setting the initial snapshot index on the first group 0 server to `1` instead of `0`. During replication, Raft will see that the joining servers are behind, triggering snapshot transfer and forcing them to pull group 0 state. It's unnecessary to do this for cluster which bootstraps with Raft enabled right away but it also doesn't hurt, so we keep the logic simple and don't introduce branches based on that. Extend Raft upgrade tests with a node bootstrap step at the end to prevent regressions (without this patch, the step would hang - node would never join, waiting for schema). Fixes: #14066 Closes #14336	2023-06-29 22:46:42 +02:00
Konstantin Osipov	3d81408a58	test.py: make `experimental: raft` the default for all tests Make sure all tests use the new centralized topology coordinator. This is a step forward towards maturing the coordinator implementation. Closes #14039	2023-06-29 14:44:00 +02:00
Tomasz Grabiec	a9282103ba	Merge 'Call storage_service notifications only after keyspace schema changes are applied on all shards' from Benny Halevy This series aims at hardening schema merges and preventing inconsistencies across shards by updating the database shards before calling the notification callback. As seen in #13137, we don't want to call the notifications on all shards in parallel while the database shards are in flux. In addition, any error to update the keyspace will cause abort so not to leave the database shards in an inconsistent state . Other changes optimize this path by: - updating shard 0 first, to seed the effective_replication_map. - executing `storage_service::keyspace_changed` only once, on shard 0 to prevent quadratic update of the token_metadata and e_r_m on every keyspace change. Fixes #13137 Closes #14158 * github.com:scylladb/scylladb: migration_manager: propagate listener notification exceptions storage_service: keyspace_changed: execute only on shard 0 database: modify_keyspace_on_all_shards: execute func first on shard 0 database: modify_keyspace_on_all_shards: call notifiers only after applying func on all shards database: add modify_keyspace_on_all_shards schema_tables: merge_keyspaces: extract_scylla_specific_keyspace_info for update_keyspace database: create_keyspace_on_all_shards database: update_keyspace_on_all_shards database: drop_keyspace_on_all_shards	2023-06-29 12:17:53 +02:00
Anna Stuchlik	d3aba00131	doc: update the language - Unified Installer page This commit improves the language and syntax on the Unified Installer page. The changes cover: - Improving the wording. - Replacing "Scylla" with "ScyllaDB" - Fixing language mistakes - Fixing heading underline so that the headings render correctly.	2023-06-29 12:11:22 +02:00
Anna Stuchlik	944ce5c5c2	doc: update Unified Installer support This commit replaces the OUTDATED list of platforms supported by Unified Installer with a link to the "OS Support" page. In this way, the list of supported OSes will be documented in one place, preventing outdated documentation.	2023-06-29 11:51:21 +02:00
Aleksandra Martyniuk	f63825151e	tasks: delete task_manager::task::impl::_progress as it's unused	2023-06-29 11:30:27 +02:00
Aleksandra Martyniuk	d624be4e6b	tasks: modify task_manager::task::impl::get_progress method Modify task_manager::task::impl::get_progress method so that, whenever relevant, progress is calculated based on children's progress. Otherwise progress indicates only whether the task is finished or not.	2023-06-29 11:30:26 +02:00
Botond Dénes	2a58b4a39a	Merge 'Compaction resharding tasks' from Aleksandra Martyniuk Task manager's tasks covering resharding compaction on table and shard level. Closes #14044 * github.com:scylladb/scylladb: test: extend test_compaction_task.py to test resharding compaction compaction: add shard_reshard_sstables_compaction_task_impl compaction: invoke resharding on sharded database compaction: move run_resharding_jobs into reshard_sstables_compaction_task_impl::run() replica: delete unused functions and struct compaction: add reshard_sstables_compaction_task_impl compaction: replica: copy struct and functions from distributed_loader.cc compaction: create resharding_compaction_task_impl	2023-06-29 12:10:54 +03:00
Aleksandra Martyniuk	0278b21e76	tasks: add is_complete method Add is_complete method to task_manager::task::impl and task_manager::task.	2023-06-29 11:02:14 +02:00
Anna Stuchlik	32cfde2f8b	doc: fix rollback in 4.3-to-2021.1 upgrade guide This commit fixes the Restore System Tables section in the 5.4.3-to-2021.1 upgrade guide by adding the command to restore system tables.	2023-06-29 10:36:47 +02:00
Anna Stuchlik	130ddc3d2b	doc: fix rollback in 5.0-to-2022.1 upgrade guide This commit fixes the Restore System Tables section in the 5.0-to-2022.1 upgrade guide by adding the command to restore system tables.	2023-06-29 10:29:23 +02:00
Anna Stuchlik	8b3153f9ef	doc: fix rollback in 5.1-to-2022.2 upgrade guide This commit fixes the Restore System Tables section in the 5.1-to-2022.2 upgrade guide by adding a command to clean upgraded SStables during rollback.	2023-06-29 10:17:06 +02:00
Nadav Har'El	dd63169077	Merge 'test/boost/index_with_paging_test: reduce running time' from Alecco Reduce test string value size, parallelize inserts, and use a prepared statement, The debug running time for this tests is reduced from 13:18 to 7:52. Refs #13905 Closes #14380 * github.com:scylladb/scylladb: test/boost/index_with_paging_test: parallel insert test/boost/index_with_paging_test: prepared statement test/boost/index_with_paging_test: reduce running time	2023-06-29 10:45:01 +03:00
Kefu Chai	2894bc4954	repair: do not check retval of handle_mutation_fragment() handle_mutation_fragment() does not return `stop_iteration::yes` anymore after `fbbc86e18c`, so let's stop checking its return value. and make it return void. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-06-29 10:35:50 +08:00
Kefu Chai	e16b5ceb48	repair: coroutinize move_row_buf_to_working_row_buf() for better readability Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-06-29 10:19:02 +08:00
Kefu Chai	a973b43b9f	repair: coroutinize read_rows_from_disk() for better readability. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-06-29 10:17:48 +08:00
Kefu Chai	0cacc1fd4e	repair: coroutinize get_sync_boundary() for better readability. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-06-29 10:17:48 +08:00
Tomasz Grabiec	50e8ec77c6	Merge 'Wait for other nodes to be UP and NORMAL on bootstrap right after enabling gossiping' from Kamil Braun `handle_state_normal` may drop connections to the handled node. This causes spurious failures if there's an ongoing concurrent operation. This problem was already solved twice in the past in different contexts: first in `53636167ca`, then in `79ee38181c`. Time to fix it for the third time. Now we do this right after enabling gossiping, so hopefully it's the last time. This time it's causing snapshot transfer failures in group 0. Although the transfer is retried and eventually succeeds, the failed transfer is wasted work and causes an annoying ERROR message in the log which dtests, SCT, and I don't like. The fix is done by moving the `wait_for_normal_state_handled_on_boot()` call before `setup_group0()`. But for the wait to work correctly we must first ensure that gossiper sees an alive node, so we precede it with `wait_for_live_node_to_show_up()` (before this commit, the call site of `wait_for_normal_state_handled_on_boot` was already after this wait). There is another problem: the bootstrap procedure is racing with gossiper marking nodes as UP, and waiting for other nodes to be NORMAL doesn't guarantee that they are also UP. If gossiper is quick enough, everything will be fine. If not, problems may arise such as streaming or repair failing due to nodes still being marked as DOWN, or the CDC generation write failing. In general, we need all NORMAL nodes to be up for bootstrap to proceed. One exception is replace where we ignore the replaced node. The `sync_nodes` set constructed for `wait_for_normal_state_handled_on_boot` takes this into account, so we also use it to wait for nodes to be UP. As explained in commit messages and comments, we only do these waits outside raft-based-topology mode. This should improve CI stability. Fixes: #12972 Refs: #14042 Closes #14354 * github.com:scylladb/scylladb: messaging_service: print which connections are dropped due to missing topology info storage_service: wait for nodes to be UP on bootstrap storage_service: wait for NORMAL state handler before `setup_group0()` storage_service: extract `gossiper::wait_for_live_nodes_to_show_up()`	2023-06-28 20:40:03 +02:00
Avi Kivity	f6f974cdeb	cql3: selection: fix GROUP BY, empty groups, and aggregations A GROUP BY combined with aggregation should produce a single row per group, except for empty groups. This is in contrast to an aggregation without GROUP BY, which produces a single row no matter what. The existing code only considered the case of no grouping and forced a row into the result, but this caused an unwanted row if grouping was used. Fix by refining the check to also consider GROUP BY. XFAIL tests are relaxed. Fixes #12477. Note, forward_service requires that aggregation produce exactly one row, but since it can't work with grouping, it isn't affected. Closes #14399	2023-06-28 18:56:22 +03:00
Kamil Braun	b912eeade5	Merge 'merge raft commands to group0 before applying them whenever possible' from Gleb Since most group0 commands are just mutations it is easy to combine them before passing them to a subsystem they destined to since it is more efficient. The logic that handles those mutations in a subsystem will run once for each batch of commands instead of for each individual command. This is especially useful when a node catches up to a leader and gets a lot of commands together. The patch here does exactly that. It combines commands into a single command if possible, but it preserves an order between commands, so each time it encounters a command to a different subsystem it flushes already combined batch and starts a new one. This extra safety assumes that there are dependencies between subsystems managed by group0, so the order matters. It may be not the case now, but we prefer to be on a safe side. Broadcast table commands are not mutations, so they are never combined. * 'raft-merge-cmds' of https://github.com/gleb-cloudius/scylla: test: add test for group0 raft command merging service: raft: respect max mutation size limit when persisting raft entries group0_state_machine: merge commands before applying them whenever possible	2023-06-28 17:21:07 +02:00
Kamil Braun	1fa9678c64	messaging_service: print which connections are dropped due to missing topology info This connection dropping caused us to spend a lot of time debugging. Those debugging sessions would be shorter if Scylla logs indicated that connections are being dropped and why. Connection drops for a given node are a one-time event - we only do it if we establish a connection to a node without topology info, which should only happen before we handle the node's NORMAL status for the first time. So it's a rare thing and we can log it on INFO level without worrying about log spam.	2023-06-28 16:20:29 +02:00
Kamil Braun	51cec2be86	storage_service: wait for nodes to be UP on bootstrap The bootstrap procedure is racing with gossiper marking nodes as UP. If gossiper is quick enough, everything will be fine. If not, problems may arise such as streaming or repair failing due to nodes still being marked as DOWN, or the CDC generation write failing. In general, we need all NORMAL nodes to be up for bootstrap to proceed. One exception is replace where we ignore the replaced node. The `sync_nodes` set constructed for `wait_for_normal_state_handled_on_boot` takes this into account, so we use it. Refs: #14042 This doesn't completely fix #14042 yet becasue it's specific to gossiper-based topology mode only. For Raft-based topology, the node joining procedure will be coordinated by the topology coordinator right from the start and it will be the coordinator who issues the 'wait for node to see other live nodes'.	2023-06-28 16:20:29 +02:00
Kamil Braun	5ec5c7704c	storage_service: wait for NORMAL state handler before `setup_group0()` `handle_state_normal` may drop connections to the handled node. This causes spurious failures if there's an ongoing concurrent operation. This problem was already solved twice in the past in different contexts: first in `53636167ca`, then in `79ee38181c`. Time to fix it for the third time. Now we do this right after enabling gossiping, so hopefully it's the last time. This time it's causing snapshot transfer failures in group 0. Although the transfer is retried and eventually succeeds, the failed transfer is wasted work and causes an annoying ERROR message in the log which dtests, SCT, and I don't like. The fix is done by moving the `wait_for_normal_state_handled_on_boot()` call before `setup_group0()`. But for the wait to work correctly we must first ensure that gossiper sees an alive node, so we precede it with `wait_for_live_node_to_show_up()` (before this commit, the call site of `wait_for_normal_state_handled_on_boot` was already after this wait). We do it only in non-raft-topology mode, because with Raft-based topology, node state changes are propagated to the cluster through explicit global barriers and we plan to remove node statuses from gossiper altogether. Fixes: #12972	2023-06-28 16:19:24 +02:00
Anna Stuchlik	f4ae2c095b	doc: fix rollback in 5.2-to-2023.1 upgrade guide This commit fixes the Restore System Tables section in the 5.2-to-2023.1 upgrade guide by adding a command to clean upgraded SStables during rollback. This is a bug (an incomplete command) and must be backported to branch-5.3 and branch-5.2. Refs: https://github.com/scylladb/scylla-enterprise/issues/3046 Closes #14373	2023-06-28 17:16:32 +03:00
Alejo Sanchez	d4697ed21e	test/boost/index_with_paging_test: parallel insert Parallelize inserts for long-running test_index_with_paging. Run time in debug mode reduced by 1 minute 48 seconds. Refs #13905 Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2023-06-28 16:11:58 +02:00
Alejo Sanchez	70a3179888	test/boost/index_with_paging_test: prepared statement Prepare statement for insert. Run time in debug mode reduced by 9 seconds. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2023-06-28 14:49:21 +02:00
Michał Jadwiszczak	0a8fcead08	cql3: Specify arguments types in UDA creation errors Display not only function name but also expected arguments if `state_function` or `final_function` was not found. Fixes: #12088 Closes #14278	2023-06-28 15:27:49 +03:00
Alejo Sanchez	48d24269f1	test/boost/index_with_paging_test: reduce running time Reduce test string value size for test_index_with_paging from 4096 to 100. With 100 bytes it should make the base row significantly larger than the key so the test will exercise both types of paging in the scanning code. The debug running time for this tests is reduced from 9 minutes to 6 minutes. Refs #13905 Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2023-06-28 13:55:52 +02:00
Nadav Har'El	49c8c06b1b	Merge 'cql: fix crash on empty clustering range in LWT' from Jan Ciołek LWT queries with empty clustering range used to cause a crash. For example in: ```cql UPDATE tab SET r = 9000 WHERE p = 1 AND c = 2 AND c = 2000 IF r = 3 ``` The range of `c` is empty - there are no valid values. This caused a segfault when accessing the `first` range: ```c++ op.ranges.front() ``` Cassandra rejects such queries at the preparation stage. It doesn't allow two `EQ` restriction on the same clustering column when an IF is involved. We reject them during runtime, which is a worse solution. The user can prepare a query with `c = ? AND c = ?`, and then run it, but unexpectedly it will throw an `invalid_request_exception` when the two bound variables are different. We could ban such queries as well, we already ban the usage of `IN` in conditional statements. The problem is that this would be a breaking change. A better solution would be to allow empty ranges in `LWT` statements. When an empty range is detected we just wouldn't apply the change. This would be a larger change, for now let's just fix the crash. Fixes: https://github.com/scylladb/scylladb/issues/13129 Closes #14429 * github.com:scylladb/scylladb: modification_statement: reject conditional statements with empty clustering key statements/cas_request: fix crash on empty clustering range in LWT	2023-06-28 14:43:54 +03:00
Kamil Braun	64c302e777	storage_service: extract `gossiper::wait_for_live_nodes_to_show_up()` This piece of `storage_service::wait_for_ring_to_settle()` will be performed earlier in the boot procedure in follow-up commits. Make it more generic, to be able to wait for `n` nodes to show up. Here we wait for `2` nodes - ourselves and at least one other.	2023-06-28 12:36:06 +02:00
Aleksandra Martyniuk	bf3e0744c1	test: extend test_compaction_task.py to test resharding compaction	2023-06-28 11:43:12 +02:00
Aleksandra Martyniuk	87c8d63b7a	compaction: add shard_reshard_sstables_compaction_task_impl Add task manager's task covering resharding compaction on one shard.	2023-06-28 11:43:12 +02:00
Aleksandra Martyniuk	db6e4a356b	compaction: invoke resharding on sharded database In reshard_sstables_compaction_task_impl::run() we call sharded<sstables::sstable_directory>::invoke_on_all. In lambda passed to that method, we use both sharded sstable_directory service and its local instance. To make it straightforward that sharded and local instances are dependend, we call sharded<replica::database>::invoke_on_all instead and access local directory through the sharded one.	2023-06-28 11:43:12 +02:00
Aleksandra Martyniuk	1acaed026a	compaction: move run_resharding_jobs into reshard_sstables_compaction_task_impl::run()	2023-06-28 11:43:11 +02:00
Aleksandra Martyniuk	85cc85fc5a	replica: delete unused functions and struct	2023-06-28 11:41:43 +02:00
Aleksandra Martyniuk	837d77ba8c	compaction: add reshard_sstables_compaction_task_impl Add task manager's task covering resharding compaction.	2023-06-28 11:41:43 +02:00
Aleksandra Martyniuk	0d6dd3eeda	compaction: replica: copy struct and functions from distributed_loader.cc As a preparation for integrating resharding compaction with task manager a struct and some functions are copied from replica/distributed_loader.cc to compaction/task_manager_module.cc.	2023-06-28 11:41:42 +02:00
Aleksandra Martyniuk	2b4874bbf7	compaction: create resharding_compaction_task_impl resharding_compaction_task_impl serves as a base class of all concrete resharding compaction task classes.	2023-06-28 11:36:53 +02:00

1 2 3 4 5 ...

37647 Commits