scylladb

Author	SHA1	Message	Date
Sergey Zolotukhin	612a141660	raft: Fix race condition on override_snapshot_thresholds. When the server_impl::applier_fiber is paused by a co_await at line raft/server.cc:1375: ``` co_await override_snapshot_thresholds(); ``` a new snapshot may be applied, which updates the actual values of the log's last applied and snapshot indexes. As a result, the new snapshot index could become higher than the old value stored in _applied_idx at line raft/server.cc:1365, leading to an assertion failure in log::last_conf_for(). Since error injection is disabled in release builds, this issue does not affect production releases. This issue was introduced in the following commit `9dfa041fe1`, when error injection was added to override the log snapshot configuration parameters. How to reproduce: 1. Build debug version of randomized_nemesis_test ``` ninja-build build/debug/test/raft/randomized_nemesis_test ``` 2. Run ``` parallel --halt now,fail=1 -j20 'build/debug/test/raft/randomized_nemesis_test \ --run_test=test_frequent_snapshotting -- -c2 -m2G --overprovisioned --unsafe-bypass-fsync 1 \ --kernel-page-cache 1 --blocked-reactor-notify-ms 2000000 --default-log-level \ trace > tmp/logs/eraseme_{}.log 2>&1 && rm tmp/logs/eraseme_{}.log' ::: {1..1000} ``` Fixes scylladb/scylladb#20363 Closes scylladb/scylladb#20555	2024-09-12 16:19:27 +02:00
Kefu Chai	3e84d43f93	treewide: use seastar::format() or fmt::format() explicitly before this change, we rely on `using namespace seastar` to use `seastar::format()` without qualifying the `format()` with its namespace. this works fine until we changed the parameter type of format string `seastar::format()` from `const char*` to `fmt::format_string<...>`. this change practically invited `seastar::format()` to the club of `std::format()` and `fmt::format()`, where all members accept a templated parameter as its `fmt` parameter. and `seastar::format()` is not the best candidate anymore. despite that argument-dependent lookup (ADT for short) favors the function which is in the same namespace as its parameter, but `using namespace` makes `seastar::format()` more competitive, so both `std::format()` and `seastar::format()` are considered as the condidates. that is what is happening scylladb in quite a few caller sites of `format()`, hence ADT is not able to tell which function the winner in the name lookup: ``` /__w/scylladb/scylladb/mutation/mutation_fragment_stream_validator.cc:265:12: error: call to 'format' is ambiguous 265 \| return format("{} ({}.{} {})", _name_view, s.ks_name(), s.cf_name(), s.id()); \| ^~~~~~ /usr/bin/../lib/gcc/x86_64-redhat-linux/14/../../../../include/c++/14/format:4290:5: note: candidate function [with _Args = <const std::basic_string_view<char> &, const seastar::basic_sstring<char, unsigned int, 15> &, const seastar::basic_sstring<char, unsigned int, 15> &, const utils::tagged_uuid<table_id_tag> &>] 4290 \| format(format_string<_Args...> __fmt, _Args&&... __args) \| ^ /__w/scylladb/scylladb/seastar/include/seastar/core/print.hh:143:1: note: candidate function [with A = <const std::basic_string_view<char> &, const seastar::basic_sstring<char, unsigned int, 15> &, const seastar::basic_sstring<char, unsigned int, 15> &, const utils::tagged_uuid<table_id_tag> &>] 143 \| format(fmt::format_string<A...> fmt, A&&... a) { \| ^ ``` in this change, we change all `format()` to either `fmt::format()` or `seastar::format()` with following rules: - if the caller expects an `sstring` or `std::string_view`, change to `seastar::format()` - if the caller expects an `std::string`, change to `fmt::format()`. because, `sstring::operator std::basic_string` would incur a deep copy. we will need another change to enable scylladb to compile with the latest seastar. namely, to pass the format string as a templated parameter down to helper functions which format their parameters. to miminize the scope of this change, let's include that change when bumping up the seastar submodule. as that change will depend on the seastar change. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-09-11 23:21:40 +03:00
Abhi	9b09439065	raft: Add descriptions for requested abort errors Fixes: scylladb/scylladb#18902 Closes scylladb/scylladb#20291	2024-09-10 17:56:29 +02:00
Sergey Zolotukhin	13b3d3a795	raft: Ensure const correctness in applier_fiber. Add 'const' to non mutable varibales in server_impl::applier_fiber() function.	2024-08-20 15:24:00 +02:00
Sergey Zolotukhin	c3e52ab942	raft: Invoke store_snapshot_descriptor with actually preserved items. - raft_sys_table_storage::store_snapshot_descriptor now receives a number of preserved items in the log, rather than _config.snapshot_trailing value; - Incorrect check for truncated number of items in store_snapshot_descriptor was removed. Fixes scylladb/scylladb#16817 Fixes scylladb/scylladb#20080	2024-08-20 15:22:49 +02:00
Sergey Zolotukhin	922e035629	raft: Use raft_server_set_snapshot_thresholds in tests. Replace raft_server_snapshot_reduce_threshold with raft_server_set_snapshot_thresholds in tests as raft_server_set_snapshot_thresholds fully covers the functionality of raft_server_snapshot_reduce_threshold.	2024-08-20 15:08:49 +02:00
Sergey Zolotukhin	00a1d3e305	raft: Fix indentation in server.cc	2024-08-20 15:08:45 +02:00
Sergey Zolotukhin	9dfa041fe1	raft: Add raft_server_set_snapshot_thresholds injection. Use error injection to allow overriding following snapshot threshold settings: - snapshot_threshold - snapshot_threshold_log_size - snapshot_trailing - snapshot_trailing_size	2024-08-20 14:15:50 +02:00
Laszlo Ersek	4f1f207be1	raft/server: clean up index_t usage With implicit conversion of tagged integers to untagged ones going away, explicitly tag (or untag, as necessary) the operands of the following operations, in "raft/server.cc": - addition of tagged and untagged (both should be tagged) - subscripting an array by tagged (should be untagged) - comparing a size-like threshold against tagged (should be untagged) - exposing tagged via gauges (should be untagged) Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-14 13:35:08 +02:00
Laszlo Ersek	1b134d52ac	raft/tracker: don't drop out of index_t space for subtraction Tagged integers support subtraction; use it. Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-14 13:35:08 +02:00
Laszlo Ersek	b6233209d9	raft/fsm: clean up index_t and term_t usage With implicit conversion of tagged integers to untagged ones going away, explicitly tag (or untag, as necessary) the operands of the following operations, in "raft/fsm.cc": - addition of tagged and untagged (both should be tagged) - comparison (relop) between tagged an untagged (both should be tagged) - subscripting or sizing an array by tagged (should be untagged) Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-14 13:35:08 +02:00
Laszlo Ersek	5b9a4428c6	raft/log: clean up index_t usage With implicit conversion of tagged integers to untagged ones going away, explicitly tag (or untag, as necessary) the operands of the following operations, in raft/log.{cc,h}: - addition of tagged and untagged (both should be tagged) - comparison (relop) between tagged an untagged (both should be tagged) - subscripting an array, or offsetting an iterator, by tagged (should be untagged) - comparing an array bound against tagged (should be untagged) - subtracting tagged from an array bound (should be untagged) Note: these files mix uniform initialization syntax (index_t{...}) with constructor call syntax (index_t()), with the former being more frequent. Stick with the former here too, for consistency. Signed-off-by: Laszlo Ersek <laszlo.ersek@scylladb.com>	2024-08-14 13:35:08 +02:00
Avi Kivity	aa1270a00c	treewide: change assert() to SCYLLA_ASSERT() assert() is traditionally disabled in release builds, but not in scylladb. This hasn't caused problems so far, but the latest abseil release includes a commit [1] that causes a 1000 insn/op regression when NDEBUG is not defined. Clearly, we must move towards a build system where NDEBUG is defined in release builds. But we can't just define it blindly without vetting all the assert() calls, as some were written with the expectation that they are enabled in release mode. To solve the conundrum, change all assert() calls to a new SCYLLA_ASSERT() macro in utils/assert.hh. This macro is always defined and is not conditional on NDEBUG, so we can later (after vetting Seastar) enable NDEBUG in release mode. [1] `66ef711d68` Closes scylladb/scylladb#20006	2024-08-05 08:23:35 +03:00
Gleb Natapov	9ebdb23002	raft: add more raft metrics to make debug easier	2024-07-01 10:55:22 +02:00
Kamil Braun	a441d06d6c	raft: fsm: add details to on_internal_error_noexcept message If we receive a message in the same term but from a different leader than we expect, we print: ``` Got append request/install snapshot/read_quorum from an unexpected leader ``` For some reason the message did not include the details (who the leader was and who the sender was) which requires almost zero effort and might be useful for debugging. So let's include them. Ref: scylladb/scylla-enterprise#4276 Closes scylladb/scylladb#19238	2024-06-12 17:29:42 +03:00
Yaniv Michael Kaul	82875095e9	Raft: improve descriptions of metrics 1. Fixed a single typo (send -> sent) 2. Rephrase 'How many' to 'Number of' and use less passive tense. 3. Be more specific in the description of the different metrics insteda of the more generic descriptions. Signed-off-by: Yaniv Kaul <yaniv.kaul@scylladb.com> Closes scylladb/scylladb#19067	2024-06-06 15:18:47 +03:00
Kefu Chai	0b0e661a85	build: bring abseil submodule back because of https://bugzilla.redhat.com/show_bug.cgi?id=2278689, the rebuilt abseil package provided by fedora has different settings than the ones if the tree is built with the sanitizer enabled. this inconsistency leads to a crash. to address this problem, we have to reinstate the abseil submodule, so we can built it with the same compiler options with which we build the tree. in this change * Revert "build: drop abseil submodule, replace with distribution abseil" * update CMake building system with abseil header include settings * bump up the abseil submodule to the latest LTS branch of abseil: lts_2024_01_16 * update scylla-gdb.py to adapt to the new structure of flat_hash_map This reverts commit `8635d24424`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#18511	2024-05-05 23:31:09 +03:00
Kefu Chai	168ade72f8	treewide: replace formatter<std::string_view> with formatter<string_view> in in {fmt} before v10, it provides the specialization of `fmt::formatter<..>` for `std::string_view` as well as the specialization of `fmt::formatter<..>` for `fmt::string_view` which is an implementation builtin in {fmt} for compatibility of pre-C++17. and this type is used even if the code is compiled with C++ stadandard greater or equal to C++17. also, before v10, the `fmt::formatter<std::string_view>::format()` is defined so it accepts `std::string_view`. after v10, `fmt::formatter<std::string_view>` still exists, but it is now defined using `format_as()` machinery, so it's `format()` method does not actually accept `std::string_view`, it accepts `fmt::string_view`, as the former can be converted to `fmt::string_view`. this is why we can inherit from `fmt::formatter<std::string_view>` and use `formatter<std::string_view>::format(foo, ctx);` to implement the `format()` method with {fmt} v9, but we cannot do this with {fmt} v10, and we would have following compilation failure: ``` FAILED: service/CMakeFiles/service.dir/RelWithDebInfo/topology_state_machine.cc.o /home/kefu/.local/bin/clang++ -DFMT_DEPRECATED_OSTREAM -DFMT_SHARED -DSCYLLA_BUILD_MODE=release -DSEASTAR_API_LEVEL=7 -DSEASTAR_LOGGER_COMPILE_TIME_FMT -DSEASTAR_LOGGER_TYPE_STDOUT -DSEASTAR_SCHEDULING_GROUPS_COUNT=16 -DSEASTAR_SSTRING -DXXH_PRIVATE_API -DCMAKE_INTDIR=\"RelWithDebInfo\" -I/home/kefu/dev/scylladb -I/home/kefu/dev/scylladb/build/gen -I/home/kefu/dev/scylladb/seastar/include -I/home/kefu/dev/scylladb/build/seastar/gen/include -I/home/kefu/dev/scylladb/build/seastar/gen/src -ffunction-sections -fdata-sections -O3 -g -gz -std=gnu++20 -fvisibility=hidden -Wall -Werror -Wextra -Wno-error=deprecated-declarations -Wimplicit-fallthrough -Wno-c++11-narrowing -Wno-deprecated-copy -Wno-mismatched-tags -Wno-missing-field-initializers -Wno-overloaded-virtual -Wno-unsupported-friend -Wno-enum-constexpr-conversion -Wno-unused-parameter -ffile-prefix-map=/home/kefu/dev/scylladb=. -march=westmere -mllvm -inline-threshold=2500 -fno-slp-vectorize -U_FORTIFY_SOURCE -Werror=unused-result -MD -MT service/CMakeFiles/service.dir/RelWithDebInfo/topology_state_machine.cc.o -MF service/CMakeFiles/service.dir/RelWithDebInfo/topology_state_machine.cc.o.d -o service/CMakeFiles/service.dir/RelWithDebInfo/topology_state_machine.cc.o -c /home/kefu/dev/scylladb/service/topology_state_machine.cc /home/kefu/dev/scylladb/service/topology_state_machine.cc:254:41: error: no matching member function for call to 'format' 254 \| return formatter<std::string_view>::format(it->second, ctx); \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~ /usr/include/fmt/core.h:2759:22: note: candidate function template not viable: no known conversion from 'seastar::basic_sstring<char, unsigned int, 15>' to 'const fmt::basic_string_view<char>' for 1st argument 2759 \| FMT_CONSTEXPR auto format(const T& val, FormatContext& ctx) const \| ^ ~~~~~~~~~~~~ ``` because the inherited `format()` method actually comes from `fmt::formatter<fmt::string_view>`. to reduce the confusion, in this change, we just inherit from `fmt::format<string_view>`, where `string_view` is actually `fmt::string_view`. this follows the document at https://fmt.dev/latest/api.html#formatting-user-defined-types, and since there is less indirection under the hood -- we do not use the specialization created by `FMT_FORMAT_AS` which inherit from `formatter<fmt::string_view>`, hopefully this can improve the compilation speed a little bit. also, this change addresses the build failure with {fmt} v10. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#18299	2024-04-19 07:44:07 +03:00
Kefu Chai	e97ae6b0de	raft: server: print pointee of `server_impl::_fsm` before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, instead of printing the `unique_ptr` instance, we print the pointee of it. since `server_impl` uses pimpl paradigm, `_fsm` is always valid after `server_impl::start()`, we can always deference it without checking for null. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17953	2024-03-25 11:20:34 +02:00
Kefu Chai	50637964ed	raft: add fmt::formatter for error classes before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatter for classes derived from `raft::error`. since {fmt} v10 defines the formatter for all classes derived from `std::exception`, the definition is provided only when the tree is compiled with {fmt} < 10. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-20 21:01:29 +08:00
Kefu Chai	079d70145e	raft: add fmt::formatter for raft tracker types before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for * raft::election_tracker * raft::votes * raft::vote_result and drop their operator<<:s. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17670	2024-03-08 15:19:37 +02:00
Kefu Chai	57ede58a64	raft: add fmt::formatter for raft::fsm before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for `raft::fsm`, and drop its operator<<. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17414	2024-02-20 09:02:02 +02:00
Kefu Chai	c555af3cd8	raft: add formatter for raft::log before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for `raft::log`, and drop its operator<<. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17301	2024-02-13 17:17:57 +02:00
Botond Dénes	d202d32f81	Merge 'Add an API to trigger snapshot in Raft servers' from Kamil Braun This allows the user of `raft::server` to cause it to create a snapshot and truncate the Raft log (leaving no trailing entries; in the future we may extend the API to specify number of trailing entries left if needed). In a later commit we'll add a REST endpoint to Scylla to trigger group 0 snapshots. One use case for this API is to create group 0 snapshots in Scylla deployments which upgraded to Raft in version 5.2 and started with an empty Raft log with no snapshot at the beginning. This causes problems, e.g. when a new node bootstraps to the cluster, it will not receive a snapshot that would contain both schema and group 0 history, which would then lead to inconsistent schema state and trigger assertion failures as observed in scylladb/scylladb#16683. In 5.4 the logic of initial group 0 setup was changed to start the Raft log with a snapshot at index 1 (`ff386e7a44`) but a problem remains with these existing deployments coming from 5.2, we need a way to trigger a snapshot in them (other than performing 1000 arbitrary schema changes). Another potential use case in the future would be to trigger snapshots based on external memory pressure in tablet Raft groups (for strongly consistent tables). The PR adds the API to `raft::server` and a HTTP endpoint that uses it. In a follow-up PR, we plan to modify group 0 server startup logic to automatically call this API if it sees that no snapshot is present yet (to automatically fix the aforementioned 5.2 deployments once they upgrade.) Closes scylladb/scylladb#16816 * github.com:scylladb/scylladb: raft: remove `empty()` from `fsm_output` test: add test for manual triggering of Raft snapshots api: add HTTP endpoint to trigger Raft snapshots raft: server: add `trigger_snapshot` API raft: server: track last persisted snapshot descriptor index raft: server: framework for handling server requests raft: server: inline `poll_fsm_output` raft: server: fix indentation raft: server: move `io_fiber`'s processing of `batch` to a separate function raft: move `poll_output()` from `fsm` to `server` raft: move `_sm_events` from `fsm` to `server` raft: fsm: remove constructor used only in tests raft: fsm: move trace message from `poll_output` to `has_output` raft: fsm: extract `has_output()` raft: pass `max_trailing_entries` through `fsm_output` to `store_snapshot_descriptor` raft: server: pass `*_aborted` to `set_exception` call	2024-01-29 15:06:04 +02:00
Kefu Chai	abb12979f8	raft: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17011	2024-01-29 10:00:56 +02:00
Kamil Braun	1824c12975	raft: remove `empty()` from `fsm_output` Nobody remembered to keep this function up to date when adding stuff to `fsm_output`. Turns out that it's not being used by any Raft logic but only in some tests. That use case can now be replaced with `fsm::has_output()` which is also being used by `raft::server` code.	2024-01-23 16:48:28 +01:00
Kamil Braun	0eda7a2619	raft: server: add `trigger_snapshot` API This allows the user of `raft::server` to ask it to create a snapshot and truncate the Raft log. In a later commit we'll add a REST endpoint to Scylla to trigger group 0 snapshots. One use case for this API is to create group 0 snapshots in Scylla deployments which upgraded to Raft in version 5.2 and started with an empty Raft log with no snapshot at the beginning. This causes problems, e.g. when a new node bootstraps to the cluster, it will not receive a snapshot that would contain both schema and group 0 history, which would then lead to inconsistent schema state and trigger assertion failures as observed in scylladb/scylladb#16683. In 5.4 the logic of initial group 0 setup was changed to start the Raft log with a snapshot at index 1 (`ff386e7a44`) but a problem remains with these existing deployments coming from 5.2, we need a way to trigger a snapshot in them (other than performing 1000 arbitrary schema changes). Another potential use case in the future would be to trigger snapshots based on external memory pressure in tablet Raft groups (for strongly consistent tables).	2024-01-23 16:48:28 +01:00
Kamil Braun	3268be3860	raft: server: track last persisted snapshot descriptor index Also introduce a condition variable notified whenever this index is updated. Will be user in following commits.	2024-01-22 16:48:08 +01:00
Kamil Braun	1e786d9d64	raft: server: framework for handling server requests Add data structures and modify `io_fiber` code to prepare it for handling requests generated by the `server`, not just `fsm`. Used in later commits.	2024-01-22 16:47:34 +01:00
Kamil Braun	8d9b0a6538	raft: server: inline `poll_fsm_output`	2024-01-18 18:09:13 +01:00
Kamil Braun	754a7b54e4	raft: server: fix indentation	2024-01-18 18:09:11 +01:00
Kamil Braun	527780987b	raft: server: move `io_fiber`'s processing of `batch` to a separate function	2024-01-18 18:09:02 +01:00
Kamil Braun	3e6b4910a6	raft: move `poll_output()` from `fsm` to `server` `server` was the only user of this function and it can now be implemented using `fsm`'s public interface. In later commits we'll extend the logic of `io_fiber` to also subscribe to other events, triggered by `server` API calls, not only to outputs from `fsm`.	2024-01-18 18:07:52 +01:00
Kamil Braun	95b6a60428	raft: move `_sm_events` from `fsm` to `server` In later commits we will use it to wake up `io_fiber` directly from `raft::server` based on events generated by `raft::server` itself -- not only from events generated by `raft::fsm`. `raft::fsm` still obtains a reference to the condition variable so it can keep signaling it.	2024-01-18 18:07:44 +01:00
Kamil Braun	a83e04279e	raft: fsm: remove constructor used only in tests This constructor does not provide persisted commit index. It was only used in tests, so move it there, to the helper `fsm_debug` which inherits from `fsm`. Test cases which used `fsm` directly instead of `fsm_debug` were modified to use `fsm_debug` so they can access the constructor. `fsm_debug` doesn't change the behavior of `fsm`, only adds some helper members. This will be useful in following commits too.	2024-01-18 18:07:17 +01:00
Kamil Braun	689d59fccd	raft: fsm: move trace message from `poll_output` to `has_output` In a later commit we'll move `poll_output` out of `fsm` and it won't have access to internals logged by this message (`_log.stable_idx()`). Besides, having it in `has_output` gives a more detailed trace. In particular we can now see values such as `stable_idx` and `last_idx` from the moment of returning a new fsm output, not only when poll started waiting for it (a lot of time can pass between these two events).	2024-01-18 18:06:55 +01:00
Kamil Braun	f6d43779af	raft: fsm: extract `has_output()` Also use the more efficient coroutine-specific `condition_variable::when` instead of `wait`.	2024-01-18 18:06:27 +01:00
Kamil Braun	dccfd09d83	raft: pass `max_trailing_entries` through `fsm_output` to `store_snapshot_descriptor` This parameter says how many entries at most should be left trailing before the snapshot index. There are multiple places where this decision is made: - in `applier_fiber` when the server locally decides to take a snapshot due to log size pressure; this applies to the in-memory log - in `fsm::step` when the server received an `install_snapshot` message from the leader; this also applies to the in-memory log - and in `io_fiber` when calling `store_snapshot_descriptor`; this applies to the on-disk log. The logic of how many entries should be left trailing is calculated twice: - first, in `applier_fiber` or in `fsm::step` when truncating the in-memory log - and then again as the snapshot descriptor is being persisted. The logic is to take `_config.snapshot_trailing` for locally generated snapshots (coming from `applier_fiber`) and `0` for remote snapshots (from `fsm::step`). But there is already an error injection that changes the behavior of `applier_fiber` to leave `0` trailing entries. However, this doesn't affect the following `store_snapshot_descriptor` call which still uses `_config.snapshot_trailing`. So if the server got restarted, the entries which were truncated in-memory would get "revived" from disk. Fortunately, this is test-only code. However in future commits we'd like to change the logic of `applier_fiber` even further. So instead of having a separate calculation of trailing entries inside `io_fiber`, it's better for it to use the number that was already calculated once. This number is passed to `fsm::apply_snapshot` (by `applier_fiber` or `fsm::step`) and can then be received by `io_fiber` from `fsm_output` to use it inside `store_snapshot_descriptor`.	2024-01-18 18:05:45 +01:00
Kamil Braun	40cd91cff7	raft: server: pass `_aborted` to `set_exception` call This looks like a minor oversight, in `server_impl::abort` there are multiple calls to `set_exception` on the different promises, only one of them would not receive `_aborted`.	2024-01-18 18:05:18 +01:00
Patryk Jędrzejczak	df2034ebd7	server, raft_group0_client: remove the default nullptr values The previous commit has fixed 5 bugs of the same type - incorrectly passing the default nullptr to one of the changed functions. At least some of these bugs wouldn't appear if there was no default value. It's much harder to make this kind of a bug if you have to write "nullptr". It's also much easier to detect it in review. Moreover, these default values are rarely used outside tests. Keeping them is just not worth the time spent on debugging.	2024-01-05 18:45:50 +01:00
Botond Dénes	d2a88cd8de	Merge 'Typos: fix typos in code' from Yaniv Kaul Fixes some more typos as found by codespell run on the code. In this commit, there are more user-visible errors. Refs: https://github.com/scylladb/scylladb/issues/16255 Closes scylladb/scylladb#16289 * github.com:scylladb/scylladb: Update unified/build_unified.sh Update main.cc Update dist/common/scripts/scylla-housekeeping Typos: fix typos in code	2023-12-06 07:36:41 +02:00
Yaniv Kaul	ae2ab6000a	Typos: fix typos in code Fixes some more typos as found by codespell run on the code. In this commit, there are more user-visible errors. Refs: https://github.com/scylladb/scylladb/issues/16255	2023-12-05 15:18:11 +02:00
Kefu Chai	3a8a3100af	raft: add formatter for raft::logical_clock::time_point before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we * define a formatter for logical_clock::time_point, as fmt does not provide formatter for this time_point, as it is not a part of the standard library * remove operator<<() for logical_clock::time_point, as its soly purpose is to generate the corresponding fmt::formatter when FMT_DEPRECATED_OSTREAM is defined. * remove operator<<() for logical_clock::duration, as fmt provides a default implementation for formatting std::chrono::nanoseconds already, which uses `int64_t` as its rep template parameter as well. * include "fmt/chrono.h" so that the source files including this header can have access the formatter without including it by themselves, this preserve the existing behavior which we have before removal of "operator<<()". Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16263	2023-12-04 18:32:03 +02:00
Yaniv Kaul	7c4b742583	Update raft/tracker.hh	2023-12-03 10:07:55 +02:00
Yaniv Kaul	c658bdb150	Typos: fix typos in comments Fixes some typos as found by codespell run on the code. In this commit, I was hoping to fix only comments, not user-visible alerts, output, etc. Follow-up commits will take care of them. Refs: https://github.com/scylladb/scylladb/issues/16255 Signed-off-by: Yaniv Kaul <yaniv.kaul@scylladb.com>	2023-12-02 22:37:22 +02:00
Piotr Dulikowski	c58ff554d8	raft: rpc: introduce destination_not_alive_error Add a new destination_not_alive_error, thrown from two-way RPCs in case when the RPC is not issued because the destination is not reported as alive by the failure detector. In snapshot transfer code, lower the verbosity of the message printed in case it fails on the new error. This is done to prevent flakiness in the CI - in case of slow runs, nodes might get spuriously marked as dead if they are busy, and a message with the "error" verbosity can cause some tests to fail.	2023-11-23 11:14:28 +01:00
Piotr Dulikowski	a1ebfcf006	raft: add server::is_alive Add a method which reports whether given raft server is running. In following commits, the information about whether the local raft group 0 is running or not will be included in the response to the failure detector ping, and the is_alive method will be used there.	2023-11-23 00:34:22 +01:00
Kefu Chai	efd65aebb2	build: cmake: add check-header target to have feature parity with `configure.py`. we won't need this once we migrate to C++20 modules. but before that day comes, we need to stick with C++ headers. we generate a rule for each .hh files to create a corresponding .cc and then compile it, in order to verify the self-containness of that header. so the number of rule is quite large, to avoid the unnecessary overhead. the check-header target is enabled only if `Scylla_CHECK_HEADERS` option is enabled. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#15913	2023-11-13 10:27:06 +02:00
Gleb Natapov	9f6e93c144	raft: make sure that all operation forwarded to a leader are completed before destroying raft server Hold a gate around all operations that are forwarded to a leader to be able to wait for them during server::abort() otherwise the abort() may complete while those operations are still running which may cause use after free.	2023-10-25 13:29:36 +03:00
Piotr Dulikowski	64668e325e	raft: expose current_leader in raft::server The handler for join_node_request will need to know which node is considered the group 0 leader right now by the local node. If the topology coordinator crashes and a new node immediately wants to replace it with the same IP, the node that handles join_node_request will attempt to perform a read barrier. If this happens quickly enough, due to the IP reuse the RPC will be sent to the new node instead of the (now crashed) topology coordinator; the RPC will get an error and will fail the barrier. If we detect that the new node wants to replace the current topology coordinator, the upcoming join_node_request_handler will wait until there is a leader change.	2023-09-26 15:56:52 +02:00

1 2 3 4 5 ...

358 Commits