scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-21 00:50:35 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	04f9a150be	Merge 'raft: split `can_vote` field form `server_address` to separate struct' from Kamil Braun Whether a server can vote in a Raft configuration is not part of the address. `server_address` was used in many context where `can_vote` is irrelevant. Split the struct: `server_address` now contains only `id` and `server_info` as it did before `can_vote` was introduced. Instead we have a `config_member` struct that contains a `server_address` and the `can_vote` field. Also remove an "unsafe" constructor from `server_address` where `id` was provided but `server_info` was not. The constructor was used for tests where `server_info` is irrelevant, but it's important not to forget about the info in production code. Replace the constructor with helper functions which specify in comments that they are supposed to be used in tests or in contexts where `info` doesn't matter (e.g. when checking presence in an `unordered_set`, where the equality operator and hash operate only on the `id`). Closes #11047 * github.com:scylladb/scylla: raft: fsm: fix `entry_size` calculation for config entries raft: split `can_vote` field from `server_address` to separate struct serializer_impl: generalize (de)serialization of `unordered_set` to_string: generalize `operator<<` for `unordered_set`	2022-07-20 12:20:52 +02:00
Gleb Natapov	d40106d3a9	raft: remove unused code Message-Id: <YtUh8Hs+nQQ8+hLY@scylladb.com>	2022-07-18 21:21:45 +03:00
Kamil Braun	7c377cc457	raft: fsm: fix `entry_size` calculation for config entries We forgot about `can_vote`. Stumbled on this while separating `can_vote` to separate struct. Note that `entry_size` is still inaccurate (#11068) but the patch is an improvement. Refs: #11068	2022-07-18 18:24:50 +02:00
Kamil Braun	daf9c53bb8	raft: split `can_vote` field from `server_address` to separate struct Whether a server can vote in a Raft configuration is not part of the address. `server_address` was used in many context where `can_vote` is irrelevant. Split the struct: `server_address` now contains only `id` and `server_info` as it did before `can_vote` was introduced. Instead we have a `config_member` struct that contains a `server_address` and the `can_vote` field. Also remove an "unsafe" constructor from `server_address` where `id` was provided but `server_info` was not. The constructor was used for tests where `server_info` is irrelevant, but it's important not to forget about the info in production code. The constructor was used for two purposes: - Invoking set operations such as `contains`. To solve this we use C++20 transparent hash and comparator functions, which allow invoking `contains` and similar functions by providing a different key type (in this case `raft::server_id` in set of addresses, for example). - constructing addresses without `info`s in tests. For this we provide helper functions in the test helpers module and use them.	2022-07-18 18:22:10 +02:00
Petr Gusev	86299ad194	raft: server: fix comment for set_configuration follow-up to https://github.com/scylladb/scylla/pull/10905 as discussed in the comments. Closes #11035	2022-07-14 11:37:35 +02:00
Petr Gusev	6cdd5b9ff5	raft, set_configuration fix: don't use dummy entries Leader which ceases to be a leader as a result of a execute_modify_config cannot wait for a dummy record to be committed because io_fiber aborts current waiters as soon as it detects a lost of leadership. This commit excludes dummy entries from the configuration change procedure. A special promise is set on io_fiber when it gets a non-joint configuration, and set_configuration just waits for the corresponding future instead of a dummy record. Fixes: #10010 Closes #10905	2022-07-06 11:26:59 +02:00
Kamil Braun	d128f65354	raft: server: if `add_entry` with `wait_type::applied` successfully returns, ensure `state_machine::apply` is called for this entry Previously it could happen that `add_entry` returned successfully but `state_machine::apply` was never called by the server for this entry, even though `wait_type::applied` was used, if the server loaded a snapshot that contained this entry in just the right moment. Some clients may find this behavior surprising, even though we may argue that it's not technically incorrect. For example, the nemesis test assumed that if `add_entry` returned successfully (with `wait_type::applied`), the local state machine applied the entry; the test uses `apply` to obtain an output - the result of the command - from the state machine. It's not a problem to give a stronger guarantee, so we do it in this commit. In the scenario where a snapshot causes Raft to skip over the entry, `add_entry` will finish exceptionally with `commit_status_unknown`.	2022-05-27 12:06:18 +02:00
Kamil Braun	f31f73b1e8	raft: tracker: fix the definition of `voters()` The previous implementation was weird, and it's not even clear if the C++ standard defined what the result would be (because it used `std::unordered_set::insert(iterator, iterator)`, where the iterators pointed to a sequence of elements with elements that already had equivalent elements in the set; cppreference does not specify which elements end up in the set in this case). In any case, in testing, the resulting set did not give the desired result: if the configuration was joint, and a server was a voter in the previous config but a non-voter in the current one, it would not be a member of this set. This would cause the server to not vote for itself when it became a candidate, which could lead to cluster unavailability. The new definition is simple: a server belongs to `voters()` iff it is a voter in current or previous configuration. This fixes the problem described above. Fixes #10618.	2022-05-27 12:06:18 +02:00
Kamil Braun	6e5d1f4784	raft: when printing `raft::server_address`, include `can_vote` Makes debugging easier when configurations include non-voters.	2022-05-27 12:06:18 +02:00
Kamil Braun	c8237d405e	raft: server: in `abort()`, abort read barriers before waiting for rpc abort `rpc::abort` may need to wait until all read barriers finish, so abort read barrier before waiting for `rpc::abort` to finish to avoid a deadlock on shutdown. `rpc::abort` is still called before the read barriers are aborted, only waited for after. Calling it first prevents new read barriers from being started by `rpc` (see `rpc::abort` comment). Also prevent new read barriers from being started after abort starts directly on a leader by checking the `_aborted` flag at the beginning of `execute_read_barrier`. Finally, use the opportunity to remove some compiler-dependent code.	2022-05-25 14:56:32 +02:00
Kamil Braun	86c5036353	raft: server: handle aborts correctly in `read_barrier` The `wait_for_apply` function, called from `read_barrier`, didn't handle aborts. Fix that.	2022-05-25 14:56:32 +02:00
Kamil Braun	1eb849c3d7	raft: fsm: don't advance commit index further than match_idx during read_quorum It's not safe to advance the commit index further than match_idx since beyond that point the follower's log may be outdated. Fixes #10578.	2022-05-25 14:56:32 +02:00
Kamil Braun	4767b163ef	service: raft: rpc: don't call `execute...` functions after `abort()` The functions are called from RPC when a follower forwards a request to a leader (`add_entry`, `modify_config`, `read_barrier`). The call may be attempted during shutdown. The Raft shutdown code cleans up data structures created by those requests. Make sure that they are not updated concurrently with shutdown. This can lead to problems such as using the server object after it was aborted, or even destroyed. After this change, the RPC implementation may wait for a `execute_modify_config` call to finish before finishing abort. That call in turn may be stuck on `wait_for_entry`. Thus the waiter may prevent RPC from aborting. Fix this be moving the wait on the future returned from `_rpc->abort()` in `server::abort()` until after waiters were destroyed.	2022-05-24 11:36:51 +02:00
Kamil Braun	5e06d0ad6f	raft: server: fix bad_variant_access in `modify_config` `modify_config` would call `execute_modify_config` or `_rpc->send_modify_config`, which returned a reply of type `add_entry_reply`. This is a variant of 3 options: `entry_id`, `not_a_leader`, or `commit_status_unknown`. The code would check for the `entry_id` option and otherwise assume that it was `not_a_leader`. During nemesis testing however, the reply was sometimes `commit_status_unknown`, which caused a `bad_variant_access` exception during `std::get` call. Fix this. There is a similar piece of code in `add_entry`, but there it should be impossible to obtain `commit_status_unknown` even though the types don't enforce it. Make it more explicit with a comment and an assertion.	2022-05-24 11:36:51 +02:00
Gleb Natapov	7f26a8eef5	raft: actively search for a leader if it is not known for a tick duration For a follower to forward requests to a leader the leader must be known. But there may be a situation where a follower does not learn about a leader for a while. This may happen when a node becomes a follower while its log is up-to-date and there are no new entries submitted to raft. In such case the leader will send nothing to the follower and the only way to learn about the current leader is to get a message from it. Until a new entry is added to the raft's log a follower that does not know who the leader is will not be able to add entries. Kind of a deadlock. Note that the problem is specific to our implementation where failure detection is done by an outside module. In vanilla raft a leader sends messages to all followers periodically, so essentially it is never idle. The patch solves this by broadcasting specially crafted append reject to all nodes in the cluster on a tick in case a leader is not known. The leader responds to this message with an empty append request which will cause the node to learn about the leader. For optimisation purposes the patch sends the broadcast only in case there is actually an operation that waits for leader to be known. Fixes #10379	2022-04-25 14:51:22 +02:00
Kamil Braun	5308a7d7a3	raft: server: return immediately from `wait_for_leader` if leader is known `wait_for_leader` may be called when leader is known. There's nothing to wait for in this case.	2022-04-25 12:59:55 +02:00
Kamil Braun	ad3141d3e0	raft: server: translate abort_requested_exception to raft::request_aborted The `wait_for_leader` function would throw a low-level `abort_requested_aborted` exception from seastar::shared_promise. Translate it to the high-level raft::request_aborted so we can reduce the number of different exception types which cross the Raft API boundary. Also, add comments on Raft API functions about the exception thrown when requests are aborted.	2022-04-05 19:18:53 +02:00
Kamil Braun	7da586b912	raft: fsm: when stopping, become follower to reject new requests After enabling add_entry forwarding in randomized_nemesis_test, the test would sometimes hang on _rpc->abort() call due to add_entry messages from followers which waited on log_limiter_semaphore on the leader preventing _rpc from finishing the abort; the log_limter_semaphore would not get unblocked because the part of the server was already stopped. Prevent log_limiter_semaphore from being waited on when stopping the server by becoming a follower in fsm::stop.	2022-04-05 19:11:44 +02:00
Kamil Braun	0f0d75fd66	raft: server: translate semaphore_aborted to request_aborted	2022-03-29 15:10:29 +02:00
Gleb Natapov	a1604aa388	raft: make raft requests abortable This patch adds an ability to pass abort_source to raft request APIs ( add_entry, modify_config) to make them abortable. A request issuer not always want to wait for a request to complete. For instance because a client disconnected or because it no longer interested in waiting because of a timeout. After this patch it can now abort waiting for such requests through an abort source. Note that aborting a request only aborts the wait for it to complete, it does not mean that the request will not be eventually executed. Message-Id: <YjHivLfIB9Xj5F4g@scylladb.com>	2022-03-16 18:38:01 +01:00
Gleb Natapov	108e7fcc4e	raft: enter candidate state immediately when starting a singleton cluster When a node starts it does not immediately becomes a candidate since it waits to learn about already existing leader and randomize the time it becomes a candidate to prevent dueling candidates if several nodes are started simultaneously. If a cluster consist of only one node there is no point in waiting before becoming a candidate though because two cases above cannot happen. This patch checks that the node belongs to a singleton cluster where the node itself is the only voting member and becomes candidate immediately. This reduces the starting time of a single node cluster which are often used in testing. Message-Id: <YiCbQXx8LPlRQssC@scylladb.com>	2022-03-04 20:30:52 +01:00
Piotr Grabowski	e99f487d31	raft: add missing include Add missing include of "<experimental/source_location>" which caused compile errors on GCC: In file included from raft/fsm.hh:12, from raft/fsm.cc:8: raft/raft.hh:251:30: error: ‘std::experimental’ has not been declared 251 \| state_machine_error(std::experimental::source_location l = std::experimental::source_location::current()) \| ^~~~~~~~~~~~ raft/raft.hh:251:59: error: expected ‘)’ before ‘l’ 251 \| state_machine_error(std::experimental::source_location l = std::experimental::source_location::current()) \| ~ ^~ Note that there are some GCC compilation problems still left apart from this one. Closes #10155	2022-03-02 16:33:43 +01:00
Alejo Sanchez	627275945f	raft: modify_config: support voting state change Handle requests to change voting for servers already present in the current configuration. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-02-08 08:00:07 -04:00
Alejo Sanchez	a40417df08	raft: minor: fix log format string Fix format string for log line. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-02-08 08:00:07 -04:00
Tomasz Grabiec	00a9326ae7	Merge "raft: let `modify_config` finish on a follower that removes itself" from Kamil When forwarding a reconfiguration request from follower to a leader in `modify_config`, there is no reason to wait for the follower's commit index to be updated. The only useful information is that the leader committed the configuration change - so `modify_config` should return as soon as we know that. There is a reason not to wait for the follower's commit index to be updated: if the configuration change removes the follower, the follower will never learn about it, so a local waiter will never be resolved. `execute_modify_config` - the part of `modify_config` executed on the leader - is thus modified to finish when the configuration change is fully complete (including the dummy entry appended at the end), and `modify_config` - which does the forwarding - no longer creates a local waiter, but returns as soon as the RPC call to the leader confirms that the entry was committed on the leader. We still return an `entry_id` from `execute_modify_config` but that's just an artifact of the implementation. Fixes #9981. A regression test was also added in randomized_nemesis_test. * kbr/modify-config-finishes-v1: test: raft: randomized_nemesis_test: regression test for #9981 raft: server: don't create local waiter in `modify_config`	2022-01-31 20:14:50 +01:00
Tomasz Grabiec	b78bab7286	Merge "raft: fixes and improvements to the library and nemesis test" from Kamil Raft randomized nemesis test was improved by adding some more chaos: randomizing the network delay, server configuration, ticking speed of servers. This allowed to catch a serious bug, which is fixed in the first patch. The patchset also fixes bugs in the test itself and adds quality of life improvements such as better diagnostics when inconsistency is detected. * kbr/nemesis-random-v1: test: raft: randomized_nemesis_test: print state of each state machine when detecting inconsistency test: raft: randomized_nemesis_test: print details when detecting inconsistency test: raft: randomized_nemesis_test: print snapshot details when taking/loading snapshots in `impure_state_machine` test: raft: randomized_nemesis_test: keep server id in impure_state_machine test: raft: randomized_nemesis_test: frequent snapshotting configuration test: raft: randomized_nemesis_test: tick servers at different speeds in generator test test: raft: randomized_nemesis_test: simplify ticker test: raft: randomized_nemesis_test: randomize network delay test: raft: randomized_nemesis_test: fix use-after-free in `environment::crash()` test: raft: randomized_nemesis_test: fix use-after-free in two-way rpc functions test: raft: randomized_nemesis_test: rpc: don't propagate `gate_closed_exception` outside test: raft: randomized_nemesis_test: fix obsolete comment raft: fsm: print configuration entries appearing in the log raft: `operator<<(ostream&, ...)` implementation for `server_address` and `configuration` raft: server: abort snapshot applications before waiting for rpc abort raft: server: logging fix raft: fsm: don't advance commit index beyond matched entries	2022-01-31 13:25:27 +01:00
Kamil Braun	28b5792481	raft: server: don't create local waiter in `modify_config` When forwarding a reconfiguration request from follower to a leader in `modify_config`, there is no reason to wait for the follower's commit index to be updated. The only useful information is that the leader committed the configuration change - so `modify_config` should return as soon as we know that. There is a reason not to wait for the follower's commit index to be updated: if the configuration change removes the follower, the follower will never learn about it, so a local waiter will never be resolved. `execute_modify_config` - the part of `modify_config` executed on the leader - is thus modified to finish when the configuration change is fully complete (including the dummy entry appended at the end), and `modify_config` - which does the forwarding - no longer creates a local waiter, but returns as soon as the RPC call to the leader confirms that the entry was committed on the leader. We still return an `entry_id` from `execute_modify_config` but that's just an artifact of the implementation. Fixes #9981.	2022-01-27 17:49:40 +01:00
Kamil Braun	22092d110a	raft: fsm: print configuration entries appearing in the log When appending or committing configuration entries, print them (on TRACE level). Useful for debugging.	2022-01-26 16:09:41 +01:00
Kamil Braun	44a1a8a8b0	raft: `operator<<(ostream&, ...)` implementation for `server_address` and `configuration` Useful for debugging. Had to make `configuration` constructor explicit. Otherwise the `operator<<` implementation for `configuration` would implicitly convert the `server_address` to `configuration` when trying to output it, causing infinite recursion. Removed implicit uses of the constructor.	2022-01-26 16:09:41 +01:00
Kamil Braun	46f6a0cca5	raft: server: abort snapshot applications before waiting for rpc abort The implementation of `rpc` may wait for all snapshot applications to finish before it can finish aborting. This is what the randomized_nemesis_test implementation did. This caused rpc abort to hang in some scenarios. In this commit, the order of abort calls is modified a bit. Instead of waiting for rpc abort to finish and then aborting existing snapshot applications, we call `rpc::abort()` and keep the future, then abort snapshot applications, then wait on the future. Calling `rpc::abort()` first is supposed to prevent new snapshot applications from starting; a comment was added at the interface definition. The nemesis test implementation had this property, and `raft_rpc` in group registry was adjusted appropriately. Aborting the snapshot applications then allows `rpc::abort()` to finish.	2022-01-26 16:06:45 +01:00
Kamil Braun	5577ad6c34	raft: server: logging fix	2022-01-26 15:54:14 +01:00
Kamil Braun	1216f39977	raft: fsm: don't advance commit index beyond matched entries Otherwise it was possible to incorrectly mark obsolete entries from earlier terms as committed, leading to inconsistencies between state machine replicas. Fixes #9965.	2022-01-26 15:53:13 +01:00
Gleb Natapov	579dcf187a	raft: allow an option to persist commit index Raft does not need to persist the commit index since a restarted node will either learn it from an append message from a leader or (if entire cluster is restarted and hence there is no leader) new leader will figure it out after contacting a quorum. But some users may want to be able to bring their local state machine to a state as up-to-date as it was before restart as soon as possible without any external communication. For them this patch introduces new persistence API that allows saving and restoring last seen committed index. Message-Id: <YfFD53oS2j1My0p/@scylladb.com>	2022-01-26 14:06:39 +01:00
Gleb Natapov	e56e96ac5a	raft: do not add new wait entries after abort Abort signals stopped_error on all awaited entries, but if an entry is added after this it will be destroyed without signaling and will cause a waiter to get broken_promise. Fixes #9688 Message-Id: <Ye6xJjTDooKSuZ87@scylladb.com>	2022-01-25 09:52:30 +02:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Avi Kivity	4118f2d8be	treewide: replace deprecated seastar::later() with seastar::yield() seastar::later() was recently deprecated and replaced with two alternatives: a cheap seastar::yield() and an expensive (but more powerful) seastar::check_for_io_immediately(), that corresponds to the original later(). This patch replaces all later() calls with the weaker yield(). In all cases except one, it's unambiguously correct. In one case (test/perf scheduling_latency_measurer::stop()) it's not so ambiguous, since check_for_io_immediately() will additionally force a poll and so will cause more work to be done (but no additional tasks to be executed). However, I think that any measurement that relies on the measuring the work on the last tick to be inaccurate (you need thousands of ticks to get any amount of confidence in the measurement) that in the end it doesn't matter what we pick. Tests: unit (dev) Closes #9904	2022-01-12 12:19:19 +01:00
Kamil Braun	75bab2beec	raft: server: print the ID of aborted server	2021-12-07 11:23:34 +01:00
Kamil Braun	485c0b1819	raft: server: don't register metrics in `start()` Instead, expose `register_metrics()` at the `server` interface (previously it was a private method of `server_impl`). Metrics are global so `register_metrics()` cannot be called on two servers that have the same ID, which is useful e.g. in tests when we want to simulate server stops and restarts.	2021-12-07 11:23:33 +01:00
Konstantin Osipov	eea82f1262	raft: (server) improve tracing	2021-11-25 12:35:43 +03:00
Konstantin Osipov	0d830d4c11	raft: (metrics) fix spelling of waiters_awaken The usage of awake and awaken is quite messy, but awoken is more common for passive voice, so use waiters_awoken.	2021-11-25 12:35:43 +03:00
Konstantin Osipov	6d28927550	raft: make forwarding optional In absence of abort_source or timeouts in Raft API, automatic bouncing can create too much noise during testing, especially during network failures. Add an option to disable follower bouncing feature, since randomized_nemesis_test has its own bouncing which handles timeouts correctly. Optionally disable forwarding in basic_generator_test.	2021-11-25 12:35:43 +03:00
Konstantin Osipov	c22f945f11	raft: (service) manage Raft configuration during topology changes Operations of adding or removing a node to Raft configuration are made idempotent: they do nothing if already done, and they are safe to resume after a failure. However, since topology changes are not transactional, if a bootstrap or removal procedure fails midway, Raft group 0 configuration may go out of sync with topology state as seen by gossip. In future we must change gossip to avoid making any persistent changes to the cluster: all changes to persistent topology state will be done exclusively through Raft Group 0. Specifically, instead of persisting the tokens by advertising them through gossip, the bootstrap will commit a change to a system table using Raft group 0. nodetool will switch from looking at gossip-managed tables to consulting with Raft Group 0 configuration or Raft-managed tables. Once this transformation is done, naturally, adding a node to Raft configuration (perhaps as a non-voting member at first) will become the first persistent change to ring state applied when a node joins; removing a node from the Raft Group 0 configuration will become the last action when removing a node. Until this is done, do our best to avoid a cluster state when a removed node or a node which addition failed is stuck in Raft configuration, but the node is no longer present in gossip-managed system tables. In other words, keep the gossip the primary source of truth. For this purpose, carefully chose the timing when we join and leave Raft group 0: Join the Raft group 0 only after we've advertised our tokens, so the cluster is aware of this node, it's visible in nodetool status, but before node state jumps to "normal", i.e. before it accepts queries. Since the operation is idempotent, invoke it on each restart. Remove the node from Group 0 before its tokens are removed from gossip-managed system tables. This guarantees that if removal from Raft group 0 fails for whatever reason, the node stays in the ring, so nodetool removenode and friends are re-tried. Add tracing.	2021-11-25 12:35:42 +03:00
Konstantin Osipov	e3751068fe	raft: (server) allow adding entries/modify config on a follower Implement an RPC to forward add_entry calls from the follower to leader. Bounce & retry in case of not_a_leader. Do not retry in case of uncertainty - this can lead to adding duplicate entries. The feature is added to core Raft since it's needed by all current clients - both topology and schema changes. When forwarding an entry to a remote leader we may get back a term/index pair that conflicts (has the same index, but is with a higher term) with a local entry we're still waiting on. This can happen, e.g. because there was a leader change and the log was truncated, but we still haven't got the append_entries RPC from the new leader, still haven't truncated the log locally, still haven't aborted all the local waits for truncated entries. Only remove the offending entry from the wait list and abort it. There may be entries labeled with an older term to the right (with higher commit index) of the conflicting entry. However, finding them, would require a linear scan. If we allow it, we may end up doing this linear scan for every conflicting entry during the transition period, which brings us to N^2 complexity of this step. At the same time, as soon as append_entries that commits a higher-term entry with the same index reaches the follower, the waits for the respective truncated entry will be aborted anyway (see notify_waiters() which sets dropped_entry exception), so the scan is unnecessary. Similarly to being able to add entries, allow to modify Raft group configuration on a follower. The implementation works the same way as adding entries - forwards the command to the leader. Now that add_entry() or modify_config never throws not_a_leader, it's more likely to throw timed_out_error, e.g. in case the network is partitioned. Previously it was only possible due to a semaphore wait timeout, and this scenario was not tested. Handle timed_out_error on RPC level to let the existing tests (specifically the randomized nemesis test) pass.	2021-11-25 11:50:38 +03:00
Konstantin Osipov	8f303844df	raft: (server) fix a typo in exception message	2021-11-25 11:50:38 +03:00
Konstantin Osipov	9cde1cdf71	raft: (server) implement id() helper There is no easy way to get server id otherwise.	2021-11-25 11:50:38 +03:00
Konstantin Osipov	b9faf41513	raft: (server) remove apply_dummy_entry() It's currently unused, and going forward we'd like to make it work on the follower, which requires a new implementation.	2021-11-25 11:50:38 +03:00
Gleb Natapov	814dea3600	raft: handle non voting members correctly in stepdown procedure For leader stepdown purposes a non voting member is not different from a node outside of the config. The patch makes relevant code paths to check for both conditions.	2021-11-18 11:35:29 +02:00
Gleb Natapov	6a9b3cdb49	raft: exclude non voting nodes from the stable leader check If a node is a non voting member it cannot be a leader, so the stable leader rule should not be applied to it. This patch aligns non voting node behaviour with a node that was removed from the cluster. Both of them stepdown from leader position if they happen to be a leader when the state change occurred.	2021-11-18 11:18:13 +02:00
Gleb Natapov	8f64a6d2d2	raft: fix configuration::can_vote() to worth correctly with joint config Fix configuration::can_vote() to return true if a node is a voting member in any of the configs.	2021-11-17 11:06:42 +02:00
Gleb Natapov	7aac6c2086	raft: rename rpc_configuration to configuration in fsm output The filed is generic and used not only for rpc configuration now.	2021-11-09 15:16:57 +02:00

1 2 3 4 5

246 Commits