scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-24 10:30:38 +00:00

Author	SHA1	Message	Date
Gleb Natapov	022a825b33	raft: introduce not_a_member error and return it when non member tries to do add/modify_config Currently if a node that is outside of the config tries to add an entry or modify config transient error is returned and this causes the node to retry. But the error is not transient. If a node tries to do one of the operations above it means it was part of the cluster at some point, but since a node with the same id should not be added back to a cluster if it is not in the cluster now it will never be. Return a new error not_a_member to a caller instead. Message-Id: <Y42mTOx8bNNrHqpd@scylladb.com>	2022-12-05 17:11:04 +01:00
Avi Kivity	6a5d9ff261	treewide: use non-experimental std::source_location Now that we use libstdc++ 12, we can use the standardized source_location. Closes #12137	2022-11-30 11:06:43 +02:00
Avi Kivity	fb6804e7a4	raft: don't compare signed and unsigned types gcc warns it can lead to undefined behavior, though 2G entries in a list of mutations are unlikely. Use the correct type for iteration.	2022-11-28 21:58:30 +02:00
Konstantin Osipov	990c7a209f	raft: change the API of conf change notifications Pass a change diff into the notification callback, rather than add or remove servers one by one, so that if we need to persist the state, we can do it once per configuration change, not for every added or removed server. For now still pass added and removed entries in two separate calls per a single configuration change. This is done mainly to fulfill the library contract that it never sends messages to servers outside the current configuration. The group0 RPC implementation doesn't need the two calls, since it simply marks the removed servers as expired: they are not removed immediately anyway, and messages can still be delivered to them. However, there may be test/mock implementations of RPC which could benefit from this contract, so we decided to keep it.	2022-11-17 12:07:31 +03:00
Kamil Braun	0c9cb5c5bf	Merge 'raft: wait for the next tick before retrying' from Gusev Petr When `modify_config` or `add_entry` is forwarded to the leader, it may reach the node at "inappropriate" time and result in an exception. There are two reasons for it - the leader is changing and, in case of `modify_config`, other `modify_config` is currently in progress. In both cases the command is retried, but before this patch there was no delay before retrying, which could led to a tight loop. The patch adds a new exception type `transient_error`. When the client receives it, it is obliged to retry the request after some delay. Previously leader-side exceptions were converted to `not_a_leader`, which is strange, especially for `conf_change_in_progress`. Fixes: #11564 Closes #11769 * github.com:scylladb/scylladb: raft: rafactor: remove duplicate code on retries delays raft: use wait_for_next_tick in read_barrier raft: wait for the next tick before retrying	2022-11-16 18:20:54 +01:00
Petr Gusev	ae3e0e3627	raft: rafactor: remove duplicate code on retries delays Introduce a templated function do_on_leader_with_retries, use it in add_entries/modify_config/read_barrier. The function implements the basic logic of retries with aborts and leader changes handling, adds a delay between iterations to protect against tight loops.	2022-11-15 13:18:53 +04:00
Petr Gusev	15cc1667d0	raft: use wait_for_next_tick in read_barrier Replaced the yield on transport_error with wait_for_next_tick. Added delays for retries, similar to add_entry/modify_config: we postpone the next call attempt if we haven't received new information about the current leader.	2022-11-15 12:31:49 +04:00
Petr Gusev	5e15c3c9bd	raft: wait for the next tick before retrying When modify_config or add_entry is forwarded to the leader, it may reach the node at "inappropriate" time and result in an exception. There are two reasons for it - the leader is changing and, in case of modify_config, other modify_config is currently in progress. In both cases the command is retried, but before this patch there was no delay before retrying, which could led to a tight loop. The patch adds a new exception type transient_error. When the client node receives it, it is obliged to retry the request, possibly after some delay. Previously, leader-side exceptions were converted to not_a_leader exception, which is strange, especially for conf_change_in_progress. We add a delay before retrying in modify_config and add_entry if the client hasn't received any new information about the leader since the last attempt. This can happen if the server responds with a transient_error with an empty leader and the current node has not yet learned the new leader. We neglect an excessive delay if the newly elected leader is the same as the previous one, this supposed to be a rare. Fixes: #11564	2022-11-15 11:49:26 +04:00
Petr Gusev	d79fbab682	raft: convert raft::transport_error to raft::commit_status_unknown The add_entry and modify_config methods sometimes do an rpc to execute the request on the current leader. If the tcp connection was broken, a seastar::rpc::closed_error would be thrown to the client. This exception was not documented in the method comments and the client could have missed handling it. For example, this exception was not handled when calling modify_config in raft_group0, which sometimes broke the removenode command. An intermittent_connection_error exception was added earlier to solve a similar problem with the read_barrier method. In this patch it is renamed to transport_error, as it seems to better describe the situation, and an explicit specification for this exception was added - the rpc implementation can throw it if it is not known whether the call reached the target node and whether any actions were performed on it. In case of read_barrier it does not matter and we just retry. In case of add_entry and modify_config we cannot retry because the rpc calls are not idempotent, so we convert this exception to commit_status_unknown, which the client has to handle. Explicit comments have also been added to raft::server methods describing all possible exceptions.	2022-10-07 13:34:16 +04:00
Petr Gusev	cbfe033786	raft server, shrink_to_fit on log truncation We don't want to keep memory we don't use, shrink_to_fit guarantees that. In fact, boost::deque frees up memory when items are deleted, so this change has little effect at the moment, but it may pay off if we change the container in the future.	2022-09-27 12:02:36 +04:00
Petr Gusev	b34dfed307	raft server, release memory if add_entry throws We consume memory from semaphore in add_entry_on_leader, but never release it if add_entry throws.	2022-09-27 12:02:34 +04:00
Petr Gusev	27e60ecbf4	raft server, log size limit in bytes Before this patch we could get an OOM if we received several big commands. The number of commands was small, but their total size in bytes was large. snapshot_trailing_size is needed to guarantee progress. Without this limit the fsm could get stuck if the size of the next item is greater than max_log_size - (size of trailing entries).	2022-09-26 13:10:10 +04:00
Petr Gusev	210d9dd026	raft: fix snapshots leak applier_fiber could create multiple snapshots between io_fiber run. The fsm_output.snp variable was overwritten by applier_fiber and io_fiber didn't drop the previous snapshot. In this patch we introduce the variable fsm_output.snps_to_drop, store in it the current snapshot id before applying a new one, and then sequentially drop them in io_fiber after storing the last snapshot_descriptor. _sm_events.signal() is added to fsm::apply_snapshot, since this method mutates the _output and thus gives a reason to run io_fiber. The new test test_frequent_snapshotting demonstrates the problem by causing frequent snapshots and setting the applier queue size to one. Closes #11530	2022-09-21 12:46:26 +02:00
Petr Gusev	1b5fa4088e	raft server, abort group0 server on background errors	2022-09-12 10:16:43 +04:00
Petr Gusev	e92dc9c15b	raft server, provide a callback to handle background errors Fix: #11352	2022-09-12 10:16:43 +04:00
Petr Gusev	c57238d3d6	raft server, check aborted state on public server public api's Fix: #11352	2022-09-12 10:16:40 +04:00
Kamil Braun	6c16ae4868	Merge 'raft, limit for command size' from Gusev Petr Commitlog imposes a limit on the size of mutations and throws an exception if it's exceeded. In case of schema changes before raft this exception was delivered to the client. Now it happens while saving the raft command in io_fiber in persistence->store_log_entries and what the client gets is just a timeout exception, which doesn't say much about the cause of the problem. This patch introduces an explicit command size limit and provides a clear error message in this case. Closes #11318 * github.com:scylladb/scylladb: raft, use max_command_size to satisfy commitlog limit raft, limit for command size	2022-08-26 12:20:58 +02:00
Tomasz Grabiec	83850e247a	Merge 'raft: server: handle aborts when waiting for config entry to commit' from Kamil Braun Changing configuration involves two entries in the log: a 'joint configuration entry' and a 'non-joint configuration entry'. We use `wait_for_entry` to wait on the joint one. To wait on the non-joint one, we use a separate promise field in `server`. This promise wasn't connected to the `abort_source` passed into `set_configuration`. The call could get stuck if the server got removed from the configuration and lost leadership after committing the joint entry but before committing the non-joint one, waiting on the promise. Aborting wouldn't help. Fix this by subscribing to the `abort_source` in resolving the promise exceptionally. Furthermore, make sure that two `set_configuration` calls don't step on each other's toes by one setting the other's promise. To do that, reset the promise field at the end of `set_configuration` and check that it's not engaged at the beginning. Fixes #11288. Closes #11325 * github.com:scylladb/scylladb: test: raft: randomized_nemesis_test: additional logging raft: server: handle aborts when waiting for config entry to commit	2022-08-25 12:49:09 +02:00
Tomasz Grabiec	9c4e32d2e2	Merge 'raft: server: drop waiters in `applier_fiber` instead of `io_fiber`' from Kamil Braun When `io_fiber` fetched a batch with a configuration that does not contain this node, it would send the entries committed in this batch to `applier_fiber` and proceed by any remaining entry dropping waiters (if the node was no longer a leader). If there were waiters for entries committed in this batch, it could either happen that `applier_fiber` received and processed those entries first, notifying the waiters that the entries were committed and/or applied, or it could happen that `io_fiber` reaches the dropping waiters code first, causing the waiters to be resolved with `commit_status_unknown`. The second scenario is undesirable. For example, when a follower tries to remove the current leader from the configuration using `modify_config`, if the second scenario happens, the follower will get `commit_status_unknown` - this can happen even though there are no node or network failures. In particular, this caused `randomized_nemesis_test.remove_leader_with_forwarding_finishes` to fail from time to time. Fix it by serializing the notifying and dropping of waiters in a single fiber - `applier_fiber`. We decided to move all management of waiters into `applier_fiber`, because most of that management was already there (there was already one `drop_waiters` call, and two `notify_waiters` calls). Now, when `io_fiber` observes that we've been removed from the config and no longer a leader, instead of dropping waiters, it sends a message to `applier_fiber`. `applier_fiber` will drop waiters when receiving that message. Improve an existing test to reproduce this scenario more frequently. Fixes #11235. Closes #11308 * github.com:scylladb/scylladb: test: raft: randomized_nemesis_test: more chaos in `remove_leader_with_forwarding_finishes` raft: server: drop waiters in `applier_fiber` instead of `io_fiber` raft: server: use `visit` instead of `holds_alternative`+`get`	2022-08-23 17:19:44 +02:00
Kamil Braun	efad6fe9b4	raft: server: handle aborts when waiting for config entry to commit Changing configuration involves two entries in the log: a 'joint configuration entry' and a 'non-joint configuration entry'. We use `wait_for_entry` to wait on the joint one. To wait on the non-joint one, we use a separate promise field in `server`. This promise wasn't connected to the `abort_source` passed into `set_configuration`. The call could get stuck if the server got removed from the configuration and lost leadership after committing the joint entry but before committing the non-joint one, waiting on the promise. Aborting wouldn't help. Fix this by subscribing to the `abort_source` in resolving the promise exceptionally. Furthermore, make sure that two `set_configuration` calls don't step on each other's toes by one setting the other's promise. To do that, reset the promise field at the end of `set_configuration` and check that it's not engaged at the beginning. Fixes #11288.	2022-08-23 13:14:29 +02:00
Kamil Braun	db2a3deda1	raft: server: drop waiters in `applier_fiber` instead of `io_fiber` When `io_fiber` fetched a batch with a configuration that does not contain this node, it would send the entries committed in this batch to `applier_fiber` and proceed by any remaining entry dropping waiters (if the node was no longer a leader). If there were waiters for entries committed in this batch, it could either happen that `applier_fiber` received and processed those entries first, notifying the waiters that the entries were committed and/or applied, or it could happen that `io_fiber` reaches the dropping waiters code first, causing the waiters to be resolved with `commit_status_unknown`. The second scenario is undesirable. For example, when a follower tries to remove the current leader from the configuration using `modify_config`, if the second scenario happens, the follower will get `commit_status_unknown` - this can happen even though there are no node or network failures. In particular, this caused `randomized_nemesis_test.remove_leader_with_forwarding_finishes` to fail from time to time. Fix it by serializing the notifying and dropping of waiters in a single fiber - `applier_fiber`. We decided to move all management of waiters into `applier_fiber`, because most of that management was already there (there was already one `drop_waiters` call, and two `notify_waiters` calls). Now, when `io_fiber` observes that we've been removed from the config and no longer a leader, instead of dropping waiters, it sends a message to `applier_fiber`. `applier_fiber` will drop waiters when receiving that message. Fixes #11235.	2022-08-22 18:53:44 +02:00
Kamil Braun	5badf20c7a	raft: server: use `visit` instead of `holds_alternative`+`get` In `std::holds_alternative`+`std::get` version, the `get` performs a redundant check. Also `std::visit` gives a compile-time exhaustiveness check (whether we handled all possible cases of the `variant`).	2022-08-22 18:47:48 +02:00
Kamil Braun	b52429f724	Merge 'raft: relax some error severity' from Gleb Natapov Dtest fails if it sees an unknown errors in the logs. This series reduces severity of some errors (since they are actually expected during shutdown) and removes some others that duplicate already existing errors that dtest knows how to deal with. Also fix one case of unhandled exception in schema management code. * 'dtest-fixes-v1' of github.com:gleb-cloudius/scylla: raft: getting abort_requested_exception exception from a sm::apply is not a critical error schema_registry: fix abandoned feature warning service: raft: silence rpc::closed_errors in raft_rpc	2022-08-18 12:16:44 +02:00
Petr Gusev	eedfd7ad9b	raft, limit for command size Adds max_command_size to the raft configuration and restricts commands to this limit.	2022-08-18 13:35:49 +04:00
Gleb Natapov	e5157b27ad	raft: getting abort_requested_exception exception from a sm::apply is not a critical error During shutdown it is normal to get abort_requested_exception exception from a state machine "apply" method. Do not rethrow it as state_machine_error, just abort an applier loop with an info message.	2022-08-11 15:11:21 +03:00
Petr Gusev	4bc6611829	raft read_barrier, retry over intermittent rpc failures If the leader was unavailable during read_barrier, closed_error occurs, which was not handled in any way and eventually reached the client. This patch adds retries in this case. Fix: scylladb#11262 Refs: #11278 Closes #11263	2022-08-11 13:31:19 +03:00
Benny Halevy	6436c614d7	raft: migrate tagged_id definition to utils::tagged_uuid So it can be used for other types in the system outside of raft, like counter_id, table_id, table_schema_version, and more. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:02:27 +03:00
Tomasz Grabiec	04f9a150be	Merge 'raft: split `can_vote` field form `server_address` to separate struct' from Kamil Braun Whether a server can vote in a Raft configuration is not part of the address. `server_address` was used in many context where `can_vote` is irrelevant. Split the struct: `server_address` now contains only `id` and `server_info` as it did before `can_vote` was introduced. Instead we have a `config_member` struct that contains a `server_address` and the `can_vote` field. Also remove an "unsafe" constructor from `server_address` where `id` was provided but `server_info` was not. The constructor was used for tests where `server_info` is irrelevant, but it's important not to forget about the info in production code. Replace the constructor with helper functions which specify in comments that they are supposed to be used in tests or in contexts where `info` doesn't matter (e.g. when checking presence in an `unordered_set`, where the equality operator and hash operate only on the `id`). Closes #11047 * github.com:scylladb/scylla: raft: fsm: fix `entry_size` calculation for config entries raft: split `can_vote` field from `server_address` to separate struct serializer_impl: generalize (de)serialization of `unordered_set` to_string: generalize `operator<<` for `unordered_set`	2022-07-20 12:20:52 +02:00
Gleb Natapov	d40106d3a9	raft: remove unused code Message-Id: <YtUh8Hs+nQQ8+hLY@scylladb.com>	2022-07-18 21:21:45 +03:00
Kamil Braun	7c377cc457	raft: fsm: fix `entry_size` calculation for config entries We forgot about `can_vote`. Stumbled on this while separating `can_vote` to separate struct. Note that `entry_size` is still inaccurate (#11068) but the patch is an improvement. Refs: #11068	2022-07-18 18:24:50 +02:00
Kamil Braun	daf9c53bb8	raft: split `can_vote` field from `server_address` to separate struct Whether a server can vote in a Raft configuration is not part of the address. `server_address` was used in many context where `can_vote` is irrelevant. Split the struct: `server_address` now contains only `id` and `server_info` as it did before `can_vote` was introduced. Instead we have a `config_member` struct that contains a `server_address` and the `can_vote` field. Also remove an "unsafe" constructor from `server_address` where `id` was provided but `server_info` was not. The constructor was used for tests where `server_info` is irrelevant, but it's important not to forget about the info in production code. The constructor was used for two purposes: - Invoking set operations such as `contains`. To solve this we use C++20 transparent hash and comparator functions, which allow invoking `contains` and similar functions by providing a different key type (in this case `raft::server_id` in set of addresses, for example). - constructing addresses without `info`s in tests. For this we provide helper functions in the test helpers module and use them.	2022-07-18 18:22:10 +02:00
Petr Gusev	86299ad194	raft: server: fix comment for set_configuration follow-up to https://github.com/scylladb/scylla/pull/10905 as discussed in the comments. Closes #11035	2022-07-14 11:37:35 +02:00
Petr Gusev	6cdd5b9ff5	raft, set_configuration fix: don't use dummy entries Leader which ceases to be a leader as a result of a execute_modify_config cannot wait for a dummy record to be committed because io_fiber aborts current waiters as soon as it detects a lost of leadership. This commit excludes dummy entries from the configuration change procedure. A special promise is set on io_fiber when it gets a non-joint configuration, and set_configuration just waits for the corresponding future instead of a dummy record. Fixes: #10010 Closes #10905	2022-07-06 11:26:59 +02:00
Kamil Braun	d128f65354	raft: server: if `add_entry` with `wait_type::applied` successfully returns, ensure `state_machine::apply` is called for this entry Previously it could happen that `add_entry` returned successfully but `state_machine::apply` was never called by the server for this entry, even though `wait_type::applied` was used, if the server loaded a snapshot that contained this entry in just the right moment. Some clients may find this behavior surprising, even though we may argue that it's not technically incorrect. For example, the nemesis test assumed that if `add_entry` returned successfully (with `wait_type::applied`), the local state machine applied the entry; the test uses `apply` to obtain an output - the result of the command - from the state machine. It's not a problem to give a stronger guarantee, so we do it in this commit. In the scenario where a snapshot causes Raft to skip over the entry, `add_entry` will finish exceptionally with `commit_status_unknown`.	2022-05-27 12:06:18 +02:00
Kamil Braun	f31f73b1e8	raft: tracker: fix the definition of `voters()` The previous implementation was weird, and it's not even clear if the C++ standard defined what the result would be (because it used `std::unordered_set::insert(iterator, iterator)`, where the iterators pointed to a sequence of elements with elements that already had equivalent elements in the set; cppreference does not specify which elements end up in the set in this case). In any case, in testing, the resulting set did not give the desired result: if the configuration was joint, and a server was a voter in the previous config but a non-voter in the current one, it would not be a member of this set. This would cause the server to not vote for itself when it became a candidate, which could lead to cluster unavailability. The new definition is simple: a server belongs to `voters()` iff it is a voter in current or previous configuration. This fixes the problem described above. Fixes #10618.	2022-05-27 12:06:18 +02:00
Kamil Braun	6e5d1f4784	raft: when printing `raft::server_address`, include `can_vote` Makes debugging easier when configurations include non-voters.	2022-05-27 12:06:18 +02:00
Kamil Braun	c8237d405e	raft: server: in `abort()`, abort read barriers before waiting for rpc abort `rpc::abort` may need to wait until all read barriers finish, so abort read barrier before waiting for `rpc::abort` to finish to avoid a deadlock on shutdown. `rpc::abort` is still called before the read barriers are aborted, only waited for after. Calling it first prevents new read barriers from being started by `rpc` (see `rpc::abort` comment). Also prevent new read barriers from being started after abort starts directly on a leader by checking the `_aborted` flag at the beginning of `execute_read_barrier`. Finally, use the opportunity to remove some compiler-dependent code.	2022-05-25 14:56:32 +02:00
Kamil Braun	86c5036353	raft: server: handle aborts correctly in `read_barrier` The `wait_for_apply` function, called from `read_barrier`, didn't handle aborts. Fix that.	2022-05-25 14:56:32 +02:00
Kamil Braun	1eb849c3d7	raft: fsm: don't advance commit index further than match_idx during read_quorum It's not safe to advance the commit index further than match_idx since beyond that point the follower's log may be outdated. Fixes #10578.	2022-05-25 14:56:32 +02:00
Kamil Braun	4767b163ef	service: raft: rpc: don't call `execute...` functions after `abort()` The functions are called from RPC when a follower forwards a request to a leader (`add_entry`, `modify_config`, `read_barrier`). The call may be attempted during shutdown. The Raft shutdown code cleans up data structures created by those requests. Make sure that they are not updated concurrently with shutdown. This can lead to problems such as using the server object after it was aborted, or even destroyed. After this change, the RPC implementation may wait for a `execute_modify_config` call to finish before finishing abort. That call in turn may be stuck on `wait_for_entry`. Thus the waiter may prevent RPC from aborting. Fix this be moving the wait on the future returned from `_rpc->abort()` in `server::abort()` until after waiters were destroyed.	2022-05-24 11:36:51 +02:00
Kamil Braun	5e06d0ad6f	raft: server: fix bad_variant_access in `modify_config` `modify_config` would call `execute_modify_config` or `_rpc->send_modify_config`, which returned a reply of type `add_entry_reply`. This is a variant of 3 options: `entry_id`, `not_a_leader`, or `commit_status_unknown`. The code would check for the `entry_id` option and otherwise assume that it was `not_a_leader`. During nemesis testing however, the reply was sometimes `commit_status_unknown`, which caused a `bad_variant_access` exception during `std::get` call. Fix this. There is a similar piece of code in `add_entry`, but there it should be impossible to obtain `commit_status_unknown` even though the types don't enforce it. Make it more explicit with a comment and an assertion.	2022-05-24 11:36:51 +02:00
Gleb Natapov	7f26a8eef5	raft: actively search for a leader if it is not known for a tick duration For a follower to forward requests to a leader the leader must be known. But there may be a situation where a follower does not learn about a leader for a while. This may happen when a node becomes a follower while its log is up-to-date and there are no new entries submitted to raft. In such case the leader will send nothing to the follower and the only way to learn about the current leader is to get a message from it. Until a new entry is added to the raft's log a follower that does not know who the leader is will not be able to add entries. Kind of a deadlock. Note that the problem is specific to our implementation where failure detection is done by an outside module. In vanilla raft a leader sends messages to all followers periodically, so essentially it is never idle. The patch solves this by broadcasting specially crafted append reject to all nodes in the cluster on a tick in case a leader is not known. The leader responds to this message with an empty append request which will cause the node to learn about the leader. For optimisation purposes the patch sends the broadcast only in case there is actually an operation that waits for leader to be known. Fixes #10379	2022-04-25 14:51:22 +02:00
Kamil Braun	5308a7d7a3	raft: server: return immediately from `wait_for_leader` if leader is known `wait_for_leader` may be called when leader is known. There's nothing to wait for in this case.	2022-04-25 12:59:55 +02:00
Kamil Braun	ad3141d3e0	raft: server: translate abort_requested_exception to raft::request_aborted The `wait_for_leader` function would throw a low-level `abort_requested_aborted` exception from seastar::shared_promise. Translate it to the high-level raft::request_aborted so we can reduce the number of different exception types which cross the Raft API boundary. Also, add comments on Raft API functions about the exception thrown when requests are aborted.	2022-04-05 19:18:53 +02:00
Kamil Braun	7da586b912	raft: fsm: when stopping, become follower to reject new requests After enabling add_entry forwarding in randomized_nemesis_test, the test would sometimes hang on _rpc->abort() call due to add_entry messages from followers which waited on log_limiter_semaphore on the leader preventing _rpc from finishing the abort; the log_limter_semaphore would not get unblocked because the part of the server was already stopped. Prevent log_limiter_semaphore from being waited on when stopping the server by becoming a follower in fsm::stop.	2022-04-05 19:11:44 +02:00
Kamil Braun	0f0d75fd66	raft: server: translate semaphore_aborted to request_aborted	2022-03-29 15:10:29 +02:00
Gleb Natapov	a1604aa388	raft: make raft requests abortable This patch adds an ability to pass abort_source to raft request APIs ( add_entry, modify_config) to make them abortable. A request issuer not always want to wait for a request to complete. For instance because a client disconnected or because it no longer interested in waiting because of a timeout. After this patch it can now abort waiting for such requests through an abort source. Note that aborting a request only aborts the wait for it to complete, it does not mean that the request will not be eventually executed. Message-Id: <YjHivLfIB9Xj5F4g@scylladb.com>	2022-03-16 18:38:01 +01:00
Gleb Natapov	108e7fcc4e	raft: enter candidate state immediately when starting a singleton cluster When a node starts it does not immediately becomes a candidate since it waits to learn about already existing leader and randomize the time it becomes a candidate to prevent dueling candidates if several nodes are started simultaneously. If a cluster consist of only one node there is no point in waiting before becoming a candidate though because two cases above cannot happen. This patch checks that the node belongs to a singleton cluster where the node itself is the only voting member and becomes candidate immediately. This reduces the starting time of a single node cluster which are often used in testing. Message-Id: <YiCbQXx8LPlRQssC@scylladb.com>	2022-03-04 20:30:52 +01:00
Piotr Grabowski	e99f487d31	raft: add missing include Add missing include of "<experimental/source_location>" which caused compile errors on GCC: In file included from raft/fsm.hh:12, from raft/fsm.cc:8: raft/raft.hh:251:30: error: ‘std::experimental’ has not been declared 251 \| state_machine_error(std::experimental::source_location l = std::experimental::source_location::current()) \| ^~~~~~~~~~~~ raft/raft.hh:251:59: error: expected ‘)’ before ‘l’ 251 \| state_machine_error(std::experimental::source_location l = std::experimental::source_location::current()) \| ~ ^~ Note that there are some GCC compilation problems still left apart from this one. Closes #10155	2022-03-02 16:33:43 +01:00
Alejo Sanchez	627275945f	raft: modify_config: support voting state change Handle requests to change voting for servers already present in the current configuration. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-02-08 08:00:07 -04:00

1 2 3 4 5 ...

273 Commits