scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-24 18:40:38 +00:00

Author	SHA1	Message	Date
Patryk Jędrzejczak	ba5b5c7d2f	gossip: add recovery_leader to gossip_digest_syn In the new Raft-based recovery procedure, live nodes join the new group 0 one by one during a rolling restart. There is a time window when some of them are in the old group 0, while others are in the new group 0. This causes a group 0 mismatch in `gossiper::handle_syn_msg`. The current solution for this problem is to ignore group 0 mismatches if `recovery_leader` is set on the local node and to ask the administrator to perform the rolling restart in the following way: - set `recovery_leader` in `scylla.yaml` on all live nodes, - send the `SIGHUP` signal to all Scylla processes to reload the config, - proceed with the rolling restart. This commit makes `gossiper::handle_syn_msg` ignore group 0 mismatches when exactly one of the two gossiping nodes has `recovery_leader` set. We achieve this by adding `recovery_leader` to `gossip_digest_syn`. This change makes setting `recovery_leader` earlier on all nodes and reloading the config unnecessary. From now on, the administrator can simply restart each node with `recovery_leader` set. However, note that nodes that join group 0 must have `recovery_leader` set until all nodes join the new group 0. For example, assume that we are in the middle of the rolling restart and one of the nodes in the new group 0 crashes. It must be restarted with `recovery_leader` set, or else it would reject `gossip_digest_syn` messages from nodes in the old group 0. To avoid problems in such cases, we will continue to recommend setting `recovery_leader` in `scylla.yaml` instead of passing it as a command line argument.	2025-07-23 15:36:57 +02:00
Patryk Jędrzejczak	445a15ff45	db/config, gms/gossiper: change recovery_leader to UUID We change the type of the `recovery_leader` config parameter and `gossip_config::recovery_leader` from sstring to UUID. `recovery_leader` is supposed to store host ID, so UUID is a natural choice. After changing the type to UUID, if the user provides an incorrect UUID, parsing `recovery_leader` will fail early, but the start-up will continue. Outside the recovery procedure, `recovery_leader` will then be ignored. In the recovery procedure, the start-up will fail on: ``` throw std::runtime_error( "Cannot start - Raft-based topology has been enabled but persistent group 0 ID is not present. " "If you are trying to run the Raft-based recovery procedure, you must set recovery_leader."); ```	2025-07-23 15:36:56 +02:00
Gleb Natapov	a221b2bfde	gossiper: do not assume that id->ip mapping is available in failure_detector_loop_for_node failure_detector_loop_for_node may be started on a shard before id->ip mapping is available there. Currently the code treats missing mapping as an internal error, but it uses its result for debug output only, so lets relax the code to not assume the mapping is available. Fixes #23407 Closes scylladb/scylladb#24614	2025-07-01 11:33:20 +03:00
Benny Halevy	4bd0845fce	gossiper: make send_gossip_echo cancellable Currently send_gossip_echo has a 22 seconds timeout during which _abort_source is ignored. Mark the verb as cancellable so it can be canceled on shutdown / abort. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-04-30 11:46:10 +03:00
Benny Halevy	fa1c3e86a9	gossiper: add send_echo helper CAll send_gossip_echo using a centralized helper. A following patch will make it abortable. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-04-30 11:45:51 +03:00
Benny Halevy	e06d226d08	gossiper: failure_detector_loop_for_node: ignore abort_requested_exception Aborting the failure detector happens normally when the node shuts down. There's no need to log anything about it, as long as we abort the function cleanly. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-04-30 11:05:24 +03:00
Benny Halevy	83c69642f7	gossiper: failure_detector_loop_for_node: check if abort_requested in loop condition The same as the loop condition in the direct_failure_detector. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-04-30 11:05:24 +03:00
Benny Halevy	cecfb6dfd7	gms: gossiper: use named gate Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-04-12 11:28:48 +03:00
Gleb Natapov	a982db326e	gossiper: send newest entry in a digest message In cases where two entries have the same ip address send information only for the newest one. Now we send both which make the receiver use one of them at random and it may be outdated one (though it should only cause more data than needed to be requested).	2025-04-06 18:39:24 +03:00
Gleb Natapov	8d534ee68e	gossiper: change make_random_gossip_digest to return value instead of modifying passed parameter	2025-04-06 18:39:24 +03:00
Gleb Natapov	6f53611337	gossiper: move force_remove_endpoint to work on host id Since the gossiper works on host ids now it is incorrect to leave this function to work on ip. It makes it impossible to delete outdated entry since the "gossiper.get_host_id(endpoint) != id" check will always be false for such entries (get_host_id() always returns most up -to-date mapping.	2025-04-06 18:39:24 +03:00
Gleb Natapov	df6cd87bcc	gossiper: do not send outdated endpoint in gossiper round Now that the gossiper map is id based there can be a situation where two entries have the same ip, Shadow round should send the newest one in this cased. The patch makes it so. Fixes: #23553	2025-04-06 15:08:03 +03:00
Gleb Natapov	afdfde8300	gossiper: rename get_nodes_with_host_id to get_node_ip Also change it to return std::optional instead of std::set since now there can be only on ip mapped to an id.	2025-03-31 16:50:50 +03:00
Gleb Natapov	28fb84117d	treewide: drop id parameter from gossiper::for_each_endpoint_state We have it in endpoint_state anyway, so no need to pass both.	2025-03-31 16:50:50 +03:00
Gleb Natapov	4609bbbbb2	treewide: move gossiper to index nodes by host id This patch changes gossiper to index nodes by host ids instead of ips. The main data structure that changes is _endpoint_state_map, but this results in a lot of changes since everything that uses the map directly or indirectly has to be changed. The big victim of this outside of the gossiper itself is topology over gossiper code. It works on IPs and assumes the gossiper does the same and both need to be changed together. Changes to other subsystems are much smaller since they already mostly work on host ids anyway.	2025-03-31 16:50:50 +03:00
Gleb Natapov	19ac05b0ba	gossiper: drop ip from replicate function parameters We have it in endpoint_state now, so no need to pass both.	2025-03-31 16:50:50 +03:00
Gleb Natapov	c5b8429bec	gossiper: drop ip from apply_new_states parameters We have it in endpoint_state now, so no need to pass both.	2025-03-31 16:50:50 +03:00
Gleb Natapov	6da5f541a2	gossiper: drop address from handle_major_state_change parameter list We have it in endpoint_state now, so no need to pass both.	2025-03-31 16:50:50 +03:00
Gleb Natapov	5e06bf76e0	gossiper: pass rpc::client_info to gossiper_shutdown verb handler It will be needed later to obtain host id of the peer.	2025-03-31 16:50:50 +03:00
Gleb Natapov	704580b197	gossiper: add try_get_host_id function The function returns unengaged std::optional if id is not found instead of throwing like get_host_id does.	2025-03-31 16:50:45 +03:00
Gleb Natapov	6999b474a1	gossiper: add ip to endpoint_state Store endpoint's IP in the endpoint state. Currently it is stored as a key in gossiper's endpoint map, but we are going to change that. The new filed is not serialized when endpoint state is sent over rpc, so it is set by the rpc handler from the value in the map that is in the rpc message. This map will not be changed to be host id based to not break interoperability.	2025-03-31 15:42:08 +03:00
Gleb Natapov	e5cc3b75f8	gossiper: drop template from wait_alive_helper function Move ip to id translation to the caller.	2025-03-31 15:42:07 +03:00
Gleb Natapov	0dd86b4f1d	gossiper: move get_supported_features and its users to host id	2025-03-31 15:42:07 +03:00
Gleb Natapov	a581a99dbf	gossiper: move _pending_mark_alive_endpoints to host id Index _pending_mark_alive_endpoints map by host id instead of ip	2025-03-31 15:25:39 +03:00
Gleb Natapov	555149c153	gossiper: do not allow to assassinate endpoint in raft topology mode It does nothing but harm in raft topology mode.	2025-03-31 15:25:39 +03:00
Gleb Natapov	4cc1c10035	gossiper: fix indentation after previous patch	2025-03-31 15:25:39 +03:00
Gleb Natapov	e8b7aaa0d4	gossiper: do not allow to assassinate non existing endpoint We assume that all endpoint states have HOST_ID set or the host id is available locally, but the assassinate code injects a state without HOST_ID for not existing endpoint violating this assumption.	2025-03-31 15:25:39 +03:00
Patryk Jędrzejczak	9970c1fcc3	gossip: allow group 0 ID mismatch in the Raft-based recovery procedure This patch ensures that members of the new group 0 can gossip with members of the old group 0 during rolling restart in the Raft-based recovery procedure. Without this change, restarted nodes (members of the new group 0) wouldn't be marked as UP by other nodes (members of the old group 0), which would decrease availability.	2025-03-14 13:53:05 +01:00
Gleb Natapov	57f2b6d825	gossiper: drop unneeded code host_id is already available at this point.	2025-03-11 12:09:22 +02:00
Gleb Natapov	cca228265e	gossiper: move _expire_time_endpoint_map to host_id Index _expire_time_endpoint_map map by host id instead of ip	2025-03-11 12:09:22 +02:00
Gleb Natapov	c45b50bbe6	gossiper: move _just_removed_endpoints to host id Index _just_removed_endpoints map by host id instead of ip	2025-03-11 12:09:22 +02:00
Gleb Natapov	22739bb39a	gossiper: drop unused get_msg_addr function	2025-03-11 12:09:22 +02:00
Gleb Natapov	499eb4d17f	treewide: pass host id to endpoint state change subscribers	2025-03-11 12:09:22 +02:00
Gleb Natapov	eb59205caf	gossiper: drop deprecated unsafe_assassinate_endpoint operation It was always deprecated.	2025-03-11 12:09:21 +02:00
Gleb Natapov	7dcffda6bd	gossiper: drop ip address from handle_echo_msg and simplify code since host_id is now mandatory	2025-03-11 12:09:21 +02:00
Gleb Natapov	8425c26462	gossiper: start using host ids to send messages earlier Send digest ack and ack2 by host ids as well now since the id->ip mapping is available after receiving digest syn. It allows to convert more code to host id here.	2025-03-11 12:09:21 +02:00
Gleb Natapov	0e3dcb7954	treewide: move everyone to use host id based gossiper::is_alive and drop ip based one	2025-03-11 12:09:21 +02:00
Gleb Natapov	e47f251178	gossiper: move _live_endpoints and _unreachable_endpoints endpoint to host_id Index live and dead endpoints by host id. It also allows to simplify some code that does a translation.	2025-03-11 12:09:21 +02:00
Gleb Natapov	6f05608b5e	gossiper: chunk vector using std::views::chunk instead of explicitly code it	2025-03-11 12:09:21 +02:00
Gleb Natapov	3734afe8a5	gossiper: send shutdown notification by host id	2025-03-11 12:09:21 +02:00
Gleb Natapov	ee59baf6fc	gossiper: drop old shadow round code It is no longer used. It was replaced with explicit GOSSIP_GET_ENDPOINT_STATES verb in `cd7d64f588` which is in scylla-4.3.0	2025-03-11 12:09:20 +02:00
Gleb Natapov	f1a82c1d01	gossiper: drop unused get_endpoint_states function	2025-03-11 12:09:20 +02:00
Gleb Natapov	c4a0fbae16	gossiper: check id match inside force_remove_endpoint Before calling force_remove_endpoint (which works on ip) the code checks that the ip maps to the correct id (not not remove a new node that inherited this ip by mistake). Move the check to the function itself.	2025-03-11 12:09:20 +02:00
Gleb Natapov	4420ddaf86	gossiper: move is_gossip_only_member and its users to work on host id	2025-03-11 12:09:20 +02:00
Gleb Natapov	2746d391af	gossiper: do not ping outdated address A node may change its IP but some other node in the cluster may still try to ping it using an old IP because it may receive an outdated gossiper entry with the old IP. Do not send echo message to the old IP. It will cause a misusing UP message with old address to be printed.	2025-03-11 12:09:20 +02:00
Gleb Natapov	6952f62869	gossiper: drop unused field from loaded_endpoint_state	2025-03-11 12:09:20 +02:00
Amnon Heiman	1b64fa2283	gms/gossiper.cc: label metrics with basic_level The following metrics will be marked with basic_level label: scylla_gossip_heart_beat scylla_gossip_live scylla_gossip_unreachable Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2025-03-03 16:58:39 +02:00
Gleb Natapov	914c9f1711	treewide: include build_mode.hh for SCYLLA_BUILD_MODE_RELEASE where it is missing Fixes: #22914 Closes scylladb/scylladb#22915	2025-02-20 10:50:04 +03:00
Gleb Natapov	0ec9f7de64	gossiper: drop get_unreachable_token_owners functions It is used by truncate code only and even there it only check if the returned set is not empty. Check for dead token owners in the truncation code directly.	2025-01-16 16:37:07 +02:00
Gleb Natapov	36ccc897e8	gossiper: change get_live_members and all its users to work on host ids	2025-01-16 16:37:06 +02:00

1 2 3 4 5 ...

772 Commits