scylladb/service at 5acfcd8ef5deb47b69361bcb4005d5d0d8a7d1fd - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-29 11:10:40 +00:00

Files

History

Kamil Braun 5acfcd8ef5 Merge 'raft: send group0 RPCs only if the destination group0 server is seen as alive' from Piotr Dulikowski

In topology on raft mode, the events "new node starts its group0 server"
and "new node is added to group0 configuration" are not synchronized
with each other. Therefore it might happen that the cluster starts
sending commands to the new node before the node starts its server. This
might lead to harmless, but ugly messages like:

    INFO  2023-09-27 15:42:42,611 [shard 0:stat] rpc - client
    127.0.0.1:56352 msg_id 2:  exception "Raft group
    b8542540-5d3b-11ee-99b8-1052801f2975 not found" in no_wait handler
    ignored

In order to solve this, the failure detector verb is extended to report
information about whether group0 is alive. The raft rpc layer will drop
messages to nodes whose group0 is not seen as alive.

Tested by adding a delay before group0 is started on the joining node,
running all topology tests and grepping for the aforementioned log
messages.

Fixes: scylladb/scylladb#15853
Fixes: scylladb/scylladb#15167

Closes scylladb/scylladb#16071

* github.com:scylladb/scylladb:
  raft: rpc: introduce destination_not_alive_error
  raft: rpc: drop RPCs if the destination is not alive
  raft: pass raft::failure_detector to raft_rpc
  raft: transfer information about group0 liveness in direct_fd_ping
  raft: add server::is_alive

2023-11-24 10:34:05 +01:00

..

broadcast_tables/experimental

raft: add description argument to add_entry_unguarded

2023-07-07 13:11:44 +02:00

everywhere: reduce dependencies on i_partitioner.hh

2023-11-05 20:47:44 +02:00

table: rename cache_truncation_record -> set_truncation_time

2023-10-03 17:11:35 +04:00

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

Merge 'raft: send group0 RPCs only if the destination group0 server is seen as alive' from Piotr Dulikowski

2023-11-24 10:34:05 +01:00

cache_hitrate_calculator.hh

Introduce schema/ module

2023-02-15 11:01:50 +02:00

client_state.cc

client_state: co-routinise has_column_family_access function

2023-06-22 15:26:20 +03:00

client_state.hh

cql3: move validation::validate_column_family from client_state::has_column_family_access

2023-06-22 13:57:36 +03:00

CMakeLists.txt

build: cmake: add check-header target

2023-11-13 10:27:06 +02:00

endpoint_lifecycle_subscriber.hh

…

forward_service.cc

forward_service: Remove .shutdown() method

2023-09-26 10:39:22 +03:00

forward_service.hh

forward_service: Remove .shutdown() method

2023-09-26 10:39:22 +03:00

load_broadcaster.hh

gms: pass endpoint_state_ptr to endpoint_state change subscribers

2023-08-31 09:35:15 +03:00

load_meter.hh

load-meter: Remove unused get_load_string

2023-05-15 09:21:08 +03:00

memory_limiter.hh

…

migration_listener.hh

migration_listener: add parameter to on_before_create_column_family

2023-10-31 12:08:03 +01:00

migration_manager.cc

service/migration_manager: only reload schema when enabling disabled features

2023-11-22 17:44:07 +02:00

migration_manager.hh

migration_manager: add new prepare_new_column_family_announcement

2023-10-31 12:08:03 +01:00

misc_services.cc

Merge 'Gossiper endpoint locking' from Benny Halevy

2023-08-02 13:50:08 +02:00

query_state.hh

treewide: reduce include of cql_statement.hh

2023-09-08 13:23:50 +03:00

storage_proxy_stats.hh

storage_proxy: Make split_stats resilient to being called from different scheduling group

2023-06-21 10:08:27 +03:00

storage_proxy.cc

gms,service: add a feature to protect the usage of allow_mutation_read_page_without_live_row

2023-11-20 13:03:55 +01:00

storage_proxy.hh

db: view: run local materialized view mutations on a separate smp service group

2023-10-29 18:30:32 +02:00

storage_service.cc

Merge 'raft topology: reject replace if the node being replaced is not dead' from Patryk Jędrzejczak

2023-11-23 10:31:59 +01:00

storage_service.hh

storage_service: Drop (un)init_messaging_service_part() pair

2023-11-20 13:59:08 +03:00

tablet_allocator.cc

tablet_allocator: update on_before_create_column_family

2023-10-31 12:08:03 +01:00

tablet_allocator.hh

tablets, raft topology: Add support for decommission with tablets

2023-09-14 13:05:49 +02:00

topology_state_machine.cc

everywhere: reduce dependencies on i_partitioner.hh

2023-11-05 20:47:44 +02:00

topology_state_machine.hh

everywhere: reduce dependencies on i_partitioner.hh

2023-11-05 20:47:44 +02:00

view_update_backlog_broker.hh

gms: pass endpoint_state_ptr to endpoint_state change subscribers

2023-08-31 09:35:15 +03:00