scylladb/service at f261b4594de8c9be42d578ffdd2c52f401ba8f05 - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-23 16:22:15 +00:00

Files

History

Petr Gusev f261b4594d ip_address_updater: call raft_topology_update_ip even if ip hasn't changed

Previously, the prev_ip check caused problems for bootstrapping nodes.
Suppose a bootstrapping node A appears in the system.peers table of
some other node B. Its record has only ID and IP of the node A, due to
the special handling of bootstrapping nodes in raft_topology_update_ip.
Suppose node B gets temporarily isolated from the topology coordinator.
The topology coordinator fences out node B and succesfully finishes
bootstrapping of the node A. Later, when the connectivity is restored,
topology_state_load runs on the node B, node A is already in
normal state, but the gossiper on B might not yet have any state for
it yet. In this case, raft_topology_update_ip would not update
system.peers because the gossiper state is missing. Subsequently,
on_join/on_restart/on_alive events would skip updates because the IP
in gossiper matches the IP for that node in system.peers.

Removing the check avoids this issue, with negligible overhead:
* on_join/on_restart/on_alive happen only once in a
node’s lifetime
* topology_state_load already updates all nodes each time it runs.

This problem was found by a fencing test, which crashed a
node while another node was going through the bootstrapping
process. After restart the node saw that other node already
is in normal state, since the topology coordinator fenced out
this node and managed to finish the bootstrapping process
successfully. This test will be provided in a separate
fencing-for-paxos PR.

Closes scylladb/scylladb#25596

2025-08-21 10:02:06 +02:00

..

broadcast_tables/experimental

service: do not include unused headers

2025-03-20 11:18:16 +08:00

direct_failure_detector

Move direct_failure_detector from root to service/

2025-04-08 13:03:24 +03:00

Merge 'LWT: enable for tablet-based tables' from Petr Gusev

2025-07-28 13:19:23 +03:00

test_tablets_lwt: add test_error_message_for_timeout_due_to_uncertainty

2025-08-13 14:03:57 +02:00

storage_service, group0_state_machine: move SL cache update from topology_state_load() to load_snapshot()

2025-08-01 13:41:08 +02:00

Merge 'row_cache: add memtable overlap checks elision optimization for tombstone gc' from Botond Dénes

2025-08-11 23:54:59 +02:00

address_map.hh

service: do not include unused headers

2025-03-20 11:18:16 +08:00

cache_hitrate_calculator.hh

…

cas_shard.hh

storage_proxy: add cas_shard class

2025-06-30 10:33:17 +02:00

client_state.cc

check_internal_table_permissions: handle Paxos state tables

2025-07-24 19:48:08 +02:00

client_state.hh

check_internal_table_permissions: handle Paxos state tables

2025-07-24 19:48:08 +02:00

CMakeLists.txt

vector_store_client: implement initial vector_store_client service

2025-07-08 16:29:55 +02:00

endpoint_lifecycle_subscriber.hh

treewide: pass host id to endpoint_lifecycle_subscriber

2025-03-11 12:09:22 +02:00

load_broadcaster.hh

load_meter: move to host id

2025-03-11 12:09:22 +02:00

load_meter.hh

service: do not include unused headers

2025-03-20 11:18:16 +08:00

maintenance_mode.hh

…

mapreduce_service.cc

mapreduce: add shard_id_hint to mapreduce request

2025-06-25 19:23:07 +02:00

mapreduce_service.hh

mapreduce: add tablet-aware dispatching algorithm

2025-06-25 10:18:02 +02:00

memory_limiter.hh

…

migration_listener.hh

Merge 'Atomic in-memory schema changes application' from Marcin Maliszkiewicz

2025-07-13 20:47:55 +03:00

migration_manager.cc

migration manager: do not use group0 on non zero shard

2025-07-28 14:10:01 +02:00

migration_manager.hh

migration_manager: add timeout to start_group0_operation and announce

2025-07-24 16:39:50 +02:00

misc_services.cc

locator: abstract_replication_strategy: define is_local

2025-08-06 13:34:23 +03:00

query_state.hh

…

session.cc

service: do not include unused headers

2025-03-20 11:18:16 +08:00

session.hh

service: session: use named gate

2025-04-12 11:28:49 +03:00

state_id.hh

…

storage_proxy_fwd.hh

storage_proxy: introduce node_local_only flag

2025-07-24 19:48:08 +02:00

storage_proxy_stats.hh

service: do not include unused headers

2025-03-20 11:18:16 +08:00

storage_proxy.cc

Merge 'storage_proxy: node_local_only: always use my_host_id' from Petr Gusev

2025-08-20 12:11:44 +03:00

storage_proxy.hh

storage_proxy: node_local_only: always use my_host_id

2025-08-19 16:11:49 +02:00

storage_service.cc

ip_address_updater: call raft_topology_update_ip even if ip hasn't changed

2025-08-21 10:02:06 +02:00

storage_service.hh

storage_service: use naked e_r_m pointers

2025-08-06 13:34:23 +03:00

tablet_allocator_fwd.hh

…

tablet_allocator.cc

compaction: Add tablet incremental repair support

2025-08-18 11:01:21 +08:00

tablet_allocator.hh

service: tablets: Keep load_stats inside tablet_allocator

2025-04-09 20:21:51 +02:00

tablet_operation.hh

…

task_manager_module.cc

tablets: replace all_tables method

2025-07-01 13:20:18 +03:00

task_manager_module.hh

tasks: replace ip with host_id in task_identity

2025-02-05 10:11:52 +01:00

topology_coordinator.cc

topology_coordinator: Make rpc::remote_verb_error to warning level

2025-08-18 11:01:22 +08:00

topology_coordinator.hh

treewide: avoid including gms/feature_service.hh from headers

2025-08-20 10:30:27 +03:00

topology_guard.hh

service: do not include unused headers

2025-03-20 11:18:16 +08:00

topology_mutation.cc

topology coordinator: Implement global topology request queue

2025-06-11 11:29:33 +03:00

topology_mutation.hh

topology coordinator: Implement global topology request queue

2025-06-11 11:29:33 +03:00

topology_state_machine.cc

topology request: make it possible to hold global request types in request_type field

2025-06-09 13:38:49 +03:00

topology_state_machine.hh

topology coordinator: Implement global topology request queue

2025-06-11 11:29:33 +03:00

vector_store_client.cc

service/vector_store_client: Add live configuration update support

2025-08-12 08:12:53 +02:00

vector_store_client.hh

service/vector_store_client: Add live configuration update support

2025-08-12 08:12:53 +02:00

view_update_backlog_broker.hh

treewide: pass host id to endpoint state change subscribers

2025-03-11 12:09:22 +02:00