scylladb/locator at 8d1d206aff1e68cfa4bc5cd6dd52af4e034aebce - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-22 01:20:39 +00:00

Files

History

Kamil Braun 57e810c852 Merge 'Serialize repair with tablet migration' from Tomasz Grabiec

We want to exclude repair with tablet migrations to avoid races
between repair reads and writes with replica movement. Repair is not
prepared to handle topology transitions in the middle.

One reason why it's not safe is that repair may successfully write to
a leaving replica post streaming phase and consider all replicas to be
repaired, but in fact they are not, the new replica would not be
repaired.

Other kinds of races could result in repair failures. If repair writes
to a leaving replica which was already cleaned up, such writes will
fail, causing repair to fail.

Excluding works by keeping effective_replication_map_ptr in a version
which doesn't have table's tablets in transitions. That prevents later
transitions from starting because topology coordinator's barrier will
wait for that erm before moving to a stage later than
allow_write_both_read_old, so before any requests start using the new
topology. Also, if transitions are already running, repair waits for
them to finish.

A blocked tablet migration (e.g. due to down node) will block repair,
whereas before it would fail. Once admin resolves the cause of blocked migration,
repair will continue.

Fixes #17658.
Fixes #18561.

Closes scylladb/scylladb#18641

* github.com:scylladb/scylladb:
  test: pylib: Do not block async reactor while removing directories
  repair: Exclude tablet migrations with tablet repair
  repair_service: Propagate topology_state_machine to repair_service
  main, storage_service: Move topology_state_machine outside storage_service
  storage_srvice, toplogy: Extract topology_state_machine::await_quiesced()
  tablet_scheduler: Make disabling of balancing interrupt shuffle mode
  tablet_scheduler: Log whether balancing is considered as enabled

2024-06-06 11:27:03 +02:00

..

abstract_replication_strategy.cc

treewide: include fmt/ranges.h and/or fmt/std.h

2024-04-19 22:56:16 +08:00

abstract_replication_strategy.hh

replication_strategy: Remove unused factory_key::to_sstring() declaration

2024-05-30 18:03:51 +03:00

azure_snitch.cc

treewide: replace seastar::future::get0() with seastar::future::get()

2024-02-02 22:12:57 +08:00

azure_snitch.hh

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

CMakeLists.txt

build: cmake: add check-header target

2023-11-13 10:27:06 +02:00

ec2_multi_region_snitch.cc

locator: do not include unused headers

2024-01-23 09:12:23 +02:00

ec2_multi_region_snitch.hh

endpoint_state subscriptions: batch on_change notification

2023-12-31 18:37:34 +02:00

ec2_snitch.cc

locator: do not include unused headers

2024-01-23 09:12:23 +02:00

ec2_snitch.hh

locator::ec2_snitch: change retry logic to exponential backoff

2023-12-25 18:17:23 +02:00

everywhere_replication_strategy.cc

locator: Wrap replication_strategy_config_options into replication_strategy_params

2023-12-25 15:53:03 +03:00

everywhere_replication_strategy.hh

locator: Wrap replication_strategy_config_options into replication_strategy_params

2023-12-25 15:53:03 +03:00

gce_snitch.cc

treewide: replace seastar::future::get0() with seastar::future::get()

2024-02-02 22:12:57 +08:00

gce_snitch.hh

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

gossiping_property_file_snitch.cc

treewide: replace seastar::future::get0() with seastar::future::get()

2024-02-02 22:12:57 +08:00

gossiping_property_file_snitch.hh

locator: do not include unused headers

2024-01-23 09:12:23 +02:00

host_id.hh

everywhere: define locator::host_id as a strong tagged_uuid type

2022-08-12 06:01:44 +03:00

load_sketch.hh

test: perf: Add test for tablet load balancer effectiveness

2024-06-02 14:23:00 +02:00

local_strategy.cc

locator: Wrap replication_strategy_config_options into replication_strategy_params

2023-12-25 15:53:03 +03:00

local_strategy.hh

locator: do not include unused headers

2024-01-23 09:12:23 +02:00

network_topology_strategy.cc

treewide: include fmt/ranges.h and/or fmt/std.h

2024-04-19 22:56:16 +08:00

network_topology_strategy.hh

network_topology_strategy: reallocate_tablets: support deallocation via rf change

2024-03-27 12:06:24 +02:00

production_snitch_base.cc

snitch: Remove production_snitch_base::_prop_file_contents

2024-05-30 13:55:14 +03:00

production_snitch_base.hh

snitch: Remove production_snitch_base::_prop_file_contents

2024-05-30 13:55:14 +03:00

rack_inferring_snitch.cc

locator: do not include unused headers

2024-01-23 09:12:23 +02:00

rack_inferring_snitch.hh

snitch: pass broadcast_address in snitch_config

2023-12-05 08:42:49 +02:00

simple_snitch.cc

snitch: Make config-based construction of all drivers

2022-04-11 14:38:34 +03:00

simple_snitch.hh

locator: do not include unused headers

2024-01-23 09:12:23 +02:00

simple_strategy.cc

replication_strategy: Do not convert string RF into int twise

2024-02-02 14:38:17 +03:00

simple_strategy.hh

locator: do not include unused headers

2024-01-23 09:12:23 +02:00

snitch_base.cc

locator: do not include unused headers

2024-01-23 09:12:23 +02:00

snitch_base.hh

locator: do not include unused headers

2024-01-23 09:12:23 +02:00

tablet_metadata_guard.hh

token_metadata: drop the template

2023-12-12 23:19:54 +04:00

tablet_replication_strategy.hh

network_topology_strategy: allocate_tablets_for_new_table: do not rely on token ownership

2024-03-27 12:06:21 +02:00

tablet_sharder.hh

dht: Deprecate old sharder API: shard_of/next_shard/token_for_next_shard

2024-05-16 00:28:47 +02:00

tablets.cc

Merge 'tablets: Filter-out left nodes in get_natural_endpoints()' from Tomasz Grabiec

2024-06-06 11:23:27 +02:00

tablets.hh

repair: Exclude tablet migrations with tablet repair

2024-06-05 16:11:22 +02:00

token_metadata_fwd.hh

token_metadata: drop the template

2023-12-12 23:19:54 +04:00

token_metadata.cc

locator: host_id_or_endpoint: keep value as variant

2024-04-14 15:25:50 +03:00

token_metadata.hh

locator: host_id_or_endpoint: keep value as variant

2024-04-14 15:25:50 +03:00

token_range_splitter.hh

token_metadata: drop the template

2023-12-12 23:19:54 +04:00

topology.cc

locator: Remove unused lshift-operator for topology

2024-05-21 09:46:30 +03:00

topology.hh

locator: Remove unused lshift-operator for topology

2024-05-21 09:46:30 +03:00

types.hh

dc_rack_fn: make it non-template

2023-12-12 23:19:54 +04:00

util.cc

range.hh: retire

2024-02-21 00:24:25 +02:00

util.hh

storage_service, locator: extract describe_ring()

2022-12-10 12:51:05 +01:00