scylladb/test/lib at d483051e44b0ef39c6ea4df9a08d51275d2db4af - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-26 03:20:37 +00:00

Files

History

Avi Kivity b1d9f80d85 Merge 'tablets: Make load balancing capacity-aware' from Tomasz Grabiec

Before this patch, the load balancer was equalizing tablet count per
shard, so it achieved balance assuming that:
 1) tablets have the same size
 2) shards have the same capacity

That can cause imbalance of utilization if shards have different
capacity, which can happen in heterogeneous clusters with different
instance types. One of the causes for capacity difference is that
larger instances run with fewer shards due to vCPUs being dedicated to
IRQ handling. This makes those shards have more disk capacity, and
more CPU power.

After this patch, the load balancer equalizes shard's storage
utilization, so it no longer assumes that shards have the same
capacity. It still assumes that each tablet has equal size. So it's a
middle step towards full size-aware balancing.

One consequence is that to be able to balance, the load balancer need
to know about every node's capacity, which is collected with the same
RPC which collects load_stats for average tablet size. This is not a
significant set back because migrations cannot proceed anyway if nodes
are down due to barriers. We could make intra-node migration
scheduling work without capacity information, but it's pointless due
to above, so not implemented.

Also, per-shard goal for tablet count is still the same for all nodes in the cluster,
so nodes with less capacity will be below limit and nodes with more capacity will
be slightly above limit. This shouldn't be a significant problem in practice, we could
compensate for this by increasing the limit.

Refs #23042

Closes scylladb/scylladb#23079

* github.com:scylladb/scylladb:
  tablets: Make load balancing capacity-aware
  topology_coordinator: Fix confusing log message
  topology_coordinator: Refresh load stats after adding a new node
  topology_coordinator: Allow capacity stats to be refreshed with some nodes down
  topology_coordinator: Refactor load status refreshing so that it can be triggered from multiple places
  test: boost: tablets_test: Always provide capacity in load_stats
  test: perf_load_balancing: Set node capacity
  test: perf_load_balancing: Convert to topology_builder
  config, disk_space_monitor: Allow overriding capacity via config
  storage_service, tablets: Collect per-node capacity in load_stats

2025-03-11 14:34:27 +02:00

..

alternator_test_env.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

alternator_test_env.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

CMakeLists.txt

test/lib: mutation_assertions: deinline

2025-02-25 11:40:54 +01:00

cql_assertions.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cql_assertions.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cql_test_env.cc

Merge 'tablets: Make load balancing capacity-aware' from Tomasz Grabiec

2025-03-11 14:34:27 +02:00

cql_test_env.hh

Merge 'tablets: Make load balancing capacity-aware' from Tomasz Grabiec

2025-03-11 14:34:27 +02:00

data_model.cc

treewide: replace boost::algorithm::join() with fmt::join()

2025-01-07 12:45:05 +02:00

data_model.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

dummy_sharder.cc

tree: migrate from boost::find to std::ranges algorithms

2025-02-20 09:28:57 +03:00

dummy_sharder.hh

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

eventually.cc

tests: loading_cache_test: use manual_clock

2025-01-23 09:28:08 +02:00

eventually.hh

tests: loading_cache_test: use manual_clock

2025-01-23 09:28:08 +02:00

exception_utils.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

exception_utils.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

expr_test_utils.cc

boost/expr_test: add vector expression tests

2025-01-28 21:14:49 +01:00

expr_test_utils.hh

boost/expr_test: add vector expression tests

2025-01-28 21:14:49 +01:00

failure_injecting_allocation_strategy.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

fragment_scatterer.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

index_reader_assertions.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

key_utils.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

key_utils.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

log.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

log.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

make_random_string.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

memtable_snapshot_source.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_assertions.cc

test/lib: mutation_assertions: deinline

2025-02-25 11:40:54 +01:00

mutation_assertions.hh

test/lib: mutation_assertions: deinline

2025-02-25 11:40:54 +01:00

mutation_diff

…

mutation_reader_assertions.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_source_test.cc

treewide: replace boost::algorithm::join() with fmt::join()

2025-01-07 12:45:05 +02:00

mutation_source_test.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

random_schema.cc

tree: Make values mutable to enable move semantics

2025-03-03 13:53:02 +03:00

random_schema.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

random_utils.hh

test: lib: replace boost::generate with std equivalent

2025-02-27 01:05:46 +01:00

reader_concurrency_semaphore.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

reader_lifecycle_policy.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

result_set_assertions.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

result_set_assertions.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

scylla_test_case.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

scylla_tests_cmdline_options.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

select_statement_utils.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

simple_position_reader_queue.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

simple_schema.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

sstable_run_based_compaction_strategy_for_tests.cc

tree: implement boost::accumulate with std::ranges library

2025-02-26 23:22:02 +02:00

sstable_run_based_compaction_strategy_for_tests.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

sstable_test_env.hh

sstables_manager: rename increment_total_reclaimable_memory_and_maybe_reclaim()

2025-02-14 22:11:04 +05:30

sstable_utils.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

sstable_utils.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

test_services.cc

aws creds: add env. and file credentials providers

2025-02-05 14:57:19 +02:00

test_services.hh

moved cache files to db

2025-02-04 12:21:31 +03:00

test_utils.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

test_utils.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tmpdir.cc

tmpdir: shorten test tempdir path

2025-01-09 10:37:35 +00:00

tmpdir.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

topology_builder.hh

test: boost: tablets_test: Always provide capacity in load_stats

2025-03-06 13:35:37 +01:00