scylladb

Author	SHA1	Message	Date
copilot-swe-agent[bot]	9e806cb3f7	Fix critical bugs and issues found in alternator code review Co-authored-by: nyh <584227+nyh@users.noreply.github.com>	2026-01-29 22:54:57 +00:00
copilot-swe-agent[bot]	f267af38bd	Initial plan	2026-01-29 22:49:31 +00:00
Piotr Dulikowski	f150629948	Merge 'auth: switch find_record to use cache' from Marcin Maliszkiewicz This series optimizes role lookup by moving find_record into standard_role_manager and switching it to use the auth cache. This allows reverting can_login to its original simpler form, ensuring hot paths are properly cached while maintaining consistency via group0_guard. Backport: no, it's not a bug fix. Closes scylladb/scylladb#28329 * github.com:scylladb/scylladb: auth: bring back previous version of standard_role_manager::can_login auth: switch find_record to use cache auth: make find_record and callers standard_role_manager members	2026-01-29 17:25:42 +01:00
Avi Kivity	7984925059	Merge 'Use coroutine::switch_to() in table::try_flush_memtable_to_sstable' from Pavel Emelyanov The method was coroutinized by `6df07f7ff7`. Back then thecoroutine::switch_to() wasn't available, and the code used with_scheduling_group() to call coroutinized lambdas. Those lambdas were implemented as on-stack variables to solve the capture list lifetime problems. As a result, the code looks like ``` auto flush = [] { ... // do the flushing auto post_flush = [] { ... // do the post-flushing } co_return co_await with_scheduling_group(group_b, post_flush); }; co_return co_await with_scheduling_group(group_a, flush); ``` which is a bit clumsy. Now we have switch_to() and can make the code flow of this method more readable, like this ``` co_await switch_to(group_a); ... // do the flushing co_await switch_to(group_b); ... // do the post-flushing ``` Code cleanup, not backporting Closes scylladb/scylladb#28430 * github.com:scylladb/scylladb: table: Fix indentation after previous patch table: Use coroutine::switch_to() in try_flush_memtable_to_sstable()	2026-01-29 18:12:35 +02:00
Nadav Har'El	a6fdda86b5	Merge 'test: test_alternator_proxy_protocol: fix race between node startup and test start' from Avi Kivity test_alternator_proxy_protocol starts a node and connects via the alternator ports. Starting a node, by default, waits until the CQL ports are up. This does not guarantee that the alternator ports are up (they will be up very soon after this), so there is a short window where a connection to the alternator ports will fail. Fix by adding a ServerUpState=SERVING mode, which waits for the node to report to its supervisor (systemd, which we are pretending to be) that its ports are open. The test is then adjusted to request this new ServerUpState. Fixes #28210 Fixes #28211 Flaky tests are only in master and branch-2026.1, so backporting there. Closes scylladb/scylladb#28291 * github.com:scylladb/scylladb: test: test_alternator_proxy_protocol: wait for the node to report itself as serving test: cluster_manager: add ability to wait for supervisor STATUS=serving	2026-01-29 16:18:26 +02:00
Pavel Emelyanov	56e212ea8d	table: Fix indentation after previous patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-29 15:02:33 +03:00
Pavel Emelyanov	258a1a03e3	table: Use coroutine::switch_to() in try_flush_memtable_to_sstable() It allows dropping the local lambdas passed into with_scheduling_group() calls. Overall the code flow becomes more readable. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-29 15:01:27 +03:00
Botond Dénes	3158e9b017	doc: reorganize properties in config.cc and config.hh This commit moves the "Ungrouped properties" category to the end of the properties list. The properties are now published in the documentation, and it doesn't look good if the list starts with ungrouped properties. This patch was taken over from Anna Stuchlik <anna.stuchlik@scylladb.com>. Closes scylladb/scylladb#28343	2026-01-29 11:27:42 +03:00
Pavel Emelyanov	937d008d3c	Merge 'Clean up partition_snapshot_reader' from Botond Dénes Move to `replica/`, drop `flat` from name and drop unused usages as well as unused includes. Code cleanup, no backport Closes scylladb/scylladb#28353 * github.com:scylladb/scylladb: replica/partition_snapshot_reader: remove unused includes partition_snapshot_reader: remove "flat" from name mv partition_snapshot_reader.hh -> replica/	2026-01-29 11:22:15 +03:00
Botond Dénes	f6d7f606aa	memtable_test: disable flushing_rate_is_reduced_if_compaction_doesnt_keep_up for debug This test case was observed to take over 2 minutes to run on CI machines, contributing to already bloated CI run times. Disable this test in debug mode. This test checks for memtable flush being slowed down when compaction can't keep up. So this test needs to overwhelm the CPU by definition. On the other hand, this is not a correctness test, there are such tests for the memtable and compaction already, so it is not critical to run this in debug mode, it is not expected to catch any use-after-free and such. Closes scylladb/scylladb#28407	2026-01-29 11:13:22 +03:00
Jakub Smolar	e978cc2a80	scylla_gdb: use persistent GDB - decrease test execution time This commit replaces the previous approach of running pytest inside GDB’s Python interpreter. Instead, tests are executed by driving a persistent GDB process externally using pexpect. - pexpect: Python library for controlling interactive programs (used here to send commands to GDB and capture its output) - persistent GDB: keep one GDB session alive across multiple tests instead of starting a new process for each test Tests can now be executed via `./test.py gdb` or with `pytest test/scylla_gdb`. This improves performance and makes failures easier to debug since pytest no longer runs hidden inside GDB subprocesses. Closes scylladb/scylladb#24804	2026-01-29 10:01:39 +02:00
Avi Kivity	347c69b7e2	build: add clang-tools-extra (for clang-include-cleaner) to frozen toolchain clang-include-cleaner is used in the iwyu.yaml github workflow (include- what-you-use). Add it to the frozen toolchain so it can be made part of the regular build process. The corresponding install command is removed from iwyu.yaml. Regenerated frozen toolchain with optimized clang from https://devpkg.scylladb.com/clang/clang-21.1.8-Fedora-43-aarch64.tar.gz https://devpkg.scylladb.com/clang/clang-21.1.8-Fedora-43-x86_64.tar.gz Closes scylladb/scylladb#28413	2026-01-29 08:44:49 +02:00
Botond Dénes	482ffe06fd	Merge 'Improve load shedding on the replica side' from Łukasz Paszkowski When reads arrive, they have to wait for admission on the reader concurrency semaphore. If the node is overloaded, the reads will be queued. They can time out while in the queue, but will not time out once admitted. Once the shard is sufficiently loaded, it is possible that most queued reads will time out, because the average time it takes to for a queued read to be admitted is around that of the timeout. If a read times out, any work we already did, or are about to do on it is wasted effort. Therefore, the patch tries to prevent it by checking if an admitted read has a chance to complete in time and abort it if not. It uses the following criteria: if read's remaining time <= read's timeout when arrived to the semaphore * live updateable preemptive_abort_factor; the read is rejected and the next one from the wait list is considered. Fixes https://github.com/scylladb/scylladb/issues/14909 Fixes: SCYLLADB-353 Backport is not needed. Better to first observe its impact. Closes scylladb/scylladb#21649 * github.com:scylladb/scylladb: reader_concurrency_semaphore: Check during admission if read may timeout permit_reader::impl: Replace break with return after evicting inactive permit on timeout reader_concurrency_semaphore: Add preemptive_abort_factor to constructors config: Add parameters to control reads' preemptive_abort_factor permit_reader: Add a new state: preemptive_aborted reader_concurrency_semaphore: validate waiters counter when dequeueing a waiting permit reader_concurrency_semaphore: Remove cpu_concurrency's default value	2026-01-29 08:27:22 +02:00
Botond Dénes	a8767f36da	Merge 'Improve load balancer logging and other minor cleanups' from Tomasz Grabiec Contains various improvements to tablet load balancer. Batched together to save on the bill for CI. Most notably: - Make plan summary more concise, and print info only about present elements. - Print rack name in addition to DC name when making a per-rack plan - Print "Not possible to achieve balance" only when this is the final plan with no active migrations - Print per-node stats when "Not possible to achieve balance" is printed - amortize metrics lookup cost - avoid spamming logs with per-node "Node {} does not have complete tablet stats, ignoring" Backport to 2026.1: since the changes enhance debuggability and are relatively low risk Fixes #28423 Fixes #28422 Closes scylladb/scylladb#28337 * github.com:scylladb/scylladb: tablets: tablet_allocator.cc: Convert tabs to spaces tablets: load_balancer: Warn about incomplete stats once for all offending nodes tablets: load_balancer: Improve node stats printout tablets: load_balancer: Warn about imbalance only when there are no more active migrations tablets: load_balancer: Extract print_node_stats() tablet: load_balancer: Use empty() instead of size() where applicable tablets: Fix redundancy in migration_plan::empty() tablets: Cache pointer to stats during plan-making tablets: load_balancer: Print rack in addition to DC when giving context tablets: load_balancer: Make plan summary concise tablets: load_balancer: Move "tablet_migration_bypass" injection point to make_plan()	2026-01-29 08:25:17 +02:00
Piotr Dulikowski	ec6a2661de	Merge 'Keep view_builder background fiber in maintenance scheduling group' from Pavel Emelyanov In fact, it's partially there already. When view_builder::start() is called is first calls initialization code (the start_in_background() method), then kicks do_build_step() that runs a background fiber to perform build steps. The starting code inherits scheduling group from main(). And the step fiber code needs to run itself in a maintenance scheduling group, so it explicitly grabs one via database->db_config. This PR mainly gets rid of the call to database::get_streaming_scheduling_group() from do_build_step() as preparation to splitting the streaming scheduling group into parts (see SCYLLADB-351). To make it happen the do_build_step() is patched to inherit its scheduling group from view_builder::start() and the start() itself is called by main from maintenance scheduling group (like for other view building services). New feature (nested scheduling group), not backporting Closes scylladb/scylladb#28386 * github.com:scylladb/scylladb: view_builder: Start background in maintenance group view_builder: Wake-up step fiber with condition variable	2026-01-28 20:49:19 +01:00
Pavel Emelyanov	cb1d05d65a	streaming: Get streaming sched group from debug:: namespace In a lambda returned from make_streaming_consumer() there's a check for current scheudling group being streaming one. It came from #17090 where streaming code was launched in wrong sched group thus affecting user groups in a bad way. The check is nice and useful, but it abuses replica::database by getting unrelated information from it. To preserve the check and to stop using database as provider of configs, keep the streaming scheduling group handle in the debug namespace. This emphasises that this global variable is purely for debugging purposes. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#28410	2026-01-28 19:14:59 +02:00
Marcin Maliszkiewicz	5d4e2ec522	Merge 'docs: add documentation for automatic repair' from Botond Dénes Explain what automatic repair is and how to configure it. While at it, improve the existing repair documentation a bit. Fixes: SCYLLADB-130 This PR missed the 2026.1 branch date, so it needs backport to 2026.1, where the auto repair feature debuts. Closes scylladb/scylladb#28199 * github.com:scylladb/scylladb: docs: add feature page for automatic repair docs: inter-link incremental-repair and repair documents docs: incremental-repair: fix curl example	2026-01-28 17:46:53 +01:00
Nadav Har'El	1454228a05	test/cqlpy: fix "assertion rewriting" in translated Cassandra tests One of the best features of the pytest framework is "assertion rewriting": If your test does for example "assert a + 1 == b", the assertion is "rewritten" so that if it fails it tells you not only that "a+1" and "b" are not equal, what the non-equal values are, how they are not equal (e.g., find different elements of arrays) and how each side of the equality was calculated. But pytest can only "rewrite" assertion that it sees. If you call a utility function checksomething() from another module and that utility function calls assert, it will not be able to rewrite it, and you'll get ugly, hard-to-debug, assertion failures. This problem is especially noticable in tests we translated from Cassandra, in test/cqlpy/cassandra_tests. Those tests use a bunch of assertion-performing utility functions like assertRows() et al. Those utility functions are defined in a separate source file, porting.py, so by default do not get their assertions rewritten. We had a solution for this: test/cqlpy/cassandra_test/__init__.py had: pytest.register_assert_rewrite("cassandra_tests.porting") This tells pytest to rewrite assertions in porting.py the first time that it is imported. It used to work well, but recently it stopped working. This is because we change the module paths recently, and it should be written as test.cqlpy.cassandra_tests.porting. I verified by editing one of the cassandra_tests to make a bad check that indeed this statement stopped working, and fixing the module path in this way solves it, and makes assertion rewriting work again. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#28411	2026-01-28 18:34:57 +02:00
Pavel Emelyanov	3ebd02513a	view_builder: Start background in maintenance group Currently view_builder::start() is called in default scheduling group. Once it initializes itself, it wakes up the step fiber that explicitly switches to maintenance scheduling group. This explicit switch made sence before previous patch, when the fiber was implemented as a serialized action. Now the fiber starts directly from .start() method and can inherit scheduling group from it. Said that, main code calls view_builder::start() in maintenance scheduling group killing two birds with one stone. First, the step fiber no longer needs borrow its scheduling group indirectly via database. Second, the start_in_background() code itself runs in a more suitable scheduling group. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-28 18:34:59 +03:00
Pavel Emelyanov	2439d27b60	view_builder: Wake-up step fiber with condition variable View builder runs a background fiber that perform build steps. To kick the fiber it uses serizlized action, but it's an overkill -- nobody waits for the action to finish, but on stop, when it's joined. This patch uses condition variable to kick the fiber, and starts it instantly, in the place where serialized action was first kicked. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-01-28 18:34:58 +03:00
Botond Dénes	1713d75c0d	docs: add feature page for automatic repair Explain what the feature is and how to confiture it. Inter-link all the repair related pages, so one can discover all about repair, regardless of which page they land on.	2026-01-28 16:45:57 +02:00
Łukasz Paszkowski	7e1bbbd937	reader_concurrency_semaphore: Check during admission if read may timeout When a shard on a replica is overloaded, it breaks down completely, throughput collapses, latencies go through the roof and the node/shard can even become completely unresponsive to new connection attempts. When reads arrive, they have to wait for admission on the reader concurrency semaphore. If the node is overloaded, the reads will be queued and thus they can time out while being in the queue or during the execution. In the latter case, the timeout does not always result in the read being aborted. Once the shard is sufficiently loaded, it is possible that most queued reads will time out, because the average time it takes for a queued read to be admitted is around that of the timeout. If a read times out, any work we already did, or are about to do on it is wasted effort. Therefore, the patch tries to prevent it by checking if an admitted read has a chance to complete in time and abort it if not. It uses the following cryteria: if read's remaining time <= read's timeout when arrived to the semaphore * preemptive factor; the read is rejected and the next one from the wait list is considered.	2026-01-28 14:24:45 +01:00
Łukasz Paszkowski	8a613960af	permit_reader::impl: Replace break with return after evicting inactive permit on timeout Evicting an inactive permit destroyes the permit object when the reader is closed, making any further member access invalid. Switch from break to an early return to prevent any possible use-after-free after evict() in the state::inactive timeout path.	2026-01-28 14:24:33 +01:00
Łukasz Paszkowski	fde09fd136	reader_concurrency_semaphore: Add preemptive_abort_factor to constructors The new parameter parametrizes the factor used to reject a read during admission. Its value shall be between 0.0 and 1.0 where + 0.0 means a read will never get rejected during admission + 1.0 means a read will immediatelly get rejected during admission Although passing values outside the interaval is possible, they will have the exact same effects as they were clamped to [0.0, 1.0].	2026-01-28 14:20:01 +01:00
Łukasz Paszkowski	21348050e8	config: Add parameters to control reads' preemptive_abort_factor	2026-01-28 14:20:01 +01:00
Łukasz Paszkowski	2d3a40e023	permit_reader: Add a new state: preemptive_aborted A permit gets into the preemptive_aborted state when: - times out; - gets rejected from execution due to high chance its execution would not finalize on time; Being in this state means a permit was removed from the wait list, its internal timer was canceled and semaphore's statistic `total_reads_shed_due_to_overload` increased.	2026-01-28 14:20:01 +01:00
Łukasz Paszkowski	5a7cea00d0	reader_concurrency_semaphore: validate waiters counter when dequeueing a waiting permit Add a defensive check in dequeue_permit() to avoid underflowing _stats.waiters and report an internal error if the stats are already inconsistent.	2026-01-28 14:19:53 +01:00
Tomasz Grabiec	df949dc506	Merge 'topology_coordinator: make cleanup reliable on barrier failures' from Łukasz Paszkowski Fix a subtle but damaging failure mode in the tablet migration state machine: when a barrier fails, the follow-up barrier is triggered asynchronously, and cleanup can get skipped for that iteration. On the next loop, the original failure may no longer be visible (because the failing node got excluded), so the tablet can incorrectly move forward instead of entering `cleanup_target`. To make cleanup reliable this PR: Adds an additional “fallback cleanup” stage - `write_both_read_old_fallback_cleanup` that does not modify read/write selectors. This stage is safe to enter immediately after a barrier failure, and it funnels the tablet into cleanup with the required barriers. Avoids changing both read and write selectors in a single step transitioning from `write_both_read_new` to `cleanup_target`. The fallback path updates selectors in a safe order: read first, then write. Allows a direct no-barrier transition from `allow_write_both_read_old` to `cleanup_target` after failure, because in that specific case `cleanup_target` doesn’t change selectors and the hop is safe. No need for backport. It's an improvement. Currently, tablets transition to `cleanup_target` eventually via failed streaming. Closes scylladb/scylladb#28169 * github.com:scylladb/scylladb: topology_coordinator: add write_both_read_old_fallback_cleanup state topology_coordinator: allow cleanup_target transition from streaming/rebuild_repair without barrier topology_coordinator: allow cleanup_target transition without barrier after failure in write_both_read_old topology_coordinator: allow cleanup_target transition without barrier after failure in allow_write_both_read_old	2026-01-28 13:33:39 +01:00
Botond Dénes	ee631f31a0	Merge 'Do not export system keyspace from raft_group0_client' from Pavel Emelyanov There are few places that use raft_group0_client as a way to get to system_keyspace. Mostly they can live without it -- either the needed reference is already at hand, or it's (ab)used to get to the database reference. The only place that really needs the system keyspace is the state merger code that needs last state ID. For that, the explicit helper method is added to group0_client. Refining API between components, not backporting Closes scylladb/scylladb#28387 * github.com:scylladb/scylladb: raft_group0_client: Dont export system keyspace raft_group0_client: Add and use get_last_group0_state_id() group0_state_machine: Call ensure_group0_sched() with data_dictionary view_building_worker: Use its own system_keyspace& reference	2026-01-28 13:24:32 +02:00
Yaron Kaikov	7c49711906	test/cqlpy: Remove redundant pytest.register_assert_rewrite call During test.py run, noticed this warning: ``` 10:38:22 test/cqlpy/cassandra_tests/validation/operations/insert_update_if_condition_test.py:14: 32 warnings 10:38:22 /jenkins/workspace/releng-testing/scylla-ci/scylla/test/cqlpy/cassandra_tests/validation/operations/insert_update_if_condition_test.py:14: PytestAssertRewriteWarning: Module already imported so cannot be rewritten: test.cqlpy.cassandra_tests.porting 10:38:22 pytest.register_assert_rewrite('test.cqlpy.cassandra_tests.porting') ``` The insert_update_if_condition_test.py was calling pytest.register_assert_rewrite() for the porting module, but this registration is already handled by cassandra_tests/__init__.py which is automatically loaded before any test runs. Closes scylladb/scylladb#28409	2026-01-28 13:17:05 +02:00
Avi Kivity	42fdea7410	github: fix iwyu workflow permissions The include-what-you-use workflow fails with ``` Invalid workflow file: .github/workflows/iwyu.yaml#L25 The workflow is not valid. .github/workflows/iwyu.yaml (Line: 25, Col: 3): Error calling workflow 'scylladb/scylladb/.github/workflows/read-toolchain.yaml@257054deffbef0bde95f0428dc01ad10d7b30093'. The nested job 'read-toolchain' is requesting 'contents: read', but is only allowed 'contents: none'. ``` Fix by adding the correct permissions. Closes scylladb/scylladb#28390	2026-01-28 12:38:54 +02:00
Jakub Smolar	e1f623dd69	skip_mode: Allow multiple build modes in pytest skip_mode marker Enhance the skip_mode marker to accept either a single mode string or a list of modes, allowing tests to be skipped across multiple build configurations with a single marker. Before: @pytest.mark.skip_mode("dev", reason="...") @pytest.mark.skip_mode("debug", reason="...") After: @pytest.mark.skip_mode(["dev", "debug"], reason="...") This reduces duplication when the same skip condition applies to multiple build modes. Closes scylladb/scylladb#28406	2026-01-28 12:27:41 +02:00
Patryk Jędrzejczak	a2c1569e04	test: test_gossiper_orphan_remover: get host ID of the bootstrapping node before it crashes The test is currently flaky. It tries to get the host ID of the bootstrapping node via the REST API after the node crashes. This can obviously fail. The test usually doesn't fail, though, as it relies on the host ID being saved in `ScyllaServer._host_id` at this point by `ScyllaServer.try_get_host_id()` repeatedly called in `ScyllaServer.start()`. However, with a very fast crash and unlucky timings, no such call may succeed. We deflake the test by getting the host ID before the crash. Note that at this point, the bootstrapping node must be serving the REST API requests because `await log.wait_for("finished do_send_ack2_msg")` above guarantees that the node has started the gossip shadow round, which happens after starting the REST API. Fixes #28385 Closes scylladb/scylladb#28388	2026-01-28 10:54:22 +02:00
Avi Kivity	8d2689d1b5	build: avoid sccache by default for Rust targets A bug[1] in sccache prevents correct distributed compilation of wasmtime. Disable it by default for now, but allow users to enable it. [1] https://github.com/mozilla/sccache/issues/2575 Closes scylladb/scylladb#28389	2026-01-28 10:36:49 +02:00
Pavel Emelyanov	2ffe5b7d80	tablet_allocator: Have its own explicit background scheduling group Currently, tablet_allocator switches to streaming scheduling group that it gets from database. It's not nice to use database as provider of configs/scheduling_groups. This patch adds a background scheduling group for tablet allocator configured via its config and sets it to streaming group in main.cc code. This will help splitting the streaming scheduling group into more elaborated groups under the maintenance supergroup: SCYLLADB-351 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#28356	2026-01-28 10:34:28 +02:00
Avi Kivity	47315c63dc	treewide: include Seastar headers with angle brackets Seastar is a "system" library from our point of view, so should be included with angle brackets. Closes scylladb/scylladb#28395	2026-01-28 10:33:06 +02:00
Botond Dénes	b7dccdbe93	Merge 'test/storage: speed up out-of-space prevention tests' from Łukasz Paszkowski This PR reduces the runtime of `test_out_of_space_prevention.py` by addressing two main sources of overhead: slow “critical utilization” setup and delayed tablet load stats propagation. Combined, these changes cut the module’s total execution time from 324s to 185s. Improvements. No backup is required. Closes scylladb/scylladb#28396 * github.com:scylladb/scylladb: test/storage: speed up out-of-space prevention tests by using smaller volumes test/storage: reduce tablet load stats refresh interval to speed up OOS prevention tests	2026-01-28 10:28:20 +02:00
Marcin Maliszkiewicz	931a38de6e	service: remove unused has_schema_access It became unused after we dropped support for thrift in `ad649be1bf` Closes scylladb/scylladb#28341	2026-01-28 10:18:26 +02:00
Pavel Emelyanov	834921251b	test: Replace memory_data_source with seastar::util::as_input_stream The existing test-only implementation is a simplified version of the generic one. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#28339	2026-01-28 10:15:03 +02:00
Andrei Chekun	335e81cdf7	test.py: migrate nodetool to run by pytest As a next step of migration to the pytest runner, this PR moves responsibility for nodetool tests execution solely to the pytest. Closes scylladb/scylladb#28348	2026-01-28 09:49:59 +02:00
Tomasz Grabiec	8e831a7b6d	tablets: tablet_allocator.cc: Convert tabs to spaces	2026-01-28 01:32:01 +01:00
Tomasz Grabiec	9715965d0c	tablets: load_balancer: Warn about incomplete stats once for all offending nodes To reduce log spamming when all nodes are missing stats.	2026-01-28 01:32:01 +01:00
Tomasz Grabiec	ef0e9ad34a	tablets: load_balancer: Improve node stats printout Make it more concise: - reduce precision for load to 6 fractional digits - reduce precision for tablets/shard to 3 fractional digits - print "dc1/rack1" instead of "dc=dc1 rack=rack1", like in other places - print "rd=0 wr=0" instead of "stream_read=0 stream_write=0" Example: load_balancer - Node 477569c0-f937-11f0-ab6f-541ce4a00601: dc10/rack10c load=170.666667 tablets=1 shards=12 tablets/shard=0.083 state=normal cap=64424509440 stream: rd=0 wr=0 load_balancer - Node 47678711-f937-11f0-ab6f-541ce4a00601: dc10/rack10c load=0.000000 tablets=0 shards=12 tablets/shard=0.000 state=normal cap=64424509440 stream: rd=0 wr=0 load_balancer - Node 47832560-f937-11f0-ab6f-541ce4a00601: dc10/rack10c load=0.000000 tablets=0 shards=12 tablets/shard=0.000 state=normal cap=64424509440 stream: rd=0 wr=0	2026-01-28 01:32:01 +01:00
Tomasz Grabiec	4a161bff2d	tablets: load_balancer: Warn about imbalance only when there are no more active migrations Otherwise, it may be only a temporary situation due to lack of candidates, and may be unnecessarily alerting. Also, print node stats to allow assessing how bad the situation is on the spot. Those stats can hint to a cause of imbalance, if balancing is per-DC and racks have different capacity.	2026-01-28 01:32:00 +01:00
Tomasz Grabiec	7228bd1502	tablets: load_balancer: Extract print_node_stats()	2026-01-28 01:32:00 +01:00
Tomasz Grabiec	615b86e88b	tablet: load_balancer: Use empty() instead of size() where applicable	2026-01-28 01:32:00 +01:00
Tomasz Grabiec	12fdd205d6	tablets: Fix redundancy in migration_plan::empty()	2026-01-28 01:32:00 +01:00
Tomasz Grabiec	0d090aa47b	tablets: Cache pointer to stats during plan-making Saves on lookup cost, esp. for candidate evaluation. This showed up in perf profile in the past. Also, lays the ground for splitting stats per rack.	2026-01-28 01:32:00 +01:00
Tomasz Grabiec	f2b0146f0f	tablets: load_balancer: Print rack in addition to DC when giving context Load-balancing can be now per-rack instead of per-DC. So just printing "in DC" is confusing. If we're balancing a rack, we should print which rack is that.	2026-01-28 01:32:00 +01:00
Tomasz Grabiec	df32318f66	tablets: load_balancer: Make plan summary concise Before: load_balancer - Prepared 1 migration plans, out of which there were 1 tablet migration(s) and 0 resize decision(s) and 0 tablet repair(s) and 0 rack-list colocation(s) After: load_balancer - Prepared plan: migrations: 1 We print only stats about elements which are present.	2026-01-28 01:32:00 +01:00

1 2 3 4 5 ...

51816 Commits