scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-01 04:26:48 +00:00

Files

Patryk Jędrzejczak e1c3f666c9 Merge 'vnode cleanup: add missing barriers and fix race conditions' from Petr Gusev

Problems addressed by this PR

* Missing barrier before cleanup: If a node was bootstrapped before cleanup, some request coordinators could still be in `write_both_read_new` and send stale requests to replicas being cleaned up.
* Sessions not drained before cleanup: We lacked protection against stale streaming or repair operations.
* `sstable_vnodes_cleanup_fiber()` calling `flush_all_tables()` under group0 lock: This caused SCT test failures (see [this comment](https://github.com/scylladb/scylladb/issues/25333#issuecomment-3298859046) for details).
* Issues with `storage_proxy::start_write()` used by `sstable_vnodes_cleanup_fiber`:
  * The result of `start_write()` was not held during `abstract_write_response_handler::apply_locally`, so coordinator-local writes were not properly awaited.
  * Synchronization was racy — `start_write()` was not atomic with the fence check, allowing stale writes to sneak in if `fence_version` changed in between.
  * It waited for all writes, including local tables and tablet-based tables, which is redundant because `sstable_vnodes_cleanup_fiber` does not apply to them.
  * It also waited for writes with versions greater than the current `fence_version`, which is unnecessary.

Fixes scylladb/scylladb#26150

backport: this PR fixes several issues with the vnodes cleanup procedure, but it doesn't seem they are critical enough to deserve backporting

Closes scylladb/scylladb#26315

* https://github.com/scylladb/scylladb:
  test_automatic_cleanup: add test_cleanup_waits_for_stale_writes
  test_fencing: fix due to new version increment
  test_automatic_cleanup: clean it up
  storage_proxy: wait for closing sessions in sstable cleanup fiber
  storage_proxy: rename await_pending_writes -> await_stale_pending_writes
  storage_proxy: use run_fenceable_write
  storage_proxy: abstract_write_response_handler: apply_locally: extract post fence check
  storage_proxy: introduce run_fenceable_write
  storage_proxy: move update_fence_version from shared_token_metadata
  storage_proxy: fix start_write() operation scope in apply_locally
  storage_proxy: move post fence check into handle_write
  storage_proxy: move fencing into mutate_counter_on_leader_and_replicate
  storage_proxy::handle_read: add fence check before get_schema
  storage_service: rebrand cleanup_fiber to vnodes_cleanup_fiber
  sstable_cleanup_fiber: use coroutine::parallel_for_each
  storage_service: sstable_cleanup_fiber: move flush_all_tables out of the group0 lock
  topology_coordinator: barrier before cleanup
  topology_coordinator: small start_cleanup refactoring
  global_token_metadata_barrier: add fenced flag

2025-10-27 12:35:13 +01:00

alternator

test/alternator/test_tablets: add test for GSI backfill with tablets

2025-10-22 00:34:49 +02:00

boost

Merge 'replica/mutation_dump: include empty/dead partitions in the scan results' from Botond Dénes

2025-10-24 23:26:16 +03:00

broadcast_tables

…

cluster

Merge 'vnode cleanup: add missing barriers and fix race conditions' from Petr Gusev

2025-10-27 12:35:13 +01:00

cql

schema_tables: Keep "replication" column backwards-compatible by expanding rack lists to numeric RF

2025-10-21 09:11:25 +03:00

cqlpy

Merge 'scylla-sstable: add cql support to write operation' from Botond Dénes

2025-10-24 23:32:40 +03:00

ldap

auth: add query_state parameter to query functions

2025-09-25 16:46:50 +02:00

lib

Merge 'RFC: Initial GCP storage backend for scylla (sstables + backup)' from Calle Wilund

2025-10-20 13:14:53 +03:00

manual

message: move RPC compression from utils/ to message/

2025-09-30 17:03:09 +03:00

nodetool

Merge 'Add --drop-unfixable-sstables flag for scrub in segregate mode' from Taras Veretilnyk

2025-10-23 11:06:19 +03:00

perf

mutation/mutation_compactor: add tombstone_gc_state to query ctor

2025-10-12 17:48:15 +03:00

pylib

Merge 'vnode cleanup: add missing barriers and fix race conditions' from Petr Gusev

2025-10-27 12:35:13 +01:00

pylib_test

…

raft

raft: refactor can_vote logic and type

2025-09-24 13:55:05 +02:00

resource

test/resource: add ms sample sstable files for relevant tests

2025-09-29 22:15:25 +02:00

rest_api

rest_api/test_storage_service: add v2 natural_endpoints test for composite key with multiple components

2025-10-01 15:53:25 +02:00

scylla_gdb

scylla-gdb: add scylla prepared-statements

2025-09-16 23:40:47 +03:00

storage

database: Log message after critical_disk_utilization mode is set

2025-10-20 13:24:10 +03:00

unit

code: Replace distributed<> with sharded<>

2025-09-19 12:22:51 +02:00

vector_search

vector_search: remove dependence on cql3

2025-10-21 17:41:55 +03:00

__init__.py

test.py: metrics: add host_id suffix to .db file

2025-08-19 11:33:11 +00:00

CMakeLists.txt

vector_store_client_test: Relocate to a dedicated directory

2025-09-25 14:04:28 +02:00

conftest.py

test.py: refactor: move framework-related code to test.pylib.runner

2025-08-17 12:32:35 +00:00

pytest.ini

tiering (test.py): introduce tiering labels

2025-08-04 15:38:16 +03:00

README.md

…

README.md

Scylla in-source tests.

For details on how to run the tests, see docs/dev/testing.md

Shared C++ utils, libraries are in lib/, for Python - pylib/

alternator - Python tests which connect to a single server and use the DynamoDB API unit, boost, raft - unit tests in C++ cqlpy - Python tests which connect to a single server and use CQL topology* - tests that set up clusters and add/remove nodes cql - approval tests that use CQL and pre-recorded output rest_api - tests for Scylla REST API Port 9000 scylla-gdb - tests for scylla-gdb.py helper script nodetool - tests for C++ implementation of nodetool

If you can use an existing folder, consider adding your test to it. New folders should be used for new large categories/subsystems, or when the test environment is significantly different from some existing suite, e.g. you plan to start scylladb with different configuration, and you intend to add many tests and would like them to reuse an existing Scylla cluster (clusters can be reused for tests within the same folder).

To add a new folder, create a new directory, and then copy & edit its suite.ini.