scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 03:30:49 +00:00

Files

Botond Dénes 9dff9752b4 Merge 'Fix regression in Alternator TTL with tablets and node going down' from Nadav Har'El

Recently we suffered a regression on how Alternator TTL behaves when a node goes down when tablets are used.

Usually, expiration of data in a particular tablet are handled by this tablet's "primary replica". However, if that node is down, we want another node to perform these expiration until the primary replica goes back online. We created a function `tablet_map::get_secondary_replica()` to select that "other node". We don't care too much what the "secondary replica" means, but we do care that it's different from the primary replica - if it's the same the expiration of that tablet will never be done.

It turns out that recently, in commits 817fdad and d88036d, the implementation of get_primary_replica() changed without a corresponding change to get_secondary_replica(). After those changes, the two functions are mismatched, and sometimes return the same node for both primary and secondary replica.

Unfortunately, although we had a dtest for the handling of a dead node in Alternator TTL, it failed to reproduce this bug, so this regression was missed - nothing else besides Alternator TTL ever used the get_secondary_replica() function.

So this series, in addition to fixing the bug, we add two tests that reproduce this bug (fail before the fix, pass with the fix):

1. A unit test that checks that get_secondary_replica() always returns a different node from get_primary_replica()
2. A cluster test based on the original dtest, which does reproduce this bug in Alternator TTL where some of the data was never expired (but only failed in release build, for an unknown reason).

Fixes SCYLLADB-777.

Closes scylladb/scylladb#28771

* github.com:scylladb/scylladb:
  test: add unit test for tablet_map::get_secondary_replica()
  test, alternator: add test for TTL expiration with a node down
  locator: fix get_secondary_replica() to match get_primary_replica()

2026-02-25 10:13:55 +02:00

alternator

Merge 'test: remove xfail marker from a few passing tests' from Nadav Har'El

2026-02-05 10:10:43 +01:00

boost

test: add unit test for tablet_map::get_secondary_replica()

2026-02-23 16:19:43 +02:00

broadcast_tables

test.py: switch of execution of several test directories by test.py runner

2026-01-09 11:59:25 +01:00

cluster

Merge 'Fix regression in Alternator TTL with tablets and node going down' from Nadav Har'El

2026-02-25 10:13:55 +02:00

cql

test.py: switch of execution of several test directories by test.py runner

2026-01-09 11:59:25 +01:00

cqlpy

Merge 'vector_search: return NaN for similarity_cosine with all-zero vectors' from Dawid Pawlik

2026-02-23 13:10:44 +01:00

ldap

auth: ldap: add permissions reload to unified cache

2026-02-17 17:56:27 +01:00

lib

test: perf: fix prepared statements logic in perf-simple-query

2026-02-19 12:42:07 +02:00

manual

Populate all sl:* groups into dedicated top-level supergroup

2026-01-21 14:14:48 +02:00

nodetool

nodetool: fix handling of "--primary-replica-only" argument

2026-02-18 12:21:27 +02:00

perf

Merge 'Reapply "main: test: add future and abort_source to after_init_func"' from Marcin Maliszkiewicz

2026-02-19 19:12:46 +02:00

pylib

test.py: improve stdout output for boost test

2026-02-25 00:50:25 +01:00

pylib_test

…

raft

test: raft: Add test_aborting_wait_for_state_change

2026-02-19 14:21:01 +01:00

resource

build: apply sccache to rust builds too

2025-12-22 15:36:15 +02:00

rest_api

test: Keep test_gossiper_live_endpoints checks togethger

2026-01-23 16:53:48 +02:00

scylla_gdb

test/scylla_gdb: skip coroutine tests if coroutine frame is not found

2026-02-24 10:12:03 +01:00

unit

…

vector_search

Merge 'vector_search: return NaN for similarity_cosine with all-zero vectors' from Dawid Pawlik

2026-02-23 13:10:44 +01:00

__init__.py

test.py: introduce new environment variable TESTPY_PREPARED_ENVIRONMENT

2026-01-09 11:59:25 +01:00

CMakeLists.txt

Revert "Merge 'vector_search: add validator tests' from Pawel Pery"

2026-02-08 16:29:58 +02:00

conftest.py

…

pytest.ini

test.py: improve C++ fail summary in pytest

2026-02-17 14:25:28 +03:00

README.md

…

README.md

Scylla in-source tests.

For details on how to run the tests, see docs/dev/testing.md

Shared C++ utils, libraries are in lib/, for Python - pylib/

alternator - Python tests which connect to a single server and use the DynamoDB API unit, boost, raft - unit tests in C++ cqlpy - Python tests which connect to a single server and use CQL topology* - tests that set up clusters and add/remove nodes cql - approval tests that use CQL and pre-recorded output rest_api - tests for Scylla REST API Port 9000 scylla-gdb - tests for scylla-gdb.py helper script nodetool - tests for C++ implementation of nodetool

If you can use an existing folder, consider adding your test to it. New folders should be used for new large categories/subsystems, or when the test environment is significantly different from some existing suite, e.g. you plan to start scylladb with different configuration, and you intend to add many tests and would like them to reuse an existing Scylla cluster (clusters can be reused for tests within the same folder).

To add a new folder, create a new directory, and then copy & edit its suite.ini.