scylladb

Files

Piotr Dulikowski 591a67c7e7 Merge 'view_builder: register view on all shards atomically' from Michael Litvak

When the view builder starts to build a new view, each shard registers
itself by writing the shard id and current token to the
scylla_views_builds_in_progress table.

Previously, this happened independently by each shard. We change it now
to register all shards "atomically" - when a shard registers itself, it
also registers all other shards with an empty status, if they aren't
registered yet. This ensures that we don't have a partial state in the
table where only some of the shards are registered, but we always have a
status for all shards.

The reason we want to register all shards atomically is that if it
happens that only some of the shards were registered, then we restart
and load the status from table, this doesn't work well for multiple
reasons.

One example is that to know how many shards we had previously, we take
the maximum shard id we see in the table. If it's different than the
current shard count, we will execute the reshard code. But of course, if
the last shard is missing from the table because it didn't register
itself, this calculation will be wrong, and we can't know the previous
number of shards.

This is a problem because suppose we have two shards, and shard 0
finished building the view but shard 1 didn't start. When we come up, we
will think that previously we had only a single shard and it completed
building everything, when in fact we built only half the view
approximately. The problem is that we don't have enough information in
the tables to know that.

There are additional problems related to reshard. In the reshard
function, whether it is executed because we actually do node reshard or
because we calculated the wrong number of previous shards, if the status
of some shard is missing then the calculation of new ranges will be
wrong. When some shard didn't make progress we should start building the
view from scratch. However, this doesn't happen if we don't have a
status for the shard, because the code looks only for shards that have a
status. In effect, this shard is considered complete even though it
didn't start. This could cause the view building to get stuck or
complete without building all tokens ranges.

By registering all shards atomically, this should solve the above
problems because we will always have statuses for all shards.

Fixes https://github.com/scylladb/scylladb/issues/22989

backport not needed - the issue is probably not common and there's a workaround

Closes scylladb/scylladb#25790

* github.com:scylladb/scylladb:
  test: mv: add a test for view build interrupt during registration
  view_builder: register view on all shards atomically

2025-09-22 08:03:44 +02:00

alternator

Merge 'alternator: Store LSI keys in :attrs for newly created tables' from Piotr Wieczorek

2025-09-18 21:48:43 +03:00

boost

Merge 'view_builder: register view on all shards atomically' from Michael Litvak

2025-09-22 08:03:44 +02:00

broadcast_tables

…

cluster

Merge 'view_builder: register view on all shards atomically' from Michael Litvak

2025-09-22 08:03:44 +02:00

cql

test/cql: enable cql cdc tests to run with tablets

2025-09-17 14:47:13 +02:00

cqlpy

vector search: correct column name formatting

2025-09-20 07:02:53 +02:00

ldap

auth: allow dropping roles in saslauthd_authenticator

2025-08-22 09:40:44 +03:00

lib

db: get rid of sstables-format-selector

2025-09-19 16:17:56 +03:00

manual

code: Replace distributed<> with sharded<>

2025-09-19 12:22:51 +02:00

nodetool

nodetool: ignore repair request error of colocated tables

2025-09-18 09:35:53 +02:00

perf

test/perf/tablet_load_balancing.cc: Create nodes within one DC

2025-09-21 21:36:43 +02:00

pylib

test: add reload_raft_topology_state() to ScyllaRESTAPIClient

2025-09-18 09:28:32 +02:00

pylib_test

…

raft

test.py: pytest: support --mode/--repeat in a common way for all tests

2025-08-17 15:26:23 +00:00

resource

Merge 'types: add byte-comparable format support for collections' from Lakshmi Narayanan Sreethar

2025-08-31 15:53:27 +03:00

rest_api

test: add compaction task progress test

2025-08-28 12:10:13 +02:00

scylla_gdb

scylla-gdb: add scylla prepared-statements

2025-09-16 23:40:47 +03:00

storage

tests/cluster: Add new storage tests

2025-08-29 14:56:13 +02:00

unit

code: Replace distributed<> with sharded<>

2025-09-19 12:22:51 +02:00

__init__.py

test.py: metrics: add host_id suffix to .db file

2025-08-19 11:33:11 +00:00

CMakeLists.txt

…

conftest.py

test.py: refactor: move framework-related code to test.pylib.runner

2025-08-17 12:32:35 +00:00

pytest.ini

tiering (test.py): introduce tiering labels

2025-08-04 15:38:16 +03:00

README.md

…

README.md

Scylla in-source tests.

For details on how to run the tests, see docs/dev/testing.md

Shared C++ utils, libraries are in lib/, for Python - pylib/

alternator - Python tests which connect to a single server and use the DynamoDB API unit, boost, raft - unit tests in C++ cqlpy - Python tests which connect to a single server and use CQL topology* - tests that set up clusters and add/remove nodes cql - approval tests that use CQL and pre-recorded output rest_api - tests for Scylla REST API Port 9000 scylla-gdb - tests for scylla-gdb.py helper script nodetool - tests for C++ implementation of nodetool

If you can use an existing folder, consider adding your test to it. New folders should be used for new large categories/subsystems, or when the test environment is significantly different from some existing suite, e.g. you plan to start scylladb with different configuration, and you intend to add many tests and would like them to reuse an existing Scylla cluster (clusters can be reused for tests within the same folder).

To add a new folder, create a new directory, and then copy & edit its suite.ini.