scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-02 14:15:46 +00:00

Files

Patryk Jędrzejczak b0eef50b2e raft topology: make left_token_ring a transition state

A node can be in the `left_token_ring` state after:
- a finished decommission,
- a failed bootstrap,
- a failed replace.

When a node is in the `left_token_ring` state, we don't know how
it has ended up in this state. We cannot distinguish a node that
has finished decommissioning from a node that has failed bootstrap.

The main problem it causes is that we incorrectly send the
`barrier_and_drain` command to a node that has failed
bootstrapping or replacing. We must do it for a node that has
finished decommissioning because it could still coordinate
requests. However, since we cannot distinguish nodes in the
`left_token_ring` state, we must send the command to all of them.
This issue appeared in scylladb/scylladb#16797 and this patch is
a follow-up that fixes it.

The solution is changing `left_token_ring` from a node state
to a transition state.

Regarding implementation, most of the changes are simple
refactoring. The less obvious are:
- Before this patch, in `system_keyspace::left_topology_state`, we
had to keep the ignored nodes' IDs for replace to ensure that the
replacing node will have access to it after moving to the
`left_token_ring` state, which happens when replace fails. We
don't need this workaround anymore. When we enter the new
`left_token_ring` transition state, the new node will still be in
the `decommissioning` state, so it won't lose its request param.
- Before this patch, a decommissioning node lost its tokens
while moving to the `left_token_ring` state. After the patch, it
loses tokens while still being in the `decommissioning` state. We
ensure that all `decommissioning` handlers correctly handle a node
that lost its tokens.

Moving the `left_token_ring` handler from `handle_node_transition`
to `handle_topology_transition` created a large diff. There are
only three changes:
- adding `auto node = get_node_to_work_on(std::move(guard));`,
- adding `builder.del_transition_state()`,
- changing error logged when `global_token_metadata_barrier` fails.

2024-01-29 10:39:07 +01:00

alternator

test/alternator: add more tests for TagResource

2024-01-23 11:55:22 +02:00

auth_cluster

test.py: test_maintenance_socket: remove pytest.xfail

2024-01-19 14:54:15 +01:00

boost

Merge ' db: commitlog_replayer: ignore mutations affected by (tablet) cleanups ' from Michał Chojnowski

2024-01-25 20:51:03 +02:00

broadcast_tables

db: config: make consistent_cluster_management mandatory

2023-12-14 16:54:04 +01:00

cql

cql3:statement_restrictions.cc add more conditions to prevent "allow filtering" error to pop up in delete/update statements

2023-12-07 21:25:18 +02:00

cql-pytest

DROP TYPE IF EXISTS should work on non-existent keyspace

2024-01-25 14:28:43 +02:00

lib

main: Postpone start-up of hint manager

2024-01-26 12:49:40 +01:00

manual

reader_permit: store schema_ptr instead of raw schema pointer

2024-01-11 08:37:56 +02:00

nodetool

test/nodetool: only test "storage_service/cleanup_all" with scylla

2024-01-26 13:19:15 +02:00

object_store

Typos: fix typos in comments

2023-12-02 22:37:22 +02:00

perf

reader_permit: store schema_ptr instead of raw schema pointer

2024-01-11 08:37:56 +02:00

pylib

Merge 'Add maintenance mode' from Mikołaj Grzebieluch

2024-01-26 11:02:34 +01:00

pylib_test

test.py: support code coverage

2024-01-18 11:11:34 +02:00

raft

build: cmake: add "unit_test_list" target

2024-01-10 08:43:04 +02:00

redis

Typos: fix typos in comments

2023-12-02 22:37:22 +02:00

resource

rust: update dependencies

2023-12-17 13:20:25 +02:00

rest_api

compaction_manager: perform_task_on_all_files: return early when there are no sstables to compact

2024-01-17 11:53:39 +02:00

scylla-gdb

Typos: fix typos in comments

2023-12-02 22:37:22 +02:00

topology

raft topology: make left_token_ring a transition state

2024-01-29 10:39:07 +01:00

topology_custom

test.py: add test for maintenance mode

2024-01-25 15:27:53 +01:00

topology_experimental_raft

Merge ' db: commitlog_replayer: ignore mutations affected by (tablet) cleanups ' from Michał Chojnowski

2024-01-25 20:51:03 +02:00

unit

test/unit/bptree_validation: use "{}" for formatting test_data

2024-01-11 10:53:33 +02:00

__init__.py

…

CMakeLists.txt

build: cmake: add "unit_test_list" target

2024-01-10 08:43:04 +02:00

README.md

test: provide overview of the contents of test/ directory

2023-11-26 15:51:07 +02:00

README.md

Scylla in-source tests.

For details on how to run the tests, see docs/dev/testing.md

Shared C++ utils, libraries are in lib/, for Python - pylib/

alternator - Python tests which connect to a single server and use the DynamoDB API unit, boost, raft - unit tests in C++ cql-pytest - Python tests which connect to a single server and use CQL topology* - tests that set up clusters and add/remove nodes cql - approval tests that use CQL and pre-recorded output rest_api - tests for Scylla REST API Port 9000 scylla-gdb - tests for scylla-gdb.py helper script nodetool - tests for C++ implementation of nodetool

If you can use an existing folder, consider adding your test to it. New folders should be used for new large categories/subsystems, or when the test environment is significantly different from some existing suite, e.g. you plan to start scylladb with different configuration, and you intend to add many tests and would like them to reuse an existing Scylla cluster (clusters can be reused for tests within the same folder).

To add a new folder, create a new directory, and then copy & edit its suite.ini.