scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-26 03:20:37 +00:00

Files

Dawid Mędrek 0a6137218a db/hints: Cancel draining when stopping node

Draining hints may occur in one of the two scenarios:

* a node leaves the cluster and the local node drains all of the hints
  saved for that node,
* the local node is being decommissioned.

Draining may take some time and the hint manager won't stop until it
finishes. It's not a problem when decommissioning a node, especially
because we want the cluster to retain the data stored in the hints.
However, it may become a problem when the local node started draining
hints saved for another node and now it's being shut down.

There are two reasons for that:

* Generally, in situations like that, we'd like to be able to shut down
  nodes as fast as possible. The data stored in the hints won't
  disappear from the cluster yet since we can restart the local node.
* Draining hints may introduce flakiness in tests. Replaying hints doesn't
  have the highest priority and it's reflected in the scheduling groups we
  use as well as the explicitly enforced throughput. If there are a large
  number of hints to be replayed, it might affect our tests.
  It's already happened, see: scylladb/scylladb#21949.

To solve those problems, we change the semantics of draining. It will behave
as before when the local node is being decommissioned. However, when the
local node is only being stopped, we will immediately cancel all ongoing
draining processes and stop the hint manager. To amend for that, when we
start a node and it initializes a hint endpoint manager corresponding to
a node that's already left the cluster, we will begin the draining process
of that endpoint manager right away.

That should ensure all data is retained, while possibly speeding up
the shutdown process.

There's a small trade-off to it, though. If we stop a node, we can then
remove it. It won't have a chance to replay hints it might've before
these changes, but that's an edge case. We expect this commit to bring
more benefit than harm.

We also provide tests verifying that the implementation works as intended.

Fixes scylladb/scylladb#21949

Closes scylladb/scylladb#22811

2025-03-13 11:55:15 +02:00

alternator

alternator: Clean error handling on CreateTable without AttributeDefinitions

2025-02-26 14:24:57 +02:00

boost

Merge 'tablets: Make load balancing capacity-aware' from Tomasz Grabiec

2025-03-11 14:34:27 +02:00

broadcast_tables

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cluster

db/hints: Cancel draining when stopping node

2025-03-13 11:55:15 +02:00

cql

cql: restore validating replication strategies options

2025-02-04 12:27:33 +01:00

cqlpy

cql3/select_statement: reject aggregate functions when PER PARTITION LIMIT is present

2025-03-13 10:29:53 +02:00

ldap

test.py: Add possibility to run ldap tests from pytest

2025-02-07 21:40:28 +01:00

lib

Merge 'tablets: Make load balancing capacity-aware' from Tomasz Grabiec

2025-03-11 14:34:27 +02:00

manual

treewide: Reduce db/config.hh header fanout

2025-02-25 15:16:40 +01:00

nodetool

tools/scylla-nodetool: netstats: don't assume both senders and receivers

2025-02-15 20:32:22 +02:00

perf

test/perf/s3: Don't forget to stop sharded<tester> on error

2025-03-13 09:54:09 +02:00

pylib

aws_error: Enhance error handling for AWS HTTP client

2025-03-10 09:01:47 +02:00

pylib_test

…

raft

test: Add the possibility to run raft tests with pytest

2025-02-12 14:10:19 +02:00

redis

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

resource

build: cmake: use wasm32-wasip1 as an alternative of wasm32-wasi

2025-01-16 16:28:29 +03:00

rest_api

test: Add unit test for total/live sstable sizes

2025-03-04 19:52:33 +03:00

scylla_gdb

scylla-gdb.py: add scylla tablet-metadata command

2025-02-11 07:29:46 -05:00

unit

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

__init__.py

…

CMakeLists.txt

Introduce LDAP role manager & saslauthd authenticator

2025-01-12 14:50:29 +02:00

conftest.py

test.py: extract prepare dirs and S3 mock steps to test/conftest.py

2025-03-03 13:24:37 +03:00

pytest.ini

test.py: introduce prepare_3_nodes_cluster marker

2025-03-04 10:32:43 +01:00

README.md

…

README.md

Scylla in-source tests.

For details on how to run the tests, see docs/dev/testing.md

Shared C++ utils, libraries are in lib/, for Python - pylib/

alternator - Python tests which connect to a single server and use the DynamoDB API unit, boost, raft - unit tests in C++ cqlpy - Python tests which connect to a single server and use CQL topology* - tests that set up clusters and add/remove nodes cql - approval tests that use CQL and pre-recorded output rest_api - tests for Scylla REST API Port 9000 scylla-gdb - tests for scylla-gdb.py helper script nodetool - tests for C++ implementation of nodetool

If you can use an existing folder, consider adding your test to it. New folders should be used for new large categories/subsystems, or when the test environment is significantly different from some existing suite, e.g. you plan to start scylladb with different configuration, and you intend to add many tests and would like them to reuse an existing Scylla cluster (clusters can be reused for tests within the same folder).

To add a new folder, create a new directory, and then copy & edit its suite.ini.