scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-22 01:20:39 +00:00

Files

Avi Kivity 6b415cfd4b Merge 'managed_bytes: in the copy constructor, respect the target preferred allocation size' from Michał Chojnowski

Commit 14bf09f447 added a single-chunk layout to `managed_bytes`, which makes the overhead of `managed_bytes` smaller in the common case of a small buffer.

But there was a bug in it. In the copy constructor of `managed_bytes`, a copy of a single-chunk `managed_bytes` is made single-chunk too.

But this is wrong, because the source of the copy and the target of the copy might have different preferred max contiguous allocation sizes.

In particular, if a `managed_bytes` of size between 13 kiB and 128 kiB is copied from the standard allocator into LSA, the resulting `managed_bytes` is a single chunk which violates LSA's preferred allocation size. (And therefore is placed by LSA in the standard allocator).

In other words, since Scylla 6.0, cache and memtable cells between 13 kiB and 128 kiB are getting allocated in the standard allocator rather than inside LSA segments.

Consequences of the bug:

1. Effective memory consumption of an affected cell is rounded up to the nearest power of 2.

2. With a pathological-enough allocation pattern (for example, one which somehow ends up placing a single 16 kiB memtable-owned allocation in every aligned 128 kiB span), memtable flushing could theoretically deadlock, because the allocator might be too fragmented to let the memtable grow by another 128 kiB segment, while keeping the sum of all allocations small enough to avoid triggering a flush. (Such an allocation pattern probably wouldn't happen in practice though).

3. It triggers a bug in reclaim which results in spurious allocation failures despite ample evictable memory.

   There is a path in the reclaimer procedure where we check whether reclamation succeeded by checking that the number of free LSA segments grew.

   But in the presence of evictable non-LSA allocations, this is wrong because the reclaim might have met its target by evicting the non-LSA allocations, in which case memory is returned directly to the standard allocator, rather than to the pool of free segments.

   If that happens, the reclaimer wrongly returns `reclaimed_nothing` to Seastar, which fails the allocation.

Refs (possibly fixes) https://github.com/scylladb/scylladb/issues/21072
Fixes https://github.com/scylladb/scylladb/issues/22941
Fixes https://github.com/scylladb/scylladb/issues/22389
Fixes https://github.com/scylladb/scylladb/issues/23781

This is a regression fix, should be backported to all affected releases.

Closes scylladb/scylladb#23782

* github.com:scylladb/scylladb:
  managed_bytes_test: add a reproducer for #23781
  managed_bytes: in the copy constructor, respect the target preferred allocation size

2025-04-17 21:14:10 +03:00

alternator

test_returnconsumedcapacity.py: test RCU for batch get item

2025-04-16 17:05:32 +03:00

boost

Merge 'managed_bytes: in the copy constructor, respect the target preferred allocation size' from Michał Chojnowski

2025-04-17 21:14:10 +03:00

broadcast_tables

…

cluster

wip

2025-04-17 03:01:17 -04:00

cql

…

cqlpy

test_tools: Manual merge of local key gen tool test from enterprise

2025-04-15 15:14:08 +03:00

ldap

test: ldap: avoid io_uring Seastar reactor backend

2025-03-28 07:45:53 +02:00

lib

Merge 'Use named gates' from Benny Halevy

2025-04-14 20:56:32 +03:00

manual

readers: mv forwardable_v2.hh forwardable.hh

2025-04-16 04:33:50 -04:00

nodetool

test: nodetool: add tests for cluster repair command

2025-04-08 09:13:14 +02:00

perf

Merge 'readers: strip "flat" and "v2" from names' from Botond Dénes

2025-04-16 20:21:51 +03:00

pylib

raft: make group0 Raft operation timeout configurable

2025-04-15 10:57:39 +03:00

pylib_test

test.py: remove pylib_test from test.py/CI run

2025-04-01 16:43:45 +03:00

raft

Move direct_failure_detector from root to service/

2025-04-08 13:03:24 +03:00

redis

…

resource

…

rest_api

test: Add unit test for total/live sstable sizes

2025-03-04 19:52:33 +03:00

scylla_gdb

test/scylla_gdb: generate a coredump when coro_task fails

2025-04-15 15:16:38 +03:00

unit

replica/memtable: s/make_flat_reader/make_mutation_reader/

2025-04-01 17:58:13 +03:00

__init__.py

test.py: refactor paths constants and options

2025-03-30 03:19:29 +00:00

CMakeLists.txt

…

conftest.py

test.py: Make the testpy log files in pytest follow the same format

2025-04-14 12:52:48 +03:00

pytest.ini

Merge 'test/pylib: servers_add: support list of property_files' from Benny Halevy

2025-04-01 09:14:20 +03:00

README.md

…

README.md

Scylla in-source tests.

For details on how to run the tests, see docs/dev/testing.md

Shared C++ utils, libraries are in lib/, for Python - pylib/

alternator - Python tests which connect to a single server and use the DynamoDB API unit, boost, raft - unit tests in C++ cqlpy - Python tests which connect to a single server and use CQL topology* - tests that set up clusters and add/remove nodes cql - approval tests that use CQL and pre-recorded output rest_api - tests for Scylla REST API Port 9000 scylla-gdb - tests for scylla-gdb.py helper script nodetool - tests for C++ implementation of nodetool

If you can use an existing folder, consider adding your test to it. New folders should be used for new large categories/subsystems, or when the test environment is significantly different from some existing suite, e.g. you plan to start scylladb with different configuration, and you intend to add many tests and would like them to reuse an existing Scylla cluster (clusters can be reused for tests within the same folder).

To add a new folder, create a new directory, and then copy & edit its suite.ini.