scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 20:27:03 +00:00

Files

Botond Dénes fd6877c654 Merge 'alternator: avoid oversized allocation in Query/Scan' from Nadav Har'El

This series fixes one cause of oversized allocations - and therefore potentially stalls and increased tail latencies - in Alternator.

The first patch in the series is the main fix - the later patches are cleanups requested by reviewers but also involved other pre-existing code, so I did those cleanups as separate patches.

Alternator's Scan or Query operation return a page of results. When the number of items is not limited by a "Limit" parameter, the default is to return a 1 MB page. If items are short, a large number of them can fit in that 1MB. The test test_query.py::test_query_large_page_small_rows has 30,000 items returned in a single page.

In the response JSON, all these items are returned in a single array "Items". Before this patch, we build the full response as a RapidJSON object before sending it. The problem is that unfortunately, RapidJSON stores arrays as contiguous allocations. This results in large contiguous allocations in workloads that scan many small items, and large contiguous allocations can also cause stalls and high tail latencies. For example, before this patch, running

    test/alternator/run --runveryslow \
        test_query.py::test_query_large_page_small_rows

reports in the log:

    oversized allocation: 573440 bytes.

After this patch, this warning no longer appears.
The patch solves the problem by collecting the scanned items not in a RapidJSON array, but rather in a chunked_vector<rjson::value>, i.e, a chunked (non-contiguous) array of items (each a JSON value). After collecting this array separately from the response object, we need to print its content without actually inserting it into the object - we add a new function print_with_extra_array() to do that.

The new separate-chunked-vector technique is used when a large number (currently, >256) of items were scanned. When there is a smaller number of items in a page (this is typical when each item is longer), we just insert those items in the object and print it as before.

Beyond the original slow test that demonstrated the oversized allocation (which is now gone), this patch also includes a new test which exercises the new code with a scan of 700 (>256) items in a page - but this new test is fast enough to be permanently in our test suite and not a manual "veryslow" test as the other test.

Fixes #23535

The stalls caused by large allocations was seen by actual users, so it makes sense to backport this patch. On the other hand, the patch while not big is fairly intrusive (modifies the nomal Scan and Query path and also the later patches do some cleanup of additional code) so there is some small risk involved in the backport.

Closes scylladb/scylladb#24480

* github.com:scylladb/scylladb:
  alternator: clean up by co-routinizing
  alternator: avoid spamming the log when failing to write response
  alternator: clean up and simplify request_return_type
  alternator: avoid oversized allocation in Query/Scan

2025-07-17 11:30:40 +03:00

alternator

Merge 'alternator: avoid oversized allocation in Query/Scan' from Nadav Har'El

2025-07-17 11:30:40 +03:00

boost

Merge 'auth: move passwords::check call to alien thread' from Andrzej Jackowski

2025-07-16 13:15:54 +03:00

broadcast_tables

test.py: cql: run tests using bare pytest command

2025-06-03 07:54:51 +00:00

cluster

Merge 'streaming: Avoid deadlock by running view checks in a separate scheduling group' from Tomasz Grabiec

2025-07-17 10:24:41 +03:00

cql

test.py: cql: don't exit from pytest session on failed CQL

2025-06-03 07:54:51 +00:00

cqlpy

test: wait for 3 clients with given username in test_service_level_api

2025-07-15 23:28:39 +02:00

ldap

test.py: clean code that isn't used anymore

2025-06-11 18:29:26 +02:00

lib

tests::proc::process_fixture: Fix line handler adaptor buffering

2025-07-17 10:58:03 +03:00

manual

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

nodetool

tools/scylla-nodetool: backup: add --move-files parameter

2025-06-27 16:21:39 +03:00

perf

Merge 'Atomic in-memory schema changes application' from Marcin Maliszkiewicz

2025-07-13 20:47:55 +03:00

pylib

Merge 'test.py: add missed parameters that should be passed from test.py to pytest' from Andrei Chekun

2025-07-16 15:29:17 +02:00

pylib_test

…

raft

test.py: clean code that isn't used anymore

2025-06-11 18:29:26 +02:00

redis

…

resource

types/comparable_bytes: add testcase to verify compatibility with cassandra

2025-07-01 22:19:08 +05:30

rest_api

test.py: Fix test_compactionhistory_rows_merged_time_window_compaction_strategy

2025-07-01 15:01:21 +03:00

scylla_gdb

…

unit

test.py: clean code that isn't used anymore

2025-06-11 18:29:26 +02:00

__init__.py

test.py: allow cmake configuration and ./configure.py configuration to coexist

2025-06-03 16:46:41 +03:00

CMakeLists.txt

…

conftest.py

test.py: add bypassing x_log2_compaction_groups to boost tests

2025-07-11 12:30:09 +02:00

pytest.ini

test.py: dtest: add missed markers to pytest.ini

2025-06-30 10:06:32 +00:00

README.md

…

README.md

Scylla in-source tests.

For details on how to run the tests, see docs/dev/testing.md

Shared C++ utils, libraries are in lib/, for Python - pylib/

alternator - Python tests which connect to a single server and use the DynamoDB API unit, boost, raft - unit tests in C++ cqlpy - Python tests which connect to a single server and use CQL topology* - tests that set up clusters and add/remove nodes cql - approval tests that use CQL and pre-recorded output rest_api - tests for Scylla REST API Port 9000 scylla-gdb - tests for scylla-gdb.py helper script nodetool - tests for C++ implementation of nodetool

If you can use an existing folder, consider adding your test to it. New folders should be used for new large categories/subsystems, or when the test environment is significantly different from some existing suite, e.g. you plan to start scylladb with different configuration, and you intend to add many tests and would like them to reuse an existing Scylla cluster (clusters can be reused for tests within the same folder).

To add a new folder, create a new directory, and then copy & edit its suite.ini.