scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-28 18:50:53 +00:00

Files

Nadav Har'El c04b086929 alternator: avoid oversized allocation in Query/Scan

This patch fixes one cause of oversized allocations - and therefore
potentially stalls and increased tail latencies - in Alternator.

Alternator's Scan or Query operation return a page of results. When the
number of items is not limited by a "Limit" parameter, the default is
to return a 1 MB page. If items are short, a large number of them can
fit in that 1MB. The test test_query.py::test_query_large_page_small_rows
has 30,000 items returned in a single page.

In the response JSON, all these items are returned in a single array
"Items". Before this patch, we build the full response as a RapidJSON
object before sending it. The problem is that unfortunately, RapidJSON
stores arrays as contiguous allocations. This results in large
contiguous allocations in workloads that scan many small items, and
large contiguous allocations can also cause stalls and high tail
latencies. For example, before this patch, running

    test/alternator/run --runveryslow \
        test_query.py::test_query_large_page_small_rows

reports in the log:

    oversized allocation: 573440 bytes.

After this patch, this warning no longer appears.
The patch solves the problem by collecting the scanned items not in a
RapidJSON array, but rather in a chunked_vector<rjson::value>, i.e,
a chunked (non-contiguous) array of items (each a JSON value).
After collecting this array separately from the response object, we
need to print its content without actually inserting it into the object -
we add a new function print_with_extra_array() to do that.

The new separate-chunked-vector technique is used when a large number
(currently, >256) of items were scanned. When there is a smaller number
of items in a page (this is typical when each item is longer), we just
insert those items in the object and print it as before.

Beyond the original slow test that demonstrated the oversized allocation
(which is now gone), this patch also includes a new test which
exercises the new code with a scan of 700 (>256) items in a page -
but this new test is fast enough to be permanently in our test suite
and not a manual "veryslow" test as the other test.

Fixes #23535

(cherry picked from commit 2385fba4b6)

Closes scylladb/scylladb#25654

2025-09-01 16:40:02 +03:00

alternator

alternator: avoid oversized allocation in Query/Scan

2025-09-01 16:40:02 +03:00

boost

commitlog: Ensure segment deletion is re-entrant

2025-08-30 18:51:35 +03:00

broadcast_tables

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cluster

main: Log RF-rack-invalid keyspaces at startup

2025-08-22 14:31:13 +00:00

cql

cql: restore validating replication strategies options

2025-02-04 12:27:33 +01:00

cqlpy

test: run mv tests depending on metrics on a standalone instance

2025-08-19 17:10:58 +03:00

ldap

test.py: move the readme file for LDAP tests to the correct location

2025-04-22 19:03:28 +02:00

lib

main: Log RF-rack-invalid keyspaces at startup

2025-08-22 14:31:13 +00:00

manual

gossip: add recovery_leader to gossip_digest_syn

2025-08-05 10:59:06 +00:00

nodetool

nodetool: repair: skip tablet keyspaces

2025-07-15 06:36:08 +03:00

perf

test: switch uses of make_sstable_compressor_factory() to a seastar::thread-dependent version

2025-05-12 09:12:05 +00:00

pylib

Merge '[Backport 2025.2] generic server: 2 step shutdown' from Scylladb[bot]

2025-08-19 17:11:22 +03:00

pylib_test

test.py: remove pylib_test from test.py/CI run

2025-04-01 16:43:45 +03:00

raft

raft: replication test: change rpc_propose_conf_change test to SEASTAR_THREAD_TEST_CASE

2025-08-15 13:27:52 +03:00

redis

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

resource

build: cmake: use wasm32-wasip1 as an alternative of wasm32-wasi

2025-01-16 16:28:29 +03:00

rest_api

keys: from_nodetool_style_string don't split single partition keys

2025-08-26 10:31:54 +03:00

scylla_gdb

test/scylla_gdb: better error message when running on dev build mode

2025-04-22 15:02:06 +03:00

unit

replica/memtable: s/make_flat_reader/make_mutation_reader/

2025-04-01 17:58:13 +03:00

__init__.py

test.py: move get_combined_tests to the correct facade

2025-04-24 14:05:49 +02:00

CMakeLists.txt

Introduce LDAP role manager & saslauthd authenticator

2025-01-12 14:50:29 +02:00

conftest.py

test.py: move setup cgroups to the generic method

2025-04-24 14:05:49 +02:00

pytest.ini

Merge 'test/pylib: servers_add: support list of property_files' from Benny Halevy

2025-04-01 09:14:20 +03:00

README.md

test: rename "cql-pytest" to "cqlpy"

2024-11-06 16:48:36 +02:00

README.md

Scylla in-source tests.

For details on how to run the tests, see docs/dev/testing.md

Shared C++ utils, libraries are in lib/, for Python - pylib/

alternator - Python tests which connect to a single server and use the DynamoDB API unit, boost, raft - unit tests in C++ cqlpy - Python tests which connect to a single server and use CQL topology* - tests that set up clusters and add/remove nodes cql - approval tests that use CQL and pre-recorded output rest_api - tests for Scylla REST API Port 9000 scylla-gdb - tests for scylla-gdb.py helper script nodetool - tests for C++ implementation of nodetool

If you can use an existing folder, consider adding your test to it. New folders should be used for new large categories/subsystems, or when the test environment is significantly different from some existing suite, e.g. you plan to start scylladb with different configuration, and you intend to add many tests and would like them to reuse an existing Scylla cluster (clusters can be reused for tests within the same folder).

To add a new folder, create a new directory, and then copy & edit its suite.ini.