mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 12:17:02 +00:00

Files

Botond Dénes fed2c6ba83 sstables/mx/reader: release column value buffer after consumed

data_consume_rows_context_m has a _column_value buffer it uses to read
key and column values into, preparing for parsing and consuming them.
This buffer is reset (released) in a few different cases:
* When using it for key - after consuming its content
* When using it for column value - when a colum has no value

However, the buffer is not released when used for a column value and the
column is consumed. This means that if a large column is read from the
sstable, this buffer can potentially linger and keep consuming memory
until either one of the other release scenarios is hit, or the reader is
destroyed.
Add a third release scenario, releasing the buffer after the row end was
consumed. This allows the buffer to be re-used between columns of the
same row, at the same time ensuring that a large buffer will not linger.

This patch can almost halve the memory consumption of reads in certain
circumstances. Point in case: the test
test_reader_concurrency_semaphore_memory_limit_engages starts to fail
after this fix, because the read doesn't trigger the OOM limit anymore
and needs doubling of the concurrency to keep passing.

This issue was found in a dtest
(`test_ics_refresh_with_big_sstable_files`), which writes some large
cells of up to 7MiB. After reading the row containing this large cell,
the reader holds on to the 7MiB buffer causing the semaphore's OOM
protection to kick in down the line.

Fixes: https://github.com/scylladb/scylladb/issues/21160

Closes scylladb/scylladb#21132

2024-11-14 17:24:53 +01:00

aggregate_fcts_test.cc

treewide: drop includes of <boost/range/adaptors.hpp>

2024-10-20 17:17:11 +03:00

allocation_strategy_test.cc

…

alternator_unit_test.cc

treewide: use seastar::format() or fmt::format() explicitly

2024-09-11 23:21:40 +03:00

anchorless_list_test.cc

…

auth_passwords_test.cc

…

auth_resource_test.cc

test: use generic boost_test_print_type()

2024-05-20 12:56:20 +03:00

auth_test.cc

cql3: Rename SALTED HASH to HASHED PASSWORD

2024-10-30 14:07:58 +02:00

aws_error_injection_test.cc

aws_errors: Make error messages more verbose.

2024-11-07 21:01:25 +02:00

aws_errors_test.cc

aws_errors: Change aws_error::parse to return std::optional<>

2024-11-07 21:01:25 +02:00

batchlog_manager_test.cc

db/batchlog_manager: do_batch_log_replay(): add cleanup flag

2024-10-30 11:07:57 +08:00

big_decimal_test.cc

…

bloom_filter_test.cc

boost/bloom_filter_test: wait for total memory reclaimed update

2024-07-26 08:15:11 +03:00

bptree_test.cc

test: Move other collection-testing headers from unit to boost

2024-09-24 13:42:13 +03:00

bptree_validation.hh

test: include fmt/iostream.h and iostream when appropriate

2024-11-12 17:34:08 +02:00

broken_sstable_test.cc

test: Make tests use schema_builder instead of make_shared_schema

2024-09-05 19:31:30 +03:00

btree_test.cc

test: Move other collection-testing headers from unit to boost

2024-09-24 13:42:13 +03:00

btree_validation.hh

test: Move other collection-testing headers from unit to boost

2024-09-24 13:42:13 +03:00

bytes_ostream_test.cc

serialization: replace boost::type with std::type_identity

2024-11-05 00:43:27 +01:00

cache_algorithm_test.cc

auth: do not include unused headers

2024-06-17 17:33:55 +03:00

cache_mutation_reader_test.cc

treewide: change assert() to SCYLLA_ASSERT()

2024-08-05 08:23:35 +03:00

cached_file_test.cc

cached_file: Adapt page_view to ContiguousSharedBuffer

2024-09-27 01:25:15 +02:00

caching_options_test.cc

…

canonical_mutation_test.cc

canonical_mutation: add make_canonical_mutation_gently

2024-05-02 19:37:04 +03:00

cartesian_product_test.cc

…

castas_fcts_test.cc

treewide: drop includes of <boost/range/adaptors.hpp>

2024-10-20 17:17:11 +03:00

cdc_generation_test.cc

treewide: change assert() to SCYLLA_ASSERT()

2024-08-05 08:23:35 +03:00

cdc_test.cc

treewide: s/boost::adaptors::map_values/std::views::values/

2024-10-27 21:32:45 +02:00

cell_locker_test.cc

…

checksum_utils_test.cc

treewide: include seastar/core/format.hh instead of seastar/core/print.hh

2024-11-14 17:45:07 +02:00

chunked_managed_vector_test.cc

treewide: replace boost::irange with std::views::iota where possible

2024-10-03 10:33:33 +03:00

chunked_vector_test.cc

utils: chunked_vector: add from_range_t constructor

2024-10-31 19:32:16 +02:00

clustering_ranges_walker_test.cc

…

CMakeLists.txt

build: cmake: correct some tests' KIND

2024-10-22 07:10:47 +03:00

collection_stress.hh

test: Move stress-collecton header from unit to boost

2024-09-24 13:42:13 +03:00

column_mapping_test.cc

test: use generic boost_test_print_type()

2024-05-20 12:56:20 +03:00

commitlog_cleanup_test.cc

raft_group0_client: uninclude "db/system_keyspace.hh"

2024-09-28 16:31:53 +03:00

commitlog_test.cc

treewide: move log.hh into utils/log.hh

2024-10-22 06:54:46 +03:00

compaction_group_test.cc

compaction::table_state: implement get_token_range_after_split() wrapper

2024-11-11 12:24:00 +05:30

compound_test.cc

compound_compat: replace use of boost ranges with std ranges

2024-10-30 19:58:07 +02:00

compress_test.cc

…

config_test.cc

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

continuous_data_consumer_test.cc

…

counter_test.cc

compound_compat: replace use of boost ranges with std ranges

2024-10-30 19:58:07 +02:00

cql_auth_query_test.cc

treewide: use seastar::format() or fmt::format() explicitly

2024-09-11 23:21:40 +03:00

cql_auth_syntax_test.cc

cql3: Rename SALTED HASH to HASHED PASSWORD

2024-10-30 14:07:58 +02:00

cql_functions_test.cc

treewide: drop includes of <boost/range/adaptors.hpp>

2024-10-20 17:17:11 +03:00

cql_query_group_test.cc

treewide: drop includes of <boost/range/adaptors.hpp>

2024-10-20 17:17:11 +03:00

cql_query_large_test.cc

treewide: drop includes of <boost/range/adaptors.hpp>

2024-10-20 17:17:11 +03:00

cql_query_like_test.cc

treewide: drop includes of <boost/range/adaptors.hpp>

2024-10-20 17:17:11 +03:00

cql_query_test.cc

test: move a materialized-view test from boost to cqlpy

2024-11-14 16:55:58 +03:00

crc_test.cc

treewide: include seastar/core/format.hh instead of seastar/core/print.hh

2024-11-14 17:45:07 +02:00

data_listeners_test.cc

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

database_test.cc

treewide: replace boost::irange with std::views::iota where possible

2024-10-03 10:33:33 +03:00

dirty_memory_manager_test.cc

treewide: include seastar/core/format.hh instead of seastar/core/print.hh

2024-11-14 17:45:07 +02:00

double_decker_test.cc

treewide: include seastar/core/format.hh instead of seastar/core/print.hh

2024-11-14 17:45:07 +02:00

duration_test.cc

…

dynamic_bitset_test.cc

treewide: s/boost::algorithm::any_of/std::ranges::any_of/

2024-11-05 14:06:09 +08:00

enum_option_test.cc

…

enum_set_test.cc

…

error_injection_test.cc

error_injection: Remove unused inject(sleep, then invoke) overload

2024-11-05 09:56:08 +02:00

estimated_histogram_test.cc

test/estimated_histogram_test Add summary tests

2024-08-22 23:34:24 +03:00

exception_container_test.cc

…

exceptions_fallback_test.cc

…

exceptions_optimized_test.cc

…

exceptions_test.inc.cc

treewide: change assert() to SCYLLA_ASSERT()

2024-08-05 08:23:35 +03:00

expr_test.cc

cql3: introduce dialect infrastructure

2024-08-29 21:19:23 +03:00

extensions_test.cc

…

filtering_test.cc

treewide: drop includes of <boost/range/adaptors.hpp>

2024-10-20 17:17:11 +03:00

flush_queue_test.cc

treewide: include seastar/core/format.hh instead of seastar/core/print.hh

2024-11-14 17:45:07 +02:00

fragmented_temporary_buffer_test.cc

treewide: change assert() to SCYLLA_ASSERT()

2024-08-05 08:23:35 +03:00

frozen_mutation_test.cc

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

generic_server_test.cc

test/generic_server: add test case

2024-08-28 10:59:44 +02:00

gossiping_property_file_snitch_test.cc

test: use generic boost_test_print_type()

2024-05-20 12:56:20 +03:00

group0_cmd_merge_test.cc

raft_group0_client: uninclude "db/system_keyspace.hh"

2024-09-28 16:31:53 +03:00

group0_test.cc

raft: add the check for the group0 tables

2024-10-08 21:08:11 +02:00

hash_test.cc

…

hashers_test.cc

…

hint_test.cc

test/boost/hint_test.cc: Add missing parse() callback

2024-06-19 23:19:33 +02:00

idl_test.cc

serialization: replace boost::type with std::type_identity

2024-11-05 00:43:27 +01:00

index_reader_test.cc

sstables: bsearch_clustered_cursor: Narrow down range using "end" position of the block

2024-10-03 14:16:05 +02:00

index_with_paging_test.cc

treewide: replace boost::irange with std::views::iota where possible

2024-10-03 10:33:33 +03:00

input_stream_test.cc

…

intrusive_array_test.cc

…

json_cql_query_test.cc

treewide: drop includes of <boost/range/adaptors.hpp>

2024-10-20 17:17:11 +03:00

json_test.cc

…

keys_test.cc

serialization: replace boost::type with std::type_identity

2024-11-05 00:43:27 +01:00

large_paging_state_test.cc

treewide: change assert() to SCYLLA_ASSERT()

2024-08-05 08:23:35 +03:00

like_matcher_test.cc

…

limiting_data_source_test.cc

treewide: change assert() to SCYLLA_ASSERT()

2024-08-05 08:23:35 +03:00

linearizing_input_stream_test.cc

…

lister_test.cc

treewide: change assert() to SCYLLA_ASSERT()

2024-08-05 08:23:35 +03:00

loading_cache_test.cc

…

locator_topology_test.cc

treewide: move log.hh into utils/log.hh

2024-10-22 06:54:46 +03:00

log_heap_test.cc

treewide: change assert() to SCYLLA_ASSERT()

2024-08-05 08:23:35 +03:00

logalloc_standard_allocator_segment_pool_backend_test.cc

…

logalloc_test.cc

treewide: include seastar/core/format.hh instead of seastar/core/print.hh

2024-11-14 17:45:07 +02:00

managed_bytes_test.cc

…

managed_vector_test.cc

…

map_difference_test.cc

…

memtable_test.cc

replica: implement memtable_flush_period_in_ms schema option

2024-10-17 13:41:15 +03:00

multishard_combining_reader_as_mutation_source_test.cc

treewide: s/boost::adaptors::map_values/std::views::values/

2024-10-27 21:32:45 +02:00

multishard_mutation_query_test.cc

serialization: replace boost::type with std::type_identity

2024-11-05 00:43:27 +01:00

murmur_hash_test.cc

treewide: include seastar/core/format.hh instead of seastar/core/print.hh

2024-11-14 17:45:07 +02:00

mutation_fragment_test.cc

schema_registry: stop including replica/database.hh

2024-11-04 13:16:27 +01:00

mutation_query_test.cc

readers: Use reversed schema and native reversed slices

2024-08-13 10:03:46 +02:00

mutation_reader_another_test.cc

treewide: replace boost::irange with std::views::iota where possible

2024-10-03 10:33:33 +03:00

mutation_reader_test.cc

test/boost/mutation_reader_test: add test for multishard reader buffer hint

2024-11-07 02:47:54 -05:00

mutation_test.cc

treewide: Remove table::config::datadir

2024-09-19 13:06:39 +03:00

mutation_writer_test.cc

treewide: s/boost::adaptors::map_values/std::views::values/

2024-10-27 21:32:45 +02:00

mvcc_test.cc

mvcc_test: fix a benign failure of test_apply_to_incomplete_respects_continuity

2024-11-08 06:08:39 +01:00

network_topology_strategy_test.cc

treewide: move log.hh into utils/log.hh

2024-10-22 06:54:46 +03:00

nonwrapping_interval_test.cc

test/boost: include test/lib/test_utils.hh

2024-05-26 12:32:43 +08:00

observable_test.cc

…

partitioner_test.cc

treewide: s/boost::algorithm::all_of/std::ranges::all_of/

2024-11-05 14:05:24 +08:00

per_partition_rate_limit_test.cc

test: Avoid using deprecated sharded API

2024-05-16 00:28:47 +02:00

pretty_printers_test.cc

…

querier_cache_test.cc

treewide: use std::ranges sort functions rather than boost

2024-10-01 14:19:05 +03:00

query_processor_test.cc

treewide: drop includes of <boost/range/adaptors.hpp>

2024-10-20 17:17:11 +03:00

radix_tree_printer.hh

test: Move other collection-testing headers from unit to boost

2024-09-24 13:42:13 +03:00

radix_tree_test.cc

test: Move other collection-testing headers from unit to boost

2024-09-24 13:42:13 +03:00

range_assert.hh

…

range_tombstone_list_assertions.hh

…

range_tombstone_list_test.cc

…

rate_limiter_test.cc

…

reader_concurrency_semaphore_test.cc

sstables/mx/reader: release column value buffer after consumed

2024-11-14 17:24:53 +01:00

README.md

test: rename "cql-pytest" to "cqlpy"

2024-11-06 16:48:36 +02:00

recent_entries_map_test.cc

…

repair_test.cc

test/boost/repair_test: close reader after use

2024-09-13 06:52:26 -04:00

restrictions_test.cc

treewide: drop includes of <boost/range/adaptors.hpp>

2024-10-20 17:17:11 +03:00

result_utils_test.cc

…

reusable_buffer_test.cc

treewide: change assert() to SCYLLA_ASSERT()

2024-08-05 08:23:35 +03:00

role_manager_test.cc

cql3: auth: use mutation collector for alter role

2024-06-04 15:43:04 +02:00

row_cache_test.cc

treewide: s/boost::algorithm::any_of/std::ranges::any_of/

2024-11-05 14:06:09 +08:00

rust_test.cc

…

s3_test.cc

aws_errors: Make error messages more verbose.

2024-11-07 21:01:25 +02:00

schema_change_test.cc

compound_compat: replace use of boost ranges with std ranges

2024-10-30 19:58:07 +02:00

schema_changes_test.cc

treewide: rename flat_mutation_reader_v2 to mutation_reader

2024-06-21 07:12:06 +03:00

schema_loader_test.cc

test: Use sstables::test_env to make sstables for schema loader test

2024-09-09 14:22:58 +03:00

schema_registry_test.cc

db: move schema merging code into a separate unit

2024-09-23 12:01:36 +02:00

secondary_index_test.cc

test/boost/secondary_index_test: add test for use after free

2024-11-08 14:16:19 +01:00

serialization_test.cc

serialization: replace boost::type with std::type_identity

2024-11-05 00:43:27 +01:00

serialized_action_test.cc

…

service_level_controller_test.cc

service/qos/service_level_controller: notify subscribers on effective

2024-08-08 10:42:09 +02:00

sessions_test.cc

…

small_vector_test.cc

utils: small_vector: support from_range_t

2024-10-21 09:31:38 +03:00

snitch_reset_test.cc

treewide: include used headers

2024-05-27 17:34:38 +03:00

sorting_test.cc

test/boost: add test for topological sorting

2024-05-16 13:30:03 +02:00

sstable_3_x_test.cc

Merge 'compound: replace boost ranges with std ranges' from Avi Kivity

2024-10-30 11:02:51 +01:00

sstable_compaction_test.cc

treewide: s/boost::algorithm::any_of/std::ranges::any_of/

2024-11-05 14:06:09 +08:00

sstable_conforms_to_mutation_source_test.cc

readers: Use reversed schema and native reversed slices

2024-08-13 10:03:46 +02:00

sstable_datafile_test.cc

treewide: s/boost::algorithm::all_of/std::ranges::all_of/

2024-11-05 14:05:24 +08:00

sstable_directory_test.cc

test: Fix test_multiple_data_dirs

2024-10-07 12:04:23 +03:00

sstable_generation_test.cc

…

sstable_move_test.cc

treewide: use seastar::format() or fmt::format() explicitly

2024-09-11 23:21:40 +03:00

sstable_mutation_test.cc

treewide: use std::ranges sort functions rather than boost

2024-10-01 14:19:05 +03:00

sstable_partition_index_cache_test.cc

…

sstable_resharding_test.cc

treewide: replace boost::irange with std::views::iota where possible

2024-10-03 10:33:33 +03:00

sstable_set_test.cc

boost/sstable_set_test: add testcase to test tablet_sstable_set copy constructor

2024-08-17 23:38:05 +05:30

sstable_test.cc

test: Squash test::change_generation_number() into test::store()

2024-10-24 11:29:17 +03:00

sstable_test.hh

test: Tune up indentation in uncompressed_schema()

2024-09-05 19:33:29 +03:00

stall_free_test.cc

utils/stall_free: introduce reserve_gently

2024-06-18 23:36:30 +05:30

statement_restrictions_test.cc

cql3: statement_restrictions, expr: move restrictions-related expression utilities out of expression.cc

2024-09-22 11:00:51 +03:00

storage_proxy_test.cc

treewide: use seastar::format() or fmt::format() explicitly

2024-09-11 23:21:40 +03:00

string_format_test.cc

test: string_format_test: disable test if {fmt} >= 10.0.0

2024-05-03 11:34:23 +03:00

suite.yaml

boost/bloom_filter_test: add testcase to verify unlinked sstables are not reloaded

2024-07-19 13:15:57 +05:30

summary_test.cc

…

tablets_test.cc

tablets_test: test enable/disable tablets when creating a new keyspace

2024-11-07 13:57:40 +02:00

tagged_integer_test.cc

…

token_metadata_test.cc

locator: topology: add_or_update_endpoint: use none as the default node state

2024-08-29 10:37:07 +02:00

top_k_test.cc

treewide: include used headers

2024-05-27 17:34:38 +03:00

total_order_check.hh

treewide: use seastar::format() or fmt::format() explicitly

2024-09-11 23:21:40 +03:00

tracing_test.cc

…

transport_test.cc

treewide: replace boost::irange with std::views::iota where possible

2024-10-03 10:33:33 +03:00

tree_test_key.hh

test: Move other collection-testing headers from unit to boost

2024-09-24 13:42:13 +03:00

types_test.cc

treewide: remove dependency on boost asio address_v4

2024-10-01 14:00:50 +03:00

user_function_test.cc

test/boost: include test/lib/test_utils.hh

2024-05-26 12:32:43 +08:00

user_types_test.cc

treewide: replace boost::irange with std::views::iota where possible

2024-10-03 10:33:33 +03:00

utf8_test.cc

…

UUID_test.cc

treewide: use seastar::format() or fmt::format() explicitly

2024-09-11 23:21:40 +03:00

view_build_test.cc

test/boost: stop using ranges::to()

2024-10-19 13:21:20 +08:00

view_complex_test.cc

compaction: replace optional<task_info> with task_info param

2024-08-02 14:38:46 +02:00

view_schema_ckey_test.cc

test/lib: do not include unused headers

2024-05-05 23:31:48 +03:00

view_schema_pkey_test.cc

test/lib: do not include unused headers

2024-05-05 23:31:48 +03:00

view_schema_test.cc

node_update_backlog: divide adding and fetching backlogs

2024-06-06 10:45:13 +02:00

vint_serialization_test.cc

…

virtual_reader_test.cc

treewide: drop includes of <boost/range/adaptors.hpp>

2024-10-20 17:17:11 +03:00

virtual_table_mutation_source_test.cc

…

virtual_table_test.cc

…

wasm_alloc_test.cc

main, test: use seastar::handle_signal() instead

2024-09-19 18:10:07 +03:00

wasm_test.cc

…

wrapping_interval_test.cc

test/boost: include test/lib/test_utils.hh

2024-05-26 12:32:43 +08:00

README.md

Scylla unit tests using C++ and the Boost test framework

The source files in this directory are Scylla unit tests written in C++ using the Boost.Test framework. These unit tests come in three flavors:

Some simple tests that check stand-alone C++ functions or classes use Boost's BOOST_AUTO_TEST_CASE.
Some tests require Seastar features, and need to be declared with Seastar's extensions to Boost.Test, namely SEASTAR_TEST_CASE.
Even more elaborate tests require not just a functioning Seastar environment but also a complete (or partial) Scylla environment. Those tests use the do_with_cql_env() or do_with_cql_env_thread() function to set up a mostly-functioning environment behaving like a single-node Scylla, in which the test can run.

While we have many tests of the third flavor, writing new tests of this type should be reserved to white box tests - tests where it is necessary to inspect or control Scylla internals that do not have user-facing APIs such as CQL. In contrast, black-box tests - tests that can be written only using user-facing APIs, should be written in one of newer test frameworks that we offer - such as test/cqlpy or test/alternator (in Python, using the CQL or DynamoDB APIs respectively) or test/cql (using textual CQL commands), or - if more than one Scylla node is needed for a test - using the test/topology* framework.

Running tests

Because these are C++ tests, they need to be compiled before running. To compile a single test executable row_cache_test, use a command like

ninja build/dev/test/boost/row_cache_test

You can also use ninja dev-test to build all C++ tests, or use ninja deb-build to build the C++ tests and also the full Scylla executable (however, note that full Scylla executable isn't needed to run Boost tests).

Replace "dev" by "debug" or "release" in the examples above and below to use the "debug" build mode (which, importantly, compiles the test with ASAN and UBSAN enabling on and helps catch difficult-to-catch use-after-free bugs) or the "release" build mode (optimized for run speed).

To run an entire test file row_cache_test, including all its test functions, use a command like:

build/dev/test/boost/row_cache_test -- -c1 -m1G

to run a single test function test_reproduce_18045() from the longer test file, use a command like:

build/dev/test/boost/row_cache_test -t test_reproduce_18045 -- -c1 -m1G

In these command lines, the parameters before the -- are passed to Boost.Test, while the parameters after the -- are passed to the test code, and in particular to Seastar. In this example Seastar is asked to run on one CPU (-c1) and use 1G of memory (-m1G) instead of hogging the entire machine. The Boost.Test option -t test_reproduce_18045 asks it to run just this one test function instead of all the test functions in the executable.

Unfortunately, interrupting a running test with control-C while doesn't work. This is a known bug (#5696). Kill a test with SIGKILL (-9) if you need to kill it while it's running.

Boost tests can also be run using test.py - which is a script that provides a uniform way to run all tests in scylladb.git - C++ tests, Python tests, etc.

Writing tests

Because of the large build time and build size of each separate test executable, it is recommended to put test functions into relatively large source files. But not too large - to keep compilation time of a single source file (during development) at reasonable levels.

When adding new source files in test/boost, don't forget to list the new source file in configure.py and also in CMakeLists.txt. The former is needed by our CI, but the latter is preferred by some developers.