mirror of https://github.com/scylladb/scylladb.git synced 2026-05-29 19:21:01 +00:00

Files

Pavel Emelyanov 3f7ee3ce5d Merge 'batchlog: make replay (flush) faster' from Botond Dénes

The batchlog table contains an entry for each logged batch that is processed by the local node as coordinator. These entries are typically very short lived, they are inserted when the batch is processed and deleted immediately after the batch is successfully applied.
When a table has `tombstone_gc = {'mode': 'repair'}` enabled, every repair has to flush all hints and batchlogs, so that we can be certain that there is no live data in any of these, older than the last repair. Since batches can contain member queries from any number of tables, the whole batchlog has to be flushed, even if repair-mode tombstone-gc is enabled for a single table.

Flushing the batchlog table happens by doing a batchlog replay. This involves reading the entire content of this table, and attempting to replay+delete any live entries (that are old enough to be replayed).  Under normal operating circumstances, 99%+ of the content of the batchlog table is partition tombstones.  Because of this, scanning the content of this table has to process thousands to millions of tombstones. This was observed to require up to 20 minutes to finish, causing repairs to slow down to a crawl, as the batchlog-flush has to be repeated at the end of the repair of each token-range.

When trying to address this problem, the first idea was that we should expedite the garbage-collection of these accumulated tombstones. This experiment failed, see https://github.com/scylladb/scylladb/pull/23752. The commitlog proved to be an impossible to bypass barrier, preventing quick garbage-collection of tombstones. So long as a single commit-log segment is alive, holding content from the batchlog table, all tombstones written after are blocked from GC.
The second approach, represented by this PR, is to not rely in tombstone GC to reduce the tombstone amount. Instead restructure the table such that a single higher-order tombstone can be used to shadow and allow for the eviction of the myriads of individual batchlog entry tombstones. This is realized by reorganizing the batchlog table such that individual batches are rows, not partitions.
This new schema is introduced by the new `system.batchlog_v2` table, introduced by this PR:

    CREATE TABLE system.batchlog_v2 (
        version int,
        stage int,
        shard int,
        written_at timestamp,
        id uuid,
        data blob,
        PRIMARY KEY ((version, stage, shard), written_at, id));

The new schema organization has the following goals:
1) Make post-replay batchlog cleanup possible with a simple range-tombstone. This allows dropping the individual dead batchlog entries, as they are shadowed by a higher level tombstone. This enables dropping tombstones without tombstone GC.
2) To make the above possible, introduce the stage key component: batchlog entries that fail the first replay attempt, are moved to the failed_replay stage, so the initial stage can be cleaned up safely.
3) Spread out the data among Scylla shards, via the batchlog shard column.
4) Make batchlog entries ordered by the batchlog create time (id). This allows for selecting batchlogs to replay, without post-filtering of batchlogs that are too young to be replayed.

Fixes: https://github.com/scylladb/scylladb/issues/23358

This is an improvement, normally not a backport-candidate. We might override this and backport to allow wider use of `tombstone_gc: {'mode': 'repair'}`.

Closes scylladb/scylladb#26671

* github.com:scylladb/scylladb:
  db/config: change batchlog_replay_cleanup_after_replays default to 1
  test/boost/batchlog_manager_test: add test for batchlog cleanup
  replica/mutation_dump: always set position weight for clustering positions
  service/storage_proxy: s/batch_replay_throw/storage_proxy_fail_replay_batch/
  test/lib: introduce error_injection.hh
  utils/error_injection: add debug log to disable() and disable_all()
  test/lib/cql_test_env: forward config to batchlog
  test/lib/cql_test_env: add batch type to execute_batch()
  test/lib/cql_assertions: add with_size(predicate) overload
  test/lib/cql_assertions: add source location to fail messages
  test/lib/cql_assertions: columns_assertions: add assert_for_columns_of_each_row()
  test/lib/cql_assertions: rows_assertions::assert_for_columns_of_row(): add index bound check
  test/lib/cql_assertions: columns_assertions: add T* with_typed_column() overload
  db/batchlog_manager: config: s/write_timeout/reply_timeot/
  db,service: switch to system.batchlog_v2
  db/system_keyspace: introduce system.batchlog_v2
  service,db: extract generation of batchlog delete mutation
  service,db: extract get_batchlog_mutation_for() from storage-proxy
  db/batchlog_manager: only consider propagation delay with tombstone-gc=repair
  db/batchlog_manager: don't drop entire batch if one mutations' table was dropped
  data_dictionary: table: add get_truncation_time()
  db/batchlog_manager: batch(): replace map_reduce() with simple loop
  db/batchlog_manager: finish coroutinizing replay_all_failed_batches
  db/batchlog_manager: improve replayAllFailedBatches logs

2025-12-15 15:05:19 +03:00

__init__.py

test.py: Add the possibility to run boost test from pytest

2025-02-07 21:40:25 +01:00

address_map_test.cc

address_map: Use barrier() to wait for replication

2025-11-19 15:21:02 +01:00

advanced_rpc_compressor_test.cc

message: move RPC compression from utils/ to message/

2025-09-30 17:03:09 +03:00

aggregate_fcts_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

allocation_strategy_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

alternator_unit_test.cc

alternator: adds expression cache implementation

2025-09-28 04:27:44 +02:00

anchorless_list_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

auth_passwords_test.cc

test/boost: add too_long_password to auth_passwords_test

2025-12-10 15:36:18 +01:00

auth_resource_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

auth_test.cc

service_level_controller: automatically create sl:driver

2025-10-08 08:24:43 +02:00

aws_error_injection_test.cc

treewide: move away from accessing httpd::request::query_parameters

2025-09-24 11:52:15 +03:00

aws_errors_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

batchlog_manager_test.cc

test/boost/batchlog_manager_test: add test for batchlog cleanup

2025-12-02 14:21:26 +02:00

big_decimal_test.cc

utils/big_decimal: fix scale overflow when parsing values with large exponents

2025-06-26 15:29:28 +03:00

bloom_filter_test.cc

test/boost/bloom_filter_test: add test_rebuild_from_temporary_hashes

2025-09-29 22:15:26 +02:00

bptree_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bptree_validation.hh

test/boost/bptree_validation.hh: add missing include <fmt/format.h>

2025-01-23 06:05:57 -05:00

broken_sstable_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bti_index_test.cc

sstables/trie: fix an assertion violation in bti_partition_index_writer_impl::write_last_key

2025-11-07 11:25:07 +02:00

bti_key_translation_test.cc

sstables/trie: BTI-translate the entire partition key at once

2025-09-29 04:10:40 +02:00

bti_node_sink_test.cc

sstables/trie: fix a special case in max_offset_from_child

2025-09-07 00:30:15 +02:00

btree_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

btree_validation.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

bytes_ostream_test.cc

bytes_ostream: overload write() to support writing from FragmentedView

2025-07-01 22:19:07 +05:30

cache_algorithm_test.cc

test: compile unit tests into a single executable

2024-12-22 19:14:09 +02:00

cache_mutation_reader_test.cc

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

cached_file_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

caching_options_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

canonical_mutation_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cartesian_product_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

castas_fcts_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

cdc_generation_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cdc_test.cc

vector_search: Restrict vector index tests to tablets only

2025-11-25 09:26:16 +02:00

cell_locker_test.cc

treewide: Move replica related files to replica directory

2025-09-18 08:00:35 +03:00

checksum_utils_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

chunked_managed_vector_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

chunked_vector_test.cc

test: avoid #include <boost/test/included/...>

2025-09-22 15:26:06 +03:00

clustering_ranges_walker_test.cc

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

CMakeLists.txt

test/boost: coroutinize auth_passwords_test

2025-12-10 15:36:18 +01:00

collection_stress.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

column_mapping_test.cc

test: compile unit tests into a single executable

2024-12-22 19:14:09 +02:00

combined_tests.cc

test: combined_test: relicense

2024-12-25 13:53:54 +02:00

commitlog_cleanup_test.cc

db/config: add tablets_mode_for_new_keyspaces option

2025-03-24 14:54:45 +02:00

commitlog_test.cc

commitlog::read_log_file: Check for eof position on all data reads

2025-11-28 15:26:46 +03:00

compaction_group_test.cc

compaction: remove using namespace {compaction,sstables}

2025-09-25 15:03:57 +03:00

comparable_bytes_test.cc

vector_search: remove dependence on cql3

2025-10-21 17:41:55 +03:00

compound_test.cc

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

compress_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

config_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

conftest.py

test.py: pytest: support --mode/--repeat in a common way for all tests

2025-08-17 15:26:23 +00:00

continuous_data_consumer_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

counter_test.cc

treewide: Move mutation related files to a mutation directory

2025-09-24 13:23:38 +03:00

cql_auth_query_test.cc

vector_search: Restrict vector index tests to tablets only

2025-11-25 09:26:16 +02:00

cql_auth_syntax_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cql_functions_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

cql_query_group_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

cql_query_large_test.cc

treewide: Rename table_state to compaction_group_view

2025-08-08 06:51:28 +03:00

cql_query_like_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

cql_query_test.cc

Update seastar submodule

2025-11-30 12:38:47 +02:00

crc_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

data_listeners_test.cc

test: compile unit tests into a single executable

2024-12-22 19:14:09 +02:00

database_test.cc

Revert "Merge 'Add option to use sstable identifier in snapshot' from Benny Halevy"

2025-12-12 03:55:13 +00:00

dict_trainer_test.cc

message: move RPC compression from utils/ to message/

2025-09-30 17:03:09 +03:00

dirty_memory_manager_test.cc

replica/memtable: move region_listener handlers from dirty_memory_manager to memtable

2025-06-20 11:42:30 +02:00

disk_space_monitor_test.cc

disk_space_monitor_test.cc: Start a monitor after fake space source function is registered

2025-09-18 15:03:34 +03:00

double_decker_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

duration_test.cc

treewide: Move type related files to a type directory As requested in #22110 , moved the files and fixed other includes and build system.

2025-09-17 17:32:19 +03:00

dynamic_bitset_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

encrypted_file_test.cc

encryption: add encrypted_data_source class

2025-07-06 09:18:39 +03:00

encryption_at_rest_test.cc

test::boost::encryption_at_rest: Remove redundant azure test indent

2025-11-05 10:22:23 +00:00

enum_option_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

enum_set_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

error_injection_test.cc

test: compile unit tests into a single executable

2024-12-22 19:14:09 +02:00

estimated_histogram_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

exception_container_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

exceptions_fallback_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

exceptions_optimized_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

exceptions_test.inc.cc

exceptions: Add try_catch_nested to universally handle nested exceptions of the same type.

2025-03-26 11:15:13 +01:00

expr_test.cc

boost/expr_test: add vector expression tests

2025-01-28 21:14:49 +01:00

extensions_test.cc

sstables::file_io_extension: Make sstable argument to "wrap" const

2025-03-20 14:54:09 +00:00

file_stream_test.cc

file_stream_test: add sstable file streaming integrity verification test cases

2025-11-21 12:52:35 +01:00

filtering_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

flush_queue_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

fragmented_temporary_buffer_test.cc

transport: replace throwing protocol_exception with returns

2025-08-28 23:31:36 +02:00

frozen_mutation_test.cc

readers: mv from_mutations_v2.hh from_mutations.hh

2025-04-16 04:46:08 -04:00

gcp_object_storage_test.cc

utils::gcp::object_storage: Fix buffer alignment reordering trailing data

2025-11-21 09:36:13 +02:00

generic_server_test.cc

generic_server: transport: start using sl:driver for new connections

2025-10-08 08:25:12 +02:00

gossiping_property_file_snitch_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

group0_cmd_merge_test.cc

service/migration_manager: pass storage_proxy to prepare_keyspace_drop_announcement()

2025-08-27 08:55:47 +02:00

group0_test.cc

treewide: seastar module update

2025-11-27 12:34:22 +02:00

group0_voter_calculator_test.cc

raft: small fixes for voters code

2025-10-16 18:41:08 +02:00

hash_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

hashers_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

hint_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

idl_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

incremental_compaction_test.cc

compaction: remove using namespace {compaction,sstables}

2025-09-25 15:03:57 +03:00

index_reader_test.cc

test/boost/index_reader_test: prepare for ms sstables

2025-09-29 22:15:25 +02:00

index_with_paging_test.cc

test: compile unit tests into a single executable

2024-12-22 19:14:09 +02:00

input_stream_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

intrusive_array_test.cc

utils: do not include unused headers

2025-01-14 07:56:39 -05:00

json_cql_query_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

json_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

keys_test.cc

keys: from_nodetool_style_string don't split single partition keys

2025-08-14 19:52:04 +03:00

large_paging_state_test.cc

test: compile unit tests into a single executable

2024-12-22 19:14:09 +02:00

like_matcher_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

limiting_data_source_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

linearizing_input_stream_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

lister_test.cc

lister: Fix race between readdir and stat

2025-10-28 15:10:22 +02:00

loading_cache_test.cc

loading_cache_test: test_loading_cache_reload_during_eviction: use manual_clock

2025-03-31 14:53:06 +03:00

locator_topology_test.cc

tablets: prevent accidental copy of tablets_map

2025-07-22 15:07:26 +03:00

log_heap_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

logalloc_standard_allocator_segment_pool_backend_test.cc

…

logalloc_test.cc

logalloc_test: don't test performance in test background_reclaim

2025-05-06 18:59:18 +02:00

lru_string_map_test.cc

utils: add lru_string_map

2025-09-28 04:06:00 +02:00

managed_bytes_test.cc

managed_bytes: make empty managed_bytes constexpr friendly

2025-07-29 23:51:43 +03:00

managed_vector_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

map_difference_test.cc

treewide: Move misc files to utils directory

2025-07-21 11:56:40 +03:00

memtable_test.cc

tests: adjust for incremental repair

2025-08-08 06:49:17 +03:00

multishard_combining_reader_as_mutation_source_test.cc

everywhere: use utils::chunked_vector for list of mutations

2025-07-13 19:13:11 +03:00

multishard_query_test.cc

code: Stop using seastar::compat::source_location

2025-11-27 19:10:11 +02:00

murmur_hash_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_fragment_test.cc

everywhere: use utils::chunked_vector for list of mutations

2025-07-13 19:13:11 +03:00

mutation_query_test.cc

mutation/mutation_compactor: add tombstone_gc_state to query ctor

2025-10-12 17:48:15 +03:00

mutation_reader_another_test.cc

everywhere: use utils::chunked_vector for list of mutations

2025-07-13 19:13:11 +03:00

mutation_reader_test.cc

code: Stop using seastar::compat::source_location

2025-11-27 19:10:11 +02:00

mutation_test.cc

mutation/mutation_compactor: add tombstone_gc_state to query ctor

2025-10-12 17:48:15 +03:00

mutation_writer_test.cc

everywhere: use utils::chunked_vector for list of mutations

2025-07-13 19:13:11 +03:00

mvcc_test.cc

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

network_topology_strategy_test.cc

mv: replace the simple/complex rack-aware pairing with exact rack matching

2025-12-02 10:52:36 +01:00

nonwrapping_interval_test.cc

test: nonwrapping_interval_test: verify an interval of tokens is trivial

2025-09-06 18:41:00 +03:00

object_storage_upload_test.cc

sstables::object_storage_client: Add multi-upload support for GS

2025-10-13 08:53:27 +00:00

observable_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

partitioner_test.cc

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

per_partition_rate_limit_test.cc

test: compile unit tests into a single executable

2024-12-22 19:14:09 +02:00

pluggable_test.cc

utils: phased_barrier, pluggable: use named gate

2025-04-12 11:47:00 +03:00

pretty_printers_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

querier_cache_test.cc

mutation/mutation_compactor: add tombstone_gc_state to query ctor

2025-10-12 17:48:15 +03:00

query_processor_test.cc

test/lib/cql_test_env: add batch type to execute_batch()

2025-12-02 14:21:26 +02:00

radix_tree_printer.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

radix_tree_test.cc

test: Remove unused operator<<(radix_tree_test::test_data)

2025-10-15 11:57:56 +02:00

range_assert.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

range_tombstone_list_assertions.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

range_tombstone_list_test.cc

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

rate_limiter_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

reader_concurrency_semaphore_test.cc

code: Stop using seastar::compat::source_location

2025-11-27 19:10:11 +02:00

README.md

test.py: Add the possibility to run boost test from pytest

2025-02-07 21:40:25 +01:00

recent_entries_map_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

repair_test.cc

Revert "repair: Add tablet repair progress report support"

2025-12-11 12:18:11 +02:00

replicator_test.cc

utils: Introduce helper for replicated data structures

2025-11-19 15:21:02 +01:00

reservoir_sampling_test.cc

utils: introduce reservoir_sampling

2024-12-23 23:37:02 +01:00

rest_client_test.cc

rest_client: set version on http::request to avoid invalid state

2025-09-18 07:36:25 +03:00

restrictions_test.cc

code: Stop using seastar::compat::source_location

2025-11-27 19:10:11 +02:00

result_utils_test.cc

test/result_utils: Do not assume map_reduce reducing order

2025-05-30 09:38:59 +02:00

reusable_buffer_test.cc

utils/reusable_buffer: accept non-throwing writer callbacks via result_with_exception

2025-07-17 16:40:02 +02:00

role_manager_test.cc

main: auth: add auth cache dependency to auth service

2025-11-26 12:01:31 +01:00

row_cache_test.cc

db: optimize cache invalidation following repair/streaming

2025-09-14 19:48:14 +03:00

rust_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

s3_test.cc

s3_client: simplify chunked download error handling using make_request

2025-10-23 15:58:11 +03:00

schema_change_test.cc

config: break out object_storage_endpoint_param preparing for multi storage

2025-10-13 08:53:24 +00:00

schema_changes_test.cc

sstables: add ms to all_sstable_versions

2025-09-29 22:15:25 +02:00

schema_loader_test.cc

test/boost/schema_loader_test.cc: Explicitly enable rf_rack_valid_keyspaces

2025-09-29 13:07:08 +02:00

schema_registry_test.cc

schema: add pointer to CDC schema

2025-10-21 14:13:43 +02:00

scoped_item_list_test.cc

utils: unit test for utils::scoped_item_list

2025-08-01 02:15:04 +03:00

secondary_index_test.cc

Merge 'mv: allow setting concurrency in PRUNE MATERIALIZED VIEW' from Wojciech Mitros

2025-12-04 11:47:41 +02:00

serialization_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

serialized_action_test.cc

utils: phased_barrier, pluggable: use named gate

2025-04-12 11:47:00 +03:00

service_level_controller_test.cc

qos: don't populate effective service level cache until auth is migrated to raft

2025-07-29 11:37:37 +02:00

sessions_test.cc

test: compile unit tests into a single executable

2024-12-22 19:14:09 +02:00

small_vector_test.cc

test: avoid #include <boost/test/included/...>

2025-09-22 15:26:06 +03:00

snitch_reset_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

sorting_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

sstable_3_x_test.cc

test/boost/sstable_3_x_test: add ms sstables to multi-version tests

2025-09-29 22:15:25 +02:00

sstable_compaction_test.cc

replica/table: keep track of total pre-compression file size

2025-11-13 00:49:57 +01:00

sstable_compression_config_test.cc

test/boost: Add tests for SSTable compression config options

2025-09-26 12:02:42 +03:00

sstable_compressor_factory_test.cc

test/boost/sstable_compressor_factory_test: fix thread-unsafe usage of Boost.Test

2025-10-12 17:16:51 +03:00

sstable_conforms_to_mutation_source_test.cc

sstables: add ms to all_sstable_versions

2025-09-29 22:15:25 +02:00

sstable_datafile_test.cc

test/lib/cql_assertions: columns_assertions: add T* with_typed_column() overload

2025-12-02 14:21:26 +02:00

sstable_directory_test.cc

test: Check file existence directly

2025-11-04 19:37:55 +01:00

sstable_generation_test.cc

test: ignore unused fmt::to_string() result

2025-03-24 10:19:09 +03:00

sstable_inexact_index_test.cc

test/boost/sstable_inexact_index_test: explicitly use a me sstable

2025-09-29 22:15:25 +02:00

sstable_move_test.cc

test: sstable_move_test: always use uuid sstable generation

2025-06-18 11:30:29 +03:00

sstable_mutation_test.cc

sstables: make sstable::estimated_keys_for_range asynchronous

2025-09-29 13:01:21 +02:00

sstable_partition_index_cache_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

sstable_resharding_test.cc

tests::boost: Add GS object storage cases to mirror S3 ones

2025-10-13 08:53:27 +00:00

sstable_set_test.cc

replica: Fix range reads spanning sibling tablets

2025-05-27 22:39:40 -03:00

sstable_tablet_streaming.cc

streaming: add get_sstables_by_tablet_range tests

2025-12-08 12:30:23 +02:00

sstable_test.cc

sstable_test: add verification testcases of SSTable components digests persistance

2025-12-04 21:09:01 +01:00

sstable_test.hh

compaction: move code to namespace compaction

2025-09-25 15:03:56 +03:00

stall_free_test.cc

utils: stall_free: add dispose_gently

2025-11-11 12:20:18 +02:00

statement_restrictions_test.cc

code: Stop using seastar::compat::source_location

2025-11-27 19:10:11 +02:00

storage_proxy_test.cc

token_metadata: move make_token_metadata_ptr into shared_token_metadata class

2025-07-06 14:22:20 +03:00

stream_compressor_test.cc

message: move RPC compression from utils/ to message/

2025-09-30 17:03:09 +03:00

string_format_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

summary_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

symmetric_key_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

tablets_test.cc

tablet: scheduler: Do not emit conflicting migration in merge colocation

2025-11-28 11:17:12 +01:00

tagged_integer_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

test_config.yaml

test/boost/batchlog_manager_test: add test for batchlog cleanup

2025-12-02 14:21:26 +02:00

token_metadata_test.cc

code: Stop using seastar::compat::source_location

2025-11-27 19:10:11 +02:00

top_k_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

total_order_check.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tracing_test.cc

test: compile unit tests into a single executable

2024-12-22 19:14:09 +02:00

transport_test.cc

transport: Don't use scattered_message

2025-10-17 10:17:08 +03:00

tree_test_key.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

trie_traversal_test.cc

sstables/trie: support reader_permit and trace_state properly

2025-09-17 12:22:40 +02:00

trie_writer_test.cc

sstables/trie/trie_writer: free nodes after they are flushed

2025-11-19 14:54:16 +02:00

types_test.cc

vector_search: remove dependence on cql3

2025-10-21 17:41:55 +03:00

unique_view_test.cc

utils: implement drop-in replacement for replacing boost::adaptors::uniqued

2025-01-21 16:24:45 +08:00

url_parse_test.cc

utils::http: Handle ipv6 numeric host part in URL:s

2025-12-04 11:38:41 +00:00

user_function_test.cc

treewide: include boost headers as "system" headers

2025-08-22 17:21:24 +03:00

user_types_test.cc

raft: make group0 Raft operation timeout configurable

2025-04-15 10:57:39 +03:00

utf8_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

UUID_test.cc

utils: fix get_random_time_UUID_from_micros to generate correct time uuid

2025-11-20 10:27:29 +02:00

view_build_test.cc

test/boost/view_build_test: increase number of retires

2025-12-08 23:14:01 +02:00

view_complex_test.cc

treewide: Rename table_state to compaction_group_view

2025-08-08 06:51:28 +03:00

view_schema_ckey_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

view_schema_pkey_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

view_schema_test.cc

test/boost/view_schema_test.cc: fix race in wait_until_built

2025-07-01 13:20:19 +03:00

vint_serialization_test.cc

test: avoid spaces when defining user-defined literal operator

2025-03-24 10:17:12 +03:00

virtual_reader_test.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

virtual_table_mutation_source_test.cc

everywhere: use utils::chunked_vector for list of mutations

2025-07-13 19:13:11 +03:00

virtual_table_test.cc

config: specialize config_from_string() for sstring

2025-01-26 15:53:12 +02:00

wasm_alloc_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

wasm_test.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

wrapping_interval_test.cc

treewide: include boost headers as "system" headers

2025-08-22 17:21:24 +03:00

README.md

Scylla unit tests using C++ and the Boost test framework

The source files in this directory are Scylla unit tests written in C++ using the Boost.Test framework. These unit tests come in three flavors:

Some simple tests that check stand-alone C++ functions or classes use Boost's BOOST_AUTO_TEST_CASE.
Some tests require Seastar features, and need to be declared with Seastar's extensions to Boost.Test, namely SEASTAR_TEST_CASE.
Even more elaborate tests require not just a functioning Seastar environment but also a complete (or partial) Scylla environment. Those tests use the do_with_cql_env() or do_with_cql_env_thread() function to set up a mostly-functioning environment behaving like a single-node Scylla, in which the test can run.

While we have many tests of the third flavor, writing new tests of this type should be reserved to white box tests - tests where it is necessary to inspect or control Scylla internals that do not have user-facing APIs such as CQL. In contrast, black-box tests - tests that can be written only using user-facing APIs, should be written in one of newer test frameworks that we offer - such as test/cqlpy or test/alternator (in Python, using the CQL or DynamoDB APIs respectively) or test/cql (using textual CQL commands), or - if more than one Scylla node is needed for a test - using the test/topology* framework.

Running tests

Because these are C++ tests, they need to be compiled before running. To compile a single test executable row_cache_test, use a command like

ninja build/dev/test/boost/row_cache_test

You can also use ninja dev-test to build all C++ tests, or use ninja deb-build to build the C++ tests and also the full Scylla executable (however, note that full Scylla executable isn't needed to run Boost tests).

Replace "dev" by "debug" or "release" in the examples above and below to use the "debug" build mode (which, importantly, compiles the test with ASAN and UBSAN enabling on and helps catch difficult-to-catch use-after-free bugs) or the "release" build mode (optimized for run speed).

To run an entire test file row_cache_test, including all its test functions, use a command like:

build/dev/test/boost/row_cache_test -- -c1 -m1G

to run a single test function test_reproduce_18045() from the longer test file, use a command like:

build/dev/test/boost/row_cache_test -t test_reproduce_18045 -- -c1 -m1G

In these command lines, the parameters before the -- are passed to Boost.Test, while the parameters after the -- are passed to the test code, and in particular to Seastar. In this example Seastar is asked to run on one CPU (-c1) and use 1G of memory (-m1G) instead of hogging the entire machine. The Boost.Test option -t test_reproduce_18045 asks it to run just this one test function instead of all the test functions in the executable.

Unfortunately, interrupting a running test with control-C while doesn't work. This is a known bug (#5696). Kill a test with SIGKILL (-9) if you need to kill it while it's running.

Boost tests can also be run using test.py - which is a script that provides a uniform way to run all tests in scylladb.git - C++ tests, Python tests, etc.

Execution with pytest

To run all tests with pytest execute

pytest test/boost

To execute all tests in one file, provide the path to the source filename as a parameter

pytest test/boost/aggregate_fcts_test.cc

Since it's a normal path, autocompletion works in the terminal out of the box.

To execute only one test function, provide the path to the source file and function name

pytest --mode dev test/boost/aggregate_fcts_test.cc::test_aggregate_avg

To provide a specific mode, use the next parameter --mode dev, if parameter isn't provided pytest tries to use ninja mode_list to find out the compiled modes.

Parallel execution is controlled by pytest-xdist and the parameter -n auto. This command starts tests with the number of workers equal to CPU cores. The useful command to discover the tests in the file or directory is

pytest --collect-only -q --mode dev test/boost/aggregate_fcts_test.cc

That will return all test functions in the file. To execute only one function from the test, you can invoke the output from the previous command. However, suffix for mode should be skipped. For example, output shows in the terminal something like this test/boost/aggregate_fcts_test.cc::test_aggregate_avg.dev. So to execute this specific test function, please use the next command

pytest --mode dev test/boost/aggregate_fcts_test.cc::test_aggregate_avg

Writing tests

Because of the large build time and build size of each separate test executable, it is recommended to put test functions into relatively large source files. But not too large - to keep compilation time of a single source file (during development) at reasonable levels.

When adding new source files in test/boost, don't forget to list the new source file in configure.py and also in CMakeLists.txt. The former is needed by our CI, but the latter is preferred by some developers.