scylladb/cql3 at d14eec8160d25e897dee4484412dff8ec404f774 - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-04 22:13:19 +00:00

Files

History

Avi Kivity c80dc57156 Merge 'batchlog replay: bypass tombstones generated by past replays' from Botond Dénes

The `system.batchlog` table has a partition for each batch that failed to complete. After finally applying the batch, the partition is deleted. Although the table has gc_grace_second = 0, tombstones can still accumulate in memory, because we don't purge partition tombstones from either the memtable or the cache. This can lead to the cache and memtable of this table to accumulate many thousands of even millions of tombstones, making batchlog replay very slow. We didn't notice this before, because we would only replay all failed batches on unbootstrap, which is rare and a heavy and slow operation on its own right already.
With repair-based tombstone-gc however, we do a full batchlog replay at the beginning of each repair, and now this extra delay is noticeable.
Fix this by making sure batchlog replays don't have to scan through all the tombstones generated by previous replays:
* flush the `system.batchlog` memtable at the end of each batchlog replay, so it is cleared of tombstones
* bypass the cache

Fixes: https://github.com/scylladb/scylladb/issues/19376

Although this is not a regression -- replay was like this since forever -- now that repair calls into batchlog replay, every release which uses repair-based tombstone-gc should get this fix

Closes scylladb/scylladb#19377

* github.com:scylladb/scylladb:
  db/batchlog_manager: bypass cache when scanning batchlog table
  db/batchlog_manager: replace open-coded paging with internal one
  db/batchlog_manager: implement cleanup after all batchlog replay
  cql3/query_processor: for_each_cql_result(): move func to the coro frame

2024-06-25 16:11:01 +03:00

..

cql3: remove unused operator<<

2024-06-14 09:45:35 +03:00

functions: Do not crash when schema is missing

2024-05-15 17:20:40 +03:00

statement_restrictions: partition_ranges_from_singles: no need to default-initialize result

2024-06-25 12:11:28 +03:00

cql: fix regression in SELECT * GROUP BY

2023-12-25 17:52:57 +02:00

Merge 'cql3/statement/select_statement: do not parallelize single-partition aggregations' from Michał Jadwiszczak

2024-06-21 08:50:00 +02:00

assignment_testable.hh

cql3: remove unused operator<<

2024-06-14 09:45:35 +03:00

attributes.cc

types: do not include unused headers

2024-04-23 12:08:23 +03:00

attributes.hh

…

authorized_prepared_statements_cache.hh

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

cf_name.cc

…

cf_name.hh

cql3: Remove unused cf_name::operator<<

2024-03-08 15:14:52 +02:00

CMakeLists.txt

build: bring abseil submodule back

2024-05-05 23:31:09 +03:00

column_identifier.cc

cql3: add fmt::formatter for column_identifier{,_row}

2024-03-02 10:52:50 +08:00

column_identifier.hh

cql3: remove unused operator<<

2024-06-14 09:45:35 +03:00

column_specification.cc

…

column_specification.hh

…

constants.cc

cql3: do not include unused headers

2024-01-16 16:43:17 +02:00

constants.hh

cql3: do not include unused headers

2024-01-16 16:43:17 +02:00

cql3_type.cc

cql3: remove unused operator<<

2024-06-14 09:45:35 +03:00

cql3_type.hh

cql3: remove unused operator<<

2024-06-14 09:45:35 +03:00

cql_config.hh

…

cql_statement.hh

cql3:statements: run service level statements on shard0 with raft guard

2024-03-21 23:14:57 +01:00

Cql.g

cql3/statements: extend ALTER TABLE ... DROP to allow specifying timestamp of column drop

2024-04-25 21:27:40 +02:00

error_collector.hh

cql3: do not include unused headers

2024-01-16 16:43:17 +02:00

error_listener.hh

…

index_name.cc

…

index_name.hh

cql3: do not include unused headers

2024-01-16 16:43:17 +02:00

keyspace_element_name.cc

…

keyspace_element_name.hh

…

lists.cc

cql3: do not include unused headers

2024-01-16 16:43:17 +02:00

lists.hh

treewide: do not define FMT_DEPRECATED_OSTREAM

2024-04-19 22:57:36 +08:00

maps.cc

cql3: do not include unused headers

2024-01-16 16:43:17 +02:00

maps.hh

cql3: do not include unused headers

2024-01-16 16:43:17 +02:00

operation_impl.hh

…

operation.cc

treewide: use fmt::to_string() to transform a UUID to std::string

2024-03-26 13:38:37 +08:00

operation.hh

cql3: do not include unused headers

2024-01-16 16:43:17 +02:00

prepare_context.cc

types: do not include unused headers

2024-04-23 12:08:23 +03:00

prepare_context.hh

cql3/prepare_context: fix generating pk_indexes for duplicate named bind variables

2023-09-25 17:18:53 +02:00

prepared_statements_cache.hh

treewide: drop thrift support

2024-06-07 06:44:59 +08:00

query_options_fwd.hh

…

query_options.cc

cql3: do not include unused headers

2024-01-16 16:43:17 +02:00

query_options.hh

…

query_processor.cc

cql3/query_processor: for_each_cql_result(): move func to the coro frame

2024-06-25 06:15:25 -04:00

query_processor.hh

cql3/query_processor: for_each_cql_result(): move func to the coro frame

2024-06-25 06:15:25 -04:00

result_generator.hh

cql3: result_set: switch cell data type from bytes_opt to managed_bytes_opt

2023-05-07 17:17:36 +03:00

result_set.cc

cql3: result_set: switch cell data type from bytes_opt to managed_bytes_opt

2023-05-07 17:17:36 +03:00

result_set.hh

result_set: introduce visit_gently()

2024-03-26 18:32:11 +02:00

role_name.cc

…

role_name.hh

…

role_options.hh

…

sets.cc

cql3: expr: break up expression.hh header

2023-06-22 14:21:03 +03:00

sets.hh

cql3: do not include unused headers

2024-01-16 16:43:17 +02:00

stats.hh

guardrails: restrict replication strategy (RS)

2023-10-31 18:34:41 +03:00

type_json.cc

cql3: Fix invalid JSON parsing for JSON object with different key types

2024-05-05 15:42:43 +03:00

type_json.hh

…

untyped_result_set.cc

untyped_result_set: add missing_column_exception

2023-08-04 07:37:12 +02:00

untyped_result_set.hh

cql3: untyped_result_set: document performance characteristics

2023-05-10 15:03:12 +03:00

update_parameters.cc

cql3: expr: break up expression.hh header

2023-06-22 14:21:03 +03:00

update_parameters.hh

…

user_types.cc

cql3: do not include unused headers

2024-01-16 16:43:17 +02:00

user_types.hh

cql3: do not include unused headers

2024-01-16 16:43:17 +02:00

ut_name.cc

cql: maybe quote user type name in ut_name::to_string()

2023-03-23 01:41:58 +01:00

ut_name.hh

cql3: add formatter for cql3::ut_name

2024-01-21 16:53:05 +02:00

util.cc

cql3: expr: break up expression.hh header

2023-06-22 14:21:03 +03:00

util.hh

treewide: replace std::result_of_t with std::invoke_result_t

2024-05-26 16:45:42 +03:00

values.cc

cql3: add fmt::formatter for raw_value{,_view}

2024-03-05 14:00:13 +08:00

values.hh

cql3: remove unused operator<<

2024-06-14 09:45:35 +03:00