scylladb/db at 65daae0fbe9c19a5eba50bb1981406c9e5abe768 - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 03:30:49 +00:00

Files

History

Botond Dénes f121720898 Merge '[Backport 5.4] batchlog replay: bypass tombstones generated by past replays' from ScyllaDB

The `system.batchlog` table has a partition for each batch that failed to complete. After finally applying the batch, the partition is deleted. Although the table has gc_grace_second = 0, tombstones can still accumulate in memory, because we don't purge partition tombstones from either the memtable or the cache. This can lead to the cache and memtable of this table to accumulate many thousands of even millions of tombstones, making batchlog replay very slow. We didn't notice this before, because we would only replay all failed batches on unbootstrap, which is rare and a heavy and slow operation on its own right already.
With repair-based tombstone-gc however, we do a full batchlog replay at the beginning of each repair, and now this extra delay is noticeable.
Fix this by making sure batchlog replays don't have to scan through all the tombstones generated by previous replays:
* flush the `system.batchlog` memtable at the end of each batchlog replay, so it is cleared of tombstones
* bypass the cache

Fixes: https://github.com/scylladb/scylladb/issues/19376

Although this is not a regression -- replay was like this since forever -- now that repair calls into batchlog replay, every release which uses repair-based tombstone-gc should get this fix

(cherry picked from commit 4e96e320b4)

(cherry picked from commit 2dd057c96d)

(cherry picked from commit 29f610d861)

(cherry picked from commit 31c0fa07d8)

Refs #19377

Closes scylladb/scylladb#19501

* github.com:scylladb/scylladb:
  db/batchlog_manager: bypass cache when scanning batchlog table
  db/batchlog_manager: replace open-coded paging with internal one
  db/batchlog_manager: implement cleanup after all batchlog replay
  cql3/query_processor: for_each_cql_result(): move func to the coro frame

2024-06-27 14:57:19 +03:00

..

Commitlog replayer: Range-check skip call

2024-01-05 09:19:28 +02:00

cql3, db: sstable: specialize fmt::formatter<function_name>

2023-04-21 10:07:28 +03:00

db/hints/manager: Reword comments about state

2023-10-06 13:25:30 +02:00

types: move types.{cc,hh} into types

2023-02-19 21:05:45 +02:00

migration_manager: announce: provide descriptions for all calls

2023-08-07 14:38:11 +02:00

view-builder: Print correct exception in built ste exception handler

2024-05-01 10:19:28 +03:00

batchlog_manager.cc

db/batchlog_manager: bypass cache when scanning batchlog table

2024-06-26 09:05:13 +00:00

batchlog_manager.hh

batchlog_manager: Remove start() method

2023-09-12 16:37:52 +03:00

cache_tracker.hh

sstables: partition_index_cache: deglobalize stats

2023-09-01 22:34:41 +02:00

chained_delegating_reader.hh

flat_mutation_reader_v2: drop forward_buffer_to

2023-02-28 23:00:02 +04:00

CMakeLists.txt

db/hints: Move the rebalancing logic to hint_storage

2023-09-15 03:46:15 +02:00

config.cc

db/config.cc: increment components_memory_reclaim_threshold config default

2024-06-04 07:11:43 +03:00

config.hh

repair: Introduce repair_partition_count_estimation_ratio config option

2024-05-27 16:32:56 +08:00

consistency_level_type.hh

…

consistency_level_validations.hh

…

consistency_level.cc

build: disable implicit fallthrough

2023-07-10 19:36:06 +02:00

consistency_level.hh

db: consistency_level: remove overload of filter_for_query

2023-06-14 11:41:36 +02:00

cql_type_parser.cc

Merge 'Allow setting permissions for user-defined functions' from Wojciech Mitros

2023-03-12 14:04:34 +02:00

cql_type_parser.hh

Merge 'Allow setting permissions for user-defined functions' from Wojciech Mitros

2023-03-12 14:04:34 +02:00

data_listeners.cc

keys: specialize fmt::formatter<partition_key> and friends

2023-04-14 13:21:30 +08:00

data_listeners.hh

db: data_listener: mark data_listener's dtor virtual

2023-03-19 15:16:02 +02:00

extensions.cc

db::extentions: Add "extensions internal" keyspace set

2023-03-27 15:12:31 +00:00

extensions.hh

db::extentions: Add "extensions internal" keyspace set

2023-03-27 15:12:31 +00:00

heat_load_balance.cc

…

heat_load_balance.hh

…

large_data_handler.cc

logging: Don't log PK/CK in large partition/row/cell warning

2024-04-05 16:02:22 +03:00

large_data_handler.hh

Introduce schema/ module

2023-02-15 11:01:50 +02:00

legacy_schema_migrator.cc

code: Pass sharded<db::system_keyspace>& to database::truncate()

2023-07-21 13:11:59 +03:00

legacy_schema_migrator.hh

db: Add sharded<system_keyspace>& to legacy_schema_migrator

2023-07-21 12:38:46 +03:00

operation_type.hh

…

paxos_grace_seconds_extension.hh

schema_extensions: Add an option to string method

2024-06-25 12:54:13 +00:00

per_partition_rate_limit_extension.hh

Introduce schema/ module

2023-02-15 11:01:50 +02:00

per_partition_rate_limit_info.hh

db: add rate_limiter

2022-06-22 20:16:48 +02:00

per_partition_rate_limit_options.cc

treewide: use fmt::join() when appropriate

2023-03-16 20:34:18 +08:00

per_partition_rate_limit_options.hh

per_partition_rate_limit_options: abort on illegal operation type

2022-11-28 21:58:30 +02:00

rate_limiter.cc

db: add rate_limiter

2022-06-22 20:16:48 +02:00

rate_limiter.hh

db: add rate_limiter

2022-06-22 20:16:48 +02:00

read_repair_decision.hh

…

schema_features.hh

Revert "Merge 'Don't calculate hashes for schema versions in Raft mode' from Kamil Braun"

2023-10-11 00:32:05 +03:00

schema_tables.cc

schema_tables: pass reload flag when calling merge_schema cross-shard

2024-04-02 12:53:58 +02:00

schema_tables.hh

Revert "Merge 'Don't calculate hashes for schema versions in Raft mode' from Kamil Braun"

2023-10-11 00:32:05 +03:00

size_estimates_virtual_reader.cc

dht: Rename dht::shard_of() to dht::static_shard_of()

2023-06-21 00:58:24 +02:00

size_estimates_virtual_reader.hh

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

snapshot-ctl.cc

snapshot: protect list operations against the lambda coroutine fiasco

2022-12-05 08:14:39 +02:00

snapshot-ctl.hh

database: automatically take snapshot of base table views

2022-09-26 11:02:54 +03:00

sstables-format-selector.cc

system_keyspace: scylla_local: use schema commitlog

2023-09-13 23:17:20 +04:00

sstables-format-selector.hh

sstables_format_selector: extract listener

2023-09-13 23:04:50 +04:00

system_distributed_keyspace.cc

cdc: use chunked_vector for topology_description entries

2023-09-18 23:17:01 +03:00

system_distributed_keyspace.hh

schema.hh: use schema_static_props for wait_for_sync_to_commitlog

2023-03-14 19:26:05 +04:00

system_keyspace_view_types.hh

…

system_keyspace.cc

system_keyspace: use system memory for system.raft table

2023-11-16 12:51:03 +01:00

system_keyspace.hh

Merge 'truncation records refactorings' from Petr Gusev

2023-10-17 10:55:30 +02:00

timeout_clock.hh

…

virtual_table.cc

dht: Rename dht::shard_of() to dht::static_shard_of()

2023-06-21 00:58:24 +02:00

virtual_table.hh

db/virtual_table: mark the dtor of base class virtual

2023-02-17 07:11:18 +02:00

virtual_tables.cc

system_keyspace: drop load phases

2023-09-13 23:17:20 +04:00

virtual_tables.hh

system_keyspace: move initialize_virtual_tables into virtual_tables.hh

2023-09-13 23:00:15 +04:00

write_type.hh

…