scylladb/db at d9853efa7c6a6d80540e1f4bbf9ccbd2831a4df4 - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-24 18:40:38 +00:00

Files

History

Pavel Emelyanov d9853efa7c Merge '[Out-of-space prevention] db: backup: prioritize sstables that were deleted from the table' from Benny Halevy

The motivation behind this change to free up disk space as early as possible.
The reason is that snapshot locks the space of all SSTables in the snapshot,
and deleting form the table, for example, by compaction, or tablet migration,
won't free-up their capacity until they are uploaded to object storage and deleted from the snapshot.

This series adds prioritization of deleted sstables in two cases:
First, after the snapshot dir is processed, the list of SSTable generation is cross-referenced with the
list of SSTables presently in the table and any generation that is not in the table is prioritized to
be uploaded earlier.
In addition, a subscription mechanism was added to sstables_manager
and it is used in backup to prioritize SSTables that get deleted from the table directory
during backup.

This is particularly important when backup happens during high disk utilization (e.g. 90%).
Without it, even if the cluster is scaled up and tablets are migrated away from the full nodes
to new nodes, tablet cleanup might not free any space if all the tablet sstables are hardlinked to the
snapshot taken for backup.

* Enhancement, no backport needed

Closes scylladb/scylladb#23241

* github.com:scylladb/scylladb:
  db: snapshot: backup_task: prioritize sstables deleted during upload
  sstables_manager: add subscriptions
  db: snapshot: backup_task: limit concurrency
  sstables: directory_semaphore: expose get_units
  db: snapshot: backup_task: add sharded sstables_manager
  database: expose get_sstables_manager(schema)
  db: snapshot: backup_task: do_backup: prioritize sstables that are already deleted from the table
  db: snapshot-ctl: pass table_id to backup_task
  db: snapshot-ctl: expose sharded db() getter
  db: snapshot: backup_task: do_backup: organize components by sstable generation
  db: snapshot: coroutinize backup_task
  db: snapshot: backup_task: refactor backup_file out of uploads_worker
  db: snapshot: backup_task: refactor uploads_worker out of do_backup
  db: snapshot: backup_task: process_snapshot_dir: initialize total progress
  utils/s3: upload_progress: init members to 0
  db: snapshot: backup_task: do_backup: refactor process_snapshot_dir
  db: snapshot: backup_task: keep expection as member

2025-04-09 15:32:11 +03:00

..

commitlog: Serialize file deletion

2025-03-17 12:09:00 +00:00

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

Merge "convert some parts of the gossiper to host ids" from Gleb

2025-03-13 13:36:31 +02:00

type_parser: support vector type

2025-01-28 21:14:49 +01:00

db: snapshot: backup_task: prioritize sstables deleted during upload

2025-04-09 08:54:07 +03:00

schema: deprecate schema_extension

2025-03-19 20:36:16 +02:00

db/view/view.cc: label metrics with basic_level

2025-03-03 16:58:39 +02:00

auth_version.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

batchlog_manager.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

batchlog_manager.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cache_mutation_reader.hh

db/row_cache: add overlap-check for cache tombstone garbage collection

2025-04-08 00:11:35 -04:00

cache_tracker.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

chained_delegating_reader.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

CMakeLists.txt

build: cmake: remove trailing comma in db/CMakeLists.txt source list

2025-02-09 17:28:47 +02:00

config.cc

Merge 'Add tablet enforcing option' from Benny Halevy

2025-04-03 16:32:19 +03:00

config.hh

Merge 'Add tablet enforcing option' from Benny Halevy

2025-04-03 16:32:19 +03:00

consistency_level_type.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

consistency_level_validations.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

consistency_level.cc

consistency_level: drop templates since the same types of ranges are used by all the callers

2025-01-16 16:37:06 +02:00

consistency_level.hh

tree: remove unused "#include"s

2025-01-28 14:12:06 +03:00

cql_type_parser.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

cql_type_parser.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

data_listeners.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

data_listeners.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

extensions.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

extensions.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

heat_load_balance.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

heat_load_balance.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

large_data_handler.cc

database, compaction_manager, large_data_handler: use pluggable<system_keysapce>

2025-03-05 08:27:23 +02:00

large_data_handler.hh

database, compaction_manager, large_data_handler: use pluggable<system_keysapce>

2025-03-05 08:27:23 +02:00

legacy_schema_migrator.cc

database: Sanitize flush_tables_on_all_shards()

2025-03-10 13:13:10 +03:00

legacy_schema_migrator.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

operation_type.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

partition_snapshot_row_cursor.hh

db: do not include unused headers

2025-02-06 13:38:19 +02:00

paxos_grace_seconds_extension.hh

schema: deprecate schema_extension

2025-03-19 20:36:16 +02:00

per_partition_rate_limit_extension.hh

schema: deprecate schema_extension

2025-03-19 20:36:16 +02:00

per_partition_rate_limit_info.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

per_partition_rate_limit_options.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

per_partition_rate_limit_options.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

rate_limiter.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

rate_limiter.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

read_context.hh

db/row_cache: add overlap-check for cache tombstone garbage collection

2025-04-08 00:11:35 -04:00

read_repair_decision.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

row_cache.cc

replica/memtable: add is_merging_to_cache()

2025-04-08 00:11:35 -04:00

row_cache.hh

db/row_cache: add overlap-check for cache tombstone garbage collection

2025-04-08 00:11:35 -04:00

schema_applier.cc

db: prevent accidental copies of result_set_row by making it move-only

2025-02-17 09:48:08 +02:00

schema_applier.hh

treewide: use angle brackets when including seastar headers

2024-12-20 16:16:28 +02:00

schema_features.hh

feature_service: add TABLET_OPTIONS cluster schema feature

2025-02-06 08:55:51 +02:00

schema_tables.cc

schema: deprecate schema_extension

2025-03-19 20:36:16 +02:00

schema_tables.hh

schema_tables: Remove all_table_names()

2025-03-10 13:12:56 +03:00

size_estimates_virtual_reader.cc

db: cql3: add comments regarding unsafe interval<clustering_key_prefix>

2025-02-26 12:01:28 +01:00

size_estimates_virtual_reader.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

snapshot-ctl.cc

db: snapshot-ctl: pass table_id to backup_task

2025-04-09 08:54:07 +03:00

snapshot-ctl.hh

db: snapshot-ctl: expose sharded db() getter

2025-04-09 08:54:07 +03:00

sstables-format-selector.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

sstables-format-selector.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

system_distributed_keyspace.cc

qos: use the shares field in service level reads/writes

2025-01-02 07:13:34 +01:00

system_distributed_keyspace.hh

db/system_distributed_keyspace: add shares column and upgrade code

2025-01-02 07:13:34 +01:00

system_keyspace_sstables_registry.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

system_keyspace_view_types.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

system_keyspace.cc

Merge "Convert gossiper's endpoint state map to be host id based" from Gleb

2025-04-02 12:30:00 +03:00

system_keyspace.hh

Merge "Convert gossiper's endpoint state map to be host id based" from Gleb

2025-04-02 12:30:00 +03:00

tablet_options.cc

schema: add per-table tablet options

2025-02-06 08:55:51 +02:00

tablet_options.hh

schema: add per-table tablet options

2025-02-06 08:55:51 +02:00

timeout_clock.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

virtual_table.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

virtual_table.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

virtual_tables.cc

treewide: drop id parameter from gossiper::for_each_endpoint_state

2025-03-31 16:50:50 +03:00

virtual_tables.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

write_type.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00