scylladb/mutation at 63b32cbdb47fe7fefe6f75aa191fdcfef62afa7a - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 11:36:54 +00:00

Files

History

Tomasz Grabiec 4e9d95d78c Merge 'Compact data before streaming' from Botond Dénes

Currently, streaming and repair processes and sends data as-is. This is wasteful: streaming might be sending data which is expired or covered by tombstones, taking up valuable bandwidth and processing time. Repair additionally could be exposed to artificial differences, due to different nodes being in different states of compactness.
This PR adds opt-in compaction to `make_streaming_reader()`, then opts in all users. The main difference being in how these choose the current compaction time to use:
* Load'n'stream and streaming uses the current time on the local node.
* Repair uses a centrally chosen compaction time, generated on the repair master and propagated to al repair followers. This is to ensure all repair participants work with the exact state of compactness.

 Importantly, this compaction does *not* purge tombstones (tombstone GC is disabled completely).

Fixes: https://github.com/scylladb/scylladb/issues/3561

Closes #14756

* github.com:scylladb/scylladb:
  replica: make_[multishard_]streaming_reader(): make compaction_time mandatory
  repair/row_level: opt in to compacting the stream
  streaming: opt-in to compacting the stream
  sstables_loader: opt-in for compacting the stream
  replica/table: add optional compacting to make_multishard_streaming_reader()
  replica/table: add optional compacting to make_streaming_reader()
  db/config: add config item for enabling compaction for streaming and repair
  repair: log the error which caused the repair to fail
  readers: compacting_reader: use compact_mutation_state::abandon_current_partition()
  mutation/mutation_compactor: allow user to abandon current partition

2023-07-28 16:42:13 +02:00

..

atomic_cell_hash.hh

types: move types.{cc,hh} into types

2023-02-19 21:05:45 +02:00

atomic_cell_or_collection.hh

…

atomic_cell.cc

mutation_partition: compare_row_marker_for_merge: consider ttl in case expiry is the same

2023-06-20 10:10:39 +03:00

atomic_cell.hh

…

canonical_mutation.cc

…

canonical_mutation.hh

…

CMakeLists.txt

readers,mutation: move mutation_fragment_stream_validator to mutation/

2023-05-09 07:55:13 -04:00

frozen_mutation.cc

…

frozen_mutation.hh

db, storage_proxy: Drop mutation/frozen_mutation ::shard_of()

2023-06-21 00:58:24 +02:00

json.hh

tools,mutation: extract the low-level json utilities into mutation/json.hh

2023-07-19 01:28:28 -04:00

mutation_cleaner.hh

mutation: mutation_cleaner: add pause()

2023-06-19 22:50:43 +02:00

mutation_compactor.hh

mutation/mutation_compactor: allow user to abandon current partition

2023-07-27 02:50:44 -04:00

mutation_consumer_concepts.hh

…

mutation_consumer.hh

…

mutation_fragment_fwd.hh

…

mutation_fragment_stream_validator.cc

mutation/mutation_fragment_stream_validator.cc: rename logger

2023-05-09 07:55:13 -04:00

mutation_fragment_stream_validator.hh

mutation: mutation_fragment_stream_validating_filter: add accessor to underlying validator

2023-07-20 08:48:50 -04:00

mutation_fragment_v2.hh

reader_permit: split resource_units::reset()

2023-04-26 07:41:57 -04:00

mutation_fragment.cc

Merge 'reader_permit: minor improvements to resource consume/release safety' from Botond Dénes

2023-05-14 14:14:23 +03:00

mutation_fragment.hh

Merge ' mvcc: make schema upgrades gentle' from Michał Chojnowski

2023-05-24 22:58:43 +02:00

mutation_partition_serializer.cc

…

mutation_partition_serializer.hh

…

mutation_partition_v2.cc

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

mutation_partition_v2.hh

mutation_partition_v2: change schema_ptr to schema& in mutation_partition_v2 constructor

2023-05-04 02:37:29 +02:00

mutation_partition_view.cc

…

mutation_partition_view.hh

…

mutation_partition_visitor.hh

…

mutation_partition.cc

Merge 'atomic_cell: compare value last' from Benny Halevy

2023-06-20 12:11:48 +02:00

mutation_partition.hh

treewide: remove #includes not use directly

2023-07-18 17:36:31 +08:00

mutation_rebuilder.hh

mutation/mutation_rebuilder: add comment about validity of returned mutation reference

2023-07-27 12:13:46 +03:00

mutation_source_metadata.hh

…

mutation.cc

mutation/mutation: add memory_usage()

2023-07-25 10:34:30 -04:00

mutation.hh

mutation/mutation: add memory_usage()

2023-07-25 10:34:30 -04:00

partition_version_list.hh

…

partition_version.cc

partition_version: make partition_entry::upgrade() gentle

2023-05-04 03:35:15 +02:00

partition_version.hh

partition_version: handle multi-schema snapshots in merge_partition_versions

2023-05-04 03:35:15 +02:00

position_in_partition.hh

mutation: drop operator<< for position_in_partition and friends

2023-03-31 19:03:14 +08:00

range_tombstone_assembler.hh

…

range_tombstone_change_generator.hh

range_tombstone_change_generator: fix an edge case in flush()

2023-05-16 17:54:08 +02:00

range_tombstone_list.cc

mutation: specialize fmt::formatter<range_tombstone_{entry,list}>

2023-04-26 09:00:25 +03:00

range_tombstone_list.hh

mutation: specialize fmt::formatter<range_tombstone_{entry,list}>

2023-04-26 09:00:25 +03:00

range_tombstone_splitter.hh

…

range_tombstone.cc

mutation: drop operator<<(ostream, const range_tombstone{_change,} &)

2023-03-21 11:37:07 +08:00

range_tombstone.hh

mutation: drop operator<<(ostream, const range_tombstone{_change,} &)

2023-03-21 11:37:07 +08:00

tombstone.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00