scylladb/mutation at b33dd2bd7dba311daff7c04db7bf16fdc4eecd01 - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-22 15:52:13 +00:00

Files

History

Avi Kivity b33dd2bd7d Merge 'sstables/mx/writer: handle non-full prefix row keys' from Botond Dénes

Although valid for compact tables, non-full (or empty) clustering key prefixes are not handled for row keys when writing sstables. Only the present components are written, consequently if the key is empty, it is omitted entirely.
When parsing sstables, the parsing code unconditionally parses a full prefix.
This mis-match results in parsing failures, as the parser parses part of the row content as a key resulting in a garbage key and subsequent mis-parsing of the row content and maybe even subsequent partitions.

Introduce a new system table: `system.corrupt_data` and infrastructure similar to `large_data_handler`: `corrupt_data_handler` which abstracts how corrupt data is handled. The sstable writer now passes rows such corrupt keys to the corrupt data handler. This way, we avoid corrupting the sstables beyond parsing and the rows are also kept around in system.corrupt_data for later inspection and possible recovery.

Add a full-stack test which checks that rows with bad keys are correctly handled.

Fixes: https://github.com/scylladb/scylladb/issues/24489

The bug is present in all versions, has to be backported to all supported versions.

Closes scylladb/scylladb#24492

* github.com:scylladb/scylladb:
  test/boost/sstable_datafile_test: add test for corrupt data
  sstables/mx/writer: handler rows with empty keys
  test/lib/cql_assertions: introduce columns_assertions
  sstables: add corrupt_data_handler to sstables::sstables
  tools/scylla-sstable: make large_data_handler a local
  db: introduce corrupt_data_handler
  mutation: introduce frozen_mutation_fragment_v2
  mutation/mutation_partition_view: read_{clustering,static}_row(): return row type
  mutation/mutation_partition_view: extract de-ser of {clustering,static} row
  idl-compiler.py: generate skip() definition for enums serializers
  idl: extract full_position.idl from position_in_partition.idl
  db/system_keyspace: add apply_mutation()
  db/system_keyspace: introduce the corrupt_data table

2025-06-29 18:18:36 +03:00

..

async_utils.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

async_utils.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

atomic_cell_hash.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

atomic_cell_or_collection.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

atomic_cell.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

atomic_cell.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

canonical_mutation.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

canonical_mutation.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

CMakeLists.txt

build: cmake: build async_utils.cc

2024-05-09 08:26:44 +03:00

compact_and_expire_result.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

frozen_mutation.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

frozen_mutation.hh

mutation: introduce frozen_mutation_fragment_v2

2025-06-24 11:05:31 +03:00

json.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_cleaner.hh

utils: do not include unused headers

2025-01-14 07:56:39 -05:00

mutation_compactor.hh

mutation/mutation_compactor: cache regular/shadowable max-purgable in separate members

2025-05-29 22:52:08 +03:00

mutation_consumer_concepts.hh

mutation: fold FragmentConsumer[V2] into FlattenedConsumer[V2]

2025-03-18 09:24:49 -04:00

mutation_consumer.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_fragment_fwd.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_fragment_stream_validator.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_fragment_stream_validator.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_fragment_v2.hh

Move mutation_fragment_v2::kind into data object

2025-05-26 11:06:53 +02:00

mutation_fragment.cc

Move mutation_fragment::kind into data object

2025-05-26 11:06:54 +02:00

mutation_fragment.hh

Move mutation_fragment::kind into data object

2025-05-26 11:06:54 +02:00

mutation_partition_serializer.cc

mutation: introduce frozen_mutation_fragment_v2

2025-06-24 11:05:31 +03:00

mutation_partition_serializer.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_partition_v2.cc

mutation: check key of inserted rows

2025-06-23 09:38:45 +03:00

mutation_partition_v2.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_partition_view.cc

mutation: introduce frozen_mutation_fragment_v2

2025-06-24 11:05:31 +03:00

mutation_partition_view.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_partition_visitor.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_partition.cc

mutation: check key of inserted rows

2025-06-23 09:38:45 +03:00

mutation_partition.hh

mutation: check key of inserted rows

2025-06-23 09:38:45 +03:00

mutation_rebuilder.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_source_metadata.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

mutation_tombstone_stats.hh

mutation_compactor: Collect tombstone purge attempts

2025-05-16 20:00:00 +02:00

mutation.cc

readers: mv from_mutations_v2.hh from_mutations.hh

2025-04-16 04:46:08 -04:00

mutation.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

partition_version_list.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

partition_version.cc

moved cache files to db

2025-02-04 12:21:31 +03:00

partition_version.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

position_in_partition.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

range_tombstone_assembler.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

range_tombstone_change_generator.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

range_tombstone_list.cc

mutation,test: replace boost::equal with std::ranges::equal

2025-02-26 14:27:42 +03:00

range_tombstone_list.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

range_tombstone_splitter.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

range_tombstone.cc

mutation: replace boost::upper_bound with std::ranges::upper_bound

2025-03-04 10:36:57 +03:00

range_tombstone.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

tombstone.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00