scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 03:30:49 +00:00

Files

Avi Kivity 5e4941a74b Merge '[Backport 2025.2] sstables/mx/writer: handle non-full prefix row keys' from Scylladb[bot]

Although valid for compact tables, non-full (or empty) clustering key prefixes are not handled for row keys when writing sstables. Only the present components are written, consequently if the key is empty, it is omitted entirely.
When parsing sstables, the parsing code unconditionally parses a full prefix.
This mis-match results in parsing failures, as the parser parses part of the row content as a key resulting in a garbage key and subsequent mis-parsing of the row content and maybe even subsequent partitions.

Introduce a new system table: `system.corrupt_data` and infrastructure similar to `large_data_handler`: `corrupt_data_handler` which abstracts how corrupt data is handled. The sstable writer now passes rows such corrupt keys to the corrupt data handler. This way, we avoid corrupting the sstables beyond parsing and the rows are also kept around in system.corrupt_data for later inspection and possible recovery.

Add a full-stack test which checks that rows with bad keys are correctly handled.

Fixes: https://github.com/scylladb/scylladb/issues/24489

The bug is present in all versions, has to be backported to all supported versions.

- (cherry picked from commit 92b5fe8983)

- (cherry picked from commit 0753643606)

- (cherry picked from commit b0d5462440)

- (cherry picked from commit 093d4f8d69)

- (cherry picked from commit 678deece88)

- (cherry picked from commit 64f8500367)

- (cherry picked from commit b931145a26)

- (cherry picked from commit 3e1c50e9a7)

- (cherry picked from commit 46ff7f9c12)

- (cherry picked from commit ebd9420687)

- (cherry picked from commit aae212a87c)

- (cherry picked from commit 592ca789e2)

- (cherry picked from commit edc2906892)

Parent PR: #24492

Closes scylladb/scylladb#24744

* github.com:scylladb/scylladb:
  test/boost/sstable_datafile_test: add test for corrupt data
  sstables/mx/writer: handler rows with empty keys
  test/lib/cql_assertions: introduce columns_assertions
  sstables: add corrupt_data_handler to sstables::sstables
  tools/scylla-sstable: make large_data_handler a local
  db: introduce corrupt_data_handler
  mutation: introduce frozen_mutation_fragment_v2
  mutation/mutation_partition_view: read_{clustering,static}_row(): return row type
  mutation/mutation_partition_view: extract de-ser of {clustering,static} row
  idl-compiler.py: generate skip() definition for enums serializers
  idl: extract full_position.idl from position_in_partition.idl
  db/system_keyspace: add apply_mutation()
  db/system_keyspace: introduce the corrupt_data table

2025-07-01 12:27:01 +03:00

alternator

alternator: hide internal tags from users

2025-06-04 09:56:33 +03:00

boost

test/boost/sstable_datafile_test: add test for corrupt data

2025-06-30 12:44:29 +00:00

broadcast_tables

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cluster

group0: modify start_operation logic to account for synchronize phase race condition

2025-07-01 10:10:55 +02:00

cql

cql: restore validating replication strategies options

2025-02-04 12:27:33 +01:00

cqlpy

cql, schema: Extend name length limit from 48 to 192 bytes

2025-06-22 17:38:30 +00:00

ldap

test.py: move the readme file for LDAP tests to the correct location

2025-04-22 19:03:28 +02:00

lib

test/lib/cql_assertions: introduce columns_assertions

2025-06-30 12:44:29 +00:00

manual

readers: mv forwardable_v2.hh forwardable.hh

2025-04-16 04:33:50 -04:00

nodetool

Add support for nodetool refresh --skip-reshape

2025-06-13 14:06:19 +03:00

perf

test: switch uses of make_sstable_compressor_factory() to a seastar::thread-dependent version

2025-05-12 09:12:05 +00:00

pylib

test: tablets: add get_tablet_info helper

2025-06-17 13:59:10 +00:00

pylib_test

test.py: remove pylib_test from test.py/CI run

2025-04-01 16:43:45 +03:00

raft

Move direct_failure_detector from root to service/

2025-04-08 13:03:24 +03:00

redis

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

resource

build: cmake: use wasm32-wasip1 as an alternative of wasm32-wasi

2025-01-16 16:28:29 +03:00

rest_api

test: rest_api: fix test_repair_task_progress

2025-06-28 09:39:06 +03:00

scylla_gdb

test/scylla_gdb: better error message when running on dev build mode

2025-04-22 15:02:06 +03:00

unit

replica/memtable: s/make_flat_reader/make_mutation_reader/

2025-04-01 17:58:13 +03:00

__init__.py

test.py: move get_combined_tests to the correct facade

2025-04-24 14:05:49 +02:00

CMakeLists.txt

Introduce LDAP role manager & saslauthd authenticator

2025-01-12 14:50:29 +02:00

conftest.py

test.py: move setup cgroups to the generic method

2025-04-24 14:05:49 +02:00

pytest.ini

Merge 'test/pylib: servers_add: support list of property_files' from Benny Halevy

2025-04-01 09:14:20 +03:00

README.md

test: rename "cql-pytest" to "cqlpy"

2024-11-06 16:48:36 +02:00

README.md

Scylla in-source tests.

For details on how to run the tests, see docs/dev/testing.md

Shared C++ utils, libraries are in lib/, for Python - pylib/

alternator - Python tests which connect to a single server and use the DynamoDB API unit, boost, raft - unit tests in C++ cqlpy - Python tests which connect to a single server and use CQL topology* - tests that set up clusters and add/remove nodes cql - approval tests that use CQL and pre-recorded output rest_api - tests for Scylla REST API Port 9000 scylla-gdb - tests for scylla-gdb.py helper script nodetool - tests for C++ implementation of nodetool

If you can use an existing folder, consider adding your test to it. New folders should be used for new large categories/subsystems, or when the test environment is significantly different from some existing suite, e.g. you plan to start scylladb with different configuration, and you intend to add many tests and would like them to reuse an existing Scylla cluster (clusters can be reused for tests within the same folder).

To add a new folder, create a new directory, and then copy & edit its suite.ini.