scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-28 10:41:12 +00:00

Files

Avi Kivity b33dd2bd7d Merge 'sstables/mx/writer: handle non-full prefix row keys' from Botond Dénes

Although valid for compact tables, non-full (or empty) clustering key prefixes are not handled for row keys when writing sstables. Only the present components are written, consequently if the key is empty, it is omitted entirely.
When parsing sstables, the parsing code unconditionally parses a full prefix.
This mis-match results in parsing failures, as the parser parses part of the row content as a key resulting in a garbage key and subsequent mis-parsing of the row content and maybe even subsequent partitions.

Introduce a new system table: `system.corrupt_data` and infrastructure similar to `large_data_handler`: `corrupt_data_handler` which abstracts how corrupt data is handled. The sstable writer now passes rows such corrupt keys to the corrupt data handler. This way, we avoid corrupting the sstables beyond parsing and the rows are also kept around in system.corrupt_data for later inspection and possible recovery.

Add a full-stack test which checks that rows with bad keys are correctly handled.

Fixes: https://github.com/scylladb/scylladb/issues/24489

The bug is present in all versions, has to be backported to all supported versions.

Closes scylladb/scylladb#24492

* github.com:scylladb/scylladb:
  test/boost/sstable_datafile_test: add test for corrupt data
  sstables/mx/writer: handler rows with empty keys
  test/lib/cql_assertions: introduce columns_assertions
  sstables: add corrupt_data_handler to sstables::sstables
  tools/scylla-sstable: make large_data_handler a local
  db: introduce corrupt_data_handler
  mutation: introduce frozen_mutation_fragment_v2
  mutation/mutation_partition_view: read_{clustering,static}_row(): return row type
  mutation/mutation_partition_view: extract de-ser of {clustering,static} row
  idl-compiler.py: generate skip() definition for enums serializers
  idl: extract full_position.idl from position_in_partition.idl
  db/system_keyspace: add apply_mutation()
  db/system_keyspace: introduce the corrupt_data table

2025-06-29 18:18:36 +03:00

alternator

Return correct creation date time in describe table

2025-06-10 15:25:57 +03:00

boost

Merge 'sstables/mx/writer: handle non-full prefix row keys' from Botond Dénes

2025-06-29 18:18:36 +03:00

broadcast_tables

test.py: cql: run tests using bare pytest command

2025-06-03 07:54:51 +00:00

cluster

Merge 'audit: introduce debug level logs on happy path' from Dario Mirovic

2025-06-27 20:10:54 +03:00

cql

test.py: cql: don't exit from pytest session on failed CQL

2025-06-03 07:54:51 +00:00

cqlpy

Merge 'cql, schema: Extend keyspace, table, views, indexes name length limit from 48 to 192 bytes' from Karol Nowacki

2025-06-22 17:41:10 +03:00

ldap

test.py: clean code that isn't used anymore

2025-06-11 18:29:26 +02:00

lib

Merge 'sstables/mx/writer: handle non-full prefix row keys' from Botond Dénes

2025-06-29 18:18:36 +03:00

manual

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

nodetool

tools/scylla-nodetool: backup: add --move-files parameter

2025-06-27 16:21:39 +03:00

perf

Revert "Merge 'Atomic in-memory schema changes application' from Marcin Maliszkiewicz"

2025-06-16 22:38:12 +03:00

pylib

Merge 'test.py: enhance allure reporting' from Andrei Chekun

2025-06-27 16:22:03 +03:00

pylib_test

…

raft

test.py: clean code that isn't used anymore

2025-06-11 18:29:26 +02:00

redis

…

resource

…

rest_api

test: rest_api: fix test_repair_task_progress

2025-06-25 09:08:06 +03:00

scylla_gdb

test/scylla_gdb: better error message when running on dev build mode

2025-04-22 15:02:06 +03:00

unit

test.py: clean code that isn't used anymore

2025-06-11 18:29:26 +02:00

__init__.py

test.py: allow cmake configuration and ./configure.py configuration to coexist

2025-06-03 16:46:41 +03:00

CMakeLists.txt

…

conftest.py

test.py: rework testpy_test fixture

2025-05-29 12:15:28 +00:00

pytest.ini

Merge 'test.py: python: run tests using bare pytest command' from Evgeniy Naydanov

2025-05-30 08:48:43 +03:00

README.md

…

README.md

Scylla in-source tests.

For details on how to run the tests, see docs/dev/testing.md

Shared C++ utils, libraries are in lib/, for Python - pylib/

alternator - Python tests which connect to a single server and use the DynamoDB API unit, boost, raft - unit tests in C++ cqlpy - Python tests which connect to a single server and use CQL topology* - tests that set up clusters and add/remove nodes cql - approval tests that use CQL and pre-recorded output rest_api - tests for Scylla REST API Port 9000 scylla-gdb - tests for scylla-gdb.py helper script nodetool - tests for C++ implementation of nodetool

If you can use an existing folder, consider adding your test to it. New folders should be used for new large categories/subsystems, or when the test environment is significantly different from some existing suite, e.g. you plan to start scylladb with different configuration, and you intend to add many tests and would like them to reuse an existing Scylla cluster (clusters can be reused for tests within the same folder).

To add a new folder, create a new directory, and then copy & edit its suite.ini.