scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-08 16:03:20 +00:00

Files

Avi Kivity f0950e023d Merge 'Split CDC streams table partitions into clustered rows ' from Kamil Braun

Until now, the lists of streams in the `cdc_streams_descriptions` table
for a given generation were stored in a single collection. This solution
has multiple problems when dealing with large clusters (which produce
large lists of streams):
1. large allocations
2. reactor stalls
3. mutations too large to even fit in commitlog segments

This commit changes the schema of the table as described in issue #7993.
The streams are grouped according to token ranges, each token range
being represented by a separate clustering row. Rows are inserted in
reasonably large batches for efficiency.

The table is renamed to enable easy upgrade. On upgrade, the latest CDC
generation's list of streams will be (re-)inserted into the new table.

Yet another table is added: one that contains only the generation
timestamps clustered in a single partition. This makes it easy for CDC
clients to learn about new generations. It also enables an elegant
two-phase insertion procedure of the generation description: first we
insert the streams; only after ensuring that a quorum of replicas
contains them, we insert the timestamp. Thus, if any client observes a
timestamp in the timestamps table (even using a ONE query),
it means that a quorum of replicas must contain the list of streams.

---

Nodes automatically ensure that the latest CDC generation's list of
streams is present in the streams description table. When a new
generation appears, we only need to update the table for this
generation; old generations are already inserted.

However, we've changed the description table (from
`cdc_streams_descriptions` to `cdc_streams_descriptions_v2`). The
existing mechanism only ensures that the latest generation appears in
the new description table. We add an additional procedure that
rewrites the older generations as well, if we find that it is necessary
to do so (i.e. when some CDC log tables may contain data in these
generations).

Closes #8116

* github.com:scylladb/scylla:
  tests: add a simple CDC cql pytest
  cdc: add config option to disable streams rewriting
  cdc: rewrite streams to the new description table
  cql3: query_processor: improve internal paged query API
  cdc: introduce no_generation_data_exception exception type
  docs: cdc: mention system.cdc_local table
  cdc: coroutinize do_update_streams_description
  sys_dist_ks: split CDC streams table partitions into clustered rows
  cdc: use chunked_vector for streams in streams_version
  cdc: remove `streams_version::expired` field
  system_distributed_keyspace: use mutation API to insert CDC streams
  storage_service: don't use `sys_dist_ks` before it is started

2021-02-18 12:49:43 +02:00

aggregate_fcts_test.cc

cql3: Use correct comparator in timeuuid min/max

2021-01-13 11:07:29 +02:00

allocation_strategy_test.cc

…

alternator_base64_test.cc

build: Be consistent about system versus regular headers

2020-06-10 15:49:51 +03:00

anchorless_list_test.cc

…

auth_passwords_test.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

auth_resource_test.cc

…

auth_test.cc

auth: Permit ALTER options on system_auth tables

2020-11-16 22:32:32 -05:00

batchlog_manager_test.cc

storage_proxy: un-hardcode force sync flag for mutate_locally(mutation) overload

2020-07-16 16:38:48 +03:00

big_decimal_test.cc

big_decimal: Add a test for a corner case

2020-06-25 15:37:23 -07:00

bptree_test.cc

bptree: Special intra-node key search when possible

2020-08-06 15:41:31 +03:00

broken_sstable_test.cc

tree-wide: use sstables::make_reader() instead of the read_.*row.*_flat() methods

2021-01-27 17:38:17 +02:00

btree_test.cc

utils: Intrusive B-tree (with tests)

2021-02-02 09:30:29 +03:00

bytes_ostream_test.cc

test: stop using BOOST_TEST_MESSAGE() for logging

2020-03-05 11:38:11 +03:00

cache_flat_mutation_reader_test.cc

mutation_partition: Switch cache of rows onto B-tree

2021-02-02 09:30:30 +03:00

cached_file_test.cc

utils: Introduce cached_file

2020-06-16 16:15:23 +02:00

caching_options_test.cc

…

canonical_mutation_test.cc

…

cartesian_product_test.cc

…

castas_fcts_test.cc

test: avoid using literal suffix 'd'

2020-09-21 16:32:53 +03:00

cdc_generation_test.cc

cdc: Limit size of topology description

2021-02-17 13:24:40 +01:00

cdc_test.cc

sys_dist_ks: split CDC streams table partitions into clustered rows

2021-02-18 11:44:59 +01:00

cell_locker_test.cc

…

checksum_utils_test.cc

everywhere: Don't assume sstring::begin() and sstring::end() are pointers

2020-03-10 13:13:48 -07:00

chunked_vector_test.cc

utils/chunked_vector: add reserve_partial()

2020-11-02 18:02:01 +02:00

clustering_ranges_walker_test.cc

clustering_range_walker: fix false discontiguity detected after a static row

2021-02-01 19:32:07 +02:00

column_mapping_test.cc

lwt: add column_mapping history persistence tests

2020-10-15 19:25:24 +03:00

commitlog_test.cc

test: commitlog_test: test_allocation_failure: fill memory using smaller allocations

2021-02-03 12:21:20 +02:00

compound_test.cc

keys, compound: switch from bytes_view to managed_bytes_view

2021-01-08 14:16:08 +01:00

compress_test.cc

…

config_test.cc

alternator: guard streams with an experimental flag

2020-11-12 12:36:16 +01:00

continuous_data_consumer_test.cc

test: everywhere: use seastar::testing::local_random_engine

2021-01-13 11:07:29 +02:00

counter_test.cc

imr: switch back to open-coded description of structures

2021-02-16 23:43:07 +01:00

cql_auth_query_test.cc

…

cql_auth_syntax_test.cc

cql3: return raw::parsed_statement as unique_ptr

2020-03-23 23:19:21 +03:00

cql_functions_test.cc

cql3: Use correct comparator in timeuuid min/max

2021-01-13 11:07:29 +02:00

cql_query_group_test.cc

cql3: Delete some newlines

2020-10-19 15:40:55 -04:00

cql_query_large_test.cc

test: Split cql_query_test

2020-03-16 20:27:45 +03:00

cql_query_like_test.cc

cql3/restrictions: Use free functions instead of methods

2020-07-07 23:08:09 +02:00

cql_query_test.cc

tests: Adjusted tests for DC checking in NTS

2021-02-09 08:29:35 +01:00

crc_test.cc

…

data_listeners_test.cc

code: Force formatting of pointer in .debug and .trace

2020-08-26 20:44:11 +03:00

database_test.cc

migration_manager: drop announce_locally flag

2021-01-03 13:58:09 +02:00

double_decker_test.cc

test: switch lsa-related tests (imr_test and double_decker_test) to seastar framework

2020-10-30 08:06:04 +02:00

duration_test.cc

…

dynamic_bitset_test.cc

tests: dynamic_bitset_test: don't exhaust random number entropy

2020-05-26 20:46:45 +03:00

enum_option_test.cc

codebase wide: replace count with contains

2020-08-15 20:26:02 +03:00

enum_set_test.cc

…

error_injection_test.cc

utils: add timeout error injection with lambda

2020-06-03 14:44:00 +02:00

estimated_histogram_test.cc

approx_exponential_histogram: Makes the implementation clearer

2020-06-18 14:18:21 +03:00

extensions_test.cc

Mark CDC as GA

2020-11-12 12:36:13 +01:00

filtering_test.cc

cql3: Drop superfluous ALLOW FILTERING

2020-10-19 15:38:11 -04:00

flat_mutation_reader_test.cc

flat_mutation_reader: return future from next_partition

2021-01-13 17:35:07 +02:00

flush_queue_test.cc

test: everywhere: use seastar::testing::local_random_engine

2021-01-13 11:07:29 +02:00

fragmented_temporary_buffer_test.cc

…

frozen_mutation_test.cc

mutation_fragment: add schema and permit

2020-09-28 11:27:23 +03:00

gossip_test.cc

main: start a shared_token_metadata

2020-11-11 14:20:23 +02:00

gossiping_property_file_snitch_test.cc

storage-service: Subscribe to snitch to update topology

2021-01-13 16:41:34 +03:00

hash_test.cc

…

hashers_test.cc

test: add hashers_test

2021-01-15 18:28:24 +01:00

idl_test.cc

idl: add unit-test for const specifiers feature

2020-12-15 16:03:18 +03:00

index_with_paging_test.cc

test: Split secondary_index test

2020-03-16 20:26:34 +03:00

input_stream_test.cc

test: everywhere: use seastar::testing::local_random_engine

2021-01-13 11:07:29 +02:00

intrusive_array_test.cc

intrusive-array: Array with trusted bounds

2020-07-14 16:29:49 +03:00

json_cql_query_test.cc

treewide: use utils::multiprecision_int for varint implementation

2020-03-04 13:28:16 +02:00

json_test.cc

test: move json tests from manual/ to boost/

2020-07-06 11:24:12 +03:00

keys_test.cc

keys, compound: switch from bytes_view to managed_bytes_view

2021-01-08 14:16:08 +01:00

large_paging_state_test.cc

cql3: Drop superfluous ALLOW FILTERING

2020-10-19 15:38:11 -04:00

like_matcher_test.cc

tests: like_matcher_test: adjust for C++20 char8_t

2020-05-13 09:37:39 +03:00

limiting_data_source_test.cc

…

linearizing_input_stream_test.cc

test: stop using BOOST_TEST_MESSAGE() for logging

2020-03-05 11:38:11 +03:00

loading_cache_test.cc

test: everywhere: use seastar::testing::local_random_engine

2021-01-13 11:07:29 +02:00

log_heap_test.cc

…

logalloc_test.cc

Merge 'managed_bytes: switch to explicit linearization' from Michał Chojnowski

2021-01-18 11:01:28 +02:00

managed_bytes_test.cc

test: add managed_bytes_test

2021-01-15 18:21:13 +01:00

managed_vector_test.cc

…

map_difference_test.cc

…

memtable_test.cc

mutation_fragment: s/as_mutable_clustering_row/mutate_as_clustering_row/

2020-09-28 10:53:56 +03:00

multishard_combining_reader_as_mutation_source_test.cc

multishard_combining_reader: add permit parameter

2020-10-12 15:56:56 +03:00

multishard_mutation_query_test.cc

imr: switch back to open-coded description of structures

2021-02-16 23:43:07 +01:00

murmur_hash_test.cc

…

mutation_fragment_test.cc

keys, compound: switch from bytes_view to managed_bytes_view

2021-01-08 14:16:08 +01:00

mutation_query_test.cc

flat_mutation_reader: impl: add reader_permit parameter

2020-09-28 10:53:48 +03:00

mutation_reader_test.cc

evictable_reader: reset _range_override after fast-forwarding

2021-02-17 19:11:00 +02:00

mutation_test.cc

test: mutation_test: remove an obsolete assertion

2021-02-16 21:35:14 +01:00

mutation_writer_test.cc

mutation_writer: bucket_writer: add close

2021-01-19 19:03:58 +02:00

mvcc_test.cc

partition_version: Change range_tombstones() to return chunked_vector

2020-10-26 11:54:42 +02:00

network_topology_strategy_test.cc

test: everywhere: use seastar::testing::local_random_engine

2021-01-13 11:07:29 +02:00

nonwrapping_range_test.cc

test: check sizes before dereferencing the vector

2020-07-23 16:49:35 +03:00

observable_test.cc

…

partitioner_test.cc

dht: Add find_first_token_for_shard

2020-04-22 18:24:54 +02:00

querier_cache_test.cc

reader_concurrency_semaphore: separate set_notify_handler from register_inactive_reader

2021-02-08 22:31:01 +02:00

query_processor_test.cc

cql3: query_processor: improve internal paged query API

2021-02-18 11:44:59 +01:00

raft_etcd_test.cc

raft: tidy up follower_progress API

2021-02-16 23:15:16 +03:00

raft_fsm_test.cc

raft: add a unit test for voting

2021-02-16 23:15:16 +03:00

raft_sys_table_storage_test.cc

raft: joint consensus, use unordered_set for server_address list

2021-01-29 22:07:07 +03:00

range_assert.hh

…

range_test.cc

range_test: Add cases for singular intersection

2020-06-18 12:38:31 +03:00

range_tombstone_list_assertions.hh

…

range_tombstone_list_test.cc

test: Enhance test for range_tombstone_list de-overlapping

2020-12-28 18:26:48 +02:00

restrictions_test.cc

cql3: Fix range computation for p=1 AND p=1

2020-12-16 14:46:48 -05:00

reusable_buffer_test.cc

test: stop using BOOST_TEST_MESSAGE() for logging

2020-03-05 11:38:11 +03:00

role_manager_test.cc

…

row_cache_test.cc

treewide: explicitly use flat_mutation_reader_opt

2021-02-17 17:57:34 +02:00

schema_change_test.cc

raft: use null_sharder for raft tables

2021-02-01 18:52:04 +02:00

schema_changes_test.cc

tests: don't pass temporary ranges to readers

2021-01-27 17:38:17 +02:00

schema_registry_test.cc

test: move config to heap in schema_registry_test

2020-03-25 14:19:30 +01:00

secondary_index_test.cc

cql3: Fix value_for when restriction is impossible

2020-12-16 15:00:29 -05:00

serialization_test.cc

gms::inet_address: Fix sign extension error in custom address formatting

2020-04-12 17:48:44 +03:00

serialized_action_test.cc

test: serialized_action_test: add test_serialized_action_exception

2020-10-14 16:45:21 +03:00

small_vector_test.cc

…

snitch_reset_test.cc

storage-service: Subscribe to snitch to update topology

2021-01-13 16:41:34 +03:00

sstable_3_x_test.cc

imr: switch back to open-coded description of structures

2021-02-16 23:43:07 +01:00

sstable_conforms_to_mutation_source_test.cc

test: sstable_conforms_to_mutation_source_test: remove references to test_sstables_manager

2020-09-23 20:55:12 +03:00

sstable_datafile_test.cc

imr: switch back to open-coded description of structures

2021-02-16 23:43:07 +01:00

sstable_directory_test.cc

sstables: sstable_writer_config: add origin member

2021-02-01 16:45:52 +02:00

sstable_move_test.cc

sstable: create_links: support for move

2020-11-09 19:57:40 +02:00

sstable_mutation_test.cc

tree-wide: use sstables::make_reader() instead of the read_.*row.*_flat() methods

2021-01-27 17:38:17 +02:00

sstable_resharding_test.cc

test: sstable_resharding_test: prepare for asynchronously closed sstables_manager

2020-09-23 20:55:13 +03:00

sstable_test.cc

tree-wide: use sstables::make_reader() instead of the read_.*row.*_flat() methods

2021-01-27 17:38:17 +02:00

sstable_test.hh

test: Add test for TWCS interposer on memtable flush

2021-01-04 16:55:06 -03:00

stall_free_test.cc

utils: Add merge_to_gently

2020-08-11 10:37:34 +08:00

storage_proxy_test.cc

token_metdata: futurize update_normal_tokens

2020-12-22 10:35:15 +02:00

suite.yaml

test: enable raft tests

2020-11-03 21:30:11 +03:00

test_table.cc

keys, compound: switch from bytes_view to managed_bytes_view

2021-01-08 14:16:08 +01:00

top_k_test.cc

…

total_order_check.hh

test: stop using BOOST_TEST_MESSAGE() for logging

2020-03-05 11:38:11 +03:00

transport_test.cc

…

types_test.cc

Validate ascii values when creating from CQL

2020-11-02 16:47:32 +02:00

user_function_test.cc

user_function: throw on_internal_error if executed outside a seastar thread

2021-02-02 13:03:39 +02:00

user_types_test.cc

test: Restore a case in user_types_test

2020-08-16 13:49:55 +03:00

utf8_test.cc

test: utf8: add fragmented buffer validation tests

2020-10-21 11:14:44 +03:00

UUID_test.cc

test: add tests for legacy uuid compare & msb monotonicity

2021-01-21 13:03:59 +03:00

view_build_test.cc

sstables: sstable_writer_config: add origin member

2021-02-01 16:45:52 +02:00

view_complex_test.cc

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

view_schema_ckey_test.cc

test: Split view_schema_test

2020-03-16 20:27:45 +03:00

view_schema_pkey_test.cc

test: Split view_schema_test

2020-03-16 20:27:45 +03:00

view_schema_test.cc

tests: mv: Test dropping columns from base table

2020-08-20 14:53:07 +02:00

vint_serialization_test.cc

…

virtual_reader_test.cc

…