scylladb/db at 3a67423ac94de4c742ba80c8cebfdd054a407da1 - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-23 08:12:08 +00:00

Files

History

Avi Kivity cdae15ced9 Merge '[Backport 6.0] db/view: drop view updates to replaced node marked as left' from ScyllaDB

When a node that is permanently down is replaced, it is marked as "left" but it still can be a replica of some tablets. We also don't keep IPs of nodes that have left and the `node` structure for such node returns an empty IP (all zeros) as the address.

This interacts badly with the view update logic. The base replica paired with the left node might decide to generate a view update. Because storage proxy still uses IPs and not host IDs, it needs to obtain the view replica's IP and tell the storage proxy to write a view update to that node - so, it chooses 0.0.0.0. Apparently, storage proxy decides to write a hint towards this address - hinted handoff on the other hand operates on host IDs and not IPs, so it attempts to translate the IP back, which triggers an assertion as there is no replica with IP 0.0.0.0.

As a quick workaround for this issue just drop view updates towards nodes which seem to have IPs that are all zeros. It would be more proper to keep the view updates as hints and replay them later to the new paired replica, but achieving this right now would require much more significant changes. For now, fixing a crash is more important than keeping views consistent with base replicas.

In addition to the fix, this PR also includes a regression test heavily based on the test that @kbr-scylla prepared during his investigation of the issue.

Fixes: scylladb/scylladb#19439

This issue can cause multiple nodes to crash at once and the fix is quite small, so I think this justifies backporting it to all affected versions. 6.0 and 6.1 are affected. No need to backport to 5.4 as this issue only happens with tablets, and tablets are experimental there.

(cherry picked from commit 6af7882c59)

(cherry picked from commit 5ec8c06561)

 Refs #19765

Closes scylladb/scylladb#19896

* github.com:scylladb/scylladb:
  test: regression test for MV crash with tablets during decommission
  db/view: drop view updates to replaced node marked as left

2024-08-14 22:32:07 +03:00

..

commitlog_replayer: Avoid deprecated sharder::shard_of()

2024-05-16 00:28:47 +02:00

treewide: replace formatter<std::string_view> with formatter<string_view>

2024-04-19 07:44:07 +03:00

db/hints: Make commitlog use commitlog IO scheduling group

2024-08-14 22:15:28 +03:00

…

everywhere: include seastar headers using angle brackets

2024-05-06 10:00:31 +03:00

db/view: drop view updates to replaced node marked as left

2024-07-26 14:02:51 +00:00

batchlog_manager.cc

db/batchlog_manager: bypass cache when scanning batchlog table

2024-06-26 09:05:14 +00:00

batchlog_manager.hh

batchlog_manager, test: initialize delay configuration

2024-05-13 07:57:35 +03:00

cache_tracker.hh

sstables: partition_index_cache: deglobalize stats

2023-09-01 22:34:41 +02:00

chained_delegating_reader.hh

…

CMakeLists.txt

db: auth: move auth tables to system keyspace

2024-06-02 21:41:14 +03:00

config.cc

db/config: introduce reader_concurrency_semahore_cpu_concurrency

2024-07-08 08:06:28 +03:00

config.hh

db/config: introduce reader_concurrency_semahore_cpu_concurrency

2024-07-08 08:06:28 +03:00

consistency_level_type.hh

db: add fmt::format for db::consistency_level

2024-01-12 10:49:00 +02:00

consistency_level_validations.hh

…

consistency_level.cc

treewide: include fmt/ranges.h and/or fmt/std.h

2024-04-19 22:56:16 +08:00

consistency_level.hh

db: consistency_level: remove overload of filter_for_query

2023-06-14 11:41:36 +02:00

cql_type_parser.cc

db/cql_type_parser: use generic topological sorting

2024-05-16 13:30:03 +02:00

cql_type_parser.hh

db/cql_type_parses: futurize raw_builder::build()

2024-05-16 13:30:03 +02:00

data_listeners.cc

db: do not include unused headers

2024-01-09 11:44:19 +02:00

data_listeners.hh

…

extensions.cc

db::extentions: Add "extensions internal" keyspace set

2023-03-27 15:12:31 +00:00

extensions.hh

db: do not include unused headers

2024-01-09 11:44:19 +02:00

heat_load_balance.cc

Typos: fix typos in comments

2023-12-02 22:37:22 +02:00

heat_load_balance.hh

Typos: fix typos in comments

2023-12-02 22:37:22 +02:00

large_data_handler.cc

sstable: added cluster feature for dead rows and range tombstones

2024-05-02 11:49:46 +02:00

large_data_handler.hh

sstable: added cluster feature for dead rows and range tombstones

2024-05-02 11:49:46 +02:00

legacy_schema_migrator.cc

treewide: remove {dclocal_,}read_repair_chance options

2024-04-25 17:15:27 +08:00

legacy_schema_migrator.hh

db: Add sharded<system_keyspace>& to legacy_schema_migrator

2023-07-21 12:38:46 +03:00

operation_type.hh

db: add formatter for db::operation_type

2024-01-19 10:16:41 +02:00

paxos_grace_seconds_extension.hh

schema_extensions: Add an option to string method

2024-06-18 14:13:51 +00:00

per_partition_rate_limit_extension.hh

…

per_partition_rate_limit_info.hh

…

per_partition_rate_limit_options.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

per_partition_rate_limit_options.hh

…

rate_limiter.cc

…

rate_limiter.hh

…

read_repair_decision.hh

db: add formatter for db::read_repair_decision

2024-01-29 15:43:51 +02:00

schema_features.hh

feature: grandfather PER_TABLE_PARTITIONERS

2024-05-18 00:15:07 +03:00

schema_tables.cc

schema_tables: remove unused code

2024-06-05 13:55:28 +00:00

schema_tables.hh

db/cql_type_parses: futurize raw_builder::build()

2024-05-16 13:30:03 +02:00

size_estimates_virtual_reader.cc

range.hh: retire

2024-02-21 00:24:25 +02:00

size_estimates_virtual_reader.hh

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

snapshot-ctl.cc

snapshot: Get per-table snapshot size under snapshot lock

2024-04-25 10:05:51 +03:00

snapshot-ctl.hh

snapshot: Get per-table snapshot size under snapshot lock

2024-04-25 10:05:51 +03:00

sstables-format-selector.cc

feature: grandfather ME_SSTABLE feature

2024-05-17 20:41:19 +03:00

sstables-format-selector.hh

feature: grandfather ME_SSTABLE feature

2024-05-17 20:41:19 +03:00

system_distributed_keyspace.cc

service:qos: extract common service levels' table functions

2024-03-21 23:14:57 +01:00

system_distributed_keyspace.hh

db: do not include unused headers

2024-01-09 11:44:19 +02:00

system_keyspace_sstables_registry.hh

sstables_manager: decouple from system_keyspace

2024-03-18 20:38:07 +03:00

system_keyspace_view_types.hh

…

system_keyspace.cc

config: Remove experimental TABLETS feature

2024-06-03 12:16:41 +03:00

system_keyspace.hh

db: auth: move auth tables to system keyspace

2024-06-02 21:41:14 +03:00

timeout_clock.hh

db: do not include unused headers

2024-01-09 11:44:19 +02:00

virtual_table.cc

db: do not include unused headers

2024-01-09 11:44:19 +02:00

virtual_table.hh

db: do not include unused headers

2024-01-09 11:44:19 +02:00

virtual_tables.cc

replica: get rid of fragile compaction group intrusive list

2024-08-13 12:26:11 -03:00

virtual_tables.hh

virtual_tables: scope virtual tables registry in system_keyspace

2023-12-21 16:19:42 +02:00

write_type.hh

db: add formatter for db::write_type

2024-02-01 10:22:45 +02:00