scylladb/service at 6601c778ddc4bf699b79be5dfc91c20c384ebc01 - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-02 04:56:58 +00:00

Files

History

Kamil Braun 9212bdc6b1 migration_manager: more verbose logging for schema versions

We're observing nodes getting stuck during bootstrap inside
`storage_service::wait_for_ring_to_settle()`, which periodically checks
`migration_manager::have_schema_agreement()` until it becomes `true`:
scylladb/scylladb#15393.

There is no obvious reason why that happens -- according to the nodes'
logs, their latest in-memory schema version is the same.

So either the gossiped schema version is for some reason different
(perhaps there is a race in publishing `application_state::SCHEMA`) or
missing entirely.

Alternatively, `wait_for_ring_to_settle` is leaving the
`have_schema_agreement` loop and getting stuck in
`update_topology_change_info` trying to acquire a lock.

Modify logging inside `have_schema_agreement` so details about missing
schema or version mismatch are logged on INFO level, and an INFO level
message is printed before we return `true`. To prevent logs from getting
spammed, rate-limit the periodic messages to once every 5 seconds. This
will still show the reason in our tests which allow the node to hang for
many minutes before timing out. Also these schema agreement checks are
done on relatively rare occasions such as bootstrap, so the additional
logs should not be harmful.

Furthermore, when publishing schema version to gossip, log it on INFO
level. This is happening at most once per schema change so it's a rare
message. If there's a race in publishing schema versions, this should
allow us to observe it.

Ref: scylladb/scylladb#15393

Closes scylladb/scylladb#16021

2023-11-14 11:24:47 +02:00

..

broadcast_tables/experimental

raft: add description argument to add_entry_unguarded

2023-07-07 13:11:44 +02:00

everywhere: reduce dependencies on i_partitioner.hh

2023-11-05 20:47:44 +02:00

table: rename cache_truncation_record -> set_truncation_time

2023-10-03 17:11:35 +04:00

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

raft topology: join: do not time out waiting for the node to be joined

2023-11-10 12:36:37 +01:00

cache_hitrate_calculator.hh

…

client_state.cc

client_state: co-routinise has_column_family_access function

2023-06-22 15:26:20 +03:00

client_state.hh

cql3: move validation::validate_column_family from client_state::has_column_family_access

2023-06-22 13:57:36 +03:00

CMakeLists.txt

build: cmake: add check-header target

2023-11-13 10:27:06 +02:00

endpoint_lifecycle_subscriber.hh

…

forward_service.cc

forward_service: Remove .shutdown() method

2023-09-26 10:39:22 +03:00

forward_service.hh

forward_service: Remove .shutdown() method

2023-09-26 10:39:22 +03:00

load_broadcaster.hh

gms: pass endpoint_state_ptr to endpoint_state change subscribers

2023-08-31 09:35:15 +03:00

load_meter.hh

load-meter: Remove unused get_load_string

2023-05-15 09:21:08 +03:00

memory_limiter.hh

…

migration_listener.hh

migration_listener: add parameter to on_before_create_column_family

2023-10-31 12:08:03 +01:00

migration_manager.cc

migration_manager: more verbose logging for schema versions

2023-11-14 11:24:47 +02:00

migration_manager.hh

migration_manager: add new prepare_new_column_family_announcement

2023-10-31 12:08:03 +01:00

misc_services.cc

Merge 'Gossiper endpoint locking' from Benny Halevy

2023-08-02 13:50:08 +02:00

query_state.hh

treewide: reduce include of cql_statement.hh

2023-09-08 13:23:50 +03:00

storage_proxy_stats.hh

storage_proxy: Make split_stats resilient to being called from different scheduling group

2023-06-21 10:08:27 +03:00

storage_proxy.cc

db: view: run local materialized view mutations on a separate smp service group

2023-10-29 18:30:32 +02:00

storage_proxy.hh

db: view: run local materialized view mutations on a separate smp service group

2023-10-29 18:30:32 +02:00

storage_service.cc

Merge 'raft topology: join: do not time out waiting for the node to be joined' from Patryk Jędrzejczak

2023-11-13 15:02:27 +01:00

storage_service.hh

Merge 'cleanup no longer used gossiper states' from Gleb

2023-11-07 11:48:04 +01:00

tablet_allocator.cc

tablet_allocator: update on_before_create_column_family

2023-10-31 12:08:03 +01:00

tablet_allocator.hh

tablets, raft topology: Add support for decommission with tablets

2023-09-14 13:05:49 +02:00

topology_state_machine.cc

everywhere: reduce dependencies on i_partitioner.hh

2023-11-05 20:47:44 +02:00

topology_state_machine.hh

everywhere: reduce dependencies on i_partitioner.hh

2023-11-05 20:47:44 +02:00

view_update_backlog_broker.hh

gms: pass endpoint_state_ptr to endpoint_state change subscribers

2023-08-31 09:35:15 +03:00