scylladb/service at 796205678ff0b355d4e962a576ebdd94c4bed583 - scylladb - Anomalous Gitea

mirrors/scylladb

Files

History

Asias He da5cc13e97 repair: Fix deadlock when topology coordinator steps down in the middle

Consider this:

1) n1 is the topology coordinator
2) n1 schedules and executes a tablet repair with session id s1 for a
tablet on n3 an n4.
3) n3 and n4 take and store the in _rs._repair_compaction_locks[s1]
4) n1 steps down before it executes
locator::tablet_transition_stage::end_repair
5) n2 becomes the new topology coordinator
6) n2 runs locator::tablet_transition_stage::repair again
7) n3 and n4 try to take the lock again and hangs since the lock is
already taken.

To avoid the deadlock, we can throw in step 7 so that n2 will
proceed to end_repair stage and release the lock. After that, the
scheduler could schedule the tablet repair request again.

Fixes #26346

Closes scylladb/scylladb#27163

2025-11-28 15:14:39 +01:00

..

broadcast_tables/experimental

treewide: Move replica related files to replica directory

2025-09-18 08:00:35 +03:00

direct_failure_detector

…

treewide: Move query related files to a new query directory

2025-09-16 23:40:47 +03:00

paxos_state: get_replica_lock: remove shard check

2025-10-31 21:37:39 +01:00

service/qos: Do not crash Scylla if auth_integration absent

2025-11-10 19:21:36 +01:00

code: Stop using seastar::compat::source_location

2025-11-27 19:10:11 +02:00

address_map.hh

code: Stop using seastar::compat::source_location

2025-11-27 19:10:11 +02:00

cache_hitrate_calculator.hh

…

cas_shard.hh

storage_proxy: add cas_shard class

2025-06-30 10:33:17 +02:00

client_state.cc

cql: allow VECTOR_SEARCH_INDEXING users to select

2025-10-03 16:55:57 +02:00

client_state.hh

Merge 'auth: implement vector store authorization' from Michał Hudobski

2025-10-20 17:32:00 +03:00

CMakeLists.txt

Add precompiled headers to CMakeLists.txt

2025-11-21 12:27:41 +02:00

endpoint_lifecycle_subscriber.hh

hinted_handoff: drain hints after the target node stops owning tokens

2025-09-24 07:11:59 +02:00

load_broadcaster.hh

code: Replace distributed<> with sharded<>

2025-09-19 12:22:51 +02:00

load_meter.hh

code: Replace distributed<> with sharded<>

2025-09-19 12:22:51 +02:00

maintenance_mode.hh

…

mapreduce_service.cc

treewide: Move query related files to a new query directory

2025-09-16 23:40:47 +03:00

mapreduce_service.hh

code: Replace distributed<> with sharded<>

2025-09-19 12:22:51 +02:00

memory_limiter.hh

…

migration_listener.hh

migration_manager: pass timestamp to pre_create

2025-11-13 16:59:43 +01:00

migration_manager.cc

db/view/view_building_state: replace task's state with aborted flag

2025-11-25 12:14:04 +01:00

migration_manager.hh

service: attach storage_service to migration_manager using pluggabe

2025-11-14 08:50:19 +01:00

misc_services.cc

replica/table: keep track of total pre-compression file size

2025-11-13 00:49:57 +01:00

query_state.hh

…

session.cc

…

session.hh

…

state_id.hh

…

storage_proxy_fwd.hh

storage_proxy: introduce node_local_only flag

2025-07-24 19:48:08 +02:00

storage_proxy_stats.hh

storage_proxy_stats: add fenced_out_requests metric

2025-09-15 11:24:53 +02:00

storage_proxy.cc

Improve choice distribution for primary replica

2025-11-11 09:18:01 +02:00

storage_proxy.hh

storage_proxy: use gates to track write handlers destruction

2025-11-05 14:37:52 +01:00

storage_service.cc

Revert "storage service: add repair colocated tablets rpc"

2025-11-25 09:06:48 +01:00

storage_service.hh

Revert "storage service: add repair colocated tablets rpc"

2025-11-25 09:06:48 +01:00

tablet_allocator_fwd.hh

…

tablet_allocator.cc

tablet: scheduler: Do not emit conflicting migration in merge colocation

2025-11-28 11:17:12 +01:00

tablet_allocator.hh

…

tablet_operation.hh

…

task_manager_module.cc

api: tasks: task_manager: keep children identities in chunked_{array,vector}

2025-09-15 08:44:16 +03:00

task_manager_module.hh

…

topology_coordinator.cc

repair: Fix deadlock when topology coordinator steps down in the middle

2025-11-28 15:14:39 +01:00

topology_coordinator.hh

topology_coordinator: add service_level_controller reference

2025-10-08 08:24:28 +02:00

topology_guard.hh

…

topology_mutation.cc

topology_coordinator: small start_cleanup refactoring

2025-10-22 16:31:42 +02:00

topology_mutation.hh

topology_coordinator: small start_cleanup refactoring

2025-10-22 16:31:42 +02:00

topology_state_machine.cc

topology_state_machine: inline get_excluded_nodes

2025-11-13 14:18:46 +01:00

topology_state_machine.hh

topology_state_machine: inline get_excluded_nodes

2025-11-13 14:18:46 +01:00

view_update_backlog_broker.hh

…