scylladb/api at 9d95e0e6ba59294c29d234c3836e3070fac80ae3 - scylladb - Anomalous Gitea

mirrors/scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 04:06:59 +00:00

Files

History

Marcin Maliszkiewicz 9d95e0e6ba Merge 'storage_service: fix REST API races during shutdown and cross-shard forwarding' from Piotr Smaron

REST route removal unregisters handlers but does not wait for requests
that already entered storage_service.  A request can therefore suspend
inside an async operation, restart proceeds to tear the service down,
and the coroutine later resumes against destroyed members such as
_topology_state_machine, _group0, or _sys_ks — a use-after-destruction
bug that surfaces as UBSAN dynamic-type failures (e.g. the crash seen
from topology_state_load()).

Fix this by holding storage_service::_async_gate from the entry
boundary of every externally-triggered async operation so that stop()
drains them before teardown begins.  The gate is acquired in
run_with_api_lock, run_with_no_api_lock, and in individual REST
handlers that bypass those wrappers (reload_raft_topology_state,
mark_excluded, removenode, schema reload, topology-request
waits/abort, cleanup, ring/schema queries, SSTable dictionary
training/publish, and sampling).

Additionally, fix get_ownership() and abort_topology_request() which
forward work to shard 0 but were still referencing the caller-shard's
`this` pointer instead of the destination-shard instance, causing
silent cross-shard access to shard-local state.
Add a cluster regression test that repeatedly exercises the multi-shard
ownership REST path to cover the forwarding fix.

Fixes: SCYLLADB-1415

Should be backported to all branches, the code has been introduced around 2024.1 release.

Closes scylladb/scylladb#29373

* github.com:scylladb/scylladb:
  storage_service: fix shard-0 forwarding in REST helpers
  storage_service: gate REST-facing async operations during shutdown
  storage_service: prepare for async gate in REST handlers

(cherry picked from commit 4043d95810)

Closes scylladb/scylladb#29611

2026-04-27 13:57:32 +02:00

..

Revert "api: storage_service/tablets/repair: disable incremental repair by default"

2026-01-21 08:50:13 +02:00

api_init.hh

api: implement client_routes endpoints

2025-12-15 17:36:47 +01:00

api.cc

api: implement client_routes endpoints

2025-12-15 17:36:47 +01:00

api.hh

code: Replace distributed<> with sharded<>

2025-09-19 12:22:51 +02:00

authorization_cache.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

authorization_cache.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

cache_service.cc

api: Capture and use db in cache_service handlers

2025-08-26 11:50:11 +03:00

cache_service.hh

api: Add sharded<database>& arg to set_cache_service()

2025-08-26 11:49:35 +03:00

client_routes.cc

db: api: service: Fix ClientConnectorError in test_client_routes The bug was caused by capturing local variables by reference in lambdas passed to with_retry(), which is a coroutine. When the coroutine suspends, the lambda frame exits and the referenced locals are destroyed, leading to use-after-lifetime issues. This change fixes the problem by ensuring safe ownership across suspension points and also refactors how route_keys and route_entries are passed from the caller. Previously they were passed as const lvalue references, which cannot be moved and therefore ended up being repeatedly copied across function calls and lambda invocations. The new approach avoids unnecessary copies and makes the lifetime semantics explicit and safe.

2025-12-22 14:52:47 +02:00

client_routes.hh

api: implement client_routes endpoints

2025-12-15 17:36:47 +01:00

CMakeLists.txt

api: implement client_routes endpoints

2025-12-15 17:36:47 +01:00

collectd.cc

tree: Remove unused boost headers

2025-02-15 20:32:22 +02:00

collectd.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

column_family.cc

replica/table: keep track of total pre-compression file size

2025-11-13 00:49:57 +01:00

column_family.hh

api: Remove system_keyspace ref from column_family API block

2025-10-03 13:50:22 +03:00

commitlog.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

commitlog.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

compaction_manager.cc

build: switch to Seastar API_LEVEL 8 (noncopyable_function in json)

2025-09-29 08:33:49 +03:00

compaction_manager.hh

compaction: move code to namespace compaction

2025-09-25 15:03:56 +03:00

config.cc

api: Remove unused get_json_return_type() templates

2025-07-05 18:42:02 +03:00

config.hh

Merge 'Complete implementation of configuring IO bandwidth limits' from Pavel Emelyanov

2025-01-14 07:56:38 -05:00

cql_server_test.cc

treewide: include build_mode.hh for SCYLLA_BUILD_MODE_RELEASE where it is missing

2025-02-20 10:50:04 +03:00

cql_server_test.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

endpoint_snitch.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

endpoint_snitch.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

error_injection.cc

api: Switch to request content streaming

2025-10-06 16:43:26 +03:00

error_injection.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

failure_detector.cc

api/failure_detector.cc: stream endpoints

2025-06-25 11:28:37 +03:00

failure_detector.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

gossiper.cc

api: Use std::ranges to stringify collections

2025-06-02 20:09:56 +03:00

gossiper.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

hinted_handoff.cc

hints: move create_hint_sync_point function to host ids

2025-01-15 16:30:28 +02:00

hinted_handoff.hh

hints: move create_hint_sync_point function to host ids

2025-01-15 16:30:28 +02:00

lsa.cc

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

lsa.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

messaging_service.cc

messaging_service: pass host id to remove_rpc_client in down notification

2025-03-11 12:09:22 +02:00

messaging_service.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

raft.cc

treewide: move away from accessing httpd::request::query_parameters

2025-09-24 11:52:15 +03:00

raft.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

scrub_status.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

service_levels.cc

treewide: use angle brackets for including seastar headers

2025-03-17 10:03:06 +02:00

service_levels.hh

api: include "smaller" header

2025-01-06 13:04:33 +02:00

storage_proxy.cc

code: Replace distributed<> with sharded<>

2025-09-19 12:22:51 +02:00

storage_proxy.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

storage_service.cc

Merge 'storage_service: fix REST API races during shutdown and cross-shard forwarding' from Piotr Smaron

2026-04-27 13:57:32 +02:00

storage_service.hh

compaction: move code to namespace compaction

2025-09-25 15:03:56 +03:00

stream_manager.cc

Merge 'Complete implementation of configuring IO bandwidth limits' from Pavel Emelyanov

2025-01-14 07:56:38 -05:00

stream_manager.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

system.cc

Revert "Merge 'db/config: enable ms sstable format by default' from Michał Chojnowski"

2025-12-02 14:38:56 +02:00

system.hh

db: get rid of sstables-format-selector

2025-09-19 16:17:56 +03:00

task_manager_test.cc

treewide: move away from accessing httpd::request::query_parameters

2025-09-24 11:52:15 +03:00

task_manager_test.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

task_manager.cc

treewide: use coroutine::maybe_yield in coroutines

2026-01-12 10:38:47 +01:00

task_manager.hh

api: task_manager: pass gossiper to api::set_task_manager

2025-02-05 10:10:29 +01:00

tasks.cc

db: snapshot_ctl: move skip_flush to struct snapshot_options

2026-01-22 09:12:56 +02:00

tasks.hh

treewide: relicense to ScyllaDB-Source-Available-1.0

2024-12-18 17:45:13 +02:00

token_metadata.cc

nodetool: status: Show excluded nodes as having status 'X'

2025-10-31 09:03:20 +01:00

token_metadata.hh

api: do not use token_metadata to retrieve ip to id mapping in token_metadata RESTful endpoints

2025-01-15 16:30:28 +02:00