scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-01 04:26:48 +00:00

Files

Duarte Nunes 0a561fc326 gms/gossiper: Synchronize endpoint state destruction

In gossiper::handle_major_state_change() we set the endpoint_state for
a particular endpoint and replicate the changes to other cores.

This is totally unsynchronized with the execution of
gossiper::evict_from_membership(), which can happen concurrently, and
can remove the very same endpoint from the map  (in all cores).

Replicating the changes to other cores in handle_major_state_change()
can interleave with replicating the changes to other cores in
evict_from_membership(), and result in an undefined final state.

Another issue happened in debug mode dtests, where a fiber executes
handle_major_state_change(), calls into the subscribers, of which
storage_service is one, and ultimately lands on
storage_service::update_peer_info(), which iterates over the
endpoint's application state with deferring points in between (to
update a system table). gossiper::evict_from_membership() was executed
concurrently by another fiber, which freed the state the first one is
iterating over.

Fixes #3299.

Signed-off-by: Duarte Nunes <duarte@scylladb.com>
Message-Id: <20180318123211.3366-1-duarte@scylladb.com>
(cherry picked from commit 810db425a5)

2018-03-18 14:54:54 +02:00

application_state.cc

gossip: Print SCHEMA_TABLES_VERSION correctly

2017-09-26 08:38:28 +02:00

application_state.hh

service: Advertise schema tables format version through gossip

2017-07-07 19:07:59 +02:00

endpoint_state.cc

gms/endpoint_state: Remove get_application_state()

2017-10-11 10:02:32 +01:00

endpoint_state.hh

gossiper: Replicate endpoint_state::is_alive()