scylladb/replica/table.cc at copilot/fix-task-api-docs-and-node-ops

Files

Raphael S. Carvalho 74ecedfb5c replica: Fail timed-out single-key read on cleaned up tablet replica

Consider the following:
1) single-key read starts, blocks on replica e.g. waiting for memory.
2) the same replica is migrated away
3) single-key read expires, coordinator abandons it, releases erm.
4) migration advances to cleanup stage, barrier doesn't wait on
   timed-out read
5) compaction group of the replica is deallocated on cleanup
6) that single-key resumes, but doesn't find sstable set (post cleanup)
7) with abort-on-internal-error turned on, node crashes

It's fine for abandoned (= timed out) reads to fail, since the
coordinator is gone.
For active reads (non timed out), the barrier will wait for them
since their coordinator holds erm.
This solution consists of failing reads which underlying tablet
replica has been cleaned up, by just converting internal error
to plain exception.

Fixes #26229.

Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>

Closes scylladb/scylladb#27078

2025-11-20 11:44:03 +02:00

210 KiB

Raw Permalink Blame History

View Raw

210 KiB Raw Permalink Blame History

210 KiB

Raw Permalink Blame History