scylladb

Author	SHA1	Message	Date
Asias He	6f04de3efd	streaming: Fail stream plan on stream_mutation_fragments handler in case of error The following is observed in pytest: 1) node1, stream master, tried to pull data from node3 2) node3, stream follower, found node1 restarted 3) node3 killed the rpc stream 4) node1 did not get the stream session failure message from node3. This failure message was supposed to kill the stream plan on node1. That's the reason node1 failed the stream session much later at "2024-08-19 21:07:45,539". Note, node3 failed the stream on its side, so it should have sent the stream session failure message. ``` $ cat node1.log \|grep f890bea0-5e68-11ef-99ae-e5bca04385fc INFO 2024-08-19 20:24:01,162 [shard 0:strm] stream_session - [Stream #f890bea0-5e68-11ef-99ae-e5bca04385fc] Executing streaming plan for Tablet migration-ks-index-0 with peers={127.0.34.3}, master ERROR 2024-08-19 20:24:01,190 [shard 1:strm] stream_session - [Stream #f890bea0-5e68-11ef-99ae-e5bca04385fc] Failed to handle STREAM_MUTATION_FRAGMENTS (receive and distribute phase) for ks=ks, cf=cf, peer=127.0.34.3: seastar::nested_exception: seastar::rpc::stream_closed (rpc stream was closed by peer) (while cleaning up after seastar::rpc::stream_closed (rpc stream was closed by peer)) WARN 2024-08-19 21:07:45,539 [shard 0:main] stream_session - [Stream #f890bea0-5e68-11ef-99ae-e5bca04385fc] Streaming plan for Tablet migration-ks-index-0 failed, peers={127.0.34.3}, tx=0 KiB, 0.00 KiB/s, rx=484 KiB, 0.18 KiB/s $ cat node3.log \|grep f890bea0-5e68-11ef-99ae-e5bca04385fc INFO 2024-08-19 20:24:01,163 [shard 0:strm] stream_session - [Stream #f890bea0-5e68-11ef-99ae-e5bca04385fc] Executing streaming plan for Tablet migration-ks-index-0 with peers=127.0.34.1, slave INFO 2024-08-19 20:24:01,164 [shard 1:strm] stream_session - [Stream #f890bea0-5e68-11ef-99ae-e5bca04385fc] Start sending ks=ks, cf=cf, estimated_partitions=2560, with new rpc streaming WARN 2024-08-19 20:24:01,187 [shard 0: gms] stream_session - [Stream #f890bea0-5e68-11ef-99ae-e5bca04385fc] Streaming plan for Tablet migration-ks-index-0 failed, peers={127.0.34.1}, tx=633 KiB, 26506.81 KiB/s, rx=0 KiB, 0.00 KiB/s WARN 2024-08-19 20:24:01,188 [shard 0:strm] stream_session - [Stream #f890bea0-5e68-11ef-99ae-e5bca04385fc] stream_transfer_task: Fail to send to 127.0.34.1:0: seastar::rpc::stream_closed (rpc stream was closed by peer) WARN 2024-08-19 20:24:01,189 [shard 0:strm] stream_session - [Stream #f890bea0-5e68-11ef-99ae-e5bca04385fc] Failed to send: seastar::rpc::stream_closed (rpc stream was closed by peer) WARN 2024-08-19 20:24:01,189 [shard 0:strm] stream_session - [Stream #f890bea0-5e68-11ef-99ae-e5bca04385fc] Streaming error occurred, peer=127.0.34.1 ``` To be safe in case the stream fail message is not received, node1 could fail the stream plan as soon as the rpc stream is aborted in the stream_mutation_fragments handler. Fixes #20227 Closes scylladb/scylladb#21960	2025-02-10 16:32:12 +01:00
Avi Kivity	f3eade2f62	treewide: relicense to ScyllaDB-Source-Available-1.0 Drop the AGPL license in favor of a source-available license. See the blog post [1] for details. [1] https://www.scylladb.com/2024/12/18/why-were-moving-to-a-source-available-license/	2024-12-18 17:45:13 +02:00
Gleb Natapov	41a57ed2e8	streaming: move streaming code to use host ids instead of host ips The patch is rather large, but it is a straightforward conversion from one type to another.	2024-12-15 11:31:11 +02:00
Tomasz Grabiec	fd3c089ccc	service: range_streamer: Propagate topology_guard to receivers	2023-12-06 18:36:16 +01:00
Benny Halevy	12eb3d210f	streaming: stream_plan: transfer_ranges: move token ranges towards add_transfer_ranges Rather than copying the ranges vector. Note that add_transfer_ranges itself cannot simply move the ranges since it copies them for multiple tables. While at it, move also the keyspace and column_family strings. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-02-28 17:03:51 +02:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Pavel Emelyanov	db33607eb2	stream_session: Keep stream_manager reference The manager is needed to get messaging service and database from. Actually, the database can be pushed though arguments in all the places, so effectively session only needs the messaging. However, the stream-task's need the manager badly and there's no other place to get it from other than the session. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	5b748a72de	stream_result_future: Keep stream_manager reference The stream_result_future needs manager to register on it and to unregister from it. Also the result-future is referenced from stream_session that also needs the manager (see next patches). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Piotr Sarna	bd2d48e99c	streaming: make stream_plan::abort noexcept Aborting a stream plan is used in deinitialization code ran in noexcept environment, so it should be noexcept itself. Tested on a not-merged-yet Seastar patch with hardened noexcept checks for abort_source. Message-Id: <6eada033bb394d725b83a7e0f92381cb792ef6a1.1596446857.git.sarna@scylladb.com>	2020-08-03 14:00:19 +03:00
Avi Kivity	fd513c42ad	streaming: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Asias He	7f826d3343	streaming: Expose reason for streaming On receiving a mutation_fragment or a mutation triggered by a streaming operation, we pass an enum stream_reason to notify the receiver what the streaming is used for. So the receiver can decide further operation, e.g., send view updates, beyond applying the streaming data on disk. Fixes #3276 Message-Id: <f15ebcdee25e87a033dcdd066770114a499881c0.1539498866.git.asias@scylladb.com>	2018-10-15 22:03:28 +01:00
Asias He	9c8da2cc56	streaming: Add abort to stream_plan It can be used by the user of stream_plan to abort the stream sessions. Repair will be the first user when aborting the repair.	2017-08-30 15:19:51 +08:00
Asias He	937f28d2f1	Convert to use dht::partition_range_vector and dht::token_range_vector	2016-12-19 14:08:50 +08:00
Asias He	d1178fa299	Convert to use dht::token_range	2016-12-19 08:04:29 +08:00
Avi Kivity	a35136533d	Convert ring_position and token ranges to be nonwrapping Wrapping ranges are a pain, so we are moving wrap handling to the edges. Since cql can't generate wrapping ranges, this means thrift and the ring maintenance code; also range->ring transformations need to merge the first and last ranges. Message-Id: <1478105905-31613-1-git-send-email-avi@scylladb.com>	2016-11-02 21:04:11 +02:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Asias He	28ccd866e2	streaming: Move ranges in stream_plan The ranges are not used afterwards. We can move instead of copy. Message-Id: <1458540564-34277-1-git-send-email-asias@scylladb.com>	2016-03-21 10:10:09 +01:00
Asias He	ed3da7b04c	streaming: Drop flush_tables option for add_transfer_ranges We do not stream sstable files. No need to flush it.	2016-01-29 16:31:07 +08:00
Asias He	face74a8f2	streaming: Rename stream_result_future::init to ::init_sending_side So we have: - init_sending_side called when the node initiates a stream_session - init_receiving_side called when the node is a receiver of a stream_session initiated by a peer	2016-01-25 11:38:13 +08:00
Asias He	dc94c5e42e	streaming: Rename get_or_create_next_session to get_or_create_session There is only one session for each peer in stream_coordinator.	2016-01-25 11:38:13 +08:00
Asias He	bdd6a69af7	streaming: Drop unused parameters - int connections_per_host Scylla does not create connections per stream_session, instead it uses rpc, thus connections_per_host is not relevant to scylla. - bool keep_ss_table_level - int repaired_at Scylla does not stream sstable files. They are not relevant to scylla.	2016-01-25 11:38:13 +08:00
Asias He	89b79d44de	streaming: Get rid of the _connecting_ parameter messaging_service will use private ip address automatically to connect a peer node if possible. There is no need for the upper level like streaming to worry about it. Drop it simplifies things a bit.	2015-12-31 11:25:08 +01:00
Asias He	d51227ad9c	streaming: Remove transfer_files It is never used.	2015-12-21 14:42:47 +08:00
Asias He	cadf8b1484	streaming: Handle stream_plan with no range added If no ranges for neither sending nor receiving are added for the stream plan, the stream plan is empty. Return a ready future immediately.	2015-11-10 15:39:34 +08:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Asias He	59cae82470	streaming: Make stream_plan::execute return a future Returns a ready future if the stream_plan completes successfully, or a failed future otherwise.	2015-07-31 16:27:55 +08:00
Asias He	d1720ffed1	streaming: Hold a shared_ptr inside stream_plan	2015-07-14 20:41:14 +08:00
Asias He	14ae9e66ae	streaming: Use shared_ptr to track back to stream_session I tried our lw_shared_ptr, the compiler complained endless usage of incomplete type stream_session. I can not include stream_session.hh everywhere due to circular dependency. For now, I'm using std::shared_ptr which works fine.	2015-07-14 20:41:14 +08:00
Asias He	ad3692f666	streaming: Implement stream_session::add_transfer_ranges Given keyspace names, ranges and column_families names, figure out mutation_readers to transfer.	2015-07-09 15:52:27 +08:00
Asias He	7ec2ee6b86	streaming: Add stream_plan::listeners	2015-06-30 16:55:30 +08:00
Asias He	6f0994349a	streaming: Add stream_plan::execute	2015-06-30 16:47:25 +08:00
Asias He	abf24b3bfa	streaming: Add flush_before_transfer to stream_plan	2015-06-30 15:38:18 +08:00
Asias He	794c65e58c	streaming: Complete transfer_ranges and request_ranges in stream_plan	2015-06-26 08:31:28 +08:00
Asias He	a4235ebc13	streaming: Implement transfer_files	2015-06-26 08:31:28 +08:00
Asias He	9e8512a783	streaming: Convert StreamPlan.java to C++	2015-06-18 23:02:31 +08:00

36 Commits