scylladb

Author	SHA1	Message	Date
Pavel Emelyanov	a1ea553fe1	code: Replace distributed<> with sharded<> The latter is recommended in seastar, and the former was left as compatibility alias. Latest seastar explicitly marks it as deprecated so once the submodule is updated, compilation logs will explode. Most of the patch is generated with for f in $(git grep -l '\<distributed<[A-Za-z0-9:_]>') ; do sed -e 's/\<distributed<$[A-Za-z0-9:_]$>/sharded<\1>/g' -i $f; done for f in $(git grep -l distributed.hh); do sed -e 's/distributed.hh/sharded.hh/' -i $f ; done and a small manual change in test/perf/perf.hh Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#26136	2025-09-19 12:22:51 +02:00
Avi Kivity	f3eade2f62	treewide: relicense to ScyllaDB-Source-Available-1.0 Drop the AGPL license in favor of a source-available license. See the blog post [1] for details. [1] https://www.scylladb.com/2024/12/18/why-were-moving-to-a-source-available-license/	2024-12-18 17:45:13 +02:00
Gleb Natapov	41a57ed2e8	streaming: move streaming code to use host ids instead of host ips The patch is rather large, but it is a straightforward conversion from one type to another.	2024-12-15 11:31:11 +02:00
Kefu Chai	f86a5ae87a	streaming: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16947	2024-01-23 19:38:30 +02:00
Tomasz Grabiec	fd3c089ccc	service: range_streamer: Propagate topology_guard to receivers	2023-12-06 18:36:16 +01:00
Benny Halevy	a1acf6854b	everywhere: reduce dependencies on i_partitioner.hh Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-11-05 20:47:44 +02:00
Benny Halevy	314e45d957	streaming: define plan_id as a strong tagged_uuid type Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-22 19:45:30 +03:00
Benny Halevy	257d74bb34	schema, everywhere: define and use table_id as a strong type Define table_id as a distinct utils::tagged_uuid modeled after raft tagged_id, so it can be differentiated from other uuid-class types, in particular from table_schema_version. Fixes #11207 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-08-08 08:09:41 +03:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Avi Kivity	bbad8f4677	replica: move ::database, ::keyspace, and ::table to replica namespace Move replica-oriented classes to the replica namespace. The main classes moved are ::database, ::keyspace, and ::table, but a few ancillary classes are also moved. There are certainly classes that should be moved but aren't (like distributed_loader) but we have to start somewhere. References are adjusted treewide. In many cases, it is obvious that a call site should not access the replica (but the data_dictionary instead), but that is left for separate work. scylla-gdb.py is adjusted to look for both the new and old names.	2022-01-07 12:04:38 +02:00
Pavel Emelyanov	a3b4d4d3cf	stream_session: Use manager reference from result-future When the stream_session initializes it's being equipped with the shared-pointer on the stream_result_future very early. In all the places where stream_session needs the manager this pointer is alive and session get get manager from it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	db33607eb2	stream_session: Keep stream_manager reference The manager is needed to get messaging service and database from. Actually, the database can be pushed though arguments in all the places, so effectively session only needs the messaging. However, the stream-task's need the manager badly and there's no other place to get it from other than the session. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	f2ae080c63	stream_session: Remove unused default contructor Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	73e10c7aed	streaming: Move start/stop onto common rails In case of streaming this mostly means dropping the global init/uninit calls and replacing them with sharded<stream_manager> instance. It's still global, but it's being fixed atm. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:15:58 +03:00
Pavel Emelyanov	ba298bd5c6	streaming: Remove global dependency pointers Now they are not needed. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:15:58 +03:00
Pavel Emelyanov	e448774588	streaming: Move rpc verbs reg/unreg into manager As a part of streaming start/stop unification. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:15:58 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Pavel Emelyanov	423d0baa65	streaming: Get migration_manager shared_ptr in messaging Same as in previous patch, but for streaming code. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-04-23 17:13:24 +03:00
Avi Kivity	82c76832df	treewide: don't include "db/system_distributed_keyspace.hh" from headers This just causes unneeded and slower recompliations. Instead replace with forward declarations, or includes of smaller headers that were incidentally brought in by the one removed. The .cc files that really need it gain the include, but they are few. Ref #1. Closes #8403	2021-04-04 14:00:26 +03:00
Pavel Emelyanov	a6888e3ce3	streaming: Keep reference on messaging Streaming uses messaging, init it with itw own reference. Nowadays the whole streaming subsystem uses global static references on the needed services. This is not nice, but still better than just using code-wide globals, so treat the messaging service here the same way. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:52 +03:00
Pavel Emelyanov	5060063cd6	messaging: Add missing per-service unregistering methods 5 services register handlers in messaging, but not all of them have clear unregistration methods. Summary: migration_manager: everything is in place, no changes gossiper: ditto proxy: some verbs unregistration is missing repair: no unregistration at all streaming: ditto This patch adds the needed unregistration methods. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-07-22 16:34:00 +03:00
Pavel Emelyanov	b11cf6e950	cql3/query_processor.hh: Debloat from other headers This gives ~30% less (251 jobs -> 181 jobs) recompile when touching it Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20200212225828.3374-1-xemul@scylladb.com>	2020-02-16 11:22:30 +02:00
Pavel Solodovnikov	2f442f28af	treewide: add const qualifiers throughout the code base	2019-11-26 02:24:49 +03:00
Asias He	49a73aa2fc	streaming: Move stream_mutation_fragments_cmd to a new file (#4812 ) Avoid including the lengthy stream_session.hh in messaging_service. More importantly, fix the build because currently messaging_service.cc and messaging_service.hh does not include stream_mutation_fragments_cmd. I am not sure why it builds on my machine. Spotted this when backporting the "streaming: Send error code from the sender to receiver" to 3.0 branch. Refs: #4789	2019-08-07 14:59:46 +02:00
Asias He	bac987e32a	streaming: Send error code from the sender to receiver In case of error on the sender side, the sender does not propagate the error to the receiver. The sender will close the stream. As a result, the receiver will get nullopt from the source in get_next_mutation_fragment and pass mutation_fragment_opt with no value to the generating_reader. In turn, the generating_reader generates end of stream. However, the last element that the generating_reader has generated can be any type of mutation_fragment. This makes the sstable that consumes the generating_reader violates the mutation_fragment stream rule. To fix, we need to propagate the error. However RPC streaming does not support propagate the error in the framework. User has to send an error code explicitly. Fixes: #4789	2019-08-06 16:54:56 +02:00
Asias He	b8158dd65d	streaming: Get rid of the keep alive timer in streaming There is no guarantee that rpc streaming makes progress in some time period. Remove the keep alive timer in streaming to avoid killing the session when the rpc streaming is just slow. The keep alive timer is used to close the session in the following case: n2 (the rpc streaming sender) streams to n1 (the rpc streaming receiver) kill -9 n2 We need this because we do not kill the session when gossip think a node is down, because we think the node down might only be temporary and it is a waste to drop the previous work that has done especially when the stream session takes long time. Since in range_streamer, we do not stream all data in a single stream session, we stream 10% of the data per time, and we have retry logic. I think it is fine to kill a stream session when gossip thinks a node is down. This patch changes to close all stream session with the node that gossip think it is down. Message-Id: <bdbb9486a533eee25fcaf4a23a946629ba946537.1551773823.git.asias@scylladb.com>	2019-03-12 12:20:28 +01:00
Piotr Sarna	0eb703dc80	all: rename view_update_from_staging_generator The new name, view_update_generator, is both more concise and correct, since we now generate from directories other than "/staging".	2019-01-15 17:31:47 +01:00
Avi Kivity	f02c64cadf	streaming: stream_session: remove include of db/view/view_update_from_staging_generator.hh This header, which is easily replaced with a forward declaration, introduces a dependency on database.hh everywhere. Remove it and scatter includes of database.hh in source files that really need it.	2019-01-05 17:33:25 +02:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Piotr Sarna	dc74887ff3	streaming: add system distributed keyspace ref to streaming Streaming code needs system distributed keyspace to check if streamed sstables should be staging, so a proper reference is added.	2018-11-13 15:01:53 +01:00
Piotr Sarna	7ef5e1b685	streaming: add view update generator reference to streaming Streaming code may need view update generator service to generate and send view updates, so a proper reference is added.	2018-11-13 15:01:53 +01:00
Asias He	7f826d3343	streaming: Expose reason for streaming On receiving a mutation_fragment or a mutation triggered by a streaming operation, we pass an enum stream_reason to notify the receiver what the streaming is used for. So the receiver can decide further operation, e.g., send view updates, beyond applying the streaming data on disk. Fixes #3276 Message-Id: <f15ebcdee25e87a033dcdd066770114a499881c0.1539498866.git.asias@scylladb.com>	2018-10-15 22:03:28 +01:00
Asias He	ad7b132188	Revert "streaming: Do not abort session too early in idle detection" This reverts commit `f792c78c96`. With the "Use range_streamer everywhere" (`7217b7ab36`) series, all the user of streaming now do streaming with relative small ranges and can retry streaming at higher level. Reduce the time-to-recover from 5 hours to 10 minutes per stream session. Even if the 10 minutes idle detection might cause higher false positive, it is fine, since we can retry the "small" stream session anyway. In the long term, we should replace the whole idle detection logic with whenever the stream initiator goes away, the stream slave goes away. Message-Id: <75f308baf25a520d42d884c7ef36f1aecb8a64b0.1520992219.git.asias@scylladb.com>	2018-03-14 10:11:00 +02:00
Asias He	774307b3a7	streaming: Do send failed message for uninitialized session The uninitialized session has no peer associated with it yet. There is no point sending the failed message when abort the session. Sending the failed message in this case will send to a peer with uninitialized dst_cpu_id which will casue the receiver to pass a bogus shard id to smp::submit_to which cases segfault. In addition, to be safe, initialize the dst_cpu_id to zero. So that uninitialized session will send message to shard zero instead of random bogus shard id. Fixes the segfault issue found by repair_additional_test.py:RepairAdditionalTest.repair_abort_test Fixes #3115 Message-Id: <9f0f7b44c7d6d8f5c60d6293ab2435dadc3496a9.1515380325.git.asias@scylladb.com>	2018-01-08 15:04:06 +02:00
Asias He	a9dab60b6c	streaming: One cf per time on sender In the case there are large number of column families, the sender will send all the column families in parallel. We allow 20% of shard memory for streaming on the receiver, so each column family will have 1/N, N is the number of in-flight column families, memory for memtable. Large N causes a lot of small sstables to be generated. It is possible there are multiple senders to a single receiver, e.g., when a new node joins the cluster, the maximum in-flight column families is number of peer node. The column families are sent in the order of cf_id. It is not guaranteed that all peers has the same speed so they are sending the same cf_id at the same time, though. We still have chance some of the peers are sending the same cf_id. Fixes #3065 Message-Id: <46961463c2a5e4f1faff232294dc485ac4f1a04e.1513159678.git.asias@scylladb.com>	2017-12-13 12:32:41 +02:00
Avi Kivity	85a6a2b3cb	streaming: remove unneeded includes	2017-09-12 10:43:39 +03:00
Asias He	fad34801bf	streaming: Introduce streaming::abort() It will be used soon by stream_plan::abort() to abort a stream session.	2017-08-30 15:19:50 +08:00
Asias He	eace5fc6e8	streaming: Introduce received_failed_complete_message It is the handler for the failed complete message. Add a flag to remember if we received a such message from peer, if so, do not send back the failed complete message back to the peer when running close_session with failed status.	2017-08-30 15:18:27 +08:00
Asias He	ca5248cd58	streaming: Introduce send_failed_complete_message Currently, send_complete_message is not used. We will use it shortly in case the local session is failed. Send a complete message with failed flag to notify peer node that the session is failed so that peer can close the session. This can speed up the closing of failed session. Also rename it to send_failed_complete_message.	2017-07-19 10:11:04 +08:00
Asias He	7599c1524d	streaming: Remove unused session_failed function It is never used. Get rid of it.	2017-07-18 11:22:09 +08:00
Asias He	f792c78c96	streaming: Do not abort session too early in idle detection Streaming ususally takes long time to complete. Abort it on false positive idle detection can be very wasteful. Increase the abort timeout from 10 minutes to a very large timeout, 300 minutes. The real idle session will be aborted eventually if other mechanisms, e.g., streaming manager has gossip callback for on_remove and on_restart event to abort, do not abort the session. Fixes #2197 Message-Id: <57f81bfebfdc6f42164de5a84733097c001b394e.1494552921.git.asias@scylladb.com>	2017-05-24 12:29:50 +03:00
Avi Kivity	ebaeefa02b	Merge seatar upstream (seastar namespace) - introcduced "seastarx.hh" header, which does a "using namespace seastar"; - 'net' namespace conflicts with seastar::net, renamed to 'netw'. - 'transport' namespace conflicts with seastar::transport, renamed to cql_transport. - "logger" global variables now conflict with logger global type, renamed to xlogger. - other minor changes	2017-05-21 12:26:15 +03:00
Asias He	937f28d2f1	Convert to use dht::partition_range_vector and dht::token_range_vector	2016-12-19 14:08:50 +08:00
Asias He	d1178fa299	Convert to use dht::token_range	2016-12-19 08:04:29 +08:00
Tomasz Grabiec	c1a7e2090e	Revert "database: change find_column_families signature so it returns a lw_shared_ptr" This reverts commit `f3528ede65`.	2016-11-04 10:48:21 +01:00
Glauber Costa	f3528ede65	database: change find_column_families signature so it returns a lw_shared_ptr There are places in which we need to use the column family object many times, with deferring points in between. Because the column family may have been destroyed in the deferring point, we need to go and find it again. If we use lw_shared_ptr, however, we'll be able to at least guarantee that the object will be alive. Some users will still need to check, if they want to guarantee that the column family wasn't removed. But others that only need to make sure we don't access an invalid object will be able to avoid the cost of re-finding it just fine. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <722bf49e158da77ff509372c2034e5707706e5bf.1478111467.git.glauber@scylladb.com>	2016-11-03 13:27:31 +01:00
Avi Kivity	a35136533d	Convert ring_position and token ranges to be nonwrapping Wrapping ranges are a pain, so we are moving wrap handling to the edges. Since cql can't generate wrapping ranges, this means thrift and the ring maintenance code; also range->ring transformations need to merge the first and last ranges. Message-Id: <1478105905-31613-1-git-send-email-avi@scylladb.com>	2016-11-02 21:04:11 +02:00
Avi Kivity	c94fb1bf12	build: reduce inclusions of messaging_service.hh Remove inclusions from header files (primary offender is fb_utilities.hh) and introduce new messaging_service_fwd.hh to reduce rebuilds when the messaging service changes. Message-Id: <1475584615-22836-1-git-send-email-avi@scylladb.com>	2016-10-05 11:46:49 +03:00
Paweł Dziepak	f2ae31711e	streaming: inform CF when streaming fails Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-07-07 12:18:35 +01:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00

1 2 3

131 Commits