scylladb

Author	SHA1	Message	Date
Rafael Ávila de Espíndola	625080b414	Rename large_partition_handler Now that it also handles large rows, rename it to large_data_handler. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-01-28 15:03:14 -08:00
Piotr Jastrzebski	1ac7283550	Fix cross shard cf usage in streaming Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-01-24 18:13:30 +01:00
Duarte Nunes	04a14b27e4	Merge 'Add handling staging sstables to /upload dir' from Piotr " This series adds generating view updates from sstables added through /upload directory if their tables have accompanying materialized views. Said sstables are left in /upload directory until updates are generated from them and are treated just like staging sstables from /staging dir. If there are no views for a given tables, sstables are simply moved from /upload dir to datadir without any changes. Tests: unit (release) " * 'add_handling_staging_sstables_to_upload_dir_5' of https://github.com/psarna/scylla: all: rename view_update_from_staging_generator distributed_loader: fix indentation service: add generating view updates from uploaded sstables init: pass view update generator to storage service sstables: treat sstables in upload dir as needing view build sstables,table: rename is_staging to requires_view_building distributed_loader: use proper directory for opening SSTable db,view: make throttling optional for view_update_generator	2019-01-15 18:19:27 +00:00
Piotr Sarna	0eb703dc80	all: rename view_update_from_staging_generator The new name, view_update_generator, is both more concise and correct, since we now generate from directories other than "/staging".	2019-01-15 17:31:47 +01:00
Piotr Sarna	7e61f02365	streaming: add phasing incoming streams Incoming streams are now phased, which can be leveraged later to wait for all ongoing streams to finish. Refs #4032	2019-01-15 10:28:15 +01:00
Duarte Nunes	fa2b0384d2	Replace std::experimental types with C++17 std version. Replace stdx::optional and stdx::string_view with the C++ std counterparts. Some instances of boost::variant were also replaced with std::variant, namely those that called seastar::visit. Scylla now requires GCC 8 to compile. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20190108111141.5369-1-duarte@scylladb.com>	2019-01-08 13:16:36 +02:00
Avi Kivity	f02c64cadf	streaming: stream_session: remove include of db/view/view_update_from_staging_generator.hh This header, which is easily replaced with a forward declaration, introduces a dependency on database.hh everywhere. Remove it and scatter includes of database.hh in source files that really need it.	2019-01-05 17:33:25 +02:00
Piotr Sarna	9d46715613	streaming,view: move view update checks to separate file Checking if view update path should be used for sstables is going to be reused in row level repair code, so relevant functions are moved to a separate header.	2019-01-03 08:31:40 +01:00
Duarte Nunes	bab7e6877b	streaming/stream_session: Only stage sstables for tables with views When streaming, sstables for which we need to generate view updates are placed in a special staging directory. However, we only need to do this for tables that actually have views. Refs #4021 Message-Id: <20181227215412.5632-1-duarte@scylladb.com>	2018-12-28 18:32:24 +02:00
Duarte Nunes	66e45469b2	streaming/stream_session: Don't use table reference across defer points When creating a sstable from which to generate view updates, we held on to a table reference across defer points. In case there's a concurrent schema drop, the table object might be destroyed and we will incur in a use-after-free. Solve this by holding on to a shared pointer and pinning the table object. Refs #4021 Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20181227105921.3601-1-duarte@scylladb.com>	2018-12-27 13:05:46 +02:00
Asias He	0067d32b47	flat_mutation_reader: Add make_generating_reader Move generating_reader from stream_session.cc to flat_mutation_reader.cc. It will be used by repair code soon. Also introduce a helper make_generating_reader to hide the implementation of generating_reader.	2018-12-12 16:49:01 +08:00
Piotr Sarna	8e6021dfa1	streaming: don't check view building of system tables System tables will never need view building, and, what's more, are actually used in the process of view build checking. So, checking whether system tables need a view update path is simplified to returning 'false'.	2018-11-28 09:21:56 +01:00
Piotr Sarna	6ad2c39f88	streaming: remove unused sstable_is_staging bool class sstable_is_staging bool class is not used anywhere in the code anymore, so it's removed.	2018-11-28 09:21:56 +01:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Piotr Sarna	32c0fe8df2	streaming: stream tables with views through staging sstables While streaming to a table with paired views, staging sstables are used. After the table is written to disk, it's used to generate all required view updates. It's also resistant to restarts as it's stored on a hard drive in staging/ directory. Refs #3275	2018-11-13 15:04:42 +01:00
Piotr Sarna	dc74887ff3	streaming: add system distributed keyspace ref to streaming Streaming code needs system distributed keyspace to check if streamed sstables should be staging, so a proper reference is added.	2018-11-13 15:01:53 +01:00
Piotr Sarna	7ef5e1b685	streaming: add view update generator reference to streaming Streaming code may need view update generator service to generate and send view updates, so a proper reference is added.	2018-11-13 15:01:53 +01:00
Avi Kivity	fd513c42ad	streaming: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Asias He	7f826d3343	streaming: Expose reason for streaming On receiving a mutation_fragment or a mutation triggered by a streaming operation, we pass an enum stream_reason to notify the receiver what the streaming is used for. So the receiver can decide further operation, e.g., send view updates, beyond applying the streaming data on disk. Fixes #3276 Message-Id: <f15ebcdee25e87a033dcdd066770114a499881c0.1539498866.git.asias@scylladb.com>	2018-10-15 22:03:28 +01:00
Gleb Natapov	ceb361544a	stream_session: remove unused capture 'Consumer function' parameter for distribute_reader_and_consume_on_shards() captures schema_ptr (which is a seastar::shared_ptr), but the function is later copied on another shard at which point schema_ptr is also copied and its counter is incremented by the wrong shard. The capture is not even used, so lets just drop it. Fixes #3838 Message-Id: <20181011075500.GN14449@scylladb.com>	2018-10-11 11:10:58 +03:00
Asias He	de05df216f	streaming: Use rpc::source on the shard where it is created rpc::source can only work on the shard where it is created, thus we can not apply the load distribution optimization. Disable it and let the multishard_writer to forward the data to the correct shard. Fixes #3731. Message-Id: <0d1b4d3e7adcfdc4e392b83aeb2544b95f3f46dd.1537430162.git.asias@scylladb.com>	2018-09-20 12:29:24 +03:00
Asias He	d47d46e1a8	streaming: Use streaming_write_priority for the sstable writer Use the streaming io priority otherwise it uses the default io priority. Message-Id: <e1836a9a93e7204d4bc9bba9c841d57c8b24aff8.1533715438.git.asias@scylladb.com>	2018-08-08 11:08:06 +03:00
Asias He	deff5e7d60	streaming: Add rpc streaming support This patch changes scylla streaming to use the recently added rpc streaming feature provided by seastar to send mutation fragments for scylla streaming instead of the rpc verbs. It also changes the receiver to write to the sstable file directly, skipping writing to memtable.	2018-07-13 08:36:47 +08:00
Asias He	e20038eb84	streaming: Handle stream_mutation rpc handler on all shards In streaming, the sender sends the mutations on all the local shards in parallel, it is possible that the receiver handle more than one such connection on the same shard. It is determined by where the tcp connection goes. Current rpc ignores the dest shard id when sending the rpc message. For instance, say node1 has 2 shards, node2 has 2 shards. Currently, we can end up with like this: Node 1 shard 0 -> Node 2 shard 1 Node 1 shard 1 -> Node 2 shard 1 It is better if we do: Node 1 shard 0 -> Node 2 shard 0 Node 1 shard 1 -> Node 2 shard 1 This patch solves this problem by let the handler always handle on shard = src_cpu_id % smp::count. If sender and receiver have the same shard config, it is completely distributed the work evenly. If sender and receiver do not have the same shard config, it is unavoidable some of the shard will do more work than the others. Tests: dtest update_cluster_layout_tests.py Message-Id: <911827bcf67459a07ec92623a9ed4c4fbba195ca.1524622375.git.asias@scylladb.com>	2018-05-19 21:08:25 +03:00
Asias He	774307b3a7	streaming: Do send failed message for uninitialized session The uninitialized session has no peer associated with it yet. There is no point sending the failed message when abort the session. Sending the failed message in this case will send to a peer with uninitialized dst_cpu_id which will casue the receiver to pass a bogus shard id to smp::submit_to which cases segfault. In addition, to be safe, initialize the dst_cpu_id to zero. So that uninitialized session will send message to shard zero instead of random bogus shard id. Fixes the segfault issue found by repair_additional_test.py:RepairAdditionalTest.repair_abort_test Fixes #3115 Message-Id: <9f0f7b44c7d6d8f5c60d6293ab2435dadc3496a9.1515380325.git.asias@scylladb.com>	2018-01-08 15:04:06 +02:00
Raphael S. Carvalho	95d1995876	fix compilation of stream_session.cc stream_session.cc:417:62: error: cannot call member function ‘utils::UUID streaming::stream_session::plan_id()’ without object sslog.warn("[Stream #{}] Failed to send: {}", plan_id(), ep); Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20171214022621.19442-1-raphaelsc@scylladb.com>	2017-12-14 10:57:33 +01:00
Asias He	a9dab60b6c	streaming: One cf per time on sender In the case there are large number of column families, the sender will send all the column families in parallel. We allow 20% of shard memory for streaming on the receiver, so each column family will have 1/N, N is the number of in-flight column families, memory for memtable. Large N causes a lot of small sstables to be generated. It is possible there are multiple senders to a single receiver, e.g., when a new node joins the cluster, the maximum in-flight column families is number of peer node. The column families are sent in the order of cf_id. It is not guaranteed that all peers has the same speed so they are sending the same cf_id at the same time, though. We still have chance some of the peers are sending the same cf_id. Fixes #3065 Message-Id: <46961463c2a5e4f1faff232294dc485ac4f1a04e.1513159678.git.asias@scylladb.com>	2017-12-13 12:32:41 +02:00
Asias He	fad34801bf	streaming: Introduce streaming::abort() It will be used soon by stream_plan::abort() to abort a stream session.	2017-08-30 15:19:50 +08:00
Asias He	7fba7cca01	streaming: Make stream_manager and coordinator message debug level When we abort a session, it is possible that: node 1 abort the session by user request node 1 send the complete_message to node 2 node 2 abort the session upon receive of the complete_message node 1 sends one more stream message to node 2 and the stream_manager for the session can not be found. It is fine for node 2 to not able to find the stream_manager, make the log on node 2 less verbose to confuse user less.	2017-08-30 15:19:50 +08:00
Asias He	be573bcafb	streaming: Check if _stream_result is valid If on_error() was called before init() was executed, the _stream_result can be invalid.	2017-08-30 15:19:44 +08:00
Asias He	8a3f6acdd2	streaming: Log peer address in on_error	2017-08-30 15:18:27 +08:00
Asias He	eace5fc6e8	streaming: Introduce received_failed_complete_message It is the handler for the failed complete message. Add a flag to remember if we received a such message from peer, if so, do not send back the failed complete message back to the peer when running close_session with failed status.	2017-08-30 15:18:27 +08:00
Duarte Nunes	85e85ec72e	Don't catch polymorphic exceptions by value It makes gcc a very sad compiler. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20170726172053.5639-2-duarte@scylladb.com>	2017-07-27 09:39:58 +03:00
Asias He	aa87429e67	streaming: Send complete message with failed flag when session is failed To notify peer node the session is failed.	2017-07-19 10:11:05 +08:00
Asias He	03b838705c	streaming: Handle failed flag in complete message Fail the current session if the failed flag is on in the complete message handler.	2017-07-19 10:11:05 +08:00
Asias He	12d18cfab4	streaming: Do not fail the session when failed to send complete message Since the complete message is not mandatary, no point to fail the session in case failed to send the complete message.	2017-07-19 10:11:04 +08:00
Asias He	ca5248cd58	streaming: Introduce send_failed_complete_message Currently, send_complete_message is not used. We will use it shortly in case the local session is failed. Send a complete message with failed flag to notify peer node that the session is failed so that peer can close the session. This can speed up the closing of failed session. Also rename it to send_failed_complete_message.	2017-07-19 10:11:04 +08:00
Asias He	f21cb75cdb	streaming: Do not send complete message when session is successful The complete_message is not needed and the handler of this rpc message does nothing but returns a ready future. The patch to remove it did not make into the Scylla 1.0 release so it was left there.	2017-07-18 15:29:42 +08:00
Asias He	0ba4e73068	streaming: Introduce the failed parameter for complete message Use this flag to notify the peer that the session is failed so that the peer can close the failed session more quickly. The flag is used as a rpc::optional so it is compatible use old version of the verb.	2017-07-18 11:24:31 +08:00
Asias He	7599c1524d	streaming: Remove unused session_failed function It is never used. Get rid of it.	2017-07-18 11:22:09 +08:00
Asias He	caad7ced23	streaming: Less verbose in logging Now, we will have large number of small streaming. Make the not very important logging message debug level.	2017-07-18 11:17:09 +08:00
Avi Kivity	ebaeefa02b	Merge seatar upstream (seastar namespace) - introcduced "seastarx.hh" header, which does a "using namespace seastar"; - 'net' namespace conflicts with seastar::net, renamed to 'netw'. - 'transport' namespace conflicts with seastar::transport, renamed to cql_transport. - "logger" global variables now conflict with logger global type, renamed to xlogger. - other minor changes	2017-05-21 12:26:15 +03:00
Asias He	937f28d2f1	Convert to use dht::partition_range_vector and dht::token_range_vector	2016-12-19 14:08:50 +08:00
Asias He	e5485f3ea6	Get rid of query::partition_range Use dht::partition_range instead	2016-12-19 08:09:25 +08:00
Asias He	d1178fa299	Convert to use dht::token_range	2016-12-19 08:04:29 +08:00
Tomasz Grabiec	c1a7e2090e	Revert "database: change find_column_families signature so it returns a lw_shared_ptr" This reverts commit `f3528ede65`.	2016-11-04 10:48:21 +01:00
Glauber Costa	f3528ede65	database: change find_column_families signature so it returns a lw_shared_ptr There are places in which we need to use the column family object many times, with deferring points in between. Because the column family may have been destroyed in the deferring point, we need to go and find it again. If we use lw_shared_ptr, however, we'll be able to at least guarantee that the object will be alive. Some users will still need to check, if they want to guarantee that the column family wasn't removed. But others that only need to make sure we don't access an invalid object will be able to avoid the cost of re-finding it just fine. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <722bf49e158da77ff509372c2034e5707706e5bf.1478111467.git.glauber@scylladb.com>	2016-11-03 13:27:31 +01:00
Avi Kivity	a35136533d	Convert ring_position and token ranges to be nonwrapping Wrapping ranges are a pain, so we are moving wrap handling to the edges. Since cql can't generate wrapping ranges, this means thrift and the ring maintenance code; also range->ring transformations need to merge the first and last ranges. Message-Id: <1478105905-31613-1-git-send-email-avi@scylladb.com>	2016-11-02 21:04:11 +02:00
Asias He	a0020fdad2	stream_session: Allow adding ranges to a cf more than once Append the ranges to a stream_transfer_task if the cf is already added to _transfers in add_transfer_ranges.	2016-09-26 06:28:50 +08:00
Duarte Nunes	aaa76d58ba	query: Move to_partition_range to dht namespace This patch moves to_partition_range, from the query namespace to the dht namespace, where it is a more natural fit. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1468498060-19251-1-git-send-email-duarte@scylladb.com>	2016-07-15 10:41:52 +02:00

1 2 3 4 5

223 Commits