scylladb

Author	SHA1	Message	Date
Asias He	b89ced4635	streaming: Do not open rpc stream connection if reader has no data We can use the reader::peek() to check if the reader contains any data. If not, do not open the rpc stream connection. It helps to reduce the port usage. Refs: #4943	2019-10-08 10:31:02 +02:00
Botond Dénes	fddd9a88dd	treewide: silence discarded future warnings for legit discards This patch silences those future discard warnings where it is clear that discarding the future was actually the intent of the original author, and they did the necessary precautions (handling errors). The patch also adds some trivial error handling (logging the error) in some places, which were lacking this, but otherwise look ok. No functional changes.	2019-08-26 18:54:44 +03:00
Asias He	49a73aa2fc	streaming: Move stream_mutation_fragments_cmd to a new file (#4812 ) Avoid including the lengthy stream_session.hh in messaging_service. More importantly, fix the build because currently messaging_service.cc and messaging_service.hh does not include stream_mutation_fragments_cmd. I am not sure why it builds on my machine. Spotted this when backporting the "streaming: Send error code from the sender to receiver" to 3.0 branch. Refs: #4789	2019-08-07 14:59:46 +02:00
Asias He	288371ce75	streaming: Do not call rpc stream flush in send_mutation_fragments The stream close() guarantees the data sent will be flushed. No need to call the stream flush() since the stream is not reused. Follow up fix for commit `bac987e32a` (streaming: Send error code from the sender to receiver). Refs #4789	2019-08-07 14:31:17 +02:00
Asias He	bac987e32a	streaming: Send error code from the sender to receiver In case of error on the sender side, the sender does not propagate the error to the receiver. The sender will close the stream. As a result, the receiver will get nullopt from the source in get_next_mutation_fragment and pass mutation_fragment_opt with no value to the generating_reader. In turn, the generating_reader generates end of stream. However, the last element that the generating_reader has generated can be any type of mutation_fragment. This makes the sstable that consumes the generating_reader violates the mutation_fragment stream rule. To fix, we need to propagate the error. However RPC streaming does not support propagate the error in the framework. User has to send an error code explicitly. Fixes: #4789	2019-08-06 16:54:56 +02:00
Asias He	64a4c0ede2	streaming: Do not open rpc stream connection if ranges are not relevant to a shard Given a list of ranges to stream, stream_transfer_task will create an reader with the ranges and create a rpc stream connection on all the shards. When user provides ranges to repair with -st -et options, e.g., using scylla-manger, such ranges can belong to only one shard, repair will pass such ranges to streaming. As a result, only one shard will have data to send while the rpc stream connections are created on all the shards, which can cause the kernel run out of ports in some systems. To mitigate the problem, do not open the connection if the ranges do not belong to the shard at all. Refs: #4708	2019-07-18 18:31:21 +03:00
Asias He	b8158dd65d	streaming: Get rid of the keep alive timer in streaming There is no guarantee that rpc streaming makes progress in some time period. Remove the keep alive timer in streaming to avoid killing the session when the rpc streaming is just slow. The keep alive timer is used to close the session in the following case: n2 (the rpc streaming sender) streams to n1 (the rpc streaming receiver) kill -9 n2 We need this because we do not kill the session when gossip think a node is down, because we think the node down might only be temporary and it is a waste to drop the previous work that has done especially when the stream session takes long time. Since in range_streamer, we do not stream all data in a single stream session, we stream 10% of the data per time, and we have retry logic. I think it is fine to kill a stream session when gossip thinks a node is down. This patch changes to close all stream session with the node that gossip think it is down. Message-Id: <bdbb9486a533eee25fcaf4a23a946629ba946537.1551773823.git.asias@scylladb.com>	2019-03-12 12:20:28 +01:00
Duarte Nunes	fa2b0384d2	Replace std::experimental types with C++17 std version. Replace stdx::optional and stdx::string_view with the C++ std counterparts. Some instances of boost::variant were also replaced with std::variant, namely those that called seastar::visit. Scylla now requires GCC 8 to compile. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20190108111141.5369-1-duarte@scylladb.com>	2019-01-08 13:16:36 +02:00
Avi Kivity	f02c64cadf	streaming: stream_session: remove include of db/view/view_update_from_staging_generator.hh This header, which is easily replaced with a forward declaration, introduces a dependency on database.hh everywhere. Remove it and scatter includes of database.hh in source files that really need it.	2019-01-05 17:33:25 +02:00
Gleb Natapov	37b4043677	streaming: always read from rpc::source until end-of-stream during mutation sending rpc::source cannot be abandoned until EOS is reached, but current code does not obey it if error code is received, it throws exception instead that aborts the reading loop. Fix it by moving exception throwing out of the loop. Fixes: #4025 Message-Id: <20181227135051.GC29458@scylladb.com>	2018-12-27 16:50:53 +02:00
Gleb Natapov	393269d34b	streaming: hold to sink while close() is running and call close on error as well Currently if something throws while streaming in mutation sending loop sink is not closed. Also when close() is running the code does not hold onto sink object. close() is async, so sink should be kept alive until it completes. The patch uses do_with() to hold onto sink while close is running and run close() on error path too. Fixes #4004. Message-Id: <20181220155931.GL3075@scylladb.com>	2018-12-20 18:03:37 +02:00
Asias He	bcba6b4f4d	streaming: Futurize estimate_partitions The loop can take a long time if the number of sstables and/or ranges are large. To fix, futurize the loop. Fixes: #4005 Message-Id: <3b05cb84f3f57cc566702142c6365a04b075018e.1545290730.git.asias@scylladb.com>	2018-12-20 12:08:03 +02:00
Avi Kivity	fd513c42ad	streaming: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Asias He	7f826d3343	streaming: Expose reason for streaming On receiving a mutation_fragment or a mutation triggered by a streaming operation, we pass an enum stream_reason to notify the receiver what the streaming is used for. So the receiver can decide further operation, e.g., send view updates, beyond applying the streaming data on disk. Fixes #3276 Message-Id: <f15ebcdee25e87a033dcdd066770114a499881c0.1539498866.git.asias@scylladb.com>	2018-10-15 22:03:28 +01:00
Botond Dénes	eb357a385d	flat_mutation_reader: make timeout opt-out rather than opt-in Currently timeout is opt-in, that is, all methods that even have it default it to `db::no_timeout`. This means that ensuring timeout is used where it should be is completely up to the author and the reviewrs of the code. As humans are notoriously prone to mistakes this has resulted in a very inconsistent usage of timeout, many clients of `flat_mutation_reader` passing the timeout only to some members and only on certain call sites. This is small wonder considering that some core operations like `operator()()` only recently received a timeout parameter and others like `peek()` didn't even have one until this patch. Both of these methods call `fill_buffer()` which potentially talks to the lower layers and is supposed to propagate the timeout. All this makes the `flat_mutation_reader`'s timeout effectively useless. To make order in this chaos make the timeout parameter a mandatory one on all `flat_mutation_reader` methods that need it. This ensures that humans now get a reminder from the compiler when they forget to pass the timeout. Clients can still opt-out from passing a timeout by passing `db::no_timeout` (the previous default value) but this will be now explicit and developers should think before typing it. There were suprisingly few core call sites to fix up. Where a timeout was available nearby I propagated it to be able to pass it to the reader, where I couldn't I passed `db::no_timeout`. Authors of the latter kind of code (view, streaming and repair are some of the notable examples) should maybe consider propagating down a timeout if needed. In the test code (the wast majority of the changes) I just used `db::no_timeout` everywhere. Tests: unit(release, debug) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <1edc10802d5eb23de8af28c9f48b8d3be0f1a468.1536744563.git.bdenes@scylladb.com>	2018-09-20 11:31:24 +02:00
Asias He	deff5e7d60	streaming: Add rpc streaming support This patch changes scylla streaming to use the recently added rpc streaming feature provided by seastar to send mutation fragments for scylla streaming instead of the rpc verbs. It also changes the receiver to write to the sstable file directly, skipping writing to memtable.	2018-07-13 08:36:47 +08:00
Asias He	faa6769cdb	streaming: Add estimate_partitions to send_info The sender needs to estimate the number of partitions to send, because the receiver needs this to prepare the sstables.	2018-07-13 08:36:46 +08:00
Asias He	db8c3a7059	streaming: Do not use dht::split_ranges_to_shards There is no need to call dht::split_ranges_to_shards to split the token range into <shard> : <a lot of small ranges> mapping and create a flat mutation reader with a lot of small ranges. Because: 1) The flat mutation reader on each shard only returns data belongs to this local shard, there is no correctness issue if we do not split and feed the sub ranges only belongs to this local shard. 2) With murmur3_partitioner_ignore_msb_bits = 12, it is almost certain that given a token range, all the shards will have data for the range anyway. Even if we ask all the shards to work on the token range and some of the shards have no data for it, it is fine. We simply send no data from this shard. Tests: update_cluster_layout_tests.py Message-Id: <ac00cd21d6156c47b74451dd415d627481e48212.1526864222.git.asias@scylladb.com>	2018-05-21 10:42:45 +03:00
Asias He	a9dab60b6c	streaming: One cf per time on sender In the case there are large number of column families, the sender will send all the column families in parallel. We allow 20% of shard memory for streaming on the receiver, so each column family will have 1/N, N is the number of in-flight column families, memory for memtable. Large N causes a lot of small sstables to be generated. It is possible there are multiple senders to a single receiver, e.g., when a new node joins the cluster, the maximum in-flight column families is number of peer node. The column families are sent in the order of cf_id. It is not guaranteed that all peers has the same speed so they are sending the same cf_id at the same time, though. We still have chance some of the peers are sending the same cf_id. Fixes #3065 Message-Id: <46961463c2a5e4f1faff232294dc485ac4f1a04e.1513159678.git.asias@scylladb.com>	2017-12-13 12:32:41 +02:00
Paweł Dziepak	dca93bea23	db: convert make_streaming_reader() to flat_mutation_reader	2017-11-13 16:49:52 +00:00
Paweł Dziepak	6f1e0d3ed8	stream_transfer_task: switch to flat_mutation_reader	2017-11-13 16:49:52 +00:00
Paweł Dziepak	8bb672502d	fragment_and_freeze: allow callback to stop iteration There is a user of fragment_and_freeze() (streaming) that will need to be able to break the loop Right now, it does that between streamed_mutation, but that won't be possible after we switch to flat readers.	2017-11-13 16:44:33 +00:00
Avi Kivity	ebaeefa02b	Merge seatar upstream (seastar namespace) - introcduced "seastarx.hh" header, which does a "using namespace seastar"; - 'net' namespace conflicts with seastar::net, renamed to 'netw'. - 'transport' namespace conflicts with seastar::transport, renamed to cql_transport. - "logger" global variables now conflict with logger global type, renamed to xlogger. - other minor changes	2017-05-21 12:26:15 +03:00
Avi Kivity	ca69a04969	streaming: avoid auto in function argument declaration 'auto' in a non-lambda function argument is not legal C++, and is hard to read besides. Replace with the right type.	2017-04-17 23:03:15 +03:00
Asias He	937f28d2f1	Convert to use dht::partition_range_vector and dht::token_range_vector	2016-12-19 14:08:50 +08:00
Asias He	e5485f3ea6	Get rid of query::partition_range Use dht::partition_range instead	2016-12-19 08:09:25 +08:00
Asias He	d1178fa299	Convert to use dht::token_range	2016-12-19 08:04:29 +08:00
Asias He	ba54654af3	streaming: Use interval_set to sort and merge ranges So that the ranges are sorted and have no overlaps. We can have less ranges to deal with and it can help the mutation readers to optimize. Here is an exmaple of ranges generated by repair: Before: INFO 2016-12-07 17:44:21,185 [shard 0] stream_session - cf_id = dec9fa90-bc3b-11e6-af78-000000000001, before ranges = {(-3383928698815274642, -3376937163195039606], (-7260764223708720005, -7251657821052234309], (-4767213984179237293, -4747032371925842389], (-7645879646119667643, -7589962743703481776], (-2340199306656526861, -2320523117224780931], (-576028861239229331, -560973674020019962], (-4070378863644120252, -3987599893827407860], (-2551584407739673151, -2498779102482524711], (-5416061903556353312, -5354212455975869358], (37594980457713898, 67885601051654285], (3083778975065200884, 3091232478835418439], (3131345970514528877, 3187922544267434961], (5765437476661317163, 5778671293583720541], (5960610072466058818, 5972289771228014343], (7749618183851698485, 7758080813117351135], (-3987599893827407860, -3899198931034439776], (-7251657821052234309, -7131649010279865221], (-3576581915808403133, -3383928698815274642], (-417850207760366422, -327959672080599465], (-2671876682129336880, -2551584407739673151], (-1305178847032904465, -1137497074548854552], (8540448858050275827, 8610171849752115483], (-560973674020019962, -417850207760366422], (-2498779102482524711, -2340199306656526861], (2394447940525988167, 2523396860109747637], (-6703329224557608009, -6517757811218772762], (-3675103288021821677, -3576581915808403133], (-5622185785296846551, -5416061903556353312], (8610171849752115483, 8742605005068551458], (8068079250973315241, 8185655671734937642], (560264964510741191, 790641981923757238], (5581202487214475094, 5765437476661317163], (8742605005068551458, 8923908282731801645], (-6038176423022601107, -5622185785296846551], (5778671293583720541, 5960610072466058818], (-3899198931034439776, -3675103288021821677], (8356739976149429222, 8540448858050275827], (-6517757811218772762, -6038176423022601107], (-8052600134279395253, -7645879646119667643], (-327959672080599465, 37594980457713898], (7758080813117351135, 8019254284118543066], (4781565016737645510, 5067070718000527886], (2523396860109747637, 3083778975065200884], (-5354212455975869358, -4767213984179237293], (6784138025918878582, 7190719703944308372], (67885601051654285, 447405341661896387], (-2190610927722759275, -1305178847032904465], (-4747032371925842389, -4070378863644120252]}, size=48 After: INFO 2016-12-07 17:44:21,185 [shard 0] stream_session - cf_id = dec9fa90-bc3b-11e6-af78-000000000001, after ranges = {(-8052600134279395253, -7589962743703481776], (-7260764223708720005, -7131649010279865221], (-6703329224557608009, -3376937163195039606], (-2671876682129336880, -2320523117224780931], (-2190610927722759275, -1137497074548854552], (-576028861239229331, 447405341661896387], (560264964510741191, 790641981923757238], (2394447940525988167, 3091232478835418439], (3131345970514528877, 3187922544267434961], (4781565016737645510, 5067070718000527886], (5581202487214475094, 5972289771228014343], (6784138025918878582, 7190719703944308372], (7749618183851698485, 8019254284118543066], (8068079250973315241, 8185655671734937642], (8356739976149429222, 8923908282731801645]}, size=15	2016-12-12 11:09:26 +08:00
Asias He	1987264beb	streaming: Make streaming reader with ranges Now that we have the new interface to make readers with ranges, we can simplify the code a lot. 1) Less readers are needed before: number of ranges of readers after: smp::count readers at most 2) No foreign_ptr is needed There is no need to forward to a shard to make the foreign_ptr for send_info in the first phase and forward to that shard to execute the send_info in the second phase. 3) No do_with is needed in send_mutations since si now is a lw_shared_ptr 4) Fix possible user after free of 'si' in do_send_mutations We need to take a reference of 'si' when sending the mutation with send_stream_mutation rpc call, otherwise: msg1 got exception si->mutations_done.broken() si is freed msg2 got exception si is used again The issue is introduced in `dc50ce0ce5` (streaming: Make the mutation readers when streaming starts) which is master only, branch 1.5 is not affected.	2016-12-12 09:04:21 +08:00
Asias He	dc50ce0ce5	streaming: Make the mutation readers when streaming starts Currenlty we make the mutation readers for streaming at different time point, i.e., do_for_each(_ranges.begin(), _ranges.end(), [] (auto range) { make a mutation reader for this range read mutations from the reader and send }) If there are write workload in the background, we will stream extra data, since the later the reader is made the more data we need to send. Fix it by making all the readers before starting to stream. Fixes #1815 Message-Id: <1479341474-1364-2-git-send-email-asias@scylladb.com>	2016-11-17 12:41:53 +02:00
Tomasz Grabiec	c1a7e2090e	Revert "database: change find_column_families signature so it returns a lw_shared_ptr" This reverts commit `f3528ede65`.	2016-11-04 10:48:21 +01:00
Glauber Costa	f3528ede65	database: change find_column_families signature so it returns a lw_shared_ptr There are places in which we need to use the column family object many times, with deferring points in between. Because the column family may have been destroyed in the deferring point, we need to go and find it again. If we use lw_shared_ptr, however, we'll be able to at least guarantee that the object will be alive. Some users will still need to check, if they want to guarantee that the column family wasn't removed. But others that only need to make sure we don't access an invalid object will be able to avoid the cost of re-finding it just fine. Signed-off-by: Glauber Costa <glauber@scylladb.com> Message-Id: <722bf49e158da77ff509372c2034e5707706e5bf.1478111467.git.glauber@scylladb.com>	2016-11-03 13:27:31 +01:00
Avi Kivity	a35136533d	Convert ring_position and token ranges to be nonwrapping Wrapping ranges are a pain, so we are moving wrap handling to the edges. Since cql can't generate wrapping ranges, this means thrift and the ring maintenance code; also range->ring transformations need to merge the first and last ranges. Message-Id: <1478105905-31613-1-git-send-email-avi@scylladb.com>	2016-11-02 21:04:11 +02:00
Pekka Enberg	9ea24c9d2b	Merge "repair: less stream_plan and less streaming traffic" from Asias "This series improves repair by 1) using less streaming sessions 2) reducing unnecessary streaming traffic 3) fixing a hang during shutdown See commit log for "repair: Reduce stream_plan usage", "repair: Reduce unnecessary streaming traffic" and "streaming: Fail streaming sessions during shutdown" for details. Tested with repair_additional_test.py."	2016-09-28 09:54:15 +03:00
Asias He	ac6198208b	streaming: Switch to use make_streaming_reader Using make_streaming_reader for streaming on the sender side, it has the following advantages: - streaming, repair will not pollute the row cache on the sender side any more. Currently, we are risking evicting all the frequently-queried partitions from the cache when an operation like repair reads entire sstables and floods the row cache with swathes of cold data from they read from disk. - less data will be sent becasue the reader will only return existing data before the point of the reader is created, plus bounded amount of writes which arrive later. This helps reducing the streaming time in the case new data is being inserted all the time while streaming is in progress. E.g., adding a new node while there is a lot of cql write workload. Fixes #382 Fixes #1682	2016-09-26 16:12:56 +08:00
Asias He	576e15532f	streaming: Add append_ranges for stream_transfer_task Allow to append more ranges to transfer for a stream transfer task.	2016-09-26 06:28:50 +08:00
Piotr Jastrzebski	3607d99269	Remove clustering_key_filtering_context. Remove clustering_key_filter_factory and clustering_key_filtering_context. Use partition_slice directly with a static get_ranges method. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-08-30 20:31:55 +02:00
Duarte Nunes	aaa76d58ba	query: Move to_partition_range to dht namespace This patch moves to_partition_range, from the query namespace to the dht namespace, where it is a more natural fit. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1468498060-19251-1-git-send-email-duarte@scylladb.com>	2016-07-15 10:41:52 +02:00
Paweł Dziepak	e779e2f0c9	streaming: do not fragment mutations in mixed cluster The receiving side needs to handle fragmented mutations properly so that isolation guarantees are not broken. If the receiving node may be an old one do not fragment mutations. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-07-13 09:51:23 +01:00
Paweł Dziepak	d9eb4d8028	streaming: use fragment_and_freeze() to send mutations Commit `206955e4` "streaming: Reduce memory usage when sending mutations" moved streaming mutation limiter from do_send_mutations() to send_mutations(). The reason for that was that send_mutation() did full mutation copies. That's no longer the case and streaming limiter should be moved back to do_send_mutation() in order to provide back pressure to fragment_and_freeze(). Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-07-07 12:18:36 +01:00
Paweł Dziepak	32a5de7a1f	db: handle receiving fragmented mutations If mutations are fragmented during streaming a special care must be taken so that isolation guarantees are not broken. Mutations received with flag "fragmented" set are applied to a memtable that is used only by that particular streaming task and the sstables created by flushing such memtables are not made visible until the task is complte. Also, in case the streaming fails all data is dropped. This means that fragmented mutations cannot benefit from coalescing of writes from multiple streaming plans, hence separate way of handling them so that there is no loss of performance for small partitions. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-07-07 12:18:35 +01:00
Paweł Dziepak	737eb73499	mutation_reader: make readers return streamed_mutations Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:50 +01:00
Asias He	94c9211b0e	streaming: Switch log level to warn instead of error dtest takes error level log as serious error. It is not a serious error for streaming to fail to send a verb and fail a streaming session, for example, the peer node is gone or stopped. Switch to use log level warn instead of level error. Fixes repair_additional_test.py:RepairAdditionalTest.repair_kill_3_test Fixes: #1335 Message-Id: <0149d30044e6e4d80732f1a20cd20593de489fc8.1465979288.git.asias@scylladb.com>	2016-06-15 13:01:22 +03:00
Asias He	96463cc17c	streaming: Fix indention in do_send_mutations Message-Id: <bc8cfa7c7b29f08e70c0af6d2fb835124d0831ac.1464857352.git.asias@scylladb.com>	2016-06-02 11:56:03 +03:00
Asias He	206955e47c	streaming: Reduce memory usage when sending mutations Limit disk bandwidth to 5MB/s to emulate a slow disk: echo "8:0 5000000" > /cgroup/blkio/limit/blkio.throttle.write_bps_device echo "8:0 5000000" > /cgroup/blkio/limit/blkio.throttle.read_bps_device Start scylla node 1 with low memory: scylla -c 1 -m 128M --auto-bootstrap false Run c-s: taskset -c 7 cassandra-stress write duration=5m cl=ONE -schema 'replication(factor=1)' -pop seq=1..100000 -rate threads=20 limit=2000/s -node 127.0.0.1 Start scylla node 2 with low memory: scylla -c 1 -m 128M --auto-bootstrap true Without this patch, I saw std::bad_alloc during streaming ERROR 2016-06-01 14:31:00,196 [shard 0] storage_proxy - exception during mutation write to 127.0.0.1: std::bad_alloc (std::bad_alloc) ... ERROR 2016-06-01 14:31:10,172 [shard 0] database - failed to move memtable to cache: std::bad_alloc (std::bad_alloc) ... To fix: 1. Apply the streaming mutation limiter before we read the mutation into memory to avoid wasting memory holding the mutation which we can not send. 2. Reduce the parallelism of sending streaming mutations. Before we send each range in parallel, after we send each range one by one. before: nr_vnode * nr_shard * (send_info + cf.make_reader memory usage) after: nr_shard * (send_info + cf.make_reader memory usage) We can at least save memory usage by the factor of nr_vnode, 256 by default. In my setup, fix 1) alone is not enough, with both fix 1) and 2), I saw no std::bad_alloc. Also, I did not see streaming bandwidth dropped due to 2). In addition, I tested grow_cluster_test.py:GrowClusterTest.test_grow_3_to_4, as described: https://github.com/scylladb/scylla/issues/1270#issuecomment-222585375 With this patch, I saw no std::bad_alloc any more. Fixes: #1270 Message-Id: <7703cf7a9db40e53a87f0f7b5acbb03fff2daf43.1464785542.git.asias@scylladb.com>	2016-06-02 11:01:58 +03:00
Piotr Jastrzebski	dcba6f5c45	Pass clustering_row_ranges to mutation readers. This will allow readers to reduce the amount of data read. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-05-16 14:36:57 +02:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Glauber Costa	10c8ca6ace	priority manager: separate streaming reads from writes Streaming has currently one class, that can be used to contain the read operations being generated by the streaming process. Those reads come from two places: - checksums (if doing repair) - reading mutations to be sent over the wire. Depending on the amount of data we're dealing with, that can generate a significant chunk of data, with seconds worth of backlog, and if we need to have the incoming writes intertwined with those reads, those can take a long time. Even if one node is only acting as a receiver, it may still read a lot for the checksums - if we're talking about repairs, those are coming from the checksums. However, in more complicated failure scenarios, it is not hard to imagine a node that will be both sending and receiving a lot of data. The best way to guarantee progress on both fronts, is to put both kinds of operations into different classes. This patch introduces a new write class, and rename the old read class so it can have a more meaningful name. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-03-23 09:12:59 -04:00
Asias He	f747df2aff	streaming: Fix rethrow in stream_transfer_task Fix bootstrap_test.py:TestBootstrap.failed_bootstap_wiped_node_can_join_test Logs on node 1: INFO 2016-03-11 15:53:43,287 [shard 0] gossip - FatClient 127.0.0.2 has been silent for 30000ms, removing from gossip INFO 2016-03-11 15:53:43,287 [shard 0] stream_session - stream_manager: Close all stream_session with peer = 127.0.0.2 in on_remove WARN 2016-03-11 15:53:43,498 [shard 0] stream_session - [Stream #4e411ba0-e75e-11e5-81f8-000000000000] stream_transfer_task: Fail to send STREAM_MUTATION_DONE to 127.0.0.2:0: std::runtime_error ([Stream #4e411ba0-e75e-11e5-81f8-000000000000] GOT STREAM_ MUTATION_DONE 127.0.0.1: Can not find stream_manager) terminate called without an active exception Backtrace on node 1: #0 0x00007fb74723da98 in raise () from /lib64/libc.so.6 #1 0x00007fb74723f69a in abort () from /lib64/libc.so.6 #2 0x00007fb74ab84aed in __gnu_cxx::__verbose_terminate_handler() () from /lib64/libstdc++.so.6 #3 0x00007fb74ab82936 in ?? () from /lib64/libstdc++.so.6 #4 0x00007fb74ab82981 in std::terminate() () from /lib64/libstdc++.so.6 #5 0x00007fb74ab82be9 in __cxa_rethrow () from /lib64/libstdc++.so.6 #6 0x0000000000f3521e in streaming::stream_transfer_task::<lambda()>::<lambda(auto:44)>::operator()<std::__exception_ptr::exception_ptr> (ep=..., __closure=0x7ffce74d8630) at streaming/stream_transfer_task.cc:169 #7 do_void_futurize_apply<const streaming::stream_transfer_task::start()::<lambda()>::<lambda(auto:44)>&, std::__exception_ptr::exception_ptr> (func=...) at /home/asias/src/cloudius-systems/scylla/seastar/core/future.hh:1142 #8 futurize<void>::apply<const streaming::stream_transfer_task::start()::<lambda()>::<lambda(auto:44)>&, std::__exception_ptr::exception_ptr> (func=...) at /home/asias/src/cloudius-systems/scylla/seastar/core/future.hh:1190 #9 future<>::<lambda(auto:7&&)>::operator()<future<> > ( fut=fut@entry=<unknown type in /home/asias/src/cloudius-systems/scylla/build/release/scylla, CU 0xec84d00, DIE 0xee2561d>, __closure=__closure@entry=0x7ffce74d8630) at /home/asias/src/cloudius-systems/scylla/seastar/core/future.hh:1014 Message-Id: <1457684884-4776-2-git-send-email-asias@scylladb.com>	2016-03-11 11:14:05 +02:00
Asias He	a9ec752939	streaming: Reduce STREAM_MUTATION error logging There might be larger number of STREAM_MUTATION inflight. Log one error per column_family per range to avoid spam the log.	2016-03-10 10:56:48 +08:00

1 2

99 Commits