scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-23 08:12:08 +00:00

Author	SHA1	Message	Date
Asias He	953af38281	streaming: Allow drop table during streaming Currently, if a table is dropped during streaming, the streaming would fail with no_such_column_family error. Since the table is dropped anyway, it makes more sense to ignore the streaming result of the dropped table, whether it is successful or failed. This allows users to drop tables during node operations, e.g., bootstrap or decommission a node. This is especially useful for the cloud users where it is hard to coordinate between a node operation by admin and user cql change. This patch also fixes a possible user after free issue by not passing the table reference object around. Fixes #10395 Closes #10396	2022-04-24 17:43:20 +03:00
Botond Dénes	b029bd3db7	tree: remove mutation_reader.hh include In most files it was unused. We should move these to the patch which moved out the last interesting reader from mutation_reader.hh (and added the corresponding new header include) but its probably not worth the effort. Some other files still relied on mutation_reader.hh to provide reader concurrency semaphore and some other misc reader related definitions.	2022-03-30 15:42:51 +03:00
Botond Dénes	fcf15fda94	readers: generating_reader: use noncopyable_function<> std::function<> requires the functor it wraps to be copyable, which is an unnecessarily strict requirement. To relax this, we use noncopyable_function<> instead. Since the former seems to lack some disambiguation magic of the latter, we add `_v1` and `_v2` postfixes to manually disambiguate.	2022-03-17 06:53:44 +02:00
Botond Dénes	35bbd54946	readers: merge generating.hh into generating_v2.hh Both variants return a v2 reader and we are going to keep both for a time to come.	2022-03-17 06:52:28 +02:00
Botond Dénes	7844ff9912	readers/generating.hh: return v2 reader from make_generating_reader() For now, the (v1) reader is just upgraded to v2 behind the scenes.	2022-03-17 06:51:20 +02:00
Mikołaj Sielużycki	1d84a254c0	flat_mutation_reader: Split readers by file and remove unnecessary includes. The flat_mutation_reader files were conflated and contained multiple readers, which were not strictly necessary. Splitting optimizes both iterative compilation times, as touching rarely used readers doesn't recompile large chunks of codebase. Total compilation times are also improved, as the size of flat_mutation_reader.hh and flat_mutation_reader_v2.hh have been reduced and those files are included by many file in the codebase. With changes real 29m14.051s user 168m39.071s sys 5m13.443s Without changes real 30m36.203s user 175m43.354s sys 5m26.376s Closes #10194	2022-03-14 13:20:25 +02:00
Botond Dénes	06e6bb6ec9	streaming: migrate to v2 variant of sstable writer API	2022-03-10 09:16:33 +02:00
Botond Dénes	ad1b157452	streaming/consumer: convert to v2 At least on the API level, internally there are still conversions, but these are going to be sorted out in the next patches too.	2022-03-02 09:55:09 +02:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Avi Kivity	63d254a8d2	Merge 'gms, service: futurize and coroutinize gossiper-related code' from Pavel Solodovnikov This series greatly reduces gossipers' dependence on `seastar::async` (yet, not completely). `i_endpoint_state_change_subscriber` callbacks are converted to return futures (again, to get rid of `seastar::async` dependency), all users are adjusted appropriately (e.g. `storage_service`, `cdc::generation_service`, `streaming::stream_manager`, `view_update_backlog_broker` and `migration_manager`). This includes futurizing and coroutinizing the whole function call chain up to the `i_endpoint_state_change_subscriber` callback functions. To aid the conversion process, a non-`seastar::async` dependent variant of `utils::atomic_vector::for_each` is introduced (`for_each_futurized`). A different name is used to clearly distinguish converted and non-converted code, so that the last step (remove `seastar::async()` wrappers around callback-calling code in gossiper) is easier. This is left for a follow-up series, though. Tests: unit(dev) Closes #9844 * github.com:scylladb/scylla: service: storage_service: coroutinize `set_gossip_tokens` service: storage_service: coroutinize `leave_ring` service: storage_service: coroutinize `handle_state_left` service: storage_service: coroutinize `handle_state_leaving` service: storage_service: coroutinize `handle_state_removing` service: storage_service: coroutinize `do_drain` service: storage_service: coroutinize `shutdown_protocol_servers` service: storage_service: coroutinize `excise` service: storage_service: coroutinize `remove_endpoint` service: storage_service: coroutinize `handle_state_replacing` service: storage_service: coroutinize `handle_state_normal` service: storage_service: coroutinize `update_peer_info` service: storage_service: coroutinize `do_update_system_peers_table` service: storage_service: coroutinize `update_table` service: storage_service: coroutinize `handle_state_bootstrap` service: storage_service: futurize `notify_*` functions service: storage_service: coroutinize `handle_state_replacing_update_pending_ranges` repair: row_level_repair_gossip_helper: coroutinize `remove_row_level_repair` locator: reconnectable_snitch_helper: coroutinize `reconnect` gms: i_endpoint_state_change_subscriber: make callbacks to return futures utils: atomic_vector: introduce future-returning `for_each` function utils: atomic_vector: rename `for_each` to `thread_for_each` gms: gossiper: coroutinize `start_gossiping` gms: gossiper: coroutinize `force_remove_endpoint` gms: gossiper: coroutinize `do_status_check` gms: gossiper: coroutinize `remove_endpoint`	2022-01-13 23:09:02 +02:00
Avi Kivity	134601a15e	Merge "Convert input side of mutation compactor to v2" from Botond " With this series the mutation compactor can now consume a v2 stream. On the output side it still uses v1, so it can now act as an online v2->v1 converter. This allows us to push out v2->v1 conversion to as far as the compactor, usually the next to last component in a read pipeline, just before the final consumer. For reads this is as far as we can go, as the intra-node ABI and hence the result-sets built are v1. For compaction we could go further and eliminate conversion altogether, but this requires some further work on both the compactor and the sstable writer and so it is left to be done later. To summarize, this patchset enables a v2 input for the compactor and it updates compaction and single partition reads to use it. " * 'mutation-compactor-consume-v2/v1' of https://github.com/denesb/scylla: table: add make_reader_v2() querier: convert querier_cache and {data,mutation}_querier to v2 compaction: upgrade compaction::make_interposer_consumer() to v2 mutation_reader: remove unecessary stable_flattened_mutations_consumer compaction/compaction_strategy: convert make_interposer_consumer() to v2 mutation_writer: migrate timestamp_based_splitting_writer to v2 mutation_writer: migrate shard_based_splitting_writer to v2 mutation_writer: add v2 clone of feed_writer and bucket_writer flat_mutation_reader_v2: add reader_consumer_v2 typedef mutation_reader: add v2 clone of queue_reader compact_mutation: make start_new_page() independent of mutation_fragment version compact_mutation: add support for consuming a v2 stream compact_mutation: extract range tombstone consumption into own method range_tombstone_assembler: add get_range_tombstone_change() range_tombstone_assembler: add get_current_tombstone()	2022-01-12 14:37:19 +02:00
Michael Livshin	be5118a7c9	convert table::make_streaming_reader() to flat_mutation_reader_v2 All changes are mechanical. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-01-11 10:49:26 +02:00
Pavel Solodovnikov	5dcfb94d5a	gms: i_endpoint_state_change_subscriber: make callbacks to return futures Coroutinize a few simple callbacks in the process. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2022-01-11 09:29:12 +03:00
Botond Dénes	1ba19c2aa4	compaction/compaction_strategy: convert make_interposer_consumer() to v2 The underlying timestamp-based splitter is v2 already.	2022-01-07 13:51:59 +02:00
Avi Kivity	bbad8f4677	replica: move ::database, ::keyspace, and ::table to replica namespace Move replica-oriented classes to the replica namespace. The main classes moved are ::database, ::keyspace, and ::table, but a few ancillary classes are also moved. There are certainly classes that should be moved but aren't (like distributed_loader) but we have to start somewhere. References are adjusted treewide. In many cases, it is obvious that a call site should not access the replica (but the data_dictionary instead), but that is left for separate work. scylla-gdb.py is adjusted to look for both the new and old names.	2022-01-07 12:04:38 +02:00
Avi Kivity	ae3a360725	database: Move database, keyspace, table classes to replica/ directory The database, keyspace, and table classes represent the replica-only part of the objects after which they are named. Reading from a table doesn't give you the full data, just the replica's view, and it is not consistent since reconciliation is applied on the coordinator. As a first step in acknowledging this, move the related files to a replica/ subdirectory.	2022-01-06 17:07:30 +02:00
Raphael S. Carvalho	426450dc04	treewide: remove useless include of database.hh Wrote a script based on cpp-include to find places that needlessly included database.hh, which is expensive to process during build time. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20220104204359.168895-1-raphaelsc@scylladb.com>	2022-01-05 10:15:19 +02:00
Pavel Emelyanov	4a34226aa6	streaming, main: Remove global stream_manager Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	50e6d334a9	stream_transfer_task: Get manager from session (result-future) When the task starts it needs the stream_manager to get messaging service and database from. There's a session at hands and this session is properly initialized thus it has the result-future. Voila -- we have the manager! Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	95d26bc420	stream_transfer_task: Keep Updater fn onboard The helper function called send_mutation_fragments needs the manager to update stats about stream_transfer_task as it goes on. Carrying the manager over its stack is quite boring, but there's a helper send_info object that lives there. Equip the guy with the updating function and capture the manager by it early to kill one more usage of the global stream_manager call. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	9ee208de8d	stream_transfer_task: Remove unused database reference The send_info helper keeps it, but doesn't use. Remove. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	a3b4d4d3cf	stream_session: Use manager reference from result-future When the stream_session initializes it's being equipped with the shared-pointer on the stream_result_future very early. In all the places where stream_session needs the manager this pointer is alive and session get get manager from it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	56f5327450	stream_session: Capture container() in message handler The stream_mutation_fragments handler need to access the manager. Since the handler is registered by the manager itself, it can capture the local manager reference and use container() where appropriate. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	db33607eb2	stream_session: Keep stream_manager reference The manager is needed to get messaging service and database from. Actually, the database can be pushed though arguments in all the places, so effectively session only needs the messaging. However, the stream-task's need the manager badly and there's no other place to get it from other than the session. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	f2ae080c63	stream_session: Remove unused default contructor Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	307a2583ee	stream_result_future: Use local manager reference The reference is present in all the required places already. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	5b748a72de	stream_result_future: Keep stream_manager reference The stream_result_future needs manager to register on it and to unregister from it. Also the result-future is referenced from stream_session that also needs the manager (see next patches). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	3087422d4d	stream_plan: Keep stream_manager onboard The plan itself doesn't need it, but it creates some lower level objects that do. Next patches will use this reference. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:17:37 +03:00
Pavel Emelyanov	3a9eb6af28	stream_manager: Use container() in notification code Continuation of the previous patch -- some native stream_manager methods can enjoy using container() call. One nit -- the [] access to the map of statistics now runs in const context and cannot create elements, so switch this place into .at() method. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:15:59 +03:00
Pavel Emelyanov	8ab96a8362	streaming: Move get_session into stream_manager This makes the code a bit shorter and helps removing one more call for global stream manager. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:15:59 +03:00
Pavel Emelyanov	228b4520a6	streaming: Use container.invoke_on in rpc handlers This will help to reduce the usage of global manager instance. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:15:59 +03:00
Pavel Emelyanov	c2c676784a	streaming: Fix interaction with gossiper Streaming manager registers itself in gossiper, so it needs an explicit dependency reference. Also it forgets to unregister itself, so do it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:15:59 +03:00
Pavel Emelyanov	73e10c7aed	streaming: Move start/stop onto common rails In case of streaming this mostly means dropping the global init/uninit calls and replacing them with sharded<stream_manager> instance. It's still global, but it's being fixed atm. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:15:58 +03:00
Pavel Emelyanov	08818ffe75	streaming: Rename .stop() into .shutdown() The start/stop standard is becoming like sharded<foo> foo; foo.start(); defer([] { foo.stop() }); foo.invoke_on_all(&foo::start); ... defer([] { foo.shutdown() }); wait_for_stop_signal(); /* quit making the above defers self-unroll */ where .shutdown() for a service would mean "do whatever is appropriate to start stopping, the real synchronous .stop() will come some time later". According to that, rename .stop() as it's really the mentioned preparation, not real stopping. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:15:58 +03:00
Pavel Emelyanov	ba298bd5c6	streaming: Remove global dependency pointers Now they are not needed. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:15:58 +03:00
Pavel Emelyanov	6d7eb76fad	streaming: Use get_stream_manager to get dependencies Currently streaming uses global pointers to save and get a dependency. Now all the dependencies live on the manager, this patch changes all the places in streaming/ to get the needed dependencies from it, not from global pointer (next patch will remove those globals). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:15:58 +03:00
Pavel Emelyanov	e448774588	streaming: Move rpc verbs reg/unreg into manager As a part of streaming start/stop unification. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:15:58 +03:00
Pavel Emelyanov	165971fb7f	streaming: Initialize stream manager with proper deps The stream manager is going to become central point of control for the streaming subsys. This patch makes its dependencies explicit and prepares the gound for further patching. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-11-24 12:15:58 +03:00
Avi Kivity	7abd105d79	streaming: replace seastar::sprint() with fmt::format() sprint() is obsolete.	2021-10-27 17:02:00 +03:00
Avi Kivity	acf8da2bce	Merge "flat_mutation_reader: keep timeout in permit" from Benny " This series moves the timeout parameter, that is passed to most f_m_r methods, into the reader_permit. This eliminates the need to pass the timeout around, as it's taken from the permit when needed. The permit timeout is updated in certain cases when the permit/reader is paused and retrieved later on for reuse. Following are perf_simple_query results showing ~1% reduction in insns/op and corresponding increase in tps. $ build/release/test/perf/perf_simple_query -c 1 --operations-per-shard 1000000 --task-quota-ms 10 Before: 102500.38 tps ( 75.1 allocs/op, 12.1 tasks/op, 45620 insns/op) After: 103957.53 tps ( 75.1 allocs/op, 12.1 tasks/op, 45372 insns/op) Test: unit(dev) DTest: repair_additional_test.py:RepairAdditionalTest.repair_abort_test (release) materialized_views_test.py:TestMaterializedViews.remove_node_during_mv_insert_3_nodes_test (release) materialized_views_test.py:InterruptBuildProcess.interrupt_build_process_with_resharding_half_to_max_test (release) migration_test.py:TTLWithMigrate.big_table_with_ttls_test (release) " * tag 'reader_permit-timeout-v6' of github.com:bhalevy/scylla: flat_mutation_reader: get rid of timeout parameter reader_concurrency_semaphore: use permit timeout for admission reader_concurrency_semaphore: adjust reactivated reader timeout multishard_mutation_query: create_reader: validate saved reader permit repair: row_level: read_mutation_fragment: set reader timeout flat_mutation_reader: maybe_timed_out: use permit timeout test: sstable_datafile_test: add sstable_reader_with_timeout reader_permit: add timeout member	2021-08-25 17:51:10 +03:00
Raphael S. Carvalho	a4053dbb72	repair: Postpone data segregation to off-strategy compaction With data segregation on repair, thousands of sstables are potentially added to maintenance set which causes high latency due to stalls. That's because N*M sstables are created by a repair, where N = # of ranges and M = # of segregations For TWCS, M = # of windows. Assuming N = 768 and M = 20, ~15k sstables end up in sstable set To fix this problem, let's avoid performing data segregation in repair, as offstrategy will already perform the segregation anyway. So from now on, only N non-overlapping sstables will be added to set. Read amplification isn't affected because a query will only touch one sstable in maintenance set. When offstrategy starts, it will pick all sstables from set and compact them in a single step while performing data segregation, so data is properly laid out before integrated into the main set. tests: - sstable_compaction_test.twcs_reshape_with_disjoint_set_test - mode(dev) - manual test using repair-based bootstrap Fixes #9199. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210824185043.76475-1-raphaelsc@scylladb.com>	2021-08-25 15:31:38 +03:00
Benny Halevy	4476800493	flat_mutation_reader: get rid of timeout parameter Now that the timeout is taken from the reader_permit. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-08-24 16:30:51 +03:00
Avi Kivity	1b492396c1	stream_session.cc: trim unneeded includes stream_session.cc doesn't need storage_proxy, or sstables, or the system keyspace. Remove them. Closes #9230	2021-08-23 10:57:04 +03:00
Botond Dénes	5293bd21cf	streaming/stream_session: use database::obtain_reader_permit()	2021-07-14 16:48:43 +03:00
Botond Dénes	999169e535	database: make_streaming_reader(): require permit As a preparation for up-front admission, add a permit parameter to `make_streaming_reader()`, which will be the admitted permit once we switch to up-front admission. For now it has to be a non-admitted permit. A nice side-effect of this patch is that now permits will have a use-case specific description, instead of the generic "streaming" one.	2021-07-14 16:48:43 +03:00
Avi Kivity	a57d8eef49	Merge 'streaming: make_streaming_consumer: close reader on errors' from Benny Halevy Currently, if e.g. find_column_family throws an error, as seen in #8776 when the table was dropped during repair, the reader is not closed. Use a coroutine to simplify error handling and close the reader if an exception is caught. Also, catch an error inside the lambda passed to make_interposer_consumer when making the shared_sstable for streaming, and close the reader their and return an exceptional future early, since the reader will not be moved to sst->write_components, that assumes ownership over it and closes it in all cases. Fixes #8776 Test: unit(dev) DTest: repair_additional_test.py:RepairAdditionalTest.repair_while_table_is_dropped_test (dev, debug) w/ https://github.com/scylladb/scylla/pull/8635#issuecomment-856661138 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #8782 * github.com:scylladb/scylla: streaming: make_streaming_consumer: close reader on errors streaming: make_streaming_consumer: coroutinize returned function	2021-06-09 15:02:36 +03:00
Asias He	5c9816615f	streaming: Enable off-strategy compaction for bootstrap and replace The off-strategy compaction is now enabled for repair based node operations. It is not bound to repair based node operations though. It makes sense to enable it for streaming based node operations too. Fixes #8820 Closes #8821	2021-06-08 12:13:20 +03:00
Benny Halevy	2e93996473	streaming: make_streaming_consumer: close reader on errors Currently, if e.g. find_column_family throws an error, as seen in #8776 when the table was dropped during repair, the reader is not closed. Use a coroutine to simplify error handling and close the reader if an exception is caught. Also, catch an error inside the lambda passed to make_interposer_consumer when making the shared_sstable for streaming, and close the reader their and return an exceptional future early, since the reader will not be moved to sst->write_components, that assumes ownership over it and closes it in all cases. Fixes #8776 Test: unit(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-06-08 08:50:46 +03:00
Benny Halevy	42028c324c	streaming: make_streaming_consumer: coroutinize returned function To simplify error handling in the next patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-06-08 08:48:33 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00

1 2 3 4 5 ...

585 Commits