scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-21 00:50:35 +00:00

Author	SHA1	Message	Date
Raphael S. Carvalho	84f7ae2c82	table: remove unneeded code as sstables are not shared anymore given that resharding is now a synchronous mandatory step, before table is populated, snapshot() can now get rid of code which takes into account whether or not a sstable is shared. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Reviewed-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210414121549.85858-1-raphaelsc@scylladb.com>	2021-04-15 11:59:41 +02:00
Botond Dénes	5c8f142fe5	table: add mutation_query() We want to migrate `database::mutation_query()` off `mutation_query()` to use `table::mutation_query()` instead. The reason is the same as for making `table::query()` standalone: the `mutation_query()` implementation increasingly became specific to how tables are queried and is about to became even more specific due to impending changes to how permits are obtained. As no-one in the codebase is doing generic mutation queries on generic mutation sources we can just make this a member of table. This patch just adds `table::mutation_query()`, no user exists yet. `table::mutation_query()` is identical to `mutation_query()`, except that it is a coroutine.	2021-04-09 13:40:27 +03:00
Botond Dénes	c3f0681011	table: query(): inline data_query() code into query() `data_query()` is now just a thin wrapper over `data_querier::consume_page()`. Furthermore, contrary to the old data query method, it is not a generic way of querying a mutation source, it is now closely tied to how we query tables. It does a querier lookup and save. In the future we plan on tying it even closer to the table in how permits are obtained. For this reason it is better to just inline it into the `query()` method which invokes it.	2021-04-09 13:40:27 +03:00
Botond Dénes	b03f360bb0	table: make query() a coroutine This method is very hard to read or modify in its current form due to all the continuation-chain boilerplate. Make it a coroutine to facilitate future changes in the next patches but not just.	2021-04-09 11:04:35 +03:00
Botond Dénes	32ae51dc2c	table: query(): fix typo (short_read_allwoed) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20210408133018.65692-1-bdenes@scylladb.com>	2021-04-08 16:34:08 +03:00
Raphael S. Carvalho	65b09567dd	table: Wire up off-strategy compaction on repair-based bootstrap and replace Now, sstables created by bootstrap and replace will be added to the maintenance set, and once the operation completes, off-strategy compaction will be started. We wait until the end of operation to trigger off-strategy, as reshaping can be more efficient if we wait for all sstables before deciding what to compact. Also, waiting for completion is no longer an issue because we're able to read from new sstables using partitioned_sstable_set and their existence aren't accounted by the compaction backlog tracker yet. Refs #5226. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-18 11:47:49 -03:00
Raphael S. Carvalho	c45d2e1d27	table: extend add_sstable_and_update_cache() for off-strategy Function is extended to add sstable to maintenance set if requested by the caller. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-18 11:47:49 -03:00
Raphael S. Carvalho	e0e5bf8285	table: Introduce off-strategy compaction on maintenance sstable set Off-strategy compaction is about incrementally reshaping the off-strategy sstables in maintenance set, using our existing reshape mechanism, until the set is ready for integration into the main sstable set. The whole operation is done in maintenance mode, using the streaming scheduling group. We can do it this way because data in maintenance set is disjoint, so effects on read amplification is avoided by using partitioned_sstable_set, which is able to efficiently and incrementally retrieve data from disjoint sstables. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-18 11:47:49 -03:00
Raphael S. Carvalho	439e9b6fab	table: change build_new_sstable_list() to accept other sstable sets Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-18 11:47:49 -03:00
Raphael S. Carvalho	6e95860e09	table: change non_staging_sstables() to filter out off-strategy sstables SSTables that are off-strategy should be excluded by this function as it's used to select candidates for regular compaction. So in addition to only returning candidates from the main set, let's also rename it to precisely reflect its behavior. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-18 11:47:49 -03:00
Raphael S. Carvalho	c64a156c53	table: Introduce maintenance sstable set This new sstable set will hold sstables created by repair-based operations. A repair-based op creates 1 sstable per vrange (256), so sstables added to this new set are disjoint, therefore they can be efficiently read from using partitioned_sstable_set. Compound set is changed to include this new set, so sstables in this new set are automatically included when creating readers, computing statistics, and so on. This new set is not backlog tracked, so changes were needed to prevent a sstable in this set from being added or removed from the tracker. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-18 11:47:47 -03:00
Raphael S. Carvalho	1e7a444a8b	table: Wire compound sstable set From now own, _sstables becomes the compound set, and _main_sstables refer only to the main sstables of the table. In the near future, maintenance set will be introduced and will also be managed by the compound set. So add_sstable() and on_compaction_completion() are changed to explicitly insert and remove sstables from the main set. By storing compound set in _sstables, functions which used _sstables for creating reader, computing statistics, etc, will not have to be changed when we introduce the maintenance set, so code change is a lot minimized by this approach. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-18 11:46:06 -03:00
Raphael S. Carvalho	42b309b43e	table: prepare make_reader_excluding_sstables() to work with compound sstable set Compound set will not be inserted or erased directly, so let's change this function to build a new set from scratch instead. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-18 11:42:50 -03:00
Raphael S. Carvalho	4e142458eb	table: prepare discard_sstables() to work with compound sstable set After compound set, discard_sstables() will have to prune each set individually and later refresh the compound set. So let's change the function to support multiple sstable sets, taking into account that a sstable set may not want to be backlog tracked. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-18 11:42:50 -03:00
Raphael S. Carvalho	d25822a030	table: extract add_sstable() common code into a function The purpose is to allow the code to be eventually reused by maintenance sstable set, which will be soon introduced. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-18 11:42:50 -03:00
Raphael S. Carvalho	f6fc32c8da	table: use new sstable_set::for_each_sstable for_each_sstable() is preferred over all() because it's guaranteed to perform no copy. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210311163009.42210-2-raphaelsc@scylladb.com>	2021-03-11 18:47:17 +02:00
Raphael S. Carvalho	05b07c7161	sstable_set: preparatory work to change sstable_set::all() api users of sstable_set::all() rely on the set itself keeping a reference to the returned list, so user can iterate through the list assuming that it is alive all the way through. this will change in the future though, because there will be a compound set impl which will have to merge the all() of multiple managed sets, and the result is a temporary value. so even range-based loops on all() have to keep a ref to the returned list, to avoid the list from being prematurely destroyed. so the following code for (auto& sst : sstable_set.all()) { ...} becomes for (auto sstables = sstable_set.all(); auto& sst : sstables) { ... } Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-10 12:02:12 -03:00
Avi Kivity	5f4bf18387	Revert "Merge 'sstables: add versioning to the sstable_set ' from Wojciech Mitros" This reverts commit `31909515b3`, reversing changes made to `ef97adc72a`. It shows many serious regressions in dtest. Fixes #8197.	2021-03-02 13:21:22 +02:00
Avi Kivity	31909515b3	Merge 'sstables: add versioning to the sstable_set ' from Wojciech Mitros Currently, the sstable_set in a table is copied before every change to allow accessing the unchanged version by existing sstable readers. This patch changes the sstable_set to a structure that keeps all its versions that are referenced somewhere and provides a way of getting a reference to an immutable version of the set. Each sstable in the set is associated with the versions it is alive in, and is removed when all such versions don't have references anymore. To avoid copying, the object holding all sstables in the set version is changed to a new structure, sstable_list, which was previously an alias for std::unordered_set<shared_sstable>, and which implements most of the methods of an unordered_set, but its iterator uses the actual set with all sstables from all referenced versions and iterates over those sstables that belong to the captured version. The methods that modify the sets contents give strong exception guarantee by trying to insert new sstables to its containers, and erasing them in the case of an caught exception. To release shared_sstables as soon as possible (i.e. when all references to versions that contain them die), each time a version is removed, all sstables that were referenced exclusively by this version are erased. We are able to find these sstables efficiently by storing, for each version, all sstables that were added and erased in it, and, when a version is removed, merging it with the next one. When a version that adds an sstable gets merged with a version that removes it, this sstable is erased. Fixes #2622 Signed-off-by: Wojciech Mitros wojciech.mitros@scylladb.com Closes #8111 * github.com:scylladb/scylla: sstables: add test for checking the latency of updating the sstable_set in a table sstables: move column_family_test class from test/boost to test/lib sstables: use fast copying of the sstable_set instead of rebuilding it sstables: replace the sstable_set with a versioned structure sstables: remove potential ub sstables: make sstable_set constructor less error-prone	2021-03-01 14:16:36 +02:00
Tomasz Grabiec	fb1d3fe2cf	table: Fix schema mismatch between memtable reader and sstable writer The schema used to create the sstable writer has to be the same as the schema used by the reader, as the former is used to intrpret mutation fragments produced by the reader. Commit `9124a70` intorduced a deferring point between reader creation and writer creation which can result in schema mismatch if there was a concurrent alter. This could lead to the sstable write to crash, or generate a corrupted sstable. Fixes #7994 Message-Id: <20210222153149.289308-1-tgrabiec@scylladb.com>	2021-02-22 17:51:00 +02:00
Wojciech Mitros	0feff8712e	sstables: use fast copying of the sstable_set instead of rebuilding it The sstable_set enables copying without iterating over all its elements, so it's faster to copy a set and modify it than copy all its elements while filtering the ones that were erased. The modifications are done on a temporary version of the set, so that if an operation fails the base version remains unchanged Signed-off-by: Wojciech Mitros <wojciech.mitros@scylladb.com>	2021-02-11 11:02:55 +01:00
Avi Kivity	4082f57edc	Merge 'Make commitlog disk limit a hard limit.' from Calle Wilund Refs #6148 Commitlog disk limit was previously a "soft" limit, in that we allowed allocating new segments, even if we were over disk usage max. This would also cause us sometimes to create new segments and delete old ones, if badly timed in needing and releasing segments, in turn causing useless disk IO for pre-allocation/zeroing. This patch set does: * Make limit a hard limit. If we have disk usage > max, we wait for delete or recycle. * Make flush threshold configurable. Default is ask for flush when over 50% usage. (We do not wait for results) * Make flush "partial". We flush X% of the used space (used - thres/2), and make the rp limit accordingly. This means we will try to clear the N oldest segments, not all. I.e. "lighter" flush. Of course, if the CL is wholly dominated by a single CF, this will not really help much. But when > 1 cf is used, it means we can skip those not having unflushed data < req rp. * Force more eager flush/recycle if we're out of segments Note: flush threshold is not exposed in scylla config (yet). Because I am unsure of wording, and even if it should. Note: testing is sparse, esp. in regard to latency/timeouts added in high usage scenarios. While I can fairly easily provoke "stalls" (i.e. forced waiting for segments to free up) with simple C-S, it is hard to say exactly where in a more sane config (I set my limits looow) latencies will start accumulating. Closes #7879 * github.com:scylladb/scylla: commitlog: Force earlier cycle/flush iff segment reserve is empty commitlog: Make segment allocation wait iff disk usage > max commitlog: Do partial (memtable) flushing based on threshold commitlog: Make flush threshold configurable table: Add a flush RP mark to table, and shortcut if not above	2021-02-08 16:44:05 +02:00
Raphael S. Carvalho	e1261d10f1	table: Avoid useless allocations when updating cache on memtable flush completion we're unconditionally using make_combined_mutation_source(), which causes extra allocations, even if memtable was flushed into a single sstable, which is the most common case. memtable will only be flushed into more than one sstable if TWCS is used and memtable had old data written into it due to out-of-order writes. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210205182028.439948-1-raphaelsc@scylladb.com>	2021-02-06 20:03:33 +02:00
Benny Halevy	22f6023ac3	sstables: sstable_writer_config: add origin member Add a string describing where the sstables originated from (e.g. memtable, repair, streaming, compaction, etc.) If configure_writer is called with a nullptr, the origin will be equal to an empty string. Introduce test_env_sstables_manager that provides an overload of configure_writer with no parmeters that calls the base-class' configure_writer with "test" origin. This was to reduce the code churn in this patch and to keep the tests simple. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-02-01 16:45:52 +02:00
Botond Dénes	080bc2ffec	sstables: pass partition_range to create_single_key_sstable_reader() We want to unify the various sstable reader creation methods and this method taking a ring position instead of a partition range like everybody else stands in the way of that. This is effect reverts `68663d0de`.	2021-01-27 17:38:14 +02:00
Avi Kivity	df3ef800c2	Merge 'Introduce load and stream feature' from Asias He storage_service: Introduce load_and_stream === Introduction === This feature extends the nodetool refresh to allow loading arbitrary sstables that do not belong to a node into the cluster. It loads the sstables from disk and calculates the owning nodes of the data and streams to the owners automatically. From example, say the old cluster has 6 nodes and the new cluster has 3 nodes. We can copy the sstables from the old cluster to any of the new nodes and trigger the load and stream process. This can make restores and migrations much easier. === Performance === I managed to get 40MB/s per shard on my build machine. CPU: AMD Ryzen 7 1800X Eight-Core Processor DISK: Samsung SSD 970 PRO 512GB Assume 1TB sstables per node, each shard can do 40MB/s, each node has 32 shards, we can finish the load and stream 1TB of data in 13 mins on each node. 1TB / 40 MB per shard * 32 shard / 60 s = 13 mins === Tests === backup_restore_tests.py:TestBackupRestore.load_and_stream_to_new_cluster_test which creates a cluster with 4 nodes and inserts data, then use load_and_stream to restore to a 2 nodes cluster. === Usage === curl -X POST "http://{ip}:10000/storage_service/sstables/{keyspace}?cf={table}&load_and_stream=true === Notes === Btw, with the old nodetool refresh, the node will not pick up the data that does not belong to this node but it will not delete it either. One has to run nodetool cleanup to remove those data manually which is a surprise to me and probably to users as well. With load and stream, the process will delete the sstables once it finishes stream, so no nodetool cleanup is needed. The name of this feature load and stream follows load and store in CPU world. Fixes #7831 Closes #7846 * github.com:scylladb/scylla: storage_service: Introduce load_and_stream distributed_loader: Add get_sstables_from_upload_dir table: Add make_streaming_reader for given sstables set	2021-01-18 15:08:19 +02:00
Raphael S. Carvalho	00c29e1e24	table: Move notify_bootstrap_or_replace_*() out of line Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210117045747.69891-9-raphaelsc@scylladb.com>	2021-01-17 10:36:13 +02:00
Calle Wilund	c3d95811da	table: Add a flush RP mark to table, and shortcut if not above Adds a second RP to table, marking where we flushed last. If a new flush request comes in that is below this mark, we can skip a second flush. This is to (in future) support incremental CL flush.	2021-01-05 18:16:09 +00:00
Raphael S. Carvalho	9124a708f1	table: Wire interposer consumer for memtable flush From now on, memtable flush will use the strategy's interposer consumer iff split_during_flush is enabled (disabled by default). It has effect only for TWCS users as TWCS it's the only strategy that goes on to implement this interposer consumer, which consists of segregating data according to the window configuration. Fixes #4617. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-01-04 16:26:07 -03:00
Raphael S. Carvalho	c926a948e5	table: Add write_memtable_to_sstable variant which accepts flat_mutation_reader This new variant will be needed for interposer consumer. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-01-04 16:23:00 -03:00
Raphael S. Carvalho	32acb44fec	table: Allow sstable write permit to be shared across monitors As a preparation for interposer on flush, let's allow database write monitor to store a shared sstable write permit, which will be released as soon as any of the sstable writers reach the sealing stage. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-01-04 14:46:43 -03:00
Raphael S. Carvalho	5519fdba72	table: Extend cache update to operate a memtable split into multiple sstables This extension is needed for future work where a memtable will be segregated during flush into one sstable or more. So now multiple sstables can be added to the set after a memtable flush, and compaction is only triggered at the end. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-01-04 13:24:10 -03:00
Asias He	84f482bde4	table: Add make_streaming_reader for given sstables set Add a streaming reader that streams from a given sstables set. Refs #7831	2020-12-30 08:32:42 +08:00
Raphael S. Carvalho	8dd7280107	table: Fix potential reactor stall on LCS compaction completion On every compaction completion, sstable set is rebuilt from scratch. With LCS and ~160G of data per shard, it means we'll have to create a new sstable set with ~1000 entries whenever compaction completes, which will likely result in reactor stalling for a significant amount of time. This is fixed by futurizing build_new_sstable_list(), so it will yield whenever needed. Fixes #7758. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2020-12-28 13:17:50 -03:00
Raphael S. Carvalho	6082da4703	table: decouple preparation from execution when updating sstable set row cache now allows updater to first prepare the work, and then execute the update atomically as the last step. let's do that when rebuilding the set, so now new set is created in the preparation phase, and the new set replaces the old one in the execution phase, satisfying the atomicity requirement of row cache. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2020-12-28 13:17:48 -03:00
Raphael S. Carvalho	43f0200b8f	table: change rebuild_sstable_list to return new sstable set procedure is changed to return the new set, so caller will be responsible for replacing the old set with the new one. this will allow our future work where building new set and enabling it will be decoupled. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2020-12-28 13:17:47 -03:00
Raphael S. Carvalho	198b87503f	row_cache: allow external updater to decouple preparation from execution External updater may do some preparatory work like constructing a new sstable list, and at the end atomically replace the old list by the new one. Decoupling the preparation from execution will give us the following benefits: - the preparation step can now yield if needed to avoid reactor stalls, as it's been futurized. - the execution step will now be able to provide strong exception guarantees, as it's now decoupled from the preparation step which can be non-exception-safe. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2020-12-28 13:17:45 -03:00
Calle Wilund	71c5dc82df	database: Verify iff we actually are writing memtables to disk in truncate Fixes #7732 When truncating with auto_snapshot on, we try to verify the low rp mark from the CF against the sstables discarded by the truncation timestamp. However, in a scenario like: Fill memtables Flush Truncate with snapshot A Fill memtables some more Truncate Move snapshot A to upload + refresh (load old tables) Truncate The last op will assert, because while we have sstables loaded, which will be discarded now, we did not in fact generate any _new_ ones (since memtables are empty), and the RP we get back from discard is one from an earlier generation set. (Any permutation of events that create the situation "empty memtable" + "non-empty sstables with only old tables" will generate the same error). Added a check that before flushing checks if we actually have any data, and if not, does not uphold the RP relation assert. Closes #7799	2020-12-15 16:24:36 +02:00
Piotr Sarna	cd1e351dc1	table: unify waiting for pending operations In order to reduce code duplication which already caused a bug, waiting for pending operations is now unified with a single helper function.	2020-12-15 13:11:25 +01:00
Piotr Sarna	df3204426d	database: add a phaser for flush operations Pending flushes can participate in races when a table with auto_snapshot==false is dropped. The race is as follows: 1. A flush of table T is initiated 2. The flush operation is preempted 3. Table T is dropped without flushing, because it has auto_snapshot off 4. The flush operation from (2.) wakes up and continues working on table T, which is already dropped 5. Segfault/memory corruption To prevent such races, a phaser for pending flushes is introduced	2020-12-15 12:59:36 +01:00
Avi Kivity	f802356572	Revert "Revert "Merge "raft: fix replication if existing log on leader" from Gleb"" This reverts commit `dc77d128e9`. It was reverted due to a strange and unexplained diff, which is now explained. The HEAD on the working directory being pulled from was set back, so git thought it was merging the intended commits, plus all the work that was committed from HEAD to master. So it is safe to restore it.	2020-12-08 19:19:55 +02:00
Avi Kivity	dc77d128e9	Revert "Merge "raft: fix replication if existing log on leader" from Gleb" This reverts commit `0aa1f7c70a`, reversing changes made to `72c59e8000`. The diff is strange, including unrelated commits. There is no understanding of the cause, so to be safe, revert and try again.	2020-12-06 11:34:19 +02:00
Kamil Braun	68663d0de0	sstables: pass ring_position to create_single_key_sstable_reader instead of partition_range. It would be best to pass `partition_key` or `decorated_key` here. However, the implementation of this function needs a `partition_range` to pass into `sstable_set::select`, and `partition_range` must be constructed from `ring_position`s. We could create the `ring_position` internally from the key but that would involve a copy which we want to avoid.	2020-11-23 12:33:24 +01:00
Kamil Braun	40d8bfa394	sstables: move sstable reader creation functions to `sstable_set` Lower level functions such as `create_single_key_sstable_reader` were made methods of `sstable_set`. The motivation is that each concrete sstable_set may decide to use a better sstable reading algorithm specific to the data structures used by this sstable_set. For this it needs to access the set's internals. A nice side effect is that we moved some code out of table.cc and database.hh which are huge files.	2020-11-19 17:52:39 +01:00
Avi Kivity	5d45662804	database, streaming: remove remnants of memtable-base streaming Commit `e5be3352cf` ("database, streaming, messaging: drop streaming memtables") removed streaming memtables; this removes the mechanisms to synchronize them: _streaming_flush_gate and _streaming_flush_phaser. The memory manager for streaming is removed, and its 10% reserve is evenly distributed between memtables and general use (e.g. cache). Note that _streaming_flush_phaser and _streaming_flush_date are no longer used to syncrhonize anything - the gate is only used to protect the phaser, and the phaser isn't used for anything. Closes #7454	2020-11-16 14:32:19 +01:00
Michał Chojnowski	1eb19976b9	database: make changes to durable_writes effective immediately Users can change `durable_writes` anytime with ALTER KEYSPACE. Cassandra reads the value of `durable_writes` every time when applying a mutation, so changes to that setting take effect immediately. That is, mutations are added to the commitlog only when `durable_writes` is `true` at the moment of their application. Scylla reads the value of `durable_writes` only at `keyspace` construction time, so changes to that setting take effect only after Scylla is restarted. This patch fixes the inconsistency. Fixes #3034 Closes #7533	2020-11-06 17:53:22 +01:00
Benny Halevy	82aabab054	table: get rid of reshuffle_sstables It is unused since `7351db7cab` Refs #6950 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201026074914.34721-1-bhalevy@scylladb.com>	2020-10-26 09:50:21 +02:00
Benny Halevy	70219b423f	table: add_sstable: provide strong exception guarantees Do not leave side-effects on nexception. Fixes #6658 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201020145429.19426-1-bhalevy@scylladb.com>	2020-10-21 11:40:03 +03:00
Botond Dénes	ff623e70b3	reader_concurrency_semaphore: name permits Require a schema and an operation name to be given to each permit when created. The schema is of the table the read is executed against, and the operation name, which is some name identifying the operation the permit is part of. Ideally this should be different for each site the permit is created at, to be able to discern not only different kind of reads, but different code paths the read took. As not all read can be associated with one schema, the schema is allowed to be null. The name will be used for debugging purposes, both for coredump debugging and runtime logging of permit-related diagnostics.	2020-10-13 12:32:13 +03:00
Avi Kivity	4d6739c2e6	Merge "Use max_concurrent_for_each" from Benny " max_concurrent_for_each was added to seastar for replacing sstable_directory::parallel_for_each_restricted by using more efficient concurrency control that doesn't create unlimited number of continuations. The series replaces the use of sstable_directory::parallel_for_each_restricted with max_concurrent_for_each and exposes the sstable_directory::do_for_each_sstable via a static method. This method is used here by table::snapshot to limit concurrency do snapshot operations that suffer from the same unbound concurrency problem sstable_directory solved. In addition sstable_directory::_load_semaphore that was used across calls to do_for_each_sstable was replaced by a static per-shard semaphore that caps concurrency across all calls to `do_for_each_sstable` on that shard. This makes sense since the disk is a shared resource. In the future, we may want to have a load semaphore per device rather than a single global one. We should experiment with that. Test: unit(dev) " * tag 'max_concurrent_for_each-v5' of github.com:bhalevy/scylla: table: snapshot: use max_concurrent_for_each sstable_directory: use a external load_semaphore test: sstable_directory_test: extract sstable_directory creation into with_sstable_directory distributed_loader: process_upload_dir: use initial_sstable_loading_concurrency sstables: sstable_directory: use max_concurrent_for_each	2020-10-12 09:43:12 +03:00

1 2 3 4 5

250 Commits