scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-26 11:30:36 +00:00

Author	SHA1	Message	Date
Botond Dénes	11105cbb78	reader_concurrency_semaphore: make inactive read handles unique across semaphores Currently inactive read handles are only unique within the same semaphore, allowing for an unregister against another semaphore to potentially succeed. This can lead to disasters ranging from crashes to data corruption. While a handle should never be used with another semaphore in the first place, we have recently seen a bug (#6613) causing exactly that, so in this patch we prevent such unregister operations from ever succeeding by making handles unique across all semaphores. This is achieved by adding a pointer to the semaphore to the handle.	2020-07-23 16:43:33 +03:00
Avi Kivity	4e79296090	tracked_file_impl: inherit disk and memory alignment from underlying file tracked_file_impl is a wrapper around another file, that tracks memory allocated for buffers in order to control memory consumption. However, it neglects to inherit the disk and memory alignment settings from the wrapped file, which can cause unnecessarily-large buffers to be read from disk, reducing throughput. Fix by copying the alignment parameters. Fixes #6290.	2020-06-11 17:43:50 +03:00
Botond Dénes	3cd2598ab3	reader_permit: forbid empty permits Remove `no_reader_permit()` and all ways to create empty (invalid) permits. All permits are guaranteed to be valid now and are only obtainable from a semaphore. `reader_permit::semaphore()` now returns a reference, as it is guaranteed to always have a valid semaphore reference.	2020-05-28 11:34:35 +03:00
Botond Dénes	f417b9a3ea	reader_concurrency_semaphore: remove wait_admission and consume_resources() Permits are now created with `make_permit()` and code is using the permit to do all resource consumption tracking and admission waiting, so we can remove these from the semaphore. This allows us to remove some now unused code from the permit as well, namely the `base_cost` which was used to track the resource amount the permit was created with. Now this amount is also tracked with a `resource_units` RAII object, returned from `reader_permit::wait_admission()`, so it can be removed. Curiously, this reduces the reader permit to be glorified semaphore pointer. Still, the permit abstraction is worth keeping, because it allows us to make changes to how the resource tracking part of the semaphore works, without having to change the huge amount of code sites passing around the permit.	2020-05-28 11:34:35 +03:00
Botond Dénes	bf4ade8917	reader_permit: resource_units: introduce add() Allows merging two resource_units into one.	2020-05-28 11:34:35 +03:00
Botond Dénes	4d7250d12b	reader_permit: add wait_admission We want to make `read_permit` the single interface through which reads interact with the concurrency limiting mechanism. So far it was only usable to track memory consumption. Add the missing `wait_admission()` and `consume_resources()` to the permit API. As opposed to `reader_concurrency_semaphore::` equivalents which returned a permit, the `reader_permit::` variants jut return `reader_permit::resource_units` which is an RAII holder for the acquired units. This also allows for the permit to be created earlier, before the reader is admitted, allowing for tracking pre-admission memory usage as well. In fact this is what we are going to do in the next patches. This patch also introduces a `broken()` method on the reader concurrency semaphore which resolves waiters with an exception. This method is also called internally from the semaphore's destructor. This is needed because the semaphore can now have external waiters, who has to be resolved before the semaphore itself is destroyed.	2020-05-28 11:34:35 +03:00
Botond Dénes	bd793d6e19	reader_permit: resource_units: work in terms of reader_resources Refactor resource_units semantically as well to work in terms of reader_resources, instead of just memory.	2020-05-28 11:34:35 +03:00
Botond Dénes	0f9c24631a	reader_permit: s/memory_units/resource_units/ We want to refactor reader_permit::memory_units to work in terms of reader_resources, as we are planning to use it for guarding count resources as well. This patch makes the first step: renames it from memory_units to resources_units. Since this is a very noisy change, we do it in a separate patch, the semantic change is in the next patch.	2020-05-28 11:34:35 +03:00
Avi Kivity	88ade3110f	treewide: replace calls to engine().some_api() with some_api() This removes the need to include reactor.hh, a source of compile time bloat. In some places, the call is qualified with seastar:: in order to resolve ambiguities with a local name. Includes are adjusted to make everything compile. We end up having 14 translation units including reactor.hh, primarily for deprecated things like reactor::at_exit(). Ref #1	2020-04-05 12:46:04 +03:00
Botond Dénes	05116ba963	reader_concurrency_semaphore: make signal() noexcept Currently reader_concurrency_semaphore::signal() can fail. This is dangerous in two ways: * It is called from constructors, so the exception can bring down the node. This will convert an `std::bad_alloc` to a crash. * Reads in the queue will be blocked until they either time-out, or another `signal()` succeeds. To solve this, wrap the `reader_permit` constructor, the only code that can throw, with try-catch and forward the exception to the reader admission promise. In practice this will result in the flushing of the reader queue, when we fail to admit a read. Fixes #5741 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200206154238.707031-1-bdenes@scylladb.com>	2020-02-06 17:51:03 +02:00
Botond Dénes	434d32befe	reader_permit: tidy up reader_permit::memory_units This patch is a bag of fixes/cleanups that were omitted from the reader memory tracking series due to contributor error. It contains the following changes: * Get rid of unused `increase()` and `decrease()` methods. * Make all constructors and assignment operators `noexcept`. * Make move assignment operator safe w.r.t. self assignment. * `reset()`: consume the new amount before releasing the old amount, to prevent a transient window where new readers might be admitted. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200206143007.633069-1-bdenes@scylladb.com>	2020-02-06 16:35:07 +02:00
Botond Dénes	92fffe51d5	reader_concurrency_semaphore: tracking_file_impl: consume memory speculatively Consume the memory before even submitting the I/O to the underlying `file` object. This is in line with the underlying `file` object allocating the buffer before it forwards the I/O request to the kernel. This extends the "visibility" over the memory consumed by I/O greatly, as it turns out buffers spend most time alive waiting for the I/O to complete and are parsed shortly afterwards.	2020-01-28 08:13:16 +02:00
Botond Dénes	4bb3c7b1f0	reader_concurrency_semaphore: bye reader_resource_tracker Replaced by `reader_permit`, of which it was a mere wrapper of in the first place.	2020-01-28 08:13:16 +02:00
Botond Dénes	dea24ca859	reader_permit: expose make_tracked_temporary_buffer() Previously `tracking_file_impl::make_tracked_buf()`. In the next patches we plan on using this outside `tracking_file_impl`, so make it public and templatize on the char type.	2020-01-28 08:13:16 +02:00
Botond Dénes	16cea36a94	reader_permit: introduce make_tracked_file() Free function equivalent of `reader_resource_tracker::track_file()`, using a `reader_permit` directly.	2020-01-28 08:13:16 +02:00
Botond Dénes	1859a03629	reader_permit: introduce memory_units Similar to `seastar::semaphore_units`, this allows consuming and releasing memory via an RAII object. In addition to that, it also allows tracking changing values. This feature was designed to be used for tracking the ever changing memory consumption of the buffers of `flat_mutation_reader`:s. This is now the only supported way of consuming memory from a permit.	2020-01-28 08:13:16 +02:00
Botond Dénes	c0f96db2d9	reader_concurrency_semaphore: mv reader_resources and reader_permit to reader_permit.hh In the next patches we will replace `reader_resource_tracker` and have code use the `reader_permit` directly. In subsequent patches, the `reader_permit` will get even more usages as we attempt to make the tracking of reader resource more accurate by tracking more parts of it. So the grand plan is that the current `reader_concurrency_semaphore.hh` is split into two headers: * `reader_concurrency_semaphore.hh` - containing the semaphore proper. * `reader_permit.hh` - a very lightweight header, to be used by components which only want to track various parts of the resource consumption of reads.	2020-01-28 08:13:16 +02:00
Botond Dénes	2005495857	reader_concurrency_semaphore: reader_permit: make it a value type Currently `reader_permit` is passed around as `lw_shared_ptr<reader_permit>`, which is clunky to write and use and is also an unnecessary leak of details on how permit ownership is managed. Make `reader_permit` a simple value type, making it a little bit easier and safer to use. In the next patches we will get rid of `reader_resource_tracker` and instead have code use the permit instance directly, so this small improvement in usability will go a long way towards preventing eye sore.	2020-01-28 08:13:16 +02:00
Botond Dénes	89c5fd0c25	reader_concurrency_semaphore::reader_permit: move methods out-of-line In preparation for making the reader_permit a top-level class, and moving it to another file. It is also good practice to define non-performance critical methods out-of-line to reduce header bloat.	2020-01-28 08:13:16 +02:00
Juliusz Stasiewicz	d043393f52	db+semaphores+tests: mandatory `name' param in reader_concurrency_semaphore Exception messages contain semaphore's name (provided in ctor). This affects the queue overflow exception as well as timeout exception. Also, custom throwing function in ctor was changed to `prethrow_action', i.e. metrics can still be updated there but now callers have no control over the type of the exception being thrown. This affected `restricted_reader_max_queue_length' test. `reader_concurrency_semaphore'-s docs are updated accordingly.	2019-12-03 15:41:34 +01:00
Botond Dénes	e56c26205f	reader_concurrency_semaphore: add counters for inactive reads Add counters that give insight into inactive read related events. Two counters are added: * permit_based_evictions * population	2019-01-07 16:45:49 +02:00
Botond Dénes	dfd649a6b4	reader_concurrency_semaphore: add consume_resources()	2018-12-17 13:18:08 +02:00
Botond Dénes	77dbc7d09a	querier: fix evict_one() and evict_all_for_table() Both of these have the same problem. They remove the to-be-evicted entries from `_entries` but they don't unregister the `entry` from the `read_concurrency_semaphore`. This results in the `reader_concurrency_semaphore` being left with a dangling pointer to the entries will trigger segfault when it tries to evict the associated inactive reads. Also add a unit test for `evict_all_for_table()` to check that it works properly (`evict_one()` is only used in tests, so no dedicated test for it). Fixes: #3962 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <57001857e3791c6385721b624d33b667ccda2e7d.1544010868.git.bdenes@scylladb.com>	2018-12-05 21:51:01 +02:00
Botond Dénes	37f0117747	reader_concurrency_semaphore: refactor eviction mechanism As we are about to add multiple sources of evictable readers, we need a more scalable solution than a single functor being passed that opaquely evicts a reader when called. Add a generic way to register and unregister evictable (inactive) readers to the semaphore. The readers are expected to be registered when they become evictable and are expected to be unregistered when they cease to become evictable. The semaphore might evict any reader that is registered to it, when it sees fit. This also solves the problem of notifying the semaphore when new readers become evictable. Previously there was no such mechanism, and the semaphore would only evict any such new readers when a new permit was requested from it.	2018-12-04 08:51:00 +02:00
Botond Dénes	0cb7c43fb5	reader_concurrency_semaphore: add dedicated .cc file As we are about to extend the functionality of the reader concurrency semaphore, adding more method implementations that need to go to a .cc file, it's time we create a dedicated file, instead of keep shoving them into unrelated .cc files (mutation_reader.cc).	2018-12-03 13:37:02 +02:00

25 Commits