scylladb

Author	SHA1	Message	Date
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Botond Dénes	85c42a5d76	querier: convert querier_cache and {data,mutation}_querier to v2 The shard_mutation_querier is left using a v1 reader in its API as the multishard query code is not ready yet. When saving this reader it is upgraded to v2 and on lookup it is downgraded to v1. This should cancel out thanks to upgrade/downgrade unwrapping.	2022-01-07 13:52:26 +02:00
Michael Livshin	a1b8ba23d2	reader_concurrency_semaphore: convert to flat_mutation_reader_v2 Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2021-12-21 11:26:17 +02:00
Tomasz Grabiec	cc56a971e8	database, treewide: Introduce partition_slice::is_reversed() Cleanup, reduces noise. Message-Id: <20211014093001.81479-1-tgrabiec@scylladb.com>	2021-10-14 12:39:16 +03:00
Benny Halevy	9b0b13c450	reader_concurrency_semaphore: adjust reactivated reader timeout Update the reader's timeout where needed after unregistering inactive_read. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-08-24 16:30:51 +03:00
Benny Halevy	e9aff2426e	everywhere: make deferred actions noexcept Prepare for updating seastar submodule to a change that requires deferred actions to be noexcept (and return void). Test: unit(dev, debug) Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-08-22 21:11:52 +03:00
Botond Dénes	f37e26c73d	querier: remove now unused cache_context	2021-07-14 16:48:43 +03:00
Botond Dénes	6efb278ea3	querier_cache: insert(): close refused queriers The querier cache refuses to cache queriers that read in reverse. These queriers are also not closed, with the caller having no way to determine whether the querier it just moved into `insert()` needs a close afterwards or not, requiring a `close()` on the moved-from querier just to be sure. Avoid this by consistently closing all passed-in queriers, including those the cache refuses to save. For this, the internal `insert_querier()` methods has to be made a member to be able to use the closing gate.	2021-07-14 16:48:43 +03:00
Avi Kivity	9059514335	build, treewide: enable -Wpessimizing-move warning This warning prevents using std::move() where it can hurt - on an unnamed temporary or a named automatic variable being returned from a function. In both cases the value could be constructed directly in its final destination, but std::move() prevents it. Fix the handful of cases (all trivial), and enable the warning. Closes #8992	2021-07-08 17:52:34 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Benny Halevy	7c7569f0ad	querier_cache: implement stop Close the _closing_gate to wait on background close of dropped queries, and close all remaining queriers. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:35:07 +03:00
Benny Halevy	07f34b4a32	querier_cache: lookup_querier: close the querier before dropping it Make sure to close the dropped querier before it's destroyed. The operation is moved to the background so not to penelize the common path. A following patch will add a querier_cache::close() method that will close _closing_gate to wait on the querier close (among other things it needs to wait on :)). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:35:07 +03:00
Benny Halevy	4a0abc7b9c	querier_cache: lookup_querier: define as a private method In preparation to closing the querier in the background before dropping it. With that, stats need not be passed as a parameter, but rather the _stats member can be used directly. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:35:07 +03:00
Benny Halevy	8b8c721431	querier: add close method Depening on the variant _reader contents, either close the reader or unregister the inactive reader and close it. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:35:07 +03:00
Benny Halevy	3f00c21481	querier_cache: evict: close evicted reader Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:35:07 +03:00
Benny Halevy	0d8d56c36f	querier: coroutinize evict methods Instead of calling a lambda function for each index simply iterate over all indices and use co_await / co_return in the inner loop. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:35:07 +03:00
Benny Halevy	2f9cf01aa7	querier_cache: futurize evict api Prepare for futurizing the lower-level inactive reads api. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:35:07 +03:00
Botond Dénes	594636ebbf	querier: insert(): account immediately evicted querier as resource based eviction `reader_concurrency_semaphore::register_inactive_read()` drops the registered inactive read immediately if there is a resource shortage. This is in effect a resource based eviction, so account it as such in `querier::insert()`.	2021-03-18 14:57:57 +02:00
Benny Halevy	d2b8b3041d	querier_cache: insert_querier: ignore errors to register inactive reader Since the reader may normally dropped upon registration, hitting an error is equivalent to having it evicted at any time, so just log the exception and ignore it. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-02-08 22:31:01 +02:00
Benny Halevy	9bdb8190ce	querier_cache: insert_querier: handle errors Make insert_querier exception safe. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-02-08 22:31:01 +02:00
Benny Halevy	b8f935457a	querier_utils: mark functions noexcept They all are trivially noexcept. Mark them so to simplify error handing assumptions in the next patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-02-08 22:31:01 +02:00
Benny Halevy	46c2229b78	reader_concurrency_semaphore: separate set_notify_handler from register_inactive_reader Register the inactive reader first with no evict_notify_handler and ttl. Those can be set later, only if registration succeeded. Otherwise, as in the querier example, there is no need to to place the querier in the index and erase it on eviction. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-02-08 22:31:01 +02:00
Benny Halevy	4e8f29ef14	reader_concurrency_semaphore: inactive_read: keep a flat_mutation_reader There's no need to hold a unique_ptr<flat_mutation_reader> as flat_mutation_reader itself holds a unique_ptr<flat_mutation_reader::impl> and functions as a unique ptr via flat_mutation_reader_opt. With that, unregister_inactive_read was modified to return a flat_mutation_reader_opt rather than a std::unique_ptr<flat_mutation_reader>, keeping exactly the same semantics. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-02-08 20:32:40 +02:00
Avi Kivity	913d970c64	Merge "Unify inactive readers" from Botond " Currently inactive readers are stored in two different places: * reader concurrency semaphore * querier cache With the latter registering its inactive readers with the former. This is an unnecessarily complex (and possibly surprising) setup that we want to move away from. This series solves this by moving the responsibility if storing of inactive reads solely to the reader concurrency semaphore, including all supported eviction policies. The querier cache is now only responsible for indexing queriers and maintaining relevant stats. This makes the ownership of the inactive readers much more clear, hopefully making Benny's work on introducing close() and abort() a little bit easier. Tests: unit(release, debug:v1) " * 'unify-inactive-readers/v2' of https://github.com/denesb/scylla: reader_concurrency_semaphore: store inactive readers directly querier_cache: store readers in the reader concurrency semaphore directly querier_cache: retire memory based cache eviction querier_cache: delegate expiry to the reader_concurrency_semaphore reader_concurrency_semaphore: introduce ttl for inactive reads querier_cache: use new eviction notify mechanism to maintain stats reader_concurrency_semaphore: add eviction notification facility reader_concurrency_semaphore: extract evict code into method evict()	2021-02-03 10:59:04 +02:00
Botond Dénes	72b8a2d147	querier: move common stuff into querier_base The querier cache expects all querier objects it stores to have certain methods. To avoid accessing these via `std::visit()` (the querier object is stored in an `std::variant`), we move all the stuff that is common to all querier types into a base class. The querier cache now accesses the members via a reference to this common base. Additionally the variant is eliminated completely and the cache entry stores an `std::unique_ptr<querier_base>` instead. Tests: unit(dev) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200603152544.83704-1-bdenes@scylladb.com>	2020-06-03 18:45:33 +03:00
Botond Dénes	e678f06a5e	querier_cache: get semaphore from querier Currently the `querier_cache` is passed a semaphore during its construction and it uses this semaphore to do all the inactive reader registering/unregistering. This is inaccurate as in theory cached reads could belong to different semaphores (although currently this is not yet the case). As all queriers store a valid permit now, use this permit to obtain the semaphore the querier is associated with, and register the inactive read with this semaphore.	2020-05-28 11:34:35 +03:00
Botond Dénes	e778b072b1	read_command: use bool_class for is_first_page parameter The constructor of `read_command` is used both by IDL and clients in the code. However, this constructor has a parameter that is not used by IDL: `read_timestamp`. This requires that this parameter is the very last in the list and that new parameters that are used by IDL are added before it. One such new parameter was `bool is_first_page`. Adding this parameter right before the read timestamp one created a situation where the last parameter (read_timestamp) implicitly converts to the one before it (is_first_page). This means that some call sites passing `read_timestamp` were now silently converting this to `is_first_page`, effectively dropping the timestamp. This patch aims to rectify this, while also avoiding similar accidents in the future, by making `is_first_page` a `bool_class` which doesn't have any implicit convertions defined. This change does not break the ABI as `bool_class` is also sent as a `bool` on the wire. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Tests: unit(dev) Message-Id: <20200422073657.87241-1-bdenes@scylladb.com>	2020-04-22 11:01:22 +03:00
Botond Dénes	091d80e8c3	flat_mutation_reader: expose reverse reader as a standalone reader Currently reverse reads just pass a flag to `flat_mutation_reader::consume()` to make the read happen in reverse. This is deceptively simple and streamlined -- while in fact behind the scenes a reversing reader is created to wrap the reader in question to reverse partitions, one-by-one. This patch makes this apparent by exposing the reversing reader via `make_reversing_reader()`. This now makes how reversing works more apparent. It also allows for more configuration to be passed to the reversing reader (in the next patches). This change is forward compatible, as in time we plan to add reversing support to the sstable layer, in which case the reversing reader will go.	2020-02-27 18:11:54 +02:00
Botond Dénes	00b432b61d	querier_cache: correctly account entries evicted on insertion in the population Currently, the population stat is not increased for entries that are evicted immediately on insert, however the code that does the eviction still decreases the population stat, leading to an imbalance and in some cases the underflow of the population stat. To fix, unconditionally increase the population stat upon inserting an entry, regardless of whether it is immediately evicted or not. Fixes: #5123 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20191001153215.82997-1-bdenes@scylladb.com>	2019-10-03 11:49:44 +03:00
Botond Dénes	d57ab83bc8	querier_cache: add `inserted` stat Recently we have seen a case where the population stat of the cache was corrupt, either due to misaccounting or some more serious corruption. When debugging something like that it would have been useful to know how many items have been inserted to the cache. I also believe that such a counter could be useful generally as well. Refs: #4918 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20190924083429.43038-1-bdenes@scylladb.com>	2019-09-24 10:52:49 +02:00
Botond Dénes	ab5d717052	reader_concurrency_semaphore::inactive_read_handle: fix handle semantics That is: * make it move only; * make moved-from handles null handles; * add (public) default constructor, which constructs a null handle;	2019-02-12 16:20:51 +02:00
Duarte Nunes	fa2b0384d2	Replace std::experimental types with C++17 std version. Replace stdx::optional and stdx::string_view with the C++ std counterparts. Some instances of boost::variant were also replaced with std::variant, namely those that called seastar::visit. Scylla now requires GCC 8 to compile. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20190108111141.5369-1-duarte@scylladb.com>	2019-01-08 13:16:36 +02:00
Botond Dénes	021feef513	querier_cache: simplify memory eviction use-after-free fix, add tests Simplify the fix for memory based eviction, introduced by `918d255` so there is no need to massage the counters. Also add a check to `test_memory_based_cache_eviction` which checks for the bug fixed. While at it also add a check to `test_time_based_cache_eviction` for the fix to time based eviction (`e5a0ea3`). Tests: tests/querier_cache:debug Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <c89e2788a88c2a701a2c39f377328e77ac01e3ef.1546515465.git.bdenes@scylladb.com>	2019-01-03 13:44:08 +02:00
Botond Dénes	e5a0ea390a	querier_cache: unregister queriers evicted due to expired TTL Currently queriers evicted due to their TTL expiring are not unregistered from the `reader_concurrency_semaphore`. This can cause a use-after-free when the semaphore tries to evict the same querier at some later point in time, as the querier entry it has a pointer to is now invalid. Fix by unregistering the querier from the semaphore before destroying the entry. Refs: #4018 Refs: #4031 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <4adfd09f5af8a12d73c29d59407a789324cd3d01.1546504034.git.bdenes@scylladb.com>	2019-01-03 10:29:26 +02:00
Avi Kivity	918d255168	querier_cache: unregister querier from reader_concurrency_semaphore during eviction In insert_querier(), we may evict older queriers to make room for the new one. However, we forgot to unregister the evicted queriers from reader_concurrency_semaphore. As a result, when reader_concurrency_semaphore eventually wanted to evict something, it saw an inactive_read_handle that was not connected to a querier_cache::entry, and crashed on use-after-free. Fix by evicting through the inactive_read_handle associated with the querier to be evicted. This removes traces of the querier from both reader_concurrency_semaphore and querier_cache. We also have to massage the statistics since querier_inactive_read::evict() updates different counters. Fixes #4018. Tests: unit(release) Reviewed-by: Botond Denes <bdenes@scylladb.com> Message-Id: <20190102175023.26093-1-avi@scylladb.com>	2019-01-03 09:15:07 +02:00
Botond Dénes	5780f2ce7a	querier_cache: check that the query wasn't evicted during registering The reader concurrency semaphore can evict the querier when it is registered as an inactive read. Make the `querier_cache` aware of this so that it doesn't continue to process the inserted querier when this happens. Also add a unit test for this.	2018-12-17 13:18:08 +02:00
Botond Dénes	77dbc7d09a	querier: fix evict_one() and evict_all_for_table() Both of these have the same problem. They remove the to-be-evicted entries from `_entries` but they don't unregister the `entry` from the `read_concurrency_semaphore`. This results in the `reader_concurrency_semaphore` being left with a dangling pointer to the entries will trigger segfault when it tries to evict the associated inactive reads. Also add a unit test for `evict_all_for_table()` to check that it works properly (`evict_one()` is only used in tests, so no dedicated test for it). Fixes: #3962 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <57001857e3791c6385721b624d33b667ccda2e7d.1544010868.git.bdenes@scylladb.com>	2018-12-05 21:51:01 +02:00
Botond Dénes	37f0117747	reader_concurrency_semaphore: refactor eviction mechanism As we are about to add multiple sources of evictable readers, we need a more scalable solution than a single functor being passed that opaquely evicts a reader when called. Add a generic way to register and unregister evictable (inactive) readers to the semaphore. The readers are expected to be registered when they become evictable and are expected to be unregistered when they cease to become evictable. The semaphore might evict any reader that is registered to it, when it sees fit. This also solves the problem of notifying the semaphore when new readers become evictable. Previously there was no such mechanism, and the semaphore would only evict any such new readers when a new permit was requested from it.	2018-12-04 08:51:00 +02:00
Botond Dénes	ecb1e79bcc	querier: add shard_mutation_querier The querier to be used for saving shard readers belonging to a multishard range scan. This querier doesn't provide a `consume_page` method as it doesn't support reading from it directly. It is more of a storage to allow caching the reader and any objects it depends on.	2018-09-03 10:31:44 +03:00
Botond Dénes	07cdf766c5	querier: prepare for multi-ranges In the next patch a querier will be added that reads multiple ranges as opposed to a single range that data and mutation queriers read. To keep `querier_cache` code seamless regarding this difference change all range-matching logic to work in terms of `dht::partition_ranges_view`. This allows for cheap and seamless way of having a single code-base for the insert/lookup logic. Code actually matching ranges is updated to be able to handle both singular and multi-ranges while maintaining backward compatibility.	2018-09-03 10:31:44 +03:00
Botond Dénes	c12008b8cb	querier: split querier into separate data and mutation querier types Instead of hiding what compaction method the querier uses (and only expose it via rejecting 'can_be_used_for_page()`) make it very explicit that these are really two different queriers. This allows using different indexes for the two queriers in `querier_cache` and eliminating the possibility of picking up a querier with the wrong compaction method (read kind). This also makes it possible to add new querier type(s) that suit the multishard-query's needs without making a confusing mess of `querier` by making it a union of all querying logic. Splitting the queriers this way changes what happens when a lookup finds a querier of the wrong kind (e.g. emit_only_live::yes for an emit_only_live::no command). As opposed to dropping the found (but wrong) querier the querier will now simply not be found by the lookup. This is a result of using separate search indexes for the different mutation kinds. This change should have no practical implications. Splitting is done by making querier templated on `emit_only_live_rows`. It doesn't make sense to duplicate the entire querier as the two share 99% of the code.	2018-09-03 10:31:44 +03:00
Botond Dénes	c53f17ddb8	querier: move all matching related logic into free functions So that they can be used for multiple querier classes easily, without inheritance. The functions are not visible from the header. Also update the comments on `querier` to w.r.t. the disappeared checking functions. Change the language to be more general. In practice these checks are never done by client code, instead they are done by the `querier_cache`.	2018-09-03 10:31:44 +03:00
Botond Dénes	43f464c52d	querier: inline querier::current_position() and make it public	2018-09-03 10:31:44 +03:00
Botond Dénes	86a61ded7d	querier: s/position/position_view/ Also treat it as a view, that is take it by value in functions, instead of reference.	2018-09-03 10:31:44 +03:00
Botond Dénes	6e4ec53679	querier: move position outside of querier In preparation for having multiple querier types that can share code without inheritance.	2018-09-03 10:31:44 +03:00
Botond Dénes	7bd955e993	querier_cache: move insert/lookup related logic into free functions In preparations for introducing support multiple entry types in the querier_cache move all insert/lookup related logic into free functions. Later these functions will be templated so they can handle multiple entry types with the same code.	2018-09-03 10:31:44 +03:00
Botond Dénes	cded477b94	querier: return std::optional<querier> instead of using create_fun() Requiring the caller of lookup() to pass in a `create_fun()` was not such a good idea in hindsight. It leads to awkward call sites and even more awkward code when trying to find out whether the lookup was successfull or not. Returning an optional gives calling code much more flexibility and makes the code cleaner.	2018-09-03 10:31:44 +03:00
Botond Dénes	5f726e9a89	querier: move all to query namespace To avoid name clashes.	2018-09-03 10:31:44 +03:00
Botond Dénes	2609a17a23	querier: find_querier(): return end() when no querier matches the range When none of the queriers found for the lookup key match the lookup range `_entries.end()` should be returned as the search failed. Instead the iterator returned from the failed `std::find_if()` is returned which, if the find failed, will be the end iterator returned by the previous call to `_entries.equal_range()`. This is incorrect because as long as `equal_range()`'s end iterator is not also `_entries.end()` the search will always return an iterator to a querier regardless of whether any of them actually matches the read range. Fix by returning `_entries.end()` when it is detected that no queriers match the range. Fixes: #3530	2018-06-19 13:20:43 +03:00
Botond Dénes	7ce7f3f0cc	querier_cache: restructure entries storage Currently querier_cache uses a `std::unordered_map<utils::UUID, querier>` to store cache entries and an `std::list<meta_entry>` to store meta information about the querier entries, like insertion order, expiry time, etc. All cache eviction algorithms use the meta-entry list to evict entries in reverse insertion order (LRU order). To make this possible meta-entries keep an iterator into the entry map so that given a meta-entry one can easily erase the querier entry. This however poses a problem as std::unordered_map can possibly invalidate all its iterators when new items are inserted. This is use-after-free waiting to happen. Another disadvantages of the current solution is that it requires the meta-entry to use a weak pointer to the querier entry so that in case that is removed (as a result of a successful lookup) it doesn't try to access it. This has an impact on all cache eviction algorithms as they have to be prepared to deal with stale meta-entries. Stale meta-entries also unnecesarily consume memory. To solve these problems redesign how querier_cache stores entries completely. Instead of storing the entries in an `std::unordered_map` and storing the meta-entries in an `std::list`, store the entries in an `std::list` and an intrusive-map (index) for lookups. This new design has severeal advantages over the old one: * The entries will now be in insert order, so eviction strategies can work on the entry list itself, no need to involve additional data structures for this. * All data related to an entry is stored in one place, no data duplication. * Removing an entry automatically removes it from the index as intrusive containers support auto unlink. This means there is no need to store iterators for long terms, risking use-after-free when the container invalidates it's iterators. Additional changes: * Modify eviction strategies so that they work with the `entry` interface rather than the stored value directly. Ref #3424	2018-06-19 13:20:40 +03:00

1 2

60 Commits