scylladb

Author	SHA1	Message	Date
Botond Dénes	62d82b8b0e	flat_mutation_reader: convert make_flat_mutation_reader_from_mutation() v2 Since this reader is also heavily used by tests to compose a specific workload for readers above it, we just add a v2 variant, instead of changing the existing v1 one. The v2 variant was written from scratch to have built-in support for reading in reverse. It is built-on `mutation::consume()` to avoid duplicating the logic of consuming the contents of the mutation. A v2 native unit test is also added.	2022-01-05 09:06:16 +02:00
Botond Dénes	2d1bb90c8e	flat_mutation_reader: extract mutation slicing into a function	2022-01-05 09:06:16 +02:00
Michael Livshin	9f656b96ac	short-circuit flat mutation reader upgrades and downgrades When asked to upgrade a reader that itself is a downgrade, try to return the original v2 reader instead, and likewise when downgrading upgraded v1 readers. This is desirable because version transformations can result from, say, entering/leaving a reader concurrency semaphore, and the amount of such transformations is practically unbounded. Such short-circuiting is only done if it is safe, that is: the transforming reader's buffer is empty and its internal range tombstone tracking state is discardable. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2021-12-21 11:26:17 +02:00
Botond Dénes	c193bbed82	flat_mutation_reader_v2: add v2 version of empty reader Convert the v1 implementation to v2, downgrade to v1 in the existing `make_empty_flat_reader()`.	2021-12-16 14:57:49 +02:00
Botond Dénes	39426b1aa3	flat_mutation_reader_v2: add make_flat_mutation_reader_from_fragments() The main difference compared to v1 (apart from having _v2 suffix at relevant places) is how slicing and reversing works. The v2 variant has native reverse support built-in because the reversing reader is not something we want to convert to v2. A native v2 mutation-source test is also added.	2021-12-10 15:48:49 +02:00
Botond Dénes	76ee3f029c	flat_mutation_reader_v2: add make_forwardable() Not a completely straightforward conversion as the v2 version has to make sure to emit the current range tombstone change after fast_forward_to() (if it changes compared to the current one before fast forwarding). Changes are around the two new members `_tombstone_to_emit` and `maybe_emit_tombstone()`.	2021-12-10 15:48:49 +02:00
Botond Dénes	7306f53be1	flat_mutation_reader: make_forwardable(): fix indentation	2021-12-10 15:19:18 +02:00
Botond Dénes	2468a0602b	flat_mutation_reader: make_forwardable(): coroutinize reader For improved readability and to facilitate further patching.	2021-12-10 15:19:18 +02:00
Botond Dénes	64bb48855c	flat_mutation_reader: revamp flat_mutation_reader_from_mutations() Add schema parameter so that: * Caller has better control over schema -- especially relevant for reverse reads where it is not possible to follow the convention of passing the query schema which is reversed compared to that of the mutations. * Now that we don't depend on the mutations for the schema, we can lift the restriction on mutations not being empty: this leads to safer code. When the mutations parameter is empty, an empty reader is created. Add "make_" prefix to follow convention of similar reader factory functions. Tests: unit(dev) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20211115155614.363663-1-bdenes@scylladb.com>	2021-11-15 17:58:46 +02:00
Tomasz Grabiec	cc56a971e8	database, treewide: Introduce partition_slice::is_reversed() Cleanup, reduces noise. Message-Id: <20211014093001.81479-1-tgrabiec@scylladb.com>	2021-10-14 12:39:16 +03:00
Botond Dénes	f5ef88c0c5	flat_mutation_reader: make_reversing_reader(): add convenience stored slice This serves as a convenience slice storage for reads that have to store an edited slice somewhere. This is common for reads that work with a native-reversed slice and so have to convert the one used in the query -- which is in half-reversed format.	2021-09-28 17:03:57 +03:00
Botond Dénes	eeebe4ab63	flat_mutation_reader: make_flat_mutation_reader_from_fragments(): add reverse read support Implemented with the `make_reversing_reader()` adaptor.	2021-09-28 17:03:57 +03:00
Botond Dénes	cc222e5332	flat_mutation_reader: flat_mutation_reader_from_mutations(): add reverse read support Implemented with the `make_reversing_reader()` adaptor.	2021-09-28 17:03:57 +03:00
Botond Dénes	bf38e204af	flat_mutation_reader: make_reversing_reader(): implement fast_forward_to(partition_range)	2021-09-09 15:42:15 +03:00
Botond Dénes	350440b418	flat_mutation_reader: make_reversing_reader(): take ownership of the reader Makes for much simpler client code.	2021-09-09 15:42:15 +03:00
Botond Dénes	502a45ad58	treewide: switch to native reversed format for reverse reads We define the native reverse format as a reversed mutation fragment stream that is identical to one that would be emitted by a table with the same schema but with reversed clustering order. The main difference to the current format is how range tombstones are handled: instead of looking at their start or end bound depending on the order, we always use them as-usual and the reversing reader swaps their bounds to facilitate this. This allows us to treat reversed streams completely transparently: just pass along them a reversed schema and all the reader, compacting and result building code is happily ignorant about the fact that it is a reversed stream.	2021-09-09 15:42:15 +03:00
Pavel Emelyanov	8f061b9b1c	range_tombstone_list: Simplify (maybe) pop_front_and_lock() The method returns a pointer on the left-most range tombstone and expects the caller to "dispose" it. This is not very nice because the callers thus needs to mess with the relevant deleter. A nicer approach is the pop-like one (former pop_as<>-like) which is in returning the range tombstone by value. This value is move-constructed from the original object which is disposed by the container itself. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-03 19:34:45 +03:00
Pavel Emelyanov	2e1b21d72b	range_tombstone_list: De-templatize pop_as<> The method pops the range tombstone from the containing list and transparently "converts" it into some other type. Nowadays all callers of it need range tombstone as-is, so the template can be relaxed down to a plan call. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-03 19:34:45 +03:00
Michael Livshin	fbb5802229	mf-stream-validator: add previous partition key to error messages Only seems to make sense in mutation fragment validation where validation level is >= `partition_key`. Fixes #9269 Signed-off-by: Michael Livshin <michael.livshin@scylladb.com> Message-Id: <20210901165641.340185-1-michael.livshin@scylladb.com>	2021-09-02 11:05:33 +03:00
Benny Halevy	4476800493	flat_mutation_reader: get rid of timeout parameter Now that the timeout is taken from the reader_permit. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-08-24 16:30:51 +03:00
Benny Halevy	e9aff2426e	everywhere: make deferred actions noexcept Prepare for updating seastar submodule to a change that requires deferred actions to be noexcept (and return void). Test: unit(dev, debug) Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-08-22 21:11:52 +03:00
Nadav Har'El	6c27000b98	Merge 'Propagate exceptions without throwing' from Piotr Sarna NOTE: this series depends on a Seastar submodule update, currently queued in next: 0ed35c6af052ab291a69af98b5c13e023470cba3 In order to avoid needless throwing, exceptions are passed directly wherever possible. Two mechanisms which help with that are: 1. `make_exception_future<>` for futures 2. `co_return coroutine::exception(...)` for coroutines which return `future<T>` (the mechanism does not work for `future<>` without parameters, unfortunately) Tests: unit(release) Closes #9079 * github.com:scylladb/scylla: system_keyspace: pass exceptions without throwing sstables: pass exceptions without throwing storage_proxy: pass exceptions without throwing multishard_mutation_query: pass exceptions without throwing client_state: pass exceptions without throwing flat_mutation_reader: pass exceptions without throwing table: pass exceptions without throwing commitlog: pass exceptions without throwing compaction: pass exceptions without throwing database: pass exceptions without throwing	2021-08-01 16:47:47 +03:00
Pavel Emelyanov	1bf643d4fd	mutation_partition: Pin mutable access to range tombstones Some callers of mutation_partition::row_tomstones() don't want (and shouldn't) modify the list itself, while they may want to modify the tombstones. This patch explicitly locates those that need to modify the collection, because the next patch will return immutable collection for the others. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Pavel Emelyanov	ad27bf40e6	mutation_partition: Pin mutable access to rows Some callers of mutation_partition::clustered_rows() don't want (and shouldn't) modify the tree of rows, while they may want to modify the rows themselves. This patch explicitly locates those that need to modify the collection, because the next patch will return immutable collection for the others. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Piotr Sarna	e5925d4980	flat_mutation_reader: pass exceptions without throwing In order to avoid needless throwing, exceptions are passed directly wherever possible. Two mechanisms which help with that are: 1. make_exception_future<> for futures 2. co_return coroutine::exception(...) for coroutines which return future<T> (the mechanism does not work for future<> without parameters, unfortunately)	2021-07-26 17:04:20 +02:00
Piotr Wojtczak	cb2a0ab858	flat_mutation_reader: Add a new filtering reader factory method Introduce a new function creating a filtering reader using query slice and partition range.	2021-07-20 14:00:47 +02:00
Michał Radwański	67d99e02a7	flat_mutation_reader: downgrade_to_v1 - reset state of rt_assembler The downgrade_to_v1 didn't reset the state of range tombstone assembler in case of the calls to next_partition or fast_forward_to, which caused a situation where the closing range tombstone change is cleared from the buffer before being emitted, without notifying the assembler. This patch fixes the behaviour in fast_forward_to as well. Fixes #9022	2021-07-19 15:54:26 +02:00
Tomasz Grabiec	1f255c420e	flat_mutation_reader_v2: Make is_end_of_stream() reflect consumer-side state of the stream Currently, flat_mutation_reader_v2::is_end_of_stream() returns flat_mutation_reader_v2::impl::_end_of_stream, which means the producer is done. The stream may be still not yet fully consumed even if producer is done, due to internal buffering. So consumers need to make a more elaborate check: rd.is_end_of_stream() && rd.is_buffer_empty() It would be cleaner if flat_mutation_reader_v2::is_end_of_stream() returned the state of the consumer-side of the stream, since it belongs to the consumer-side of the API. The consumption will be as simple as: while (!rd.is_end_of_stream()) { consume_fragment(rd()); } This patch makes the change on the v2 of the reader interface. v1 is not changed to avoid problems which could happen when backporting code which assumes new semantics into a version with the old semantics. v2 is not in any old branch yet so it doesn't have this problem and it's a good time to make the API change. Note that it's always safe to use the new semantics in the context which assumes the old semantics, so v1 users can be safely converted to v2 even if they are unware of the change. Fixes #3067 Message-Id: <20210715102833.146914-1-tgrabiec@scylladb.com>	2021-07-15 14:00:48 +03:00
Botond Dénes	7cf5b43bbc	mutation_fragment_stream_validator: add schema() accessor	2021-07-12 07:11:29 +03:00
Tomasz Grabiec	558d88ea17	flat_mutation_reader: Trim range tombstones in make_flat_mutation_reader_from_fragments() This is needed to change the guarantees of flat_mutation_reader v1 to produce only range tombstones trimmed to clustering restrictions. The reason for this is so that v2 has a canonical representation in which all fragments have position inside clustering restrictions. Conversion from v1 to v2 can guarantee that only if v1 trims range tombstones.	2021-06-16 00:23:49 +02:00
Tomasz Grabiec	1ef73abd82	flat_mutation_reader_v2: Implement read_mutation_from_flat_mutation_reader()	2021-06-15 13:14:45 +02:00
Tomasz Grabiec	9996b7ca18	flat_mutation_reader: Introduce adaptors between v1 and v2 of mutation fragment stream The transition to v2 will be incremental. To support that, we need adaptors between v1 and v2 which will be inserted at places which are boundaries of conversion. The v1 -> v2 converter needs to accumulate range tombstones, so has unbounded worst case memory footprint. The v2 -> v1 converter trims range tombstones around clustering rows, so generates more fragments than necessary. Because of that, adpators are a temporary solution and we should not release with them on the produciton code paths.	2021-06-15 13:10:47 +02:00
Tomasz Grabiec	e3309322c3	Clone flat_mutation_reader related classes into v2 variants To make review easier, first clone the classes without chaning the logic. Logic and API will change in subsequent commits.	2021-06-15 13:10:09 +02:00
Pavel Solodovnikov	76bea23174	treewide: reduce header interdependencies Use forward declarations wherever possible. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Closes #8813	2021-06-07 15:58:35 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Botond Dénes	104a47699c	mutation_fragment_stream_validator: add reset methods Allow resetting the validator to a given partition or mutation fragment. This allows a user which is able to fix corrupt streams to reset the validator to a partition or row which the validator normally wouldn't accept and hence it wouldn't advance its internal state to it.	2021-05-05 12:03:42 +03:00
Benny Halevy	b134640829	flat_mutation_reader: abort if not closed before destroyed The motivation to abort if the reader is not closed before its destroyed is mainly driven by: 1. Aborting will force us find and fix missing closes. Otherwise, log warnings can easily be lost in the noise. (ERRORs however are caught by dtests but won't be necessarily caught in SCT / production environments) 2. Following patches remove existing cleanup code in destructors that is not needed once close() is mandated. If we don't abort on missing close we'll have to keep maintaining both cleanup paths forever. 3. Not enforcing close exposes us to leaks and potential use-after-free from background tasks that are left behind. We want to stop guranteeing the safety of the background tasks post close(). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:35:07 +03:00
Benny Halevy	5b22731f9a	flat_mutation_reader: require close Make flat_mutation_reader::impl::close pure virtual so that all implementations are required to implemnt it. With that, provide a trivial implementation to all implementations that currently use the default, trivial close implementation. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:35:07 +03:00
Benny Halevy	0da2eea211	flat_mutation_reader: flat_multi_range_mutation_reader: close underlying reader Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:16:10 +03:00
Benny Halevy	18268ab474	flat_mutation_reader: forwardable_empty_mutation_reader: close optional underlying reader Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:16:10 +03:00
Benny Halevy	e2e642b1b1	flat_mutation_reader: make_forwardable, make_nonforwardable: close underlying reader Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:16:10 +03:00
Benny Halevy	978501c336	flat_mutation_reader: partition_reversing_mutation_reader: implement no-op close We don't own _source therefore do not close it. That said, we still need to make sure that the reversing reader itself is closed to calm down the check when it's destroyed. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:16:10 +03:00
Benny Halevy	f4dfaaa6c9	flat_mutation_reader: delegating_reader: close reader when moved to it The underlying reader is owned by the caller if it is moved to it, but not if it was constructed with a reference to the underlying reader. Close the underlying reader on close() only in the former case. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:16:10 +03:00
Benny Halevy	ca06d3c92a	flat_mutation_reader: log a warning if destroyed without closing We cannot close in the background since there are use cases that require the impl to be destroyed synchronously. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:16:10 +03:00
Benny Halevy	a471579bd7	flat_mutation_reader: introduce close Allow closing readers before destorying them. This way, outstanding background operations such as read-aheads can be gently canceled and be waited upon. Note that similar to destructors, close must not fail. There is nothing to do about errors after the f_m_r is done. Enforce that in flat_mutation_reader::close() so if the f_m_r implementation did return a failure, report it and abort as internal error. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:16:10 +03:00
Botond Dénes	727bc0f5d4	mutation_fragment_stream_validator: add token validation level In some cases the full-blown partition key validation and especially the associated key copy per partition might be deemed too costly. As a next best thing this patch adds a token only validation, which should cover 99% (number pulled out of my sleeve) of the cases. Let's hope no one gets unlucky.	2021-03-01 07:49:23 +02:00
Botond Dénes	694f8a4ec6	mutation_fragment_stream_validating_filter: make validation levels more fine-grained Currently key order validation for the mutation fragment stream validating filter is all or nothing. Either no keys (partition or clustering) are validated or all of them. As we suspect that clustering key order validation would add a significant overhead, this discourages turning key validation on, which means we miss out on partition key monotonicity validation which has a much more moderate cost. This patch makes this configurable in a more fine-grained fashion, providing separate levels for partition and clustering key monotonicity validation. As the choice for the default validation level is not as clear-cut as before, the default value for the validation level is removed in the validating filter's constructor.	2021-03-01 07:49:23 +02:00
Pavel Emelyanov	bfcd6a4bb7	flat_mutation_reader: Use clear() in destroy_current_mutation() Currently the code uses a look of unlink_leftmost_without_rebalance calls. B-tree does have it, but plain clearing of the tree is a bit faster with clear(). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-02-02 09:30:30 +03:00
Benny Halevy	29002e3b48	flat_mutation_reader: return future from next_partition To allow it to asynchronously close underlying readers on next_partition(). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-01-13 17:35:07 +02:00
Botond Dénes	8dae6152bf	mutation_fragment_stream_validator: make it easier to validate concrete fragment types The current API is tailored to the `mutation_fragment` type. In the next patch we will want to use the validator from a context where the mutation fragments are already decomposed into their respective concrete types, e.g. static_row, clustering_row, etc. To avoid having to reconstruct a mutation fragment type just to use the validator, add an API which allows validating these concrete types conveniently too.	2021-01-11 08:07:42 +02:00

1 2 3

131 Commits