scylladb

Author	SHA1	Message	Date
Ernest Zaslavsky	d624413ddd	treewide: Move query related files to a new `query` directory As requested in #22120, moved the files and fixed other includes and build system. Moved files: - query.cc - query-request.hh - query-result.hh - query-result-reader.hh - query-result-set.cc - query-result-set.hh - query-result-writer.hh - query_id.hh - query_result_merger.hh Fixes: #22120 This is a cleanup, no need to backport Closes scylladb/scylladb#25105	2025-09-16 23:40:47 +03:00
Ernest Zaslavsky	d2c5765a6b	treewide: Move keys related files to a new keys directory As requested in #22102, #22103 and #22105 moved the files and fixed other includes and build system. Moved files: - clustering_bounds_comparator.hh - keys.cc - keys.hh - clustering_interval_set.hh - clustering_key_filter.hh - clustering_ranges_walker.hh - compound_compat.hh - compound.hh - full_position.hh Fixes: #22102 Fixes: #22103 Fixes: #22105 Closes scylladb/scylladb#25082	2025-07-25 10:45:32 +03:00
Ran Regev	edd56a2c1c	moved cache files to db As requested in #22097, moved the files and fixed other includes and build system. Fixes: #22097 Signed-off-by: Ran Regev <ran.regev@scylladb.com> Closes scylladb/scylladb#22495	2025-02-04 12:21:31 +03:00
Avi Kivity	f3eade2f62	treewide: relicense to ScyllaDB-Source-Available-1.0 Drop the AGPL license in favor of a source-available license. See the blog post [1] for details. [1] https://www.scylladb.com/2024/12/18/why-were-moving-to-a-source-available-license/	2024-12-18 17:45:13 +02:00
Avi Kivity	6d110b530c	partition_snapshot_reader.hh: drop unused include boost/range/algorithm/heap_algorithm.hpp	2024-11-15 14:02:19 +02:00
Łukasz Paszkowski	da95f44adc	readers: Use reversed schema and native reversed slices The reconcilable_result is built as it would be constructed for forward read queries for tables with reversed order. Mutations constructed for reversed queries are consumed forward. Drop overloaded reversed functions that reverse read_command and reconcilable_result directly and keep only those requiring smart pointers. They are not used any more.	2024-08-13 10:03:46 +02:00
Avi Kivity	fdc1449392	treewide: rename flat_mutation_reader_v2 to mutation_reader flat_mutation_reader_v2 was introduced in a pair of commits in 2021: `e3309322c3` "Clone flat_mutation_reader related classes into v2 variants" `08b5773c12` "Adapt flat_mutation_reader_v2 to the new version of the API" as a replacement for flat_mutation_reader, using range_tombstone_change instead of range_tombstone to represent represent range tombstones. See those commits for more information. The transition was incremental; the last use of the original flat_mutation_reader was removed in 2022 in commit `026f8cc1e7` "db: Use mutation_partition_v2 in mvcc" In turn, flat_mutation_reader was introduced in 2017 in commit `748205ca75` "Introduce flat_mutation_reader" To transition from a mutation_reader that nested rows within a partition in a separate stream, to a flat reader that streamed partitions and rows in the same stream. Here, we reclaim the original name and rename the awkward flat_mutation_reader_v2 to mutation_reader. Note that mutation_fragment_v2 remains since we still use the original for compatibilty, sometimes. Some notes about the transition: - files were also renamed. In one case (flat_mutation_reader_test.cc), the rename target already existed, so we rename to mutation_reader_another_test.cc. - a namespace 'mutation_reader' with two definitions existed (in mutation_reader_fwd.hh). Its contents was folded into the mutation_reader class. As a result, a few #includes had to be adjusted. Closes scylladb/scylladb#19356	2024-06-21 07:12:06 +03:00
Avi Kivity	c5e4bf51bd	Introduce mutation/ module Move mutation-related files to a new mutation/ directory. The names are kept in the global namespace to reduce churn; the names are unambiguous in any case. mutation_reader remains in the readers/ module. mutation_partition_v2.cc was missing from CMakeLists.txt; it's added in this patch. This is a step forward towards librarization or modularization of the source base. Closes #12788	2023-02-14 11:19:03 +02:00
Tomasz Grabiec	026f8cc1e7	db: Use mutation_partition_v2 in mvcc This patch switches memtable and cache to use mutation_partition_v2, and all affected algorithms accordingly. The memtable reader was changed to use the same cursor implementation which cache uses, for improved code reuse and reducing risk of bugs due to discrepancy of algorithms which deal with MVCC. Range tombstone eviction in cache has now fine granularity, like with rows. Fixes #2578 Fixes #3288 Fixes #10587	2023-01-27 21:56:28 +01:00
Botond Dénes	f406151a86	utils/logalloc: allocating_section: don't use the global tracker Instead, get the tracker instance from the region. This requires adding a `region&` parameter to `with_reserve()`. This brings us one step closer to eliminating the global tracker.	2022-08-23 10:38:58 +03:00
Tomasz Grabiec	a6aef60b93	memtable: Fix missing range tombstones during reads under ceratin rare conditions There is a bug introduced in `e74c3c8` (4.6.0) which makes memtable reader skip one a range tombstone for a certain pattern of deletions and under certain sequence of events. _rt_stream contains the result of deoverlapping range tombstones which had the same position, which were sipped from all the versions. The result of deoverlapping may produce a range tombstone which starts later, at the same position as a more recent tombstone which has not been sipped from the partition version yet. If we consume the old range tombstone from _rt_stream and then refresh the iterators, the refresh will skip over the newer tombstone. The fix is to drop the logic which drains _rt_stream so that _rt_stream is always merged with partition versions. For the problem to trigger, there have to be multiple MVCC versions (at least 2) which contain deletions of the following form: [a, c] @ t0 [a, b) @ t1, [b, d] @ t2 c > b The proper sequence for such versions is (assuming d > c): [a, b) @ t1, [b, d] @ t2 Due to the bug, the reader will produce: [a, b) @ t1, [b, c] @ t0 The reader also needs to be preempted right before processing [b, d] @ t2 and iterators need to get invalidated so that lsa_partition_reader::do_refresh_state() is called and it skips over [b, d] @ t2. Otherwise, the reader will emit [b, d] @ t2 later. If it does emit the proper range tombstone, it's possible that it will violate fragment order in the stream if _rt_stream accumulated remainders (possible with 3 MVCC versions). The problem goes away once MVCC versions merge. Fixes #10913 Fixes #10830 Closes #10914	2022-06-29 19:02:23 +03:00
Michael Livshin	632b4e5a9a	fix "ninja dev-headers" Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-05-31 23:42:34 +03:00
Botond Dénes	4f77e74bd4	partition_snapshot_reader: convert implementation to native v2 The underlying mutation representation is still v1, so the implementation still has to do conversion. This happens right above the lsa reader component.	2022-04-28 14:12:12 +03:00
Avi Kivity	a9812166cd	replica, partition_snapshot_reader, keys: replace boost::any with std::any Reduce #include load by standardizing on std::any. In keys.cc, we just drop the unneeded include. One instance of boost::any remains in config_file, due to a tie-in with other boost components. Closes #10441	2022-04-28 07:18:53 +03:00
Mikołaj Sielużycki	1d84a254c0	flat_mutation_reader: Split readers by file and remove unnecessary includes. The flat_mutation_reader files were conflated and contained multiple readers, which were not strictly necessary. Splitting optimizes both iterative compilation times, as touching rarely used readers doesn't recompile large chunks of codebase. Total compilation times are also improved, as the size of flat_mutation_reader.hh and flat_mutation_reader_v2.hh have been reduced and those files are included by many file in the codebase. With changes real 29m14.051s user 168m39.071s sys 5m13.443s Without changes real 30m36.203s user 175m43.354s sys 5m26.376s Closes #10194	2022-03-14 13:20:25 +02:00
Mikołaj Sielużycki	f4c57cbe87	memtable: Convert partition_snapshot_flat_reader to v2. This is a facade change only, the make_partition_snapshot_flat_reader function calls upgrade_to_v2 internally. Closes #10152	2022-03-02 15:07:36 +02:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Michał Radwański	ee601b7d87	partition_snapshot_reader: fix indentation in fill_buffer	2021-11-05 10:51:58 +01:00
Michał Radwański	94b263e356	partition_snapshot_reader: fix obtaining rt_slice, if Reversing and _last_rts was set If Reversing and _last_rts was set, the created rt_slice still contained range tombstones between _last_rts and (snapshot) clustering range end. This is wrong - the correct range is between (snapshot) clustering range begin and _last_rts.	2021-11-04 16:10:07 +01:00
Michał Radwański	9caf85f64a	partition_snapshot_reader: do not accidentally copy schema Functions `upper_bound` and `lower_bound` had signatures: ``` template<typename T, typename... Args> static rows_iter_type lower_bound(const T& t, Args... args); ``` This caused a dacay from `const schema&` to `schema` as one of the args, which in turn copied the schema in a fair number of the queries. Fix that by setting the parameter type to `Args&&`, which doesn't discard the reference. Fixes #9502 Closes #9507	2021-10-20 19:09:08 +03:00
Michał Radwański	c04dffbc01	partition_snapshot_reader: pop_range_tombstone returns reference (instead of value) when possible.	2021-10-10 20:38:18 +02:00
Michał Radwański	cc5ea66957	partition_snapshot_reader: reverse ck_range when needed by Reversing Previous commits made it possible to split the responsibility of two kinds of clustering key ranges in read_next and next_range_tombstone. Here, the actual reversal takes place and we start passing the actually reversed ck_range, if Reversing. This reversed ck_range is stored as a class member, so that the reversal happens just once for each range.	2021-10-10 20:38:18 +02:00
Michał Radwański	5449982a0b	memtable, partition_snapshot_reader: read from partition in reverse In this commit, I add the ability to read from partition snapshots in reverse order. Before these changes, a reverse read from memtable has been handled as follows: - A reader higher in the hierarchy of readers performs a read from memtable in the forward order, which is not aware of the intention to read in reverse. - Later, some reader reverses the received mutation fragments. Memtable decides based on options in `slice`, whether to read forward or in reverse. Note that previous commit creates a killswitch which clears the `reverse` option from slice before running the logic of whether to reverse or not. This is due to the fact, that this commit doesn't all the required code changes. The reversing partition snapshot reader maintains two schemas - one that is the reversed schema (called _query_schema) for the output, and the other one (forward one, called _snapshot_schema), which is used to access the memtable tree (which needs to be the same as the schema used to create memtable). The `partition_slice` provided by callers is provided in 'half-reversed' format for reversed queries, where the order of clustering ranges is reversed, but the ranges themselves are not.	2021-10-10 20:38:18 +02:00
Michał Radwański	6813c39927	partition_snapshot_reader: rows_position and rows_iter_type supporting reverse iteration Iterating in reverse is useful for native reverse memtable reader.	2021-10-10 20:38:18 +02:00
Michał Radwański	a672b8b86f	partition_snapshot_reader: split responsibility of ck_range Previously, next_range_tombstone took as an argument a clustering key range, which served two purposes. One was for accesing only specified key ranges from the partition, the other was for deciding in which order the mutation fragments should be emitted. This commits separates these responsibilities, since in the advent of native memtable reader, these two responsibilities are no longer common. The split is propagated to the rest of the partition_snapshot_reader.hh to avoid confusion.	2021-10-07 17:04:44 +02:00
Michał Radwański	fc51d2cc8c	partition_snapshot_reader: separate _schema into _query_schema and _partition_schema After memtable starts supporting reverse order queries, the schema provided to the readers will be reversed (reverse clustering order). Reading from memtable in reverse requires two schemas - one to access the memtable internal data structures (_partition_schema), and the other one (_query_schema), the schema imposing clustering order on returned mutation fragments. This commit prepares for introduction of native reverse queries for memtable, by separating these responsibilities. For now, they are still initialized with the schema passed from query.	2021-10-07 17:04:44 +02:00
Pavel Emelyanov	5515f7187d	range_tombstone, code: Add range_tombstone& getters Currently all the code operates on the range_tombstone class. and many of those places get the range tombstone in question from the range_tombstone_list. Next patches will make that list carry (and return) some new object called range_tombstone_entry, so all the code that expects to see the former one there will need to patched to get the range_tombstone from the _entry one. This patch prepares the ground for that by introdusing the range_tombstone& tombstone() { return *this; } getter on the range_tombstone itself and patching all future users of the _entry to call .tombstone() right now. Next patch will remove those getters together with adding the new range_tombstone_entry object thus automatically converting all the patched places into using the entry in a proper way. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-09-03 19:34:45 +03:00
Benny Halevy	4476800493	flat_mutation_reader: get rid of timeout parameter Now that the timeout is taken from the reader_permit. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-08-24 16:30:51 +03:00
Michael Livshin	69ade155be	partition_snapshot_reader: rename MemoryAccounter to just Accounter Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2021-08-01 19:41:11 +03:00
Michael Livshin	4d8f99df25	remove the newly-unused `partition_snapshot_reader_dummy_accounter` (along with the `make_partition_snapshot_flat_reader` overload that used it) Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2021-08-01 19:41:11 +03:00
Tomasz Grabiec	e74c3c885e	partition_snapshot_reader: Avoid quadratic behavior with lots of range tombstones next_range_tombstone() was populating _rt_stream on each invocation from the current iterator ranges in _range_tombstones. If there is a lot of range tombstones, all would be put into _rt_stream. One problem is that this can cause a reactor stall. Fix by more incremental approach where we populate _rt_stream with minimal amount on each invocation of next_range_tombstone(). Another problem is that this can get quadratic. The iterators in _range_tombstones are advanced, but if lsa invalidates them across calls they can revert back to the front since they go back to _last_rt, which is the last consumed range tombstone, and if the buffer fills up, not all tombstones from _rt_stream could be consumed. The new code doesn't have this problem because everything which is produced out of the iterators in _range_tombstones is produced only once. What we put into _rt_stream is consumed first before we try to feed the _rt_stream with more data.	2021-07-26 17:48:05 +02:00
Piotr Jastrzebski	27fe3c3aa0	partition_snapshot_flat_reader: Fix typo in next_range_rombstone Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <2fd48c08092d6ed1b452b6fe2e43e2273a78c8c2.1625500334.git.piotr@scylladb.com>	2021-07-05 19:39:08 +03:00
Tomasz Grabiec	4046dda844	partition_snapshot_flat_reader: Trim range tombstones to query ranges This is needed to change the guarantees of flat_mutation_reader v1 to produce only range tombstones trimmed to clustering restrictions. The reason for this is so that v2 has a canonical representation in which all fragments have position inside clustering restrictions. Conversion from v1 to v2 can guarantee that only if v1 trims range tombstones.	2021-06-16 00:23:49 +02:00
Pavel Solodovnikov	76bea23174	treewide: reduce header interdependencies Use forward declarations wherever possible. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Closes #8813	2021-06-07 15:58:35 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Pavel Solodovnikov	fff7ef1fc2	treewide: reduce boost headers usage in scylla header files `dev-headers` target is also ensured to build successfully. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-05-20 01:33:18 +03:00
Benny Halevy	5b22731f9a	flat_mutation_reader: require close Make flat_mutation_reader::impl::close pure virtual so that all implementations are required to implemnt it. With that, provide a trivial implementation to all implementations that currently use the default, trivial close implementation. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-04-25 11:35:07 +03:00
Pavel Emelyanov	2e6255c499	partition_snapshot_reader: Emit range tombstones on demand Currently the reader gets all range tombstones from the given range and places them into a stream. When filling the buffer with fragments the range tombstones are extracted from the stream one by one. This is memory consuming, the reader's memory usage shouldn't depend on the number of inhabitants in the partition range. The patch implements the heap-based cursor for range tombstones almost like it's done for rows. The heap contains range_tombstone_list::iterator_ranges, the tombstones are popped from the heap when needed, are applied into the stream and then are emitted from it into the buffer. The refresh_state() is called on each new range to set up the iterators, and when lsa reports references invalidation to refresh the iterators. To let the refresh_state revalidate the iterators, the position at which the last range tombstone was emitted is maintained. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-16 12:08:18 +03:00
Pavel Emelyanov	ef61f84426	partition_snapshot_reader: Introduce maybe_refresh_state The existing refresh_state() is supposed to setup or revalidate iterators to rows inside partition versions if needed. It will be called in more than one place soon, so here's the helper. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-16 12:08:18 +03:00
Pavel Emelyanov	5e0a8130d4	partition_snapshot_reader: Move range tombstone stream member The lsa_partition_reader is the helper sub-class for partition_snapshot_reader that, among other things, is responsible for filling the stream of range tombstones, that's then used by the reader itself. Next patches will change the way range tombstones are emitted by the reader, so hide the stream inside the helper subclass in advance. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-16 12:08:18 +03:00
Pavel Emelyanov	755d993031	partition_snapshot_reader: Add reset_state method to helper class This method "notifies" the lsa_reader helper class when the owning reader moves to a new range. This method is now empty, but will be used by next patch. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-16 12:07:20 +03:00
Pavel Emelyanov	a387fbd984	partition_snapshot_reader: Downgrade heap comparator Next patch will extend the comparator to manage heap of range tombstones. Not to add yet another comparator to it (and not to create another heap comparator class) just use the comparator that's common for both -- rows and range tombstones. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-16 12:06:19 +03:00
Pavel Emelyanov	2179014efa	partition_snapshot_reader: Use on-demand comparators There are already two raii-sh comparators on reader, next patch will need to add the third. This just bloats the reader, the comparators in question are state-less and can be created on demand for free. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-16 12:04:47 +03:00
Pavel Emelyanov	5c0f9a8180	mutation_partition: Switch cache of rows onto B-tree The switch is pretty straightforward, and consists of - change less-compare into tri-compare - rename insert/insert_check into insert_before_hint - use tree::key_grabber in mutation_partition::apply_monotonically to exception-safely transfer a row from one tree to another - explicitly erase the row from tree in rows_entry::on_evicted, there's a O(1) tree::iterator method for this - rewrite rows_entry -> cache_entry transofrmation in the on_evicted to fit the B-tree API - include the B-tree's external memory usage into stats That's it. The number of keys per node was is set to 12 with linear search and linear extention of 20 because - experimenting with tree shows that numbers 8 through 10 keys with linear search show the best performance on stress tests for insert/find-s of keys that are memcmp-able arrays of bytes (which is an approximation of current clustring key compare). More keys work slower, but still better than any bigger value with any type of search up to 64 keys per node - having 12 keys per nodes is the threshold at which the memory footprint for B-tree becomes smaller than for boost::intrusive::set for partitions with 32+ keys - 20 keys for linear root eats the first-split peak and still performs well in linear search As a result the footpring for B tree is bigger than the one for BST only for trees filled with 21...32 keys by 0.1...0.7 bytes per key. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-02-02 09:30:30 +03:00
Pavel Emelyanov	165255e2bd	partition_snapshot_reader: Rename cmp to less for explicity This is less comparator, cmp is used as a sign of tri-compare in this set. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-02-02 09:30:30 +03:00
Tomasz Grabiec	94749b01eb	Merge "futurize flat_mutation_reader::next_partition" from Benny The main motivation for this patchset is to prepare for adding a async close() method to flat_mutation_reader. In order to close the reader before destroying it in all paths we need to make next_partition asynchronous so it can asynchronously close a current reader before destoring it, e.g. by reassignment of flat_mutation_reader_opt, as done in scanning_reader::next_partition. Test: unit(release, debug) * git@github.com:bhalevy/scylla.git futurize-next-partition-v1: flat_mutation_reader: return future from next_partition multishard_mutation_query: read_context: save_reader: destroy reader_meta from the calling shard mutation_reader: filtering_reader: fill_buffer: futurize inner loop flat_mutation_reader::impl: consumer_adapter: futurize handle_result flat_mutation_reader: consume_pausable/in_thread: futurize_invoke consumer flat_mutation_reader: FlatMutationReaderConsumer: support also async consumer flat_mutation_reader:impl: get rid of _consume_done member	2021-01-19 10:19:03 +02:00
Avi Kivity	60f5ec3644	Merge 'managed_bytes: switch to explicit linearization' from Michał Chojnowski This is a revival of #7490. Quoting #7490: The managed_bytes class now uses implicit linearization: outside LSA, data is never fragmented, and within LSA, data is linearized on-demand, as long as the code is running within with_linearized_managed_bytes() scope. We would like to stop linearizing managed_bytes and keep it fragmented at all times, since linearization can require large contiguous chunks. Large contiguous allocations are hard to satisfy and cause latency spikes. As a first step towards that, we remove all implicitly linearizing accessors and replace them with an explicit linearization accessor, with_linearized(). Some of the linearization happens long before use, by creating a bytes_view of the managed_bytes object and passing it onwards, perhaps storing it for later use. This does not work with with_linearized(), which creates a temporary linearized view, and does not work towards the longer term goal of never linearizing. As a substitute a managed_bytes_view class is introduced that acts as a view for managed_bytes (for interoperability it can also be a view for bytes and is compatible with bytes_view). By the end of the series, all linearizations are temporary, within the scope of a with_linearized() call and can be converted to fragmented consumption of the data at leisure. This has limited practical value directly, as current uses of managed_bytes are limited to keys (which are limited to 64k). However, it enables converting the atomic_cell layer back to managed_bytes (so we can remove IMR) and the CQL layer to managed_bytes/managed_bytes_view, removing contiguous allocations from the coordinator. Closes #7820 * github.com:scylladb/scylla: test: add hashers_test memtable: fix accounting of managed_bytes in partition_snapshot_accounter test: add managed_bytes_test utils: fragment_range: add a fragment iterator for FragmentedView keys: update comments after changes and remove an unused method mutation_test: use the correct preferred_max_contiguous_allocation in measuring_allocator row_cache: more indentation fixes utils: remove unused linearization facilities in `managed_bytes` class misc: fix indentation treewide: remove remaining `with_linearized_managed_bytes` uses memtable, row_cache: remove `with_linearized_managed_bytes` uses utils: managed_bytes: remove linearizing accessors keys, compound: switch from bytes_view to managed_bytes_view sstables: writer: add write_* helpers for managed_bytes_view compound_compat: transition legacy_compound_view from bytes_view to managed_bytes_view types: change equal() to accept managed_bytes_view types: add parallel interfaces for managed_bytes_view types: add to_managed_bytes(const sstring&) serializer_impl: handle managed_bytes without linearizing utils: managed_bytes: add managed_bytes_view::operator[] utils: managed_bytes: introduce managed_bytes_view utils: fragment_range: add serialization helpers for FragmentedMutableView bytes: implement std::hash using appending_hash utils: mutable_view: add substr() utils: fragment_range: add compare_unsigned utils: managed_bytes: make the constructors from bytes and bytes_view explicit utils: managed_bytes: introduce with_linearized() utils: managed_bytes: constrain with_linearized_managed_bytes() utils: managed_bytes: avoid internal uses of managed_bytes::data() utils: managed_bytes: extract do_linearize_pure() thrift: do not depend on implicit conversion of keys to bytes_view clustering_bounds_comparator: do not depend on implicit conversion of keys to bytes_view cql3: expression: linearize get_value_from_mutation() eariler bytes: add to_bytes(bytes) cql3: expression: mark do_get_value() as static	2021-01-18 11:01:28 +02:00
Raphael S. Carvalho	97e076365e	Fix stalls on Memtable flush by preempting across fragment generation if needed Flush is facing stalls because partition_snapshot_flat_reader::fill_buffer() generates mutation fragment until buffer is full[1] without yielding. this is the code path: flush_reader::fill_buffer() <---------\| flat_mutation_reader::consume_pausable() <--------\| partition_snapshot_flat_reader::fill_buffer() -\| [1]: https://github.com/scylladb/scylla/blob/6cfc949e/partition_snapshot_reader.hh#L261 This is fixed by breaking the loop in do_fill_buffer() if preemption is needed, allowing do_until() to yield in sequence, and when it resumes, continue from where it left off, until buffer is full. Fixes #7885. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210114141417.285175-1-raphaelsc@scylladb.com>	2021-01-14 16:30:55 +02:00
Benny Halevy	29002e3b48	flat_mutation_reader: return future from next_partition To allow it to asynchronously close underlying readers on next_partition(). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-01-13 17:35:07 +02:00
Pavel Solodovnikov	8709844566	misc: fix indentation The patch fixes indentation issues introduced in previous patches related to removing `with_linearized_managed_bytes` uses from the code tree. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-01-08 14:16:08 +01:00

1 2

87 Commits