scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 19:46:48 +00:00

Author	SHA1	Message	Date
Raphael S. Carvalho	818830715f	Fix potential infinite recursion when combining mutations for leveled compaction The issue is triggered by compaction of sstables of level higher than 0. The problem happens when interval map of partitioned sstable set stores intervals such as follow: [-9223362900961284625 : -3695961740249769322 ] (-3695961740249769322 : -3695961103022958562 ] When selector is called for first interval above, the exclusive lower bound of the second interval is returned as next token, but the inclusivess info is not returned. So reader_selector was returning that there were new readers when the current token was -3695961740249769322 because it was stored in selector position field as inclusive, but it's actually exclusive. This false positive was leading to infinite recursion in combined reader because sstable set's incremental selector itself knew that there were actually no new readers, and therefore no progress could be made. Fix is to use ring_position in reader_selector, such that inclusiveness would be respected. So reader_selector::has_new_readers() won't return false positive under the conditions described above. Fixes #2908. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2018-01-03 16:23:01 -02:00
Avi Kivity	8795238869	Merge "Fix handling of range tombstones starting at same position" from Tomasz "When we get two range tombstones with the same lower bound from different data sources (e.g. two sstable), which need to be combined into a single stream, they need to be de-overlapped, because each mutation fragment in the stream must have a different position. If we have range tombstones [1, 10) and [1, 20), the result of that de-overlapping will be [1, 10) and [10, 20]. The problem is that if the stream corresponds to a clustering slice with upper bound greater than 1, but lower than 10, the second range tombstone would appear as being out of the query range. This is currently violating assumptions made by some consumers, like cache populator. One effect of this may be that a reader will miss rows which are in the range (1, 10) (after the start of the first range tombstone, and before the start of the second range tombstone), if the second range tombstone happens to be the last fragment which was read for a discontinuous range in cache and we stopped reading at that point because of a full buffer and cache was evicted before we resumed reading, so we went to reading from the sstable reader again. There could be more cases in which this violation may resurface. There is also a related bug in mutation_fragment_merger. If the reader is in forwarding mode, and the current range is [1, 5], the reader would still emit range_tombstone([10, 20]). If that reader is later fast forwarded to another range, say [6, 8], it may produce fragments with smaller positions which were emitted before, violating monotonicity of fragment positions in the stream. A similar bug was also present in partition_snapshot_flat_reader. Possible solutions: 1) relax the assumption (in cache) that streams contain only relevant range tombstones, and only require that they contain at least all relevant tombstones 2) allow subsequent range tombstones in a stream to share the same starting position (position is weakly monotonic), then we don't need to de-overlap the tombstones in readers. 3) teach combining readers about query restrictions so that they can drop fragments which fall outside the range 4) force leaf readers to trim all range tombstones to query restrictions This patch implements solution no 2. It simplifies combining readers, which don't need to accumulate and trim range tombstones. I don't like solution 3, because it makes combining readers more complicated, slower, and harder to properly construct (currently combining readers don't need to know restrictions of the leaf streams). Solution 4 is confined to implementations of leaf readers, but also has disadvantage of making those more complicated and slower. There is only one consumer which needs the tombstones with monotonic positions, and that is the sstable writer. Fixes #3093." * tag 'tgrabiec/fix-out-of-range-tombstones-v1' of github.com:scylladb/seastar-dev: tests: row_cache: Introduce test for concurrent read, population and eviction tests: sstables: Add test for writing combined stream with range tombstones at same position tests: memtable: Test that combined mutation source is a mutation source tests: memtable: Test that memtable with many versions is a mutation source tests: mutation_source: Add test for stream invariants with overlapping tombstones tests: mutation_reader: Test fast forwarding of combined reader with overlapping range tombstones tests: mutation_reader: Test combined reader slicing on random mutations tests: mutation_source_test: Extract random_mutation_generator::make_partition_keys() mutation_fragment: Introduce range() clustering_interval_set: Introduce overlaps() clustering_interval_set: Extract private make_interval() mutation_reader: Allow range tombstones with same position in the fragment stream sstables: Handle consecutive range_tombstone fragments with same position tests: streamed_mutation_assertions: Merge range_tombstones with the same position in produces_range_tombstone() streamed_mutation: Introduce peek() mutation_fragment: Extract mergeable_with() mutation_reader: Move definition of combining mutation reader to source file mutation_reader: Use make_combined_reader() to create combined reader	2018-01-02 18:32:09 +02:00
Duarte Nunes	1374f898b9	Merge seastar upstream Class optimized_optional was moved into seastar, and its usage simplified so move_and_disengage() is replaced in favour of std::exchange(_, { }). * seastar adaca37...b0f5591 (9): > Merge "core: Introduce cancellation mechanism" from Duarte > Fix Seastar build that no longer builds with --enable-dpdk after the recent commit fd87ea2 > noncopyable_function: support function objects whose move constructors throw > Adding new hardware options to new config format, using new config format for dpdk device > Fix check for Boost version during pre-build configuration. > variant_utils: add variant_visitor constructor for C++17 mode > Merge "Allows json object to be stream to an" from Amnon > Merge 'Default to C++17' from Avi > Add const version of subscript operator to circular_buffer Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20171228112126.18142-1-duarte@scylladb.com>	2017-12-28 13:24:18 +02:00
Tomasz Grabiec	60ed5d29c0	mutation_reader: Move definition of combining mutation reader to source file So that the whole world doesn't recompile when it changes.	2017-12-21 21:24:11 +01:00
Tomasz Grabiec	52285a9e73	mutation_reader: Use make_combined_reader() to create combined reader So that we can hide the definition of combined_mutation_reader. It's also less verbose.	2017-12-21 21:24:11 +01:00
Piotr Jastrzebski	04ce7dfb84	Remove unused make_combined_reader overload. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 17:00:43 +01:00
Piotr Jastrzebski	b3b6db4f50	Remove unused make_combined_reader overload. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 17:00:43 +01:00
Piotr Jastrzebski	024e01ad9e	mutation_source: Add constructors for sources that ignore forwarding Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 16:59:57 +01:00
Piotr Jastrzebski	ff718d6573	Add default parameter values in make_combined_reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 11:47:07 +01:00
Piotr Jastrzebski	ac1d2f98e4	Fix build by removing semicolon after concept Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <4504cf47be0a451c58052476bc8cc4f9cba59472.1513248094.git.piotr@scylladb.com>	2017-12-14 10:46:13 +00:00
Paweł Dziepak	a0a13ceb46	filtering_reader: switch to flat mutation fragment streams	2017-12-13 12:01:03 +00:00
Paweł Dziepak	3bbb3b300d	filtering_reader: pass a const dht::decorated_key& to the callback All users of the filtering reader need only the decorated key of a partition, but currently the predicate is given a reference to streamed_mutations which are obsolete now.	2017-12-13 11:57:27 +00:00
Paweł Dziepak	d8dad04564	mutation_reader: drop make_restricted_reader() make_restricted_reader() has been replaced by make_restricted_flat_reader().	2017-12-13 11:57:22 +00:00
Paweł Dziepak	3839bc5d60	mutation_reader: convert restricted reader to flat streams	2017-12-13 10:46:41 +00:00
Botond Dénes	9661769313	combined_mutation_reader: fix fast-fowarding related row-skipping bug When fast forwarding is enabled and all readers positioned inside the current partition return EOS, return EOS from the combined-reader too. Instead of skipping to the next partition if there are idle readers (positioned at some later partition) available. This will cause rows to be skipped in some cases. The fix is to distinguish EOS'd readers that are only halted (waiting for a fast-forward) from thoose really out of data. To achieve this we track the last fragment-kind the reader emitted. If that was a partition-end then the reader is out of data, otherwise it might emit more fragments after a fast-forward. Without this additional information it is impossible to determine why a reader reached EOS and the code later may make the wrong decision about whether the combined-reader as a whole is at EOS or not. Also when fast-forwarding between partition-ranges or calling next_partition() we set the last fragment-kind of forwarded readers because they should emit a partition-start, otherwise they are out of data. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <6f0b21b1ec62e1197de6b46510d5508cdb4a6977.1512569218.git.bdenes@scylladb.com>	2017-12-06 16:09:05 +02:00
Botond Dénes	e7535f5e88	Add flat_mutation_reader overload of make_combined_reader	2017-12-04 07:57:43 +02:00
Botond Dénes	8731c1bc66	Flatten the implementation of combined_mutation_reader In fact flatten mutation_reader_merger and adjust combined_mutation_reader accordingly.	2017-12-04 07:57:43 +02:00
Botond Dénes	217740c608	Add mutation_fragment_merger This is the mutation fragment level equivalent of mutation_merger. It merges fragments produced by different sources. Mutation fragments are not as self-contained as streamed mutations, they have external context, e.g. the partition they belong to. To support this mutation_fragment_merger operates on a producer instead of a vector of fragments. Producer can have internal state and can do side-actions as fragments are consumed.	2017-12-04 07:57:43 +02:00
Botond Dénes	3f8110b5b6	Make combined_mutation_reader a flat_mutation_reader For now only the interface is converted, behind the scenes the previous implementation remains, it's output is simply converted by flat_mutation_reader_from_mutation_reader. The implementation will be converted in the following patches.	2017-12-04 07:57:43 +02:00
Botond Dénes	c011747c30	Move the mutation merging logic to combined_mutation_reader This is the second step in splitting the combined readers's logic into two parts as outlined in the previous patch.	2017-12-04 07:57:43 +02:00
Botond Dénes	3681e17555	Remove the unnecessary indirection of mutation_reader_merger::next()	2017-12-04 07:57:43 +02:00
Botond Dénes	c5e57e0961	Move the implementation of combined_mutation_reader into mutation_reader_merger This simple code-movement and patch lays the groundwork for splitting the logic in combined_mutation_reader into two blocks: * one that takes care of moving the readers in lockstep and emits their output as a non-decreasing stream of streamed_mutations and * one that takes care of merging the above stream into strictly-increasing stream of streamed_mutations. This in turn is preparation-work to the transformation of combined_mutation_reader into a flat_mutation_reader::impl.	2017-12-04 07:57:43 +02:00
Botond Dénes	85b5ded670	Remove unused mutation_and_reader::less_compare and operator<	2017-12-04 07:57:43 +02:00
Paweł Dziepak	6a1fe70a72	mutation_reader: drop consume_flattened_in_thread()	2017-11-23 18:14:31 +00:00
Tomasz Grabiec	aa8c2cbc16	Merge "Migrate sstables to flat_mutation_reader" from Piotr Introduce sstable::read_row_flat and sstable::read_range_rows_flat methods and use them in sstable::as_mutation_source. * https://github.com/scylladb/seastar-dev/tree/haaawk/flat_reader_sstables_v3: Introduce conversion from flat_mutation_reader to streamed_mutation Add sstables::read_rows_flat and sstables::read_range_rows_flat Turn sstable_mutation_reader into a flat_mutation_reader sstable: add getter for filter_tracker Move mp_row_consumer methods implementations to the bottom Remove unused sstable_mutation_reader constructor Replace "sm" with "partition" in get_next_sm and on_sm_finished Move advance_to_upper_bound above sstable_mutation_reader Store sstable_mutation_reader pointer in mp_row_consumer Stop using streamed_mutation in consumer and reader Stop using streamed_mutation in sstable_data_source Delete sstable_streamed_mutation Introduce sstable::read_row_flat Migrate sstable::as_mutation_source to flat_mutation_reader Remove single_partition_reader_adaptor Merge data_consume_context::impl into data_consume_context Create data_consume_context_opt. Merge on_partition_finished into mark_partition_finished Check _partition_finished instead of _current_partition_key Merge sstable_data_source into sstable_mutation_reader Remove sstable_data_source Remove get_next_partition and partition_header	2017-11-22 15:45:21 +01:00
Paweł Dziepak	5753e85c6b	mutation_reader: drop consume_flattened() consume_flattened() has been fully replaced by flat_mutation_reader::consume()	2017-11-21 11:37:04 +00:00
Piotr Jastrzebski	3f70dfc939	Introduce conversion from flat_mutation_reader to streamed_mutation Allows splitting migration into small steps. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-11-15 15:33:23 +01:00
Paweł Dziepak	97767963a0	mutation_reader: drop multi_range_reader	2017-11-13 16:49:52 +00:00
Paweł Dziepak	11e8866aee	flat_mutation_reader: add partition_range_forwarding flat_mutation_reader::partition_range_forwarding and mutation_reader::forwarding are aliases of the same type. The change was necessary in order to make mutation_reader::forwarding available in flat_mutation_reader.hh even though it is included by mutation_reader.hh	2017-11-13 16:49:52 +00:00
Piotr Jastrzebski	acfc6fef55	Simplify flat_mutation_reader wrappers If a wrapper takes a flat_mutation_reader in a constructor then it does not have to take schema_ptr because it can obtain it from the inner flat_mutation_reader. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <88c3672df08d2ac465711e9138d426e43ae9c62b.1510331382.git.piotr@scylladb.com>	2017-11-13 08:53:34 +01:00
Piotr Jastrzebski	9233ee7309	Move FlattenedConsumer concept to flat_mutation_reader.hh This concept will be used both in flat_mutation_reader.hh and mutation_reader.hh. mutation_reader.hh includes flat_mutation_reader.hh so we have to move the concept to make it accessible in both files. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-11-08 14:14:51 +01:00
Piotr Jastrzebski	6efda10790	Add mutation_source::make_flat_mutation_reader This will be used as an intermediate state of migration from mutation_reader to flat_mutation_reader. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-11-08 12:58:31 +01:00
Piotr Jastrzebski	93e8b43e7b	Add flat reader mutation source implementation This will be used by sources that are migrated to flat_mutation_reader. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-11-08 12:41:12 +01:00
Piotr Jastrzebski	1a7936561e	Prepare mutation_source for more than one implementation There will be a second implementation that will be used by sources that are converted to flat_mutation_reader. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-11-08 12:41:12 +01:00
Duarte Nunes	baeec0935f	Replace query::full_slice with schema::full_slice() query::full_slice doesn't select any regular or static columns, which is at odds with the expectations of its users. This patch replaces it with the schema::full_slice() version. Refs #2885 Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <1507732800-9448-2-git-send-email-duarte@scylladb.com>	2017-10-17 11:25:53 +02:00
Paweł Dziepak	8c3b7fea81	Merge "Introduce new API and converters from/to old mutation_reader" from Piotr "This changeset is the first step to flatten mutation_reader. Then it introduces new mutation_fragment types for partition header and end of partition. Using those a new flat_mutation_reader is defined. Finally it introduces converters between new flat_mutation_reader and old mutation_reader." * 'haaawk/flattened_mutation_reader_v12' of github.com:scylladb/seastar-dev: Add tests for flat_mutation_reader Introduce conversion from flat_mutation_reader to mutation_reader Introduce conversion from mutation_reader to flat_mutation_reader Introduce flat_mutation_reader Extract FlattenedConsumer concept using GCC6_CONCEPT Introduce partition_end mutation_fragment Introduce a position for end of partition Introduce partition_start mutation_fragment Introduce FragmentConsumer Introduce a position for partition start streamed_mutation: Extract concepts using GCC6_CONCEPT macro	2017-10-16 12:14:23 +01:00
Piotr Jastrzebski	31733a7eeb	Introduce conversion from flat_mutation_reader to mutation_reader This will be used in transition from mutation_reader to flat_mutation_reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-10-13 16:08:59 +02:00
Piotr Jastrzebski	f325fef362	Extract FlattenedConsumer concept using GCC6_CONCEPT This concept will be used in flat_mutation_reader::consume Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-10-10 16:15:59 +02:00
Piotr Jastrzebski	2516b42752	Introduce partition_start mutation_fragment This type of mutation_fragment will be used in new mutation_reader to signal the beginning of the next partition. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-10-10 16:15:59 +02:00
Botond Dénes	a43901f842	row_consumer: de-virtualize io_priority() and resource_tracker() Fixes #2830 Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <448a1f739ab8c88a7a5562bce8dce5ae6efdf934.1507302530.git.bdenes@scylladb.com>	2017-10-06 18:50:12 +01:00
Botond Dénes	fea6214a0a	Update reader restriction related metrics Update description of existing reader count metrics, add memory consumption metrics. Use labels to distinguish between system, user and streaming reads related metrics.	2017-10-03 12:44:17 +03:00
Botond Dénes	47e07b787e	restricted_mutation_reader: restrict based-on memory consumption Restrict readers based on their memory consumption, instead of the count of the top-level readers. To do this an interposer is installed at the input_stream level which tracks buffers emmited by the stream. This way we can have an accurate picture of the readers' actual memory consumption. New readers will consume 16k units from the semaphore up-front. This is to account their own memory-consumption, apart from the buffers they will allocate. Creating the reader will be deferred to when there are enough resources to create it. As before only new readers will be blocked on an exhausted semaphore, existing readers can continue to work.	2017-10-03 12:44:12 +03:00
Botond Dénes	0a07e9e7c7	mutation_reader.hh: Move restricted_reader related code In preparation of make_restricted_reader taking a mutation_source as its argument.	2017-10-03 12:39:22 +03:00
Avi Kivity	78eae8bf48	Revert "Merge "Make restricting_mutation_reader more accurate" from Botond" This reverts commit `c6e5dcc556`, reversing changes made to `19b21a0ab2`. Failes to build, plus author has more changes.	2017-10-03 11:58:59 +03:00
Botond Dénes	43dba8f173	Update reader restriction related metrics Update description of existing reader count metrics, add memory consumption metrics.	2017-09-20 11:16:21 +03:00
Botond Dénes	33e97e7457	restricted_mutation_reader: restrict based-on memory consumption Restrict readers based on their memory consumption, instead of the count of the top-level readers. To do this an interposer is installed at the input_stream level which tracks buffers emmited by the stream. This way we can have an accurate picture of the readers' actual memory consumption. New readers will consume 16k units from the semaphore up-front. This is to account their own memory-consumption, apart from the buffers they will allocate. Creating the reader will be deferred to when there are enough resources to create it. As before only new readers will be blocked on an exhausted semaphore, existing readers can continue to work.	2017-09-20 11:14:35 +03:00
Botond Dénes	e4a9e55e0d	mutation_reader.hh: Move restricted_reader related code In preparation of make_restricted_reader taking a mutation_source as its argument.	2017-09-20 11:12:57 +03:00
Tomasz Grabiec	8a9f0f86e7	mutation_source: Introduce mutation_source::make_partition_presence_checker() Every mutation source can have a presence checker. By default all answer "maybe contains". Having this on mutation_source level will be useful for simplifying cache update flow. The cache can ask the right snapshot for a presence checker rather than relying on database to know when and how to make the right one which preserves all invariants. This will be especially useful once all updates of the underlying mutation source of cache (e.g. sstable list) will have to go through cache for safety reasons.	2017-09-04 10:04:29 +02:00
Tomasz Grabiec	065feb1b7b	mutation_reader: Move definitions up in the header	2017-09-04 10:04:29 +02:00
Tomasz Grabiec	4e4839082b	mutation_reader: Use constructor delegation to reduce code duplication	2017-09-04 10:04:29 +02:00

1 2 3

116 Commits