scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-06 06:53:12 +00:00

Author	SHA1	Message	Date
Avi Kivity	375ed938b4	Merge "Fix potential infinite recursion in leveled compaction" from Raphael '"The issue is triggered by compaction of sstables of level higher than 0. The problem happens when interval map of partitioned sstable set stores intervals such as follow: [-9223362900961284625 : -3695961740249769322 ] (-3695961740249769322 : -3695961103022958562 ] When selector is called for first interval above, the exclusive lower bound of the second interval is returned as next token, but the inclusivess info is not returned. So reader_selector was returning that there were new readers when the current token was -3695961740249769322 because it was stored in selector position field as inclusive, but it's actually exclusive. This false positive was leading to infinite recursion in combined reader because sstable set's incremental selector itself knew that there were actually no new readers, and therefore no progress could be made." Fixes #2908.' * 'high_level_compaction_infinite_recursion_fix_v4' of github.com:raphaelsc/scylla: tests: test for infinite recursion bug when doing high-level compaction Fix potential infinite recursion when combining mutations for leveled compaction dht: make it easier to create ring_position_view from token dht: introduce is_min/max for ring_position	2018-01-07 13:22:17 +02:00
Raphael S. Carvalho	e641c0d333	tests: test for infinite recursion bug when doing high-level compaction Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2018-01-03 16:23:02 -02:00
Raphael S. Carvalho	818830715f	Fix potential infinite recursion when combining mutations for leveled compaction The issue is triggered by compaction of sstables of level higher than 0. The problem happens when interval map of partitioned sstable set stores intervals such as follow: [-9223362900961284625 : -3695961740249769322 ] (-3695961740249769322 : -3695961103022958562 ] When selector is called for first interval above, the exclusive lower bound of the second interval is returned as next token, but the inclusivess info is not returned. So reader_selector was returning that there were new readers when the current token was -3695961740249769322 because it was stored in selector position field as inclusive, but it's actually exclusive. This false positive was leading to infinite recursion in combined reader because sstable set's incremental selector itself knew that there were actually no new readers, and therefore no progress could be made. Fix is to use ring_position in reader_selector, such that inclusiveness would be respected. So reader_selector::has_new_readers() won't return false positive under the conditions described above. Fixes #2908. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2018-01-03 16:23:01 -02:00
Vlad Zolotarov	976f444813	tests: commitlog_test: fix the compilation and test errors introduced by the hinted_handoff series Use the default commitlog configuration with the hinted_handoff disabled in the tests. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com> Message-Id: <1514942938-3844-1-git-send-email-vladz@scylladb.com>	2018-01-03 12:20:34 +00:00
Raphael S. Carvalho	e29b598c5f	sstables: make compaction_descriptor's ctor explicit to avoid bad conversion perf sstable used old sstables::compact_sstables() interface and still compiled due to bad implicit conversion. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20180103041900.21186-1-raphaelsc@scylladb.com>	2018-01-03 12:37:12 +02:00
Avi Kivity	8795238869	Merge "Fix handling of range tombstones starting at same position" from Tomasz "When we get two range tombstones with the same lower bound from different data sources (e.g. two sstable), which need to be combined into a single stream, they need to be de-overlapped, because each mutation fragment in the stream must have a different position. If we have range tombstones [1, 10) and [1, 20), the result of that de-overlapping will be [1, 10) and [10, 20]. The problem is that if the stream corresponds to a clustering slice with upper bound greater than 1, but lower than 10, the second range tombstone would appear as being out of the query range. This is currently violating assumptions made by some consumers, like cache populator. One effect of this may be that a reader will miss rows which are in the range (1, 10) (after the start of the first range tombstone, and before the start of the second range tombstone), if the second range tombstone happens to be the last fragment which was read for a discontinuous range in cache and we stopped reading at that point because of a full buffer and cache was evicted before we resumed reading, so we went to reading from the sstable reader again. There could be more cases in which this violation may resurface. There is also a related bug in mutation_fragment_merger. If the reader is in forwarding mode, and the current range is [1, 5], the reader would still emit range_tombstone([10, 20]). If that reader is later fast forwarded to another range, say [6, 8], it may produce fragments with smaller positions which were emitted before, violating monotonicity of fragment positions in the stream. A similar bug was also present in partition_snapshot_flat_reader. Possible solutions: 1) relax the assumption (in cache) that streams contain only relevant range tombstones, and only require that they contain at least all relevant tombstones 2) allow subsequent range tombstones in a stream to share the same starting position (position is weakly monotonic), then we don't need to de-overlap the tombstones in readers. 3) teach combining readers about query restrictions so that they can drop fragments which fall outside the range 4) force leaf readers to trim all range tombstones to query restrictions This patch implements solution no 2. It simplifies combining readers, which don't need to accumulate and trim range tombstones. I don't like solution 3, because it makes combining readers more complicated, slower, and harder to properly construct (currently combining readers don't need to know restrictions of the leaf streams). Solution 4 is confined to implementations of leaf readers, but also has disadvantage of making those more complicated and slower. There is only one consumer which needs the tombstones with monotonic positions, and that is the sstable writer. Fixes #3093." * tag 'tgrabiec/fix-out-of-range-tombstones-v1' of github.com:scylladb/seastar-dev: tests: row_cache: Introduce test for concurrent read, population and eviction tests: sstables: Add test for writing combined stream with range tombstones at same position tests: memtable: Test that combined mutation source is a mutation source tests: memtable: Test that memtable with many versions is a mutation source tests: mutation_source: Add test for stream invariants with overlapping tombstones tests: mutation_reader: Test fast forwarding of combined reader with overlapping range tombstones tests: mutation_reader: Test combined reader slicing on random mutations tests: mutation_source_test: Extract random_mutation_generator::make_partition_keys() mutation_fragment: Introduce range() clustering_interval_set: Introduce overlaps() clustering_interval_set: Extract private make_interval() mutation_reader: Allow range tombstones with same position in the fragment stream sstables: Handle consecutive range_tombstone fragments with same position tests: streamed_mutation_assertions: Merge range_tombstones with the same position in produces_range_tombstone() streamed_mutation: Introduce peek() mutation_fragment: Extract mergeable_with() mutation_reader: Move definition of combining mutation reader to source file mutation_reader: Use make_combined_reader() to create combined reader	2018-01-02 18:32:09 +02:00
Raphael S. Carvalho	2a7eaa4933	tests:perf: add compaction mode to perf_sstable Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20171209175759.7769-1-raphaelsc@scylladb.com>	2018-01-02 10:16:13 +01:00
Tomasz Grabiec	37ddc8bcfd	tests: row_cache: Introduce test for concurrent read, population and eviction	2017-12-22 11:58:17 +01:00
Tomasz Grabiec	42ec01661c	tests: sstables: Add test for writing combined stream with range tombstones at same position	2017-12-22 11:06:34 +01:00
Tomasz Grabiec	cb34420e1c	tests: memtable: Test that combined mutation source is a mutation source	2017-12-22 11:06:34 +01:00
Tomasz Grabiec	7ce02bc22e	tests: memtable: Test that memtable with many versions is a mutation source	2017-12-22 11:06:34 +01:00
Tomasz Grabiec	9cd35f4b90	tests: mutation_source: Add test for stream invariants with overlapping tombstones	2017-12-22 11:06:33 +01:00
Tomasz Grabiec	7ce52df88b	tests: mutation_reader: Test fast forwarding of combined reader with overlapping range tombstones	2017-12-22 11:06:33 +01:00
Tomasz Grabiec	ca6de9e78c	tests: mutation_reader: Test combined reader slicing on random mutations	2017-12-22 11:06:33 +01:00
Tomasz Grabiec	73a79372a4	tests: mutation_source_test: Extract random_mutation_generator::make_partition_keys()	2017-12-22 11:06:33 +01:00
Tomasz Grabiec	41ede08a1d	mutation_reader: Allow range tombstones with same position in the fragment stream When we get two range tombstones with the same lower bound from different data sources (e.g. two sstable), which need to be combined into a single stream, they need to be de-overlapped, because each mutation fragment in the stream must have a different position. If we have range tombstones [1, 10) and [1, 20), the result of that de-overlapping will be [1, 10) and [10, 20]. The problem is that if the stream corresponds to a clustering slice with upper bound greater than 1, but lower than 10, the second range tombstone would appear as being out of the query range. This is currently violating assumptions made by some consumers, like cache populator. One effect of this may be that a reader will miss rows which are in the range (1, 10) (after the start of the first range tombstone, and before the start of the second range tombstone), if the second range tombstone happens to be the last fragment which was read for a discontinuous range in cache and we stopped reading at that point because of a full buffer and cache was evicted before we resumed reading, so we went to reading from the sstable reader again. There could be more cases in which this violation may resurface. There is also a related bug in mutation_fragment_merger. If the reader is in forwarding mode, and the current range is [1, 5], the reader would still emit range_tombstone([10, 20]). If that reader is later fast forwarded to another range, say [6, 8], it may produce fragments with smaller positions which were emitted before, violating monotonicity of fragment positions in the stream. A similar bug was also present in partition_snapshot_flat_reader. Possible solutions: 1) relax the assumption (in cache) that streams contain only relevant range tombstones, and only require that they contain at least all relevant tombstones 2) allow subsequent range tombstones in a stream to share the same starting position (position is weakly monotonic), then we don't need to de-overlap the tombstones in readers. 3) teach combining readers about query restrictions so that they can drop fragments which fall outside the range 4) force leaf readers to trim all range tombstones to query restrictions This patch implements solution no 2. It simplifies combining readers, which don't need to accumulate and trim range tombstones. I don't like solution 3, because it makes combining readers more complicated, slower, and harder to properly construct (currently combining readers don't need to know restrictions of the leaf streams). Solution 4 is confined to implementations of leaf readers, but also has disadvantage of making those more complicated and slower. Fixes #3093.	2017-12-22 11:06:20 +01:00
Tomasz Grabiec	92b89d576d	tests: streamed_mutation_assertions: Merge range_tombstones with the same position in produces_range_tombstone() In preparation for allowing fragment stream to produce consecutive range tombstones with the same position.	2017-12-21 22:45:35 +01:00
Tomasz Grabiec	52285a9e73	mutation_reader: Use make_combined_reader() to create combined reader So that we can hide the definition of combined_mutation_reader. It's also less verbose.	2017-12-21 21:24:11 +01:00
Piotr Jastrzebski	759baa3a11	Migrate test_fast_forwarding_combining_reader to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 17:00:43 +01:00
Piotr Jastrzebski	9e3da50ed1	Don't pass fwd to flat_mutation_reader_from_mutations if it's no Default value for fwd is no so there's no need to pass it explicitly. This is important because we will add additional parameter to flat_mutation_reader_from_mutations in next patch. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 17:00:43 +01:00
Piotr Jastrzebski	202c562f68	Migrate test_combining_two_partially_overlapping_readers to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 17:00:43 +01:00
Piotr Jastrzebski	6c62454076	Migrate test_combining_two_non_overlapping_readers to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 17:00:43 +01:00
Piotr Jastrzebski	bef2cf8ed9	Migrate combined_mutation_reader_test to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 17:00:43 +01:00
Piotr Jastrzebski	19d4bce624	Migrate test_sm_fast_forwarding_combining_reader to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 17:00:43 +01:00
Piotr Jastrzebski	17e6f6b089	Migrate test_combining_one_empty_reader to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 17:00:43 +01:00
Piotr Jastrzebski	1f77370d9e	Migrate test_combining_two_empty_readers to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 17:00:43 +01:00
Piotr Jastrzebski	a702d0ec3f	Migrate test_combining_two_readers_with_one_reader_empty to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 17:00:43 +01:00
Piotr Jastrzebski	9a5d6bd8af	Migrate test_combining_one_reader_with_many_partitions to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 17:00:43 +01:00
Piotr Jastrzebski	13551e6f50	Migrate test_combining_two_readers_with_the_same_row to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 17:00:43 +01:00
Piotr Jastrzebski	024e01ad9e	mutation_source: Add constructors for sources that ignore forwarding Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 16:59:57 +01:00
Piotr Jastrzebski	85d2b24415	Stop using memtable::make_reader in row_cache_stress_test Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 11:47:07 +01:00
Piotr Jastrzebski	129a282cbf	Stop using memtable::make_reader in row_cache_test Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 11:47:07 +01:00
Piotr Jastrzebski	dc75df6353	Stop using memtable::make_reader in mutation_test Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 11:47:07 +01:00
Piotr Jastrzebski	660086f2d6	Stop using memtable::make_reader in streamed_mutation_test Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 11:47:07 +01:00
Piotr Jastrzebski	2a9cd5bffe	Stop using memtable::make_reader in memtable_snapshot_source.hh Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 11:47:07 +01:00
Piotr Jastrzebski	b1676db658	Migrate test_virtual_dirty_accounting_on_flush to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 11:47:07 +01:00
Piotr Jastrzebski	b90677272f	Migrate test_adding_a_column_during_reading_doesnt_affect_read_result to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 11:47:07 +01:00
Piotr Jastrzebski	20e31e462e	Simplify flat_reader_assertions& produces(const mutation& m) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 11:47:07 +01:00
Piotr Jastrzebski	ddecd385c1	Migrate test_partition_version_consistency_after_lsa_compaction_happens to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 11:47:07 +01:00
Piotr Jastrzebski	b18c075470	Add next_mutation() to flat_mutation_reader_assertions Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 11:47:07 +01:00
Piotr Jastrzebski	570703a169	read_mutation_from_flat_mutation_reader: don't take schema_ptr Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 11:47:07 +01:00
Piotr Jastrzebski	681dc26dd1	Migrate test_fast_forward_to_after_memtable_is_flushed to flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-21 11:47:07 +01:00
Tomasz Grabiec	71cc63dfa6	Merge "Fixes for multi_range_reader" from Paweł The following patches contain fixes for skipping to the next parititon in multi_range_reader and completelty dissable support for fast forwarding inside a single partition, which is not needed and would only add unnecessary complexity. * https://github.com/pdziepak/scylla.git fix-multi_range_reader/v1: flat_multi_range_mutation_reader: disallow streamed_mutation::forwarding flat_multi_range_mutation_reader: clear buffer on next_partition() tests/flat_multi_range_mutation_reader: test skipping to next partition	2017-12-21 11:06:57 +01:00
George Tavares	ceecd542cd	db/view: Consume updated rows regardless of static row Using Materialized Views, if the base table has static columns, and the update in base table mutates static and non static rows, the streamed_mutation is stopped before process non static row. The patch avoids stopping the stream_mutation and adds a test case. Message-Id: <20171220173434.25091-1-tavares.george@gmail.com>	2017-12-21 00:49:15 +01:00
Paweł Dziepak	da0655ab3c	tests/flat_multi_range_mutation_reader: test skipping to next partition	2017-12-20 16:08:09 +00:00
Tomasz Grabiec	dfe48bbbc7	range_tombstone_list: Fix insert_from() end_bound was not updated in one of the cases in which end and end_kind was changed, as a result later merging decision using end_bound were incorrect. end_bound was using the new key, but the old end_kind. Fixes #3083. Message-Id: <1513772083-5257-1-git-send-email-tgrabiec@scylladb.com>	2017-12-20 12:20:20 +00:00
Avi Kivity	2137d753b3	Merge "Serialize compaction of same size tier for different cfs" from Raphael "Currently, compaction manager will serialize compaction of same size tier (or weight) if they belong to the same column family. However, it fails to do so if the compaction jobs belong to different column families. That can lead to an ungodly amount of running compaction which gets worse the higher the number of shards and active column families. The problem is that it may affect overall system performance due to excessive resource usage. It's easy to trigger it during bootstraping after loading node with new sstables or repairing, or if lots of cfs are being actively written." Fixes #1295. * 'similar_sized_compaction_serialization_v4' of github.com:raphaelsc/scylla: sstables: remove column_family from compaction_weight_registration compaction_manager: serialize compaction of same size tier for different cfs sstables: introduces deregister() and weight() to compaction_weight_registration sstables: move compaction_weight_registration to its own header sstables: improve compact_sstables() interface	2017-12-19 16:32:27 +02:00
Tomasz Grabiec	37b19ae6ba	Merge "Migrate cache to use flat_mutation_reader" from Piotr	2017-12-18 17:53:20 +01:00
Piotr Jastrzebski	d756c49baf	Rename cache_streamed_mutation_test to cache_flat_mutation_reader_test Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:57 +01:00
Piotr Jastrzebski	6e9b54cc77	Remove unused cache_streamed_mutation Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:56 +01:00

1 2 3 4 5 ...

1870 Commits