scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 12:17:02 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	44fdee3f2e	tests: mutation_assertions: Allow expecting fragments	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	1f23130b07	mutation_fragment: Implement equality check	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	116bcb8b30	tests: row_cache: Add test for population of random partitions	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	930a1415fe	tests: row_cache: Add test for partition tombstone population	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	9bfece6f82	tests: row_cache: Test reading randomly populated partition	2017-06-24 18:06:11 +02:00
Piotr Jastrzebski	0358334579	tests: row_cache: Add test_single_partition_update() [tgrabiec: Extracted from "row_cache: Introduce cache_streamed_mutation"]	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	8bb76e2f12	tests: row_cache: Add test_scan_with_partial_partitions	2017-06-24 18:06:11 +02:00
Piotr Jastrzebski	896bf2e5de	Remove unused methods from MVCC Some apply methods where replaced by apply_to_incomplete(). Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	6f6575f456	row_cache: Enable partial partition population	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	5a0ae55f6d	Introduce schema_upgrader	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	1828e28bbb	database: Invalidate cache atomically with attaching streaming sstables Not doing so may cause reads to see partial writes, if another update+read happens in between.	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	896196b841	database: Invalidate cache from seal_active_streaming_memtable_immediate() Cache must be synchronized atomically with changing the underlying mutation source, otherwise write atomicity may not hold.	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	7ae40d7045	tests: Add test for update_invalidating()	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	e792220c3a	row_cache: Introduce update_invalidating()	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	c29878f49f	row_cache: Extract memtable walking logic from update() into do_update() So that it can be reused in update_invalidating().	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	6ebfb730ee	partition_entry: Introduce partition_tombstone() getter	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	fb62dfab02	tests: mvcc: Introduce test_schema_upgrade_preserves_continuity	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	164989a574	tests: mvcc: Add test for partition_entry::apply_to_incomplete()	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	e433e68610	partition_entry: Make squashed() and upgrade() work with not fully continuous versions Those methods first create a neutral mutation_partition, and left-fold it with the versions. The problem is that there is no neutral element for static row continuity, the flag from the first addend always wins. We have to copy the flag from the first version to preserve the logical value.	2017-06-24 18:06:11 +02:00
Piotr Jastrzebski	b680de930c	partition_entry: Introduce apply_to_incomplete() Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> [tgrabiec: - extracted from a larger commit - fix heap comparator in apply_incomplete_target to order versions properly - extracted partition_version detaching into partition_entry::with_detached_versions() - dropped unnecessary rows_iterator::_version field - dropped unnecessary allocation of rows_entry and key copies in rows_iterator - dropped row_pointer - replaced apply_reversibly() with weaker and faster apply() - added handling of dummy entries at any position - fixed exception safety issue in apply_to_incomplete() which may result in data loss. We cannot move data out of applied versions into a new synthetic row and then apply it, because if exception happens in the middle, the data which was moved from the source will be lost. To fix that, row_iterator::consume_row() is introduced which allows in-place consumption of data without construction of temporary deletable_row. ]	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	b6ce963200	partition_version: Introduce partition_entry::with_detached_versions()	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	2d8f024e4d	partition_version: Document version merging rules on partition_entry	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	0770845a23	mutation_partition: Introduce r-value accepting deletable_row::apply()	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	48a5b1d3ab	converting_mutation_partition_applier: Expose cell upgrade logic	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	04aebaa2cb	streamed_mutation: Introduce transform()	2017-06-24 18:06:11 +02:00
Piotr Jastrzebski	3755641c4b	row_cache: Introduce cache_streamed_mutation This streamed mutation populates cache with the rows requested by the read. It takes whatever it can find in the cache and fetches the remainings from underlying source. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> [tgrabiec: - fixed maybe_add_to_cache_and_update_continuity() leaking entries if the key already exists in the snapshot - fixed a problem where population race could result in a read missing some rows, because cache_streamed_mutation was advancing the cursor, then deferring, and then checking continuity. We should check continuity atomically with advancing. - fixed rows_handle.maybe_refresh() being accessed outside of update section in read_from_underlying() (undefined behavior) - fixed a problem in start_reading_from_underlying() where we would use incorrect start if lower_bound ended with a range tombstone starting before a key. - range tombstone trimming in add_to_buffer() could create a tombstone which has too low start bound if last_rt.end was a prefix and had inclusive end. invert_kind(end_kind) should be used instead of unconditional inc_start. - range tombstone trimming incorrectly assumed it is fine to trim the tombstone from underlying to the previous fragment's end and emit such tombstone. That would mean the stream can't emit any fragments which start before previous tombstone's end. Solve with range_tombstone_stream. - split add_to_buffer() into overloads for clustering_row, and range_tombstone. Better than wrapping into mutation_fragment before the call and having add_to_buffer() rediscover the information. - changed maybe_add_to_cache_and_update_continuity() to not set continuity to false for existing entries, it's not necessary - moved range tombstone trimming to range_tombstone class - moved range tombstone slicing code to range_tombstone_list and partition_snapshot - can_populate::can_use_cache was unused, dropped - dropped assumption that dummy entries are only at the end - renamed maybe_add_to_cache_and_update_continuity() to maybe_add_to_cache() - dropped no longer needed lower_bound class - extracted row_handle to a seaparate patch - made the copy-from-cache loop preemptable - split maybe_add_next_to_buffer_and_update_continuity(bool) - dropped cache_populator - replaced "underlying" class with use of read_context - replaced can_populate class with a function - simplified lsa_manager methods to avoid moves ]	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	58a8022462	intrusive_set_external_comparator: Introduce insert_check()	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	509a0d8a83	row_cache: Allow reading from underlying through read_context The interaction will be as follows: - Before creating cache_streamed_mutation for given partition, cache mutation reader sets up read_context for current partition (in one of two ways) so that the matching underlying streamed_mutation can be accessed at any time by cached_stream_mutation. - cache_streamed_mutation assumes that read_context is set up for current partition and invokes fast_forward_to() and get_next_fragment() to access the underlying streamed_mutation.	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	69ea29131f	row_cache: Allow specifying desired snapshot in autoupdating_underlying_reader When reading from incomplete partition entry, we may discover we need to read something from the underlying mutation source. In such case we will fast forward this reader to that partition. But we must do it using a specific snapshot, the one we obtained when entering the partition, not the latest one.	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	a1d3e0318c	row_cache: Store autoupdating_underlying_reader in read_context Will be reused for reading of incomplete partition entries.	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	3f2320c377	row_cache: Store information whether query is a range query in read_context We will need to use this information later in yet another place, when creating a reader for incomplete cache entry. This refactors the code so that there is a single place which determines this fact.	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	a2207ee9a6	row_cache: Move autoupdating_underlying_reader to read_context.hh	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	ca920bd0ef	row_cache: Keep only one streamed_mutation in scanning_and_populating_reader Currently scanning_and_populating_reader asks just_cache_scanning_reader for the next partition from cache, together with information if the range is continuous. If it's not, it saves the partition it got from it and moves on to reading from the underlying reader up to that partition. When that's done, it emits the stored partition. This approach won't work well with upcoming changes for storing partial partitions. We won't have whole partitions any more, so streamed_mutation returned for the entry needs to be prepared for reading from the underlying mutation source. We want to reuse the same underlying reader as much as possible, so all streamed_mutations for given read (read_context) will share the state of the underlying reader. Construction of a streamed_mutation will depend on the fact that the shared state is set up for it, so we cannot have two streamed_mutations prepared at the same time (one for entry from primary, and one for the earlier entry being populated). This change defers the creation of a streamed_mutation for the entry present in cache until the whole reader reaches it to avoid this problem. This will also have antoher potentially beneficial effect. Since we defer the decision about which snapshot to use until we reach the entry, there is a higher chance that the current snapshot of the entry will match the one used last by the populating read, and that we will be able to reuse the reader. It's implemented by utilizing a stable partition cursor which tracks its current position so that it's possible to revisit the cache entry (if it's still there) after population ends. The functionality of just_cache_scanning_reader was inlined into scanning_and_populating_reader.	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	1b041298fe	range: Introduce trim_front()	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	045888d5f3	row_cache: Introduce partition_range_cursor	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	e989d65539	dht: Make ring_position_view copyable dht::token needs to be stored as a pointer now and not a reference so that validity of old pointers is not impacted by in-place object construction which would occur in the copy-assignment operator. [1] says that old pointers can be used to access the new object only if the type "does not contain any non-static data member whose type is const-qualified or a reference type". [1] http://en.cppreference.com/w/cpp/language/lifetime#Storage_reuse	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	c3905bf235	row_cache: Print position instead of key of cache_entry	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	f0cc86e5db	row_cache: Introduce cache_entry::position()	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	e3272526a1	row_cache: Allow comparing with ring_position views in row_cache::compare	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	5bfecaad99	row_cache: Switch invalidate_unwrapped() to use ring_position_view ranges It's needed before switching cache_entry ordering to rely solely on cache_entry::position() so that invalidate_unwrapped() never removes the dummy entry at the end. Currently if the range has upper bound like this: { ring_position::max(), inclusive=true } The code which selects entries for removal would include the dummy row at the end. It uses upper_bound() to get the end iterator, and the dummy entry has a position which is equal to the position in the bound. ring_position_view ranges are end-exclusive, so it's impossible to create a partition range which would include a dummy entry. The code is also simpler.	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	64626b32b0	row_cache: Make printable	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	639af55a78	partition_version: Add versions() getter [tgrabiec: Use explicit return type]	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	1d3fec43eb	partition_version: Make return type of versions() explicit	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	22a0e301f1	partition_version: Make is_referenced() const-qualified	2017-06-24 18:06:11 +02:00
Piotr Jastrzebski	a17fa5726f	Introduce streamed_mutation_from_forwarding_streamed_mutation This will allow conversion from streamed_mutation that supports fast forwarding to streamed_mutation that does not. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	11191b7aef	streamed_mutation: Introduce make_empty_streamed_mutation()	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	7c9569ec95	Introduce partition_snapshot_row_cursor	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	54b3da1910	row_cache: Introduce find_or_create() helper	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	f2d2c221d4	row_cache: Return cache_entry reference from do_find_or_create_entry Will be useful when additional action needs to be done on the entry after it was created or constructed.	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	0db6bdc916	row_cache: Introduce cache_entry constructor which constructs incomplete entry	2017-06-24 18:06:11 +02:00

1 2 3 4 5 ...

12364 Commits