scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-19 16:15:07 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	09c49b2db3	cache_streamed_mutation: Add trace-level logging to cache_streamed_mutation	2017-11-13 20:55:14 +01:00
Tomasz Grabiec	bd7b68f877	cache_streamed_mutation: Make advancing to the next range exception-safe Changing _ck_ranges_curr and _lower_bound should be atomic, either both fail or both succeed. Currently it could happen that if position_in_partition::for_range_start() fails, _ck_ranges_curr would be advanced but _lower_bound not.	2017-11-13 20:55:14 +01:00
Tomasz Grabiec	081deec731	cache_streamed_mutation: Make add_clustering_row_to_buffer() exception-safe We need to maintain the following invariants: (1) no fragment with position >= _lower_bound was pushed yet (2) If _lower_bound > mf.position(), mf was emitted Before this patch (1) could be violated if drain_tombstones() failed in the middle. (2) could be violated if push_mutation_fragment() failed.	2017-11-13 20:55:14 +01:00
Tomasz Grabiec	d1b844737a	cache_streamed_mutation: Make drain_tombstones() exception-safe If push_mutation_fragment() failed, mfo which we got from get_next() would be lost. Fix by making sure push_mutation_fragment() won't fail.	2017-11-13 20:55:14 +01:00
Tomasz Grabiec	875fc93956	cache_streamed_mutation: Return void from start_reading_from_underlying() The return value is no longer used.	2017-11-13 20:55:14 +01:00
Tomasz Grabiec	5fb319bbb9	cache_streamed_mutation: Document invariants related to exception-safety	2017-11-13 20:55:14 +01:00
Tomasz Grabiec	bbca83d4c0	cache: Make range tombstone merging exception-safe range_tombstone_list::apply() has no exception safety guarantees about the logical state. The target mutation_partition in cache should be assumed to be left in unspecified state. In particular, some of the preexisting overlapping tombstones may be removed and not reinserted, so the cache would be missing some of the range tombstone information in case the whole allocating section fails. Use apply_monotonically() which provides the needed guarantees. Fixes #2938.	2017-11-07 15:33:24 +01:00
Tomasz Grabiec	a76202df4f	cache: Document invariants of cache_streamed_mutation::_lower_bound (cherry picked from commit b52813279d30782270ac83856233f18787b28b7e)	2017-11-02 12:16:17 +01:00
Tomasz Grabiec	328faf695e	cache_streamed_mutation: Special-case population for singular ranges This is an optimization which avoids creating dummy entries around row entry when populating a singular range.	2017-11-02 12:16:09 +01:00
Tomasz Grabiec	0fd57cdff5	cache_streamed_mutation: Increment mispopulation counter when can't populate due to eviction	2017-11-02 11:05:19 +01:00
Tomasz Grabiec	8c41a3eb43	cache_streamed_mutation: Override continuity of older versions when populating Fixes the case of continuity not being populated when the row which is the upper bound of the population range belongs to a non-latest version. In such case we wouldn't mark the range as continuous, because we can't modify rows of non-latest versions. To fix this, create an empty entry in latest version which will just override the continuity flag of the old entry.	2017-11-02 11:05:19 +01:00
Tomasz Grabiec	65ed490e1c	cache_streamed_mutation: Mark whole query range as continuous Before this patch only ranges between returned row fragments were marked as continuous. In the extreme case, there could be no such fragments, in which case next read would miss as well. To avoid this, mark whole query range as continuous by inserting dummy entries when necessary. Refs #2579.	2017-11-02 11:05:19 +01:00
Tomasz Grabiec	d4928eb1b7	cache_streamed_mutation: Populate continuity when range adjacent to non-latest version rows Current code will not mark the range as continuous if the previous entry does not come in the latest version. Fix that by switching to partition_snapshot_row_pointer, which is capable of checking in older versions as necessary. Also, we avoid the key comparison if we know that the iterator is still valid.	2017-11-02 11:05:19 +01:00
Tomasz Grabiec	835d17ee37	cache_streamed_mutation: Avoid lookup in maybe_add_to_cache() in more cases	2017-11-02 11:05:19 +01:00
Tomasz Grabiec	b4e3c0946a	cache_streamed_mutation: Avoid copy of decorated_key Message-Id: <1509060503-17483-1-git-send-email-tgrabiec@scylladb.com>	2017-10-26 16:51:27 -07:00
Tomasz Grabiec	44faaafc29	cache_streamed_mutation: Read static row with cache region locked _snp->static_row() allocates and needs reference stability. Message-Id: <1507555031-11567-1-git-send-email-tgrabiec@scylladb.com>	2017-10-09 15:55:53 +01:00
Tomasz Grabiec	e83cd508f6	cache_streamed_mutation: Add missing _next_row.maybe_refresh() call We were checking if the cursor is up_to_date(), but this is not enough to guarantee that the cursor is valid, merely that its iterators are valid. The cursor may be invalidated even if its iterators are valid if there was an insertion after cursor's position. Fixes #2834.	2017-09-25 11:21:58 +02:00
Tomasz Grabiec	09d99b0358	mvcc: partition_snapshot_row_cursor: Rename up_to_date() to iterators_valid()	2017-09-25 11:21:58 +02:00
Tomasz Grabiec	7b5b461067	cache_streamed_mutation: De-futurize cursor movement start_reading_from_underlying() doesn't return future<> any more, so we can simplify this.	2017-09-15 15:41:55 +02:00
Tomasz Grabiec	22019577cc	cache_streamed_mutation: Call fast_forward_to() outside allocating section On bad_alloc the section is retried. If the exception happened inside fast_forward_to() on the underlying reader, that call will be retried. However, the reader should not be used after exception is thrown, since it is in unspecified state. Also, calling fast_forward_to() with cache region locked increases the chances of it failing to allocate. We shouldn't call fast_forward_to() with the cache region locked. Fixes #2791.	2017-09-15 15:41:55 +02:00
Tomasz Grabiec	3b790a1e80	cache_streamed_mutation: Switch from flags to explicit state machine We're in one state at a time, so it's better to express it as a single variable rather than N independent flags. In preparation before adding more states.	2017-09-15 15:41:55 +02:00
Tomasz Grabiec	fa2c26342c	row_cache: Handle eviction in partition reader	2017-09-13 17:38:08 +02:00
Tomasz Grabiec	cda86abdbc	mvcc: Encapsulate reference stability check in partition_snapshot	2017-09-13 17:38:08 +02:00
Paweł Dziepak	b4d1dea4a9	cache: use equality comparators instead of tri_compare Equality comparator may be much cheaper than the fully fledged trichotomic comparator, especially if the component types are byte order equal but not byte order comparable.	2017-07-26 14:38:27 +01:00
Paweł Dziepak	2780555968	cache: short-circut static row logic if there are no static columns	2017-07-26 14:38:27 +01:00
Paweł Dziepak	c2ec43f70b	cache_streamed_mutation: use consumer based read_context reader	2017-07-26 14:38:21 +01:00
Paweł Dziepak	9bc6038ff3	cache_streamed_mutation: avoid moving clustering_row clustering_row can stores quite a lot of data internally which makes its move constructor not exactly cheap. If possible it is better to move mutation_fragment around as it keeps everything externally. This also avoids some cases when clustering row would be extracted from mutation_fragment only to be made to create another mutation_fragment later.	2017-07-26 14:36:37 +01:00
Paweł Dziepak	6572f38450	cache: fix aborts if no clustering range is specified cache_streamed_mutation assumed that at least one clustering range was specified. That was wrong since the readers are allowed to query just for a static row (e.g. counter update that modifies only static columns). Fixes #2604. Message-Id: <20170725131220.17467-1-pdziepak@scylladb.com>	2017-07-25 15:27:48 +02:00
Tomasz Grabiec	60c2a86192	row_cache: Track mispopulations also at row level	2017-07-04 13:55:06 +02:00
Tomasz Grabiec	94547db620	row_cache: Track row insertions	2017-07-04 13:55:06 +02:00
Tomasz Grabiec	a58f2c8640	row_cache: Track row hits and misses	2017-07-04 13:55:06 +02:00
Piotr Jastrzebski	3755641c4b	row_cache: Introduce cache_streamed_mutation This streamed mutation populates cache with the rows requested by the read. It takes whatever it can find in the cache and fetches the remainings from underlying source. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> [tgrabiec: - fixed maybe_add_to_cache_and_update_continuity() leaking entries if the key already exists in the snapshot - fixed a problem where population race could result in a read missing some rows, because cache_streamed_mutation was advancing the cursor, then deferring, and then checking continuity. We should check continuity atomically with advancing. - fixed rows_handle.maybe_refresh() being accessed outside of update section in read_from_underlying() (undefined behavior) - fixed a problem in start_reading_from_underlying() where we would use incorrect start if lower_bound ended with a range tombstone starting before a key. - range tombstone trimming in add_to_buffer() could create a tombstone which has too low start bound if last_rt.end was a prefix and had inclusive end. invert_kind(end_kind) should be used instead of unconditional inc_start. - range tombstone trimming incorrectly assumed it is fine to trim the tombstone from underlying to the previous fragment's end and emit such tombstone. That would mean the stream can't emit any fragments which start before previous tombstone's end. Solve with range_tombstone_stream. - split add_to_buffer() into overloads for clustering_row, and range_tombstone. Better than wrapping into mutation_fragment before the call and having add_to_buffer() rediscover the information. - changed maybe_add_to_cache_and_update_continuity() to not set continuity to false for existing entries, it's not necessary - moved range tombstone trimming to range_tombstone class - moved range tombstone slicing code to range_tombstone_list and partition_snapshot - can_populate::can_use_cache was unused, dropped - dropped assumption that dummy entries are only at the end - renamed maybe_add_to_cache_and_update_continuity() to maybe_add_to_cache() - dropped no longer needed lower_bound class - extracted row_handle to a seaparate patch - made the copy-from-cache loop preemptable - split maybe_add_next_to_buffer_and_update_continuity(bool) - dropped cache_populator - replaced "underlying" class with use of read_context - replaced can_populate class with a function - simplified lsa_manager methods to avoid moves ]	2017-06-24 18:06:11 +02:00

32 Commits