scylladb

Author	SHA1	Message	Date
Tomasz Grabiec	3997421b2c	row_cache: Let the cleanup guard do invalidation of unmerged partitions	2016-02-26 16:57:31 +01:00
Tomasz Grabiec	aa15268249	row_cache: Delete the entry even if invalidation failed Otherwise we will leak it, and region destructor will fail: row_cache_test: utils/logalloc.cc:1211: virtual logalloc::region_impl::~region_impl(): Assertion `seg->is_empty()' failed. Fixes regression in row_cache_test.	2016-02-26 16:57:31 +01:00
Tomasz Grabiec	be24816c8a	row_cache: Clear partitions with region locked Since invalidate() may allocate, we need to take the region lock to keep m.partitions references valid around whole clear_and_dispose(), which relies on that.	2016-02-26 16:57:31 +01:00
Avi Kivity	fbe6961827	row_cache: run partiton-touching operations of row_cache::update in a linearization context To avoid scattered keys (and values, though those are already protected) from being accessed, run the update procedure in a managed_bytes linearization context. Fixes #807.	2016-02-16 14:37:44 +02:00
Avi Kivity	ad58663c96	row_cache: reindent	2016-02-07 13:25:29 +02:00
Paweł Dziepak	490201fd1c	row_cache: protect against stale entries row_cache::update() does not explicitly invalidate the entries it failed to update in case of a failure. This could lead to inconsistency between row cache and sstables. In paractice that's not a problem because before row_cache::update() fails it will cause all entries in the cache to be invalidated during memory reclaim, but it's better to be safe and explicitly remove entries that should be updated but it was not possible to do so. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com> Message-Id: <1453829681-29239-1-git-send-email-pdziepak@scylladb.com>	2016-01-26 20:34:41 +01:00
Glauber Costa	f6cfb04d61	add a priority class to mutation readers SSTables already have a priority argument wired to their read path. However, most of our reads do not call that interface directly, but employ the services of a mutation reader instead. Some of those readers will be used to read through a mutation_source, and those have to patched as well. Right now, whenever we need to pass a class, we pass Seastar's default priority class. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-01-25 15:20:38 -05:00
Tomasz Grabiec	d332fcaefc	row_cache: Restore indentation	2016-01-15 15:33:17 +01:00
Tomasz Grabiec	6b059fd828	row_cache: Guard against wrap-around range in make_reader()	2016-01-13 17:50:55 +01:00
Tomasz Grabiec	7fb0bc4e15	row_cache: Take the reclaim lock in invalidate() It's needed to keep the iterators valid in case eviciton is triggered somehwere in between. It probably isn't because destructors should not allocate, but better be safe.	2016-01-13 17:50:55 +01:00
Tomasz Grabiec	50cc0c162e	row_cache: Make invalidate() handle wrap-around ranges Currently for wrap around the "begin" iterator would not meet with the "end" iterator, invoking undefined behavior in erase_and_dispose() which results in a crash. Fixes #785	2016-01-13 17:50:55 +01:00
Tomasz Grabiec	d81a46d7b5	column_family: Add schema setters There is one current schema for given column_family. Entries in memtables and cache can be at any of the previous schemas, but they're always upgraded to current schema on access.	2016-01-11 10:34:52 +01:00
Tomasz Grabiec	4e5a52d6fa	db: Make read interface schema version aware The intent is to make data returned by queries always conform to a single schema version, which is requested by the client. For CQL queries, for example, we want to use the same schema which was used to compile the query. The other node expects to receive data conforming to the requested schema. Interface on shard level accepts schema_ptr, across nodes we use table_schema_version UUID. To transfer schema_ptr across shards, we use global_schema_ptr. Because schema is identified with UUID across nodes, requestors must be prepared for being queried for the definition of the schema. They must hold a live schema_ptr around the request. This guarantees that schema_registry will always know about the requested version. This is not an issue because for queries the requestor needs to hold on to the schema anyway to be able to interpret the results. But care must be taken to always use the same schema version for making the request and parsing the results. Schema requesting across nodes is currently stubbed (throws runtime exception).	2016-01-11 10:34:52 +01:00
Tomasz Grabiec	036974e19b	Make mutation interfaces support multiple versions Schema is tracked in memtable and cache per-entry. Entries are upgraded lazily on access. Incoming mutations are upgraded to table's current schema on given shard. Mutating nodes need to keep schema_ptr alive in case schema version is requested by target node.	2016-01-11 10:34:51 +01:00
Tomasz Grabiec	5184381a0b	memtable: Deconstify memtable in readers We want to upgrade entries on read and for that we need mutating permission.	2016-01-11 10:34:51 +01:00
Paweł Dziepak	59245e7913	row_cache: add functions for invalidating entries in cache Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-12-15 13:21:11 +01:00
Tomasz Grabiec	7c3e6c306b	row_cache: Wait for in-flight populations on update Before this change, populations could race with update from flushed memtable, which might result in cache being populated with older data. Populations started before the flush are not considering the memtable nor its sstable. The fix employed here is to make update wait for populations which were started before the flushed memtable's sstable was added to the undrelying data source. All populatinos started after that are guaranteed to see the new data.	2015-11-29 16:25:21 +01:00
Paweł Dziepak	513ab87b47	row_cache: update hit and miss stats in scanning reader Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-22 12:25:02 +03:00
Paweł Dziepak	b1b830bcbb	row_cache: merge cache_entry::compare and ring_position_compare Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-22 12:25:02 +03:00
Paweł Dziepak	c2b53c5282	row_cache: add scanning_and_populating_reader This reader enables range queries on row cache. An underlying key_reader is used to obtain information about partitions that belong to the specified range and if any of them isn't in the cache an underlying mutation reader is used to read the missing data. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-20 20:27:53 +02:00
Paweł Dziepak	9abefbfa28	row_cache: add just_cache_scanning_reader This mutation reader returns mutations from cache that are in a given range. There may be other mutations in the system (e.g. in sstables) that won't be returned, so this reader on its own cannot really satisfy any query. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-20 20:27:53 +02:00
Paweł Dziepak	c765b38599	row_cache: add modification counter Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-20 20:27:53 +02:00
Paweł Dziepak	c1e95dd893	row_cache: pass underlying key_source to row_cache Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-20 20:27:53 +02:00
Pekka Enberg	ac4007153d	row_cache: Implement clear() helper We need to clear the row cache for column family truncate operation.	2015-09-30 09:09:42 +02:00
Tomasz Grabiec	4712af2c21	row_cache: Use allocating_section in row_cache::populate() Cache has a tendency to eat up all available memory. It is evicted on-demand, but this happens at certain points in time (during large allocation requests). Small allocations which are served from small object pools won't usually trigger this. Large allocations happen for example when LSA region needs a new segment, eg. when row cache is populated. If large allocations happen for certain period only inside row_cache::update(), then eviction will not be able to make forward progress because cache's LSA region is locked inside row_cache::update(). While it's locked, data can't be evicted from it. The solution is to use allocating_section. Fixes #376.	2015-09-21 13:25:13 +03:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Avi Kivity	9dbe8ca1b5	row_cache: reduce cpu impact of memtable flush Restrict the impact of flushing a memtable to row_cache to 20% of the cpu. This is accomplished by converting the code to a thread (with bad indentation to improve patch readability) and using a thread scheduling group.	2015-09-19 09:22:52 +03:00
Tomasz Grabiec	91e7dcfe10	row_cache: Don't count insertions and merges as hits and misses Currently cache update which from a flushed memtable affects hits and misses, which may be confusing. Let's reserve hits and misses for reads. Cache update will affect counters called "insertions" and "merges".	2015-09-10 12:41:27 +03:00
Tomasz Grabiec	f64ac3a80e	row_cache: Extract scanning reader construction	2015-09-10 12:41:27 +03:00
Tomasz Grabiec	447e59eaf9	row_cache: Expose a metric for the number of cached partitions Fixes #193.	2015-09-10 12:41:12 +03:00
Tomasz Grabiec	433a298f60	row_cache: Extract comparator construction before the loop	2015-09-07 09:41:36 +02:00
Tomasz Grabiec	122bd8ea46	row_cache: Restore indentation	2015-09-06 21:25:44 +02:00
Tomasz Grabiec	d1f89b4eab	row_cache: Use allocation_section See #259. When transferring mutations between memtable and cache, lsa sometimes runs out of memory. This solves the first two points, keeping reserve filled up and adjusting the amount of reserve based on execution history.	2015-09-06 21:25:44 +02:00
Tomasz Grabiec	7efcde12aa	row_cache: Introduce row_cache::touch() Useful in tests for ensuring that certain entries survive eviction.	2015-09-06 21:25:44 +02:00
Tomasz Grabiec	24a5221280	row_cache: Avoid leaking of partitions when exception is thrown inside update()	2015-09-06 21:25:44 +02:00
Tomasz Grabiec	c82325a76c	lsa: Make region evictor signal forward progress In some cases region may be in a state where it is not empty and nothing could be evicted from it. For example when creating the first entry, reclaimer may get invoked during creation before it gets linked. We therefore can't rely on emptiness as a stop condition for reclamation, the evction function shall signal us if it made forward progress.	2015-09-06 21:25:44 +02:00
Tomasz Grabiec	802a9db9b0	Fix spelling of 'definitely_doesnt_exist'	2015-09-06 21:24:58 +02:00
Tomasz Grabiec	870e9e5729	lsa: Replace compaction_lock with broader reclaim_lock Disabling compaction of a region is currently done in order to keep the references valid. But disabling only compaction is not enough, we also need to disable eviction, as it also invalidates references. Rather than introducing another type of lock, compaction and eviction are controlled together, generalized as "reclaiming" (hence the reclaim_lock).	2015-09-01 17:29:04 +03:00
Tomasz Grabiec	d20fae96a2	lsa: Make reclaimer run synchronously with allocations The goal is to make allocation less likely to fail. With async reclaimer there is an implicit bound on the amount of memory that can be allocated between deferring points. This bound is difficult to enforce though. Sync reclaimer lifts this limitation off. Also, allocations which could not be satisfied before because of fragmentation now will have higher chances of succeeding, although depending on how much memory is fragmented, that could involve evicting a lot of segments from cache, so we should still avoid them. Downside of sync reclaiming is that now references into regions may be invalidated not only across deferring points but at any allocation site. compaction_lock can be used to pin data, preferably just temporarily.	2015-08-31 21:50:18 +02:00
Tomasz Grabiec	295ec3bfae	row_cache: Add cache.bytes.total statistic	2015-08-31 21:50:17 +02:00
Avi Kivity	7090dffe91	mutation_reader: switch to a class based implementation Using a lambda for implementing a mutation_reader is nifty, but does not allow us to add methods. Switch to a class-based implementation in anticipation of adding a close() method.	2015-08-31 15:53:53 +03:00
Tomasz Grabiec	f2713561f5	row_cache: Avoid copy when moving whole entry from memtable to cache	2015-08-25 17:07:34 +03:00
Tomasz Grabiec	cb72d02c98	row_cache: Rename underlying_negative to presence_checker	2015-08-25 17:07:33 +03:00
Avi Kivity	4390be3956	Rename 'negative_mutation_reader' to 'partition_presence_checker' Suggested by Tomek.	2015-08-24 18:03:22 +03:00
Avi Kivity	0afbdf4aa7	Merge "Add row related methods to the cache_service API" from Amnon "This series expose statistics from the row_cache in the cache_service API. After this series the following methods will be available: get_row_hits get_row_requests get_row_hit_rate get_row_size get_row_entries"	2015-08-23 15:46:07 +03:00
Avi Kivity	bcff75003e	row_cache: yield while moving data to cache If we don't yield, we can run out of memory while moving a memtable into cache. This reduces the chance that writing an sstable will fail because we could not transfer the memtable into the cache.	2015-08-19 19:36:41 +03:00
Amnon Heiman	0e1aa2e78b	Expose the cache tracker and the num_entries in row_cache This expose the cache tracker and the num entries in the row cache so it can be used by the API. And it adds a const getter for the region. Both are const and are used for inspecting only. Signed-off-by: Amnon Heiman <amnon@cloudius-systems.com>	2015-08-17 19:42:23 +03:00
Avi Kivity	1016b21089	cache: improve preloading of flushed memtable mutations If a mutation definitely doesn't exist in all sstables, then we can certainly load it into the cache.	2015-08-09 22:46:08 +03:00
Tomasz Grabiec	ef549ae5a5	lsa: Reclaim space from evictable regions incrementally When LSA reclaimer cannot reclaim more space by compaction, it will reclaim data by evicting from evictable regions. Currently the only evictable region is the one owned by the row cache.	2015-08-08 09:59:24 +02:00
Tomasz Grabiec	7a8f1ef6c3	row_cache: Replace _lru_len counter with region occupancy _lru_len may get stale when row_cache instance goes out of scope purging all its partitions from cache. I'm assuming we're not really interested in the number of partitions here, but rather a measure of occupancy, so I applied a simple fix of using LSA region occupancy instead.	2015-08-08 09:59:24 +02:00

1 2

58 Commits