scylladb

Author	SHA1	Message	Date
Glauber Costa	5140aaea00	add a timeout to fast forward to In the last patch, we enabled per-request timeouts, we enable timeouts in fill_buffer. There are many places, though, in which we fast_forward_to before we fill_buffer, so in order to make that effective we need to propagate the timeouts to fast_forward_to as well. In the same way as fill_buffer, we make the argument optional wherever possible in the high level callers, making them mandatory in the implementations. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2018-01-12 07:43:19 -05:00
Glauber Costa	d965af42b0	add a timeout to fill_buffer As part of the work to enable per-request timeouts, we enable timeouts in fill_buffer. The argument is made optional at the main classes, but mandatory in all the ::impl versions. This way we'll make sure we didn't forget anything. At this point we're still mostly passing that information around and don't have any entity that will act on those timeouts. In the next patch we will wire that up. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2018-01-11 12:07:41 -05:00
Duarte Nunes	2618209c2d	Remove obsolete includes and fix build move.hh was deleted, but files weren't updated to reflect that. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-12-28 12:03:44 +00:00
Tomasz Grabiec	7b36c8423c	row_cache: Fix single_partition_populating_reader not waiting on create_underlying() to resolve Results in undefined behavior. Message-Id: <1513691679-27081-1-git-send-email-tgrabiec@scylladb.com>	2017-12-19 16:12:11 +02:00
Raphael S. Carvalho	928beae242	Fix compilation of db/hints/manager.cc and row_cache.cc compiler: gcc (GCC) 6.3.1 20161221 (Red Hat 6.3.1-1) Problems introduced in `f6a461c7a4` and `37b19ae6ba`, respectively. They both fail to compile due to use of method in lambda without explicit mention of this. Some of failure is fixed by not using auto in lambda parameter. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20171218222144.12297-1-raphaelsc@scylladb.com>	2017-12-19 11:15:45 +01:00
Piotr Jastrzebski	14d98aaa0b	Rename row_cache::create_underlying_flat_reader to create_underlying_reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:57 +01:00
Piotr Jastrzebski	49993e56a9	Remove unused row_cache::create_underlying_reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:57 +01:00
Piotr Jastrzebski	b976872c1a	Rename all _underlying_flat methods in read_context to _underlying. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:57 +01:00
Piotr Jastrzebski	1457a3d771	Rename cache_entry::read_flat to cache_entry::read Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:57 +01:00
Piotr Jastrzebski	8b796a884f	Rename read_context::enter_flat_partition to enter_partition Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:57 +01:00
Piotr Jastrzebski	8d37b71843	Rename autoupdating_underlying_flat_reader to autoupdating_underlying_reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:57 +01:00
Piotr Jastrzebski	893e434207	Stop using autoupdating_underlying_reader in read_context Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:56 +01:00
Piotr Jastrzebski	6e9b54cc77	Remove unused cache_streamed_mutation Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:56 +01:00
Piotr Jastrzebski	df17bad13b	Remove unused cache_entry::read and do_read Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:56 +01:00
Piotr Jastrzebski	003670c3cd	Remove unused read_directly_from_underlying overload Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:56 +01:00
Piotr Jastrzebski	9fab29be82	Rename _sm to _reader in scanning_and_populating_reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:56 +01:00
Piotr Jastrzebski	610fa7a2c2	Stop using streamed_mutation in scanning_and_populating_reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:56 +01:00
Piotr Jastrzebski	3153d5d2c2	Rename _sm to _reader in single_partition_populating_reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:56 +01:00
Piotr Jastrzebski	556edfab29	Stop using streamed mutation in single_partition_populating_reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:56 +01:00
Piotr Jastrzebski	fec4468669	Add read_directly_from_underlying that returns flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:37:56 +01:00
Piotr Jastrzebski	47eb609aeb	Change fill_buffer_from_streamed_mutation to fill_buffer_from that can handle both streamed_mutation and flat_mutation_reader as source. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 16:24:16 +01:00
Piotr Jastrzebski	880623e2e9	Use cache_entry::read_flat in make_flat_reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 13:28:33 +01:00
Piotr Jastrzebski	a9b6551584	Add cache_entry::read_flat Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 13:28:33 +01:00
Piotr Jastrzebski	a322268416	Turn cache_flat_mutation_reader into a flat reader. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 13:28:33 +01:00
Piotr Jastrzebski	714868db2d	Use autoupdating_underlying_flat_reader in read_context and add read_context::enter_flat_partition. This will temporarily coexist with read_context::enter_partition but after everything in cache is migrated to flat reader the new method will replace old one. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 13:28:09 +01:00
Piotr Jastrzebski	bf4e1c0c54	Add row_cache::create_underlying_flat_reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 13:28:09 +01:00
Piotr Jastrzebski	16a0d306fd	Turn scanning_and_populating_reader into flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 13:28:09 +01:00
Piotr Jastrzebski	656e8622e1	Turn single_partition_populating_reader into flat reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-18 13:28:09 +01:00
Piotr Jastrzebski	ceaf0dee99	Introduce row_cache::make_flat_reader Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2017-12-14 12:49:39 +01:00
Tomasz Grabiec	12704fd679	mvcc: Propagate region reference to partition_entry::apply_to_incomplete()	2017-12-08 17:50:48 +01:00
Tomasz Grabiec	1971332195	row_cache: Fix exception safety of cache_entry::read() When we fail, we need to return streamed_mutation back, so that the operation can be retried. Causes SIGSEGV on nullptr otherwise on bad_alloc.	2017-11-13 20:55:14 +01:00
Tomasz Grabiec	11a195c403	row_cache: scanning_and_populating_reader: Fix exception unsafety causing read to skip data If assignment to _lower_bound in the "_secondary_in_progress = true;" case in do_read_from_primary() throws due to allocation failure, the update section will be retried and we will take the not_moved path, skipping the range which was discontinuous and was supposed to be read from underlying. Fix by redoing lookup using _lower_bound in case the section is retried. When we retry, _primary.valid() will be false. We need to ensure now that _lower_bound is always valid. Fixes #2944.	2017-11-13 20:55:14 +01:00
Tomasz Grabiec	5dc1ee41e4	row_cache: partition_range_cursor: Extract valid() and advance_to() from refresh()	2017-11-13 20:55:14 +01:00
Tomasz Grabiec	09c49b2db3	cache_streamed_mutation: Add trace-level logging to cache_streamed_mutation	2017-11-13 20:55:14 +01:00
Glauber Costa	1d7617723d	row cache: pin real dirty during cache updates. Right now, once a region is moved to the cache is no longer visible to the dirty memory system. Not as real dirty nor virtual dirty. The problem is that until a particular partition is moved to the cache it is not evictable. As a result we can OOM the system if we have a lot of pending cache updates as the writes will not be throttled and memory won't be made available. This patch pins the memory used by the region as real dirty before the cache update starts, and unpins it when it is over. In the mean time it gradually releases memory of the partitions that are being moved to cache. I have verified in a couple of workloads that the amount of memory accounted through this is the same amount of memory accounted through the memtable flush procedure. Fixes #1942 Signed-off-by: Glauber Costa <glauber@scylladb.com>	2017-11-08 19:46:36 -05:00
Glauber Costa	b836005555	row_cache: modernize use of seastar threads For a while now we have an async() function, that simplifies the code by not needing to issue an explicit join. This patch converts the row cache to use async() as well, which most of our code already does. Doing so will make it easier to make changes to update_cache. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2017-11-08 16:21:44 -05:00
Tomasz Grabiec	083b9cddef	row_cache: Fix handling of concurrent partition population This fixes a regression introduced in `27a3b4bca9` (master only). partition_range_cursor assumes that as long as references are valid, _end is valid as well. But if new entries were inserted before _end, it may not, if the new entries fall after the query range. This may result in reads returning partitions from outside the query range. Message-Id: <1507815478-20269-1-git-send-email-tgrabiec@scylladb.com>	2017-10-12 15:55:20 +01:00
Piotr Jastrzebski	6069bab755	Cache single queries to non-existing partitions This way we don't need to query sstables again when the query is repeated. Fixes #1533 Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <8f8559ff19c534dbbb7c9ef6c28271cec607ba20.1506521461.git.piotr@scylladb.com>	2017-09-27 16:15:18 +02:00
Tomasz Grabiec	0911fbbdef	row_cache: Fix row_cache::update_invalidating() evict() doesn't guarantee that the whole partition is discontinuous. In particular, partition tombstone cannot be marked as discontinuous. The parts which are still continuous must be updated. Broken after `c78047fa5b`. Message-Id: <1505375684-28574-1-git-send-email-tgrabiec@scylladb.com>	2017-09-14 10:58:25 +03:00
Tomasz Grabiec	c78047fa5b	row_cache: Evict partition snapshots If snapshots are not evicted, they may pin unbouned amount of memory for a long time in cache, which may lead to OOM. Evict snapshots together with the entry. Fixes #2775. Fixes #2730.	2017-09-13 17:47:03 +02:00
Tomasz Grabiec	adb159d51b	row_cache: Reuse allocation_strategy::invalidate_references() Modification count in the tracker is redundant, we can rely on allocator's invalidation counter.	2017-09-13 17:38:08 +02:00
Tomasz Grabiec	27a3b4bca9	row_cache: Don't invalidate references on insertion modification_count is currently only used to detect invalidation of references, intended to be incremented on erasure. Insertion into intrusive set doesn't invalidate references, so no need to increment the counter.	2017-09-13 17:38:08 +02:00
Tomasz Grabiec	2df6f356b1	mvcc: Store LSA region reference in partition_snapshot Will be useful for improving encapsulation.	2017-09-13 17:38:08 +02:00
Tomasz Grabiec	d22fdf4261	row_cache: Improve safety of cache updates Cache imposes requirements on how updates to the on-disk mutation source are made: 1) each change to the on-disk muation source must be followed by cache synchronization reflecting that change 2) The two must be serialized with other synchronizations 3) must have strong failure guarantees (atomicity) Because of that, sstable list update and cache synchronization must be done under a lock, and cache synchronization cannot fail to synchronize. Normally cache synchronization achieves no-failure thing by wiping the cache (which is noexcept) in case failure is detect. There are some setup steps hoever which cannot be skipped, e.g. taking a lock followed by switching cache to use the new snapshot. That truly cannot fail. The lock inside cache synchronizers is redundant, since the user needs to take it anyway around the combined operation. In order to make ensuring strong exception guarantees easier, and making the cache interface easier to use correctly, this patch moves the control of the combined update into the cache. This is done by having cache::update() et al accept a callback (external_updater) which is supposed to perform modiciation of the underlying mutation source when invoked. This is in-line with the layering. Cache is layered on top of the on-disk mutation source (it wraps it) and reading has to go through cache. After the patch, modification also goes through cache. This way more of cache's requirements can be confined to its implementation. The failure semantics of update() and other synchronizers needed to change due to strong exception guaratnees. Now if it fails, it means the update was not performed, neither to the cache nor to the underlying mutation source. The database::_cache_update_sem goes away, serialization is done internally by the cache. The external_updater needs to have strong exception guarantees. This requirement is not new. It is however currently violated in some places. This patch marks those callbacks as noexcept and leaves a FIXME. Those should be fixed, but that's not in the scope of this patch. Aborting is still better than corrupting the state. Fixes #2754. Also fixes the following test failure: tests/row_cache_test.cc(949): fatal error: in "test_update_failure": critical check it->second.equal(*s, mopt->partition()) has failed which started to trigger after commit `318423d50b`. Thread stack allocation may fail, in which case we did not do the necessary invalidation.	2017-09-04 10:04:29 +02:00
Tomasz Grabiec	b0f3efa577	row_cache: Extract invalidate_sync()	2017-09-04 10:04:29 +02:00
Tomasz Grabiec	56e3ce05db	row_cache: Don't require presence checker to be supplied externally The API is simpler and safer this way.	2017-09-04 10:04:29 +02:00
Tomasz Grabiec	1a2f17d42c	row_cache: Make populate() preserve continuity	2017-09-04 10:04:29 +02:00
Tomasz Grabiec	bc3112a187	row_cache: Allow marking as fully continuous on construction Will be needed in tests.	2017-09-04 10:04:29 +02:00
Avi Kivity	48b9e47f7d	Revert "row_cache: Add missing handling for failures happening outside the updating thread" This reverts commit `f9feb310ab` (requested by author).	2017-08-29 19:26:02 +03:00
Tomasz Grabiec	f9feb310ab	row_cache: Add missing handling for failures happening outside the updating thread Thread stack allocation may fail, in which case we did not do the necessary invalidation. Fix by hoisting the scope of the cleanup function. Also fixes the following test failure: tests/row_cache_test.cc(949): fatal error: in "test_update_failure": critical check it->second.equal(*s, mopt->partition()) has failed which started to trigger after commit `318423d50b`. Message-Id: <1504023113-30374-2-git-send-email-tgrabiec@scylladb.com>	2017-08-29 19:17:22 +03:00

1 2 3 4 5

214 Commits