scylladb

Author	SHA1	Message	Date
Glauber Costa	d41fcd45d1	memtables: make memtable inherit from region The LSA memory pressure mechanism will let us know which region is the best candidate for eviction when under pressure. We need to somehow then translate region -> memtable -> column family. The easiest way to convert from region to memtable, is having memtable inherit from region. Despite the fact that this requires multiple inheritance, which always raise a flag a bit, the other class we inherit from is enable_shared_from_this, which has a very simple and well defined interface. So I think it is worthy for us to do it. Once we have the memtable, grabing the column family is easy provided we have a database object. We can grab it from the schema. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-07-05 15:05:29 -04:00
Paweł Dziepak	6871bd5fa0	memtable: fully support streamed_mutations Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:52 +01:00
Paweł Dziepak	2ab1a73efa	memtable: rename partition_entry to memtable_entry partition_entry is going to be a more general object used by both cache and memtable entries. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:51 +01:00
Paweł Dziepak	737eb73499	mutation_reader: make readers return streamed_mutations Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:50 +01:00
Piotr Jastrzebski	dcba6f5c45	Pass clustering_row_ranges to mutation readers. This will allow readers to reduce the amount of data read. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-05-16 14:36:57 +02:00
Piotr Jastrzebski	23c23abe53	Make memtable mutation_reader slice using clustering ranges. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-05-16 11:46:41 +02:00
Piotr Jastrzebski	484d2ecd0a	Slice data with clustering key range in sstable reader Add additional parameters to mp_row_consumer to be able to fetch only cells for given clustering key ranges This will be used in row_cache when it will work on clustering key level instead of partition key level. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-05-16 11:46:30 +02:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Glauber Costa	336babfcb8	database: add a priority class to a few SSTable readers Not all SSTable readers will end up getting the right tag for a priority class. In particular, the range reader, also used for the memtables complete ignores any priority class. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-02-24 18:00:34 -05:00
Glauber Costa	80ab41a715	memtable reader: also include a priority class There are situations when a memtable is already flushed but the memtable reader will continue to be in place, relaying reads to the underlying table. For that reason, the "memtables don't need a priority class" argument gets obviously broken. We need to pass a priority class for its reader as well. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-02-24 18:00:34 -05:00
Avi Kivity	d415167496	memtable: use managed_bytes linearization context when applying mutations Ensures that we don't access scattered keys when looking up stuff.	2016-02-16 14:37:46 +02:00
Glauber Costa	15336e7eb7	key_source: turn it into a class Its definition as a lambda function is inconvenient, because it does not allow us to use default values for parameters. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-01-25 15:20:38 -05:00
Glauber Costa	58fdae33bd	mutation_source: turn it into a class Its definition as a lambda function is inconvenient, because it does not allow us to use default values for parameters. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2016-01-25 15:20:38 -05:00
Tomasz Grabiec	d81a46d7b5	column_family: Add schema setters There is one current schema for given column_family. Entries in memtables and cache can be at any of the previous schemas, but they're always upgraded to current schema on access.	2016-01-11 10:34:52 +01:00
Tomasz Grabiec	4e5a52d6fa	db: Make read interface schema version aware The intent is to make data returned by queries always conform to a single schema version, which is requested by the client. For CQL queries, for example, we want to use the same schema which was used to compile the query. The other node expects to receive data conforming to the requested schema. Interface on shard level accepts schema_ptr, across nodes we use table_schema_version UUID. To transfer schema_ptr across shards, we use global_schema_ptr. Because schema is identified with UUID across nodes, requestors must be prepared for being queried for the definition of the schema. They must hold a live schema_ptr around the request. This guarantees that schema_registry will always know about the requested version. This is not an issue because for queries the requestor needs to hold on to the schema anyway to be able to interpret the results. But care must be taken to always use the same schema version for making the request and parsing the results. Schema requesting across nodes is currently stubbed (throws runtime exception).	2016-01-11 10:34:52 +01:00
Tomasz Grabiec	036974e19b	Make mutation interfaces support multiple versions Schema is tracked in memtable and cache per-entry. Entries are upgraded lazily on access. Incoming mutations are upgraded to table's current schema on given shard. Mutating nodes need to keep schema_ptr alive in case schema version is requested by target node.	2016-01-11 10:34:51 +01:00
Tomasz Grabiec	8a05b61d68	memtable: Read under _read_section	2016-01-11 10:34:51 +01:00
Tomasz Grabiec	5184381a0b	memtable: Deconstify memtable in readers We want to upgrade entries on read and for that we need mutating permission.	2016-01-11 10:34:51 +01:00
Tomasz Grabiec	32ac2ccc4a	memtable: Introduce apply(memtable&)	2015-11-29 16:25:21 +01:00
Avi Kivity	a40a62d840	memtable: use allocating_section to guard allocations Without this, an allocation can fail, and we may not be able to reclaim memory.	2015-11-16 10:56:06 +02:00
Paweł Dziepak	b0edaa5bb7	memtable: add as_key_source() Needed only for tests. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-20 20:27:53 +02:00
Nadav Har'El	1a4c8db71a	scanning_reader: fix bug on still-being-written memtable scanning_reader has a bug in its range support when it iterates over a memtable which is still open, and thus might still be modified between calls to the read function. This caused, among other things, issue #368 - where repair was reading a memtable which was still open and being written to (by a stream from a a remote node). The problem is that scanning_reader has an optimization so it can avoid comparing the current partition with the range's end on every iteration: It finds, once, a pointer to the element past the end of the range (the so-called "upper bound"), and saves this pointer in _end. Then at every iteration, we can just compare pointers. But If partitions are added to the memtable, the _end we saved is no longer relevant: It still points to a valid partition, but this partition which was once the first partition after the range, may now be precedeed by many new partitions, which may be now returned despite being after the range's end. The fix is to re-calculate "_end" if partitions were added to the memtable. Moreover, we also need to re-calculate "_i" in this case - the current code calculates in one iteration a pointer, _i, to the element to be returned in the next iteration. If additional partitions were added in the meantime, we may need to return them. Because it's impossible to delete partitions from a memtable (just to add new ones or modify existing ones), we can trivially figure out if new partitions were added, using _memtable->partition_count(). Because boost::intrusive::set defaults to constant_time_size(true), using this count is efficient. Fixes #368. Signed-off-by: Nadav Har'El <nyh@cloudius-systems.com>	2015-09-20 15:08:08 +02:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Tomasz Grabiec	a0c180ef49	memtable: Fix flush in the middle of scanning bug Fixes #309. When scanning memtable readers detect is was flushed, which means that it started to be moved to cache, they fall back to reading from memtable's sstable. Eventually what we should do is to combine memtable and cache contents so that as long as data is not evicted we won't do IO. We do not support scanning in cache yet though, so there is no point in doing this now, and it is not trivial.	2015-09-09 10:17:35 +02:00
Tomasz Grabiec	920fe4278a	Cleanup leftovers after compaction_counter to reclaim_counter rename	2015-09-08 10:19:19 +02:00
Tomasz Grabiec	870e9e5729	lsa: Replace compaction_lock with broader reclaim_lock Disabling compaction of a region is currently done in order to keep the references valid. But disabling only compaction is not enough, we also need to disable eviction, as it also invalidates references. Rather than introducing another type of lock, compaction and eviction are controlled together, generalized as "reclaiming" (hence the reclaim_lock).	2015-09-01 17:29:04 +03:00
Tomasz Grabiec	d20fae96a2	lsa: Make reclaimer run synchronously with allocations The goal is to make allocation less likely to fail. With async reclaimer there is an implicit bound on the amount of memory that can be allocated between deferring points. This bound is difficult to enforce though. Sync reclaimer lifts this limitation off. Also, allocations which could not be satisfied before because of fragmentation now will have higher chances of succeeding, although depending on how much memory is fragmented, that could involve evicting a lot of segments from cache, so we should still avoid them. Downside of sync reclaiming is that now references into regions may be invalidated not only across deferring points but at any allocation site. compaction_lock can be used to pin data, preferably just temporarily.	2015-08-31 21:50:18 +02:00
Tomasz Grabiec	ff8c81b25f	memtable: Encapsulate unsafe accessors	2015-08-31 21:50:17 +02:00
Avi Kivity	7090dffe91	mutation_reader: switch to a class based implementation Using a lambda for implementing a mutation_reader is nifty, but does not allow us to add methods. Switch to a class-based implementation in anticipation of adding a close() method.	2015-08-31 15:53:53 +03:00
Tomasz Grabiec	f4038b1c04	memtable: scanning_reader: Avoid lookups when iterators not invalidated Fixes #230.	2015-08-31 13:58:42 +02:00
Tomasz Grabiec	2bfb138910	Fix typos	2015-08-25 17:07:35 +03:00
Avi Kivity	1579f86503	memtable: keep the lsa region alive while partitions are being destroyed Or we get a use-after-free. Reported by Pekka.	2015-08-20 15:32:30 +03:00
Avi Kivity	c175025bb6	db: place all memtables into a single region_group We can use this to track the amount of unevictable memory in the system.	2015-08-19 19:36:41 +03:00
Tomasz Grabiec	3b92ba2857	db: Add memtable flush logging	2015-08-06 14:05:16 +02:00
Tomasz Grabiec	cda31eccf7	db: Use LSA to allocate data inside memtable	2015-08-06 14:05:16 +02:00
Tomasz Grabiec	fe4c75dee6	memtable: Remove unused find_partition()	2015-08-06 14:05:16 +02:00
Tomasz Grabiec	1046ee6e80	memtable: Remove all_partitions() Preferred way to access the memtable is via reader.	2015-08-06 14:05:16 +02:00
Avi Kivity	98ec451d6a	Extract range<> into its own header It's not just for queries any more.	2015-08-02 16:07:42 +03:00
Avi Kivity	182d5ab798	memtable: fix memory leak Since memtable::partitions is now an intrusive_set, it must be cleared explicitly, or memory is leaked.	2015-07-26 20:01:50 +03:00
Tomasz Grabiec	f9da612581	memtable: Implement range queries	2015-07-22 13:14:33 +02:00
Tomasz Grabiec	9c4956c5dc	memtable: Use boost::intrusive_set<> to store partition entries So that we can use heterogenous comparators. For range queries we will need to compare keys with ring_position.	2015-07-22 13:14:33 +02:00
Tomasz Grabiec	0b0ea04958	range: Remove start_value() and end_value() It's easy to miss that they may be undefined. start() and end(), which return optional<bound> const&, make it clear.	2015-07-22 10:27:47 +02:00
Tomasz Grabiec	da937897cf	memtable: Introduce as_data_source()	2015-07-09 19:46:29 +02:00
Tomasz Grabiec	8a18d2b699	Extract memtable implementation to memtable.cc	2015-07-09 19:46:29 +02:00

44 Commits