scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 12:17:02 +00:00

Author	SHA1	Message	Date
Paweł Dziepak	27fea7bf2c	mutation_partition: add non-cons rows and tombstones accessors Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-07-13 09:50:07 +01:00
Tomasz Grabiec	8c4b5e4283	db: Avoiding checking bloom filters during compaction Checking bloom filters of sstables to compute max purgeable timestamp for compaction is expensive in terms of CPU time. We can avoid calculating it if we're not about to GC any tombstone. This patch changes compacting functions to accept a function instead of ready value for max_purgeable. I verified that bloom filter operations no longer appear on flame graphs during compaction-heavy workload (without tombstones). Refs #1322.	2016-07-10 09:54:20 +02:00
Paweł Dziepak	23d0bfd065	mutation_partition: add row::memory_usage() Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-07-07 12:17:25 +01:00
Paweł Dziepak	f95c5542dc	mutation_partition: allow slicing moved mutation_partition Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:51 +01:00
Paweł Dziepak	22160ae6d5	mutation_partition: make rows_type public Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:49 +01:00
Paweł Dziepak	847bf878ec	mutation_partition: add more row::apply() overloads Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:48 +01:00
Duarte Nunes	70083efee2	sstables: Read and write range tombstone bounds This patch uses the composite_marker to add inclusiveness information to the prefixes of a range tombstone. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-02 16:21:59 +02:00
Duarte Nunes	7628e403a3	sstables: Drop code for tombstone merging Since Scylla now supports proper range tombstones, the code for reading ranges from sstables and converting them to overlapping tombstones is no longer necessary, and is, in fact, wasteful as the internal representation converts overlapping tombstones back to ranges. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-02 16:21:59 +02:00
Duarte Nunes	91aac30f12	mutations: Row tombstones are now a set of ranges This patch changes the type of the mutation partition's row_tombstones to be a range_tombstone_list, so that they are now represented as a set of disjoint ranges. All of its usages are updated accordingly. Fixes #1155 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2016-06-02 16:21:59 +02:00
Piotr Jastrzebski	23c23abe53	Make memtable mutation_reader slice using clustering ranges. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2016-05-16 11:46:41 +02:00
Tomasz Grabiec	a1539fed95	mutation_partition: Fix reversed trim_rows() The first erase_and_dispose(), which removes rows between last position and beginning of the next range, can invalidate end() iterator of the range. Fix by looking up end after erasing. mutation_partition::range() was split into lower_bound() and upper_bound() to allow for that. This affects for example queries with descending order where the selected clustering range is empty and falls before all rows. Exposed by `f15c380a4f`, which is now calling do_compact() during query. Reproduced by dtest paging_test.py:TestPagingData.static_columns_paging_test	2016-04-08 20:53:33 +02:00
Avi Kivity	db03295c8a	Merge "Fix query digest mismatch" from Tomasz "Currently data query digest includes cells and tombstones which may have expired or be covered by higher-level tombstones. This causes digest mismatch between replicas if some elements are compacted on one of the nodes and not on others. This mismatch triggers read-repair which doesn't resolve because mutations received by mutation queries are not differing, they are compacted already. The fix adds compacting step before writing and digesting query results by reusing the algorithm used by mutation query. This is not the most optimal way to fix this. The compaction step could be folded with the query writing, there is redundancy in both steps. However such change carries more risk, and thus was postponed. perf_simple_query test (cassandra-stress-like partitions) shows regression from 83k to 77k (7%) ops/s. Fixes #1165."	2016-04-08 12:13:29 +03:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Tomasz Grabiec	f15c380a4f	database: Compact mutations when executing data queries Currently data query digest includes cells and tombstones which may have expired or be covered by higher-level tombstones. This causes digest mismatch between replicas if some elements are compacted on one of the nodes and not on others. This mismatch triggers read-repair which doesn't resolve because mutations received by mutation queries are not differing, they are compacted already. The fix adds compacting step before writing and digesting query results by reusing the algorithm used by mutation query. This is not the most optimal way to fix this. The compaction step could be folded with the query writing, there is redundancy in both steps. However such change carries more risk, and thus was postponed. perf_simple_query test (cassandra-stress-like partitions) shows regression from 83k to 77k (7%) ops/s. Fixes #1165.	2016-04-07 19:56:58 +02:00
Tomasz Grabiec	a7966e9b71	mutation_partition: Fix friend declarations Missing "class" confuses CLion IDE.	2016-03-21 21:49:53 +01:00
Tomasz Grabiec	dc290f0af7	mutation_partition: Make apply() atomic even in case of exception We cannot leave partially applied mutation behind when the write fails. It may fail if memory allocation fails in the middle of apply(). This for example would violate write atomicity, readers should either see the whole write or none at all. This fix makes apply() revert partially applied data upon failure, by the means of ReversiblyMergeable concept. In a nut shell the idea is to store old state in the source mutation as we apply it and swap back in case of exception. At cell level this swapping is inexpensive, just rewiring pointers. For this to work, the source mutation needs to be brought into mutable form, so frozen mutations need to be unfrozen. In practice this doesn't increase amount of cell allocations in the memtable apply path because incoming data will usually be newer and we will have to copy it into LSA anyway. There are extra allocations though for the data structures which holds cells. I didn't see significant change in performance of: build/release/tests/perf/perf_simple_query -c1 -m1G --write --duration 13 The score fluctuates around ~77k ops/s. Fixes #283.	2016-03-21 21:49:52 +01:00
Tomasz Grabiec	e09d186c7c	mutation_partition: Make intrusive sets ReversiblyMergeable	2016-03-21 21:49:52 +01:00
Tomasz Grabiec	f1a4feb1fc	mutation_partition: Make row_tombstones_entry ReversiblyMergeable	2016-03-21 19:26:24 +01:00
Tomasz Grabiec	e4a576a90f	mutation_partition: Make rows_entry ReversiblyMergeable	2016-03-21 19:26:24 +01:00
Tomasz Grabiec	aadcd75d89	mutation_partition: Make row_marker ReversiblyMergeable	2016-03-21 19:26:24 +01:00
Tomasz Grabiec	ea7c2dd085	mutation_partition: Make row ReversiblyMergeable	2016-03-21 19:26:24 +01:00
Tomasz Grabiec	9fc7f8a5ed	mutation_partition: row: Add empty()	2016-03-21 18:41:27 +01:00
Tomasz Grabiec	d5e66a5b0d	mutation_partition: row: Allow storing empty cells internally Currently only "set" storage could store empty cells, but not the "vector" one because there empty cell has the meaning of being missing. To implement rolback, we need to be able to distinguish empty cells from missing ones. Solve by making vector storage use a bitmap for presence checking instead of emptiness. This adds 4 bytes to vector storage.	2016-03-21 18:41:27 +01:00
Tomasz Grabiec	8134992024	mutation_partition: Add cell_entry constructor which makes an empty cell	2016-03-18 22:30:04 +01:00
Tomasz Grabiec	c91eefa183	mutation_partition: Unmark cell_entry's copy constructor as noexcept It was a mistake, it certainly may throw because it copies cells.	2016-03-18 22:30:04 +01:00
Tomasz Grabiec	6cec131432	query: Switch to IDL-generated views and writers The query result footprint for cassandra-stress mutation as reported by tests/memory-footprint increased by 18% from 285 B to 337 B. perf_simple_query shows slight regression in throughput (-8%): build/release/tests/perf/perf_simple_query -c4 -m1G --partitions 100000 Before: ~433k tps After: ~400k tps	2016-02-26 12:26:13 +01:00
Tomasz Grabiec	4284715ddf	Relax includes	2016-02-26 12:26:13 +01:00
Avi Kivity	1f245e3bcb	mutation_partition: fix use of boost::intrusive::set<>::comp() Seems like boost::intrusive::set<>::comp() is not accessible on some versions of boost. Replace by the equivalent boost::intrusive::set<>::key_comp(). Fixes #858. Message-Id: <1454326483-29780-1-git-send-email-avi@scylladb.com>	2016-02-01 13:54:52 +01:00
Tomasz Grabiec	036974e19b	Make mutation interfaces support multiple versions Schema is tracked in memtable and cache per-entry. Entries are upgraded lazily on access. Incoming mutations are upgraded to table's current schema on given shard. Mutating nodes need to keep schema_ptr alive in case schema version is requested by target node.	2016-01-11 10:34:51 +01:00
Tomasz Grabiec	f59ec59abc	mutation: Implement upgrade() Converts mutation to a new schema.	2016-01-08 21:10:26 +01:00
Tomasz Grabiec	2cfdfe261d	Introduce converting_mutation_partition_applier	2016-01-08 21:10:26 +01:00
Tomasz Grabiec	a6084ee007	mutation: Make hashable The computed hash is independent of any internal representation thus can be used as a digest across nodes and versions.	2016-01-08 21:10:26 +01:00
Tomasz Grabiec	ade5cf1b4b	mutation_partition: Make visitable with mutation_partition_visitor	2016-01-08 21:10:25 +01:00
Tomasz Grabiec	bc9ee083dd	db: Move atomic_cell_or_collection to separate header To break future cyclic dependency: atomic_cell.hh -> schema.hh (new) -> types.hh -> atomic_cell.hh	2016-01-08 21:10:25 +01:00
Tomasz Grabiec	6f955e1290	mutation_partition: Make equal() work with different schemas	2016-01-08 21:10:25 +01:00
Tomasz Grabiec	ff3a2e1239	mutation_partition: Drop row tombstones in do_compact()	2016-01-08 21:10:25 +01:00
Raphael S. Carvalho	03eee06784	remove empty rows in mutation_partition::do_compact do_compact() wasn't removing an empty row that is covered by a tombstone. As a result, an empty partition could be written to a sstable. To solve this problem, let's make trim_rows remove a row that is considered to be empty. A row is empty if it has no tombstone, no marker and no cells. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2016-01-05 15:19:21 +01:00
Avi Kivity	fd14cb3743	mutation_partition: fix leak in move assignment operator The default move assignment operator calls boost::intrusive::set's move assignment operator, which leaks, because it does not believe it owns the data. Fix by providing a custom implementation.	2015-12-14 10:33:19 +01:00
Paweł Dziepak	5f1e9fd88f	mutation_partition: remove unused find_entry() Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-12-10 05:46:26 +01:00
Tomasz Grabiec	f74c665671	mutation_partition: Add non-const-qualified version of range() and use it	2015-10-22 18:09:07 +03:00
Avi Kivity	0129e42b06	Merge "Mutation diff" from Paweł "This series add code for computing mutation_partition difference. For mutations A and B: diffA = A.difference(B); diffB = B.difference(A); AB = A.apply(B); diffA is the minimal mutation that when applied to B makes it equal to AB and diffB is the minimal mutation that applied to A results in AB. Fixes #430."	2015-10-22 16:38:25 +03:00
Paweł Dziepak	f78a80dfa3	mutation_partition: add method for computing difference Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-22 12:08:53 +02:00
Paweł Dziepak	85edc3de07	mutation_partition: compute row difference Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-22 12:08:53 +02:00
Paweł Dziepak	4440f9b85b	mutation_partition: add row_marker::is_live() Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-22 12:08:53 +02:00
Paweł Dziepak	2aa96eb00f	mutation_partition: add insert_row() Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-22 12:08:53 +02:00
Paweł Dziepak	a064181d7c	mutation_partition: add row::with_both_ranges() Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-22 12:08:53 +02:00
Paweł Dziepak	1c05d7b927	mutation_partition: fix row_marker::apply() for equal timestamps Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-22 12:08:53 +02:00
Paweł Dziepak	7fab0ee867	mutation_partition: add compare_row_marker_for_merge() A compare_atomic_cell_for_merge() equivalent intended to be used with row markers. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2015-10-22 12:08:53 +02:00
Tomasz Grabiec	cc5cc7117d	mutation_query: Respect 'reversed' partition_slice option Fixes #480	2015-10-22 10:32:08 +02:00
Tomasz Grabiec	1b1cfd2cbf	tests: Introduce tests/memory_footprint_test	2015-09-23 21:27:44 -07:00

1 2 3

102 Commits