scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-20 16:40:35 +00:00

Author	SHA1	Message	Date
Paweł Dziepak	637b9a7b3b	atomic_cell_or_collection: make operator<< show cell content After the new in-memory representation of cells was introduced there was a regression in atomic_cell_or_collection::operator<< which stopped printing the content of the cell. This makes debugging more incovenient are time-consuming. This patch fixes the problem. Schema is propagated to the atomic_cell_or_collection printer and the full content of the cell is printed. Fixes #3571. Message-Id: <20181024095413.10736-1-pdziepak@scylladb.com>	2018-10-24 13:29:51 +03:00
Tomasz Grabiec	024b3c9fd9	mutation_partition: Fix exception safety of row::apply_monotonically() When emplace_back() fails, value is already moved-from into a temporary, which breaks monotonicity expected from apply_monotonically(). As a result, writes to that cell will be lost. The fix is to avoid the temporary by in-place construction of cell_and_hash. To do that, appropriate cell_and_hash constructor was added. Found by mutation_test.cc::test_apply_monotonically_is_monotonic with some modifications to the random mutation generator. Introduced in `99a3e3a`. Fixes #3678. Message-Id: <1533816965-27328-1-git-send-email-tgrabiec@scylladb.com>	2018-08-09 15:29:10 +03:00
Tomasz Grabiec	6b1fe6cbe5	mutation_partition: Introduce set_continuity()	2018-07-17 16:30:01 +02:00
Tomasz Grabiec	4d3cc2867a	mutation_partition: Make merging preemtable	2018-06-27 12:48:30 +02:00
Paweł Dziepak	ec9d166a4f	treewide: require type to compute cell memory usage	2018-05-31 15:51:11 +01:00
Paweł Dziepak	27014a23d7	treewide: require type info for copying atomic_cell_or_collection	2018-05-31 15:51:11 +01:00
Tomasz Grabiec	81d231f35b	mvcc: Remove rows from tracker gently Some parititons may have a lot of rows. Better to iterate over them incrementally as part of clear_gently() to avoid stalls.	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	40cc766cf2	database: Add API for incremental clearing of partition entries Partitions can get very large. Destroying them all at once can stall the reactor for significant amount of time. We want to avoid that by doing destruction incrementally, deferring in between. A new API is added for that at various levels: stop_iteration clear_gently() noexcept; It returns stop_iteration::yes when the object is fully cleared and can be now destroyed quickly. So a deferring destruction can look like this: return repeat([this] { return clear_gently(); }); The reason why clear_gently() doesn't return a future<> itself is that some contexts cannot defer, like memory reclamation.	2018-05-30 12:18:56 +02:00
Paweł Dziepak	33dffd5fb6	row: add clear_hash() Needed to measure the performance of hashing a cell.	2018-05-09 16:52:26 +01:00
Vladimir Krivopalov	ed62b9a667	Add mutation_partition::apply_insert() overload that accepts TTL and expiry for row marker. For #1969. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-04-26 13:27:42 -07:00
Duarte Nunes	c8baba4e3a	mutation_partition: Clarify comment about emptiness empty() doesn't distinguish between live and dead data, so clarify that in its comment. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:03 +01:00
Duarte Nunes	67dac67c46	mutation_partition: Regular base column in view determines row liveness When views contain a primary key column that is not part of the base table primary key, that column determines whether the row is live or not. We need to ensure that when that cell is dead, and thus the derived row marker, either by normal deletion of by TTL, so is the rest of the row. This patch introduces the idea of shawdowing row marker. We map the status of the regular base column in the view's PK to the view row's marker. If this marker is dead, so is that cell in the base table, and so should the view row become. To enforce that, a view row's dead marker shadows the whole row if that view includes a base regular column in its PK. Fixes #3360 Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Duarte Nunes	b0cb5480d5	mutation_fragment: Allow querying if row is live For clustering_row and static_row, allow querying whether they are live or not. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-04-23 09:32:02 +01:00
Tomasz Grabiec	381bf02f55	cache: Evict with row granularity Instead of evicting whole partitions, evicts whole rows. As part of this, invalidation of partition entries was changed to not evict from snapshots right away, but unlink them and let them be evicted by the reclaimer.	2018-03-06 11:50:29 +01:00
Tomasz Grabiec	5320705300	cache: Propagate cache_tracker to places manipulating evictable entries cache_tracker reference will be needed to link/unlink row entries. No change of behavior in this patch.	2018-03-06 11:50:27 +01:00
Tomasz Grabiec	3dc9000c51	mutation_partition: Introduce rows_entry::is_last_dummy() Will be needed by row evictor, which needs to treat last dummies specially (not evict them).	2018-03-06 11:50:26 +01:00
Tomasz Grabiec	d9a38c1c85	mutation_partition: Add API to walk from rows_entry to cache_entry Will be needed on row eviction, to unlink containers when they become fully evicted.	2018-03-06 11:50:26 +01:00
Tomasz Grabiec	9893e8e5f7	mvcc: Make each version have independent continuity This change is a preparation for introducing row-level eviction, such that entries can be evicted from older versions without having to touch other versions. Currently continuity flags on entries are interpreted relative to the combined view merged from all entries. For example: v2: <key=2, cont=1> v1: <key=1, cont=1> In v2, the flag on entry key=2 marks the range (1, 2) as continuous. This is problematic because if the old version is evicted, continuity will change in an incorrect way: v2: <key=2, cont=1> Here, the range (-inf, 1) would be marked as continuous, which is not true. To solve this problem, we change the rules for continuity interpretation in MVCC. Each version will have its own continuity, fully specified in that version, independent of continuity of other versions. Continuity of the snapshot will be a union of continuous ranges in each version. It is assumed that continuous intervals in different versions are non- overlapping, except for points corresponding to complete rows, in which case a later version may overlap with an older version (overwrite). We make use of this assumption to make calculation of the union of intervals on merging easier. I make use of the above assumption in mutation_partition::apply_monotonically(). MVCC population of incomplete entries already almost maintains the non-overlapping invariant, because population intervals correspond to intervals which are incomplete in the old snapshot. The only change needed is to ensure that both population bounds will have entries in the latest version. Population from memtables doesn't mark any intervals as continuous, so also conforms. The only change needed there is to not inherit continuity flags from the old snapshot, effectively making the new version internally discontinuous except for row points. The example from the beginning will become: v2: <key=1, cont=0> <key=2, cont=1> v1: <key=1, cont=1> When marking a range as continuous with some rows present only in older versions, we need to insert entries in the latest version, so that we can mark the range as continuous. The easiest solution is to copy the entry from the old version. Another option would be to add support for incomplete rows and insert such instead. This way we would avoid duplicating row contents. This optimization is deferred.	2018-03-06 11:50:25 +01:00
Duarte Nunes	99a3e3aa76	mutation_partition: Allow caching cell hashes We add storage to a row to hold the cached hashes of each individual cell. We don't store the hash in each cell because that would a) change the cell equality function, and b) require us to change a cell in a potentially fragmented buffer. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-02-01 01:02:47 +00:00
Duarte Nunes	71ba99d53e	mutation_partition: Force vector_storage internal storage size This patch forces the size of vector_storage's internal storage to 5, meaning that the underlying managed_vector will ensure it doesn't need to externally allocate a buffer to hold the row, if only its first 5 cells are set. We define this size explicitly so we can change the vector's value type in upcoming patches without affecting the optimization. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-02-01 00:22:51 +00:00
Tomasz Grabiec	da0c48a987	mutation_partition: Add rows_entry::set_dummy()	2018-01-18 11:32:49 +01:00
Duarte Nunes	83e983d4d0	mutation_partition: Remove unused operator==() Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180115013546.67260-1-duarte@scylladb.com>	2018-01-15 11:16:35 +02:00
Duarte Nunes	9d1d9883ff	mutation_partition: Remove unused for_each_cell() overload Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180115013618.67351-1-duarte@scylladb.com>	2018-01-15 11:16:34 +02:00
Tomasz Grabiec	8e8ece5dec	mutation_partition: Introduce deletable_row::apply() from a clustering_row fragment	2017-12-08 17:50:47 +01:00
Tomasz Grabiec	b3709047b0	mutation_partition: Extract sliced() from mutation into mutation_partition So that we can call it on mutation_partition.	2017-12-08 17:50:47 +01:00
Tomasz Grabiec	bde050835f	mutation_partition: Make check_continuity() const-qualified	2017-12-08 12:01:27 +01:00
Tomasz Grabiec	f9257886cb	mutation_partition: Make check_continuity() public	2017-12-08 12:01:27 +01:00
Tomasz Grabiec	865bd8a594	mutation_partition: Introduce mutation_partition::get_continuity() Intended to be used in tests.	2017-12-08 12:01:27 +01:00
Tomasz Grabiec	22138554e6	mutation_partition: Leave moved-from row in an empty state Needed by apply_monotonically(). Fixes SIGSEGV in mutation_test_g.	2017-12-08 12:01:27 +01:00
Tomasz Grabiec	70e14f78a7	mutation_partition: Drop apply_reversibly()	2017-11-28 13:03:06 +01:00
Tomasz Grabiec	091e10fc70	mutation_partition: Relax exception guarantees of apply() The uses which needed strong or weak exception guarantees were switched to a solution involving apply_monotonically(). All remaining uses don't need any exception guarantees.	2017-11-28 13:03:06 +01:00
Tomasz Grabiec	988d3c67b4	mutation_partition: Introduce apply_weak() Intended to be used by code which doesn't need any exception guarantees. Currently just delegates to apply_monotonically().	2017-11-28 13:03:03 +01:00
Tomasz Grabiec	97ebf51d3a	mutation_partition: Introduce apply_monotonically() Has weaker exception guarantees than apply(), which allows for simpler implementation. Intended to replace the apply() with strong exception guarantees.	2017-11-28 12:28:51 +01:00
Tomasz Grabiec	978b874065	mutation_partition: Introduce row::consume_with()	2017-11-28 11:20:03 +01:00
Glauber Costa	d49ecae201	mutation_partition: estimate size of partition In the memtable flusher, we account for the size of a partition as we read them. However, there are other points in the architecture where we would like to calculate the size of a partition in a point in which we are not reading it. One such example is the cache update process. This patch enhances the mutation_partition adding a method that returns the total size for this partition. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2017-11-08 16:21:44 -05:00
Tomasz Grabiec	749f5770df	mutation: Introduce apply(mutation_fragment)	2017-11-02 12:16:17 +01:00
Tomasz Grabiec	72028bb048	mutation_partition: Allow creating rows_entry at any clustered position_in_partition In preparation for supporting setting continuity of arbitrary clustering range.	2017-11-02 11:05:19 +01:00
Tomasz Grabiec	409adc045a	mutation_partition: Remove delegating_compare() It can't work with rows_entry at any position_in_partition, so we need to drop it.	2017-11-02 11:05:19 +01:00
Tomasz Grabiec	455a1b0d24	mutation_partition: Introduce range continuity checking methods	2017-09-13 17:47:04 +02:00
Tomasz Grabiec	abc489e99d	mutation_partition: Enable rows_entry::compare() on position_in_partition_views For full symmetry with existing overloads.	2017-09-13 17:47:04 +02:00
Tomasz Grabiec	b6ae5783cd	mvcc: Introduce partition_entry::evict() The operation frees as much memory as possible, marking affected mutation elements as discontinuous.	2017-09-13 17:47:03 +02:00
Paweł Dziepak	43cce6c2f4	rows_entry: make position() inlineable	2017-07-26 14:38:27 +01:00
Tomasz Grabiec	0770845a23	mutation_partition: Introduce r-value accepting deletable_row::apply()	2017-06-24 18:06:11 +02:00
Piotr Jastrzebski	efc75b0bc3	mutation_partition: Add rows_entry constructor which accepts full contents [tgrabiec: Extracted from different patch]	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	dce293e11c	tests: row_cache: Apply only fully continuous mutations to underlying mutation source Cache currently assumes that mutations coming from outside are fully continuous.	2017-06-24 18:06:11 +02:00
Piotr Jastrzebski	05b56fcfb0	mutation_partition: Add support for specifying continuity This will allow expressing lack of information about certain ranges of rows (including the static row), which will be used in cache to determine if information in cache is complete or not. Continuity is represented internally using flags on row entries. The key range between two consecutive entries is continuous iff rows_entry::continuous() is true for the later entry. The range starting after the last entry is assumed to be continuous. The range corresponding to the key of the entry is continuous iff rows_entry::dummy() is false. [tgrabiec: - based on the following commits: 4a5bf75 - Piotr Jastrzebski : mutation_partition: introduce dummy rows_entry 773070e - Piotr Jastrzebski : mutation_partition: add continuity flag to rows_entry - documented that partition tombstone is always complete - require specifying the partition tombstone when creating an incomplete entry - replaced rows_entry(dummy_tag, ...) constructor with more general rows_entry(position_in_partition, ...) - documented continuity semantics on mutation_partition - fixed _static_row_cached being lost by mutation_partition copy constructors - fixed conversion to streamed_mutation to ignore dummy entries - fixed mutation_partition serializer to drop dummy entries - documented semantics of continuity on mutation_partition level - dropped assumptions that dummy entries can be only at the last position - changed equality to ignore continuity completely, rather than partially (it was not ignoring dummy entries, but ignoring continuity flag) - added printout of continuity information in mutation_partition - fixed handling of empty entries in apply_reversibly() with regards to continuity; we no longer can remove empty entries before merging, since that may affect continuity of the right-hand mutation. Added _erased flag. - fixed mutation_partition::clustered_row() with dummy==true to not ignore the key - fixed partition_builder to not ignore continuity - renamed dummy_tag_t to dummy_tag. _t suffix is reserved. - standardized all APIs on is_dummy and is_continuous bool_class:es - replaced add_dummy_entry() with ensure_last_dummy() with safer semantics - dropped unused remove_dummy_entry() - simplified and inlined cache_entry::add_dummy_entry() - fixed mutation_partition(incomplete_tag) constructor to mark all row ranges as discontinuous ]	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	a77734952d	mutation_partition: Make rows_entry comparable with position_in_partition	2017-06-24 18:06:11 +02:00
Piotr Jastrzebski	65b3123516	mutation_partition: Use rows_entry::position() in comparators key() will not be valid for dummy entries, but position() is always valid. [tgrabiec: Extracted from other commits] [tgrabiec: Added missing change to range_tombstone_stream::get_next]	2017-06-24 18:06:11 +02:00
Tomasz Grabiec	660f3127a6	mutation_partition: Introduce rows_entry::position() In preparation for enabling dummy entries with postion past all clustering rows.	2017-06-24 18:06:11 +02:00
Gleb Natapov	f5679e0416	database: remove remnants of no longer existing db::serializer. Message-Id: <20170604100552.GD8248@scylladb.com>	2017-06-04 13:07:17 +03:00

1 2 3 4

173 Commits