scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 12:06:44 +00:00

Author	SHA1	Message	Date
Glauber Costa	6317bd45d7	LCS: implement backlog tracker for compaction controller This is the last missing tracker among the major strategies. After this, only DTCS is left. To calculate the backlog, we will define the point of zero-backlog as having all data in the last level. The backlog is then: Sum(L in levels) sizeof(L) * (max_levels - L) * fan_out, where: * the fan_out is the amount of SSTables we usually compact with the next level (usually 10). * max_levels is the number of levels currently populated * sizeof(L) is the total amount of data in a particular level. Care is taken for the backlog not to jump when a new level has been just recently created. Aside from that, SSTables that accumulate in L0 can be subject to STCS. We will then add a STCS backlog in those SSTables to represent that. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2018-06-03 18:14:09 -04:00
Glauber Costa	04546df55c	LCS: don't construct property in the body of constructor Right now we are constructing the _max_sstable_size_in_mb property in the body of the constructor, which it makes it hard for us to use from other properties. We are doing that because we'd like to test for bounds of that value. So a cleaner way is to have a helper function for that. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2018-06-03 18:14:09 -04:00
Glauber Costa	28382cb25c	LCS: try harder to move SSTables to highest levels. Our current implementation of LCS can end up with situations in which just a bit of data is in the highest levels, with the majority in the lowest levels. That happens because we will only promote things to highest levels if the amount of data in the current level is higher than the maximum. This is a pre-existing problem in itself, but became even clearer when we started trying to define what is the backlog for LCS. We have discussed ways to fix this it by redefining the criteria on when to move data to the next levels. That would require us to change the way things are today considerably, allowing parallel compactions, etc. There is significant risk that we'll increase write amplication and we would need to carefully validate that. For now I will propose a simpler change, that essentially solves the "inverted pyramid" problem of current LCS without major disruption: keep selecting compaction candidates with the same criteria that we do today, we should help make sure we are not compacting high levels for no reason; but if there is nothing to do, use the idle time to push data to higher levels. As an added benefit, old data that is in the higher level can also be compacted away faster. With this patch we see that in an idle, post-load system all data is eventually pushed to the last level. Systems under constant writes keep behaving the same way they did before. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2018-06-03 18:12:19 -04:00
Glauber Costa	e64b471e3d	leveled manifest: turn 10 into a constant We increase levels in powers of 10 but that is a parameter of the algorithm. At least make it into a constant so that we can reuse it somewhere else. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2018-06-03 16:55:58 -04:00
Glauber Costa	7e3093709a	backlog: add level to write progress monitor For SSTables being written, we don't know their level yet. Add that information to the write monitor. New SSTables will always be at L0. Compacted SSTables will have their level determined by the compaction process. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2018-05-31 21:09:38 -04:00
Amnon Heiman	bc7503feee	Scyllatop to use prometheus by default Scylla now expose the prometheus API by default. This patch chagnes scyllatop to use the Prometheus API, the collect API is still available. The main changes in the patch: * Move collectd specific logic inside collectd. * Add support for help information. * Add command line to configure prometheus end point and to enable collectd. * Add a prometheus class that collect information from prometheus. Fixes: #1541 Message-Id: <20180531124156.26336-1-amnon@scylladb.com>	2018-05-31 18:00:22 +03:00
Tomasz Grabiec	b5e42bc6a0	tests: row_cache: Do not hang when only one of the readers throws Message-Id: <20180531122729.3314-1-tgrabiec@scylladb.com>	2018-05-31 18:00:22 +03:00
Piotr Sarna	360326fdc5	cql3: add compatibility with libjsoncpp < 1.6.0 Only libjsoncpp >= 1.6.0 offers a safe name() method for value iterators. For older versions, deprecated memberName() is used instead. Note that memberName() was deprecated because of its inability to deal with embedded null characters. Fixes #3471 Message-Id: <e64a62bfc24ef06daee238d79d557fe6ec8979d3.1527758708.git.sarna@scylladb.com>	2018-05-31 18:00:22 +03:00
Duarte Nunes	f8626c7c93	tests/view_schema_test: Test view correctness under base schema changes Reproducer for #3443. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180530194536.51202-2-duarte@scylladb.com>	2018-05-31 12:10:50 +03:00
Duarte Nunes	c4f267bdfe	database: Refresh view dependent fields when altering base A view schema's view_info contains the id of the base regular column that view includes in its primary key. Since the column id of a particular column can potentially change with a new schema version, we need to refresh the stored column id. We weren't doing that when unselected base columns are added, and this patch fixes it by triggering an update of the view schema when base columns are added and the view contains a base regular column in its PK. Fixes #3443 Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180530194536.51202-1-duarte@scylladb.com>	2018-05-31 12:10:49 +03:00
Nadav Har'El	a1cbeeffcd	tests/view_complex_test.cc: fix and enable buggy test tests/view_complex_test.cc contained a #ifdef'ed-out test claiming to be a reproducer for issue #3362. Unfortunately, it it is not - after earlier commits the only reason this test still fails is a mistake in the test, which expects 0 rows in a case where the real result is 1 row. Issue #3362 does not have to be fixed to fix this test. So this patch fixes the broken test, and enables it. It also adds comments explaining what this test is supposed to do, and why it works the way it does. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180530142214.29398-1-nyh@scylladb.com>	2018-05-30 15:39:25 +01:00
Avi Kivity	9999e0e6bc	Merge "Implement support for static rows in SSTable 3.0" from Piotr " Add handling for static rows and tests for it. " * 'haaawk/sstables3/read-static-v1' of ssh://github.com/scylladb/seastar-dev: sstable_3_x_test: Add test_uncompressed_compound_static_row_read sstable_3_x_test: add test_uncompressed_static_row_read flat_mutation_reader_assertions: improve static row assertions data_consume_rows_context_m: Implement support for static rows mp_row_consumer_m: Implement support for static rows mp_row_consumer_m: Extract fill_cells	2018-05-30 17:17:17 +03:00
Paweł Dziepak	62d0639fe9	Merge "Avoid reactor stalls in cache with large partitions" from Tomasz " We currently suffer from reactor stalls caused by non-preemptible processing of large partitions in the following places: (1) dropping partition entries from cache or memtables does not defer (2) dropping partition versions abandoned by detached snapshots does not defer (3) merging of partition versions when snapshots go away does not defer (4) cache update from memtable processes partition entries without deferring (#2578) (5) partition entries are upgraded to new schema atomically This series fixes problems (1), (2) and (4), but not (3) and (5). (1) and (2) are fixed by introducing mutation_cleaner objects which are containers for garbage partition versions which are delaying actual freeing. Freeing happens from memory reclaimers and is incremental. (3) and (5) are not solved yet. (4) is solved by having partition merging process partitions with row granularity and defer in the middle of partition. In order to preserve update atomicity on partition level as perceived by reads, when update starts we create a snapshot to the current version of partition and process memtable entry by inserting data into a separate partition version. This way if upgrade defers in the middle of partition reads can still go to the old version and not see partial writes. Snapshots are marked with phase numbers, and reads will use the previous phase until whole partition is upgraded. When partition is finally merged, the snapshots go away and the new version will eventually be merged to the old version. Due to (3) however, this merging may still add latency to the upgrade path. Remaining work: - Solving problem (3). I think the approach to take here would be to move the task of merging versions to the background, maybe into mutation_cleaner. - Merging range tombstones incrementally. Performance =========== Performance improvements were evaluated using tests/perf_row_cache_update -c1 -m1G, which measures time it takes to update cache from memtable for various workloads and schemas. For large partition with lots of small rows we see a significant reduction of scheduling latency from ~550ms to ~23ms. The cause of remainig latency is problem (3) stated above. The run time is reduced by 70%. For small partition case without clustering columns we see no degradation. For small partition case with clustering key, but only 3 small rows per partition, we see a 30% degradation in run time. For large partition with lots of range tombstones we see degradation of 15% in run time and scheduling latency. Below you can see full statistics for cache update run time: === Small partitions, no overwrites: Before: avg = 433.965155 stdev = 35.958024 min = 340.093201 max = 468.564514 After: avg = 436.929447 (+1%) stdev = 37.130237 min = 349.410339 max = 489.953400 === Small partition with a few rows: Before: avg = 315.379316 stdev = 30.059120 min = 240.340561 max = 342.408295 After: avg = 407.232691 (+30%) stdev = 53.918717 min = 269.514648 max = 444.846649 === Large partition, lots of small rows: Before: avg = 412.870689 stdev = 227.411317 min = 286.990631 max = 1263.417847 After: avg = 124.351705 (-70%) stdev = 4.705762 min = 110.063255 max = 129.643387 === Large partition, lots of range tombstones: Before: avg = 601.172644 stdev = 121.376866 min = 223.502136 max = 874.111572 After: avg = 695.627588 (+15%) stdev = 135.057004 min = 337.173950 max = 784.838745 " * tag 'tgrabiec/clear-gently-all-partitions-v3' of github.com:tgrabiec/scylla: mvcc: Use small_vector<> in partition_snapshot_row_cursor utils: Extract small_vector.hh mvcc: Erase rows gradually in apply_to_incomplete() mvcc: partition_snapshot_row_cursor: Avoid row copying in consume() when possible cache: real_dirty_memory_accounter: Move unpinning out of the hot path mvcc: partition_snapshot_row_cursor: Reduce lookups in ensure_entry_if_complete() mutation_partition: Reduce row lookups in apply_monotonically() cache: Release dirty memory with row granularity cache: Defer during partition merging mvcc: partition_snapshot_row_cursor: Introduce consume_row() mvcc: partition_snapshot_row_cursor: Introduce maybe_refresh_static() mvcc: Make apply_to_incomplete() work with attached versions cache: Propagate phase to apply_to_incomplete() cache: Prepare for incremental apply_to_incomplete() Introduce a coroutine wrapper tests: mvcc: Encapsulate memory management details tests: cache: Take into account that update() may defer cache: real_dirty_memory_accounter: Allow construction without memtable cache: Extract real_dirty_memory_accounter mvcc: Destroy memtable partition versions gently memtable: Destroy partitions incrementally from clear_gently() mvcc: Remove rows from tracker gently cache: Destroy partition versions incrementally Introduce mutation_cleaner mvcc: Introduce partition_version_list mvcc: Fix move constructor of partition_version_ref() not preserving _unique_owner database: Add API for incremental clearing of partition entries cache: Define trivial methods inline tests: Improve perf_row_cache_update mutation_reader: Make empty mutation source advertize no partitions	2018-05-30 14:12:29 +01:00
Tomasz Grabiec	4561e97efe	mvcc: Use small_vector<> in partition_snapshot_row_cursor I measured 8% improvement in cache update throughput for small partitions.	2018-05-30 14:41:41 +02:00
Tomasz Grabiec	db36ff0643	utils: Extract small_vector.hh	2018-05-30 14:41:41 +02:00
Tomasz Grabiec	5b59df3761	mvcc: Erase rows gradually in apply_to_incomplete() So that we avoid double-buffering partitions.	2018-05-30 14:41:41 +02:00
Tomasz Grabiec	b7fdf4309f	mvcc: partition_snapshot_row_cursor: Avoid row copying in consume() when possible	2018-05-30 14:41:41 +02:00
Tomasz Grabiec	8d66f6da58	cache: real_dirty_memory_accounter: Move unpinning out of the hot path Instead of calling into real dirty memory manager per row, call it per deferring point.	2018-05-30 14:41:41 +02:00
Tomasz Grabiec	60000b98a4	mvcc: partition_snapshot_row_cursor: Reduce lookups in ensure_entry_if_complete() Leverage the fact that it is called with monotonically increasing positions, and avoid lookups in case the current target entry is the successor of desired position. Reduces cache update latency by 40% for large partition in a time-series workload.	2018-05-30 14:41:41 +02:00
Tomasz Grabiec	82e8217ba0	mutation_partition: Reduce row lookups in apply_monotonically() This change speeds up merging of partition versions with many rows in case the merged version has many rows which fall between existing rows in the target version. This is often the case for time-series workloads, which insert rows at the front. Lookup can be avoided for all but the first row in the stride because we already have a reference to the successor in the target tree, we only need to check that the current entry in the target tree is still the successor. This change greatly reduces amount of lookups per row during version merging of large partitions in time-series workloads.	2018-05-30 14:41:41 +02:00
Tomasz Grabiec	5bc201df10	cache: Release dirty memory with row granularity	2018-05-30 14:41:41 +02:00
Tomasz Grabiec	70c72773be	cache: Defer during partition merging	2018-05-30 14:41:41 +02:00
Tomasz Grabiec	051bb74583	mvcc: partition_snapshot_row_cursor: Introduce consume_row()	2018-05-30 14:41:41 +02:00
Tomasz Grabiec	518fd7083f	mvcc: partition_snapshot_row_cursor: Introduce maybe_refresh_static() A version of maybe_refresh() optimized for snapshots which are no longer populated. Will be used to implement cache update from memtable.	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	c653137b2b	mvcc: Make apply_to_incomplete() work with attached versions Needed before making it preemptible. We cannot steal the entry since we may need to resume merging later.	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	1792be3697	cache: Propagate phase to apply_to_incomplete() It will be needed to create snapshots with appropriate phase markers.	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	494cb3f3da	cache: Prepare for incremental apply_to_incomplete() Incremental merging will be implemented by the means of resumable functions, which return stop_iteration::no when not yet finished. We're not using futures, so that the caller can do work around preemption points as well.	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	a19c5cbc16	Introduce a coroutine wrapper Represents a deferring operation which defers cooperatively with the caller. The operation is started and resumed by calling run(), which returns with stop_iteration::no whenever the operation defers and is not completed yet. When the operation is finally complete, run() returns with stop_iteration::yes. This allows the caller to: 1) execute some post-defer and pre-resume actions atomically 2) have control over when the operation is resumed and in which context, in particular the caller can cancel the operation at deferring points. It will be used to implement deferring partition_version::apply_to_incomplete().	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	6bd1a04c10	tests: mvcc: Encapsulate memory management details Curently tests have a single LSA region lock around construction of managed objects, their manipulation, and access. This way we avoid the complexity of dealing with allocating sections. That will not be possible once apply_to_incomplete() is changed to enter an allocating section itself becasue this requires region to be unlocked at entry. The tests will have to take more fine-grained locks. That is somewhat tricky add would add a lot of noise to tests. This patch will make things easier by abstracting LSA management, among other things, inside mvcc_conatiner and mvcc_partition classes.	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	f6e21accc7	tests: cache: Take into account that update() may defer The test incorrectly assumed that once update() is started the cache will return only versions from last_generation. This will not hold once we start to defer during partition merging.	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	c10d9e1607	cache: real_dirty_memory_accounter: Allow construction without memtable	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	6ecda1ccd7	cache: Extract real_dirty_memory_accounter	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	3f19f76c67	mvcc: Destroy memtable partition versions gently Now all snapshots will have a mutation_cleaner which they will use to gently destroy freed partition_version objects. Destruction of memtable entries during cache update is also using the gentle cleaner now. We need to have a separate cleaner for memtable objects even though they're owned by cache's region, because memtable versions must be cleared without a cache_tracker. Each memtable will have its own cleaner, which will be merged with the cache's cleaner when memtable is merged into cache. Fixes some sources of reactor stalls on cache update when there are large partition entries in memtables.	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	c2d702622e	memtable: Destroy partitions incrementally from clear_gently() Destroying large partitions may stall the reactor for a long time. Avoid this by clearing incrementally.	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	81d231f35b	mvcc: Remove rows from tracker gently Some parititons may have a lot of rows. Better to iterate over them incrementally as part of clear_gently() to avoid stalls.	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	f0c1edd672	cache: Destroy partition versions incrementally Instead of destroying whole partition_versions at once, we will do that gently using mutation_cleaner to avoid reactor stalls. Large deletions could happen when large partition gets invalidated, upgraded to a new schema, or when it's abandaned by a detached snapshot. Refs #3289.	2018-05-30 14:41:40 +02:00
Tomasz Grabiec	e0803ff71e	Introduce mutation_cleaner Used for collecting unsued partition_version objects and freeing them incrementally. Will be used for both cache and memtables.	2018-05-30 14:41:39 +02:00
Tomasz Grabiec	e5aa02efeb	mvcc: Introduce partition_version_list	2018-05-30 12:18:56 +02:00
Tomasz Grabiec	ca1ee93577	mvcc: Fix move constructor of partition_version_ref() not preserving _unique_owner We didn't rely on that yet, it seems, but will. (cherry picked from commit 21a744337de01f699d5c5c340483ad23cabab2ee)	2018-05-30 12:18:56 +02:00
Tomasz Grabiec	40cc766cf2	database: Add API for incremental clearing of partition entries Partitions can get very large. Destroying them all at once can stall the reactor for significant amount of time. We want to avoid that by doing destruction incrementally, deferring in between. A new API is added for that at various levels: stop_iteration clear_gently() noexcept; It returns stop_iteration::yes when the object is fully cleared and can be now destroyed quickly. So a deferring destruction can look like this: return repeat([this] { return clear_gently(); }); The reason why clear_gently() doesn't return a future<> itself is that some contexts cannot defer, like memory reclamation.	2018-05-30 12:18:56 +02:00
Tomasz Grabiec	2f75212ca4	cache: Define trivial methods inline They have users in a different compilation unit, in partition_version.cc	2018-05-30 12:18:56 +02:00
Tomasz Grabiec	25b3641d9e	tests: Improve perf_row_cache_update We now test more kinds of workloads: - small partitions with no clustering key - large partition with lots of small rows - large partition with lots of range tombstones We also collect statistics about scheduling latency induced by cache update. Example output: Small partitions, no overwrites: update: 356.809113 [ms], stall: {ticks: 396, min: 0.006867 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.358102 [ms]}, cache: 257/257 [MB] LSA: 257/257 [MB] std free: 83 [MB] update: 337.542999 [ms], stall: {ticks: 373, min: 0.001598 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.358102 [ms]}, cache: 514/514 [MB] LSA: 514/514 [MB] std free: 83 [MB] update: 383.485291 [ms], stall: {ticks: 425, min: 0.001598 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.131752 [ms]}, cache: 771/788 [MB] LSA: 771/788 [MB] std free: 83 [MB] update: 574.968811 [ms], stall: {ticks: 634, min: 0.001598 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.629722 [ms], max: 1.955666 [ms]}, cache: 879/917 [MB] LSA: 879/917 [MB] std free: 83 [MB] update: 411.541138 [ms], stall: {ticks: 455, min: 0.001598 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.358102 [ms]}, cache: 787/835 [MB] LSA: 787/835 [MB] std free: 83 [MB] update: 368.491211 [ms], stall: {ticks: 408, min: 0.001332 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.131752 [ms]}, cache: 750/790 [MB] LSA: 750/790 [MB] std free: 83 [MB] update: 343.671967 [ms], stall: {ticks: 380, min: 0.001598 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.131752 [ms]}, cache: 734/769 [MB] LSA: 734/769 [MB] std free: 83 [MB] update: 320.277283 [ms], stall: {ticks: 357, min: 0.001598 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.131752 [ms]}, cache: 724/753 [MB] LSA: 724/753 [MB] std free: 83 [MB] update: 310.583282 [ms], stall: {ticks: 344, min: 0.001598 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.131752 [ms]}, cache: 714/740 [MB] LSA: 714/740 [MB] std free: 83 [MB] update: 303.627106 [ms], stall: {ticks: 338, min: 0.001598 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.955666 [ms]}, cache: 707/731 [MB] LSA: 707/731 [MB] std free: 83 [MB] update: 296.742523 [ms], stall: {ticks: 330, min: 0.001332 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.131752 [ms]}, cache: 701/724 [MB] LSA: 701/724 [MB] std free: 83 [MB] update: 286.598541 [ms], stall: {ticks: 319, min: 0.001598 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.131752 [ms]}, cache: 697/719 [MB] LSA: 697/719 [MB] std free: 83 [MB] update: 288.649323 [ms], stall: {ticks: 321, min: 0.001598 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.131752 [ms]}, cache: 694/715 [MB] LSA: 694/715 [MB] std free: 83 [MB] update: 282.069916 [ms], stall: {ticks: 314, min: 0.001598 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.131752 [ms]}, cache: 692/712 [MB] LSA: 692/712 [MB] std free: 83 [MB] update: 292.462036 [ms], stall: {ticks: 325, min: 0.001917 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.131752 [ms]}, cache: 689/708 [MB] LSA: 689/708 [MB] std free: 83 [MB] update: 274.390442 [ms], stall: {ticks: 305, min: 0.001332 [ms], 50%: 1.131752 [ms], 90%: 1.131752 [ms], 99%: 1.131752 [ms], max: 1.131752 [ms]}, cache: 687/705 [MB] LSA: 687/705 [MB] std free: 83 [MB] invalidation: 172.617508 [ms] Large partition, lots of small rows: update: 262.132721 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.005722 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 268.650944 [ms]}, cache: 187/188 [MB] LSA: 187/188 [MB] std free: 82 [MB] update: 281.359467 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.017084 [ms], 99%: 0.017084 [ms], max: 322.381152 [ms]}, cache: 375/376 [MB] LSA: 375/376 [MB] std free: 82 [MB] update: 287.229065 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.017084 [ms], 99%: 0.017084 [ms], max: 322.381152 [ms]}, cache: 563/564 [MB] LSA: 563/564 [MB] std free: 82 [MB] update: 1294.816284 [ms], stall: {ticks: 4, min: 0.001917 [ms], 50%: 0.005722 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 1386.179840 [ms]}, cache: 586/625 [MB] LSA: 586/625 [MB] std free: 82 [MB] update: 845.022461 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.005722 [ms], 90%: 0.017084 [ms], 99%: 0.017084 [ms], max: 962.624896 [ms]}, cache: 439/475 [MB] LSA: 439/475 [MB] std free: 82 [MB] update: 380.335938 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 386.857376 [ms]}, cache: 599/600 [MB] LSA: 599/600 [MB] std free: 82 [MB] update: 477.234680 [ms], stall: {ticks: 4, min: 0.002760 [ms], 50%: 0.004768 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 557.074624 [ms]}, cache: 599/600 [MB] LSA: 599/600 [MB] std free: 82 [MB] update: 525.955017 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 557.074624 [ms]}, cache: 599/600 [MB] LSA: 599/600 [MB] std free: 82 [MB] update: 548.003784 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.006866 [ms], 90%: 0.017084 [ms], 99%: 0.017084 [ms], max: 557.074624 [ms]}, cache: 599/600 [MB] LSA: 599/600 [MB] std free: 82 [MB] update: 528.697937 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 557.074624 [ms]}, cache: 599/600 [MB] LSA: 599/600 [MB] std free: 82 [MB] update: 609.292603 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.005722 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 668.489536 [ms]}, cache: 599/600 [MB] LSA: 599/600 [MB] std free: 82 [MB] update: 575.762451 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.017084 [ms], 99%: 0.017084 [ms], max: 668.489536 [ms]}, cache: 599/600 [MB] LSA: 599/600 [MB] std free: 82 [MB] update: 530.801392 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 557.074624 [ms]}, cache: 599/600 [MB] LSA: 599/600 [MB] std free: 82 [MB] update: 535.948364 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.017084 [ms], 99%: 0.017084 [ms], max: 557.074624 [ms]}, cache: 599/600 [MB] LSA: 599/600 [MB] std free: 82 [MB] update: 527.143555 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.020501 [ms], 99%: 0.020501 [ms], max: 557.074624 [ms]}, cache: 599/600 [MB] LSA: 599/600 [MB] std free: 82 [MB] update: 521.869202 [ms], stall: {ticks: 4, min: 0.002760 [ms], 50%: 0.004768 [ms], 90%: 0.017084 [ms], 99%: 0.017084 [ms], max: 557.074624 [ms]}, cache: 599/600 [MB] LSA: 599/600 [MB] std free: 82 [MB] invalidation: 173.069733 [ms] Large partition, lots of range tombstones: update: 224.003220 [ms], stall: {ticks: 4, min: 0.001917 [ms], 50%: 0.004768 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 268.650944 [ms]}, cache: 52/52 [MB] LSA: 52/52 [MB] std free: 82 [MB] update: 570.882874 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 668.489536 [ms]}, cache: 105/105 [MB] LSA: 105/105 [MB] std free: 82 [MB] update: 577.249878 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 668.489536 [ms]}, cache: 158/158 [MB] LSA: 158/158 [MB] std free: 82 [MB] update: 580.239624 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 668.489536 [ms]}, cache: 211/211 [MB] LSA: 211/211 [MB] std free: 82 [MB] update: 614.187134 [ms], stall: {ticks: 4, min: 0.001917 [ms], 50%: 0.004768 [ms], 90%: 0.011864 [ms], 99%: 0.011864 [ms], max: 668.489536 [ms]}, cache: 264/264 [MB] LSA: 264/264 [MB] std free: 82 [MB] update: 618.709229 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.003973 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 668.489536 [ms]}, cache: 317/317 [MB] LSA: 317/317 [MB] std free: 82 [MB] update: 626.943359 [ms], stall: {ticks: 4, min: 0.001598 [ms], 50%: 0.004768 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 668.489536 [ms]}, cache: 369/370 [MB] LSA: 369/370 [MB] std free: 82 [MB] update: 602.873474 [ms], stall: {ticks: 4, min: 0.001917 [ms], 50%: 0.003973 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 668.489536 [ms]}, cache: 422/423 [MB] LSA: 422/423 [MB] std free: 82 [MB] update: 617.522583 [ms], stall: {ticks: 4, min: 0.001598 [ms], 50%: 0.004768 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 668.489536 [ms]}, cache: 475/475 [MB] LSA: 475/475 [MB] std free: 82 [MB] update: 627.291138 [ms], stall: {ticks: 4, min: 0.001598 [ms], 50%: 0.004768 [ms], 90%: 0.011864 [ms], 99%: 0.011864 [ms], max: 668.489536 [ms]}, cache: 528/528 [MB] LSA: 528/528 [MB] std free: 82 [MB] update: 623.720886 [ms], stall: {ticks: 4, min: 0.001598 [ms], 50%: 0.003973 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 668.489536 [ms]}, cache: 581/581 [MB] LSA: 581/581 [MB] std free: 82 [MB] update: 630.735596 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 668.489536 [ms]}, cache: 634/634 [MB] LSA: 634/634 [MB] std free: 82 [MB] update: 2776.525635 [ms], stall: {ticks: 4, min: 0.002300 [ms], 50%: 0.004768 [ms], 90%: 0.014237 [ms], 99%: 0.014237 [ms], max: 2874.382592 [ms]}, cache: 687/687 [MB] LSA: 687/687 [MB] std free: 82 [MB]	2018-05-30 12:18:56 +02:00
Tomasz Grabiec	bb96518cc5	mutation_reader: Make empty mutation source advertize no partitions So that perf_row_cache_update will always populate cache.	2018-05-30 12:18:56 +02:00
Avi Kivity	dd26cf1490	Merge "db/view: Clarifications to range movement scenarios" from Duarte " This series provides reasoning and clarification for the current structure of mutate_MV(), and how we handle some scenarios related to range movements. " * 'materialized-views/clarifications/v3' of github.com:duarten/scylla: db/view: Remove ifdef'd Java code db/view: Ignore scenario where base replica hasn't joined the ring db/view: Handle case when base has no paired view replica	2018-05-29 18:51:06 +03:00
Avi Kivity	928af7701c	Merge "Implement reading clustering columns from SSTables 3.x" from Piotr " Add handling for clustering columns and tests for it. " * 'haaawk/sstables3/read-ck-v3' of ssh://github.com/scylladb/seastar-dev: Add test_uncompressed_compound_ck_read for SSTables 3.x Add test_uncompressed_simple_read for SSTables 3.x Implement reading clustering key from SSTables 3.x column_translation: cache fixed value lengths for ck data_consume_rows_context_m: use cached fixed column value lenghts column_translation: store fix lengths of column values consume_row_start: change type of clustering key Rename ROW_BODY state to CLUSTERING_ROW	2018-05-29 18:49:26 +03:00
Piotr Jastrzebski	d2300bc5a9	sstable_3_x_test: Add test_uncompressed_compound_static_row_read Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-29 14:55:36 +02:00
Piotr Jastrzebski	6639ef8769	sstable_3_x_test: add test_uncompressed_static_row_read Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-29 14:55:11 +02:00
Piotr Jastrzebski	18cced2edc	flat_mutation_reader_assertions: improve static row assertions Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-29 14:52:55 +02:00
Piotr Jastrzebski	6ab660880d	data_consume_rows_context_m: Implement support for static rows Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-29 14:52:14 +02:00
Piotr Jastrzebski	c9c2fc8e4b	mp_row_consumer_m: Implement support for static rows Add consumer_m::consume_static_row_start Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2018-05-29 14:50:15 +02:00

1 2 3 4 5 ...

15575 Commits