scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-25 19:10:42 +00:00

Files

Duarte Nunes 5e9cd128ad Merge 'Single key sstable reader optimization' from Botond

"When reading a single row it is possible that the read will be satisfied
by just reading from one of the data source candidates. To exploit this
an optimization is employed which sorts data source candidates by their
timestamp and reads mutations from the most recent to the oldest. When
all needed cells are present and their earliest timestamp is still
later than the latest one of the remaining data source the read can be
terminated early.
However this optimization also has the possibility to backfire as the
data sources are read sequentially, so if all of them has to be read
eventually then we will end up worse then without it.
Thus the optimization can be disabled up-front or enabled to only run
until its efficiency degrades below a certain threshold.
Also counters are added to column-families to make it possible to
observe how well it performs.

Benchmarking

Benchmarking was done with disabled cache and at a constant op rate of
4k (1/3 of the max op rate on my box), against 3 sstables containing the
same 10000 rows.

1) Optimization turned off (all sstables read paralelly)
latency mean              : 1.3 [simple:1.3]
latency median            : 1.0 [simple:1.0]
latency 95th percentile   : 2.4 [simple:2.4]
latency 99th percentile   : 2.9 [simple:2.9]
latency 99.9th percentile : 8.0 [simple:8.0]
latency max               : 13.5 [simple:13.5]

2) Optimization turned on, best case (1 of 3 sstables read)
latency mean              : 0.6 [simple:0.6]
latency median            : 0.6 [simple:0.6]
latency 95th percentile   : 1.0 [simple:1.0]
latency 99th percentile   : 1.2 [simple:1.2]
latency 99.9th percentile : 4.4 [simple:4.4]
latency max               : 13.4 [simple:13.4]

3) Optimization turned on, best case, IN query (1 of 3 sstables read)
latency mean              : 0.7 [simple_in:0.7]
latency median            : 0.6 [simple_in:0.6]
latency 95th percentile   : 1.1 [simple_in:1.1]
latency 99th percentile   : 1.4 [simple_in:1.4]
latency 99.9th percentile : 5.4 [simple_in:5.4]
latency max               : 16.8 [simple_in:16.8]

4) Optimization turned on, worst case (3 of 3 sstables read sequentally)
latency mean              : 2.8 [simple:2.8]
latency median            : 2.3 [simple:2.3]
latency 95th percentile   : 5.4 [simple:5.4]
latency 99th percentile   : 6.5 [simple:6.5]
latency 99.9th percentile : 13.5 [simple:13.5]
latency max               : 19.2 [simple:19.2]

5) Optimization turned on, mid case (2 of 3 sstables read sequentally)
latency mean              : 1.4 [simple:1.4]
latency median            : 1.1 [simple:1.1]
latency 95th percentile   : 2.7 [simple:2.7]
latency 99th percentile   : 3.2 [simple:3.2]
latency 99.9th percentile : 7.7 [simple:7.7]
latency max               : 15.1 [simple:15.1]"

Ref #324

* 'bdenes/optimize_single_row_read_v6' of github.com:denesb/scylla:
  Add unit tests for single_key_sstable_reader
  Add counters for the single-key reader optimization
  Add single_key_parallel_scan_threshold option
  single_key_sstable_reader: optimize single-row queries
  single_key_sstable_reader: move reading code into it's own method
  Add selects_only_full_rows() and selects_only_full_rows_with_atomic_columns()

2017-10-18 16:38:53 +01:00

perf

tests: Fix compile errors introduced in c468e5981

2017-10-18 16:38:18 +01:00

snitch_property_files

tests: added ec2_snitch_test

2015-10-08 20:57:20 +03:00

sstables

tests/sstables: add test for reading wrong-order counter cells

2017-09-05 10:32:48 +01:00

allocation_strategy_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

anchorless_list_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

auth_test.cc

tests: Fix compile errors introduced in c468e5981

2017-10-18 16:38:18 +01:00

batchlog_manager_test.cc

untyped_result_set: reduce dependencies

2017-09-18 15:15:15 +02:00

bytes_ostream_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

cache_streamed_mutation_test.cc

Replace query::full_slice with schema::full_slice()

2017-10-17 11:25:53 +02:00

canonical_mutation_test.cc

tests/canonical_mutation: don't try to upgrade incompatible schemas

2017-02-07 15:17:14 +00:00

cartesian_product_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

castas_fcts_test.cc

tests: Add test for CAST AS functions.

2017-10-07 21:05:53 +02:00

cell_locker_test.cc

cell_locker: add metrics for lock acquisition

2017-03-02 09:05:12 +00:00

chunked_vector_test.cc

tests: add test for chunked_vector

2017-08-26 16:44:47 +03:00

clustering_ranges_walker_test.cc

tests: Introduce clustering_ranges_walker_test

2017-08-28 21:08:55 +02:00

commitlog_test.cc

tests: commitlog: Check there are no segments left on disk after clean shutdown

2017-07-09 19:25:27 +03:00

compound_test.cc

sstables: avoid copying key components

2017-07-26 14:38:27 +01:00

compress_test.cc

segmented_offsets: use _current_bucket_segment_index consistently

2017-08-28 16:14:25 +03:00

config_test.cc

Fix compile errors in tests/config_test.cc introduced by c468e5981

2017-10-18 15:20:45 +01:00

counter_test.cc

tests/counter: verify counter_id ordering

2017-09-05 10:52:54 +01:00

cql_assertions.cc

Merge seatar upstream (seastar namespace)

2017-05-21 12:26:15 +03:00

cql_assertions.hh

Merge seatar upstream (seastar namespace)

2017-05-21 12:26:15 +03:00

cql_query_test.cc

tests: Fix cql_query_test.cc::test_duration_restrictions

2017-09-06 15:49:03 +03:00

cql_test_env.cc

Merge "loading_shared_values and size limited and evicting prepared statements cache" from Vlad

2017-10-04 09:13:32 +01:00

cql_test_env.hh

cql3::query_processor: implement CQL and Thrift prepared statements caches using cql3::prepared_statements_cache

2017-09-15 22:19:15 -04:00

crc_test.cc

utils: Put crc32 under utils namespace

2016-12-05 11:48:29 +02:00

database_test.cc

storage_proxy: pass maximum result size to replicas

2016-12-22 17:16:23 +01:00

duration_test.cc

duration_test.cc: Add test for printing zero duration

2017-08-10 14:11:30 -04:00

dynamic_bitset_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

ec2_snitch_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

flat_mutation_reader_test.cc

Add tests for flat_mutation_reader

2017-10-13 16:08:59 +02:00

flush_queue_test.cc

Merge seatar upstream (seastar namespace)

2017-05-21 12:26:15 +03:00

frozen_mutation_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

gossip_test.cc

Merge seatar upstream (seastar namespace)

2017-05-21 12:26:15 +03:00

gossip.cc

Merge seatar upstream (seastar namespace)

2017-05-21 12:26:15 +03:00

gossiping_property_file_snitch_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

hash_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

idl_test.cc

Merge seatar upstream (seastar namespace)

2017-05-21 12:26:15 +03:00

input_stream_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

keys_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

loading_cache_test.cc

tests: loading_cache_test: initial commit

2017-09-15 22:19:15 -04:00

log_heap_test.cc

log_histogram: rename to log_heap

2017-09-18 12:44:05 +02:00

logalloc_test.cc

Remove use of std::random_shuffle()

2017-06-26 09:36:38 +02:00

lsa_async_eviction_test.cc

tests: lsa_async_eviction_test: Allocate objects under allocating section

2017-03-16 10:21:10 +01:00

lsa_sync_eviction_test.cc

Remove use of std::random_shuffle()

2017-06-26 09:36:38 +02:00

make_random_string.hh

tests: simple_schema: Add missing include

2017-08-28 21:00:06 +02:00

managed_vector_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

map_difference_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

memory_footprint.cc

treewide: use shared_sstable, make_sstable in place of lw_shared_ptr<sstable>

2017-09-12 10:43:05 +03:00

memtable_snapshot_source.hh

Replace query::full_slice with schema::full_slice()

2017-10-17 11:25:53 +02:00

memtable_test.cc

Add test to reproduce #2854

2017-09-29 15:17:53 +02:00

message.cc

Merge seatar upstream (seastar namespace)

2017-05-21 12:26:15 +03:00

murmur_hash_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

mutation_assertions.hh

tests: streamed_mutation_assertions: Introduce produces(mutation&)

2017-09-13 17:47:03 +02:00

mutation_query_test.cc

Allow reading exactly desired byte ranges and fast_forward_to

2017-06-19 18:31:32 +03:00

mutation_reader_assertions.hh

tests: do not overload the meaning of empty clustering range

2017-07-25 15:28:12 +02:00

mutation_reader_test.cc

Add unit tests for single_key_sstable_reader

2017-10-18 17:24:03 +03:00

mutation_source_test.cc

Replace query::full_slice with schema::full_slice()

2017-10-17 11:25:53 +02:00

mutation_source_test.hh

tests: Add range generators to random_mutation_generator

2017-02-23 18:50:53 +01:00

mutation_test.cc

tests: Extract mvcc tests to separate file

2017-09-13 17:47:04 +02:00

mvcc_test.cc

tests: mvcc: Add test for partition_snapshot_row_cursor

2017-09-25 11:21:58 +02:00

network_topology_strategy_test.cc

Merge seatar upstream (seastar namespace)

2017-05-21 12:26:15 +03:00

nonwrapping_range_test.cc

Convert to use dht::partition_range

2016-12-19 08:04:30 +08:00

partitioner_test.cc

tests: Add test_selective_token_range_sharder

2017-07-04 18:46:19 +08:00

perf_row_cache_update.cc

row_cache: Improve safety of cache updates

2017-09-04 10:04:29 +02:00

query_processor_test.cc

untyped_result_set: reduce dependencies

2017-09-18 15:15:15 +02:00

range_assert.hh

Fix pre-ScyllaDB copyright statements

2016-04-08 08:12:47 +03:00

range_test.cc

tests: Add test case for nonwrapping_range::intersection()

2017-05-17 10:33:18 +02:00

range_tombstone_list_test.cc

mutation_partition_serializer: Assume range tombstone support

2017-06-15 09:54:05 +03:00

result_set_assertions.cc

Merge "Fix query digest mismatch" from Tomasz

2016-04-08 12:13:29 +03:00

result_set_assertions.hh

Merge "Fix query digest mismatch" from Tomasz

2016-04-08 12:13:29 +03:00

row_cache_alloc_stress.cc

row_cache: Improve safety of cache updates

2017-09-04 10:04:29 +02:00

row_cache_stress_test.cc

Replace query::full_slice with schema::full_slice()

2017-10-17 11:25:53 +02:00

row_cache_test.cc

Replace query::full_slice with schema::full_slice()

2017-10-17 11:25:53 +02:00

schema_change_test.cc

Merge "loading_shared_values and size limited and evicting prepared statements cache" from Vlad

2017-10-04 09:13:32 +01:00

schema_registry_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

serialized_action_test.cc

tests/serialized_action: add missing forced defers

2017-07-31 11:35:24 +01:00

simple_schema.hh

Add unit tests for single_key_sstable_reader

2017-10-18 17:24:03 +03:00

single_key_sstable_reader_test_cases.cc

Add unit tests for single_key_sstable_reader

2017-10-18 17:24:03 +03:00

snitch_reset_test.cc

build: support for linking statically with boost

2016-10-26 08:51:21 +03:00

sstable_assertions.hh

tests/sstable_mutation_test: Test promoted index blocks are monotonic

2017-07-27 18:23:58 +02:00

sstable_atomic_deletion_test.cc

sstables: add unit tests for atomic deletion

2016-11-04 15:48:43 +02:00

sstable_datafile_test.cc

Replace query::full_slice with schema::full_slice()

2017-10-17 11:25:53 +02:00

sstable_mutation_test.cc

treewide: use shared_sstable, make_sstable in place of lw_shared_ptr<sstable>

2017-09-12 10:43:05 +03:00

sstable_resharding_test.cc

treewide: use shared_sstable, make_sstable in place of lw_shared_ptr<sstable>

2017-09-12 10:43:05 +03:00

sstable_test.cc

sstables: Get rid of [[deprecated]] index_reader::get_index_entries()

2017-10-08 12:18:52 +03:00

sstable_test.hh

Add unit tests for single_key_sstable_reader

2017-10-18 17:24:03 +03:00

sstable_utils.cc

Add restricted_reader_test unit test

2017-10-03 12:44:17 +03:00

sstable_utils.hh

Add combined_mutation_reader_test unit test

2017-08-10 12:38:10 +03:00

storage_proxy_test.cc

Convert to use dht::partition_range_vector and dht::token_range_vector

2016-12-19 14:08:50 +08:00

streamed_mutation_test.cc

Introduce partition_start mutation_fragment

2017-10-10 16:15:59 +02:00

streaming_histogram_test.cc

tests: add streaming_histogram_test

2017-06-29 02:08:12 -03:00

test_services.hh

Merge seatar upstream (seastar namespace)

2017-05-21 12:26:15 +03:00

test-serialization.cc

utils::serialization: remove not used deserialization_xxx() functions

2017-05-26 19:26:20 +03:00

tmpdir.hh

tests: move tmpdir to /tmp

2017-07-16 11:55:08 +02:00

total_order_check.hh

utils: Extract to_boost_visitor() to a separate header

2017-05-22 19:30:02 +02:00

types_test.cc

Support duration CQL native type

2017-08-10 15:01:10 -04:00

UUID_test.cc

utils::UUID: operator< should behave as comparison of hex strings/bytes

2017-02-22 09:19:22 +00:00

view_schema_test.cc

tests: Fix compile errors introduced in c468e5981

2017-10-18 16:38:18 +01:00

vint_serialization_test.cc

CQL native protocol: Add support for vint serialization

2017-08-10 14:11:30 -04:00

virtual_reader_test.cc

untyped_result_set: reduce dependencies

2017-09-18 15:15:15 +02:00