scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 03:30:49 +00:00

Files

Avi Kivity aab6b0ee27 Merge "Introduce new in-memory representation for cells" from Paweł

"
This is the first part of the first step of switching Scylla. It covers
converting cells to the new serialisation format. The actual structure
of the cells doesn't differ much from the original one with a notable
exception of the fact that large values are now fragmented and
linearisation needs to be explicit. Counters and collections still
partially rely on their old, custom serialisation code and their
handling is not optimial (although not significantly worse than it used
to be).

The new in-memory representation allows objects to be of varying size
and makes it possible to provide deserialisation context so that we
don't need to keep in each instance of an IMR type all the information
needed to interpret it. The structure of IMR types is described in C++
using some metaprogramming with the hopes of making it much easier to
modify the serialisation format that it would be in case of open-coded
serialisation functions.

Moreover, IMR types can own memory thanks to a limited support for
destructors and movers (the latter are not exactly the same thing as C++
move constructors hence a different name). This makes it (relatively)
to ensure that there is an upper bound on the size of all allocations.

For now the only thing that is converted to the IMR are atomic_cells
and collections which means that the reduction in the memory footprint
is not as big as it can be, but introducing the IMR is a big step on its
own and also paves the way towards complete elimination of unbounded
memory allocations.

The first part of this patchset contains miscellaneous preparatory
changes to various parts of the Scylla codebase. They are followed by
introduction of the IMR infrastructure. Then structure of cells is
defined and all helper functions are implemented. Next are several
treewide patches that mostly deal with propagating type information to
the cell-related operations. Finally, atomic_cell and collections are
switched to used the new IMR-based cell implementation.

The IMR is described in much more detail in imr/IMR.md added in "imr:
add IMR documentation".

Refs #2031.
Refs #2409.

perf_simple_query -c4, medians of 30 results:

        ./perf_base  ./perf_imr   diff
 read     308790.08   309775.35   0.3%
 write    402127.32   417729.18   3.9%

The same with 1 byte values:
        ./perf_base1  ./perf_imr1   diff
 read      314107.26    314648.96   0.2%
 write     463801.40    433255.96  -6.6%

The memory footprint is reduced, but that is partially due to removal of
small buffer optimisation (whether it will be restored depends on the
exact mesurements of the performance impact). Generally, this series was
not expected to make a huge difference as this would require converting
whole rows to the IMR.

Memory footprint:
Before:
mutation footprint:
 - in cache: 1264
 - in memtable: 986

After:
mutation footprint:
 - in cache: 1104
 - in memtable: 866

Tests: unit (release, debug)
"

* tag 'imr-cells/v3' of https://github.com/pdziepak/scylla: (37 commits)
  tests/mutation: add test for changing column type
  atomic_cell: switch to new IMR-based cell reperesentation
  atomic_cell: explicitly state when atomic_cell is a collection member
  treewide: require type for creating collection_mutation_view
  treewide: require type for comparing cells
  atomic_cell: introduce fragmented buffer value interface
  treewide: require type to compute cell memory usage
  treewide: require type to copy atomic_cell
  treewide: require type info for copying atomic_cell_or_collection
  treewide: require type for creating atomic_cell
  atomic_cell: require column_definition for creating atomic_cell views
  tests: test imr representation of cells
  types: provide information for IMR
  data: introduce cell
  data: introduce type_info
  imr/utils: add imr object holder
  imr: introduce concepts
  imr: add helper for allocating objects
  imr: allow creating lsa migrators for IMR objects
  imr: introduce placeholders
  ...

2018-05-31 19:21:15 +03:00

perf

treewide: require type info for copying atomic_cell_or_collection

2018-05-31 15:51:11 +01:00

snitch_property_files

…

sstables

Merge "Implement support for static rows in SSTable 3.0" from Piotr

2018-05-30 17:17:17 +03:00

aggregate_fcts_test.cc

tests: Tests for min/max aggregate functions over date/timestamp and timeuuid.

2018-01-14 13:17:09 +01:00

allocation_strategy_test.cc

…

anchorless_list_test.cc

…

auth_resource_test.cc

auth: Add code to expand a resource family

2018-02-14 14:15:59 -05:00

auth_test.cc

query_processor: require clients to specify timeout configuration

2018-05-14 09:41:06 +03:00

batchlog_manager_test.cc

treewide: require type for creating atomic_cell

2018-05-31 15:51:11 +01:00

big_decimal_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

bytes_ostream_test.cc

…

cache_flat_mutation_reader_test.cc

mvcc: Destroy memtable partition versions gently

2018-05-30 14:41:40 +02:00

caching_options_test.cc

tests: Add unit tests for caching_options.

2017-12-04 17:42:23 -08:00

canonical_mutation_test.cc

tests: reduce dependencies in test_services.hh

2018-03-12 20:05:23 +02:00

cartesian_product_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

castas_fcts_test.cc

tests: Add test that decimal obtained as CAST from integer always contain one decimal place.

2018-01-21 19:09:03 +01:00

cell_locker_test.cc

treewide: require type info for copying atomic_cell_or_collection

2018-05-31 15:51:11 +01:00

chunked_vector_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

clustering_ranges_walker_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

commitlog_test.cc

Merge seastar upstream

2018-04-29 11:03:21 +03:00

compound_test.cc

tests: compound_test: fix the 'narrowing' compilation error on Power

2017-12-08 13:38:13 -05:00

compress_test.cc

sstables/compress: Fix race condition in segmented offset reading of shared sstable

2018-02-06 12:10:10 +02:00

config_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

continuous_data_consumer_test.cc

Merge seastar upstream

2018-04-29 11:03:21 +03:00

counter_test.cc

atomic_cell: introduce fragmented buffer value interface

2018-05-31 15:51:11 +01:00

cql_assertions.cc

tests: improve usability of cql_assertions.hh error messages

2018-05-07 09:19:45 +01:00

cql_assertions.hh

tests/cql_assertions: Assert result set is not empty

2018-03-27 01:20:11 +01:00

cql_auth_query_test.cc

auth: Grant all permissions to object creator

2018-03-14 01:54:31 -04:00

cql_auth_syntax_test.cc

cql: Elaborate error for quoted user names

2018-03-01 12:06:59 -05:00

cql_query_test.cc

tests: add test for dropping a table with secondary indexes

2018-05-22 21:10:51 +02:00

cql_test_env.cc

atomic_cell: introduce fragmented buffer value interface

2018-05-31 15:51:11 +01:00

cql_test_env.hh

tests/cql_test_env: Move eventually() to this file

2018-03-27 01:20:11 +01:00

crc_test.cc

utils::crc: introduce process_le/be(T) methods

2017-12-08 10:12:21 -05:00

database_test.cc

query-result: Introduce class result_options

2018-02-01 00:22:50 +00:00

duration_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

dynamic_bitset_test.cc

Merge seastar upstream

2018-04-29 11:03:21 +03:00

ec2_snitch_test.cc

locator: de-inline production_snitch_base

2018-03-11 18:22:49 +02:00

enum_set_test.cc

enum_set: Add iterator

2018-02-14 14:15:59 -05:00

extensions_test.cc

Extract sstable::component_type to separete header

2018-04-24 11:29:57 +02:00

failure_injecting_allocation_strategy.hh

lsa: add free() that does not require object size

2018-05-09 16:52:26 +01:00

flat_mutation_reader_assertions.hh

atomic_cell: introduce fragmented buffer value interface

2018-05-31 15:51:11 +01:00

flat_mutation_reader_test.cc

treewide: require type info for copying atomic_cell_or_collection

2018-05-31 15:51:11 +01:00

flush_queue_test.cc

tests/flush_queue_test: Don't assume continuations run immediately

2018-03-05 15:22:33 +02:00

frozen_mutation_test.cc

tests: add test for frozen_mutation_fragments

2018-05-25 10:15:10 +01:00

gce_snitch_test.cc

locator: de-inline production_snitch_base

2018-03-11 18:22:49 +02:00

gossip_test.cc

service/storage_service: Allow querying the view build status

2018-03-27 01:20:10 +01:00

gossip.cc

service/storage_service: Allow querying the view build status

2018-03-27 01:20:10 +01:00

gossiping_property_file_snitch_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

hash_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

hint_test.cc

tests: reduce dependencies in test_services.hh

2018-03-12 20:05:23 +02:00

idl_test.cc

tests/idl: test variant being the first member of a structure

2018-05-25 10:15:10 +01:00

imr_test.cc

tests/imr: add tests for destructor and mover methods

2018-05-31 10:09:01 +01:00

input_stream_test.cc

…

keys_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

limiting_data_source_test.cc

Add tests for make_limiting_data_source

2018-04-16 21:00:35 +02:00

loading_cache_test.cc

tests: loading_cache_test: add a tests for a loading_cache::remove(key)/remove(iterator)

2018-05-22 20:05:01 -04:00

log_heap_test.cc

log_histogram: rename to log_heap

2017-09-18 12:44:05 +02:00

logalloc_test.cc

Merge seastar upstream

2018-04-29 11:03:21 +03:00

lsa_async_eviction_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

lsa_sync_eviction_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

make_random_string.hh

tests: simple_schema: Add missing include

2017-08-28 21:00:06 +02:00

managed_vector_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

map_difference_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

memory_footprint.cc

treewide: require type for creating atomic_cell

2018-05-31 15:51:11 +01:00

memtable_snapshot_source.hh

tests: memtable_snapshot_source: Fix compact()

2018-05-22 15:08:07 +01:00

memtable_test.cc

treewide: require type info for copying atomic_cell_or_collection

2018-05-31 15:51:11 +01:00

message.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

meta_test.cc

tests: introduce tests for metaprogramming helpers

2018-05-31 10:09:01 +01:00

murmur_hash_test.cc

…

mutation_assertions.hh

Delete unused streamed_mutation_assertions

2018-01-24 20:56:48 +01:00

mutation_diff

tests: Introduce mutation_diff script

2017-12-01 10:52:37 +01:00

mutation_fragment_test.cc

tests: reduce dependencies in test_services.hh

2018-03-12 20:05:23 +02:00

mutation_query_test.cc

treewide: require type for creating atomic_cell

2018-05-31 15:51:11 +01:00

mutation_reader_test.cc

tests/mutation_reader: disambiguate freeze() overload

2018-05-25 10:15:10 +01:00

mutation_source_test.cc

treewide: require type for creating atomic_cell

2018-05-31 15:51:11 +01:00

mutation_source_test.hh

tests: mutation_source_test: Extract random_mutation_generator::make_partition_keys()

2017-12-22 11:06:33 +01:00

mutation_test.cc

tests/mutation: add test for changing column type

2018-05-31 15:51:11 +01:00

mvcc_test.cc

treewide: require type info for copying atomic_cell_or_collection

2018-05-31 15:51:11 +01:00

network_topology_strategy_test.cc

tests: network_topology_strategy_test: peel off redundant parentheses around token initializer

2018-04-21 13:53:29 +01:00

nonwrapping_range_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

partition_data_test.cc

tests: test imr representation of cells

2018-05-31 15:51:11 +01:00

partitioner_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

perf_row_cache_update.cc

tests: Improve perf_row_cache_update

2018-05-30 12:18:56 +02:00

querier_cache.cc

test_resources_based_cache_eviction: s/assert/BOOST_REQUIRE_*/

2018-04-11 10:55:21 +03:00

query_processor_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

random-utils.hh

tests: add helpers for generating random data

2018-05-31 10:09:01 +01:00

range_assert.hh

…

range_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

range_tombstone_list_assertions.hh

tests: Introduce range_tombstone_list assertions

2017-11-07 15:33:24 +01:00

range_tombstone_list_test.cc

tests: range_tombstone_list: Do not depend on argument evaluation order

2018-02-05 12:31:37 +00:00

result_set_assertions.cc

…

result_set_assertions.hh

…

role_manager_test.cc

auth: Remove unused "performer" argument

2018-02-14 14:15:58 -05:00

row_cache_alloc_stress.cc

tests: row_cache_alloc_stress: Avoid quadratic behavior

2018-03-07 16:52:59 +01:00

row_cache_stress_test.cc

row_cache: rename make_flat_reader to make_reader

2018-01-24 20:54:45 +01:00

row_cache_test.cc

Merge "Introduce new in-memory representation for cells" from Paweł

2018-05-31 19:21:15 +03:00

row_locker_test.cc

db: add row locking metrics

2018-05-22 16:52:58 +02:00

schema_change_test.cc

db: schema_tables: Treat drop of scylla_tables.version as an alter

2018-04-27 17:12:33 +03:00

schema_registry_test.cc

schema_tables: Require context object in schema load path

2018-02-07 10:11:46 +00:00

secondary_index_test.cc

secondary index: test multiple clustering column

2018-05-24 15:56:57 +03:00

serialized_action_test.cc

tests/serialized_action_test: Don't rely on task execution order

2018-02-19 13:08:58 +00:00

simple_schema.hh

atomic_cell: introduce fragmented buffer value interface

2018-05-31 15:51:11 +01:00

snitch_reset_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

sstable_3_x_test.cc

treewide: require type for creating atomic_cell

2018-05-31 15:51:11 +01:00

sstable_assertions.hh

Merge the pair of index_readers into just one tracking a range.

2018-03-29 15:23:31 +03:00

sstable_datafile_test.cc

atomic_cell: introduce fragmented buffer value interface

2018-05-31 15:51:11 +01:00

sstable_mutation_test.cc

treewide: require type to copy atomic_cell

2018-05-31 15:51:11 +01:00

sstable_resharding_test.cc

database, sstables, tests: add large_partition_handler

2018-05-04 14:38:13 +02:00

sstable_test.cc

treewide: require type for creating atomic_cell

2018-05-31 15:51:11 +01:00

sstable_test.hh

atomic_cell: introduce fragmented buffer value interface

2018-05-31 15:51:11 +01:00

sstable_utils.cc

database, sstables, tests: add large_partition_handler

2018-05-04 14:38:13 +02:00

sstable_utils.hh

database, sstables, tests: add large_partition_handler

2018-05-04 14:38:13 +02:00

storage_proxy_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

streaming_histogram_test.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

test_services.hh

tests: reduce dependencies in test_services.hh

2018-03-12 20:05:23 +02:00

test-serialization.cc

Move thread_local declarations out of main.cc

2017-11-27 20:27:42 +01:00

tmpdir.hh

…

total_order_check.hh

Remove to_boost_visitor heler.

2018-03-14 23:49:07 +00:00

types_test.cc

types: Make seastar::inet_address the "native" type for CQL inet.

2018-04-24 23:12:07 +01:00

UUID_test.cc

…

view_build_test.cc

tests/view_build_test: Add tests for view building

2018-03-27 01:20:11 +01:00

view_complex_test.cc

tests/view_complex_test.cc: fix and enable buggy test

2018-05-30 15:39:25 +01:00

view_schema_test.cc

tests/view_schema_test: Test view correctness under base schema changes

2018-05-31 12:10:50 +03:00

vint_serialization_test.cc

Cover serialized_size_from_first_byte in tests

2018-04-16 20:26:44 +02:00

virtual_reader_test.cc

tests: Add unit test for build_progress_virtual_reader

2018-03-27 01:20:10 +01:00