scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-29 19:21:01 +00:00

Files

Pavel Emelyanov 2f7c03d84c utils: Intrusive B-tree (with tests)

The design of the tree goes from the row-cache needs, which are

1. Insert/Remove do not invalidate iterators
2. Elements are LSA-manageable
3. Low key overhead
4. External tri-comparator
5. As little actions on insert/remove as possible

With the above the design is

Two types of nodes -- inner and leaf. Both types keep pointer on parent nodes
and N pointers on keys (not keys themselves). Two differences: inner nodes have
array of pointers on kids, leaf nodes keep pointer on the tree (to update left-
and rightmost tree pointers on node move).

Nodes do not keep pointers/references on trees, thus we have O(1) move of any
object, but O(logN) to get the tree size. Fortunately, with big keys-per-node
value this won't result in too many steps.

In turn, the tree has 3 pointers -- root, left- and rightmost leaves. The latter
is for constant-time begin() and end().

Keys are managed by user with the help of embeddable member_hook instance,
which is 1 pointer in size.

The code was copied from the B+ tree one, then heavily reworked, the internal
algorythms turned out to differ quite significantly.

For the sake of mutation_partition::apply_monotonically(), which needs to move
an element from one tree into another, there's a key_grabber helping wrapper
that allows doing this move respecting the exception-safety requirement.

As measured by the perf_collections test the B-tree with 8 keys is faster, than
the std::set, but slower than the B+tree:

            vs set        vs b+tree
   fill:     +13%           -6%
   find:     +23%          -35%

Another neat thing is that 1-key insertion-removal is ~40% faster than
for BST (the same number of allocations, but the key object is smaller,
less pointers to set-up and less instructions to execute when linking
node with root).

v4:
- equip insertion methods with on_alloc_point() calls to catch
  potential exception guarantees violations eariler

- add unlink_leftmost_without_rebalance. The method is borrowed from
  boost intrusive set, and is added to kill two birds -- provide it,
  as it turns out to be popular, and use a bit faster step-by-step
  tree destruction than plain begin+erase loop

v3:
- introduce "inline" root node that is embedded into tree object and in
  which the 1st key is inserted. This greatly improves the 1-key-tree
  performance, which is pretty common case for rows cache

v2:
- introduce "linear" root leaf that grows on demand

  This improves the memory consumption for small trees. This linear node may
  and should over-grow the NodeSize parameter. This comes from the fact that
  there are two big per-key memory spikes on small trees -- 1-key root leaf
  and the first split, when the tree becomes 1-key root with two half-filled
  leaves. If the linear extention goes above NodeSize it can flatten even the
  2nd peak

- mitigate the keys indirection a bit

  Prefetching the keys while doing the intra-node linear scan and the nodes
  while descending the tree gives ~+5% of fill and find

- generalize stress tests for B and B+ trees

- cosmetic changes

TODO:

- fix few inefficincies in the core code (walks the sub-tree twice sometimes)
- try to optimize the leaf nodes, that are not lef-/righmost not to carry
  unused tree pointer on board

Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>

2021-02-02 09:30:29 +03:00

arch/powerpc/crc32-vpmsum

…

utils: add missing include for ssize_t

2020-01-30 14:10:18 +02:00

allocation_strategy.hh

allocation_strategy: set preferred max contiguous allocation to 128k for standard allocations

2021-01-21 11:15:13 +02:00

anchorless_list.hh

…

array-search.cc

utils: array-search: deinline, working around clang bug

2020-10-11 10:29:24 +03:00

array-search.hh

utils/array-search: document restrictions

2020-10-11 15:19:54 +03:00

ascii.cc

utils: add fast ascii string validation

2018-12-24 09:58:08 +02:00

ascii.hh

utils: add fast ascii string validation

2018-12-24 09:58:08 +02:00

atomic_vector.hh

atomic_vetor: Don't pass references to callbacks

2020-04-23 16:06:37 +03:00

big_decimal.cc

utils: big_decimal: work around clang difficulty with boost::cpp_int(string_view) constructor

2020-10-11 22:09:19 +03:00

big_decimal.hh

big_decimal: Add a as_rational member function

2020-06-25 15:33:31 -07:00

bloom_calculations.cc

…

bloom_calculations.hh

utils: bloom_calculations: avoid gratuitous conversion to double

2020-09-22 17:24:33 +03:00

bloom_filter.cc

Update seastar submodule

2020-03-23 11:59:30 +02:00

bloom_filter.hh

…

bounded_stats_deque.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

bptree.hh

utils: bptree: remove redundant and possibly wrong friend declaration

2020-09-22 17:24:33 +03:00

buffer_input_stream.cc

…

buffer_input_stream.hh

…

build_id.cc

utils: build_id: fix ubsan false positive on pointer arithmetic

2020-10-11 17:23:40 +03:00

build_id.hh

build_id: mv sources to utils/

2020-08-03 15:55:16 +03:00

cached_file.hh

utils: cached_file: Fix compilation error

2020-06-18 14:08:29 +03:00

chunked_vector.hh

utils/chunked_vector: reserve_partial(): better explain how to properly use

2020-11-10 15:45:01 +02:00

class_registrator.hh

class_registry: Use std::string_view in (un)?qualified_name

2020-07-03 12:28:14 -07:00

clmul.hh

utils: Extract clmul() from crc.hh

2018-12-03 14:36:08 +01:00

collection-concepts.hh

intrusive-array: Array with trusted bounds

2020-07-14 16:29:49 +03:00

config_file_impl.hh

utils: config_src::add_command_line_options(): drop name and desc args

2020-07-28 18:00:29 +03:00

config_file.cc

utils: config_src::add_command_line_options(): drop name and desc args

2020-07-28 18:00:29 +03:00

config_file.hh

db/config_file.hh: named_value: remove unused members _name and _desc

2020-08-03 12:51:16 +03:00

coroutine.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

crc.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

data_input.hh

Replace std::experimental types with C++17 std version.

2019-01-08 13:16:36 +02:00

date.h

…

directories.cc

directories.cc: prepare for use outside main.cc

2020-11-17 10:15:47 +01:00

directories.hh

directories.cc: prepare for use outside main.cc

2020-11-17 10:15:47 +01:00

disk-error-handler.cc

utils: Move disk-error-handler into it

2020-02-09 17:26:52 +02:00

disk-error-handler.hh

utils: do_io_check: adjust indentation

2020-08-06 19:01:18 +03:00

div_ceil.hh

…

double-decker.hh

memtable: Switch onto B+ rails

2020-07-14 16:30:02 +03:00

dynamic_bitset.cc

…

dynamic_bitset.hh

…

enum_option.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

error_injection.cc

utils: error_injection: remove forward-declared function returning auto

2020-09-22 17:24:33 +03:00

error_injection.hh

utils: error_injection: remove forward-declared function returning auto

2020-09-22 17:24:33 +03:00

estimated_histogram.hh

approx_exponential_histogram: Makes the implementation clearer

2020-06-18 14:18:21 +03:00

exceptions.cc

utils: Use on_internal_error from seastar

2020-02-29 19:28:57 +02:00

exceptions.hh

utils: Use on_internal_error from seastar

2020-02-29 19:28:57 +02:00

exponential_backoff_retry.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

extremum_tracking.hh

utils/extremum_tracking: drop default constructor

2019-02-07 18:31:25 +02:00

fb_utilities.hh

fb_utilities.hh: mark methods noexcept

2020-11-01 16:46:18 +02:00

file_lock.cc

treewide: replace calls to engine().some_api() with some_api()

2020-04-05 12:46:04 +03:00

file_lock.hh

file_lock: Work with fs::path, not sstring

2019-12-12 17:32:10 +03:00

flush_queue.hh

flush_queue: call_helper: support no variadic futures

2020-10-11 14:40:32 +03:00

fragment_range.hh

utils: fragment_range: add a fragment iterator for FragmentedView

2021-01-15 14:05:44 +01:00

fragmented_temporary_buffer.hh

utils/fragment_temporary_buffer: don't push empty fragment if data size is fragment-aligned

2021-01-30 20:54:20 +02:00

generation-number.cc

storage_service: Move get_generation_number to util/

2020-06-01 09:08:40 +03:00

generation-number.hh

storage_service: Move get_generation_number to util/

2020-06-01 09:08:40 +03:00

hash.hh

database: Use a flat_hash_map for _ks_cf_to_uuid

2020-06-14 08:18:39 -07:00

histogram_metrics_helper.hh

approx_exponential_histogram: Makes the implementation clearer

2020-06-18 14:18:21 +03:00

histogram.hh

moving avarage rate: Keep computed rates in zero until they are

2020-11-04 11:13:59 +02:00

human_readable.cc

utils: add to_hr_size()

2020-10-13 12:32:14 +03:00

human_readable.hh

utils: add to_hr_size()

2020-10-13 12:32:14 +03:00

i_filter.cc

utils: convert sprint() to format()

2018-11-01 13:16:17 +00:00

i_filter.hh

…

in.hh

treewide: add const qualifiers throughout the code base

2019-11-26 02:24:49 +03:00

input_stream.hh

…

int_range.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

intrusive_btree.hh

utils: Intrusive B-tree (with tests)

2021-02-02 09:30:29 +03:00

intrusive-array.hh

intrusive-array: Array with trusted bounds

2020-07-14 16:29:49 +03:00

joinpoint.hh

treewide: replace calls to engine().some_api() with some_api()

2020-04-05 12:46:04 +03:00

large_bitset.cc

utils/large_bitset: use reserve_partial() to reserve _storage

2020-11-02 18:03:19 +02:00

large_bitset.hh

…

latency.hh

…

like_matcher.cc

cql3: Allow repeated LIKE on same column

2020-02-27 09:34:51 -05:00

like_matcher.hh

cql3: Allow repeated LIKE on same column

2020-02-27 09:34:51 -05:00

limiting_data_source.cc

…

limiting_data_source.hh

…

linearizing_input_stream.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

loading_cache.hh

Update seastar submodule

2020-08-04 17:54:45 +03:00

loading_shared_values.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

log_heap.hh

log_heap: Remove std::iterator from hist_iterator

2020-11-17 16:53:20 +01:00

logalloc.cc

Replace disable_failure_guard with scoped_critical_alloc_section

2020-11-17 16:01:25 +02:00

logalloc.hh

utils: logalloc: add lsa_global_occupancy_stats()

2020-11-17 15:13:21 +02:00

managed_bytes.cc

utils: remove unused linearization facilities in managed_bytes class

2021-01-08 14:16:08 +01:00

managed_bytes.hh

memtable: fix accounting of managed_bytes in partition_snapshot_accounter

2021-01-15 18:21:13 +01:00

managed_ref.hh

managed_ref: add external_memory_usage()

2019-10-15 15:41:42 +03:00

managed_vector.hh

…

maybe_yield.hh

locator: extract can_yield to utils/maybe_yield.hh

2020-11-24 12:23:56 +02:00

memory_data_sink.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

meta.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

multiprecision_int.cc

utils: introduce multiprecision_int

2020-03-04 12:42:57 +02:00

multiprecision_int.hh

utils: multiprecision_int: disambiguate operator templates by adding overloads

2020-09-22 17:24:33 +03:00

murmur_hash.cc

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

murmur_hash.hh

everywhere: Insert space after switch

2020-08-18 14:31:04 +03:00

mutable_view.hh

utils: mutable_view: add substr()

2021-01-08 13:17:46 +01:00

neat-object-id.hh

utils: B+ tree implementation

2020-07-14 16:29:43 +03:00

observable.hh

…

overloaded_functor.hh

…

phased_barrier.hh

utils: phased_barrier: add operations_in_progress()

2020-11-17 15:13:21 +02:00

preempt.hh

treewide: add missing headers and/or forward declarations

2020-03-23 09:29:45 +02:00

ranges.hh

utils: to_range(): relax constraint

2020-10-18 18:16:30 +03:00

rate_limiter.cc

Replace std::experimental types with C++17 std version.

2019-01-08 13:16:36 +02:00

rate_limiter.hh

treewide: replace calls to engine().some_api() with some_api()

2020-04-05 12:46:04 +03:00

reusable_buffer.hh

treewide: update concepts language from the Concepts TS to C++20

2020-06-02 09:12:21 +03:00

rjson.cc

rjson: Add templated get/set overloads and optional get<T>

2020-07-15 08:10:23 +00:00

rjson.hh

rjson: Add templated get/set overloads and optional get<T>

2020-07-15 08:10:23 +00:00

runtime.cc

…

runtime.hh

…

sequenced_set.hh

Update seastar submodule

2020-03-23 11:59:30 +02:00

serialization.hh

utils: fragment_range: add serialization helpers for FragmentedMutableView

2021-01-08 14:16:07 +01:00

serialized_action.hh

serialized_action: trigger: propagate action error

2020-10-14 16:45:21 +03:00

small_vector.hh

utils/small_vector: Add missing include

2020-03-03 21:23:40 +02:00

stall_free.hh

utils: Add clear_gently

2020-08-11 19:37:47 +08:00

streaming_histogram.hh

streaming_histogram: add missing include for uint64_t

2020-05-23 11:09:10 +03:00

top_k.hh

Merge "nodetool toppartitions" from Rafi & Avi

2018-12-28 16:31:24 +01:00

updateable_value.cc

utils: updateable_value: fix nullptr_t name

2020-09-22 17:24:33 +03:00

updateable_value.hh

utils: updateable_value: fix nullptr_t name

2020-09-22 17:24:33 +03:00

utf8.cc

utils: utf8: expose validate_partial() in a header

2020-10-21 11:14:44 +03:00

utf8.hh

utils: fragment_range: add a fragment iterator for FragmentedView

2021-01-15 14:05:44 +01:00

UUID_gen.cc

lists: use query timestamp for list cell values during append

2021-01-21 13:03:59 +03:00

UUID_gen.hh

test: add tests for legacy uuid compare & msb monotonicity

2021-01-21 13:03:59 +03:00

uuid.cc

Update seastar submodule

2018-11-21 00:01:44 +02:00

UUID.hh

uuid: add a comment warning against UUID::operator<

2021-01-21 13:03:59 +03:00