scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-30 11:36:54 +00:00

Author	SHA1	Message	Date
Konstantin Osipov	2b8ce83eea	lists: use query timestamp for list cell values during append Scylla list cells are represented internally as a map of timeuuid => value. To append a new value to a list the coordinator generates a timeuuid reflecting the current time as key and adds a value to the map using this key. Before this patch, Scylla always generated a timeuuid for a new value, even if the query had a user supplied or LWT timestamp. This could break LWT linearizability. User supplied timestamps were ignored. This is reported as https://github.com/scylladb/scylla/issues/7611 A statement which appended multiple values to a list or a BATCH generated an own microsecond-resolution timeuuid for each value: BEGIN BATCH UPDATE ... SET a = a + [3] UPDATE ... SET a = a + [4] APPLY BATCH UPDATE ... SET a = a + [3, 4] To fix the bug, it's necessary to preserve monotonicity of timeuuids within a batch or multi-value append, but make sure they all use the microsecond time, as is set by LWT or user. To explain the fix, it's first necessary to recall the structure of time-based UUIDs: 60 bits: time since start of GMT epoch, year 1582, represented in 100-nanosecond units 4 bits: version 14 bits: clock sequence, a random number to avoid duplicates in case system clock is adjusted 2 bits: type 48 bits: MAC address (or other hardware address) The purpose of clockseq bits is as defined in https://tools.ietf.org/html/rfc4122#section-4.1.5 is to reduce the probability of UUID collision in case clock goes back in time or node id changes. The implementation should reset it whenever one of these events may occur. Since LWT microsecond time is guaranteed to be unique by Paxos, the RFC provisioning for clockseq and MAC slots becomes excessive. The fix thus changes timeuuid slot content in the following way: - time component now contains the same microsecond time for all values of a statement or a batch. The time is unique and monotonic in case of LWT. Otherwise it's most always monotonic, but may not be unique if two timestamps are created on different coordinators. - clockseq component is used to store a sequence number which is unique and monotonic for all values within the statement/batch. - to protect against time back-adjustments and duplicates if time is auto-generated, MAC component contains a random (spoof) MAC address, re-created on each restart. The address is different at each shard. The change is made for all sources of time: user, generated, LWT. Conditioning the list key generation algorithm on the source of time would unnecessarily complicate the code while not increase quality (uniqueness) of created list keys. Since 14 bits of clockseq provide us with only 16383 distinct slots per statement or batch, 3 extra bits in nanosecond part of the time are used to extend the range to 131071 values per statement/batch. If the rang is exceeded beyond the limit, an exception is produced. A twist on the use of clockseq to extend timeuuid uniqueness is that Scylla, like Cassandra, uses int8 compare to compare lower bits of timeuuid for ordering. The patch takes this into account and sign-complements the clockseq value to make it monotonic according to the legacy compare function. Fixes #7611 test: unit (dev)	2021-01-21 13:03:59 +03:00
Konstantin Osipov	6d1781be36	uuid: fill in UUID node identifier part of UUID Before this patch, UUID generation code was not creating sufficiently unique IDs: the 6 byte node identifier was mostly empty, i.e. only containing shard id. This could lead to collisions between queries executed concurrently at different coordinators, and, since timeuuid is used as key in list append and prepend operations, lead to lost updates. To generate a unique node id, the patch uses a combination of hardware MAC address (or a random number if no hardware address is available) and the current shard id. The shard id is mixed into higher bits of MAC, to reduce the chances on NIC collision within the same network. With sufficiently unique timeuuids as list cell keys, such updates are no longer lost, but multi-value update can still be "merged" with another multi-value update. E.g. if node A executes SET l = l + [4, 5] and node B executes SET l = l + [6, 7], the list value could be any of [4, 5, 6, 7], [4, 6, 5, 7], [6, 4, 5, 7] and so on. At least we are now less likely to get any value lost. Fixes #6208. @todo: initialize UUID subsystem explicitly in main() and switch to using seastar::engine().net().network_interfaces() test: unit (dev)	2021-01-21 13:03:53 +03:00
Avi Kivity	a11ecfe231	Merge 'types: don't linearize in validate()' from Michał Chojnowski A sequel to #7692. This series gets rid of linearization when validating collections and tuple types. (Other types were already validated without linearizing). The necessary helpers for reading from fragmented buffers were introduced in #7692. All this series does is put them to use in `validate()`. Refs: #6138 Closes #7770 * github.com:scylladb/scylla: types: add single-fragment optimization in validate() utils: fragment_range: add with_simplified() cql3: statements: select_statement: remove unnecessary use of with_linearized cql3: maps: remove unnecessary use of with_linearized cql3: lists: remove unnecessary use of with_linearized cql3: tuples: remove unnecessary use of with_linearized cql3: sets: remove unnecessary use of with_linearized cql3: tuples: remove unnecessary use of with_linearized cql3: attributes: remove unnecessary uses of with_linearized types: validate lists without linearizing types: validate tuples without linearizing types: validate sets without linearizing types: validate maps without linearizing types: template abstract_type::validate on FragmentedView types: validate_visitor: transition from FragmentRange to FragmentedView utils: fragmented_temporary_buffer: add empty() to FragmentedView utils: fragmented_temporary_buffer: don't add to null pointer	2020-12-11 17:33:59 +02:00
Michał Chojnowski	150473f074	types: add single-fragment optimization in validate() Manipulating fragmented views is costlier that manipulating contiguous views, so let's detect the common situation when the fragmented view is actually contiguous underneath, and make use of that. Note: this optimization is only useful for big types. For trivial types, validation usually only checks the size of the view.	2020-12-11 09:53:07 +01:00
Michał Chojnowski	e2d17879fc	utils: fragment_range: add with_simplified() Reading from contiguous memory (bytes_view) is significantly simpler runtime-wise than reading from a fragmented view, due to less state and less branching, so we often want to convert a fragmented view to a simple view before processing it, if the fragmented view contains at most one fragment, which is common. with_simplified() does just that.	2020-12-11 09:53:07 +01:00
Michał Chojnowski	15dbe00e8a	types: validate_visitor: transition from FragmentRange to FragmentedView This will allow us to easily get rid of linearizations when validating collections and tuples, because the helpers used in validate_aux() already have FragmentedView overloads.	2020-12-11 09:53:07 +01:00
Michał Chojnowski	3647c0ba47	utils: fragmented_temporary_buffer: add empty() to FragmentedView It's redundant with size_bytes(), but sometimes empty() is more readable and reduces churn when replacing other types with FragmentedView.	2020-12-11 09:53:07 +01:00
Michał Chojnowski	b4dd5d3bdb	utils: fragmented_temporary_buffer: don't add to null pointer When fragmented_temporary_buffer::view is created from a bytes_view, _current is null. In that case, in remove_current(), null pointer offset happens, and ubsan complains. Fix that.	2020-12-11 09:53:07 +01:00
Michał Chojnowski	60a3cecfea	utils: fragment_range: use range-based for loop instead of boost::for_each We want to pass bytes_ostream to this loop in later commits. bytes_ostream does not conform to some boost concepts required by boost::for_each, so let's just use C++'s native loop.	2020-12-07 12:50:36 +01:00
Piotr Sarna	2015988373	Merge 'types: get rid of linearization in deserialize()' from Michał Chojnowski Citing #6138: > In the past few years we have converted most of our codebase to work in terms of fragmented buffers, instead of linearised ones, to help avoid large allocations that put large pressure on the memory allocator. > One prominent component that still works exclusively in terms of linearised buffers is the types hierarchy, more specifically the de/serialization code to/from CQL format. Note that for most types, this is the same as our internal format, notable exceptions are non-frozen collections and user types. > > Most types are expected to contain reasonably small values, but texts, blobs and especially collections can get very large. Since the entire hierarchy shares a common interface we can either transition all or none to work with fragmented buffers. This series gets rid of intermediate linearizations in deserialization. The next steps are removing linearizations from serialization, validation and comparison code. Series summary: - Fix a bug in `fragmented_temporary_buffer::view::remove_prefix`. (Discovered while testing. Since it wasn't discovered earlier, I guess it doesn't occur in any code path in master.) - Add a `FragmentedView` concept to allow uniform handling of various types of fragmented buffers (`bytes_view`, `temporary_fragmented_buffer::view`, `ser::buffer_view` and likely `managed_bytes_view` in the future). - Implement `FragmentedView` for relevant fragmented buffer types. - Add helper functions for reading from `FragmentedView`. - Switch `deserialize()` and all its helpers from `bytes_view` to `FragmentedView`. - Remove `with_linearized()` calls which just became unnecessary. - Add an optimization for single-fragment cases. The addition of `FragmentedView` might be controversial, because another concept meant for the same purpose - `FragmentRange` - is already used. Unfortunately, it lacks the functionality we need. The main (only?) thing we want to do with a fragmented buffer is to extract a prefix from it and `FragmentRange` gives us no way to do that, because it's immutable by design. We can work around that by wrapping it into a mutable view which will track the offset into the immutable `FragmentRange`, and that's exactly what `linearizing_input_stream` is. But it's wasteful. `linearizing_input_stream` is a heavy type, unsuitable for passing around as a view - it stores a pair of fragment iterators, a fragment view and a size (11 words) to conform to the iterator-based design of `FragmentRange`, when one fragment iterator (4 words) already contains all needed state, just hidden. I suggest we replace `FragmentRange` with `FragmentedView` (or something similar) altogether. Refs: #6138 Closes #7692 * github.com:scylladb/scylla: types: collection: add an optimization for single-fragment buffers in deserialize types: add an optimization for single-fragment buffers in deserialize cql3: tuples: don't linearize in in_value::from_serialized cql3: expr: expression: replace with_linearize with linearized cql3: constants: remove unneeded uses of with_linearized cql3: update_parameters: don't linearize in prefetch_data_builder::add_cell cql3: lists: remove unneeded use of with_linearized query-result-set: don't linearize in result_set_builder::deserialize types: remove unneeded collection deserialization overloads types: switch collection_type_impl::deserialize from bytes_view to FragmentedView cql3: sets: don't linearize in value::from_serialized cql3: lists: don't linearize in value::from_serialized cql3: maps: don't linearize in value::from_serialized types: remove unused deserialize_aux types: deserialize: don't linearize tuple elements types: deserialize: don't linearize collection elements types: switch deserialize from bytes_view to FragmentedView types: deserialize tuple types from FragmentedView types: deserialize set type from FragmentedView types: deserialize map type from FragmentedView types: deserialize list type from FragmentedView types: add FragmentedView versions of read_collection_size and read_collection_value types: deserialize varint type from FragmentedView types: deserialize floating point types from FragmentedView types: deserialize decimal type from FragmentedView types: deserialize duration type from FragmentedView types: deserialize IP address types from FragmentedView types: deserialize uuid types from FragmentedView types: deserialize timestamp type from FragmentedView types: deserialize simple date type from FragmentedView types: deserialize time type from FragmentedView types: deserialize boolean type from FragmentedView types: deserialize integer types from FragmentedView types: deserialize string types from FragmentedView types: remove unused read_simple_opt types: implement read_simple* versions for FragmentedView utils: fragmented_temporary_buffer: implement FragmentedView for view utils: fragment_range: add single_fragmented_view serializer: implement FragmentedView for buffer_view utils: fragment_range: add linearized and with_linearized for FragmentedView utils: fragment_range: add FragmentedView utils: fragmented_temporary_buffer: fix view::remove_prefix	2020-12-04 09:46:20 +01:00
Michał Chojnowski	fcb258cb01	utils: fragmented_temporary_buffer: implement FragmentedView for view fragmented_temporary_buffer::view is one of the types we want to directly deserialize from.	2020-11-27 15:26:13 +01:00
Michał Chojnowski	f6cc2b6a48	utils: fragment_range: add single_fragmented_view bytes_view is one of the types we want to deserialize from (at least for now), so we want to be able to pass it to deserialize() after it's transitioned to FragmentView. single_fragmented_view is a wrapper implementing FragmentedView for bytes_view. It's constructed from bytes_view explicitly, because it's typically used in context where we want to phase linearization (and by extension, bytes_view) out.	2020-11-27 15:26:13 +01:00
Michał Chojnowski	2008c0f62f	utils: fragment_range: add linearized and with_linearized for FragmentedView We would like those helpers to disappear one day but for now we still need them until everything can handle fragmented buffers.	2020-11-27 15:26:13 +01:00
Michał Chojnowski	fc90bd5190	utils: fragment_range: add FragmentedView This patch introduces FragmentedView - a concept intented as a general-purpose interface for fragmented buffers. Another concept made for this purpose, FragmentedRange, already exists in the codebase. However, it's unwieldy. The iterator-based design of FragmentRange is harder to implement and requires more code, but more importantly it makes FragmentRange immutable. Usually we want to read the beginning of the buffer and pass the rest of it elsewhere. This is impossible with FragmentRange. FragmentedView can do everything FragmentRange can do and more, except for playing nicely with iterator-based collection methods, but those are useless for fragmented buffers anyway.	2020-11-27 15:26:13 +01:00
Benny Halevy	157a964a63	locator: extract can_yield to utils/maybe_yield.hh Move the definition of bool_class can_yield to a standalone header file and define there a maybe_yield(can_yield) helper. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-24 12:23:56 +02:00
Michał Chojnowski	9bceaac44c	utils: fragmented_temporary_buffer: fix view::remove_prefix This piece of logic was wrong for two unrelated reasons: 1. When fragmented_temporary_buffer::view is constructed from bytes_view, _current is null. When remove_prefix was used on such view, null pointer dereference happened. 2. It only worked for the first remove_prefix call. A second call would put a wrong value in _current_position.	2020-11-24 03:05:13 +01:00
Avi Kivity	d612ca78f3	Merge 'Allow changing hinted handoff configuration in runtime' from Piotr Dulikowski This PR allows changing the hinted_handoff_enabled option in runtime, either by modifying and reloading YAML configuration, or through HTTP API. This PR also introduces an important change in semantics of hinted_handoff_enabled: - Previously, hinted_handoff_enabled controlled whether _both writing and sending_ hints is allowed at all, or to particular DCs, - Now, hinted_handoff_enabled only controls whether _writing hints_ is enabled. Sending hints from disk is now always enabled. Fixes: #5634 Tests: - unit(dev) for each commit of the PR - unit(debug) for the last commit of the PR Closes #6916 * github.com:scylladb/scylla: api: allow changing hinted handoff configuration storage_proxy: fix wrong return type in swagger hints_manager: implement change_host_filter storage_proxy: always create hints manager config: plug in hints::host_filter object into configuration db/hints: introduce host_filter hints/resource_manager: allow registering managers after start hints: introduce db::hints::directory_initializer directories.cc: prepare for use outside main.cc	2020-11-18 13:41:02 +02:00
Avi Kivity	13c6c90d8c	Merge 'Remove std::iterator usage' from Piotr Jastrzębski std::iterator is deprecated since C++17 so define all the required iterator_traits directly and stop using std::iterator at all. More context: https://www.fluentcpp.com/2018/05/08/std-iterator-deprecated Tests: unit(dev) Closes #7635 * github.com:scylladb/scylla: log_heap: Remove std::iterator from hist_iterator types: Remove std::iterator from tuple_deserializing_iterator types: Remove std::iterator from listlike_partial_deserializing_iterator sstables: remove std::iterator from const_iterator token_metadata: Remove std::iterator from tokens_iterator size_estimates_virtual_reader: Remove std::iterator token_metadata: Remove std::iterator from tokens_iterator_impl counters: Remove std::iterator from iterators compound_compat: Remove std::iterator from iterators compound: Remove std::iterator from iterator clustering_interval_set: Remove std::iterator from position_range_iterator cdc: Remove std::iterator from collection_iterator cartesian_product: Remove std::iterator from iterator bytes_ostream: Remove std::iterator from fragment_iterator	2020-11-17 19:22:17 +02:00
Piotr Jastrzebski	2fe9d879df	log_heap: Remove std::iterator from hist_iterator std::iterator is deprecated since C++17 so define all the required iterator_traits directly. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-11-17 16:53:20 +01:00
Piotr Jastrzebski	f2b98b0aad	Replace disable_failure_guard with scoped_critical_alloc_section scoped_critical_alloc_section was recently introduced to replace disable_failure_guard and made the old class deprecated. This patch replaces all occurences of disable_failure_guard with scoped_critical_alloc_section. Without this patch the build prints many warnings like: warning: 'disable_failure_guard' is deprecated: Use scoped_critical_section instead [-Wdeprecated-declarations] Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <ca2a91aaf48b0f6ed762a6aa687e6ac5e936355d.1605621284.git.piotr@scylladb.com>	2020-11-17 16:01:25 +02:00
Botond Dénes	7b56ed6057	utils: logalloc: add lsa_global_occupancy_stats() Allows querying the occupancy stats of all the lsa memory.	2020-11-17 15:13:21 +02:00
Botond Dénes	f69942424d	utils: phased_barrier: add operations_in_progress() Allows querying the number of operations in-flight in the current phase.	2020-11-17 15:13:21 +02:00
Piotr Dulikowski	81a568c57a	directories.cc: prepare for use outside main.cc Currently, the `directories` class is used exclusively during initialization, in the main() function. This commit refactors this class so that it is possible to use it to initialize directories much later after startup. The intent of this change is to make it possible for hints manager to create directories for hints lazily. Currently, when Scylla is booted with hinted handoff disabled, the `hints_directory` config parameter is ignored and directories for hints are neither created nor verified. Because we would like to preserve this behavior and introduce possibility to switch hinted handoff on in runtime, the hints directories will have to be created lazily the first time hinted handoff is enabled.	2020-11-17 10:15:47 +01:00
Botond Dénes	7f07b95dd3	utils/chunked_vector: reserve_partial(): better explain how to properly use Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20201110130953.435123-1-bdenes@scylladb.com>	2020-11-10 15:45:01 +02:00
Eliran Sinvani	4c434f3fa4	moving avarage rate: Keep computed rates in zero until they are meaningful When computing moving average rates too early after startup, the rate can be infinite, this is simply because the sample interval since the system started is too small to generate meaningful results. Here we check for this situation and keep the rate at 0 if it happens to signal that there are still no meaningful results. This incident is unlikely to happen since it can happen only during a very small time window after restart, so we add a hint to the compiler to optimize for that in order to have a minimum impact on the normal usecase. Fixes #4469	2020-11-04 11:13:59 +02:00
Avi Kivity	25e6a9e493	Merge "utils/large_bitset: reserve memory for _storage gently" from Botond " Introduce a gentle (yielding) implementation of reserve for chunked vector and use it when reserving the backing storage vector for large bitset. Large bitset is used by bloom filters, which can be quite large and have been observed to cause stalls when allocating memory for the storage. Fixes: #6974 Tests: unit(dev) " * 'gentle-reserve/v1' of https://github.com/denesb/scylla: utils/large_bitset: use reserve_partial() to reserve _storage utils/chunked_vector: add reserve_partial()	2020-11-03 13:42:54 +02:00
Botond Dénes	a08b640fa7	utils/large_bitset: use reserve_partial() to reserve _storage To avoid stalls when reserving memory for a large bloom filter. The filter creation already has a yielding loop for initialization, this patch extends it to reservation of memory too.	2020-11-02 18:03:19 +02:00
Botond Dénes	bb908b1750	utils/chunked_vector: add reserve_partial() A variant of reserve() which allows gentle reserving of memory. This variant will allocate just one chunk at a time. To drive it to completion, one should call it repeatedly with the return value of the previous call, until it returns 0. This variant will be used in the next patch by the large bitset creation code, to avoid stalls when allocating large bloom filters (which are backed by large bitset).	2020-11-02 18:02:01 +02:00
Benny Halevy	87c3fd9cd8	fb_utilities.hh: mark methods noexcept Now that gms::inet_address assignment is marked as noexcept. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-01 16:46:18 +02:00
Pavel Emelyanov	b2ce3b197e	allocation_strategy: Fix standard_migrator initialization This is the continuation of `30722b8c8e`, so let me re-cite Rafael: The constructors of these global variables can allocate memory. Since the variables are thread_local, they are initialized at first use. There is nothing we can do if these allocations fail, so use disable_failure_guard. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20201028140553.21709-1-xemul@scylladb.com>	2020-10-28 16:22:23 +02:00
Nadav Har'El	6740907f3d	Merge 'utf8: don't linearize cells for validation' from Avi Kivity Currently, we linearize large UTF8 cells in order to validate them. This can cause large latency spikes if the cell is large. This series changes UTF8 validation to work on fragmented buffers. This is somewhat tricky since the validation routines are optimized for single-instruction-multiple-data (SIMD) architectures. The unit tests are expanded to cover the new functionality. Fixes #7448. Closes #7449 * github.com:scylladb/scylla: types: don't linearize utf8 for validation test: utf8: add fragmented buffer validation tests utils: utf8: add function to validate fragmented buffers utils: utf8: expose validate_partial() in a header utils: utf8: introduce validate_partial() utils: utf8: extract a function to evaluate a single codepoint	2020-10-21 20:51:15 +03:00
Avi Kivity	91490827c1	utils: utf8: add function to validate fragmented buffers Add a function to validate fragmented buffers. We validate each buffer with SIMD-optimized validate_partial(), then collect the codepoint that spans buffer boundaries (if any) in a temporary buffer, validate that too, and continue.	2020-10-21 11:14:44 +03:00
Avi Kivity	3d1be9286f	utils: utf8: expose validate_partial() in a header Since fragmented buffers are templates, we'll need access to validate_partial() in a header. Move it there.	2020-10-21 11:14:44 +03:00
Avi Kivity	22a0c457e2	utils: utf8: introduce validate_partial() The current validators expect the buffer to contain a full UTF-8 string. This won't be the case for fragmented buffers, since a codepoint can straddle two (or more) buffers. To prepare for that, convert the existing validators to validate_partial(), which returns either an error, or success with an indication of the size of the tail that was not validated and now many bytes it is missing. This is natural since the SIMD validators already cannot process a tail in SIMD mode if it's smaller than the vector size, so only minor rearrangements are needed. In addition, we now have validate_partial() for non-SIMD architectures, since we'll need it for fragmented buffer validation.	2020-10-21 11:14:44 +03:00
Avi Kivity	900699f1b5	utils: utf8: extract a function to evaluate a single codepoint Our SIMD optimized validators cannot process a codepoint that spans multiple buffers, and adapting them to be able to will slow them down. So our strategy is to special-case any codepoint that spans two buffers. To do that, extract an evaluate_codepoint() function from the current validate_naive() function. It returns three values: - if a codepoint was successfully decoded from the buffer, how many bytes were consumed - if not enough bytes were in the buffer, how many more are needed - otherwise, an error happened, so return an indication The new function uses a table to calculate a codepoint's size from its first byte, similar to the SIMD variants. validate_naive() is now implemented in terms of evaluate_codepoint().	2020-10-21 11:14:43 +03:00
Avi Kivity	f9129fc1f9	utils: to_range(): relax constraint The input range to utils::to_range() should be indeed a range, but clang has trouble compiling <ranges> which causes it to fail. Relax the constraint until this is fixed.	2020-10-18 18:16:30 +03:00
Calle Wilund	83339f4bac	Alternator::streams: Make SequenceNumber monotinically growing Fixes #7424 AWS sdk (kinesis) assumes SequenceNumbers are monotonically growing bigints. Since we sort on and use timeuuids are these a "raw" bit representation of this will _not_ fulfill the requirement. However, we can "unwrap" the timestamp of uuid msb and give the value as timestamp<<64\|lsb, which will ensure sort order == bigint order.	2020-10-14 16:45:21 +03:00
Benny Halevy	f3fc81751f	serialized_action: trigger: propagate action error Currently, the serialized_action error is set to a shared_promise, but is not returned to the caller, unless there is an already outstanding action. Note that setting the exception to the promise when noone collected it via the shared_future caused 'Exceptional future ignored' warning to be issued, as seen in #7352. Fixes #7352 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-10-14 16:45:21 +03:00
Benny Halevy	81d2f60df9	serialized_action: trigger: include also semaphore status to promise Currently, if `with_semaphore` returns exceptional future, it is not propagated to the promise, and other waiters that got a shared future will not see that. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-10-14 16:45:21 +03:00
Botond Dénes	0994e8b5e2	utils: add to_hr_size() This utility function converts a potentially large number to a compact representation, composed of at most 4 digits and a letter appropriate to the power of two the number has to multiplied with to arrive to the original number (with some loss of precision). The different powers of two are the conventional 2 ** (N * 10) variants: * N=0: (B)ytes * N=1: (K)bytes * N=2: (M)bytes * N=3: (G)bytes * N=4: (T)bytes Examples: * 87665 will be converted to 87K * 1024 will be converted to 1K	2020-10-13 12:32:14 +03:00
Avi Kivity	dfffa4dc71	utils: big_decimal: work around clang difficulty with boost::cpp_int(string_view) constructor Clang has some difficulty with the boost::cpp_int constructor from string_view. In fact it is a mess of enable_if<>s so a human would have trouble too. Work around it by converting to std::string. This is bad for performance, but this constructor is not going to be fast in any case. Hopefully a fix will arrive in clang or boost. Closes #7389	2020-10-11 22:09:19 +03:00
Avi Kivity	af8fd8c8d8	utils: build_id: fix ubsan false positive on pointer arithmetic get_nt_build_id() constructs a pointer by adding a base and an offset, but if the base happens to be zero, that is undefined under C++ rules (altough legal ELF). Fix by performing the addition on integers, and only then casting to a pointer. Closes #7379	2020-10-11 17:23:40 +03:00
Avi Kivity	7d025b5cf4	utils: log_heap: relax check for clang's sanitizer `b1e78313fe` added a check for ubsan to squelch a false positive, but that check doesn't work with clang. Relax it to check for debug mode, so clang doesn't hit the same false positive as gcc did. Define a SANITIZE macro so we have a reliable way to detect if we're running with a sanitizer. Closes #7372	2020-10-11 16:07:16 +03:00
Avi Kivity	6bc6db8037	utils/array-search: document restrictions Our AVX2 implementation cannot load a partial vector, or mask unused elements (that can be done with AVX-512/SVE2), so it has some restrictions. Document them. Closes #7385	2020-10-11 15:19:54 +03:00
Avi Kivity	3e2707c2bf	utils: fragmented_temporary_buffer: don't add to potentially null pointers Offsetting a null pointer is undefined, and clang's ubsan complains. Rearrange the arithmetic so we never offset a null pointer. A function is introduced for the remaining contiguous bytes so it can cast the result to size_t, avoiding a compare-of-different-signedness warning from gcc. Closes #7373	2020-10-11 15:05:15 +03:00
Benny Halevy	064aae8ffa	flush_queue: call_helper: support no variadic futures Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201011105422.818623-1-bhalevy@scylladb.com>	2020-10-11 14:40:32 +03:00
Benny Halevy	a0b5529441	flush_queue: use futurator::invoke Attend to the following warning with Seastar_API_LEVEL 5+: ``` ./utils/flush_queue.hh:68:36: warning: ‘static seastar::futurize<T>::type seastar::futurize<T>::apply(Func&&, FuncArgs&& ...) [with Func = test_queue_ordering_random_ops::run_test_case()::<lambda(int)>::<lambda(int)>; FuncArgs = {int}; T = void; seastar::futurize<T>::type = seastar::future<>]’ is deprecated: Use invoke for varargs [-Wdeprecated-declarations] 68 \| return futurator::apply(std::forward<Func>(func), f.get()); ``` Test: flush_queue(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201007112130.474269-1-bhalevy@scylladb.com>	2020-10-11 12:14:17 +03:00
Avi Kivity	c41905e986	utils: array-search: deinline, working around clang bug Clang has a bug processing inline ifuncs with intrinsics[1]. Since ifuncs can't be inlined anyway (they are always dispatched via a function pointer that is determined based on the CPU features present), nothing is gained by inlining them. Deinlining therefore reduces compile time and works around the clang bug. [1] https://bugs.llvm.org/show_bug.cgi?id=47691 Closes #7358	2020-10-11 10:29:24 +03:00
Avi Kivity	31a5378a82	utils: utf8: avoid harmless integer overflow 240 doesn't fit in char without overflow, so cast it explicitly to avoid a clang warning.	2020-09-22 17:24:33 +03:00
Avi Kivity	e12c72ad55	utils: multiprecision_int: disambiguate operator templates by adding overloads We have templates for multiprecision_int for both sides of the operator, for example: template <typename T> bool operator==(const T& x) const and template <typename T> friend bool operator==(const T& x, const multiprecision_int& y) Clang considers them equally satisfying when both operands are multiprecision_int, so provide a disambiguating overload.	2020-09-22 17:24:33 +03:00

1 2 3 4 5 ...

864 Commits