scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-25 09:11:10 +00:00

Author	SHA1	Message	Date
Piotr Sarna	2015988373	Merge 'types: get rid of linearization in deserialize()' from Michał Chojnowski Citing #6138: > In the past few years we have converted most of our codebase to work in terms of fragmented buffers, instead of linearised ones, to help avoid large allocations that put large pressure on the memory allocator. > One prominent component that still works exclusively in terms of linearised buffers is the types hierarchy, more specifically the de/serialization code to/from CQL format. Note that for most types, this is the same as our internal format, notable exceptions are non-frozen collections and user types. > > Most types are expected to contain reasonably small values, but texts, blobs and especially collections can get very large. Since the entire hierarchy shares a common interface we can either transition all or none to work with fragmented buffers. This series gets rid of intermediate linearizations in deserialization. The next steps are removing linearizations from serialization, validation and comparison code. Series summary: - Fix a bug in `fragmented_temporary_buffer::view::remove_prefix`. (Discovered while testing. Since it wasn't discovered earlier, I guess it doesn't occur in any code path in master.) - Add a `FragmentedView` concept to allow uniform handling of various types of fragmented buffers (`bytes_view`, `temporary_fragmented_buffer::view`, `ser::buffer_view` and likely `managed_bytes_view` in the future). - Implement `FragmentedView` for relevant fragmented buffer types. - Add helper functions for reading from `FragmentedView`. - Switch `deserialize()` and all its helpers from `bytes_view` to `FragmentedView`. - Remove `with_linearized()` calls which just became unnecessary. - Add an optimization for single-fragment cases. The addition of `FragmentedView` might be controversial, because another concept meant for the same purpose - `FragmentRange` - is already used. Unfortunately, it lacks the functionality we need. The main (only?) thing we want to do with a fragmented buffer is to extract a prefix from it and `FragmentRange` gives us no way to do that, because it's immutable by design. We can work around that by wrapping it into a mutable view which will track the offset into the immutable `FragmentRange`, and that's exactly what `linearizing_input_stream` is. But it's wasteful. `linearizing_input_stream` is a heavy type, unsuitable for passing around as a view - it stores a pair of fragment iterators, a fragment view and a size (11 words) to conform to the iterator-based design of `FragmentRange`, when one fragment iterator (4 words) already contains all needed state, just hidden. I suggest we replace `FragmentRange` with `FragmentedView` (or something similar) altogether. Refs: #6138 Closes #7692 * github.com:scylladb/scylla: types: collection: add an optimization for single-fragment buffers in deserialize types: add an optimization for single-fragment buffers in deserialize cql3: tuples: don't linearize in in_value::from_serialized cql3: expr: expression: replace with_linearize with linearized cql3: constants: remove unneeded uses of with_linearized cql3: update_parameters: don't linearize in prefetch_data_builder::add_cell cql3: lists: remove unneeded use of with_linearized query-result-set: don't linearize in result_set_builder::deserialize types: remove unneeded collection deserialization overloads types: switch collection_type_impl::deserialize from bytes_view to FragmentedView cql3: sets: don't linearize in value::from_serialized cql3: lists: don't linearize in value::from_serialized cql3: maps: don't linearize in value::from_serialized types: remove unused deserialize_aux types: deserialize: don't linearize tuple elements types: deserialize: don't linearize collection elements types: switch deserialize from bytes_view to FragmentedView types: deserialize tuple types from FragmentedView types: deserialize set type from FragmentedView types: deserialize map type from FragmentedView types: deserialize list type from FragmentedView types: add FragmentedView versions of read_collection_size and read_collection_value types: deserialize varint type from FragmentedView types: deserialize floating point types from FragmentedView types: deserialize decimal type from FragmentedView types: deserialize duration type from FragmentedView types: deserialize IP address types from FragmentedView types: deserialize uuid types from FragmentedView types: deserialize timestamp type from FragmentedView types: deserialize simple date type from FragmentedView types: deserialize time type from FragmentedView types: deserialize boolean type from FragmentedView types: deserialize integer types from FragmentedView types: deserialize string types from FragmentedView types: remove unused read_simple_opt types: implement read_simple* versions for FragmentedView utils: fragmented_temporary_buffer: implement FragmentedView for view utils: fragment_range: add single_fragmented_view serializer: implement FragmentedView for buffer_view utils: fragment_range: add linearized and with_linearized for FragmentedView utils: fragment_range: add FragmentedView utils: fragmented_temporary_buffer: fix view::remove_prefix	2020-12-04 09:46:20 +01:00
Michał Chojnowski	fcb258cb01	utils: fragmented_temporary_buffer: implement FragmentedView for view fragmented_temporary_buffer::view is one of the types we want to directly deserialize from.	2020-11-27 15:26:13 +01:00
Michał Chojnowski	f6cc2b6a48	utils: fragment_range: add single_fragmented_view bytes_view is one of the types we want to deserialize from (at least for now), so we want to be able to pass it to deserialize() after it's transitioned to FragmentView. single_fragmented_view is a wrapper implementing FragmentedView for bytes_view. It's constructed from bytes_view explicitly, because it's typically used in context where we want to phase linearization (and by extension, bytes_view) out.	2020-11-27 15:26:13 +01:00
Michał Chojnowski	2008c0f62f	utils: fragment_range: add linearized and with_linearized for FragmentedView We would like those helpers to disappear one day but for now we still need them until everything can handle fragmented buffers.	2020-11-27 15:26:13 +01:00
Michał Chojnowski	fc90bd5190	utils: fragment_range: add FragmentedView This patch introduces FragmentedView - a concept intented as a general-purpose interface for fragmented buffers. Another concept made for this purpose, FragmentedRange, already exists in the codebase. However, it's unwieldy. The iterator-based design of FragmentRange is harder to implement and requires more code, but more importantly it makes FragmentRange immutable. Usually we want to read the beginning of the buffer and pass the rest of it elsewhere. This is impossible with FragmentRange. FragmentedView can do everything FragmentRange can do and more, except for playing nicely with iterator-based collection methods, but those are useless for fragmented buffers anyway.	2020-11-27 15:26:13 +01:00
Benny Halevy	157a964a63	locator: extract can_yield to utils/maybe_yield.hh Move the definition of bool_class can_yield to a standalone header file and define there a maybe_yield(can_yield) helper. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-24 12:23:56 +02:00
Michał Chojnowski	9bceaac44c	utils: fragmented_temporary_buffer: fix view::remove_prefix This piece of logic was wrong for two unrelated reasons: 1. When fragmented_temporary_buffer::view is constructed from bytes_view, _current is null. When remove_prefix was used on such view, null pointer dereference happened. 2. It only worked for the first remove_prefix call. A second call would put a wrong value in _current_position.	2020-11-24 03:05:13 +01:00
Avi Kivity	d612ca78f3	Merge 'Allow changing hinted handoff configuration in runtime' from Piotr Dulikowski This PR allows changing the hinted_handoff_enabled option in runtime, either by modifying and reloading YAML configuration, or through HTTP API. This PR also introduces an important change in semantics of hinted_handoff_enabled: - Previously, hinted_handoff_enabled controlled whether _both writing and sending_ hints is allowed at all, or to particular DCs, - Now, hinted_handoff_enabled only controls whether _writing hints_ is enabled. Sending hints from disk is now always enabled. Fixes: #5634 Tests: - unit(dev) for each commit of the PR - unit(debug) for the last commit of the PR Closes #6916 * github.com:scylladb/scylla: api: allow changing hinted handoff configuration storage_proxy: fix wrong return type in swagger hints_manager: implement change_host_filter storage_proxy: always create hints manager config: plug in hints::host_filter object into configuration db/hints: introduce host_filter hints/resource_manager: allow registering managers after start hints: introduce db::hints::directory_initializer directories.cc: prepare for use outside main.cc	2020-11-18 13:41:02 +02:00
Avi Kivity	13c6c90d8c	Merge 'Remove std::iterator usage' from Piotr Jastrzębski std::iterator is deprecated since C++17 so define all the required iterator_traits directly and stop using std::iterator at all. More context: https://www.fluentcpp.com/2018/05/08/std-iterator-deprecated Tests: unit(dev) Closes #7635 * github.com:scylladb/scylla: log_heap: Remove std::iterator from hist_iterator types: Remove std::iterator from tuple_deserializing_iterator types: Remove std::iterator from listlike_partial_deserializing_iterator sstables: remove std::iterator from const_iterator token_metadata: Remove std::iterator from tokens_iterator size_estimates_virtual_reader: Remove std::iterator token_metadata: Remove std::iterator from tokens_iterator_impl counters: Remove std::iterator from iterators compound_compat: Remove std::iterator from iterators compound: Remove std::iterator from iterator clustering_interval_set: Remove std::iterator from position_range_iterator cdc: Remove std::iterator from collection_iterator cartesian_product: Remove std::iterator from iterator bytes_ostream: Remove std::iterator from fragment_iterator	2020-11-17 19:22:17 +02:00
Piotr Jastrzebski	2fe9d879df	log_heap: Remove std::iterator from hist_iterator std::iterator is deprecated since C++17 so define all the required iterator_traits directly. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-11-17 16:53:20 +01:00
Piotr Jastrzebski	f2b98b0aad	Replace disable_failure_guard with scoped_critical_alloc_section scoped_critical_alloc_section was recently introduced to replace disable_failure_guard and made the old class deprecated. This patch replaces all occurences of disable_failure_guard with scoped_critical_alloc_section. Without this patch the build prints many warnings like: warning: 'disable_failure_guard' is deprecated: Use scoped_critical_section instead [-Wdeprecated-declarations] Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <ca2a91aaf48b0f6ed762a6aa687e6ac5e936355d.1605621284.git.piotr@scylladb.com>	2020-11-17 16:01:25 +02:00
Botond Dénes	7b56ed6057	utils: logalloc: add lsa_global_occupancy_stats() Allows querying the occupancy stats of all the lsa memory.	2020-11-17 15:13:21 +02:00
Botond Dénes	f69942424d	utils: phased_barrier: add operations_in_progress() Allows querying the number of operations in-flight in the current phase.	2020-11-17 15:13:21 +02:00
Piotr Dulikowski	81a568c57a	directories.cc: prepare for use outside main.cc Currently, the `directories` class is used exclusively during initialization, in the main() function. This commit refactors this class so that it is possible to use it to initialize directories much later after startup. The intent of this change is to make it possible for hints manager to create directories for hints lazily. Currently, when Scylla is booted with hinted handoff disabled, the `hints_directory` config parameter is ignored and directories for hints are neither created nor verified. Because we would like to preserve this behavior and introduce possibility to switch hinted handoff on in runtime, the hints directories will have to be created lazily the first time hinted handoff is enabled.	2020-11-17 10:15:47 +01:00
Botond Dénes	7f07b95dd3	utils/chunked_vector: reserve_partial(): better explain how to properly use Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20201110130953.435123-1-bdenes@scylladb.com>	2020-11-10 15:45:01 +02:00
Eliran Sinvani	4c434f3fa4	moving avarage rate: Keep computed rates in zero until they are meaningful When computing moving average rates too early after startup, the rate can be infinite, this is simply because the sample interval since the system started is too small to generate meaningful results. Here we check for this situation and keep the rate at 0 if it happens to signal that there are still no meaningful results. This incident is unlikely to happen since it can happen only during a very small time window after restart, so we add a hint to the compiler to optimize for that in order to have a minimum impact on the normal usecase. Fixes #4469	2020-11-04 11:13:59 +02:00
Avi Kivity	25e6a9e493	Merge "utils/large_bitset: reserve memory for _storage gently" from Botond " Introduce a gentle (yielding) implementation of reserve for chunked vector and use it when reserving the backing storage vector for large bitset. Large bitset is used by bloom filters, which can be quite large and have been observed to cause stalls when allocating memory for the storage. Fixes: #6974 Tests: unit(dev) " * 'gentle-reserve/v1' of https://github.com/denesb/scylla: utils/large_bitset: use reserve_partial() to reserve _storage utils/chunked_vector: add reserve_partial()	2020-11-03 13:42:54 +02:00
Botond Dénes	a08b640fa7	utils/large_bitset: use reserve_partial() to reserve _storage To avoid stalls when reserving memory for a large bloom filter. The filter creation already has a yielding loop for initialization, this patch extends it to reservation of memory too.	2020-11-02 18:03:19 +02:00
Botond Dénes	bb908b1750	utils/chunked_vector: add reserve_partial() A variant of reserve() which allows gentle reserving of memory. This variant will allocate just one chunk at a time. To drive it to completion, one should call it repeatedly with the return value of the previous call, until it returns 0. This variant will be used in the next patch by the large bitset creation code, to avoid stalls when allocating large bloom filters (which are backed by large bitset).	2020-11-02 18:02:01 +02:00
Benny Halevy	87c3fd9cd8	fb_utilities.hh: mark methods noexcept Now that gms::inet_address assignment is marked as noexcept. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-01 16:46:18 +02:00
Pavel Emelyanov	b2ce3b197e	allocation_strategy: Fix standard_migrator initialization This is the continuation of `30722b8c8e`, so let me re-cite Rafael: The constructors of these global variables can allocate memory. Since the variables are thread_local, they are initialized at first use. There is nothing we can do if these allocations fail, so use disable_failure_guard. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20201028140553.21709-1-xemul@scylladb.com>	2020-10-28 16:22:23 +02:00
Nadav Har'El	6740907f3d	Merge 'utf8: don't linearize cells for validation' from Avi Kivity Currently, we linearize large UTF8 cells in order to validate them. This can cause large latency spikes if the cell is large. This series changes UTF8 validation to work on fragmented buffers. This is somewhat tricky since the validation routines are optimized for single-instruction-multiple-data (SIMD) architectures. The unit tests are expanded to cover the new functionality. Fixes #7448. Closes #7449 * github.com:scylladb/scylla: types: don't linearize utf8 for validation test: utf8: add fragmented buffer validation tests utils: utf8: add function to validate fragmented buffers utils: utf8: expose validate_partial() in a header utils: utf8: introduce validate_partial() utils: utf8: extract a function to evaluate a single codepoint	2020-10-21 20:51:15 +03:00
Avi Kivity	91490827c1	utils: utf8: add function to validate fragmented buffers Add a function to validate fragmented buffers. We validate each buffer with SIMD-optimized validate_partial(), then collect the codepoint that spans buffer boundaries (if any) in a temporary buffer, validate that too, and continue.	2020-10-21 11:14:44 +03:00
Avi Kivity	3d1be9286f	utils: utf8: expose validate_partial() in a header Since fragmented buffers are templates, we'll need access to validate_partial() in a header. Move it there.	2020-10-21 11:14:44 +03:00
Avi Kivity	22a0c457e2	utils: utf8: introduce validate_partial() The current validators expect the buffer to contain a full UTF-8 string. This won't be the case for fragmented buffers, since a codepoint can straddle two (or more) buffers. To prepare for that, convert the existing validators to validate_partial(), which returns either an error, or success with an indication of the size of the tail that was not validated and now many bytes it is missing. This is natural since the SIMD validators already cannot process a tail in SIMD mode if it's smaller than the vector size, so only minor rearrangements are needed. In addition, we now have validate_partial() for non-SIMD architectures, since we'll need it for fragmented buffer validation.	2020-10-21 11:14:44 +03:00
Avi Kivity	900699f1b5	utils: utf8: extract a function to evaluate a single codepoint Our SIMD optimized validators cannot process a codepoint that spans multiple buffers, and adapting them to be able to will slow them down. So our strategy is to special-case any codepoint that spans two buffers. To do that, extract an evaluate_codepoint() function from the current validate_naive() function. It returns three values: - if a codepoint was successfully decoded from the buffer, how many bytes were consumed - if not enough bytes were in the buffer, how many more are needed - otherwise, an error happened, so return an indication The new function uses a table to calculate a codepoint's size from its first byte, similar to the SIMD variants. validate_naive() is now implemented in terms of evaluate_codepoint().	2020-10-21 11:14:43 +03:00
Avi Kivity	f9129fc1f9	utils: to_range(): relax constraint The input range to utils::to_range() should be indeed a range, but clang has trouble compiling <ranges> which causes it to fail. Relax the constraint until this is fixed.	2020-10-18 18:16:30 +03:00
Calle Wilund	83339f4bac	Alternator::streams: Make SequenceNumber monotinically growing Fixes #7424 AWS sdk (kinesis) assumes SequenceNumbers are monotonically growing bigints. Since we sort on and use timeuuids are these a "raw" bit representation of this will _not_ fulfill the requirement. However, we can "unwrap" the timestamp of uuid msb and give the value as timestamp<<64\|lsb, which will ensure sort order == bigint order.	2020-10-14 16:45:21 +03:00
Benny Halevy	f3fc81751f	serialized_action: trigger: propagate action error Currently, the serialized_action error is set to a shared_promise, but is not returned to the caller, unless there is an already outstanding action. Note that setting the exception to the promise when noone collected it via the shared_future caused 'Exceptional future ignored' warning to be issued, as seen in #7352. Fixes #7352 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-10-14 16:45:21 +03:00
Benny Halevy	81d2f60df9	serialized_action: trigger: include also semaphore status to promise Currently, if `with_semaphore` returns exceptional future, it is not propagated to the promise, and other waiters that got a shared future will not see that. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-10-14 16:45:21 +03:00
Botond Dénes	0994e8b5e2	utils: add to_hr_size() This utility function converts a potentially large number to a compact representation, composed of at most 4 digits and a letter appropriate to the power of two the number has to multiplied with to arrive to the original number (with some loss of precision). The different powers of two are the conventional 2 ** (N * 10) variants: * N=0: (B)ytes * N=1: (K)bytes * N=2: (M)bytes * N=3: (G)bytes * N=4: (T)bytes Examples: * 87665 will be converted to 87K * 1024 will be converted to 1K	2020-10-13 12:32:14 +03:00
Avi Kivity	dfffa4dc71	utils: big_decimal: work around clang difficulty with boost::cpp_int(string_view) constructor Clang has some difficulty with the boost::cpp_int constructor from string_view. In fact it is a mess of enable_if<>s so a human would have trouble too. Work around it by converting to std::string. This is bad for performance, but this constructor is not going to be fast in any case. Hopefully a fix will arrive in clang or boost. Closes #7389	2020-10-11 22:09:19 +03:00
Avi Kivity	af8fd8c8d8	utils: build_id: fix ubsan false positive on pointer arithmetic get_nt_build_id() constructs a pointer by adding a base and an offset, but if the base happens to be zero, that is undefined under C++ rules (altough legal ELF). Fix by performing the addition on integers, and only then casting to a pointer. Closes #7379	2020-10-11 17:23:40 +03:00
Avi Kivity	7d025b5cf4	utils: log_heap: relax check for clang's sanitizer `b1e78313fe` added a check for ubsan to squelch a false positive, but that check doesn't work with clang. Relax it to check for debug mode, so clang doesn't hit the same false positive as gcc did. Define a SANITIZE macro so we have a reliable way to detect if we're running with a sanitizer. Closes #7372	2020-10-11 16:07:16 +03:00
Avi Kivity	6bc6db8037	utils/array-search: document restrictions Our AVX2 implementation cannot load a partial vector, or mask unused elements (that can be done with AVX-512/SVE2), so it has some restrictions. Document them. Closes #7385	2020-10-11 15:19:54 +03:00
Avi Kivity	3e2707c2bf	utils: fragmented_temporary_buffer: don't add to potentially null pointers Offsetting a null pointer is undefined, and clang's ubsan complains. Rearrange the arithmetic so we never offset a null pointer. A function is introduced for the remaining contiguous bytes so it can cast the result to size_t, avoiding a compare-of-different-signedness warning from gcc. Closes #7373	2020-10-11 15:05:15 +03:00
Benny Halevy	064aae8ffa	flush_queue: call_helper: support no variadic futures Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201011105422.818623-1-bhalevy@scylladb.com>	2020-10-11 14:40:32 +03:00
Benny Halevy	a0b5529441	flush_queue: use futurator::invoke Attend to the following warning with Seastar_API_LEVEL 5+: ``` ./utils/flush_queue.hh:68:36: warning: ‘static seastar::futurize<T>::type seastar::futurize<T>::apply(Func&&, FuncArgs&& ...) [with Func = test_queue_ordering_random_ops::run_test_case()::<lambda(int)>::<lambda(int)>; FuncArgs = {int}; T = void; seastar::futurize<T>::type = seastar::future<>]’ is deprecated: Use invoke for varargs [-Wdeprecated-declarations] 68 \| return futurator::apply(std::forward<Func>(func), f.get()); ``` Test: flush_queue(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201007112130.474269-1-bhalevy@scylladb.com>	2020-10-11 12:14:17 +03:00
Avi Kivity	c41905e986	utils: array-search: deinline, working around clang bug Clang has a bug processing inline ifuncs with intrinsics[1]. Since ifuncs can't be inlined anyway (they are always dispatched via a function pointer that is determined based on the CPU features present), nothing is gained by inlining them. Deinlining therefore reduces compile time and works around the clang bug. [1] https://bugs.llvm.org/show_bug.cgi?id=47691 Closes #7358	2020-10-11 10:29:24 +03:00
Avi Kivity	31a5378a82	utils: utf8: avoid harmless integer overflow 240 doesn't fit in char without overflow, so cast it explicitly to avoid a clang warning.	2020-09-22 17:24:33 +03:00
Avi Kivity	e12c72ad55	utils: multiprecision_int: disambiguate operator templates by adding overloads We have templates for multiprecision_int for both sides of the operator, for example: template <typename T> bool operator==(const T& x) const and template <typename T> friend bool operator==(const T& x, const multiprecision_int& y) Clang considers them equally satisfying when both operands are multiprecision_int, so provide a disambiguating overload.	2020-09-22 17:24:33 +03:00
Avi Kivity	d1c049b202	utils: error_injection: remove forward-declared function returning auto Clang dislikes forward-declared functions returning auto, so declare the type up front. Functions returning auto are a readability problem anyway. To solve a circular dependency problem (get_local_injector() -> error_injection<> -> get_local_injector()), which is further compounded by problems in using template specializations before they are defined (which is forbidden), the storage for get_local_injector() was moved to error_injection<>, and get_local_injector() is just an accessor. After this, error_injection<> does not depend on get_local_injector().	2020-09-22 17:24:33 +03:00
Avi Kivity	765e632626	utils: bptree: remove redundant and possibly wrong friend declaration Clang complains about befriending a constructor. It's possibly correct. In any case it's redundant, so remove it.	2020-09-22 17:24:33 +03:00
Avi Kivity	c7105019b2	utils: bptree: add missing typename for clang Clang does not implement p0634r3, so we must add more typenames.	2020-09-22 17:24:33 +03:00
Avi Kivity	0d25ea5a67	utils: bloom_calculations: avoid gratuitous conversion to double The conversion to double evokes a complaint about precision loss from clang, and is unneeded anyway, so use integral types throughout.	2020-09-22 17:24:33 +03:00
Avi Kivity	4c93ec8351	utils: updateable_value: fix nullptr_t name nullptr_t's full name is std::nullptr_t. gcc somehow allows plain nullptr_t, but that's not correct. Clang rejects it. Use std::nullptr_t.	2020-09-22 17:24:33 +03:00
Avi Kivity	dcaf4ea4dd	Merge "Fix race in schema version recalculation leading to stale schema version in gossip" from Tomasz " Migration manager installs several cluster feature change listeners. The listeners will call update_schema_version_and_announce() when cluster features are enabled, which does this: return update_schema_version(proxy, features).then([] (utils::UUID uuid) { return announce_schema_version(uuid); }); It first updates the schema version and then publishes it via gossip in announce_schema_version(). It is possible that the announce_schema_version() part of the first schema change will be deferred and will execute after the other four calls to update_schema_version_and_announce(). It will install the old schema version in gossip instead of the more recent one. The fix is to serialize schema digest calculation and publishing. Refs #7200 This problem also brought my attention to initialization code, which could be prone to the same problem. The storage service computes gossiper states before it starts the gossiper. Among them, node's schema version. There are two problems with that. First is that computing the schema version and publishing it is not atomic, so is not safe against concurrent schema changes or schema version recalculations. It will not exclude with recalculate_schema_version() calls, and we could end up with the old (and incorrect) schema version being advertised in gossip. Second problem is that we should not allow the database layer to call into the gossiper layer before it is fully initialized, as this may produce undefined behavior. Maybe we're not doing concurrent schema changes/recalculations now, but it is easy to imagine that this could change for whatever reason in the future. The solution for both problems is to break the cyclic dependency between the database layer and the storage_service layer by having the database layer not use the gossiper at all. The database layer publishes schema version inside the database class and allows installing listeners on changes. The storage_service layer asks the database layer for the current version when it initializes, and only after that installs a listener which will update the gossiper. Tests: - unit (dev) - manual (3 node ccm) " * tag 'fix-schema-digest-calculation-race-v1' of github.com:tgrabiec/scylla: db, schema: Hide update_schema_version_and_announce() db, storage_service: Do not call into gossiper from the database layer db: Make schema version observable utils: updateable_value_source: Introduce as_observable() schema: Fix race in schema version recalculation leading to stale schema version in gossip	2020-09-14 12:37:46 +03:00
Tomasz Grabiec	fed89ee23e	utils: updateable_value_source: Introduce as_observable()	2020-09-11 14:42:41 +02:00
Avi Kivity	7ac59dcc98	lsa: decay reserves The log-structured allocator (LSA) reserves memory when performing operations, since its operations are performed with reclaiming disabled and if it runs out, it cannot evict cache to gain more. The amount of memory to reserve is remembered across calls so that it does not have to repeat the fail/increase-reserve/retry cycle for every operation. However, we currently lack decaying the amount to reserve. This means that if a single operation increased the reserve in the distant past, all current operations also require this large reserve. Large reserves are expensive since they can cause large amounts of cache to be evicted. This patch adds reserve decay. The time-to-decay is inversely proportional to reserve size: 10GB/reserve. This means that a 20MB reserve is halved after 500 operations (10GB/20MB) while a 20kB reserve is halved after 500,000 operations (10GB/20kB). So large, expensive reserves are decayed quickly while small, inexpensive reserves are decayed slowly to reduce the risk of allocation failures and exceptions. A unit test is added. Fixes #325.	2020-09-08 15:59:25 +03:00
Piotr Grabowski	ffd8c8c505	utf8: Print invalid UTF-8 character position Add new validate_with_error_position function which returns -1 if data is a valid UTF-8 string or otherwise a byte position of first invalid character. The position is added to exception messages of all UTF-8 parsing errors in Scylla. validate_with_error_position is done in two passes in order to preserve the same performance in common case when the string is valid.	2020-09-07 18:11:21 +03:00

1 2 3 4 5 ...

855 Commits