scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 03:56:42 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	8fbea0b5b7	utils: cached_file: Introduce file wrapper It's an adpator between seastar::file and cached_file. It gives a seastar::file which will serve reads using a given cached_file as a read-through cache.	2021-07-02 10:25:58 +02:00
Tomasz Grabiec	8e2118069b	sstables: cached_file: Account buffers returned by cached_file under read_permit We want buffers to be accounted only when they are used outside cached_file. Cached pages should not be accounted because they will stay around for longer than the read after subsequent commits.	2021-07-02 10:25:58 +02:00
Tomasz Grabiec	7fa4e10aa0	row_cache: Use generic LRU for eviction In preparation for tracking different kinds of objects, not just rows_entry, in the LRU, switch to the LRU implementation form utils/lru.hh which can hold arbitrary element type.	2021-07-02 10:25:58 +02:00
Tomasz Grabiec	6b59c8cfb1	utils: Introduce general-purpose LRU The LRU can link objects of different types, which is achieved by having a virtual base class called "evictable" from which the linked objects should inherit. Whe the object is removed from the LRU, evictable::on_evicted() is called. The container is non-owning.	2021-07-02 10:25:58 +02:00
Avi Kivity	6676ceabde	Merge 'Prevent reactor stall in utils::merge_to_gently' from Benny Halevy std::copy_if runs without yielding. See https://github.com/scylladb/scylla/issues/8897#issuecomment-867522480 Also, eliminate extraneous loop on merge first1 will point to the inserted value which is a copy of first2. Since list2 is sorted in ascending order, the next item from list2 will never be less than the one we've just inserted, so we waste an iteration to merely increment first1 again. Fixes #8897 Test: unit(dev), stall_free_test(debug) DTest: repair_additional_test.py:RepairAdditionalTest.{repair_same_row_diff_value_3nodes_diff_shard_count_test,repair_disjoint_row_3nodes_diff_shard_count_test} (dev) Closes #8925 github.com:scylladb/scylla: utils: merge_to_gently: eliminate extraneous loop on merge utils: merge_to_gently: prevent stall in std::copy_if	2021-06-27 13:56:32 +03:00
Benny Halevy	b96eeaefe4	utils: merge_to_gently: eliminate extraneous loop on merge first1 will point to the inserted value which is a copy of first2. Since list2 is sorted in ascending order, the next item from list2 will never be less than the one we've just inserted, so we waste an iteration to merely increment first1 again. Note that the standard states that no iterators or references are invalidated on insert so we can safely keep looking at `first1` after inserting a copy of `first2` before it. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-06-24 14:58:12 +03:00
Benny Halevy	453e7c8795	utils: merge_to_gently: prevent stall in std::copy_if std::copy_if runs without yielding. See https://github.com/scylladb/scylla/issues/8897#issuecomment-867522480 Note that the standard states that no iterators or references are invalidated on insert so we can keep inserting before last1 when merging the remainder of list2 at the tail of list1. Fixes #8897 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-06-24 14:47:25 +03:00
Pavel Emelyanov	a61afe8421	btree: Improve unlink_leftmost_without_rebalance() The helper is used to walk the tree key-by-key destroying it in the mean time. Current implementation of this method just uses the "regular" erasing code which actually rebalances the tree despite the name. The biggest problem with removing the rebalancing is that at some point non-balanced tree may have the left-most key on an inner node, so to make 100% rebalance-less unlink every other method of the tree would have to be prepared for that. However, there's an option to make "light rebalance" (as it's called in this patch) that only maintains this crucial property of the tree -- the left-most key is on the leaf. Some more tech details. Current rebalancer starts when the node population falls below 1/2 of its capacity and tries to - grab a key from one of the siblings if it's balanced - merge two siblings together if they are small enough The light rebalance is lighter in two ways. First, it leaves the node unbalanced until it becomes empty. And then it goes ahead and replaces it with the next sibling. This change removes ~60% of the keys movements on random test. Keys still move when the sibling replace happens because in this case the separation key needs to be placed at the right sibling 0 position which means shifting all its keys right. tests: unit(debug) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20210623083836.27491-1-xemul@scylladb.com>	2021-06-24 12:03:26 +03:00
Benny Halevy	02917c79b6	logalloc: get rid of unused _descendant_blocked_requests Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210620064204.1709957-1-bhalevy@scylladb.com>	2021-06-22 15:58:56 +02:00
Avi Kivity	f05ddf0967	Merge "Improve LSA descriptor encoding" from Pavel " The LSA small objects allocation latency is greatly affected by the way this allocator encodes the object descriptor in front of each allocated slot. Nowadays it's one of VLE variants implemented with the help of a loop. Re-implementing this piece with less instructions and without a loop allows greatly reducing the allocation latency. The speed-up mostly comes from loop-less code that doesn't confuse branch predictor. Also the express encoder seems to benefit from writing 8 bytes of the encoded value in one go, rather than byte- -by-byte. Perf measurements: 1. (new) logallog test shows ~40% smaller times 2. perf_mutation in release mode shows ~2% increase in tps 3. the encoder itself is 2 - 4 times faster on x86_64 and 1.05 - 3 times faster on aarch64. The speed-up depends on the 'encoded length', old encoder has linear time, the new one is constant tests: unit(dev), perf(release), just encoder on Aarch64 " * 'br-lsa-alloc-latency-4' of https://github.com/xemul/scylla: lsa: Use express encoder uleb64: Add express encoding lsa: Extract uleb64 code into header test: LSA allocation perf test	2021-06-16 18:07:13 +03:00
Pavel Emelyanov	8d0780fb92	lsa: Use express encoder To make it possible to use the express encoder, lsa needs to make sure that the value is below express supreme value and provide the size of the gap after the encoded value. Both requirements can be satisfied when encoding the migrator index on object allocation. On free the encoded value can be larger, so the extended express encoder will need more instructions and will not be that efficient, so the old encoder is used there. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-06-16 17:47:12 +03:00
Pavel Emelyanov	1782b0c6b9	uleb64: Add express encoding Standard encoding is compiled into a loop that puts values into memory byte-by-byte. This works slowly, but reliably. When allocating an object LSA uses ubel64 encoder with 2 features that allow to optimize the encoder: 1. the value is migrator.index() which is small enough to fit 2 bytes when encoded 2. After the descriptor there usually comes an object which is of 8+ bytes in size Feature #1 makes it possible to encode the value with just a few instructions. In O3 level clang makes it like mov %esi,%ecx and $0xfc0,%ecx and $0x3f,%esi lea (%rsi,%rcx,4),%ecx add $0x40,%ecx Next, the encoder needs to put the value into a gap whose size depends on the alignment of previous and current objects, so the classical algo loops through this size. Feature #2 makes it possible to put the encoded value and the needed amount of zeros by using 2 64-bit movs. In this case the encoded value gets off the needed size and overwrites some memory after. That's OK, as this overwritten memory is where the allocated object _will_ be, so the contents there is not of any interest. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-06-16 17:47:10 +03:00
Pavel Emelyanov	d8dea48248	lsa: Extract uleb64 code into header The LSA code encodes an object descriptor before the object itself. The descriptor is 32-bit value and to put it in an efficient manner it's encoded into unsigned little-endian base-64 sequence. The encoding code is going to be optimized, so put it into a dedicated header in advance. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-06-16 17:46:44 +03:00
Tomasz Grabiec	605a6e0166	Merge "Remove int_or_strong_ordering concept" from Pavel The one was added to smothly switch tri-comparing stuff from int to strong-ordering. As for today only tests still need it and the conversion is pretty simple, plus operator<<(ostream&) for the std::strong_ordering type. * xemul/br-remove-int-or-strong-ordering-2: util: Drop int_or_strong_ordering concept tests: Switch total-order-check onto strong_ordering to_string: Add formatter for strong_ordering tests: Return strong-ordering from tri-comparators	2021-06-16 09:34:49 +02:00
Nadav Har'El	afacffc556	utils/enum_option.hh: add implicit converter to the underlying enum Add an implicit converter of the enum_option to the underyling enum it is holding. This is needed for using switch() on an enum_option. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-06-13 13:18:49 +03:00
Pavel Emelyanov	4ad4208426	util: Drop int_or_strong_ordering concept Nobody uses it now. All tri-comparing stuff is strong_ordering now. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-06-08 11:40:55 +03:00
Pavel Solodovnikov	76bea23174	treewide: reduce header interdependencies Use forward declarations wherever possible. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Closes #8813	2021-06-07 15:58:35 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Avi Kivity	e9e5663731	build, utils/bptree.hh: drop -Wno-gnu-designator warning Drop the warning about old-stye GNU designated initializers and convert two violations in bptree.hh to the standard C++20 syntax. Closes #8743	2021-05-31 18:51:49 +03:00
Botond Dénes	cd6bbd37a4	utils/utf8.c: move includes outside of namespaces Including in the middle of a namespace is not a good practice. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20210528142502.962947-1-bdenes@scylladb.com>	2021-05-30 23:23:20 +03:00
Benny Halevy	d46958d3ce	phased_barrier: advance_and_await: abort on allocation failure Currently, advance_and_wait() allocates a new gate which might fail. Rather than returning this failure as an exceptional future - which will require its callers to handle that failure, keep the function as noexcept and let an exception from make_lw_shared<gate>() terminate the program. This makes the function "fail-free" to its callers, in particular, when called from the table::stop() path where we can't do much about these failures and we require close/stop functions to always succeed. The alternative of make the allocation of a new gate optional and covering from it in start() is possible but was deemed not worth it as it will add complexity and cost to start() that's called on the common, hot, path. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210525055336.1190029-1-bhalevy@scylladb.com>	2021-05-25 12:50:59 +02:00
Avi Kivity	948e2c0b36	utils: config_file: delete unneeded template instantation of operator<<() config_file.cc instantiates std::istream& std::operator>>(std::istream&, std::unordered_map<seastar::sstring, seastar::sstring>&), but that instantiation is ignored since config_file_impl.hh specializes that signature. -Winstantiation-after-specialization warns about it, so re-enable it now that the code base is clean. Also remove the matching "extern template" declaration, which has no definition any more. Closes #8696	2021-05-24 18:34:45 +03:00
Avi Kivity	50f3bbc359	Merge "treewide: various header cleanups" from Pavel S " The patch set is an assorted collection of header cleanups, e.g: * Reduce number of boost includes in header files * Switch to forward declarations in some places A quick measurement was performed to see if these changes provide any improvement in build times (ccache cleaned and existing build products wiped out). The results are posted below (`/usr/bin/time -v ninja dev-build`) for 24 cores/48 threads CPU setup (AMD Threadripper 2970WX). Before: Command being timed: "ninja dev-build" User time (seconds): 28262.47 System time (seconds): 824.85 Percent of CPU this job got: 3979% Elapsed (wall clock) time (h:mm:ss or m:ss): 12:10.97 Average shared text size (kbytes): 0 Average unshared data size (kbytes): 0 Average stack size (kbytes): 0 Average total size (kbytes): 0 Maximum resident set size (kbytes): 2129888 Average resident set size (kbytes): 0 Major (requiring I/O) page faults: 1402838 Minor (reclaiming a frame) page faults: 124265412 Voluntary context switches: 1879279 Involuntary context switches: 1159999 Swaps: 0 File system inputs: 0 File system outputs: 11806272 Socket messages sent: 0 Socket messages received: 0 Signals delivered: 0 Page size (bytes): 4096 Exit status: 0 After: Command being timed: "ninja dev-build" User time (seconds): 26270.81 System time (seconds): 767.01 Percent of CPU this job got: 3905% Elapsed (wall clock) time (h:mm:ss or m:ss): 11:32.36 Average shared text size (kbytes): 0 Average unshared data size (kbytes): 0 Average stack size (kbytes): 0 Average total size (kbytes): 0 Maximum resident set size (kbytes): 2117608 Average resident set size (kbytes): 0 Major (requiring I/O) page faults: 1400189 Minor (reclaiming a frame) page faults: 117570335 Voluntary context switches: 1870631 Involuntary context switches: 1154535 Swaps: 0 File system inputs: 0 File system outputs: 11777280 Socket messages sent: 0 Socket messages received: 0 Signals delivered: 0 Page size (bytes): 4096 Exit status: 0 The observed improvement is about 5% of total wall clock time for `dev-build` target. Also, all commits make sure that headers stay self-sufficient, which would help to further improve the situation in the future. " * 'feature/header_cleanups_v1' of https://github.com/ManManson/scylla: transport: remove extraneous `qos/service_level_controller` includes from headers treewide: remove evidently unneded storage_proxy includes from some places service_level_controller: remove extraneous `service/storage_service.hh` include sstables/writer: remove extraneous `service/storage_service.hh` include treewide: remove extraneous database.hh includes from headers treewide: reduce boost headers usage in scylla header files cql3: remove extraneous includes from some headers cql3: various forward declaration cleanups utils: add missing <limits> header in `extremum_tracking.hh`	2021-05-24 14:24:20 +03:00
Michał Chojnowski	03faf139c8	collection_mutation: don't linearize collection values Yet another patch preventing potentially large allocations. Currently, collection_mutation{_view,}_description linearize each collection value during deserialization. It's not unthinkable that a user adds a large element to a list or a map, so let's avoid that. This patch removes the dependency on linearizing_input_stream, which does not provide a way to read fragmented subbuffers, and replaces it with a new helper, which does. (Extending linearizing_input_stream is not viable without rewriting it completely). Only linearization of collection values is corrected in this patch. Collection keys are still linearized. Storing them in managed_bytes is likely to be more harmful than helpful, because large map keys are extremely unlikely, and UUIDs, which are used as keys in lists, do not fit into manages_bytes's small value optimization, so this would incure an extra allocation for every list element. Note: this patch leaves utils/linearizing_input_stream.hh unused. Refs: #8120 Closes #8690	2021-05-23 12:16:56 +03:00
Michał Chojnowski	d72b91053b	logalloc: fix quadratic behaviour of reclaim_from_evictable As an optimization for optimistic cases, reclaim_from_evictable first evicts the requested amount of memory before attempting to reclaim segments through compactions. However, due to an oversight, it does this before every compaction instead of once before all compactions. Usually reclaim_from_evictable is called with small targets, or is preemptible, and in those cases this issue is not visible. However, when the target is bigger than one segment and the reclaim is not preemptible, which is he case when it's called from allocating_section, this results in a quadratic explosion of evictions, which can evict several hundred MiB to reclaim a few MiB. Fix that by calculating the target of memory eviction only once, instead of recalculating it after every compaction. Fixes #8542. Closes #8611	2021-05-22 20:49:00 +02:00
Pavel Solodovnikov	fff7ef1fc2	treewide: reduce boost headers usage in scylla header files `dev-headers` target is also ensured to build successfully. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-05-20 01:33:18 +03:00
Piotr Sarna	223a59c09c	test: make rjson allocator test working in sanitize mode Following Nadav's advice, instead of ignoring the test in sanitize/debug modes, the allocator simply has a special path of failing sufficiently large allocation requests. With that, a problem with the address sanitizer is bypassed and other debug mode sanitizers can inspect and check if there are no more problems related to wrapping the original rapidjson allocator. Closes #8539	2021-05-20 00:42:47 +03:00
Pavel Solodovnikov	94b5c6333f	utils: add missing <limits> header in `extremum_tracking.hh` This makes all headers in scylla to be self-sufficient up to the moment. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-05-20 00:04:51 +03:00
Pavel Emelyanov	0c4ba56594	bptree: Require dynamic object for nodes reconstruct The B+ tree is not intrusive and supports both kinds of objects -- dynamic (in sense of previous patch) and fixed-size. Respectively, the nodes provide .storage_size() method and get the embedded object storage size themselves. Thus, if a dynamic object is used with the tree but it misses the .storage_size() itself this would come unnoticed. Fortunately, dynamic objects use the .reconstruct() method, so the method should be equipeed with the DybnamicObject concept. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-05-19 09:23:49 +03:00
Pavel Emelyanov	9216a5bc08	allocation_strategy, code: Conceptualize dynamic objects Usually lsa allocation is performed with the construct() helper that allocates a sizeof(T) slot and constructs it in place. Some rare objects have dynamic size, so they are created by alloc()ating a slot of some specific size and (!) must provide the correct overload of size_for_allocation_strategy that reports back the relevant storage size. This "must provide" is not enforced, if missed a default sizer would be instantiated, but won't work properly. This patch makes all users of alloc() conform to DynamicObject concept which requires the presense of .storage_size() method to tell that size. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-05-19 09:23:49 +03:00
Pavel Emelyanov	b8a4f32b48	allocation_strategy: Hide migrators After IMR rework the only lsa-migrating functionality is standard one that calls move constructors on lsa slots. Hide the whole thing inside allocation-strategy. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-05-19 09:23:49 +03:00
Pavel Emelyanov	28f01aadc9	allocation_strategy, code: Simplify alloc() Todays alloc() accepts migrate-fn, size and alignment. All the callers don't really need to provide anything special for the migrate-fn and are just happy with default alignof() for alignment. The simplification is in providing alloc() that only accepts size arg and does the rest itself. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-05-19 09:23:49 +03:00
Pavel Emelyanov	fdfcda97d7	allocation_strategy: Mark size_for_allocation_strategy noexcept Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-05-19 09:23:49 +03:00
Tomasz Grabiec	f8d7374400	Merge 'Add additional sstable stats' from Michael Livshin Refs #251. Closes #8630 * github.com:scylladb/scylla: statistics: add global bloom filter memory gauge statistics: add some sstable management metrics sstables: make the `_open` field more useful sstables: stats: noexcept all accessors	2021-05-12 14:35:13 +02:00
Michael Livshin	357ab759ee	statistics: add global bloom filter memory gauge Refs #251. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2021-05-12 03:48:07 +03:00
Benny Halevy	c0dafa75d9	utils: phased_barrier: advance_and_await: make noexcept As a function returning a future, simplify its interface by handling any exceptions and returning an exceptional future instead of propagating the exception. In this specific case, throwing from advance_and_await() will propagate through table::await_pending_* calls short-circuiting a .finally clause in table::stop(). Also, mark as noexcept methods of class table calling advance_and_await and table::await_pending_ops that depends on them. Fixes #8636 A followup patch will convert advance_and_await to a coroutine. This is done separately to facilitate backporting of this patch. Test: unit(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210511161407.218402-1-bhalevy@scylladb.com>	2021-05-12 01:36:11 +02:00
Nadav Har'El	c7a814fd5c	utils/enum_option.hh: make it easier to compare the value The operator== of enum_option<> (which we use to hold multi-valued Scylla options) makes it easy to compare to another enum_option wrapper, but ugly to compare the actual value held. So this patch adds a nicer way to compare the value held. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210511120222.1167686-1-nyh@scylladb.com>	2021-05-11 18:39:10 +03:00
Benny Halevy	9ba960a388	utils: phased_barrier::operation do not leak gate entry when reassigned utils::phased_barrier holds a `lw_shared_ptr<gate>` that is typically `enter()`ed in `phased_barrier::start()`, and left when the operation is destroyed in `~operation`. Currently, the operation move-assign implementation is the default one that just moves the lw_shared gate ptr from the other operation into this one, without calling `_gate->leave()` first. This change first destroys *this when move-assigned (if not self) to call _gate->leave() if engaged, before reassigning the gate with the other operation::_gate. A unit test that reproduces the issue before this change and passes with the fix was added to serialized_action_test. Fixes #8613 Test: unit(dev), serialized_action_test(debug) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210510120703.1520328-1-bhalevy@scylladb.com>	2021-05-11 18:39:10 +03:00
Benny Halevy	2a168c3224	atomic_cell: get rid of is_value_fragments It isn't used. Along with it, get rid also of: managed_bytes::is_fragmented and managed_bytes_basic_view::is_fragmented Test: unit(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210506174115.171048-1-bhalevy@scylladb.com>	2021-05-09 11:08:53 +03:00
Avi Kivity	3114f09d76	utils: small_vector: add print operator for std::ostream In order to replace std::vector with utils::small_vector, it needs to support this feature too.	2021-05-05 12:10:59 +03:00
Pavel Solodovnikov	a7bd7dd122	utils: make basic UUID constructors constexpr Mark default and `UUID(most_sig_bits, least_sig_bits)` ctors as constexpr. This allows to construct constexpr constants using UUID type. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20210429170630.533596-2-pa.solodovnikov@scylladb.com>	2021-05-02 16:39:52 +03:00
Avi Kivity	ae660eeec4	logalloc: reduce minimum lsa reserve in allocating_section to 1 Many workloads have fairly constant and small request sizes, so we don't need large reserves for them. These workloads suffer needlessly from the current large reserve of 10 segments (1.2MB) when they really need a few hundred bytes. Reduce the reserve to a minimum of 1 segment. Note that due to #8542 this can make a large difference. Consider a workload that has a 1000-byte footprint in cache. If we've just consumed some free memory and reduced the reserve to zero, then we'll evict about 50,000 objects before proceeding to compact. With the reserved reduced to 1, we'll evict 128 objects. All this for 1000 bytes of memory. Of course, #8542 should be fixed, but reducing the reserve provides some quick relief and makes sense even with the larger fix. The reserve will quickly grow for workloads that handle bigger requests, so they won't see an impact from the reduction. Closes #8572	2021-05-02 15:22:04 +02:00
Avi Kivity	5801c93715	utils: rjson: convert enable_if to concept Simpler and easier to understand. Vague comment about enable_if removed. Closes #8405	2021-04-25 21:53:46 +03:00
Avi Kivity	c36549b22e	Merge 'rjson: Add throwing allocator' from Piotr Sarna This series adds a wrapper for the default rjson allocator which throws on allocation/reallocation failures. It's done to work around several rapidjson (the underlying JSON parsing library) bugs - in a few cases, malloc/realloc return value is not checked, which results in dereferencing a null pointer (or an arbitrary pointer computed as 0 + `size`, with the `size` parameter being provided by the user). The new allocator will throw an `rjson:error` if it fails to allocate or reallocate memory. This series comes with unit tests which checks the new allocator behavior and also validates that an internal rapidjson structure which we indirectly rely upon (Stack) is not left in invalid state after throwing. The last part is verified by the fact that its destructor ran without errors. Fixes #8521 Refs #8515 Tests: * unit(release) * YCSB: inserting data similar to the one mentioned in #8515 - 1.5MB objects clustered in partitions 30k objects in size - nothing crashed during various YCSB workloads, but nothing also crashed for me locally before this patch, so it's not 100% robust relevant YCSB workload config for using 1.5MB objects: ```yaml fieldcount=150 fieldlength=10000 ``` Closes #8529 * github.com:scylladb/scylla: test: add a test for rjson allocation test: rename alternator_base64_test to alternator_unit_test rjson: add a throwing allocator	2021-04-22 17:12:02 +03:00
Avi Kivity	350f79c8ce	Merge 'sstables: remove large allocations when parsing cells' from Wojciech Mitros sstable cells are parsed into temporary_buffers, which causes large contiguous allocations for some cells. This is fixed by storing fragments of the cell value in a fragmented_temporary_buffer instead. To achieve this, this patch also adds new methods to the fragmented_temporary_buffer(size(), ostream& operator<<()) and adds methods to the underlying parser(primitive_consumer) for parsing byte strings into fragmented buffers. Fixes #7457 Fixes #6376 Closes #8182 * github.com:scylladb/scylla: primitive_consumer: keep fragments of parsed buffer in a small_vector sstables: add parsing of cell values into fragmented buffers sstables: add non-contiguous parsing of byte strings to the primitive_consumer utils: add ostream operator<<() for fragmented_temporary_buffer::view compound_type: extend serialize_value for all FragmentedView types	2021-04-22 15:38:10 +02:00
Piotr Sarna	45d7144529	rjson: add a throwing allocator The default rapidjson allocator returns nullptr from a failed allocation or reallocation. It's not a bug by itself, but rapidjson internals usually don't check for these return values and happily use nullptr as a valid pointer, which leads to segmentation faults and memory corruptions. In order to prevent these bugs, the default allocator is wrapped with a class which simply throws once it fails to allocate or reallocate memory, thus preventing the use of nullptr in the code. One exception is Malloc/Realloc with size 0, which is expected to return nullptr by rapidjson code.	2021-04-21 14:26:38 +02:00
Piotr Sarna	2ad09d0bf8	Merge 'treewide: remove inclusions of storage_proxy.hh from headers' from Avi Kivity Reduce rebuilds and build time by removing unnecessary includes. Along the way, improve header sanity. Ref #1. Test: dev-headers, unit(dev). Closes #8524 * github.com:scylladb/scylla: treewide: remove inclusions of storage_proxy.hh from headers storage_proxy: unnest coordinator_query_result treewide: make headers self-sufficient utils: intrusive_btree: add missing #pragma once	2021-04-21 08:22:52 +02:00
Avi Kivity	14a4173f50	treewide: make headers self-sufficient In preparation for some large header changes, fix up any headers that aren't self-sufficient by adding needed includes or forward declarations.	2021-04-20 21:23:00 +03:00
Avi Kivity	6db1a71775	utils: intrusive_btree: add missing #pragma once Interferes with making headers self-sufficient, so add it now.	2021-04-20 21:23:00 +03:00
Piotr Sarna	ec750e5f49	rjson: make the max nested level configurable Back when rjson was only part of alternator, there was a hardcoded limit of nested levels - 78. The number was calculated as: - taking the DynamoDB limit (32) - adding 7 to it to make alternator support more cases - doubling it because rjson internals bump the level twice for each alternator object (because the alternator object is represented as a 2-level JSON object). Since rjson is no longer specific to alternator, this limit is now configurable, and the original default value is explained in a comment. Message-Id: <51952951a7cd17f2f06ab36211f74086e1b60d2d.1618916299.git.sarna@scylladb.com>	2021-04-20 14:05:03 +03:00

1 2 3 4 5 ...

977 Commits