scylladb

Author	SHA1	Message	Date
Dawid Mędrek	a151944fa6	treewide: Replace __builtin_expect with (un)likely C++20 introduced two new attributes--likely and unlikely--that function as a built-in replacement for __builtin_expect implemented in various compilers. Since it makes code easier to read and it's an integral part of the language, there's no reason to not use it instead. Closes scylladb/scylladb#24786	2025-07-03 13:34:04 +03:00
Lakshmi Narayanan Sreethar	e4c7cb7834	bytes_ostream: overload write() to support writing from FragmentedView Overloaded write() method to support writing a FragmentedView into bytes_ostream. Also added a testcase to verify the implementation. The new helper will be used by the byte_comparable implementation during the encode/decode process. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2025-07-01 22:19:07 +05:30
Avi Kivity	f3eade2f62	treewide: relicense to ScyllaDB-Source-Available-1.0 Drop the AGPL license in favor of a source-available license. See the blog post [1] for details. [1] https://www.scylladb.com/2024/12/18/why-were-moving-to-a-source-available-license/	2024-12-18 17:45:13 +02:00
Avi Kivity	baaa92c6f5	bytes_ostream: replace boost ranges with std ranges Have fragment_iterator support iterator_concept for compatibility with std ranges, and switch from boost iterator_range to std::ranges::subrange.	2024-11-05 14:50:38 +02:00
Avi Kivity	cb026c347e	bytes_ostream: extract fragment_iterator into namespace scope C++ concept evaluation rules clash with nested class definition rules with the result that evaluating concepts about the nested class within the enclosing class doesn't work. Extract bytes_ostream::fragment_iterator to avoid that.	2024-11-05 14:43:49 +02:00
Avi Kivity	aa1270a00c	treewide: change assert() to SCYLLA_ASSERT() assert() is traditionally disabled in release builds, but not in scylladb. This hasn't caused problems so far, but the latest abseil release includes a commit [1] that causes a 1000 insn/op regression when NDEBUG is not defined. Clearly, we must move towards a build system where NDEBUG is defined in release builds. But we can't just define it blindly without vetting all the assert() calls, as some were written with the expectation that they are enabled in release mode. To solve the conundrum, change all assert() calls to a new SCYLLA_ASSERT() macro in utils/assert.hh. This macro is always defined and is not conditional on NDEBUG, so we can later (after vetting Seastar) enable NDEBUG in release mode. [1] `66ef711d68` Closes scylladb/scylladb#20006	2024-08-05 08:23:35 +03:00
Kefu Chai	432c000dfa	./: not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17888	2024-03-20 09:16:46 +02:00
Michał Chojnowski	5a3e4a1cc0	utils: managed_bytes: optimize memory usage for small buffers managed_bytes is implemented as chain of blob_storage objects. Each blob_storage contains 24 bytes of metadata. But in the most common case -- when there is only a single element in the chain -- 16 bytes of this metadata is trivial/unused. This is regrettable waste because managed_bytes is used for every database cell in the memtables and cache. It means that every value of size >= 7 bytes (smaller ones fit in the inline storage of managed_bytes) receives 16 bytes of useless overhead. To correct that, this patch adds to managed_bytes an alternative storage layout -- used for buffers small enough to fit in one contiguous fragment -- which only stores the necessary minimum of metadata. (That is: a pointer to the parent, to facilitate moving the storage during memory defragmentation).	2024-02-09 20:56:20 +01:00
Avi Kivity	11d651b606	utils/managed_bytes, serializer: add conversion between buffer_view<bytes_ostream> and managed_bytes_view The codebase evolved to have several different ways to hold a fragmented buffer: fragmented_temporary_buffer (for data received from the network; not relevant for this discussion); bytes_ostream (for fragmented data that is built incrementally; also used for a serialized result_set), and managed_bytes (used for lsa and serialized individual values in expression evaluation). One problem with this state of affairs is that using data in one fragmented form with functions that accept another fragmented form requires either a copy, or templating everything. The former is unpalatable for fast-path code, and the latter is undesirable for compile time and run-time code footprint. So we'd like to make the various forms compatible. In `53e0dc7530` ("bytes_ostream: base on managed_bytes") we changed bytes_ostream to have the same underlying data structure as managed_bytes, so all that remains is to add the right API. This is somewhat difficult as the data is hidden in multiple layers: ser::buffer_view<> is used to abstract a slice of bytes_ostream, and this is further abstracted by using iterators into bytes_ostream rather than directly using the internals. Likewise, it's impossible to construct a managed_bytes_view from the internals. Hack through all of these by adding extract_implementation() methods, and a build_managed_bytes_view_from_internals() helper. These are all used by new APIs buffer_view_to_managed_bytes_view() that extract the internals and put them back together again. Ideally we wouldn't need any of this, but unifying the type system in this area is quite an undertaking, so we need some shortcuts.	2023-05-07 17:17:34 +03:00
Kefu Chai	f5b05cf981	treewide: use defaulted operator!=() and operator==() in C++20, compiler generate operator!=() if the corresponding operator==() is already defined, the language now understands that the comparison is symmetric in the new standard. fortunately, our operator!=() is always equivalent to `! operator==()`, this matches the behavior of the default generated operator!=(). so, in this change, all `operator!=` are removed. in addition to the defaulted operator!=, C++20 also brings to us the defaulted operator==() -- it is able to generated the operator==() if the member-wise lexicographical comparison. under some circumstances, this is exactly what we need. so, in this change, if the operator==() is also implemented as a lexicographical comparison of all memeber variables of the class/struct in question, it is implemented using the default generated one by removing its body and mark the function as `default`. moreover, if the class happen to have other comparison operators which are implemented using lexicographical comparison, the default generated `operator<=>` is used in place of the defaulted `operator==`. sometimes, we fail to mark the operator== with the `const` specifier, in this change, to fulfil the need of C++ standard, and to be more correct, the `const` specifier is added. also, to generate the defaulted operator==, the operand should be `const class_name&`, but it is not always the case, in the class of `version`, we use `version` as the parameter type, to fulfill the need of the C++ standard, the parameter type is changed to `const version&` instead. this does not change the semantic of the comparison operator. and is a more idiomatic way to pass non-trivial struct as function parameters. please note, because in C++20, both operator= and operator<=> are symmetric, some of the operators in `multiprecision` are removed. they are the symmetric form of the another variant. if they were not removed, compiler would, for instance, find ambiguous overloaded operator '=='. this change is a cleanup to modernize the code base with C++20 features. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13687	2023-04-27 10:24:46 +03:00
Avi Kivity	e2f6e0b848	utils: move hashing related files to utils/ module Closes #12884	2023-02-17 07:19:52 +02:00
Avi Kivity	23b94ac391	bytes_ostream: don't take reference to packed variable bytes_ostream is packed, so its _begin member is packed as well. gcc (correctly) disallows taking a reference to an unaligned variable in an aligned refernce, and complains. Make it happy by open-coding the exchange operation.	2022-11-28 21:40:18 +02:00
Raphael S. Carvalho	7d97e15c43	bytes_ostream: Avoid waste by rounding up allocation size to power-of-two - bytes_ostream has a default initial chunk size of 512. - let's say we call bytes_ostream::write() to write 500 bytes. - as next_alloc_size() takes into account space to hold chunk metadata (24 bytes) + chunk data, then 512 bytes is not enough, so it returns 500 + 24 instead to be allocated. - when allocating next chunk, next_alloc_size() will use the size of existing chunk, which is 500 bytes (without metadata) and multiply it to 2 (growth factor), so 1000 bytes is allocated for it. So allocations can be non power-of-two, resulting in memory waste. When seastar is allocating from small pools, the waste is not terrible (although accumulated small wastes can be problematic), but once allocations pass the large threshold (16k), then alignment is 4k (page size) and the waste is not negligible. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #11027	2022-07-13 16:51:13 +03:00
Avi Kivity	53e0dc7530	bytes_ostream: base on managed_bytes bytes_ostream is an incremental builder for a discontiguous byte container. managed_bytes is a non-incremental (size must be known up front) byte container, that is also compatible with LSA. So far, conversion between them involves copying. This is unfortunate, since query_result is generated as a bytes_ostream, but is later converted to managed_bytes (today, this is done in cql3::expr::get_non_pk_values() and compound_view_wrapper::explode(). If the two types could be made compatible, we could use managed_bytes_view instead of creating new objects and avoid a copy. It's also nicer to have one less vocabulary type. This patch makes bytes_ostream use managed_bytes' internal representation (blob_storage instead of bytes_ostream::chunk) and provides a conversion to managed_bytes. All bytes_ostream users are left in place, but the goal is to make bytes_ostream a write-only type with the only observer a conversion to managed_bytes. It turns out to be relatively simple. The internal representations were already similar. I made blob_storage::ref_type self-initializing to reduce churn (good practice anyway) and added a private constructor to managed_bytes for the conversion. Note that bytes_ostream can only be used to construct a non-LSA managed_bytes, but LSA uses of managed_bytes are very strictly controlled (the entry points to memtable and cache) so that's not a problem. A unit test is added. Closes #10986	2022-07-12 00:23:29 +03:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Benny Halevy	68bd748af2	repair: row_level: clear_gently: clear_gently each repair_row Rows might be large so free them gently by: - add bytes_ostream.clear_gently that may yield in the chunk freeing loop. - use that in frozen_mutation_fragment, contained in repair_row. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-07-01 19:16:11 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Avi Kivity	74df67776b	bytes_ostream: convert write_placeholder from enable_if to concepts Concepts are easier to read and result in better error messages. This change also tightens the constraint from "std::is_fundamental" to "std::integral". The differences are floating point values, nullptr_t, and void. The latter two are illegal/useless to write, and nobody uses floating point values for list lengths, so everything still compiles. Closes #8326	2021-03-22 12:00:07 +01:00
Michał Chojnowski	5c3385730b	treewide: get rid of unaligned_cast unaligned_cast violates strict aliasing rules. Replace it with safe equivalents.	2021-03-17 17:00:41 +01:00
Benny Halevy	ff5b42a0fa	bytes_ostream: max_chunk_size: account for chunk header Currently, if the data_size is greater than max_chunk_size - sizeof(chunk), we end up allocating up to max_chunk_size + sizeof(chunk) bytes, exceeding buf.max_chunk_size(). This may lead to allocation failures, as seen in https://github.com/scylladb/scylla/issues/7950, where we couldn't allocate 131088 (= 128K + 16) bytes. This change adjusted the expose max_chunk_size() to be max_alloc_size (128KB) - sizeof(chunk) so that the allocated chunks would normally be allocated in 128KB chunks in the write() path. Added a unit test - test_large_placeholder that stresses the chunk allocation path from the write_place_holder(size) entry point to make sure it handles large chunk allocations correctly. Refs #7950 Refs #8081 Test: unit(release), bytes_ostream_test(debug) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210303143413.902968-1-bhalevy@scylladb.com>	2021-03-10 19:54:12 +02:00
Piotr Jastrzebski	0605d9e8ed	bytes_ostream: Remove std::iterator from fragment_iterator std::iterator is deprecated since C++17 so define all the required iterator_traits directly. Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-11-17 16:53:20 +01:00
Botond Dénes	875314fc4b	bytes_ostream: make it a FragmentRange The presence of `const_iterator` seems to be a requirement as well although it is not part of the concept. But perhaps it is just an assumption made by code using it.	2019-12-02 10:10:31 +02:00
Botond Dénes	07007edab9	bytes_ostream: add output_iterator To allow it being used for serialization code, which works in terms of output iterators.	2019-12-02 10:10:31 +02:00
Tomasz Grabiec	f206ef0038	bytes_ostream: Optimize writing of fixed-size types Inlining write() allows the writing code to be optimized for fixed-size types. In particular, memcpy() calls and loops will be eliminated. Saw 4% improvement in throughput in perf_fast_forward for tiny rows.	2018-12-10 20:08:16 +01:00
Tomasz Grabiec	a1fb441df8	bytes_ostream: Implement clear()	2018-12-10 20:07:43 +01:00
Tomasz Grabiec	7cf5de3d9c	bytes_ostream: Make initial chunk size configurable	2018-12-10 20:07:43 +01:00
Paweł Dziepak	91793c0a43	bytes_ostream: drop appending_hash specialisation appending_hash is used for computing hashes that become part of the binary interface. They cannot change between Scylla version and the same data needs to always result in the same hash. At the moment, appending_hash<bytes_ostream> doesn't fulfil those requirements since it leaks information how the underlying buffer is fragmented. Fortunately, it has no users so it doesn't casue any compatibility issues. Moreover, bytes_ostream is usually used as an output of some serialisation routine (e.g. frozen_mutation_fragment or CQL response). Those serialisation formats do not guarantee that there is a single representation of a given data and therefore are not fit to be hashed by appending_hash. Removing appending_hash<bytes_ostream> may help preventing such incorrect uses. Message-Id: <20181122163823.12759-1-pdziepak@scylladb.com>	2018-11-22 23:53:54 +00:00
Avi Kivity	775b7e41f4	Update seastar submodule * seastar d59fcef...b924495 (2): > build: Fix protobuf generation rules > Merge "Restructure files" from Jesse Includes fixup patch from Jesse: " Update Seastar `#include`s to reflect restructure All Seastar header files are now prefixed with "seastar" and the configure script reflects the new locations of files. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <5d22d964a7735696fb6bb7606ed88f35dde31413.1542731639.git.jhaberku@scylladb.com> "	2018-11-21 00:01:44 +02:00
Paweł Dziepak	00a63663d6	bytes_ostream: increase max chunk size to 128 kB 128 kB is the size of the LSA segment and therefore the default size of any kind of chunks, fragments and buffers. Message-Id: <20180709155615.22500-1-pdziepak@scylladb.com>	2018-07-09 19:59:51 +03:00
Paweł Dziepak	fe8dc1fa5c	bytes_ostream: add remove_suffix()	2018-06-25 09:21:47 +01:00
Paweł Dziepak	a85197a7b5	bytes_ostream: make fragment_iterator default constructible	2018-06-25 09:21:47 +01:00
Avi Kivity	65c27ccf21	bytes_ostream: make max_chunk_size() an inline function Fixes debug build looking for a variable definition and not finding it.	2016-10-17 11:49:33 +03:00
Avi Kivity	c0a1ad0b77	bytes_ostream: use larger allocations A 1MB response will require 2000 allocations with the current 512-byte chunk size. Increase it exponentially to reduce allocation count for larger responses (still respecting the upper limit). Message-Id: <1476369152-1245-1-git-send-email-avi@scylladb.com>	2016-10-16 10:05:48 +01:00
Gleb Natapov	32989d1e66	Merge seastar upstream * seastar 2b55789...5b7252d (3): > Merge "rpc: serialize large messages into fragmented memory" from Gleb > Merge "Print backtrace on SIGSEGV and SIGABRT" from Tomasz > test_runner: avoid nested optionals Includes patch from Gleb to adapt to seastar changes.	2016-09-28 17:34:16 +03:00
Paweł Dziepak	387434a76c	bytes_ostream: add reduce_chunk_count() Deserialization code has now two variants. The faster one can be used only when the source buffer is not fragmented. reduce_chunk_count() aims to increase number of cases when the fast path can be used. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00
Paweł Dziepak	7d4b7fd5fc	bytes_ostream: add equality operator Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00
Paweł Dziepak	222bde7e6f	bytes_ostream: introduce upper bound on chunk size This patch makes append() and write() limit the maximum size of a single allocation to bytes_ostream::max_chunk_size. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-08-22 09:31:33 +01:00
Avi Kivity	1d1f61ac65	bytes_ostream: fix assignment operators Protect against self-assignment, and provide the strong exception guarantees for copying assignment. Fixes #1499. Message-Id: <1469466156-11114-1-git-send-email-avi@scylladb.com>	2016-07-25 19:06:42 +02:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Tomasz Grabiec	2abd62b5cb	bytes_ostream: Drop methods which serialize integers This will make bytes_ostream completely agnostic to serialization format, which should be determined by layer above it. Message-Id: <1457004221-8345-2-git-send-email-tgrabiec@scylladb.com>	2016-03-03 13:27:27 +02:00
Tomasz Grabiec	0c8db777b1	bytes_ostream: Avoid recursion when freeing chunks When there is a lot of chunks we may get stack overflow. This seems to fix issue #906, a memory corruption during schema merge. I suspect that what causes corruption there is overflowing of the stack allocated for the seastar thread. Those stacks don't have red zones which would catch overflow. Message-Id: <1456056288-3983-1-git-send-email-tgrabiec@scylladb.com>	2016-02-21 14:18:49 +02:00
Amnon Heiman	ca72d637f9	bytes_ostream: Allow place holder return a stream Reader and writer can use the bytes_ostream as a raw bytes stream, handling the bytes encoding and streaming on their own. To fully support this functionality, place holder should support it as well. This patch adds a get_stream method that return a simple_output_stream writer can use it using their own serialization function. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2016-02-17 18:42:04 +02:00
Tomasz Grabiec	c971544e83	bytes_ostream: Adapt to Output concept used in serializer.hh Message-Id: <1453888242-2086-1-git-send-email-tgrabiec@scylladb.com>	2016-01-27 12:13:34 +02:00
Tomasz Grabiec	eb1b21eb4b	Introduce hashing helpers	2016-01-08 21:10:25 +01:00
Avi Kivity	098136f4ab	Merge "Convert serialization of query::result to use db::serializer<>" from Tomasz Reviewed-by: Nadav Har'El <nyh@scylladb.com>	2015-12-07 16:53:34 +02:00
Tomasz Grabiec	657841922a	Mark move constructors noexcept when possible	2015-12-07 09:50:27 +01:00
Tomasz Grabiec	d4d3a5b620	bytes_ostream: Make size_type and value_type public	2015-12-03 09:19:11 +01:00
Avi Kivity	d5cf0fb2b1	Add license notices	2015-09-20 10:43:39 +03:00
Pekka Enberg	5164932f1e	bytes_ostream: Fix current_space_left() We also allocate chunks larger than "usable_chunk_size" in alloc(). Fix up the calculation in current_space_left(). Spotted by ASan. Signed-off-by: Pekka Enberg <penberg@cloudius-systems.com>	2015-08-07 13:00:31 +03:00
Tomasz Grabiec	c742529e52	bytes_ostream: Introduce retract() Useful for optimistic writers, where it's easier/cheaper to retract later than to calculate the decision up front.	2015-07-09 19:55:00 +02:00

1 2

55 Commits