scylladb

Author	SHA1	Message	Date
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Asias He	4c1f8c2f83	compaction: Move compaction_garbage_collector.hh to compaction dir The top dir is a mess. Move compaction_garbage_collector.hh to the new home.	2021-08-07 08:07:09 +08:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Michał Chojnowski	03faf139c8	collection_mutation: don't linearize collection values Yet another patch preventing potentially large allocations. Currently, collection_mutation{_view,}_description linearize each collection value during deserialization. It's not unthinkable that a user adds a large element to a list or a map, so let's avoid that. This patch removes the dependency on linearizing_input_stream, which does not provide a way to read fragmented subbuffers, and replaces it with a new helper, which does. (Extending linearizing_input_stream is not viable without rewriting it completely). Only linearization of collection values is corrected in this patch. Collection keys are still linearized. Storing them in managed_bytes is likely to be more harmful than helpful, because large map keys are extremely unlikely, and UUIDs, which are used as keys in lists, do not fit into manages_bytes's small value optimization, so this would incure an extra allocation for every list element. Note: this patch leaves utils/linearizing_input_stream.hh unused. Refs: #8120 Closes #8690	2021-05-23 12:16:56 +03:00
Botond Dénes	ba7a9d2ac3	imr: switch back to open-coded description of structures Commit `aab6b0ee27` introduced the controversial new IMR format, which relied on a very template-heavy infrastructure to generate serialization and deserialization code via template meta-programming. The promise was that this new format, beyond solving the problems the previous open-coded representation had (working on linearized buffers), will speed up migrating other components to this IMR format, as the IMR infrastructure reduces code bloat, makes the code more readable via declarative type descriptions as well as safer. However, the results were almost the opposite. The template meta-programming used by the IMR infrastructure proved very hard to understand. Developers don't want to read or modify it. Maintainers don't want to see it being used anywhere else. In short, nobody wants to touch it. This commit does a conceptual revert of `aab6b0ee27`. A verbatim revert is not possible because related code evolved a lot since the merge. Also, going back to the previous code would mean we regress as we'd revert the move to fragmented buffers. So this revert is only conceptual, it changes the underlying infrastructure back to the previous open-coded one, but keeps the fragmented buffers, as well as the interface of the related components (to the extent possible). Fixes: #5578	2021-02-16 23:43:07 +01:00
Avi Kivity	a4c44cab88	treewide: update concepts language from the Concepts TS to C++20 Seastar recently lost support for the experimental Concepts Technical Specification (TS) and gained support for C++20 concepts. Re-enable concepts in Scylla by updating our use of concepts to the C++20 standard. This change: - peels off uses of the GCC6_CONCEPT macro - removes inclusions of <seastar/gcc6-concepts.hh> - replaces function-style concepts (no longer supported) with equation-style concepts - semicolons added and removed as needed - deprecated std::is_pod replaced by recommended replacement - updates return type constraints to use concepts instead of type names (either std::same_as or std::convertible_to, with std::same_as chosen when possible) No attempt is made to improve the concepts; this is a specification update only. Message-Id: <20200531110254.2555854-1-avi@scylladb.com>	2020-06-02 09:12:21 +03:00
Botond Dénes	0c52c2ba50	data: make cell::make_collection(): more consistent and safer `3ec889816` changed cell::make_collection() to take different code paths depending whether its `data` argument is nothrow copyable/movable or not. In case it is not, it is wrapped in a view to make it so (see the above mentioned commit for a full explanation), relying on the methods pre-existing requirement for callers to keep `data` alive while the created writer is in use. On closer look however it turns out that this requirement is neither respected, nor enforced, at least not on the code level. The real requirement is that the underlying data represented by `data` is kept alive. If `data` is a view, it is not expected to be kept alive and callers don't, it is instead copied into `make_collection()`. Non-views however are expected to be kept alive. This makes the API error prone. To avoid any future errors due to this ambiguity, require all `data` arguments to be nothrow copyable and movable. Callers are now required to pass views of nonconforming objects. This patch is a usability improvement and is not fixing a bug. The current code works as-is because it happens to conform to the underlying requirements. Refs: #5575 Refs: #5341 Tests: unit(dev) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200115084520.206947-1-bdenes@scylladb.com>	2020-01-16 12:05:50 +02:00
Avi Kivity	75d9909b27	collection_mutation_view: add type-aware pretty printer Add a way for the user to associate a type with a collection_mutation_view and get a nice printout.	2020-01-07 12:06:29 +02:00
Botond Dénes	4c59487502	collection_mutation: don't linearize the buffer on deserialization Use `utils::linearizing_input_stream` for the deserizalization of the collection. Allows for avoiding the linearization of the entire cell value, instead only linearizing individual values as they are deserialized from the buffer.	2019-12-02 10:10:31 +02:00
Botond Dénes	2f9307c973	collection_mutation: use a fragmented buffer for serialization For the serialization `bytes_ostream` is used.	2019-12-02 10:10:31 +02:00
Kamil Braun	d9baff0e4b	collection_mutation: generalize collection_mutation.cc:difference to UDTs.	2019-10-25 10:49:19 +02:00
Kamil Braun	a344019b25	collection_mutation: generalize collection_mutation_view::last_update to UDTs.	2019-10-25 10:49:19 +02:00
Kamil Braun	691f00408d	collection_mutation: generalize merge to UDTs.	2019-10-25 10:49:19 +02:00
Kamil Braun	7f5cd8e8ce	collection_mutation: generalize collection_mutation_view_description::materialize to UDTs.	2019-10-25 10:49:19 +02:00
Kamil Braun	20b42b1155	collection_mutation: generalize collection_mutation_view::is_any_live to UDTs.	2019-10-25 10:49:19 +02:00
Kamil Braun	323370e4ba	collection_mutation: generalize deserialize_collection_mutation to UDTs.	2019-10-25 10:49:19 +02:00
Kamil Braun	d83ebe1092	collection_mutation: move collection_type_impl::difference to collection_mutation.hh.	2019-10-25 10:42:58 +02:00
Kamil Braun	7e3bbe548c	collection_mutation: move collection_type_impl::merge to collection_mutation.hh.	2019-10-25 10:42:58 +02:00
Kamil Braun	a41277a7cd	collection_mutation: move collection_type_impl::last_update to collection_mutation_view	2019-10-25 10:42:58 +02:00
Kamil Braun	30802f5814	collection_mutation: move collection_type_impl::is_any_live to collection_mutation_view	2019-10-25 10:42:58 +02:00
Kamil Braun	e16ba76c2e	collection_mutation: move collection_type_impl::is_empty to collection_mutation_view.	2019-10-25 10:42:58 +02:00
Kamil Braun	bbdb438d89	collection_mutation: easier (de)serialization of collection_mutation(s). `collection_type_impl::serialize_mutation_form` became `collection_mutation(_view)_description::serialize`. Previously callers had to cast their data_type down to collection_type to use serialize_mutation_form. Now it's done inside `serialize`. In the future `serialize` will be generalized to handle UDTs. `collection_type_impl::deserialize_mutation_form` became a free standing function `deserialize_collection_mutation` with similiar benefits. Actually, noone needs to call this function manually because of the next paragraph. A common pattern consisting of linearizing data inside a `collection_mutation_view` followed by calling `deserialize_mutation_form` has been abstracted out as a `with_deserialized` method inside collection_mutation_view. serialize_mutation_form_only_live was removed, because it hadn't been used anywhere.	2019-10-25 10:42:58 +02:00
Kamil Braun	e4101679e4	collection_mutation: generalize constructor of collection_mutation to abstract_type. The constructor doesn't use anything specific to collection_type_impl. In the future it will also handle non-frozen user types.	2019-10-25 10:42:58 +02:00
Kamil Braun	b1d16c1601	types: move collection_type_impl::mutation(_view) out of collection_type_impl. collection_type_impl::mutation became collection_mutation_description. collection_type_impl::mutation_view became collection_mutation_view_description. These classes now reside inside collection_mutation.hh. Additional documentation has been written for these classes. Related function implementations were moved to collection_mutation.cc. This makes it easier to generalize these classes to non-frozen UDTs in future commits. The new names (together with documentation) better describe their purpose.	2019-10-25 10:19:45 +02:00
Kamil Braun	c0d3e6c773	atomic_cell: move collection_mutation(_view) to a new file. The classes 'collection_mutation' and 'collection_mutation_view' were moved to a separate header, collection_mutation.hh. Implementations of functions that operate on these classes, including some methods of collection_type_impl, were moved to a separate compilation unit, collection_mutation.cc. This makes it easier to modify these structures in future commits in order to generalize them for non-frozen User Defined Types. Some additional documentation has been written for collection_mutation.	2019-10-25 10:19:45 +02:00

25 Commits