scylladb

Author	SHA1	Message	Date
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Jan Ciolek	e5391f1eed	types: Add map_type_impl::serialize(range of <bytes, bytes>) Adds two functions that take a range over pairs of serialized values and return a serialized map value. There are 2 functions - one operating on bytes and one operating on managed_bytes. The version with managed_bytes is used in expression.cc, used to be a local static function. The bytes version will be used in type_json.cc in the next commit. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-10-28 15:14:52 +02:00
Jan Ciolek	e9f24edc9b	cql3: types: Optimize abstract_type::contains_collection contains_collection() and contains_set_or_map() used to be calculated on each call(). Now the result is calculated only once during type creation. Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-09-24 13:45:38 +02:00
Avi Kivity	e52ebe2da5	types: convert abstract_type::compare and related to std::strong_ordering Change comparators around types to std::strong_ordering. Ref #1449.	2021-07-28 13:19:24 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Pavel Solodovnikov	e0749d6264	treewide: some random header cleanups Eliminate not used includes and replace some more includes with forward declarations where appropriate. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-06-06 19:18:49 +03:00
Michał Chojnowski	472f0eb932	types: collection: remove an unused version of pack_fragmented It was made unused by previous patches in this series.	2021-04-01 10:44:21 +02:00
Michał Chojnowski	0bb959e890	cql3: don't linearize elements of lists, tuples, and user types This patch switches the type used to store collection elements inside the intermediate form used in lists::value, tuples::value etc. from bytes to managed_bytes. After this patch, tuple and list elements are only linearized in from_serialized, which will be corrected soon. This commit introduces some additional copies in expression.cc, which will be dealt with in a future commit.	2021-04-01 10:44:21 +02:00
Michał Chojnowski	aab9509775	types: collection: add versions of pack for fragmented buffers We will need them to port the representation of collection types in cql3/ from bytes to managed_bytes. The version which takes an iterator of `bytes` as an argument will be removed after that transition is complete.	2021-04-01 10:44:21 +02:00
Botond Dénes	ba7a9d2ac3	imr: switch back to open-coded description of structures Commit `aab6b0ee27` introduced the controversial new IMR format, which relied on a very template-heavy infrastructure to generate serialization and deserialization code via template meta-programming. The promise was that this new format, beyond solving the problems the previous open-coded representation had (working on linearized buffers), will speed up migrating other components to this IMR format, as the IMR infrastructure reduces code bloat, makes the code more readable via declarative type descriptions as well as safer. However, the results were almost the opposite. The template meta-programming used by the IMR infrastructure proved very hard to understand. Developers don't want to read or modify it. Maintainers don't want to see it being used anywhere else. In short, nobody wants to touch it. This commit does a conceptual revert of `aab6b0ee27`. A verbatim revert is not possible because related code evolved a lot since the merge. Also, going back to the previous code would mean we regress as we'd revert the move to fragmented buffers. So this revert is only conceptual, it changes the underlying infrastructure back to the previous open-coded one, but keeps the fragmented buffers, as well as the interface of the related components (to the extent possible). Fixes: #5578	2021-02-16 23:43:07 +01:00
Michał Chojnowski	a1f7fabb3d	types: collection: add an optimization for single-fragment buffers in deserialize Helpers parametrized with single_fragmented_view should compile to better code, so let's use them when possible.	2020-12-04 09:21:05 +01:00
Michał Chojnowski	c08419e28d	types: switch collection_type_impl::deserialize from bytes_view to FragmentedView Devirtualizes collection_type_impl::deserialize (so it can be templated) and adds a FragmentedView overload. This will allow us to deserialize collections with explicit cql_serialization_format directly from fragmented buffers.	2020-12-04 09:19:37 +01:00
Pavel Solodovnikov	f6e765b70f	cql3: pass `column_specification` via lw_shared_ptr `column_specification` class is marked as "final": it's safe to use non-polymorphic pointer "lw_shared_ptr" instead of a more generic "shared_ptr". tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200427084016.26068-1-pa.solodovnikov@scylladb.com>	2020-04-27 12:47:42 +03:00
Pavel Solodovnikov	abb3a7e218	cql3: minor sweeps through the cql layer code to reduce shared_ptrs count Convert some more helper functions to accept const reference to column_specification and column_identifier instead of shared_ptr. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-02-16 17:24:26 +03:00
Rafael Ávila de Espíndola	4b4efcf302	types: Remove collection_type_impl::serialize The rest of the serialize api has been devirtualized some time ago, but this auxiliary function stayed virtual. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200129203916.20460-1-espindola@scylladb.com>	2020-01-30 14:10:18 +02:00
Kamil Braun	4374982de0	types: collection_type_impl::to_value becomes serialize_for_cql. The purpose of collection_type_impl::to_value was to serialize a collection for sending over CQL. The corresponding function in origin is called serializeForNativeProtocol, but the name is a bit lengthy, so I settled for serialize_for_cql. The method now became a free-standing function, using the visit function to perform a dispatch on the collection type instead of a virtual call. This also makes it easier to generalize it to UDTs in future commits. Remove the old serialize_for_native_protocol with a FIXME: implement inside. It was already implemented (to_value), just called differently. remove dead methods: enforce_limit and serialized_values. The corresponding methods in C* are auxiliary methods used inside serializeForNativeProtocol. In our case, the entire algorithm is wholly written in serialize_for_cql.	2019-10-25 10:49:19 +02:00
Kamil Braun	d83ebe1092	collection_mutation: move collection_type_impl::difference to collection_mutation.hh.	2019-10-25 10:42:58 +02:00
Kamil Braun	7e3bbe548c	collection_mutation: move collection_type_impl::merge to collection_mutation.hh.	2019-10-25 10:42:58 +02:00
Kamil Braun	a41277a7cd	collection_mutation: move collection_type_impl::last_update to collection_mutation_view	2019-10-25 10:42:58 +02:00
Kamil Braun	30802f5814	collection_mutation: move collection_type_impl::is_any_live to collection_mutation_view	2019-10-25 10:42:58 +02:00
Kamil Braun	e16ba76c2e	collection_mutation: move collection_type_impl::is_empty to collection_mutation_view.	2019-10-25 10:42:58 +02:00
Kamil Braun	bbdb438d89	collection_mutation: easier (de)serialization of collection_mutation(s). `collection_type_impl::serialize_mutation_form` became `collection_mutation(_view)_description::serialize`. Previously callers had to cast their data_type down to collection_type to use serialize_mutation_form. Now it's done inside `serialize`. In the future `serialize` will be generalized to handle UDTs. `collection_type_impl::deserialize_mutation_form` became a free standing function `deserialize_collection_mutation` with similiar benefits. Actually, noone needs to call this function manually because of the next paragraph. A common pattern consisting of linearizing data inside a `collection_mutation_view` followed by calling `deserialize_mutation_form` has been abstracted out as a `with_deserialized` method inside collection_mutation_view. serialize_mutation_form_only_live was removed, because it hadn't been used anywhere.	2019-10-25 10:42:58 +02:00
Kamil Braun	b1d16c1601	types: move collection_type_impl::mutation(_view) out of collection_type_impl. collection_type_impl::mutation became collection_mutation_description. collection_type_impl::mutation_view became collection_mutation_view_description. These classes now reside inside collection_mutation.hh. Additional documentation has been written for these classes. Related function implementations were moved to collection_mutation.cc. This makes it easier to generalize these classes to non-frozen UDTs in future commits. The new names (together with documentation) better describe their purpose.	2019-10-25 10:19:45 +02:00
Kamil Braun	c0d3e6c773	atomic_cell: move collection_mutation(_view) to a new file. The classes 'collection_mutation' and 'collection_mutation_view' were moved to a separate header, collection_mutation.hh. Implementations of functions that operate on these classes, including some methods of collection_type_impl, were moved to a separate compilation unit, collection_mutation.cc. This makes it easier to modify these structures in future commits in order to generalize them for non-frozen User Defined Types. Some additional documentation has been written for collection_mutation.	2019-10-25 10:19:45 +02:00
Konstantin Osipov	a30c08e04e	lwt: support for multi-cell set & list value serialization	2019-10-22 17:40:42 +03:00
Konstantin Osipov	605755e3f6	lwt: support for multi-cell map & list comparison with literal values Multi-cell lists and maps may be stored in different formats: as sorted vectors of pairs of values, when retreived from storage, or as sorted vectors of values, when created from parser literals or supplied as parameter values. Implement a specialized compare for use when receiver and paramter representation don't match. Add helpers.	2019-10-22 17:07:33 +03:00
Rafael Ávila de Espíndola	e0065b414e	types: Avoid shared_ptr copies They are somewhat expensive (in code size at least) and not needed everywhere. Inside the getter the variables are 'const data_type&', so we can return that. Everything still works when a copy is needed, but in code that just wants to check a property we avoid the copy. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-09-03 07:43:35 -07:00
Rafael Ávila de Espíndola	f633f70616	types: Devirtualize abstract_type::is_value_compatible_with_internal It now is a static helper. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 10:02:00 -07:00
Rafael Ávila de Espíndola	19c9a033d9	types: Devirtualize abstract_type::is_compatible_with Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 10:02:00 -07:00
Rafael Ávila de Espíndola	a6b48bda03	types: Devirtualize abstract_type::is_native Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 10:02:00 -07:00
Rafael Ávila de Espíndola	ec09fb94cb	types: Devirtualize abstract_type::is_multi_cell Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 10:02:00 -07:00
Rafael Ávila de Espíndola	1581805a8d	types: Devirtualize abstract_type::is_collection Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 10:02:00 -07:00
Rafael Ávila de Espíndola	69d6fd21d2	types: Add a listlike_collection_type_impl class With this we can share code that wants to access the element type of set and list. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 10:02:00 -07:00
Rafael Ávila de Espíndola	a4837301a6	types: Move _is_multi_cell to collection_type_impl It was duplicated in each concrete collection type. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 10:02:00 -07:00
Rafael Ávila de Espíndola	de6d6c46a1	types: Remove collection_type_impl::kind All uses have been switched to abstract_type::kind. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 10:02:00 -07:00
Rafael Ávila de Espíndola	e5c7deaeb5	types: Add a kind to abstract_type The type hierarchy is closed, so we can give each leaf an enum value. This will be used to implement a visitor pattern and reduce code duplication. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-08-14 10:02:00 -07:00
Botond Dénes	307b48794d	collection_type_impl::mutation: compact_and_expire() add collector parameter The new collector parameter is a pointer to a `compaction_garbage_collector` implementation. This collector is passed all atoms that are expired and would be discarded. The body of `compact_and_expire()` was changed so that it checks cells' tombstone coverage before it checks their expiry, so that cells that are both covered by a tombstone and also expired are not passed to the collector. The collector param is optional and defaults to nullptr. To accommodate the collector, which needs to know the column id, a new `column_id` parameter was added as well.	2019-07-15 17:37:55 +03:00
Botond Dénes	572a738777	collection: use chunked_vector to store cells This is quick fix to the immediate problem of large collections causing large allocations, triggering stalls or OOM. The proper fix is to use IMR for storing the cells, but that is a complex change that will require time, so let's not stall/OOM in the meanwhile.	2019-06-26 11:40:44 +03:00
Rafael Ávila de Espíndola	53ab298957	Turn cql3_type into a trivial wrapper over data_type Both cql3_type and abstract_type are normally used inside shared_ptr. This creates a problem when an abstract_type needs to refer to a cql3_type as that creates a cycle. To avoid warnings from asan, we were using a std::unordered_map to store one of the edges of the cycle. This avoids the warning, but wastes even more memory. Even before this patch cql3_type was a fairly light weight structure. This patch pushes in that direction and now cql3_type is a struct with a single member variable, a data_type. This avoids the reference cycle and is easier to understand IMHO. Tests: unit (dev) Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-03-20 14:10:28 -07:00
Piotr Jastrzebski	5a5201a50b	Move collection_type_impl out of types.hh to types/collection.hh Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2019-01-24 09:56:38 +01:00

40 Commits