scylladb

Author	SHA1	Message	Date
Kefu Chai	168ade72f8	treewide: replace formatter<std::string_view> with formatter<string_view> in in {fmt} before v10, it provides the specialization of `fmt::formatter<..>` for `std::string_view` as well as the specialization of `fmt::formatter<..>` for `fmt::string_view` which is an implementation builtin in {fmt} for compatibility of pre-C++17. and this type is used even if the code is compiled with C++ stadandard greater or equal to C++17. also, before v10, the `fmt::formatter<std::string_view>::format()` is defined so it accepts `std::string_view`. after v10, `fmt::formatter<std::string_view>` still exists, but it is now defined using `format_as()` machinery, so it's `format()` method does not actually accept `std::string_view`, it accepts `fmt::string_view`, as the former can be converted to `fmt::string_view`. this is why we can inherit from `fmt::formatter<std::string_view>` and use `formatter<std::string_view>::format(foo, ctx);` to implement the `format()` method with {fmt} v9, but we cannot do this with {fmt} v10, and we would have following compilation failure: ``` FAILED: service/CMakeFiles/service.dir/RelWithDebInfo/topology_state_machine.cc.o /home/kefu/.local/bin/clang++ -DFMT_DEPRECATED_OSTREAM -DFMT_SHARED -DSCYLLA_BUILD_MODE=release -DSEASTAR_API_LEVEL=7 -DSEASTAR_LOGGER_COMPILE_TIME_FMT -DSEASTAR_LOGGER_TYPE_STDOUT -DSEASTAR_SCHEDULING_GROUPS_COUNT=16 -DSEASTAR_SSTRING -DXXH_PRIVATE_API -DCMAKE_INTDIR=\"RelWithDebInfo\" -I/home/kefu/dev/scylladb -I/home/kefu/dev/scylladb/build/gen -I/home/kefu/dev/scylladb/seastar/include -I/home/kefu/dev/scylladb/build/seastar/gen/include -I/home/kefu/dev/scylladb/build/seastar/gen/src -ffunction-sections -fdata-sections -O3 -g -gz -std=gnu++20 -fvisibility=hidden -Wall -Werror -Wextra -Wno-error=deprecated-declarations -Wimplicit-fallthrough -Wno-c++11-narrowing -Wno-deprecated-copy -Wno-mismatched-tags -Wno-missing-field-initializers -Wno-overloaded-virtual -Wno-unsupported-friend -Wno-enum-constexpr-conversion -Wno-unused-parameter -ffile-prefix-map=/home/kefu/dev/scylladb=. -march=westmere -mllvm -inline-threshold=2500 -fno-slp-vectorize -U_FORTIFY_SOURCE -Werror=unused-result -MD -MT service/CMakeFiles/service.dir/RelWithDebInfo/topology_state_machine.cc.o -MF service/CMakeFiles/service.dir/RelWithDebInfo/topology_state_machine.cc.o.d -o service/CMakeFiles/service.dir/RelWithDebInfo/topology_state_machine.cc.o -c /home/kefu/dev/scylladb/service/topology_state_machine.cc /home/kefu/dev/scylladb/service/topology_state_machine.cc:254:41: error: no matching member function for call to 'format' 254 \| return formatter<std::string_view>::format(it->second, ctx); \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~ /usr/include/fmt/core.h:2759:22: note: candidate function template not viable: no known conversion from 'seastar::basic_sstring<char, unsigned int, 15>' to 'const fmt::basic_string_view<char>' for 1st argument 2759 \| FMT_CONSTEXPR auto format(const T& val, FormatContext& ctx) const \| ^ ~~~~~~~~~~~~ ``` because the inherited `format()` method actually comes from `fmt::formatter<fmt::string_view>`. to reduce the confusion, in this change, we just inherit from `fmt::format<string_view>`, where `string_view` is actually `fmt::string_view`. this follows the document at https://fmt.dev/latest/api.html#formatting-user-defined-types, and since there is less indirection under the hood -- we do not use the specialization created by `FMT_FORMAT_AS` which inherit from `formatter<fmt::string_view>`, hopefully this can improve the compilation speed a little bit. also, this change addresses the build failure with {fmt} v10. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#18299	2024-04-19 07:44:07 +03:00
Kefu Chai	1ab2bb69b8	keys: do not use zip_iterator for printing key components boost's the operator==() implementation of boost's zip_iterator returns true only if all elements in enclosed tuple of zip_iterator are equal. and the zip_iterator always advances all the iterators in the enclosed tuple. but in our case, some components might be missing. in other words, the size of the `components` might be smaller than that of the `types` range. so, when the zip_iterator advances past the end of the components, scylla starts reading out of bounds. because zip_iterator does not allow us to customize how it implements the equal operator. and we cannot deduce the size of components without reading all of them. so in this change, we partially revert `3738fcbe05`, instead of using fmt::join(), just iterate through the components manually. this should avoid the out-of-bound reading, and also preserve the original behavior. Branches: 5.3 Fixes #14435 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14457	2023-07-01 23:49:02 +03:00
Kefu Chai	a573a89128	keys: print "non-utf8-key" when clustering_key is not UTF-8 before this change we do not check if the clustering_key to be formatted is UTF-8 encoded before printing it. but we do perform the validation when printing paritition_keys. since the clustering_key is not different from partition_key when it comes to encoding, actually they are different parts of a parimary key. so let's validate the encoding of clustering_key as well, when formatting it. this change is a follow-up of `85b21ba049`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13641	2023-04-24 10:40:23 +03:00
Avi Kivity	9072763a52	keys: change from_optional_exploded to accept a span instead of a vector A span is more generic than a vector, and can be constructed from any contiguous container (like small_vector), or a subset of a container. To support this, helpers in compound.hh need to use make_iterator_range, since a span doesn't fit the container concept (since spans don't own their contents). This is needed to make a similar change to function evaluation, as the token function passes its parameters to from_optional_exploded().	2023-04-19 20:18:50 +03:00
Kefu Chai	85b21ba049	keys: consolidate the formatter for partition_keys since there are two places formatting `with_schema_wrapper`, it'd be desirable if we can consolidate them. so, in this change, the formatting code is extracted into a helper, so we only have a single place for formatting the `with_schema_wrapper`s. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-04-14 13:21:30 +08:00
Kefu Chai	3738fcbe05	keys: specialize fmt::formatter<partition_key> and friends this is a part of a series to migrating from `operator<<(ostream&, ..)` based formatting to fmtlib based formatting. the goal here is to enable fmtlib to print following classes without the help of `operator<<`. - partition_key_view - partition_key - partition_key::with_schema_wrapper - key_with_schema - clustering_key_prefix - clustering_key_prefix::with_schema_wrapper the corresponding `operator<<()` are dropped dropped in this change, as all its callers are now using fmtlib for formatting now. the helper of `print_key()` is removed, as its only caller is `operator<<(std::ostream&, const clustering_key_prefix::with_schema_wrapper&)`. the reason why all these operators are replaced in one go is that we have a template function of `key_to_str()` in `db/large_data_handler.cc`. this template function is actually the caller of operator<< of `partition_key::with_schema_wrapper` and `clustering_key_prefix::with_schema_wrapper`. so, in order to drop either of these two operator<<, we need to remove both of them, so that we can switch over to `fmt::to_string()` in this template function. Refs scylladb#13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-04-14 13:21:30 +08:00
Avi Kivity	9ced89a41c	keys: disambiguate construction from initializer_list<bytes> Some tests initialize via an initializer_list, but gcc finds other valid constructors via vector<managed_bytes>. Disambiguate by adding a constructor that accepts the initializer_list, and forward to the wanted constructor.	2023-03-21 13:42:49 +02:00
Kefu Chai	df63e2ba27	types: move types.{cc,hh} into types they are part of the CQL type system, and are "closer" to types. let's move them into "types" directory. the building systems are updated accordingly. the source files referencing `types.hh` were updated using following command: ``` find . -name "*.{cc,hh}" -exec sed -i 's/\"types.hh\"/\"types\/types.hh\"/' {} + ``` the source files under sstables include "types.hh", which is indeed the one located under "sstables", so include "sstables/types.hh" instea, so it's more explicit. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #12926	2023-02-19 21:05:45 +02:00
Avi Kivity	e2f6e0b848	utils: move hashing related files to utils/ module Closes #12884	2023-02-17 07:19:52 +02:00
Avi Kivity	69a385fd9d	Introduce schema/ module Schema related files are moved there. This excludes schema files that also interact with mutations, because the mutation module depends on the schema. Those files will have to go into a separate module. Closes #12858	2023-02-15 11:01:50 +02:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Avi Kivity	ae3a360725	database: Move database, keyspace, table classes to replica/ directory The database, keyspace, and table classes represent the replica-only part of the objects after which they are named. Reading from a table doesn't give you the full data, just the replica's view, and it is not consistent since reconciliation is applied on the coordinator. As a first step in acknowledging this, move the related files to a replica/ subdirectory.	2022-01-06 17:07:30 +02:00
Avi Kivity	0909e3c17d	treewide: remove redundant "x <=> 0" compares If x is of type std::strong_ordering, then "x <=> 0" is equivalent to x. These no-ops were inserted during #1449 fixes, but are now unnecessary. They have potential for harm, since they can hide an accidental of the type of x to an arithmetic type, so remove them. Ref #1449.	2021-07-28 13:30:32 +03:00
Pavel Emelyanov	0f53e83a8e	range_tombstone_list, code: Mark external_memory_usage noexcept The range_tombstone_list's method is at the top of the stack of calls each not throwing anything, so do the deep-dive noexcept marking. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-07-27 20:06:53 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Michał Chojnowski	4b60e69e7c	keys, compound: take the argument to from_single_value() by reference Since serialize_value needs to copy the values to a bigger buffer anyway, there is no point in copying the argument higher in the call chain. This patch eliminates some pointless copies, for example in alternator/executor.cc Closes #8688	2021-05-24 11:20:24 +03:00
Michał Chojnowski	ffdb706984	keys, compound: eliminate some careless copies of shared pointers Using `auto` copies the shared pointers. We don't want that, so let's use `const auto&`. Closes #8686	2021-05-23 12:11:46 +03:00
Michał Chojnowski	23909e91a4	alternator: executor: eliminate some pointless reserializations There are places where abstract_type::deserialize is called just to pass the result to compound_wrapper::from_singular, which immediately serializes it again. Get rid of this ritual by adding a version of from_singular which takes a serialized argument. As a bonus, along the way we eliminate some pointless copies of lw_shared_ptr and std::shared_ptr caused by two careless uses of `auto`. Closes #8687	2021-05-23 09:42:09 +03:00
Michał Chojnowski	5a2b492f09	compound: add explode_fragmented We will use it in the next patches in this series.	2021-04-08 10:02:54 +02:00
Michał Chojnowski	979666075f	cql3: expression: use managed_bytes instead of bytes where possible	2021-04-01 10:44:21 +02:00
Michał Chojnowski	0bb959e890	cql3: don't linearize elements of lists, tuples, and user types This patch switches the type used to store collection elements inside the intermediate form used in lists::value, tuples::value etc. from bytes to managed_bytes. After this patch, tuple and list elements are only linearized in from_serialized, which will be corrected soon. This commit introduces some additional copies in expression.cc, which will be dealt with in a future commit.	2021-04-01 10:44:21 +02:00
Avi Kivity	58b7f225ab	keys: convert trichotomic comparators to return std::strong_ordering A trichotomic comparator returning an int an easily be mistaken for a less comparator as the return types are convertible. Use the new std::strong_ordering instead. A caller in cql3's update_parameters.hh is also converted, following the path of least resistance. Ref #1449. Test: unit (dev) Closes #8323	2021-03-21 09:30:43 +02:00
Michał Chojnowski	85048b349b	memtable: fix accounting of managed_bytes in partition_snapshot_accounter managed_bytes has a small overhead per each fragment. Due to that, managed_bytes containing the same data can have different total memory usage in different allocators. The smaller the preferred max allocation size setting is, the more fragments are needed and the greater total per-fragment overhead is. In particular, managed_bytes allocated in the LSA could grow in memory usage when copied to the standard allocator, if the standard allocator had a preferred max allocation setting smaller than the LSA. partition_snapshot_accounter calculates the amount of memory used by mutation fragments in the memtable (where they are allocated with LSA) based on the memory usage after they are copied to the standard allocator. This could result in an overestimation, as explained above. But partition_snapshot_accounter must not overestimate the amount of freed memory, as doing otherwise might result in OOM situations. This patch prevents the overaccounting by adding minimal_external_memory_usage(): a new version of external_memory_usage(), which ignores allocator-dependent overhead. In particular, it includes the per-fragment overhead in managed_bytes only once, no matter how many fragments there are.	2021-01-15 18:21:13 +01:00
Michał Chojnowski	2e38647a95	keys: update comments after changes and remove an unused method The comments were outdated after the latest changes (bytes_view vs managed_bytes_view). compound_view_wrapper::get_component() is unused, so we remove it.	2021-01-15 14:05:44 +01:00
Michał Chojnowski	dbcf987231	keys, compound: switch from bytes_view to managed_bytes_view The keys classes (partition_key et al) already use managed_bytes, but they assume the data is not fragmented and make liberal use of that by casting to bytes_view. The view classes use bytes_view. Change that to managed_bytes_view, and adjust return values to managed_bytes/managed_bytes_view. The callers are adjusted. In some places linearization (to_bytes()) is needed, but this isn't too bad as keys are always <= 64k and thus will not be fragmented when out of LSA. We can remove this linearization later. The serialize_value() template is called from a long chain, and can be reached with either bytes_view or managed_bytes_view. Rather than trace and adjust all the callers, we patch it now with constexpr if. operator bytes_view (in keys) is converted to operator managed_bytes_view, allowing callers to defer or avoid linearization.	2021-01-08 14:16:08 +01:00
Michał Chojnowski	2d28471a59	utils: managed_bytes: make the constructors from bytes and bytes_view explicit Conversions from views to owners have no business being implicit. Besides, they would also cause various ambiguity problems when adding managed_bytes_view.	2021-01-04 22:22:12 +01:00
Botond Dénes	84c47c4228	partition_key_view: add validate method We want to be able to pass `partition_key_view` to `validation::validate_cql_key()`. As the latter wants to call `validate()` on the key, replicate `partition_key::validate()` in `partition_key_view`.	2020-05-12 12:07:00 +03:00
Piotr Jastrzebski	9279a679da	keys.hh: make it independent from schema.hh This cuts build dependency keys.hh -> schema.hh Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com>	2020-01-20 14:25:17 +02:00
Tomasz Grabiec	c6274fdef3	keys: Avoid implicit conversion to partition_key in the hasher of partition_key_view Message-Id: <1556230107-13557-1-git-send-email-tgrabiec@scylladb.com>	2019-04-26 20:02:35 +03:00
Rafael Ávila de Espíndola	561285488b	keys: add schema-aware printing for clustering_key_prefix For reporting large rows we have to be able to print clustering keys in addition to partition keys. Refs #3988. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2019-01-28 13:01:54 -08:00
Tomasz Grabiec	75cde85349	Merge "Support reading range tombstones" from Piotr and Vladimir Implement and test support for reading range tombstones in SSTables 3. Does not yet support reads which are using slicing or fast forwarding. From github.com/scylladb/seastar-dev.git haaawk/sstables3/tombstones_v11: Piotr Jastrzebski (5): sstables: Add consumer_m::consume_range_tombstone sstables: Support null columns in ck sstables: Support reading range_tombstones sstables: Test reading range_tombstones sstables: Add test for RT with non-full key Vladimir Krivopalov (2): sstables: Add operator<< overload for bound_kind_m. keys: Add clustering_key_prefix::make_full helper.	2018-08-27 20:43:38 +02:00
Vladimir Krivopalov	8acf4ddb8e	keys: Add clustering_key_prefix::make_full helper. This method fills non-full clustering key with trailing empty values to make it full. This can be used for clustering keys of rows in a compact table as, unlike in regular tables, they can be non-full. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-08-22 12:13:23 +02:00
Duarte Nunes	ce461b06d7	keys: Add factory for an empty clustering_key_prefix_view Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-08-20 21:39:37 +01:00
Avi Kivity	bfd14b4123	keys: schema-aware printing of a partition_key Add a with_schema() helper to decorate a partition key with its schema for pretty-printing purposes, and matching operator<<. This is useful to print partition keys where the operator, who may not be familiar with the encoding, may see them.	2018-07-17 14:43:12 +03:00
Avi Kivity	2582f53b44	Merge "database and API: Add column_family::get_sstables_by_key" from Amnon " This is series is for nodetool getsstables. This patch is based on: `8daaf9833a` With some minor adjustments because of the code change in sstables. The idea is to allow searching for all the sstables that contains a given key. After this patch if there is a table t1 in keyspace k1 and it has a key called aa. curl -X GET "http://localhost:10000/column_family/sstables/by_key/k1%3At1?key=aa" Will return the list of sstables file names that contains that key. " * 'amnon/sstable_for_key_v4' of github.com:scylladb/seastar-dev: Add the API implementation to get_sstables_by_key api: column_family.json make the get_sstables_for_key doc clearer column_family: Add the get_sstables_by_partition_key method sstable test: add has_partition_key test sstable: Add has_partition_key method keys_test: add a test for nodetool_style string keys: Add from_nodetool_style_string factory method	2018-06-10 16:53:56 +03:00
Amnon Heiman	c517ee8353	keys: Add from_nodetool_style_string factory method Based on: `8daaf9833a` This patch adds a from_nodetool_style_string factory method to partition_key. The string format is follows the nodetool format, that column in the partition keys are split by ':'. For example, if a partition key has two column col1 and col2, to get the partition key that has col1 = val1 and col2 = val2: val1:val2	2018-05-28 18:09:51 +03:00
Nadav Har'El	433fc6c36e	keys.hh: simplify empty clustering-key check The exploded_clustering_prefix type has a convenient is_empty() method and an even more convenient "operator bool" shortcut. Unfortunately, the other clustering prefix types (clustering_key_prefix, clustering_key_prefix_view) have, for historic reasons, an is_empty method which takes a schema parameter. That also means they can't have an "operator bool" shortcut. But checking if a prefix doesn't really need the schema - all we need to check is whether the byte representation is empty. The result is simpler and more efficient code, and easier to use. It is also more consistent - all clustering-key-related types will have an "operator bool" instead of just some of them. To avoid massive code changes, we leave a is_empty(schema) variant, which simply calls is_empty(). There's already precedent for that - various methods which have a variant taking schema (and ignoring it) and one taking nothing. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180521174220.13262-1-nyh@scylladb.com>	2018-05-23 11:46:23 +02:00
Duarte Nunes	5f822e3928	db/view/view_builder: Actually build views This patch adds the missing view building code to the eponymous class. We consume from the reader associated with each base table until all its views are built. If the reader reaches the end and there are incomplete views, then a view was added while others were being built. In such cases, we restart the reader to the beginning of the current token, but not to the beginning of the token range, when the view is added. Then, when we exhaust the reader, we simply create a new one for the whole token range, and resume building the pending views. We aim to be resource-conscious. On a given shard, at any given moment, we consume at most from one reader. We also strive for fairness, in that each build step inserts entries for the views of a different base. Each build step reads and generates updates for batch_size rows. We lack a controller, which could potentially allow us to go faster (to execute multiple steps at the same time, or consume more rows per batch), and also which would apply backpressure, so we could, for example, delay executing a build step. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-03-27 01:20:11 +01:00
Duarte Nunes	12507fb9ce	keys: Replace feed_hash() member function with appending_hash Replace the feed_hash() member function of partition_key and clustering_key_prefix with the specialization of appending_hash, so that we can use the general feed_hash() function. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2018-02-01 00:22:50 +00:00
Paweł Dziepak	6031b7e587	keys: introduce compound_wrapper::from_exploded_view()	2017-07-26 14:38:27 +01:00
Duarte Nunes	257eaa0d05	compound_view_wrapper: Add tri_compare Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-05-17 10:33:18 +02:00
Duarte Nunes	9e88b60ef5	mutation: Set cell using clustering_key_prefix Change the clustering key argument in mutation::set_cell from exploded_clustering_prefix to clustering_key_prefix, which allows for some overall code simplification and fewer copies. This mostly affects the cql3 layer. Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-05-04 15:59:50 +02:00
Duarte Nunes	07e648251b	prefix_compound_view_wrapper: Add is_full and is_empty functions Signed-off-by: Duarte Nunes <duarte@scylladb.com>	2017-05-04 15:59:50 +02:00
Tomasz Grabiec	212a021fc6	keys: Introduce is_empty() for prefixes	2017-03-28 18:10:39 +02:00
Paweł Dziepak	711bd19f16	keys: add memory_usage() Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-11-18 11:25:36 +00:00
Paweł Dziepak	ef57b9a26f	rename memory_usage() to external_memory_usage() where applicable Renaming the function to external_memory_usage() makes it clear that sizeof(T) is not included, something that was a source of confusion in the past. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-11-18 11:25:36 +00:00
Paweł Dziepak	eb59b4c4ab	keys: disable constructing from generic range stdx::optional<T> uses quite elaborate std::enable_if_t magic to decide whether the argument passed to its constructor should be used for a call T constructor or stdx::optional<T> constructor. Apparently, with GCC 6.2 having T constructor which accepts any type confuses that magic and we end up with compile errors. The solution is to have from_range() method that replaces that constructor from range. There is also constructor that creates a key from std::vector<bytes> so that code generated by IDL works as it did before. Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com> Message-Id: <1474550971-15309-1-git-send-email-pdziepak@scylladb.com>	2016-09-24 18:57:01 +03:00
Paweł Dziepak	d0ee750cec	keys: add memory_usage() Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-07-07 12:17:25 +01:00
Paweł Dziepak	7809adc6ce	keys: add compound_wrapper::tri_compare Signed-off-by: Paweł Dziepak <pdziepak@scylladb.com>	2016-06-20 21:29:48 +01:00
Tomasz Grabiec	57413618e8	Merge branch 'range-tombstone-v9' from https://github.com/duarten/scylla.git From Duarte: This patchset adds the range_tombstone_list data structure, used to hold a set of disjoint range tombstones, and changes the internal representation of row tombstones to use that data structure. Fixes #1155 [tgrabiec: Added compound_wrapper::make_empty(const schema&) overload to fix compilation failure in tracing code]	2016-06-02 22:17:17 +02:00

1 2 3

105 Commits