scylladb

Author	SHA1	Message	Date
Benny Halevy	15c9f0f0df	utils: to_string: generalize range helpers As seen in https://github.com/scylladb/scylladb/issues/13146 the current implementation is not general enough to provide print helpers for all kind of containers. Modernize the implementation using templates based on std::ranges::range and using fmt::join. Extend unit test for formatting different types of ranges, boost::transformed ranges, deque. Fixes #13146 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-02 10:48:46 +03:00
Benny Halevy	45153b58bd	utils: chunked_vector: add std::ranges::range ctor To be used in next patch for constructing chunked_vector from an initializer_list. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-02 10:48:46 +03:00
Kamil Braun	30cc07b40d	Merge 'Introduce tablets' from Tomasz Grabiec This PR introduces an experimental feature called "tablets". Tablets are a way to distribute data in the cluster, which is an alternative to the current vnode-based replication. Vnode-based replication strategy tries to evenly distribute the global token space shared by all tables among nodes and shards. With tablets, the aim is to start from a different side. Divide resources of replica-shard into tablets, with a goal of having a fixed target tablet size, and then assign those tablets to serve fragments of tables (also called tablets). This will allow us to balance the load in a more flexible manner, by moving individual tablets around. Also, unlike with vnode ranges, tablet replicas live on a particular shard on a given node, which will allow us to bind raft groups to tablets. Those goals are not yet achieved with this PR, but it lays the ground for this. Things achieved in this PR: - You can start a cluster and create a keyspace whose tables will use tablet-based replication. This is done by setting `initial_tablets` option: ``` CREATE KEYSPACE test WITH replication = {'class': 'NetworkTopologyStrategy', 'replication_factor': 3, 'initial_tablets': 8}; ``` All tables created in such a keyspace will be tablet-based. Tablet-based replication is a trait, not a separate replication strategy. Tablets don't change the spirit of replication strategy, it just alters the way in which data ownership is managed. In theory, we could use it for other strategies as well like EverywhereReplicationStrategy. Currently, only NetworkTopologyStrategy is augmented to support tablets. - You can create and drop tablet-based tables (no DDL language changes) - DML / DQL work with tablet-based tables Replicas for tablet-based tables are chosen from tablet metadata instead of token metadata Things which are not yet implemented: - handling of views, indexes, CDC created on tablet-based tables - sharding is done using the old method, it ignores the shard allocated in tablet metadata - node operations (topology changes, repair, rebuild) are not handling tablet-based tables - not integrated with compaction groups - tablet allocator piggy-backs on tokens to choose replicas. Eventually we want to allocate based on current load, not statically Closes #13387 * github.com:scylladb/scylladb: test: topology: Introduce test_tablets.py raft: Introduce 'raft_server_force_snapshot' error injection locator: network_topology_strategy: Support tablet replication service: Introduce tablet_allocator locator: Introduce tablet_aware_replication_strategy locator: Extract maybe_remove_node_being_replaced() dht: token_metadata: Introduce get_my_id() migration_manager: Send tablet metadata as part of schema pull storage_service: Load tablet metadata when reloading topology state storage_service: Load tablet metadata on boot and from group0 changes db, migration_manager: Notify about tablet metadata changes via migration_listener::on_update_tablet_metadata() migration_notifier: Introduce before_drop_keyspace() migration_manager: Make prepare_keyspace_drop_announcement() return a future<> test: perf: Introduce perf-tablets test: Introduce tablets_test test: lib: Do not override table id in create_table() utils, tablets: Introduce external_memory_usage() db: tablets: Add printers db: tablets: Add persistence layer dht: Use last_token_of_compaction_group() in split_token_range_msb() locator: Introduce tablet_metadata dht: Introduce first_token() dht: Introduce next_token() storage_proxy: Improve trace-level logging locator: token_metadata: Fix confusing comment on ring_range() dht, storage_proxy: Abstract token space splitting Revert "query_ranges_to_vnodes_generator: fix for exclusive boundaries" db: Exclude keyspace with per-table replication in get_non_local_strategy_keyspaces_erms() db: Introduce get_non_local_vnode_based_strategy_keyspaces() service: storage_proxy: Avoid copying keyspace name in write handler locator: Introduce per-table replication strategy treewide: Use replication_strategy_ptr as a shorter name for abstract_replication_strategy::ptr_type locator: Introduce effective_replication_map locator: Rename effective_replication_map to vnode_effective_replication_map locator: effective_replication_map: Abstract get_pending_endpoints() db: Propagate feature_service to abstract_replication_strategy::validate_options() db: config: Introduce experimental "TABLETS" feature db: Log replication strategy for debugging purposes db: Log full exception on error in do_parse_schema_tables() db: keyspace: Remove non-const replication strategy getter config: Reformat	2023-04-27 09:40:18 +02:00
Kefu Chai	f5b05cf981	treewide: use defaulted operator!=() and operator==() in C++20, compiler generate operator!=() if the corresponding operator==() is already defined, the language now understands that the comparison is symmetric in the new standard. fortunately, our operator!=() is always equivalent to `! operator==()`, this matches the behavior of the default generated operator!=(). so, in this change, all `operator!=` are removed. in addition to the defaulted operator!=, C++20 also brings to us the defaulted operator==() -- it is able to generated the operator==() if the member-wise lexicographical comparison. under some circumstances, this is exactly what we need. so, in this change, if the operator==() is also implemented as a lexicographical comparison of all memeber variables of the class/struct in question, it is implemented using the default generated one by removing its body and mark the function as `default`. moreover, if the class happen to have other comparison operators which are implemented using lexicographical comparison, the default generated `operator<=>` is used in place of the defaulted `operator==`. sometimes, we fail to mark the operator== with the `const` specifier, in this change, to fulfil the need of C++ standard, and to be more correct, the `const` specifier is added. also, to generate the defaulted operator==, the operand should be `const class_name&`, but it is not always the case, in the class of `version`, we use `version` as the parameter type, to fulfill the need of the C++ standard, the parameter type is changed to `const version&` instead. this does not change the semantic of the comparison operator. and is a more idiomatic way to pass non-trivial struct as function parameters. please note, because in C++20, both operator= and operator<=> are symmetric, some of the operators in `multiprecision` are removed. they are the symmetric form of the another variant. if they were not removed, compiler would, for instance, find ambiguous overloaded operator '=='. this change is a cleanup to modernize the code base with C++20 features. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13687	2023-04-27 10:24:46 +03:00
Tomasz Grabiec	5a24984147	utils, tablets: Introduce external_memory_usage()	2023-04-24 10:49:37 +02:00
Nadav Har'El	51fbc89df3	util/chunked_vector: more complete comment chunked_vector was headed by short comment which didn't really explain why it exists and how and why it really differs from std::dequeue. Moreover, it made the vague claim that it "limits" contiguous allocations, which it really doesn't (at least not in the asymptotic sense). In this patch I wrote a much longer comment, which I hope will clearly explain exactly what chunked_vector is, how it really differs in its contiguous allocations from std::deque, and what it guarantees and doesn't guarantee. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #10857	2022-06-23 10:33:35 +03:00
Tomasz Grabiec	01eeb33c6e	utils/chunked_vector: Fix sigsegv during reserve() Fixes the case of make_room() invoked with last_chunk_capacity_deficit but _size not in the last reserved chunk. Found during code review, no known user impact. Fixes #10363. Message-Id: <20220411222605.641614-1-tgrabiec@scylladb.com>	2022-04-12 16:35:17 +03:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Benny Halevy	f7b8b809d0	sstables: parse chunked_vector<std::integral Members>: maximize chunk size Currently this parse function reads only 100KB worth of members in eac hiteration. Since the default max_chunk_capacity is 128KB, 100KB underutilize the chunk capacity, and it could be safely increased to the max to reduce the number of allocations and corresponding calls to read_exactly for large arrays. Expose utils::chunked_vector::max_chunk_capacity so that the caler wouldn't have to guess this number and use it in parse(). Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20211222103126.1819289-2-bhalevy@scylladb.com>	2021-12-22 15:47:37 +02:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Botond Dénes	7f07b95dd3	utils/chunked_vector: reserve_partial(): better explain how to properly use Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20201110130953.435123-1-bdenes@scylladb.com>	2020-11-10 15:45:01 +02:00
Botond Dénes	bb908b1750	utils/chunked_vector: add reserve_partial() A variant of reserve() which allows gentle reserving of memory. This variant will allocate just one chunk at a time. To drive it to completion, one should call it repeatedly with the return value of the previous call, until it returns 0. This variant will be used in the next patch by the large bitset creation code, to avoid stalls when allocating large bloom filters (which are backed by large bitset).	2020-11-02 18:02:01 +02:00
Avi Kivity	eaa9a5b0d7	utils::chunked_vector: add rbegin() and related iterators Needed as an std::vector replacement.	2019-08-01 18:39:47 +03:00
Avi Kivity	df6faae980	utils: chunked_vector: make begin()/end() const correct begin() of a const vector should return a const_iterator, to avoid giving the caller the ability to mutate it. This slipped through since iterator's constructor does a const_cast. Noticed by code inspection.	2019-08-01 18:38:53 +03:00
Tomasz Grabiec	b0f5df10d2	utils: chunked_vector: Do not require T to be default-constructible for clear() resize(), used by clear(), requires T to be default-constructible in case the vector is expanded. It's not actually needed for clearing, and there will be users which use clear() with non-default-constructible T, so implement clear() without using resize().	2018-07-11 16:55:20 +02:00
Tomasz Grabiec	03832dab97	utils: chunked_vector: Implement front() std::vector<> has it, so should this, for easy migration.	2018-07-11 16:55:20 +02:00
Tomasz Grabiec	db36ff0643	utils: Extract small_vector.hh	2018-05-30 14:41:41 +02:00
Glauber Costa	7190bb4f95	chunked_vector: exports its current memory usage There are times in which we would like to estimate how much memory a chunked_vector is using. We have two strategies to do it: 1) multiply the size by the size of the elements. That is wrong, because the chunked_vector can allocate larger chunks in anticipation of more elements to come. 2) multiply the number of chunks by 128kB. That is also wrong, because the chunk_vector will not always allocate the entire chunk if there are only a few elements in it. The best way to deal with it is to allow the chunked_vector to exports its current memory usage. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2018-05-15 11:22:21 -04:00
Glauber Costa	00d04b49a0	chunked_vector: do not iterate to destruct trivially destructible types Signed-off-by: Glauber Costa <glauber@scylladb.com>	2018-03-14 09:16:54 -04:00
Avi Kivity	d9ee2ad9f0	chunked_vector: avoid boost::small_vector with old boost versions Apparently older boost versions have a bug resulting in a double-free in boost::container::small_vector. Use std::vector instead. Fixes #2748. Tested-by: Vlad Zolotarov <vladz@scylladb.com> Message-Id: <20170903170207.21635-1-avi@scylladb.com>	2017-09-07 09:32:51 +03:00
Avi Kivity	3ba2c0652d	utils: add a new container type chunked_vector We currently use std::deque<> for when we need large random-access containers, but deque<> requires nr_items * sizeof(T) / 64 bytes of contiguous memory, which can exceed our 256k fragmentation unit with large sstables. The new container, which is a cross between deque and vector, has much lower limitations. Like deque, we allocate chunks of contiguous items, but they are 128k in size instead of 512. The last chunk can be smaller to avoid allocating 128k for a really small vector.	2017-08-26 16:44:45 +03:00

21 Commits