scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-26 19:35:12 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	6de342ed3e	locator: network_topology_strategy: Respect rack list when reallocating tablets	2025-10-02 19:42:39 +02:00
Tomasz Grabiec	029505b179	tablets: load_balancer: Move hints processing to tablet scheduler Hints have common meaning for all strategies, so the logic belongs more to make_sizing_plan(). As a side effect, we can reuse shard capacity computation across tables, which reduces computational complexity from O(tablesnodes) to O(tables DCs + nodes)	2025-02-19 16:29:07 +01:00
Tomasz Grabiec	f1bda8d4c1	tablets: load_balancer: Scale down tablet count to respect per-shard tablet count goal The limit is enforced by controlling average per-shard tablet replica count in a given DC, which is controlled by per-table tablet count. This is effective in respecting the limit on individual shards as long as tablet replicas are distributed evenly between shards. There is no attempt to move tablets around in order to enforce limits on individual shards in case of imbalance between shards. If the average per-shard tablet count exceeds the limit, all tables which contribute to it (have replicas in the DC) are scaled down by the same factor. Due to rounding up to the nearest power of 2, we may overshoot the per-shard goal by at most a factor of 2. If different DCs want different scale factors of a given table, the lowest scale factor is chosen for a given table. The limit is configurable. It's a global per-cluster config which controls how many tablet replicas per shard in total we consider to be still ok. It controls tablet allocator behavior, when choosing initial tablet count. Even though it's a per-node config, we don't support different limits per node. All nodes must have the same value of that config. It's similar in that regard to other scheduler config items like tablets_initial_scale_factor and target_tablet_size_in_bytes.	2025-02-19 16:29:07 +01:00
Tomasz Grabiec	94b5165ac7	tablets: Use scheduler's make_sizing_plan() to decide about tablet count of a new table This makes decisions made by the scheduler consistent with decisions made on table creation, with regard to tablet count. We want to avoid over-allocation of tablets when table is created, which would then be reduced by the scheduler's scaling logic. Not just to avoid wasteful migrations post table creation, but to respect the per-shard goal. To respect the per-shard goal, the algorithm will no longer be as simple as looking at hints, and we want to share the algorithm between the scheduler and initial tablet allocator. So invoke the scheduler to get the tablet count when table is created.	2025-02-19 14:40:07 +01:00
Tomasz Grabiec	8eedb551b5	tablets: network_topology_stragy: Coroutinize calculate_initial_tablets_from_topology() To insert preemption points later.	2025-02-19 14:38:49 +01:00
Benny Halevy	32c2f7579f	network_topology_strategy: allocate_tablets_for_new_table: consider tablet options Use the keyspace initial_tablets for min_tablet_count, if the latter isn't set, then take the maximum of the option-based tablet counts: - min_tablet_count - and expected_data_size_in_gb / target_tablet_size - min_per_shard_tablet_count (via calculate_initial_tablets_from_topology) If none of the hints produce a positive tablet_count, fall back to calculate_initial_tablets_from_topology * initial_scale. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2025-02-06 08:59:32 +02:00
Piotr Smaron	3848293a43	cql: clean redundant code validating replication strategy options Most of the code from `recognized_options` is either incorrect or lacks any implementation, for example: - comments for Everywhere and Local strategies are contradictory, first says to allow all options, second says that the strategy doesn't accept any options, even though both functions have the same implementation, - for Local & Everywhere strategies the same logic is repeated in `validate_options` member functions, i.e. this function does nothing, - for NetworkTopology this function returns DC names and tablet options, but tablet options are empty; OTOH this strategy also accepts 'replication_factor' tag, which was ommitted, - for SimpleStrategy this function returns `replication_factor`, but this is also validated in `validate_options` function called just before the removed function. All of it makes `validate_replication_strategy` work incorrectly. That being said, 3 tests fail because of this logic's removal, so it did something after all. The failing tests are commented out, so that the CI passes, and will be restored in the next commit(s).	2025-01-27 12:01:59 +01:00
Avi Kivity	f3eade2f62	treewide: relicense to ScyllaDB-Source-Available-1.0 Drop the AGPL license in favor of a source-available license. See the blog post [1] for details. [1] https://www.scylladb.com/2024/12/18/why-were-moving-to-a-source-available-license/	2024-12-18 17:45:13 +02:00
Benny Halevy	898cd1d404	network_topology_strategy: allocate_tablets_for_new_table: do not rely on token ownership Base initial tablets allocation for new table on the dc/rack topology, rather then on the token ring, to remove the dependency on token ownership. We keep the rack ordinal order in each dc to facilitate in-rack pairing of base/view replica pairing, and we apply load-balancing principles by sorting the nodes in each rack by their load (number of tablets allocated to the node), and attempting to fill lease-loaded nodes first. This method is more efficient than circling the token ring and attemting to insert the endpoints to the natural_endpoint_tracker until the replication factor per dc is fulfilled, and it allows an easier way to incrementally allocate more replicas after rf is increased. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-03-27 12:06:21 +02:00
Pavel Emelyanov	3abdb3c7ee	tablets: Remove tablet_aware_replication_strategy::parse_initial_tablets It's now unused, string with initial tablets its parsed elsewhere Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#17010	2024-01-29 10:03:38 +02:00
Botond Dénes	26d814d8be	Merge 'Configure initial tablets count scaling' from Pavel Emelyanov There are currently two options how to "request" the number of initial tables for a table 1. specify it explicitly when creating a keyspace 2. let scylla calculate it on its own Both are not very nice. The former doesn't take cluster layout into consideration. The latter does, but starts with one tablet per shard, which can be too low if the amount of data grows rapidly. Here's a (maybe temporary) proposal to facilitate at least perf tests -- the --tablets-initial-scale-factor option that enhances the option number two above by multiplying the calculated number of tablets by the configured number. This is what we currently do to run perf tests by patching scylla, with the option it going to be more convenient. Closes scylladb/scylladb#16919 * github.com:scylladb/scylladb: config: Add --tablets-initial-scale-factor tablet_allocator: Add initial tablets scale to config tablet_allocator: Add config	2024-01-23 13:25:12 +02:00
Kefu Chai	76b9e4f4f4	locator: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16914	2024-01-23 09:12:23 +02:00
Pavel Emelyanov	eb3b237e05	tablet_allocator: Add initial tablets scale to config When allocating tablets for table for the frist time their initial count is calculated so that each shard in a cluster gets one tablet. It may happen that more than one initial tablet per shard is better, e.g. perf tests typically rely on that. It's possible to specify the initial tablets count when creating a keyspace, this number doesn't take the cluster topology into consideration and may also be not very nice. As a temporary solution (e.g. for perf tests) we may add a configurable that scales the initial number of calculated tablets by some factor Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-01-22 19:14:45 +03:00
Pavel Emelyanov	45f4276de6	locator: Pass abstract_replication_strategy& into validate_tablet_options() It will need to check if the r.s. in question had been marked as per-table one in next patches. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-12-25 15:56:49 +03:00
Pavel Emelyanov	bf824d79d9	locator: Carry r.s. params into process_tablet_options() The latter method is the one that will need extended params in next patches. It's called from network_topology_strategy() constructor which already has params at hand. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-12-25 15:56:02 +03:00
Tomasz Grabiec	6d4d3d8bbd	locator: Introduce tablet_aware_replication_strategy tablet_aware_replication_strategy is a trait class meant to be inherited by replication strategy which want to work with tablets. The trait produces per-table effective_replication_map which looks at tablet metadata to determine replicas. No replication startegy is changed to use tablets yet in this patch.	2023-04-24 10:49:37 +02:00

16 Commits