scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 03:56:42 +00:00

Author	SHA1	Message	Date
Kefu Chai	c429a8d8ae	sstables: use "me" sstable format by default in `7952200c`, we changed the `selected_format` from `mc` to `me`, but to be backward compatible the cluster starts with "md", so when the nodes in cluster agree on the "ME_SSTABLE_FORMAT" feature, the format selector believes that the node is already using "ME", which is specified by `_selected_format`. even it is actually still using "md", which is specified by `sstable_manager::_format`, as changed by `54d49c04`. as explained above, it was specified to "md" in hope to be backward compatible when upgrading from an existign installation which might be still using "md". but after a second thought, since we are able to read sstables persisted with older formats, this concern is not valid. in other words, `7952200c` introduced a regression which changed the "default" sstable format from `me` to `md`. to address this, we just change `sstable_manager::_format` to "me", so that all sstables are created using "me" format. a test is added accordingly. Fixes #18995 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19293	2024-06-21 12:56:01 +03:00
Yaron Kaikov	57428d373b	[actions] fix sync label from PR to linked issue in `b8c705bc54` i modified the even name to `pull_request_target`, This caused skipping sync process when PR label was added/removed Fixing it Closes scylladb/scylladb#19408	2024-06-21 11:39:44 +03:00
Kamil Braun	627d566811	Merge 'join_token_ring, gossip topology: recalculate sync nodes in wait_alive' from Patryk Jędrzejczak The node booting in gossip topology waits until all NORMAL nodes are UP. If we removed a different node just before, the booting node could still see it as NORMAL and wait for it to be UP, which would time out and fail the bootstrap. This issue caused scylladb/scylladb#17526. Fix it by recalculating the nodes to wait for in every step of the of the `wait_alive` loop. Although the issue fixed by this PR caused only test flakiness, it could also manifest in real clusters. It's best to backport this PR to 5.4 and 6.0. Fixes scylladb/scylladb#17526 Closes scylladb/scylladb#19387 * github.com:scylladb/scylladb: join_token_ring, gossip topology: update obsolete comment join_token_ring, gossip topology: fix indendation after previous patch join_token_ring, gossip topology: recalculate sync nodes in wait_alive	2024-06-21 10:22:32 +02:00
Piotr Dulikowski	c3536015e4	Merge 'cql3/statement/select_statement: do not parallelize single-partition aggregations' from Michał Jadwiszczak This patch adds a check if aggregation query is doing single-partition read and if so, makes the query to not use forward_service and do not parallelize the request. Fixes scylladb/scylladb#19349 Closes scylladb/scylladb#19350 * github.com:scylladb/scylladb: test/boost/cql_query_test: add test for single-partition aggregation cql3/select_statement: do not parallelize single-partition aggregations	2024-06-21 08:50:00 +02:00
Avi Kivity	fdc1449392	treewide: rename flat_mutation_reader_v2 to mutation_reader flat_mutation_reader_v2 was introduced in a pair of commits in 2021: `e3309322c3` "Clone flat_mutation_reader related classes into v2 variants" `08b5773c12` "Adapt flat_mutation_reader_v2 to the new version of the API" as a replacement for flat_mutation_reader, using range_tombstone_change instead of range_tombstone to represent represent range tombstones. See those commits for more information. The transition was incremental; the last use of the original flat_mutation_reader was removed in 2022 in commit `026f8cc1e7` "db: Use mutation_partition_v2 in mvcc" In turn, flat_mutation_reader was introduced in 2017 in commit `748205ca75` "Introduce flat_mutation_reader" To transition from a mutation_reader that nested rows within a partition in a separate stream, to a flat reader that streamed partitions and rows in the same stream. Here, we reclaim the original name and rename the awkward flat_mutation_reader_v2 to mutation_reader. Note that mutation_fragment_v2 remains since we still use the original for compatibilty, sometimes. Some notes about the transition: - files were also renamed. In one case (flat_mutation_reader_test.cc), the rename target already existed, so we rename to mutation_reader_another_test.cc. - a namespace 'mutation_reader' with two definitions existed (in mutation_reader_fwd.hh). Its contents was folded into the mutation_reader class. As a result, a few #includes had to be adjusted. Closes scylladb/scylladb#19356	2024-06-21 07:12:06 +03:00
Avi Kivity	185338c8cf	Merge 'Reduce TWCS off-strategy space overhead' from Raphael "Raph" Carvalho Normally, the space overhead for TWCS is 1/N, where is number of windows. But during off-strategy, the overhead is 100% because input sstables cannot be released earlier. Reshaping a TWCS table that takes ~50% of available space can result in system running out of space. That's fixed by restricting every TWCS off-strategy job to 10% of free space in disk. Tables that aren't big will not be penalized with increased write amplification, as all input (disjoint) sstables can still be compacted in a single round. Fixes #16514. Closes scylladb/scylladb#18137 * github.com:scylladb/scylladb: compaction: Reduce twcs off-strategy space overhead to 10% of free space compaction: wire storage free space into reshape procedure sstables: Allow to get free space from underlying storage replica: don't expose compaction_group to reshape task	2024-06-20 18:51:25 +03:00
Kefu Chai	42b9784650	build: cmake: mark wasm "ALL" so that "wasm" target is built. "wasm" generates the text format of wasm code. and these wasm applications are used by the test_wasm tests. the rules generated by `configure.py` adds these .wat files as a dependency of `{mode}-build`, which is in turn a dependency of `{mode}`. in this change, let's mirror this behavior by making `wasm` ALL, so it is built by the default target. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19391	2024-06-20 18:45:31 +03:00
Kefu Chai	caf1149f11	cql-pytest/test_sstable: do not import unused modules Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19389	2024-06-20 17:14:28 +03:00
Avi Kivity	02cf17f4dc	Merge 'Sanitize load_meter API handlers management' from Pavel Emelyanov The service in question is pretty small one, but it has its API endpoint that lives in /storage_service group. Currently when a service starts and has any endpoints that depend on it, the endpoint registration should follow it (#2737). Here's the PR that does it for load meter. Another goal of this change is that http context now has one less dependency onboard. Closes scylladb/scylladb#19390 * github.com:scylladb/scylladb: api: Remove ctx->load_meter dependency api: Use local load_meter reference in handlers api: Fix indentation after previous patch api: Coroutinize load_meter::get_load_map handler api: Move load meter handlers api: Add set/unset methods for load_meter	2024-06-20 17:07:19 +03:00
Anna Stuchlik	027cf3f47d	doc: remove the link to Scylladb Google group The group is no longer active and should be removed from resources. Closes scylladb/scylladb#19379	2024-06-20 15:31:03 +02:00
Yaron Kaikov	f2705b3887	[action] add github context info for better debugging It seems that we skip the sync label process between PR and linked Issues Adding those debug prints will allow us to understand why Closes scylladb/scylladb#19393	2024-06-20 16:17:04 +03:00
Pavel Emelyanov	de80094815	Merge 'treewide: remove unused operator<<' from Kefu Chai since we've switched almost all callers of the operator<< to {fmt}, let's drop the unused operator<<:s. there are more occurrences of unused operator<< in the tree, but let's do the cleanup piecemeal. --- this is a cleanup, so no need to backport Closes scylladb/scylladb#19346 * github.com:scylladb/scylladb: types: remove unused operator<< node_ops: remove unused operator<< lang: remove unused operator<< gms: remove unused operator<< dht: remove unused operator<< test: do not use operator<< for std::optional	2024-06-20 13:18:59 +03:00
Pavel Emelyanov	873d76c02b	api: Remove ctx->load_meter dependency Now the API uses captured reference and the explicit dependency is not needed. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-20 12:38:28 +03:00
Pavel Emelyanov	d85e70ef98	api: Use local load_meter reference in handlers Now it uses ctx.lm dependency, but the idiomatic way for API is to use the argument one. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-20 12:37:48 +03:00
Pavel Emelyanov	bc5e360066	api: Fix indentation after previous patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-20 12:37:39 +03:00
Pavel Emelyanov	e54f651beb	api: Coroutinize load_meter::get_load_map handler Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-20 12:37:18 +03:00
Pavel Emelyanov	40c178bee2	api: Move load meter handlers Now they are in storage service set/unset helper, but there's the dedicated set/unset pair for meter's enpoints. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-20 12:36:38 +03:00
Pavel Emelyanov	724d62aa87	api: Add set/unset methods for load_meter The meter is pretty small sevice and its API is also tiny. Still, it's a standalone top-level service, and its API should come next to it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-06-20 12:35:58 +03:00
Botond Dénes	b09196ac49	Merge 'tasks: fix tasks abort' from Aleksandra Martyniuk Currently if task_manager::task::impl::abort preempts before children are recursively aborted and then the task gets unregistered, we hit use after free since abort uses children vector which is no longer alive. Modify abort method so that it goes over all tasks in task manager and aborts those with the given parent. Fixes: #19304. Requires backport to all versions containing task manager Closes scylladb/scylladb#19305 * github.com:scylladb/scylladb: test: add test for abort while a task is being unregistered tasks: fix tasks abort	2024-06-20 12:09:30 +03:00
Kefu Chai	1a724f22f9	mutation: silence false alarm from clang-tidy before this change, because it seems that we move away from `p2` in each iteration, so the succeeding iterations are moving from an empty `p2`, clang-tidy warns at seeing this. but we only move from `p2._static_row` in the first iteration when the dest `mutation_partition` instance's static row is empty. and in the succeeding iterations, the dest `mutation_partition` instance's static row is not empty anymore if it is set. so, this is a false alarm. in this change, we silence this warning. another option is to extract the single-shot mutation out of the loop, and pass the `std::move(p2)` only for the single-shot mutation, but that'd be a much more intrusive change. we can revisit this later. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19331	2024-06-20 12:05:20 +03:00
Kefu Chai	9f0b60c7a0	rust: disable incremental build for release build so that the release build is reproducible. a reproduciable helps developers to perform postmortem debugging. Fixes #19225 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19374	2024-06-20 12:01:14 +03:00
Patryk Jędrzejczak	bcc0a352b7	join_token_ring, gossip topology: update obsolete comment The code mentioned in the comment has already been added. We change the comment to prevent confusion.	2024-06-20 10:59:50 +02:00
Patryk Jędrzejczak	7735bd539b	join_token_ring, gossip topology: fix indendation after previous patch	2024-06-20 10:59:50 +02:00
Patryk Jędrzejczak	017134fd38	join_token_ring, gossip topology: recalculate sync nodes in wait_alive Before this patch, if we booted a node just after removing a different node, the booting node may still see the removed node as NORMAL and wait for it to be UP, which would time out and fail the bootstrap. This issue caused scylladb/scylladb#17526. Fix it by recalculating the nodes to wait for in every step of the of the `wait_alive` loop.	2024-06-20 10:59:49 +02:00
Anna Stuchlik	680405b465	doc: separate Entrprise- from OSS-only content This commit adds files that contain Open Source-specific information and includes these files with the .. scylladb_include_flag:: directive. The files include a) a link and b) Table of Contents. The purpose of this update is to enable adding Open Source/Enterprise-specific information in the Reference section. Closes scylladb/scylladb#19362	2024-06-20 11:58:32 +03:00
Piotr Dulikowski	75441ee120	Merge 'mv: fix value of the gossiped view update backlog' from Wojciech Mitros Currently, when calculating the view update backlog for gossip, we start with `db::view::update_backlog()` and compare it to backlogs from all shards. However, this backlog can't be compared to other backlogs - it has size 0 and we compare the fraction current/size when comparing backlogs, causing us to compare with `NaN`. This patch fixes it by starting the comparisons with an empty backlog. The patch introducing this issue (`f70f774e40`) wasn't backported, so this one doesn't need to be either Closes scylladb/scylladb#19247 * github.com:scylladb/scylladb: mv: make the view update backlog unmofidiable mv: fix value of the gossiped view update backlog	2024-06-20 06:27:11 +02:00
Piotr Dulikowski	78a40dbe2c	Merge 'cql: remove global_req_id from schema_altering_statement' from Marcin Maliszkiewicz Such field is no longer needed as the information comes directly from group0_batch. Fixes scylladb/scylladb#19365 Backport: no, we don't backport code cleanups Closes scylladb/scylladb#19366 * github.com:scylladb/scylladb: cql: remove global_req_id from schema_altering_statement cql: switch alter keyspace prepare_schema_mutations to use group0_batch	2024-06-20 06:21:48 +02:00
Dawid Medrek	c56de90a26	test/boost/hint_test.cc: Add missing parse() callback Before these changes, compilation was failing with the following error: In file included from test/boost/hint_test.cc:12: /usr/include/fmt/ranges.h:298:7: error: no member named 'parse' in 'fmt::formatter<db::hints::sync_point::host_id_or_addr>' 298 \| f.parse(ctx); \| ~ ^ We add the missing callback. Closes scylladb/scylladb#19375	2024-06-19 23:19:33 +02:00
Wojciech Mitros	cde14a5788	mv: make the view update backlog unmofidiable Currently, a view update backlog may reach an invalid state, when its max is 0 and its relative_size() is NaN as a result. This can be achieved either by constructing the backlog with a 0 max or by modifying the max of an existing backlog. In particular, this happens when creating the backlog using the default constructor. In this patch the the default constructor is deleted and a check is added to make sure that the max is different than 0 is added to its constructor - if the check fails, we construct an empty backlog instead, to handle the possibility of getting an invalid backlog sent from a node with a version that's missing this check. Additionally, we make the backlogs members private, exposing them only through const getters.	2024-06-19 19:44:57 +02:00
Pavel Emelyanov	5fe4290f66	gitattributes: Mark swagger .js files as binary The goal is the same as in `29768a2d02` (gitattributes: Mark *.svg as binary) -- prevent grep from searching patterns in those files. Despite those files are, in fact, javascript code, the way they are formatted is not suitable for human reading, so it's unlikely that anyone would be interested in grep-ing patters in it. At the same time, those files consist of of very long lines, so if a grep finds a pattern in one of those, the output is spoiled. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#19357	2024-06-19 15:07:56 +03:00
Botond Dénes	9d1fa828be	Merge 'utils/large_bitset: replace reserve_partial with utils::reserve_gently' from Lakshmi Narayanan Sreethar Replace the reserve_partial loop in large_bitset constructor with a new function - reserve_gently() that can reserve memory without stalling by repeatedly calling reserve_partial() method of the passed container. Closes scylladb/scylladb#19361 * github.com:scylladb/scylladb: utils/large_bitset: replace reserve_partial with utils::reserve_gently utils/stall_free: introduce reserve_gently	2024-06-19 14:31:59 +03:00
Michał Jadwiszczak	8eb5ca8202	test/boost/cql_query_test: add test for single-partition aggregation	2024-06-19 09:24:17 +02:00
Piotr Dulikowski	7567b87e72	Merge 'auth: reuse roles select query during cache population' from Marcin Maliszkiewicz With big number of shards in the cluster (e.g. 500+) due to cache periodic refresh we experience high load on role_permissions table (e.g. 1k op/s). The load on roles table is amplified because to populate single entry in the cache we do several selects on roles table. Some of this can't be avoided because roles are arranged in a tree-like structure where permissions can be inherited. This patch tries to reuse queries which are simply duplicated. It should reduce the load on roles table by up to 50%. Fixes scylladb/scylladb#19299 Closes scylladb/scylladb#19300 * github.com:scylladb/scylladb: auth: reuse roles select query during cache population auth: coroutinize service::get_uncached_permissions auth: coroutinize service::has_superuser	2024-06-19 07:53:47 +02:00
Marcin Maliszkiewicz	56707e2965	cql: remove global_req_id from schema_altering_statement Such field is no longer needed as the information comes directly from group0_batch. Fixes scylladb/scylladb#19365	2024-06-18 20:26:09 +02:00
Lakshmi Narayanan Sreethar	9ad800cfb9	utils/large_bitset: replace reserve_partial with utils::reserve_gently Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2024-06-18 23:36:30 +05:30
Lakshmi Narayanan Sreethar	31414f54c6	utils/stall_free: introduce reserve_gently Add reserve_gently() that can reserve memory without stalling by repeatedly calling reserve_partial() method of the passed container. Update the comments of existing reserve_partial() methods to mention this newly introduced reserve_gently() wrapper. Also, add test to verify the functionality. Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com>	2024-06-18 23:36:30 +05:30
Marcin Maliszkiewicz	685aecde61	cql: switch alter keyspace prepare_schema_mutations to use group0_batch This is needed to simplify the code in the following commit.	2024-06-18 19:54:55 +02:00
Michał Jadwiszczak	e9ace7c203	cql3/select_statement: do not parallelize single-partition aggregations Currently reads with WHERE clause which limits them to be single-partition reads, are unnecessarily parallelized. This commit checks this condition and the query doesn't use forward_service in single-partition reads.	2024-06-18 19:21:32 +02:00
Pavel Emelyanov	f7d5d4877c	Merge '[test.py] Fix several issues in log gathering' from Andrei Chekun Related: https://github.com/scylladb/scylladb/issues/17851 Fix the issue that test logs were not deleted Fix the issue that the URL to the failed test directory was incorrectly shown even when artifacts_dir_url option was not provided Fix the issue that there were no node logs when it failed to join the cluster Closes scylladb/scylladb#19115 * github.com:scylladb/scylladb: [test.py] Fix logs had multiplication of lines [test.py] Fix log not deleted [test.py] Fix log for failed node was nod added to failed directory [test.py] Fix URl for failed logs directory in CI	2024-06-18 15:37:29 +03:00
Aleksandra Martyniuk	50cb797d95	test: add test for abort while a task is being unregistered	2024-06-18 13:41:51 +02:00
Aleksandra Martyniuk	3463f495b1	tasks: fix tasks abort Currently if task_manager::task::impl::abort preempts before children are recursively aborted and then the task gets unregistered, we hit use after free since abort uses children vector which is no longer alive. Modify abort method so that it goes over all tasks in task manager and aborts those with the given parent. Fixes: #19304.	2024-06-18 13:39:29 +02:00
Botond Dénes	2123b22526	Merge 'doc: add 6.x.y to 6.x.z and remove 5.x.y to 5.x.z upgrade guide' from Anna Stuchlik This PR removes the 5.x.y to 5.x.z upgrade guide and adds the 6.x.y to 6.x.z upgrade guide. The previous maintenance upgrade guides, such as from 5.x.y to 5.x.z, consisted of several documents - separate for each platform. The new 6.x.y to 6.x.z upgrade guide is one document - there are tabs to include platform-specific information (we've already done it for other upgrade guides as one generic document is more convenient to use and maintain). I did not modify the procedures. At some point, they have been reviewed for previous upgrade guides. Fixes https://github.com/scylladb/scylladb/issues/19322 - This PR must be backported to branch-6.0, as it adds 6.x specific content. Closes scylladb/scylladb#19340 * github.com:scylladb/scylladb: doc: remove the 5.x.y to 5.x.z upgrade guide doc: add the 6.x.y to 6.x.z upgrade guide-6	2024-06-18 14:24:38 +03:00
Wojciech Mitros	1de5566cfa	mv: fix value of the gossiped view update backlog Currently, when calculating the view update backlog for gossip, we start with `db::view::update_backlog()` and compare it to backlogs from all shards. However, this backlog can't be compared to other backlogs - it has size 0 and we compare the fraction current/size when comparing backlogs, causing us to compare with `NaN`. This patch fixes it by starting the comparisons with an empty backlog.	2024-06-18 13:15:18 +02:00
Kefu Chai	87247c6542	.github: add workflow to build with latest seastar so we can be awared that if scylla builds with seastar master HEAD, and to be prepared if a build failure is found. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19135	2024-06-18 13:34:43 +03:00
Andrei Chekun	6a4b441bf2	[test.py] Fix logs had multiplication of lines Since the test name was not unique across the run and when we were using a --repeat option, there were several handlers for the same file. With this change test name and accordingly, the log name will be different for the same test but different repeat case. Remove mode from the test name since it's already in mode directory.	2024-06-18 11:14:07 +02:00
Andrei Chekun	b01a5f9bd9	[test.py] Fix log not deleted One of the created log files was not deleted at all, because there was no delete command. Unlink moved on later stage explicitly after removing the handler that writing to this file to avoid the possibility that something will be added after removing the file.	2024-06-18 11:14:01 +02:00
Kefu Chai	0a74d45425	build: cmake: add commitlog_cleanup_test in `94cdfcaa94`, we added commitlog_cleanup_test to `configure.py`, but didn't add it to the CMake building system. in this change, let's add it to the CMake building system. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19314	2024-06-18 12:12:28 +03:00
Kefu Chai	68ef7dda79	config: correct the comment on printable_to_json() seastar::format() does not use operator<< under the hood, it uses {fmt}, so update the comment accordingly. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#19315	2024-06-18 12:08:59 +03:00
Nadav Har'El	2ec1e0f0d5	test/cql-pytest: tests verifying UUID sort order In issue #15561 some doubts were raised regarding the way ScyllaDB sorts UUID values. This patch adds a heavily-commented cql-pytest test that helps understand - and verify that understanding - of the way Scylla sorts UUIDs, and shows there is some reason in the madness (in particular, Version 1 UUIDs (time uuids) are sorted like timeuuids, and not as byte arrays. The new tests check the different cases (see the comments in the test), and as usual for cql-pytest tests - they passes also on Cassandra, which allows us to confirm that the sort order we used is identical to the one used by Cassandra and not something that Scylla mis-implemented. Having this test in our suite will also ensure that the UUID ordering never changes accidentally in the future. If it ever changes, it can break access to existing tables that use UUID clustering keys, so it shouldn't change. Fixes #15561 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#19343	2024-06-18 12:05:30 +03:00
Pavel Emelyanov	147552c34a	Merge 'configurable maintenance (streaming) semaphore count resource limit' from Botond Dénes Making the count resources on the maintenance (streaming) semaphore live update via config. This will allow us to improve repair speed on mixed-shard clusters, where we suspect that reader trashing -- due to the combination of high number of readers on each shard and very conservative reader count limit (10) -- is the main cause of the slowness. Making this count limit confgurable allows us to start experimenting with this fix, without committing to a count limit increase (or removal), addressing the pain in the field. Refs: #18269 No OSS backport needed. Closes scylladb/scylladb#19248 * github.com:scylladb/scylladb: replica/database: wire in maintenance_reader_concurrency_semaphore_count_limit db/config: introduce maintenance_reader_concurrency_semaphore_count_limit reader_concurrency_semaphore: make count parameter live-update	2024-06-18 12:02:24 +03:00

1 2 3 4 5 ...

43273 Commits