scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-06 23:13:15 +00:00

Author	SHA1	Message	Date
Patryk Jędrzejczak	3a100cd16c	test: test_raft_recovery_stuck: ensure raft upgrade procedure failed We have log browsing in test.py now, so we can fix this TODO easily. Closes scylladb/scylladb#18425	2024-04-26 10:16:49 +02:00
Asias He	62a9ecff51	repair: Cleanup repair history status entry for tablet The entry in the repair history map that is used to track repair status internally for each repair job should be removed after the repair job is done. We do the same for vnode repairs. This patch adds the missing automatic history cleanup code which is missed in the initial tablet repair support in commit `54239514af`, which does not support repair history update back then. Refs #17046 Closes scylladb/scylladb#18434	2024-04-26 10:56:45 +03:00
Botond Dénes	044fd7a3ec	Merge 'Move some view updating methods from table to view_update_generator' from Pavel Emelyanov The populate_views() and generate_and_propagate_view_updates() both naturally belong to view_update_generator -- they don't need anything special from table itself, but rather depend on some internals of the v.u.generator itself. Moving them there lets removing the view concurrency semaphore from keyspace and table, thus reducing the cross-components dependencies. Closes scylladb/scylladb#18421 * github.com:scylladb/scylladb: replica: Do not carry view concurrency semaphore pointer around view: Get concurrency semaphore via database, not table view_update_generator: Mark mutate_MV() private view: Move view_update_generator methods' code view: Move table::generate_and_propagate_view_updates into view code view: Move table::populate_views() into view_update_generator class	2024-04-26 10:55:38 +03:00
Botond Dénes	d566eec89a	Merge 'treewide: remove {dclocal_,}read_repair_chance options' from Kefu Chai dclocal_read_repair_chance and read_repair_chance have been removed in Cassandra 3.11 and 4.x, see https://issues.apache.org/jira/browse/CASSANDRA-13910. if we expose these properties via DDL, Cassandra would fail to consume the CQL statement creating the table when performing migration from Scylla to Cassandra 4.x, as the latter does not understand these properties anymore. currently the default values of `dc_local_read_repair_chance` and `read_repair_chance` are both "0". so they are practically disabled, unless user deliberately set them to a value greater than 0. also, as a side effect, Cassandra 4.x has better support of Python3. the cqlsh shipped along with Cassandra 3.11.16 only supports python2.7, see https://github.com/apache/cassandra/blob/cassandra-3.11.16/bin/cqlsh.py it errors out if the system only provides python3 with the error of ``` No appropriate python interpreter found. ``` but modern linux systems do not provide python2 anymore. so, in this change, we deprecate these two options. Fixes #3502 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#18087 * github.com:scylladb/scylladb: docs: drop documents related to {,dclocal_}read_repair_chance treewide: remove {dclocal_,}read_repair_chance options	2024-04-26 10:48:47 +03:00
Michał Chojnowski	c1146314a1	docs: clarify that `DELETE` can be used with `USING TIMEOUT` The current text seems to suggest that `USING TIMEOUT` doesn't work with `DELETE` and `BATCH`. But that's wrong. Closes scylladb/scylladb#18424	2024-04-26 10:48:17 +03:00
Pavel Emelyanov	4ac30e5337	view-builder: Print correct exception in built ste exception handler Inside .handle_exception() continuation std::current_exception() doesn't work, there's std::exception ex argument to handler's lambda instead fixes #18423 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#18349	2024-04-26 09:58:45 +03:00
Botond Dénes	7cbe5c78b4	install.sh: use the native nodetool directly * tools/java b810e8b00e...4ee15fd9ea (1): > install.sh: don't install nodetool into /usr/bin Add a bin/nodetool and install it to bin/ in install.sh. This script simply forwards to scylla nodetool and it is the replacement for the Java nodetool, which is dropped from the java-tools's install.sh, in the submodule update also included in this patch. With this change, we now hardwire the usage of the native nodetool, as the nodetool, with the intermediary nodetool wrapper script removed from the picture. Bash completion was copied from the java tools repository and it is now installed by the scylla package, together with nodetool. The Java nodetool is still available as as a fall-back, in case the native nodetool has problems, at the path of /opt/scylladb/share/cassandra/bin/nodetool. Testing I tested upgrades on a DEB and RPM distro: Ubuntu and Fedora. First I installed scylla-5.4, then I installed the packages for this PR. On Ubuntu, I had to use dpkg -i --auto-deconfigure, otherwise, dpkg would refuse to install the new packages because they break the old ones. No extra flags were required on Fedora. In both cases, /usr/bin/nodetool was changed from a thunk calling the Java nodetool (from 5.4) to the native launcher script from this PR. /opt/scylladb/share/cassandra/bin/nodetool remained in place and still works after the upgrade. I also verified that --nonroot installs also work. Nodetool works both when called with an absolute path, or when ~/scylladb/bin is added to $PATH. Fixes: #18226 Fixes: #17412 Closes scylladb/scylladb#18255 [avi: reset submodule to actual hash we ended up with]	2024-04-25 22:52:00 +03:00
Avi Kivity	c2b8ca7d71	Merge 'cql3: statements: change default tombstone_gc mode for tablets' from Aleksandra Martyniuk Repair may miss some tablets that migrated across nodes. So if tombstones expire after some timeout, then we can have data resurrection. Set default tombstone_gc mode to "repair" for tables which use tablets (if repair is required). Fixes: #16627. Closes scylladb/scylladb#18013 * github.com:scylladb/scylladb: test: check default value of tombstone_gc test: topology: move some functions to util.py cql3: statements: change default tombstone_gc mode for tablets	2024-04-25 19:18:37 +03:00
Lakshmi Narayanan Sreethar	6af2659b57	sstables: reclaim_memory_from_components: do not update _recognised_components When reclaiming memory from bloom filters, do not remove them from _recognised_components, as that leads to the on-disk filter component being left back on disk when the SSTable is deleted. Fixes #18398 Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com> Closes scylladb/scylladb#18400	2024-04-25 19:15:59 +03:00
Raphael S. Carvalho	4a5fdc5814	table: Remove outdated FIXME about sstable spanning multiple tablets The FIXME was added back then because we thought the interface of compaction_group_for_sstable might have to be adjusted if a sstable were allowed to temporarily span multiple tablets until it's split, but we have gone a different path. If a sstable's key range incorrectly spans more than one tablet, that will be considered a bug and an exception is thrown. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes scylladb/scylladb#18410	2024-04-25 17:21:11 +03:00
Marcin Maliszkiewicz	7085339f72	cql3: test: include get_mutations_internal log in test.py We have a concurrent modification conflict in tests and suspect duplicated requests but since we don't log successful requests we have no way to verify if that's the case. get_mutations_internal log will help to tell wchich nodes are trying to push auth or service levels mutations into raft. Refs scylladb/scylladb#18319 Closes scylladb/scylladb#18413	2024-04-25 17:17:53 +03:00
Botond Dénes	0234b4542a	Merge '[github] add PR template and action to verify PR tasks was completed' from Yaron Kaikov Today with the backport automation, the developer added the relevant backport label, but without any explanation of why Adding the PR template with a placeholder for the developer to add his decision about backport yes or no The placeholder is marked as a task, so once the explanation is added, the task must be checked as completed Also adding another check to the PR summary will make it clear to the maintainer/reviewer if the developer explained about backport Closes scylladb/scylladb#18275 * github.com:scylladb/scylladb: [github] add action to verify PR tasks was completed [github] add PR template	2024-04-25 17:14:50 +03:00
Pavel Emelyanov	18cc2cfa31	replica: Generalize snapshot details for single table/snapshot dir There are two places that get total:live stats for a table snapshot -- database::get_snapshot_details() and table::get_snapshot_details(). Both do pretty similar thing -- walk the table/snapshots/ directory, then each of the found sub-directory and accumulate the found files' sizes into snapshot details structure. Both try to tell total from live sizes by checking whether an sstable component found in snapshots is present in the table datadir. The database code does it in a more correct way -- not just checks the file presense, but also compares if it's a hardlink on the snapshot file, while the table code just checks if the file of the same name exists. This patch does both -- makes both database and table call the same helper method for a single snapshot details, and makes the generalized version use more elaborated collision check, thus fixing the per-table details getting behavior. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#18347	2024-04-25 17:12:42 +03:00
Asias He	1ca779d287	streaming: Fix use after move in fire_stream_event The event is used in a loop. Found by clang-tidy: ``` streaming/stream_result_future.cc:80:49: warning: 'event' used after it was moved [bugprone-use-after-move] listener->handle_stream_event(std::move(event)); ^ streaming/stream_result_future.cc:80:39: note: move occurred here listener->handle_stream_event(std::move(event)); ^ streaming/stream_result_future.cc:80:49: note: the use happens in a later loop iteration than the move listener->handle_stream_event(std::move(event)); ^ ``` Fixes #18332 Closes scylladb/scylladb#18333	2024-04-25 16:48:54 +03:00
Botond Dénes	2c8bd99cd4	Merge 'Coroutinize view_builder::stop()' from Pavel Emelyanov It's pretty straightforward, but prior to that, exception handling needs some care Closes scylladb/scylladb#18378 * github.com:scylladb/scylladb: view-builder: Coroutinize stop() view_builder: Do not try to handle step join exceptions on stop	2024-04-25 16:48:25 +03:00
Kefu Chai	014a069ed2	build: cmake: require {fmt} >= 9.0.0 we are using `fmt::ostream_formatter` which was introduced in {fmt} v9.0.0, see https://github.com/fmtlib/fmt/releases/tag/9.0.0 . before this change, we depend on Seastar to find {fmt}. but the minimal version of {fmt} required by Seastar is 5.0.0, which cannot fulfill the needs to build scylladb. in this change, we find {fmt} package in scylla, and specify the minimal required version of 9.0.0, so the build can fail at the configuration time. {fmt} v8 could be still being used by users. for instance, ubuntu:jammy comes with libfmt-dev 8.1.1. and ubuntu:jammy is EOL in Apr 2027, see https://ubuntu.com/about/release-cycle . Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#18386	2024-04-25 16:35:08 +03:00
Amnon Heiman	dfea50a7e9	db/config.cc add metric family config from file Metric family config lets a user configure the metric family aggregate labels. This patch modifies the existing relable-config from file to accept metric family config. Similar to the existing relable_config, it adds a metric_family_configs section. For example, the following configuration demonstrates changing aggregate labels by name and regular expression. ``` metric_family_configs: - name: storage_service aggregate_labels: [shard] - regex: (storage_proxy.*) aggregate_labels: [shard, scheduling_group_name] ``` Signed-off-by: Amnon Heiman <amnon@scylladb.com> Closes scylladb/scylladb#18339	2024-04-25 16:03:39 +03:00
Kefu Chai	e9b31cb4c1	test: locator_topology: s/get0()/get()/ this change addresses the leftover of `9e8805bb49` Refs `9e8805bb49` Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#18390	2024-04-25 16:03:01 +03:00
Yaron Kaikov	5e63f74984	[github] add action to verify PR tasks was completed Adding another check to the PR summary will make it clear to the maintainer/reviewer if the developer explained about backport	2024-04-25 15:24:22 +03:00
Botond Dénes	aaa76d4c0e	Merge 'Getting per-table snapshot size is racy wrt creating new snapshots' from Pavel Emelyanov The API endpoint in question calls table::get_snapshot_detail() which just walks table/snapshots/ directory. This can clash with creating a new snapshot. Database-wide walk is guarded with snapshot-ctl's locking, so should the per-table API do Closes scylladb/scylladb#18414 * github.com:scylladb/scylladb: snapshot: Get per-table snapshot size under snapshot lock snapshot: Move per-table snap API to other snapshot endpoints	2024-04-25 14:57:52 +03:00
Kefu Chai	e5b30ae2ad	partition_version: do not rereference moved variable in `partition_entry::apply_to_incomplete()`, we pass `dst_snp` and `std::move(dst_snp)` to build the capture variable list of a lambda, but the order of evaluation of these variables are unspecified. fortunately, we haven't run into any issues at this moment. but this is not future-proof. so, let's avoid this by storing a reference of the dereferenced smart pointer, and use it later on. this issue is identified by clang-tidy: ``` /home/kefu/dev/scylladb/mutation/partition_version.cc:500:53: warning: 'dst_snp' used after it was moved [bugprone-use-after-move] 500 \| cur = partition_snapshot_row_cursor(s, dst_snp), \| ^ /home/kefu/dev/scylladb/mutation/partition_version.cc:502:23: note: move occurred here 502 \| dst_snp = std::move(dst_snp), \| ^ /home/kefu/dev/scylladb/mutation/partition_version.cc:500:53: note: the use and move are unsequenced, i.e. there is no guarantee about the order in which they are evaluated 500 \| cur = partition_snapshot_row_cursor(s, dst_snp), \| ^ /home/kefu/dev/scylladb/mutation/partition_version.cc:501:57: warning: 'src_snp' used after it was moved [bugprone-use-after-move] 501 \| src_cur = partition_snapshot_row_cursor(s, src_snp, can_move), \| ^ /home/kefu/dev/scylladb/mutation/partition_version.cc:504:23: note: move occurred here 504 \| src_snp = std::move(src_snp), \| ^ /home/kefu/dev/scylladb/mutation/partition_version.cc:501:57: note: the use and move are unsequenced, i.e. there is no guarantee about the order in which they are evaluated 501 \| src_cur = partition_snapshot_row_cursor(s, *src_snp, can_move), \| ^ ``` Fixes #18360 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#18361	2024-04-25 14:57:52 +03:00
Pavel Emelyanov	8aaa09ee97	replica: Do not carry view concurrency semaphore pointer around Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-04-25 14:27:43 +03:00
Pavel Emelyanov	2ee7c41139	view: Get concurrency semaphore via database, not table The _view_update_concurrency_sem field on database propagates itself via keyspace config down to table config and view_update_generator then grabs one via table:: helper. That's an overkil, view_update_generator has direct reference on the database and can get this semaphore from there. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-04-25 14:25:57 +03:00
Pavel Emelyanov	3d8b572d96	view_update_generator: Mark mutate_MV() private Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-04-25 14:25:40 +03:00
Pavel Emelyanov	bc4552740f	view: Move view_update_generator methods' code Now when the two methods belong to another class, move the code itself to db/view , where the class itself resides. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-04-25 14:24:20 +03:00
Pavel Emelyanov	c2bf6b43b2	view: Move table::generate_and_propagate_view_updates into view code Similarly to populate_views() method, this one also naturally belongs to view_update_generator class. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-04-25 14:20:06 +03:00
Pavel Emelyanov	670c7c925c	view: Move table::populate_views() into view_update_generator class The method in question has little to do with table, effectively it only needs stats and consurrency semaphore. And the semaphore in question is obtained from table indirectly, it really resides on database. On the other hand, the method carries lots of bits from db::view, e.g. the view_update_builder class, memory_usage_of() helper and a bit more. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-04-25 14:17:20 +03:00
Kefu Chai	e5bcea6718	docs: drop documents related to {,dclocal_}read_repair_chance since "read_repair_chance" and "dclocal_read_repair_chance" are removed, and not supported anymore. let's stop documenting them. Refs #3502 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-04-25 17:15:27 +08:00
Kefu Chai	c323c93fa4	treewide: remove {dclocal_,}read_repair_chance options dclocal_read_repair_chance and read_repair_chance have been removed in Cassandra 3.11 and 4.x, see https://issues.apache.org/jira/browse/CASSANDRA-13910. if we expose the properties via DDL, Cassandra would fails to consume the CQL statement to creating the table when performing migration from Scylla to Cassandra 4.x, as the latter does not understand these properties anymore. currently the default values of `dc_local_read_repair_chance` and `read_repair_chance` are both "0". so this is practically disabled, unless user deliberately set them to a value greater than 0. also, as a side effect, Cassandra 4.x has better support of Python3. the cqlsh shipped along with Cassandra 3.11.16 only supports python2.7, see https://github.com/apache/cassandra/blob/cassandra-3.11.16/bin/cqlsh.py it errors out if the system only provides python3 with the error of ``` No appropriate python interpreter found. ``` but modern linux systems do not provide python2 anymore. so, in this change, we deprecate these two options. Fixes #3502 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-04-25 17:15:27 +08:00
Botond Dénes	ca26899c36	Merge 'sstable: large data handler needs to count range tombstones as rows' from Ferenc Szili When issuing warnings about partitions with the number of rows above a configured threshold, the large partitions handler does not take into consideration the number of range tombstone markers in the total rows count. This fix adds the number of range tombstone markers to the total number of rows and saves this total in system.large_partitions.rows (if it is above the threshold). It also adds a new column range_tombstones to the system.large_partitions table which only contains the number of range tombstone markers for the given partition. This PR fixes the first part of issue #13968 It does not cover distinguishing between live and dead rows. A subsequent PR will handle that. Closes scylladb/scylladb#18346 * github.com:scylladb/scylladb: sstables: add docs changes for system.large_partitions sstable: large data handler needs to count range tombstones as rows	2024-04-25 11:38:30 +03:00
Pavel Emelyanov	e97abfc473	tablets: Fix indentation after flat-hash-map patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#18364	2024-04-25 11:36:37 +03:00
Kefu Chai	0b5a861961	build: cmake: reference build_mode with ${scylla_build_mode_${CMAKE_BUILD_TYPE}} before this change, if we generate the building system with plain `Ninja`, instead of `Ninja Multi-Config` using cmake, the build fails, because `${scylla_build_mode_${CMAKE_BUILD_TYPE}}` is not defined. so the profile used for building the rust library would be "rust-", which does not match any of the profiles defined by `Cargo.toml`. in this change, we use `$CMAKE_BUILD_TYPE` instead of "$config". as the former is defined for non-multi generator. while the latter is. see https://cmake.org/cmake/help/latest/generator/Ninja%20Multi-Config.html with this change, we are able to generate the building system properly with the "Ninja" generator. if we just want to run some static analyzer against the source tree or just want to build scylladb with a single configuration, the "Ninja" generator is a good fit. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#18353	2024-04-25 10:51:54 +03:00
Pavel Emelyanov	ae4c1c44ec	snapshot: Get per-table snapshot size under snapshot lock Walking per-table snapshot directory without lock is racy. There's snapshot-ctl locking that's used to get db-wide snapshot details, it should be used to get per-table snapshot details too Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-04-25 10:05:51 +03:00
Pavel Emelyanov	186b36165e	snapshot: Move per-table snap API to other snapshot endpoints So that they are collected in one place and to facilitate next patch that's going to use snapshot-ctl for per-table API too Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-04-25 10:05:01 +03:00
Anna Stuchlik	b5d256a991	doc: add Scylla Doctor to the docs This commit adds the description and usage instructions of Scylla Doctor to the "How to Report a ScyllaDB Problem" page. Scylla Doctor replaces Health Check Report, so the description of and references to the latter are removed with this commit. Fixes https://github.com/scylladb/scylladb/issues/16276 Closes scylladb/scylladb#17617	2024-04-25 09:50:38 +03:00
Asias He	037bba0ca1	repair: Turn on off_strategy_updater for tablet repair The off_strategy_updater is used during repair to update the automatic off strategy timer so off_strategy compaction starts automatically only after repair finishes. We still use off_strategy for tablets. So we should still turn on the updater. The update logic is used for vnode tables. We can share the code with vnode table instead of copying, but since there is a possibility we could disable off_strategy for tablets. We'd better postpone the code sharing as follow ups. If later, we decide to disable off_strategy for tablets, we can remove the updater for tablet. Fixes #18196 Closes scylladb/scylladb#18266	2024-04-25 09:03:07 +03:00
Kamil Braun	3363f6e1e8	Merge 'Fix write failures during node replace with same IP with topology over raft' from Gleb Currently a new node is marked as alive too late, after it is already reported as a pending node. The patch series changes replace procedure to be the same as what node_ops do: first stop reporting the IP of the node that is being replaced as a natural replica for writes, then mark the IP is alive, and only after that report the IP as a pending endpoint. Fixes: scylladb/scylladb#17421 * 'gleb/17421-fix-v2' of github.com:scylladb/scylla-dev: test_replace_reuse_ip: add data plane load sync_raft_topology_nodes: make replace procedure similar to nodeops one storage_service: topology_coordinator: fix indentation after previous patch storage_service: topology coordinator: drop ring check in node_state::replacing state	2024-04-24 17:09:01 +02:00
Petr Gusev	bc98774f83	test_replace_reuse_ip: add data plane load In this commit we enhance test_replace_reuse_ip to reproduce #17421. We create a test table and run insert queries on it while the first node is being replaced. In this form the test fails without the fix from the previous commit. Some insert requests fail with [Unavailable exception] "Cannot achieve consistency level for cl QUORUM...".	2024-04-24 16:59:24 +03:00
Gleb Natapov	4614fedd22	sync_raft_topology_nodes: make replace procedure similar to nodeops one In replace-with-same-ip a new node calls gossiper.start_gossiping from join_token_ring with the 'advertise' parameter set to false. This means that this node will fail echo RPC-s from other nodes, making it appear as not alive to them. The node changes this only in storage_service::join_node_response_handler, when the topology coordinator notifies it that it's actually allowed to join the cluster. The node calls _gossiper.advertise_to_nodes({}), and only from this moment other nodes can see it as alive. The problem is that topology coordinator sends this notification in topology::transition_state::join_group0 state. In this state nodes of the cluster already see the new node as pending, they react with calling tmpr->add_replacing_endpoint and update_topology_change_info when they process the corresponding raft notification in sync_raft_topology_nodes. When the new token_metadata is published, assure_sufficient_live_nodes sees the new node in pending_endpoints. All of this happen before the new node handled successful join notification, so it's not alive yet. Suppose we had a cluster with three nodes and we're replacing on them with a fourth node. For cl=qurum assure_sufficient_live_nodes throws if live < need + pending, which in our case becomes 2 < 2 + 1. The end effect is that during replace-with-same-ip data plane requests can fail with unavailable_exception, breaking availability. The patch makes boot procedure more similar to node ops one. It splits the marking of a node as "being replaced" and adding it to pending set in to different steps and marks it as alive in the middle. So when the node is in topology::transition_state::join_group0 state it marked as "being replaced" which means it will no longer be used for reads and writes. Then, in the next state, new node is marked as alive and is added to pending list. fixes scylladb/scylladb#17421	2024-04-24 16:59:22 +03:00
Kamil Braun	1297b9a322	mutation: mutation_by_size_splitter: skip last mutation if it's empty Currently, the last mutation emitted by split_mutation could be empty. It can happen as follows: - consume range tombstone change at pos `1` with some timestamp - consume clustering row at pos `2` - flush: this will create mutation with range tombstone (1, 2) and clustering row at 2 - consume range tombstone change at pos `2` with no timestamp (i.e. closing rtc) - end of partition since the closing rtc has the same position as the clustering row, no additional range tombstone will be emitted -- the only necessary range tombstone was already emitted in the previous mutation. On the other hand, `test_split_mutations` expects all emitted mutations to be non-empty, which is a sane expectation for this function. The test catched a case like this with random-seed=629157129. Fix this by skipping the last mutation if it turns out to be empty. Fixes: scylladb/scylladb#18042 Closes scylladb/scylladb#18375	2024-04-24 16:25:31 +03:00
Raphael S. Carvalho	71682aebdd	storage_service: Fix use-after-move in storage_service::node_ops_cmd_handler ``` service/storage_service.cc:4288:62: warning: 'req' used after it was moved [bugprone-use-after-move] node_ops_insert(ops_uuid, coordinator, std::move(req.ignore_nodes), [this, coordinator, req = std::move(req)] () mutable { ^ service/storage_service.cc:4288:107: note: move occurred here node_ops_insert(ops_uuid, coordinator, std::move(req.ignore_nodes), [this, coordinator, req = std::move(req)] () mutable { ^ service/storage_service.cc:4288:62: note: the use and move are unsequenced, i.e. there is no guarantee about the order in which they are evaluated node_ops_insert(ops_uuid, coordinator, std::move(req.ignore_nodes), [this, coordinator, req = std::move(req)] () mutable { ^ ``` if evaluation order is right-to-left (GCC), req is moved first, and req.ignore_nodes will be empty, so nodes that should be ignored will still be considered, potentially resulting in a failure during replace. https://godbolt.org/z/jPcM6GEx1 courtesy of clang-tidy. Fixes #18324. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes scylladb/scylladb#18366	2024-04-24 15:36:28 +03:00
Aleksandra Martyniuk	06f6aaf2cf	test: check default value of tombstone_gc Add a test which checks whether default tombstone_gc value is properly set and if it does not override previous setting.	2024-04-24 10:57:51 +02:00
Aleksandra Martyniuk	e0d498716a	test: topology: move some functions to util.py Move functions marked with asynccontextmanager from test/topology/test_mv.py to test/topology/util.py so that they can be used in other tests.	2024-04-24 10:57:51 +02:00
Aleksandra Martyniuk	58f72f9019	cql3: statements: change default tombstone_gc mode for tablets Currently, if tombstone_gc mode isn't specified for a table, then "timeout" is used by default. With tablets, running "nodetool repair -pr" may miss a tablet if it migrated across the nodes. Then, if we expire tombstones for ranges that weren't repaired, we may get data resurrection. Set default tombstone_gc mode value for DDLs that don't specify it. It's set to "repair" for tables which use tablets unless they use local replication strategy or rf = 1. Otherwise it's set to "timeout".	2024-04-24 10:42:10 +02:00
Kamil Braun	8876b9b0ef	test/pylib: random_tables: use IF NOT EXISTS when creating keyspace Due to Python driver's unexpected behavior, "CREATE KEYSPACE" statement may sometimes get executed twice (scylladb/python-driver#317), leading to "Keyspace ... already exists" error in our tests (scylladb/scylladb#17654). Work around this by using "IF NOT EXISTS". Fixes: scylladb/scylladb#17654 Closes scylladb/scylladb#18368	2024-04-24 10:09:26 +03:00
Pavel Emelyanov	1b1b86809d	view-builder: Coroutinize stop() Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-04-23 20:43:42 +03:00
Pavel Emelyanov	eaf78fca04	view_builder: Do not try to handle step join exceptions on stop Commit `23c891923e` (main: make sure view_builder doesn't propagate semaphore errors) ignored some exceptions that could pop up from the _build_step/do_build_step() serialized action, since they are "benign" on stop. Later there came `b56b10a4bb` (view_builder: do_build_step: handle unexpected exceptions) that plugged any exception from the action in question, regardless of they happen on stop or run-time. Apparently, the latter commit supersedes the former. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-04-23 20:26:14 +03:00
Anna Stuchlik	c0e4f3e646	doc: include OSS-specific info as separate files This commit excludes OSS-specific links and content added in https://github.com/scylladb/scylladb/pull/17624 to separate files and adds the include directive `.. scylladb_include_flag::` to include these files in the doc source files. Reason: Adding the link to the Open Source upgrade guide (/upgrade/upgrade-opensource/upgrade-guide-from-5.4-to-6.0/enable-consistent-topology) breaks the Enterprise documentation because the Enterprise docs don't contain that upgrade guide. We must add separate files for OSS and Enterprise to prevent failing the Enterprise build and breaking the links. Closes scylladb/scylladb#18372	2024-04-23 16:59:05 +02:00
Raphael S. Carvalho	fa2dc5aefa	sstables: Fix use-after-move in an error path of FS-based sstable writer ``` sstables/storage.cc:152:21: warning: 'file_path' used after it was moved [bugprone-use-after-move] remove_file(file_path).get(); ^ sstables/storage.cc:145:64: note: move occurred here auto w = file_writer(output_stream<char>(std::move(sink)), std::move(file_path)); ``` It's a regression when TOC is found for a new sstable, and we try to delete temporary TOC. courtesy of clang-tidy. Fixes #18323. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes scylladb/scylladb#18367	2024-04-23 17:19:55 +03:00
Pavel Emelyanov	f5f57dc817	table: No need to open directory in snapshot_exists() In order to check if a snapshot of a certain name exists the checking method opens directory. It can be made with more lightweight call. Also, though not critical, is that it fogets to close it. Coroutinuze the method while at it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#18365	2024-04-23 17:19:24 +03:00

1 2 3 4 5 ...

42348 Commits