scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-05 22:43:15 +00:00

Author	SHA1	Message	Date
Kefu Chai	8dacec589d	cql3: add fmt::formatter for cql3_type and cql3_type::raw before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, `fmt::formatter<>` is added for following classes: * `cql3::cql3_type` * `cql3::cql3_type::raw` Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17945	2024-03-21 14:08:50 +02:00
Nadav Har'El	fdeb14b468	Merge 'scylla-nodetool: make command-line parsing fully compatible with the legacy nodetool' from Botond Dénes There was two more things missing: * Allow global options to be positioned before the operation/command option (https://github.com/scylladb/scylladb/issues/16695) * Ignore JVM args (https://github.com/scylladb/scylladb/issues/16696) This PR fixes both. With this, hopefully we are fully compatible with nodetool as far as command line parsing is concerned. After this PR goes in, we will need another fix to tools/java/bin/nodetool-wrapper, to allow user to benefit from this fix. Namely, after this PR, we can just try to invoke scylla-nodetool first with all the command-line args as-is. If it returns with exit-code 100, we fall back to nodetool. We will not need the current trick with `--help $1`. In fact, this trick doesn't work currently, because `$1` is not guaranteed to be the command in the first place. In addition to the above, this PR also introduces a new option, to help us in the switching process. This is `--rest-api-port`, which can also be provided as `-Dcom.scylladb.apiPort`. When provided, this option takes precedence over `--port\|-p`. This is intended as a bridge for `scylla-ccm`, which currently provides the JMX port as `--port`. With this change, it can also provided the REST API port as `-Dcom.scylladb.apiPort`. The legacy nodetool will ignore this, while the native nodetool will use it to connect to the correct REST API address. After the switch we can ditch these options. Fixes: https://github.com/scylladb/scylladb/issues/16695 Fixes: https://github.com/scylladb/scylladb/issues/16696 Refs: https://github.com/scylladb/scylladb/issues/16679 Refs: https://github.com/scylladb/scylladb/issues/15588 Closes scylladb/scylladb#17168 * github.com:scylladb/scylladb: tools/scylla-nodetool: add --rest-api-port option tools/scylla-nodetool: ignore JVM args tools/utils: make finding the operation command line option more flexible tools/utils: get_selected_operation(): remove alias param tools: add constant with current help command-line arguments	2024-03-21 14:06:45 +02:00
Pavel Emelyanov	c8fc43d169	test: Update topology_custom/suite::run_first list The recently added test_tablets_migration dominates with it run-time (10 minutes). Also update other tests, e.g. test_read_repair is not in top-7 for any mode, test_replace and test_raft_recovery_majority_loss are both not notably slower than most of other tests (~40 sec both). On the other hand, the test_raft_recovery_basic and test_group0_schema_versioning are both 1+ minute Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#17927	2024-03-21 12:48:50 +01:00
Andrei Chekun	a5455460d8	test: fix flakiness of the multi_dc tests The initial version used a redundant method, and it did not cover all cases. So that leads to the flakiness of the test that used this method. Switching to the cluster_con() method removes flakiness since it's written more robustly. Fixes scylladb/scylladb#17914 Closes scylladb/scylladb#17932	2024-03-21 11:17:22 +01:00
Asias He	9587352f13	repair: Invoke group0 read barrier in repair_tablets This allows the repair master to see all previous metadata changes. Refs #17658 Closes scylladb/scylladb#17942	2024-03-21 10:54:40 +01:00
Kamil Braun	4dfb7e3051	Merge 'storage_service::merge_topology_snapshot: handle big mutations' from Petr Gusev The group0 state machine calls `merge_topology_snapshot` from `transfer_snapshot`. It feeds it with `raft_topology_snapshot` returned from `raft_pull_topology_snapshot`. This snapshot includes the entire `system.cdc_generations_v3` table. It can be huge and break the commitlog `max_record_size` limit. The `system.cdc_generations_v3` is a single-partition table, so all the data is contained in one mutation object. To fit the commitlog limit we split this mutation into many smaller ones and apply them in separate `database::apply` calls. That means we give up the atomicity guarantee, but we actually don't need it for `system.cdc_generations_v3` and `system.topology_requests`. This PR fixes the dtest `update_cluster_layout_tests.py::TestLargeScaleCluster::test_add_many_nodes_under_load` Fixes scylladb/scylladb#17545 Closes scylladb/scylladb#17632 * github.com:scylladb/scylladb: test_cdc_generation_data: test snapshot transfer storage_service::merge_topology_snapshot: handle big cdc_generations_v3 mutations mutation: add split_mutation function storage_service::merge_topology_snapshot: fix indentation	2024-03-21 10:50:03 +01:00
Avi Kivity	628017c810	test: sstables::test_env: mock sstables_registry sstables::test_env is intended for sstable unit tests, but to satisfy its dependency of an sstables_registry we instantiate an entire database. Remove the dependency by having a mock implementation of sstables_registry and using that instead. Closes scylladb/scylladb#17895	2024-03-21 10:19:46 +01:00
Tomasz Grabiec	baf12b0b2f	test: tablets: Avoid infinite loop in rebalance_tablets() If there is a bug in the tablet scheduler which makes it never converge for a given state of topology, rebalance_tablets() will never complete and will generate a huge amounts of logs. This patch adds a sanity limit so that we fail earlier. This was observed in one of the test_load_balancing_with_random_load runs in CI. Fixes scylladb/scylladb#17894. Closes scylladb/scylladb#17916	2024-03-21 10:19:46 +01:00
Kamil Braun	bc42a5a092	Merge 'make sure that address map entry is not dropped between join request placement and the request handling' from Gleb The series marks nodes to be non expiring in the address map earlier, when they are placed in the topology. Fixes: scylladb/scylladb#16849 * 'gleb/16849-fix-v2' of github.com:scylladb/scylla-dev: test: add test to check that address cannot expire between join request placemen and its processing topology_coordinator: set address map entry to nonexpiring when a node is added to the topology raft_group0: add modifiable_address_map() function	2024-03-21 10:19:46 +01:00
Kamil Braun	676af581d8	Merge 'cdc: should_propose_first_generation: get my_host_id from caller' from Benny Halevy There is no need to map this node's inet_address to host_id. The storage_service can easily just pass the local host_id. While at it, get the other node's host_id directly from their endpoint_state instead of looking it up yet again in the gossiper, using the nodes' address. Refs #12283 Closes scylladb/scylladb#17919 * github.com:scylladb/scylladb: cdc: should_propose_first_generation: get my_host_id from caller storage_service: add my_host_id	2024-03-21 10:19:46 +01:00
Avi Kivity	43bcaeb87f	Merge 'test: randomized_nemesis_test: add fmt::formatter for some types' from Kefu Chai before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for * raft_call * raft_read * network_majority_grudge * reconfiguration * stop_crash * operation::thread_id * append_seq * AppendReg::append * AppendReg::ret * operation::either_of<Ops...> * operation::exceptional_result<Op> * operation::completion<Op> * operation::invocable<Op> and drop their operator<<:s. in which, * `operator<<` for append_entry is never used. so it is removed. * `operator<<` for `std::monostate` and `std::variant` are dropped. as we are now using their counterparts in {fmt}. * stop_crash::result_type 's `fmt::formatter` is not added, as we cannot define a partial specialization of `fmt::formatter` for a nested class for a template class. we will tackle this struct in another change. Refs #13245 Closes scylladb/scylladb#17884 * github.com:scylladb/scylladb: test: raft: generator: add fmt::formatter:s test: randomized_nemesis_test: add fmt::formatter for some types test: randomized_nemesis_test: add fmt::formatter for seastar::timed_out_error raft: add fmt::formatter for error classes	2024-03-21 10:19:46 +01:00
Petr Gusev	740b240e9d	test_cdc_generation_data: test snapshot transfer The test only looked at the initial cdc_generation generation. It made the changes bigger to go past the raft max_command_size limit. It then made sure this large mutation set is saved in several raft commands. In this commit we enhance the test to check that the mutations are properly handled during snapshot transfer. The problem is that the entire system.cdc_generations_v3 table is read into the topology_snapshot and it's total size can exceed the commitlog max_record_size limit. We need a separate injection since the compaction could nullify the effects of the previous injection. The test fails without the fix from the previous commit.	2024-03-20 22:40:03 +04:00
Petr Gusev	276d58114d	storage_service::merge_topology_snapshot: handle big cdc_generations_v3 mutations The group0 state machine calls merge_topology_snapshot from transfer_snapshot. It feeds it with raft_topology_snapshot returned from raft_pull_topology_snapshot. This snapshot includes the entire system.cdc_generations_v3 table. It can be huge and break the commitlog max_record_size limit. The system.cdc_generations_v3 is a single-partition table, so all the data is contained in one mutation object. To fit the commitlog limit we split this mutation into several smaller ones and apply them in separate database::apply calls. That means we give up the atomicity guarantee, but we actually don't need it for system.cdc_generations_v3. The cdc_generations_v3 data is not used in any way until it's referenced from the topology table. By applying the cdc_generations_v3 mutations before topology mutations we ensure that the lack of atomicity isn't a problem here. The database::apply method takes frozen_mutation parameter by const reference, so we need to keep them alive until all the futures are complete. fixes #17545	2024-03-20 22:40:03 +04:00
Petr Gusev	db1afa0aba	mutation: add split_mutation function The function splits the source mutation into multiple mutations so that their size does not exceed the max_size limit. The size of a mutation is calculated as the sum of the memory_usage() of its constituent mutation_fragments. The implementation is taken from view_updating_consumer. We use mutation_rebuilder_v2 to reconstruct mutations from a stream of mutation fragments and recreate the output mutation whenever we reach the limit. We'll need this function in the next commit.	2024-03-20 22:39:51 +04:00
Petr Gusev	d07e0efdd8	storage_service::merge_topology_snapshot: fix indentation It was three spaces, should be four.	2024-03-20 22:30:48 +04:00
Kefu Chai	61424b615c	test: raft: generator: add fmt::formatter:s before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for * operation::either_of<Ops...> * operation::exceptional_result<Op> * operation::completion<Op> * operation::invocable<Op> and drop their operator<<:s. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-20 21:01:29 +08:00
Kefu Chai	72899f573e	test: randomized_nemesis_test: add fmt::formatter for some types before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for * raft_call * raft_read * network_majority_grudge * reconfiguration * stop_crash * operation::thread_id * append_seq * append_entry * AppendReg::append * AppendReg::ret and drop their operator<<:s. in which, * `operator<<` for `std::monostate` and `std::variant` are dropped. as we are now using their counterparts in {fmt}. * stop_crash::result_type 's `fmt::formatter` is not added, as we cannot define a partial specialization of `fmt::formatter` for a nested class for a template class. we will tackle this struct in another change. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-20 21:01:29 +08:00
Kefu Chai	97b203b1af	test: randomized_nemesis_test: add fmt::formatter for seastar::timed_out_error before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatter for `seastar::timed_out_error`, which will be used by the `fmt::formatter` for `std::variant<...>`. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-20 21:01:29 +08:00
Kefu Chai	50637964ed	raft: add fmt::formatter for error classes before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatter for classes derived from `raft::error`. since {fmt} v10 defines the formatter for all classes derived from `std::exception`, the definition is provided only when the tree is compiled with {fmt} < 10. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-20 21:01:29 +08:00
Pavel Emelyanov	21a5911e60	Merge 'db/virtual_tables: make token_ring_table tablet aware' from Botond Dénes The token ring table is a virtual table (`system.token_ring`), which contains the ring information for all keyspaces in the system. This is essentially an alternative to `nodetool describering`, but since it is a virtual table, it allows for all the usual filtering/aggregation/etc. that CQL supports. Up until now, this table only supported keyspaces which use vnodes. This PR adds support for tablet keyspaces. To accommodate these keyspaces a new `table_name` column is added, which is set to `ALL` for vnodes keyspaces. For tablet keyspaces, this contains the name of the table. Simple sanity tests are added for this virtual table (it had none). Fixes: #16850 Closes scylladb/scylladb#17351 * github.com:scylladb/scylladb: test/cql-pytest: test_virtual_tables: add test for token_ring table db/virtual_tables: token_ring_table: add tablet support db/virtual_tables: token_ring_table: add table_name column db/virtual_tables: token_ring_table: extract ring emit service/storage_service: describe_ring_for_table(): use topology to map hostid to ip	2024-03-20 14:05:49 +03:00
Benny Halevy	fceb1183d3	cdc: should_propose_first_generation: get my_host_id from caller There is no need to map this node's inet_address to host_id. The storage_service can easily just pass the local host_id. While at it, get the other node's host_id directly from their endpoint_state instead of looking it up yet again in the gossiper, using the nodes' address. Refs #12283 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-03-20 12:53:49 +02:00
Benny Halevy	37adcd3ecf	storage_service: add my_host_id Shorthand for getting this node's host_id from token_metadata.topology, similar to the `get_broadcast_address` helper. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-03-20 12:53:49 +02:00
Mikołaj Grzebieluch	b4144d14c6	test.py: adjust the test for topology upgrade to write to and read from CDC tables In topology on raft, management of CDC generations is moved to the topology coordinator. We need to verify that the CDC keeps working correctly during the upgrade for topology on the raft. A similar change will be made in the topology recovery test. It will reuse the `start_writes_to_cdc_table` function. Ref #17409 Closes scylladb/scylladb#17828	2024-03-20 11:15:02 +01:00
Yaron Kaikov	d859067486	[action sync labels] improve pr search when labeling an issue This PR contains few fixes and improvment seen during https://github.com/scylladb/scylladb/issues/15902 label addtion When we add a label to an issue, we go through all PR. 1) Setting PR base to `master` (release PR are not relevant) 2) Since for each Issue we have only one PR, ending the search after a match was found 3) Make sure to skip PR with empty body (mainly debug one) 4) Set backport label prefix to `backport/` Closes scylladb/scylladb#17912	2024-03-20 12:14:42 +02:00
David Garcia	559dc9bb27	docs: Implement relative link support for configuration properties Introduces relative link support for individual properties listed on the configuration properties page. For instance, to link to a property from a different document, use the syntax :ref:`memtable_flush_static_shares <confprop_memtable_flush_static_shares>`. Additionally, it also adds support for linking groups. For example, :ref:`Ungrouped properties <confgroup_ungrouped_properties>`. Closes scylladb/scylladb#17753	2024-03-20 11:39:30 +02:00
Gleb Natapov	2b11842cb4	test: add test to check that address cannot expire between join request placemen and its processing	2024-03-20 11:05:31 +02:00
Kefu Chai	2479328e3b	Update seastar submodule > Revert "build: do not provide zlib as an ingredient" > Fix reference to sstring type in tutorial about concurrency in coroutines > Merge 'Adding a Metrics tester app' from Amnon Heiman > cooking.sh: do not quote backtick in here document Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17887	2024-03-20 09:18:35 +02:00
Kefu Chai	432c000dfa	./: not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17888	2024-03-20 09:16:46 +02:00
Raphael S. Carvalho	6115c113fe	sstables_loader: Don't discard sstable that is not fully exhausted Affects load-and-stream for tablets only. The intention is that only this loop is responsible for detecting exhausted sstables and then discarding them for next iterations: while (sstable_it != _sstables.rend() && exhausted(*sstable_it)) { sstable_it++; } But the loop which consumes non exhausted sstables, on behalf of each tablet, was incorrectly advancing the iterator, despite the sstable wasn't considered exhausted. Fixes #17733. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes scylladb/scylladb#17899	2024-03-20 09:11:59 +02:00
Yaron Kaikov	0cbe5f1aa8	[action] add Fixes validation in backport PR When we open a backport PR we should make sure the patch contains a ref to the issue it suppose to fix in order to make sure we have more accurate backport information This action will only be triggered when base branch is `branch-*` If `Fixes` are missing, this action will fail and notify the author. Ref: https://github.com/scylladb/scylla-pkg/issues/3539 Closes scylladb/scylladb#17897	2024-03-20 08:55:36 +02:00
Nadav Har'El	8df2ea3f95	cql: don't crash when creating a view during a truncate The test dtest materialized_views_test.py::TestMaterializedViews:: test_mv_populating_from_existing_data_during_truncate reproduces an assertion failure, and crash, while doing a CREATE MATERIALIZED VIEW during a TRUNCATE operation. This patch fixes the crash by removing the assert() call for a view (replacing it by a warning message) - we'll explain below why this is fine. Also for base tables change we change the assertion to an on_internal_error (Refs #7871). This makes the test stop crashing Scylla, but it still fails due to issue #17635. Let's explain the crash, and the fix: The test starts TRUNCATE on table that doesn't yet have a view. truncate_table_on_all_shards() begins by disabling compaction on the table and all its views (of which there are none, at this point). At this point, the test creates a new view is on this table. The new view has, by default, compaction enabled. Later, TRUNCATE calls discard_sstables() on this new view, asserts that it has compaction disabled - and this assertion fails. The fix in this patch is to not do the assert() for views. In other words, we acknowledge that in this use case, the view will have compactions enabled while being truncated. I claim that this is "good enough", if we remember why we disable compaction in the first place: It's important to disable compaction while truncating because truncating during compaction can lead us to data resurection when the old sstable is deleted during truncation but the result of the compaction is written back. True, this can now happen in a new view (a view created DURING the truncation). But I claim that worse things can happen for this new view: Notably, we may truncate a view and then the ongoing view building (which happens in a new view) might copy data from the base to the view and only then truncate the base - ending up with an empty base and non-empty view. This problem - issue #17635 - is more likely, and more serious, than the compaction problem, so will need to be solved in a separate patch. Fixes #17543. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#17634	2024-03-20 08:54:39 +02:00
Raphael S. Carvalho	d5a5005afa	sstables: Fix clone semantics for runs in partitioned_sstable_set When a sstable set is cloned, we don't want a change in cloned set propagating to the former one. It happens today with partitioned_sstable_set::_all_runs, because sets are sharing ownership of runs, which is wrong. Let's not violate clone semantics by copying all_runs when cloning. Doesn't affect data correctness as readers work directly with sstables, which are properly cloned. Can result in a crash in ICS when it is estimating pending tasks, but should be very rare in practice. Fixes #17878. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes scylladb/scylladb#17879	2024-03-20 08:41:32 +02:00
Botond Dénes	c2425ca135	tools/scylla-nodetool: add --rest-api-port option This option is an alternative to --port\|-p and takes precedence over it. This is meant to aid the switch from the legacy nodetool to the native one. Users of the legacy nodetool pass the port of JMX to --port. We need a way to provide both the JMX port (via --port) and also the REST API port, which only the native nodetool will interpret. So we add this new --rest-api-port, which when provided, overwrites the --port\|-p option. To ensure the legacy nodeotol doesn't try to interpret this, this option can also be provided as -Dcom.scylladb.apiPort (which is substituted to --rest-api-port behind the scenes).	2024-03-20 02:11:47 -04:00
Botond Dénes	a85ec6fc60	tools/scylla-nodetool: ignore JVM args Legacy scripts and tests for nodetool, might pass JVM args like -Dcom.sun.jndi.rmiURLParsing=legacy. Ignore these, by dropping anything that starts with -D from the command line args.	2024-03-20 02:11:47 -04:00
Botond Dénes	12516b0861	tools/utils: make finding the operation command line option more flexible Currently all scylla-tools assume that the operation/command is in argv[1]. This is not very flexible, because most programs allow global options (that are not dependent on the current operation/command) to be passed before the operation name on the command line. Notably C*'s nodetool is one such program and indeed scripts and tests using nodetool do utilize this. This patch makes this more flexible. Instead of looking at argv[1], do an initial option parsing with boost::program_options to locate the operation parameter. This initial parser knows about the global options, and the operation positional argument. It allows for unrecognized positional and non-positional arguments, but only after the command. With this, any combination of global options + operation is allowed, in any order.	2024-03-20 02:11:47 -04:00
Botond Dénes	7ae98c586a	tools/utils: get_selected_operation(): remove alias param This method has a single caller, who always passes "operation". Just hard-code this into the method, no need to keep a param for it.	2024-03-20 02:11:47 -04:00
Botond Dénes	28e7eecf0b	tools: add constant with current help command-line arguments Unfortunately, we have code in scylla-nodetool.cc which needs to know what are the current help options available. Soon, there will be more code like this in tools/utils.cc, so centralize this list in a const static tool_app_template member.	2024-03-20 02:11:47 -04:00
Andrei Chekun	b52f79b1ce	Fix leaking file descriptors in test.py Fixes #17569 Tests are not closing file descriptor after it finishes. This leads to inability to continue tests since the default value for opened files in Linux is 1024. Issue easy to reproduce with the next command: ``` $ ./test.py --mode debug test_native_transport --repeat 1500 ``` After fix applied all tests are passed with a next command: ``` $ ./test.py --mode debug test_native_transport --repeat 10000 ``` Closes scylladb/scylladb#17798	2024-03-19 14:59:14 +01:00
Piotr Dulikowski	70cb1dc8fe	doc: describe upgrade and recovery for raft topology Document the manual upgrade procedure that is required to enable consistent cluster management in clusters that were upgraded from an older version to ScyllaDB Open Source 6.0. This instruction is placed in previously placeholder "Enable Raft-based Topology" page which is a part of the upgrade instructions to ScyllaDB Open Source 6.0. Add references to the new description in the "Raft Consensus Algorithm in ScyllaDB" document in relevant places. Extend the "Handling Node Failures" document so that it mentions steps required during recovery of a ScyllaDB cluster running version 6.0. Fixes: scylladb/scylladb#17341 Closes scylladb/scylladb#17624	2024-03-19 14:59:14 +01:00
Gleb Natapov	fde3068530	topology_coordinator: set address map entry to nonexpiring when a node is added to the topology Currently a node's address is set to nonexpiring in the address map when the node is added to group0, but the node is added to the topology earlier (during the join request) and the cluster must be able to communicate with it (potentially) much later when the request will be processed. The patch marks nodes that are in the topology, but no yet in group0 as non expiring, so they will not be dropped from address map until their join request is processed. Fixes: scylladb/scylladb#16849	2024-03-19 13:35:19 +02:00
Gleb Natapov	9651ae875f	raft_group0: add modifiable_address_map() function Provide access to non const address_map. We will need it later.	2024-03-19 13:34:41 +02:00
Yaron Kaikov	ad76f0325e	[action] Sync labels from an Issue to linked PR After merging https://github.com/scylladb/scylladb/pull/17365, all backport labels should be added to PR (before we used to add backport labels to the issues). Adding a GitHub action which will be triggered in the following conditions only: 1) The base branch is `master` or `next` 2) Pull request events: - opened: For every new PR that someone opens, we will sync all labels from the linked issue (if available) - labeled: This role only applies to labels with the `backport/` prefix. When we add a new label for the backport we will update the relevant issue or PR to get them both to sync - unlabeled: Same as `labeled` only applies to the `backport/` prefix. When we remove a label for backport we will update the relevant issue or pr Closes scylladb/scylladb#17715	2024-03-19 09:17:07 +02:00
Avi Kivity	e48eb76f61	sstables_manager: decouple from system_keyspace sstables_manager now depends on system_keyspace for access to the system.sstables table, needed by object storage. This violates modularity, since sstables_manager is a relatively low-level leaf module while system_keyspace integrates large parts of the system (including, indirectly, sstables_manager). One area where this is grating is sstables::test_env, which has to include the much higher level cql_test_env to accommodate it. Fix this by having sstables_manager expose its dependency on system_keyspace as an interface, sstables_registry, and have system_keyspace implement the glue logic in system_keyspace_sstables_manager. Closes scylladb/scylladb#17868	2024-03-18 20:38:07 +03:00
Anna Stuchlik	a13694daea	doc: fix the image upgrade page This commit updates the Upgrade ScyllaDB Image page. - It removes the incorrect information that updating underlying OS packages is mandatory. - It adds information about the extended procedure for non-official images. Closes scylladb/scylladb#17867	2024-03-18 18:27:46 +02:00
Gleb Natapov	af218d0063	raft_group0_client: assert that hold_read_apply_mutex is called on shard 0 group0 operations a valid on shard 0 only. Assert that. We already do that in the version of the function that gets abort source. Message-ID: <ZeCti70vrd7UFNim@scylladb.com>	2024-03-18 16:20:41 +01:00
Pavel Emelyanov	a8f48e0f6b	test/boost/tablets: Use verbose BOOST_REQUIRE checkers Lot's of BOOST_REQUIRES in this test require some integers to be in some eq/gt/le relations to each other. And one place that compares rack names as strings. Using more verbose boost checkers is preferred in such cases Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#17866	2024-03-18 17:09:02 +02:00
Botond Dénes	270d01f16a	Merge 'build: cmake: put server deb packages under build/dist/$<CONFIG>/debian' from Kefu Chai this change is a follow up of `ca7f7bf8e2`, which changed the output path to build/$<CONFIG>/debian. but what dist/docker/debian/build_docker.sh expects is `build/dist/$config/debian/.deb`, where `$config` is the normalized mode, when the debian packages are built using CMake generated rules, `$mode` is CMake configuration name, i.e., `$<CONFIG>`. so, `ca7f7bf8e2` made a mistake, as it does not match the expectation of `build_docker.sh`. in this change, this issue is addressed. so we use the same path in both `dist/CMakeLists.txt` and `dist/docker/debian/build_docker.sh`. Closes scylladb/scylladb#17848 github.com:scylladb/scylladb: build: cmake: add dist-* targets to the default build target build: cmake: put server deb packages under build/dist/$<CONFIG>/debian	2024-03-18 16:18:35 +02:00
Avi Kivity	72bbe75d5b	Merge 'Fix node replace with tablets for RF=N' from Tomasz Grabiec This PR fixes a problem with replacing a node with tablets when RF=N. Currently, this will fail because tablet replica allocation for rebuild will not be able to find a viable destination, as the replacing node is not considered to be a candidate. It cannot be a candidate because replace rolls back on failure and we cannot roll back after tablets were migrated. The solution taken here is to not drain tablet replicas from replaced node during topology request but leave it to happen later after the replaced node is in left state and replacing node is in normal state. The replacing node waits for this draining to be complete on boot before the node is considered booted. Fixes https://github.com/scylladb/scylladb/issues/17025 Nodes in the left state will be kept in tablet replica sets for a while after node replace is done, until the new replica is rebuilt. So we need to know about those node's location (dc, rack) for two reasons: 1) algorithms which work with replica sets filter nodes based on their location. For example materialized views code which pairs base replicas with view replicas filters by datacenter first. 2) tablet scheduler needs to identify each node's location in order to make decisions about new replica placement. It's ok to not know the IP, and we don't keep it. Those nodes will not be present in the IP-based replica sets, e.g. those returned by get_natural_endpoints(), only in host_id-based replica sets. storage_proxy request coordination is not affected. Nodes in the left state are still not present in token ring, and not considered to be members of the ring (datacanter endpoints excludes them). In the future we could make the change even more transparent by only loading locator::node* for those nodes and keeping node* in tablet replica sets. Currently left nodes are never removed from topology, so will accumulate in memory. We could garbage-collect them from topology coordinator if a left node is absent in any replica set. That means we need a new state - left_for_real. Closes scylladb/scylladb#17388 * github.com:scylladb/scylladb: test: py: Add test for view replica pairing after replace raft, api: Add RESTful API to query current leader of a raft group test: test_tablets_removenode: Verify replacing when there is no spare node doc: topology-on-raft: Document replace behavior with tablets tablets, raft topology: Rebuild tablets after replacing node is normal tablets: load_balancer: Access node attributes via node struct tablets: load_balancer: Extract ensure_node() mv: Switch to using host_id-based replica set effective_replication_map: Introduce host_id-based get_replicas() raft topology: Keep nodes in the left state to topology tablets: Introduce read_required_hosts()	2024-03-18 16:16:08 +02:00
Kefu Chai	d1c35f943d	test: unit: add fmt::formatter for test_data in tests before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for * test_data in two different tests * row_cache_stress_test::reader_id and drop its operator<<. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17861	2024-03-18 15:35:28 +02:00
Kefu Chai	de6803de92	build: cmake: use --ld-path for specifying linker for clang Clang > 12 starts to complain like ``` warning: '-fuse-ld=' taking a path is deprecated; use '--ld-path=' instead [-Wfuse-ld-path]' ``` this option is not supported by GCC yet. also instead of using the generic driver's name, use the specific name. otherwise ld fails like ``` lld is a generic driver. Invoke ld.lld (Unix), ld64.lld (macOS), lld-link (Windows), wasm-ld (WebAssembly) instead ``` Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17825	2024-03-18 14:49:11 +02:00

1 2 3 4 5 ...

41919 Commits