scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-02 06:05:53 +00:00

Author	SHA1	Message	Date
Kefu Chai	39ee8593cb	repair: add fmt::formatter for repair_hash before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for repair_hash. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-09 23:41:58 +08:00
Botond Dénes	9f97d21339	Merge 'Enhance perf-simple-query test' from Pavel Emelyanov While measuring #17149 with this test some changes were applied, here they are - keep initial_tablets number in output json's parameters section - disable auto compaction - add control over the amount of sstables generated for --bypass-cache case Closes scylladb/scylladb#17473 * github.com:scylladb/scylladb: perf_simple_query: Add --memtable-partitions option perf_simple_query: Disable auto compaction perf_simple_query: Keep number of initial tablets in output json	2024-03-08 15:21:04 +02:00
Kefu Chai	079d70145e	raft: add fmt::formatter for raft tracker types before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for * raft::election_tracker * raft::votes * raft::vote_result and drop their operator<<:s. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17670	2024-03-08 15:19:37 +02:00
Piotr Smaroń	44bbf2e57b	test.py: improve readability of failures resulting in empty XML Before the change, when a test failed because of some error in the `cql_test_env.cc`, we were getting: ``` error: boost/virtual_table_test: failed to parse XML output '/home/piotrs/src/scylla2/testlog/debug/xml/boost.virtual_table_test.test_system_config_table_read.1.xunit.xml': no element found: line 1, column 0 ``` After the change we're getting: ``` error: boost/virtual_table_test: Empty testcase XML output, possibly caused by a crash in the cql_test_env.cc, details: '/home/piotrs/src/scylla2/testlog/debug/xml/boost.virtual_table_test.test_system_config_table_read.1.xunit.xml': no element found: line 1, column 0 ``` Closes scylladb/scylladb#17679	2024-03-08 15:17:12 +02:00
Kefu Chai	362a8a777c	partition_snapshot_row_cursor: add fmt::format to this class before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for `partition_snapshot_row_cursor`, and drop its operator<<. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17669	2024-03-08 15:15:43 +02:00
Botond Dénes	630be97d2f	Merge 'tools/scylla-nodetool: print hostname if --resolve-ip is passed to "ring"' from Kefu Chai before this change, "ring" subcommand has two issues: 1. `--resolve-ip` option accepts a boolean argument, but this option should be a switch, which does not accept any argument at all 2. it always prints the endpoint no matter if `--resolve-ip` is specified or not. but it should print the resolved name, instead of an IP address if `--resolve-ip` is specified. in this change, both issues are addressed. and the test is updated accordingly to exercise the case where `--resolve-ip` is used. Closes scylladb/scylladb#17553 * github.com:scylladb/scylladb: tools/scylla-nodetool: print hostname if --resolve-ip is passed to "ring" test/nodetool: calc max_width from all_hosts test/nodetool: keep tokens as Host's member test/nodetool: remove unused import	2024-03-08 15:15:19 +02:00
Pavel Emelyanov	fc9fb03b90	cql3: Remove unused cf_name::operator<< Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#17686	2024-03-08 15:14:52 +02:00
Nadav Har'El	ba585905e5	Update tools/java submodule * tools/java 5e11ed17...e4878ae7 (2): > nodetool: fix a typo in error message > bin/cassandra-stress: Add extended version info Closes scylladb/scylladb#17680	2024-03-08 15:14:21 +02:00
Kefu Chai	f5f5ff1d51	clustering_interval_set: add fmt::formatter for clustering_interval_set before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for `clustering_interval_set` their operator<<:s are dropped Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17593	2024-03-08 15:13:14 +02:00
Kefu Chai	9b5ec53355	tombstone_gc_options: add fmt::formatter for tombstone_gc_mode before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for `tombstone_gc_mode`, and drop its operator<<. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17673	2024-03-08 15:12:00 +02:00
Kefu Chai	8ca672a02c	test/pylib: return better error if self.create_server() raises in `ScyllaServer::add_server()`, `self.create_server()` is called to create a server, but if it raises, we would reference a local variable of `server` which is not bound to any value, as `server` is not assigned at that moment. if `ScyllaServer` is used by `ScyllaClusterManager`, we would not be able to see the real exception apart from the error like ``` cannot access local variable 'server' where it is not associated with a value ``` which is but the error from Python runtime. in this change, `server` is always initialized, and we check for None, before dereference it. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17693	2024-03-08 15:10:27 +02:00
Kefu Chai	70ef7e63b5	tools: toolchain: prepare: do not bail out when checking for command before this change, if `buildah` is not available in $PATH, this script fails like: ```console $ tools/toolchain/prepare --help tools/toolchain/prepare: line 3: buildah: command not found ``` the error message never gets a chance to show up. as `set -e` in the shebang line just let bash quit. after this change, we check for the existence of buildah, and bail out if it is not available. so, on a machine without buildah around, we now have: ```console $ tools/toolchain/prepare --help install buildah 1.19.3 or later ``` the same applies to "reg". Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17697	2024-03-08 15:09:21 +02:00
Botond Dénes	05307d0be9	Merge 'service: add fmt::formatter for service types' from Kefu Chai before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for * service::fencing_token * service::topology::transition_state * service::node_state * service::topology_request * service::global_topology_request * service::raft_topology_cmd::command * service::paxos::proposal * service::paxos::promise Refs https://github.com/scylladb/scylladb/issues/13245 Closes scylladb/scylladb#17692 * github.com:scylladb/scylladb: service/paxos: add fmt::formatter for paxos::promise service/paxos: add fmt::formatter for paxos::proposal service: add fmt::formatter for topology_state_machine types	2024-03-08 15:06:07 +02:00
Botond Dénes	505f137cc9	Merge 'Make object_store suite use ManagerClient' from Pavel Emelyanov The test cases in this suite need to start scylla with custom config options, restart it and call API on it. By the time the suite was created all this wasn't possible with any library facility, so the suite carries its version of managed_cluster class that piggy-backs cql-pytest scylla starting. Now test.py has pretty flexible manager that provides all the scylla cluster management object_store suite needs. This PR makes the suite use the manager client instead of the home-brew managed_cluster thing refs: #16006 fixes: #16268 Closes scylladb/scylladb#17292 * github.com:scylladb/scylladb: test/object_store: Remove unused managed_cluster (and other stuff) test/object_store: Use tmpdir fixture in flush-retry case test/object_store: Turn flush-retry case to use ManagerClient test/object_store: Turn "misconfigured" case to use ManagerClient test/object_store: Turn garbage-collect case to use ManagerClient test/object_store: Turn basic case to use ManagerClient test/object_store: Prepare to work with ManagerClient	2024-03-08 15:04:46 +02:00
Tomasz Grabiec	85ae10f632	Merge 'Make it possible to run individual pytest cases with test.py' from Pavel Emelyanov Today's test.py allows filtering tests to run with the `test.py --options name` syntax. The "name" argument is then considered to be some prefix, and when iterating tests only those whose name starts with that prefix are collected and executed. This has two troubles. Minor: since it is prefix filtering, running e.g. topology_custom/test_tablets will run test_tablets _and_ test_tablets_migration from it. There's no way to exclude the latter from this selection. It's not common, but careful file names selection is welcome for better ~~user~~ testing experience. Major: most of test files in topology and python suites contain many cases, some are extremely long. When the intent is to run a single, potentially fast, test case one needs to either wait or patch the test .py file by hand to somehow exclude unwanted test cases. This PR adds the ability to run individual test case with test.py. The new syntax is `test.py --options name::case`. If the "::case" part is present two changes apply. First, the test file selection is done by name match, not by prefix match. So running topology_custom/test_tablets will _not_ select test_tablets_migration from it. Second, the "::case" part is appended to the pytest execution so that it collects and runs only the specified test case. Closes scylladb/scylladb#17481 * github.com:scylladb/scylladb: test.py: Add test-case splitting in 'name' selection test.py: Add casename argument to PythonTest	2024-03-08 12:56:39 +01:00
Kamil Braun	ae954fb2ec	test: unflake test_tablets_removenode These tests are inserting data into RF=3 tables, but used the default consistency level which is taken from the default execution profile which is set to LOCAL_QUORUM. The tests would then read with CL=ONE, so we cannot give a guarantee that some of the data won't be missed. Fix this by inserting the data with CL=ALL. (Do it for all RF cases for simplicity.) Fixes scylladb/scylladb#17695 Closes scylladb/scylladb#17700	2024-03-08 12:47:47 +01:00
Benny Halevy	8456967012	tablets: read_tablet_mutations: unfreeze_gently Use co_await unfreeze_gently in the loop body unfreezing each partition mutation to prevent reactor stalls when building group0 snapshot with lots of tablets. Fixes #15303 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#17688	2024-03-08 10:52:39 +01:00
Yaron Kaikov	ad842e5ad7	[mergify] Fix worng label and base branch for backport pr This PR contains 2 fixes for mergify config file: 1) When openning a backport PR base branch should be `branch-x.y` 2) Once a commit is promoted, we should add the label `promoted-to-master`, in 5.4 configuraion we were using the wrong label. fixing it Closes scylladb/scylladb#17698	2024-03-08 10:08:09 +01:00
Kamil Braun	76fb902858	test: unflake test_topology_remove_garbage_group0 The test is booting nodes, and then immediately starts shutting down nodes and removing them from the cluster. The shutting down and removing may happen before driver manages to connect to all nodes in the cluster. In particular, the driver didn't yet connect to the last bootstrapped node. Or it can even happen that the driver has connected, but the control connection is established to the first node, and the driver fetched topology from the first node when the first node didn't yet consider the last node to be normal. So the driver decides to close connection to the last node like this: ``` 22:34:03.159 DEBUG> [control connection] Removing host not found in peers metadata: <Host: 127.42.90.14:9042 datacenter1> ``` Eventually, at the end of the test, only the last node remains, all other nodes have been removed or stopped. But the driver does not have a connection to that last node. Fix this problem by ensuring that: - all nodes see each other as NORMAL, - the driver has connected to all nodes at the beginning of the test, before we start shutting down and removing nodes. Fixes scylladb/scylladb#16373 Closes scylladb/scylladb#17676	2024-03-08 10:08:09 +01:00
Mikołaj Grzebieluch	a0915115c3	maintenance_socket: change log message to differentiate from regular CQL ports Scylla-ccm uses function `wait_for_binary_interface` that waits for scylla logs to print "Starting listening for CQL clients". If this log is printed far before the regular cql_controller is initialized, scylla-ccm assumes too early that node is initialized. It can result in timeouts that throw errors, for example in the function `watch_rest_for_alive`. Closes scylladb/scylladb#17496	2024-03-08 10:08:09 +01:00
Nadav Har'El	ea53db379f	Merge 'tools/scylla-nodetool: listsnapshot: make it compatible with origin' from Botond Dénes The following incompatibilities were identified by `listsnapshots_test.py` in dtests: * Command doesn't bail out when there are no snapshots, instead it prints meaningless empty report * Formatting is incompatible Both are fixed in this mini-series. Closes scylladb/scylladb#17541 * github.com:scylladb/scylladb: tools/scylla-nodetool: listsnapshots: make the formatting compatible with origin's tools/scylla-nodetool: listsnapshots: bail out if there are no snapshots	2024-03-08 10:08:09 +01:00
Kefu Chai	185b503b73	service/paxos: add fmt::formatter for paxos::promise before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for `service::paxos::promise`, and drop its operator<<. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-08 14:26:58 +08:00
Kefu Chai	cb6c7bb9bf	service/paxos: add fmt::formatter for paxos::proposal before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for `service::paxos::proposal`, but its operator<< is preserved, as it is still used by our generic formatter for std::tuple<> which uses operator<< for printing the elements in it, so operator<< of this class is indirectly used. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-08 14:26:58 +08:00
Kefu Chai	14cb48eb0a	service: add fmt::formatter for topology_state_machine types before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for * service::fencing_token * service::topology::transition_state * service::node_state * service::topology_request * service::global_topology_request * service::raft_topology_cmd::command Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-08 14:05:45 +08:00
Kefu Chai	de276901f2	tools/scylla-nodetool: print hostname if --resolve-ip is passed to "ring" before this change, "ring" subcommand has two issues: 1. `--resolve-ip` option accepts a boolean argument, but this option should be a switch, which does not accept any argument at all 2. it always prints the endpoint no matter if `--resolve-ip` is specified or not. but it should print the resolved name, instead of an IP address if `--resolve-ip` is specified. in this change, both issues are addressed. and the test is updated accordingly to exercise the case where `--resolve-ip` is used. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-07 22:29:31 +08:00
Kefu Chai	d927ee8d8f	test/nodetool: calc max_width from all_hosts for better readability. as `token_to_endpoint` is but a derived variable from `all_hosts`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-07 22:28:54 +08:00
Kefu Chai	4a748c7fb0	test/nodetool: keep tokens as Host's member to be more consistent with the test_status.py. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-07 22:28:54 +08:00
Kefu Chai	aefc385786	test/nodetool: remove unused import and add two empty lines in between global functions Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-03-07 22:28:54 +08:00
Botond Dénes	b69ee6bc27	Merge 'Fix load-and-stream for tablets' from Raphael "Raph" Carvalho It might happen that multiple tablets co-habit the same shard, so we want load-and-stream to jump into a new streaming session for every tablet, such that the receiver will have the data properly segregated. That's a similar treatment we gave to repair. Today, load-and-stream fails due to sstables spanning more than 1 tablet in the receiver. Synchronization with migration is done by taking replication map, so migrations cannot advance while streaming new data. A bug was fixed too, where data must be streamed to pending replicas too, to handle case where migration is ongoing and new data must reach both old and new replica set. A test was added stressing this synchronization path. Another bug was fixed in sstable loading, which expected sharder to not be invalidated throughout the operation, but that breaks during migrations. Fixes #17315. Closes scylladb/scylladb#17449 * github.com:scylladb/scylladb: test: test_tablets: Add load-and-stream test sstables_loader: Stream to pending tablet replica if needed sstables_loader: Implement tablet based load-and-stream sstables_loader: Virtualize sstable_streamer for tablet sstables_loader: Avoid reallocations in vector sstable_loader: Decouple sstable streaming from selection sstables_loader: Introduce sstable_streamer Fix online SSTable loading with concurrent tablet migration	2024-03-07 14:18:30 +02:00
Nadav Har'El	19bcea6216	materialized views: fix rare failure caused by empty update This one-line patch fixes a failure in the dtest lwt_schema_modification_test.py::TestLWTSchemaModification ::test_table_alter_delete Where an update sometimes failed due to an internal server error, and the log had the mysterious warning message: "std::logic_error (Empty materialized view updated)" We've also seen this log-message in the past in another user's log, and never understood what it meant. It turns out that the error message was generated (and warning printed) while building view updates for a base-table mutation, and noticing that the base mutation contains an empty row - a row with no cells or tombstone or anything whatsoever. This case was deemed (8 years ago, in `d5a61a8c48`) unexpected and nonsensical, and we threw an exception. But this case actually can happen - here is how it happened in test_table_alter_delete - which is a test involving a strange combination of materialized views, LWT and schema changes: 1. A table has a materialized view, and also a regular column "int_col". 2. A background thread repeatedly drops and re-creates this column int_col. 3. Another thread deletes rows with LWT ("IF EXISTS"). 4. These LWT operations each reads the existing row, and because of repeated drop-and-recreate of the "int_col" column, sometimes this read notices that one node has a value for int_col and the other doesn't, and creates a read-repair mutation setting int_col (the difference between the two reads includes just in this column). 5. The node missing "int_col" receives this mutation which sets only int_col. It upgrade()s this mutation to its most recent schema, which doesn't have int_col, so it removes this column from the mutation row - and is left with a completely empty mutation row. This completely empty row is not useful, but upgrade() doesn't remove it. 6. The view-update generation code sees this empty base-mutation row and fails it with this std::logic_error. 7. The node which sent the read-repair mutation sees that the read repair failed, so it fails the read and therefore fails the LWT delete operation. It is this LWT operation which failed in the test, and caused the whole test to fail. The fix is trivial: an empty base-table row mutation should simply be ignored when generating view updates - it shouldn't cause any error. Before this patch, test_table_alter_delete used to fail in roughly 20% of the runs on my laptop. After this patch, I ran it 100 times without a single failure. Fixes #15228 Fixes #17549 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#17607	2024-03-07 12:00:43 +02:00
Botond Dénes	09068d20ea	tools/scylla-nodetool: scrub: make keyspace parameter optional When no keyspace is provided, request all keyspaces from the server, then scrub all of them. This is what the legacy nodetool does, for some reason this was missed when re-implementing scrub. Closes scylladb/scylladb#17495	2024-03-07 11:15:46 +02:00
Tomasz Grabiec	ec6ed18b5c	Merge 'Handle tablet migration failure in barrier stages' from Pavel Emelyanov There are 4 barrier-only stages when migrating a tablet and the test needs to fail pending/leaving replica that handles it in order to validate how coordinator handles dead node. Failing the barrier is done by suspending it with injection code and stopping the node without waking it up. The main difficulty here is how to tell one barrier RPC call from another, because they don't have anything onboard that could tell which stage the barrier is run for. This PR suggests that barrier injection code looks directly into the system.tablets table for the transition stage, the stage is already there by the time barrier is about to ack itself over RPC. refs: #16527 Closes scylladb/scylladb#17450 * github.com:scylladb/scylladb: topology.tablets_migration: Handle failed use_new topology.tablets_migration: Handle failed write_both_read_new topology.tablets_migration: Handle failed write_both_read_old topology.tablets_migration: Handle failed allow_write_both_read_old test/tablets_migration: Add conditional break-point into barrier handler replica: Add helper to read tablet transition stage topology_coordinator: Add action_failed() helper	2024-03-07 09:56:13 +01:00
Botond Dénes	5dfaa69bde	tools/scylla-nodetool: listsnapshots: make the formatting compatible with origin's The author (me) tried to be clever and fix the formatting, but then he realized this just means a lot of unnecessary fighting with tests. So this patch makes the formatting compatible with that of the legacy nodetool: * Use compatible rounding and precision formatting * Use incorrect unit (KB instead of KiB) * Align numbers to the left * Add trailing white-space to "Snapshot Details: "	2024-03-07 03:54:54 -05:00
Botond Dénes	80483ba732	tools/scylla-nodetool: listsnapshots: bail out if there are no snapshots Print a message and exit, don't continue to output the snapshot table. This is what the legacy nodetool does too.	2024-03-07 03:54:54 -05:00
Botond Dénes	ac15e4c109	tools/scylla-nodetool: repair: accept and ignore -full/--full and -j/--job-threads These two parameters are not used by the native nodetool, because ScyllaDB itself doesn't support them. These should be just ignored and indeed there was a unit test checking that this is the case. However, due to a mistake in the unit test, this was not actually tested and nodetool complained when seeing these params. This patch fixes both the test and the native nodetool. Closes scylladb/scylladb#17477	2024-03-07 11:53:50 +03:00
Nadav Har'El	a36c8b28dd	Merge 'scylla-gdb.py: fixes warnings raised by flake8' from Kefu Chai this changeset addresses some warnings raised by flake8 in hope to improve the readability of this script in general. Closes scylladb/scylladb#17668 * github.com:scylladb/scylladb: scylla-gdb: s/if not foo is None/if foo is not None/ scylla-gdb.py: add space after keyword scylla-gdb.py: remove extraneous spaces scylla-gdb.py: use 2 empty lines between top-level funcs/classes scylla-gdb.py: replace <tab> with 4 spaces scylla-gdb: fix the indent	2024-03-07 10:41:15 +02:00
Botond Dénes	28639e6a59	Merge 'docs: trigger the docs-pages workflow on release branches' from Beni Peled Currently, the github docs-pages workflow is triggered only when changes are merged to the master/enterprise branches, which means that in the case of changes to a release branch, for example, a fix to branch-5.4, or a branch-5.4>branch-2024.1 merge, the docs-pages is not triggering and therefore the documentation is not updated with the new change, In this change, I added the `branch-*` pattern, so changes to release branches will trigger the workflow Closes scylladb/scylladb#17281 github.com:scylladb/scylladb: docs: always build from the default branch docs: trigger the docs-pages workflow on release branches	2024-03-07 10:01:50 +02:00
Botond Dénes	75fe2f5c3a	Merge 'test: rest_api: fix tests to work with tablets' from Aleksandra Martyniuk Fix test_compaction_task.py, test_repair_task.py and test_storage_service.py to work with tablets. Fixes: #17338. Closes scylladb/scylladb#17474 * github.com:scylladb/scylladb: test: rest_api: enable tablets by default test: fix indentation and delete unused this_dc param test: rest_api: fix test_storage_service.py test: rest_api: fix test_repair_task.py test: rest_api: fix test_compaction_task.py test: rest_api: use skip_without_tablets fixture test: rest_api: add some tablet related fixtures	2024-03-07 10:00:09 +02:00
Asias He	83a28342ea	service: Drop unused table param from session_topology_guard The table param is not used. Dropping it so it can be used in places where the table object is not available. Closes scylladb/scylladb#17628	2024-03-07 09:34:40 +02:00
Israel Fruchter	6eb0509ff9	Update tools/cqlsh submodule * tools/cqlsh b8d86b76...e5f5eafd (2): > dist/debian: fix the trailer line format > `COPY TO STDOUT` shouldn't put None where a function is expected Fixes: scylladb/scylladb#17451 Closes scylladb/scylladb#17447	2024-03-07 09:33:36 +02:00
Michał Chojnowski	f9e97fa632	sstables: fix a use-after-free in key_view::explode() key_view::explode() contains a blatant use-after-free: unless the input is already linearized, it returns a view to a local temporary buffer. This is rare, because partition keys are usually not large enough to be fragmented. But for a sufficiently large key, this bug causes a corrupted partition_key down the line. Fixes #17625 Closes scylladb/scylladb#17626	2024-03-07 09:07:07 +02:00
Kefu Chai	7631605892	query-request: use default-generated operator== instead of using the hand-crafted operator==, use the default-generated one, which is equivalent to the former. regarding the difference between global operator== and member operator==, the default-generated operator in C++20 is now symmetric. so we don't need to worry about the problem of `max_result_size` being lhs or rhs. but neither do we need to worry about the implicit conversion, because all constructors of `max_result_size` are marked explicit. so we don't gain any advantage by making the operator== global instead of a member operator. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17536	2024-03-07 09:02:42 +03:00
Kefu Chai	64e14d21db	locator/tablets: add fmt::formatter for tablet_* before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for * tablet_id * tablet_replica * tablet_metadata * tablet_map their operator<<:s are dropped Refs scylladb/scylladb#13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17504	2024-03-07 09:00:49 +03:00
Kefu Chai	6ef507e842	build: cmake: add table_check.cc to repair/CMakeLists.txt in `5202bb9d`, we introduced repair/table_check.cc, but we didn't update repair/CMakeLists.txt accordingly. but the symbols defined by this compilation unit is referenced by other source files when building scylla. so, in this change, we add this table_check.cc to the "repair" target. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17517	2024-03-07 08:59:02 +03:00
Pavel Emelyanov	52a1b2c413	Merge 'mutation: add fmt::formatter for mutation types' from Kefu Chai before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define formatters for * position_range * mutation_fragment * range_tombstone_stream * mutation_fragment_v2::printer Refs #13245 Closes scylladb/scylladb#17521 * github.com:scylladb/scylladb: mutation: add fmt::formatter for position_range mutation: add fmt::formatter for mutation_fragment and range_tombstone_stream mutation: add fmt::formatter for mutation_fragment_v2::printer	2024-03-07 08:56:21 +03:00
Pavel Emelyanov	df6048adec	topology.tablets_migration: Handle failed use_new This stage doesn't need any special treatment, because we cannot revert to old replicas and should proceed normally. The barrier itself won't get stuck, because it already handles excluded/ignored nodes. Just make the test validate it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-03-07 08:47:26 +03:00
Pavel Emelyanov	fb7428c560	topology.tablets_migration: Handle failed write_both_read_new Two options here -- go revert to old replicas by jumping into cleanup_target stage or proceed noramlly. The choice depends on which replica set has less number of dead nodes. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-03-07 08:47:26 +03:00
Pavel Emelyanov	324eaaf873	topology.tablets_migration: Handle failed write_both_read_old At this stage it can happen that target replica got some writes, so its tablet needs to be cleaned up, so jump to cleanup_target stage. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-03-07 08:47:26 +03:00
Pavel Emelyanov	f81e0b2e88	topology.tablets_migration: Handle failed allow_write_both_read_old This is early stage, just proceed to existing revert_migration Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-03-07 08:47:26 +03:00
Pavel Emelyanov	5bb1597a30	test/tablets_migration: Add conditional break-point into barrier handler There are several transition stages that are executed by the topology coordinator with the help of barrier-and-drain raft commands. For the test to stop and remove a node while handling this stage it must inject a break-point into barrier handler, wait for it to happen and then stop the node without resuming the break-point. Then removenode from the cluster. The break-point suspends barrier handling when a specific tablet is in specific transition stage. Tablet ID and desired stage are configured via injector parameters. With today's error-injection facilities the way to suspend code execution is with injecting a lambda that waits for a message from the injection engine. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-03-07 08:47:26 +03:00

1 2 3 4 5 ...

41690 Commits