scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-01 13:45:53 +00:00

Author	SHA1	Message	Date
Avi Kivity	3c44445c07	Merge "Introduce off-strategy compaction for repair-based bootstrap and replace" from Raphael " Scylla suffers with aggressive compaction after repair-based operation has initiated. That translates into bad latency and slowness for the operation itself. This aggressiveness comes from the fact that: 1) new sstables are immediately added to the compaction backlog, so reducing bandwidth available for the operation. 2) new sstables are in bad shape when integrated into the main sstable set, not conforming to the strategy invariant. To solve this problem, new sstables will be incrementally reshaped, off the compaction strategy, until finally integrated into the main set. The solution takes advantage there's only one sstable per vnode range, meaning sstables generated by repair-based operations are disjoint. NOTE: off-strategy for repair-based decommission and removenode will follow this series and require little work as the infrastructure is introduced in this series. Refs #5226. " * 'offstrategy_v7' of github.com:raphaelsc/scylla: tests: Add unit test for off-strategy sstable compaction table: Wire up off-strategy compaction on repair-based bootstrap and replace table: extend add_sstable_and_update_cache() for off-strategy sstables/compaction_manager: Add function to submit off-strategy work table: Introduce off-strategy compaction on maintenance sstable set table: change build_new_sstable_list() to accept other sstable sets table: change non_staging_sstables() to filter out off-strategy sstables table: Introduce maintenance sstable set table: Wire compound sstable set table: prepare make_reader_excluding_sstables() to work with compound sstable set table: prepare discard_sstables() to work with compound sstable set table: extract add_sstable() common code into a function sstable_set: Introduce compound sstable set reshape: STCS: preserve token contiguity when reshaping disjoint sstables	2021-03-22 10:43:13 +02:00
Raphael S. Carvalho	65b09567dd	table: Wire up off-strategy compaction on repair-based bootstrap and replace Now, sstables created by bootstrap and replace will be added to the maintenance set, and once the operation completes, off-strategy compaction will be started. We wait until the end of operation to trigger off-strategy, as reshaping can be more efficient if we wait for all sstables before deciding what to compact. Also, waiting for completion is no longer an issue because we're able to read from new sstables using partitioned_sstable_set and their existence aren't accounted by the compaction backlog tracker yet. Refs #5226. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2021-03-18 11:47:49 -03:00
Piotr Sarna	2509b7dbde	Merge 'dht: convert ring_position and decorated_key to std::strong_ordering' from Avi Kivity As #1449 notes, trichotomic comparators returning int are dangerous as they can be mistaken for less comparators. This series converts dht::ring_position and dht::decorated_key, as well as a few closely related downstream types, to return std::strong_ordering. Closes #8225 * github.com:scylladb/scylla: dht: ring_position, decorated_key: convert tri_comparators to std::strong_ordering pager: rephrase misleading comparison check test: total_order_checks: prepare for std::strong_ordering test: mutation_test: prepare merge_container for std::strong_ordering intrusive_array: prepare for std::strong_ordering utils: collection-concepts: prepare for std::strong_ordering	2021-03-18 11:51:54 +01:00
Avi Kivity	378556418c	dht: ring_position, decorated_key: convert tri_comparators to std::strong_ordering Convert tri_comparators to return std::strong_ordering rather than int, to prevent confusion with less comparators. Downstream users are either also converted, or adjust the return type back to int, whichever happens to be simpler; in all cases the change it trivial.	2021-03-18 12:40:05 +02:00
Michał Chojnowski	5c3385730b	treewide: get rid of unaligned_cast unaligned_cast violates strict aliasing rules. Replace it with safe equivalents.	2021-03-17 17:00:41 +01:00
Asias He	d5e6ba1ff1	repair: Shortcut when no followers to repair with - 3 nodes in the cluster with rf = 3 - run repair on node1 with ignore_nodes to ignore node2 and node3 - node1 has no followers to repair with However, currently node1 will walk through the repair procedure to read data from disk and calculate hashes which are unnecessary. This patch fixes this issue, so that in case there are no followers, we skip the range and avoid the unnecessary work. Before: $ curl -X POST http://127.0.0.1:10000/storage_service/repair_async/myks3?ignore_nodes="127.0.0.2,127.0.0.3" repair - repair id [id=1, uuid=ff39151b-2ce9-4885-b7e9-89158b14b5c2] on shard 0 stats: repair_reason=repair, keyspace=myks3, tables={standard1}, ranges_nr=769, sub_ranges_nr=769, round_nr=1456, round_nr_fast_path_already_synced=1456, round_nr_fast_path_same_combined_hashes=0, round_nr_slow_path=0, rpc_call_nr=0, tx_hashes_nr=0, rx_hashes_nr=0, duration=0.19 seconds, tx_row_nr=0, rx_row_nr=0, tx_row_bytes=0, rx_row_bytes=0, row_from_disk_bytes={{127.0.0.1, 2822972}}, row_from_disk_nr={{127.0.0.1, 6218}}, row_from_disk_bytes_per_sec={{127.0.0.1, 14.1695}} MiB/s, row_from_disk_rows_per_sec={{127.0.0.1, 32726.3}} Rows/s, tx_row_nr_peer={}, rx_row_nr_peer={} Data was read from disk. After: $ curl -X POST http://127.0.0.1:10000/storage_service/repair_async/myks3?ignore_nodes="127.0.0.2,127.0.0.3" repair - repair id [id=1, uuid=c6df8b23-bd3b-4ebc-8d4c-a11d1ebcca39] on shard 0 stats: repair_reason=repair, keyspace=myks3, tables={standard1}, ranges_nr=769, sub_ranges_nr=0, round_nr=0, round_nr_fast_path_already_synced=0, round_nr_fast_path_same_combined_hashes=0, round_nr_slow_path=0, rpc_call_nr=0, tx_hashes_nr=0, rx_hashes_nr=0, duration=0.0 seconds, tx_row_nr=0, rx_row_nr=0, tx_row_bytes=0, rx_row_bytes=0, row_from_disk_bytes={}, row_from_disk_nr={}, row_from_disk_bytes_per_sec={} MiB/s, row_from_disk_rows_per_sec={} Rows/s, tx_row_nr_peer={}, rx_row_nr_peer={} No data was read from disk. Fixes #8256 Closes #8257	2021-03-11 11:53:22 +02:00
Asias He	61ac8d03b9	repair: Add ignore_nodes option In some cases, user may want to repair the cluster, ignoring the node that is down. For example, run repair before run removenode operation to remove a dead node. Currently, repair will ignore the dead node and keep running repair without the dead node but report the repair is partial and report the repair is failed. It is hard to tell if the repair is failed only due to the dead node is not present or some other errors. In order to exclude the dead node, one can use the hosts option. But it is hard to understand and use, because one needs to list all the "good" hosts including the node itself. It will be much simpler, if one can just specify the node to exclude explicitly. In addition, we support ignore nodes option in other node operations like removenode. This change makes the interface to ignore a node explicitly more consistent. Refs: #7806 Closes #8233	2021-03-09 16:03:13 +01:00
Botond Dénes	ba7a9d2ac3	imr: switch back to open-coded description of structures Commit `aab6b0ee27` introduced the controversial new IMR format, which relied on a very template-heavy infrastructure to generate serialization and deserialization code via template meta-programming. The promise was that this new format, beyond solving the problems the previous open-coded representation had (working on linearized buffers), will speed up migrating other components to this IMR format, as the IMR infrastructure reduces code bloat, makes the code more readable via declarative type descriptions as well as safer. However, the results were almost the opposite. The template meta-programming used by the IMR infrastructure proved very hard to understand. Developers don't want to read or modify it. Maintainers don't want to see it being used anywhere else. In short, nobody wants to touch it. This commit does a conceptual revert of `aab6b0ee27`. A verbatim revert is not possible because related code evolved a lot since the merge. Also, going back to the previous code would mean we regress as we'd revert the move to fragmented buffers. So this revert is only conceptual, it changes the underlying infrastructure back to the previous open-coded one, but keeps the fragmented buffers, as well as the interface of the related components (to the extent possible). Fixes: #5578	2021-02-16 23:43:07 +01:00
Botond Dénes	4d309fc34a	repair: row_level: invoke on_internal_error() on out-of-order partitions repair_writer::do_write(): already has a partition compare for each mutation fragment written, do determine whether the fragment belongs to another partition or not. This equal compare can be converted to a tri_compare at no extra cost allowing for detecting out-of-order partitions, in which case `on_internal_error()` is called. Refs: #7623 Refs: #7552 Test: dtest(RepairAdditionalTest.repair_disjoint_row_3nodes_diff_shard_count_test:debug) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20210216074523.318217-1-bdenes@scylladb.com>	2021-02-16 15:31:40 +02:00
Benny Halevy	22f6023ac3	sstables: sstable_writer_config: add origin member Add a string describing where the sstables originated from (e.g. memtable, repair, streaming, compaction, etc.) If configure_writer is called with a nullptr, the origin will be equal to an empty string. Introduce test_env_sstables_manager that provides an overload of configure_writer with no parmeters that calls the base-class' configure_writer with "test" origin. This was to reduce the code churn in this patch and to keep the tests simple. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-02-01 16:45:52 +02:00
Benny Halevy	322aa2f8b5	token_metadata: add clear_gently clear_gently gently clears the token_metadata members. It uses continuations to allow yielding if needed to prevent reactor stalls. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-12-22 11:22:21 +02:00
Benny Halevy	e089c22ec1	token_metdata: futurize update_normal_tokens The function complexity if O(#tokens) in the worst case as for each endpoint token to traverses _token_to_endpoint_map lineraly to erase the endpoint mapping if it exists. This change renames the current implementation of update_normal_tokens to update_normal_tokens_sync and clones the code as a coroutine that returns a future and may yield if needed. Eventually we should futurize the whole token_metadata and abstract_replication_strategy interface and get rid of the synchronous functions. Until then the sync version is still required from call sites that are neither returning a future nor run in a seastar thread. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-12-22 10:35:15 +02:00
Benny Halevy	55316df6bf	repair: replace_with_repair: convert to coroutine Prepare to futurizing update_normal_tokens. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-12-22 09:49:08 +02:00
Benny Halevy	aae3991246	repair: do_decommission_removenode_with_repair: don't deref ops when null `ops` might be passed as a disengaged shared_ptr when called from `decommission_with_repair`. In this case we need to propagate to sync_data_using_repair a disengaged std::optional<utils::UUID>. Fixes #7788 DTest: update_cluster_layout_tests:TestUpdateClusterLayout.verify_latest_copy_decommission_node_test(debug) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201213073743.331253-1-bhalevy@scylladb.com>	2020-12-13 12:37:18 +02:00
Asias He	829b4c1438	repair: Make removenode safe by default Currently removenode works like below: - The coordinator node advertises the node to be removed in REMOVING_TOKEN status in gossip - Existing nodes learn the node in REMOVING_TOKEN status - Existing nodes sync data for the range it owns - Existing nodes send notification to the coordinator - The coordinator node waits for notification and announce the node in REMOVED_TOKEN Current problems: - Existing nodes do not tell the coordinator if the data sync is ok or failed. - The coordinator can not abort the removenode operation in case of error - Failed removenode operation will make the node to be removed in REMOVING_TOKEN forever. - The removenode runs in best effort mode which may cause data consistency issues. It means if a node that owns the range after the removenode operation is down during the operation, the removenode node operation will continue to succeed without requiring that node to perform data syncing. This can cause data consistency issues. For example, Five nodes in the cluster, RF = 3, for a range, n1, n2, n3 is the old replicas, n2 is being removed, after the removenode operation, the new replicas are n1, n5, n3. If n3 is down during the removenode operation, only n1 will be used to sync data with the new owner n5. This will break QUORUM read consistency if n1 happens to miss some writes. Improvements in this patch: - This patch makes the removenode safe by default. We require all nodes in the cluster to participate in the removenode operation and sync data if needed. We fail the removenode operation if any of them is down or fails. If the user want the removenode operation to succeed even if some of the nodes are not available, the user has to explicitly pass a list of nodes that can be skipped for the operation. $ nodetool removenode --ignore-dead-nodes <list_of_dead_nodes_to_ignore> <host_id> Example restful api: $ curl -X POST "http://127.0.0.1:10000/storage_service/remove_node/?host_id=7bd303e9-4c7b-4915-84f6-343d0dbd9a49&ignore_nodes=127.0.0.3,127.0.0.5" - The coordinator can abort data sync on existing nodes For example, if one of the nodes fails to sync data. It makes no sense for other nodes to continue to sync data because the whole operation will fail anyway. - The coordinator can decide which nodes to ignore and pass the decision to other nodes Previously, there is no way for the coordinator to tell existing nodes to run in strict mode or best effort mode. Users will have to modify config file or run a restful api cmd on all the nodes to select strict or best effort mode. With this patch, the cluster wide configuration is eliminated. Fixes #7359 Closes #7626	2020-12-10 10:14:39 +02:00
Avi Kivity	f802356572	Revert "Revert "Merge "raft: fix replication if existing log on leader" from Gleb"" This reverts commit `dc77d128e9`. It was reverted due to a strange and unexplained diff, which is now explained. The HEAD on the working directory being pulled from was set back, so git thought it was merging the intended commits, plus all the work that was committed from HEAD to master. So it is safe to restore it.	2020-12-08 19:19:55 +02:00
Avi Kivity	dc77d128e9	Revert "Merge "raft: fix replication if existing log on leader" from Gleb" This reverts commit `0aa1f7c70a`, reversing changes made to `72c59e8000`. The diff is strange, including unrelated commits. There is no understanding of the cause, so to be safe, revert and try again.	2020-12-06 11:34:19 +02:00
Avi Kivity	0584db1eb3	Merge "Unstall cleanup_compaction::get_ranges_for_invalidation" from Benny " This series adds maybe_yield called from cleanup_compaction::get_ranges_for_invalidation to avoid reactor stalls. To achieve that, we first extract bool_class can_yield to utils/maybe_yield.hh, and add a convience helper: utils::maybe_yield(can_yield) that conditionally calls seastar::thread::maybe_yield if it can (when called in a seastar thread). With that, we add a can_yield parameter to dht::to_partition_ranges and dht::partition_range::deoverlap (defaults to false), and use it from cleanup_compaction::get_ranges_for_invalidation, as the latter is always called from `consume_in_thread`. Fixes #7674 Test: unit(dev) " * tag 'unstall-get_ranges_for_invalidation-v2' of github.com:bhalevy/scylla: compaction: cleanup_compaction: get_ranges_for_invalidation: add yield points dht/i_partitioner: to_partition_ranges: support yielding locator: extract can_yield to utils/maybe_yield.hh	2020-11-29 14:10:39 +02:00
Benny Halevy	157a964a63	locator: extract can_yield to utils/maybe_yield.hh Move the definition of bool_class can_yield to a standalone header file and define there a maybe_yield(can_yield) helper. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-24 12:23:56 +02:00
Asias He	1b2155eb1d	repair: Use same description for the same metric In commit `9b28162f88` (repair: Use label for node ops metrics), we switched to use label for different node operations. We should use the same description for the same metric name. Fixes #7681 Closes #7682	2020-11-24 09:35:39 +02:00
Asias He	f7c954dc1e	repair: Use decorated_key::tri_compare to compare keys It is faster than the legacy_equal because it compares the token first. Fixes #7643 Closes #7644	2020-11-18 14:12:59 +02:00
Asias He	9b28162f88	repair: Use label for node ops metrics Make it easier to be consumed by the scylla-monitor. Fixes #7270 Closes #7638	2020-11-18 10:12:39 +02:00
Benny Halevy	1e2138e8ef	abstract_replication_strategy: get rid of get_ranges_in_thread Use the can_yield param to get_ranges instead. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-11 14:20:24 +02:00
Benny Halevy	e4e0e71b50	repair: call get_ranges_in_thread where possible To prevent reactor stalls during repair-based operations. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-11 14:20:24 +02:00
Benny Halevy	ba31350239	abstract_replication_strategy: add can_yield param to get_pending_ranges and friends To prevent reactor stalls as seen in #7313. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-11 14:20:24 +02:00
Benny Halevy	0abd8e62cd	token_metadata: futurize clone_after_all_left Call the futurized clone_only_token_map and remove the _leaving_endpoints from the cloned token_metadata_impl. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-11 14:20:24 +02:00
Benny Halevy	4a622c14e1	token_metadata: futurize clone_only_token_map Does part of clone_async() using continuations to prevent stalls. Rename synchronous variant to clone_only_token_map_sync that is going to be deprecated once all its users will be futurized. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-11 14:20:24 +02:00
Benny Halevy	6af7b689f3	repair: replace_with_repair: use token_metadata::clone_async Clone the input token_metadata asynchronously using clone_async() before modifying it using update_normal_tokens. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-11 14:20:23 +02:00
Benny Halevy	5ab7b0b2ea	abstract_replication_strategy: accept a token_metadata_ptr in get_pending_address_ranges methods Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-11 14:20:23 +02:00
Benny Halevy	349aa966ba	abstract_replication_strategy: accept a token_metadata_ptr in get_ranges methods In preparation to returning future<dht::token_range_vector> from async variants. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-11 14:20:23 +02:00
Benny Halevy	6cba82a792	repair: accept a token_metadata_ptr in repair based node ops Only replace_with_repair needs to clone the token_metadata and update the local copy, so we can safely pass a read-only snapshot of the token_metadata rather than copying it in all cases. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-11-11 14:20:23 +02:00
Avi Kivity	512daa75a6	Merge 'repair: Use single writer for all followers' from Asias He repair: Use single writer for all followers Currently, repair master create one writer for each follower to write rows from follower to sstables. That are RF - 1 writers in total. Each writer creates 1 sstable for the range repaired, usually a vnode range. Those sstables for a given vnode range are disjoint. To reduce the compaction work, we can create one writer for all the followers. This reduces the number of sstables generated by repair significantly to one per vnode range from RF - 1 per vnode range. Fixes #7525 Closes #7528 * github.com:scylladb/scylla: repair: No more vector for _writer_done and friends repair: Use single writer for all followers	2020-11-05 18:45:07 +01:00
Benny Halevy	f93fb55726	repair: repair_writer: do not capture lw_shared_ptr cross-shard The shared_from_this lw_shared_ptr must not be accessed across shards. Capturing it in the lambda passed to mutation_writer::distribute_reader_and_consume_on_shards causes exactly that since the captured lw_shared_ptr is copied on other shards, and ends up in memory corruption as seen in #7535 (probably due to lw_shared_ptr._count going out-of-sync when incremented/decremented in parallel on other shards with no synchronization. This was introduced in `289a08072a`. The writer is not needed in the body of this lambda anyways so it doesn't need to capture it. It is already held by the continuations until the end of the chain. Fixes #7535 Test: repair_additional_test:RepairAdditionalTest.repair_disjoint_row_3nodes_diff_shard_count_test (dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201104142216.125249-1-bhalevy@scylladb.com>	2020-11-05 18:44:49 +01:00
Asias He	db28efb28a	repair: No more vector for _writer_done and friends Now that both repair followers and repair master use a single writer. We can get rid of the vector associated with _writer_done and friends. Fixes #7525	2020-11-05 13:28:40 +08:00
Asias He	998b153f86	repair: Use single writer for all followers Currently, repair master create one writer for each follower to write rows from follower to sstables. That are RF - 1 writers in total. Each writer creates 1 sstable for the range repaired, usually a vnode range. Those sstables for a given vnode range are disjoint. To reduce the compaction work, we can create one writer for all the followers. This reduces the number of sstables generated by repair significantly to one per vnode range from RF - 1 per vnode range. Fixes #7525	2020-11-05 13:28:40 +08:00
Asias He	289a08072a	repair: Make repair_writer a shared pointer The future of the fiber that writes data into sstables inside the repair_writer is stored in _writer_done like below: class repair_writer { _writer_done[node_idx] = mutation_writer::distribute_reader_and_consume_on_shards().then([this] { ... }).handle_exception([this] { ... }); } The fiber access repair_writer object in the error handling path. We wait for the _writer_done to finish before we destroy repair_meta object which contains the repair_writer object to avoid the fiber accessing already freed repair_writer object. To be safer, we can make repair_writer a shared pointer and take a reference in the distribute_reader_and_consume_on_shards code path. Fixes #7406 Closes #7430	2020-10-28 16:22:23 +02:00
Botond Dénes	ff623e70b3	reader_concurrency_semaphore: name permits Require a schema and an operation name to be given to each permit when created. The schema is of the table the read is executed against, and the operation name, which is some name identifying the operation the permit is part of. Ideally this should be different for each site the permit is created at, to be able to discern not only different kind of reads, but different code paths the read took. As not all read can be associated with one schema, the schema is allowed to be null. The name will be used for debugging purposes, both for coredump debugging and runtime logging of permit-related diagnostics.	2020-10-13 12:32:13 +03:00
Botond Dénes	6ca0464af5	mutation_fragment: add schema and permit We want to start tracking the memory consumption of mutation fragments. For this we need schema and permit during construction, and on each modification, so the memory consumption can be recalculated and pass to the permit. In this patch we just add the new parameters and go through the insane churn of updating all call sites. They will be used in the next patch.	2020-09-28 11:27:23 +03:00
Botond Dénes	3fab83b3a1	flat_mutation_reader: impl: add reader_permit parameter Not used yet, this patch does all the churn of propagating a permit to each impl. In the next patch we will use it to track to track the memory consumption of `_buffer`.	2020-09-28 10:53:48 +03:00
Pavel Emelyanov	9a15ebfe6a	repair: Move CHECKSUM_RANGE verb into repair/ The verb is sent by repair code, so it should be registered in the same place, not in main. Also -- the verb should be unregistered on stop. The global messaging service instance is made similarly to the row-level one, as there's no ready to use repair service. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-09-17 09:52:48 +03:00
Pavel Emelyanov	d5769346d7	repair: Toss messaging init/uninit calls There goal is to make it possible to reg/unreg not only row-level verbs. While at it -- equip the init call with sharded<database>& argument, it will be needed by the next patch. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-09-17 09:52:48 +03:00
Avi Kivity	253a7640e3	Merge 'Clean up old cluster features' from Piotr Sarna " This series follows the suggestion from https://github.com/scylladb/scylla/pull/7203#issuecomment-689499773 discussion and deprecates a number of cluster features. The deprecation does not remove any features from the strings sent via gossip to other nodes, but it removes all checks for these features from code, assuming that the checks are always true. This assumption is quite safe for features introduced over 2 years ago, because the official upgrade path only allows upgrading from a previous official release, and these feature bits were introduced many release cycles ago. All deprecated features were picked from a `git blame` output which indicated that they come from 2018: ```git `e46537b7d3` 2016-05-31 11:44:17 +0200 RANGE_TOMBSTONES_FEATURE = "RANGE_TOMBSTONES"; `85c092c56c` 2016-07-11 10:59:40 +0100 LARGE_PARTITIONS_FEATURE = "LARGE_PARTITIONS"; `02bc0d2ab3` 2016-12-09 22:09:30 +0100 MATERIALIZED_VIEWS_FEATURE = "MATERIALIZED_VIEWS"; `67ca6959bd` 2017-01-30 19:50:13 +0000 COUNTERS_FEATURE = "COUNTERS"; `815c91a1b8` 2017-04-12 10:14:38 +0300 INDEXES_FEATURE = "INDEXES"; `d2a2a6d471` 2017-08-03 10:53:22 +0300 DIGEST_MULTIPARTITION_READ_FEATURE = "DIGEST_MULTIPARTITION_READ"; `ecd2bf128b` 2017-09-01 09:55:02 +0100 CORRECT_COUNTER_ORDER_FEATURE = "CORRECT_COUNTER_ORDER"; `713d75fd51` 2017-09-14 19:15:41 +0200 SCHEMA_TABLES_V3 = "SCHEMA_TABLES_V3"; `2f513514cc` 2017-11-29 11:57:09 +0000 CORRECT_NON_COMPOUND_RANGE_TOMBSTONES = "CORRECT_NON_COMPOUND_RANGE_TOMBSTONES"; `0be3bd383b` 2017-12-04 13:55:36 +0200 WRITE_FAILURE_REPLY_FEATURE = "WRITE_FAILURE_REPLY"; `0bab3e59c2` 2017-11-30 00:16:34 +0000 XXHASH_FEATURE = "XXHASH"; `fbc97626c4` 2018-01-14 21:28:58 -0500 ROLES_FEATURE = "ROLES"; `802be72ca6` 2018-03-18 06:25:52 +0100 LA_SSTABLE_FEATURE = "LA_SSTABLE_FORMAT"; `71e22fe981` 2018-05-25 10:37:54 +0800 STREAM_WITH_RPC_STREAM = "STREAM_WITH_RPC_STREAM"; ``` Tests: unit(dev) manual(verifying with cqlsh that the feature strings are indeed still set) " Closes #7234. * psarna-clean_up_features: gms: add comments for deprecated features gms: remove unused feature bits streaming: drop checks for RPC stream support roles: drop checks for roles schema support service: drop checks for xxhash support service: drop checks for write failure reply support sstables: drop checks for non-compound range tombstones support service: drop checks for v3 schema support repair: drop checks for large partitions support service: drop checks for digest multipartition read support sstables: drop checks for correct counter order support cql3: drop checks for materialized views support cql3: drop checks for counters support cql3: drop checks for indexing support	2020-09-16 10:53:25 +03:00
Piotr Sarna	7c8728dd73	Merge 'Add progress metrics for replace decommission removenode' from Asias. This series follows "repair: Add progress metrics for node ops #6842" and adds the metrics for the remaining node operations, i.e., replace, decommission and removenode. Fixes #1244, #6733 * asias-repair_progress_metrics_replace_decomm_removenode: repair: Add progress metrics for removenode ops repair: Add progress metrics for decommission ops repair: Add progress metrics for replace ops	2020-09-15 12:19:11 +02:00
Benny Halevy	0dc45529c8	abstract_replication_strategy: get_ranges_in_thread: copy _token_metadata if func may yield Change `94995acedb` added yielding to abstract_replication_strategy::do_get_ranges. And `07e253542d` used get_ranges_in_thread in compaction_manager. However, there is nothing to prevent token_metadata, and in particular its `_sorted_tokens` from changing while iterating over them in do_get_ranges if the latter yields. Therefore copy the the replication strategy `_token_metadata` in `get_ranges_in_thread(inet_address ep)`. If the caller provides `token_metadata` to get_ranges_in_thread, then the caller must make sure that we can safely yield while accessing token_metadata (like in `do_rebuild_replace_with_repair`). Fixes #7044 Test: unit(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20200915074555.431088-1-bhalevy@scylladb.com>	2020-09-15 11:33:55 +03:00
Piotr Sarna	9e6098a422	repair: drop checks for large partitions support Large partitions are supported for over 2 years and upgrades are only allowed from versions which already have the support, so the checks are hereby dropped.	2020-09-14 12:07:20 +02:00
Pavel Emelyanov	a89c7198c2	range_tombstone_list: Introduce and use pop_as<>() The method extracts an element from the list, constructs a desired object from it and frees. This is common usage of range_tombstone_list. Having a helper helps encapsulating the exact collection inside the class. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-09-07 23:17:41 +03:00
Pavel Emelyanov	f19ade31ee	repair: Mark some partition_hasher methods noexcept The net patch will change the way range tombstones are fed into hasher. To make sure the codeflow doesn't become exception-unsafe, mark the relevant methods as nont-throwing. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-09-07 23:17:41 +03:00
Asias He	8b4530a643	repair: Add progress metrics for removenode ops The following metric is added: scylla_node_maintenance_operations_removenode_finished_percentage{shard="0",type="gauge"} 0.650000 It is the number of finished percentage for removenode operation so far. Fixes #1244, #6733	2020-08-31 14:43:39 +08:00
Asias He	25e03233f1	repair: Add progress metrics for decommission ops The following metric is added: scylla_node_maintenance_operations_decommission_finished_percentage{shard="0",type="gauge"} 0.650000 It is the number of finished percentage for decommission operation so far. Fixes #1244, #6733	2020-08-31 14:43:39 +08:00
Asias He	80cb157669	repair: Add progress metrics for replace ops The following metric is added: scylla_node_maintenance_operations_replace_finished_percentage{shard="0",type="gauge"} 0.650000 It is the number of finished percentage for replace operation so far. Fixes #1244, #6733	2020-08-31 14:03:05 +08:00

1 2 3 4 5 ...

437 Commits