scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-24 18:40:38 +00:00

Author	SHA1	Message	Date
Gleb Natapov	94fcba5662	storage_proxy: remove unused variable	2023-06-22 15:26:20 +03:00
Tomasz Grabiec	10e05eec66	storage_proxy: Obtain shard from erm in the read path dht::shard_of() does not use the correct sharder for tablet-based tables. Code which is supposed to work with all kinds of tables should use erm::get_sharder().	2023-06-21 00:58:24 +02:00
Petr Gusev	d34da12240	storage_proxy: add fencing_token and related infrastructure A new stale_topology_exception was introduced, it's raised in apply_fence when an RPC comes with a stale fencing_token. An overload of apply_fence with future will be used to wrap the storage_proxy methods which need to be fenced.	2023-06-15 15:48:00 +04:00
Kamil Braun	a740fbf58a	storage_proxy: rename `init_messaging_service` to `start_remote` The function now has more responsibilities than before, rename it and add a comment to better illustrate this.	2023-06-14 11:41:36 +02:00
Kamil Braun	f26e98c3be	storage_proxy: don't pass `gossiper&` and `messaging_service&` during initialization These services are now passed during `init_messaging_service`, and that's when the `remote` object is constructed. The `remote` object is then destroyed in `uninit_messaging_service`. Also, `migration_manager*` became `migration_manager&` in `init_messaging_service`.	2023-06-14 11:41:36 +02:00
Kamil Braun	10f11b89ea	storage_proxy: prepare for missing `remote` Prepare the users of `remote` for the possibility that it's gone. The `remote()` accessor throws an error if it's gone. Observe that `remote()` is only used in places where it's verified that we really want to send a message to a remote node, with a small exception: `truncate_blocking`, which truncates locally by sending an RPC to ourselves (and truncate always sends RPC to the whole cluster; we might want to change this behavior in the future, see #11087). Other places are easy to check (it's either implementations of `apply_remotely` which is only called for remote nodes, or there's an `if` that checks we don't apply the operation to ourselves). There is one direct access to `_remote` which checks first if `_remote` is available: `storage_proxy::is_alive`. If `_remote` is unavailable, we consider nodes other than us dead. Indeed, if `gossiper` is unavailable, we didn't have a chance to gossip with other nodes and mark them alive.	2023-06-14 11:41:36 +02:00
Kamil Braun	ddcbade919	storage_proxy: don't access `remote` when calculating target replicas for local queries We only want to access `remote` when it's necessary - when we're performing a query that involves remote nodes. We want to support local queries when `remote` (in particular, `gossiper&`) is unavailable. Add a helper, `storage_proxy::filter_replicas_for_read`, which will check if it's a local query and return early in that case without accessing `remote`.	2023-06-14 11:41:34 +02:00
Kamil Braun	ff8d88a228	storage_proxy: introduce const version of `remote()` One version is implemented using the other (with `const_cast`) because some additional safety checks will be added in later commit.	2023-06-13 12:44:03 +02:00
Kamil Braun	0ef35ceed4	service: storage_proxy: make hint write handlers cancellable Whether a write handler should be cancellable is now controlled by a parameter passed to `create_write_response_handler`. We plumb it down from `send_to_endpoint` which is called by hints manager. This will cause hint write handlers to immediately timeout when we shutdown or when a destination node is marked as dead. Fixes #8079	2023-05-29 11:03:18 +02:00
Kamil Braun	eddb7406b4	service: storage_proxy: rename `view_update_handlers_list` The list will be used for non-view-update write handlers as well, so generalize the name. Also generalize some variable names used in the implementation. This commit only renames things + some comments were added, there are no logical changes.	2023-05-29 10:59:50 +02:00
Kamil Braun	c7ef9a12ee	service: storage_proxy: make it possible to cancel all write handler types The `view_update_write_response_handler` class, which is a subclass of `abstract_write_response_handler`, was created for a single purpose: to make it possible to cancel a handler for a view update write, which means we stop waiting for a response to the write, timing out the handler immediately. This was done to solve issue with node shutdown hanging because it was waiting for a view update to finish; view updates were configured with 5 minute timeout. See #3966, #4028. Now we're having a similar problem with hint updates causing shutdown to hang in tests (#8079). `view_update_write_response_handler` implements cancelling by adding itself to an intrusive list which we then iterate over to timeout each handler when we shutdown or when gossiper notifies `storage_proxy` that a node is down. To make it possible to reuse this algorithm for other handlers, move the functionality into `abstract_write_response_handler`. We inherit from `bi::list_base_hook` so it introduces small memory overhead to each write handler (2 pointers) which was only present for view update handlers before. But those handlers are already quite large, the overhead is small compared to their size. Not all handlers are added to the cancelling list, this is controlled by the `cancellable` parameter passed to the constructor. For now we're only cancelling view handlers as before. In following commits we'll also cancel hint handlers.	2023-05-29 10:42:57 +02:00
Petr Gusev	052b91fb1f	storage_proxy: rename get_live_sorted_endpoints->get_endpoints_for_reading We are going to use remapped_endpoints_for_reading, we need to make sure we use it in the right place. The get_live_sorted_endpoints function looks like what we need - it's used in all read code paths. From its name, however, this was not obvious. Also, we add the parameter ks_name as we'll need it to pass to remapped_endpoints_for_reading.	2023-05-09 18:42:03 +04:00
Pavel Emelyanov	739455c3aa	code: Remove global proxy No code needs global proxy anymore. Keep on-stack values in main and cql_test_env and keep the pointer on debug:: namespace. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-21 14:18:59 +03:00
Pavel Emelyanov	ab8fc0e166	proxy: Carry replication map with repair mutation(s) The create_write_response_handler() for read repair needs the e.r.m. from the caller, because it effectively accepts list of endpoints from it. So this patch equips all read_repair_mutation-s with the e.r.m. pointer so that the handler creation can use it. It's the same for all mutations, so it's a waste of space, but it's not bad -- there's typically few mutations in this range and the entry passed there is temporary, so even lots of them won't occupy lots of memory for long. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-14 14:03:39 +03:00
Pavel Emelyanov	140f373e15	proxy: Wrap read repair entries into read_repair_mutation The schedule_repair() operates on a map of endpoint:mutations pairs. Next patch will need to extend this entry and it's going to be easier if the entry is wrapped in a helper structure in advance. This is where the forwardable reference cursor from the previous patch gets its user. The schedule_repair() produces a range of rvalue wrappers, but the create_write_response_handler accepting it is OK, it copies mutations anyway. The printing operator is added to facilitate mutations logging from mutate_internal() method. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-12-14 14:01:12 +03:00
Michał Jadwiszczak	360dbf98f1	storage_proxy: add `describe_ring()` method In order to execute `DESC CLUSTER`, there has to be a way to describe ring. `storage_service` is not available at query execution. This patch adds `describe_ring()` as a method of `storage_proxy()` (using helper function from `locator/util.hh`).	2022-12-10 12:51:05 +01:00
Benny Halevy	731a74c71f	storage_proxy: pass topology& to sort_endpoints_by_proximity It mustn't use the latest topology that may differ from the one used by the query as it may be missing nodes (e.g. after concurrent decommission). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-11-22 15:02:40 +02:00
Benny Halevy	ab3fc1e069	storage_proxy: pass topology& to is_worth_merging_for_range_query It mustn't use the latest topology that may differ from the one used by the query as it may be missing nodes (e.g. after concurrent decommission). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-11-22 15:01:58 +02:00
Avi Kivity	a2da08f9f9	storage_proxy: hold effective_replication_map for the duration of a paxos transaction Luckily, all topology calculations are done in get_paxos_participants(), so all we have to do is it hold the effective_replication_map for the duration of the transaction, and pass it to get_paxos_participants(). This ensures that the coordinator knows about all in-flight requests and can fence them from topology changes.	2022-10-13 14:27:26 +03:00
Avi Kivity	69aaa5e131	storage_proxy: move paxos_response_handler class to .cc file It's not used elsewhere.	2022-10-13 14:27:26 +03:00
Avi Kivity	b2f3934e95	storage_proxy: deinline paxos_response_handler constructor/destructor They have no business being inline as it's a heavyweight object.	2022-10-13 14:27:26 +03:00
Avi Kivity	94e4ff11be	storage_proxy: use consistent effective_replication_map for counter coordinator Hold the effective_replication_map while talking to the counter leader, to allow for fencing in the future. The code is somewhat awkward because the API allows for multiple keyspaces to be in use. The error code generation, already broken as it doesn't use the correct table, continues to be broken in that it doesn't use the correct effective_replication_map, for the same reason.	2022-10-13 14:27:23 +03:00
Avi Kivity	406a046974	storage_proxy: improve consistency in query_partition_key_range{,_concurrent} query_partition_key_range captures a token_metadata_ptr and uses it consistently in sequential calls to query_partition_key_range_concurrent (via tail recursion), but each invocation of query_partition_key_range_concurrent captures its own effective_replication_map_ptr. Since these are captured at different times, they can be inconsistent after the first iteration. Fix by capturing it once in the caller and propagating it everywhere.	2022-10-13 13:56:52 +03:00
Avi Kivity	86a48cf12f	storage_proxy: use consistent token_metadata with rest of singular read query_singular() uses get_token_metadata_ptr() and later, in get_read_executor(), captures the effective_replication_map(). This isn't a bug, since the two are captured in the same continuation and are therefore consistent, but a way to ensure it stays so is to capture the effective_replication_map earlier and derive the token_metadata from it.	2022-10-13 13:46:04 +03:00
Pavel Emelyanov	2b8636a2a9	storage_proxy.hh: Remove unused headers Add needed forward declarations and fix indirect inclusions in some .ccs Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #11679	2022-10-02 20:48:50 +03:00
Benny Halevy	64140ccf05	cql3, storage_proxy: add support for TRUNCATE USING TIMEOUT Extend the cql3 truncate statement to accept attributes, similar to modification statements. To achieve that we define cql3::statements::raw::truncate_statement derived from raw::cf_statement, and implement its pure virtual prepare() method to make a prepared truncate_statement. The latter, statements::truncate_statement, is no longer derived from raw::cf_statement, and just stores a schema_ptr to get to the keyspace and column_family names. `test_truncate_using_timeout` cql-pytest was added to test the new USING TIMEOUT feature. Fixes #11408 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-09-26 18:30:39 +03:00
Pavel Emelyanov	b6fdea9a79	code: Call sort_endpoints_by_proximity() via topology The method is about to be moved from snitch to topology, this patch prepares the rest of the code to use the latter to call it. The topology's method just calls snitch, but it's going to change in the next patch. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-09-05 15:14:01 +03:00
Pavel Emelyanov	642e50f3e3	snitch: Move is_worth_merging_for_range_query to proxy Proxy is the only place that calls this method. Also the method name suggests it's not something "generic", but rather an internal logic of proxy's query processing. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-09-05 15:10:46 +03:00
Benny Halevy	d295d8e280	everywhere: define locator::host_id as a strong tagged_uuid type So it can be distinguished from other uuid-based identifiers in the system. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #11276	2022-08-12 06:01:44 +03:00
Avi Kivity	01a614fb4d	storage_proxy: use consistent replication map on write path Capture a replication map just once in abstract_write_handler::_effective_replication_map_ptr and use it in all write handlers. A few accesses to get the topology still remain, they will be fixed up in a later patch.	2022-08-11 17:58:42 +03:00
Avi Kivity	f1b0e3d58e	storage_proxy: convert get_live{,_sorted}_endpoints() to accept an effective_replication_map Allow callers to use consistent effective_replication_map:s across calls by letting the caller select the object to use.	2022-08-11 17:58:42 +03:00
Botond Dénes	2656968db2	service/storage_proxy: propagate last position on digest reads We want to transmit the last position as determined by the replica on both result and digest reads. Result reads already do that via the query::result, but digest reads don't yet as they don't return the full query::result structure, just the digest field from it. Add the last position to the digest read's return value and collect these in the digest resolver, along with the returned digests.	2022-08-10 06:03:37 +03:00
Botond Dénes	1b669cefed	service/storage_proxy: add get_tombstone_limit() To be used by coordinator side code to determine the correct tombstone limit to pass to read-command (tombstone limit field added in the next commit). When this limit is non-zero, the replica will start cutting pages after the tombstone limit is surpassed. This getter works similarly to `get_max_result_size()`: if the cluster feature for empty replica pages is set, it will return the value configured via db::config::query_tombstone_limit. System queries always use a limit of 0 (unlimited tombstones).	2022-08-09 10:00:40 +03:00
Kamil Braun	0a4e701b50	service: storage_proxy: pass `migration_manager*` to `init_messaging_service` `migration_manager` lifetime is longer than the lifetime of "storage proxy's messaging service part" - that is, `init_messaging_service` is called after `migration_manager` is started, and `uninit_messaging_service` is called before `migration_manager` is stopped. Thus we don't need to hold an owning pointer to `migration_manager` here. Later, when `init_messaging_service` will actually construct `remote`, this will be a reference, not a pointer. Also observe that `_mm` in `remote` is only used in handlers, and handlers are unregistered before `_mm` is nullified, which ensures that handlers are not running when `_mm` is nullified. (This argument shows why the code made sense regardless of our switch from shared_ptr to raw ptr).	2022-08-04 12:19:43 +02:00
Kamil Braun	078900042f	service: storage_proxy: pass `shared_ptr<gossiper>` to `start_hints_manager` No need to call `_remote->gossiper().shared_from_this()` from within storage_proxy now.	2022-08-04 12:16:09 +02:00
Kamil Braun	d9d10d87ec	service: storage_proxy: establish private section in `remote` Only the (un)init, send_*, and `is_alive` functions are public, plus a getter for gossiper.	2022-08-04 12:16:05 +02:00
Kamil Braun	7364d453dd	service: storage_proxy: remove `migration_manager` pointer The ownership is passed to `remote`, which now contains a `shared_ptr<migration_manager>`.	2022-08-04 12:15:36 +02:00
Kamil Braun	eddd3b8226	service: storage_proxy: remove `_gossiper` field Access `gossiper` through `_remote`. Later, all those accesses will handle missing `remote`. Note that there are also accesses through the `remote()` internal getter. The plan is as follows: - direct accesses through `_remote` will be modified to handle missing `_remote` (these won't cause an error) - `remote()` will throw if `_remote` is missing (`remote()` is only used for operations which actually need to send a message to a remote node).	2022-08-04 12:15:35 +02:00
Kamil Braun	ab946e392f	alternator: ttl: pass `gossiper&` to `expiration_service` This allows us to remove the `gossiper()` getter from `storage_proxy`.	2022-08-04 12:12:43 +02:00
Kamil Braun	3e73de9a40	service: storage_proxy: introduce `is_alive` helper A helper is introduced both in `remote` and in `storage_proxy`. The `storage_proxy` one calls the `remote` one. In the future it will also handle a missing `remote`. Then it will report only the local node to be alive and other nodes dead while `remote` is missing. The change reduces the number of functions using the `_gossiper` field in `storage_proxy`.	2022-08-04 12:12:41 +02:00
Kamil Braun	2aff2fea00	service: storage_proxy: remove `_messaging` reference All uses of `messaging_service&` have been moved to `remote`.	2022-08-02 19:55:12 +02:00
Kamil Braun	cf931c7863	service: storage_proxy: move `connection_dropped` to `remote`	2022-08-02 19:55:12 +02:00
Kamil Braun	2203d4fa09	service: storage_proxy: make `encode_replica_exception_for_rpc` a static function No need for this ugly template to be part of the `storage_proxy` header.	2022-08-02 19:55:12 +02:00
Kamil Braun	3499bc7731	service: storage_proxy: move `handle_write` to `remote` It is a helper used by `receive_mutation_handler` and `handle_paxos_learn`.	2022-08-02 19:55:12 +02:00
Kamil Braun	ba88ad8db0	service: storage_proxy: move `handle_paxos_prune` to `remote`	2022-08-02 19:55:12 +02:00
Kamil Braun	548767f91e	service: storage_proxy: move `handle_paxos_accept` to `remote`	2022-08-02 19:55:12 +02:00
Kamil Braun	807c7f32de	service: storage_proxy: move `handle_paxos_prepare` to `remote`	2022-08-02 19:55:12 +02:00
Kamil Braun	0e431e7c03	service: storage_proxy: move `handle_truncate` to `remote`	2022-08-02 19:55:12 +02:00
Kamil Braun	f8c1ba357f	service: storage_proxy: move `handle_read_digest` to `remote`	2022-08-02 19:55:12 +02:00
Kamil Braun	43997af40f	service: storage_proxy: move `handle_read_mutation_data` to `remote`	2022-08-02 19:55:12 +02:00

1 2 3 4 5 ...

394 Commits