scylladb

Author	SHA1	Message	Date
Gleb Natapov	3e8dbb3c09	lwt: do not return unavailable exception from the 'learn' stage Unavailable exception means that operation was not started and it can be retried safely. If lwt fails in the learn stage though it most certainly means that its effect will be observable already. The patch returns timeout exception instead which means uncertainty. Fixes #7258 Message-Id: <20201001130724.GA2283830@scylladb.com>	2020-10-01 17:16:52 +02:00
Asias He	eedcee7f31	gossip: Reduce unncessary VIEW_BACKLOG updates The blacklog of current and max in VIEW_BACKLOG is not update but the nodes are updating VIEW_BACKLOG all the time. For example: ``` INFO 2020-03-06 17:13:46,761 [shard 0] storage_service - Update system.peers table: endpoint=127.0.0.3, app_state=VIEW_BACKLOG, versioned_value=Value(0:18446744073709551615:1583486026590,718) INFO 2020-03-06 17:13:46,821 [shard 0] storage_service - Update system.peers table: endpoint=127.0.0.2, app_state=VIEW_BACKLOG, versioned_value=Value(0:18446744073709551615:1583486026531,742) INFO 2020-03-06 17:13:47,765 [shard 0] storage_service - Update system.peers table: endpoint=127.0.0.3, app_state=VIEW_BACKLOG, versioned_value=Value(0:18446744073709551615:1583486027590,721) INFO 2020-03-06 17:13:47,825 [shard 0] storage_service - Update system.peers table: endpoint=127.0.0.2, app_state=VIEW_BACKLOG, versioned_value=Value(0:18446744073709551615:1583486027531,745) INFO 2020-03-06 17:13:48,772 [shard 0] storage_service - Update system.peers table: endpoint=127.0.0.3, app_state=VIEW_BACKLOG, versioned_value=Value(0:18446744073709551615:1583486028590,726) INFO 2020-03-06 17:13:48,833 [shard 0] storage_service - Update system.peers table: endpoint=127.0.0.2, app_state=VIEW_BACKLOG, versioned_value=Value(0:18446744073709551615:1583486028531,750) INFO 2020-03-06 17:13:49,772 [shard 0] storage_service - Update system.peers table: endpoint=127.0.0.3, app_state=VIEW_BACKLOG, versioned_value=Value(0:18446744073709551615:1583486029590,729) INFO 2020-03-06 17:13:49,832 [shard 0] storage_service - Update system.peers table: endpoint=127.0.0.2, app_state=VIEW_BACKLOG, versioned_value=Value(0:18446744073709551615:1583486029531,753) ``` The downside of such updates: - Introduces more gossip exchange traffic - Updates system.peers all the time The extra unnecessary gossip traffic is fine to a cluster in a good shape but when some of the nodes or shards are loaded, such messages and the handling of such messages can make the system even busy. With this patch, VIEW_BACKLOG is updated only when the backlog is really updated. Btw, we can even make the update only when the change of the backlog is great than a threshold, e.g., 5%, which can reduce the traffic even further. Fixes #5970	2020-09-29 13:37:37 +03:00
Pavel Emelyanov	550fc734d9	query_pager: Fix continuation handling for noop visitor Before updating the _last_[cp]key (for subsequent .fetch_page()) the pager checks is 'if the pager is not exhausted OR the result has data'. The check seems broken: if the pager is not exhausted, but the result is empty the call for keys will unconditionally try to reference the last element from empty vector. The not exhausted condition for empty result can happen if the short_read is set, which, in turn, unconditionally happens upon meeting partition end when visiting the partition with result builder. The correct check should be 'if the pager is not exhausted AND the result has data': the _last_[pc]key-s should be taken for continuation (not exhausted), but can be taken if the result is not empty (has data). fixes: #7263 tests: unit(dev), but tests don't trigger this corner case Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20200921124329.21209-1-xemul@scylladb.com>	2020-09-22 10:18:01 +02:00
Avi Kivity	8722cb97ae	Merge "storage_service: set_tables_autocompaction: run_with_api_lock" from Benny " Based on https://github.com/scylladb/scylla/issues/7199, it looks like storage_service::set_tables_autocompaction may be called on shards other than 0. Use run_with_api_lock to both serialize the action and to check _initialized on shard 0. Fixes #7199 Test: unit(dev), compaction_test:TestCompaction_with_SizeTieredCompactionStrategy.disable_autocompaction_nodetool_test " * tag 'set_tables_autocompaction-v1' of github.com:bhalevy/scylla: storage_service: set_tables_autocompaction: fixup indentation storage_service: set_tables_autocompaction: run_with_api_lock storage_service: set_tables_autocompaction: use do_with to hold on to args storage_service: set_tables_autocompaction: log message in info level	2020-09-17 12:21:06 +03:00
Pavel Emelyanov	949a258809	storage_service: Uninit RPC verbs The service does this on stop, which is never called, so do it separately. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-09-17 09:52:45 +03:00
Avi Kivity	253a7640e3	Merge 'Clean up old cluster features' from Piotr Sarna " This series follows the suggestion from https://github.com/scylladb/scylla/pull/7203#issuecomment-689499773 discussion and deprecates a number of cluster features. The deprecation does not remove any features from the strings sent via gossip to other nodes, but it removes all checks for these features from code, assuming that the checks are always true. This assumption is quite safe for features introduced over 2 years ago, because the official upgrade path only allows upgrading from a previous official release, and these feature bits were introduced many release cycles ago. All deprecated features were picked from a `git blame` output which indicated that they come from 2018: ```git `e46537b7d3` 2016-05-31 11:44:17 +0200 RANGE_TOMBSTONES_FEATURE = "RANGE_TOMBSTONES"; `85c092c56c` 2016-07-11 10:59:40 +0100 LARGE_PARTITIONS_FEATURE = "LARGE_PARTITIONS"; `02bc0d2ab3` 2016-12-09 22:09:30 +0100 MATERIALIZED_VIEWS_FEATURE = "MATERIALIZED_VIEWS"; `67ca6959bd` 2017-01-30 19:50:13 +0000 COUNTERS_FEATURE = "COUNTERS"; `815c91a1b8` 2017-04-12 10:14:38 +0300 INDEXES_FEATURE = "INDEXES"; `d2a2a6d471` 2017-08-03 10:53:22 +0300 DIGEST_MULTIPARTITION_READ_FEATURE = "DIGEST_MULTIPARTITION_READ"; `ecd2bf128b` 2017-09-01 09:55:02 +0100 CORRECT_COUNTER_ORDER_FEATURE = "CORRECT_COUNTER_ORDER"; `713d75fd51` 2017-09-14 19:15:41 +0200 SCHEMA_TABLES_V3 = "SCHEMA_TABLES_V3"; `2f513514cc` 2017-11-29 11:57:09 +0000 CORRECT_NON_COMPOUND_RANGE_TOMBSTONES = "CORRECT_NON_COMPOUND_RANGE_TOMBSTONES"; `0be3bd383b` 2017-12-04 13:55:36 +0200 WRITE_FAILURE_REPLY_FEATURE = "WRITE_FAILURE_REPLY"; `0bab3e59c2` 2017-11-30 00:16:34 +0000 XXHASH_FEATURE = "XXHASH"; `fbc97626c4` 2018-01-14 21:28:58 -0500 ROLES_FEATURE = "ROLES"; `802be72ca6` 2018-03-18 06:25:52 +0100 LA_SSTABLE_FEATURE = "LA_SSTABLE_FORMAT"; `71e22fe981` 2018-05-25 10:37:54 +0800 STREAM_WITH_RPC_STREAM = "STREAM_WITH_RPC_STREAM"; ``` Tests: unit(dev) manual(verifying with cqlsh that the feature strings are indeed still set) " Closes #7234. * psarna-clean_up_features: gms: add comments for deprecated features gms: remove unused feature bits streaming: drop checks for RPC stream support roles: drop checks for roles schema support service: drop checks for xxhash support service: drop checks for write failure reply support sstables: drop checks for non-compound range tombstones support service: drop checks for v3 schema support repair: drop checks for large partitions support service: drop checks for digest multipartition read support sstables: drop checks for correct counter order support cql3: drop checks for materialized views support cql3: drop checks for counters support cql3: drop checks for indexing support	2020-09-16 10:53:25 +03:00
Asias He	c38ec98c6e	token_metadata: Remove _pending_ranges - Remove get_pending_ranges and introduce has_pending_ranges, since the caller only needs to know if there is a pending range for the keyspace and the node. - Remove print_pending_ranges which is only used in logging. If we really want to log the new pending token ranges, we can log when we set the new pending token ranges. This removal of _pending_ranges makes copying of token_metadata faster and reduces the chance to cause reactor stall. Refs: #7220	2020-09-15 16:27:50 +08:00
Benny Halevy	0410e11213	storage_service: set_tables_autocompaction: fixup indentation Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-09-14 13:55:28 +03:00
Benny Halevy	39cf84f291	storage_service: set_tables_autocompaction: run_with_api_lock Based on https://github.com/scylladb/scylla/issues/7199, it looks like storage_service::set_tables_autocompaction may be called on shards other than 0. Use run_with_api_lock to both serialize the action and to check _initialized on shard 0. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-09-14 13:53:08 +03:00
Benny Halevy	1ca02c756f	storage_service: set_tables_autocompaction: use do_with to hold on to args In preparation to calling is_initialized() which may yield. Plus, the way the tables vector is currently captured is inefficient. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-09-14 13:45:00 +03:00
Benny Halevy	e85a6c4853	storage_service: set_tables_autocompaction: log message in info level This is rare enough and important for the operator to be logged in info level. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-09-14 13:38:59 +03:00
Piotr Sarna	e19e86a6e7	service: drop checks for xxhash support xxhash algorithm is supported for over 2 years and upgrades are only allowed from versions which already have the support, so the checks are hereby dropped.	2020-09-14 12:13:03 +02:00
Piotr Sarna	f05bf78716	service: drop checks for write failure reply support Write failure reply is supported for over 2 years and upgrades are only allowed from versions which already have the support, so the checks are hereby dropped.	2020-09-14 12:11:42 +02:00
Piotr Sarna	cc57f7b154	service: drop checks for v3 schema support Schema v3 is supported for over 2 years and upgrades are only allowed from versions which already have the support, so the checks are hereby dropped.	2020-09-14 12:07:51 +02:00
Piotr Sarna	854a44ff9b	service: drop checks for digest multipartition read support Digest multipartition read is supported for over 2 years and upgrades are only allowed from versions which already have the support, so the checks are hereby dropped.	2020-09-14 12:06:32 +02:00
Avi Kivity	dcaf4ea4dd	Merge "Fix race in schema version recalculation leading to stale schema version in gossip" from Tomasz " Migration manager installs several cluster feature change listeners. The listeners will call update_schema_version_and_announce() when cluster features are enabled, which does this: return update_schema_version(proxy, features).then([] (utils::UUID uuid) { return announce_schema_version(uuid); }); It first updates the schema version and then publishes it via gossip in announce_schema_version(). It is possible that the announce_schema_version() part of the first schema change will be deferred and will execute after the other four calls to update_schema_version_and_announce(). It will install the old schema version in gossip instead of the more recent one. The fix is to serialize schema digest calculation and publishing. Refs #7200 This problem also brought my attention to initialization code, which could be prone to the same problem. The storage service computes gossiper states before it starts the gossiper. Among them, node's schema version. There are two problems with that. First is that computing the schema version and publishing it is not atomic, so is not safe against concurrent schema changes or schema version recalculations. It will not exclude with recalculate_schema_version() calls, and we could end up with the old (and incorrect) schema version being advertised in gossip. Second problem is that we should not allow the database layer to call into the gossiper layer before it is fully initialized, as this may produce undefined behavior. Maybe we're not doing concurrent schema changes/recalculations now, but it is easy to imagine that this could change for whatever reason in the future. The solution for both problems is to break the cyclic dependency between the database layer and the storage_service layer by having the database layer not use the gossiper at all. The database layer publishes schema version inside the database class and allows installing listeners on changes. The storage_service layer asks the database layer for the current version when it initializes, and only after that installs a listener which will update the gossiper. Tests: - unit (dev) - manual (3 node ccm) " * tag 'fix-schema-digest-calculation-race-v1' of github.com:tgrabiec/scylla: db, schema: Hide update_schema_version_and_announce() db, storage_service: Do not call into gossiper from the database layer db: Make schema version observable utils: updateable_value_source: Introduce as_observable() schema: Fix race in schema version recalculation leading to stale schema version in gossip	2020-09-14 12:37:46 +03:00
Tomasz Grabiec	9f58dcc705	db, storage_service: Do not call into gossiper from the database layer The storage service computes gossiper states before it starts the gossiper. Among them, node's schema version. There are two problems with that. First is that computing the schema version and publishing it is not atomic, so is not safe against concurrent schema changes or schema version recalculations. It will not exclude with recalculate_schema_version() calls, and we could end up with the old (and incorrect) schema version being advertised in gossip. Second problem is that we should not allow the database layer to call into the gossiper layer before it is fully initialized, as this may produce undefined behavior. The solution for both problems is to break the cyclic dependency between the database layer and the storage_service layer by having the database layer not use the gossiper at all. The database layer publishes schema version inside the database class and allows installing listeners on changes. The storage_service layer asks the database layer for the current version when it initializes, and only after that installs a listener which will update the gossiper. This also allows us to drop unsafe functions like update_schema_version().	2020-09-11 14:42:41 +02:00
Tomasz Grabiec	1a57d641d1	schema: Fix race in schema version recalculation leading to stale schema version in gossip Migration manager installs several feature change listeners: if (this_shard_id() == 0) { _feature_listeners.push_back(_feat.cluster_supports_view_virtual_columns().when_enabled(update_schema)); _feature_listeners.push_back(_feat.cluster_supports_digest_insensitive_to_expiry().when_enabled(update_schema)); _feature_listeners.push_back(_feat.cluster_supports_cdc().when_enabled(update_schema)); _feature_listeners.push_back(_feat.cluster_supports_per_table_partitioners().when_enabled(update_schema)); } They will call update_schema_version_and_announce() when features are enabled, which does this: return update_schema_version(proxy, features).then([] (utils::UUID uuid) { return announce_schema_version(uuid); }); So it first updates the schema version and then publishes it via gossip in announce_schema_version(). It is possible that the announce_schema_version() part of the first schema change will be deferred and will execute after the other four calls to update_schema_version_and_announce(). It will install the old schema version in gossip instead of the more recent one. The fix is to serialize schema digest calculation and publishing. Refs #7200	2020-09-11 14:40:28 +02:00
Avi Kivity	0e03c979d2	Merge 'Fix ignoring cells after null in appending hash' from Piotr Sarna " This series fixes a bug in `appending_hash<row>` that caused it to ignore any cells after the first NULL. It also adds a cluster feature which starts using the new hashing only after the whole cluster is aware of it. The series comes with tests, which reproduce the issue. Fixes #4567 Based on #4574 " * psarna-fix_ignoring_cells_after_null_in_appending_hash: test: extend mutation_test for NULL values tests/mutation: add reproducer for #4567 gms: add a cluster feature for fixed hashing digest: add null values to row digest mutation_partition: fix formatting appending_hash<row>: make publicly visible	2020-09-10 15:35:38 +03:00
Piotr Sarna	21a77612b3	gms: add a cluster feature for fixed hashing The new hashing routine which properly takes null cells into account is now enabled if the whole cluster is aware of it.	2020-09-10 13:16:44 +02:00
Piotr Sarna	7b329f7102	digest: add null values to row digest With the new hashing routine, null values are taken into account when computing row digest. Previous behavior had a regression which stopped computing the hash after the first null value is encountered, but the original behavior was also prone to errors - e.g. row [1, NULL, 2] was not distinguishable from [1, 2, NULL], because their hashes were identical. This hashing is not yet active - it will only be used after the next commit introduces a proper cluster feature for it.	2020-09-10 13:16:44 +02:00
Raphael S. Carvalho	86b9ea6fb2	storage_service: Fix use-after-free when calculating effective ownership Use-after-free happens because we take a ref to keyspace_name, which is stack allocated, and ceases to exist after the next deferring action. Fixes #7209. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20200909210741.104397-1-raphaelsc@scylladb.com>	2020-09-10 11:33:50 +03:00
Asias He	3ba6e3d264	storage_service: Fix a TOKENS update race for replace operation In commit `7d86a3b208` (storage_service: Make replacing node take writes), application state of TOKENS of the replacing node is added into gossip and propagated to the cluster after the initial start of gossip service. This can cause a race below 1. The replacing node replaces the old dead node with the same ip address 2. The replacing node starts gossip without application state of the TOKENS 3. Other nodes in the cluster replace the application states of old dead node's version with the new replacing node's version 4. replacing node dies 5. replace operation is performed again, the TOKENS application state is not preset and replace operation fails. To fix, we can always add TOKENS application state when the gossip service starts. Fixes: #7166 Backports: 4.1 and 4.2	2020-09-09 15:24:21 +02:00
Kamil Braun	42fb4fe37c	cdc: fix deadlock inside check_and_repair_cdc_streams check_and_repair_cdc_streams, in case it decides to create a new CDC generation, updates the STATUS application state so that other nodes gossiped with pick up the generation change. The node which runs check_and_repair_cdc_streams also learns about a generation change: the STATUS update causes a notification change. This happens during add_local_application_state call which caused the STATUS update; it would lead to calling handle_cdc_generation, which detects a generation change and calls add_local_application_state with the new generation's timestamp. Thus, we get a recursive add_local_application_state call. Unforunately, the function takes a lock before doing on_change notifications, so we get a deadlock. This commit prevents the deadlock. We update the local variable which stores the generation timestamp before updating STATUS, so handle_cdc_generation won't consider the observed generation to be new, hence it won't perform the recursive add_local_application_state call.	2020-09-08 16:33:47 +03:00
Eliran Sinvani	933b44f676	Storage proxy: use hints smp group in mutate locally We are using mutate_locally to handle hint mutations that arrived through RPC. The current implementation makes no distinction whether the mutation came through hint verb or a mutation verb resulting in using the same smp group for both. This commit adds the ability to reference different smp group in mutate_locally private calls and makes the handlers pass the correct smp group to mutate_locally.	2020-09-08 10:03:50 +03:00
Eliran Sinvani	342fc07bd6	Storage proxy: add a dedicated smp group for hints Hints and regular writes currently uses the same cross shard operation semaphore, which can lead to priority inversion, making cross shard writes wait for cross shard hints. This commit adds an smp_service_group for hints and adds it usage in the mutate_hint function.	2020-09-07 15:46:12 +03:00
Pavel Solodovnikov	88ba184247	paxos: use schema_registry when applying accepted proposal if there is schema mismatch Try to look up and use schema from the local schema_registry in case when we have a schema mismatch between the most recent schema version and the one that is stored inside the `frozen_mutation` for the accepted proposal. When such situation happens the stored `frozen_mutation` is able to be applied only if we are lucky enough and column_mapping in the mutation is "compatible" with the new table schema. It wouldn't work if, for example, the columns are reordered, or some columns, which are referenced by an LWT query, are dropped. With the patch we are able to mitigate these cases as long as the referenced schema is still present in the node cache (e.g. it didn't restart/crash or the cache entry is not too old to be evicted). Tests: unit(dev, debug), dtest(paxos_tests.schema_mismatch_*_test) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200827150844.624017-1-pa.solodovnikov@scylladb.com>	2020-08-27 19:04:09 +02:00
Amnon Heiman	68b3ed1c9a	storage_service.cc: get_natural_endpoints should translate key The get_natural_endpoints returns the list of nodes holding a key. There is a variation of the method that gets the key as string, the current implementation just cast the string to bytes_view, which will not work. Instead, this patch changes the implementation to use from_nodetool_style_string to translate the key (in a nodetool like format) to a token. Fixes #7134 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2020-08-27 18:25:15 +03:00
Asias He	fefa35987b	storage_service: Avoid updating tokens in system.peers for nodes to be removed Consider: 1) Start n1,n2,n3 2) Stop n3 3) Start n4 to replace n3 but list n4 as seed node 4) Node n4 finishes replacing operation 5) Restart n2 6) Run SELECT * from system.peers on node or node 1. cqlsh> SELECT * from system.peers ; peer\| data_center \| host_id\| preferred_ip \| rack \| release_version \| rpc_address \| schema_version\| supported_features\| tokens 127.0.0.3 \|null \|null \| null \| null \| null \|null \|null \|null \| {'-90410082611643223', '5874059110445936121'} The replaced old node 127.0.0.3 shows in system.peers. (Note, since commit `399d79fc6f` (init: do not allow replace-address for seeds), step 3 will be rejected. Assume we use a version without it) The problem is that n2 sees n3 is in gossip status of SHUTDOWN after restart. The storage_service::handle_state_normal callback is called for 127.0.0.3. Since n4 is using different token as n3 (seed node does not bootstrap so it uses new tokens instead of tokens of n3 which is being replaced), so owned_tokens will be set. We see logs like: [shard 0] storage_service - handle_state_normal: New node 127.0.0.3 at token 5874059110445936121 [shard 0] storage_service - Host ID collision for cbec60e5-4060-428e-8d40-9db154572df7 between 127.0.0.4 and 127.0.0.3; ignored 127.0.0.3 As a result, db::system_keyspace::update_tokens will be called to write to system.peers for 127.0.0.3 wrongly. if (!owned_tokens.empty()) { db::system_keyspace::update_tokens(endpoint, owned_tokens) } To fix, we should skip calling db::system_keyspace::update_tokens if the nodes is present in endpoints_to_remove. Refs: #4652 Refs: #6397	2020-08-24 10:06:37 +02:00
Avi Kivity	907b775523	Merge "Free compaction from storage service" from Pavel E " There's last call for global storage service left in compaction code, it comes from cleanup_compaction to get local token ranges for filtering. The call in question is a pure wrapper over database, so this set just makes use of the database where it's already available (perform_cleanup) and adds it where it's needed (perform_sstable_upgrade). tests: unit(dev), nodetool upgradesstables " * 'br-remove-ss-from-compaction-3' of https://github.com/xemul/scylla: storage_service: Remove get_local_ranges helper compaction: Use database from options to get local ranges compaction: Keep database reference on upgrade options compaction: Keep database reference on cleanup options db: Factor out get_local_ranges helper	2020-08-23 17:58:32 +03:00
Piotr Dulikowski	b111fa98ca	hinted handoff: use default timeout for sending orphaned hints This patch causes orphaned hints (hints that were written towards a node that is no longer their replica) to be sent with a default write timeout. This is what is currently done for non-orphaned hints. Previously, the timeout was hardcoded to one hour. This could cause a long delay while shutting down, as hints manager waits until all ongoing hint sending operation finish before stopping itself. Fixes: #7051	2020-08-23 11:50:27 +03:00
Avi Kivity	0dcb16c061	Merge "Constify access to token_metadata" from Benny " We keep refrences to locator::token_metadata in many places. Most of them are for read-only access and only a few want to modify the token_metadata. Recently, in `94995acedb`, we added yielding loops that access token_metadata in order to avoid cpu stalls. To make that possible we need to make sure they token_metadata object they are traversing won't change mid-loop. This series is a first step in ensuring the serialization of updates to shared token metadata to reading it. Test: unit(dev) Dtest: bootstrap_test:TestBootstrap.start_stop_test{,_node}, update_cluster_layout_tests.py -a next-gating(dev) " * tag 'constify-token-metadata-access-v2' of github.com:bhalevy/scylla: api/http_context: keep a const sharded<locator::token_metadata>& gossiper: keep a const token_metadata& storage_service: separate get_mutable_token_metadata range_streamer: keep a const token_metadata& storage_proxy: delete unused get_restricted_ranges declaration storage_proxy: keep a const token_metadata& storage_proxy: get rid of mutable get_token_metadata getter database: keep const token_metadata& database: keyspace_metadata: pass const locator::token_metadata& around everywhere_replication_strategy: move methods out of line replication_strategy: keep a const token_metadata& abstract_replication_strategy: get_ranges: accept const token_metadata& token_metadata: rename calculate_pending_ranges to update_pending_ranges token_metadata: mark const methods token_ranges: pending_endpoints_for: return empty vector if keyspace not found token_ranges: get_pending_ranges: return empty vector if keyspace not found token_ranges: get rid of unused get_pending_ranges variant replication_strategy: calculate_natural_endpoints: make token_metadata& param const token_metadata: add get_datacenter_racks() const variant	2020-08-22 20:47:45 +03:00
Pavel Emelyanov	b3274c83e1	storage_service: Remove get_local_ranges helper It's no longer in real use. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-21 14:58:40 +03:00
Pavel Emelyanov	06f4828b93	db: Factor out get_local_ranges helper Storage service and repair code have identical helpers to get local ranges for keyspace. Move this helper's code onto database, later it will be reused by one more place. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-21 14:58:40 +03:00
Benny Halevy	2f7c529c1c	storage_service: separate get_mutable_token_metadata Use a different getter for a token_metadata& that may be changed so we can better synchronize readers and writers of token_metadata and eventually allow them to yield in asynchronous loops. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-20 16:20:34 +03:00
Benny Halevy	2c61383215	storage_proxy: delete unused get_restricted_ranges declaration Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-20 16:20:34 +03:00
Benny Halevy	c8390da5f9	storage_proxy: keep a const token_metadata& storage_proxy doesn't need to change token_metadata. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-20 16:20:34 +03:00
Benny Halevy	dfa5f8ff1e	storage_proxy: get rid of mutable get_token_metadata getter We'd like to strictly control who can modify token metadata and nobody currently needs a mutable reference to storage_proxy::_token_metadata. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-20 16:20:34 +03:00
Benny Halevy	8b63523fb7	token_metadata: rename calculate_pending_ranges to update_pending_ranges Since it sets the token_metadata_impl's pending ranges. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-20 16:20:34 +03:00
Pavel Emelyanov	a6f8f450ba	storage_service: Use local messaging reference All the places the are (and had become such with previous patches) using the global messaging service and the storage service methods, so they can access the local reference on the messaging service. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	4ea3c2797c	storage_service: Keep reference on sharded messaging service It is a bit step backward in the storage-service decompsition campaign, but... Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	24eaf827c0	migration_manager: Add messaging service as argument to get_schema_definition There are 4 places that call this helper: - storage proxy. Callers are rpc verb handlers and already have the proxy at hands from which they can get the messaging service instance - repair. There's local-global messaging instance at hands, and the caller is in verb handler too - streaming. The caller is verb handler, which is unregistered on stop, so the messaging service instance can be captured - migration manager itself. The caller already uses "this", so the messaging service instance can be get from it The better approach would be to make get_schema_definition be the method of migration_manager, but the manager is stopped for real on shutdown, thus referencing it from the callers might not be safe and needs revisiting. At the same time the messaging service is always alive, so using its reference is safe. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	2a4c0fa280	migration_manager: Use local messaging reference in simple cases Most of those places are either non-static migration_manager methods. Plus one place where the local service instance is already at hands. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	6c49127d04	migration_manager: Keep reference on messaging That's another user of messaging service, init it with private reference. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	abb1dd608f	migration_manager: Make push_schema_mutation private non-static method The local migration manager instance is already available at caller, so we can call a method on it. This is to facilitate next patching. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	56aa514cd9	migration_manager: Move get_schema_version verb handling from proxy The user of this verb is migration manager, so the handler must be it as well. The hander code now explicitly gets global proxy. This call is safe, as proxy is not stopped nowadays. In the future we'll need to revisit the relation between migration - proxy - stats anyway. The use of local migration manager is safe, as it happens in verb handler which is unregistered and is waited to be completed on migration manager stop. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	45c31eadb3	repair: Push the sharded<messaging_service> reference down to sync_data_using_repair This function needs the messaging service inside, but the closest place where it can get one from is the storage_service API handlers. Temporarily move the call for global messaging service into storage service, its turn for this cleanup will come later. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:52 +03:00
Pavel Emelyanov	528e4455b9	storage_proxy: Use _proxy in paxos_response_handler methods The proxy pointer is non-null (and is already used in these methods), so it should be safe. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:52 +03:00
Pavel Emelyanov	d397d7e734	storage_proxy: Pass proxy into forward_fn lambda of handle_write It is alive there, so it is safe to pass one to lambda. Once in forward_fn, it can be used to get messaging from. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:52 +03:00
Pavel Emelyanov	e5c10ee3e0	storage_proxy: Use reference on messaging in simple cases Most of the places that need messaging service in proxy already use storage_proxy instance, so it is safe to get the local messaging from it too. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:52 +03:00

1 2 3 4 5 ...

1921 Commits