scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 12:17:02 +00:00

Author	SHA1	Message	Date
Piotr Dulikowski	8727634e9c	raft: topology: only pull topology snapshot in topology-on-raft mode Currently, during group0 snapshot transfer, the node pulling the snapshot will send the `raft_pull_topology_snapshot` verb even if the cluster is not in topology-on-raft mode. The RPC handler returns an empty snapshot in that case. However, using the verb outside topology on raft causes problems: - It can cause issues during rolling upgrade as the snapshot transfer will keep failing on the upgraded nodes until the leader node is upgraded, - Topology changes on raft are still experimental, and using the RPC outside experimental mode will prevent us from doing breaking changes to it. Solve the issue by passing the "topology changes on raft enabled" flag to group0_state_machine and send the RPC only in topology on raft mode.	2023-11-02 07:39:27 +01:00
Marcin Maliszkiewicz	020a9c931b	db: view: run local materialized view mutations on a separate smp service group When base write triggers mv write and it needs to be send to another shard it used the same service group and we could end up with a deadlock. This fix affects also alternator's secondary indexes. Testing was done using (yet) not committed framework for easy alternator performance testing: https://github.com/scylladb/scylladb/pull/13121. I've changed hardcoded max_nonlocal_requests config in scylla from 5000 to 500 and then ran: ./build/release/scylla perf-alternator-workloads --workdir /tmp/scylla-workdir/ --smp 2 \ --developer-mode 1 --alternator-port 8000 --alternator-write-isolation forbid --workload write_gsi \ --duration 60 --ring-delay-ms 0 --skip-wait-for-gossip-to-settle 0 --continue-after-error true --concurrency 2000 Without the patch when scylla is overloaded (i.e. number of scheduled futures being close to max_nonlocal_requests) after couple seconds scylla hangs, cpu usage drops to zero, no progress is made. We can confirm we're hitting this issue by seeing under gdb: p seastar::get_smp_service_groups_semaphore(2,0)._count $1 = 0 With the patch I wasn't able to observe the problem, even with 2x concurrency. I was able to make the process hang with 10x concurrency but I think it's hitting different limit as there wasn't any depleted smp service group semaphore and it was happening also on non mv loads. Fixes https://github.com/scylladb/scylladb/issues/15844 Closes scylladb/scylladb#15845	2023-10-29 18:30:32 +02:00
Kefu Chai	227136ddf5	main.cc: specify shortname for scheduling groups so, for instance, the logging message looks like: ``` INFO 2023-10-24 15:19:37,290 [shard 0:strm] storage_service - entering STARTING mode ``` instead of ``` INFO 2023-10-24 15:19:37,290 [shard 0:stre] storage_service - entering STARTING mode ``` Fixes #15267 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#15821	2023-10-26 10:52:05 +03:00
Pavel Emelyanov	8e1ff745fa	api: Remove http::context -> token_metadata dependency Now the token metadata usage is fine grained by the relevant endpoint handlers only. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-24 17:49:05 +03:00
Pavel Emelyanov	be9ea0c647	api: Pass shared_token_metadata instead of storage_service The token metadata endpoints need token metadata, not storage service Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-24 17:48:27 +03:00
Pavel Emelyanov	e4c0a4d34d	api: Move storage_service endpoints that use token metadata only There are few of them that don't need the storage service for anything but get token metadata from. Move them to own .cc/.hh units. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-24 17:44:53 +03:00
Botond Dénes	19fc01be23	Merge 'Sanitize API -> task_manager dependency' from Pavel Emelyanov This is the continuation of `8c03eeb85d` Registering API handlers for services need to * get the service to handle requests via argument, not from http context (http context, in turn, is going not to depend on anything) * unset the handlers on stop so that the service is not used after it's stopped (and before API server is stopped) This makes task manager handlers work this way Closes scylladb/scylladb#15764 * github.com:scylladb/scylladb: api: Unset task_manager test API handlers api: Unset task_manager API handlers api: Remove ctx->task_manager dependency api: Use task_manager& argument in test API handlers api: Push sharded<task_manager>& down the test API set calls api: Use task_manager& argument in API handlers api: Push sharded<task_manager>& down the API set calls	2023-10-20 18:07:20 +03:00
Pavel Emelyanov	0981661f8b	api: Unset task_manager test API handlers So that the task_manager reference is not used when it shouldn't on stop Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-18 18:56:24 +03:00
Pavel Emelyanov	2d543af78e	api: Unset task_manager API handlers So that the task_manager reference is not used when it shouldn't on stop Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-18 18:56:01 +03:00
Pavel Emelyanov	0632ad50f3	api: Remove ctx->task_manager dependency Now the task manager's API (and test API) use the argument and this explicit dependency is no longer required Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-18 18:55:27 +03:00
Pavel Emelyanov	0396ce7977	api: Push sharded<task_manager>& down the test API set calls This is to make it possible to use this reference instead of the ctx.tm one by the next patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-18 18:54:53 +03:00
Pavel Emelyanov	14e10e7db4	api: Push sharded<task_manager>& down the API set calls This is to make it possible to use this reference instead of the ctx.tm one by the next patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-18 18:52:46 +03:00
Kamil Braun	c1486fee40	Merge 'commitlog: drop truncation_records after replay' from Petr Gusev This is a follow-up for #15279 and it fixes two problems. First, we restore flushes on writes for the tables that were switched to the schema commitlog if `SCHEMA_COMMITLOG` feature is not yet enabled. Otherwise durability is not guaranteed. Second, we address the problem with truncation records, which could refer to the old commitlog if any of the switched tables were truncated in the past. If the node crashes later, and we replay schema commitlog, we may skip some mutations since their `replay_position`s will be smaller than the `replay_position`s stored for the old commitlog in the `truncated` table. It turned out that this problem exists even if we don't switch commitlogs for tables. If the node was rebooted the segment ids will start from some small number - they use `steady_clock` which is usually bound to boot time. This means that if the node crashed we may skip the mutations because their RPs will be smaller than the last truncation record RP. To address this problem we delete truncation records as soon as commitlog is replayed. We also include a test which demonstrates the problem. Fixes #15354 Closes scylladb/scylladb#15532 * github.com:scylladb/scylladb: add test_commitlog system.truncated: Remove replay_position data from truncated on start main.cc: flush only local memtables when replaying schema commitlog main.cc: drop redundant supervisor::notify system_keyspace: flush if schema commitlog is not available	2023-10-18 11:14:31 +02:00
Avi Kivity	f42eb4d1ce	Merge 'Store and propagage GC timestamp markers from commitlog' from Calle Wilund Fixes #14870 (Originally suggested by @avikivity). Use commit log stored GC clock min positions to narrow compaction GC bounds. (Still requires augmented manual flush:es with extensive CL clearing to pass various dtest, but this does not affect "real" execution). Adds a lowest timestamp of GC clock whenever a CF is added to a CL segment the first time. Because GC clock is wall clock time and only connected to TTL (not cell/row timestamps), this gives a fairly accurate view of GC low bounds per segment. This is then (in a rather ugly way) propagated to tombstone_gc_state to narrow the allowed GC bounds for a CF, based on what is currently left in CL. Note: this is a rather unoptimized version - no caching or anything. But even so, should not be excessively expensive, esp. since various other code paths already cache the results. Closes scylladb/scylladb#15060 * github.com:scylladb/scylladb: main/cql_test_env: Augment compaction mgr tombstone_gc_state with CL GC info tombstone_gc_state: Add optional callback to augment GC bounds commitlog: Add keeping track of approximate lowest GC clock for CF entries database: Force new commitlog segment on user initiated flush commitlog: Add helper to force new active segment	2023-10-17 18:27:43 +03:00
Calle Wilund	6fbd210679	system.truncated: Remove replay_position data from truncated on start Once we've started clean, and all replaying is done, truncation logs commit log regarding replay positions are invalid. We should exorcise them as soon as possible. Note that we cannot remove truncation data completely though, since the time stamps stored are used by things like batch log to determine if it should use or discard old batch data.	2023-10-17 18:16:48 +04:00
Petr Gusev	dde36b5d9d	main.cc: flush only local memtables when replaying schema commitlog Schema commitlog can be used only on shard 0, so it's redundant to flush any other memtables.	2023-10-17 18:15:51 +04:00
Petr Gusev	54dd7cf1da	main.cc: drop redundant supervisor::notify Later in the code we have 'replaying schema commit log', which duplicates this one. Also, maybe_init_schema_commitlog may skip schema commitlog initialization if the SCHEMA_COMMITLOG feature is not yet supported by the cluster, so this notification can be misleading.	2023-10-17 18:15:49 +04:00
Kefu Chai	77b96f7748	main.cc: do not cast hours to milliseconds there is no need to explicitly cast an instance of std::chrono::hours to std::chrono::milliseconds to feed it to a function which expects std::chrono::milliseconds. the constructor of of std::chrono::milliseconds is able to do this convert and create a new instance of std::chrono::milliseconds from another. std::chrono::duration<> instance. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#15734	2023-10-17 17:02:45 +03:00
Calle Wilund	3378c246f7	main/cql_test_env: Augment compaction mgr tombstone_gc_state with CL GC info Fixes #14870 (yet another alternative solution) (Originally suggested by @avikivity). Use store GC clock min positions from CL to narrow compaction GC bounds. Note: not optimized with caches or anything at this point. Can easily be added though of course always somewhat risky.	2023-10-17 10:30:40 +00:00
Petr Gusev	39789b6527	main.cc: ARM build fix This is a follow-up for #15720. Closes scylladb/scylladb#15730	2023-10-17 13:17:32 +03:00
Tomasz Grabiec	0aef0f900b	Merge 'truncation records refactorings' from Petr Gusev This PR contains several refactoring, related to truncation records handling in `system_keyspace`, `commitlog_replayer` and `table` clases: * drop map_reduce from `commitlog_replayer`, it's sufficient to load truncation records from the null shard; * add a check that `table::_truncated_at` is properly initialized before it's accessed; * move its initialization after `init_non_system_keyspaces` Closes scylladb/scylladb#15583 * github.com:scylladb/scylladb: system_keyspace: drop truncation_record system_keyspace: remove get_truncated_at method table: get_truncation_time: check _truncated_at is initialized database: add_column_family: initialize truncation_time for new tables database: add_column_family: rename readonly parameter to is_new system_keyspace: move load_truncation_times into distributed_loader::populate_keyspace commitlog_replayer: refactor commitlog_replayer::impl::init system_keyspace: drop redundant typedef system_keyspace: drop redundant save_truncation_record overload table: rename cache_truncation_record -> set_truncation_time system_keyspace: get_truncated_position -> get_truncated_positions	2023-10-17 10:55:30 +02:00
Petr Gusev	9b1dfad51c	main.cc: disable stall detector for debug ARM builds The stall detector uses glibc backtrace function to collect backtraces, this causes ASAN failures on ARM. For now we just disable the stall detector in this configuration, the ticket about migrating to libunwind: scylladb/seastar#1878 We increase the value of blocked_reactor_notify_ms to make sure the stall detector never fires. Fixes #15389 Fixes #15090 Closes scylladb/scylladb#15720	2023-10-16 21:57:35 +03:00
Pavel Emelyanov	162642ac18	api: Remove proxy reference from http context Now it's unused Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-05 16:16:52 +03:00
Pavel Emelyanov	6ce7ec4a5e	api,hints: Pass sharded<proxy>& instead of gossiper& Proxy is the target service to handle hints API endpoints. Need to pass it as argument to handlers Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-05 16:15:28 +03:00
Petr Gusev	b70bca71bc	system_keyspace: move load_truncation_times into distributed_loader::populate_keyspace load_truncation_times() now works only for schema tables since the rest is not loaded until distributed_loader::init_non_system_keyspaces. An attempt to call cf.set_truncation_time for non-system table just throws an exception, which is caught and logged with debug level. This means that the call cf.get_truncation_time in paxos_state.cc has never worked as expected. To fix that we move load_truncation_times() closer to the point where the tables are loaded. The function distributed_loader::populate_keyspace is called for both system and non-system tables. Once the tables are loaded, we use the 'truncated' table to initialize _truncated_at field for them. The truncation_time check for schema tables is also moved into populate_keyspace since is seems like a more natural place for it.	2023-10-05 15:19:52 +04:00
Botond Dénes	19ed3393b3	Merge 'Sanitize tracing start-stop calls' from Pavel Emelyanov Tracing is one of two global service left out there with its starting and stopping being pretty hairy. In order to de-globalize it and keep its start-stop under control the existing start-stop sequence is worth cleaning. This PR * removes create_ , start_ and stop_ wrappers to un-hide the global tracing_instance thing * renames tracing::stop() to shutdown() as it's in fact shutdown * coroutinizes start/shutdown/stop while at it Squeezed parts from #14156 that don't reorder start-stop calls Closes scylladb/scylladb#15611 * github.com:scylladb/scylladb: main: Capture local tracing reference to stop tracing tracing: Pack testing code tracing: Remove stop_tracing() wrapper tracing: Remove start_tracing() wrapper tracing: Remove create_tracing() wrapper tracing: Make shutdown() re-entrable tracing: Coroutinize start/shutdown/stop tracing: Rename helper's stop() to shutdown()	2023-10-05 10:27:19 +03:00
Raphael S. Carvalho	e23f4cf8c9	main: delete dead initialization code for compaction this is redundant code that should have be gone a long time ago. the snippet (which lies above the code being deleted): db.invoke_on_all([] (replica::database& db) { db.get_tables_metadata().for_each_table([] (table_id, lw_shared_ptr<replica::table> table) { replica::table& t = *table; t.enable_auto_compaction(); }); }).get(); provides the same thing as this code being deleted. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes scylladb/scylladb#15597	2023-10-04 15:57:24 +03:00
Petr Gusev	da1e6751e9	table: rename cache_truncation_record -> set_truncation_time This is a refactoring commit without observable changes in behaviour. There is a truncation_record struct, but in this method we only care about time, so rename it (and other related methods) appropriately to avoid confusion.	2023-10-03 17:11:35 +04:00
Pavel Emelyanov	c07905b074	main: Capture local tracing reference to stop tracing Now it's using global reference, but has the local one since recently Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-03 10:46:47 +03:00
Pavel Emelyanov	4c74425780	tracing: Remove stop_tracing() wrapper Now it's confusing, as it doesn't stop tracing, but rather shuts it down on all shards. The only caller of it can be more descriptive without the wrapper Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-03 10:46:47 +03:00
Pavel Emelyanov	61381feaad	tracing: Remove start_tracing() wrapper Callers can make one-like stop with the help of invoke_on_all() overload that wraps std::invoke Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-03 10:46:47 +03:00
Pavel Emelyanov	89c43f6677	tracing: Remove create_tracing() wrapper It doesn't make callers' life easier, but hides global tracing instance Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-10-03 10:46:47 +03:00
Pavel Emelyanov	4910b4d5b7	api: Pass sharded<storage_proxy> reference to storage_proxy handlers The goals is to make handlers use proxy argument instead of keeping proxt as dependency on http context (other handlers are mostly such already) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-09-29 14:10:09 +03:00
Pavel Emelyanov	bbba691931	api: Start (and stop) storage_proxy API earlier Handlers can register as early as the used service is started Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-09-29 14:10:09 +03:00
Pavel Emelyanov	7ef7b05397	api: Remove storage_service argument from storage_proxy setup The code setting up storage_proxy/ endpoints no longer needs storage_service and related decoration Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-09-29 14:10:09 +03:00
Pavel Emelyanov	abf541cf29	main: Initialize API server early Surprisingly, but the dependency-less API server context is initialized somewhere in the middle of main. By that time some "real" services had already started and should have the ability to register their endpoints, so API context should be initialized way ahead. This patch places its initialization next to prometheus init. One thing that's not nice here is that API port listening remains where it was before the patch, so for the external ... observer API initialization doesn't change. Likely API should start listening for connection early as well, but that's left for future patching. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-09-29 14:09:01 +03:00
Botond Dénes	5d8384eff0	Merge 'Fix `test_fencing.py::test_fence_hints` flakiness' from Kamil Braun Add a REST API to reload Raft topology state without having to restart a node and use it in `test_fence_hints`. Restarting the node has undesired side effects which cause test flakiness; more details provided in commit messages. Refactor the test a bit while at it. Fixes: #15285 Closes scylladb/scylladb#15523 * github.com:scylladb/scylladb: test: test_fencing.py: enable hints_manager=trace logs in `test_fence_hints` test: test_fencing.py: reload topology through REST API in `test_fence_hints` test: refactor test_fencing.py api: storage_service: add REST API to reload topology state	2023-09-28 16:30:23 +03:00
Kamil Braun	992f1327d3	api: storage_service: add REST API to reload topology state Some tests may want to modify system.topology table directly. Add a REST API to reload the state into memory. An alternative would be restarting the server, but that's slower and may have other side effects undesired in the test. The API can also be called outside tests, it should not have any observable effects unless the user modifies `system.topology` table directly (which they should never do, outside perhaps some disaster recovery scenarios).	2023-09-28 11:59:16 +02:00
Kamil Braun	060f2de14e	Merge 'Cluster features on raft: new procedure for joining group 0' from Piotr Dulikowski This PR implements a new procedure for joining nodes to group 0, based on the description in the "Cluster features on Raft (v2)" document. This is a continuation of the previous PRs related to cluster features on raft (https://github.com/scylladb/scylladb/pull/14722, https://github.com/scylladb/scylladb/pull/14232), and the last piece necessary to replace cluster feature checks in gossip. Current implementation relies on gossip shadow round to fetch the set of enabled features, determine whether the node supports all of the enabled features, and joins only if it is safe. As we are moving management of cluster features to group 0, we encounter a problem: the contents of group 0 itself may depend on features, hence it is not safe to join it unless we perform the feature check which depends on information in group 0. Hence, we have a dependency cycle. In order to solve this problem, the algorithm for joining group 0 is modified, and verification of features and other parameters is offloaded to an existing node in group 0. Instead of directly asking the discovery leader to unconditionally add the node to the configuration with `GROUP0_MODIFY_CONFIG`, two different RPCs are added: `JOIN_NODE_REQUEST` and `JOIN_NODE_RESPONSE`. The main idea is as follows: - The new node sends `JOIN_NODE_REQUEST` to the discovery leader. It sends a bunch of information describing the node, including supported cluster features. The discovery leader verifies some of the parameters and adds the node in the `none` state to `system.topology`. - The topology coordinator picks up the request for the node to be joined (i.e. the node in `none` state), verifies its properties - including cluster features - and then: - If the node is accepted, the coordinator transitions it to `boostrap`/`replace` state and transitions the topology to `join_group0` state. The node is added to group 0 and then `JOIN_NODE_RESPONSE` is sent to it with information that the node was accepted. - Otherwise, the node is moved to `left` state, told by the coordinator via `JOIN_NODE_RESPONSE` that it was rejected and it shuts down. The procedure is not retryable - if a node fails to do it from start to end and crashes in between, it will not be allowed to retry it with the same host_id - `JOIN_NODE_REQUEST` will fail. The data directory must be cleared before attempting to add it again (so that a new host_id is generated). More details about the procedure and the RPC are described in `topology-over-raft.md`. Fixes: #15152 Closes scylladb/scylladb#15196 * github.com:scylladb/scylladb: tests: mark test_blocked_bootstrap as skipped storage_service: do not check features in shadow round storage_service: remove raft_{boostrap,replace} topology_coordinator: relax the check in enable_features raft_group0: insert replaced node info before server setup storage_service: use join node rpc to join the cluster topology_coordinator: handle joining nodes topology_state_machine: add join_group0 state storage_service: add join node RPC handlers raft: expose current_leader in raft::server storage_service: extract wait_for_live_nodes_timeout constant raft_group0: abstract out node joining handshake storage_service: pass raft_topology_change_enabled on rpc init rpc: add new join handshake verbs docs: document the new join procedure topology_state_machine: add supported_features to replica_state storage_service: check destination host ID in raft verbs group_state_machine: take reference to raft address map raft_group0: expose joined_group0	2023-09-28 11:45:09 +02:00
Botond Dénes	2cc37eb89b	Merge 'Sanitize storage_service API maintenance' from Pavel Emelyanov Storage service API set/unset has two flaws. First, unset doesn't happen, so after storage service is stopped its handlers become "local is not initialized"-assertion and use-after-free landmines. Second, setting of storage service API carry gossiper and system keyspace references, thus duplicating the knowledge about storage service dependencies. This PR fixes both by adding the storage service API unsetting and by making the handlers use _only_ storage service instance, not any externally provided references. Closes scylladb/scylladb#15547 * github.com:scylladb/scylladb: main, api: Set/Unset storage_service API in proper place api/storage_service: Remove gossiper arg from API api/storage_service: Remove system keyspace arg from API api/storage_service: Get gossiper from storage service api/storage_service: Get token_metadata from storage service	2023-09-27 10:00:54 +03:00
Piotr Dulikowski	c24daf7e88	storage_service: pass raft_topology_change_enabled on rpc init We will want to conditionally register some verbs based on whether we are using raft topology or not. This commit serves as a preparation, passing the `raft_topology_change_enabled` to the function which initializes the verbs (although there is _raft_topology_change_enabled field already, it's only initialized on shard 0 later).	2023-09-26 15:56:52 +02:00
Pavel Emelyanov	e022a76350	main, api: Set/Unset storage_service API in proper place Currently the storage-service API handlers are set up in "random" place. It can happen earlier -- as soon as the storage service itself is ready. Also, despite storage service is stopped on shutdown, API handlers continue reference it leading to potential use-after-frees or "local is not initialized" assertions. Fix both. Unsetting is pretty bulky, scylladb/seastar#1620 is to help. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-09-26 12:22:13 +03:00
Pavel Emelyanov	78a22c5ae3	api/storage_service: Remove gossiper arg from API Now all handlers work purely on storage_service and gossiper argument is no longer needed Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-09-26 12:21:46 +03:00
Pavel Emelyanov	8dc6e74138	api/storage_service: Remove system keyspace arg from API It's not used nowadays Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-09-26 12:21:25 +03:00
Pavel Emelyanov	0e0f9a57c6	forward_service: Remove .shutdown() method It's now empty and has no value Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-09-26 10:39:22 +03:00
Pavel Emelyanov	b18c54f56c	forward_service: Add abort_source to constructor It will be used by the next patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-09-26 10:38:26 +03:00
Kefu Chai	f3f31f0c65	main.cc: rename aws option - s/aws_key/aws_access_key_id/ - s/aws_secret/aws_secret_access_key/ - s/aws_token/aws_session_token/ rename them to more popular names, these names are also used by boto's API. this should improve the readability and consistency. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-09-23 14:31:32 +08:00
Kefu Chai	ac3406e537	utils/s3/creds: rename aws_config member variables - s/key/access_key_id/ - s/secret/secret_access_key/ - s/token/session_token/ so they are more aligned with the AWS document. for instance, in https://docs.aws.amazon.com/AmazonS3/latest/userguide/RESTAuthentication.html#ConstructingTheAuthenticationHeader AWSAccessKeyId is used in the "Authorization" header. this would help with the readability and maintainability. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-09-23 14:28:07 +08:00
Avi Kivity	1da6a939fe	Merge 'Track memory usage of S3 object uploads' from Pavel Emelyanov The S3 uploading sink needs to collect buffers internally before sending them out, because the minimal upload-able part size is 5Mb. When the necessary amount of bytes is accumulated, the part uploading fibers starts in the background. On flush the sink waits for all the fibers to complete and handles failure of any. Uploading parallelism is nowadays limited by the means of the http client max-connections parameter. However, when a part uploading fibers waits for it connection it keeps the 5Mb+ buffers on the request's body, so even though the number of uploading parts is limited, the number of _waiting_ parts is effectively not. This PR adds a shard-wide limiter on the number of background buffers S3 clients (and theirs http clients) may use. Closes scylladb/scylladb#15497 * github.com:scylladb/scylladb: s3::client: Track memory in client uploads code: Configure s3 clients' memory usage s3::client: Construct client with shared semaphore sstables::storage_manager: Introduce config	2023-09-21 18:24:42 +03:00
Botond Dénes	3b95f4f107	Merge 'Sanitize view-update-generator start-stop sequence' from Pavel Emelyanov The v.u.g. start stop is now spread over main() code heavily. 1. sharded<v.u.g.>.start() happens early enough to allow depending services register staging sstables on it 2. after the system is "more-or-less" alive the invoke_on_all(v.u.g.::start()) is called (conditionally) to activate the generator background fiber. Not 100% sure why it happens _that_ late, but somehow it's required that while scylla is joining the cluster the generation doesn't happen 3. early on stop the v.u.g. is fully stopped The 3rd step is pretty nasty. It may happen that v.u.g. is not stopped if scylla start aborts before the last action is defer-scheduled. Also, when it happens, it leaves stopping dependencies with non-initialized v.u.g.'s local instances, which is not symmetrical to how they start. Said that, this PR fixes the stopping sequence to happen later, i.e. -- being defer-scheduled right after sharded<v.u.g.> is started. Also it makes sure that terminating the background fiber happens as early as it is now. This is done the compaction_manager-style -- the v.u.g. subscribes on stop signal abort source and kicks the fiber to stop when it fires. Closes scylladb/scylladb#15466 * github.com:scylladb/scylladb: view_update_generator: Stop for real later view_update_generator: Add logging to do_abort() view_update_generator: Move abort kicking to do_abort() view_update_generator: Add early abort subscription	2023-09-21 17:01:27 +03:00

1 2 3 4 5 ...

1104 Commits