scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-22 07:42:16 +00:00

Author	SHA1	Message	Date
Avi Kivity	0ae22a09d4	LICENSE: Update to version 1.1 Updated terms of non-commercial use (must be a never-customer).	2026-04-12 19:46:33 +03:00
Dawid Mędrek	ad8a263683	service: strong_consistency: Abort ongoing operations when shutting down These changes are complementary to those from a recent commit where we handled aborting ongoing operations during tablet events, such as tablet migration. In this commit, we consider the case of shutting down a node. When a node is shutting down, we eventually close the connections. When the client can no longer get a response from the server, it makes no sense to continue with the queries. We'd like to cancel them at that point. We leverage the abort source passed down via `client_state` down to the strongly consistent coordinator. This way, the transport layer can communicate with it and signal that the queries should be canceled. The abort source is triggered by the CQL server (cf. `generic_server::server::{stop,shutdown}`). --- Note that this is not an optional change. In fact, if we don't abort those requests, we might hang for an indefinite amount of time when executing the following code in `main.cc`: ``` // Register at_exit last, so that storage_service::drain_on_shutdown will be called first auto do_drain = defer_verbose_shutdown("local storage", [&ss] { ss.local().drain_on_shutdown().get(); }); ``` The problem boils down to the fact that `generic_server::server::stop` will wait for all connections to be closed, but that won't happen until all ongoing operations (at least those to strongly consistent tables) are finished. It's important to highlight that even though we hang on this, the client can no longer get any response. Thus, it's crucial that at that point we simply abort ongoing operations to proceed with the rest of shutdown. --- Two tests are added to verify that the implementation is correct: one focusing on local operations, the other -- on a forwarded write. Difference in time spent on the whole test file `test_strong_consistency.py` on my local machine, in dev mode: Before: ``` real 0m31.775s user 1m4.475s sys 0m22.615s ``` After: ``` real 0m32.024s user 1m10.751s sys 0m23.871s ``` Individual runs of the added tests: test_queries_when_shutting_down: ``` real 0m12.818s user 0m36.726s sys 0m4.577s ``` test_abort_forwarded_write_upon_shutdown: ``` real 0m12.930s user 0m36.622s sys 0m4.752s ```	2026-04-09 11:36:17 +02:00
Marcin Maliszkiewicz	aba5a8c37f	generic_server: fix waiters count in shed log Capture semaphore waiters count when blocking starts, not after the wait completes.	2026-02-20 17:04:23 +01:00
Marcin Maliszkiewicz	23bed55170	generic_server: scale connection concurrency semaphore by listener count The concurrency semaphore gates uninitialized connections across all do_accepts loops, but was initialized to a fixed value regardless of how many listeners exist. With multiple listeners competing for the same units, each effectively gets less than the configured concurrency. Initialize the semaphore to concurrency - 1 and signal 1 per listen() call, so total capacity is concurrency - 1 + nr_listeners. This guarantees each listener's accept loop can have at least one unit available.	2026-02-20 16:59:19 +01:00
Marcin Maliszkiewicz	0376d16ad3	transport: fix connection code to consume only initially taken semaphore units The connection's cpu_concurrency_t struct tracks the state of a connection to manage the admission of new requests and prevent CPU overload during connection storms. When a connection holds units (allowed only 0 or 1), it is considered to be in the "CPU state" and contributes to the concurrency limits used when accepting new connections. The bug stems from the fact that `counted_data_source_impl::get` and `counted_data_sink_impl::put` calls can interleave during execution. This occurs because of `should_parallelize` and `_ready_to_respond`, the latter being a future chain that can run in the background while requests are being read. Consequently, while reading request (N), the system may concurrently be writing the response for request (N-1) on the same connection. This interleaving allows `return_all()` to be called twice before the subsequent `consume_units()` is invoked. While the second `return_all()` call correctly returns 0 units, the matching `consume_units()` call would mistakenly take an extra unit from the semaphore. Over time, a connection blocked on a read operation could end up holding an unreturned semaphore unit. If this pattern repeats across multiple connections, the semaphore units are eventually depleted, preventing the server from accepting any new connections. The fix ensures that we always consume the exact number of units that were previously returned. With this change, interleaved operations behave as follows: get() return_all — returns 1 unit put() return_all — returns 0 units get() consume_units — takes back 1 unit put() consume_units — takes back 0 units Logically, the networking phase ends when the first network operation concludes. But more importantly, when a network operation starts, we no longer hold any units. Other solutions are possible but the chosen one seems to be the simplest and safest to backport. Fixes SCYLLADB-485	2026-02-17 17:55:48 +01:00
Botond Dénes	7b562bb185	Merge 'system.clients: Address SSL refactor review comments' from Piotr Smaron and Copilot Addresses outstanding review comments from PR #22961 where SSL field collection was refactored into generic_server::connection base class. This patch consists of minor cosmetic enhancements for increased readability, mainly, with some minor fixups explained in specific commits. Cosmetic changes, no need to backport. Closes scylladb/scylladb#27575 * github.com:scylladb/scylladb: test_ssl: fix indentation generic_server: improve logging broken TLS connection test_ssl: improve timeout and readability alternator/server: update SSL comment	2026-01-13 11:00:26 +02:00
Piotr Smaron	2bcbebe92d	generic_server: improve logging broken TLS connection Preiously we were logging a broken TLS connection and then this has been logged later again, so now instead of logging we're constructing an exception with a message extened with TLS info, which later will be catched with its full message still logged.	2026-01-09 10:24:55 +01:00
Avi Kivity	1382b47d45	config, transport: support proxy protocol v2 enhanced connections We have four native transport ports: two for plain/TLS, and two more for shard-aware (plain/TLS as well). Add four more that expect the proxy protocol v2 header. This allows nodes behind a reverse proxy to record the correct source address and port in system.clients, and the shard-aware port to see the correct source port selection made my the client.	2025-12-17 14:18:04 +02:00
Marcin Maliszkiewicz	d5b63df46e	transport: remove redundant futurize_invoke from counted data sink and source Closes scylladb/scylladb#27526	2025-12-11 10:32:16 +03:00
Pavel Emelyanov	a88a36f5b5	code: Switch to seastar API level 9 In the new API the biggest change is to implement the only data_sink_impl::put(span<temporary_buffer>) overload. Encrypted file impl and sstables compress sink use fallback_put() helper that generates a chain of continuations each holding a buffer. The counting_data_sink in transport had mostly been patched to correct implementation by the previous patch, the change here is to replace vector argument with span one. Most other sinks just re-implement their put(vector<temporary_buffer>) overload by iterating over span and non-preemptively grabbing buffers from it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-10-17 10:26:50 +03:00
Pavel Emelyanov	9ece535b5e	transport: Open-code invoke_with_counting into counting_data_sink::put The former helper is implemented like this: future<> invoke_with_counting(fn) { if (not_needed) return fn(); return futurize_invoke(something).then([fn] { return fn() }).finally(something_else); } and all put() overloads are like future<> put(arg) { return invoke_with_counting([this, arg] { return lower_sink.put(arg); }); } The problem is that with seastar API level 9, the put() overload will have to move the passed buffers into stable storage before preempting. In its current implementation, when counting is needed the invoke_with_counting will link lower_sink.put() invocation to the futurize_invoke(something) future. Despite "something" is non-preempting, and futurize_invoke() on it returns ready future, in debug mode ready_future.then() does preempt, and the API level 9 put() contract will be violated. To facilitate the switch to new API level, this patch rewrites one of put() overloads to look like future<> put(arg) { if (not_needed) { return lower_sink.put(arg); } something; return lower_sink(arg).finally(something_else); } Other put()-s will be removed by next patch anyway, but this put() will be patched and will call lower_sink.put() without preemption. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2025-10-17 10:26:30 +03:00
Andrzej Jackowski	14081d0727	generic_server: transport: start using `sl:driver` for new connections Before this change, new connections were handled in a default scheduling group (`main`), because before the user is authenticated we do not know which service level should be used. With the new `sl:driver` service level, creation of new connections can be moved to `sl:driver`. We switch the service level as early as possible, in `do_accepts`. There is a possibility, that `sl:driver` will not exist yet, for instance, in specific upgrade cases, or if it was removed. Therefore, we also switch to `sl:driver` after a connection is accepted. Refs: scylladb/scylladb#24411	2025-10-08 08:25:12 +02:00
Ernest Zaslavsky	debc756794	treewide: Move transport related files to a `transport` directory As requested in #22112 , moved the files and fixed other includes and build system. Moved files: - generic_server.hh - generic_server.cc - protocol_server.hh Fixes: #22112 This is a cleanup, no need to backport Closes scylladb/scylladb#25090	2025-09-29 11:46:06 +03:00

13 Commits