scylladb

Author	SHA1	Message	Date
Piotr Smaron	5afa3028a3	Introduce TABLET_KEYSPACE event to differentiate processing path of a vnode vs tablets ks	2024-05-30 08:33:15 +03:00
Avi Kivity	784c2f8ad2	Merge 'treewide: replace calls to future::get0() by calls to future::get()' from Kefu Chai get0() dates back from the days where Seastar futures carried tuples, and get0() was a way to get the first (and usually only) element. Now it's a distraction, and Seastar is likely to deprecate and remove it. Replace with seastar::future::get(), which does the same thing. Closes scylladb/scylladb#17130 * github.com:scylladb/scylladb: treewide: replace seastar::future::get0() with seastar::future::get() sstable: capture return value of get0() using auto utils: result_loop: define result_type with decayed type [avi: add another one that snuck in while this was cooking]	2024-02-04 15:23:33 +02:00
Avi Kivity	7cb1c10fed	treewide: replace seastar::future::get0() with seastar::future::get() get0() dates back from the days where Seastar futures carried tuples, and get0() was a way to get the first (and usually only) element. Now it's a distraction, and Seastar is likely to deprecate and remove it. Replace with seastar::future::get(), which does the same thing.	2024-02-02 22:12:57 +08:00
Kefu Chai	c7a01b9eb4	transport: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#17092	2024-02-02 11:20:24 +02:00
Mikołaj Grzebieluch	8b2f0e38d9	service/maintenance_mode: move maintenance_socket_enabled definition to seperate file	2024-01-25 15:27:53 +01:00
Mikołaj Grzebieluch	ac61d0f695	cql_controller: add option to start maintenance socket Add an option to listen on the maintenance socket. It is set up on an unix domain socket and the metrics are disabled. This enables having an independent authentication mechanism for this socket. To start the maintenance socket, a new cql_controller has to be created with `db::maintenance_socket_enabled::yes` argument. Creating maintenance socket will raise an exception if * the path is longer than 107 chars (due to linux limits), * a file or a directory already exists in the path. The indentation is fixed in the next commit.	2023-12-18 17:58:13 +01:00
Kefu Chai	0b69a1badc	transport: cast unaligned<T> to T for formatting it in fmt v10, it does not cast unaligned<T> to T when formatting it, instead it insists on finding a matched fmt::formatter<> specialization for it. that's why we have FTBFS with fmt v10 when printing these packed<T> variables with fmtlib v10. in this change, we just cast them to the underlying types before formatting them. because seastar::unaligned<T> does not provide a method for accessing the raw value, neither does it provide a type alias of the type of the underlying raw value, we have to cast to the type without deducing it from the printed value. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16167	2023-11-27 15:26:13 +02:00
sylwiaszunejko	75b3dbf7ea	transport: add support for setting custom payload A custom payload can now be added to response_message. If it is set, it will be sent to client and the custom_payload flag will be set. write_string_bytes_map method is added to response class and a missing custom_payload flag is added to cql_frame_flags.	2023-11-21 15:09:36 +01:00
Amnon Heiman	1abcd4bb11	transport/server.cc: mark metric counter with skip_when_empty This patch mark scylla_transport_cql_errors_total with skip_when_empty flag. It reduces the overhead for metrics for types that are never reported. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2023-08-23 09:30:35 -04:00
Gleb Natapov	4ffc39d885	cql3: Extend the scope of group0_guard during DDL statement execution Currently we hold group0_guard only during DDL statement's execute() function, but unfortunately some statements access underlying schema state also during check_access() and validate() calls which are called by the query_processor before it calls execute. We need to cover those calls with group0_guard as well and also move retry loop up. This patch does it by introducing new function to cql_statement class take_guard(). Schema altering statements return group0 guard while others do not return any guard. Query processor takes this guard at the beginning of a statement execution and retries if service::group0_concurrent_modification is thrown. The guard is passed to the execute in query_state structure. Fixes: #13942 Message-ID: <ZNsynXayKim2XAFr@scylladb.com>	2023-08-17 15:52:48 +03:00
Avi Kivity	d57a951d48	Revert "cql3: Extend the scope of group0_guard during DDL statement execution" This reverts commit `70b5360a73`. It generates a failure in group0_test .test_concurrent_group0_modifications in debug mode with about 4% probability. Fixes #15050	2023-08-15 00:26:45 +03:00
Gleb Natapov	70b5360a73	cql3: Extend the scope of group0_guard during DDL statement execution Currently we hold group0_guard only during DDL statement's execute() function, but unfortunately some statements access underlying schema state also during check_access() and validate() calls which are called by the query_processor before it calls execute. We need to cover those calls with group0_guard as well and also move retry loop up. This patch does it by introducing new function to cql_statement class take_guard(). Schema altering statements return group0 guard while others do not return any guard. Query processor takes this guard at the beginning of a statement execution and retries if service::group0_concurrent_modification is thrown. The guard is passed to the execute in query_state structure. Fixes: #13942 Message-ID: <ZNSWF/cHuvcd+g1t@scylladb.com>	2023-08-13 14:19:39 +03:00
Kefu Chai	565f5c7380	transport: correct format string when printing logging message we print the stream id in the logging messages, but in this case, we forgot to pass `stream` to `log::debug()`. but the placeholder for `stream` was added. if the underlying fmtlib actually formats the argument with the format string, it would throw. fortunately, we don't enable debug level logging often, guess that's why we haven't spotted this issue yet. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14620	2023-07-13 11:21:43 +03:00
Calle Wilund	20e9619bb1	transport: Try to do early, transport based auth if possible Bypassing the need for an AUTH message+response. I.e. do auth _without_ client having login specified.	2023-06-26 15:00:21 +00:00
Kefu Chai	c3d91f5190	tracing: drop trace(.., std::string&&) overload this change is a follow-up of `4f5fcb02fd`, the goal is to avoid the programming oversights like ```c++ trace(trace_ptr, "foo {} with {} but {} is {}"); ``` as `trace(const trace_state_ptr& p, const std::string& msg)` is a better match than the templated one, i.e., `trace(const trace_state_ptr& p, fmt::format_string<T...> fmt, T&&... args)`. so we cannot detect this with the compile-time format checking. so let's just drop this overload, and update its callers to use the other overload. The change was suggested by Avi. the example also came from him. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #14188	2023-06-10 20:09:35 +03:00
Avi Kivity	42a1ced73b	cql3: result_set: switch cell data type from bytes_opt to managed_bytes_opt The expression system uses managed_bytes_opt for values, but result_set uses bytes_opt. This means that processing values from the result set in expressions requires a copy. Out of the two, managed_bytes_opt is the better choice, since it prevents large contiguous allocations for large blobs. So we switch result_set to use managed_bytes_opt. Users of the result_set API are adjusted. The db::function interface is not modified to limit churn; instead we convert the types on entry and exit. This will be adjusted in a following patch.	2023-05-07 17:17:36 +03:00
Kefu Chai	b76877fd99	transport: capture reference to temp value by value `current_scheduling_group()` returns a temporary value, and `name()` returns a reference, so we cannot capture the return value by reference, and use the reference after this expression is evaluated. this would cause undefined behavior. so let's just capture it by value. this change also silence following warning from GCC-13: ``` /home/kefu/dev/scylladb/transport/server.cc:204:11: error: possibly dangling reference to a temporary [-Werror=dangling-reference] 204 \| auto& cur_sg_name = current_scheduling_group().name(); \| ^~~~~~~~~~~ /home/kefu/dev/scylladb/transport/server.cc:204:56: note: the temporary was destroyed at the end of the full expression ‘seastar::current_scheduling_group().seastar::scheduling_group::name()’ 204 \| auto& cur_sg_name = current_scheduling_group().name(); \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~ ``` Fixes #13719 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13724	2023-05-01 22:40:36 +03:00
Botond Dénes	3e92bcaa20	Merge 'utils: redesign reusable_buffer' from Michał Chojnowski Common compression libraries work on contiguous buffers. Contiguous buffers are a problem for the allocator. However, as long as they are short-lived, we can avoid the expensive allocations by reusing buffers across tasks. This idea is already applied to the compression of CQL frames, but with some deficiencies. `utils: redesign reusable_buffer` attempts to improve upon it in a few ways. See its commit message for an extended discussion. Compression buffer reuse also happens in the zstd SSTable compressor, but the implementation is misguided. Every `zstd_processor` instance reuses a buffer, but each instance has its own buffer. This is very bad, because a healthy database might have thousands of concurrent instances (because there is one for each sstable reader). Together, the buffers might require gigabytes of memory, and the reuse actually increases memory pressure significantly, instead of reducing it. `zstd: share buffers between compressor instances` aims to improve that by letting a single buffer be shared across all instances on a shard. Closes #13324 * github.com:scylladb/scylladb: zstd: share buffers between compressor instances utils: redesign reusable_buffer	2023-04-27 09:09:09 +03:00
Michał Chojnowski	bf26a8c467	utils: redesign reusable_buffer Large contiguous buffers put large pressure on the allocator and are a common source of reactor stalls. Therefore, Scylla avoids their use, replacing it with fragmented buffers whenever possible. However, the use of large contiguous buffers is impossible to avoid when dealing with some external libraries (i.e. some compression libraries, like LZ4). Fortunately, calls to external libraries are synchronous, so we can minimize the allocator impact by reusing a single buffer between calls. An implementation of such a reusable buffer has two conflicting goals: to allocate as rarely as possible, and to waste as little memory as possible. The bigger the buffer, the more likely that it will be able to handle future requests without reallocation, but also the memory memory it ties up. If request sizes are repetitive, the near-optimal solution is to simply resize the buffer up to match the biggest seen request, and never resize down. However, if we anticipate pathologically large requests, which are caused by an application/configuration bug and are never repeated again after they are fixed, we might want to resize down after such pathological requests stop, so that the memory they took isn't tied up forever. The current implementation of reusable buffers handles this by resizing down to 0 every 100'000 requests. This patch attempts to solve a few shortcomings of the current implementation. 1. Resizing to 0 is too aggressive. During regular operation, we will surely need to resize it back to the previous size again. If something is allocated in the hole left by the old buffer, this might cause a stall. We prefer to resize down only after pathological requests. 2. When resizing, the current implementation allocates the new buffer before freeing the old one. This increases allocator pressure for no reason. 3. When resizing up, the buffer is resized to exactly the requested size. That is, if the current size is 1MiB, following requests of 1MiB+1B and 1MiB+2B will both cause a resize. It's preferable to limit the set of possible sizes so that every reset doesn't tend to cause multiple resizes of almost the same size. The natural set of sizes is powers of 2, because that's what the underlying buddy allocator uses. No waste is caused by rounding up the allocation to a power of 2. 4. The interval of 100'000 uses is both too low and too arbitrary. This is up for discussion, but I think that it's preferable to base the dynamics of the buffer on time, rather than the number of uses. It's more predictable to humans. The implementation proposed in this patch addresses these as follows: 1. Instead of resizing down to 0, we resize to the biggest size seen in the last period. As long as at least one maximal (up to a power of 2) "normal" request appears each period, the buffer will never have to be resized. 2. The capacity of the buffer is always rounded up to the nearest power of 2. 3. The resize down period is no longer measured in number of requests but in real time. Additionally, since a shared buffer in asynchronous code is quite a footgun, some rudimentary refcounting is added to assert that only one reference to the buffer exists at a time, and that the buffer isn't downsized while a reference to it exists. Fixes #13437	2023-04-26 22:09:17 +02:00
Kefu Chai	c642ca9e73	redis,thrift,transport: initialize _config with std::move(config) instead of copying the `config` parameter, move away from it. this change also prepares for a non-copyable config. if the class of `config` is not copyable, we will not be able to initialize the member variable by copying from the given `config` parameter. after the live-updateable config change, the `_config` member variable will contain instances of utils::observer<>, which is not copyable, but is move-constructable, hence in this change, we just move away from the give `config`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-03-29 20:06:02 +08:00
Botond Dénes	19560419d2	Merge 'treewide: improve compatibility with gcc 13' from Avi Kivity An assortment of patches that reduce our incompatibilities with the upcoming gcc 13. Closes #13243 * github.com:scylladb/scylladb: transport: correctly format unknown opcode treewide: catch by reference test: raft: avoid confusing string compare utils, types, test: extract lexicographical compare utilities test: raft: fsm_test: disambiguate raft::configuration construction test: reader_concurrency_semaphore_test: handle all enum values repair: fix signed/unsigned compare repair: fix incorrect signed/unsigned compare treewide: avoid unused variables in if statements keys: disambiguate construction from initializer_list<bytes> cql3: expr: fix serialize_listlike() reference-to-temporary with gcc compaction: error on invalid scrub type treewide: prevent redefining names api: task_manager: fix signed/unsigned compare alternator: streams: fix signed/unsigned comparison test: fix some mismatched signed/unsigned comparisons	2023-03-24 15:16:05 +02:00
Vlad Zolotarov	f94bbc5b34	transport: add per-scheduling-group CQL opcode-specific metrics This patch extends a previous patch that added these metrics globally: - cql_requests_count - cql_request_bytes - cql_response_bytes This patch adds a "scheduling_group_name" label to these metrics and changes corresponding counters to be accounted on a per-scheduling-group level. As a bonus this patch also marks all 3 metrics as 'skip_when_empty'. Ref #13061 Signed-off-by: Vlad Zolotarov <vladz@scylladb.com> Message-Id: <20230321201412.3004845-1-vladz@scylladb.com>	2023-03-22 13:27:48 +02:00
Avi Kivity	19810cfc5e	transport: correctly format unknown opcode gcc allows an enum to contain values outside its members. For extra safety, as this can be user visible, format the unknown opcode and return it.	2023-03-21 15:43:00 +02:00
Avi Kivity	e75009cd49	treewide: catch by reference gcc rightly warns about capturing by value, so capture by reference.	2023-03-21 15:43:00 +02:00
Vlad Zolotarov	ae6724f155	transport: refactor CQL metrics This patch reorganizes and extends CQL related metrics. Before this patch we only had counters for specific CQL requests. However, many times we need to reason about the size of CQL queries: corresponding requests and response sizes. This patch adds corresponding metrics: - Arranges all 3 per-opcode statistics counters in a single struct. - Defines a vector of such structs for each CQL opcode. - Adjusts statistics updates accordingly - the code is much simpler now. - Removes old metrics that were accounting some CQL opcodes. - Adds new per-opcode metrics for requests number, request and response sizes: - New metrics are of a derived kind - rate() should be applied to them. - There are 3 new metrics names: - 'cql_requests_count' - 'cql_request_bytes' - 'cql_response_bytes' - New metrics have a per-opcode label - 'kind'. For example: A number of response bytes for an EXECUTE opcode on shard 0 looks as follows: scylla_transport_cql_response_bytes{kind="EXECUTE",shard="0"} Ref #13061 Signed-off-by: Vlad Zolotarov <vladz@scylladb.com> Message-Id: <20230302154816.299721-1-vladz@scylladb.com>	2023-03-07 12:02:34 +02:00
Kefu Chai	412953fdd5	compress, transport: do not detect LZ4_compress_default() `LZ4_compress_default()` was introduced in liblz4 v1.7.3, despite that the release note (https://github.com/lz4/lz4/releases/tag/v1.7.3) of v1.7.3 didn't mention this. if we check the commit which added this API, we can find all releases including it: see ``` $ git tag --contains 1b17bf2ab8cf66dd2b740eca376e2d46f7ad7041 lz4-r130 r129 r130 r131 rc129v0 v1.7.3 v1.7.4 v1.7.4.2 v1.7.5 v1.8.0 v1.8.1 v1.8.1.2 v1.8.2 v1.8.3 v1.9.0 v1.9.1 v1.9.2 v1.9.3 v1.9.4 ``` and v1.7.3 was released in Nov 17, 2016. some popular distros releases also package new enough liblz4: - fedora 35 ships lz4-devel 1.9.3, - CentOS 7 ships lz4-devel 1.8.3 - debian 10 ships liblz4-dev 1.8.3 - ubuntu 18.04 ships liblz4-dev r131 so, in this change, we drop the support of liblz4 < 1.7.3 for better code readability. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #12971	2023-02-23 14:39:20 +02:00
Kefu Chai	0cb842797a	treewide: do not define/capture unused variables these warnings are found by Clang-17 after removing `-Wno-unused-lambda-capture` and '-Wno-unused-variable' from the list of disabled warnings in `configure.py`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-02-15 22:57:18 +02:00
Petr Gusev	3263523b54	transport server: fix "request size too large" handling Calling _read_buf.close() doesn't imply eof(), some data may have already been read into kernel or client buffers and will be returned next time read() is called. When the _server._max_request_size limit was exceeded and the _read_buf was closed, the process_request method finished and we started processing the next request in connection::process. The unread data from _read_buf was treated as the header of the next request frame, resulting in "Invalid or unsupported protocol version" error. The existing test_shed_too_large_request was adjusted. It was originally written with the assumption that the data of a large query would simply be dropped from the socket and the connection could be used to handle the next requests. This behaviour was changed in scylladb#8800, now the connection is closed on the Scylla side and can no longer be used. To check there are no errors in this case, we use Scylla metrics, getting them from the Scylla Prometheus API.	2023-02-08 00:07:08 +04:00
Petr Gusev	0904f98ebf	transport server: log failed requests with debug level These logs can be helpful for debugging, e.g. if an error was not handled correctly by the client driver, or another error occurred while handling it.	2023-02-08 00:07:08 +04:00
Petr Gusev	a4cf509c3d	transport server: fix unexpected server errors handling If request processing ended with an error, it is worth sending the error to the client through make_error/write_response. Previously in this case we just wrote a message to the log and didn't handle the client connection in any way. As a result, the only thing the client got in this case was timeout error. A new test_batch_with_error is added. It is quite difficult to reproduce error condition in a test, so we use error injection instead. Passing injection_key in the body of the request ensures that the exception will be thrown only for this test request and will not affect other requests that the driver may send in the background. Closes: scylladb#12104	2023-02-08 00:07:02 +04:00
Petr Gusev	bd80a449d5	transport server: log client errors with debug level Ideally, these errors should be transparently delivered to the client, but in practice, due to various flaws/bugs in scylla and/or the driver, they can be lost, which enormously complicates troubleshooting. const socket_address& get_remote_address() is needed for its convenient conversion to string, which includes ip and port.	2023-02-07 13:53:38 +04:00
Avi Kivity	0b418fa7cf	cql3, transport, tests: remove "unset" from value type system The CQL binary protocol introduced "unset" values in version 4 of the protocol. Unset values can be bound to variables, which cause certain CQL fragments to be skipped. For example, the fragment `SET a = :var` will not change the value of `a` if `:var` is bound to an unset value. Unsets, however, are very limited in where they can appear. They can only appear at the top-level of an expression, and any computation done with them is invalid. For example, `SET list_column = [3, :var]` is invalid if `:var` is bound to unset. This causes the code to be littered with checks for unset, and there are plenty of tests dedicated to catching unsets. However, a simpler way is possible - prevent the infiltration of unsets at the point of entry (when evaluating a bind variable expression), and introduce guards to check for the few cases where unsets are allowed. This is what this long patch does. It performs the following: (general) 1. unset is removed from the possible values of cql3::raw_value and cql3::raw_value_view. (external->cql3) 2. query_options is fortified with a vector of booleans, unset_bind_variable_vector, where each boolean corresponds to a bind variable index and is true when it is unset. 3. To avoid churn, two compatiblity structs are introduced: cql3::raw_value{,_view}_vector_with_unset, which can be constructed from a std::vector<raw_value{,_view/}>, which is what most callers have. They can also be constructed with explicit unset vectors, for the few cases they are needed. (cql3->variables) 4. query_options::get_value_at() now throws if the requested bind variable is unset. This replaces all the throwing checks in expression evaluation and statement execution, which are removed. 5. A new query_options::is_unset() is added for the users that can tolerate unset; though it is not used directly. 6. A new cql3::unset_operation_guard class guards against unsets. It accepts an expression, and can be queried whether an unset is present. Two conditions are checked: the expression must be a singleton bind variable, and at runtime it must be bound to an unset value. 7. The modification_statement operations are split into two, via two new subclasses of cql3::operation. cql3::operation_no_unset_support ignores unsets completely. cql3::operation_skip_if_unset checks if an operand is unset (luckily all operations have at most one operand that tolerates unset) and applies unset_operation_guard to it. 8. The various sites that accept expressions or operations are modified to check for should_skip_operation(). This are the loops around operations in update_statement and delete_statement, and the checks for unset in attributes (LIMIT and PER PARTITION LIMIT) (tests) 9. Many unset tests are removed. It's now impossible to enter an unset value into the expression evaluation machinery (there's just no unset value), so it's impossible to test for it. 10. Other unset tests now have to be invoked via bind variables, since there's no way to create an unset cql3::expr::constant. 11. Many tests have their exception message match strings relaxed. Since unsets are now checked very early, we don't know the context where they happen. It would be possible to reintroduce it (by adding a format string parameter to cql3::unset_operation_guard), but it seems not to be worth the effort. Usage of unsets is rare, and it is explicit (at least with the Python driver, an unset cannot be introduced by ommission). I tried as an alternative to wrap cql3::raw_value{,_view} (that doesn't recognize unsets) with cql3::maybe_unset_value (that does), but that caused huge amounts of churn, so I abandoned that in favor of the current approach. Closes #12517	2023-01-16 21:10:56 +02:00
Avi Kivity	7a8a442c1e	transport: drop some dead code around v1 and v2 protocols In `424dbf43f` ("transport: drop cql protocol versions 1 and 2"), we dropped support for protocols 1 and 2, but some code remains that checks for those versions. It is now dead code, so remove it. Closes #12497	2023-01-12 12:52:19 +02:00
Avi Kivity	2739ac66ed	treewide: drop cql_serialization_format Now that we don't accept cql protocol version 1 or 2, we can drop cql_serialization format everywhere, except when in the IDL (since it's part of the inter-node protocol). A few functions had duplicate versions, one with and one without a cql_serialization_format parameter. They are deduplicated. Care is taken that `partition_slice`, which communicates the cql_serialization_format across nodes, still presents a valid cql_serialization_format to other nodes when transmitting itself and rejects protocol 1 and 2 serialization\ format when receiving. The IDL is unchanged. One test checking the 16-bit serialization format is removed.	2023-01-03 19:54:13 +02:00
Avi Kivity	424dbf43f3	transport: drop cql protocol versions 1 and 2 Version 3 was introduced in 2014 (Cassandra 2.1) and was supported in the very first version of Scylla (`2a7da21481` "CQL binary protocol"). Cassandra 3.0 (2015) dropped protocols 1 and 2 as well. It's safe enough to drop it now, 9 years after introduction of v3 and 7 years after Cassandra stopped supporting it. Dropping it allows dropping cql_serialization_format, which causes quite a lot of pain, and is probably broken. This will be dropped in the following patch.	2023-01-03 19:47:49 +02:00
Eliran Sinvani	5a5514d052	cql server: Only parallelize relevant cql requests The cql server uses an execution stage to process and execute queries, however, processing stage is best utilized when having a recurrent flow that needs to be called repeatedly since it better utilizes the instruction cache. Up until now, every request was sent through the processing stage, but most requests are not meant to be executed repeatedly with high volume. This change processes and executes the data queries asynchronously, through an execution stage, and all of the rest are processed one by one, only continuing once the request has been done end to end. Tests: Unit tests in dev and debug. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com> Closes #12202	2022-12-05 21:06:58 +02:00
Avi Kivity	5dedf85288	transport: prevent signed and unsigned comparison This can lead to undefined behavior. Cast to unsigned, after we've verified the value is indeed positive.	2022-11-28 21:58:30 +02:00
Karol Baryła	1c2eef384d	transport/server.cc: Return correct size of decompressed lz4 buffer An incorrect size is returned from the function, which could lead to crashes or undefined behavior. Fix by erroring out in these cases. Fixes #11476	2022-09-07 10:58:23 +03:00
Piotr Dulikowski	efc3953c0a	transport: add rate_limit_error Adds a CQL protocol extension which introduces the rate_limit_error. The new error code will be used to indicate that the operation failed due to it exceeding the allowed per-partition rate limit. The error code is supposed to be returned only if the corresponding CQL extension is enabled by the client - if it's not enabled, then Config_error will be returned in its stead.	2022-06-22 20:07:58 +02:00
Avi Kivity	528ab5a502	treewide: change metric calls from make_derive to make_counter make_derive was recently deprecated in favor of make_counter, so make the change throughput the codebase. Closes #10564	2022-05-14 12:53:55 +02:00
Juliusz Stasiewicz	603dd72f9e	CQL: Replace assert by exception on invalid auth opcode One user observed this assertion fail, but it's an extremely rare event. The root cause - interlacing of processing STARTUP and OPTIONS messages - is still there, but now it's harmless enough to leave it as is. Fixes #10487 Closes #10503	2022-05-08 11:33:58 +03:00
Avi Kivity	987e6533d2	transport: return correct error codes when downgrading v4 {WRITE,READ}_FAILURE to {WRITE,READ}_TIMEOUT Protocol v4 added WRITE_FAILURE and READ_FAILURE. When running under v3 we downgrade these exceptions to WRITE_TIMEOUT and READ_TIMEOUT (since the client won't understand the v4 errors), but we still send the new error codes. This causes the client to become confused. Fix by updating the error codes. A better fix is to move the error code from the constructor parameter list and hard-code it in the constructor, but that is left for a follow-up after this minimal fix. Fixes #5610. Closes #10362	2022-04-12 19:19:52 +03:00
Pavel Emelyanov	de6c60c1c9	client_data: Sanitize connection_notifier Now the connection_notifier is all gone, only the client_data bits are left. To keep it consistent -- rename the files. Also, while at it, brush up the header dependencies and remove the not really used constexprs for client states. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-02-18 15:02:26 +03:00
Pavel Emelyanov	d63ba87266	transport: Indentation fix after previous patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-02-18 15:02:26 +03:00
Pavel Emelyanov	971c431a23	code: Remove old on-disk version of system.clients table This includes most of the connection_notifier stuff as well as the auxiliary code from system_keyspace.cc and a bunch of updating calls from the client state changing. Other than less code and less disk updates on clients connection paths, this removes one usage of the nasty global qctx thing. Since the system.clients goes away rename the system.clients_v here too so the table is always present out there. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-02-18 15:02:26 +03:00
Pavel Emelyanov	7bc697ec99	protocol_server: Add get_client_data call The call returns a chunked_vector with client_data's. For now only the native transport implements it, others return empty vector. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-02-18 14:25:08 +03:00
Pavel Emelyanov	0046cdc6cb	transport: Track client state for real Right now when the client state changes the respective update is performed on the system.clients table. While doing it some bits from this state are lost from the in-memory structures. For the sake of exporting this information we need to track whether the connected client goes authenticating or is already ready. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-02-18 14:25:08 +03:00
Avi Kivity	7cc43f8aa8	Merge 'utils: add result_try and result_futurize_try' from Piotr Dulikowski Adds `utils::result_try` and `utils::result_futurize_try` - functions which allow to convert existing try..catch blocks into a version which handles C++ exceptions, failed results with exception containers and, depending on the function variant, exceptional futures using the same exception handling logic. For example, you can convert the following try..catch block: try { return a_function_that_may_throw(); } catch (const my_exception& ex) { return 123; } catch (...) { throw; } ...to this: return utils::result_try([&] { return a_function_that_may_throw_or_return_a_failed_result(); }, utils::result_catch<my_exception>([&] (const Ex&) { return 123; }), utils::result_catch_dots([&] (auto&& handle) { return handle.into_result(); }); Similarly, `utils::result_futurize_try` can be used to migrate `then_wrapped` or `f.handle_exception()` constructs. As an example of the usability of the new constructs, two places in the current code which need to simultaneously handle exceptions and failed results are converted to use `result_try` and `result_futurize_try`. Results of `perf_simple_query --smp 1 --operations-per-shard 1000000 --write`: ``` 127041.61 tps ( 67.2 allocs/op, 14.2 tasks/op, 52422 insns/op) 126958.60 tps ( 67.2 allocs/op, 14.2 tasks/op, 52409 insns/op) 127088.37 tps ( 67.2 allocs/op, 14.2 tasks/op, 52411 insns/op) 127560.84 tps ( 67.2 allocs/op, 14.2 tasks/op, 52424 insns/op) 127826.61 tps ( 67.2 allocs/op, 14.2 tasks/op, 52406 insns/op) 126801.02 tps ( 67.2 allocs/op, 14.2 tasks/op, 52420 insns/op) 125371.51 tps ( 67.2 allocs/op, 14.2 tasks/op, 52425 insns/op) 126498.51 tps ( 67.2 allocs/op, 14.2 tasks/op, 52427 insns/op) 126359.41 tps ( 67.2 allocs/op, 14.2 tasks/op, 52423 insns/op) 126298.27 tps ( 67.2 allocs/op, 14.2 tasks/op, 52410 insns/op) ``` The number of tasks and allocations is unchanged. The number of instructions per operations seems similar, it may have increased slightly (by 10-20) but it's hard to tell for sure because of the noisiness of the results. Tests: unit(dev) Closes #10045 * github.com:scylladb/scylla: transport: use result_try in process_request_one storage_proxy: use result_futurize_try in mutate_end storage_proxy: temporarily throw exception from result in mutate_end utils: add result_try and result_futurize_try	2022-02-13 19:38:13 +02:00
Piotr Dulikowski	049564bd2d	transport: use result_try in process_request_one Adapts the exception handling logic in process_request_one so that it uses utils::result_try to handle both C++ exceptions and failed results in a unified way.	2022-02-10 17:35:32 +01:00
Piotr Dulikowski	81968f2c3a	transport/server: handle exceptions from coordinator_result without throwing Instead of throwing the exception contained in failed `result<>`, it is now inspected with a visitor which avoids the need for throwing.	2022-02-08 11:08:42 +01:00

1 2 3 4 5 ...

443 Commits