scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-03 06:35:51 +00:00

Author	SHA1	Message	Date
Pavel Emelyanov	dcdd207349	storage_service: Drop memory limiter Nobody uses it now. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-18 11:28:45 +01:00
Pavel Emelyanov	f0a79574d4	memory_limiter: Use main-local instance everyehere The cql_server and alternator both need the limiter, so patch them to stop using storage service's one and use the main-local one. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-18 11:28:45 +01:00
Pavel Emelyanov	359e9caf54	main: Have local memory limiter and carry where needed Prepare memory limiters to have non-global instance of the service. For now the main-local instance is not used and (!) is not stopped for real, just like the storage_service's one is. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-18 11:28:45 +01:00
Pavel Emelyanov	c2f94fb527	cql_server: Remove semaphore getter fn from config The cql_server() need to get the memory limiter semaphore from local storage service instance. To make this happen a callback in introduced on the config structure. The same can be achieved in a simler manner -- by providing the local storage service instances directly. Actually, the storage service will be removed in further patches from this place, so this patch is mostly to get rid of the callback from the config. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-03-18 11:28:45 +01:00
Nadav Har'El	bd742f2951	Merge 'treewide: get rid of incorrect reinterpret casts' from Michał Chojnowski In some places we use the `reinterpret_cast<const net::packed<T>>(&x)` pattern to reinterpret memory. This is a violation of C++'s aliasing rules, which invokes undefined behaviour. The blessed way to correctly reinterpret memory is to copy it into a new object. Let's do that. Note: the reinterpret_cast way has no performance advantage. Compilers recognize the memory copy pattern and optimize it away. Closes #8241 * github.com:scylladb/scylla: treewide: get rid of unaligned_cast treewide: get rid of incorrect reinterpret casts	2021-03-18 11:24:18 +01:00
Piotr Sarna	ea096de1b4	service, transport: avoid using private storage_service fields ... in the transport controller. Instead, simple getters would suffice. Message-Id: <582a71d0c1b61edf0107f5a2ef96536c395972d0.1615988516.git.sarna@scylladb.com>	2021-03-18 11:15:59 +02:00
Michał Chojnowski	4e35befcf2	treewide: get rid of incorrect reinterpret casts In some places we use the `reinterpret_cast<const net::packed<T>>(&x)` pattern to reinterpret memory. This is a violation of C++'s aliasing rules, which invokes undefined behaviour. The blessed way to correctly reinterpret memory is to copy it into a new object. Let's do that. Note: the reinterpret_cast way has no performance advantage. Compilers recognize the memory copy pattern and optimize it away.	2021-03-17 17:00:38 +01:00
Piotr Sarna	8635094144	transport: return error on correct stream during size shedding When a request is shed due to being too large, its response was sent with stream id 0 instead of the stream id that matches the communication lane. That in turn confused the client, which is no longer the case.	2021-03-02 15:10:46 +01:00
Piotr Sarna	d6ea6937ee	transport: return error on correct stream during shedding When a request is shed due to exceeding the max number of concurrent requests, its response was sent with stream id 0 instead of the stream id that matches the communication lane. That in turn confused the client, which is no longer the case.	2021-03-02 15:10:46 +01:00
Piotr Sarna	4a24d7dca0	transport: skip the whole request if it is too large When a request is shed due to being too large, only the header was actually read, and the body was still stuck in the socket - and would be read in the next iteration, which would expect to actually read a new request header. Instead, the whole message is now skipped, so that a new request can be correctly read and parsed. Fixes #8193	2021-03-02 10:10:19 +01:00
Piotr Sarna	3eb7e768cb	transport: skip the whole request during shedding When a request is shed due to exceeding the number of max concurrent requests, only its header was actually read, and the body was still stuck in the socket - and would be read in the next iteration, which would expect to actually read a new request header. Instead, the whole message is now skipped, so that a new request can be correctly read and parsed. Refs #8193	2021-03-02 10:10:19 +01:00
Benny Halevy	baf5d05631	storage_service: use atomic_vector for lifecycle_subscribers So it can be modified while walked to dispatch subscribed event notifications. In #8143, there is a race between scylla shutdown and notify_down(), causing use-after-free of cql_server. Using an atomic vector itstead and futurizing unregister_subscriber allows deleting from _lifecycle_subscribers while walked using atomic_vector::for_each. Fixes #8143 Test: unit(release) DTest: update_cluster_layout_tests:TestUpdateClusterLayout.add_node_with_large_partition4_test(release) materialized_views_test.py:TestMaterializedViews.double_node_failure_during_mv_insert_4_nodes_test(release) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210224164647.561493-2-bhalevy@scylladb.com>	2021-03-01 20:34:42 +02:00
Benny Halevy	1ed04affab	cql_server: event_notifier: unregister_subscriber in stop Move unregister_subscriber from the destructor to stop as preparation for moving storage_service lifescyle_subscribers to atomic_vector and futurizing unregister_subscriber. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20210224164647.561493-1-bhalevy@scylladb.com>	2021-03-01 20:34:42 +02:00
Piotr Sarna	c5214eb096	treewide: remove timeout config from query options Timeout config is now stored in each connection, so there's no point in tracking it inside each query as well. This patch removes timeout_config from query_options and follows by removing now unnecessary parameters of many functions and constructors.	2021-02-25 17:20:27 +01:00
Piotr Sarna	7ceafda70a	service: add timeout config to client state Future patches will use this per-connection timeout config to allow setting different timeouts for each session, based on roles.	2021-02-25 17:20:26 +01:00
Piotr Sarna	25f47561cb	transport: fix an outdated comment The comment mentions calling a lambda in-place, but the lambda is no longer there since 2019! Message-Id: <3903c84d5c151415409f28935e328b552dd548f8.1614155567.git.sarna@scylladb.com>	2021-02-24 11:14:01 +02:00
Pavel Emelyanov	8490c9ff6a	transport: Remove global storage service reference On start the transport controller keeps the storage service on server config's lambda just to let the server grab a database config option. The same can be achieved by passing the sharded database reference to sharded<server>::start, so that each server instance get local database with config. As an nice side effect transport::server's config looks more like a config with simple values and without methods and/or lambdas on board. tests: unit(dev) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20210205175611.13464-1-xemul@scylladb.com>	2021-02-08 12:58:49 +01:00
Juliusz Stasiewicz	29e4737a9b	transport: Fix abort on certain configurations of native_transport_port(_ssl) The reason was accessing the `configs` table out of index. Also, native_transport_port-s can no longer be disabled by setting to 0, as per the table below. Rules for port/encryption (the same apply to shard_aware counterpart): np := native_transport_port.is_set() nps := native_transport_port_ssl.is_set() ceo := ceo.at("enabled") == "true" eq := native_transport_port_ssl() == native_transport_port() +-----+-----+-----+-----+ \| np \| nps \| ceo \| eq \| +-----+-----+-----+-----+ \| 0 \| 0 \| 0 \| * \| => listen on native_transport_port, unencrypted \| 0 \| 0 \| 1 \| * \| => listen on native_transport_port, encrypted \| 0 \| 1 \| 0 \| * \| => nonsense, don't listen \| 0 \| 1 \| 1 \| * \| => listen on native_transport_port_ssl, encrypted \| 1 \| 0 \| 0 \| * \| => listen on native_transport_port, unencrypted \| 1 \| 0 \| 1 \| * \| => listen on native_transport_port, encrypted \| 1 \| 1 \| 0 \| * \| => listen on native_transport_port, unencrypted \| 1 \| 1 \| 1 \| 0 \| => listen on native_transport_port, unencrypted + native_transport_port_ssl, encrypted \| 1 \| 1 \| 1 \| 1 \| => native_transport_port(_ssl), encrypted +-----+-----+-----+-----+ Fixes #7783 Fixes #7866 Closes #7992	2021-02-02 11:32:31 +02:00
Nadav Har'El	702b1b97bf	cql: fix error return from execution of fromJson() and other functions As reproduced in cql-pytest/test_json.py and reported in issue #7911, failing fromJson() calls should return a FUNCTION_FAILURE error, but currently produce a generic SERVER_ERROR, which can lead the client to think the server experienced some unknown internal error and the query can be retried on another server. This patch adds a new cassandra_exception subclass that we were missing - function_execution_exception - properly formats this error message (as described in the CQL protocol documentation), and uses this exception in two cases: 1. Parse errors in fromJson()'s parameters are converted into a function_execution_exception. 2. Any exceptions during the execute() of a native_scalar_function_for function is converted into a function_execution_exception. In particular, fromJson() uses a native_scalar_function_for. Note, however, that functions which already took care to produce a specific Cassandra error, this error is passed through and not converted to a function_execution_exception. An example is the blobAsText() which can return an invalid_request error, so it is left as such and not converted. This also happens in Cassandra. All relevant tests in cql-pytest/test_json.py now pass, and are no longer marked xfail. This patch also includes a few more improvements to test_json.py. Fixes #7911 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210118140114.4149997-1-nyh@scylladb.com>	2021-01-21 15:21:13 +01:00
Kamil Braun	1a8630e6a7	transport: silence "broken pipe" and "connection reset by peer" errors The code would already silence broken pipe exceptions since it's expected when the other side closes the connection or when we shutdown the socket during Scylla shutdown, but the code wouldn't handle the following: 1. "Connection reset by peer" errors: these can also happen in the aforementioned two scenarios; the conditions that determine which of the two types of errors occur are unclear. 2. The scenarios would sometimes result in a `seastar::nested_exception`, mainly during shutdown. The errors could happen once when trying to send a response to a request (`_write_buf.write(...)/flush(...)`) and then again when trying to close the connection in a `finally` block. These nested exceptions were not silenced. The commit handles each of these cases. Closes #7907. Closes #7931	2021-01-19 10:30:17 +02:00
Pekka Enberg	8d00c16feb	transport/server: Code cleanups Fix up some coding style issues spotted while reading the code: - Fix indentation to be 4 spaces - Remove superfluous semicolons Closes #7793	2020-12-14 12:48:05 +02:00
Piotr Wojtczak	3560acd311	cql_metrics: Add metrics for CQL errors This change adds tracking of all the CQL errors that can be raised in response to a CQL message from a client, as described in the CQL v4 protocol and with Scylla's CDC_WRITE_FAILUREs included. Fixes #5859 Closes #7604	2020-11-30 12:18:37 +02:00
Calle Wilund	ae4d5a60ca	transport::controller: Shut down distributed object on startup exception Fixes #7211 If we start a sharded<> object, then proceed to do potentially exceptional stuff, we should destroy it on said exception. Otherwise, the exception propagation will abort on RAII destruction of the sharded<>. And we get no exception logging.	2020-11-25 15:52:47 +00:00
Piotr Wojtczak	d9810ec8eb	cql_metrics: Add counters for CQL request messages This change adds metrics for counting request message types listed in the CQL v.4 spec under section 4.1 (https://github.com/apache/cassandra/blob/trunk/doc/native_protocol_v4.spec). To organize things properly, we introduce a new cql_server::transport_stats object type for aggregating the message and server statistics. Fixes #4888 Closes #7574	2020-11-11 20:00:17 +02:00
Pavel Emelyanov	699074bd48	transport: Keep sharded query processor reference on controller Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-10-31 15:44:21 +03:00
Avi Kivity	e2a02f15c2	Merge 'transport/system_ks: Add more info to `system.clients`' from Juliusz Stasiewicz This patch fills the following columns in `system.clients` table: * `connection_stage` * `driver_name` * `driver_version` * `protocol_version` It also improves: * `client_type` - distinguishes cql from thrift just in case * `username` - now it displays correct username iff `PasswordAuthenticator` is configured. What is still missing: * SSL params (I'll happily get some advice here) * `hostname` - I didn't find it in tested drivers Refs #6946 Closes #7349 * github.com:scylladb/scylla: transport: Update `connection_stage` in `system.clients` transport: Retrieve driver's name and version from STARTUP message transport: Notify `system.clients` about "protocol_version" transport: On successful authentication add `username` to system.clients	2020-10-27 22:44:02 +02:00
Juliusz Stasiewicz	0251cb9b31	transport: Update `connection_stage` in `system.clients`	2020-10-12 18:44:00 +02:00
Juliusz Stasiewicz	6abe1352ba	transport: Retrieve driver's name and version from STARTUP message	2020-10-12 18:37:19 +02:00
Juliusz Stasiewicz	d2d162ece3	transport: Notify `system.clients` about "protocol_version"	2020-10-12 18:32:00 +02:00
Piotr Grabowski	369895b80f	transport: Delay NEW_NODE until CQL listen started After adding a new node to the cluster, Scylla sends a NEW_NODE event to CQL clients. Some clients immediately try to connect to the new node, however it fails as the node has not yet started listening to CQL requests. In contrast, Apache Cassandra waits for the new node to start its CQL server before sending NEW_NODE event. In practice this means that NEW_NODE and UP events will be sent "jointly" after new node is UP. This change is implemented in the same manner as in Apache Cassandra code. Fixes #7301. Closes #7306	2020-10-07 09:57:27 +03:00
Juliusz Stasiewicz	acf0341e9b	transport: On successful authentication add `username` to system.clients The username becomes known in the course of resolving challenges from `PasswordAuthenticator`. That's why username is being set on successful authentication; until then all users are "anonymous". Meanwhile, `AllowAllAuthenticator` (the default) does not request username, so users logged with it will remain as "anonymous" in `system.clients`. Shuffling of code was necessary to unify existing infrastructure for INSERTing entries into `system.clients` with later UPDATEs.	2020-10-06 18:52:46 +02:00
Piotr Dulikowski	bfbf02a657	transport/config: fix cross-shard use of updateable_value Recently, the cql_server_config::max_concurrent_requests field was changed to be an updateable_value, so that it is updated when the corresponding option in Scylla's configuration is live-reloaded. Unfortunately, due to how cql_server is constructed, this caused cql_server instances on all shards to store an updateable_value which pointed to an updateable_value_source on shard 0. Unsynchronized cross-shard memory operations ensue. The fix changes the cql_server_config so that it holds a function which creates an updateable_value appropriate for the given shard. This pattern is similar to another, already existing option in the config: get_service_memory_limiter_semaphore. This fix can be reverted if updateable_value becomes safe to use across shards. Tests: unit(dev) Fixes: #7310	2020-10-01 14:10:56 +03:00
Piotr Sarna	876e9fe51a	transport: make _requests_serving param uint32_t It's not realistic for a shard to have over 4 billion concurrent requests, so this value can be safely represented in 32 bits. Also, since the current concurrency limit is represented in uint32_t, it makes sense for these two to have matching types.	2020-09-30 08:20:52 +02:00
Piotr Sarna	d18f68f1c1	transport: make overloaded error message more descriptive The message now mentions the config variable used to set the limit of max allowed concurrent requests.	2020-09-30 08:20:51 +02:00
Piotr Sarna	792ff3757a	transport: add requests_shed metrics The counter shows a total number of requests shed due to overload.	2020-09-30 08:20:50 +02:00
Piotr Sarna	4b856cf62d	transport: make max_concurrent_requests_per_shard reloadable This configuration entry is expected to be used as a quick fix for an overloaded node, so it should be possible to reload this value without having to restart the server.	2020-09-29 10:11:36 +02:00
Piotr Sarna	4da8957461	transport: return exceptional future instead of throwing Throwing bears an additional cost, so it's better to simply construct the error in place and return it.	2020-09-29 10:00:30 +02:00
Piotr Sarna	b4db6d2598	transport,config: add a param for max request concurrency The newly introduced parameter - max_concurrent_requests_per_shard - can be used to limit the number of in-flight requests a single coordinator shard can handle. Each surplus request will be immediately refused by returning OverloadedException error to the client. The default value for this parameter is large enough to never actually shed any requests. Currently, the limit is only applied to CQL requests - other frontends like alternator and redis are not throttled yet.	2020-09-29 09:59:30 +02:00
Piotr Grabowski	ffd8c8c505	utf8: Print invalid UTF-8 character position Add new validate_with_error_position function which returns -1 if data is a valid UTF-8 string or otherwise a byte position of first invalid character. The position is added to exception messages of all UTF-8 parsing errors in Scylla. validate_with_error_position is done in two passes in order to preserve the same performance in common case when the string is valid.	2020-09-07 18:11:21 +03:00
Rafael Ávila de Espíndola	d18af34205	everywhere: Use future::get0 when appropriate This works with current seastar and clears most of the way for updating to a version that doesn't use std::tuple in futures. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200826231947.1145890-1-espindola@scylladb.com>	2020-08-27 15:05:51 +03:00
Piotr Jastrzebski	c001374636	codebase wide: replace count with contains C++20 introduced `contains` member functions for maps and sets for checking whether an element is present in the collection. Previously `count` function was often used in various ways. `contains` does not only express the intend of the code better but also does it in more unified way. This commit replaces all the occurences of the `count` with the `contains`. Tests: unit(dev) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <b4ef3b4bc24f49abe04a2aba0ddd946009c9fcb2.1597314640.git.piotr@scylladb.com>	2020-08-15 20:26:02 +03:00
Avi Kivity	58104d17e0	Merge 'transport: Allow user to disable unencrypted native transport' from Pekka " Let users disable the unencrypted native transport too by setting the port to zero in the scylla.yaml configuration file. Fixes #6997 " * penberg-penberg/native-transport-disable: docs/protocol: Document CQL protocol port configuration options transport: Allow user to disable unencrypted native transport	2020-08-11 16:30:52 +03:00
Piotr Jastrzebski	80e3923b3c	codebase wide: replace find(...) != end() with contains C++20 introduced `contains` member functions for maps and sets for checking whether an element is present in the collection. Previously the code pattern looked like: <collection>.find(<element>) != <collection>.end() In C++20 the same can be expressed with: <collection>.contains(<element>) This is not only more concise but also expresses the intend of the code more clearly. This commit replaces all the occurences of the old pattern with the new approach. Tests: unit(dev) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <f001bbc356224f0c38f06ee2a90fb60a6e8e1980.1597132302.git.piotr@scylladb.com>	2020-08-11 13:28:50 +03:00
Pekka Enberg	e401a26701	transport: Allow user to disable unencrypted native transport Let users disable the unencrypted native transport too by setting the port to zero in the scylla.yaml configuration file. Fixes #6997	2020-08-11 13:15:17 +03:00
Nadav Har'El	936cf4cce0	merge: Increase row limits Merged pull request https://github.com/scylladb/scylla/pull/6910 by Wojciech Mitros: This patch enables selecting more than 2^32 rows from a table. The change becomes active after upgrading whole cluster - until then old limits are used. Tested reading 4.5*10^9 rows from a virtual table, manually upgrading a cluster with ccm and performing cql SELECT queries during the upgrade, ran unit tests in dev mode and cql and paging dtests. tests: add large paging state tests increase the maximum size of query results to 2^64	2020-08-04 19:52:30 +03:00
Wojciech Mitros	45215746fe	increase the maximum size of query results to 2^64 Currently, we cannot select more than 2^32 rows from a table because we are limited by types of variables containing the numbers of rows. This patch changes these types and sets new limits. The new limits take effect while selecting all rows from a table - custom limits of rows in a result stay the same (2^32-1). In classes which are being serialized and used in messaging, in order to be able to process queries originating from older nodes, the top 32 bits of new integers are optional and stay at the end of the class - if they're absent we assume they equal 0. The backward compatibility was tested by querying an older node for a paged selection, using the received paging_state with the same select statement on an upgraded node, and comparing the returned rows with the result generated for the same query by the older node, additionally checking if the paging_state returned by the upgraded node contained new fields with correct values. Also verified if the older node simply ignores the top 32 bits of the remaining rows number when handling a query with a paging_state originating from an upgraded node by generating and sending such a query to an older node and checking the paging_state in the reply(using python driver). Fixes #5101.	2020-08-03 17:32:49 +02:00
Juliusz Stasiewicz	1c11d8f4c4	transport: Added listener with port-based load balancing The new port is configurable from scylla.yaml and defaults to 19042 (unencrypted, unless client configures encryption options and omits `native_shard_aware_transport_port_ssl`). Two "SUPPORTED" tags are added: "SCYLLA_SHARD_AWARE_PORT" and "SCYLLA_SHARD_AWARE_PORT_SSL". For compatibility, "SCYLLA_SHARDING_ALGORITHM" is still kept. Fixes #5239	2020-07-31 13:02:13 +02:00
Tomasz Grabiec	8bd7359d93	Merge "lwt: introduce LWT flag in prepared statement metadata" from Pavel This patch set adds a few new features in order to fix issue The list of changes is briefly as follows: - Add a new `LWT` flag to `cql3::prepared_metadata`, which allows clients to clearly distinguish betwen lwt and non-lwt statements without need to execute some custom parsing logic (e.g. parsing the prepared query with regular expressions), which is obviously quite fragile. - Introduce the negotiation procedure for cql protocol extensions. This is done via `cql_protocol_extension` enum and is expected to have an appropriate mirroring implementation on the client driver side in order to work properly. - Implmenent a `LWT_ADD_METADATA_MARK` cql feature on top of the aforementioned algorithm to make the feature negotiable and use it conditionally (iff both server and client agrees with each other on the set of cql extensions). The feature is meant to be further utilized by client drivers to use primary replicas consistently when dealing with conditional statements. * git@github.com:ManManson/scylla feature/lwt_prepared_meta_flag_2: lwt: introduce "LWT" flag in prepared statement metadata transport: introduce `cql_protocol_extension` enum and cql protocol extensions negotiation	2020-06-30 12:40:19 +03:00
Pavel Solodovnikov	6c6f3dbe42	lwt: introduce "LWT" flag in prepared statement metadata This patch adds a new `LWT` flag to `cql3::prepared_metadata`. That allows clients to clearly distinguish betwen lwt and non-lwt statements without need to execute some custom parsing logic (e.g. parsing the prepared query with regular expressions), which is obviously quite fragile. The feature is meant to be further utilized by client drivers to use primary replicas consistently when dealing with conditional statements. Whether to use lwt optimization flag or not is handled by negotiation procedure between scylla server and client library via SUPPORTED/STARTUP messages (`LWT_ADD_METADATA_MARK` extension). Tests: unit(dev, debug), manual testing with modified scylla/gocql driver Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-06-29 12:30:37 +03:00
Gleb Natapov	7ca937778d	cql transport: do not log broken pipe error when a client closes its side of a connection abruptly Fixes #5661 Message-Id: <20200615075958.GL335449@scylladb.com>	2020-06-16 13:59:12 +02:00

1 2 3 4 5 ...

425 Commits