scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-02 13:06:57 +00:00

Author	SHA1	Message	Date
Kamil Braun	599393dcba	storage_service: return unordered_set from get_ignore_dead_nodes_for_replace	2023-03-24 12:44:37 +01:00
Kamil Braun	e8fb718e4a	Merge 'topology changes over raft' from Gleb Natapov The patch series introduces linearisable topology changes using raft protocol. The state machine driven by raft is described in "service: Introduce topology state machine". Some explanations about the implementation can be found in "storage_service: raft topology: implement topology management through raft". The code is not ready for production. There is not much in terms of error handling and integration with the rest of the system is not even started. For full integration request fencing will need to be implemented and token_metadata has to be extended to support not just "pending" nodes but concepts of "read replica set" and "write replica set". The code may be far from be usable, but it is hidden behind the "experimental raft" flag and having it in tree will relieve me from constant rebase burden. * 'raft-topology-v6' of github.com:scylladb/scylla-dev: storage_service: fix indentation from previous patch storage_service: raft topology: implement topology management through raft service: raft: make group0_guard move assignable service: raft: wire up apply() and snapshot transfer for topology in group0 state machine storage_service: raft topology: introduce a function that applies topology cmd to local state machine storage_service: raft topology: introduce a raft monitor and topology coordinator fibers storage_service: raft topology: introduce snapshot transfer code for the topology table raft topology: add RAFT_TOPOLOGY_CMD verb that will be used by topology coordinator to communicated with nodes bootstrapper: Add get_random_bootstrap_tokens function service: raft: add support for topology_change command into raft_group0_client service: raft: introduce topology_change group0 command system_keyspace: add a table to persist topology change state machine's state service: Introduce topology state machine data structures storage_proxy: not consult topology on local table write	2023-03-23 15:59:45 +01:00
Gleb Natapov	5a908c3f46	storage_service: fix indentation from previous patch	2023-03-23 16:29:56 +02:00
Gleb Natapov	f3bd7e9b8c	storage_service: raft topology: implement topology management through raft The code here implements the state machine described in "service: Introduce topology state machine". A topology operation is requested by writing into topology_request field through raft. After that topology_change_transition() function running on a leader is responsible to drive the operation to completion. There is no much in terms of error handling here yet. It something fails the code will just continue trying. topology_change_state_load() which is (eventually) called on all nodes each time state machine's state changes is a glue between the raft view of the topology and the rest of the "legacy" system. The code there creates token_metadata object from the raft view and fills in peers table which is needed for drivers. The gossiper is almost completely cut of from the topology management, but the code still updates node's sate there to 'normal' and 'left' for some legacy functionality to continue working. Note that handlers for those states are disabled in raft mode. raft_topology_cmd_handler() is called by topology coordinator and this is where the streaming happens. The kind of streaming depends on the state the node is in. The function is "re-entrable". It can be called more then once and will either start new operation if it is the first invocation or previous one failed, or it will wait from previous operation to complete. The new code is hidden behind "experimental raft" and should not change how the system works if disabled. Some indentation here is intentionally left wrong and will be fixed by the next patch.	2023-03-23 16:29:56 +02:00
Gleb Natapov	8865d5cf13	service: raft: make group0_guard move assignable	2023-03-23 16:29:56 +02:00
Gleb Natapov	344b483425	service: raft: wire up apply() and snapshot transfer for topology in group0 state machine	2023-03-23 16:29:56 +02:00
Gleb Natapov	aca21d3318	storage_service: raft topology: introduce a function that applies topology cmd to local state machine The function applies to persistent storage and call stub function topology_change_state_load() that will load the new state into the memory in later patches.	2023-03-23 16:29:56 +02:00
Gleb Natapov	284afd9255	storage_service: raft topology: introduce a raft monitor and topology coordinator fibers Raft monitor fiber monitors local's server raft state and starts the topology coordinator fiber when it becomes a leader. Stops it when it is not longer a leader. The coordinator fiber waits for topology state changes, but there will be none yet.	2023-03-23 16:29:56 +02:00
Gleb Natapov	d69a887366	storage_service: raft topology: introduce snapshot transfer code for the topology table	2023-03-23 16:29:56 +02:00
Gleb Natapov	6a4d773b7e	raft topology: add RAFT_TOPOLOGY_CMD verb that will be used by topology coordinator to communicated with nodes Empty for now. Will be used later by the topology coordinator to communicate with other nodes to instruct them to start streaming, or start to fence read/writes.	2023-03-23 16:29:56 +02:00
Nadav Har'El	4fdcee8415	test/alternator: increase CQL connection timeout This patch increases the connection timeout in the get_cql_cluster() function in test/cql-pytest/run.py. This function is used to test that Scylla came up, and also test/alternator/run uses it to set up the authentication - which can only be done through CQL. The Python driver has 2-second and 5-second default timeouts that should have been more than enough for everybody (TM), but in #13239 we saw that in one case it apparently wasn't enough. So to be extra safe, let's increase the default connection-related timeouts to 60 seconds. Note this change only affects the Scylla boot in the test/*/run scripts, and it does not affect the actual tests - those have different code to connect to Scylla (see cql_session() in test/cql-pytest/util.py), and we already increased the timeouts there in #11289. Fixes #13239 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #13291	2023-03-23 16:03:20 +02:00
Avi Kivity	afe6b0d8c9	Merge 'reader_concurrency_semaphore: add trace points for important events' from Botond Dénes Currently we have no visibility into what happens to a read in the reader concurrency semaphore as far as tracing is concerned. This series fixes that, storing a trace state pointer on the reader permit and using it to add trace messages to important semaphore related events: * admission decision * execution (execution stage functionality) * eviction This allows for seeing if the read suffered any delay in the semaphore. Example tracing (2 pages): ``` Tracing session: 8cc80d50-c72d-11ed-8427-14e21cc3ed56 activity \| timestamp \| source \| source_elapsed \| client -------------------------------------------------------------------------------------------------------------------------------------------+----------------------------+-----------+----------------+----------- Execute CQL3 query \| 2023-03-20 10:43:16.773000 \| 127.0.0.1 \| 0 \| 127.0.0.1 Parsing a statement [shard 0] \| 2023-03-20 10:43:16.773754 \| 127.0.0.1 \| -- \| 127.0.0.1 Processing a statement [shard 0] \| 2023-03-20 10:43:16.773837 \| 127.0.0.1 \| 83 \| 127.0.0.1 Creating read executor for token -4911109968640856406 with all: {127.0.0.1} targets: {127.0.0.1} repair decision: NONE [shard 0] \| 2023-03-20 10:43:16.773874 \| 127.0.0.1 \| 121 \| 127.0.0.1 read_data: querying locally [shard 0] \| 2023-03-20 10:43:16.773877 \| 127.0.0.1 \| 123 \| 127.0.0.1 Start querying singular range {{-4911109968640856406, pk{000d73797374656d5f736368656d61}}} [shard 0] \| 2023-03-20 10:43:16.773881 \| 127.0.0.1 \| 128 \| 127.0.0.1 [reader concurrency semaphore] admitted immediately [shard 0] \| 2023-03-20 10:43:16.773884 \| 127.0.0.1 \| 130 \| 127.0.0.1 [reader concurrency semaphore] executing read [shard 0] \| 2023-03-20 10:43:16.773890 \| 127.0.0.1 \| 137 \| 127.0.0.1 Querying cache for range {{-4911109968640856406, pk{000d73797374656d5f736368656d61}}} and slice {(-inf, +inf)} [shard 0] \| 2023-03-20 10:43:16.773903 \| 127.0.0.1 \| 149 \| 127.0.0.1 Page stats: 1 partition(s), 0 static row(s) (0 live, 0 dead), 100 clustering row(s) (100 live, 0 dead) and 0 range tombstone(s) [shard 0] \| 2023-03-20 10:43:16.774674 \| 127.0.0.1 \| 920 \| 127.0.0.1 Caching querier with key 5eff94d2-e47a-43b2-8e3a-2d80a9cc3b3e [shard 0] \| 2023-03-20 10:43:16.774685 \| 127.0.0.1 \| 931 \| 127.0.0.1 Querying is done [shard 0] \| 2023-03-20 10:43:16.774688 \| 127.0.0.1 \| 934 \| 127.0.0.1 Done processing - preparing a result [shard 0] \| 2023-03-20 10:43:16.774706 \| 127.0.0.1 \| 953 \| 127.0.0.1 Request complete \| 2023-03-20 10:43:16.774225 \| 127.0.0.1 \| 1225 \| 127.0.0.1 Tracing session: 8d26f630-c72d-11ed-8427-14e21cc3ed56 activity \| timestamp \| source \| source_elapsed \| client ---------------------------------------------------------------------------------------------------------------------------------------------------------+----------------------------+-----------+----------------+----------- Execute CQL3 query \| 2023-03-20 10:43:17.395000 \| 127.0.0.1 \| 0 \| 127.0.0.1 Parsing a statement [shard 0] \| 2023-03-20 10:43:17.395498 \| 127.0.0.1 \| -- \| 127.0.0.1 Processing a statement [shard 0] \| 2023-03-20 10:43:17.395558 \| 127.0.0.1 \| 60 \| 127.0.0.1 Creating read executor for token -4911109968640856406 with all: {127.0.0.1} targets: {127.0.0.1} repair decision: NONE [shard 0] \| 2023-03-20 10:43:17.395597 \| 127.0.0.1 \| 99 \| 127.0.0.1 read_data: querying locally [shard 0] \| 2023-03-20 10:43:17.395600 \| 127.0.0.1 \| 102 \| 127.0.0.1 Start querying singular range {{-4911109968640856406, pk{000d73797374656d5f736368656d61}}} [shard 0] \| 2023-03-20 10:43:17.395604 \| 127.0.0.1 \| 106 \| 127.0.0.1 Found cached querier for key 5eff94d2-e47a-43b2-8e3a-2d80a9cc3b3e and range(s) {{{-4911109968640856406, pk{000d73797374656d5f736368656d61}}}} [shard 0] \| 2023-03-20 10:43:17.395610 \| 127.0.0.1 \| 112 \| 127.0.0.1 Reusing querier [shard 0] \| 2023-03-20 10:43:17.395614 \| 127.0.0.1 \| 116 \| 127.0.0.1 [reader concurrency semaphore] executing read [shard 0] \| 2023-03-20 10:43:17.395622 \| 127.0.0.1 \| 125 \| 127.0.0.1 Page stats: 1 partition(s), 0 static row(s) (0 live, 0 dead), 11 clustering row(s) (11 live, 0 dead) and 0 range tombstone(s) [shard 0] \| 2023-03-20 10:43:17.395711 \| 127.0.0.1 \| 213 \| 127.0.0.1 Querying is done [shard 0] \| 2023-03-20 10:43:17.395718 \| 127.0.0.1 \| 221 \| 127.0.0.1 Done processing - preparing a result [shard 0] \| 2023-03-20 10:43:17.395734 \| 127.0.0.1 \| 236 \| 127.0.0.1 Request complete \| 2023-03-20 10:43:17.395276 \| 127.0.0.1 \| 276 \| 127.0.0.1 ``` Fixes: https://github.com/scylladb/scylladb/issues/12781 Closes #13255 * github.com:scylladb/scylladb: reader_concurrency_semaphore: add trace points for important events reader_permit: refresh trace_state on new pages reader_permit: keep trace_state pointer on permit test/perf/perf_collection: give more unique names to key comparators	2023-03-23 15:37:33 +02:00
Botond Dénes	7699904c54	Revert "repair: Reduce repair reader eviction with diff shard count" This reverts commit `c6087cf3a0`. Said commit can cause a deadlock when 2 or more repairs compete for locks on 2 or more nodes. Consider the following scenario: Node n1 and n2 in the cluster, 1 shard per node, rf = 2, each shard has 1 available unit for the reader lock n1 starts repair r1 r1-n1 (instance of r1 on node1) takes the reader lock on node1 n2 starts repair r2 r2-n2 (instance of r2 on node2) takes the reader lock on node2 r1-n2 will fail to take the reader lock on node2 r2-n1 will fail to take the reader lock on node1 As a result, r1 and r2 could not make progress and deadlock happens. The complexity comes from the fact that a repair job needs lock on more than one node. It is not guaranteed that all the participant nodes could take the lock in one short. There is no simple solution to this so we have to revert this locking mechanism and look for another way to prevent reader trashing when repairing nodes with mismatching shard count. Fixes: #12693 Closes #13266	2023-03-23 15:35:32 +02:00
Nadav Har'El	b5e61e1b83	test/cql-pytest, lwt: test for detection of contradicting batches Cassandra detects when a batch has both an IF EXISTS and IF NOT EXISTS on the same row, and complains this is not a useful request (after all, it can never succeed, because the batch can only succeed if both conditions are true, and that can't be if one checks IF EXISTS and the other IF NOT EXISTS). This patch adds a test, test_lwt_with_batch_conflict_1, which checks that this case results in an error. It passes on Cassandra, but xfails on Scylla which doesn't report an error in this case. A second test, test_lwt_with_batch_conflict_2, shows that the detection of the EXISTS / NOT EXISTS conflict is special, and other conflicts such as having both "r=1" and "r=2" for the same row, are NOT detected by Cassandra. Refs #13011. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #13270	2023-03-23 13:35:21 +02:00
Pavel Emelyanov	b13ff5248c	sstables: Mark continuous_data_consumer::reader_position() const Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #13285	2023-03-23 13:27:33 +02:00
Pavel Emelyanov	bee5593ba1	storage_service: Move node_ops_meta_data to .cc file It's declared in header, but is not used outside of .cc. Forward declaration in header would be enough. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #13289	2023-03-23 13:22:39 +02:00
Tzach Livyatan	ea66c16818	Fix Enable Authorization doc page references a wrong CL used by a 'cassandra' user Fix https://github.com/scylladb/scylladb/issues/11633 Closes #11637	2023-03-23 13:20:36 +02:00
Kefu Chai	0421a82821	sstables: add type constraits right in parameter list for better readability. also, add `#include <concepts>`, as we should include what we use instead of relying on other headers do this on behalf of us. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13277	2023-03-23 13:57:22 +03:00
Anna Stuchlik	b54868c639	doc: disable the outdated banner This commit disables the banner that advertises ScyllaDB University Live event, which aleardy took place. Closes #13284	2023-03-23 08:57:45 +02:00
Kefu Chai	1197664f09	test: network_topology_strategy_test: silence warning clang warns when the implicit conversion changes the precision of the converted number. in this case, the before being multiplied, `std::numeric_limits<unsigned long>::max() >> 1` is implicitly promoted to double so it can obtain the common type of double and unsigned long. and the compiler warns: ``` /home/kefu/dev/scylladb/test/boost/network_topology_strategy_test.cc:129:84: error: implicit conversion from 'unsigned long' to 'double' changes value from 9223372036854775807 to 9223372036854775808 [-Werror,-Wimplicit-const-int-float-conversion] return static_cast<unsigned long>(d(std::numeric_limits<unsigned long>::max() >> 1)) << 1; ~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~ ``` but 1. we don't really care about the precision here, we just want to map a double to a token represented by an int64_t 2. the maximum possible number being converted is less than 9223372036854775807, which is the maximum number of int64_t, which is in general an alias of `long long`, not to mention that LONG_MAX is always 2147483647, after shifting right, the result would be 1073741823 so this is a false alarm. in order to silence it, we explicitly cast the RHS of `` operator to double. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13221	2023-03-23 08:55:29 +02:00
Botond Dénes	aee5dfaa84	Merge 'docs: Add card logos' from David Garcia Related issue https://github.com/scylladb/scylladb/issues/13119 Adds product logos to cards Preview: ![Welcome-to-ScyllaDB-Documentation-ScyllaDB-Docs (1)](https://user-images.githubusercontent.com/9107969/224996621-6c93676d-1427-4a28-a529-fd3cd2bc2d61.png) Closes #13167 * github.com:scylladb/scylladb: docs: Update custom styles docs: Update styles docs: Add card logos	2023-03-23 08:53:58 +02:00
Botond Dénes	0f5e845399	Merge 'docs: scylladb better php driver' from Daniel Reis Hey y'all! Me and @malusev998 are maintaining a updated version of the [PHP Driver ](https://github.com/he4rt/scylladb-php-driver) together with @he4rt community and it had a bunch of improvements on these last month. Before it was working only at PHP 7.1 (DataStax branch), and at our branch we have it working at PHP 8.1 and 8.2. We are also using the ScyllaDB C++ Driver on this project and I think that is a good idea to point new users for this project since it's the most updated PHP Driver maintained now. What do y'all think about that? Closes #13218 * github.com:scylladb/scylladb: fix: links to php driver fix: adding php versions into driver's description docs: scylladb better php driver	2023-03-23 08:53:30 +02:00
Tzach Livyatan	2d40952737	DOCS: remove invalid example from DML reference, WHERE clause section Closes #12596	2023-03-22 18:37:20 +02:00
Nadav Har'El	d1e6d9103a	Merge 'api: reference httpd::* symbols like 'httpd::'' from Kefu Chai this change is a leftover of `063b3be8a7`, which failed to include the changes in the header files. it turns out we have `using namespace httpd;` in seastar's `request_parser.rl`, and we should not rely on this statement to expose the symbols in `seatar::httpd` to `seastar` namespace. in this change, also, sine `get_name()` previously a non-static member function of `seastar_test` is now a static member function, so we need to update the tests which capture `this` for calling this function, so they don't capture `this` anymore. Closes #13202 github.com:scylladb/scylladb: test: drop unused captured variables Update seastar submodule	2023-03-22 18:16:15 +02:00
Kefu Chai	596ea6d439	test: drop unused captured variables this should silence the warning like: ``` test/boost/multishard_mutation_query_test.cc:493:29: error: lambda capture 'this' is not used [-Werror,-Wunused-lambda-capture] do_with_cql_env_thread([this] (cql_test_env& env) -> future<> { ^~~~ test/boost/multishard_mutation_query_test.cc:577:29: error: lambda capture 'this' is not used [-Werror,-Wunused-lambda-capture] do_with_cql_env_thread([this] (cql_test_env& env) -> future<> { ^~~~ 2 errors generated. ``` Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-03-22 21:21:04 +08:00
Avi Kivity	4a18ee87eb	Update seastar submodule * seastar 9cbc1fe889...1204efbc5e (14): > http: Add lost pragma once into client.hh > prometheus, http: do not expose httpd::* in seastar > build: add haswell support > ci: fix configuration to build checkheaders target. > core: map_reduce: Fix use-after-free in variant with futurized reducer > Merge 'tests: support boost::test decorators and tolerate failures in test_spawn_input' from Kefu Chai > memory: support reallocing foreign (non-Seastar) memory on a reactor thread > test: futures: disable -Wself-move for GCC>=13 > map_reduce: do not move a temporary object > doc/building-dpdk.md: drop extraneous '$' > http: url_decode: translate plus back into char > Merge 'seastar-json2code: cleanups' from Kefu Chai > Fix markdown formatting > Merge 'Minor abort on OOM changes' from Travis Downs	2023-03-22 21:21:04 +08:00
Vlad Zolotarov	f94bbc5b34	transport: add per-scheduling-group CQL opcode-specific metrics This patch extends a previous patch that added these metrics globally: - cql_requests_count - cql_request_bytes - cql_response_bytes This patch adds a "scheduling_group_name" label to these metrics and changes corresponding counters to be accounted on a per-scheduling-group level. As a bonus this patch also marks all 3 metrics as 'skip_when_empty'. Ref #13061 Signed-off-by: Vlad Zolotarov <vladz@scylladb.com> Message-Id: <20230321201412.3004845-1-vladz@scylladb.com>	2023-03-22 13:27:48 +02:00
Botond Dénes	ff87f95a26	reader_concurrency_semaphore: add trace points for important events Notably, to admission execution and eviction. Registering/unregistering the permit as inactive is not traced, as this happens on every buffer-fill for range scans. Semaphore trace messages have a "[reader_concurrency_semaphore]" prefix to allow them to be clearly associated with the semaphore.	2023-03-22 04:58:18 -04:00
Botond Dénes	1f51f752cc	reader_permit: refresh trace_state on new pages To make sure all tracing done on a certain page will make its way into the appropriate trace session. This is a contination of the previous patch (which added trace pointer to the permit).	2023-03-22 04:58:10 -04:00
Botond Dénes	156e5d346d	reader_permit: keep trace_state pointer on permit And propagate it down to where it is created. This will be used to add trace points for semaphore related events, but this will come in the next patches.	2023-03-22 04:58:01 -04:00
Botond Dénes	27a4c24522	test/perf/perf_collection: give more unique names to key comparators perf.cc has two key comparators: key_compare and key_tri_compare. These are very generic name, in fact key_compare directly clashes with a comparator with the same name in types.hh. Avoid the clash by renaming both of these to a more unique name.	2023-03-22 04:58:01 -04:00
Nadav Har'El	2038388268	cql-pytest: translate Cassandra's tests for multi-column relations This is a translation of Cassandra's CQL unit test source file validation/operations/SelectMultiColumnRelationTest.java into our cql-pytest framework. The tests reproduce four already-known Scylla bugs and three new bugs. All tests pass on Cassandra. Because of these bugs 9 of the 22 tests are marked xfail, and one is marked skip (it crashes Scylla). Already known issues: Refs #64: CQL Multi column restrictions are allowed only on a clustering key prefix Refs #4178: Not covered corner case for key prefix optimization in filtering Refs #4244: Add support for mixing token, multi- and single-column restrictions Refs #8627: Cleanly reject updates with indexed values where value > 64k New issue discovered by these tests: Refs #13217: Internal server error when null is used in multi-column relation Refs #13241: Multi-column IN restriction with tuples of different lengths crashes Scylla Refs #13250: One-element multi-column restriction should be handled like a single-column restriction Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #13265	2023-03-22 09:54:32 +02:00
Tzach Livyatan	083408723f	doc: Add Mumur term to the glossery Point to the difference between the official MurmurHash3 and Scylla / Cassandra implementation Update docs/glossary.rst Co-authored-by: Anna Stuchlik <37244380+annastuchlik@users.noreply.github.com> Closes #11369	2023-03-21 22:45:47 +02:00
Alejo Sanchez	da00052ad8	gms, service: replicate live endpoints on shard 0 Call replicate_live_endpoints on shard 0 to copy from 0 to the rest of the shards. And get the list of live members from shard 0. Move lock to the callers. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Closes #13240	2023-03-21 15:46:12 +01:00
Gleb Natapov	fd6d45e178	bootstrapper: Add get_random_bootstrap_tokens function Does the same as get_bootstrap_tokens() but does not consult initial token config option. Will be used later.	2023-03-21 16:06:43 +02:00
Gleb Natapov	fc84c69b7e	service: raft: add support for topology_change command into raft_group0_client Extend raft_group0_client::prepare_command with support of topology_change type of command.	2023-03-21 16:06:43 +02:00
Gleb Natapov	16d61e791f	service: raft: introduce topology_change group0 command Also extend group0_command to be able to send new command type. The command consists of a mutation array.	2023-03-21 16:06:43 +02:00
Gleb Natapov	5e232ebee5	system_keyspace: add a table to persist topology change state machine's state Add local table to store topology change state machine's state there. Also add a function that loads the state to memory.	2023-03-21 16:06:43 +02:00
Gleb Natapov	a2b7d2c1a1	service: Introduce topology state machine data structures The topology state machine will track all the nodes in a cluster, their state, properties (topology, tokens, etc) and requested actions. Node state can be one of those: none - the node is not yet in the cluster bootstrapping - the node is currently bootstrapping decommissioning - the node is being decommissioned removing - the node is being removed replacing - the node is replacing another node normal - the node is working normally rebuild - the node is being rebuilt left - the node is left the cluster Nodes in state left are never removed from the state. Tokens also can be in one of the states: write_both_read_old - writes are going to new and old replica, but reads are from old replicas still write_both_read_new - writes still going to old and new replicas but reads are from new replica owner - tokens are owned by the node and reads and write go to new replica set only Tokens that needs to be move start in 'write_both_read_old' state. After entire cluster learns about it streaming start. After the streaming tokens move to 'write_both_read_new' state and again the whole cluster needs to learn about it and make sure no reads started before that point exist in the system. After that tokens may move to 'owner' state. topology_request is the field through which a topology operation request can be issued to a node. A request is one of the topology operation currently supported: join, leave, replace or remove.	2023-03-21 16:06:43 +02:00
Gleb Natapov	dd1e27736e	storage_proxy: not consult topology on local table write Writes to tables with local replication strategies do not need to consult the topology. This is not only an optimization but it allows writing into the local tables before topology is known.	2023-03-21 16:06:43 +02:00
Anna Stuchlik	922f6ba3dd	doc: fix the service name in upgrade guides Fixes https://github.com/scylladb/scylladb/issues/13207 This commit fixes the service and package names in the upgrade guides 5.0-to-2022.1 and 5.1-to-2022.2. Service name: scylla-server Package name: scylla-enterprise Previous PRs to fix the same issue in other upgrade guides: https://github.com/scylladb/scylladb/pull/12679 https://github.com/scylladb/scylladb/pull/12698 This commit must be backported to branch-5.1 and branch 5.2. Closes #13225	2023-03-21 15:56:28 +02:00
Kefu Chai	124410c059	api: reference httpd::* symbols like 'httpd::' this change is a leftover of `063b3be`, which failed to include the changes in the header files. it turns out we have `using namespace httpd;` in seastar's `request_parser.rl`, and we should not rely on this statement to expose the symbols in `seatar::httpd` to `seastar` namespace. in this change, api/.hh: all httpd symbols are referenced by `httpd::` instead of being referenced as if they are in `seastar`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-03-21 15:49:10 +02:00
Nadav Har'El	77bf90bf7d	Merge 'Sanitize {format_types\|version_types} to/from string converters' from Pavel Emelyanov There's a need to convert both -- version and format -- to string and back. Currently, there's a disperse set of helpers in sstables/ code doing that and this PR brings some other to it - adds fmt::formatter<> specialization for both types - leaves one set of {format\|version}_from_string() helpers converting any string-ish object into value refs: #12523 Closes #13214 * github.com:scylladb/scylladb: sstables: Expell sstable_version_types from_string() helper sstables: Generalize ..._from_string helpers sstables: Implement fmt::formatter<sstable_format_types> sstables: Implement fmt::formatter<sstable_version_types> sstables: Move format maps to namespace scope	2023-03-21 13:39:24 +02:00
Wojciech Mitros	406ea34aba	build: add wasm compilation target for rust In the future, when testing WASM UDFs, we will only store the Rust source codes of them, and compile them to WASM. To be able to do that, we need rust standard library for the wasm32-wasi target, which is available as an RPM called rust-std-static-wasm32-wasi. Closes #12896 [avi: regenerate toolchain] Closes #13258	2023-03-21 10:30:08 +02:00
Pavel Emelyanov	fe7609865d	Merge 'reader_concurrency_semaphore: improve diagnostics printout' from Botond Dénes Remove redundant "Total: ..." line. Include the entire `reader_concurrency_semaphore::stats` in the printout. This includes a lot of metrics not exported to monitoring. These metrics are very valuable when debugging timeouts but are otherwise uninteresting. To avoid bloating our monitoring with such niche metrics, we dump them when they are interesting: when timeouts happen. To be really helpful, we do need historic values too, but this shouldn't be a problem: timeouts come in bursts, we usually get at least a handful of diagnostics dumps at a time. New stats are also added to record the reason why reads are queued on the semaphore. Printout before: ``` INFO 2023-03-14 12:43:54,496 [shard 0] reader_concurrency_semaphore - Semaphore test_reader_concurrency_semaphore_memory_limit_no_leaks with 4/4 count and 7168/4096 memory resources: kill limit triggered, dumping permit diagnostics: permits count memory table/description/state 4 4 7K ./reader/active/unused 2 0 0B ./reader/waiting_for_admission 6 4 7K total Total: 6 permits with 4 count and 7K memory resources ``` Printout after: ``` INFO 2023-03-16 04:23:41,791 [shard 0] reader_concurrency_semaphore - Semaphore test_reader_concurrency_semaphore_memory_limit_no_leaks with 3/4 count and 7168/4096 memory resources: kill limit triggered, dumping permit diagnostics: permits count memory table/description/state 2 2 6K ./reader/active/unused 1 1 1K ./reader/waiting_for_memory 2 0 0B ./reader/waiting_for_admission 5 3 7K total Stats: permit_based_evictions: 0 time_based_evictions: 0 inactive_reads: 0 total_successful_reads: 0 total_failed_reads: 0 total_reads_shed_due_to_overload: 0 total_reads_killed_due_to_kill_limit: 1 reads_admitted: 4 reads_enqueued_for_admission: 4 reads_enqueued_for_memory: 5 reads_admitted_immediately: 2 reads_queued_because_ready_list: 0 reads_queued_because_used_permits: 0 reads_queued_because_memory_resources: 0 reads_queued_because_count_resources: 4 reads_queued_with_eviction: 0 total_permits: 6 current_permits: 5 used_permits: 0 blocked_permits: 0 disk_reads: 0 sstables_read: 0 ``` Closes #13173 * github.com:scylladb/scylladb: test/boost/reader_concurrency_semaphore_test: remove redundant stats printouts reader_concurrency_semaphore: do_dump_reader_permit_diagnostics(): print the stats reader_concurrency_semaphore: add stats to record reason for queueing permits reader_concurrency_semaphore: can_admit_read(): also return reason for rejection	2023-03-21 10:41:11 +03:00
Pavel Emelyanov	eecb9244dd	sstables: Expell sstable_version_types from_string() helper It's name is too generic despite it's narrow specialization. Also, there's a version_from_string() method that does the same in a more convenient way. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-03-21 09:56:18 +03:00
Pavel Emelyanov	4e99637777	sstables: Generalize ..._from_string helpers There are two string->{version\|format} converters living on class sstable. It's better to have both in namespace scope. Surprisingly, there's only one caller of it. Also this patch makes both accept std::string_view not to limit the helpers in converting only sstring&-s. This changes calls for reverse_map template update with "heterogenuous lookup". Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-03-21 09:56:18 +03:00
Pavel Emelyanov	bb59dc2ec1	sstables: Implement fmt::formatter<sstable_format_types> Same as in previous patch for another enum-class type. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-03-21 09:56:18 +03:00
Pavel Emelyanov	6b04eb74d6	sstables: Implement fmt::formatter<sstable_version_types> This way the version type can be fed as-is into fmt:: code, respectively the conversion to string is as simple as fmt::to_string(v). So also drop the explicit existing to_string() helper updating all callers. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-03-21 09:56:18 +03:00
Pavel Emelyanov	ea1c6fbf98	sstables: Move format maps to namespace scope They will be used by fmt::formatter specification for version and format types in next patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-03-21 09:56:18 +03:00

1 2 3 4 5 ...

35747 Commits