scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-30 05:07:05 +00:00

Author	SHA1	Message	Date
Nadav Har'El	509a41db04	alternator: change name of Alternator's SSL options When Alternator is enabled over HTTPS - by setting the "alternator_https_port" option - it needs to know some SSL-related options, most importantly where to pick up the certificate and key. Before this patch, we used the "server_encryption_options" option for that. However, this was a mistake: Although it sounds like these are the "server's options", in fact prior to Alternator this option was only used when communicating with other servers - i.e., connections between Scylla nodes. For CQL connections with the client, we used a different option - "client_encryption_options". This patch introduces a third option "alternator_encryption_options", which controls only Alternator's HTTPS server. Making it separate from the existing CQL "client_encryption_options" allows both Alternator and CQL to be active at the same time but with different certificates (if the user so wishes). For backward compatibility, we temporarily continue to allow server_encryption_options to control the Alternator HTTPS server if alternator_encryption_options is not specified. However, this generates a warning in the log, urging the user to switch. This temporary workaround should be removed in a future version. This patch also: 1. fixes the test run code (which has an "--https" option to test over https) to use the new name of the option. 2. Adds documentation of the new option in alternator.md and protocols.md - previously the information on how to control the location of the certificate was missing from these documents. Fixes #7204. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200930123027.213587-1-nyh@scylladb.com>	2020-10-14 18:13:57 +03:00
Benny Halevy	57cc5f6ae1	sstable_directory: use a external load_semaphore Although each sstable_directory limits concurrency using max_concurrent_for_each, there could be a large number of calls to do_for_each_sstable running in parallel (e.g per keyspace X per table in the distributed_loader). To cap parallelism across sstable_directory instances and concurrent calls to do_for_each_sstable, start a sharded<semaphore> and pass a shared semaphore& to the sstable_directory:s. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-10-08 11:57:06 +03:00
Pavel Emelyanov	e7f74449a6	tracing: Keep qp anchor on backend The query processor is required in table_helper's used by tracing. Now everything is ready to push the query processor reference from main down to the table helpers. Because of the current initialization sequence it's only possible to have the started query processor at the .start_tracing() time. Earlier, when the sharded<tracing> is started the query processor is not yet started, so tracing keeps a pointer on local query processor. When tracing is stopped, the pointer is null-ed. This is safe (but an assert is put when dereferencing it), because on stop trace writes' gate is closed and the query processor is only used in them. Also there's still a chance that tracing remains started in case of start abort, but this is on-par with the current code -- sharded query processor is not stopped, so the memory is not freed. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-10-06 15:45:19 +03:00
Pavel Emelyanov	87f1223965	tracing: Push query processor through init methods The goal is to make tracing keyspace helper reference query processor, so this patch adds the needed arguments through the initialization stack. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-10-06 15:45:12 +03:00
Pavel Emelyanov	b5f136c651	main: Start tracing in main Move the tracing::start_tracing() out of the storage_service::join_cluster. It anyway happens at the end of the join, so the logic is not changed, but it becomes possible to patch tracing further. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-10-06 15:44:59 +03:00
Avi Kivity	2bd264ec6a	sstables: remove background_jobs(), await_background_jobs() There are no more users for registering background jobs, so remove the mechanism and the remaining calls.	2020-09-23 20:55:17 +03:00
Pavel Emelyanov	9a15ebfe6a	repair: Move CHECKSUM_RANGE verb into repair/ The verb is sent by repair code, so it should be registered in the same place, not in main. Also -- the verb should be unregistered on stop. The global messaging service instance is made similarly to the row-level one, as there's no ready to use repair service. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-09-17 09:52:48 +03:00
Pavel Emelyanov	d5769346d7	repair: Toss messaging init/uninit calls There goal is to make it possible to reg/unreg not only row-level verbs. While at it -- equip the init call with sharded<database>& argument, it will be needed by the next patch. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-09-17 09:52:48 +03:00
Pavel Emelyanov	949a258809	storage_service: Uninit RPC verbs The service does this on stop, which is never called, so do it separately. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-09-17 09:52:45 +03:00
Eliran Sinvani	342fc07bd6	Storage proxy: add a dedicated smp group for hints Hints and regular writes currently uses the same cross shard operation semaphore, which can lead to priority inversion, making cross shard writes wait for cross shard hints. This commit adds an smp_service_group for hints and adds it usage in the mutate_hint function.	2020-09-07 15:46:12 +03:00
Pavel Emelyanov	623f61e63e	messaging_service: Unglobal messaging service instance Remove the global messaging_service, keep it on the main stack. But also store a pointer on it in debug namespace for debugging. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	4ea3c2797c	storage_service: Keep reference on sharded messaging service It is a bit step backward in the storage-service decompsition campaign, but... Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	6c49127d04	migration_manager: Keep reference on messaging That's another user of messaging service, init it with private reference. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	74494bac87	repair: Keep reference on messaging in row-level code The row-level repair keeps its statics for needed services, same as the streaming does. Treat the messaging service the same way to stop using the global one in the next patches. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	8b4820b520	repair: Keep sharded messaging service in API The reference will be needed in repair_start, so prepare one in advance Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	126dac8ad1	repair: Unset API endpoints on stop This unset the roll-back of the correpsonding _set-s. The messaging service will be (already is, but implicitly) used in repair API callbacks, so make sure they are unset before the messaging service is stopped. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:53 +03:00
Pavel Emelyanov	fe2c479c04	repair: Setup API endpoints in separate helper There will be the unset part soon, this is the preparation. No functional changes in api/storage_server.cc, just move the code. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:52 +03:00
Pavel Emelyanov	a6888e3ce3	streaming: Keep reference on messaging Streaming uses messaging, init it with itw own reference. Nowadays the whole streaming subsystem uses global static references on the needed services. This is not nice, but still better than just using code-wide globals, so treat the messaging service here the same way. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:52 +03:00
Pavel Emelyanov	24cb1b781f	storage_proxy: Keep reference on messaging The proxy is another user of messaging, so keep the reference on it. Its real usage will come in next patches. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:52 +03:00
Pavel Emelyanov	65bd54604d	gossiper: Use messaging service by reference Gossiper needs messaging service, the messaging is started before the gossiper, so we can push the former reference into it. Gossiper is not stopped for real, neither the messaging service is, so the memory usage is still safe. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 20:50:52 +03:00
Pavel Emelyanov	b895c2971a	api: Use local reference to messaging_service Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 13:08:12 +03:00
Pavel Emelyanov	d477bd562d	api: Unregister messaging endpoints on stop API is one of the subsystems that work with messaging service. To keep the dependencies correct the related API stuff should be stopped before the messaging service stops. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 13:08:12 +03:00
Pavel Emelyanov	78298ec776	init: Use local messaging reference in main There are few places that initialize db and system_ks and need the messaging service. Pass the reference to it from main instead of using the global helpers. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 13:08:12 +03:00
Pavel Emelyanov	878c50b9ad	main: Keep reference on global messaging service This is the preparation for moving the message service to main -- keep a reference and eventually pass one to subsystems depending on messaging. Once they are ready, the reference will be turned into an instance. For now only push the reference into the messaging service init/exit itself, other subsystems will be patched next. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 13:08:12 +03:00
Pavel Emelyanov	bdfb77492f	init: The messaging_service::stop is back (not really) Introduce back the .stop() method that will be used to really stop the service. For now do not do sharded::stop, as its users are not yet stopping, so this prevents use-after-free on messaging service. For now the .stop() is empty, but will be in charge of checking if all the other users had unregisterd their handlers from rpc. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 13:08:12 +03:00
Pavel Emelyanov	15998e20ce	init: Move messaging service init up the main() The messaging service is a low-level one which doesn't need other services, so it can be started first. Nowadays it's indeed started before most of its users but one -- the gossiper. In current code gossiper doesn't do anything with messaging service until it starts, but very soon this dependency will be expressed in terms of a refernce from gossiper to messaging_service, thus by the time the latter starts, the former should already exist. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 13:08:12 +03:00
Pavel Emelyanov	c28aeaee2e	messaging_service: Move initialization to messaging/ Now the init_messaging_service() only deals with messaing service and related internal stuff, so it can sit in its own module. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 13:08:12 +03:00
Pavel Emelyanov	41eee249d7	init: RIP init_scheduling_config This struct is nowadays only used to transport arguments from db::config to messaging_service::scheduling_config, things get simpler if dropping it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 13:08:12 +03:00
Pavel Emelyanov	ef6c75a732	init: Call init_messaging_service with its config only This makes the messaging service configuration completely independent from the db config. Next step would be to move the messaging service init code into its module. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 13:08:12 +03:00
Pavel Emelyanov	f7d99b4a06	init: Split messaging service and gossiper initialization The init_ms_fd_gossiper function initializes two services, but effectively consists of two independent parts, so declare them as such. The duplication of listen address resolution will go away soon. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-08-19 13:08:12 +03:00
Pavel Solodovnikov	9aa4712270	lwt: introduce `paxos_grace_seconds` per-table option to set paxos ttl Previously system.paxos TTL was set as max(3h, gc_grace_seconds). Introduce new per-table option named `paxos_grace_seconds` to set the amount of seconds which are used to TTL data in paxos tables when using LWT queries against the base table. Default value is equal to `DEFAULT_GC_GRACE_SECONDS`, which is 10 days. This change allows to easily test various issues related to paxos TTL. Fixes #6284 Tests: unit (dev, debug) Co-authored-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20200816223935.919081-1-pa.solodovnikov@scylladb.com>	2020-08-17 16:44:14 +02:00
Piotr Jastrzebski	c001374636	codebase wide: replace count with contains C++20 introduced `contains` member functions for maps and sets for checking whether an element is present in the collection. Previously `count` function was often used in various ways. `contains` does not only express the intend of the code better but also does it in more unified way. This commit replaces all the occurences of the `count` with the `contains`. Tests: unit(dev) Signed-off-by: Piotr Jastrzebski <piotr@scylladb.com> Message-Id: <b4ef3b4bc24f49abe04a2aba0ddd946009c9fcb2.1597314640.git.piotr@scylladb.com>	2020-08-15 20:26:02 +03:00
Calle Wilund	05851578d4	alternator::streams: Report streams as not ready until CDC stream id:s are available Refs #6864 When booting a clean scylla, CDC stream ID:s will not be availble until a nring delay time period has passed. Before this, writing to a CDC enabled table will fail hard. For alternator (and its tests), we can report the stream(s) for tables as not yet available (ENABLING) until such time as id:s are computed. v2: Keep storage service ref in executor	2020-08-03 20:34:15 +03:00
Avi Kivity	4edfdfa78d	Merge 'Build id cleanups' from Benny " Refs #5525 - main: add --build-id option - build_id: mv sources to utils/ - build_id: throw on errors rather than assert - build_id: simplify callback pointer type casting " * bhalevy-build-id-cleanups: build_id: simplify callback pointer type casting build_id: mv sources to utils/ main: add --build-id option	2020-08-03 17:18:09 +03:00
Calle Wilund	30a700c5b0	system_keyspace: Remove support for legacy truncation records Fixes #6341 Since scylla no longer supports upgrading from a version without the "new" (dedicated) truncation record table, we can remove support for these and the migtration thereof. Make sure the above holds whereever this is committed. Note that this does not remove the "truncated_at" field in system.local.	2020-08-03 17:16:26 +03:00
Benny Halevy	bf6e8f66d9	build_id: mv sources to utils/ The root directory is already overcrowded. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-03 15:55:16 +03:00
Benny Halevy	46f7d01536	main: add --build-id option Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-08-03 15:52:08 +03:00
Avi Kivity	257c17a87a	Merge "Don't depend on seastar::make_(lw_)?shared idiosyncrasies" from Rafael " While working on another patch I was getting odd compiler errors saying that a call to ::make_shared was ambiguous. The reason was that seastar has both: template <typename T, typename... A> shared_ptr<T> make_shared(A&&... a); template <typename T> shared_ptr<T> make_shared(T&& a); The second variant doesn't exist in std::make_shared. This series drops the dependency in scylla, so that a future change can make seastar::make_shared a bit more like std::make_shared. " * 'espindola/make_shared' of https://github.com/espindola/scylla: Everywhere: Explicitly instantiate make_lw_shared Everywhere: Add a make_shared_schema helper Everywhere: Explicitly instantiate make_shared cql3: Add a create_multi_column_relation helper main: Return a shared_ptr from defer_verbose_shutdown	2020-08-02 19:51:24 +03:00
Pavel Emelyanov	50d07696e4	main: Add missing calls to unregister RPC hanlers The gossiper's and migration_manager's unregistration is done on the services' stopm, for the rest we need to call the recently introduced methods. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-07-22 16:35:07 +03:00
Pavel Emelyanov	cc070ceca0	main: Shorten call to storage_proxy::init_messaging_service Just for brevity Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-07-22 16:31:57 +03:00
Rafael Ávila de Espíndola	ad6d65dbbd	Everywhere: Explicitly instantiate make_shared seastar::make_shared has a constructor taking a T&&. There is no such constructor in std::make_shared: https://en.cppreference.com/w/cpp/memory/shared_ptr/make_shared This means that we have to move from make_shared(T(...) to make_shared<T>(...) If we don't want to depend on the idiosyncrasies of seastar::make_shared. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-21 10:33:49 -07:00
Rafael Ávila de Espíndola	8858873d85	main: Return a shared_ptr from defer_verbose_shutdown This moves a few calls to make_shared to a single location. This makes it easier to drop a dependency on the differences between seastar::make_shared and std::make_shared. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-21 10:33:44 -07:00
Calle Wilund	0708a9971a	executor: Add system_distributed_keyspace as parameter/member Streams implementation will require querying system tables etc to do its work, thus will need access to this object.	2020-07-15 08:10:23 +00:00
Pavel Emelyanov	8d2e05778c	main: Stop http server Currently it's not stopped at all, so calling a REST request shutdown-time may crash things at random places. Fixes: #5702 But it's not the end of the story. Since the server stays up while we are shutting things down, each subsystem should carefully handle the cases when it's half-down, but a request comes. A better solution is to unregister rest verbs eventually, but httpd's rules cannot do it now. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 20:27:28 +03:00
Pavel Emelyanov	ba47ef0397	snapshots: Move ops gate from storage_service Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 20:17:21 +03:00
Pavel Emelyanov	d989d9c1c7	snapshots: Initial skeleton A placeholder for snapshotting code that will be moved into it from the storage_service. Also -- pass it through the API for future use. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 19:54:14 +03:00
Pavel Emelyanov	9a8a1635b7	snapshots: Properly shutdown API endpoints Now with the seastar httpd routes unset() at hands we can shut down individual API endpoints. Do this for snapshot calls, this will make snapshot controller stop safe. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-26 17:27:45 +03:00
Glauber Costa	e40aa042a7	distributed_loader: reshard before the node is made online This patch moves the resharding process to use the new directory_with_sstables_handler infrastructure. There is no longer a clear reshard step, and that just becomes a natural part of populate_column_family. In main.cc, a couple of changes are necessary to make that happen. The first one obviously is to stop calling reshard. We also need to make sure that: - The compaction manager is started much earlier, so we can register resharding jobs with it. - auto compactions are disabled in the populate method, so resharding doesn't have to fight for bandwidth with auto compactions. Now that we are resharding through the sstable_directory, the old resharding code can be deleted. There is also no need to deal with the resharding backlog either, because the SSTables are not yet added to the sstable set at this point. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2020-06-18 09:37:18 -04:00
Glauber Costa	9902af894a	compaction_manager: rename run_resharding_job It will be used to run any custom job where the caller provides a function. One such example is indeed resharding, but reshaping SSTables can also fall here. The semaphore is also renamed, and we'll allow only one custom job at a time (across all possible types). We also remove the assumption of the scheduling group. The caller has to have already placed the code in the correct CPU scheduling group. The I/O priority class comes from the descriptor. To make sure that we don't regress, we wrap the entire reshard-at-boot code in the compaction class. Currently the setup would be done in the main group, and the actual resharding in the compaction group. Note that this is temporary, as this code is about to change. Signed-off-by: Glauber Costa <glauber@scylladb.com>	2020-06-18 09:00:27 -04:00
Pavel Emelyanov	60e283b23e	auth: Move away from storage_service Now after the auth start/stop is standalone, we can remove reference from storage service to it. This frees some tests from the need to carry the auth service around for nothing. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-06-12 22:14:33 +03:00

1 2 3 4 5 ...

522 Commits