scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-02 14:15:46 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	36d90e637e	Merge "Relax migration manager dependencies" from Pavel Emalyanov The set make dependencies between mm and other services cleaner, in particular, after the set: - the query processor no longer needs migration manager (which doesn't need query processor either) - the database no longer needs migration manager, thus the mutual dependency between these two is dropped, only migration manager -> database is left - the migration manager -> storage_service dependency is relaxed, one more patchset will be needed to remove it, thus dropping one more mutual dependency between them, only the storage_service -> migration manager will be left - the migration manager is stopped on drain, but several more services need it on stop, thus causing use after free problems, in particular there's a caught bug when view builder crashes when unregistering from notifier list on stop. Fixed. Tests: unit(dev) Fixes: #5404	2020-01-16 12:12:25 +01:00
Piotr Dulikowski	c383652061	gossip: allow for aborting on sleep This commit makes most sleeps in gossip.cc abortable. It is now possible to quickly shut down a node during startup, most notably during the phase while it waits for gossip to settle.	2020-01-16 12:05:50 +02:00
Pavel Emelyanov	5cf365d7e7	database: Explicitly pass migration_manager through init_non_system_keyspace This is the last place where database code needs the migration_manager instance to be alive, so now the mutual dependency between these two is gone, only the migration_manager needs the database, but not the vice-versa. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:29:21 +03:00
Pavel Emelyanov	d9edcb3f15	query_processor: Use migration_notifier This patch breaks one (probably harmless but still) dependency loop. The query_processor -> migration_manager -> storage_proxy -> tracing -> query_processor. The first link is not not needed, as the query_processor needs the migration_manager purely to (ub)subscribe on notifications. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:21 +03:00
Pavel Emelyanov	28f1250b8b	view_builder: Use migration notifier The migration manager itself is still needed on start to wait for schema agreement, but there's no longer the need for the life-time reference on it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:21 +03:00
Pavel Emelyanov	7cfab1de77	database: Switch on mnotifier from migration_manager Do not call for local migration manager instance to send notifications, call for the local migration notifier, it will always be alive. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:21 +03:00
Pavel Emelyanov	f45b23f088	storage_service: Keep migration_notifier The storage service will need this guy to initialize sub-services with. Also it registers itself with notifiers. That said, it's convenient to have the migration notifier on board. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:21 +03:00
Pavel Emelyanov	f240d5760c	migration_manager: Split notifier from main class The _listeners list on migration_manager class and the corresponding notify_xxx helpers have nothing to do with the its instances, they are just transport for notification delivery. At the same time some services need the migration manager to be alive at their stop time to unregister from it, while the manager itself may need them for its needs. The proposal is to move the migration notifier into a complete separate sharded "service". This service doesn't need anything, so it's started first and stopped last. While it's not effectively a "migration" notifier, we inherited the name from Cassandra and renaming it will "scramble neurons in the old-timers' brains but will make it easier for newcomers" as Avi says. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:28:19 +03:00
Pavel Emelyanov	1992755c72	storage_service: Kill initialization helper from init.cc The helper just makes further patching more complex, so drop it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-15 14:27:27 +03:00
Tomasz Grabiec	c8a5a27bd9	Merge "storage_service: Move load_broadcaster away" from Pavel E. The storage_service struct is a collection of diverse things, most of them requiring only on start and on stop and/or runing on shard 0 (but is nonetheless sharded). As a part of clearing this structure and generated by it inter- -componenes dependencies, here's the sanitation of load_broadcaster.	2020-01-14 19:26:06 +01:00
Piotr Sarna	36ec43a262	Merge "add table with connected cql clients" from Juliusz This change introduces system.clients table, which provides information about CQL clients connected. PK is the client's IP address, CK consists of outgoing port number and client_type (which will be extended in future to thrift/alternator/redis). Table supplies also shard_id and username. Other columns, like connection_stage, driver_name, driver_version..., are currently empty but exist for C* compatibility and future use. This is an ordinary table (i.e. non-virtual) and it's updated upon accepting connections. This is also why C*'s column request_count was not introduced. In case of abrupt DB stop, the table should not persist, so it's being truncated on startup. Resolves #4820	2020-01-14 10:01:07 +02:00
Juliusz Stasiewicz	27dfda0b9e	main/transport: using the infrastructure of system.clients Resolves #4820. Execution path in main.cc now cleans up system.clients table if it exists (this is done on startup). Also, server.cc now calls functions that notify about cql clients connecting/disconnecting.	2020-01-13 14:07:04 +01:00
Pavel Emelyanov	148da64a7e	storage_servce: Move load_broadcaster away This simplifies the storage_service API and fixes the complain about shared_ptr usage instead of unique_ptr. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-13 13:55:09 +03:00
Pavel Emelyanov	b6e1e6df64	misc_services: Introduce load_meter There's a lonely get_load_map() call on storage_service that needs only load broadcaster, always runs on shard 0 and that's it. Next patch will move this whole stuff into its own helper no-shard container and this is preparation for this. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-01-13 13:53:08 +03:00
Rafael Ávila de Espíndola	b80852c447	main: Explicitly allow scylla core dumps I have not looked into the security reason for disabling it when a program has file capabilities. Fixes #5560 [avi: remove extraneous semicolon] Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200106231836.99052-1-espindola@scylladb.com>	2020-01-07 11:15:59 +02:00
Pavel Emelyanov	f2b20e7083	cache_hitrate_calculator: Do not reinvent the peering_sharded_service The class in question wants to run its own instances on different shards, for this sake it keeps reference on sharded self to call invoke_on() on. There's a handy peering_sharded_service<> in seastar for the same, using it makes the code nicer and shorter. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20191226112401.23960-1-xemul@scylladb.com>	2020-01-03 15:48:19 +02:00
Rafael Ávila de Espíndola	91c7f5bf44	Print build-id on startup Fixes #5426 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20191218031556.120089-1-espindola@scylladb.com>	2019-12-19 15:43:04 +02:00
Nadav Har'El	8157f530f5	merge: CDC: handle schema changes Merged pull request https://github.com/scylladb/scylla/pull/5366 from Calle Wilund: Moves schema creation/alter/drop awareness to use new "before" callbacks from migration manager, and adds/modifies log and streams table as part of the base table modification. Makes schema changes semi-atomic per node. While this does not deal with updates coming in before a schema change has propagated cluster, it now falls into the same pit as when this happens without CDC. Added side effect is also that now schemas are transparent across all subsystems, not just cql. Patches: cdc_test: Add small test for altering base schema (add column) cdc: Handle schema changes via migration manager callbacks migration_manager: Invoke "before" callbacks for table operations migration_listener: Add empty base class and "before" callbacks for tables cql_test_env: Include cdc service in cql tests cdc: Add sharded service that does nothing. cdc: Move "options" to separate header to avoid to much header inclusion cdc: Remove some code from header	2019-12-17 23:04:36 +02:00
Pavel Emelyanov	71a528d404	directories: Move the whole stuff into own .cc file In order not to pollute the root dir place the code in utils/ directory, "utils" namespace. While doing this -- move the touch_and_lock from the class declaration. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-12-12 19:52:01 +03:00
Pavel Emelyanov	f2b3c17e66	directories: Move all the dirs code into .init method The seastar::async usage is tempoarary, added for bisect-safety, soon it will go away. For this reason the indentation in the .init method is not "canonical", but is prepared for one-patch drop of the seastar::async. The hinted_handoff_enabled arg is there, as it's not just a parameter on config, it had been parsed in main.cc. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-12-12 17:33:11 +03:00
Pavel Emelyanov	82ef2a7730	file_lock: Work with fs::path, not sstring The main.cc code that converts sstring to fs::path will be patched soon, the file_desc::open belongs to seastar and works on sstrings. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-12-12 17:32:10 +03:00
Calle Wilund	a21e140169	cdc: Add sharded service that does nothing. But can be used to hang functionality into eventually.	2019-12-09 12:12:09 +00:00
fastio	039b83ad3b	Redis API: Rename options related to Redis API, describe them clearly, and remove unnecessary one. Rename option redis_transport_port to redis_port, which the redis transport listens on for clients. Rename option redis_transport_port_ssl to redis_ssl_port, which the redis TLS transport listens on for clients. Rename option redis_database_count. Set the redis dabase count. Rename option redis_keyspace_opitons to redis_keyspace_replication_strategy_options. Set the replication strategy for redis keyspace. Remove option enable_redis_protocol, which is unnecessary. Fixes: #5335 Signed-off-by: Peng Jian <pengjian.uestc@gmail.com>	2019-12-03 17:13:35 +08:00
Pavel Emelyanov	50a1ededde	main: Remove now unused defer-with-log helper Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-25 18:47:03 +03:00
Pavel Emelyanov	a0f92d40ee	main: Shut down sighup handler with verbose helper And (!) fix the misprinted variable name. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-25 18:47:03 +03:00
Pavel Emelyanov	2d64fc3a3e	main: Shut down database with verbose_shutdown helper Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-25 18:47:03 +03:00
Pavel Emelyanov	636c300db5	main: Shut down prometheus with verbose_shutdown() Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> --- v2: - Have stop easrlier so that exception in start/listen do not prevent prometheu.stop from calling	2019-11-25 18:47:03 +03:00
Pavel Emelyanov	804b152527	main: Sanitize shutting down callbacks As suggested in issue #4586 here is the helper that prints "shutting down foo" message, then shuts the foo down, then prints the "shutting down foo was successfull". In between it catches the exception (if any) and warns this in logs. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-25 18:45:49 +03:00
Pavel Emelyanov	f6ac969f1e	mm: Stop migration manager Before stopping the db itself, stop the migration service. It must be stopped before RPC, but RPC is not stopped yet itself, so we should be safe here. Here's the tail of the resulting logs: INFO 2019-11-20 11:22:35,193 [shard 0] init - shutdown migration manager INFO 2019-11-20 11:22:35,193 [shard 0] migration_manager - stopping migration service INFO 2019-11-20 11:22:35,193 [shard 1] migration_manager - stopping migration service INFO 2019-11-20 11:22:35,193 [shard 0] init - Shutdown database started INFO 2019-11-20 11:22:35,193 [shard 0] init - Shutdown database finished INFO 2019-11-20 11:22:35,193 [shard 0] init - stopping prometheus API server INFO 2019-11-20 11:22:35,193 [shard 0] init - Scylla version 666.development-0.20191120.25820980f shutdown complete. Also -- stop the mm on drain before the commitlog it stopped. [Tomasz: mm needs the cl because pulling schema changes from other nodes involves applying them into the database. So cl/db needs to be stopped after mm is stopped.] The drain logs would look like ... INFO 2019-11-25 11:00:40,562 [shard 0] migration_manager - stopping migration service INFO 2019-11-25 11:00:40,562 [shard 1] migration_manager - stopping migration service INFO 2019-11-25 11:00:40,563 [shard 0] storage_service - DRAINED: and then on stop ... INFO 2019-11-25 11:00:46,427 [shard 0] init - shutdown migration manager INFO 2019-11-25 11:00:46,427 [shard 0] init - Shutdown database started INFO 2019-11-25 11:00:46,427 [shard 0] init - Shutdown database finished INFO 2019-11-25 11:00:46,427 [shard 0] init - stopping prometheus API server INFO 2019-11-25 11:00:46,427 [shard 0] init - Scylla version 666.development-0.20191125.3eab6cd54 shutdown complete. Fixes #5300 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20191125080605.7661-1-xemul@scylladb.com>	2019-11-25 12:59:01 +01:00
Pavel Emelyanov	e0f40ed16a	cli: Add the --workdir\|-W option When starting scylla daemon as non-root the initialization fails because standard /var/lib/scylla is not accessible by regular users. Making the default dir accessible for user is not very convenient either, as it will cause conflicts if two or more instances of scylla are in use. This problem can be resolved by specifying --commitlog-directory, --data-file-directories, etc on start, but it's too much typing. I propose to revive Nadav's --home option that allows to move all the directories under the same prefix in one go. Unlike Nadav's approach the --workdir option doesn't do any tricky manipulations with existing directories. Insead, as Pekka suggested, the individual directories are placed under the workir if and only if the respective option is NOT provided. Otherwise the directory configuration is taken as is regardless of whether its absolute or relative path. The values substutution is done early on start. Avi suggested that this is unsafe wrt HUP config re-read and proper paths must be resolved on the fly, but this patch doesn't address that yet, here's why. First of all, the respective options are MustRestart now and the substitution is done before HUP handler is installed. Next, commitlog and data_file values are copied on start, so marking the options as LiveUpdate won't make any effect. Finally, the existing named_value::operator() returns a reference, so returning a calculated (and thus temporary) value is not possible (from my current understanding, correct me if I'm wrong). Thus if we want the _directory() to return calculated value all callers of them must be patched to call something different (e.g. _directory.get() ?) which will lead to more confusion and errors. Changes v3: - the option is --workdir back again - the existing *directory are only affected if unset - default config doesn't have any of these set - added the short -W alias Changes v2: - the option is --home now - all other paths are changed to be relative Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Message-Id: <20191119130059.18066-1-xemul@scylladb.com>	2019-11-21 15:07:39 +02:00
Peng Jian	f2801feb66	Redis API: graft redis module to Scylla In this document, the detailed design and implementation of Redis API in Scylla is provided. v2: build: work around ragel 7 generated code bug (suggested by Avi) Ragel 7 incorrectly emits some unused variables that don't compile. As a workaround, sed them away. Signed-off-by: Peng Jian <pengjian.uestc@gmail.com> Signed-off-by: Amos Kong <amos@scylladb.com>	2019-11-20 04:55:58 +08:00
Tomasz Grabiec	5e4abd75cc	main: Abort on EBADF and ENOTSOCK by default Those are typically symptoms of use-after-free or memory corruption in the program. It's better to catch such error sooner than later. That situation is also dangerous since if a valid descriptor would land under the invalid access, not the one which was intended for the operation, then the operation may be performed on the wrong file and result in corruption. Message-Id: <1565206788-31254-1-git-send-email-tgrabiec@scylladb.com>	2019-11-19 13:07:33 +02:00
Pavel Emelyanov	1dc490c81c	tracing: Move register_tracing_keyspace_backend forward decl into proper header Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Pavel Emelyanov	7e81df71ba	main: Shorten developer_mode() evaluation Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Pavel Emelyanov	1bd68d87fc	main: Do not carry pctx all over the code v2: - do not use struct initialization extention Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Pavel Emelyanov	655b6d0d1e	main: Hide start_thrift Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Pavel Emelyanov	26f2b2ce5e	main,db: Kill some unused .hh includes Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Pavel Emelyanov	f5b345604f	main: Factor out get_conf_sub Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Pavel Emelyanov	924d52573d	main: Remove unused return_value variable (and capture) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2019-11-14 19:59:03 +03:00
Avi Kivity	2970578677	config: add configuration option for 3.1.0 heritage clusters Scylla 3.1.0 broke the serialization format for TTLs. Later versions corrected it, but if a cluster was originally installed as 3.1.0, it will use the broken serialization forever. This configuration option allows upgrades from 3.1.0 to succeed, by enabling the broken format even for later versions.	2019-10-23 18:36:35 +03:00
Avi Kivity	f12feec2c9	Update seastar submodule * seastar 1f68be436f...e888b1df9c (8): > sharded: Make map work with mapper that returns a future > cmake: Remove FindBoost.cmake > Reduce noncopyable_function instruction cache footprint > doc: add Loops section to the tutorial > Merge "Move file related code out of reactor" from Asias > Merge "Move the io_queue code out of reactor" from Asias > cmake: expose seastar_perf_testing lib > future: class doc: explain why discarding a future is bad - main.cc now includes new file io_queue.hh - perf tests now include seastar perf utilities via user, not system, includes since those are not exported	2019-10-10 18:17:28 +03:00
Piotr Sarna	97cbb9a2c7	alternator: add verifying the auth signature The signature sent in the "Authorization:" header is now verified by computing the signature server-side with a matching secret key and confirming that the signatures match. Currently the secret key is hardcoded to be "whatever" in order to work with current tests, but it should be replaced by a proper key store. Refs #5046	2019-10-10 13:51:00 +02:00
Piotr Sarna	e1b0537149	alternator: add HTTPS support By providing a server based on a TLS socket, it's now possible to serve HTTPS requests in alternator. The HTTPS server is enabled by setting its port in scylla.yaml: alternator_tls_port=XXXX. Alternator TLS relies on the existing TLS configuration, which is provided by certificate, keyfile, truststore, priority_string options. Fixes #5042	2019-10-03 19:10:30 +02:00
Piotr Sarna	f912122072	main: log unexpected errors thrown on shutdown (#4993 ) Shutdown routines are usually implemented via the deferred_action mechanism, which runs a function in its destructor. We thus expect the function to be noexcept, but unfortunately it's not always the case. Throwing in the destructor results in terminating the program anyway, but before we do that, the exception can be logged so it's easier to investigate and pinpoint the issue. Example output before the patch: INFO 2019-09-10 12:49:05,858 [shard 0] view - Stopping view builder terminate called without an active exception Aborting on shard 0. Backtrace: 0x000000000184a9ad (...) Example output after the patch: INFO 2019-09-10 12:49:05,858 [shard 0] view - Stopping view builder ERROR 2019-09-10 12:49:05,858 [shard 0] init - Unexpected error on shutdown: std::runtime_error (Hello there!) terminate called without an active exception Aborting on shard 0. Backtrace: 0x000000000184a9ad (...)	2019-09-16 09:42:55 +03:00
Nadav Har'El	b2bd3bbc1f	alternator: add "--alternator-address" configuration parameter So far we had the "--alternator-port" option allowing to configure the port on which the Alternator server listens on, but the server always listened to any address. It is important to also be able to configure the listen address - it is useful in tests running several instances of Scylla on the same machine, and useful in multi-homed machines with several interfaces. So this patch adds the "--alternator-address" option, defaulting to 0.0.0.0 (to listen on all interfaces). It works like the many other "--*-address" options that Scylla already has. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190808204641.28648-1-nyh@scylladb.com>	2019-09-11 18:01:05 +03:00
Nadav Har'El	c0518183c2	alternator: require alternator-port configuration Until now, we always opened the Alternator port along with Scylla's regular ports (CQL etc.). This should really be made optional. With this patch, by default Alternator does NOT start and does not open a port. Run Scylla with --alternator-port=8000 to open an Alternator API port on port 8000, as was the default until now. It's also possible to set this in scylla.yaml. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 12:38:31 +03:00
Piotr Sarna	2ec78164bc	alternator: add minimal HTTP interface The interface works on port 8000 by default and provides the most basic alternator operations - it's an incomplete set without validation, meant to allow testing as early as possible.	2019-09-11 12:34:18 +03:00
Avi Kivity	9579946e72	main: extend CPU feature check to verify that PCLMUL is available Since `79136e895f`, we use the pclmul instruction set, so check it is there.	2019-08-29 15:13:32 +03:00
Botond Dénes	136fc856c5	treewide: silence discarded future warnings for questionable discards This patches silences the remaining discarded future warnings, those where it cannot be determined with reasonable confidence that this was indeed the actual intent of the author, or that the discarding of the future could lead to problems. For all those places a FIXME is added, with the intent that these will be soon followed-up with an actual fix. I deliberately haven't fixed any of these, even if the fix seems trivial. It is too easy to overlook a bad fix mixed in with so many mechanical changes.	2019-08-26 19:28:43 +03:00
Avi Kivity	67b0d379e0	main: add glue between db::config and cql3::cql_config Copy values between the flat db::config and the hierarchical cql_config, adding observers to keep the values updated.	2019-08-21 19:35:59 +02:00

1 2 3 4 5 ...

396 Commits