scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-19 16:15:07 +00:00

Author	SHA1	Message	Date
Botond Dénes	dc23736d0c	db/config: replace ad-hoc aliases with alias mechanism We already uses aliases for some configuration items, although these are created with an ad-hoc mechanism that only registers them on the command line. Replace this with the built-in alias mechanism in the previous patch, which has the benefit of conflict resolution and also working with YAML.	2020-07-28 18:00:29 +03:00
Botond Dénes	003f5e9e54	utils: config: add alias support Allow configuration items to also have an alias, besides the name. This allows easy replacement of configuration items, with newer names, while still supporting the old name for backward compatibility. The alias mechanism takes care of registering both the name and the alias as command line arguments, as well as parsing them from YAML. The command line documentation of the alias will just refer to the name for documentation.	2020-07-28 17:59:51 +03:00
Nadav Har'El	a7df8486b1	alternator test: add test for tracing In commit `8d27e1b`, we added tracing (see docs/tracing.md) support to Alternator requests. However, we never had a functional test that verifies this feature actually works as expected, and we recently noticed that for the GetItem and BatchGetItem requestd, the trace doesn't really work (it returns an empty list of events). So this patch adds a test, test/alternator/test_tracing.py, which verifies that the tracing feature works for the PutItem, GetItem, DeleteItem, UpdateItem, BatchGetItem, BatchWriteItem, Query and Scan operations. This test is very peculiar. It needs to use out-of-band REST API requests to enable and disable tracing (of course, the test is skipped when running against AWS - this is a Scylla-only feature). It also needs to read CQL-only system tables and does this using Alternator's ".scylla.alternator" interface for system tables - which came through for us here beautifully and demonstrated their usefulness. I paid a lot of attention for this test to remain reasonably fast - this entire test now runs in a little less than one second. Achieving this while testing eight different requests was a bit of a challenge, because traces take time until they are visible in the trace table. This is the main reason why in this patch the test for all eight request types are done in one test, instead of eight separate tests. Fixes #6891 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200727115401.1199024-1-nyh@scylladb.com>	2020-07-27 14:31:45 +02:00
Takuya ASADA	97fa17b17b	scylla_setup: remove square bracket from disk prompt selected list Selected list on disk prompt is looks like an alternatives, it's better to use single quote. Fixes #6760	2020-07-27 14:50:31 +03:00
Avi Kivity	3f84d41880	Merge "messaging: make verb handler registering independent of current scheduling group" from Botond " `0c6bbc8` refactored `get_rpc_client_idx()` to select different clients for statement verbs depending on the current scheduling group. The goal was to allow statement verbs to be sent on different connections depending on the current scheduling group. The new connections use per-connection isolation. For backward compatibility the already existing connections fall-back to per-handler isolation used previously. The old statement connection, called the default statement connection, also used this. `get_rpc_client_idx()` was changed to select the default statement connection when the current scheduling group is the statement group, and a non-default connection otherwise. This inadvertently broke `scheduling_group_for_verb()` which also used this method to get the scheduling group to be used to isolate a verb at handle register time. This method needs the default client idx for each verb, but if verb registering is run under the system group it instead got the non-default one, resulting in the per-handler isolation not being set-up for the default statement connection, resulting in default statement verb handlers running in whatever scheduling group the process loop of the rpc is running in, which is the system scheduling group. This caused all sorts of problems, even beyond user queries running in the system group. Also as of `0c6bbc8` queries on the replicas are classified based on the scheduling group they are running on, so user reads also ended up using the system concurrency semaphore. In particular this caused severe problems with ranges scans, which in some cases ended up using different semaphores per page resulting in a crash. This could happen because when the page was read locally the code would run in the statement scheduling group, but when the request arrived from a remote coordinator via rpc, it was read in a system scheduling group. This caused a mismatch between the semaphore the saved reader was created with and the one the new page was read with. The result was that in some cases when looking up a paused reader from the wrong semaphore, a reader belonging to another read was returned, creating a disconnect between the lifecycle between readers and that of the slice and range they were referencing. This series fixes the underlying problem of the scheduling group influencing the verb handler registration, as well as adding some additional defenses if this semaphore mismatch ever happens in the future. Inactive read handles are now unique across all semaphores, meaning that it is not possible anymore that a handle succeeds in looking up a reader when used with the wrong semaphore. The range scan algorithm now also makes sure there is no semaphore mismatch between the one used for the current page and that of the saved reader from the previous page. I manually checked that each individual defense added is already preventing the crash from happening. Fixes: #6613 Fixes: #6907 Fixes: #6908 Tests: unit(dev), manual(run the crash reproducer, observe no crash) " * 'query-classification-regressions/v1' of https://github.com/denesb/scylla: multishard_mutation_query: use cached semaphore messaging: make verb handler registering independent of current scheduling group multishard_mutation_query: validate the semaphore of the looked-up reader reader_concurrency_semaphore: make inactive read handles unique across semaphores reader_concurrency_semaphore: add name() accessor reader_concurrency_semaphore: allow passing name to no-limit constructor	2020-07-27 13:56:52 +03:00
Nadav Har'El	9080709c56	docs: add paragraph to tracing.md Issue #6919 was caused by an incorrect assumption: I assumed that we see the tracing session record, we can be sure that the event records for this session had already been written. In this patch we add a paragraph to the tracing documentation - docs/tracing.md, which explains that this assumption is in fact incorrect: 1. On a multi-node setup, replicas may continue to write tracing events after the coordinator "finished" (moved to background) the request and wrote the session record. 2. Even on a single-node setup, the writes of the session record and the individual events are asynchronous, and can happen in an unexpected order (which is what happened in issue #6919). Refs #6919. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200727102438.1194314-1-nyh@scylladb.com>	2020-07-27 13:38:57 +03:00
Takuya ASADA	0ffa0e8745	dist_util.py: use correct ID value to detect Amazon Linux 2 On `2d63acdd6a` we replaced 'ol' and 'amzn' to 'oracle' and 'amazon', but distro.id() actually returns 'amzn' for Amazon Linux 2, so we need to revert the change. Fixes #6882	2020-07-27 12:46:21 +03:00
Botond Dénes	eeeef0a0f1	multishard_mutation_query: use cached semaphore Instead of requesting the query class config from the database every time the semaphore is needed, use the cached one by calling `semaphore()`.	2020-07-27 12:17:22 +03:00
Nadav Har'El	65f75e3862	alternator test: enable test_get_records After issue #6864 was fixed, the test_streams.py::test_get_records test no longer fails, so its "xfail" marker can be removed. Refs #6864. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200722132518.1077882-1-nyh@scylladb.com>	2020-07-27 09:19:37 +02:00
Nadav Har'El	f488eaebaf	merge: db/view: view_update_generator: make staging reader evictable Merged patch set by Botond Dénes: The view update generation process creates two readers. One is used to read the staging sstables, the data which needs view updates to be generated for, and another reader for each processed mutation, which reads the current value (pre-image) of each row in said mutation. The staging reader is created first and is kept alive until all staging data is processed. The pre-image reader is created separately for each processed mutation. The staging reader is not restricted, meaning it does not wait for admission on the relevant reader concurrency semaphore, but it does register its resource usage on it. The pre-image reader however is restricted. This creates a situation, where the staging reader possibly consumes all resources from the semaphore, leaving none for the later created pre-image reader, which will not be able to start reading. This will block the view building process meaning that the staging reader will not be destroyed, causing a deadlock. This patch solves this by making the staging reader restricted and making it evictable. To prevent thrashing -- evicting the staging reader after reading only a really small partition -- we only make the staging reader evictable after we have read at least 1MB worth of data from it. test/boost: view_build_test: add test_view_update_generator_buffering test/boost: view_build_test: add test test_view_update_generator_deadlock reader_permit: reader_resources: add operator- and operator+ reader_concurrency_semaphore: add initial_resources() test: cql_test_env: allow overriding database_config mutation_reader: expose new_reader_base_cost db/view: view_updating_consumer: allow passing custom update pusher db/view: view_update_generator: make staging reader evictable db/view: view_updating_consumer: move implementation from table.cc to view.cc database: add make_restricted_range_sstable_reader() Signed-off-by: Botond Dénes <bdenes@scylladb.com> --- db/view/view_updating_consumer.hh \| 51 ++++++++++++++++++++++++++++--- db/view/view.cc \| 39 +++++++++++++++++------ db/view/view_update_generator.cc \| 19 +++++++++--- 3 files changed, 91 insertions(+), 18 deletions(-)	2020-07-27 09:19:37 +02:00
Botond Dénes	fe127a2155	sstables: clamp estimated_partitions to [1, +inf) in writers In some cases estimated number of partitions can be 0, which is albeit a legit estimation result, breaks many low-level sstable writer code, so some of these have assertions to ensure estimated partitions is > 0. To avoid hitting this assert all users of the sstable writers do the clamping, to ensure estimated partitions is at least 1. However leaving this to the callers is error prone as #6913 has shown it. As this clamping is standard practice, it is better to do it in the writers themselves, avoiding this problem altogether. This is exactly what this patch does. It also adds two unit tests, one that reproduces the crash in #6913, and another one that ensures all sstable writers are fine with estimated partitions being 0 now. Call sites previously doing the clamping are changed to not do it, it is unnecessary now as the writer does it itself. Fixes #6913 Tests: unit(dev) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <20200724120227.267184-1-bdenes@scylladb.com>	2020-07-27 09:19:37 +02:00
Avi Kivity	91619d77a1	Merge "Simplify the lifetime management of write monitors" from Raphael " This makes sure that monitors are always owned by the same struct that owns the monitored writer, simplifying the lifetime management. This hopefully fixes some of the crashes we have observed around this area. " * 'espindola/use-compaction_writer-v6' of https://github.com/espindola/scylla: sstables: Rename _writer to _compaction_writer sstables: Move compaction_write_monitor to compaction_writer sstables: Add couple of writer() getters to garbage_collected_sstable_writer sstables: Move compaction_write_monitor earlier in the file	2020-07-27 09:19:37 +02:00
Dejan Mircevski	c11b2de84c	cql3: Fix tombstone-range check for TRUE A DELETE statement checks that the deletion range is symmetrically bounded. This check was broken for expression TRUE. Test the fix by setting initial_key_restrictions::expression to TRUE, since CQL doesn't currently allow WHERE TRUE. That change has been proposed anyway in feedback to #5763: https://github.com/scylladb/scylla/pull/5763#discussion_r443213343 Tests: unit (dev) Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-07-27 09:19:37 +02:00
Dejan Mircevski	ba74659f5a	cql/restrictions: Constrain to_sorted_vector As requested in #5763 feedback, enforce the function's assumptions with concept asserts. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-07-27 09:19:37 +02:00
Botond Dénes	0df4c2fd3b	messaging: make verb handler registering independent of current scheduling group `0c6bbc8` refactored `get_rpc_client_idx()` to select different clients for statement verbs depending on the current scheduling group. The goal was to allow statement verbs to be sent on different connections depending on the current scheduling group. The new connections use per-connection isolation. For backward compatibility the already existing connections fall-back to per-handler isolation used previously. The old statement connection, called the default statement connection, also used this. `get_rpc_client_idx()` was changed to select the default statement connection when the current scheduling group is the statement group, and a non-default connection otherwise. This inadvertently broke `scheduling_group_for_verb()` which also used this method to get the scheduling group to be used to isolate a verb at handle register time. This method needs the default client idx for each verb, but if verb registering is run under the system group it instead got the non-default one, resulting in the per-handler isolation not being set-up for the default statement connection, resulting in default statement verb handlers running in whatever scheduling group the process loop of the rpc is running in, which is the system scheduling group. This caused all sorts of problems, even beyond user queries running in the system group. Also as of `0c6bbc8` queries on the replicas are classified based on the scheduling group they are running on, so user reads also ended up using the system concurrency semaphore.	2020-07-27 10:11:21 +03:00
Piotr Sarna	d08e22c4eb	alternator: fix tracing BatchGetItem The BatchGetItem request did not pass its trace state to lower layers in a correct manner, which resulted in losing tracing information. Refs #6891 Message-Id: <078f58a0f76b9f182f671a8d16e147ded489138c.1595515815.git.sarna@scylladb.com>	2020-07-23 20:05:10 +03:00
Piotr Sarna	7256572e41	alternator: fix tracing GetItem The GetItem request did not pass the trace state properly, which resulted in having almost empty traces. Refs #6891 Tests: manual: Before: session_id \| event_id \| activity \| scylla_parent_id \| scylla_span_id \| source \| source_elapsed \| thread --------------------------------------+--------------------------------------+------------------------------------------------------------------------------------------------------------------------+------------------+-----------------+-----------+----------------+--------- 57995da0-cce4-11ea-97ea-000000000000 \| 579971c4-cce4-11ea-97ea-000000000000 \| GetItem \| 0 \| 131309406144163 \| 127.0.0.1 \| 0 \| shard 0 After: session_id \| event_id \| activity \| scylla_parent_id \| scylla_span_id \| source \| source_elapsed \| thread --------------------------------------+--------------------------------------+------------------------------------------------------------------------------------------------------------------------+------------------+-----------------+-----------+----------------+--------- 57995da0-cce4-11ea-97ea-000000000000 \| 579971c4-cce4-11ea-97ea-000000000000 \| GetItem \| 0 \| 131309406144163 \| 127.0.0.1 \| 0 \| shard 0 57995da0-cce4-11ea-97ea-000000000000 \| 57997327-cce4-11ea-97ea-000000000000 \| Creating read executor for token -7535857341981351089 with all: {127.0.0.1} targets: {127.0.0.1} repair decision: NONE \| 0 \| 131309406144163 \| 127.0.0.1 \| 35 \| shard 0 57995da0-cce4-11ea-97ea-000000000000 \| 5799733d-cce4-11ea-97ea-000000000000 \| read_data: querying locally \| 0 \| 131309406144163 \| 127.0.0.1 \| 38 \| shard 0 57995da0-cce4-11ea-97ea-000000000000 \| 57997358-cce4-11ea-97ea-000000000000 \| Start querying the token range that starts with -7535857341981351089 \| 0 \| 131309406144163 \| 127.0.0.1 \| 40 \| shard 0 57995da0-cce4-11ea-97ea-000000000000 \| 57997579-cce4-11ea-97ea-000000000000 \| Querying is done \| 0 \| 131309406144163 \| 127.0.0.1 \| 95 \| shard 0 Message-Id: <d585ff7aaaeebf2050890643d40cdafb2efb8d98.1595509338.git.sarna@scylladb.com>	2020-07-23 20:05:06 +03:00
Avi Kivity	39db54a758	Merge "Use seastar::with_file_close_on_failure in commitlog" from Benny " `close_on_failure` was committed to seastar so use the library version. This requires making the lambda function passed to it nothrow move constructible, so this series also makes db::commitlog::descriptor move constructor noexcept and changes allocate_segment_ex and segment::segment to get a descriptor by value rather than by reference. Test: unit(dev), commitlog_test(debug) " * tag 'commit-log-use-with_file_close_on_failure-v1' of github.com:bhalevy/scylla: commitlog: use seastar::with_file_close_on_failure commitlog: descriptor: make nothrow move constructible commitlog: allocate_segment_ex, segment: pass descriptor by value commitlog: allocate_segment_ex: filename capture is unused	2020-07-23 19:23:23 +03:00
Rafael Ávila de Espíndola	bca4eb8b8c	Build: Garbage collect dead sections In another patch I noticed gcc producing dead functions. I am not sure why gcc is doing that. Some of those functions are already placed in independent sections, and so can be garbage collected by the linker. This is a 1% text section reduction in scylla, from 39363380 to 38974324 bytes. There is no difference in the tps reported by perf_simple_query. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200723152511.8214-1-espindola@scylladb.com>	2020-07-23 18:57:01 +03:00
Piotr Sarna	6cdc9f1a43	Merge 'alternator: refactor api_error class' from Nadav In the patch "Add exception overloads for Dynamo types", Alternator's single api_error exception type was replaced by a more complex hierarchy of types. The implementation was not only longer and more complex to understand - I believe it also negated an important observation: The "api_error" exception type is special. It is not an exception created by code for other code. It is not meant to be caught in Alternator code. Instead, it is supposed to contain an error message created for the user, containing one of the few supported exception exception "names" described in the DynamoDB documentation, and a user-readable text message. Throwing such an exception in Alternator code means the thrower wants the request to abort immediately, and this message to reach the user. These exceptions are not designed to be caught in Alternator code. Code should use other exceptions - or alternatives to exceptions (e.g., std::optional) for problems that should be handled before returning a different error to the user. Moreover, "api_error" isn't just thrown as an exception - it can also be returned-by-value in a executor::request_return_type) - which is another reason why it should not be subclassed. For these reasons, I believe we should have a single api_error type, and it's wrong to subclass it. So in this patch I am reverting the subclasses and template added in the aforementioned patch. Still, one correct observation made in that patch was that it is inconvenient to type in DynamoDB exception names (no help from the editor in completing those strings) and also error-prone. In this patch we propse a different - simpler - solution to the same problem: We add trivial factory functions, e.g., api_error::validation(std::string) as a shortcut to api_error("ValidationException"). The new implementation is easy to understand, and also more self explanatory to readers: It is now clear that "api_error::validation()" is actually a user-visible "api_error", something which was obscured by the name validation_exception() used before this patch. Finally, this patch also improves the comment in error.hh explaining the purpose of api_error and the fact it can be returned or thrown. The fact it should not be subclassed is legislated with a "finally". There is also no point of this class inheriting from std::exception or having virtual functions, or an empty constructor - so all these are dropped as well. Signed-off-by: Nadav Har'El <nyh@scylladb.com> * 'api-error-refactor' of https://github.com/nyh/scylla: alternator: use api_error factory functions in auth.cc alternator: use api_error::validation() alternator: use api_error factory functions in executor.cc alternator: use api_error factory functions in server.cc alternator: refactor api_error class	2020-07-23 17:35:56 +02:00
Piotr Sarna	e7c18963e4	test: check sizes before dereferencing the vector It's better to assert a certain vector size first and only then dereference its elements - otherwise, if a bug causes the size to be different, the test can crash with a segfault on an invalid dereference instead of graciously failing with a test assertion.	2020-07-23 16:49:35 +03:00
Piotr Sarna	6b04034566	cql3: fix multi column restriction bounds Generating bounds from multi-column restrictions used to create incorrect nonwrapping intervals, which only happened to work because they're implemented as wrapping intervals underneath. The following CQL restriction: WHERE (a, b) >= (1, 0) should translate to (a, b) >= (1, 0), no upper bound, while it incorrectly translates to (a, b) >= (1, 0) AND (a, b) < empty-prefix. Since empty prefix is smaller than any other clustering key, this range was in fact not correct, since the assumption was that starting bound was never greater than the ending bound. While the bug does not trigger any errors in tests right now, it starts to do so after the code is modified in order to correctly handle empty intervals (intervals with end > start).	2020-07-23 16:49:24 +03:00
Botond Dénes	b7cfa4ea97	multishard_mutation_query: validate the semaphore of the looked-up reader To make sure it belongs to the same semaphore that the database thinks is appropriate for the current query. Since a semaphore mismatch points to a serious bug, we use `on_internal_error()` to allow generating coredumps on-demand.	2020-07-23 16:43:37 +03:00
Botond Dénes	11105cbb78	reader_concurrency_semaphore: make inactive read handles unique across semaphores Currently inactive read handles are only unique within the same semaphore, allowing for an unregister against another semaphore to potentially succeed. This can lead to disasters ranging from crashes to data corruption. While a handle should never be used with another semaphore in the first place, we have recently seen a bug (#6613) causing exactly that, so in this patch we prevent such unregister operations from ever succeeding by making handles unique across all semaphores. This is achieved by adding a pointer to the semaphore to the handle.	2020-07-23 16:43:33 +03:00
Botond Dénes	d12540bfbf	reader_concurrency_semaphore: add name() accessor Allows identifying the semaphore in question in semaphore related error messages.	2020-07-23 16:42:54 +03:00
Botond Dénes	88129f500f	reader_concurrency_semaphore: allow passing name to no-limit constructor So tests can provide names for semaphores as well, making test output more clear.	2020-07-23 16:42:36 +03:00
Nadav Har'El	b661c1eae2	alternator: use api_error factory functions in auth.cc All the places in auth.cc where we constructed an api_error with inline strings now use api_error factory functions. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-07-23 15:36:39 +03:00
Nadav Har'El	bca88521ba	alternator: use api_error::validation() All the places in conditions.cc, expressions.cc and serialization.cc where we constructed an api_error, we always used the ValidationException type string, which the code repeated dozens of times. This patch converts all these places to use the factory function api_error::validation(). Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-07-23 15:36:39 +03:00
Nadav Har'El	06ba0c0232	alternator: use api_error factory functions in executor.cc All the places in executor.cc where we constructed an api_error with inline strings now use api_error factory functions. Most of them, but not all of them, were api_error::validation(). We also needed to add a couple more of these factory functions. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-07-23 15:36:39 +03:00
Nadav Har'El	81589be00a	alternator: use api_error factory functions in server.cc All the places in server.cc where we constructed an api_error with inline strings now use api_error factory functions - we needed to add a few more. Interestingly, we had a wrong type string for "Internal Server Error", which we fix in this patch. We wrote the type string like that - with spaces - because this is how it was listed in the DynamoDB documentation at https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/Programming.Errors.html But this was in fact wrong, and it should be without spaces: "InternalServerError". The botocore library (for example) recognizes it this way, and this string can also be seen in other online DynamoDB examples. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-07-23 15:36:39 +03:00
Nadav Har'El	5a35632cd3	alternator: refactor api_error class In the patch "Add exception overloads for Dynamo types", Alternator's single api_error exception type was replaced by a more complex hierarchy of types. The implementation was not only longer and more complex to understand - I believe it also negated an important observation: The "api_error" exception type is special. It is not an exception created by code for other code. It is not meant to be caught in Alternator code. Instead, it is supposed to contain an error message created for the user, containing one of the few supported exception exception "names" described in the DynamoDB documentation, and a user-readable text message. Throwing such an exception in Alternator code means the thrower wants the request to abort immediately, and this message to reach the user. These exceptions are not designed to be caught in Alternator code. Code should use other exceptions - or alternatives to exceptions (e.g., std::optional) for problems that should be handled before returning a different error to the user. Moreover, "api_error" isn't just thrown as an exception - it can also be returned-by-value in a executor::request_return_type) - which is another reason why it should not be subclassed. For these reasons, I believe we should have a single api_error type, and it's wrong to subclass it. So in this patch I am reverting the subclasses and template added in the aforementioned patch. Still, one correct observation made in that patch was that it is inconvenient to type in DynamoDB exception names (no help from the editor in completing those strings) and also error-prone. In this patch we propse a different - simpler - solution to the same problem: We add trivial factory functions, e.g., api_error::validation(std::string) as a shortcut to api_error("ValidationException"). The new implementation is easy to understand, and also more self explanatory to readers: It is now clear that "api_error::validation()" is actually a user-visible "api_error", something which was obscured by the name validation_exception() used before this patch. Finally, this patch also improves the comment in error.hh explaining the purpose of api_error and the fact it can be returned or thrown. The fact it should not be subclassed is legislated with a "finally". There is also no point of this class inheriting from std::exception or having virtual functions, or an empty constructor - so all these are dropped as well. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-07-23 15:36:39 +03:00
Avi Kivity	01b838e291	Merge "Unregister RPC verbs on stop" from Pavel E " There are 5 services, that register their RPC handlers in messaging service, but quite a few of them unregister them on stop. Unregistering is somewhat critical, not just because it makes the code look clean, but also because unregistration does wait for the message processing to complete, thus avoiding use-after-free's in the handlers. In particular, several handlers call service::get_schema_for_write() which, in turn, may end up in service::maybe_sync() calling for the local migration manager instance. All those handlers' processing must be waited for before stopping the migration manager. The set brings the RPC handlers unregistration in sync with the registration part. tests: unit (dev) dtest (dev: simple_boot_shutdown, repair) start-stop by hands (dev) fixes: #6904 " * 'br-rpc-unregister-verbs' of https://github.com/xemul/scylla: main: Add missing calls to unregister RPC hanlers messaging: Add missing per-service unregistering methods messaging: Add missing handlers unregistration helpers streaming: Do not use db->invoke_on_all in vain storage_proxy: Detach rpc unregistration from stop main: Shorten call to storage_proxy::init_messaging_service	2020-07-23 12:03:49 +03:00
Avi Kivity	b4b9deadf3	build: install jmx and tools-java submodule dependencies Let each submodule be responsible for its own dependencies, and call the submodule's dependency installation script. Reviewed-by: Piotr Jastrzebski <piotr@scylladb.com> Reviewed-by: Takuya ASADA <syuu@scylladb.com>	2020-07-22 20:13:50 +03:00
Avi Kivity	7fbe50a4e4	build: remove pystache from install-dependencies As of `d6165bc1c3` we do not depend on pystache, so don't install it. Reviewed-by: Takuya ASADA <syuu@scylladb.com>	2020-07-22 20:12:31 +03:00
Avi Kivity	19da4a5b8f	build: don't package tools/java and tools/jmx in relocatable pacakge tools/java and tools/jmx have their own relocatable packages (and rpm/deb), so they should not be part of the main relocatable package. Enforce this by enabling the filter parameter in reloc_add, and passing a filter that excludes tools/java and tools/jmx.	2020-07-22 20:03:18 +03:00
Avi Kivity	98a22e572a	dist: redhat: reduce log spam from unpacking sources when building rpm rpmbuild defaults to logging the name of every file it unpacks from the archive. Make it quiet with the %setup -q flag.	2020-07-22 20:02:04 +03:00
Rafael Ávila de Espíndola	87b261ab32	sstables: Rename _writer to _compaction_writer Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-22 08:15:55 -07:00
Rafael Ávila de Espíndola	97b7fee78e	sstables: Move compaction_write_monitor to compaction_writer There is one monitor per writer, so we new keep them together in the compaction_writer struct. This trivially guarantees that the monitor is always destroyed before the writer. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-22 08:15:53 -07:00
Rafael Ávila de Espíndola	f8cc582e4a	sstables: Add couple of writer() getters to garbage_collected_sstable_writer This just reduces the noise of an upcoming patch. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-22 07:46:05 -07:00
Rafael Ávila de Espíndola	c740c66840	sstables: Move compaction_write_monitor earlier in the file This will used by followup patches. Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com>	2020-07-22 07:46:05 -07:00
Pavel Emelyanov	50d07696e4	main: Add missing calls to unregister RPC hanlers The gossiper's and migration_manager's unregistration is done on the services' stopm, for the rest we need to call the recently introduced methods. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-07-22 16:35:07 +03:00
Pavel Emelyanov	5060063cd6	messaging: Add missing per-service unregistering methods 5 services register handlers in messaging, but not all of them have clear unregistration methods. Summary: migration_manager: everything is in place, no changes gossiper: ditto proxy: some verbs unregistration is missing repair: no unregistration at all streaming: ditto This patch adds the needed unregistration methods. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-07-22 16:34:00 +03:00
Pavel Emelyanov	7a7b1b3108	messaging: Add missing handlers unregistration helpers Handlers for each verb have both -- register and unregister helpers, but unregistration ones for some verbs are missing, so here they are. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-07-22 16:31:57 +03:00
Pavel Emelyanov	08e36ca77c	streaming: Do not use db->invoke_on_all in vain The db instance is not needed to initialize messages, so use plain smp::invoke_on_all Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-07-22 16:31:57 +03:00
Pavel Emelyanov	f845a78d9a	storage_proxy: Detach rpc unregistration from stop The proxy's stop method is not called (and unlikely will be soon), but stopping the message handlers is needed now, so prepare the existing method for this.' Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-07-22 16:31:57 +03:00
Pavel Emelyanov	cc070ceca0	main: Shorten call to storage_proxy::init_messaging_service Just for brevity Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-07-22 16:31:57 +03:00
Kamil Braun	12e2891c60	cdc: if ring_delay == 0, don't add delay to newly created generation If ring_delay == 0, something fishy is going on, e.g. single-node tests are being performed. In this case we want the CDC generation to start operating immediately. There is no need to wait until it propagates to the cluster. You should not use ring_delay == 0 in production. Fixes https://github.com/scylladb/scylla/issues/6864.	2020-07-22 16:06:09 +03:00
Avi Kivity	5e1fa13d08	Merge 'docker: Make I/O configuration setup configurable' from Pekka " This adds a '--io-setup N' command line option, which users can pass to specify whether they want to run the "scylla_io_setup" script or not. This is useful if users want to specify I/O settings themselves in environments such as Kubernetes, where running "iotune" is problematic. While at it, add the same option to "scylla_setup" to keep the interface between that script and Docker consistent. Fixes #6587 " * penberg-penberg/docker-no-io-setup: scylla_setup: Add '--io-setup ENABLE' command line option dist/docker: Add '--io-setup ENABLE' command line option	2020-07-22 14:17:53 +03:00
Rafael Ávila de Espíndola	e83e91e352	alternator: Fix use after return Avoid a copy of timeout so that we don't end up with a reference to a stack allocated variable. Fixes #6897 Signed-off-by: Rafael Ávila de Espíndola <espindola@scylladb.com> Message-Id: <20200721184939.111665-1-espindola@scylladb.com>	2020-07-21 22:06:13 +03:00
Avi Kivity	098d24fd6d	Update seastar submodule * seastar 4a99d56453...02ad74fa7d (5): > TLS: Use "known" (precalculated) DH parameters if available > tutorial: fix advanced service_loop examples > tutorial: further fix service_loop example text > linux-aio: make the RWF_NOWAIT support work again > locking_test: Fix a use after return	2020-07-21 19:08:36 +03:00

1 2 3 4 5 ...

22917 Commits