scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-31 20:16:43 +00:00

Author	SHA1	Message	Date
Benny Halevy	243dc2efce	hints: host_filter: check topology::has_endpoint if enabled_selectively Don't call get_datacenter(ep) without checking first has_endpoint(ep) since the former may abort on internal error if the endpoint is not listed in topology. Refs #11870 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #12054	2022-11-24 14:33:06 +03:00
Benny Halevy	996eac9569	topology: add get_datacenters Returns an unordered set of datacenter names to be used by network_topology_replication_strategy and for ks_prop_defs. The set is kept in sync with _dc_endpoints. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes #12023	2022-11-23 18:39:36 +02:00
Takuya ASADA	9acdd3af23	dist: drop deprecated AMI parameters on setup scripts Since we moved all IaaS code to scylla-machine-image, we nolonger need AMI variable on sysconfig file or --ami parameter on setup scripts, and also never used /etc/scylla/ami_disabled. So let's drop all of them from Scylla core core. Related with scylladb/scylla-machine-image#61 Closes #12043	2022-11-23 17:56:13 +02:00
Avi Kivity	7c66fdcad1	Merge 'Simplify sstable_directory configuration' from Pavel Emelyanov When started the sstable_directory is constructed with a bunch of booleans that control the way its process_sstable_dir method works. It's shorter and simpler to pass these booleans into method directly, all the more so there's another flag that's already passed like this. Closes #12005 * github.com:scylladb/scylladb: sstable_directory: Move all RAII booleans onto flags sstable_directory: Convert sort-sstables argument to flags struct sstable_directory: Drop default filter	2022-11-23 16:16:04 +02:00
Avi Kivity	70bfa708f5	storage_proxy: coroutinize change_hints_host_filter() Trivial straight-line code, no performance implications. Closes #12056	2022-11-23 15:34:24 +02:00
Botond Dénes	602dfdaf98	Merge 'Task manager top level repair tasks' from Aleksandra Martyniuk The PR introduces top level repair tasks representing repair and node operations performed with repair. The actions performed as a part of these operations are moved to corresponding tasks' run methods. Also a small change to repair module is added. Closes #11869 * github.com:scylladb/scylladb: repair: define run for data_sync_repair_task_impl repair: add data_sync_repair_task_impl tasks: repair: add noexcept to task impl constructor repair: define run for user_requested_repair_task_impl repair: add user_requested_repair_task_impl repair: allow direct access to max_repair_memory_per_range	2022-11-23 14:02:30 +02:00
Aleksandra Martyniuk	a3016e652f	repair: define run for data_sync_repair_task_impl Operations performed as a part of data sync repair are moved to data_sync_repair_task_impl run method.	2022-11-23 10:44:19 +01:00
Aleksandra Martyniuk	42239c8fed	repair: add data_sync_repair_task_impl Create a task spanning over whole node operation. Tasks of that type are stored on shard 0.	2022-11-23 10:19:53 +01:00
Aleksandra Martyniuk	9e108a2490	tasks: repair: add noexcept to task impl constructor Add noexcept to constructor of tasks::task_manager::task::impl and inheriting classes.	2022-11-23 10:19:53 +01:00
Aleksandra Martyniuk	4a4e9c12df	repair: define run for user_requested_repair_task_impl Operations performed as a part of user requested repair are moved to user_requested_repair_task_impl run method.	2022-11-23 10:19:51 +01:00
Aleksandra Martyniuk	3800b771fc	repair: add user_requested_repair_task_impl Create a task spanning over whole user requested repair. Tasks of that type are stored on shard 0.	2022-11-23 10:11:09 +01:00
Aleksandra Martyniuk	0256ede089	repair: allow direct access to max_repair_memory_per_range Access specifier of constexpr value max_repair_memory_per_range in repair_module is changed to public and its getter is deleted.	2022-11-23 10:11:09 +01:00
Avi Kivity	d7310fd083	gdb: messaging: print tls servers too Many systems have most traffic on tls servers, so print them. Closes #12053	2022-11-23 07:59:02 +02:00
Avi Kivity	aec9faddb1	Merge 'storage_proxy: use erm topology' from Benny Halevy When processing a query, we keep a pointer to an effective_replication_map. In a couple places we used the latest topology instead of the one held by the effective_replication_map that the query uses and that might lead to inconsistencies if, for example, a node is removed from topology after decommission that happens concurrently to the query. This change gets the topology& from the e_r_m in those cases. Fixes #12050 Closes #12051 * github.com:scylladb/scylladb: storage_proxy: pass topology& to sort_endpoints_by_proximity storage_proxy: pass topology& to is_worth_merging_for_range_query	2022-11-22 20:04:41 +02:00
Botond Dénes	49ec7caf27	mutation_fragment_stream_validator: avoid allocation when stream is correct Currently the ctor of said class always allocates as it copies the provided name string and it creates a new name via format(). We want to avoid this, now that the validator is used on the read path. So defer creating the formatted name to when we actually want to log something, which is either when log level is debug or when an error is found. We don't care about performance in either case, but we do care about it on the happy path. Further to the above, provide a constructor for string literal names and when this is used, don't copy the name string, just save a view to it. Refs: #11174 Closes #12042	2022-11-22 19:19:18 +02:00
Nadav Har'El	ce7c1a6c52	Merge 'alternator: fix wrong 'where' condition for GSI range key' from Marcin Maliszkiewicz Contains fixes requested in the issue (and some tiny extras), together with analysis why they don't affect the users (see commit messages). Fixes [ #11800](https://github.com/scylladb/scylladb/issues/11800) Closes #11926 * github.com:scylladb/scylladb: alternator: add maybe_quote to secondary indexes 'where' condition test/alternator: correct xfail reason for test_gsi_backfill_empty_string test/alternator: correct indentation in test_lsi_describe alternator: fix wrong 'where' condition for GSI range key	2022-11-22 17:46:52 +02:00
Pavel Emelyanov	22133a3949	sstable_directory: Move all RAII booleans onto flags There's a bunch of booleans that control the behavior of sstable directory scanning. Currently they are described as verbose bool_class<>-es and are put into sstable_directory construction time. However, these are not used outside of .process_sstable_dir() method and moving them onto recently added flags struct makes the code much shorter (29 insertions(+), 121 deletions(-)) Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-11-22 18:30:00 +03:00
Pavel Emelyanov	7ca5e143d7	sstable_directory: Convert sort-sstables argument to flags struct The sstable_directory::process_sstable_dir() accepts a boolean to control its behavior when collecting sstables. Turn this boolean into a structure of flags. The intention is to extend this flags set in the future (next patch). This boolean is true all the time, but one place sets it to true in a "verbose" manner, like this: bool sort_sstables_according_to_owner = false; process_sstable_dir(directory, sort_sstables_according_to_owner).get(); the local variable is not used anymore. Using designated initializers solves the verbosity in a nicer manner. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-11-22 18:19:23 +03:00
Pavel Emelyanov	7c7017d726	sstable_directory: Drop default filter It's used as default argument for .reshape() method, but callers specify it explicitly. At the same time the filter is simple enough and is only used in one place so that the caller can just use explicit lambda. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-11-22 18:19:23 +03:00
Benny Halevy	731a74c71f	storage_proxy: pass topology& to sort_endpoints_by_proximity It mustn't use the latest topology that may differ from the one used by the query as it may be missing nodes (e.g. after concurrent decommission). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-11-22 15:02:40 +02:00
Benny Halevy	ab3fc1e069	storage_proxy: pass topology& to is_worth_merging_for_range_query It mustn't use the latest topology that may differ from the one used by the query as it may be missing nodes (e.g. after concurrent decommission). Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-11-22 15:01:58 +02:00
Marcin Maliszkiewicz	2bf2ffd3ed	alternator: add maybe_quote to secondary indexes 'where' condition This bug doesn't affect anything, the reason is descibed in the commit: 'alternator: fix wrong 'where' condition for GSI range key'. But it's theoretically correct to escape those key names and the difference can be observed via CQL's describe table. Before the patch 'where' condition is missing one double quote in variable name making it mismatched with corresponding column name.	2022-11-22 11:08:23 +01:00
Marcin Maliszkiewicz	4389baf0d9	test/alternator: correct xfail reason for test_gsi_backfill_empty_string Previously cited issue is closed already.	2022-11-22 11:08:23 +01:00
Marcin Maliszkiewicz	59eca20af1	test/alternator: correct indentation in test_lsi_describe Otherwise I think assert is not executed in a loop. And I am not sure why lsi variable can be bound to anything. As I tested it was pointing to the last element in lsis...	2022-11-22 11:08:23 +01:00
Marcin Maliszkiewicz	d6d20134de	alternator: fix wrong 'where' condition for GSI range key This bug doesn't manifest in a visible way to the user. Adding the index to an existing table via GlobalSecondaryIndexUpdates is not supported so we don't need to consider what could happen for empty values of index range key. After the index is added the only interesting value user can set is omitting the value (null or empty are not allowed, see test_gsi_empty_value and test_gsi_null_value). In practice no matter of 'where' condition the underlaying materialized view code is skipping row updates with missing keys as per this comment: 'If one of the key columns is missing, set has_new_row = false meaning that after the update there will be no view row'. Thats why the added test passes both before and after the patch. But it's still usefull to include it to exercise those code paths. Fixes #11800	2022-11-22 11:08:23 +01:00
Nadav Har'El	ff617c6950	cql-pytest: translate a few small Cassandra tests This patch includes a translation of several additional small test files from Cassandra's CQL unit test directory cql3/validation/operations. All tests included here pass on both Cassandra and Scylla, so they did not discover any new Scylla bugs, but can be useful in the future as regression tests. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #12045	2022-11-22 07:54:13 +02:00
Botond Dénes	f3eecb47f6	Merge 'Optimize cleanup compaction get ranges for invalidation' from Benny Halevy Take advantage of the facts that both the owned ranges and the initial non_owned_ranges (derived from the set of sstables) are deoverlapped and sorted by start token to turn the calculation of the final non_owned_ranges from quadratic to linear. Fixes #11922 Closes #11903 * github.com:scylladb/scylladb: dht: optimize subtract_ranges compaction: refactor dht::subtract_ranges out of get_ranges_for_invalidation compaction_manager: needs_cleanup: get first/last tokens from sstable decorated keys	2022-11-22 06:45:01 +02:00
Avi Kivity	bf2e54ff85	Merge 'Move deletion log code to sstable_directory.cc' from Pavel Emelyanov In order to support different storage kinds for sstable files (e.g. -- s3) it's needed to localize all the places that manipulate files on a POSIX filesystem so that custom storage could implement them in its own way. This set moves the deletion log manipulations to the sstable_directory.cc, which already "knows" that it works over a directory. Closes #12020 * github.com:scylladb/scylladb: sstables: Delete log file in replay_pending_delete_log() sstables: Move deletion log manipulations to sstable_directory.cc sstables: Open-code delete_sstables() call sstables: Use fs::path in replay_pending_delete_log() sstables: Indentation fix after previous patch sstables: Coroutinize replay_pending_delete_log sstables: Read pending delete log with one line helper sstables: Dont write pending log with file_writer	2022-11-21 21:22:59 +02:00
Benny Halevy	57ff3f240f	dht: optimize subtract_ranges Take advantage of the fact that both ranges and ranges_to_subtract are deoverlapped and sorted by to reduce the calculation complexity from quadratic to linear. Fixes #11922 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-11-21 15:48:28 +02:00
Benny Halevy	8b81635d95	compaction: refactor dht::subtract_ranges out of get_ranges_for_invalidation The algorithm is generic and can be used elsewhere. Add a unit test for the function before it gets optimized in the following patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-11-21 15:48:26 +02:00
Benny Halevy	7c6f60ae72	compaction_manager: needs_cleanup: get first/last tokens from sstable decorated keys Currently, the function is inefficient in two ways: 1. unnecessary copy of first/last keys to automatic variables 2. redecorating the partition keys with the schema passed to needs_cleanup. We canjust use the tokens from the sstable first/last decorated keys. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2022-11-21 15:44:32 +02:00
Pavel Emelyanov	2f9b7931af	sstables: Delete log file in replay_pending_delete_log() It's natural that the replayer cleans up after itself Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-11-21 13:16:22 +03:00
Pavel Emelyanov	bdc47b7717	sstables: Move deletion log manipulations to sstable_directory.cc The deletion log concept uses the fact that files are on a POSIX filesystem. Support for another storage type will have to reimplement this place, so keep the FS-specific code in _directory.cc file. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-11-21 13:16:21 +03:00
Pavel Emelyanov	865c51c6cf	sstables: Open-code delete_sstables() call It's no used by any other code, but to be used it requires the caller to tranform TOC file names by prepending sstable directory to them. Things get shorter and simpler if merging the helper code into the caller. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-11-21 13:15:25 +03:00
Pavel Emelyanov	a61c96a627	sstables: Use fs::path in replay_pending_delete_log() It's called by a code that has fs::path at hand and internally uses helpers that need fs::path too, so no need to convert it back and forth. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-11-21 13:15:25 +03:00
Pavel Emelyanov	f5684bcaf0	sstables: Indentation fix after previous patch Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-11-21 13:15:25 +03:00
Pavel Emelyanov	85a73ca9c6	sstables: Coroutinize replay_pending_delete_log Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-11-21 13:15:25 +03:00
Pavel Emelyanov	6f3fd94162	sstables: Read pending delete log with one line helper There's one in seastar since recently Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-11-21 13:15:25 +03:00
Pavel Emelyanov	2dedf4d03a	sstables: Dont write pending log with file_writer It's a wrapper over output_stream with offset tracking and the tracking is not needed to generate a log file. As a bonus of switching back we get a stream.write(sstring) sugar. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-11-21 13:15:24 +03:00
Botond Dénes	2d4439a739	Merge 'doc: add a troubleshooting article about the missing configuration files' from Anna Stuchlik Fix https://github.com/scylladb/scylladb/issues/11598 This PR adds the troubleshooting article submitted by @syuu1228 in the deprecated _scylla-docs_ repo, with https://github.com/scylladb/scylla-docs/pull/4152. I copied and reorganized the content and rewritten it a little according to the RST guidelines so that the page renders correctly. @syuu1228 Could you review this PR to make sure that my changes didn't distort the original meaning? Closes #11626 * github.com:scylladb/scylladb: doc: apply the feedback to improve clarity doc: add the link to the new Troubleshooting section and replace Scylla with ScyllaDB doc: add the new page to the toctree doc: add a troubleshooting article about the missing configuration files	2022-11-21 12:02:31 +02:00
Nadav Har'El	757d2a4c02	test/alternator: un-xfail a test which passes on modern Python We had an xfailing test that reproduced a case where Alternator tried to report an error when the request was too long, but the boto library didn't see this error and threw a "Broken Pipe" error instead. It turns out that this wasn't a Scylla bug but rather a bug in urllib3, which overzealously reported a "Broken Pipe" instead of trying to read the server's response. It turns out this issue was already fixed in https://github.com/urllib3/urllib3/pull/1524 and now, on modern installations, the test that used to fail now passes and reports "XPASS". So in this patch we remove the "xfail" tag, and skip the test if running an old version of urllib3. Fixes #8195 Closes #12038	2022-11-21 08:10:10 +02:00
Botond Dénes	ffc3697f2f	Merge 'storage_service api: handle dropped tables' from Benny Halevy Gracefully skip tables that were removed in the background. Fixes #12007 Closes #12013 * github.com:scylladb/scylladb: api: storage_service: fixup indentation api: storage_service: add run_on_existing_tables api: storage_service: add parse_table_infos api: storage_service: log errors from compaction related handlers api: storage_service: coroutinize compaction related handlers	2022-11-21 07:56:27 +02:00
Avi Kivity	994603171b	Merge 'Add validator to the mutation compactor' from Botond Dénes Fragment reordering and fragment dropping bugs have been plaguing us since forever. To fight them we added a validator to the sstable write path to prevent really messed up sstables from being written. This series adds validation to the mutation compactor. This will cover reads and compaction among others, hopefully ridding us of such bugs on the read path too. This series fixes some benign looking issues found by unit tests after the validator was added -- although how benign a producer emitting two partition-ends depends entirely on how the consumer reacts to it, so no such bug is actually benign. Fixes: https://github.com/scylladb/scylladb/issues/11174 Closes #11532 * github.com:scylladb/scylladb: mutation_compactor: add validator mutation_fragment_stream_validator: add a 'none' validation level test/boost/mutation_query_test: test_partition_limit: sort input data querier: consume_page(): use partition_start as the sentinel value treewide: use ::for_partition_end() instead of ::end_of_partition_tag_t{} treewide: use ::for_partition_start() instead of ::partition_start_tag_t{} position_in_partition: add for_partition_{start,end}()	2022-11-20 20:33:26 +02:00
Avi Kivity	779b01106d	Merge 'cql3: expr: add unit tests for prepare_expression' from Jan Ciołek Adds unit tests for the function `expr::prepare_expression`. Three minor bugs were found by these tests, both fixed in this PR. 1. When preparing a map, the type for tuple constructor was taken from an unprepared tuple, which has `nullptr` as its type. 2. Preparing an empty nonfrozen list or set resulted in `null`, but preparing a map didn't. Fixed this inconsistency. 3. Preparing a `bind_variable` with `nullptr` receiver was allowed. The `bind_variable` ended up with a `nullptr` type, which is incorrect. Changed it to throw an exception, Closes #11941 * github.com:scylladb/scylladb: test preparing expr::usertype_constructor expr_test: test that prepare_expression checks style_type of collection_constructor expr_test: test preparing expr::collection_constructor for map prepare_expr: make preparing nonfrozen empty maps return null prepare_expr: fix a bug in map_prepare_expression expr_test: test preparing expr::collection_constructor for set expr_test: test preparing expr::collection_constructor for list expr_test: test preparing expr::tuple_constructor expr_test: test preparing expr::untyped_constant expr_test_utils: add make_bigint_raw/const expr_test_utils: add make_tinyint_raw/const expr_test: test preparing expr::bind_variable cql3: prepare_expr: forbid preparing bind_variable without a receiver expr_test: test preparing expr::null expr_test: test preparing expr::cast expr_test_utils: add make_receiver expr_test_utils: add make_smallint_raw/const expr_test: test preparing expr::token expr_test: test preparing expr::subscript expr_test: test preparing expr::column_value expr_test: test preparing expr::unresolved_identifier expr_test_utils: mock data_dictionary::database	2022-11-20 20:03:54 +02:00
Nadav Har'El	2ba8b8d625	test/cql-pytest: remove "xfail" from passing test testIndexOnFrozenCollectionOfUDT We had a test that used to fail because of issue #8745. But this issue was alread fixed, and we forgot to remove the "xfail" marker. The test now passes, so let's remove the xfail marker. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #12039	2022-11-20 19:54:59 +02:00
Avi Kivity	40f61db120	Merge 'docs: describe the Raft upgrade and recovery procedures' from Kamil Braun Add new guide for upgrading 5.1 to 5.2. In this new upgrade doc, include additional steps for enabling Raft using the `consistent_cluster_management` flag. Note that we don't have this flag yet but it's planned to replace the experimental flag in 5.2. In the "Raft in ScyllaDB" document, add sections about: - enabling Raft in existing clusters in Scylla 5.2, - verifying that the internal Raft upgrade procedure finishes successfully, - recovering from a stuck Raft upgrade procedure or from a majority loss situation. Fix some problems in the documentation, e.g. it is not possible to enable Raft in an existing cluster in 5.0, but the documentation claimed that it is. Follow-up items: - if we decide for a different name for `consistent_cluster_management`, use that name in the docs instead - update the warnings in Scylla to link to the Raft doc - mention Enterprise versions once we know the numbers - update the appropriate upgrade docs for Enterprise versions once they exist Closes #11910 * github.com:scylladb/scylladb: docs: describe the Raft upgrade and recovery procedures docs: add upgrade guide 5.1 -> 5.2	2022-11-20 19:00:23 +02:00
Avi Kivity	15ee8cfc05	Merge 'reader_concurrency_semaphore: fix waiter/inactive race' from Botond Dénes We recently (in `7fbad8de87`) made sure all admission paths can trigger the eviction of inactive reads. As reader eviction happens in the background, a mechanism was added to make sure only a single eviction fiber was running at any given time. This mechanism however had a preemption point between stopping the fiber and releasing the evict lock. This gave an opportunity for either new waiters or inactive readers to be added, without the fiber acting on it. Since it still held onto the lock, it also prevented from other eviction fibers to start. This could create a situation where the semaphore could admit new reads by evicting inactive ones, but it still has waiters. Since an empty waitlist is also an admission criteria, once one waiter is wrongly added, many more can accumulate. This series fixes this by ensuring the lock is released in the instant the fiber decides there is no more work to do. It also fixes the assert failure on recursive eviction and adds a detection to the inactive/waiter contradiction. Fixes: #11923 Refs: #11770 Closes #12026 * github.com:scylladb/scylladb: reader_concurrency_semaphore: do_wait_admission(): detect admission-waiter anomaly reader_concurrency_semaphore: evict_readers_in_the_background(): eliminate blind spot reader_concurrency_semaphore: do_detach_inactive_read(): do a complete detach	2022-11-20 18:51:34 +02:00
Avi Kivity	895d721d5e	Merge 'scylla-sstable: data-dump improvements' from Botond Dénes This series contains a mixed bag of improvements to `scylla sstable dump-data`. These improvements are mostly aimed at making the json output clearer, getting rid of any ambiguities. Closes #12030 * github.com:scylladb/scylladb: tools/scylla-sstable: traverse sstables in argument order tools/scylla-sstable: dump-data docs: s/clustering_fragments/clustering_elements tools/scylla-sstable: dump-data/json: use Null instead of "<unknown>" tools/scylla-sstable: dump-data/json: use more uniform format for collections tools/scylla-sstable: dump-data/json: make cells easier to parse	2022-11-20 17:02:27 +02:00
Avi Kivity	2f9c53fbe4	Merge 'test/pylib: scylla_cluster: use server ID to name workdir and log file, not IP address' from Kamil Braun Since recently the framework uses a separate set of unique IDs to identify servers, but the log file and workdir is still named using the last part of the IP address. This is confusing: the test logs sometimes don't provide the IP addr (only the ID), and even if they do, the reader of the test log may not know that they need to look at the last part of the IP to find the node's log/workdir. Also using ID will be necessary if we want to reuse IP addresses (e.g. during node replace, or simply not to run out of IP addresses during testing). So use the ID instead to name the workdir and log file. Also, when starting a test case, print the used cluster. This will make it easier to map server IDs to their IP addresses when browsing through the test logs. Closes #12018 * github.com:scylladb/scylladb: test/pylib: manager_client: print used cluster when starting test case test/pylib: scylla_cluster: use server ID to name workdir and log file, not IP address	2022-11-20 16:56:19 +02:00
Avi Kivity	14218d82d6	Update tools/java submodule (serverless) * tools/java caf754f243...874e2d529b (2): > Add Scylla Cloud serverless support > Switch cqlsh to use scylla-driver	2022-11-20 16:41:36 +02:00

1 2 3 4 5 ...

33918 Commits