scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-20 16:40:35 +00:00

Author	SHA1	Message	Date
Patryk Jędrzejczak	08cb3a5e2c	test: test_raft_recovery_basic: add raft=trace logs It could help when we hit scylladb/scylladb#17918 again. This PR only changes log levels in a test, no need to backport it. Refs scylladb/scylladb#17918 Closes scylladb/scylladb#20318	2024-08-28 13:50:09 +02:00
Avi Kivity	94d5507237	Merge 'select from mutation_fragments() + tablets: handle reads for non-owned partitions' from Botond Dénes Attempting to read a partition via `SELECT * FROM MUTATION_FRAGMENTS()`, which the node doesn't own, from a table using tablets causes a crash. This is because when using tablets, the replica side simply doesn't handle requests for un-owned tokens and this triggers a crash. We should probably improve how this is handled (an exception is better than a crash), but this is outside the scope of this PR. This PR fixes this and also adds a reproducer test. Fixes: https://github.com/scylladb/scylladb/issues/18786 Fixes a regression introduced in 6.0, so needs backport to 6.0 and 6.1 Closes scylladb/scylladb#20109 * github.com:scylladb/scylladb: test/tablets: Test that reading tablets' mutations from MUTATION_FRAGMENTS works replica/mutation_dump: enfore pinning of effective replication map replica/mutation_dump: handle un-owned tokens (with tablets)	2024-08-27 20:46:10 +03:00
Avi Kivity	b13ab90448	Merge 'alternator/executor: Use native reversed format' from Łukasz Paszkowski When executing reversed queries, a native revered format shall be used. Therefore, the table schema and the clustering key bounds are reversed before a partition slice and a read command are constructed. It is, however, possible to run a reversed query passing a table schema but only when there are no restrictions on the clustering keys. In this particular situation, the query returns correct results. Since the current alternator tests in test.py do not imply any restrictions, this situation was not caught during development of https://github.com/scylladb/scylladb/pull/18864. Hence, additional tests are provided that add clustering keys restrictions when executing reversed queries to capture such errors earlier than in dtests. Additional manual tests were performed to test a mixed-node cluster (with alternator API enabled in Scylla on each node): 1. 2-node cluster with one node upgraded: reverse read queries performed on an old node 2. 2-node cluster with one node upgraded: reverse read queries performed on a new node 3. 2-node cluster with one node upgraded and all its sstable files deleted to trigger repair: reverse read queries performed on an old node 4. 2-node cluster with one node upgraded and all its sstable files deleted to trigger repair: reverse read queries performed on a new node All reverse read queries above consists of: - single-partition reverse reads with no clustering key restrictions, with single column restrictions and multi column restrictions both with and without paging turned on The exact same tests were also performed on a fully upgraded cluster. Fixes https://github.com/scylladb/scylladb/issues/20191 No backport is required as this is a complementary patch for the series https://github.com/scylladb/scylladb/pull/18864 that did not require backporting. Closes scylladb/scylladb#20205 * github.com:scylladb/scylladb: test_query.py: Test reverse queries with clustering key bounds alternator::do_query Add additional trace log alternator::do_query: Use native reversed format alternator::do_query Rename schema with table_schema	2024-08-27 20:40:49 +03:00
Piotr Dulikowski	da5f4faac1	Merge 'mv: reject user requests by coordinator when a replica is overloaded by MVs' from Wojciech Mitros Currently, when a view update backlog of one replica is full, the write is still sent by the coordinator to all replicas. Because of the backlog, the write fails on the replica, causing inconsistency that needs to be fixed by repair. To avoid these inconsistencies, this patch adds a check on the coordinator for overloaded replicas. As a result, a write may be rejected before being sent to any replicas and later retried by the user, when the replica is no longer overloaded. This patch does not remove the replica write failures, because we still may reach a full backlog when more view updates are generated after the coordinator check is performed and before the write reaches the replica. Fixes scylladb/scylladb#17426 Closes scylladb/scylladb#18334 * github.com:scylladb/scylladb: mv: test the view update behavior mv: add test for admission control storage_proxy: return overloaded_exception instead of throwing mv: reject user requests by coordinator when a replica is overloaded by MVs	2024-08-27 12:50:34 +02:00
Botond Dénes	5c0f6d4613	Merge 'Make Summary support histogram with infinite bucket vlaues' from Amnon Heiman This series fixes an issue where histogram Summaries return an infinite value. It updated the quantile calculation logic to address cases where values fall into the infinite bucket of a histogram. Now, instead of returning infinite (max int), the calculation will return the last bucket limit, ensuring finite outputs in all cases. The series adds a test for summaries with a specific test case for this scenario. Fixes #20255 Need backport to 6.0, 6.1 and 2023.1 and above Closes scylladb/scylladb#20257 * github.com:scylladb/scylladb: test/estimated_histogram_test Add summary tests utils/histogram.hh: Make summary support inifinite bucket.	2024-08-27 10:33:54 +03:00
Avi Kivity	0acfa4a00d	Merge 'abstract_replication_strategy: make get_ranges async' from Benny Halevy To prevent stalls due to large number of tokens. For example, large cluster with say 70 nodes can have more than 16K tokens. Fixes #19757 Closes scylladb/scylladb#19758 * github.com:scylladb/scylladb: abstract_replication_strategy: make get_ranges async database: get_keyspace_local_ranges: get vnode_effective_replication_map_ptr param compaction: task_manager_module: open code maybe_get_keyspace_local_ranges alternator: ttl: token_ranges_owned_by_this_shard: let caller make the ranges_holder alternator: ttl: can pass const gms::gossiper& to ranges_holder alternator: ttl: ranges_holder_primary: unconstify _token_ranges member alternator: ttl: refactor token_ranges_owned_by_this_shard	2024-08-26 16:56:18 +03:00
Botond Dénes	b2c07c9b6f	Merge 'compaction: change compaction stop reason ' from Aleksandra Martyniuk Currently "table removal" is logged as a reason of compaction stop for table drop, tablet cleanup and tablet split. Modify log to reflect the reason. Closes scylladb/scylladb#20042 * github.com:scylladb/scylladb: test: add test to check compaction stop log compaction: fix compaction group stop reason	2024-08-26 13:40:07 +03:00
Kefu Chai	8ef26a9c8c	build: cmake: add "test" target before this change, none of the target generated by CMake-based building system runs `test.py`. but `build.ninja` generated directly by `configure.py` provides a target named `test`, which runs the `test.py` with the options passed to `configure.py`. to be more compatible with the rules generated by `configure.py`, in this change * do not include "CTest" module, as we are not using CTest for driving tests. we use the homebrew `test.py` for this purpose. more importantly, the target named "test" is provided by "CTest". so in order to add our own "test" target, we cannot use "CTest" module. * add a target named "test" to run "test.py". * add two CMake options so we can customize the behavior of "test.py", this is to be compatible with the existing behavior of `configure.py`. Refs scylladb/scylladb#2717 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#20263	2024-08-25 21:45:13 +03:00
Avi Kivity	72a85e3812	Merge 'Integrated backup' from Pavel Emelyanov This adds minimal implementation of the start-backup API call. The method starts a task that uploads all files from the given keyspace's snapshot to the requested endpoint/bucket. Arguments are: - endpoint -- the ID in object_store.yaml config file - bucket -- the target bucket to put objects into - keyspace -- the keyspace to work on - snapshot -- the method assumes that the snapshot had been already taken and only copies sstables from it The task runs in the background, its task_id is returned from the method once it's spawned and it should be used via /task_manager API to track the task execution and completion (hint: it's good to have non-zero TTL value to make sure fast backups don't finish before the caller manages to call wait_task API). Sstables components are scanned for all tables in the keyspace and are uploaded into the /bucket/${cf_name}/${snapshot_name}/ path. refs: #18391 Closes scylladb/scylladb#19890 * github.com:scylladb/scylladb: tools/scylla-nodetool: add backup integration docs: Document the new backup method test/object_store: Test that backup task is abortable test/object_store: Add simple backup test test/object_store: Move format_tuples() test/pylib: Add more methods to rest client backup-task: Make it abortable (almost) code: Introduce backup API method database: Export parse_table_directory_name() helper database: Introduce format_table_directory_name() helper snapshot-ctl: Add config to snapshot_ctl snapshot-ctl: Add sstables::storage_manager dependency snapshot-ctl: Maintain task manager module snapshot-ctl: Add "snapshots" logger snapshot-ctl: Outline stop() method and constructor snapshot-ctl: Inline run_snapshot_list<> test/cql_test_env: Export task manager from cql test env task_manager: Print task ttl on start (for debugging) docs: Update object_storage.md with AWS_ environment docs: Restructure object_storage.md	2024-08-25 20:19:10 +03:00
Andrei Chekun	f54b7f5427	test.py: Increase pool size Increase pool size changes were recently reverted because of the flakiness for the test_gossip_boot test. Test started to fail on adding the node to the cluster without any issues in the Scylla log file. In test logs it looked like the installation process for the new node just hanged. After investigating the problem, I've found out that the issue is that test.py was draining the io_executor pool for cleaning the directory during install that was set to eight workers. So to fix the issue, io_executor pool should be increased to more or less the same ratio as it was: doubled cluster pool size. Closes scylladb/scylladb#20276	2024-08-25 19:59:18 +03:00
Benny Halevy	686a8f2939	abstract_replication_strategy: make get_ranges async To prevent stalls due to large number of tokens. For example, large cluster with say 70 nodes can have more than 16K tokens. Fixes #19757 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-08-25 10:57:34 +03:00
Benny Halevy	2bbbe2a8bc	database: get_keyspace_local_ranges: get vnode_effective_replication_map_ptr param Prepare for making the function async. Then, it will need to hold on to the erm while getting the token_ranges asynchronously. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-08-25 10:55:33 +03:00
Lakshmi Narayanan Sreethar	4823a1e203	test/pylib: fix keyspace_compaction method The `keyspace_compaction` method incorrectly appends the column family parameter to the URL using a regular string, `"?cf={table}"`, instead of an f-string, `f"?cf={table}"`. As a result, the column family name is sent as `{table}` to the server, causing the compaction request to fail. Fix this issue by passing the parameter to the POST request using a dictionary instead of appending it to the URL. Fixes #20264 Signed-off-by: Lakshmi Narayanan Sreethar <lakshmi.sreethar@scylladb.com> Closes scylladb/scylladb#20243	2024-08-23 15:20:10 +03:00
Kefu Chai	4a405b0af9	perf/perf_sstable: enumerate sstables when loading them before this change, we use the default options when creating `test_env`, and the default options enable `use_uuid`. but the modes of `perf-sstables` involving reads assumes that the identifiers are deterministic. so that the previously written sstables using the "write" mode can be read with the modes like "index_read", which just uses `test_env::make_sstable()` in `load_sstables()`, and under the hood, `test_env::make_sstable()` uses `test_env::new_generation()` for retrieving the next identifier of sstable. when using integer-base identifier, this works. as the sstable identifiers are generated from a monotonically increasing integer sequence, where the identifiers are deterministic. but this does not apply anymore when the UUID-based identifiers are used, as the identifiers are generated with a pseudorandom generator of UUID v1. in this change, to avoid relying on the determinism of the integer-based sstable identifier generation, we enumerate sstables by listing the given directory, and parse the path for their identifier. after this change, we are able to support the UUID-based sstable identifier. another option is disable the UUID-based sstable identifier when loading sstables. the upside is that this approach is minimal and straightforward. but the downside is that it encodes the assumption in the algorithm implicitly, and could be confusing -- we create a new generation for loading an existing sstable with this generation. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#20183	2024-08-23 10:39:24 +03:00
Pavel Emelyanov	38edbebb10	compaction_manager: Keep flush-all-before-major option on own config Currently the major compaction task impl grabs this (non-updateable) value from db::config. That's not good, all services including compaction manager have their own configs from which they take options. Said that, this patch puts the said option onto compaction_manager::config, makes use of it and configures one from db::config on start (and tests). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#20174	2024-08-23 10:31:55 +03:00
Botond Dénes	15fdc3f6cc	Merge 'Add ability to list S3 bucket contents' from Pavel Emelyanov This is prerequisite for "restore from object storage" feature. In order to collect the sstables in bucket one would need to list the bucket contents with the given prefix. The ListObjectsV2 provides a way for it and here's the respective s3::client extension. Closes scylladb/scylladb#20120 * github.com:scylladb/scylladb: test: Add test for s3::client::bucket_lister s3_client: Add bucket lister s3_client: Encode query parameter value for query-string	2024-08-23 10:16:07 +03:00
Kefu Chai	ee19bbed05	test: do not define boost_test_print_type() for types with operator<< in `30e82a81`, we add a contraint to the template parameter of boost_test_print_type() to prevent it from being matched with types which can be formatted with operator<<. but it failed to work. we still have test failure reports like: ``` [Exception] - critical check ['s', 's', 't', '_', 'm', 'r', '.', 'i', 's', '_', 'e', 'n', 'd', '_', 'o', 'f', '_', 's', 't', 'r', 'e', 'a', 'm', '(', ')'] has failed ``` this is not what we expect. the reason is that we passed the template parameters to the `has_left_shift` trait in the wrong order, see https://live.boost.org/doc/libs/1_83_0/libs/type_traits/doc/html/boost_typetraits/reference/has_left_shift.html. we should have passed the lhs of operator<< expression as first parameter, and rhs the second. so, in this change, we correct the type constraint by passing the template parameter in the right order, now the error message looks better, like: ``` test/boost/mutation_query_test.cc(110): error: in "test_partition_query_is_full": check !partition_slice_builder(*s) .with_range({}) .build() .is_full() has failed ``` it turns out boost::transformed_range<> is formattable with operator<<, as it fulfills the constraints of `boost::has_left_shift<ostream, R>`, but when printing it, the compiler fails when it tries to insert the elements in the range to the output stream. so, in order to workaround this issue, we add a specialization for `boost::transformed_range<F, R`. also, to improve the readability, we reimplement the `has_left_shift<>` as a concept, so that it's obvious that we need to put both the output stream as the first parameter. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#20233	2024-08-23 09:26:22 +03:00
Amnon Heiman	644e6f0121	test/estimated_histogram_test Add summary tests This patch adds tests for summary calculation. It adds two tests, the first is a basic calculation for P50, P95, P99 by adding 100 elements into 20 buckets. The second test look that if elements are found in the infinite bucket, the result would be the lower limit (33s) and not infinite. Relates to #20255 Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-08-22 23:34:24 +03:00
Kefu Chai	39dd088374	test: include used headers before this change, clang 20 fails to build the tree, like: ``` /home/kefu/.local/bin/clang++ -DBOOST_ALL_DYN_LINK -DDEBUG -DDEBUG_LSA_SANITIZER -DFMT_SHARED -DSANITIZE -DSCYLLA_BUILD_MODE=debug -DSCYLLA_ENABLE_ERROR_INJECTION -DSEASTAR_API_LEVEL=7 -DSEASTAR_DEBUG -DSEASTAR_DEBUG_PROMISE -DSEASTAR_DEBUG_SHARED_PTR -DSEASTAR_DEFAULT_ALLOCATOR -DSEASTAR_LOGGER_COMPILE_TIME_FMT -DSEASTAR_LOGGER_TYPE_STDOUT -DSEASTAR_SCHEDULING_GROUPS_COUNT=16 -DSEASTAR_SHUFFLE_TASK_QUEUE -DSEASTAR_SSTRING -DSEASTAR_TESTING_MAIN -DSEASTAR_TYPE_ERASE_MORE -DXXH_PRIVATE_API -DCMAKE_INTDIR=\"Debug\" -I/home/kefu/dev/scylladb -I/home/kefu/dev/scylladb/build/gen -I/home/kefu/dev/scylladb/seastar/include -I/home/kefu/dev/scylladb/build/seastar/gen/include -I/home/kefu/dev/scylladb/build/seastar/gen/src -isystem /home/kefu/dev/scylladb/abseil -isystem /home/kefu/dev/scylladb/build/rust -g -Og -g -gz -std=gnu++23 -fvisibility=hidden -Wall -Werror -Wextra -Wno-error=deprecated-declarations -Wimplicit-fallthrough -Wno-c++11-narrowing -Wno-deprecated-copy -Wno-mismatched-tags -Wno-missing-field-initializers -Wno-overloaded-virtual -Wno-unsupported-friend -Wno-unused-parameter -ffile-prefix-map=/home/kefu/dev/scylladb=. -march=westmere -Xclang -fexperimental-assignment-tracking=disabled -U_FORTIFY_SOURCE -Werror=unused-result -fstack-clash-protection -fsanitize=address -fsanitize=undefined -fno-sanitize=vptr -MD -MT test/boost/CMakeFiles/database_test.dir/Debug/database_test.cc.o -MF test/boost/CMakeFiles/database_test.dir/Debug/database_test.cc.o.d -o test/boost/CMakeFiles/database_test.dir/Debug/database_test.cc.o -c /home/kefu/dev/scylladb/test/boost/database_test.cc /home/kefu/dev/scylladb/test/boost/database_test.cc:539:29: error: invalid use of incomplete type 'schema_builder' 539 \| return *schema_builder(ks_name, cf_name) \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /home/kefu/dev/scylladb/schema/schema.hh:115:7: note: forward declaration of 'schema_builder' 115 \| class schema_builder; \| ^ ``` and ``` /home/kefu/.local/bin/clang++ -DBOOST_ALL_DYN_LINK -DDEBUG -DDEBUG_LSA_SANITIZER -DFMT_SHARED -DSANITIZE -DSCYLLA_BUILD_MODE=debug -DSCYLLA_ENABLE_ERROR_INJECTION -DSEASTAR_API_LEVEL=7 -DSEASTAR_DEBUG -DSEASTAR_DEBUG_PROMISE -DSEASTAR_DEBUG_SHARED_PTR -DSEASTAR_DEFAULT_ALLOCATOR -DSEASTAR_LOGGER_COMPILE_TIME_FMT -DSEASTAR_LOGGER_TYPE_STDOUT -DSEASTAR_SCHEDULING_GROUPS_COUNT=16 -DSEASTAR_SHUFFLE_TASK_QUEUE -DSEASTAR_SSTRING -DSEASTAR_TESTING_MAIN -DSEASTAR_TYPE_ERASE_MORE -DXXH_PRIVATE_API -DCMAKE_INTDIR=\"Debug\" -I/home/kefu/dev/scylladb -I/home/kefu/dev/scylladb/build/gen -I/home/kefu/dev/scylladb/seastar/include -I/home/kefu/dev/scylladb/build/seastar/gen/include -I/home/kefu/dev/scylladb/build/seastar/gen/src -isystem /home/kefu/dev/scylladb/abseil -isystem /home/kefu/dev/scylladb/build/rust -g -Og -g -gz -std=gnu++23 -fvisibility=hidden -Wall -Werror -Wextra -Wno-error=deprecated-declarations -Wimplicit-fallthrough -Wno-c++11-narrowing -Wno-deprecated-copy -Wno-mismatched-tags -Wno-missing-field-initializers -Wno-overloaded-virtual -Wno-unsupported-friend -Wno-unused-parameter -ffile-prefix-map=/home/kefu/dev/scylladb=. -march=westmere -Xclang -fexperimental-assignment-tracking=disabled -U_FORTIFY_SOURCE -Werror=unused-result -fstack-clash-protection -fsanitize=address -fsanitize=undefined -fno-sanitize=vptr -MD -MT test/boost/CMakeFiles/group0_cmd_merge_test.dir/Debug/group0_cmd_merge_test.cc.o -MF test/boost/CMakeFiles/group0_cmd_merge_test.dir/Debug/group0_cmd_merge_test.cc.o.d -o test/boost/CMakeFiles/group0_cmd_merge_test.dir/Debug/group0_cmd_merge_test.cc.o -c /home/kefu/dev/scylladb/test/boost/group0_cmd_merge_test.cc /home/kefu/dev/scylladb/test/boost/group0_cmd_merge_test.cc:78:18: error: member access into incomplete type 'db::config' 78 \| cfg.db_config->commitlog_segment_size_in_mb(1); \| ^ /home/kefu/dev/scylladb/data_dictionary/data_dictionary.hh:28:7: note: forward declaration of 'db::config' 28 \| class config; \| ^ 1 error generated. ``` and ``` `FAILED: test/boost/CMakeFiles/repair_test.dir/Debug/repair_test.cc.o /home/kefu/.local/bin/clang++ -DBOOST_ALL_DYN_LINK -DDEBUG -DDEBUG_LSA_SANITIZER -DFMT_SHARED -DSANITIZE -DSCYLLA_BUILD_MODE=debug -DSCYLLA_ENABLE_ERROR_INJECTION -DSEASTAR_API_LEVEL=7 -DSEASTAR_DEBUG -DSEASTAR_DEBUG_PROMISE -DSEASTAR_DEBUG_SHARED_PTR -DSEASTAR_DEFAULT_ALLOCATOR -DSEASTAR_LOGGER_COMPILE_TIME_FMT -DSEASTAR_LOGGER_TYPE_STDOUT -DSEASTAR_SCHEDULING_GROUPS_COUNT=16 -DSEASTAR_SHUFFLE_TASK_QUEUE -DSEASTAR_SSTRING -DSEASTAR_TESTING_MAIN -DSEASTAR_TYPE_ERASE_MORE -DXXH_PRIVATE_API -DCMAKE_INTDIR=\"Debug\" -I/home/kefu/dev/scylladb -I/home/kefu/dev/scylladb/build/gen -I/home/kefu/dev/scylladb/seastar/include -I/home/kefu/dev/scylladb/build/seastar/gen/include -I/home/kefu/dev/scylladb/build/seastar/gen/src -isystem /home/kefu/dev/scylladb/abseil -isystem /home/kefu/dev/scylladb/build/rust -g -Og -g -gz -std=gnu++23 -fvisibility=hidden -Wall -Werror -Wextra -Wno-error=deprecated-declarations -Wimplicit-fallthrough -Wno-c++11-narrowing -Wno-deprecated-copy -Wno-mismatched-tags -Wno-missing-field-initializers -Wno-overloaded-virtual -Wno-unsupported-friend -Wno-unused-parameter -ffile-prefix-map=/home/kefu/dev/scylladb=. -march=westmere -Xclang -fexperimental-assignment-tracking=disabled -U_FORTIFY_SOURCE -Werror=unused-result -fstack-clash-protection -fsanitize=address -fsanitize=undefined -fno-sanitize=vptr -MD -MT test/boost/CMakeFiles/repair_test.dir/Debug/repair_test.cc.o -MF test/boost/CMakeFiles/repair_test.dir/Debug/repair_test.cc.o.d -o test/boost/CMakeFiles/repair_test.dir/Debug/repair_test.cc.o -c /home/kefu/dev/scylladb/test/boost/repair_test.cc /home/kefu/dev/scylladb/test/boost/repair_test.cc:149:45: error: use of undeclared identifier 'global_schema_ptr' 149 \| co_await e.db().invoke_on_all([gs = global_schema_ptr(gen.schema())](replica::database& db) -> future<> { \| ^ /home/kefu/dev/scylladb/test/boost/repair_test.cc:150:62: error: use of undeclared identifier 'gs' 150 \| co_await db.add_column_family_and_make_directory(gs.get(), replica::database::is_new_cf::yes); \| ^ 2 errors generated. ``` because we are using incomplete types when their complete definitions are required. so, in this change, we include the headers for their complete definition. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#20239	2024-08-22 20:51:38 +03:00
Kefu Chai	969cbb75ce	tools/scylla-nodetool: add backup integration as we have an API for backup a keyspace, let's expose this feature with nodetool. so we can exercise it without the help of scylla-manager or 3rd-party tools with a user-friendly interface. in this change: * add a new subcommand named "backup" to nodetool * add test to verify its interaction with the API server * add two more route to the REST API mock server, as the test is using /task_manager/wait_task/{task_id} API. for the sake of completeness, the route for /task_manager/{part1} is added as well. * update the document accordingly. * the bash completion script is updated accordingly. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-08-22 19:48:06 +03:00
Pavel Emelyanov	de87450453	test/object_store: Test that backup task is abortable It starts similarly to simpl backup test, but injects a pause into the task once a single file is scheduled for upload, then aborts the task, waits for it to fail, and check that _not_ all files are uploaded. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 19:47:06 +03:00
Pavel Emelyanov	f8d894bc23	test/object_store: Add simple backup test The test shows how to backup a keyspace: - flush - take snapshot - start backup with the new API method - wait for the task to finish Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 19:47:06 +03:00
Pavel Emelyanov	47e49e6dec	test/object_store: Move format_tuples() There will soon appear a new .py file in the suite that will want to use this helper too Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 19:47:06 +03:00
Pavel Emelyanov	d83d585709	test/pylib: Add more methods to rest client Namely: - POST /storage_service/snapshots to take snapshot on a ks - GET /task_manager/get_task_status/{id} to get status of a running task - GET /task_manager/wait_task/{id} to wait for a task to finish - POST /task_manager/abort_task/{id} to abort a running task Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 19:47:06 +03:00
Pavel Emelyanov	a812f13ddd	code: Introduce backup API method The method starts a task that uploads all files from the given keyspace's snapshot to the requested endpoint/bucket. The task runs in the background, its task_id is returned from the method once it's spawned and it should be used via /task_manager API to track the task execution and completion (hint: it's good to have non-zero TTL value to make sure fast backups don't finish before the caller manages to call wait_task API). If snapshot doesn't exist, nothing happens (FIXME, need to return back an error in that case). If endpoint is not configured locally, the API call resolves with bad-request instantly. Sstables components are scanned for all tables in the keyspace and are uploaded into the /bucket/${cf_name}/${snapshot_name}/ path. Task is not abortable (FIXME -- to be added) and doesn't really report its progress other than running/done state (FIXME -- to be added too). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 19:47:06 +03:00
Pavel Emelyanov	dff51fd58c	snapshot-ctl: Add config to snapshot_ctl Pretty much all services in Scylla have their own config. Add one to snapshot-ctl too, it will be populated later. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 14:57:20 +03:00
Pavel Emelyanov	f37857e20a	snapshot-ctl: Add sstables::storage_manager dependency The storage_manager maintains set of clients to configured object storage(s). The snapshot ctl is going to spawn tasks that will talk to those storages, thus it needs the storage manager to get the clients from. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 14:08:21 +03:00
Pavel Emelyanov	362331c89b	snapshot-ctl: Maintain task manager module This service is going to start tasks managed by task manager. For that, it should have its module set up and registered. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 14:08:21 +03:00
Pavel Emelyanov	4e73b4d8ad	test/cql_test_env: Export task manager from cql test env To be used by one of the next patches Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 14:08:21 +03:00
Pavel Emelyanov	4e2d7aa2a2	test/tablets: Test that reading tablets' mutations from MUTATION_FRAGMENTS works Currently it doesn't, one of the node crashes with std::out_of_range exception and meaningless calltrace [Botond]: this test checks the case of reading a partition via MUTATION_FRAGMENTS from a node which doesn't own said partition. refs: #18786 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-08-22 06:24:06 -04:00
Łukasz Paszkowski	a11d19f321	test_query.py: Test reverse queries with clustering key bounds Since a native reversed format is used for reversed queries, additional tests with restrictions on clustering keys are required to capture possible errors like https://github.com/scylladb/scylladb/issues/20191 earlier than in dtests. Add parametrization to the following tests: + test_query_reverse + test_query_reverse_paging to accept a comparison operator used in selection criteria for a Query operation.	2024-08-21 14:21:34 +02:00
Aleksandra Martyniuk	9b7c837106	test: add test to check compaction stop log	2024-08-21 12:42:37 +02:00
Aleksandra Martyniuk	5005e19de7	compaction: fix compaction group stop reason compaction_manager::remove passes "table removal" as a reason of stopping ongoing compactions, but currently remove method is also called when a tablet is migrated or split. Pass the actual reason of compaction stop, so that logs aren't misleading.	2024-08-21 12:42:09 +02:00
Avi Kivity	2ef5b5e4fe	Revert "[test.py] Increase pool size for CI" This reverts commit `cc428e8a36`. It causes may spurious CI failures while nodes are being torn down. Revert it until the root cause is fixed, after which it can be reinstated. Fixes #20116.	2024-08-21 13:21:08 +03:00
Benny Halevy	f40d06b766	table: calculate_tablet_count: use sg_manager storage_groups size Now, when each shard storage_group_manager keeps only the storage_groups for the tablet replica it owns, we can simple return the storage_group map size instead of counting the number of tablet replicas mapped to this shard. Add a unit test that sums the tablet count on all shards and tests that the sum is equal to the configured default `initial_tablets. Fixes #18909 Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Closes scylladb/scylladb#20223	2024-08-21 11:01:58 +02:00
Kamil Braun	5c9efdff50	Merge 'raft: store_snapshot_descriptor to use actually preserved items number when truncating the local log table' from Sergey Zolotukhin io_fiber/store_snapshot_descriptor now gets the actual number of items preserved when the log is truncated, fixing extra entries remained after log snapshot creation. Also removes incorrect check for the number of truncated items in the raft_sys_table_storage::store_snapshot_descriptor. Minor change: Added error_injection test API for changing snapshot thresholds settings. Fixes scylladb/scylladb#16817 Fixes scylladb/scylladb#20080 Closes scylladb/scylladb#20095 * github.com:scylladb/scylladb: raft: Ensure const correctness in applier_fiber. raft: Invoke store_snapshot_descriptor with actually preserved items. raft: Use raft_server_set_snapshot_thresholds in tests. raft: Fix indentation in server.cc raft: Add a test to check log size after truncation. raft: Add raft_server_set_snapshot_thresholds injection. utils: Ensure const correctness of injection_handler::get().	2024-08-20 18:15:30 +02:00
Tomasz Grabiec	ff52527c54	Merge 'repair: do_rebuild_replace_with_repair: use source_dc only when safe' from Benny Halevy It is unsafe to restrict the sync nodes for repair to the source data center if it has too low replication factor in network_topology_replication_strategy, or if other nodes in that DC are ignored. Also, this change restricts the usage of source_dc to `network_topology` and `everywhere_topology` strategies, as with simple replication strategy there is no guarantee that there would be any more replicas in that data center. Fixes #16826 Reproducer submitted as https://github.com/scylladb/scylla-dtest/pull/3865 It fails without this fix and passes with it. * Requires backport to live versions. Issue hit in the filed with 2022.2.14 Closes scylladb/scylladb#16827 * github.com:scylladb/scylladb: repair: do_rebuild_replace_with_repair: use source_dc only when safe repair: replace_with_repair: pass the replace_node downstream repair: replace_with_repair: pass ignore_nodes as a set of host_id:s repair: replace_rebuild_with_repair: pass ks_erms from caller nodetool: rebuild: add force option Add and use utils::optional_param to pass source_dc	2024-08-20 16:13:23 +02:00
Sergey Zolotukhin	c3e52ab942	raft: Invoke store_snapshot_descriptor with actually preserved items. - raft_sys_table_storage::store_snapshot_descriptor now receives a number of preserved items in the log, rather than _config.snapshot_trailing value; - Incorrect check for truncated number of items in store_snapshot_descriptor was removed. Fixes scylladb/scylladb#16817 Fixes scylladb/scylladb#20080	2024-08-20 15:22:49 +02:00
Sergey Zolotukhin	922e035629	raft: Use raft_server_set_snapshot_thresholds in tests. Replace raft_server_snapshot_reduce_threshold with raft_server_set_snapshot_thresholds in tests as raft_server_set_snapshot_thresholds fully covers the functionality of raft_server_snapshot_reduce_threshold.	2024-08-20 15:08:49 +02:00
Sergey Zolotukhin	b6de8230a9	raft: Add a test to check log size after truncation. The test checks that snapshot_trailing_size parameter is taken into consideration when the log system table is truncated. Test for scylladb#16817	2024-08-20 14:15:50 +02:00
Botond Dénes	3ee0d7f2d1	Merge 'tools: Enhance scylla sstable shard-of to support tablets' from Kefu Chai before this change, `scylla sstable shard-of` didn't support tablets, because: - with tablets enabled, data distribution uses the scheduler - this replaces the previous method of mapping based on vnodes and shard numbers - as a result, we can no longer deduce sstable mapping from token ranges in this change, we: - read `system.tablets` table to retrieve tablet information - print the tablet's replica set (list of <host, shard> pairs) - this helps users determine where a given sstable is hosted This approach provides the closest equivalent functionality of `shard-of` in the tablet era. Fixes scylladb/scylladb#16488 --- no need to backport, it's an improvement, not a critical fix. Closes scylladb/scylladb#20002 * github.com:scylladb/scylladb: tools: enhance `scylla sstable shard-of` to support tablets replica/tablets: extract tablet_replica_set_from_cell() tools: extract get_table_directory() out tools: extract read_mutation out build: split the list of source file across multiple line tools/scylla-sstable: print warning when running shard-of with tablets	2024-08-20 13:51:12 +03:00
Avi Kivity	7eb3b15fff	Merge 'utils/tagged_integer: remove conversion to underlying integer' from Laszlo Ersek ~~~ utils/tagged_integer: remove conversion to underlying integer Silently converting a tagged (i.e., "dimension-ful") integer to a naked ("dimensionless") integer defeats the purpose of having tagged integers, and is a source of practical bugs, such as <https://github.com/scylladb/scylladb/issues/20080>. We could make the conversion operator explicit, for enforcing static_cast<TAGGED_INTEGER_TYPE::value_type>(TAGGED_INTEGER_VALUE) in every conversion location -- but that's a mouthful to write. Instead, remove the conversion operator, and let clients call the (identically behaving) value() member function. ~~~ No backport needed (refactoring). The series is supposed to solve #20081. Two patches in the series touch up code that is known to be (orthogonally) buggy; see - `service/raft_sys_table_storage: tweak dead code` (#20080) - `test/raft/replication: untag index_t in test_case::get_first_val()` (#20151) Fixes for those (independent) issues will have to be rebased on this series, or this series will have to be rebased on those (due to context conflicts). The series builds at every stage. The debug and release unit test suites pass at the end. Closes scylladb/scylladb#20159 * github.com:scylladb/scylladb: utils/tagged_integer: remove conversion to underlying integer test/raft/randomized_nemesis_test: clean up remaining index_t usage test/raft/randomized_nemesis_test: clean up index_t usage in store_snapshot() test/raft/replication: clean up remaining index_t usage test/raft/replication: take an "index_t start_idx" in create_log() test/raft/replication: untag index_t in test_case::get_first_val() test/raft/etcd_test: tag index_t and term_t for comparisons and subtractions test/raft/fsm_test: tag index_t and term_t for comparisons and subtractions test/raft/helpers: tighten compare_log_entries() param types service/raft_sys_table_storage: tweak dead code service/raft_sys_table_storage: simplify (snap.idx - preserve_log_entries) service/raft_sys_table_storage: untag index_t and term_t for queries raft/server: clean up index_t usage raft/tracker: don't drop out of index_t space for subtraction raft/fsm: clean up index_t and term_t usage raft/log: clean up index_t usage db/system_keyspace: promise a tagged integer from increment_and_get_generation() gms/gossiper: return "strong_ordering" from compare_endpoint_startup() gms/gossiper: get "int32_t" value of "gms::version_type" explicitly	2024-08-19 19:52:54 +03:00
Benny Halevy	0419b1d522	nodetool: rebuild: add force option To be used to force usage of source_dc, even when it is unsafe for rebuild. Update docs and add test/nodetool/test_rebuild.py Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-08-19 17:20:12 +03:00
Kefu Chai	25b3c50f71	test/nodetool: print default value of options in help message would be more helpful, if the output of "--help" command line can include the default value of options. so, in this change, we include the default values in it. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#20170	2024-08-19 16:15:24 +03:00
Botond Dénes	6835f7e993	Merge 'Add CQL-based RBAC support to Alternator' from Piotr Smaron Alternator already supports authentication - the ability to to sign each request as a particular user. The users that can be used are the different "roles" that are created by CQL "CREATE ROLE" commands. This series adds support for authorization, i.e., the ability to determine that only some of these roles are allowed to read or write particular tables, to create new tables, and so on. The way we chose to do this in this series is to support CQL's existing role-based access control (RBAC) commands - GRANT and REVOKE - on Alternator tables. For example, an Alternator table "xyz" is visible to CQL as "alternator_xyz.xyz", so a `GRANT SELECT ON alternator_xyz.xyz TO myrole` will allow read commands (e.g., GetItem) on that table, and without this GRANT, a GetItem will fail with `AccessDeniedException`. This series adds the necessary checks to all relevant Alternator operations, and also adds extensive functional testing for this feature - i.e., that certain DynamoDB API operations are not allowed without the appropriate GRANTs. The following permissions are needed for the following Alternator API operations: * SELECT: `GetItem`, `Query`, `Scan`, `BatchGetItem`, `GetRecords` * MODIFY: `PutItem`, `DeleteItem`, `UpdateItem`, `BatchWriteItem` * CREATE: `CreateTable` * DROP: `DeleteTable` * ALTER: `UpdateTable`, `TagResource`, `UntagResource`, `UpdateTimeToLive` * _none needed_: `ListTables`, `DescribeTable`, `DescribeEndpoints`, `ListTagsOfResource`, `DescribeTimeToLive`, `DescribeContinuousBackups`, `ListStreams`, `DescribeStream`, `GetShardIterator` Currently, I decided that for consistency each operation requires one permission only. For example, PutItem only requires MODIFY permission. This is despite the fact that in some cases (namely, `ReturnValues=ALL_OLD`) it can also _read_ the item. We should perhaps discuss this decision - and compare how it was done in CQL - e.g., what happens in LWT writes that may return old values? Different permissions can be granted for a base table, each of its views, and the CDC table (Alternator streams). This adds power - e.g., we can allow a role to read only a view but not the base table, or read the table but not its history. GRANTing permissions on views or CDC logs require knowing their names, which are somewhat ugly (e.g., the name of GSI "abc" in table "xyz" is `alternator_xyz.xyz:abc`). But usefully, the error message when permissions are denied contains the full name of the table that was lacking permissions and which permissions were lacking, so users can easily add them. In addition to permissions checking, this series also correctly supports _auto-grant_ (except #19798): When a role has permissions to `CreateTable`, any table it creates will automatically be granted all permissions for this role, so this role will be able to use the new table and eventually delete it. `DeleteTable` does the opposite - it removes permissions from tables being deleted, so that if later a second user re-creates a table with the same name, the first user will not have permissions over the new table. The already-existing configuration parameter `alternator_enforce_authorization` (off by default), which previously only enabled authentication, now also enables authorization. Users that upgrade to the new version and already had `alternator_enforce_authorization=true` should verify that the users they use to authenticate either have the appropriate permissions or the "superuser" flag. Roles used to authenticate must also have the "login" flag. Please note that although the new RBAC support implements the access control feature we asked for in #5047, this implementation is _not compatible_ with DynamoDB. In DynamoDB, the access control is configured through IAM operations or through the new `PutResourcePolicy` - operation, not through CQL (obviously!). DynamoDB also offers finer access-control granularity than we support (Scylla's RBAC works on entire tables, DynamoDB allows setting permissions on key prefixes, on individual attributes, and more). Despite this non-compatibility, I believe this feature, as is, will already be useful to Alternator users. Fixes #5047 (after closing that issue, a new clean issue should be opened about the DynamoDB-compatible APIs that we didn't do - just so we remember this wasn't done yet). New feature, should not be backported. Closes scylladb/scylladb#20135 * github.com:scylladb/scylladb: tests: disable test_alternator_enforce_authorization_true test, alternator: test for alternator_enforce_authorization config test/pylib: allow setting driver_connect() options in servers_add() test: fix test_localnodes_joining_nodes alternator, RBAC: reproducer for missing CDC auto-grant alternator: document the new RBAC support alternator: add RBAC enforcement to GetRecords test/alternator: additional tests for RBAC test/alternator: reduce permissions-validity-in-ms test/alternator: add test for BatchGetItem from multiple tables alternator: test for operations that do not need any permissions alternator: add RBAC enforcement to UpdateTimeToLive alternator: add RBAC enforcement to TagResource and UntagResource alternator: add RBAC enforcement to BatchGetItem alternator: add RBAC enforcement to BatchWriteItem alternator: add RBAC enforcement to UpdateTable alternator: add RBAC enforcement to Query and Scan alternator: add RBAC enforcement to CreateTable alternator: add RBAC enforcement to DeleteTable alternator: add RBAC enforcement to UpdateItem alternator: add RBAC enforcement to DeleteItem alternator: add RBAC enforcement to PutItem alternator: add RBAC enforcement to GetItem alternator: stop using an "internal" client_state	2024-08-19 16:09:53 +03:00
Tomasz Grabiec	c1de4859d8	Merge 'tablets: Fix race between repair and split' from Raphael "Raph" Carvalho Consider the following: ``` T 0 split prepare starts 1 repair starts 2 split prepare finishes 3 repair adds unsplit sstables 4 repair ends 5 split executes ``` If repair produces sstable after split prepare phase, the replica will not split that sstable later, as prepare phase is considered completed already. That causes split execution to fail as replicas weren't really prepared. This also can be triggered with load-and-stream which shares the same write (consumer) path. The approach to fix this is the same employed to prevent a race between split and migration. If migration happens during prepare phase, it can happen source misses the split request, but the tablet will still be split on the destination (if needed). Similarly, the repair writer becomes responsible for splitting the data if underlying table is in split mode. That's implemented in replica::table for correctness, so if node crashes, the new sstable missing split is still split before added to the set. Fixes #19378. Fixes #19416. *Please replace this line with justification for the backport/\ labels added to this PR** Closes scylladb/scylladb#19427 * github.com:scylladb/scylladb: tablets: Fix race between repair and split compaction: Allow "offline" sstable to be split	2024-08-19 14:44:28 +02:00
Piotr Smaron	cdc88cd06c	tests: disable test_alternator_enforce_authorization_true The test is flaky and needs to be fixed in order to not randomly break our CI, OTOH can be commented out for the time being, so that we can marge the feature.	2024-08-19 09:57:53 +02:00
Nadav Har'El	989dbef315	test, alternator: test for alternator_enforce_authorization config This patch adds tests that demonstrates the current way that Alternator's authentication and authorization are both enabled or disabled by the option "alternator_enforce_authorization". If in the future we decide to change this option or eliminate it (e.g., remain just with the "authenticator" and "authorizer" options), we can easily update these tests to fit the new configuration parameters and check they work as expected. Because the new tests want to start Scylla instances with different configuration parameters, they are written in the the "topology" framework and not in the test/alternator framework. The test/alternator framework still contains (test/alternator/test_cql_rbac.py) the vast majority of the functional testing of the RBAC feature where all those tests just assume that RBAC is enabled and needs to be tested. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:57:53 +02:00
Nadav Har'El	41418603e1	test/pylib: allow setting driver_connect() options in servers_add() The manager.driver_connect() functions allows to pass parameters when creating the connection (e.g., a special auth_provider), but unfortunately right now the servers_add() function always calls driver_connect() without parameters. So in this patch we just add a new optional parameter to servers_add(), driver_connect_opts, that will be passed to driver_connect(). In theory instead of the new option to driver_connect() a caller can pass start=False to servers_add() and later call driver_connect() manually with the right arguments. The problem is that start=False avoids more than just calling driver_connect(), so it doesn't solve the problem. An example of using the new option is to run Scylla with authentication enabled, and then connect to it using the correct default account ("cassandra"/"cassandra"): config = { 'authenticator': 'PasswordAuthenticator', 'authorizer': 'CassandraAuthorizer' } servers = await manager.servers_add(1, config=config, driver_connect_opts={'auth_provider': PlainTextAuthProvider(username='cassandra', password='cassandra')}) Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:57:53 +02:00
Nadav Har'El	de20ac1a6d	test: fix test_localnodes_joining_nodes The existing test topology_experimental_raft/test_alternator::test_localnodes_joining_nodes Tried to create a second server but not wait for it to complete, but the trick it used (cancelling the task) doesn't work since commit `2ee063c` makes a list of unwaited tasks and waits for them anyway. The test appears to work because it is the last test in the file, but if we ever add another test in the same file (like I plan to do in the next patch), that other test will find a "BROKEN" ScyllaClusterManager and report that it failed :-( Other tricks I tried to use (like killing the servers) also didn't work because of various limitations and complications of the test framework and all its layers. So not wanting to fight the fragile testing framework any more at this point, I just gave up and the test will wait for the second server to come up. This adds 120 seconds (!) to the test, but since this whole test file already takes more than 500 seconds to complete, let's bite this bullet. Maybe in the future when the test framework improves, we can avoid this 120 second wait. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:57:53 +02:00

1 2 3 4 5 ...

7332 Commits