scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-05 14:33:08 +00:00

Author	SHA1	Message	Date
Nadav Har'El	6496eab5ee	Merge 'Rename Alternator batch item count metrics' from Amnon Heiman This PR addresses multiple issues with alternator batch metrics: 1. Rename the metrics to scylla_alternator_batch_item_count with op=BatchGetItem/BatchWriteItem 2. The batch size calculation was wrong and didn't count all items in the batch. 3. Add a test to validate that the metrics values increase by the correct value (not just increase). This also requires an addition to the testing to validate ops of different metrics and an exact value change. Needs backporting to allow the monitoring to use the correct metrics names. Fixes #20571 Closes scylladb/scylladb#20646 * github.com:scylladb/scylladb: alternator:test_metrics test metrics for batch item count alternator:test_metrics Add validating the increased value alternator: Fix item counting in batch operations Alterntor rename batch item count metrics	2024-09-23 10:13:07 +03:00
Kefu Chai	fdc8773278	test/scylla_gdb: get table::_schema raw pointer with lw_shared_ptr This commit addresses an issue where accessing the raw pointer of the schema instance within `table::_schema` using `table.schema._p` was unreliable. before this change, `_p` was of type `lw_shared_ptr_counter_base`, a type-erased smart pointer, preventing direct casting to the underlying schema pointer. but we still cast it to `schema` anyway. this led to a gdb.MemoryError when dereferencing the deduced pointer: but the type of `_p` is `lw_shared_ptr_counter_base`, which is a type erased smart pointer, and it cannot be casted directly to the under pointer pointing to a `schema` instance. this results in: ``` Traceback (most recent call last): File "/home/avi/scylla/test/scylla_gdb/../../scylla-gdb.py", line 5554, in invoke self.print_key_type(seastar_lw_shared_ptr(schema['_clustering_key_type']).get().dereference(), 'clustering') File "/home/avi/scylla/test/scylla_gdb/../../scylla-gdb.py", line 5533, in print_key_type key_type = seastar_shared_ptr(key_type).get().dereference() ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ gdb.MemoryError: Cannot access memory at address 0x4000079656b0078 ``` when we are dereferencing the raw pointer deduced this way. in this change, we use the wrapper of `seastar_lw_shared_ptr` to safely obtain the raw pointer. * reenable this test previously disabled by `3d781c4f` tested using ```console $ SCYLLA=/home/kefu/dev/scylladb/master/build/release/scylla \ test/scylla_gdb/run -o junit_suite_name=scylla_gdb test_misc.py::test_schema ``` on an up-to-date fedora 40 installation. Refs `3d781c4f` Fixes scylladb/scylladb#20741 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#20746	2024-09-22 18:30:16 +03:00
Avi Kivity	657848dcbb	cql3: statement_restrictions, expr: move restrictions-related expression utilities out of expression.cc Move all of the blatantly restriction-related expression utilities to statement_restrictions.cc. Some are so blatant as to include the word "restriction" in their name. Others are just so specialized that they cannot be used for anything else. The motivation is that further refactoring will be simplified if it can happen within the same module, as there will not be a need to prove it has no effect elsewhere. Most of the declarations are made non-public (in .cc file) to limit proliferation. A few are needed for tests or in select_statement.cc and so are kept public. Other than that, the only changes are namespace qualifications and removal of a now-duplicate definition ("inclusive"). Closes scylladb/scylladb#20732	2024-09-22 11:00:51 +03:00
Avi Kivity	3d781c4fc8	Update frozen toolchain * tools/java e505a6d3bb...5b0e274f12 (1): > Merge 'build.xml: install and use java-11 when building' from Kefu Chai Updates to clang 18.1.8 + LLVM patch to match Fedora 40. New optimized clang build generated and stored in https://devpkg.scylladb.com/clang/clang-18.1.8-x86_64.tar.gz https://devpkg.scylladb.com/clang/clang-18.1.8-aarch64.tar.gz Due to the loss of the jmx submodule, we no longer install java-11-openjdk. We add it in install-dependencies.sh here to compensate, pending a better solution. tools/java submodule updated to remove build failure where Java 8 was selected instead of Java 11. The scylla_gdb test suite was disabled due to a regression in gdb 15, which is brought in by the toolchain update [1]. [1] https://github.com/scylladb/scylladb/issues/20741.	2024-09-21 20:07:28 +03:00
Raphael S. Carvalho	999f1f1318	replica: Fix tablet split execute after restart let's assume there are 2 nodes, n1, n2. n1 is the coordinator. 1) n1 emits split 2) n1 and n2 complete split work 3) n1 becomes aware all replicas are ready for split 4) n2 restarts, but places split sstable into main group[1] 5) n1 executes split 6) n2 handles split completion, but see the main group is not empty [1]: During split, main group should only contain unsplit sstables. If all sstables are split, main must be empty. This is a result of replica not setting storage group to split mode on restart (using tablet map) and therefore sstables are incorrectly placed on main group. The fix is about looking at tablet map and setting group to split mode before sstables are populated into it. Refs #20626. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2024-09-20 22:28:09 -03:00
Dawid Mędrek	7b4f9c806c	treewide: Start using new overloads of describe We continue removing `data_dictionary::keyspace_element`. In this commit, we start using the overloads returning `cql3::description` in places where the methods specified by `data_dictionary::keyspace_element` were used.	2024-09-20 14:24:54 +02:00
Dawid Mędrek	39cf106151	treewide: Start using schema::ks_name() instead of schema::keyspace_name() We're going to remove the interface `data_dictionary::keyspace_element`. As `schema::keyspace_name()` is an implementation of one of the methods specified by that interface, we replace its uses by `schema::ks_name()`. `schema::keyspace_name()` was an alias for it, so no semantic change has occured.	2024-09-20 14:24:53 +02:00
Dawid Mędrek	78ab1ee8b7	test: Add tests for `CREATE ROLE WITH SALTED HASH`	2024-09-20 14:24:53 +02:00
Dawid Mędrek	35a92d189e	types: Introduce a function `cql3_type_name_without_frozen()` The introduced function returns the actual name of the type represented by `abstract_type`. It circumvents name processing like wrapping a type within `frozen<>` or using Cassandra's syntax. We add the function to be able to describe UDFs in the upcoming commits that require that their arguments not be `frozen<>`. We also test the implementation.	2024-09-20 14:24:53 +02:00
Nadav Har'El	3499c407f7	test: avoid silly "no_mode.1" labels when running tests outside test.py For the benefit of running test.py inside CI, we recently added to test/cql-pytest and test/alternator the knowledge of which "Scylla mode" (--mode) and "run number" is running (--run_id), although these concepts are alien to these two test frameworks (remember that those test frameworks can also run tests against unknown versions of Scylla or even our competitors' implementations). One unfortunate result of this change is that now if you run a test by using pytest directly (or test/*/run) instead of test.py, for example: $ cd test/alternator $ pytest --aws test_item.py::test_basic_string_put_and_get The test's success or failure reports the ugly name test_item.py::test_basic_string_put_and_get.no_mode.1 This unnecessary "no_mode.1" come from the the default values for --mode and --run_id, respectively. But there is no reason for these silly defaults. In this patch we change these defaults to None, and when they are None, they aren't tacked onto the test's name. This patch shouldn't affect running tests through test.py, because test.py always sets the --mode and --run_id options, and doesn't leave them as the default. Fixes #20512 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#20513	2024-09-20 00:36:32 +03:00
Piotr Dulikowski	7e7701d436	Merge 'cql3/statements/select_statement: `SELECT ... USING SERVICE LEVEL`' from Michał Jadwiszczak Allow to specify service level used in select statement `SELECT ... USING SERVICE LEVEL sl_name`. In OSS, this only affects statement's timeout. In case both service level and timeout are specified `SELECT ... USING SERVICE LEVEL sl_name AND TIMEOUT 1h`, the timeout has higher priority as statement's timeout. Fixes scylladb/scylladb#18471 Closes scylladb/scylladb#20523 * github.com:scylladb/scylladb: test/cql-pytest: add test for `SELECT ... USING SERVICE LEVEL` cql3/Cql.g: extend grammar to allow `SELECT ... USING SERVICE LEVEL` cql3/statements/select_statement: use service level timeout cql3/attributes: add service level name field qos/service_level_controller: add method to check if service level exists in cache	2024-09-19 18:19:23 +02:00
Pavel Emelyanov	bd720dd2da	Merge 'cql3: statement_restrictions: adapt to functional style' from Avi Kivity The statement_restrictions class started life in the object-oriented style - an object that interacts with its environment via mutators and is observed via observers. This is however not suitable for its objective: to analyze the WHERE clause, select a query plan, and partition the WHERE clause atoms to the various parts demanded by the query plan (read_command and filters). Furthermore, the object oriented style makes it hard to work with as you can only call some observers after the related mutators were called. Fix this by transforming the code info a more functional style: we call a function that returns an immutable statement_restrictions object that can only be observed. This makes it easier to further change in the future, as changes will not have to consider interaction with the environment. No backport as this is a refactoring Closes scylladb/scylladb#20672 * github.com:scylladb/scylladb: cql3: statement_restrictions: use functional style cql3: statement_restrictions: calculate the index only once cql3: statement_restrictions: make it a const object	2024-09-19 18:18:28 +03:00
Kefu Chai	7985aa97b1	main, test: use seastar::handle_signal() instead use `seastar::handle_signal()` instead of `reactor::handle_signal()`. in a recent change in seastar (c3e826ad1197f2610138f3bcfaeb0b458f8fb799), the later was marked as deprecated in favor of the former, so let's use the recommended API. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#20695	2024-09-19 18:10:07 +03:00
Kefu Chai	1fd1698a90	test: btree: use BOOST_DATA_TEST_CASE() when appropriate instead grouping tests with different parameters, let's parameterize them using `BOOST_DATA_TEST_CASE()`, simpler this way. and the tests can be more structured. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#20697	2024-09-19 18:09:05 +03:00
Avi Kivity	6f7c2ce0aa	Merge 'cql_server::connection: Process rebounce message in case of multiple shard migrations' from Sergey Zolotukhin During a query execution, the query can be re-bounced to another shard if the requested data is located there. Previous implementation assumed that the shard cannot be changed after first re-bounce, however with the introduction of Tablets, data could be migrated to another shard after the query was already re-bounced, causing a failure of the query execution. To avoid this issue, the query is re-bounced as needed until it is executed on the correct shard. Fixes #15465 Closes scylladb/scylladb#20493 * github.com:scylladb/scylladb: cql_server: Add a test for multiple query msg rebounces. cql_server::connection: process: rebounce msg if needed cql_server::connection: process: co-routinize connection::process_on_shard cql_server: connection: process: fixup indentation cql_server: connection: process_on_shard: drop permit parameter transport: server: pass bounce_to_shard as foreign shared ptr cql_server: connection: process: add template concept for process_fn cql_server: move process_fn_return_type to class definition	2024-09-19 17:27:55 +03:00
Gleb Natapov	1b4c255ffd	test: amend test_replace_reuse_ip test to check that there is no stale writes after snapshot transfer starts	2024-09-19 15:24:59 +03:00
Pavel Emelyanov	8487f2fd93	treewide: Remove table::config::datadir It's write-only now, all the places than wanted to know where table's storage is, already use storage_options. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-09-19 13:06:39 +03:00
Pavel Emelyanov	5046cfab4b	test: Construct table_for_tests with table storage options The only place that constructs table_for_tests is make_table_for_tests helper. It can and should prepare the correct storage options, because that's the last place where the target directory is still known. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-09-19 13:06:39 +03:00
Pavel Emelyanov	eaad4f348b	test: Generalize pair of make_table_for_tests helpers They only differ in a way they get target directory from -- one via argument, andother from test_env. Respectively, the latter can call the former. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-09-19 13:06:39 +03:00
Pavel Emelyanov	d9ef9bdd3b	tests: Add helper to get snapshot directory from storage options There's a bunch of tests that check the contents of snapshot directory after creating one. Add a helper for those that gets this directory via storage options, not table config. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-09-19 13:06:39 +03:00
Kefu Chai	b0696bd842	test: btree: use BOOST_DATA_TEST_CASE to structure parameterized tests for better readability. and for more structured tests. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#20516	2024-09-18 14:16:28 +03:00
Pavel Emelyanov	eb22c2a8c8	Merge 'reader_concurrency_semaphore: improve the diagnostics dump' from Botond Dénes * Also dump diagnostics when a read times out while active (not queued). * Add the "Trigger permit" line, containing the details of the permit which caused the diagnostics dump (by e.g. timing out). * Add the "Identified bottleneck(s)" line, containing the identified bottlenecks which lead to permits being queued. This line is missing if no such bottleneck can be identified. * Document the new features, as well as the stat dump, which was added some time ago. Example of the new dump format: ``` INFO 2024-09-12 08:09:48,046 [shard 0:main] reader_concurrency_semaphore - Semaphore reader_concurrency_semaphore_dump_reader_diganostics with 8/10 count and 106192275/32768 memory resources: timed out, dumping permit diagnostics: Trigger permit: count=0, memory=0, table=ks.tbl0, operation=mutation-query, state=waiting_for_admission Identified bottleneck(s): memory permits count memory table/operation/state 3 2 26M ./push-view-updates-2/active 3 2 16M ks.tbl1/push-view-updates-1/active 1 1 15M ks.tbl2/push-view-updates-1/active 1 0 13M ks.tbl1/multishard-mutation-query/active 1 0 12M ks.tbl0/push-view-updates-1/active 1 1 10M ks.tbl3/push-view-updates-2/active 1 1 6060K ks.tbl3/multishard-mutation-query/active 2 1 1930K ks.tbl0/push-view-updates-2/active 1 0 1216K ks.tbl0/multishard-mutation-query/active 6 0 0B ks.tbl1/shard-reader/waiting_for_admission 3 0 0B ./data-query/waiting_for_admission 9 0 0B ks.tbl0/mutation-query/waiting_for_admission 2 0 0B ks.tbl2/shard-reader/waiting_for_admission 4 0 0B ks.tbl0/shard-reader/waiting_for_admission 9 0 0B ks.tbl0/data-query/waiting_for_admission 7 0 0B ks.tbl3/mutation-query/waiting_for_admission 5 0 0B ks.tbl1/mutation-query/waiting_for_admission 2 0 0B ks.tbl2/mutation-query/waiting_for_admission 8 0 0B ks.tbl1/data-query/waiting_for_admission 1 0 0B ./mutation-query/waiting_for_admission 26 0 0B permits omitted for brevity 96 8 101M total Stats: permit_based_evictions: 0 time_based_evictions: 0 inactive_reads: 0 total_successful_reads: 0 total_failed_reads: 0 total_reads_shed_due_to_overload: 0 total_reads_killed_due_to_kill_limit: 0 reads_admitted: 1 reads_enqueued_for_admission: 82 reads_enqueued_for_memory: 0 reads_admitted_immediately: 1 reads_queued_because_ready_list: 0 reads_queued_because_need_cpu_permits: 82 reads_queued_because_memory_resources: 0 reads_queued_because_count_resources: 0 reads_queued_with_eviction: 0 total_permits: 97 current_permits: 96 need_cpu_permits: 0 awaits_permits: 0 disk_reads: 0 sstables_read: 0 ``` Fixes: https://github.com/scylladb/scylladb/issues/19535 Improvement, no backport needed. Closes scylladb/scylladb#20545 * github.com:scylladb/scylladb: docs/dev/reader-concurrency-semaphore.md: update the documentation on diagnostics dumps test/boost/reader_concurrency_semaphore_test: test the new diagnostics functionality reader_concurrency_semaphore: add bottleneck self-diagnosis to diagnosis dump reader_concurrency_semaphore: include trigger permit in diagnostic dump reader_concurrency_semaphore: propagate permit to do_dump_reader_permit_diagnostics() reader_concurrency_semaphore: use consistent exception type for timeout reader_concurrency_semaphore: dump diagnostics when non-waiting reader times out	2024-09-18 14:06:05 +03:00
Pavel Emelyanov	2f4f0eb060	Merge 'Alternator: a few RBAC fixes' from Nadav Har'El The main goal of this PR is to fix a bug (#20619) in the alternator_enforce_authorization=false setting - which didn't do its job (i.e, _don't_ check permissions) when authorization is configured in CQL but not wanted in Alternator. The series also a few smaller bugs in the code that were discovered while debugging the main issue: 1. A potential use-after-free (that didn't seem to hit us in practice) is fixed. 2. A confusing error message (that was also reported in #20619) is improved. 3. Make the alternator_enforce_authorization live-updatable. There was no reason why it shouldn't be, and as this series needs to make this flag available to more code, let's just do it properly and assume the flag is live-updatable. Because the RBAC feature has not been backported to any open-source branches, neither should these fixes. But if some private branch received a backport of the RBAC feature, it should get these fixes too. Fixes #20619. Closes scylladb/scylladb#20640 * github.com:scylladb/scylladb: alternator: make alternator_enforce_authorization live-updateable alternator: fix alternator_enforce_authorization=false alternator: improve error message when unauthenticated alternator: avoid use-after-free in RBAC	2024-09-18 14:02:09 +03:00
Kefu Chai	cb1670b79b	Update seastar submodule * seastar ec5da7a6...69f88e2f (38): > build: s/Sanitizers_COMPILER_OPTIONS/Sanitizers_COMPILE_OPTIONS > test: Update httpd test with request/reply body writing sugar > http: Add sugar to request and response body writers > utils: Add util::write_to_stream() helper > seastar-addr2line: adjust llvm termination regex > README.md: add Crimson project > rpc: conditionally use fmt::runtime() based on SEASTAR_LOGGER_COMPILE_TIME_FMT > build: check the combination of Sanitizers > tls: clear session ticket before releasing > print: remove dead code > doc/lambda-coroutine-fiasco: reword for better readability > rpc: fix compilation error caused by fmt::runtime() > tutorial: explain the use case of rethrow_exception and coroutine::exception > reactor: print more informative error when io_submit fails > README.md: note GitHub discussions > prometheus: `fmt::print` to stringstream directly > doc: add document for testing with seastar > seastar/testing: only include used headers > test: Add abortable http client test cases > http/client: Add abortable make_request() API method > http/client: Abort established connections > http/client: Handle abort source in pool wait > http/client: Add abort source to factory::make() method > http/client: Pass abort_source here and there > http/client: Idnentation fix after previous patch > http/client: Merge some continuations explicitly > signal: add seastar signal api > httpd: remove unused prometheus structs > print: use fmtlib's fmt::format_string in format() > rpc: do not use seastar::format() in rpc logger > treewide: s/format/seastar::format/ > prometheus: sanitize label value for text protocol > tests: unit test prometheus wire format > io-tester: Introduce batches to rate-based submission > io-tester: Generalize issueing request and collecting its result > io-tester: Cancel intent once > io-tester: Dont carry rps/parallelism variables over lambdas > io-tester: Simplify in-flight management The breaking changes in the seastar submodule necessitate corresponding modifications in our code. These changes must be implemented together in a single commit to maintain consistency. So that each commit is buildable. following changes are included in addition to seastar submodule update: * instead of passing a `const char` for the format string, pass a templated `fmt::format_string<...>`, this depends on the `seastar::format()` change in seastar. explicitly call `fmt::runtime()` if the format string is not a consteval expression. this depends on the `seastar::format()` change in seastar. as `seastar::format()` does not accept a plain `const char` which is not constexpr anymore. pass abort_source to `dns_connection_factory::make()`. this depends on the change in seastar, which added a `abort_source` argument to the pure virtual member function of `connection_factory::make()`. call call {fmt,seastar}::format() explicitly. this is a follow up of `3e84d43f`, which takes care of all places where we should call `fmt::format()` and `seastar::format()` explicitly to disambiguate the `format()` call. but more `format()` call made their way into the source tree after `3e84d43f`. so we need fix them as well. * include used header in tests Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Update seastar submodule Please enter the commit message for your changes. Lines starting Closes scylladb/scylladb#20649	2024-09-18 13:59:22 +03:00
Amnon Heiman	8dec292698	alternator:test_metrics test metrics for batch item count This patch adds tests for the batch operations item count. The tests validate that the metrics tracking the number of items processed in a batch increase by the correct amount. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-09-18 11:31:06 +03:00
Amnon Heiman	4d57a43815	alternator:test_metrics Add validating the increased value The `check_increases_operation` now allows override the checked metric. Additionally, a custom validation value can now be passed, which make it possible to validate the amount by which a value has changed, rather than just validating that the value increased. The default behavior of validating that values have increased remains unchanged, ensuring backward compatibility. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-09-18 11:31:06 +03:00
Ernest Zaslavsky	924325fd25	treewide: add "prefix" parameter to backup API Allow the caller to pass the prefix when performing backup and restore Fixes scylladb/scylladb#20335 Closes scylladb/scylladb#20413	2024-09-18 08:25:00 +03:00
Nadav Har'El	24fb92c8ba	Merge 'cql3: simplify runtime component of selection filtering' from Avi Kivity Most of the analysis of the WHERE clause is done in statement_restrictions. It determines what parts to use for the primary or secondary index, and what parts to use for filtering. The difficult part is that it has a very wide interface. After construction, the user must pick the correct bits from many public functions. There are subtle interactions between them that are hard to untangle. This series simplifies the interface as it is used for selection filtering. In the end, only two public functions are used, both returning expressions: one for the partition-level filtering, one for the clustering row level filtering. In the end, the WHERE clause is factored into three parts: - one part goes into the read_command of the primary or secondary index - another part (that references only partition key columns and static key columns) is used to filter entire partitions - another part (that currently references only clustering key columns and regular columns, but one day may reference other columns) is used to filter clustering rows Refactoring, no backport. Closes scylladb/scylladb#20487 * github.com:scylladb/scylladb: cql3: statement_restrictions: drop accessors for single-column key restrictions cql3: selection: adjust indentation cql3: selection: delete empty loop cql3: statement_restrictions, selection: fold multi-column restrictions into row-level filter cql3: statement_restrictions, selection: merge clustering key filter and regular columns filter cql3: statement_restrictions, selection: merge partition key filter and static columns filter cql3: selection: filter regular and static rows as a single expression each cql3: statement_restrictions: collect regular column and static column filters into single expressions cql3: selection: filter clustering key as a single expression cql3: statement_restrictions: expose filter for clustering key cql3: selection: filter partition key as a single expression cql3: statement_restrictions: expose filter for partition key cql3: statement_restrictions: remove relations used for indexing from filtering cql3: statement_restrictions: bail out of find_idx if !_uses_secondary_index cql3: statement_restrictions, modification_statement: pass correct value of check_indexes cql3: statement_restrictions: correct mismatched clustering/partition restrictions references cql3: statement_restrictions: precalculate get_column_defs_for_filtering() cql3: selection: do_filter(): push static/regular row glue to higher level	2024-09-17 22:58:24 +03:00
Piotr Dulikowski	cc5c3aaae7	Merge 'message/messaging_service: guard adding maintenance tenant under cluster feature' from Michał Jadwiszczak In https://github.com/scylladb/scylladb/pull/18729, we introduced a new statement tenant `$maintenance`, but the change wasn't protected by any cluster feature. This wasn't a problem for OSS, since unknown isolation cookie just uses default scheduling group. However, in enterprise that leads to creating a service level on not-upgraded nodes, which may end up in an error if user create maximum number of service levels. This patch adds a cluster feature to guard adding the new tenant. It's done in the way to handle two upgrade scenarios: - version without `$maintenance` tenant -> version with `$maintenance` tenant guarded by a feature - version with `$maintenance` tenant but not guarded by a feature -> version with `$maintenance` tenant guarded by a feature The PR adds `enabled` flag to statement tenants. This way, when the tenant is disabled, it cannot be used to create a connection, but it can be used to accept an incoming connection. The `$maintenance` tenant is added to the config as disabled and it gets enabled once the corresponding feature is enabled. Fixes scylladb/scylladb#20070 Refs scylladb/scylla-enterprise#4403 Closes scylladb/scylladb#19802 * github.com:scylladb/scylladb: message/messaging_service: guard adding maintenance tenant under cluster feature message/messaging_service: add feature_service dependency message/messaging_service: add `enabled` flag to statement tenants	2024-09-17 18:24:34 +02:00
Avi Kivity	1663fbe717	cql3: statement_restrictions: use functional style Instead of a constructor, use a new function analyze_statement_restrictions() as the entry point. It returns an immutable statement_restrictions object. This opens the door to returning a variant, with each arm of the variant corresponding to a different query plan.	2024-09-17 17:13:27 +03:00
Sergey Zolotukhin	68740f57c2	cql_server: Add a test for multiple query msg rebounces. The test emulates several LWT(Lightweight Transaction) query rebounces. Currently, the code that processes queries does not expect that a query may be rebounced more than once. It was impossible with the VNodes, but with intruduction of the Tablets, data can be moved between shards by the balancer thus a query can be rebounced to different shards multiple times.	2024-09-17 15:19:56 +02:00
Nadav Har'El	00793059e1	alternator: fix alternator_enforce_authorization=false When the configuration has alternator_enforce_authorization=false, Alternator should not do authentication (check which user signed each request) nor authorization (check if that user has permissions to do each operation). Our implementation forgot to disable the authorization checks when it's configured to false. The (incorrect) assumption was that when alternator_enforce_authorization is configured to false, the CQL 'authenticator' and 'authorizer' configuration is also disabled - so the authorization checks will be no-ops. But we can't assume that: Users are free to configure 'authenticator' and 'authorizer' for use in CQL, and then set alternator_enforce_authorization=false just for Alternator. So this patch adds a new test for this case - when we have authenticator=PasswordAuthenticator, authorizer=CassandraAuthorizer but alternator_enforce_authorization=false, and fixes it to work correctly. The heart of the fix is trivial: the `verify_*_permission()` functions just need to check the alternator_enforce_authorization and return immediately when false. The bigger part of this change is to get the alternator_enforce_authorization into the "executor" object and then to pass it into the verify calls. Although alternator_enforce_authorization is not YET live updatable, this code is prepared for the future that it may become live updatable, so the executor object saves not the boolean value of this flag, but a live-updatable reference to it. Fixes #20619 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-09-17 15:50:00 +03:00
Andrei Chekun	bbb6c3c2ff	test.py: Add resource consumption metrics This PR adds the possibility to gather resource consumption metrics. The collected metrics can be used to compare performance before and after specific changes aimed at increasing performance. Currently, this functionality works only in manual mode, and this is just raw data. Later on, these metrics can be used in Jupyter notebook to analyze and visualize how the resources are used and can provide the insight on how to improve it. This PR is a first insight after gathering these metrics. Add the possibility to gather resource consumption for the test.py execution. SQLite DB will be created with different performance metrics that will allow comparing the resource consumption between changes. The DB will be in the tmp directory that by default set to testlog. Across the runs, the DB will not be deleted, so each new run will just add information to the existing DB. Parameter --get-metrics was added to switch on or off the metrics gathering. By default, it's switched on. Closes: scylladb/qa-tasks#1666 Closes: scylladb/qa-tasks#1707 Closes scylladb/scylladb#19881	2024-09-17 15:22:34 +03:00
Tomasz Grabiec	adf99402c5	Merge 'readers/flat_mutation_reader_v2: call set_close_required() from consume()' from Botond Dénes The `consume()` variants just forward the call to the `_impl` method with the same name. The latter, being a member of `::impl`, will bypass the top level `fill_buffer()`, etc. methods and thus will never call `set_close_required()`. Do this in the top-level `consume()` methods instead, to ensure a reader, on which only `consume()` is called, and then is destroyed, will complain as it should (and abort). Only one place was found in core code, which didn't close the reader: `split_mutation() in `mutation/mutation.cc` and this reader is the "from-mutation" one which has no real close routine. All other places were in tests. All this is to say, there were no real bugs uncovered by this PR. Fixes #16520 Improvement, no backport required. Closes scylladb/scylladb#16522 * github.com:scylladb/scylladb: readers/flat_mutation_reader_v2: call set_close_required() from consume*() test/boost/sstable_compaction_test: close reader after use test/boost/repair_test: close reader after use mutation/mutation: split_mutation(): close reader after use	2024-09-17 13:21:34 +02:00
Botond Dénes	6250ff18eb	Merge 'sstable: s/crawling_sstable_mutation_reader/sstable_full_scan_reader' from Kefu Chai "crawling" is a little bit obscure in this context. so let's rename this class to reflect the fact that this reader only reads the entire content of the sstable. both crawling reader for kl and mx formats are renamed. also, in order to be consistent, all "crawling reader" in variable names are updated as well. --- it's a cleanup, hence no need to backport. Closes scylladb/scylladb#20599 * github.com:scylladb/scylladb: sstable: s/crawling_sstable_mutation_reader/sstable_full_scan_reader sstable/mx/reader: add comment for mx_crawling_sstable_mutation_reader	2024-09-17 11:55:08 +03:00
Botond Dénes	f32e67cb9e	Merge 'Make sstables without on-disk path' from Pavel Emelyanov New sstables for a table are created by the table::make_sstable() method. The method then calls sstables_manager::make_sstable() and passes there a path to component files which, in turn, sits on table::config. Since some time ago having an on-disk path for an sstable had become optional, as sstables could be put on S3 storage without local paths involved. In that case the aforementioned "path" is ~~ab~~used as a key in the system.sstables registry, that references a record with information used to retrieve URLs of sstables' objects. This PR removes the "path" argument from sstables_manager::make_sstable() and its sstable_sdirectory peer. The details of sstables' location are moved onto storage_options and depend on storage type. For now in both storage types this location is still the good-old $datadir/$keyspace/$table-$uuid string. S3 storage needs to be patched more to use more elegant "location" value. Eventually the `table::config::{datadir\|all_datadirs}` will be removed, this PR is the step towards it. closes: #12707 Closes scylladb/scylladb#20542 * github.com:scylladb/scylladb: table: Use storage options to clean the storage sstables/storage: Re-use ocally generated vector of paths sstables/storage: Visit options once to initialize storage sstables_manager: Return table storage options when initalizing storage sstables/storage: Fix indentation after previous patch table: Move datadirs initialization parallelism to storage level sstables/storage: Split the visitor's overloaded functor restore: Don't use table_dir to construct sstable_directory sstable_directory: Remove table_dir field sstable_directory: Use options details in lister sstables_manager: Remove table_dir from make_sstable() sstables: Remove table_dir from sstable constructor sstables/storage: Remove sstring dir from make_storage() sstables/storage: Use options to construct tests: Properly initialize storage options with "dir" distributed_loader: Create S3 options with prefix for restore storage_options: Add special-purpose local options maker storage_options: Keep local path / s3 prefix onboard table: Get another options when initializing storage	2024-09-17 09:41:21 +03:00
Botond Dénes	a4a8cad97f	Merge 'atomic_delete: allow deletion of sstables from several prefixes' from Benny Halevy Allow create_pending_deletion_log to delete a bunch of sstables potentially resides in different prefixes (e.g. in the base directory and under staging/). The motivation arises from table::cleanup_tablet that calls compaction_group::cleanup on all cg:s via cleanup_compaction_groups. Cleanup, in turn, calls delete_sstables_atomically on all sstables in the compaction_group, in all states, including the normal state as well as staging - hence the requirement to support deleting sstables in different sub-directories. Also, apparently truncate calls delete_atomically for all sstables too, via table::discard_sstables, so if it happened to be executed during view update generation, i.e. when there are sstables in staging, it should hit the assertion failure reported in https://github.com/scylladb/scylladb/issues/18862 as well (although I haven't seen it yet, but I see no reason why it would happen). So the issue was apparently present since the initial implementation of the pending_delete_log. It's just that with tablet migration it is more likely to be hit. Fixes scylladb/scylladb#18862 Needs backport to 6.0 since tablets require this capability Closes scylladb/scylladb#19555 * github.com:scylladb/scylladb: sstable_directory: create_pending_deletion_log: place pending_delete log under the base directory sstables: storage: keep base directory in base class sstables: storage: define opened_directory in header file sstable_directory: use only dirlog	2024-09-17 08:30:40 +03:00
Kefu Chai	df7f332a58	sstable: s/crawling_sstable_mutation_reader/sstable_full_scan_reader "crawling" is a little bit obscure in this context. so let's rename this class to reflect the fact that this reader only reads the entire content of the sstable. both crawling reader for kl and mx formats are renamed. also, in order to be consistent, all "crawling reader" in variable names are updated as well. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-09-17 10:39:37 +08:00
Michał Jadwiszczak	71a03ef6b0	message/messaging_service: add feature_service dependency	2024-09-16 15:33:40 +02:00
Michał Jadwiszczak	de7acbad8b	test/cql-pytest: add test for `SELECT ... USING SERVICE LEVEL`	2024-09-16 14:31:43 +02:00
Pavel Emelyanov	bf5021e735	test: Remove sstables::test::binary_search() That's the most mysterious wrapper in this set as it doesn't need sstable itself at all, it just duplicates the existing non-class function out there. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-09-16 12:51:35 +03:00
Pavel Emelyanov	309d315af7	test: Remove sstables::test::move_summary() This one is a bit tricky, as it needs to modify the sstables's summary. However, the sstables::test::_summary() one returns mutable reference and the only caller can use it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-09-16 12:50:48 +03:00
Pavel Emelyanov	deec952111	test: Remove sstables::test::read_toc() The sstable::read_toc() is public method, use it directly. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-09-16 12:50:19 +03:00
Pavel Emelyanov	25cd8ccdd8	test: Remove sstables::test::get_summary() Same as previous patch -- callers can come with const reference to summary, so they can live with existing public sstable::get_summary(). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-09-16 12:49:39 +03:00
Pavel Emelyanov	f714ac9b48	test: Remove sstables::test::get_statistics() Just call the public sstable::get_statistics(). The callers would get const reference on it, but they don't need more than that. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-09-16 12:48:43 +03:00
Pavel Emelyanov	53afa583e8	test: Remove sstables::test::data_read() The wrapper just changes the order of arguments for a public method. Drop it, and call the wrapee directly. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2024-09-16 12:47:59 +03:00
Botond Dénes	ea29fe579b	Merge 'replica: ignore cleanup of deallocated storage group' from Aleksandra Martyniuk Cleanup of a deallocated tablet throws an exception. Since failed cleanup is retried, we end up in an infinite loop. Ignore cleanup of deallocated storage groups. Fixes: #19752. Needs to be backported to all branches with tablets (6.0 and later) Closes scylladb/scylladb#20584 * github.com:scylladb/scylladb: test: check if cleanup of deallocated sg is ignored replica: ignore cleanup of deallocated storage group	2024-09-16 09:22:56 +03:00
Avi Kivity	65ba19323c	cql3: statement_restrictions, modification_statement: pass correct value of check_indexes Our UPDATE/INSERT/DELETE statements require a full primary/partition key and therefore never use indexes; fix the check_index parameter passed from modification_statement. So far the bug is benign as we did not take any action on the value. Make the parameter non-default to avoid such confusion in the future.	2024-09-15 14:33:56 +03:00
Avi Kivity	b9bc783418	cql3: selection: don't ignore regular column restriction if a regular row is not present If a regular row isn't present, no regular column restriction (say, r=3) can pass since all regular columns are presented as NULL, and we don't have an IS NULL predicate. Yet we just ignore it. Handle the restriction on a missing column by return false, signifying the row was filtered out. We have to move the check after the conditional checking whether there's any restriction at all, otherwise we exit early with a false failure. Unit test marked xfail on this issue are now unmarked. A subtest of test_tombstone_limit is adjusted since it depended on this bug. It tested a regular column which wasn't there, and this bug caused the filter to be ignored. Change to test a static column that is there. A test for a bug found while developing the patch is also added. It is also tested by test_tombstone_limit, but better to have a dedicated test. Fixes #10357 Closes scylladb/scylladb#20486	2024-09-15 13:44:16 +03:00
Botond Dénes	6d8e9645ce	test/*/run: restore --vnodes into working order This option was silently broken when --enable-tablet's default changed from false to true. The reason is that when --vnodes is passed, run only removes --enable-tablets=true from scylla's command line. With the new default this is not enough, we need to explicitely disable tablets to override the default. Closes scylladb/scylladb#20462	2024-09-13 17:10:09 +03:00

... 84 85 86 87 88 ...

11801 Commits