scylladb

Author	SHA1	Message	Date
Avi Kivity	847c850034	schema: add accessors for primary key columns and non-primary-key columns It's somewhat common to ask for the partition key and clustering key columns, or for the static and regular columsn. Provide accessors for them rather than requiring the user to glue them. Some callers are converted. Closes scylladb/scylladb#21191	2024-10-22 15:01:14 +02:00
Kefu Chai	6ead5a4696	treewide: move log.hh into utils/log.hh the log.hh under the root of the tree was created keep the backward compatibility when seastar was extracted into a separate library. so log.hh should belong to `utils` directory, as it is based solely on seastar, and can be used all subsystems. in this change, we move log.hh into utils/log.hh to that it is more modularized. and this also improves the readability, when one see `#include "utils/log.hh"`, it is obvious that this source file needs the logging system, instead of its own log facility -- please note, we do have two other `log.hh` in the tree. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-10-22 06:54:46 +03:00
Kefu Chai	5cd619a60c	treewide: s/boost::adaptors::map_keys/std::views::keys/ now that we are allowed to use C++23. we now have the luxury of using `std::views::keys`. in this change, we: - replace `boost::adaptors::map_keys` with `std::views::keys` - update affected code to work with `std::views::keys` to reduce the dependency to boost for better maintainability, and leverage standard library features for better long-term support. this change is part of our ongoing effort to modernize our codebase and reduce external dependencies where possible. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21198	2024-10-21 12:47:52 +03:00
Avi Kivity	c3be2489ce	treewide: drop includes of <boost/range/adaptors.hpp> This includes way too much, including <boost/regex.hpp>, which is huge. Drop includes of adaptors.hpp and replace by what is needed. Closes scylladb/scylladb#21187	2024-10-20 17:17:11 +03:00
Avi Kivity	820509026f	schema: replace boost ranges with std ranges To reduce dependency load, use std ranges instead of boost ranges. The std::ranges::{lower,upper}_bound don't support heterogeneous lookup, but a more natural solution is to use a projection to search for the name, so we use that and the custom comparator is removed. Many callers are converted as well due to poor interoperability between boost ranges and std ranges.	2024-10-15 16:42:54 +03:00
Nadav Har'El	45ccceb137	alternator: add "dc" and "rack" options to "/localnodes" request Before this patch, the "/localnodes" HTTP request to the Alternator server lists all the live nodes of the current DC. This patch adds two optional parameters to this query: dc: allows to list the live nodes of a specific named DC instead of the current DC of the server. rack: allows to restrict the results to just the nodes belonging to a specific named rack. For both options, if no live node exists in the given dc or rack (in particular, if such a dc or rack doesn't even exist), an empty list is returned - it's not an error. The default, if dc or rack is not specified - remains exactly as it is today - look at the current DC (the one of the node being request), and do not restrict the list to any specific rack. We expect the new options that we added here to be useful for two use cases: 1. A client that knows of some Scylla node (belonging to an unknown DC), but wants to list the nodes in its DC, which it knows by name. 2. A client in a multi-rack DC (e.g., multi-AZ region in AWS) that wants to send requests to nodes in its own rack (which it knows by name), to avoid cross-rack networking costs. Note that in both cases, this requires clients to know the names of DCs and AZs via some out-of-band means. The client can also get a list of DCs and racks using the system.local system table, as the tests included in this patch demonstrate. This patch includes two set of tests for these new options: One in the the single-node test/alternator framework that has a single dc and rack but can still check the case of an unknown dc or rack (in which case an empty list is returned). The second test is in the topology framework, and runs an 8-node cluster with two DCs, two racks, and two nodes in each, and checks all the combinations of "/localnodes" requests with and without dc and rack options. This test also resolves a longstanding TODO that asked for such a multi-DC test for "/localnodes" to be written. Fixes #12147 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#20915	2024-10-07 20:53:47 +03:00
Avi Kivity	93afc77307	raft_group0_client: uninclude "mutation/mutation.hh" Lighten the dependency load. Some constructors and destructors are uninlined to avoid the header depending on the mutation class.	2024-09-28 16:31:53 +03:00
Nadav Har'El	7715abfc56	Merge 'Alternator store ProvisionedThroughput' from Amnon Heiman When users create a table using the Alternator API, they can decide if the billing is PROVISIONED of PAY_PER_REQUEST. If the billing is set to PROVISIONED, they need to set the ProvisionedThroughput ReadCapacityUnits (RCU) and WriteCapacityUnits (WCU). This series adds support for getting and setting the ProvisionedThroughput. The values will be stored as table extension tags. Following how TTL is stored within the Alternator, we will use ```system:rcu_attribute``` and ```system:wcu_attribute``` for the labels. The series adds a test that sets ProvisionedThroughput and validates that it gets the value back. It was tested with both Alternator and AWS. This series is part of the effort to monitor, limit, and bill Alternator operations. New code, no need to backport. Closes scylladb/scylladb#20056 * github.com:scylladb/scylladb: docs/alternator/compatibility.md: explain the consumed capacity provisioned Add test/alternator/test_provisioned_throughput.py test/alternator/util.py: Allow override BillingMode alternator/executor.cc: Store ProvisionedThroughput	2024-09-26 01:23:17 +03:00
Nadav Har'El	6496eab5ee	Merge 'Rename Alternator batch item count metrics' from Amnon Heiman This PR addresses multiple issues with alternator batch metrics: 1. Rename the metrics to scylla_alternator_batch_item_count with op=BatchGetItem/BatchWriteItem 2. The batch size calculation was wrong and didn't count all items in the batch. 3. Add a test to validate that the metrics values increase by the correct value (not just increase). This also requires an addition to the testing to validate ops of different metrics and an exact value change. Needs backporting to allow the monitoring to use the correct metrics names. Fixes #20571 Closes scylladb/scylladb#20646 * github.com:scylladb/scylladb: alternator:test_metrics test metrics for batch item count alternator:test_metrics Add validating the increased value alternator: Fix item counting in batch operations Alterntor rename batch item count metrics	2024-09-23 10:13:07 +03:00
Pavel Emelyanov	2f4f0eb060	Merge 'Alternator: a few RBAC fixes' from Nadav Har'El The main goal of this PR is to fix a bug (#20619) in the alternator_enforce_authorization=false setting - which didn't do its job (i.e, _don't_ check permissions) when authorization is configured in CQL but not wanted in Alternator. The series also a few smaller bugs in the code that were discovered while debugging the main issue: 1. A potential use-after-free (that didn't seem to hit us in practice) is fixed. 2. A confusing error message (that was also reported in #20619) is improved. 3. Make the alternator_enforce_authorization live-updatable. There was no reason why it shouldn't be, and as this series needs to make this flag available to more code, let's just do it properly and assume the flag is live-updatable. Because the RBAC feature has not been backported to any open-source branches, neither should these fixes. But if some private branch received a backport of the RBAC feature, it should get these fixes too. Fixes #20619. Closes scylladb/scylladb#20640 * github.com:scylladb/scylladb: alternator: make alternator_enforce_authorization live-updateable alternator: fix alternator_enforce_authorization=false alternator: improve error message when unauthenticated alternator: avoid use-after-free in RBAC	2024-09-18 14:02:09 +03:00
Kefu Chai	cb1670b79b	Update seastar submodule * seastar ec5da7a6...69f88e2f (38): > build: s/Sanitizers_COMPILER_OPTIONS/Sanitizers_COMPILE_OPTIONS > test: Update httpd test with request/reply body writing sugar > http: Add sugar to request and response body writers > utils: Add util::write_to_stream() helper > seastar-addr2line: adjust llvm termination regex > README.md: add Crimson project > rpc: conditionally use fmt::runtime() based on SEASTAR_LOGGER_COMPILE_TIME_FMT > build: check the combination of Sanitizers > tls: clear session ticket before releasing > print: remove dead code > doc/lambda-coroutine-fiasco: reword for better readability > rpc: fix compilation error caused by fmt::runtime() > tutorial: explain the use case of rethrow_exception and coroutine::exception > reactor: print more informative error when io_submit fails > README.md: note GitHub discussions > prometheus: `fmt::print` to stringstream directly > doc: add document for testing with seastar > seastar/testing: only include used headers > test: Add abortable http client test cases > http/client: Add abortable make_request() API method > http/client: Abort established connections > http/client: Handle abort source in pool wait > http/client: Add abort source to factory::make() method > http/client: Pass abort_source here and there > http/client: Idnentation fix after previous patch > http/client: Merge some continuations explicitly > signal: add seastar signal api > httpd: remove unused prometheus structs > print: use fmtlib's fmt::format_string in format() > rpc: do not use seastar::format() in rpc logger > treewide: s/format/seastar::format/ > prometheus: sanitize label value for text protocol > tests: unit test prometheus wire format > io-tester: Introduce batches to rate-based submission > io-tester: Generalize issueing request and collecting its result > io-tester: Cancel intent once > io-tester: Dont carry rps/parallelism variables over lambdas > io-tester: Simplify in-flight management The breaking changes in the seastar submodule necessitate corresponding modifications in our code. These changes must be implemented together in a single commit to maintain consistency. So that each commit is buildable. following changes are included in addition to seastar submodule update: * instead of passing a `const char` for the format string, pass a templated `fmt::format_string<...>`, this depends on the `seastar::format()` change in seastar. explicitly call `fmt::runtime()` if the format string is not a consteval expression. this depends on the `seastar::format()` change in seastar. as `seastar::format()` does not accept a plain `const char` which is not constexpr anymore. pass abort_source to `dns_connection_factory::make()`. this depends on the change in seastar, which added a `abort_source` argument to the pure virtual member function of `connection_factory::make()`. call call {fmt,seastar}::format() explicitly. this is a follow up of `3e84d43f`, which takes care of all places where we should call `fmt::format()` and `seastar::format()` explicitly to disambiguate the `format()` call. but more `format()` call made their way into the source tree after `3e84d43f`. so we need fix them as well. * include used header in tests Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Update seastar submodule Please enter the commit message for your changes. Lines starting Closes scylladb/scylladb#20649	2024-09-18 13:59:22 +03:00
Amnon Heiman	905408f764	alternator: Fix item counting in batch operations This patch fixes the logic for counting items in batch operations. Previously, the item count in requests was inaccurate, it count the number of tabels in get_item and the request_items in write_items. The new logic correctly counts each individual item in `BatchGetItem` and `BatchWriteItem` requests. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-09-18 11:30:59 +03:00
Amnon Heiman	515857a4a9	Alterntor rename batch item count metrics This patch renames metrics tracking the total number of items in a batch to `scylla_alternator_batch_item_count`. It uses the existing `op` label to differentiate between `BatchGetItem` and `BatchWriteItem` operations. Ensures better clarity and distinction for batch operations in monitoring. This an example of how it looks like: # HELP scylla_alternator_batch_item_count The total number of items processed across all batches # TYPE scylla_alternator_batch_item_count counter scylla_alternator_batch_item_count{op="BatchGetItem",shard="0"} 4 scylla_alternator_batch_item_count{op="BatchWriteItem",shard="0"} 4	2024-09-18 11:20:07 +03:00
Nadav Har'El	17deaae463	alternator: make alternator_enforce_authorization live-updateable For no good reason, the "alternator_enforce_authorization" flag (which chooses whether to enable authentication and authorization checks in Alternator) was not live-updatable, so make it so. Both "server" and "executor" objects use this configuration flag, the former is fixed in this patch (to hold a live-updatable reference instead of a copy of a boolean), the latter was already prepared for this change and already held a live-updatable reference. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-09-17 15:51:16 +03:00
Nadav Har'El	00793059e1	alternator: fix alternator_enforce_authorization=false When the configuration has alternator_enforce_authorization=false, Alternator should not do authentication (check which user signed each request) nor authorization (check if that user has permissions to do each operation). Our implementation forgot to disable the authorization checks when it's configured to false. The (incorrect) assumption was that when alternator_enforce_authorization is configured to false, the CQL 'authenticator' and 'authorizer' configuration is also disabled - so the authorization checks will be no-ops. But we can't assume that: Users are free to configure 'authenticator' and 'authorizer' for use in CQL, and then set alternator_enforce_authorization=false just for Alternator. So this patch adds a new test for this case - when we have authenticator=PasswordAuthenticator, authorizer=CassandraAuthorizer but alternator_enforce_authorization=false, and fixes it to work correctly. The heart of the fix is trivial: the `verify_*_permission()` functions just need to check the alternator_enforce_authorization and return immediately when false. The bigger part of this change is to get the alternator_enforce_authorization into the "executor" object and then to pass it into the verify calls. Although alternator_enforce_authorization is not YET live updatable, this code is prepared for the future that it may become live updatable, so the executor object saves not the boolean value of this flag, but a live-updatable reference to it. Fixes #20619 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-09-17 15:50:00 +03:00
Nadav Har'El	76af7c0389	alternator: improve error message when unauthenticated When access-control checks report permission denied, we want to report the name of the authenticated role (the role signing the request) which didn't have the permission. When authentication was disabled, and there is no authenticated role, we printed the fake name "anonymous", but this can confuse users (it confused me!) to think there's an actual role named "anonymous". So let's change that string to "<anonymous>" with angle brackets - it makes it more obvious that this isn't a real role, but actually an anonymous request. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-09-17 15:44:29 +03:00
Nadav Har'El	3543bf14e9	alternator: avoid use-after-free in RBAC While auditing the code, I noticed that the current Alternator access control checks have code like: ``` return client_state.check_has_permission(auth::command_desc( permission_to_check, auth::make_data_resource(schema->ks_name(), schema->cf_name()))).then( ``` There's a problem here - it turns out that, unfortunately, command_desc holds a reference to the "resource" object - not a copy. So the temporary object returned by make_data_resource may be freed and then used... Curiously, we've not seen a bug caused by this in practice (not even in debug build mode), but better safe than sorry, so this patch changes the code in one of two ways: 1. Code using coroutines can keep the "resource" as a variable on the stack. 2. Code using continuations needs to hold the "resource" with do_with(), but since this already incurs the cost of an extra allocation (even in the successful case), might as well just switch to using coroutines and have less ugly code. This patch does not change any functionality, and all the tests seem to work before and after it the same. Signed-off-by: Nadav Har'El <nyh@scylladb.com> hello	2024-09-17 15:41:09 +03:00
Nadav Har'El	930accad12	alternator: return error on unused AttributeDefinitions A CreateTable request defines the KeySchema of the base table and each of its GSIs and LSIs. It also needs to give an AttributeDefinition for each attribute used in a KeySchema - which among other things specifies this attribute's type (e.g., S, N, etc.). Other, non-key, attributes do not have a specified type, and accordingly must not be mentioned in AttributeDefinitions. Before this patch, Alternator just ignored unused AttributeDefinitions entries, whereas DynamoDB throws an error in this case. This patch fixes Alternator's behavior to match DynamoDB's - and adds a test to verify this. Besides being more error-path-compatible with DynamoDB, this extra check can also help users: We already had one user complaining that an AttributeDefinitions setting he was using was ignored, not realizing that it wasn't used by any KeySchema. A clear error message would have saved this user hours of investigation. Fixes #19784. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#20378	2024-09-12 15:37:18 +03:00
Kefu Chai	3e84d43f93	treewide: use seastar::format() or fmt::format() explicitly before this change, we rely on `using namespace seastar` to use `seastar::format()` without qualifying the `format()` with its namespace. this works fine until we changed the parameter type of format string `seastar::format()` from `const char*` to `fmt::format_string<...>`. this change practically invited `seastar::format()` to the club of `std::format()` and `fmt::format()`, where all members accept a templated parameter as its `fmt` parameter. and `seastar::format()` is not the best candidate anymore. despite that argument-dependent lookup (ADT for short) favors the function which is in the same namespace as its parameter, but `using namespace` makes `seastar::format()` more competitive, so both `std::format()` and `seastar::format()` are considered as the condidates. that is what is happening scylladb in quite a few caller sites of `format()`, hence ADT is not able to tell which function the winner in the name lookup: ``` /__w/scylladb/scylladb/mutation/mutation_fragment_stream_validator.cc:265:12: error: call to 'format' is ambiguous 265 \| return format("{} ({}.{} {})", _name_view, s.ks_name(), s.cf_name(), s.id()); \| ^~~~~~ /usr/bin/../lib/gcc/x86_64-redhat-linux/14/../../../../include/c++/14/format:4290:5: note: candidate function [with _Args = <const std::basic_string_view<char> &, const seastar::basic_sstring<char, unsigned int, 15> &, const seastar::basic_sstring<char, unsigned int, 15> &, const utils::tagged_uuid<table_id_tag> &>] 4290 \| format(format_string<_Args...> __fmt, _Args&&... __args) \| ^ /__w/scylladb/scylladb/seastar/include/seastar/core/print.hh:143:1: note: candidate function [with A = <const std::basic_string_view<char> &, const seastar::basic_sstring<char, unsigned int, 15> &, const seastar::basic_sstring<char, unsigned int, 15> &, const utils::tagged_uuid<table_id_tag> &>] 143 \| format(fmt::format_string<A...> fmt, A&&... a) { \| ^ ``` in this change, we change all `format()` to either `fmt::format()` or `seastar::format()` with following rules: - if the caller expects an `sstring` or `std::string_view`, change to `seastar::format()` - if the caller expects an `std::string`, change to `fmt::format()`. because, `sstring::operator std::basic_string` would incur a deep copy. we will need another change to enable scylladb to compile with the latest seastar. namely, to pass the format string as a templated parameter down to helper functions which format their parameters. to miminize the scope of this change, let's include that change when bumping up the seastar submodule. as that change will depend on the seastar change. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-09-11 23:21:40 +03:00
Amnon Heiman	c76347032d	alternator/executor.cc: Store ProvisionedThroughput This patch adds the ability to store and retrieve the ProvisionedThroughput in a table. The information is stored in the table tags. We use the TTL convention used in alternator, and the tags will be: system:provisioned_rcu and system:provisioned_wcu. verify_billing_mode function now return a struct with the billing mode information. The code of describe_table now check if the provision tags exists and return the RCU and WCU accordingly. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-09-10 17:06:40 -04:00
Nadav Har'El	80a0798e77	alternator: better error message in some cases of key type mismatch Alternator uses a common function get_typed_value() to read the values of key attribute and confirm they have the expected type (key attributes have a fixed type in the schema). If the type is wrong, we want to print a "Type mismatch" error message. But the current implementation did the checks in the wrong order, and as a result could print a "Malformed value object" message instead of a "Type mismatch". That could happen if the wrong type is a boolean, map, list, or basically any type whose JSON representation is not a string. The allowed key types - bytes), string and number - all have string representations in JSON, but still we should first report the mismatched type and only report the "Malformed object" if the type matches but the JSON is faulty. In addition to fixing the error message, we fix an existing test which complained in a comment (but ignored) that the error message in some case (when trying to use a map where a key is expected) the strange "Malformed value object" instead of the expected "Type mismatch". The next patch will add an additional reproducer for this problem and its fix. That test will do: ``` with pytest.raises(ClientError, match='ValidationException.*mismatch'): test_table_gsi_6.put_item(Item={'p': p, 's': True}) ``` I.e., it tries to set a boolean value for a string key column, and expect to get the "Type mismatch" error and not the ugly "Malformed value object". Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-09-09 13:14:49 +03:00
Nadav Har'El	cf5d7ce212	Alternator: drop unneeded "IS NOT NULL" clauses in MV of GSI/LSI Scylla's materialized views naturally skip any base rows where the view's key isn't set (is NULL), because we can't create a view row with a null key. To make the user aware that this is happening, the user is required to add "WHERE ... IS NOT NULL" for the view's key columns when defining the view. However, the only place that these extra IS NOT NULL clauses are checked are in the CQL "CREATE MATERIALIZED VIEWS" statement - they are completely ignored in all other places in the code. In particular, when we create a materialized view in Alternator (GSI or LSI), we don't have to add these "IS NOT NULL" clauses, as they are outright ignored. We didn't know they were ignored, and made an effort to add them - but no matter how incorrectly we did it, it didn't matter :-) In commit `2bf2ffd3ed` it turned out we had a typo that caused the wrong column name to be printed. Also, even today we are still missing base key columns that aren't listed as a view key in Alternator but still added as view clustering keys in Scylla - and again the fact these were missing also didn't matter. So I think it's time to stop pretending, and stop calculating these "IS NOT NULL" strings, so this patch outright removes them from the Alternator view-creation code. Beyond being a nice cleanup of unnecessary and inaccurate code, it will also be necessary when we allow in later patches to index for an Alternator attribute "x" not a real column x in the base table but rather an element in the ":attrs" map - so adding a "x IS NOT NULL" isn't only unnecessary, it is outright illegal: The expression evaluation code, even though it doesn't do anything with the "IS NOT NULL" expression, still verifies that "x" is a valid column, which it isn't. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-09-09 12:09:25 +03:00
Nadav Har'El	15f8046fcb	alternator ttl: fix use-after-free The Alternator TTL scanning code uses an object "scan_ranges_context" to hold the scanning context. One of the members of this object is a service::query_state, and that in turn holds a reference to a service::client_state. The existing constructor created a temporary client_state object and saved a reference to it - which can result in use after free as the temporary object is freed as soon as the constructor ends. The fix is to save a client_state in the scan_ranges_context object, instead of a temporary object. Fixes #19988 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#20418	2024-09-03 22:15:18 +03:00
Nadav Har'El	dd030f8112	alternator: improve RBAC access denied error messages This patch address two requests made by reviewers of the original "Add CQL-based RBAC support to Alternator" series. Both requests were about the error messages produced when access is denied: 1. The error message is improved to use more proper English, and also to include the name of the role which was denied access. 2. The permission-check and error-message-formatting code is de-duplicated, using a common function verify_permission(). This de-duplication required moving the access-denied error path to throwing an exception instead of the previous exception-free implementation. However, it can be argued that this change is actually a good thing, because it makes the successful case, when access is allowed, faster. The de-duplicated code is shorter and simpler, and allowed changing the text of the error message in just one place. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#20326	2024-09-03 14:39:30 +03:00
Avi Kivity	b13ab90448	Merge 'alternator/executor: Use native reversed format' from Łukasz Paszkowski When executing reversed queries, a native revered format shall be used. Therefore, the table schema and the clustering key bounds are reversed before a partition slice and a read command are constructed. It is, however, possible to run a reversed query passing a table schema but only when there are no restrictions on the clustering keys. In this particular situation, the query returns correct results. Since the current alternator tests in test.py do not imply any restrictions, this situation was not caught during development of https://github.com/scylladb/scylladb/pull/18864. Hence, additional tests are provided that add clustering keys restrictions when executing reversed queries to capture such errors earlier than in dtests. Additional manual tests were performed to test a mixed-node cluster (with alternator API enabled in Scylla on each node): 1. 2-node cluster with one node upgraded: reverse read queries performed on an old node 2. 2-node cluster with one node upgraded: reverse read queries performed on a new node 3. 2-node cluster with one node upgraded and all its sstable files deleted to trigger repair: reverse read queries performed on an old node 4. 2-node cluster with one node upgraded and all its sstable files deleted to trigger repair: reverse read queries performed on a new node All reverse read queries above consists of: - single-partition reverse reads with no clustering key restrictions, with single column restrictions and multi column restrictions both with and without paging turned on The exact same tests were also performed on a fully upgraded cluster. Fixes https://github.com/scylladb/scylladb/issues/20191 No backport is required as this is a complementary patch for the series https://github.com/scylladb/scylladb/pull/18864 that did not require backporting. Closes scylladb/scylladb#20205 * github.com:scylladb/scylladb: test_query.py: Test reverse queries with clustering key bounds alternator::do_query Add additional trace log alternator::do_query: Use native reversed format alternator::do_query Rename schema with table_schema	2024-08-27 20:40:49 +03:00
Benny Halevy	686a8f2939	abstract_replication_strategy: make get_ranges async To prevent stalls due to large number of tokens. For example, large cluster with say 70 nodes can have more than 16K tokens. Fixes #19757 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-08-25 10:57:34 +03:00
Benny Halevy	824bdf99d2	alternator: ttl: token_ranges_owned_by_this_shard: let caller make the ranges_holder Add static `make` methods to ranges_holder_{primary,secondary} and use them to make the ranges objects and pass them to `token_ranges_owned_by_this_shard`, rather than letting token_ranges_owned_by_this_shard invoke the right constructor of the ranges_holder class. Prepare for making `make` async. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-08-25 10:25:32 +03:00
Benny Halevy	b2abbae24b	alternator: ttl: can pass const gms::gossiper& to ranges_holder There's no need to pass a mutable reference to the gossiper. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-08-25 10:25:32 +03:00
Benny Halevy	333c0d7c88	alternator: ttl: ranges_holder_primary: unconstify _token_ranges member To allow the class to be nothrow_move_constructable. Prepare for returning it as a future value. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-08-25 10:25:32 +03:00
Benny Halevy	d385219a12	alternator: ttl: refactor token_ranges_owned_by_this_shard Rather than holding a variant member (and defining both ranges_holder_{primary,secondary} in both specilizations of the class, just make the internal ranges_holder class first-class citizens and parameterize the `token_ranges_owned_by_this_shard` template by this class type. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-08-25 10:25:32 +03:00
Łukasz Paszkowski	f29d7ffa81	alternator::do_query Add additional trace log Additional log prints information on the read query being executed. It lists information like whether the query is a reversed one or not, and table_schema and query_schema versions.	2024-08-20 20:56:15 +02:00
Łukasz Paszkowski	727cbd8151	alternator::do_query: Use native reversed format When executing reversed queries, a native revered format shall be used. Therefore the table schema and the clustering key bounds are reversed before a partition slice and a read command are constructed. Similarly as for cql3::statements::select_statement.	2024-08-20 20:56:15 +02:00
Łukasz Paszkowski	3720e8aabe	alternator::do_query Rename schema with table_schema In order to increase readability, a schema variable is renamed to a table_schema to emphesize a table schema is passed to the function and used across it. Allows us to introduce a query_schema variable in the next patch.	2024-08-20 20:56:06 +02:00
Nadav Har'El	f9ff475dfb	alternator: add RBAC enforcement to GetRecords This patch adds a requirement for the "SELECT" permission on a table to run a GetRecords on it (the DynamoDB Streams API, i.e., CDC). The grant is checked on the CDC log table - not on the base table, which allows giving a role the ability to read the base but not is change stream, or vice versa. The operations ListStreams, DescribeStreams, GetShardIterators do not require any permissions to run - they do not read any data, and are (in my opinion) similar in spirit to DescribeTable, so I think it's fine not to require any permissions for them. A test is also added. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:57:53 +02:00
Nadav Har'El	9417cf8bcf	alternator: add RBAC enforcement to UpdateTimeToLive This patch adds a requirement for the "ALTER" permission on a table to run a UpdateTimeToLive on it. UpdateTimeToLive is similar in purpose to UpdateTable, so it makes sense to use the same permission "ALTER" as we do for UpdateTable. A tests is also added. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:57:53 +02:00
Nadav Har'El	e76316495c	alternator: add RBAC enforcement to TagResource and UntagResource This patch adds a requirement for the "ALTER" permission on a table to run the TagResource or UntagResource operations on it. CQL does not have an exact parallel of DynamoDB's tagging feature, but our usual use of tags as an extension of UpdateTable to change non-standard options (e.g., write isolation policy or tablets setup), so it makes sense to require the same permissions we require for UpdateTable - namely "ALTER". A test for both operations is also added. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:57:53 +02:00
Nadav Har'El	fda4a9fad8	alternator: add RBAC enforcement to BatchGetItem This patch adds a requirement for the "SELECT" permission on a table to run a BatchGetItem on it. A single batch may ask to write to several different tables, so we fail the entire batch with AccessDeniedException if any of the tables mentioned in the batch do not have SELECT permissions for this role. A tests is also added. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:57:51 +02:00
Nadav Har'El	b02288785f	alternator: add RBAC enforcement to BatchWriteItem This patch adds a requirement for the "MODIFY" permission on a table to run a BatchWriteItem on it. A single batch may ask to write to several different tables, so we fail the entire batch with AccessDeniedException if any of the tables mentioned in the batch do not have MODIFY permissions for this role. A tests is also added. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:56:28 +02:00
Nadav Har'El	445a5d57cd	alternator: add RBAC enforcement to UpdateTable This patch adds a requirement for the "ALTER" permission on a table to run a UpdateTable on it. A tests is also added. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:45:22 +02:00
Nadav Har'El	b4484158e7	alternator: add RBAC enforcement to Query and Scan This patch adds a requirement for the "SELECT" permission on a table to run a Query or Scan on it. Both Query and Scan operations call the same do_query() function, so the permission checks are put there. Note that Query can read from either the base table or one of its views, and the permissions on the base and each of the views can be separate (so we can allow a role to only read one view, for example). Tests for all of the above are also added. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:45:22 +02:00
Nadav Har'El	82f7e55943	alternator: add RBAC enforcement to CreateTable This patch adds a requirement for the "CREATE" permission on ALL KEYSPACES to run a CreateTable operation. The CreateTable operation also performs so-called "auto-grant": When a role creates a table, it is automatically granted full permissions to read, write, change or delete that new table. A test for all these things is also added. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:45:22 +02:00
Nadav Har'El	79dfb7b7d5	alternator: add RBAC enforcement to DeleteTable This patch adds a requirement for the "DROP" permission on a table to run a DeleteTable on it. Moreover, when a table and its views are deleted, any special permissions previously GRANTed on this table are removed. This is necessary because if a role creates a table it is automatically granted permissions on this table (this is known as "auto-grant" - see the CreateTable patch for details). If this role deletes this table and later a second role creates a table with the same name, we don't want the first role to have permissions on this new table. Tests for permission enforcements and revocation on delete are also added. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:45:22 +02:00
Nadav Har'El	2ebc0501b8	alternator: add RBAC enforcement to UpdateItem This patch adds a requirement for the "MODIFY" permission on a table to run a UpdateItem on it. Only the MODIFY permission is required, even if the operation may also read the old value of the item, such as a read-modify-write operation or even using ReturnValues='ALL_OLD'. A test is also added. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:45:22 +02:00
Nadav Har'El	36d8aea654	alternator: add RBAC enforcement to DeleteItem This patch adds a requirement for the "MODIFY" permission on a table to run a DeleteItem on it. Only the MODIFY permission is required, even if the operation may also read the old value of the item (using ReturnValues='ALL_OLD'). A test is also added. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:45:22 +02:00
Nadav Har'El	34c975854a	alternator: add RBAC enforcement to PutItem This patch adds a requirement for the "MODIFY" permission on a table to run a PutItem on it. Only the MODIFY permission is required, even if the operation may also read the old value of the item (using ReturnValues='ALL_OLD'). A test is also added. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:45:22 +02:00
Nadav Har'El	3008b8416c	alternator: add RBAC enforcement to GetItem In this patch, we begin to add role-based access control (RBAC) enforement to Alternator - in this patch only to GetItem. After the preparation of client_state correctly in the previous patch, the permission check itself in the get_item() function is very simple. The bigger part of this patch is a full functional test in test/alternator/test_cql_rbac.py. The test is quite self-explanatory and heavily commented. Basically we check that a new role cannot read with GetItem a pre-existing table, and we can add that ability by GRANTing (in CQL) the new role the ability to SELECT the table, the keyspace, all keyspaces, or add that ability to some other role that this role inherits. In the following patches, we will add role-based access control to the Alternator operations, but the functional tests will be shorter - we don't need to check the role inheritence, "all keyspaces" feature, and so on, for every operation separately since they all use the same underlying checking functions which handles these role inheritence issues in exactly the same way. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:45:22 +02:00
Nadav Har'El	583f060bd8	alternator: stop using an "internal" client_state Scylla uses a "client_state" object to encapsulate the information of who the client is - its IP address, which user was authenticated, and so on. For an unknown reason, Alternator created for each request an "internal" client_state, meaning that supposedly the client for each request was some sort of internal process (e.g., repair) rather than a real client. This was wrong, and we even had a FIXME about not putting the client's IP address in client_state. So in this patch, we start using a normal "external" client_state instead of an "internal" one. The client_state constructors are very different in the two cases, so a few lines of code had to change. I hope that this change will cause no functional changes. For example, Alternator was already setting its own timeouts explicitly and not relying on the default ones for external clients. However, we need to fix this for the following patches which introduce permissions checks (Role-Based Access Control - RBAC) - the client_state methods for checking permissions become no-ops for internal clients (even if the client_state contains an authenticated users). We need these functions to do their job - so we need an external variant of client_state. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-08-19 09:45:22 +02:00
Amnon Heiman	d20a333f51	alternator/executor: support batch latency and size metrics This patch Updated the get and write batch operations in Alternator to record latency using the newly added histogram metrics. It adds logic to increment the counters with the number of items processed in each batch. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-08-18 12:14:23 +03:00
Amnon Heiman	8bad4b44f8	Add metrics for Alternator get and write batch operations Introduced histogram metrics to track latency for Alternator's get and write batch operations. Added counters to record the number of items processed in each batch operation. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-08-18 12:09:46 +03:00
Avi Kivity	aa1270a00c	treewide: change assert() to SCYLLA_ASSERT() assert() is traditionally disabled in release builds, but not in scylladb. This hasn't caused problems so far, but the latest abseil release includes a commit [1] that causes a 1000 insn/op regression when NDEBUG is not defined. Clearly, we must move towards a build system where NDEBUG is defined in release builds. But we can't just define it blindly without vetting all the assert() calls, as some were written with the expectation that they are enabled in release mode. To solve the conundrum, change all assert() calls to a new SCYLLA_ASSERT() macro in utils/assert.hh. This macro is always defined and is not conditional on NDEBUG, so we can later (after vetting Seastar) enable NDEBUG in release mode. [1] `66ef711d68` Closes scylladb/scylladb#20006	2024-08-05 08:23:35 +03:00

1 2 3 4 5 ...

817 Commits