scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-27 11:55:15 +00:00

Author	SHA1	Message	Date
Piotr Smaron	200f0bb219	alternator: use get_datacenters() in get_network_topology_options() Currently, `get_network_topology_options()` is using gossip data and iterates over topology using IPs and not host IDs, which may result in operating on inconsistent data. This method's implemenations has been changed to instead use `get_datacenters()`, which should always return consistent data. Fixes: scylladb/scylladb#21490 Closes scylladb/scylladb#21940	2024-12-22 18:57:10 +02:00
Avi Kivity	f3eade2f62	treewide: relicense to ScyllaDB-Source-Available-1.0 Drop the AGPL license in favor of a source-available license. See the blog post [1] for details. [1] https://www.scylladb.com/2024/12/18/why-were-moving-to-a-source-available-license/	2024-12-18 17:45:13 +02:00
Gleb Natapov	38c13975ca	alternator: move ttl to work with host ids instead of ips	2024-12-15 11:31:11 +02:00
Emil Maskovsky	8191e57036	treewide: fix annotations reported by GH checks Clean up the unnecessary includes reported by the GitHub checks that are polluting the PR diffs. The "utils/assert.hh" report should be actually fixed by the #21739, but as the usage of `SEASTAR_ASSERT()` is protected by the `SEASTAR_DEBUG` check it makes sense to include the header conditionally as well. Closes scylladb/scylladb#21817	2024-12-09 13:44:12 +03:00
Amnon Heiman	c62cd08fbe	alternator/executor: Add WCU support for delete item Calculating the item length of WCU deleted Item depends on how the operations was performed. In a simple scenario it would be consider a 1 byte. With an unsafe Read-Before-Write the item is return by get_perious_item and with LWT the item is get from the apply method. This patch changes the calls to describe_single_item in the last two scenarios so that they would use the read item to determine the item length. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-12-03 15:55:41 +02:00
Amnon Heiman	b888ed84f7	alternator/executer use uint in describe_item Actions in rmw_operation can use describe_item to determine to get an existing value (Read before Write scenario) on those cases the existing item size can be bigger than the one we are storing (in the extreme case, when deleting an object we only have its keys) This modify the describe_item API so it would take a pointer to uint instead of the consumed_capacity_counter so we can use it to get the old value size and depends on that, determine the size that will be used for the WCU calculation.	2024-12-03 15:55:41 +02:00
Amnon Heiman	3c6594b26a	alternator/consumed_capacity.hh: Make the total_bytes public rmw operations needs to be able to modify consume_capacity total_bytes directly. Depends on the previous stored item the length on which the WCU will be calculated can be different than the length of the operation. This patch makes the total_bytes public so it will be possible to modify it directly. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-12-03 15:55:41 +02:00
Amnon Heiman	8f3dd877ff	Alternato: split WCU metrics into ops This patch add visibility to the WCU metrics. It uses a label 'ops' to split each of the operations that contribute to WCU into their operations. When summing over all ops value the result will be the same.	2024-12-03 15:55:41 +02:00
Kefu Chai	f436edfa22	mutation: remove unused "#include"s these unused includes are identified by clang-include-cleaner. after auditing the source files, all of the reports have been confirmed. please note, because `mutation/mutation.hh` does not include `seastar/coroutine/maybe_yield.hh` anymore, and quite a few source files were relying on this header to bring in the declaration of `maybe_yield()`, we have to include this header in the places where this symbol is used. the same applies to `seastar/core/when_all.hh`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-11-29 14:01:44 +08:00
Kefu Chai	2c9c654798	build: cmake: Enforce explicit library linkage visibility This change improves dependency management by explicitly specifying library linkage visibility in CMake targets. Previously, some ScyllaDB targets used `target_link_libraries()` without `PUBLIC` or `PRIVATE` keywords, which resulted in transitive library dependencies by default. This unintentionally exposed non-public dependencies to downstream targets. Changes: - Always use explicit `PRIVATE` or `PUBLIC` keywords with `target_link_libraries()` - Tighten build dependency tree - Enforce a more modular linkage model See: [CMake documentation on library dependencies](https://cmake.org/cmake/help/latest/command/target_link_libraries.html) Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21686	2024-11-28 18:15:23 +02:00
Kefu Chai	a5ee0c896b	treewide: migrate from boost::adaptors::filtered to std::views::filter Modernize the codebase by replacing Boost range adaptors with C++23 standard library views, reducing external dependencies and leveraging modern C++ language features. Key Changes: - Replace `boost::adaptors::filtered` with `std::views::filter` - Remove `#include <boost/range/adaptor/filtered.hpp>` - Utilize standard library range views Motivation: - Reduce project's external dependency footprint - Leverage standard library's range and view capabilities - Improve long-term code maintainability - Align with modern C++ best practices Implementation Challenges and Considerations: 1. Range Conversion and Move Semantics - `std::ranges::to` adaptor requires rvalue references - Necessitated updates to variable and parameter constness - Example: `cql3/restrictions/statement_restrictions.cc` modified to remove `const` from `common` to enable efficient range conversion 2. Range Iteration and Mutation - Range views may mutate internal state during iteration - Cannot pass ranges by const reference in some scenarios - Solution: Pass ranges by rvalue reference to explicitly indicate state invalidation Limitations: - One instance of `boost::adaptors::filtered` temporarily preserved due to lack of a C++23 alternative for `boost::join()` - A comprehensive replacement will be addressed in a follow-up change This change is part of our ongoing effort to modernize the codebase, reducing external dependencies and adopting modern C++ practices. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21648	2024-11-26 14:26:50 +02:00
Nadav Har'El	7014aec452	Merge 'Alternator measuring RCU and WCU' from Amnon Heiman Read and Write Consumed Capacity units are an abstract way of measuring Alternator actions. In general, they correspond to the read or write data. In the long run, the RCU/WCU adds a way of charging an operation and limiting usage. This series addresses two issues: consume capacity request API and metering. The Alternator (and DynmoDB) API has an optional parameter allowing users to check the number of units an operation consumes. When a user adds that parameter, the response will contain the number of units used for the operation. This series adds the consume capacity support to the get_item and put_item, adds a metric to collect the overall RCU and WCU used, and adds a test for the new functionality. Follow-up PRs will add support for more operations and GSI. Replaces #19811 Partially implement: #5027 Closes scylladb/scylladb#21543 * github.com:scylladb/scylladb: alternator/test_metrics: Add tests for table consumption units test_returnconsumedcapacity.py: Add putItem tests Alternator: add WCU support Add test/alternator/test_returnconsumedcapacity.py alternator/executor: Add consume capacity for get_item alsternator/stats: Add rcu and wcu metrics to stats alternator/executor.hh: white-space cleanup Add the consume_capacity helper class	2024-11-24 19:27:03 +02:00
Botond Dénes	d94591c260	Merge 'treewide: replace boost::find_if with std::ranges::find_if' from Kefu Chai now that we are allowed to use C++23. we now have the luxury of using `std::ranges::find_if`. in this change, we: - replace `boost::find_if` with `std::ranges::find_if` - remove all `#include <boost/range/algorithm/find_if.hpp>` to reduce the dependency to boost for better maintainability, and leverage standard library features for better long-term support. this change is part of our ongoing effort to modernize our codebase and reduce external dependencies where possible. --- it's a cleanup, hence no need to backport. Closes scylladb/scylladb#21495 * github.com:scylladb/scylladb: treewide: replace boost::find_if with std::ranges::find_if counters: replace boost::find_if with std::ranges::find_if combine.hh: use std::iter_const_reference_t when appropriate	2024-11-20 09:58:13 +02:00
Amnon Heiman	56dce5fe8a	Alternator: add WCU support This patch adds functionality to track Write Capacity Units (WCU). Currently for the put_item operation. This enhancement allows for standardized measurement of write operations, aligning with DynamoDB-like metrics. Additionally, the WCU value is now optionally included in the response to provide immediate feedback on the write capacity usage. The implementation adds a consumed_capacity_counter member to rmw_operation, this will allow to add WCU functionality to update_item and delete_item	2024-11-19 18:43:28 +02:00
Amnon Heiman	b8f7b2eb52	alternator/executor: Add consume capacity for get_item This patch adds functionality to track Read Capacity Units (RCU) for the get_item operation. This enhancement allows for standardized measurement of read operations, aligning with DynamoDB-like metrics. Additionally, the RCU value can now be included in the response to provide immediate feedback on the read capacity usage. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-11-19 18:43:28 +02:00
Amnon Heiman	2b10296a82	alsternator/stats: Add rcu and wcu metrics to stats Introduced `rcu` (Read Capacity Units) and `wcu` (Write Capacity Units) metrics to the `stats` object for enhanced capacity tracking. `rcu` and `wcu` provide a simplified way of measuring reads and writes, respectively, by representing capacity usage in standardized units. This patch adds these metrics to the existing alternator stats, enabling monitoring of the total consumed units.	2024-11-19 18:43:28 +02:00
Amnon Heiman	b0e699e7ec	alternator/executor.hh: white-space cleanup Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-11-19 18:43:28 +02:00
Amnon Heiman	eedf390196	Add the consume_capacity helper class Alternator API should support returning WCU and RCU when requested. The consumed capacity helper class serves multiple purposes: 1. Break the logic of calculating the RCU and WCU from the main code. 2. Add a helper class consumed_capacity_counter that can accumulate bytes. 3. Optionally update counters for RCU and WCU that will be used by the metric layer. 4. Update the response with the consumed units if needed. The consumed_capacity_counter is a base class with two implementations: A read and write implmenentation. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-11-19 18:42:56 +02:00
Nadav Har'El	7607f5e33e	alternator: fix "/localnodes" to not return down nodes Alternator's "/localnodes" HTTP requests is supposed to return the list of nodes in the local DC to which the user can send requests. Before commit `bac7c33313` we used the gossiper is_alive() method to determine if a node should be returned. That commit changed the check to is_normal() - because a node can be alive but in non-normal (e.g., joining) state and not ready for requests. However, it turns out that checking is_normal() is not enough, because if node is stopped abruptly, other nodes will still consider it "normal", but down (this is so-called "DN" state). So we need to check both is_alive() and is_normal(). This patch also adds a test reproducing this case, where a node is shut down abruptly. Before this patch, the test failed ("/localnodes" continued to return the dead node), and after it it passes. Fixes #21538 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#21540	2024-11-19 10:04:59 +02:00
Kefu Chai	33a0e5b892	treewide: replace boost::find_if with std::ranges::find_if now that we are allowed to use C++23. we now have the luxury of using `std::ranges::find_if`. in this change, we: - replace `boost::find_if` with `std::ranges::find_if` - remove all `#include <boost/range/algorithm/find_if.hpp>` to reduce the dependency to boost for better maintainability, and leverage standard library features for better long-term support. this change is part of our ongoing effort to modernize our codebase and reduce external dependencies where possible. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-11-19 10:50:01 +08:00
Nadav Har'El	f2b4a59ec7	alternator: change sstring_view to std::string_view Our "sstring_view" is an historic alias for the standard std::string_view. Alternator only used this alias in a couple of random names, let's change them to the standard type name. Refs #4062. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-11-18 15:44:49 +02:00
Nadav Har'El	da99dc3a7f	cross-tree: change to_sstring_view() to to_string_view() For historic reasons, we have (in bytes.hh) a type sstring_view which is an alias for std::string_view - since the same standard type can hold a pointer into both a seastar::sstring and std::string. This alias in unnecessary and misleading to new developers (who might assume it is somehow different from std::string_view). This patch doesn't yet remove all occurances of sstring_view (the request in #4062), but begins to do it by renaming one commonly-used function, to_sstring_view(bytes) to to_string_view() and of course changes all its uses to the new name. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-11-18 14:57:49 +02:00
Kefu Chai	00810e6a01	treewide: include seastar/core/format.hh instead of seastar/core/print.hh The later includes the former and in addition to `seastar::format()`, `print.hh` also provides helpers like `seastar::fprint()` and `seastar::print()`, which are deprecated and not used by scylladb. Previously, we include `seastar/core/print.hh` for using `seastar::format()`. and in seastar 5b04939e, we extracted `seastar::format()` into `seastar/core/format.hh`. this allows us to include a much smaller header. In this change, we just include `seastar/core/format.hh` in place of `seastar/core/print.hh`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21574	2024-11-14 17:45:07 +02:00
Benny Halevy	4cf3b683bc	alternator: create_keyspace_metadata: enable tablets using feature_service Rather than using the local configuration option on this node, check the cluster feature instead. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-11-07 13:56:59 +02:00
Kefu Chai	59eb2ab119	treewide: s/boost::algorithm::any_of/std::ranges::any_of/ now that we are allowed to use C++23. we now have the luxury of using `std::ranges::any_of`. in this change, we replace `boost::algorithm::any_of` with `std::ranges::any_of` to reduce the dependency to boost for better maintainability, and leverage standard library features for better long-term support. this change is part of our ongoing effort to modernize our codebase and reduce external dependencies where possible. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-11-05 14:06:09 +08:00
Kefu Chai	f8bb1c64f1	treewide: s/boost::algorithm::all_of/std::ranges::all_of/ now that we are allowed to use C++23. we now have the luxury of using `std::ranges::all_of`. in this change, we replace `boost::algorithm::all_of` with `std::ranges::all_of` to reduce the dependency to boost for better maintainability, and leverage standard library features for better long-term support. this change is part of our ongoing effort to modernize our codebase and reduce external dependencies where possible. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-11-05 14:05:24 +08:00
Avi Kivity	73b1f66b70	Revert "Merge 'Allow explicitly enabling or disabling tablets when creating a new keyspace' from Benny Halevy" This reverts commit `c286434e4c`, reversing changes made to `6712fcc316`. The commit causes memtable_test to be very flaky in debug mode. Specifically, subtests test_exceptions_in_flush_on_sstable_open and test_exceptions_in_flush_on_sstable_write).	2024-10-30 00:55:29 +02:00
Benny Halevy	9ef2dc2428	alternator: create_keyspace_metadata: enable tablets using feature_service Rather than using the local configuration option on this node, check the cluster feature instead. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2024-10-24 10:18:42 +03:00
Avi Kivity	847c850034	schema: add accessors for primary key columns and non-primary-key columns It's somewhat common to ask for the partition key and clustering key columns, or for the static and regular columsn. Provide accessors for them rather than requiring the user to glue them. Some callers are converted. Closes scylladb/scylladb#21191	2024-10-22 15:01:14 +02:00
Kefu Chai	6ead5a4696	treewide: move log.hh into utils/log.hh the log.hh under the root of the tree was created keep the backward compatibility when seastar was extracted into a separate library. so log.hh should belong to `utils` directory, as it is based solely on seastar, and can be used all subsystems. in this change, we move log.hh into utils/log.hh to that it is more modularized. and this also improves the readability, when one see `#include "utils/log.hh"`, it is obvious that this source file needs the logging system, instead of its own log facility -- please note, we do have two other `log.hh` in the tree. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-10-22 06:54:46 +03:00
Kefu Chai	5cd619a60c	treewide: s/boost::adaptors::map_keys/std::views::keys/ now that we are allowed to use C++23. we now have the luxury of using `std::views::keys`. in this change, we: - replace `boost::adaptors::map_keys` with `std::views::keys` - update affected code to work with `std::views::keys` to reduce the dependency to boost for better maintainability, and leverage standard library features for better long-term support. this change is part of our ongoing effort to modernize our codebase and reduce external dependencies where possible. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21198	2024-10-21 12:47:52 +03:00
Avi Kivity	c3be2489ce	treewide: drop includes of <boost/range/adaptors.hpp> This includes way too much, including <boost/regex.hpp>, which is huge. Drop includes of adaptors.hpp and replace by what is needed. Closes scylladb/scylladb#21187	2024-10-20 17:17:11 +03:00
Avi Kivity	820509026f	schema: replace boost ranges with std ranges To reduce dependency load, use std ranges instead of boost ranges. The std::ranges::{lower,upper}_bound don't support heterogeneous lookup, but a more natural solution is to use a projection to search for the name, so we use that and the custom comparator is removed. Many callers are converted as well due to poor interoperability between boost ranges and std ranges.	2024-10-15 16:42:54 +03:00
Nadav Har'El	45ccceb137	alternator: add "dc" and "rack" options to "/localnodes" request Before this patch, the "/localnodes" HTTP request to the Alternator server lists all the live nodes of the current DC. This patch adds two optional parameters to this query: dc: allows to list the live nodes of a specific named DC instead of the current DC of the server. rack: allows to restrict the results to just the nodes belonging to a specific named rack. For both options, if no live node exists in the given dc or rack (in particular, if such a dc or rack doesn't even exist), an empty list is returned - it's not an error. The default, if dc or rack is not specified - remains exactly as it is today - look at the current DC (the one of the node being request), and do not restrict the list to any specific rack. We expect the new options that we added here to be useful for two use cases: 1. A client that knows of some Scylla node (belonging to an unknown DC), but wants to list the nodes in its DC, which it knows by name. 2. A client in a multi-rack DC (e.g., multi-AZ region in AWS) that wants to send requests to nodes in its own rack (which it knows by name), to avoid cross-rack networking costs. Note that in both cases, this requires clients to know the names of DCs and AZs via some out-of-band means. The client can also get a list of DCs and racks using the system.local system table, as the tests included in this patch demonstrate. This patch includes two set of tests for these new options: One in the the single-node test/alternator framework that has a single dc and rack but can still check the case of an unknown dc or rack (in which case an empty list is returned). The second test is in the topology framework, and runs an 8-node cluster with two DCs, two racks, and two nodes in each, and checks all the combinations of "/localnodes" requests with and without dc and rack options. This test also resolves a longstanding TODO that asked for such a multi-DC test for "/localnodes" to be written. Fixes #12147 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#20915	2024-10-07 20:53:47 +03:00
Avi Kivity	93afc77307	raft_group0_client: uninclude "mutation/mutation.hh" Lighten the dependency load. Some constructors and destructors are uninlined to avoid the header depending on the mutation class.	2024-09-28 16:31:53 +03:00
Nadav Har'El	7715abfc56	Merge 'Alternator store ProvisionedThroughput' from Amnon Heiman When users create a table using the Alternator API, they can decide if the billing is PROVISIONED of PAY_PER_REQUEST. If the billing is set to PROVISIONED, they need to set the ProvisionedThroughput ReadCapacityUnits (RCU) and WriteCapacityUnits (WCU). This series adds support for getting and setting the ProvisionedThroughput. The values will be stored as table extension tags. Following how TTL is stored within the Alternator, we will use ```system:rcu_attribute``` and ```system:wcu_attribute``` for the labels. The series adds a test that sets ProvisionedThroughput and validates that it gets the value back. It was tested with both Alternator and AWS. This series is part of the effort to monitor, limit, and bill Alternator operations. New code, no need to backport. Closes scylladb/scylladb#20056 * github.com:scylladb/scylladb: docs/alternator/compatibility.md: explain the consumed capacity provisioned Add test/alternator/test_provisioned_throughput.py test/alternator/util.py: Allow override BillingMode alternator/executor.cc: Store ProvisionedThroughput	2024-09-26 01:23:17 +03:00
Nadav Har'El	6496eab5ee	Merge 'Rename Alternator batch item count metrics' from Amnon Heiman This PR addresses multiple issues with alternator batch metrics: 1. Rename the metrics to scylla_alternator_batch_item_count with op=BatchGetItem/BatchWriteItem 2. The batch size calculation was wrong and didn't count all items in the batch. 3. Add a test to validate that the metrics values increase by the correct value (not just increase). This also requires an addition to the testing to validate ops of different metrics and an exact value change. Needs backporting to allow the monitoring to use the correct metrics names. Fixes #20571 Closes scylladb/scylladb#20646 * github.com:scylladb/scylladb: alternator:test_metrics test metrics for batch item count alternator:test_metrics Add validating the increased value alternator: Fix item counting in batch operations Alterntor rename batch item count metrics	2024-09-23 10:13:07 +03:00
Pavel Emelyanov	2f4f0eb060	Merge 'Alternator: a few RBAC fixes' from Nadav Har'El The main goal of this PR is to fix a bug (#20619) in the alternator_enforce_authorization=false setting - which didn't do its job (i.e, _don't_ check permissions) when authorization is configured in CQL but not wanted in Alternator. The series also a few smaller bugs in the code that were discovered while debugging the main issue: 1. A potential use-after-free (that didn't seem to hit us in practice) is fixed. 2. A confusing error message (that was also reported in #20619) is improved. 3. Make the alternator_enforce_authorization live-updatable. There was no reason why it shouldn't be, and as this series needs to make this flag available to more code, let's just do it properly and assume the flag is live-updatable. Because the RBAC feature has not been backported to any open-source branches, neither should these fixes. But if some private branch received a backport of the RBAC feature, it should get these fixes too. Fixes #20619. Closes scylladb/scylladb#20640 * github.com:scylladb/scylladb: alternator: make alternator_enforce_authorization live-updateable alternator: fix alternator_enforce_authorization=false alternator: improve error message when unauthenticated alternator: avoid use-after-free in RBAC	2024-09-18 14:02:09 +03:00
Kefu Chai	cb1670b79b	Update seastar submodule * seastar ec5da7a6...69f88e2f (38): > build: s/Sanitizers_COMPILER_OPTIONS/Sanitizers_COMPILE_OPTIONS > test: Update httpd test with request/reply body writing sugar > http: Add sugar to request and response body writers > utils: Add util::write_to_stream() helper > seastar-addr2line: adjust llvm termination regex > README.md: add Crimson project > rpc: conditionally use fmt::runtime() based on SEASTAR_LOGGER_COMPILE_TIME_FMT > build: check the combination of Sanitizers > tls: clear session ticket before releasing > print: remove dead code > doc/lambda-coroutine-fiasco: reword for better readability > rpc: fix compilation error caused by fmt::runtime() > tutorial: explain the use case of rethrow_exception and coroutine::exception > reactor: print more informative error when io_submit fails > README.md: note GitHub discussions > prometheus: `fmt::print` to stringstream directly > doc: add document for testing with seastar > seastar/testing: only include used headers > test: Add abortable http client test cases > http/client: Add abortable make_request() API method > http/client: Abort established connections > http/client: Handle abort source in pool wait > http/client: Add abort source to factory::make() method > http/client: Pass abort_source here and there > http/client: Idnentation fix after previous patch > http/client: Merge some continuations explicitly > signal: add seastar signal api > httpd: remove unused prometheus structs > print: use fmtlib's fmt::format_string in format() > rpc: do not use seastar::format() in rpc logger > treewide: s/format/seastar::format/ > prometheus: sanitize label value for text protocol > tests: unit test prometheus wire format > io-tester: Introduce batches to rate-based submission > io-tester: Generalize issueing request and collecting its result > io-tester: Cancel intent once > io-tester: Dont carry rps/parallelism variables over lambdas > io-tester: Simplify in-flight management The breaking changes in the seastar submodule necessitate corresponding modifications in our code. These changes must be implemented together in a single commit to maintain consistency. So that each commit is buildable. following changes are included in addition to seastar submodule update: * instead of passing a `const char` for the format string, pass a templated `fmt::format_string<...>`, this depends on the `seastar::format()` change in seastar. explicitly call `fmt::runtime()` if the format string is not a consteval expression. this depends on the `seastar::format()` change in seastar. as `seastar::format()` does not accept a plain `const char` which is not constexpr anymore. pass abort_source to `dns_connection_factory::make()`. this depends on the change in seastar, which added a `abort_source` argument to the pure virtual member function of `connection_factory::make()`. call call {fmt,seastar}::format() explicitly. this is a follow up of `3e84d43f`, which takes care of all places where we should call `fmt::format()` and `seastar::format()` explicitly to disambiguate the `format()` call. but more `format()` call made their way into the source tree after `3e84d43f`. so we need fix them as well. * include used header in tests Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Update seastar submodule Please enter the commit message for your changes. Lines starting Closes scylladb/scylladb#20649	2024-09-18 13:59:22 +03:00
Amnon Heiman	905408f764	alternator: Fix item counting in batch operations This patch fixes the logic for counting items in batch operations. Previously, the item count in requests was inaccurate, it count the number of tabels in get_item and the request_items in write_items. The new logic correctly counts each individual item in `BatchGetItem` and `BatchWriteItem` requests. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-09-18 11:30:59 +03:00
Amnon Heiman	515857a4a9	Alterntor rename batch item count metrics This patch renames metrics tracking the total number of items in a batch to `scylla_alternator_batch_item_count`. It uses the existing `op` label to differentiate between `BatchGetItem` and `BatchWriteItem` operations. Ensures better clarity and distinction for batch operations in monitoring. This an example of how it looks like: # HELP scylla_alternator_batch_item_count The total number of items processed across all batches # TYPE scylla_alternator_batch_item_count counter scylla_alternator_batch_item_count{op="BatchGetItem",shard="0"} 4 scylla_alternator_batch_item_count{op="BatchWriteItem",shard="0"} 4	2024-09-18 11:20:07 +03:00
Nadav Har'El	17deaae463	alternator: make alternator_enforce_authorization live-updateable For no good reason, the "alternator_enforce_authorization" flag (which chooses whether to enable authentication and authorization checks in Alternator) was not live-updatable, so make it so. Both "server" and "executor" objects use this configuration flag, the former is fixed in this patch (to hold a live-updatable reference instead of a copy of a boolean), the latter was already prepared for this change and already held a live-updatable reference. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-09-17 15:51:16 +03:00
Nadav Har'El	00793059e1	alternator: fix alternator_enforce_authorization=false When the configuration has alternator_enforce_authorization=false, Alternator should not do authentication (check which user signed each request) nor authorization (check if that user has permissions to do each operation). Our implementation forgot to disable the authorization checks when it's configured to false. The (incorrect) assumption was that when alternator_enforce_authorization is configured to false, the CQL 'authenticator' and 'authorizer' configuration is also disabled - so the authorization checks will be no-ops. But we can't assume that: Users are free to configure 'authenticator' and 'authorizer' for use in CQL, and then set alternator_enforce_authorization=false just for Alternator. So this patch adds a new test for this case - when we have authenticator=PasswordAuthenticator, authorizer=CassandraAuthorizer but alternator_enforce_authorization=false, and fixes it to work correctly. The heart of the fix is trivial: the `verify_*_permission()` functions just need to check the alternator_enforce_authorization and return immediately when false. The bigger part of this change is to get the alternator_enforce_authorization into the "executor" object and then to pass it into the verify calls. Although alternator_enforce_authorization is not YET live updatable, this code is prepared for the future that it may become live updatable, so the executor object saves not the boolean value of this flag, but a live-updatable reference to it. Fixes #20619 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-09-17 15:50:00 +03:00
Nadav Har'El	76af7c0389	alternator: improve error message when unauthenticated When access-control checks report permission denied, we want to report the name of the authenticated role (the role signing the request) which didn't have the permission. When authentication was disabled, and there is no authenticated role, we printed the fake name "anonymous", but this can confuse users (it confused me!) to think there's an actual role named "anonymous". So let's change that string to "<anonymous>" with angle brackets - it makes it more obvious that this isn't a real role, but actually an anonymous request. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-09-17 15:44:29 +03:00
Nadav Har'El	3543bf14e9	alternator: avoid use-after-free in RBAC While auditing the code, I noticed that the current Alternator access control checks have code like: ``` return client_state.check_has_permission(auth::command_desc( permission_to_check, auth::make_data_resource(schema->ks_name(), schema->cf_name()))).then( ``` There's a problem here - it turns out that, unfortunately, command_desc holds a reference to the "resource" object - not a copy. So the temporary object returned by make_data_resource may be freed and then used... Curiously, we've not seen a bug caused by this in practice (not even in debug build mode), but better safe than sorry, so this patch changes the code in one of two ways: 1. Code using coroutines can keep the "resource" as a variable on the stack. 2. Code using continuations needs to hold the "resource" with do_with(), but since this already incurs the cost of an extra allocation (even in the successful case), might as well just switch to using coroutines and have less ugly code. This patch does not change any functionality, and all the tests seem to work before and after it the same. Signed-off-by: Nadav Har'El <nyh@scylladb.com> hello	2024-09-17 15:41:09 +03:00
Nadav Har'El	930accad12	alternator: return error on unused AttributeDefinitions A CreateTable request defines the KeySchema of the base table and each of its GSIs and LSIs. It also needs to give an AttributeDefinition for each attribute used in a KeySchema - which among other things specifies this attribute's type (e.g., S, N, etc.). Other, non-key, attributes do not have a specified type, and accordingly must not be mentioned in AttributeDefinitions. Before this patch, Alternator just ignored unused AttributeDefinitions entries, whereas DynamoDB throws an error in this case. This patch fixes Alternator's behavior to match DynamoDB's - and adds a test to verify this. Besides being more error-path-compatible with DynamoDB, this extra check can also help users: We already had one user complaining that an AttributeDefinitions setting he was using was ignored, not realizing that it wasn't used by any KeySchema. A clear error message would have saved this user hours of investigation. Fixes #19784. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#20378	2024-09-12 15:37:18 +03:00
Kefu Chai	3e84d43f93	treewide: use seastar::format() or fmt::format() explicitly before this change, we rely on `using namespace seastar` to use `seastar::format()` without qualifying the `format()` with its namespace. this works fine until we changed the parameter type of format string `seastar::format()` from `const char*` to `fmt::format_string<...>`. this change practically invited `seastar::format()` to the club of `std::format()` and `fmt::format()`, where all members accept a templated parameter as its `fmt` parameter. and `seastar::format()` is not the best candidate anymore. despite that argument-dependent lookup (ADT for short) favors the function which is in the same namespace as its parameter, but `using namespace` makes `seastar::format()` more competitive, so both `std::format()` and `seastar::format()` are considered as the condidates. that is what is happening scylladb in quite a few caller sites of `format()`, hence ADT is not able to tell which function the winner in the name lookup: ``` /__w/scylladb/scylladb/mutation/mutation_fragment_stream_validator.cc:265:12: error: call to 'format' is ambiguous 265 \| return format("{} ({}.{} {})", _name_view, s.ks_name(), s.cf_name(), s.id()); \| ^~~~~~ /usr/bin/../lib/gcc/x86_64-redhat-linux/14/../../../../include/c++/14/format:4290:5: note: candidate function [with _Args = <const std::basic_string_view<char> &, const seastar::basic_sstring<char, unsigned int, 15> &, const seastar::basic_sstring<char, unsigned int, 15> &, const utils::tagged_uuid<table_id_tag> &>] 4290 \| format(format_string<_Args...> __fmt, _Args&&... __args) \| ^ /__w/scylladb/scylladb/seastar/include/seastar/core/print.hh:143:1: note: candidate function [with A = <const std::basic_string_view<char> &, const seastar::basic_sstring<char, unsigned int, 15> &, const seastar::basic_sstring<char, unsigned int, 15> &, const utils::tagged_uuid<table_id_tag> &>] 143 \| format(fmt::format_string<A...> fmt, A&&... a) { \| ^ ``` in this change, we change all `format()` to either `fmt::format()` or `seastar::format()` with following rules: - if the caller expects an `sstring` or `std::string_view`, change to `seastar::format()` - if the caller expects an `std::string`, change to `fmt::format()`. because, `sstring::operator std::basic_string` would incur a deep copy. we will need another change to enable scylladb to compile with the latest seastar. namely, to pass the format string as a templated parameter down to helper functions which format their parameters. to miminize the scope of this change, let's include that change when bumping up the seastar submodule. as that change will depend on the seastar change. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-09-11 23:21:40 +03:00
Amnon Heiman	c76347032d	alternator/executor.cc: Store ProvisionedThroughput This patch adds the ability to store and retrieve the ProvisionedThroughput in a table. The information is stored in the table tags. We use the TTL convention used in alternator, and the tags will be: system:provisioned_rcu and system:provisioned_wcu. verify_billing_mode function now return a struct with the billing mode information. The code of describe_table now check if the provision tags exists and return the RCU and WCU accordingly. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-09-10 17:06:40 -04:00
Nadav Har'El	80a0798e77	alternator: better error message in some cases of key type mismatch Alternator uses a common function get_typed_value() to read the values of key attribute and confirm they have the expected type (key attributes have a fixed type in the schema). If the type is wrong, we want to print a "Type mismatch" error message. But the current implementation did the checks in the wrong order, and as a result could print a "Malformed value object" message instead of a "Type mismatch". That could happen if the wrong type is a boolean, map, list, or basically any type whose JSON representation is not a string. The allowed key types - bytes), string and number - all have string representations in JSON, but still we should first report the mismatched type and only report the "Malformed object" if the type matches but the JSON is faulty. In addition to fixing the error message, we fix an existing test which complained in a comment (but ignored) that the error message in some case (when trying to use a map where a key is expected) the strange "Malformed value object" instead of the expected "Type mismatch". The next patch will add an additional reproducer for this problem and its fix. That test will do: ``` with pytest.raises(ClientError, match='ValidationException.*mismatch'): test_table_gsi_6.put_item(Item={'p': p, 's': True}) ``` I.e., it tries to set a boolean value for a string key column, and expect to get the "Type mismatch" error and not the ugly "Malformed value object". Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-09-09 13:14:49 +03:00
Nadav Har'El	cf5d7ce212	Alternator: drop unneeded "IS NOT NULL" clauses in MV of GSI/LSI Scylla's materialized views naturally skip any base rows where the view's key isn't set (is NULL), because we can't create a view row with a null key. To make the user aware that this is happening, the user is required to add "WHERE ... IS NOT NULL" for the view's key columns when defining the view. However, the only place that these extra IS NOT NULL clauses are checked are in the CQL "CREATE MATERIALIZED VIEWS" statement - they are completely ignored in all other places in the code. In particular, when we create a materialized view in Alternator (GSI or LSI), we don't have to add these "IS NOT NULL" clauses, as they are outright ignored. We didn't know they were ignored, and made an effort to add them - but no matter how incorrectly we did it, it didn't matter :-) In commit `2bf2ffd3ed` it turned out we had a typo that caused the wrong column name to be printed. Also, even today we are still missing base key columns that aren't listed as a view key in Alternator but still added as view clustering keys in Scylla - and again the fact these were missing also didn't matter. So I think it's time to stop pretending, and stop calculating these "IS NOT NULL" strings, so this patch outright removes them from the Alternator view-creation code. Beyond being a nice cleanup of unnecessary and inaccurate code, it will also be necessary when we allow in later patches to index for an Alternator attribute "x" not a real column x in the base table but rather an element in the ":attrs" map - so adding a "x IS NOT NULL" isn't only unnecessary, it is outright illegal: The expression evaluation code, even though it doesn't do anything with the "IS NOT NULL" expression, still verifies that "x" is a valid column, which it isn't. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-09-09 12:09:25 +03:00

1 2 3 4 5 ...

845 Commits