scylladb

Author	SHA1	Message	Date
Gleb Natapov	1d188f0394	auth: remove legacy auth mode and upgrade code A system needs to be upgraded to use v2 auth before moving to this ScyllaDB version otherwise the boot will fail.	2026-03-10 10:09:39 +02:00
Paweł Zakrzewski	98f5e49ea8	audit: Add support to CQL statements Integrates audit functionality into CQL statement processing to enable tracking of database operations. Key changes: - Add audit_info and statement_category to all CQL statements - Implement audit categories for different statement types: - DDL: Schema altering statements (CREATE/ALTER/DROP) - DML: Data manipulation (INSERT/UPDATE/DELETE/TRUNCATE/USE) - DCL: Access control (GRANT/REVOKE/CREATE ROLE) - QUERY: SELECT statements - ADMIN: Service level operations - Add audit inspection points in query processing: - Before statement execution - After access checks - After statement completion - On execution failures - Add password sanitization for role management statements - Mask plaintext passwords in audit logs - Handle both direct password parameters and options maps - Preserve query structure while hiding sensitive data - Modify prepared statement lifecycle to carry audit context - Pass audit info during statement preparation - Track audit info through statement execution - Support batch statement auditing This change enables comprehensive auditing of CQL operations while ensuring sensitive data is properly masked in audit logs.	2025-01-15 11:10:36 +01:00
Kefu Chai	e4463b11af	treewide: replace boost::algorithm::join() with fmt::join() Replace usages of `boost::algorithm::join()` with `fmt::join()` to improve performance and reduce dependency on Boost. `fmt::join()` allows direct formatting of ranges and tuples with custom separators without creating intermediate strings. When formatting comma-separated values into another string, fmt::join() avoids the overhead of temporary string creation that `boost::algorithm::join()` requires. This change also helps streamline our dependencies by leveraging the existing fmt library instead of Boost.Algorithm. To avoid the ambiguity, some caller sites were updated to call `seastar::format()` explicitly. See also - boost::algorithm::join(): https://www.boost.org/doc/libs/1_87_0/doc/html/string_algo/reference.html#doxygen.join_8hpp - fmt::join(): https://fmt.dev/11.0/api/#ranges-api Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#22082	2025-01-07 12:45:05 +02:00
Avi Kivity	f3eade2f62	treewide: relicense to ScyllaDB-Source-Available-1.0 Drop the AGPL license in favor of a source-available license. See the blog post [1] for details. [1] https://www.scylladb.com/2024/12/18/why-were-moving-to-a-source-available-license/	2024-12-18 17:45:13 +02:00
Kefu Chai	bab12e3a98	treewide: migrate from boost::adaptors::transformed to std::views::transform now that we are allowed to use C++23. we now have the luxury of using `std::views::transform`. in this change, we: - replace `boost::adaptors::transformed` with `std::views::transform` - use `fmt::join()` when appropriate where `boost::algorithm::join()` is not applicable to a range view returned by `std::view::transform`. - use `std::ranges::fold_left()` to accumulate the range returned by `std::view::transform` - use `std::ranges::fold_left()` to get the maximum element in the range returned by `std::view::transform` - use `std::ranges::min()` to get the minimal element in the range returned by `std::view::transform` - use `std::ranges::equal()` to compare the range views returned by `std::view::transform` - remove unused `#include <boost/range/adaptor/transformed.hpp>` - use `std::ranges::subrange()` instead of `boost::make_iterator_range()`, to feed `std::views::transform()` a view range. to reduce the dependency to boost for better maintainability, and leverage standard library features for better long-term support. this change is part of our ongoing effort to modernize our codebase and reduce external dependencies where possible. limitations: there are still a couple places where we are still using `boost::adaptors::transformed` due to the lack of a C++23 alternative for `boost::join()` and `boost::adaptors::uniqued`. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#21700	2024-12-03 09:41:32 +02:00
Michał Jadwiszczak	da82c5f0b0	cql3:statements: run service level statements on shard0 with raft guard To migrate service levels to be raft managed, obtain `group0_guard` to be able to pass it to service_level_controller's methods. Using this mechanism also automatically provides retries in case of concurrent group0 operation.	2024-03-21 23:14:57 +01:00
Marcin Maliszkiewicz	b482679857	cql3: run auth DML writes on shard 0 and with raft guard Because we'll be doing group0 operations we need to run on shard 0. Additional benefit is that with needs_guard set query_processor will also do automatic retries in case of concurrent group0 operations.	2024-03-01 16:25:14 +01:00
Kefu Chai	2dbf044b91	cql3: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16791	2024-01-16 16:43:17 +02:00
Gleb Natapov	45ce608117	cql3: remove empty statement::validate functions There are a lot of empty overloads for the function so lets remove them and use the one in the parent class instead.	2023-06-22 13:57:33 +03:00
Kefu Chai	108f20c684	cql3: capture reference to temporary value by value `data_dictionary::database::find_keyspace()` returns a temporary object, and `data_dictionary::keyspace::user_types()` returns a references pointing to a member of this temporary object. so we cannot use the reference after the expression is evaluated. in this change, we capture the return value of `find_keyspace()` using universal reference, and keep the return value of `user_types()` with a reference, to ensure us that we can use it later. this change silences the warning from GCC-13, like: ``` /home/kefu/dev/scylladb/cql3/statements/authorization_statement.cc:68:21: error: possibly dangling reference to a temporary [-Werror=dangling-reference] 68 \| const auto& utm = qp.db().find_keyspace(*keyspace).user_types(); \| ^~~ ``` Fixes #13725 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes #13726	2023-05-01 22:41:41 +03:00
Wojciech Mitros	6b8c1823a3	cql3: allow UDTs in permissions on UDFs Currently, when preparing an authorization statement on a specific function, we're trying to "prepare" all cql types that appear in the function signature while parsing the statement. We cannot do that for UDTs, because we don't know the UDTs that are present in the databse at parsing time. As a result, such authorization statements fail. To work around this problem, we postpone the "preparation" of cql types until the actual statement validation and execution time. Until then, we store all type strings in the resource object. The "preparation" happens in the `maybe_correct_resource` method, which is called before every `execute` during a `check_access` call. At that point, we have access to the `query_processor`, and as a result, to `user_types_metadata` which allows us to prepare the argument types even for UDTs.	2023-03-10 11:02:33 +01:00
Avi Kivity	5937b1fa23	treewide: remove empty comments in top-of-files After `fcb8d040` ("treewide: use Software Package Data Exchange (SPDX) license identifiers"), many dual-licensed files were left with empty comments on top. Remove them to avoid visual noise. Closes #10562	2022-05-13 07:11:58 +02:00
Eliran Sinvani	bf50dbd35b	cql3 statements: Change dependency test API to express better it's purpose Cql statements used to have two API functions, depends_on_keyspace and depends_on_column_family. The former, took as a parameter only a table name, which makes no sense. There could be multiple tables with the same name each in a different keyspace and it doesn't make sense to generalize the test - i.e to ask "Does a statement depend on any table named XXX?" In this change we unify the two calls to one - depends on that takes a keyspace name and optionally also a table name, that way every logical dependency tests that makes sense is supported by a single API call.	2022-02-27 11:48:03 +02:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Pavel Emelyanov	b990ca5550	cql3: Make .validate() and .check_access() accept query_processor This is mostly a sed script that replaces methods' first argument plus fixes of compiler-generated errors. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-12-23 10:53:44 +03:00
Pavel Solodovnikov	76bea23174	treewide: reduce header interdependencies Use forward declarations wherever possible. Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Closes #8813	2021-06-07 15:58:35 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Dejan Mircevski	df3ea2443b	cql3: Drop all uses_function methods No one seems to call them except for other uses_function methods. Tests: unit (dev) Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-09-04 17:27:30 +02:00
Pavel Solodovnikov	adc6a98b59	cql3: return raw::parsed_statement as unique_ptr Change CQL parsing routine to return std::unique_ptr instead of seastar::shared_ptr. This can help reduce redundant shared_ptr copies even further. Make some supplementary changes necessary for this transition: * Remove enabled_shared_from_this base class from the following classes: truncate_statement, authorization_statement, authentication_statement: these were previously constructing prepared_statement instance in `prepare` method using `shared_from_this`. Make `prepare` methods implementation of inheriting classes mirror implementation from other statements (i.e. create a shallow copy of the object when prepairing into `prepared_statement`; this could be further refactored to avoid copies as much as possible). * Remove unused fields in create_role_statement which led to error while using compiler-generated copy ctor (copying uninitialied bool values via ctor). Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-03-23 23:19:21 +03:00
Pavel Emelyanov	6892dbdde7	cql3: Add storage_proxy argument to .check_access method Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-02-24 11:17:19 +03:00
Konstantin Osipov	d4866c1a28	cql3: remove prepared alias for prepared_statement cql3 has cql_statement, parsed_statement and prepared_statement classes, which, largely, stand for the same thing. prepared was an alias for prepared_statement which only required an extra tag jump in IDE and carried no meaning.	2020-02-12 16:44:43 +03:00
Konstantin Osipov	90346236ac	cql: propagate const property through prepared statement tree. cql_statement is a class representing a prepared statement in Scylla. It is used concurrently during execution, so it is important that its change is not changed by execution. Add const qualifier to the execution methods family, throghout the cql hierarchy. Mark a few places which do mutate prepared statement state during execution as mutable. While these are not affecting production today, as code ages, they may become a source of latent bugs and should be moved out of the prepared state or evaluated at prepare eventually: cf_property_defs::_compaction_strategy_class list_permissions_statement::_resource permission_altering_statement::_resource property_definitions::_properties select_statement::_opts	2019-11-26 14:18:17 +03:00
Avi Kivity	b70febe246	cql: cql_statement: remove execute_internal() With no callers, it can be safely removed.	2018-05-27 12:40:27 +03:00
Avi Kivity	f7b102238a	cql3: change cql_statement methods to accept a local storage_proxy The storage_proxy represents the entire cluster, so there's never a need to access it on a remote shard; the local shard instance will contact remote shard or remote nodes as needed. Simplify the API by passing storage_proxy references instead of seastar::sharded<storage_proxy> references. query_processor and other callers are adjusted to call seastar::sharded::local() first. Message-Id: <20180415142656.25370-2-avi@scylladb.com>	2018-04-16 10:18:28 +02:00
Jesse Haber-Kucharsky	e6363e15de	auth/resource: Construct from ctor The motivation behind this change is the idea that constructing a new instance of an object is the job of the constructor. One big benefit of this structure (with the addition of helpers for convenience) is that calls for emplacing instances (like `std::make_shared`, or `std::vector::emplace_back`) work without any difficulty. This would not be true for static construction functions.	2018-02-14 14:15:58 -05:00
Jesse Haber-Kucharsky	3665261a90	cql3/authorization_statement: Fix typo	2017-12-06 14:39:40 -05:00
Jesse Haber-Kucharsky	1bb22bb190	auth/resource: Generalize to different kinds This change generalizes the implementation of a `resource` to many different kinds of resources, though there is still only one kind (`data`). In the future, we also expect resource kinds for roles, user-defined functions (UDFs), and possibly on particular REST end-points. I considered several approaches to generalizing to different kinds of resources. One approach is to have a base class that is inherited from by different resource kinds. The common functionality would be accessed through virtual member functions and kind-specific functions would exist in sub-classes. I rejected this approach because dealing with different kinds of resources uniformly requires storage and life-time management through something like `std::unique_ptr<auth::resource>`, which means that we lose value semantics (including comparison) and must deal with complications around ownership. Another option was to use `boost::variant` (or, in future, `std::variant`). This is closer to what we want, since there a static set of resource kinds that we support. I rejected this approach for two reasons. The first is that all resource kinds share the same data (a list of segments and a root identifier), which would be duplicated in each type that composed the variant. The second is that the complexity and source-code overhead of `boost::variant` didn't seem warranted. The solution I ended up with is home-grown variant. All resources are described in the same `final` class: `auth::resource`. This class has value semantics, supports equality comparison, and has a strict ordering. All resources have in common a tag ("kind") and a list of parts. Most operations on resources don't care about the kind of resource (like getting its name, parsing a name, querying for the parent, etc). These are just member functions of the class. When we care about a kind-specific interpretation of a resource, we can produce a "view" of the resource. For example, `data_resource_view` allows for accessing the (optional) keyspace and table names. I anticipate in the future to add functions for creating role resources (`auth::resource::role`) and also `role_resource_view`. The functional behaviour of the system should be unchanged with this patch. I've added new unit tests in `auth_resource_test.cc` and removed the old test from `auth_test.cc`. Fixes #3027.	2017-12-06 14:37:56 -05:00
Jesse Haber-Kucharsky	8fe53ecf78	auth: Rename `data_resource` to `resource` The implementation and interface of `auth::resource` will change soon to support different kinds of resources beyond just data (keyspaces and tables).	2017-12-06 10:18:05 -05:00
Avi Kivity	ebaeefa02b	Merge seatar upstream (seastar namespace) - introcduced "seastarx.hh" header, which does a "using namespace seastar"; - 'net' namespace conflicts with seastar::net, renamed to 'netw'. - 'transport' namespace conflicts with seastar::transport, renamed to cql_transport. - "logger" global variables now conflict with logger global type, renamed to xlogger. - other minor changes	2017-05-21 12:26:15 +03:00
Vlad Zolotarov	ff55b76562	cql3::query_processor: use weak_ptr for passing the prepared statements around Use seastar::checked_ptr<weak_ptr<pepared_statement>> instead of shared_ptr for passing prepared statements around. This allows an easy tracking and handling of statements invalidation. This implementation will throw an exception every time an invalidated statement reference is dereferenced. Signed-off-by: Vlad Zolotarov <vladz@scylladb.com>	2017-04-12 12:24:03 -04:00
Vlad Zolotarov	7606588267	cql3::query_processor: add cql_stats - Add cql_stats member. - Pass it to cql3::raw::parsed_statement::prepare() virtual method. Signed-off-by: Vlad Zolotarov <vladz@cloudius-systems.com>	2016-11-03 11:48:57 -04:00
Avi Kivity	caf8d4f0e6	cql3: separate parsed_statement and parsed_statment::prepared cql3::statements::parsed_statement -> cql3::statements::raw::parsed_statement cql3::statements::parsed_statement::prepared -> cql3::statements::prepared_statement Message-Id: <1464609556-3756-2-git-send-email-avi@scylladb.com>	2016-05-31 09:09:10 +03:00
Calle Wilund	add2111c0a	cql3::statements::authorizarion_statement: Initial conversion Auth cql base type	2016-04-19 11:49:05 +00:00

33 Commits