scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-21 15:22:13 +00:00

Author	SHA1	Message	Date
Andrzej Jackowski	f8156702de	tree: add missing -present to copyright headers ~2076 files used "Copyright (C) YYYY-present ScyllaDB" while ~88 files used "Copyright (C) YYYY ScyllaDB". This inconsistency leads to unnecessary code review discussions and gradual spread of the less common format. Standardize all ScyllaDB copyright headers to use -present. Fixes SCYLLADB-1984 Closes scylladb/scylladb#29876	2026-05-21 10:57:42 +02:00
Pavel Emelyanov	98bea152a8	auth: Remove unused default_superuser() function All callers have been migrated to read the superuser name from auth::config directly. Remove the now-unused helper that fetched it from db::config via the query processor. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-05-15 18:55:02 +03:00
Pavel Emelyanov	9b58d2213b	auth: Switch role managers to use auth::config Convert all role manager implementations to receive their configuration from auth::config instead of accessing db::config through the query processor: - standard_role_manager: reads superuser name from config - ldap_role_manager: reads LDAP URL template, attribute, bind credentials, and permissions update interval from config; passes config to inner standard_role_manager - maintenance_socket_role_manager: keeps a const reference to service's config and passes it directly when lazily constructing standard_role_manager Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-05-15 18:55:02 +03:00
Pavel Emelyanov	14b36b3db1	auth: Switch authenticators to use auth::config Convert all authenticator implementations to receive their configuration from auth::config instead of accessing db::config through the query processor: - password_authenticator: reads superuser name and salted password from config, stores them as members - saslauthd_authenticator: reads socket path from config - certificate_authenticator: reads role queries from config - transitional_authenticator: passes config to inner password_authenticator - maintenance_socket_authenticator: inherits new constructor via using declaration Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-05-15 18:45:01 +03:00
Pavel Emelyanov	07ed557a2f	auth: Introduce auth::config and wire it through service Add a dedicated auth::config struct that carries all configuration options needed by auth modules. The config is created per-shard using sharded_parameter to ensure updateable_value fields are shard-local. The config is stored as a member in auth::service and passed by const reference to factories so that each auth module can receive its configuration when constructed. The modules themselves are not yet converted — they still read from db::config via the query processor. The stored config is also used in describe_roles() to read the superuser name, eliminating the default_superuser() call that reached into db::config via the query processor. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2026-05-15 18:44:37 +03:00
Marcin Maliszkiewicz	df69a5c79b	auth: tolerate missing permissions column in authorize() Ghost rows in role_permissions with a live row marker but no permissions column can occur when permissions created via INSERT (e.g. by the removed auth v2 migration) are later revoked. The row marker survives the revoke, leaving a row visible to queries but with permissions=null. Add a has() guard before accessing the permissions column, matching the pattern already used in list_all(). Return NONE permissions for such ghost rows instead of crashing.	2026-05-05 15:50:40 +02:00
Marcin Maliszkiewicz	c44625ebdf	auth: add defensive has() guard for role_attributes value column Add a has() check before accessing the value column in role_attributes to tolerate ghost rows with missing regular columns. In practice this is unlikely to be a problem since attributes are not typically revoked, but the guard is added for consistency and defensive programming.	2026-05-05 15:48:01 +02:00
Marcin Maliszkiewicz	797bc28aae	auth: remove unused permissions field from cache role_record The permissions field in role_record was populated by fetch_role() but never read. Authorization uses cached_permissions instead, which is loaded via the permission_loader callback. Remove the dead field and its fetch code. The removed code also did not check for missing columns before accessing the permissions set, which could crash on ghost rows left by the removed auth v2 migration. The migration used INSERT (creating row markers), and when permissions were later revoked, the row marker survived while the permissions column became null.	2026-05-05 15:48:01 +02:00
Andrzej Jackowski	8855e77465	auth: make shutdown the exact reverse of startup The previous parallel stop of the authenticator and authorizer was a micro-optimization that obscured the lifecycle invariant that shutdown should reverse startup. Refs SCYLLADB-1679	2026-04-24 13:34:09 +02:00
Andrzej Jackowski	37a547604f	auth: start authorizer and set permission loader before role manager LDAP role manager starts a pruner fiber that calls reload_all_permissions() which asserts _permission_loader is set. The permission loader calls _authorizer->authorize(), so the authorizer must be started before the loader is set. Start authorizer, then set the permission loader, then start the role manager, ensuring both dependencies are satisfied before the pruner can fire. Fixes SCYLLADB-1679	2026-04-24 13:34:09 +02:00
Andrzej Jackowski	c3e5285d45	auth: stop role manager before clearing permission loader service::stop() cleared the permission loader and stopped the role manager concurrently (via when_all_succeed). The LDAP pruner could be mid-reload at a yield point when the loader was set to null, causing it to call a null function. Stop the role manager first so the pruner is fully drained before the loader is cleared. Fixes SCYLLADB-1679	2026-04-24 13:34:09 +02:00
Andrzej Jackowski	f75e5ac65b	auth: reload LDAP permission cache on local shard only The LDAP role manager's _cache_pruner fiber used invoke_on_all() to reload permissions on every shard. Since auth::service::start() runs on all shards in parallel via invoke_on_all(), the pruner on shard X could call reload_all_permissions() on shard Y before shard Y finished start() and set its permission loader, hitting SCYLLA_ASSERT(_permission_loader). The same cross-shard race existed during shutdown. Each shard runs its own pruner instance, so reloading locally is sufficient — all shards are still covered. This also removes redundant N-squared reload calls. Refs SCYLLADB-1679	2026-04-24 13:06:58 +02:00
Botond Dénes	aecb6b1d76	Merge 'auth: sanitize {USER} substitution in LDAP URL template' from Piotr Smaron `LDAPRoleManager` interpolated usernames directly into `ldap_url_template`, allowing LDAP filter injection and URL structure manipulation via crafted usernames. This PR adds two layers of encoding when substituting `{USER}`: 1. RFC 4515 filter escaping — neutralises ``, `(`, `)`, `\`, NUL 2. URL percent-encoding* — prevents `%`, `?`, `#` from breaking `ldap_url_parse`'s component splitting or undoing the filter escaping It also adds `validate_query_template()` at startup to reject templates that place `{USER}` outside the filter component (e.g. in the host or base DN), where filter escaping would be the wrong defense. Fixes: SCYLLADB-1309 Compatibility note: Templates with `{USER}` in the host, base DN, attributes, or extensions were previously silently accepted. They are now rejected at startup with a descriptive error. Only templates with `{USER}` in the filter component (after the third `?`) are valid. Fixes: SCYLLADB-1309 Due to severeness, should be backported to all maintained versions. Closes scylladb/scylladb#29388 * github.com:scylladb/scylladb: auth: sanitize {USER} substitution in LDAP URL templates test/ldap: add LDAP filter-injection reproducers	2026-04-15 14:40:15 +03:00
Avi Kivity	0ae22a09d4	LICENSE: Update to version 1.1 Updated terms of non-commercial use (must be a never-customer).	2026-04-12 19:46:33 +03:00
Piotr Smaron	477353b15c	auth: sanitize {USER} substitution in LDAP URL templates LDAPRoleManager interpolated usernames directly into ldap_url_template. That allowed LDAP filter metacharacters to change the query, and URL metacharacters such as %, ?, and # to change how ldap_url_parse() split the URL. Apply two layers of encoding when substituting {USER}: 1. RFC 4515 filter escaping -- neutralises filter operators. 2. URL percent-encoding -- prevents ldap_url_parse from misinterpreting %-sequences, ? delimiters, or # fragments. Add validate_query_template() (called from start()) which uses a sentinel round-trip through ldap_url_parse to reject templates that place {USER} outside the filter component. Templates that previously placed {USER} in the host or base DN were silently accepted; they are now rejected at startup with a descriptive error. Change parse_url() to take const sstring& instead of string_view to enforce the null-termination requirement of ldap_url_parse() at the type level. Add regression coverage for %2a, ?, #, and invalid {USER} placement in the base DN, host, attributes, and extensions. Update LDAP authorization docs to document the escaping behavior and the {USER} placement restriction. Fixes: SCYLLADB-1309	2026-04-10 14:00:47 +02:00
Pavel Emelyanov	da6fe14035	transport: test that connection_stage is READY after auth via all process_startup paths The cert-auth path in process_startup (introduced in `20e9619bb1`) was missing _ready = true, _authenticating = false, update_scheduling_group() and on_connection_ready(). The result is that connections authenticated via certificate show connection_stage = AUTHENTICATING in system.clients forever, run under the wrong service-level scheduling group, and hold the uninitialized-connections semaphore slot for the lifetime of the connection. Add a parametrized cluster test that verifies all three process_startup branches result in connection_stage = READY: - allow_all: AllowAllAuthenticator (no-auth path) - password: PasswordAuthenticator (SASL/process_auth_response path) - cert_bypass: CertificateAuthenticator with transport_early_auth_bypass error injection (cert-auth path -- the buggy one) The injection is added to certificate_authenticator::authenticate() so tests can bypass actual TLS certificate parsing while still exercising the cert-auth code path in process_startup. The cert_bypass case is marked xfail until the bug is fixed. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-03-24 18:01:28 +03:00
Piotr Dulikowski	171504c84f	Merge 'auth: migrate some standard role manager APIs to use cache' from Marcin Maliszkiewicz This patchset migrates: query_all_directly_granted, query_all, get_attribute, query_attribute_for_all functions to use cache instead of doing CQL queries. It also includes some preparatory work which fixes cache update order and triggering. Main motivation behind this is to make sure that all calls from service_level_controller::auth_integration are cached, which we achieve here. Alternative implementation could move the whole auth_integration data into auth cache but since auth_integration manages also lifetime and contains service levels specific logic such solution would be too complex for little (if any) gain. Fixes https://scylladb.atlassian.net/browse/SCYLLADB-159 Backport: no, not a bug Closes scylladb/scylladb#28791 * github.com:scylladb/scylladb: auth: switch query_attribute_for_all to use cache auth: switch get_attribute to use cache auth: cache: add heterogeneous map lookups auth: switch query_all to use cache auth: switch query_all_directly_granted to use cache auth: cache: add ability to go over all roles raft: service: reload auth cache before service levels service: raft: move update_service_levels_effective_cache check	2026-03-19 14:37:22 +01:00
Marcin Maliszkiewicz	04bf631d7f	auth: switch query_attribute_for_all to use cache	2026-03-18 09:06:20 +01:00
Marcin Maliszkiewicz	cf578fd81a	auth: switch get_attribute to use cache	2026-03-18 09:06:20 +01:00
Marcin Maliszkiewicz	06d16b6ea2	auth: cache: add heterogeneous map lookups Some callers have only string_view role name, they shouldn't need to allocate sstring to do the lookup.	2026-03-18 09:06:20 +01:00
Marcin Maliszkiewicz	7fdb1118f5	auth: switch query_all to use cache	2026-03-18 09:06:20 +01:00
Marcin Maliszkiewicz	fca11c5a21	auth: switch query_all_directly_granted to use cache	2026-03-18 09:06:20 +01:00
Marcin Maliszkiewicz	6f682f7eb1	auth: cache: add ability to go over all roles This is needed to implement auth service api where we list all roles.	2026-03-18 09:06:20 +01:00
Dario Mirovic	2e4b72c6b9	auth: add maintenance_socket_authorizer GRANT/REVOKE fails on the maintenance socket connections, because maintenance_auth_service uses allow_all_authorizer. allow_all_authorizer allows all operations, but not GRANT/REVOKE, because they make no sense in its context. This has been observed during PGO run failure in operations from ./pgo/conf/auth.cql file. This patch introduces maintenance_socket_authorizer that supports the capabilities of default_authorizer ('CassandraAuthorizer') without needing authorization. Refs SCYLLADB-1070	2026-03-17 19:19:41 +01:00
Marcin Maliszkiewicz	54ef8fca57	auth: remove DEFAULT_SUPERUSER_NAME constant and dead DEFAULT_USER_PASSWORD DEFAULT_SUPERUSER_NAME is no longer referenced after removing the role_part special-casing in describe_roles. DEFAULT_USER_PASSWORD was dead code too.	2026-03-12 08:46:00 +01:00
Marcin Maliszkiewicz	029410e159	auth: use configurable default_superuser in describe_roles Replace the hardcoded meta::DEFAULT_SUPERUSER_NAME comparison with default_superuser(_qp) which reads from the auth_superuser_name config option. This makes the IF NOT EXISTS clause in DESCRIBE output correct for clusters with a non-default superuser name.	2026-03-12 08:45:47 +01:00
Marcin Maliszkiewicz	adc840919b	auth: move default_superuser to common, remove _superuser member Move default_superuser() to auth::meta in common.{hh,cc} and remove the cached _superuser member from both standard_role_manager and password_authenticator. The superuser name comes from config which is immutable at runtime, so caching it is unnecessary.	2026-03-11 16:28:38 +01:00
Marcin Maliszkiewicz	993e06c1ae	auth: use LOCAL_ONE for all auth queries Removes auth-v1 hack for cassandra superuser as auth-v1 code no longer exists. Also CL is not really used when quering raft replicated tables (like auth ones), but LOCAL_ONE is the least confusing one.	2026-03-11 16:27:15 +01:00
Marcin Maliszkiewicz	6d1153687a	auth: remove get_auth_ks_name indirection Replace get_auth_ks_name(qp) with db::system_keyspace::NAME directly. The function always returned the constant "system" and its qp parameter was unused.	2026-03-11 16:26:47 +01:00
Patryk Jędrzejczak	37aeba9c8c	Merge 'raft: add global read barrier to group0_batch::commit and switch auth and service levels' from Marcin Maliszkiewicz This series adds a global read barrier to raft_group0_client, ensuring that Raft group0 mutations are applied on all live nodes before returning to the caller. Currently, after a group0_batch::commit, the mutations are only guaranteed to be applied on the leader. Other nodes may still be catching up, leading to stale reads. This patch introduces a broadcast read barrier mechanism. Calling send_group0_read_barrier_to_live_members after committing will cause the coordinator to send a read barrier RPC to all live nodes (discovered via gossiper) and waits for them to complete. This is best effort attempt to get cluster-wide visibility of the committed state before the response is returned to the user. Auth and service levels write paths are switched to use this new mechanism. Fixes https://scylladb.atlassian.net/browse/SCYLLADB-650 Backport: no, new feature Closes scylladb/scylladb#28731 * https://github.com/scylladb/scylladb: test: add tests for global group0_batch barrier feature qos: switch service levels write paths to use global group0_batch barrier auth: switch write paths to use global group0_batch barrier raft: add function to broadcast read barrier request raft: add gossiper dependency to raft_group0_client raft: add read barrier RPC	2026-03-11 10:37:19 +01:00
Gleb Natapov	4660f908f9	auth: drop auth_migration_listener since it does nothing now	2026-03-10 10:46:48 +02:00
Gleb Natapov	1d188f0394	auth: remove legacy auth mode and upgrade code A system needs to be upgraded to use v2 auth before moving to this ScyllaDB version otherwise the boot will fail.	2026-03-10 10:09:39 +02:00
Marcin Maliszkiewicz	fe79fdf090	auth: switch write paths to use global group0_batch barrier This ensures that we return auth functions only after we wait until all live nodes apply our mutations.	2026-03-09 15:15:59 +01:00
Dario Mirovic	fd17dcbec8	auth: do not create default 'cassandra:cassandra' superuser Changes the behavior of default superuser creation. Previously, without configuration 'cassandra:cassandra' credentials were used. Now default superuser creation is skipped if not configured. The two ways to create default superuser are: - Config file - auth_superuser_name and auth_superuser_salted_password fields - Maintenance socket - connect over maintenance socket and CREATE/ALTER ROLE ... Behavior changes: Old behavior: - No config - 'cassandra:cassandra' created - auth_superuser_name only - <name>:cassandra created - auth_superuser_salted_password only - 'cassandra:<password>' created - Both specified - '<name>:<password>' created New behavior: - No config - no default superuser - Requires maintenance socket setup - auth_superuser_name only - '<name>:' created WITHOUT password - Requires maintenance socket setup - auth_superuser_salted_password only - no default superuser - Both specified - '<name>:<password>' created Fixes SCYLLADB-409	2026-03-03 23:42:25 +01:00
Dario Mirovic	9dc1deccf3	auth: remove redundant DEFAULT_USER_NAME from password authenticator Remove redundant DEFAULT_USER_NAME from password_authenticator.cc file. It is just a copy of meta::DEFAULT_SUPERUSER_NAME. Refs SCYLLADB-409	2026-03-03 23:42:25 +01:00
Dario Mirovic	45628cf041	auth: enable role management operations via maintenance socket Introduce maintenance_socket_authenticator and rework maintenance_socket_role_manager to support role management operations. Maintenance auth service uses allow_all_authenticator. To allow role modification statements over the maintenance socket connections, we need to treat the maintenance socket connections as superusers and give them proper access rights. Possible approaches are: 1. Modify allow_all_authenticator with conditional logic that password_authenticator already does 2. Modify password_authenticator with conditional logic specific for the maintenance socket connections 3. Extend password_authenticator, overriding the methods that differ Option 3 is chosen: maintenance_socket_authenticator extends password_authenticator with authentication disabled. The maintenance_socket_role_manager is reworked to lazily create a standard_role_manager once the node joins the cluster, delegating role operations to it. In maintenance mode role operations remain disabled. Refs SCYLLADB-409	2026-03-03 23:41:05 +01:00
Dario Mirovic	b68656b59f	auth: let maintenance_socket_role_manager know if node is in maintenance mode This patch is part of preparations for dropping 'cassandra::cassandra' default superuser. When that is implemented, maintenance_socket_role_manager will have two modes of work: 1. in maintenance mode, where role operations are forbidden 2. in normal mode, where role operations are allowed To execute the role operations, the node has to join a cluster. In maintenance mode the node does not join a cluster. This patch lets maintenance_socket_role_manager know if it works under maintenance mode and returns appropriate error message when role operations execution is requested. Refs SCYLLADB-409	2026-03-03 22:31:35 +01:00
Dario Mirovic	3bef493a35	auth: remove class registrator usage This patch removes class registrator usage in auth module. It is not used after switching to factory functor initialization of auth service. Several role manager, authenticator, and authorizer name variables are returned as well, and hardcoded inside qualified_java_name method, since that is the only place they are ever used. Refs SCYLLADB-409	2026-03-03 22:31:35 +01:00
Dario Mirovic	bfff07eacb	auth: add service constructor with factory functors Auth service can be initialized: - [current] by passing instantiated authorizer, authenticator, role manager - [current] by passing service_config, which then uses class registrator to instantiate authorizer, authenticator, role manager - This approach is easy to use with sharded services - [new] by passing factory functors which instantiate authorizer, authenticator, role manager - This approach is also easy to use with sharded services Refs SCYLLADB-409	2026-03-03 22:31:35 +01:00
Dario Mirovic	e8e00c874b	auth: add transitional.hh file In a follow-up patch in this patch series class registrator will be removed. Adding transitional.hh file will be necessary to expose the authenticator and authorizer. Refs SCYLLADB-409	2026-03-03 22:31:35 +01:00
Marcin Maliszkiewicz	1293b94039	auth: cache: fix permissions iterator invalidation in reload_all_permissions The inner loops in reload_all_permissions iterate role's permissions and _anonymous_permissions maps across yield points. Concurrent load_permissions calls (which don't hold _loading_sem) can emplace into those same maps during a yield, potentially triggering a rehash that invalidates the active iterator. We want to avoid adding semaphore acquire in load_permissions because it's on a common path (get_permissions). Fixing by snapshotting the keys into a vector before iterating with yields, so no long-lived map iterator is held across suspension points.	2026-02-23 12:14:22 +01:00
Marcin Maliszkiewicz	75d4bc26d3	auth/cache: acquire _loading_sem in cross-shard callbacks distribute_role() modifies _roles on non-zero shards via invoke_on_others() without holding _loading_sem. Similarly, load_all()'s invoke_on_others() callback calls prune_all() without the semaphore. When these run concurrently with reload_all_permissions(), which iterates _roles across yield points, an insertion can trigger absl::flat_hash_map::resize(), freeing the backing storage while an iterator still references it. Fix by acquiring _loading_sem on the target shard in both distribute_role()'s and load_all()'s invoke_on_others callbacks, serializing all _roles mutations with coroutines that iterate the map.	2026-02-23 10:30:03 +01:00
Marcin Maliszkiewicz	c11eb73a59	auth: add cache size metrics	2026-02-17 18:18:40 +01:00
Marcin Maliszkiewicz	a23e503e7b	auth: remove old permissions cache	2026-02-17 17:56:27 +01:00
Marcin Maliszkiewicz	9d9184e5b7	auth: use unified cache for permissions	2026-02-17 17:56:27 +01:00
Marcin Maliszkiewicz	7eedf50c12	auth: ldap: add permissions reload to unified cache The LDAP server may change role-chain assignments without notifying Scylla. As a result, effective permissions can change, so some form of polling is required. Currently, this is handled via cache expiration. However, the unified cache is designed to be consistent and does not support expiration. To provide an equivalent mechanism for LDAP, we will periodically reload the permissions portion of the new cache at intervals matching the previously configured expiration time.	2026-02-17 17:56:27 +01:00
Marcin Maliszkiewicz	10996bd0fb	auth: add permissions cache to auth/cache We want to get rid of loading cache because its periodic refresh logic generates a lot of internal load when there is many entries. Also our operation procedures involve tweaking the config while new unified cache is supposed to work out of the box.	2026-02-17 17:56:27 +01:00
Marcin Maliszkiewicz	03c4e4bb10	auth: add service::revoke_all as main entry point In the following commit we'll need to add some cache related logic (removing resource permissions). This logic doesn't depend on authorizer so it should be managed by the service itself.	2026-02-17 17:56:27 +01:00
Marcin Maliszkiewicz	070d0bfc4c	auth: explicitly life-extend resource in auth_migration_listener Otherwise it's easy to trigger use-after-free when code slightly changes.	2026-02-17 17:56:27 +01:00
Marcin Maliszkiewicz	55d246ce76	auth: bring back previous version of standard_role_manager::can_login Previously, we wanted to make minimal changes with regards to the new unified auth cache. However, as a result, some calls on the hot path were missed. Now we have switched the underlying find_record call to use the cache. Since caching is now at a lower level, we bring back the original code.	2026-01-26 16:04:11 +01:00

1 2 3 4 5 ...

553 Commits