scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-24 10:30:38 +00:00

Author	SHA1	Message	Date
Avi Kivity	eb74fe784d	auth: convert sprint() to format() sprint() recently became more strict, throwing on sprint("%s", 5). Replace with the more modern format(). Mechanically converted with https://github.com/avikivity/unsprint.	2018-11-01 13:16:17 +00:00
Jesse Haber-Kucharsky	9d27045c76	auth: Shorten `random_device` instance life-span On Fedora 28, creating an instance of `std::random_device` opens a file descriptor for `/dev/urandom` (observed via `strace`). By declaring static thread-local instances of `std::random_device`, these descriptors will be open (barring optimization by the compiler) for the entire duration of the Scylla process's life. However, the `std::random_device` instance is only necessary for initializing the `RandomNumberEngine` for generating salts. With this change, the file-descriptor is closed immediately after the engine is initialized. I considered generalizing this pattern of initialization into a function, but with only two uses (and simple ones) I think this would only obscure things. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Tests: unit (release) Message-Id: <f1b985d99f66e5e64d714fd0f087e235b71557d2.1536697368.git.jhaberku@scylladb.com>	2018-09-12 12:14:21 +01:00
Jesse Haber-Kucharsky	52d3ff057a	auth: Allow different random engines for salt This makes the function useable in more contexts due to flexibility (including in tests), since the state is not captured and the characteristics of salt generation can be customized to the caller's needs.	2018-08-13 13:24:45 -04:00
Jesse Haber-Kucharsky	fd60d61ebf	auth: Split out test for best supported scheme The `generate_salt` function invokes this function internally now. This change means that `generate_salt` is now thread-safe and therefore does not have to be invoked by a single thread only when starting the `password_authenticator`. This further means that `generate_salt` does not need to be part of the public interface of the module, and can be moved to the implementation file.	2018-08-13 13:24:45 -04:00
Jesse Haber-Kucharsky	adf058bd1f	auth: Rename function to use full words	2018-08-13 13:24:45 -04:00
Jesse Haber-Kucharsky	b272d622f8	auth: Move passsword stuff to its own namespace For clarity and nicer function names.	2018-08-13 13:24:45 -04:00
Jesse Haber-Kucharsky	2a40bcb281	auth: Move password handling to its own files While the `password_authenticator` is a complex component with lots of dependencies, password hashing and checking itself is a process with limited logical state and dependencies, which makes it easy to isolate and test.	2018-08-13 13:24:45 -04:00
Jesse Haber-Kucharsky	03cf57db62	auth: Construct `std::random_device` instances once `std::random_device` has a lot of implementation-specific behavior, and as a result we cannot assume much about its performance characteristics. We initialize thread-specific static instances of `std::random_device` once so that we don't have the overhead of invoking the ctor during every invocation of `gensalt`.	2018-08-13 13:24:45 -04:00
Jesse Haber-Kucharsky	fce10f2c6e	auth: Don't use unsupported hashing algorithms In previous versions of Fedora, the `crypt_r` function returned `nullptr` when a requested hashing algorithm was not supported. This is consistent with the documentation of the function in its man page. As of Fedora 28, the function's behavior changes so that the encrypted text is not `nullptr` on error, but instead the string "0". The info pages for `crypt_r` clarify somewhat (and contradict the man pages): Some implementations return `NULL` on failure, and others return an _invalid_ hashed passphrase, which will begin with a `` and will not be the same as SALT. Because of this change of behavior, users running Scylla on a Fedora 28 machine which was upgraded from a previous release would not be able to authenticate: an unsupported hashing algorithm would be selected, producing encrypted text that did not match the entry in the table. With this change, unsupported algorithms are correctly detected and users should be able to continue to authenticate themselves. Fixes #3637. Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <bcd708f3ec195870fa2b0d147c8910fb63db7e0e.1533322594.git.jhaberku@scylladb.com>	2018-08-05 08:57:36 +03:00
Jesse Haber-Kucharsky	e664f9b0c6	Use finite time-outs for internal auth. queries	2018-07-31 11:38:16 -04:00
Nadav Har'El	25bd139508	cross-tree: clean up use of std::random_device() std::random_device() uses the relatively slow /dev/urandom, and we rarely if ever intend to use it directly - we normally want to use it to seed a faster random_engine (a pseudo-random number generator). In many places in the code, we first created a random_device variable, and then using it created a random_engine variable. However, this practice created the risk of a programmer accidentally using the random_device object, instead of the random_engine object, because both have the same API; This hurts performance. This risk materialized in just two places in the code, utils/uuid.cc and gms/gossiper.cc. A patch for to uuid.cc was sent previously by Pawel and is not included in this patch, and the fix for gossiper.{cc,hh} is included here. To avoid risking the same mistake in the future, this patch switches across the code to an idiom where the random_device object is not named, so cannot be accidentally used. We use the following idiom: std::default_random_engine _engine{std::random_device{}()}; Here std::random_device{}() creates the random device (/dev/urandom) and pulls a random integer from it. It then uses this seed to create the random_engine (the pseudo-random number generator). The std::random_device{} object is temporary and unnamed, and cannot be unintentionally used directly. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20180726154958.4405-1-nyh@scylladb.com>	2018-07-26 16:54:58 +01:00
Avi Kivity	187ebdbe46	auth: fix possible use of disengaged optional in has_salted_hash() untyped_result_set_row's cell data type is bytes_opt, and the get_block() accessor accesses the value assuming it's engaged (relying on the caller to call has()). has_unsalted_hash() calls get_blob() without calling has() beforehand, potentially triggering undefined behavior. Fix by using get_or() instead, which also simplifies the caller. I observed failures in Jenkins in this area. It's hard to be sure this is the root cause, since the failures triggered an internal consistency assertion in asan rather than an asan report. However, the error is hard to reproduce and the fix makes sense even if it doesn't prevent the error. See #3480 for the asan error. Fixes #3480 (hopefully). Message-Id: <20180602181919.29204-1-avi@scylladb.com>	2018-06-02 19:46:32 +01:00
Avi Kivity	a99e820bb9	query_processor: require clients to specify timeout configuration Remove implicit timeouts and replace with caller-specified timeouts. This allows removing the ambiguity about what timeout a statement is executed with, and allows removing cql_statement::execute_internal(), which mostly overrode timeouts and consistency levels. Timeout selection is now as follows: query_processor::*_internal: infinite timeout, CL=ONE query_processor::process(), execute(): user-specified consisistency level and timeout All callers were adjusted to specify an infinite timeout. This can be further adjusted later to use the "other" timeout for DCL and the read or write timeout (as needed) for authentication in the normal query path. Note that infinite timeouts don't mean that the query will hang; as soon as the failure detector decides that the node is down, RPC responses will termiante with a failure and the query will fail.	2018-05-14 09:41:06 +03:00
Jesse Haber-Kucharsky	cd0553ca6a	auth: Query custom options from the `authenticator` None of the `authenticator` implementations we have support custom options, but we should support this operation to support the relevant CQL statements.	2018-05-09 21:12:50 -04:00
Jesse Haber-Kucharsky	00f7bc676d	auth: Remove ordering dependence If `auth::password_authenticator` also creates `system_auth.roles` and we fix the existence check for the default superuser in `auth::standard_role_manager` to only search for the columns that it owns (instead of the column itself), then both modules' initialization are independent of one another. Fixes #3319.	2018-03-25 22:38:11 -04:00
Jesse Haber-Kucharsky	881656cea4	auth: Wait for schema agreement Some modules of `auth` create a default superuser if it does not already exist. The existence check is through a SELECT query with quorum consistency level. If the schema for the applicable tables has not yet propagated to a peer node at the time that it processes this query, then the `storage_proxy` will print an error message to the log and the query will be retried. Eventually, the schema will propagate and the default superuser will be created. However, the error message in the log causes integration tests to fail (and is somewhat annoying). Now, prior to querying for existing data, we wait for all gossip peers to have the same schema version as we do. Fixes #2852.	2018-03-25 22:38:08 -04:00
Jesse Haber-Kucharsky	9117a689cf	auth: Fix `const` correctness This patch came about because of an important (and obvious, in hindsight) realization: instances of the authorizer, role manager, and authenticator are clients for access-control state and not the state itself. This is reflected directly in Scylla: `auth::service` is sharded across cores and this is possible because each instance queries and modifies the same global state. To give more examples, the value of an instance of `std::vector<int>` is the structure of the container and its contents. The value of `int file_descriptor` is an identifier for state maintained elsewhere. Having watched an excellent talk by Herb Sutter [1] and having read an informative blog post [2], it's clear that a member function marked `const` communicates that the observable state of the instance is not modified. Thus, the member functions of the role-manager, authenticator, and authorizer clients should not be marked `const` only if the state of the client itself is observably changed. By this principle, member functions which do not change the state of the client, but which mutate the global state the client is associated with (for example, by creating a role) are marked `const`. The `start` (and `stop`) functions of the client have the dual role of initializing (finalizing) both the local client state and the external state; they are not marked `const`. [1] https://herbsutter.com/2013/01/01/video-you-dont-know-const-and-mutable/ [2] http://talesofcpp.fusionfenix.com/post-2/episode-one-to-be-or-not-to-be-const	2018-03-14 01:32:43 -04:00
Jesse Haber-Kucharsky	fbc97626c4	auth: Migrate legacy data on boot This change allows for seamless migration of the legacy users metadata to the new role-based metadata tables. This process is summarized in `docs/migrating-from-users-to-roles.md`. In general, if any nondefault metadata exists in the new tables, then no migration happens. If, in this case, legacy metadata still exists then a warning is written to the log. If no nondefault metadata exists in the new tables and the legacy tables exist, then each node will copy the data from the legacy tables to the new tables, performing transformations as necessary. An informational message is written to the log when the migration process starts, and when the process ends. During the process of copying, data is overwritten so that multiple nodes racing to migrate data do not conflict. Since Apache Cassandra's auth. schema uses the same table for managing roles and authentication information, some useful functions in `roles-metadata.hh` have been added to avoid code duplication. Because a superuser should be able to drop the legacy users tables from `system_auth` once the cluster has migrated to roles and is functioning correctly, we remove the restriction on altering anything in the "system_auth" keyspace. Individual tables in `system_auth` are still protected later in the function. When a cluster is upgrading from one that does not support roles to one that does, some nodes will be running old code which accesses old metadata and some will be running new code which access new metadata. With the help of the gossiper `feature` mechanism, clients connecting to upgraded nodes will be notified (through code in the relevant CQL statements) that modifications are not allowed until the entire cluster has upgraded.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	cf5f6aa4c5	auth: Fix fragile variable life-times According to the Seastar convention, a parameter passed to a function taking a reference parameter must live for the duration of the execution of the returned future. When possible, variables are statically allocated. When this is not possible, we use `do_with`.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	45631604b0	auth: Use `string_view` for paramters	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	c4f686c10f	auth: Put definitions inside namespace	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	45c6d13812	auth: Remove useless try-catch block This looks to have been a typo in the original porting work.	2018-02-14 14:15:58 -05:00
Jesse Haber-Kucharsky	e6363e15de	auth/resource: Construct from ctor The motivation behind this change is the idea that constructing a new instance of an object is the job of the constructor. One big benefit of this structure (with the addition of helpers for convenience) is that calls for emplacing instances (like `std::make_shared`, or `std::vector::emplace_back`) work without any difficulty. This would not be true for static construction functions.	2018-02-14 14:15:58 -05:00
Jesse Haber-Kucharsky	053b6b4d04	auth: Unify formatting The goal is for all files in `auth/` to conform to the Seastar/Scylla `coding-style.md` document.	2018-02-14 14:15:58 -05:00
Jesse Haber-Kucharsky	a4c7aee238	auth: Fix includes	2018-02-14 14:15:58 -05:00
Jesse Haber-Kucharsky	de33124c39	Don't store `authenticated_user` in `shared_ptr` All we require are value semantics. `client_state` still stores `authenticated_user` in a `shared_ptr`, but the behavior of that class is complex enough to warrant its own discussion/design/refactor.	2018-02-14 14:15:58 -05:00
Jesse Haber-Kucharsky	741d215516	auth: Switch to roles from users This is a large change, but it's a necessary evil. This change brings us to a minimally-functional implementation of roles. There are many additional changes that are necessary, including refined grammar, bug fixes, code hygiene, and internal code structure changes. In the interest of keeping this patch somewhat read-able, those changes will come in subsequent patches. Until that time, roles are still marked "unimplemented". IMPORTANT: This code does not include any mechanism for transitioning a cluster from user-based access-control to role-based access control. All existing access-control metadata will be ignored (though not deleted). Specific changes: - All user-specific CQL statements now delegate to their roles equivalent. The statements are effectively the same, but CREATE USER will include LOGIN automatically. Also, LIST USERS only lists roles with LOGIN. - A call to LIST PERMISSIONS will now also list permissions of roles that have been granted to the caller, in addition to permissions which have been granted directly. - Much of the logic of creating, altering, and deleting roles has been moved to `auth::service`, since these operations require cooperation between the authenticator, authorizer, and role-manager. - LIST USERS actually works as expected now (fixes #2968).	2018-02-14 14:15:57 -05:00
Jesse Haber-Kucharsky	b3dc90d5d2	auth: Refactor authentication options The set of allowed options is quite small, so we benefit from a static representation (member variables) over a dynamic map. We also logically move the "OPTIONS" option to the domain of the authenticator (from user management), since this is where it is applied. This refactor also aims to reduce compilation time by moving `authentication_options` into its own header file. While changes to `user_options` were necessary to accommodate the new structure, that class will be deprecated shortly in the switch to roles. Therefore, the changes are strictly temporary.	2018-02-14 14:15:57 -05:00
Duarte Nunes	81b1455b22	auth: Replace delayed_tasks with sleep_abortable delayed_tasks has a bug that if the object is destroyed while a timer callback is queued, the callback will then try to access freed memory. This could be fixed by providing a stop() function that waits for pending callbacks, but we can just replace the whole thing by levering the abort_source-enabled exponential_backoff_retry.	2017-12-28 13:00:28 +00:00
Jesse Haber-Kucharsky	aea262cdc4	auth/resource.hh: Rename `resource_ids`	2017-12-06 14:39:40 -05:00
Jesse Haber-Kucharsky	1bb22bb190	auth/resource: Generalize to different kinds This change generalizes the implementation of a `resource` to many different kinds of resources, though there is still only one kind (`data`). In the future, we also expect resource kinds for roles, user-defined functions (UDFs), and possibly on particular REST end-points. I considered several approaches to generalizing to different kinds of resources. One approach is to have a base class that is inherited from by different resource kinds. The common functionality would be accessed through virtual member functions and kind-specific functions would exist in sub-classes. I rejected this approach because dealing with different kinds of resources uniformly requires storage and life-time management through something like `std::unique_ptr<auth::resource>`, which means that we lose value semantics (including comparison) and must deal with complications around ownership. Another option was to use `boost::variant` (or, in future, `std::variant`). This is closer to what we want, since there a static set of resource kinds that we support. I rejected this approach for two reasons. The first is that all resource kinds share the same data (a list of segments and a root identifier), which would be duplicated in each type that composed the variant. The second is that the complexity and source-code overhead of `boost::variant` didn't seem warranted. The solution I ended up with is home-grown variant. All resources are described in the same `final` class: `auth::resource`. This class has value semantics, supports equality comparison, and has a strict ordering. All resources have in common a tag ("kind") and a list of parts. Most operations on resources don't care about the kind of resource (like getting its name, parsing a name, querying for the parent, etc). These are just member functions of the class. When we care about a kind-specific interpretation of a resource, we can produce a "view" of the resource. For example, `data_resource_view` allows for accessing the (optional) keyspace and table names. I anticipate in the future to add functions for creating role resources (`auth::resource::role`) and also `role_resource_view`. The functional behaviour of the system should be unchanged with this patch. I've added new unit tests in `auth_resource_test.cc` and removed the old test from `auth_test.cc`. Fixes #3027.	2017-12-06 14:37:56 -05:00
Jesse Haber-Kucharsky	8fe53ecf78	auth: Rename `data_resource` to `resource` The implementation and interface of `auth::resource` will change soon to support different kinds of resources beyond just data (keyspaces and tables).	2017-12-06 10:18:05 -05:00
Jesse Haber-Kucharsky	ba6a41d397	auth: Switch to sharded service This change appears quite large, but is logically fairly simple. Previously, the `auth` module was structured around global state in a number of ways: - There existed global instances for the authenticator and the authorizer, which were accessed pervasively throughout the system through `auth::authenticator::get()` and `auth::authorizer::get()`, respectively. These instances needed to be initialized before they could be used with `auth::authenticator::setup(sstring type_name)` and `auth::authorizer::setup(sstring type_name)`. - The implementation of the `auth::auth` functions and the authenticator and authorizer depended on resources accessed globally through `cql3::get_local_query_processor()` and `service::get_local_migration_manager()`. - CQL statements would check for access and manage users through static functions in `auth::auth`. These functions would access the global authenticator and authorizer instances and depended on the necessary systems being started before they were used. This change eliminates global state from all of these. The specific changes are: - Move out `allow_all_authenticator` and `allow_all_authorizer` into their own files so that they're constructed like any other authenticator or authorizer. - Delete `auth.hh` and `auth.cc`. Constants and helper functions useful for implementing functionality in the `auth` module have moved to `common.hh`. - Remove silent global dependency in `auth::authenticated_user::is_super()` on the auth* service in favour of a new function `auth::is_super_user()` with an explicit auth* service argument. - Remove global authenticator and authorizer instances, as well as the `setup()` functions. - Expose dependency on the auth* service in `auth::authorizer::authorize()` and `auth::authorizer::list()`, which is necessary to check for superuser status. - Add an explicit `service::migration_manager` argument to the authenticators and authorizers so they can announce metadata tables. - The permissions cache now requires an auth* service reference instead of just an authorizer since authorizing also requires this. - The permissions cache configuration can now easily be created from the DB configuration. - Move the static functions in `auth::auth` to the new `auth::service`. Where possible, previously static resources like the `delayed_tasks` are now members. - Validating `cql3::user_options` requires an authenticator, which was previously accessed globally. - Instances of the auth* service are accessed through `external` instances of `client_state` instead of globally. This includes several CQL statements including `alter_user_statement`, `create_user_statement`, `drop_user_statement`, `grant_statement`, `list_permissions_statement`, `permissions_altering_statement`, and `revoke_statement`. For `internal` `client_state`, this is `nullptr`. - Since the `cql_server` is responsible for instantiating connections and each connection gets a new `client_state`, the `cql_server` is instantiated with a reference to the auth* service. - Similarly, the Thrift server is now also instantiated with a reference to the auth* service. - Since the storage service is responsible for instantiating and starting the sharded servers, it is instantiated with the sharded auth* service which it threads through. All relevant factory functions have been updated. - The storage service is still responsible for starting the auth* service it has been provided, and shutting it down. - The `cql_test_env` is now instantiated with an instance of the auth* service, and can be accessed through a member function. - All unit tests have been updated and pass. Fixes #2929.	2017-11-15 23:22:42 -05:00
Jesse Haber-Kucharsky	41612ee577	auth: Make the QP an explicit dependency Rather than have all uses of the QP in auth reference global variables, we supply a QP reference to both the authenticator and authorizer on construction. The caller still references a global variable when constructing the instances, but fixing this problem is a much larger task that is out of scope of this change.	2017-11-15 23:19:13 -05:00
Jesse Haber-Kucharsky	157e22a4f0	auth: Unify Java class name attributes	2017-11-15 23:19:00 -05:00
Jesse Haber-Kucharsky	9aff5d9a77	auth: Make life-time control more consistent	2017-11-15 23:18:44 -05:00
Jesse Haber-Kucharsky	5825e37310	auth: Move metadata constants This change is motivated partly be aesthetics, but more significantly due to the future work to refactor `auth` into a sharded service. Since doing so will require writing `auth::auth` from scratch, these constants (and other common functionality) need a new home.	2017-11-15 23:18:42 -05:00
Jesse Haber-Kucharsky	5c39a2cc15	auth: Fix static constant initialization Using "Meyer's singletons" eliminate the problem of static constant initialization order because static variables inside functions are initialized only the first time control flow passes over their declaration. Fixes #2966.	2017-11-15 23:16:52 -05:00
Calle Wilund	cc28cf838c	password_auth: Return actual generated salt from gensalt Fixes: 2898 Typo error in gensalt(). Only returned selected hash method, not the random salt bytes. Does not prevent the hash function from operating, but strength is ever so reduced. Message-Id: <20171016130505.25593-2-calle@scylladb.com>	2017-10-16 14:07:46 +01:00
Calle Wilund	57c5f13166	password_auth: Keep crypt_data as thread local Fixes: 2887 Speeds up password hashing ever so slightly. Message-Id: <20171016130505.25593-1-calle@scylladb.com>	2017-10-16 14:07:42 +01:00
Calle Wilund	b96a7ae656	auth: Make authenticator/authorizer use actual name based lookup Allowing for pluggable auth objects. Note: requires "class_registrator: Fix qualified name matching + provider helpers" patch previously sent.	2017-10-04 12:44:44 +02:00
Avi Kivity	e44517851e	untyped_result_set: reduce dependencies Forward-declare untyped_result_set and untyped_result_set_row, and remove the include from query_processor.hh. Message-Id: <20170916170859.27612-3-avi@scylladb.com>	2017-09-18 15:15:15 +02:00
Avi Kivity	ebaeefa02b	Merge seatar upstream (seastar namespace) - introcduced "seastarx.hh" header, which does a "using namespace seastar"; - 'net' namespace conflicts with seastar::net, renamed to 'netw'. - 'transport' namespace conflicts with seastar::transport, renamed to cql_transport. - "logger" global variables now conflict with logger global type, renamed to xlogger. - other minor changes	2017-05-21 12:26:15 +03:00
Avi Kivity	a592573491	Remove exception specifications C++17 removed exception specifications from the language, and gcc 7 warns about them even in C++14 mode. Remove them from the code base.	2017-05-05 17:02:31 +03:00
Calle Wilund	5b815b81b4	auth::password_authenticator: Ensure exceptions are processed in continuation Fixes #1718 (even more) Message-Id: <1475497389-27016-1-git-send-email-calle@scylladb.com>	2016-10-03 14:49:59 +02:00
Calle Wilund	d24d0f8f90	auth::password_authenticator: "authenticate" should not throw undeclared excpt Fixes #1718 Message-Id: <1475487331-25927-1-git-send-email-calle@scylladb.com>	2016-10-03 12:53:30 +03:00
Calle Wilund	956ee87e12	auth::authenticator: Change "protected_resources" to return reference It it an immutable static value anyway.	2016-04-19 11:49:05 +00:00
Pekka Enberg	38a54df863	Fix pre-ScyllaDB copyright statements People keep tripping over the old copyrights and copy-pasting them to new files. Search and replace "Cloudius Systems" with "ScyllaDB". Message-Id: <1460013664-25966-1-git-send-email-penberg@scylladb.com>	2016-04-08 08:12:47 +03:00
Calle Wilund	980681d28e	auth: Add a simplistic "schedule" for auth db setup Only difference from previous sleep is that we will explicitly delete the objects if the process terminates before tasks are run. I.e. make ASas happier. Message-Id: <1453295521-29580-1-git-send-email-calle@scylladb.com>	2016-01-20 19:31:14 +02:00
Calle Wilund	9a4d45e19d	auth::auth/authenticator: user storage and authentication User db storage + login/pwd db using system tables. Authenticator object is a global shard-shared singleton, assumed to be completely immutable, thus safe. Actual login authentication is done via locally created stateful object (sasl challenge), that queries db. Uses "crypt_r" for password hashing, vs. origins use of bcrypt. Main reason is that bcrypt does not exist as any consistent package that can be consumed, so to guarantee full compatibility we'd have to include the source. Not hard, but at least initially more work than worth.	2016-01-13 15:35:35 +00:00

50 Commits