scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-09 00:13:31 +00:00

Author	SHA1	Message	Date
Botond Dénes	c4b5249a46	backlog_controller::adjust(): fix heap-overflow Make sure idx will not be equal to _control_points.size() (and thus overflow the vector) when looking for the first control-point with a backlog not smaller then the current one, by stopping when it's equal to _control_points.size() - 1. Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <47841592792573d820650d570fa1ab7e58bdac2c.1518700405.git.bdenes@scylladb.com>	2018-02-26 13:47:38 +02:00
Avi Kivity	8fe2414b11	Merge seastar upstream * seastar 383ccd6...f841d2d (8): > Merge "Randomize task queue in debug mode" from Duarte > tutorial: document seastar::thread > tutorial: add missing seastar namespace > tutorial: note about asynchronous functions throwing exceptions > thread: stop backtraces on aarch64 from underflowing the stack > Revert "core:🧵 ARM64 version of annotating the frame" > core:🧵 ARM64 version of annotating the frame > core/future-util: Release exception in repeater	2018-02-26 12:54:35 +02:00
Paweł Dziepak	b103139e4f	configure.py: do not ignore optimisation flags Release mode flags are properly propagated through seastar --optflags flag, but debug mode flags aren't. This is problematic since they are used to enable additional debugging features. After this patch we will end up with some duplicate flags, but that's not really a problem. Message-Id: <20180223173617.15199-1-pdziepak@scylladb.com>	2018-02-25 17:09:07 +02:00
Botond Dénes	206e7d40d4	restricted_mutation_reader: switch to std::variant Tests: unit-tests(release) Signed-off-by: Botond Dénes <bdenes@scylladb.com> Message-Id: <a8930b764171db131d9d8d5fe4035014ecb452f4.1519391304.git.bdenes@scylladb.com>	2018-02-25 14:35:57 +02:00
Paweł Dziepak	6b66e4833b	mvcc: avoid ubsan warning about uninitialised boolean Message-Id: <20180223160133.21383-1-pdziepak@scylladb.com>	2018-02-23 16:54:23 +00:00
Jesse Haber-Kucharsky	82c8104c72	cql_test_env: Ignore error if user already exists When a `cql_test_env` points to a data directory that was previously populated with `cql_test_env`, then the "tester" user will already exist. This is not an error, so we can just ignore the exception. Fixes #3224. Tests: unit (debug) Signed-off-by: Jesse Haber-Kucharsky <jhaberku@scylladb.com> Message-Id: <7729e5a98d8020a7ed1b6d12d8726559f0850f9d.1519315698.git.jhaberku@scylladb.com>	2018-02-22 19:30:50 +01:00
Raphael S. Carvalho	f59f423f3c	Make sstable loading faster by not invoking all shards for each sstable Before `312bd9ce25`, boot had to call all shards for each sstable such that they would agree/disagree on their deletion, an atomic deletion manager requirement. After its removal, we can afford to call only the shards that own a given sstable. Reducing the operation on each sstable from (SSTABLES) * (SHARD_COUNT) to usually (SSTABLES). It may be the same as before after resharding, but resharding is an one-off operation. Boot time should be significantly reduced for nodes with a high smp count and column family using leveled strategy (which can end up with thousands of sstables). Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20180220032554.17776-1-raphaelsc@scylladb.com>	2018-02-22 09:39:56 +00:00
Amnon Heiman	edcfab3262	dist/docker: Add support for housekeeping This patch takes a modified version of the Ubuntu 14.04 housekeeping service script and uses it in Docker to validate the current version. To disable the version validation, pass the --disable-version-check flag when running the container. Message-Id: <20180220161231.1630-1-amnon@scylladb.com>	2018-02-21 09:26:02 +02:00
Duarte Nunes	e75f7c41d9	Merge 'Proper clean-up on closing index_reader' from Vladimir With the changes introduced in #2981 and #3189, the lifetime management of the objects used by index_reader became more complicated. This patchset addresses the immediate problems caused by lack of proper handling. The more holistic approach to this will take more time and is to be implemented under #3220. The current fix, however, should be good enought as a stop-gap solution. * 'issues/3213/v3' of https://github.com/argenet/scylla: Close promoted index streams when closing index_readers. Support proper closing of prepended_input_stream.	2018-02-21 01:02:16 +00:00
Vladimir Krivopalov	c996191411	Close promoted index streams when closing index_readers. Promoted index input streams must be explicitly closed when closing the index_reader in order to ensure all the pending read-aheads are completed. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-02-20 16:04:15 -08:00
Vladimir Krivopalov	8d52d809f7	Support proper closing of prepended_input_stream. When the stream is being closed, the call is forwarded to the stored data_source. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com>	2018-02-20 16:04:05 -08:00
Vladimir Krivopalov	721bd3eef6	Added missing 'override' to skip() in buffer_input_stream and prepended_input_stream. Signed-off-by: Vladimir Krivopalov <vladimir@scylladb.com> Message-Id: <4e91bead8de7f6fa9b3bfdab8bda73efdb22749d.1519152303.git.vladimir@scylladb.com>	2018-02-20 19:49:11 +00:00
Pekka Enberg	f1f691b555	Merge "Add the GoogleCloudSnitch" from Vlad "This series adds the GoogleCloudSnitch. Fixes #1619" * 'google-cloud-snitch-v4' of https://github.com/vladzcloudius/scylla: config: uncomment/add the supported snitches description tests: added gce_snitch_test locator::gce_snitch: implementation of the GoogleCloudSnitch locator::snitch_base: properly log the failure during the snitch startup	2018-02-19 15:58:56 +02:00
Paweł Dziepak	d97eebe82d	tests/cql3: increase TTL to avoid spurious failures The test inserts some values with a TTL of 1 second and then reads them back expecting them not to be expired yet. That may not always be the case if the machine is slow and we are running in the debug mode. Increasising the TTLs by x100 should help avoid these false positives. Message-Id: <20180219133816.17452-1-pdziepak@scylladb.com>	2018-02-19 15:40:19 +02:00
Pekka Enberg	bd365a10d3	Merge "Add an API to get all active repairs" from Amnon "This series adds an API to return the active repairs by their IDs. After this series a call to: curl -X GET --header "Accept: application/json" "http://localhost:10000/storage_service/active_repair/" Will return an array with the ids of the active repairs. Fixes #3193" * 'amnon/get_active_repairs_v3' of github.com:scylladb/seastar-dev: API: Add get active repair api repair: Add a get_active_repairs function to return the active repair	2018-02-19 15:32:17 +02:00
Amnon Heiman	4a8f67aa01	conf: Remove unsupported 'stream_throughput_outbound_megabits_per_sec' option stream_throughput_outbound_megabits_per_sec is not supported and is found in the unsupported part of scylla.yaml. This patch removes it from the supported part of the file. Fixes #2876 Signed-off-by: Amnon Heiman <amnon@scylladb.com> Message-Id: <20180219111421.30687-1-amnon@scylladb.com>	2018-02-19 15:16:23 +02:00
Duarte Nunes	d394b30882	tests/flush_queue_test: Ensure queue is closed before being destroyed Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180217172008.27551-1-duarte@scylladb.com>	2018-02-19 13:10:28 +00:00
Duarte Nunes	294326b5b1	tests/commitlog_test: Close file Operations on a append_challenged_posix_file_impl schedule asynchronous operations when they are executed, which capture the file object. To synchronize with them and prevent use-after-free, we need to call close() before destroying the file. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180217170556.27330-1-duarte@scylladb.com>	2018-02-19 13:10:14 +00:00
Duarte Nunes	ac55210677	tests/logalloc_test: Ensure regions are reclaimed in order This test relied on task execution order to work correctly. Namely, it relied on parent regions being reclaimed before child regions (reclaiming is an asynchronous process started by a call to start_reclaiming()). This order is necessary because child regions don't know about parent regions when calculating the biggest region that should be reclaimed. We fix this by forcing the reclaim order. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180217121655.26057-1-duarte@scylladb.com>	2018-02-19 13:09:59 +00:00
Duarte Nunes	f665f1ab97	db/commitlog: Close the segment file Operations on a segment's underlying append_challenged_posix_file_impl, such as truncate(), schedule asynchronous operations when they are executed, which capture the file object. To synchronize with them and prevent use-after-free, we need to call close() and only delete the segment and file when the returned future resolves. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180216235754.24257-1-duarte@scylladb.com>	2018-02-19 13:09:41 +00:00
Duarte Nunes	7004f6c7ff	db/commitlog: Actually prevent new requests during shutdown When shutting down the commitlog we try to block all new requests by acquiring all available resources. We were, however, letting go of the semaphore permits too early, before closing the gate and shutting down the active segments. Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180216234826.24111-1-duarte@scylladb.com>	2018-02-19 13:09:26 +00:00
Duarte Nunes	9ce0be60d4	utils/flush_queue: Remove unused function Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180216234502.23931-1-duarte@scylladb.com>	2018-02-19 13:09:11 +00:00
Duarte Nunes	4fdcd6c92f	tests/serialized_action_test: Don't rely on task execution order Signed-off-by: Duarte Nunes <duarte@scylladb.com> Message-Id: <20180216191050.21902-1-duarte@scylladb.com>	2018-02-19 13:08:58 +00:00
Duarte Nunes	03608d269e	Merge 'On the road to roles' from Jesse This series takes Scylla most of the way to supporting roles, and eliminates old user-based code. All the old user-based CQL statements and functionality should exist as they did before, except now they are backed internally by roles. While all the functionality for supporting roles should be present, role-specific features like granting a role to another role still warn as "unimplemented". This will continue until the next series addresses the final touches. These remaining items are: - A slightly revised CQL syntax consistent with Apache Cassandra's revised role syntax. - A user is automatically granted permissions on resources they create. Users running a previous version of Scylla should be able to seamlessly upgrade to a version of Scylla with this series merged. When a newly upgraded node starts, it detects the presence of old metadata and copies it to the new metadata tables if no nondefault new metadata yet exists. A new gossiper feature flag, ROLES, also ensures that access-control data is not modified while a cluster is in a partially-upgraded state. If, when the cluster is in a partially upgraded state, a client connects to an un-upgraded node then likely the change will not be propogated to the new metadata table. We will document that changes to access-control are not supported while upgrading in order to account for both cases (a client connecting to an upgraded and a non-upgraded node). All unit tests pass (except those which also fail on `master`). I've run auth-related dtests and they all pass, except for tests which depend on the old security model and which are therefore invalid. Upstream dtests have been updated to account for this new security model, and I will open an appropriate pull request to to similarly update our own version. I have also done a test-run cluster upgrade procedure with ccm consisting of a 3 node cluster. I began by creating the cluster from `master` and increasing the replication factor of the `system_auth` keyspace to 3 and repairing the nodes. I then created several users and granted them permissions on some resources. I then stopped a node, updated its hardlinked executable to Scylla built from this patch series , and restarted the node. I observed the migration of legacy data starting and finishing. Connecting to the node, I observed all the new roles functionality was working correctly. I verified that attempting to change access-control information failed with a message about an upgrading cluster. I repeated the process, node by node, with the remaining two nodes and finally observed that the entire cluster had upgraded and that I could modify access-control information freely. I will encapsulate this test into a dtest if possible. Fixes #1941. * 'jhk/switch_to_roles/v6' of https://github.com/hakuch/scylla: (83 commits) cql3: Remove some unimplemented warnings cql3: Prevent unhandled exception for anonymous user auth: Add alias for set of role names auth: Revoke permissions on dropped role resources auth: Move definition to corresponding .cc file cql3: Fix life-time of `user` from `client_state` auth: Migrate legacy data on boot auth: Check protected resources of the role-manager auth: Protect authenticator resources service/client_state: Correct erroneous comment client_state: Fix error message cql3: Fix error handling for GRANT and REVOKE auth: Remove unnecessary `sstring` allocation cql3: Rename variables to reflect roles auth: Decouple authorization and role management auth: Add code to expand a resource family cql: Also add `username` col. for LIST PERMISSIONS cql3: Fix error handling in LIST PERMISSIONS auth: Change error messages to pass dtests cql3: Handle errors more precisely for roles ...	2018-02-16 13:57:29 +00:00
Tomasz Grabiec	9c3e56fb16	tests: row_cache: Improve test for snapshot consistency on eviction Reproduces https://github.com/scylladb/scylla/issues/3215. Message-Id: <1518710592-21925-1-git-send-email-tgrabiec@scylladb.com>	2018-02-15 16:48:23 +00:00
Tomasz Grabiec	b0b57b8143	mvcc: Do not move unevictable snapshots to cache Commit `6ccd317` introduced a bug in partition_entry::evict() where a partition entry may be partially evicted if there are non-evictable snapshots in it. Partially evicting some of the versions may violate consistency of a snapshot which includes evicted versions. For one, continuity flags are interpreted realtive to the merged view, not within a version, so evicting from some of the versions may mark reanges as continuous when before they were discontinuous. Also, range tombtsones of the snapshot are taken from all versions, so we can't partially evict some of them without marking all affected ranges as discontinuous. The fix is to revert back to full eviciton, and avoid moving non-evictable snapshots to cache. When moving whole partition entry to cache, we first create a neutral empty partition entry and then merge the memtable entry into it just like we would if the entry already existed. Fixes #3215. Tests: unit (release) Message-Id: <1518710592-21925-2-git-send-email-tgrabiec@scylladb.com>	2018-02-15 16:48:07 +00:00
Paweł Dziepak	1e218e2b80	Merge "Fixes for exception safety in cache and LSA" from Tomasz "Fixes two issues: - update may abort if allocation of an empty partition_version fails - LSA region construction is not exception safe, it may leave the misconstructed region registered if allocation inside region_group::add() fails." * tag 'tgrabiec/exception-safety-cache-update-v2' of github.com:scylladb/seastar-dev: tests: row_cache: Add test for exception safety of updates from memtable tests: flat_reader_assertions: Improve failure message cache: Handle exceptions from make_evictable() tests: Disable failure injection around background compactor lsa: Disable allocation failure injection inside merge() lsa: Make region deregistration robust against duplicates lsa: Make region allocation exception safe	2018-02-15 10:32:08 +00:00
Tomasz Grabiec	b3415880b2	tests: row_cache: Add test for exception safety of updates from memtable	2018-02-15 10:13:02 +01:00
Jesse Haber-Kucharsky	2348c303df	cql3: Remove some unimplemented warnings While there are some small remaining features for roles, all the old user-based statements still exist as they did before (except now they're backed by roles) and should not log warnings.	2018-02-14 14:16:00 -05:00
Jesse Haber-Kucharsky	114cfd4e5a	cql3: Prevent unhandled exception for anonymous user Since `validate` is called after `check_access`, an anonymous user would not get the expected error message about restrictions on anonymous users.	2018-02-14 14:16:00 -05:00
Jesse Haber-Kucharsky	a83af20311	auth: Add alias for set of role names This shortens some type names considerably.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	39a44e3494	auth: Revoke permissions on dropped role resources Previously, when a table or keyspace was dropped, the authorizer (through a `migration_listener`) automatically dropped all permissions granted on that resource. Likewise, when a role is granted permissions and the role is dropped, all permissions granted to the role are dropped. In this change, we now treat role resources just like table and keyspace resources: if a permission is granted on a role (like "GRANT AUTHORIZE ON ROLE qa TO phil") and the "qa" role is dropped, then all permissions on the "qa" role resource are also dropped.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	e6d9d53eca	auth: Move definition to corresponding .cc file	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	89b5bf2d7a	cql3: Fix life-time of `user` from `client_state`	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	fbc97626c4	auth: Migrate legacy data on boot This change allows for seamless migration of the legacy users metadata to the new role-based metadata tables. This process is summarized in `docs/migrating-from-users-to-roles.md`. In general, if any nondefault metadata exists in the new tables, then no migration happens. If, in this case, legacy metadata still exists then a warning is written to the log. If no nondefault metadata exists in the new tables and the legacy tables exist, then each node will copy the data from the legacy tables to the new tables, performing transformations as necessary. An informational message is written to the log when the migration process starts, and when the process ends. During the process of copying, data is overwritten so that multiple nodes racing to migrate data do not conflict. Since Apache Cassandra's auth. schema uses the same table for managing roles and authentication information, some useful functions in `roles-metadata.hh` have been added to avoid code duplication. Because a superuser should be able to drop the legacy users tables from `system_auth` once the cluster has migrated to roles and is functioning correctly, we remove the restriction on altering anything in the "system_auth" keyspace. Individual tables in `system_auth` are still protected later in the function. When a cluster is upgrading from one that does not support roles to one that does, some nodes will be running old code which accesses old metadata and some will be running new code which access new metadata. With the help of the gossiper `feature` mechanism, clients connecting to upgraded nodes will be notified (through code in the relevant CQL statements) that modifications are not allowed until the entire cluster has upgraded.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	8be0165713	auth: Check protected resources of the role-manager A new function `auth::service::is_protected` checks the protected-resource set of all access-control modules (including the role-manager).	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	8440140465	auth: Protect authenticator resources A typo meant that only the authorizer resources were protected.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	617e432540	service/client_state: Correct erroneous comment	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	e27cfd4dda	client_state: Fix error message Now that resources are not just keyspaces and tables, the word "schema" doesn't make sense.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	f9f03bc2e1	cql3: Fix error handling for GRANT and REVOKE This change gets rid of duplicated code for checking if the grantee or revokee exist by moving this functionality to the auth. service.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	e18adbcb3e	auth: Remove unnecessary `sstring` allocation The authorizer now accepts parameters by `string_view`.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	c1a03dbf54	cql3: Rename variables to reflect roles	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	5be16247cc	auth: Decouple authorization and role management auth: Decouple authorization and role management Access control in Scylla consists of three main modules: authentication, authorization, and role-management. Each of these modules is intended to be interchangeable with alternative implementations. The `auth::service` class composes these modules together to perform all access-control functionality, including caching. This architecture implies two main properties of the individual access-control modules: - Independence of modules. An implementation of authentication should have no dependence or knowledge of authorization or role-management, for example. - Simplicity of implementing the interface. Functionality that is common to all implementations should not have to be duplicated in each implementation. The abstract interface for a module should capture only the differences between particular implementations. Previously, the authorization interface depended on an instance of `auth::service` for certain operations, since it required aggregation over all the roles granted to a particular role or required checking if a given role had superuser. This change decouples authorization entirely from role-management: the authorizer now manages only permissions granted directly to a role, and not those inherited through other roles. When a query needs to be authorized, `auth::service::get_permissions` first uses the role manager to check if the role has superuser. Then, it aggregates calls to `auth::authorizer::authorize` for each role granted to the role (again, from the role-manager) to determine the sum-total permission set. This information is cached for future queries. This structure allows for easier error handling and management (something I hope to improve in the future for both the authorizer and authenticator interfaces), easier system testing, easier implementation of the abstract interfaces, and clearer system boundaries (so the code is easier to grok). Some authorizers, like the "TransitionalAuthorizer", grant permissions to anonymous users. Therefore, we could not unconditionally authorize an empty permission set in `auth::service` for anonymous users. To account for this, the interface of the authorizer has changed to accept an optional name in `authorize`. One additional notable change to the authorizer is the `auth::authorizer::list`: previously, the filtering happened at the CQL query layer and depended on the roles granted to the role in question. I've changed the function to simply query for all roles and I do the filtering in `auth::system` in-memory with the STL. This was necessary to allow the authorizer to be decoupled from role-management. This function is only called for LIST PERMISSIONS (so performance is not a concern), and it significantly reduces demand on the implementation. Finally, we unconditionally create a user in `cql_test_env` since authorization requires its existence.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	0ac7d9922d	auth: Add code to expand a resource family This will be useful for the next change, where it is used for refactoring LIST PERMISSIONS.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	d0ddb354d0	cql: Also add `username` col. for LIST PERMISSIONS the value for the `role` column is equal to the value for the `username` column. This change makes LIST PERMISSIONS backwards compatible with clients that expect the `username` column to exist. This functionality also exists in Apache Cassandra.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	cccfe269cf	cql3: Fix error handling in LIST PERMISSIONS This patch replaces duplicated code for checking the existence of a user with the same mechanism for doing so as elsewhere: by checking for `auth::nonexistent_role` being thrown during the course of checking access-control. This patch also ensures that exceptions thrown while querying the list of permissions on a resource get handled correctly.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	13ba128967	auth: Change error messages to pass dtests The fixed dtests which only failed due to differences in wording and grammar for error messages are: - altering_nonexistent_user_throws_exception_test - cant_create_existing_user_test - dropping_nonexistent_user_throws_exception_test - users_cant_alter_their_superuser_status_test	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	f372bbb4bc	cql3: Handle errors more precisely for roles This patch ensures that all the CQL statements for managing roles correctly catch exceptions in the underlying `role_manager` and re-throw them as top-level exceptions (like "invalid request"). This patch also refines exception handling so that only the applicable errors are explicitly caught. This should allow easier auditing in the future and help to reveal faulty assumptions.	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	ce3be07556	auth: Move resource existence checks Previously, a "data" auth. resource knew how to check it's own existence by accessing a global variable. This patch accomplishes two things: it adds existence checking to all kinds of resources, and moves these checks outside of `auth::resource` itself and into `auth::service` (so that global variables are no longer accessed).	2018-02-14 14:15:59 -05:00
Jesse Haber-Kucharsky	cf5f6aa4c5	auth: Fix fragile variable life-times According to the Seastar convention, a parameter passed to a function taking a reference parameter must live for the duration of the execution of the returned future. When possible, variables are statically allocated. When this is not possible, we use `do_with`.	2018-02-14 14:15:59 -05:00

1 2 3 4 5 ...

14662 Commits