scylladb

Author	SHA1	Message	Date
Calle Wilund	f5c79d15a8	alternator: Include stream arn in table description if enabled Fixes #7157 When creating/altering/describing a table, if streams are enabled, the "latest active" stream arn should be included as LatestStreamArn. Not doing so breaks java kinesis.	2020-09-07 08:16:11 +00:00
Calle Wilund	a978e043c3	alternator::streams: Do not allow enabling streams when CDC is off Fixes #6866 If we try to create/alter an Alternator table to include streams, we must check that the cluster does in fact support CDC (experimental still). If not, throw a hopefully somewhat descriptive error. (Normal CQL table create goes through a similar check in cql_prop_defs) Note: no other operations are prohibited. The cluster could have had CDC enabled before, so streams could exist to list and even read. Any tables loaded from schema tables should be reposnsible for their own validation.	2020-08-03 21:01:31 +03:00
Calle Wilund	05851578d4	alternator::streams: Report streams as not ready until CDC stream id:s are available Refs #6864 When booting a clean scylla, CDC stream ID:s will not be availble until a nring delay time period has passed. Before this, writing to a CDC enabled table will fail hard. For alternator (and its tests), we can report the stream(s) for tables as not yet available (ENABLING) until such time as id:s are computed. v2: Keep storage service ref in executor	2020-08-03 20:34:15 +03:00
Calle Wilund	cbb70f4af4	executor: "UpdateTable" support for streams Partial implementation of the "UpdateTable" command. Supports only enabling/disabling streams.	2020-07-15 08:21:34 +00:00
Calle Wilund	45ee73969d	executor: Allow streams specification in CreateTable schema	2020-07-15 08:21:34 +00:00
Calle Wilund	bbc544748f	alternator: Implement GetRecords Simplistic variant, using 1:1 mapping of scylla stream id <-> shard	2020-07-15 08:21:34 +00:00
Calle Wilund	3756febbf5	alternator: expose describe_single_item and default_timeout To be able to describe single alternator items from other files. And query with the default timeout.	2020-07-15 08:10:23 +00:00
Calle Wilund	c45781de1e	alternator: Implement GetShardIterator	2020-07-15 08:10:23 +00:00
Calle Wilund	8084b5a9b7	alternator: Implement DescribeStream	2020-07-15 08:10:23 +00:00
Calle Wilund	8fb9b32bd3	alternator: Implement ListStreams command	2020-07-15 08:10:23 +00:00
Calle Wilund	0708a9971a	executor: Add system_distributed_keyspace as parameter/member Streams implementation will require querying system tables etc to do its work, thus will need access to this object.	2020-07-15 08:10:23 +00:00
Calle Wilund	e382d79bcd	executor: Make some helper and subroutines class-visible Subroutines needed by (in this case) streams implementation moved from being file-static to class-static (exported). To make putting handler routines in separate sources possible. Because executor.cc is large and slow to compile. Separation is nice. Unfortunately, not all methods can be kept class-private, since unrelated types also use them. Reviewer suggested to instead place there is a top-level header for export, i.e. not class-private at all. I am skipping that for now, mainly because I can't come up with a good file name. Can be part of a generate refactor of helper routine organization in executor.	2020-07-15 08:10:23 +00:00
Piotr Sarna	4de23d256e	alternator,utils: move rjson.hh to utils/ rjson is going to replace libjsoncpp, so it's moved from alternator to the common utils/ directory.	2020-07-03 08:30:01 +02:00
Piotr Sarna	53bbef1e6c	alternator: add a way of accessing system tables from alternator Scylla's system tables often provide interesting information for clients. In order to be able to access this information without CQL, a notion of virtual tables is introduced to alternator. When a table named .scylla.alternator.KS_NAME.TABLE_NAME is accessed with read-only operation - Query or Scan, Scylla's internal KS_NAME.TABLE_NAME table will be queried instead. For instance, if a user wants to read about system_auth.roles, the Scan request should target the following table: ".scylla.alternator.system_auth.roles". Fixes #6122	2020-04-09 09:41:30 +02:00
Piotr Sarna	781fbe8070	alternator: add service permit to callbacks As a first step towards introducing admission control, the API of alternator callbacks is extended with an additional 'permit' parameter.	2020-03-16 07:44:25 +01:00
Nadav Har'El	96ca5ac2c8	alternator: use separate smp_service_group for bouncing requests Until this patch, we used the default_smp_service_group() when bouncing Alternator requests between shards (which is needed for LWT). This patch creates a new smp_service_group for this purpose, which is limited to 5000 concurrent requests (the same limit used for CQL's bounce_request_smp_service_group). The purpose of this limit is to avoid many shards admitting a huge number of requests and bouncing all of them to the same shard who now can't "unadmit" these requests. Fixes #5664. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200304170825.27226-1-nyh@scylladb.com>	2020-03-05 10:17:51 +01:00
Piotr Sarna	2402955d45	alternator: move parsing in front of executor Parsing a request string into JSON happens as a first thing in every request, so it can be performed before calling any executor callbacks. The most important thing however, is that making parsing a separate stage allows certain optimizations, e.g. running all parsing in a single seastar thread, which allows adding yields to rjson parsing later.	2020-02-28 07:57:12 +02:00
Piotr Sarna	14dfa3c0c3	alternator: change keyspace prefix to alternator_ The original idea of prefixing alternator keyspace names with 'a#' leveraged the fact that '#' is not a legal CQL character for keyspace names. The idea is flawed though, since '#' proved to confuse existing Scylla tools (e.g. nodetool). Thus, the prefix is changed to more orthodox 'alternator_'. It is possible to create such keyspaces with CQL as well, but then the alternator CreateTable request would simply fail, because the keyspace already exists, which is graceful enough. Hiding alternator keyspaces and tables from CQL is another issue, but there are other ways to distinguish them than a non-standard prefix, e.g. tags. Fixes #5883	2020-02-23 23:32:29 +02:00
Piotr Sarna	3eb6da224b	alternator: switch to keyspace-per-table approach Instead of a monolith alternator keyspace, each table creates its own keyspace, named in the following pattern: `a#TABLE_NAME`. The `a#` prefix contains an illegal CQL character in order to ensure that these keyspaces are never created via CQL.	2020-02-13 09:46:19 +01:00
Piotr Sarna	dcf54331ea	alternator: allow custom names for keyspaces The maybe_create_keyspace utility now accepts a parameter - the desired name for a newly created keyspace.	2020-02-13 09:16:37 +01:00
Gleb Natapov	38fcab3db4	alternator: pass tracing state explicitly instead of relying on it been in the client_state Multiple requests can use the same client_state simultaneously, so it is not safe to use it as a container for a tracing state which is per request. This is not yet an issue for the alternator since it creates new client_state object for each request, but first of all it should not and second trace state will be dropped from the client_state, by later patch.	2020-02-10 14:50:55 +02:00
Piotr Sarna	4c9f2f3c0a	alternator: implement tagging The following requests are implemented: - TagResource - UntagResource - ListTagsOfResource Also, more tests are added for validating inputs, for both arns, tag values and tag keys. Message-Id: <a7ce9534ca580736fea445813fafef75a6139e29.1579618972.git.sarna@scylladb.com>	2020-01-29 10:20:05 +01:00
Piotr Sarna	a6a65abc3c	alternator: change request return type to variant<value, error> In order to minimize the use of exceptions during normal operations, each request handler is now able to return either a proper JSON value, or an instance of api_error, which indicates that something went wrong, but without having to throw, catch and rethrow C++ exceptions. This is especially important for conditional updates, since it's expected to be common to return ConditionalCheckFailedException. Message-Id: <d8996a0a270eb0d9db8fdcfb7046930b96781e69.1579515640.git.sarna@scylladb.com>	2020-01-28 12:39:23 +02:00
Nadav Har'El	7dfd081e0d	alternator: make "executor" a peering_sharded_service Alternator uses a sharded<executor> for handling execution of Alternator requests on different shards. In this patch we make executor a subclass of peering_sharded_service, to allow one of these executors to run an exector method on a different shard: Any one of the shard-local executor instances can call container() to get the full sharded<executor>. We will need this capability later, when we need to bounce requests between shards because of requirements of the storage_proxy::cas (LWT) code. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-01-23 13:57:23 +02:00
Nadav Har'El	7b8917b5cb	alternator: rename reserved column name "attrs" We currently reserve the column name "attrs" for a map of attributes, so the user is not allowed to use this name as a name of a key. We plan to lift this reservation in a future patch, but until we do, let's at least choose a more obscure name to forbid - in this patch ":attrs". It is even less likely that a user will want to use this specific name as a column name. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190903133508.2033-1-nyh@scylladb.com>	2019-09-11 18:01:05 +03:00
Nadav Har'El	c9eb9d9c76	alternator: update license blurbs Update all the license blurbs to the one we use in the open-source Scylla project, licensed under the AGPL. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190825160321.10016-1-nyh@scylladb.com>	2019-09-11 18:01:05 +03:00
Piotr Sarna	cb791abb9d	alternator: enable query tracing Probabilistic tracing can be enabled via REST API. Alternator will from now on create tracing sessions for its operations as well. Examples: # trace around 0.1% of all requests curl -X POST http://localhost:10000/storage_service/trace_probability?probability=0.001 # trace everything curl -X POST http://localhost:10000/storage_service/trace_probability?probability=1	2019-09-11 18:01:05 +03:00
Piotr Sarna	6c8c31bfc9	alternator: add client state Keeping an instance of client_state is a convenient way of being able to use tracing for alternator. It's also currently used in paging, so adding a client state to executor removes the need of keeping a dummy value.	2019-09-11 18:01:05 +03:00
Nadav Har'El	2f53423a2f	alternator: automatically choose RF: 1 or 3 In CQL, before a user can create a table, they must create a keyspace to contain this table and, among other things, specify this keyspace's RF. But in the DynamoDB API, there is no "create keyspace" operation - the user just creates a table, and there is no way, and no opportunity, to specify the requested RF. Presumably, Amazon always uses the same RF for all tables, most likely 3, although this is not officially documented anywhere. The existing code creates the keyspace during Scylla boot, with RF=1. This RF=1 always works, and is a good choice for a one-node test run, but was a really bad choice for a real cluster with multiple nodes, so this patch fixes this choice: With this patch, the keyspace creation is delayed - it doesn't happen when the first node of the cluster boots, but only when the user creates the first table. Presumably, at that time, the cluster is already up, so at that point we can make the obvious choice automatically: a one-node cluster will get RF=1, a >=3 node cluster will get RF=3. The choice of RF is logged - and the choice of RF=1 is considered a warning. Note that with this patch, keyspace creation is still automatic as it was before. The user may manually create the keyspace via CQL, to override this automatic choice. In the future we may also add additional keyspace configuration options via configuration flags or new REST requests, and the keyspace management code will also likely change as we start to support clusters with multiple regions and global tables. But for now, I think the automatic method is easiest for users who want to test-drive Alternator without reading lengthy instructions on how to set up the keyspace. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190820180610.5341-1-nyh@scylladb.com>	2019-09-11 18:01:05 +03:00
Nadav Har'El	4d07e2b7c5	alternator: support BatchGetItem This patch adds to Alternator an implementation of the BatchGetItem operation, which allows to start a number of GetItem requests in parallel in a single request. The implementation is almost complete - the only missing feature is the ability to ask only for non-top-level attributes in ProjectionExpression. Everything else should work, and this patch also includes tests which, as usual, pass on DynamoDB and now also on Alternator. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 15:33:50 +03:00
Nadav Har'El	83b91d4b49	alternator: add DeleteItem Add support for the DeleteItem operation, which deletes an item. The basic deletion operation is supported. Still not supported are: 1. Parameters to conditionally delete (ConditionalExpression or Expected) 2. Parameters to return pre-delete content 3. ReturnItemCollectionMetrics (statistics relevant for tables with LSI) Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 15:19:46 +03:00
Nadav Har'El	eb81b31132	alternator: add statistics his patch adds a statistics framework to Alternator: Executor has (for each shard) a _stats object which contains counters for various events, and also is in charge of making these counters visible via Scylla's regular metrics API (http://localhost:9180/metrics). This patch includes a counter for each of DynamoDB's operation types, and we increase the ones we support when handled. We also added counters for total operations and unsupported operations (operation types we don't yet handle). In the future we can easily add many more counters: Define the counter in stats.hh, export it in stats.cc, and increment it in where relevant in executor.cc (or server.cc). Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 14:36:26 +03:00
Piotr Sarna	b309c9d54b	alternator: implement basic Query The implementation covers the following restrictions - equality for hash key; - equality, <, <=, >, >=, between, begins_with for sort key. Message-Id: <021989f6d0803674cbd727f9b8b3815433ceeea5.1558356119.git.sarna@scylladb.com>	2019-09-11 14:36:16 +03:00
Piotr Sarna	c0ecd1a334	alternator: add basic BatchWriteItem The initial implementation only supports PutRequest requests, without serving DeleteRequest properly. Message-Id: <451bcbed61f7eb2307ff5722de33c2e883563643.1557914382.git.sarna@scylladb.com>	2019-09-11 14:29:50 +03:00
Nadav Har'El	9a0c13913d	alternator: improve where DescribeEndpoints gets its information Instead of blindly returning "localhost:8000" in response to DescribeEndpoints and for sure causing us problems in the future, the right thing to do is to return the same domain name which the user originally used to get to us, be it "localhost:8000" or "some.domain.name:1234". But how can we know what this domain name was? Easy - this is why HTTP 1.1 added a mandatory "Host:" header, and the DynamoDB driver I tested (boto3) adds it as expected, indeed with the expected value of "localhost:8000" on my local setup. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 14:25:22 +03:00
Nadav Har'El	29e0f68ee0	alternator: add initial implementation of DescribeEndpoints DescribeEndpoints is not a very important API (and by default, clients don't use it) but I wanted to understand how DynamoDB responds to it, and what better way than to write a test :-) And then, if we already have a test, let's implement this request in Scylla as well. This is a silly implementation, which always returns "localhost:8000". In the future, this will need to be configurable - we're not supposed here to return this server's IP address, but rather a domain name which can be used to get to all servers. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 14:22:47 +03:00
Piotr Sarna	4def674731	alternator: implement basic scan The most basic version of Scan request is implemented. It still contains a list of TODOs, among which the support for Segments parameter for scan parallelism. Message-Id: <5d1bfc086dbbe64b3674b0053e58a0439e64909b.1557757402.git.sarna@scylladb.com>	2019-09-11 14:21:39 +03:00
Piotr Sarna	b6dde25bcc	alternator: implement ListTables ListTables is used to extract all table names created so far. Message-Id: <04f4d804a40ff08a38125f36351e56d7426d2e3d.1557402320.git.sarna@scylladb.com>	2019-09-11 14:10:54 +03:00
Nadav Har'El	3ae0066aae	alternator: add support for UpdateItem's DELETE operation So far we supported UpdateItem only with PUT operations - this patch adds support for DELETE operations, to delete specific attributes from an item. Only the case of a missing value is support. DynamoDB also provides the ability to pass the old value, and only perform the deletion if the value and/or its type is still up-to-date - but we don't support this yet and fail such request if it is attempted. This patch also includes a test for this case in alternator-test/ Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 14:08:57 +03:00
Nadav Har'El	0c2a440f7f	alternator: add initial UpdateItem implementation Add an initial UpdateItem implementation. As PutItem and GetItem we are still limited to string attributes. This initial implementation of UpdateItem implements only the "PUT" action (not "DELETE" and certainly not "ADD") and not any of the more advanced options. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 14:03:00 +03:00
Piotr Sarna	6ad9b10317	alternator: make constant names more explicit KEYSPACE and ATTRS constants refer to their names, not objects, so they're named more explicitly. Message-Id: <14b1f00d625e041985efbc4cbde192bd447cbf03.1557223199.git.sarna@scylladb.com>	2019-09-11 13:07:14 +03:00
Piotr Sarna	6e8db5ac6a	alternator: inline keywords It was decided that all alternator-specific keywords can be inlined in code instead of defining them as constants. Message-Id: <6dffb9527cfab2a28b8b95ac0ad614c18027f679.1557223199.git.sarna@scylladb.com>	2019-09-11 13:04:38 +03:00
Nadav Har'El	8dec31d23b	alternator: add initial implementation of DeleteTable Add an initial implementation of Delete table, enough for making the pytest --local test_table.py::test_create_and_delete_table Pass. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 12:45:42 +03:00
Nadav Har'El	f66ec337f7	alternator: very initial implementation of DescribeTable This initial implementation is enough to pass a test of getting a failure for a non-existant table - test_table.py::test_describe_table_non_existent_table and to recognize an existing table. But it's still missing a lot of fields for an existing table (among others, the schema). Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-09-11 12:41:32 +03:00
Piotr Sarna	2ec78164bc	alternator: add minimal HTTP interface The interface works on port 8000 by default and provides the most basic alternator operations - it's an incomplete set without validation, meant to allow testing as early as possible.	2019-09-11 12:34:18 +03:00

45 Commits