scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-28 04:06:59 +00:00

Author	SHA1	Message	Date
Piotr Sarna	fea29fdfd1	alternator: allow creating GSI with 2 base regular columns Creating an underlying materialized view with 2 regular base columns is risky in Scylla, as second's column liveness will not be correctly taken into account when ensuring view row liveness. Still, in case specific conditions are met: * the regular base column value is always present in the base row * no TTLs are involved then the materialized view will behave as expected. Creating a GSI with 2 base regular columns issues a warning, as it should be performed with care. Message-Id: <5ce8642c1576529d43ea05e5c4bab64d122df829.1567159633.git.sarna@scylladb.com>	2019-09-01 11:38:56 +03:00
Nadav Har'El	93f1072cee	alternator: fix default BillingMode It is important that BillingMode should default to PROVISIONED, as it does on DynamoDB. This allows old clients, which don't specify BillingMode at all, to specify ProvisionedThroughput as allowed with PROVISIONED. Also added a test case for this case (where BillingMode is absent). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190829193027.7982-1-nyh@scylladb.com>	2019-09-01 11:25:56 +03:00
Nadav Har'El	edd03312ed	alternator: correct error on missing index or table When querying on a missing index, DynamoDB returns different errors in case the entire table is missing (ResourceNotFoundException) or the table exists and just the index is missing (ValidationException). We didn't make this distinction, and always returned ValidationException, but this confuses clients that expect ResourceNotFoundException - e.g., Amazon's Tic-Tac-Toe demo. This patch adds a test for the first case (the completely missing table) - we already had a test for the second case - and returns the correct error codes. As usual the test passes against DynamoDB as well as Alternator, ensure they behave the same. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190829174113.5558-1-nyh@scylladb.com>	2019-09-01 11:18:59 +03:00
Piotr Sarna	b6662bac35	alternator: remove redundant key checks in UpdateItem Updating key columns is not allowed in UpdateItem requests, but the series introducing GSI support for regular columns also introduced redundant duplicates checks of this kind. This condition is already checked in resolve_update_path helper function and existing test_update_expression_cannot_modify_key test makes sure that the condition is checked. Message-Id: <00f83ab631f93b263003fb09cd7b055bee1565cd.1567086111.git.sarna@scylladb.com>	2019-08-29 19:55:15 +03:00
Piotr Sarna	5fc1e70751	alternator: use from_single_value instead of from_singular in ck The code previously used clustering_key::from_singular() to compute a clustering key value. It works fine, but has two issues: 1. involves one redundant deserialization stage compared to from_single_value 2. does not work with compound clustering keys, which can appear when using indexes	2019-08-28 16:02:20 +02:00
Piotr Sarna	86e80ea320	alternator: add describing GSI in DescribeTable The DescribeTable request now contains the list of index names as well. None of the attributes of the list are marked as 'required' in the documentation, so currently the implementation provides index names only.	2019-08-28 15:38:09 +02:00
Piotr Sarna	6d62ed213a	alternator: allow adding GSI-related regular columns to schema In order to be able to create a Global Secondary Index over a regular column, this column is upgraded from being a map entry to being a full member of the schema. As such, it's possible to use this column definition in the underlying materialized view's key.	2019-08-28 15:29:35 +02:00
Piotr Sarna	05920f7c3b	alternator: add handling regular columns with schema definitions In order to prepare alternator for adding regular columns to schema, i.e. in order to create a materialized view over them, the code is changed so that updating no longer assumes that only keys are included in the table schema.	2019-08-28 15:12:19 +02:00
Piotr Sarna	9390dd41d0	alternator: start fetching all regular columns Since in the future we may want to have more regular columns in alternator tables' schemas, the code is changed accordingly, so all regular columns will be fetched instead of just the attribute map.	2019-08-28 11:06:56 +02:00
Piotr Sarna	a795c290f3	alternator: avoid creating empty collection mutations If no regular column attributes are passed to PutItem, the attr collector serializes an empty collection mutation nonetheless and sends it. It's redundant, so instead, if the attr colector is empty, the collection does not get serialized and sent to replicas.	2019-08-28 11:06:56 +02:00
Nadav Har'El	5e8644413e	alternator: update license blurbs Update all the license blurbs to the one we use in the open-source Scylla project, licensed under the AGPL. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190825160321.10016-1-nyh@scylladb.com>	2019-08-26 14:22:41 +03:00
Piotr Sarna	76c3e4e0d6	alternator: add initial tracing to requests Each request provides basic tracing information about itself. Example output from tracing: cqlsh> select request, parameters from system_traces.sessions where session_id = 39813070-c4ea-11e9-8572-000000000000; request \| parameters ------------------+----------------------------------------------------- Alternator Query \| {'query': '{"TableName": "alternator_test_15664", "KeyConditions": {"p": {"AttributeValueList": [{"S": "T0FE0QCS0X"}], "ComparisonOperator": "EQ"}}}'} cqlsh> select session_id, activity from system_traces.events where session_id = 39813070-c4ea-11e9-8572-000000000000; session_id \| activity --------------------------------------+----------------------------- 39813070-c4ea-11e9-8572-000000000000 \| Querying 39813070-c4ea-11e9-8572-000000000000 \| Performing a database query	2019-08-26 12:01:29 +02:00
Piotr Sarna	9567691551	alternator: enable query tracing Probabilistic tracing can be enabled via REST API. Alternator will from now on create tracing sessions for its operations as well. Examples: # trace around 0.1% of all requests curl -X POST http://localhost:10000/storage_service/trace_probability?probability=0.001 # trace everything curl -X POST http://localhost:10000/storage_service/trace_probability?probability=1	2019-08-26 11:57:44 +02:00
Piotr Sarna	2220c8c681	alternator: add client state Keeping an instance of client_state is a convenient way of being able to use tracing for alternator. It's also currently used in paging, so adding a client state to executor removes the need of keeping a dummy value.	2019-08-26 11:07:56 +02:00
Nadav Har'El	3044f71438	alternator: refuse CreateTable if uses unsupported features If a user tries to create a table with a unsupported feature - a local secondary index, a used-defined encryption key or supporting streams (CDC), let's refuse the table creation, so the application doesn't continue thinking this feature is available to it. The "Tags" feature is also not supported, but it is more harmless (it is used mostly for accounting purposes) so we do not fail the table creation because of it. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190818125528.9091-1-nyh@scylladb.com>	2019-08-21 17:25:01 +03:00
Nadav Har'El	c9345d8a0e	alternator: automatically choose RF: 1 or 3 In CQL, before a user can create a table, they must create a keyspace to contain this table and, among other things, specify this keyspace's RF. But in the DynamoDB API, there is no "create keyspace" operation - the user just creates a table, and there is no way, and no opportunity, to specify the requested RF. Presumably, Amazon always uses the same RF for all tables, most likely 3, although this is not officially documented anywhere. The existing code creates the keyspace during Scylla boot, with RF=1. This RF=1 always works, and is a good choice for a one-node test run, but was a really bad choice for a real cluster with multiple nodes, so this patch fixes this choice: With this patch, the keyspace creation is delayed - it doesn't happen when the first node of the cluster boots, but only when the user creates the first table. Presumably, at that time, the cluster is already up, so at that point we can make the obvious choice automatically: a one-node cluster will get RF=1, a >=3 node cluster will get RF=3. The choice of RF is logged - and the choice of RF=1 is considered a warning. Note that with this patch, keyspace creation is still automatic as it was before. The user may manually create the keyspace via CQL, to override this automatic choice. In the future we may also add additional keyspace configuration options via configuration flags or new REST requests, and the keyspace management code will also likely change as we start to support clusters with multiple regions and global tables. But for now, I think the automatic method is easiest for users who want to test-drive Alternator without reading lengthy instructions on how to set up the keyspace. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190820180610.5341-1-nyh@scylladb.com>	2019-08-20 21:24:01 +03:00
Piotr Sarna	7d68d5030d	alternator: replace is_byte_order_compatible in BEGINS WITH Checking if the type is byte-order compatible is more than enough for BEGINS WITH operator - actually, we just need to check if the type is compatible with a string. Message-Id: <27a867cc1fa907ff87e011914e4acbb4f7db0181.1566225556.git.sarna@scylladb.com>	2019-08-19 17:43:12 +03:00
Nadav Har'El	c49e009e3e	alternator: use empty_service_permit() In the new code, write and read queries take a "service permit" which they hold for the duration of the query, to help limit the load on the machine. Alternator doesn't yet participate in this feature, so for now let's just use empty_service_permit() meaning the queries don't hold on to any permit. We can fix this later to use real permits. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-08-19 17:12:08 +03:00
Nadav Har'El	eebb2f0a0f	alternator: add to CreateTable verification of BillingMode setting We allow BillingMode to be set to either PAY_PER_REQUEST (the default) or PROVISIONED, although neither mode is fully implemented: In the former case the payment isn't accounted, and in the latter case the throughput limits are not enforced. But other settings for BillingMode are now refused, and we add a new test to verify that. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190818122919.8431-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	656f62722b	alternator: enable timeouts on requests Currently Alternator starts all Scylla requests (including both reads and writes) without any timeout set. Because of bugs and/or network problems, Requests can theoretically hang and waste Scylla request for hours, long after the client has given up on them and closed their connection. The DynamoDB protocol doesn't let a user specify which timeout to use, so we should just use something "reasonable", in this patch 10 seconds. Remember that all DynamoDB read and write requests are small (even scans just scan a small piece), so 10 seconds should be above and beyond anything we actually expect to see in practice. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190812105132.18651-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	aaf559c4f9	alternator: fix indentation It turns out that recent rjson patches introduced some buggy tabs instead of spaces due to bad IDE configuration. The indentation is restored to spaces.	2019-08-19 15:49:52 +03:00
Piotr Sarna	b914ba11fa	alternator: add validation to QueryFilter QueryFilter, according to docs, can only contain non-key attributes.	2019-08-19 15:49:52 +03:00
Piotr Sarna	1b2b2c7009	alternator: add computing key bounds from filtering Alternator allows passing hash and sort key restrictions as filters - it is, however, better to incorporate these restrictions directly into partition and clustering ranges, if possible. It's also necessary, as optimizations inside restrictions_filter assume that it will not be fed unneeded rows - e.g. if filtering is not needed on partition key restrictions, they will not be checked.	2019-08-19 15:49:52 +03:00
Piotr Sarna	338b7e9e67	alternator: add bumping filtering stats When filtering is used in querying or scanning, the number of total filtered rows is added to stats.	2019-08-19 15:49:52 +03:00
Piotr Sarna	154d1649c6	alternator: add cql_stats to alternator stats Some underlying operations (e.g. paging) make use of cql_stats structure from CQL3. As such, cql_stats structure is added to alternator stats in order to gather and use these statistics.	2019-08-19 15:49:52 +03:00
Nadav Har'El	e0b01a0233	alternator: initial support for GSI This patch adds partial support for GSI (Global Secondary Index) in Alternator, implemented using a materialized view in Scylla. This initial version only supports the specific cases of the index indexing a column which was already part of the base table's key - e.g., indexing what used to be a sort key (clustering key) in the base table. Indexing of non-key attributes (which today live in a map) is not yet supported in this version. Creation of a table with GSIs is supported, and so is deleting the table. UpdateTable which adds a GSI to an existing table is not yet supported. Query and Scan operations on the index are supported. DescribeTable does not yet list the GSIs as it should. Seven previously-failing tests now pass, so their "xfail" tag is removed. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190808090256.12374-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	b3bf2fab2e	alternator: add stats for read-before-write A simple metric counting how many read-before-writes were executed is added. Message-Id: <d8cc1e9d77e832bbdeff8202a9f792ceb4f1e274.1565274797.git.sarna@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	6b145b59d3	alternator: remove missing key FIXME The case for missing key in update_item was already properly fixed along with migrating from libjsoncpp to rapidjson, but one FIXME remained in the code by mistake. Message-Id: <94b3cf53652aa932a661153c27aa2cb1207268c7.1565271432.git.sarna@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	76bc30a82d	alternator: remove decimal_type FIXME Decimal precision problems were already solved by commit d5a1854d93c9448b1d22c2d02eb1c46a286c5404, but one FIXME remained in the code by mistake. Message-Id: <381619e26f8362a8681b83e6920052919acf1142.1565271198.git.sarna@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	cd2c581c7c	alternator: remove a pointer-based workaround for future<json> With libjsoncpp we were forced to work around the problem of non-noexcept constructors by using an intermediate unique pointer. Objects provided by rapidjson have correct noexcept specifiers, so the workaround can be dropped.	2019-08-19 15:49:52 +03:00
Piotr Sarna	e19a7f908e	alternator: migrate to rapidjson library Profiling alternator implied that JSON parsing takes up a fair amount of CPU, and as such should be optimized. libjsoncpp is a standard library for handling JSON objects, but it also proves slower than rapidjson, which is hereby used instead. The results indicated that libjsoncpp used roughly 30% of CPU for a single-shard alternator instance under stress, while rapidjson dropped that usage to 18% without optimizations. Future optimizations should include eliding object copying, string copying and perhaps experimenting with different JSON allocators.	2019-08-19 15:49:52 +03:00
Nadav Har'El	d6a8626e90	alternator: correct catch table-already-exists exception Our CreateTable handler assumed that the function migration_manager::announce_new_column_family() returns a failed future if the table already exists. But in some of our code branches, this is not the case - the function itself throws instead of returning a failed future. The solution is to use seastar::futurize_apply() to handle both possibilities (direct exception or future holding an exception). This fixes a failure of the test_table.py::test_create_table_already_exists test case. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	105533c046	alternator: fix sharing of a seastar::shared_ptr between threads The function attrs_type() return a supposedly singleton, but because it is a seastar::shared_ptr we can't use the same one for multiple threads, and need to use a separate one per thread. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190804163933.13772-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Nadav Har'El	8d8baccdc4	alternator: ListTables should not list materialized views Our ListTables implementation uses get_column_families(), which lists both base tables and materialized views. We will use materialized views to implement DynamoDB's secondary indexes, and those should not be listed in the results of ListTables. The patch also includes a test for this. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190717133103.26321-2-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	99fd032b1f	alternator: make set_sum exception more user-friendly As in case of set_diff, an exception message in set_sum should include the user-provided request (ADD) rather than our internal helper function set_sum.	2019-08-19 15:49:52 +03:00
Piotr Sarna	d7f75b405b	alternator: implement set DELETE UpdateExpression's DELETE operation for set is implemented on top of set_diff helper function.	2019-08-19 15:49:52 +03:00
Piotr Sarna	1d19934bc6	alternator: add set difference helper function A function for computing set differene of two sets represented as JSON is added.	2019-08-19 15:49:52 +03:00
Nadav Har'El	493890c6f6	alternator: fail attempt to create table with GSI Although we do not support GSI yet, until now we silently ignored CreateTable's GSI parameter, and the user wouldn't know the table wasn't created as intended. In this patch, GSI is still unsupported, but now CreateTable will fail with an error message that GSI is not supported. We need to change some of the tests which test the error path, and expect an error - but should not consider a table creation error as the expected error. After this patch, test_gsi.py still fails all the tests on Alternator, but much more quickly :-) Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190711161420.18547-1-nyh@scylladb.com>	2019-08-19 15:49:52 +03:00
Piotr Sarna	7ce0a30766	alternator: add ADD operation UpdateExpression is now able to perform ADD operation on both numbers and sets.	2019-08-19 15:49:52 +03:00
Piotr Sarna	d141c3b5bd	alternator: add helper function for adding sets A helper function that allows creating a set sum out of two sets represented in JSON is added.	2019-08-19 15:49:52 +03:00
Piotr Sarna	a6ca5e19e4	alternator: add unwrap_set It will be needed later to implement adding sets.	2019-08-19 15:49:51 +03:00
Piotr Sarna	482ac08a45	alternator: add get_item_type_string helper function It will be useful later for ensuring that parameters for various functions have matching types.	2019-08-19 15:49:51 +03:00
Nadav Har'El	7efa1aa48a	alternator: fix Query verification of appropriate key columns The Query operation's conditions can be used to search for a particular hash key or both hash and sort keys - but not any other combinations. We previously forgot to verify most errors, so in this patch we add missing verifications - and tests to confirm we fail the query when DynamoDB does. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190711132720.17248-1-nyh@scylladb.com>	2019-08-19 15:49:51 +03:00
Piotr Sarna	87626beaae	alternator: implement if_not_exists The if_not_exists function is implemented on the basis of recently added read-before write mechanism.	2019-08-19 15:49:51 +03:00
Piotr Sarna	b5bbdc18e4	alternator: rename holds_path to a more generic name The holds_path() utility function is actually used to check if a value needs read before write, so its name is changed to more fitting check_needs_read_before_write.	2019-08-19 15:49:51 +03:00
Nadav Har'El	34833707be	alternator: fix bug in collection mutations Alternator currently keeps an item's attributes inside a map, and we had a serious bug in the way we build mutations for this map: We didn't know there was a requirement to build this mutation sorted by the attribute's name. When we neglect to do this sorting, this confuses Scylla's merging algorithms, which assume collection cells are thus sorted, and the result can be duplicate cells in a collection, and the visible effect is a mutation that seems to be ignored - because both old and new values exist in the collection. So this patch includes a new helper class, "attribute_collector", which helps collect attribute updates (put and del) and extract them in correctly sorted order. This helper class also eliminates some duplication of arcane code to create collection cells or deletions of collection cells. This patch includes a simple test that previously failed, and one xfail test that failed just because of this bug (this was the test that exposed this bug). Both tests now succeed. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20190709160858.6316-1-nyh@scylladb.com>	2019-08-19 15:49:51 +03:00
Piotr Sarna	3212167ac4	alternator: fix indentation	2019-08-19 15:49:51 +03:00
Piotr Sarna	33300fd30c	alternator: add unsafe read-before-write to update_item In order to serve update requests that depend on read-before-write, a proper helper function which fetches the existing item with a given key from the database is added. This read-before-write mechanism is not considered safe, because it provides no linearizability guarantees and offers no synchronization protection. As such, it should be consider a placeholder that works fine on a single machine and/or no concurrent access to the same key.	2019-08-19 15:49:51 +03:00
Piotr Sarna	0372ce0649	alternator: add context parameters to calculate_value The calculate_value utility function is going to need more context in order to resolve paths present in the right-hand side of update_item operators: update_info and schema.	2019-08-19 15:49:51 +03:00
Piotr Sarna	43049bbec0	alternator: add allowing key columns when resolving path Historically, resolving a path checked for key columns, which are not allowed to be on the left-hand side of the assignment. However, path resolving will now also be used for right-hand side, where it should be allowed to use the key value.	2019-08-19 15:49:51 +03:00

1 2 3

123 Commits