scylladb

Author	SHA1	Message	Date
Avi Kivity	f3eade2f62	treewide: relicense to ScyllaDB-Source-Available-1.0 Drop the AGPL license in favor of a source-available license. See the blog post [1] for details. [1] https://www.scylladb.com/2024/12/18/why-were-moving-to-a-source-available-license/	2024-12-18 17:45:13 +02:00
Emil Maskovsky	8191e57036	treewide: fix annotations reported by GH checks Clean up the unnecessary includes reported by the GitHub checks that are polluting the PR diffs. The "utils/assert.hh" report should be actually fixed by the #21739, but as the usage of `SEASTAR_ASSERT()` is protected by the `SEASTAR_DEBUG` check it makes sense to include the header conditionally as well. Closes scylladb/scylladb#21817	2024-12-09 13:44:12 +03:00
Amnon Heiman	b888ed84f7	alternator/executer use uint in describe_item Actions in rmw_operation can use describe_item to determine to get an existing value (Read before Write scenario) on those cases the existing item size can be bigger than the one we are storing (in the extreme case, when deleting an object we only have its keys) This modify the describe_item API so it would take a pointer to uint instead of the consumed_capacity_counter so we can use it to get the old value size and depends on that, determine the size that will be used for the WCU calculation.	2024-12-03 15:55:41 +02:00
Amnon Heiman	b8f7b2eb52	alternator/executor: Add consume capacity for get_item This patch adds functionality to track Read Capacity Units (RCU) for the get_item operation. This enhancement allows for standardized measurement of read operations, aligning with DynamoDB-like metrics. Additionally, the RCU value can now be included in the response to provide immediate feedback on the read capacity usage. Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-11-19 18:43:28 +02:00
Amnon Heiman	b0e699e7ec	alternator/executor.hh: white-space cleanup Signed-off-by: Amnon Heiman <amnon@scylladb.com>	2024-11-19 18:43:28 +02:00
Avi Kivity	93afc77307	raft_group0_client: uninclude "mutation/mutation.hh" Lighten the dependency load. Some constructors and destructors are uninlined to avoid the header depending on the mutation class.	2024-09-28 16:31:53 +03:00
Nadav Har'El	00793059e1	alternator: fix alternator_enforce_authorization=false When the configuration has alternator_enforce_authorization=false, Alternator should not do authentication (check which user signed each request) nor authorization (check if that user has permissions to do each operation). Our implementation forgot to disable the authorization checks when it's configured to false. The (incorrect) assumption was that when alternator_enforce_authorization is configured to false, the CQL 'authenticator' and 'authorizer' configuration is also disabled - so the authorization checks will be no-ops. But we can't assume that: Users are free to configure 'authenticator' and 'authorizer' for use in CQL, and then set alternator_enforce_authorization=false just for Alternator. So this patch adds a new test for this case - when we have authenticator=PasswordAuthenticator, authorizer=CassandraAuthorizer but alternator_enforce_authorization=false, and fixes it to work correctly. The heart of the fix is trivial: the `verify_*_permission()` functions just need to check the alternator_enforce_authorization and return immediately when false. The bigger part of this change is to get the alternator_enforce_authorization into the "executor" object and then to pass it into the verify calls. Although alternator_enforce_authorization is not YET live updatable, this code is prepared for the future that it may become live updatable, so the executor object saves not the boolean value of this flag, but a live-updatable reference to it. Fixes #20619 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-09-17 15:50:00 +03:00
Nadav Har'El	dd030f8112	alternator: improve RBAC access denied error messages This patch address two requests made by reviewers of the original "Add CQL-based RBAC support to Alternator" series. Both requests were about the error messages produced when access is denied: 1. The error message is improved to use more proper English, and also to include the name of the role which was denied access. 2. The permission-check and error-message-formatting code is de-duplicated, using a common function verify_permission(). This de-duplication required moving the access-denied error path to throwing an exception instead of the previous exception-free implementation. However, it can be argued that this change is actually a good thing, because it makes the successful case, when access is allowed, faster. The de-duplicated code is shorter and simpler, and allowed changing the text of the error message in just one place. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#20326	2024-09-03 14:39:30 +03:00
Kefu Chai	a2f54ded80	alternator: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-06-07 07:45:00 +08:00
Nadav Har'El	d2e089777b	Merge 'Yield while building large results in Alternator - rjson::print, executor::batch_get_item' from Marcin Maliszkiewicz Adds preemption points used in Alternator when: - sending bigger json response - building results for BatchGetItem I've tested manually by inserting in preemptible sections (e.g. before `os.write`) code similar to: auto start = std::chrono::steady_clock::now(); do { } while ((std::chrono::steady_clock::now() - start) < 100ms); and seeing reactor stall times. After the patch they were not increasing while before they kept building up due to no preemption. Refs #7926 Fixes #13689 Closes #12351 * github.com:scylladb/scylladb: alternator: remove redundant flush call in make_streamed utils: yield when streaming json in print() alternator: yield during BatchGetItem operation	2023-06-04 23:22:51 +03:00
Alexey Novikov	ffd4fcceec	Alternator: return full table description on return of DeleteTable The DeleteTable operation in Alternator shoudl return a TableDescription object describing the table which has just been deleted, similar to what DescribeTable returns Fixes scylladb#11472 Closes #11628	2023-06-04 21:00:26 +03:00
Botond Dénes	0cff0ffa08	Merge 'alternator,config: make alternator_timeout_in_ms live-updateable' from Kefu Chai before this change, alternator_timeout_in_ms is not live-updatable, as after setting executor's default timeout right before creating sharded executor instances, they never get updated with this option anymore. but many users would like to set the driver timers based on server timers. we need to enable them to configure timeout even when the server is still running. in this change, * `alternator_timeout_in_ms` is marked as live-updateable * `executor::_s_default_timeout` is changed to a thread_local variable, so it can be updated by a per-shard updateable_value. and it is now a updateable_value, so its variable name is updated accordingly. this value is set in the ctor of executor, and it is disconnected from the corresponding named_value<> option in the dtor of executor. * alternator_timeout_in_ms is passed to the constructor of executor via sharded_parameter, so `executor::_timeout_in_ms` can be initialized on per-shard basis * `executor::set_default_timeout()` is dropped, as we already pass the option to executor in its ctor. Fixes #12232 Closes #13300 * github.com:scylladb/scylladb: alternator: split the param list of executor ctor into multi lines alternator,config: make alternator_timeout_in_ms live-updateable	2023-05-15 10:16:29 +03:00
Avi Kivity	42a1ced73b	cql3: result_set: switch cell data type from bytes_opt to managed_bytes_opt The expression system uses managed_bytes_opt for values, but result_set uses bytes_opt. This means that processing values from the result set in expressions requires a copy. Out of the two, managed_bytes_opt is the better choice, since it prevents large contiguous allocations for large blobs. So we switch result_set to use managed_bytes_opt. Users of the result_set API are adjusted. The db::function interface is not modified to limit churn; instead we convert the types on entry and exit. This will be adjusted in a following patch.	2023-05-07 17:17:36 +03:00
Kefu Chai	37cf04818e	alternator: split the param list of executor ctor into multi lines before this change, the line is 249 chars long, so split it into multiple lines for better readabitlity. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-03-23 20:57:28 +08:00
Kefu Chai	69c21f490a	alternator,config: make alternator_timeout_in_ms live-updateable before this change, alternator_timeout_in_ms is not live-updatable, as after setting executor's default timeout right before creating sharded executor instances, they never get updated with this option anymore. in this change, * alternator_timeout_in_ms is marked as live-updateable * executor::_s_default_timeout is changed to a thread_local variable, so it can be updated by a per-shard updateable_value. and it is now a updateable_value, so its variable name is updated accordingly. this value is set in the ctor of executor, and it is disconnected from the corresponding named_value<> option in the dtor of executor. * alternator_timeout_in_ms is passed to the constructor of executor via sharded_parameter, so executor::_timeout_in_ms can be initialized on per-shard basis * executor::set_default_timeout() is dropped, as we already pass the option to executor in its ctor. please note, in the ctor of executor, we always update the cached value of `s_default_timeout` with the value of `_timeout_in_ms`, and we set the default timeout to 10s in `alternator_test_env`. this is a design decision to avoid bending the production code for testing, as in production, we always set the timeout with the value specified either by the default value of yaml conf file. Fixes #12232 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-03-23 20:57:08 +08:00
Marcin Maliszkiewicz	f2788f5391	alternator: yield during BatchGetItem operation	2023-01-20 14:44:24 +01:00
Marcin Maliszkiewicz	7230841431	alternator: unify json streaming heuristic Main assumption here is that if is_big is good enough for GetBatchItems operation it should work well also for Scan, Query and GetRecords. And it's easier to maintain more unified code. Additionally 'future<> print' documentation used for streaming suggests that there is quite big overhead so since it seems the only motivation for streaming was to reduce contiguous allocation size below some threshold we should not stream when this threshold is not exceeded. Closes #12164	2023-01-19 16:40:43 +02:00
Michał Sala	041cb77ad0	alternator, db: move the tag code to db/tags Tags are a useful mechanism that could be used outside of alternator namespace. My motivation to move tags_extension and other utilities to db/tags/ was that I wanted to use them to mark "synchronous mode" views. I have extracted `get_tags_of_table`, `find_tag` and `update_tags` method to db/tags/utils.cc and moved alternator/tags_extension.hh to db/tags/. The signature of `get_tags_of_table` was changed from `const std::map<sstring, sstring>&` to `const std::map<sstring, sstring>*` Original behavior of this function was to throw an `alternator::api_error` exception. This was undesirable, as it introduced a dependency on the alternator module. I chose to change it to return a potentially null value, and added a wrapper function to the alternator module - `get_tags_of_table_or_throw` to keep the previous throwing behavior.	2022-07-25 09:53:33 +02:00
Botond Dénes	52058ea974	alternator: make is_alternator_keyspace() a standalone method	2022-06-30 14:18:29 +03:00
Nadav Har'El	3aca1ca572	alternator: make BatchGetItem group reads by partition DynamoDB API's BatchGetItem invokes a number (up to 25) of read requests in parallel, returning when all results are available. Alternator naively implemented this by sending all read requests in parallel, no matter which requests these were. That implementation was inefficient when all the requests are to different items (clustering rows) of the same partition. In a multi-node setup this will end up sending 25 separate requests to the same remote node(s). Even on a single-node setup, this may result in reading from disk more than once, and even if the partition is cached - doing an O(logN) search in each multiple times. What we do in this patch, instead, is to group all the BatchGetItem requests that aimed at the same partition into a single read request asking for a (sorted) list of clustering keys. This is similar to an "IN" request in CQL. As an example of the performance benefit of this patch, I tried a BatchGetItem request asking for 20 random items from a 10-million item partition. I measured the latency of this request on a single-node Scylla. Before this patch, I saw a latency of 17-21 ms (the lower number is when the request is retried and the requested items are already in the cache). After this patch, the latency is 10-14 ms. The performance improvement on multi-node clusters are expected to be even higher. Unfortunately the patch is less trivial than I hoped it would be, because some of the old code was organized under the assumption that each read request only returned one item (and if it failed, it means only one item failed), so this part of the code had to be reorganized (and, for making the code more readable, coroutinized). An unintended benefit of the code reorganization is that it also gave me an opportunity to fail an attempt to ask BatchGetItem the same item more than once (issue #10757). The patch also adds a few more corner cases in the tests, to be even more sure that the code reorganization doesn't introduce a regression in BatchGetItem. Fixes #10753 Fixes #10757 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2022-06-19 14:47:57 +03:00
Nadav Har'El	62b6179c88	alternator: remove dead code Remove the function make_keyspace_name() that was never used. We could have used this function, but we didn't, and it had had an inconvenient API. If we later want to de-duplicate the several copies of "executor::KEYSPACE_NAME_PREFIX + table_name" we have in the code, we can do it with a better API. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2022-05-26 15:21:14 +03:00
Nadav Har'El	d0ca09a925	alternator: implement DescribeContinuousBackups operation Although we don't yet support the DynamoDB API's backup features (see issue #5063), we can already implement the DescribeContinuousBackups operation. It should just say that continuous backups, and point-in-time restores, and disabled. This will be useful for client code which tries to inquire about continuous backups, even if not planning to use them in practice (e.g., see issue #10660). Refs #5063 Refs #10660 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2022-05-26 15:13:50 +03:00
Nadav Har'El	84143c2ee5	alternator: implement Select option of Query and Scan This patch implements the previously-unimplemented Select option of the Query and Scan operators. The most interesting use case of this option is Select=COUNT which means we should only count the items, without returning their actual content. But there are actually four different Select settings: COUNT, ALL_ATTRIBUTES, SPECIFIC_ATTRIBUTES, and ALL_PROJECTED_ATTRIBUTES. Five previously-failing tests now pass, and their xfail mark is removed: * test_query.py::test_query_select * test_scan.py::test_scan_select * test_query_filter.py::test_query_filter_and_select_count * test_filter_expression.py::test_filter_expression_and_select_count * test_gsi.py::test_gsi_query_select_1 These tests cover many different cases of successes and errors, including combination of Select and other options. E.g., combining Select=COUNT with filtering requires us to get the parts of the items needed for the filtering function - even if we don't need to return them to the user at the end. Because we do not yet support GSI/LSI projection (issue #5036), the support for ALL_PROJECTED_ATTRIBUTES is a bit simpler than it will need to be in the future, but we can only finish that after #5036 is done. Fixes #5058. The most intrusive part of this patch is a change from attrs_to_get - a map of top-level attributes that a read needs to fetch - to an optional<attrs_to_get>. This change is needed because we also need to support the case that we want to read no attributes (Select=COUNT), and attrs_to_get.empty() used to mean that we want to read all attributes, not no attributes. After this patch, an unset optional<attrs_to_get> means read all attributes, a set but empty attrs_to_get means read no attributes, and a set and non-empty attrs_to_get means read those specific attributes. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20220405113700.9768-2-nyh@scylladb.com>	2022-04-11 10:04:32 +02:00
Avi Kivity	fcb8d040e8	treewide: use Software Package Data Exchange (SPDX) license identifiers Instead of lengthy blurbs, switch to single-line, machine-readable standardized (https://spdx.dev) license identifiers. The Linux kernel switched long ago, so there is strong precedent. Three cases are handled: AGPL-only, Apache-only, and dual licensed. For the latter case, I chose (AGPL-3.0-or-later and Apache-2.0), reasoning that our changes are extensive enough to apply our license. The changes we applied mechanically with a script, except to licenses/README.md. Closes #9937	2022-01-18 12:15:18 +01:00
Nadav Har'El	8bcd23fa02	Merge: move rest of internal ddl users to use raft from Gleb The patch series moves the rest of internal ddl users to do schema change over raft (if enabled). After that series only tests are left using old API. * 'gleb/raft-schema-rest-v6' of github.com:scylladb/scylla-dev: (33 commits) migration_manager: drop no longer used functions system_distributed_keyspace: move schema creation code to use raft auth: move table creation code to use raft auth: move keyspace creation code to use raft table_helper: move schema creation code to use raft cql3: make query_processor inherit from peering_sharded_service table_helper: make setup_table() static table_helper: co-routinize setup_keyspace() redis: move schema creation code to go through raft thrift: move system_update_column_family() to raft thrift: authenticate a statement before verifying in system_update_column_family() thrift: co-routinize system_update_column_family() thrift: move system_update_keyspace() to raft thrift: authenticate a statement before verifying in system_update_keyspace() thrift: co-routinize system_update_keyspace() thrift: move system_drop_keyspace() to raft thrift: authenticate a statement before verifying in system_drop_keyspace() thrift: co-routinize system_drop_keyspace() thrift: move system_add_keyspace() to raft thrift: co-routinize system_add_keyspace() ...	2022-01-12 18:09:08 +02:00
Gleb Natapov	1491cc2906	alternator: move create_table() to raft	2022-01-12 16:33:16 +02:00
Gleb Natapov	0ac20b5494	alternator: make some functions static Make add_stream_options, supplement_table_info, supplement_table_stream_info static. They only need a pointer to storage_proxy, so pass it directly.	2022-01-12 16:33:15 +02:00
Calle Wilund	4a8a7ef8b4	executor/server: Add routine to make stream object return Simply retains result object and sets json::json_return_type to streaming callback.	2022-01-12 13:34:49 +00:00
Nadav Har'El	be969ff995	alternator: add find_tag() function find_tag() returns the value of a specific tag on a table, or nothing if it doesn't exist. Unlike the existing get_tags_of_table() above, if the table is missing the tags extension (e.g., is not an Alternator table) it's not an error - we return nothing, as in the case that tags exist but not this tag. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-11-25 22:01:36 +02:00
Nadav Har'El	4ffd8c1f2b	alternator: stub TTL operations This patch adds stubs for the UpdateTimeToLive and DescribeTimeToLive operations to Alternator. These operations can enable, disable, or inquire about, the chosen expiration-time attribute. Currently, the information about the chosen attribute is only saved, with no actual expiration of any items taking place. Some of the tests for the TTL feature start to pass, so their xfail tag is removed. Because this this new feature is incomplete, it is not enabled unless the "alternator-ttl" experimental feature is enabled. Moreover, for these operations to be allowed, the entire cluster needs to support this experimental feature, because all nodes need to participate in the data expiration - if some old nodes don't support Alternator TTL, some of the data they hold won't get expired... So we don't allow enabling TTL until all the nodes in the cluster support this feature. The implementation is in a new source file, alternator/ttl.cc. This source file will continue to grow as we implement the expiration feature. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-09-19 21:05:21 +03:00
Nadav Har'El	7404c7a9c1	alternator: make three utility functions in executor.cc non-static Make three of the utility functions in alternator/executor.cc, which until now were static (local to the source files) external symbols (in the alternator namespace). This will allow using them in other Alternator source files - like the one in the next patch for TTL support, which we'll want to put in a separate source file. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-09-19 21:05:21 +03:00
Avi Kivity	aa68927873	gossiper: remove get_local_gossiper() from some inline helpers Some state accessors called get_local_gossiper(); this is removed and replaced with a parameter. Some callers (redis, alternators) now have the gossiper passed as a parameter during initialization so they can use the adjusted API.	2021-09-07 17:03:37 +03:00
Pavel Emelyanov	773d2fe2a4	alternator: Drop storage service from executor It's completely unused in it. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-06-11 18:05:11 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Avi Kivity	daeddda7cc	treewide: remove inclusions of storage_proxy.hh from headers storage_proxy.hh is huge and includes many headers itself, so remove its inclusions from headers and re-add smaller headers where needed (and storage_proxy.hh itself in source files that need it). Ref #1.	2021-04-20 21:23:00 +03:00
Nadav Har'El	f41dac2a3a	alternator: avoid large contiguous allocation for request body Alternator request sizes can be up to 16 MB, but the current implementation had the Seastar HTTP server read the entire request as a contiguous string, and then processed it. We can't avoid reading the entire request up-front - we want to verify its integrity before doing any additional processing on it. But there is no reason why the entire request needs to be stored in one big contiguous allocation. This always a bad idea. We should use a non- contiguous buffer, and that's the goal of this patch. We use a new Seastar HTTPD feature where we can ask for an input stream, instead of a string, for the request's body. We then begin the request handling by reading lthe content of this stream into a vector<temporary_buffer<char>> (which we alias "chunked_content"). We then use this non-contiguous buffer to verify the request's signature and if successful - parse the request JSON and finally execute it. Beyond avoiding contiguous allocations, another benefit of this patch is that while parsing a long request composed of chunks, we free each chunk as soon as its parsing completed. This reduces the peak amount of memory used by the query - we no longer need to store both unparsed and parsed versions of the request at the same time. Although we already had tests with requests of different lengths, most of them were short enough to only have one chunk, and only a few had 2 or 3 chunks. So we also add a test which makes a much longer request (a BatchWriteItem with large items), which in my experiment had 17 chunks. The goal of this test is to verify that the new signature and JSON parsing code which needs to cross chunk boundaries work as expected. Fixes #7213. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210309222525.1628234-1-nyh@scylladb.com>	2021-03-10 09:22:34 +01:00
Kamil Braun	e2f03e4aba	cdc: move (most of) CDC generation management code to the new service Currently all management of CDC generations happens in storage_service, which is a big ball of mud that does many unrelated things. Previous commits have introduced a new service for managing CDC generations. This code moves most of the relevant code to this new service. However, some part still remains in storage_service: the bootstrap procedure, which happens inside storage_service, must also do some initialization regarding CDC generations, for example: on restart it must retrieve the latest known generation timestamp from disk; on bootstrap it must create a new generation and announce it to other nodes. The order of these operations w.r.t the rest of the startup procedure is important, hence the startup procedure is the only right place for them. Still, what remains in storage_service is a small part of the entire CDC generation management logic; most of it has been moved to the new service. This includes listening for generation changes and updating the data structures for performing CDC log writes (cdc::metadata). Furthermore these functions now return futures (and are internally coroutines), where previously they required a seastar::async context.	2021-02-26 12:06:12 +01:00
Piotr Sarna	495b7b5596	alternator: use unique_ptr for storing attribute paths Previous commit eliminated the only copying of the attribute paths, so it's now safe to make the object noncopyable. Message-Id: <5468e8c17d3d42a03c1dd33706bbaac0c58959ce.1613398751.git.sarna@scylladb.com>	2021-02-15 18:22:59 +02:00
Nadav Har'El	33685a683e	alternator: implemented nested attribute paths in UpdateExpression This patch adds full support for nested attribute paths (e.g., a.b[3].c) in UpdateExpression. After in previous patches we already added such support for ProjectionExpression, ConditionExpression and FilterExpression this means the nested attribute paths feature is now complete, so we remove the warning from the documents. However, there is one last loose end to tie and we will do it in the next patch: After this patch, the combination of UpdateExpression with nested attributes and ReturnValues is still wrong, and the test for it in test_returnvalues.py still xfails. Note that previous patches already implemented support for attribute paths in expression evaluations - i.e., the right-hand side of UpdateExpression actions, and in this patch we just needed to implement the left hand side: When an update action is on an attribute a.b we need to read the entire content of the top-level a (an RWM operation), modify just the b part of its json with the result of the action, and finally write back the entire content of a. Of course everything gets complicated by the fact that we can have multiple actions on multiple pieces of the same JSON, and we also need to detect overlapping and conflicting actions (we already have this detection in the attribute_path_map<> class we introduced in a previous patch). I decided to leave one small esoteric difference, reproduced by the xfailing test_update_expression.py::test_nested_attribute_remove_from_missing_item: As expected, "SET x.y = :val" fails for an item if its attribute x doesn't exist or the item itself does not exist. For the update expression "REMOVE x.y", DynamoDB fails if the attribute x doesn't exist, but oddly silently passes if the entire item doesn't exist. Alternator does not currently reproduce this oddity - it will fail this write as well. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-14 12:21:34 +02:00
Nadav Har'El	4c7e27c688	alternator: prepare for UpdateItem nested attribute paths This patch prepares UpdateItem for updating of nested attribute paths (e.g., "SET a.b = :val"), but does not yet support them. Instead of _update_expression holding an unsorted list of "actions", we change it to hold a attribute_path_map of actions. This will allow us to process all the actions on a top-level attribute together, and moreover gets us "for free" the correct checking for overlapping and conflicting updates - exactly the same checking we already had in attribute_path_map for ProjectionExpression. Other than this change, most of this patch is just code movement, not functional changes. After this patch, the tests for update path overlap and conflict pass: test_update_expression_multi_overlap_nested and test_update_expression_multi_conflict_nested. We can also mark test_update_expression_nested_attribute_rhs as passing - this test involves an attribute path in the right-hand-side of an update, but the left-hand-side is still a top-level attribute, so it works (it actually worked before this patch - it started working when we implemented attribute paths in expressions, for ConditionExpression and FilterExpression). Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-14 12:21:34 +02:00
Nadav Har'El	7c5db2da83	alternator: overhaul ProjectionExpression hierarchy implementation For ProjectionExpression we implemented a hierarchical filter object which can be used to hold a tree of attribute paths groups by a the top-level attributes, and also detect overlapping and conflicting entries. For UpdateExpression, we need almost exactly the same object: We need to group update actions (e.g., SET a.b=3) by the top-level attribute, and also detect and fail overlapping or conflicting paths. So in this patch we rewrite the data structure we had for ProjectionExpression in a more genric manner, using the template attribute_path_map<T> - which holds data of type T for each attribute path. We also implement a template function attribute_path_map_add() to add a path/value pair to this map, and includes all the overlap and conflict detecting logic. There shouldn't be functional changes in this patch. The ProjectionExpression code uses the new generic code instead of the specific code, but should work the same. In the next patch we can use the new generic code to implement UpdateExpression as well. The only somewhat functional change is better error messages for conflicting or overlapping paths - which now include one of the conflicting paths. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-14 12:21:34 +02:00
Nadav Har'El	104ef5242b	alternator: support attribute paths in ProjectionExpression This patch fully implements support for attribute paths (e.g. a.b.c, a.d[3]) for the ProjectionExpression in the various operations where this parameter is supported - GetItem, BatchGetItem, Query and Scan. After this patch, all xfailing tests in test_projection_expression.py now pass. In the previous patch we remembered in the "attrs_to_get" object not only the top-level attributes to read from the table, but also how to filter from it only the desired pieces of the nested document. In this patch we add a filter() function to do this filtering, and call it in the right places to post-process the JSON objects we read from the table. We also had to fix reference resolution in paths to resolve all the components of the path (e.g., #name1.#name2) and not just the top-level attribute. This is not the end of attribute path support, there are still other expressions (ConditionExpression, UpdateExpression, FilterExpression, ReturnValues) where they are not yet supported. This will come in following patches. Refs #5024 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-08 14:16:40 +02:00
Nadav Har'El	6340619e69	alternator: overhaul attrs_to_get handling In the existing code, the variable "attrs_to_get" is a list of top-level attributes to fetch for an item. It is used to implement features like ProjectionExpression or AttributesToGet in GetItem and other places. However, to support attribute paths (e.g., a.b.c[2]) in ProjectionExpression, i.e., issue #5024, we need more than that. We still need to know the top- level attribute "a", because this is the granularity we have in the Scylla table (all the content inside "a" is serialized as a single JSON); But we also need to remember exactly which parts inside "a" we will need to extract and return. So in this patch we add a new type, "attrs_to_get", which is more than just a list of top-level attributes. Instead, it is a map, whose keys are the top-level attributes, and the value for each of them is a "hierarchy_filter", an object which describes which part of the attribute is needed. This patch includes the code which converts the AttributesToGet and ProjectionExpression into the new attrs_to_get structure. During this conversion, we recognize two kinds of errors which DynamoDB complains about: We recognize "overlapping" attributes (e.g., requesting both a.b and a.b.c) and "conflicting" attributes (e.g, requesting both a.b and a[1]). After this, two xfailing tests we had for detecting these overlap and conflicts finally pass and their "xfail" label is removed. After this patch, we have the attrs_to_get object which can allow us to filter only the requested pieces of the top-level attributes, but we don't use it yet - so this patch is not enough for complete support of attribute paths in ProjectionExpression. We will complete this support in the next patch. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-08 14:16:40 +02:00
Nadav Har'El	781f9d9aca	alternator: make default timeout configurable Whereas in CQL the client can pass a timeout parameter to the server, in the DynamoDB API there is no such feature; The server needs to choose reasonable timeouts for its own internal operations - e.g., writes to disk, querying other replicas, etc. Until now, Alternator had a fixed timeout of 10 seconds for its requests. This choice was reasonable - it is much higher than we expect during normal operations, and still lower than the client-side timeouts that some DynamoDB libraries have (boto3 has a one-minute timeout). However, there's nothing holy about this number of 10 seconds, some installations might want to change this default. So this patch adds a configuration option, "--alternator-timeout-in-ms", to choose this timeout. As before, it defaults to 10 seconds (10,000ms). In particular, some test runs are unusually slow - consider for example testing a debug build (which is already very slow) in an extremely over-comitted test host. In some cases (see issue #7706) we noticed the 10 second timeout was not enough. So in this patch we increase the default timeout chosen in the "test/alternator/run" script to 30 seconds. Please note that as the code is structured today, this timeout only applies to some operations, such as GetItem, UpdateItem or Scan, but does not apply to CreateTable, for example. This is a pre-existing issue that this patch does not change. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201207122758.2570332-1-nyh@scylladb.com>	2020-12-09 14:30:43 +01:00
Calle Wilund	f5c79d15a8	alternator: Include stream arn in table description if enabled Fixes #7157 When creating/altering/describing a table, if streams are enabled, the "latest active" stream arn should be included as LatestStreamArn. Not doing so breaks java kinesis.	2020-09-07 08:16:11 +00:00
Calle Wilund	a978e043c3	alternator::streams: Do not allow enabling streams when CDC is off Fixes #6866 If we try to create/alter an Alternator table to include streams, we must check that the cluster does in fact support CDC (experimental still). If not, throw a hopefully somewhat descriptive error. (Normal CQL table create goes through a similar check in cql_prop_defs) Note: no other operations are prohibited. The cluster could have had CDC enabled before, so streams could exist to list and even read. Any tables loaded from schema tables should be reposnsible for their own validation.	2020-08-03 21:01:31 +03:00
Calle Wilund	05851578d4	alternator::streams: Report streams as not ready until CDC stream id:s are available Refs #6864 When booting a clean scylla, CDC stream ID:s will not be availble until a nring delay time period has passed. Before this, writing to a CDC enabled table will fail hard. For alternator (and its tests), we can report the stream(s) for tables as not yet available (ENABLING) until such time as id:s are computed. v2: Keep storage service ref in executor	2020-08-03 20:34:15 +03:00
Calle Wilund	cbb70f4af4	executor: "UpdateTable" support for streams Partial implementation of the "UpdateTable" command. Supports only enabling/disabling streams.	2020-07-15 08:21:34 +00:00
Calle Wilund	45ee73969d	executor: Allow streams specification in CreateTable schema	2020-07-15 08:21:34 +00:00
Calle Wilund	bbc544748f	alternator: Implement GetRecords Simplistic variant, using 1:1 mapping of scylla stream id <-> shard	2020-07-15 08:21:34 +00:00

1 2

89 Commits