scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-02 14:15:46 +00:00

Author	SHA1	Message	Date
Nadav Har'El	3322328b21	alternator test: fix two tests that failed in HTTPS mode When the test suite is run with Scylla serving in HTTPS mode, using test/alternator/run --https, two Alternator Streams tests failed. With this patch fixing a bug in the test, the tests pass. The bug was in the is_local_java() function which was supposed to detect DynamoDB Local (which behaves in some things differently from the real DynamoDB). When that detection code makes an HTTPS request and does not disable checking the server's certificate (which on Alternator is self-signed), the request fails - but not in the way that the code expected. So we need to fix the is_local_java() to allow the failure mode of the self-signed certificate. Anyway, this case is not DynamoDB Local so the detection function would return false. Fixes #7214 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200910194738.125263-1-nyh@scylladb.com>	2020-09-11 08:10:13 +02:00
Nadav Har'El	02ee0483b2	alternator test: add reproducing tests for several issues This patch adds regression tests for four recently-fixed issues which did not yet have tests: Refs #7157 (LatestStreamArn) Refs #7158 (SequenceNumber should be numeric) Refs #7162 (LatestStreamLabel) Refs #7163 (StreamSpecification) I verified that all the new tests failed before these issues were fixed, but now pass. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200907155334.562844-1-nyh@scylladb.com>	2020-09-10 17:36:23 +02:00
Nadav Har'El	1d06da18fc	alternator test: test for the TRIM_HORIZON stream iterator This patch adds a test for the TRIM_HORIZON option of GetShardIterator in Alternator Streams. This option asks to fetch again all the available history in this shard stream. We had an implementation for it, but not a test - so this patch adds one. The test passes. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200830131458.381350-1-nyh@scylladb.com>	2020-09-02 18:37:06 +02:00
Nadav Har'El	3d4183863a	alternator test: add tests for sequence-number based iterators Alternator Streams already support the AT_SEQUENCE_NUMBER and AFTER_SEQUENCE_NUMBER options for iterators. These options allow to replay a stream of changes from a known position or after that known position. However, we never had a test verifying that these features actually work as intended, beyond just checking syntax. Having such tests is important because recently we changed the implementation of these iterators, but didn't have a test verifying that they still work. So in this patch we add such tests. The tests pass (as usual, on both Alternator and DynamoDB). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200830115817.380075-1-nyh@scylladb.com>	2020-09-02 18:36:59 +02:00
Nadav Har'El	c879b23b82	alternator test: add another passing Alternator Streams test We had a test, test_streams_last_result, that verifies that after reading from an Alternator Stream the last event, reading again will find nothing. But we didn't actually have a test which checks that if at that point a new event does arrive, we can read it. This test checks this case, and it passes (we don't have a bug there, but it's good as a regression test for NextShardIterator). This test also verifies that after reading an event for a particular key on a a specific stream "shard", the next event for the same key will arrive on the same shard. This test passes on both Alternator and DynamoDB. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200830105744.378790-1-nyh@scylladb.com>	2020-09-02 18:36:50 +02:00
Nadav Har'El	1da3af5420	alternator test: enable a passing test After issue #7107 was fixed (regarding the correctness of OldImage and NewImage in Alternator Streams) we forgot to remove the "xfail" tag from one of the tests for this issue. This test now passes, as expected, so in this patch we remove the xfail tag. Refs #7107 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200827103054.186555-1-nyh@scylladb.com>	2020-08-27 14:15:32 +03:00
Calle Wilund	678ecc7469	alternator_streams: Include keys in OldImage/NewImage Fixes #6935 Fixes #7107 DynamoDB streams for some reason duplicate the record keys into both the "Keys" and "OldImage"/"NewImage" sub-objects when doing GetRecords. This patch appends the pk/ck parts into old/new image, and also removes the previous restrictions on image generation since cdc now generates more consistent pre/post image data.	2020-08-26 18:14:09 +00:00
Nadav Har'El	d4b452002a	alternator test: tests for NewImage feature of Alternator Streams This patch adds tests for the "NewImage" attribute in Alternator Streams in NEW_IMAGE and NEW_AND_OLD_IMAGES mode. It reproduces issue #7107, that items' key attributes are missing in the NewImage. It also verifies the risky corner cases where the new item is "empty" and NewImage should include just the key, vs. the case where the item is deleted, so NewImage should be missing. This test currently passes on AWS DynamoDB, and xfails on Alternator. Refs #7107. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200825113857.106489-1-nyh@scylladb.com>	2020-08-26 11:33:23 +03:00
Nadav Har'El	a1453303f8	alternator test: test for OLD_IMAGE of an empty item We already have a test, test_streams.py::test_streams_updateitem_old_image, for issue #6935: It tests that the OLD_IMAGE in Alternator Streams should contain the item's key. However this test was missing one corner case, which is the first solution for this issue did incorrectly. So in this patch we add a test for this corner case, test_streams_updateitem_old_image_empty_item: This corner case about the item existing, but empty, i.e., having just the key but no other attribute. In this case, OLD_IMAGE should return that empty item - including its key. Not nothing. As usual, this test passes on DynamoDB and xfails on Alternator, and the "xfail" mark will be removed when issue #6935 is fixed. Refs #6935. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200819155229.34475-1-nyh@scylladb.com>	2020-08-20 13:22:40 +02:00
Nadav Har'El	4c73d43153	Alternator: allow CreateTable with SSESpecification explicitly disabled While Alternator doesn't yet support creating a table with a different "server-side encryption" (a.k.a. encryption-at-rest) parameters, the SSESpecification option with Enabled=false should still be allowed, as it is just the default, and means exactly the same as would a missing SSESpecification. This patch also adds a test for this case, which failed on Alternator before this patch. Fixes #7031. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200812205853.173846-1-nyh@scylladb.com>	2020-08-17 13:48:52 +02:00
Nadav Har'El	159a966949	alternator test, streams: add test LSI key attributes in OldImage This patch adds a test that attributes which serve as a key for a secondary index still appear in the OldImage in an Alternator Stream. This is a special case, because although usually Alternator attributes are saved as map elements, not stand-alone Scylla columns, in the special case of secondary-index keys they are saved as actual Scylla columns in the base table. And it turns out we produce wrong results in this case: CDC's "preimage" does not currently include these columns if they didn't change, while DynamoDB requires that all columns, not just the changed ones, appear in OldImage. So the test added in this patch xfails on Alternator (and as usual, passes on DynamoDB). Refs #7030. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200812144656.148315-1-nyh@scylladb.com>	2020-08-17 13:46:53 +02:00
Nadav Har'El	f8291500cf	alternator test: test for ConditionExpression on key columns While answering a stackoverflow question on how to create an item but only if we don't already have an item with the same key, I realized that we never actually tested that ConditionExpressions works on key columns: all the tests we had in test_condition_expression.py had conditions on non-key attributes. So in this patch we add two tests with a condition on the key attribute. Most examples of conditions on the key attributes would be silly, but in these two tests we demonstrate how a test on key attributes can be useful to solve the above need of creating an item if no such item exists yet. We demonstrate two ways to do this using a condition on the key - using either the "<>" (not equal) operator, or the "attribute_not_exists()" function. These tests pass - we don't have a bug in this. But it's nice to have a test that confirms that we don't (and don't regress in that area). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200806200322.1568103-1-nyh@scylladb.com>	2020-08-07 08:05:48 +02:00
Nadav Har'El	8f12ef3628	alternator test: faster stream tests by reducing number of vnodes The single-node test Scylla run by test/alternator/run uses, as the default, 256 vnodes. When we have 256 vnodes and two shards, our CDC implementation produces 512 separate "streams" (called "shards" in DynamoDB lingo). This causes each of the tests in test_streams.py which need to read data from the stream to need to do 1024 (!) API requests (512 calls to GetShardIterator and 512 calls to GetRecords) which takes significant time - about a second per test. In this patch, we reduce the number of vnodes to 16. We still have a non-negligible number of stream "shards" (32) so this part of the CDC code is still exercised. Moreover, to ensure we still routinely test the paging feature of DescribeStream (whose default page size is 100), the patch changes the request to use a Limit of 10, so paging will still be used to retrieve the list of 32 shards. The time to run the 27 tests in test_streams.py, on my laptop: Before this patch: 26 seconds After this patch: 6 seconds. Fixes #6979 Message-Id: <20200805093418.1490305-1-nyh@scylladb.com>	2020-08-05 19:34:18 +03:00
Nadav Har'El	59dff3226b	Alternator tests: more tests for Alternator Streams This patch adds additional tests for Alternator Streams, which helped uncover 9 new issues. The stream tests are noticibly slower than most other Alternator tests - test_streams.py now has 27 tests taking a total of 20 seconds. Much of this slowness is attributed to Alternator Stream's 512 "shards" per stream in the single-node test setup with 256 vnodes, meaning that we need over 1000 API requests per test using GetRecords. These tests could be made significantly faster (as little as 4 seconds) by setting a lower number of vnodes. Issue #6979 is about doing this in the future. The tests in this patch have comments explaining clearly (I hope) what they test, and also pointing to issues I opened about the problems discovered through these tests. In particular, the tests reproduce the following bugs: Refs #6918 Refs #6926 Refs #6930 Refs #6933 Refs #6935 Refs #6939 Refs #6942 The tests also work around the following issues (and can be changed to be more strict and reproduce these issues): Refs #6918 Refs #6931 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200804154755.1461309-1-nyh@scylladb.com>	2020-08-05 19:34:18 +03:00
Nadav Har'El	ae25661d9c	alternator test: set streams time window to zero Alternator Streams have a "alternator_streams_time_window_s" parameter which is used to allow for correct ordering in the stream in the face of clock differences between Scylla nodes and possibly network delays. This parameter currently defaults to 10 seconds, and there is a discussion on issue #6929 on whether it is perhaps too high. But in any case, for tests running on a single node there is no reason not to set this parameter to zero. Setting this parameter to zero greatly speeds up the Alternator Streams tests which use ReadRecords to read from the stream. Previously each such test took at least 10 seconds, because the data was only readable after a 10 second delay. With alternator_streams_time_window_s=0, these tests can finish in less than a second. Unfortunately they are still relatively slow because our Streams implementation has 512 shards, and thus we need over a thousand (!) API calls to read from the stream). Running "test/alternator/run test_streams.py" with 25 tests took before this patch 114 seconds, after this patch, it is down to 18 seconds. Refs #6929 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Reviewed-by: Calle Wilund <calle@scylladb.com> Message-Id: <20200728184612.1253178-1-nyh@scylladb.com>	2020-08-03 09:19:57 +03:00
Nadav Har'El	8b9da9c92a	alternator test: tests for combination of query filter and projection The tests in this patch, which pass on DynamoDB but fail on Alternator, reproduce a bug described in issue #6951. This bug makes it impossible for a Query (or Scan) to filter on an attribute if that attribute is not requested to be included in the output. This patch includes two xfailing tests of this type: One testing a combination of FilterExpression and ProjectionExpression, and the second testing a combination of QueryFilter and AttributesToGet; These two pairs are, respectively, DynamoDB's newer and older syntaxes to achieve the same thing. Additionally, we add two xfailing tests that demonstrates that combining old and new style syntax (e.g., FilterExpression with AttributesToGet) should not have been allowed (DynamoDB doesn't allow such combinations), but Alternator currently accepts these combinations. Refs #6951 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200729210346.1308461-1-nyh@scylladb.com>	2020-07-30 09:34:23 +02:00
Nadav Har'El	665b78253a	alternator test: reduce amount of Scylla logs saved The test/alternator/run script follows the pytest log with a full log of Scylla. This saved log can be useful in diagnosing problems, but most of it is filled with non-useful "INFO"-level messages. The two biggest offenders are compaction - which logs every single compaction happening, and the migration manager, which is just a second (and very long) message about schema change operations (e.g., table creations). Neither of these are interesting for Alternator's tests, which shouldn't care exactly when compaction of which sstable is happening. These two components alone are reponsible for 80% of the log lines, and 90% of the log bytes! In this patch we increase the log level of just these two components - compaction and migration_manager - to WARN, which reduces the log by the same percentages (80% by lines, 90% by bytes). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200728191420.1254961-1-nyh@scylladb.com>	2020-07-29 14:17:12 +03:00
Nadav Har'El	a7df8486b1	alternator test: add test for tracing In commit `8d27e1b`, we added tracing (see docs/tracing.md) support to Alternator requests. However, we never had a functional test that verifies this feature actually works as expected, and we recently noticed that for the GetItem and BatchGetItem requestd, the trace doesn't really work (it returns an empty list of events). So this patch adds a test, test/alternator/test_tracing.py, which verifies that the tracing feature works for the PutItem, GetItem, DeleteItem, UpdateItem, BatchGetItem, BatchWriteItem, Query and Scan operations. This test is very peculiar. It needs to use out-of-band REST API requests to enable and disable tracing (of course, the test is skipped when running against AWS - this is a Scylla-only feature). It also needs to read CQL-only system tables and does this using Alternator's ".scylla.alternator" interface for system tables - which came through for us here beautifully and demonstrated their usefulness. I paid a lot of attention for this test to remain reasonably fast - this entire test now runs in a little less than one second. Achieving this while testing eight different requests was a bit of a challenge, because traces take time until they are visible in the trace table. This is the main reason why in this patch the test for all eight request types are done in one test, instead of eight separate tests. Fixes #6891 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200727115401.1199024-1-nyh@scylladb.com>	2020-07-27 14:31:45 +02:00
Nadav Har'El	65f75e3862	alternator test: enable test_get_records After issue #6864 was fixed, the test_streams.py::test_get_records test no longer fails, so its "xfail" marker can be removed. Refs #6864. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200722132518.1077882-1-nyh@scylladb.com>	2020-07-27 09:19:37 +02:00
Nadav Har'El	e0693f19d0	alternator test: produce newer xunit format for test results test.py passes the "--junit-xml" option to test/alternator/run, which passes this option to pytest to get an xunit-format summary of the test results. However, unfortunately until very recent versions (which aren't yet in Linux distributions), pytest defaulted to a non-standard xunit format which tools like Jenkins couldn't properly parse. The more standard format can be chosen by passing the option "-o junit_family=xunit2", so this is what we do in this patch. Fixes #6767 (hopefully). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200719203414.985340-1-nyh@scylladb.com>	2020-07-20 09:24:50 +03:00
Nadav Har'El	3b5122fd04	alternator test: fix warning message in test_streams.py In test_streams.py, we had the line: assert desc['StreamDescription'].get('StreamLabel') In Alternator, the 'StreamLabel' attribute is missing, which the author of this test probably thought would cause this test to fail (which is expected, the test is marked with "xfail"). However, my version of pytest actually doesn't like that assert is given a value instead of a comparison, and we get the warning message: PytestAssertRewriteWarning: asserting the value None, please use "assert is None" I think that the nicest replacement for this line is assert 'StreamLabel' in desc['StreamDescription'] This is very readable, "pythonic", and checks the right thing - it checks that the JSON must include the 'StreamLabel' item, as the get() assertion was supposed to have been doing. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200716124621.906473-1-nyh@scylladb.com>	2020-07-17 14:36:23 +03:00
Nadav Har'El	dcf9c888a2	alternator test: disable test_streams.py::test_get_records This test usually fails, with the following error. Marking it "xfail" until we can get to the bottom of this. dynamodb = dynamodb.ServiceResource() dynamodbstreams = <botocore.client.DynamoDBStreams object at 0x7fa91e72de80> def test_get_records(dynamodb, dynamodbstreams): # TODO: add tests for storage/transactionable variations and global/local index with create_stream_test_table(dynamodb, StreamViewType='NEW_AND_OLD_IMAGES') as table: arn = wait_for_active_stream(dynamodbstreams, table) p = 'piglet' c = 'ninja' val = 'lucifers' val2 = 'flowers' > table.put_item(Item={'p': p, 'c': c, 'a1': val, 'a2': val2}) test_streams.py:316: ... E botocore.exceptions.ClientError: An error occurred (Internal Server Error) when calling the PutItem operation (reached max retries: 3): Internal server error: std::runtime_error (cdc::metadata::get_stream: could not find any CDC stream (current time: 2020/07/15 17:26:36). Are we in the middle of a cluster upgrade?) Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-07-16 08:24:25 +03:00
Nadav Har'El	61f52da9b1	merge: Alternator/CDC: Implement streams support Merged pull request https://github.com/scylladb/scylla/pull/6694 by Calle Wilund: Implementation of DynamoDB streams using Scylla CDC. Fixes #5065 Initial, naive implementation insofar that it uses 1:1 mapping CDC stream to DynamoDB shard. I.e. there are a lot of shards. Includes tests verified against both local DynamoDB server and actual AWS remote one. Note: Because of how data put is implemented in alternator, currently we do not get "proper" INSERT labels for first write of data, because to CDC it looks like an update. The test compensates for this, but actual users might not like it.	2020-07-16 08:18:25 +03:00
Nadav Har'El	c4497bf770	alternator test: enable experimental CDC In the script test/alternator/run, which runs Scylla for the Alternator tests, add the "--experimental-features=cdc" option, to allow us testing the streams API whose implementation requires the experimenal CDC feature. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-07-16 08:18:09 +03:00
Calle Wilund	76f6fe679a	alternator tests: Add streams test Small set of positive and negative tests of streams functionality. Verified against DynamoDB and Alternator.	2020-07-15 08:21:34 +00:00
Nadav Har'El	8e3be5e7d6	alternator test: configurable temporary directory The test/alternator/run script creates a temporary directory for the Scylla database in /tmp. The assumption was that this is the fastest disk (usually even a ramdisk) on the test machine, and we didn't need anything else from it. But it turns out that on some systems, /tmp is actually a slow disk, so this patch adds a way to configure the temporary directory - if the TMPDIR environment variable exists, it is used instead of /tmp. As before this patch, a temporary subdirectry is created in $TMPDIR, and this subdirectory is automatically deleted when the test ends. The test.py script already passes an appropriate TMPDIR (testlog/$mode), which after this patch the Alternator test will use instead of /tmp. Fixes #6750 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200713193023.788634-1-nyh@scylladb.com>	2020-07-14 08:52:22 +03:00
Nadav Har'El	35f7048228	alternator: CreateTable with bad Tags shouldn't create a table Currently, if a user tries to CreateTable with a forbidden set of tags, e.g., the Tags list is too long or contains an invalid value for system:write_isolation, then the CreateTable request fails but the table is still created. Without the tag of course. This patch fixes this bug, and adds two test cases for it that fail before this patch, and succeed with it. One of the test cases is scylla_only because it checks the Scylla-specific system:write_isolation tag, but the second test case works on DynamoDB as well. What this patch does is to split the update_tags() function into two parts - the first part just parses the Tags, validates them, and builds a map. Only the second part actually writes the tags to the schema. CreateTable now does the first part early, before creating the table, so failure in parsing or validating the Tags will not leave a created table behind. Fixes #6809. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200713120611.767736-1-nyh@scylladb.com>	2020-07-13 17:14:44 +03:00
Nadav Har'El	f549d147ea	alternator: fix Expected's "NULL" operator with missing AttributeValueList The "NULL" operator in Expected (old-style conditional operations) doesn't have any parameters, so we insisted that the AttributeValueList be empty. However, we forgot to allow it to also be missing - a possibility which DynamoDB allows. This patch adds a test to reproduce this case (the test passes on DyanmoDB, fails on Alternator before this patch, and succeeds after this patch), and a fix. Fixes #6816. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200709161254.618755-1-nyh@scylladb.com>	2020-07-10 07:45:02 +02:00
Piotr Sarna	75dbaa0834	test: add alternator test for incorrect numeric values The test case is put inside test_manual_requests suite, because boto3 validates numeric inputs and does not allow passing arbitrary incorrect values. Tests: unit(dev), alternator(local, remote) Message-Id: <ac2baedc2ea61f0d857e7c01839f34cd15f7e02d.1594289250.git.sarna@scylladb.com>	2020-07-09 13:58:33 +03:00
Nadav Har'El	9ff9cd37c3	alternator test: tests for the number type We had some tests for the number type in Alternator and how it can be stored, retrieved, calculated and sorted, but only had rudementary tests for the allowed magnitude and precision of numbers. This patch creates a new test file, test_number.py, with tests aiming to check exactly the supported magnitudes and precision of numbers. These tests verify two things: 1. That Alternator's number type supports the full precision and magnitude that DynamoDB's number type supports. We don't want to see precision or magnitude lost when storing and retrieving numbers, or when doing calculations on them. 2. That Alternator's number type does not have better precision or magnitude than DynamoDB does. If it did, users may be tempted to rely on that implementation detail. The three tests of the first type pass; But all four tests of the second type xfail: Alternator currently stores numbers using big_decimal which has unlimited precision and almost-unlimited magnitude, and is not yet limited by the precision and magnitude allowed by DynamoDB. This is a known issue - Refs #6794 - and these four new xfailing tests will can be used to reproduce that issue. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200707204824.504877-1-nyh@scylladb.com>	2020-07-09 07:38:36 +02:00
Nadav Har'El	23ce6864a3	alternator test: ProjectionExpression test for BatchGetItem The tests in test_projection_expression.py test that ProjectionExpression works - including attribute paths - for the GetItem, Query and Scan operations. There is a fourth read operation - BatchGetItem, and it supports ProjectionExpression too. We tested BatchGetItem + ProjectionExpression in test_batch.py, but this only tests the basic feature, with top-level attributes, and we were missing a test for nested document paths. This patch adds such a test. It is still xfailing on Alternator (and passing on DynamoDB), because attribute paths are still not supported (this is issue #5024). Refs #5024. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200629063244.287571-1-nyh@scylladb.com>	2020-06-29 08:51:05 +02:00
Nadav Har'El	b6fdd956bd	alternator test: ProjectionExpression tests for document paths This patch adds three more tests for the ProjectionExpression parameter of GetItem. They are tests for nested document paths like a.b[2].c. We don't support nested paths in Alternator yet (this is issue #5024), so the new tests all xfail (and pass on DynamoDB). We already had similar tests for UpdateExpression, which also needs to support document paths, but the tests were missing for ProjectionExpression. I am planning to start the implementation of document paths with ProjectionExpression (which is the simplest use of document paths), so I want the tests for this expression to be as complete as possible. Refs #5024. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200628213208.275050-1-nyh@scylladb.com>	2020-06-29 08:50:55 +02:00
Nadav Har'El	095ddf0d41	alternator test: use ConsistentRead=True where missing All tests that write some data and then read it back need to use ConsistentRead=True, otherwise the test may sporadically fail on a multi- node cluster. In the previous patch we fixed the full_query()/full_scan() convenience functions. In this patch, I audited the calls to the boto3 read methods - get_item(), batch_get_item(), query(), scan(), and although most of them did use ConsistentRead=True as needed, I found some missing and this patch fixes them. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200616080334.825893-1-nyh@scylladb.com>	2020-06-17 14:57:45 +02:00
Nadav Har'El	c298088375	alternator test: use ConsistentRead=True for full_query/scan Many of the Alternator tests use the convenience functions full_query()/ full_scan() to read from the table. Almost all these tests need to be able to read their own writes, i.e., want ConsistentRead=True, but none of them explicitly specified this parameter. Such tests may sporadically fail when running on cluster with multiple nodes. So this patch follows a TODO in the code, and makes ConsistentRead=True the default for the full_() functions. The caller can still override it with ConsistentRead=False - and this is necessary in the GSI tests, because ConsistentRead=True is not allowed in GSIs. Note that while ConsistentRead=True is now the default for the full_() convenience functions, but it is still not the default for the lower level boto3 functions scan(), query() and get_item() - so usages of those should be evaluated as well and missing ConsistentRead=True, if any, should be added. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200616073821.824784-1-nyh@scylladb.com>	2020-06-17 14:57:45 +02:00
Nadav Har'El	0b9f25ab50	alternator: implement FilterExpression This patch provides a complete implementation for the FilterExpression parameter - the newer syntax for filtering the results of the Query or Scan operations. The implementation is pretty straightforward - we already added earlier a result-filtering framework to Alternator, and used it for the older filtering syntax - QuryFilter and ScanFilter. All we had to do now was to run the FilterExpression (which has the same syntax as a ConditionExpression) on each individual items. The previous cleanup patches were important to reduce the friction of running these expressions on the items. After the previous patches fixing small esoteric bugs in a few expression functions, with this patch all the tests in test_filter_expression.py now pass, and so do the two FilterExpression tests in test_query.py and test_scan.py. As far as I know (and of course minus any bugs we'll discover later), this marks the FilterExpression feature complete. Fixes #5038. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-06-14 12:16:26 +03:00
Nadav Har'El	f87259a762	alternator: improve error path of attribute_type() function The attribute_type() function, which can be used in expressions like ConditionExpression and FilterExpression, is supposed to generate an error if its second parameter is not one of the known types. What we did until now was to just report a failed check in this case. We already had a reproducing test with FilterExpression, but in this patch we also add a test with ConditionExpression - which fails before this patch and passes afterwards (and of course, passes with DynamoDB). Fixes #6641. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-06-14 12:16:20 +03:00
Nadav Har'El	11d86dfb06	alternator: fix begins_with() error path The begins_with() function should report an error if a constant is passed to it which isn't one of the supported types - string or bytes (e.g., a number). The code we had to check this had wrong logic, though. If the item attribute was also a number, we silently returned false, and didn't go on to detect that the second parameter - a constant - was a number too and should generate an error - not be silent. Fixed and added a reproducing test case and another test to validate my understanding of the type of parameters that begins_with() accepts. Fixes #6640. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-06-14 12:13:23 +03:00
Nadav Har'El	f79a4e0e78	alternator: fix corner case of contains() function in conditions It turns out that the contains() functions in the new syntax of conditions (ConditionExpression, FilterExpression) is not identical to the CONTAINS operator in the old-syntax conditions (Expected). In the new syntax, one can check whether any constant object is contained in a list. In the old syntax, the constant object must be of specific types. So we need to move the testing out of the check_CONTAINS() functions that both implementations used, and into just the implementation of the old syntax (in conditions.cc). This bug broke one of the FilterExpression tests, but this patch also adds new tests for the different behaviour of ConditionExpression and Expected - tests which also reproduce this issue and verify its fix. Fixes #6639. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-06-14 12:02:14 +03:00
Nadav Har'El	13ef31f38b	alternator: refactor resolving of references in expressions In the DynamoDB API, expressions (e.g., ConditionExpression and many more) may contain references to column names ("#name") or to values (":val") given in a separate part of the request - ExpressionAttributeNames and ExpressionAttributeValues respectively. Before this patch, we resolved these references as part of the expression's evaluation. This approach had two downsides: 1. It often misdiagnosed (both false negatives and false positives) cases of unused names and values in expressions. We already had two xfailing tests with examples - which pass after this patch. This patch also adds two additional tests, which failed before this patch and pass with it. 2. In one of the following patches we will add support for FilterExpression, where the same expression is used repeatedly on many items. It is a waste (as well as makes the code uglier) to resolve the same references again and again each time the expression is evaluated. We should be able to do it just once. So this patch introduces an intermediate step between parsing and evaluating an expression - "resolving" the expression. The new resolve_() functions modify the already parsed expression, replacing references to attribute names and constant values by the actual names and values taken from the request. The resolve_() functions also keep track which references were used, making it very easy to check (as DynamoDB does) if there are any unused names or values, before starting the evaluation. The interface of evaluate() functions become much simpler - they no longer need to know the original request (which was previously needed for ExpressionAttributeNames/Values), the table's schema (which was previously needed only for some error checking), keep track of which references were used. This simplification is helpful for using the expressions in contexts where these things (request and schema) are no longer conveniently available, namely in FilterExpression. A small side-benefit of this patch is that it moves a bit of code, which handled resolving of references in expressions, from executor.cc to expressions.cc. This is just the first step in a bigger effort to reduce the size of executor.cc by moving code to smaller source files. There is no attempt in this patch to move as much code as we can. We will move more code in a separate patch in this series. Fixes #6572. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2020-06-14 11:57:13 +03:00
Nadav Har'El	65d3e3992f	alternator test: small fixes for test_key_condition_expression_multi The test test_key_condition_expression_multi() had a small typo, which was hidden by the fact that the request was expected to fail for other reasons, but nevertheless should be fixed. Moreover, it appears that the Amazon DynamoDB changed their error message for this case, so running the test with "--aws" failed. So this patch makes it work again by being more forgiving on the exact error message. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200609205628.562351-1-nyh@scylladb.com>	2020-06-10 07:34:20 +02:00
Nadav Har'El	ace1697aa9	alternator test: reproducer for unjustly refused condition expression This patch adds a test reproducing issue #6572, where the perfectly good condition expression: #name1 = :val1 OR #name2 = :val2 Gets refused because of the following combination in our implementation: 1. Short-circuit evaluation, i.e., after we discover #name1 = :val1 we don't evaluate the second half of the expression. 2. The list of "used" references is collected at evaluation time, instead of at parsing time. Because evaluation never reaches #name2 (or :val2) our implementation complains that they are not used, and refuses the request - which should have been allowed. This test xfails on Alternator. It passes on DynamoDB. Refs #6572 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200604171954.444291-1-nyh@scylladb.com>	2020-06-05 07:43:50 +02:00
Piotr Sarna	0ba23d2b40	test: add manual test for tagging return value While not very interesting by itself, the test case shows that in case of TagResource and UntagResource it's actually correct to return empty HTTP body instead of an empty JSON object, which was the case for PutItem. Message-Id: <6331963179c5174a695f0e9eeed17de6c9f9a3be.1591269516.git.sarna@scylladb.com>	2020-06-04 16:17:24 +03:00
Piotr Sarna	8fc3ca855e	alternator: fix the return type of PutItem Even if there are no attributes to return from PutItem requests, we should return a valid JSON object, not an empty string. Fixes #6568 Tests: unit(dev)	2020-06-03 16:03:13 +03:00
Piotr Sarna	3aff52f56e	alternator: fix returning UnprocessedKeys unconditionally Client libraries (e.g. PynamoDB) expect the UnprocessedKeys and UnprocessedItems attributes to appear in the response unconditionally - it's hereby added, along with a simple test case. Fixes #6569 Tests: unit(dev)	2020-06-03 15:48:16 +03:00
Nadav Har'El	bea9629031	alternator: implement remaining QueryFilter / ScanFilter functionality This patch implements the missing QueryFilter (and ScanFilter) functionality:` 1. All operators. Previously, only the "EQ" operator was implemented. 2. Either "OR" or "AND" of conditions (previously only "AND"). 3. Correctly returning Count and ScannedCount for post-filter and pre-filter item counts, respectively. All of the previously-xfailing tests in test_query_filter.py are now passing. The implementation in this patch abandons our previous attempts to translate the DynamoDB API filters into Scylla's CQL filters. Doing this correctly for all operators would have been exceedingly difficult (for reasons explained in #5028), and simply not worth the effort: CQL's filters receive a page of results and then filter them, and we can do exactly the same without CQL's filters: The new code just retrieves an unfiltered page of items, and then for each of these items checks whether it passes the filters. The great thing is that we already had code for this checking - the QueryFilter syntax is identical to the "Expected" syntax (for conditional operations) that we already supported, so we already had code for checking these conditions, including all the different operators. This patch prepares for the future need to support also the newer FilterExpression syntax (see issue #5038), and the "filter" class supports either type of filter - the implementation for the second syntax is just missing and can be added (fairly easily) later. Fixes #5028. Refs #5038. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200603110118.399325-1-nyh@scylladb.com>	2020-06-03 13:16:45 +02:00
Nadav Har'El	f6b1f45d69	alternator: fix order conditions on binary attributes We implemented the order operators (LT, GT, LE, GE, BETWEEN) incorrectly for binary attributes: DynamoDB requires that the bytes be treated as unsigned for the purpose of order (so byte 128 is higher than 127), but our implementation uses Scylla's "bytes" type which has signed bytes. The solution is simple - we can continue to use the "bytes" type, but we need to use its compare_unsigned() function, not its "<" operator. This bug affected conditional operations ("Expected" and "ConditionExpression") and also filters ("QueryFilter", "ScanFilter", "FilterExpression"). The bug did not affect Query's key conditions ("KeyConditions", "KeyConditionExpression") because those already used Scylla's key comparison functions - which correctly compare binary blobs as unsigned bytes (in fact, this is why we have the compare_unsigned() function). The patch also adds tests that reproduce the bugs in conditional operations, and show that the bug did not exist in key conditions. Fixes #6573 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200603084257.394136-1-nyh@scylladb.com>	2020-06-03 10:55:50 +02:00
Nadav Har'El	0d337a716b	alternator test: confirm understanding of query paging with filtering This test (which passes successfully on both Alternator and DynamoDB) was written to confirm our understanding of how the paging feature works. Our understanding, based on DynamoDB documentation, has been that the "Limit" parameter determines the number of pre-filtering items, not the actual number of items returned after having passed the filter. So the number of items actually returned may be lower than Limit - in some cases even zero. This test tries an extreme case: We scan a collection of 20 items with a filter matching only 10 (or so) of them, with Limit=1, and count the number of pages that we needed to request until collecting all these 10 (or so) matches. We note that the result is 21 - i.e., DynamoDB and Alternator really went through the 20 pre-filtering items one by one, and for the items which didn't match the filter returned an empty page. The last page (the 21st) is always empty: DynamoDB or Alternator doesn't know whether or not there is a 21st item, and it takes a 21st request to discover there isn't. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200602145015.361694-1-nyh@scylladb.com>	2020-06-02 16:57:49 +02:00
Nadav Har'El	43138c0e5e	alternator test: test Count/ScannedCount return of Query This test reproduces a bug in the current implementation of QueryFilter, which returns for ScannedCount the count of post-filter items, whereas it should return the pre-filter count. The test tests both ScannedCount and Count, when QueryFilter is used and when it isn't used. The test currently xfails on Alternator, passes on DynamoDB. Refs #5028 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200602125924.358636-1-nyh@scylladb.com>	2020-06-02 16:57:49 +02:00
Nadav Har'El	d200cde9d6	alternator: extract function for parsing ConditionalOperator The code for parsing the ConditionalOperator attribute was used once in for the "Expected" case, but we will also need it for the "QueryFilter" and "ScanFilter" cases, so let's extract it into a function, get_conditional_operator(). While doing this extraction, I also noticed a bug: when Expected is missing, ConditionalOperator should not be allowed. We correctly checked the case of an empty Expected, but forgot to also check the case of a missing Expected. So the new code also fixes this corner case, and we include a new test case for it (which passes on DynamoDB and used to fail in Alternator but passes after this patch). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200528214721.230587-1-nyh@scylladb.com>	2020-05-29 08:26:43 +02:00
Nadav Har'El	cd0fbb8d38	alternator test: add comprehensive tests for QueryFilter feature The QueryFilter parameter of Query is only partially implemented (issue tests for it. In this patch, we add comprehensive tests for this feature and all its various operators, types, and corner cases. The tests cover both the parts we already implemented, and the parts we did not yet. As usual, all tests succeed on DynamoDB, but many still xfail on Alternator pending the complete implementation. Refs #5028. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20200525141242.133710-1-nyh@scylladb.com>	2020-05-27 15:29:27 +02:00

1 2

74 Commits