scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-02 14:15:46 +00:00

Author	SHA1	Message	Date
Nadav Har'El	cf06b7cd40	test/alternator: correct some typos in comments Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210729125317.1610573-1-nyh@scylladb.com>	2021-08-24 19:43:29 +03:00
Nadav Har'El	65381bd155	test/alternator: add tests for expression length limits The DynamoDB documentation https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/Limits.html describes several hard limits on the size of the size of expressions (ProjectionExpression, ConditionExpression, UpdateExpression, FilterExpression) and various elements they contain. In this patch we begin testing those limits with a comprehensive test for the length of each of these four expressions: we test that lengths up to (and including) 4096 bytes are allowed but longer expressions are rejected. We also add TODOs for additional documented limits that should be tested in the future. Currently, this test passes on DynamoDB but xfails on Alternator because Alternator does not enforce any limits on the expression length. I don't think this is a real problem, and we may consider keeping it this way, but we should at least be aware that this difference exists and an xfailing test will remind us. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210810081948.2012120-2-nyh@scylladb.com>	2021-08-10 12:06:21 +02:00
Nadav Har'El	9d49a32486	test/alternator: add tests for attribute name limits DynamoDB limits attribute names in items to lengths of up 65535 bytes, but in some cases (such as key attributes) the limit is lower - 255. This patch adds tests for many of these cases. All the new tests pass on DynamoDB, but some still xfail on Alternator because Alternator is too lenient - sometimes allowing longer attribute names than DynamoDB allows. While this may sound great, it also has downsides: The oversized attribute names perform badly, and as they grow, Alternator's internal limits will be reached as well, and result in an unsightly "internal server error" being reported instead of the expected user-friendly error. Refs #9169. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210810081948.2012120-1-nyh@scylladb.com>	2021-08-10 12:06:13 +02:00
Nadav Har'El	dba184039a	test/alternator: another test for Query's ExclusiveStartKey We already have tests for Query's ExclusiveStartKey option, but we only exercised it as a way for paging linearly through all the results. Now we add a test that confirms that ExclusiveStartKey can be used not just for paging through all the result - but also for jumping directly to the middle of a partition after any clustering key (existing or non- existing clustering key). The new test also for the first time verifies that ExclusiveStartKey with a specific format works (previous tests just copied LastEvaluatedKey to ExclusiveStartKey, so any opaque cookie could have worked). The test passes on both DynamoDB and Alternator so it did not find a new bug. But it's useful to have as a regression test, in case in the future we want to improve paging performance (see #6278) - and need to keep in mind that ExclusiveStartKey is not just for paging. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210729114703.1609058-1-nyh@scylladb.com>	2021-08-04 15:24:47 +03:00
Nadav Har'El	2acfee8118	test/alternator: add tests for the Alternator TTL feature This patch adds a comprehensive test suite for the DynamoDB API's TTL (item expiration) feature. The tests check the two new API commands added by this feature (UpdateTimeToLive and DescribeTimeToLive), and also how items are expired in practice, and how item expiration interacts with other features such as GSI, LSI and DynamoDB Streams. Because DynamoDB has extremely long delays until items are expired, or until expiration configuration may be changed, several of these tests take up to 30 minutes to complete. We mark these tests with the "verylong" marker, so they are skipped in ordinary test runs - use the "--runverylong" option to run them. All these tests currently pass on DynamoDB, but xfail on Alternator because the two commands UpdateTimeToLive and DescribeTimeToLive are currently rejected by Alternator. Refs #5060 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-07-14 14:08:55 +03:00
Nadav Har'El	9d07ce3cb6	test/alternator: add marker for "veryslow" tests Until now, Alternator test have all been very fast, taking milliseconds or at worst seconds each - or a bit longer on DynamoDB. However, sometimes we need to write tests which take a huge amount of time - for example, tests for the TTL feature may take 10 minutes because the item expiration is delayed by that much. Because a 10 minute test is ridiculous (all 500 Alternator tests together take just one minute today!), we would normally run such test once, and then mark it "skip" so will never run again. One annoying thing about skipped tests is that there is no way to temporarily "unskip" them when we want to run such a test anyway. So in this patch, we introduce a better option for these very slow tests instead of the simple "skip": The patch introduces a marker "@pytest.mark.veryslow". By default, a test with this marker is skipped. However, an command-line option "--runveryslow" is introduced which causes tests with the veryslow mark to be run anyway, and not skipped. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-07-14 00:26:21 +03:00
Nadav Har'El	2fb379bb94	test/alternator: add new_test_table() utility function This patch adds a convenient function new_test_table() that Alternator tests can use to safely create a temporary table, and be sure it is deleted in any case. This function is used in a "with", as follows: with new_test_table(dynamodb, ...) as table: do_something(table) # at this point table has already been deleted. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-07-14 00:26:21 +03:00
Nadav Har'El	c174eeae06	alternator: do not allow LSI on base table with no sort key The purpose of an LSI (local secondary index) in Alternator is to allow a different sort key for the existing partitions, keeping the same division into partititions. So it doesn't make sense to create an LSI on a table that did not originally have a sort key (i.e., single-item partitions). DynamoDB indeed doesn't allow this case, and Alternator forgot to forbid it - so this patch adds the missing check to the CreateTable operation. This patch also adds a test case for this, test_lsi_wrong_no_sort_key, which failed before the patch and passes after it (and also passes on DynamoDB). Also, the existing test_lsi_wrong tests for bad LSI creation attempts by mistake used a base table without a sort key - so while they encountered an error as expected, it was not the right error! So we fix that test (and split it into two tests), adding the missing sort key and exposing the actual errors that the tests were meant to expose. That test passed before this patch and also afterwards - but at least after the patch it is actually testing what it was meant to be testing. Fixes #9018. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210713123747.1012954-1-nyh@scylladb.com>	2021-07-13 15:12:01 +02:00
Nadav Har'El	775a64b003	test/alternator: test for change in CDC preimage In pull request #8568, the CDC API changed slightly, with preimage data gaining extra "delete$k" values for columns whose preimage was missing. In this new test, we verify that this change did not break Alternator. We didn't expect it to break Alternator, because it just outputs the known base-table columns and ignores the columns which weren't a real base-table column - like this "delete$k". In the test we set up a stream with preimages, ensure that a real column (note that an LSI key is a real column instead of a map element) has a null preimage - and see that the preimage is returned as expected, without fake columns like "delete$k". The test passes, showing that PR #8568 was ok. The test also passes, as expected, on DynamoDB. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210504120121.915829-1-nyh@scylladb.com>	2021-07-06 14:53:42 +02:00
Nadav Har'El	b26fcf5567	test/alternator: increase timeouts in test_tracing.py The query tracing tests in test/alternator's test_tracing.py had one timeout of 30 seconds to find the trace, and one unclearly-coded timeout for finding the right content for the trace. We recently saw both timeouts exceeded in tests, but only rarely and only in debug mode, in a run 100 times slower than normal. This patch increases both timeouts to 100 seconds. Whatever happens then, we win: If the test stops failing, we know the new timeout was enough. If the test continues to fail, we will be able to conclude that we have a real bug - e.g., perhaps one of the LWT operations has a bug causing it to hang indefinitely. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210608205026.1600037-1-nyh@scylladb.com>	2021-06-10 09:19:01 +03:00
Piotr Sarna	cb17aa1e53	Merge 'test/alternator: rewrite run script to share code with cql-pytest's run script' from Nadav Har'El In this small series, I rewrite test/alternator/run to Python using the utility functions developed for test/cql-pytest. In the future, we should do the same to test/redis/run and test/scylla-gdb/run. The benefit of this rewrite is less code duplication (all run scripts start with the same duplicate code to deal with temporary directories, to run Scylla IP addresses, etc.), but most importantly - in the future fixes we do to cql-pytest (e.g., parameters needed to start Scylla efficiently, how to shut down Scylla, etc.) will appear automatically in alternator test without needing to remember to change both. Another benefit is that test/alternator/run will now be Python, not a shell script. This should make it easier to integrate it into test.py (refs #6212) in the future - if we want to. Closes #8792 * github.com:scylladb/scylla: test/alternator: rewrite test/alternator/run script in Python test/cql-pytest: make test run code more general	2021-06-06 19:18:49 +03:00
Avi Kivity	a55b434a2b	treewide: extent copyright statements to present day	2021-06-06 19:18:49 +03:00
Nadav Har'El	f22ed3ff5c	test/alternator: reduce very high timeout in one tracing test In test_tracing.py::test_slow_query_log, the was what looked like an innocent 30-second timeout, but this was in fact a 8 minute timeout - because it started with sleeping 1 second, then 2 seconds, then 3, ... until 30 seconds. Such a high timeout is frustrating when trying to debug failures in the test - which is only expected to take 2 seconds (and all of it because of an artificial timeout). So fix the loop to stop iterating after 60 seconds (a compromise between 30 seconds and 8 minutes...), sleeping a constant amount between iterations. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210601150631.1037158-1-nyh@scylladb.com>	2021-06-06 09:21:23 +03:00
Nadav Har'El	0bb2e010f5	test/alternator: rewrite test/alternator/run script in Python We already wrote the test/cql-pytest/run script in Python in a way it can be reusable for the other test//run scripts. So this patch replaces the test/alternator/run shell script with Python code which does the same thing (safely runs Scylla with Alternator and pytest on it in a temporary directory and IP address), but sharing most of the code that cql-pytest uses. The benefit of reusing the test/cql-pytest/run.py library goes beyond shorter code - the main benefit will be that we can't forget to fix one of the test//run scripts (e.g., add more command line options or fix a bug) when fixing another one. To make the test/cql-pytest/run.py library reusable for running Alternator, I needed to generalize a few things in this patch (e.g., the way we check and wait for Scylla to boot with the different APIs we intend to check). There is also one bug-fix on how interrupts are handled (they are now better guaranteed to kill pytest) - and now fixing this bug benefits all runners using run.py (cql-pytest/run, cql-pytest/run-cassandra and alternator/run). In the future, we can port the runners which are still duplicate shell scripts - test/redis/run and test/scylla-gdb/run - to Python in a similar manner to what we did here for test/alternator/run. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-06-03 11:23:00 +03:00
Nadav Har'El	a2379b96b1	alternator test: test for large BatchGetItem This patch adds an Alternator test, test_batch_get_item_large, which checks a BatchGetItem with a moderately large (1.5 MB) response. The test passes - we do not have a bug in BatchGetItem - but it does reproduce issue #8522 - the long response is stored in memory as one long contiguous string and causes a warning about an over-sized allocation: WARN ... seastar_memory - oversized allocation: 2281472 bytes. Incidentally, this test also reproduces a second contiguous allocation problem - issue #8183 (in BatchWriteItem which we use in this test to set up the item to read). Refs #8522 Refs #8183 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210520161619.110941-1-nyh@scylladb.com>	2021-05-21 08:38:53 +02:00
Nadav Har'El	7d2df8a9bc	test/alternator,cql-pytest: fix resource leak on failure In the alternator and cql-pytest test frameworks, we have some convenient contextmanager-based functions that allows us to create a temporary resource (e.g., a table) that will be automatically deleted, for example: with create_stream_test_table(...) as table: test_something(table) However, our implementation of these functions wasn't safe. We had code looking like: table = ... yield table table.delete() The thinking was that the cleanup part (the table.delete()) will be called after the user's code. However, if the user's code threw (i.e., a failed assertion), the cleanup wasn't called... When the user's code throws, it looks as if the "yield" throws. So the correct code should look like: table = ... try: yield table finally: table.delete() Python's contextmanager documentation indeed gives this idiom in its example. This patch fixes all contextmanager implementations in our tests to do the cleanup even if the user's "with" block throws. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210428083748.552203-1-nyh@scylladb.com>	2021-04-28 10:51:02 +02:00
Piotr Sarna	0779fa8428	test: add username verification to alternator tracing tests The test case now additionally checks if the username entry from found tracing events matches the username used by the test suite.	2021-04-26 11:54:02 +02:00
Nadav Har'El	50f3201ee2	alternator: fix inequality check of two sets In issue #5021 we noted that Alternator's equality operator needs to be fixed for the case of comparing two sets, because the equality check needs to take into account the possibility of different element order. Unfortunately, we fixed only the equality check operator, but forgot there is also an inequality operator! So in this patch we fix the inequality operator, and also add a test for it that was previously missing. The implementation of the inequality operator is trivial - it's just the negation of the equality test. Our pre-existing tests verify that this is the correct implementation (e.g., if attribute x doesn't exist, then "x = 3" is false but "x <> 3" is true). Refs #5021 Fixes #8513 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210419141450.464968-1-nyh@scylladb.com>	2021-04-20 13:14:19 +02:00
Nadav Har'El	dae7528fe5	alternator: fix equality check of nested document containing a set In issue #5021 we noticed that the equality check in Alternator's condition expressions needs to handle sets differently - we need to compare the set's elements ignoring their order. But the implementation we added to fix that issue was only correct when the entire attribute was a set... In the general case, an attribute can be a nested document, with only some inner set. The equality-checking function needs to tranverse this nested document, and compare the sets inside it as appropriate. This is what we do in this patch. This patch also adds a new test comparing equality of a nested document with some inner sets. This test passes on DynamoDB, failed on Alternator before this patch, and passes with this patch. Refs #5021 Fixes #8514 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210419184840.471858-1-nyh@scylladb.com>	2021-04-20 13:14:10 +02:00
Nadav Har'El	46448b0983	alternator: fix equality check of two unset attributes When a condition expression (ConditionExpression, FilterExpression, etc.) checks for equality of two item attributes, i.e., "x = y", and when one of these attributes was missing we correctly returned false. However, we also need to return false when both attributes are missing in the item, because this is what DynamoDB does in this case. In other words an unset attribute is never equal to anything - not even to another unset attribute. This was not happening before this patch: When x and y were both missing attributes, Alternator incorrectly returned true for "x = y", and this patch fixes this case. It also fixes "x <> y" which should to be true when both x and y are unset (but was false before this patch). The other comparison operators - <, <=, >, >=, BETWEEN, were all implemented correctly even before this patch. This patch also includes tests for all the two-unset-attribute cases of all the operators listed above. As usual, we check that these tests pass on both DynamoDB and Alternator to confirm our new behavior is the correct one - before this patch, two of the new tests failed on Alternator and passed on DynamoDB. Fixes #8511 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210419123911.462579-1-nyh@scylladb.com>	2021-04-20 13:14:00 +02:00
Nadav Har'El	989589b570	test/cql-pytest,alternator,redis: avoid an annoying warning This patch avoids an annoying warning Warning: Unknown config ini key: flake8-ignore when running one of the pytest-based test projects (cql-pytest, alternator and redis) on recent versions of pytest. In commit `2022da2405`, we added to the toplevel Scylla directory a "tox.ini" file with some intention to configure Python syntax checking. One of the configurations in this tox.ini is: [pytest] flake8-ignore = E501 It turns out that pytest, if a certain test directory does not have its own pytest.ini file, looks up in ancestor directory for various configuration files (the configuration file precedence is described in https://docs.pytest.org/en/stable/customize.html), and this includes this tox.ini configuration section. Recent versions of pytest complain about the "flake8-ignore" configuration parameter, which they don't recognize. This parameter may be ok (?) if you install a flake8 pytest plugin, but we do not require users to do this for running these tests. Moreover, whatever noble intentions this commit and its tox.ini had, nobody ever followed up on it. The three pytest-based test directories never adhered to flake8's recommended syntax, and never intended to do so. None of the developers of these tests use flake8, or seem to wish to do so. If this ever changes, we can change the pytest.ini or undo this commit and go back to a top-level tox.ini, but I don't see this happening anytime soon. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210411085708.300851-1-nyh@scylladb.com>	2021-04-12 08:04:06 +02:00
Nadav Har'El	0d0db05cf3	test/alternator: speed up two slow xfailing tests By far the two slowest Alternator tests when running a development build on my laptop are test_gsi.py::test_gsi_projection_include and test_gsi.py::test_gsi_projection_keys_only Each of those takes around 3.2, and the sum of just these two tests is as much as 10% (!) of all other 600 tests. The reason why these tests are slow is that they check scanning a GSI with projection. Scylla currently ignores the projection, so the scan returns the wrong value. Because this is a GSI, which supports only eventually- consistent reads, we need to retry the read - and did it for up to 3 seconds! But this retry only makes sense if the GSI read did not yet return the expected data. But in these xfailing test, we read a wrong item (with too many attributes) almost immediately, and this should indicate an immediate failure - no amount of retry would help. So in this patch we detect this case and fail the test immediately instead of wasting 3 seconds in retries. On my laptop with dev build, this patch reduces the time to run the entire Alternator test suite from 70 seconds to 63 seconds. Also, now that we never just waste time until the timeout, we can increase it to any number, and in this patch we increase it from 3 seconds to 5. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210317183918.1775383-1-nyh@scylladb.com>	2021-04-06 14:49:15 +02:00
Nadav Har'El	15cab90f7b	test/alternator: switch some fixture scopes from "session" to "module" In conftest.py we have several fixtures creating shared tables which many test files can share, so they are marked with the "session" scope - all the tests in the testing session may share the same instance. This is fine. Some of test files have additional fixtures for creating special tables needed only in those files. Those were also, unnecessarily, marked "session" scope as well. This means that these temporary tables are only deleted at the very end of test suite, event though they can be deleted at the end of the test file which needed them. This is exactly what the "module" fixture scope is, so this patch changes all the fixtures private to one test file to be "module". After this patch, the teardown of the last test in the suite goes down from 4 seconds to just 1.5 seconds (it's still long because there are still plenty of session-scoped fixtures in conftest.py). Another small benefit is that the peak disk usage of the test suite is lower, because some of the temporary tables are deleted sooner. This patch does not change any test functionality, and also does not make any test faster - it just changes the order of the fixture teardowns. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210317175036.1773774-1-nyh@scylladb.com>	2021-04-06 14:43:36 +02:00
Avi Kivity	8785dd62cb	tests: use kernel page cache Tests are short-lived and use a small amount of data. They are also often run repeatly, and the data is deleted immediately after the test. This is a good scenario for using the kernel page cache, as it can cache read-only data from test to test, and avoid spilling write data to disk if it is deleted quickly. Acknowledge this by using the new --kernel-page-cache option for tests. This is expected to help on large machines, where the disk can be overloaded. Smaller machines with NVMe disks probably will not see a difference. Closes #8347	2021-03-30 12:04:55 +02:00
Nadav Har'El	0b2cf21932	alternator-test: increase read timeout and avoid retries By default the boto3 library waits up to 60 second for a response, and if got no response, it sends the same request again, multiple times. We already noticed in the past that it retries too many times thus slowing down failures, so in our test configuration lowered the number of retries to 3, but the setting of 60-second-timeout plus 3 retries still causes two problems: 1. When the test machine and the build are extremely slow, and the operation is long (usually, CreateTable or DeleteTable involving multiple views), the 60 second timeout might not be enough. 2. If the timeout is reached, boto3 silently retries the same operation. This retry may fail because the previous one really succeeded at least partially! The symptom is tests which report an error when creating a table which already exists, or deleting a table which dooesn't exist. The solution in this patch is first of all to never do retries - if a query fails on internal server error, or times out, just report this failure immediately. We don't expect to see transient errors during local tests, so this is exactly the right behavior. The second thing we do is to increase the default timeout. If 1 minute was not enough, let's raise it to 5 minutes. 5 minutes should be enough for every operation (famous last words...). Even if 5 minutes is not enough for something, at least we'll now see the timeout errors instead of some wierd errors caused by retrying an operation which was already almost done. Fixes #8135 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210222125630.1325011-1-nyh@scylladb.com>	2021-03-18 18:58:08 +02:00
Nadav Har'El	4a7d3175e9	test/alternator: make another test faster The slowest test in test_streams.py is test_list_streams_paged. It is meant to test the ListStreams operation with paging. The existing test repeated its test four times, for four different stream types. However, there is no reason to suspect that the ListStreams operation might somehow be different for the four stream types... We already have other tests which create streams of the four types, and uses these streams - we don't need the test for ListStreams to also test creating the four types. By doing this test just once, not four times, we can save around 1.5 seconds of test time. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210318073755.1784349-1-nyh@scylladb.com>	2021-03-18 11:24:18 +01:00
Nadav Har'El	79af728335	test/alternator: make tracing test a bit faster In the test test_tracing.py::test_tracing_all, we do some operations and then need to wait until they appear in the tracing table. The current code used an exponentially-increasing delay during this wait, starting with 0.1 seconds and then doubling the delay until we find what we're looking for. However, it turns out that the delay until the data appears in the table is deliberately chosen by Scylla - and is always around 2 seconds. In this case, an exponential delay is really bad - we will usually wait for around 1 seconds too long after the needed wait of 2 seconds. So in this patch we replace the exponential delay by a constant delay - we wait 0.3 seconds between each retry. This change makes the test test_tracing.py::test_tracing_all finish in a little over 2 seconds, instead of a little over 3 seconds before this patch. We cannot reduce this 2 second time any further unless we make the 2-second tracing delay configurable. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210318000040.1782933-1-nyh@scylladb.com>	2021-03-18 11:24:18 +01:00
Nadav Har'El	4e87f95b42	test/alternator: remove slow and unhelpful test The test test_table.py::test_table_streams_on creates tables with various stream types, and then immediately deletes them without testing anything. This is a slow test (taking almost a full second on my laptop), and is redundant because in test_streams.py we have tests which create tables with streams in the same way - but then actually test that things work with these streams. So this test might as well be removed, and this is what we do in this patch. Removing this test shaves another second from the Alternator test suite's run time. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210317230530.1780849-1-nyh@scylladb.com>	2021-03-18 11:24:18 +01:00
Nadav Har'El	879656e3e0	test/alternator: make a test faster, safer and more correct The test test_condition_expression.py::test_condition_expression_with_forbidden_rmw takes half a second to run (dev build, on my laptop), one of the slowest tests in Alternator's test suite. Part of the reason was that it needlessly set the same table to forbidden_rmw, multiple times. Instead of doing that, we switch to using the test_table_s_forbid_rmw fixture, which is a table like test_table_s but created just once in forbid_rmw mode. The result is a faster test (0.05 seconds instead of 0.5 seconds), but also safer if we ever want to run tests in parallel. It also fixes a bug in the test: At the end of the test, we intended to double-check that although the forbid_rmw table forbids read-modify-write operations, it does allow pure writes. Yet the test did this after clearing the forbid_rmw mode... So after this patch the test verifies this on the forbid_rmw table, as intended. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210317222703.1779992-1-nyh@scylladb.com>	2021-03-18 11:24:18 +01:00
Nadav Har'El	1c2e473e62	test/alternator: make a test faster The test test_condition_expression.py::test_condition_expression_with_permissive_write_isolation Currently takes (on my laptop, dev build) a full two seconds, one of the slowest tests. It is not surprising it is slow - it runs five other tests three times each (for three different write isolation modes), but it doesn't have to be this slow. Before this patch, for each of the five tests we switch the write isolation mode three times, and these switches involve schema changes and are fairly slow. So in this patch we reverse the loop - and switch the write isolation mode to the outer loop. This patch halves the runtime of this test - from two seconds to one. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210317221045.1779329-1-nyh@scylladb.com>	2021-03-18 11:24:18 +01:00
Piotr Sarna	efe734c575	alternator: add test for slow query logging The test checks whether slow queries are properly logged in the system_traces.node_slow_log system table. The test is deterministic because it uses the threshold of 0ms to qualify a query as slow, which effectively makes all queries "slow enough".	2021-03-17 13:24:26 +01:00
Nadav Har'El	f41dac2a3a	alternator: avoid large contiguous allocation for request body Alternator request sizes can be up to 16 MB, but the current implementation had the Seastar HTTP server read the entire request as a contiguous string, and then processed it. We can't avoid reading the entire request up-front - we want to verify its integrity before doing any additional processing on it. But there is no reason why the entire request needs to be stored in one big contiguous allocation. This always a bad idea. We should use a non- contiguous buffer, and that's the goal of this patch. We use a new Seastar HTTPD feature where we can ask for an input stream, instead of a string, for the request's body. We then begin the request handling by reading lthe content of this stream into a vector<temporary_buffer<char>> (which we alias "chunked_content"). We then use this non-contiguous buffer to verify the request's signature and if successful - parse the request JSON and finally execute it. Beyond avoiding contiguous allocations, another benefit of this patch is that while parsing a long request composed of chunks, we free each chunk as soon as its parsing completed. This reduces the peak amount of memory used by the query - we no longer need to store both unparsed and parsed versions of the request at the same time. Although we already had tests with requests of different lengths, most of them were short enough to only have one chunk, and only a few had 2 or 3 chunks. So we also add a test which makes a much longer request (a BatchWriteItem with large items), which in my experiment had 17 chunks. The goal of this test is to verify that the new signature and JSON parsing code which needs to cross chunk boundaries work as expected. Fixes #7213. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210309222525.1628234-1-nyh@scylladb.com>	2021-03-10 09:22:34 +01:00
Nadav Har'El	28804a50f7	alternator-test: test that index can't be a name reference (#xyz) We already have a test which shows verify DynamoDB and Alternator do not allow an index in an attribute path - like a[0].b - to be a value reference - a[:xyz].b. We forgot to verify that the index also can't be a name reference - a[#xyz].b is a syntax error. So here we add a test which confirms that this is indeed the case - DynamoDB doesn't allow it, and neither does Alternator. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210219123310.1240271-1-nyh@scylladb.com>	2021-03-08 10:17:19 +01:00
Nadav Har'El	d6335b7fda	test/alternator: better tests of oversized requests Like DynamoDB, Alternator rejects requests larger than some fixed maximum size (16MB). We had a test for this feature - test_too_large_request, but it was too blunt, and missed two issues: Refs #8195 Refs #8196 So this patch adds two better tests that reproduce these two issues: First, test_too_large_request_chunked verifies that an oversized request is detected even if the body is sent with chunked encoding. Second, both tests - test_too_large_request_chunked and test_too_large_request_content_length - verify that the rather limited (and arguably buggy) Python HTTP client is able to read the 413 status code - and doesn't report some generic I/O error. Both tests pass on DynamoDB, but fail on Alternator because of these two open issues. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210302154555.1488812-1-nyh@scylladb.com>	2021-03-03 07:06:45 +01:00
Nadav Har'El	d905e71a90	Alternator: add support for CORS protocol This patch adds to Alternator support for the CORS (Cross-Origin Resource Sharing) protocol - a simple extension over the HTTP protocol which browsers use when Javascript code contacts HTTP-based servers. Although we usually think of Alternator as being used in a three-tier application, in some setups there is no middle layer and the user's browser, running Javascript code, wants to communicate directly with the database. However, for security reasons, by default Javascript loaded from domain X is not allowed to communicate with different domains Y. The CORS protocol is meant to allow this, and Alternator needs to participate in this protocol if it is to be used directly from Javascript in browsers. To implement CORS, Alternator needs to respond to the OPTIONS method which it didn't allow before - with certain headers based on the input headers. It also needs to do some of these things for the regular methods (mostly, POST). The patch includes a comprehensive test that runs against both Alternator and DynamoDB and shows that Alternator handles these headers and methods the same as DynamoDB. Additionally, I tested manually a Javascript DynamoDB client - which didn't work prior to this patch (the browser reported CORS errors), and works after this patch. Fixes #8025. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210217222027.1219319-1-nyh@scylladb.com>	2021-02-23 13:15:03 +01:00
Nadav Har'El	49cd9b3fd5	alternator: correct implemention of UpdateItem with nested attributes and ReturnValues This patch fixes the last missing part of nested attribute support in UpdateItem - returning the correct attributes when ReturnValues is requested. When the expression says "a.b = :val" and ReturnValues is set to UPDATED_OLD or UPDATED_NEW, only the actual updated attribute a.b should be returned, not the entire top-level attribute a as we did before this patch. This patch was made very simple because our existing hierarchy_filter() function already does exactly the right thing, and can trivially be made to accept any attribute_path_map<T> (in our case attribute_path_map<action>), not just attrs_to_get as it did until now. This patch also adds several more checks to the test in test_returnvalues.py to improve the test's coverage even more. Interestingly, I discovered two esoteric cases where DynamoDB does something which makes little sense, but apparently simplified their implementation - but the beautiful thing is that it also simplifies our implementation! See long comments about these two cases in the test code. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-14 12:21:34 +02:00
Nadav Har'El	33685a683e	alternator: implemented nested attribute paths in UpdateExpression This patch adds full support for nested attribute paths (e.g., a.b[3].c) in UpdateExpression. After in previous patches we already added such support for ProjectionExpression, ConditionExpression and FilterExpression this means the nested attribute paths feature is now complete, so we remove the warning from the documents. However, there is one last loose end to tie and we will do it in the next patch: After this patch, the combination of UpdateExpression with nested attributes and ReturnValues is still wrong, and the test for it in test_returnvalues.py still xfails. Note that previous patches already implemented support for attribute paths in expression evaluations - i.e., the right-hand side of UpdateExpression actions, and in this patch we just needed to implement the left hand side: When an update action is on an attribute a.b we need to read the entire content of the top-level a (an RWM operation), modify just the b part of its json with the result of the action, and finally write back the entire content of a. Of course everything gets complicated by the fact that we can have multiple actions on multiple pieces of the same JSON, and we also need to detect overlapping and conflicting actions (we already have this detection in the attribute_path_map<> class we introduced in a previous patch). I decided to leave one small esoteric difference, reproduced by the xfailing test_update_expression.py::test_nested_attribute_remove_from_missing_item: As expected, "SET x.y = :val" fails for an item if its attribute x doesn't exist or the item itself does not exist. For the update expression "REMOVE x.y", DynamoDB fails if the attribute x doesn't exist, but oddly silently passes if the entire item doesn't exist. Alternator does not currently reproduce this oddity - it will fail this write as well. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-14 12:21:34 +02:00
Nadav Har'El	7789606545	alternator: limit the depth of nested paths DynamoDB limits the depth of a nested path in expressions (e.g. "a.b.c.d") to 32 levels. This patch adds the same limit also to Alternator. The exact value of this limit is less important (although it did make sense to choose the same limit as DynamoDB does), but it's important to have some limit: It's often convenient to handle paths with a recursive algorithm, and if we allow unlimited path depth, it can result in unlimited recursion depth, and a crash. Let's avoid this possibility. We detect the over-long path while building the parsed::path object in the parser, and generate a parse error. This patch also includes a test that verifies that both Alternator and DynamoDB have the same 32-level nesting limit on paths. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-14 12:21:34 +02:00
Nadav Har'El	4c7e27c688	alternator: prepare for UpdateItem nested attribute paths This patch prepares UpdateItem for updating of nested attribute paths (e.g., "SET a.b = :val"), but does not yet support them. Instead of _update_expression holding an unsorted list of "actions", we change it to hold a attribute_path_map of actions. This will allow us to process all the actions on a top-level attribute together, and moreover gets us "for free" the correct checking for overlapping and conflicting updates - exactly the same checking we already had in attribute_path_map for ProjectionExpression. Other than this change, most of this patch is just code movement, not functional changes. After this patch, the tests for update path overlap and conflict pass: test_update_expression_multi_overlap_nested and test_update_expression_multi_conflict_nested. We can also mark test_update_expression_nested_attribute_rhs as passing - this test involves an attribute path in the right-hand-side of an update, but the left-hand-side is still a top-level attribute, so it works (it actually worked before this patch - it started working when we implemented attribute paths in expressions, for ConditionExpression and FilterExpression). Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-14 12:21:34 +02:00
Nadav Har'El	c2f18e56ea	alternator-test: a few more ProjectionExpression conflict test cases Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-14 12:21:34 +02:00
Nadav Har'El	de62a8c2d3	alternator-test: improve tests for nested attributes in UpdateExpression We already had many tests for nested attributes in UpdateExpression, but this patch adds even more: * Test nested attribute in right-hand-side in assignment: z = a.c.x. * Test for making multiple changes to the same and different top-level attributes in the same update. * Additional cases of overlap between multiple changes. * Tests for conflict between multiple changes. * Tests for writing to a nested path on a non-existent attribute or item. * A stronger test for array append sorts the added items. As this feature was not yet implemented, these tests fail on Alternator, and pass on DynamoDB. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-14 12:21:24 +02:00
Nadav Har'El	e52785be08	alternator: support attribute paths in ConditionExpression, FilterExpression This patch fully implements support for attribute paths (e.g. a.b.c, a.d[3]) for the ConditionExpression in conditional updates, and FilterExpression in queries and scans. After this patch, all previously-xfailing tests in test_projection_expression.py and test_filter_expression.py now pass. The fix is simple: Both ConditionExpression and FilterExpression use the function calculate_value() to calculate the value of the expression. When this function calculates the value of a path, it mustn't just take the top-level attribute - it needs to walk into the specific sub-object as specified by the attribute path. This is not the end of attribute path support, UpdateExpression and ReturnValues are not yet fully supported. This will come in following patches. Refs #5024 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-08 19:19:09 +02:00
Nadav Har'El	579c7b8dae	alternator-test: improve tests for nested attributes in ConditionExpression Strengthen the tests in test_condition_expression.py for nested attribute paths (e.g., b.y[1]): 1. The test test_update_condition_nested_attributes only tested successful conditions involving nested attributes. Let's also add an unsuccessful condition, to verify we don't accidentally pass every condition involving a nested attribute. 2. Test a case where a non-existant nested attribute is involved in the condition. 3. In the test for an attribute path with references - "#name1.#name2", make sure the test doesn't pass if #name2 is silently ignored. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-08 19:19:09 +02:00
Nadav Har'El	104ef5242b	alternator: support attribute paths in ProjectionExpression This patch fully implements support for attribute paths (e.g. a.b.c, a.d[3]) for the ProjectionExpression in the various operations where this parameter is supported - GetItem, BatchGetItem, Query and Scan. After this patch, all xfailing tests in test_projection_expression.py now pass. In the previous patch we remembered in the "attrs_to_get" object not only the top-level attributes to read from the table, but also how to filter from it only the desired pieces of the nested document. In this patch we add a filter() function to do this filtering, and call it in the right places to post-process the JSON objects we read from the table. We also had to fix reference resolution in paths to resolve all the components of the path (e.g., #name1.#name2) and not just the top-level attribute. This is not the end of attribute path support, there are still other expressions (ConditionExpression, UpdateExpression, FilterExpression, ReturnValues) where they are not yet supported. This will come in following patches. Refs #5024 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-08 14:16:40 +02:00
Nadav Har'El	6340619e69	alternator: overhaul attrs_to_get handling In the existing code, the variable "attrs_to_get" is a list of top-level attributes to fetch for an item. It is used to implement features like ProjectionExpression or AttributesToGet in GetItem and other places. However, to support attribute paths (e.g., a.b.c[2]) in ProjectionExpression, i.e., issue #5024, we need more than that. We still need to know the top- level attribute "a", because this is the granularity we have in the Scylla table (all the content inside "a" is serialized as a single JSON); But we also need to remember exactly which parts inside "a" we will need to extract and return. So in this patch we add a new type, "attrs_to_get", which is more than just a list of top-level attributes. Instead, it is a map, whose keys are the top-level attributes, and the value for each of them is a "hierarchy_filter", an object which describes which part of the attribute is needed. This patch includes the code which converts the AttributesToGet and ProjectionExpression into the new attrs_to_get structure. During this conversion, we recognize two kinds of errors which DynamoDB complains about: We recognize "overlapping" attributes (e.g., requesting both a.b and a.b.c) and "conflicting" attributes (e.g, requesting both a.b and a[1]). After this, two xfailing tests we had for detecting these overlap and conflicts finally pass and their "xfail" label is removed. After this patch, we have the attrs_to_get object which can allow us to filter only the requested pieces of the top-level attributes, but we don't use it yet - so this patch is not enough for complete support of attribute paths in ProjectionExpression. We will complete this support in the next patch. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-08 14:16:40 +02:00
Nadav Har'El	b2dbd56a3a	alternator-test: additional tests for attribute paths in ProjectionExpression This patch adds more tests for attribute paths in ProjectionExpression, that deal with document paths which do not fit the content of the item - e.g., trying to ask for "a.b[3]" when a.b is not a list but rather an integer or a dictionary. Moreover, we note that if you try to ask for "a.b, a[2]", DynamoDB fails this request as a "conflict". The reasoning is that no single item can ever have both a.b and a[2] (the first is only valid for dictionaries, the second for lists). It's not clear to me why we still can't return whichever of the two actually is relevant, but the fact is that DynamoDB does not allow it. The new tests fail on Alternator (marked xfailed) and pass on DynamoDB. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-08 14:16:40 +02:00
Nadav Har'El	2a2c5563ba	alternator-test: harden attribute-path tests for ProjectionExpression We have 7 xfailing tests for usage of nested attribute paths (e.g., "a.b.c[7]") in a ProjectionExpression. But some of these tests were too "easy" to pass - a trivial and wrong implementation that just ignores the path and uses the top level attribute (in the above example, "a"), would cause some of them to start passing. So this patch strengthens these tests. They still pass on AWS DynamoDB, and now continue to fail with the aforementioned broken implementation. Refs #5024. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-08 14:16:40 +02:00
Nadav Har'El	653610f4bc	alternator: fix ValidationException in FilterExpression - and more The first condition expressions we implemented in Alternator were the old "Expected" syntax of conditional updates. That implementation had some specific assumptions on how it handles errors: For example, in the "LT" operator in "Expected", the second operand is always part of the query, so an error in it (e.g., an unsupported type) resulted it a ValidationException error. When we implemented ConditionExpression and FilterExpression, we wrongly used the same functions check_compare(), check_BETWEEN(), etc., to implement them. This results in some inaccurate error handling. The worst example is what happens when you use a FilterExpression with an expression such as "x < y" - this filter is supposed to silently skip items whose "x" and "y" attributes have unsupported or different types, but in our implementation a bad type (e.g., a list) for y resulted in a ValidationException which aborted the entire scan! Interestingly, in once case (that of BEGINS_WITH) we actually noticed the slightly different behavior needed and implemented the same operator twice - with ugly code duplication. But in other operators we missed this problem completely. This patch first adds extensive tests of how the different expressions (Expected, QueryFilter, FilterExpression, ConditionExpression) and the different operators handle various input errors - unsupported types, missing items, incompatible types, etc. Importantly, the tests demonstrate that there is often different behavior depending on whether the bad input comes from the query, or from the item. Some of the new tests fail before this patch, but others pass and were useful to verify that the patch doesn't break anything that already worked correctly previously. As usual, all the tests pass on Cassandra. Finally, this patch fixes all these problems. The comparison functions like check_compare() and check_BETWEEN() now not only take the operands, they also take booleans saying if each of the operands came from the query or from an item. The old-syntax caller (Expected or QueryFilter) always say that the first operand is from the item and the second is from the query - but in the new-syntax caller (ConditionExpression or FilterExpression) any or all of the operands can come from the query and need verification. The old duplicated code for check_BEGINS_WITH() - which a TODO to remove it - is finally removed. Instead we use the same idea of passing booleans saying if each of its operands came from an item or from the query. Fixes #8043 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2021-02-08 14:16:30 +02:00
Nadav Har'El	1953b1b006	alternator-test: increase timeout in tracing test Our test for tracing Alternator requests can't be sure when tracing a request finished, because tracing is asynchronous and has no official ending signal. So before we can conclude that tracing failed, we need to wait until a timeout, which in the current code was roughly 6.4 seconds (the timeout logic is unnecessarily convoluted, but to make a long story short it has exponential sleeps starting with 0.1 second and ending with 3.2 seconds, totaling 6.4 seconds). It turns out that sporadically, in test runs on overcommitted test machines with the very slow debug build, we fail this test with this timeout. So this patch increases the timeout to 51.2 seconds. It should be more than enough for everyone. Famous last words :-) Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210204151554.582260-1-nyh@scylladb.com>	2021-02-04 17:17:07 +02:00
Nadav Har'El	0516cd1609	alternator test: de-duplicate some duplicate code In test_streams.py we had some code to get a list of shards and iterators duplicated three times. Put it in a function, shards_and_latest_iterators(), to reduce this duplication. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201006112421.426096-1-nyh@scylladb.com>	2021-01-11 08:47:25 +01:00

1 2 3 4

152 Commits