scylladb

Author	SHA1	Message	Date
Gleb Natapov	9daa109d2c	test: get rid of consistent_cluster_management usage in test consistent_cluster_management is deprecated since scylla-5.2 and no longer used by Scylladb, so it should not be used by test either. Closes scylladb/scylladb#28340	2026-01-27 11:31:30 +01:00
Nikos Dragazis	1e37781d86	schema: Add initializer for compression defaults In PR `5b6570be52` we introduced the config option `sstable_compression_user_table_options` to allow adjusting the default compression settings for user tables. However, the new option was hooked into the CQL layer and applied only to CQL base tables, not to the whole spectrum of user tables: CQL auxiliary tables (materialized views, secondary indexes, CDC log tables), Alternator base tables, Alternator auxiliary tables (GSIs, LSIs, Streams). Fix this by moving the logic into the `schema_builder` via a schema initializer. This ensures that the default compression settings are applied uniformly regardless of how the table is created, while also keeping the logic in a central place. Register the initializer at startup in all executables where schemas are being used (`scylla_main()`, `scylla_sstable_main()`, `cql_test_env`). Finally, remove the ad-hoc logic from `create_table_statement` (redundant as of this patch), remove the xfail markers from the relevant tests and adjust `test_describe_cdc_log_table_create_statement` to expect LZ4WithDicts as the default compressor. Fixes #26914. Signed-off-by: Nikos Dragazis <nikolaos.dragazis@scylladb.com>	2026-01-13 20:45:59 +02:00
Nadav Har'El	34191d8fd4	alternator: fix signature checking of headers with multiple spaces We have a test in test_compressed_response.py that reproduces a bug where in Alternator's signature checking code, if a header had multiple consecutive spaces its signature isn't checked correctly. This patch fixes this and that xfailing test begins to pass. But it turns out that the handling of multiple consecutive spaces in headers when calculating the authentication signature is just one example of "header canonization" that the AWS Signature V4 specification requires us to do. There are additional types of header canonization that Alternator must do, and this patch also adds new tests in test_authorization.py for checking all the types of canonization. Fortunately, for all other types of canonizations, we already handled them correctly - Alternator already lowercases header names, sorts them alphabetically and removes leading and trailing spaces before calculating the signature. So most of the new tests added pass also without this patch, and only one of them, test_canonization_middle_whitespace, needs this patch to pass. As usual, all the new tests also pass on DynamoDB. Fixes #27775 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#28102	2026-01-13 10:29:13 +02:00
Nadav Har'El	e7df03127b	alternator: support "deflate" encoding in request compression Currently Alternator supports compressed requests in the gzip format with "Content-Encoding: gzip". We did not support any other compression formats. It turns out that DynamoDB also supports the "deflate" encoding. The "deflate" format is just a small variant of gzip and also supported by the same zlib library that we already use, so it is very easy to add support for it as well. So this patch adds it. Beyond compatibility with DynamoDB, another benefit of this patch is symmetry with our response compression support (PR #27454), where we supported both gzip and deflate compression of responses - so we should support the same for requests. This patch also adds tests for Content-Encoding: deflate, which pass on DynamoDB (proving that "deflate" is indeed supported there). On Alternator the new tests failed before this patch and pass with this patch. Refs #27243 (which asks to support more compression formats). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#27917	2026-01-13 09:58:12 +02:00
Botond Dénes	6bcc18e5c6	erge 'test.py: integrate python tests to be executed with pytest runner' from Andrei Chekun This will move responsibility for running tests with pytest in the same manner as it was done with boost tests. From this commit, test.py is not responsible anymore for running python tests and relies completely on pytest. This is another step for unification of test execution. Convert skip_mode function to `pytest.mark` to be able to use to annotate the whole module instead of each test explicitly. NOTE: this is a breaking change. From this commit, several directories with tests will require a path to the file to launch the test. Affected directories test/alternator test/broadcast_tables test/cql test/cqlpy test/rest_api Changes only in framework, so no backport. This PR will increase the amount of the tests by 30 test, due to the fact that how test.py and pytest discover tests. test.py count a file as a test, and when skip used in suite.yaml it will exclude the tests from discovery completely. While the pytest count test funstion as a test and uses skip_mode mark and will discover the tests, but it will skip them during execution, hence the difference test.py output before PR: ```bash > ./test.py --mode=release rest_api/test_compaction_task rest_api/test_task_manager --list --no-gather-metrics ``` test.py output in this PR: ```bash > ./test.py --mode=release test/rest_api/test_compaction_task.py test/rest_api/test_task_manager.py --list rest_api/test_compaction_task.py::test_global_major_keyspace_compaction_task.release.1 rest_api/test_compaction_task.py::test_major_keyspace_compaction_task.release.1 rest_api/test_compaction_task.py::test_cleanup_keyspace_compaction_task.release.1 rest_api/test_compaction_task.py::test_offstrategy_keyspace_compaction_task.release.1 rest_api/test_compaction_task.py::test_rewrite_sstables_keyspace_compaction_task.release.1 rest_api/test_compaction_task.py::test_reshaping_compaction_task.release.1 rest_api/test_compaction_task.py::test_resharding_compaction_task.release.1 rest_api/test_compaction_task.py::test_regular_compaction_task.release.1 rest_api/test_compaction_task.py::test_compaction_task_abort.release.1 rest_api/test_compaction_task.py::test_major_keyspace_compaction_task_async.release.1 rest_api/test_compaction_task.py::test_cleanup_keyspace_compaction_task_async.release.1 rest_api/test_compaction_task.py::test_offstrategy_keyspace_compaction_task_async.release.1 rest_api/test_compaction_task.py::test_rewrite_sstables_keyspace_compaction_task_async.release.1 rest_api/test_compaction_task.py::test_compaction_progress[major_keyspace_compaction_task_impl_run_fail].release.1 rest_api/test_compaction_task.py::test_compaction_progress[shard_major_keyspace_compaction_task_impl_run_fail].release.1 rest_api/test_compaction_task.py::test_compaction_progress[table_major_keyspace_compaction_task_impl_run_fail].release.1 rest_api/test_task_manager.py::test_task_manager_modules.release.1 rest_api/test_task_manager.py::test_task_manager_tasks.release.1 rest_api/test_task_manager.py::test_task_manager_status_running.release.1 rest_api/test_task_manager.py::test_task_manager_status_done.release.1 rest_api/test_task_manager.py::test_task_manager_status_failed.release.1 rest_api/test_task_manager.py::test_task_manager_not_abortable.release.1 rest_api/test_task_manager.py::test_task_manager_wait.release.1 rest_api/test_task_manager.py::test_task_manager_ttl.release.1 rest_api/test_task_manager.py::test_task_manager_user_ttl.release.1 rest_api/test_task_manager.py::test_task_manager_sequence_number.release.1 rest_api/test_task_manager.py::test_task_manager_recursive_status.release.1 rest_api/test_task_manager.py::test_module_not_exists.release.1 rest_api/test_task_manager.py::test_task_folding.release.1 rest_api/test_task_manager.py::test_abort_on_unregistered_task.release.1 ``` Fixes: https://github.com/scylladb/scylladb/issues/27716 Closes scylladb/scylladb#26395 * github.com:scylladb/scylladb: test.py: fix test_vector_similarity.py docs: add directories excluded from test.py test.py: prevent file descriptors leaking test.py: capture print inside the test test.py: do not print header for collection with test.py test.py: remove not supported functionality test.py: switch of execution of several test directories by test.py runner test.py: integrate python tests to be executed with pytest runner test.py: fix test/vector_search_validator to be able to run with pytest test.py: prepare base class for migration test.py: move environment preparation to one method test.py: introduce new environment variable TESTPY_PREPARED_ENVIRONMENT	2026-01-12 14:17:19 +02:00
Marcin Maliszkiewicz	03e0dd0841	Merge 'test/alternator: fix most tests to run on DynamoDB' from Nadav Har'El We can run Alternator's tests against DynamoDB with `test/alternator/run --aws`, and our intention is that all except a few specially marked should pass on DynamoDB - indicating that the test itself is correct and checks compatibility with DynamoDB and not with some misunderstood spec. Before this patch series, almost two dozen Alternator's tests failed on DynamoDB. This series fixes most of them. Refs #26079 (it fixes almost all the problems but probably not all of them so let's keep the issue open for a while longer) Closes scylladb/scylladb#27995 * github.com:scylladb/scylladb: test/alternator: fix some expected error messages to fit DynamoDB test/alternator: fix compressed request test on non-us-east1 test/alternator: fix test's expected error message on DynamoDB test/alternator: mark Alternator-only test scylla_only test/alternator: fix test on DynamoDB test/alternator: increase wait_for_gsi() timeout test/alternator: fix test passing a spurious parameter	2026-01-09 18:05:20 +01:00
Andrei Chekun	21a1ff3d5c	test.py: remove not supported functionality In the current state pytest do not support the order of execution, so this parameter is removed. There is no big need in this due to the differences what pytest and test.py counted test. pytest run test functions in the threads, while test.py executed test files in the threads. That's why pytest's way is more granular and allows to fill threads better. Remove skip node, since it already added as a pytest mark for each test in the file. Remove pool_size, since this is not used by pytest at all. Pytest uses xdist to set the amount of threads instead of pool_size used by test.py	2026-01-09 11:59:25 +01:00
Andrei Chekun	e8c50a5ad4	test.py: switch of execution of several test directories by test.py runner With this commit test.py will lose ability to run tests by itself always bypassing execution to the pytest. NOTE: this is a breaking change. From this commit, several directories with tests will require a path to the file to launch the test. Affected directories test/alternator test/broadcast_tables test/cql test/cqlpy test/rest_api	2026-01-09 11:59:25 +01:00
Nadav Har'El	f7eae50d98	test/alternator: fix some expected error messages to fit DynamoDB All tests I am fixing in this patch do pass for me on DynamoDB, but other developers report that they fail because some DynamoDB servers apparently use slightly different error messages, with less detail about the cause of an error. For example, some of our tests currently expect an error message that looks like: An error occurred (ValidationException) when calling the Query operation: Invalid operator used in KeyConditionExpression: attribute_exists But some servers don't report the ": attribute_exists" at the end, so we can't use the word "attribute_exists" it in the test to recognize the correct error, and needs to use a different word (which both versions of DynamoDB and Alternator all print). As another example, the good old DynamoDB error: An error occurred (ValidationException) when calling the Query operation: 1 validation error detected: Value 'DOG' at 'conditionalOperator' failed to satisfy constraint: Member must satisfy enum value set: [OR, AND] Got replaced by the following less informative message: An error occurred (ValidationException) when calling the Query operation: Failed to satisfy constraint: Member must satisfy enum value set: [ALL, OR]' So we need to fix the test to allow it too. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-01-07 14:06:33 +02:00
Nadav Har'El	e97fbc2d65	test/alternator: fix compressed request test on non-us-east1 The test test_compressed_request.py::test_compressed_request coerces boto3 to send a compressed request, and wrongly used region_name=us-east-1 to set up the connection. Theoretically, this doesn't matter because we also set the correct URL (for either Alternator or the desired region in AWS). But in fact it does matter, because region name is part of the request's signature, and DynamoDB refuses the request if it comes to a different region than it is signed for. So this test fails when run on DynamoDB on any other region except us-east-1. The fix is simple - don't use the constant "us-east-1", but pick up the correct region name from the original connection. The functions new_dynamodb_session(), new_dynamodb() and new_dynamodb_stremas() had the same bug and we fix it too, but it didn't break any test because the only tests using these functions were Scylla-only so the AWS region problem didn't apply to them.	2026-01-07 13:33:46 +02:00
Nadav Har'El	2c02e463ff	test/alternator: fix test's expected error message on DynamoDB The Alternator test test_tag.py::test_tag_lsi_gsi expects to see an error - it's not allowed to set a tag on a GSI or LSI - but the error message that DynamoDB prints recently changed - instead of saying "ResourceArn" the new error message says "resource arn". Change the test to allow both forms, so it will pass on both Alternator (which still uses the word ResourceArn - which is the name of the parameter) and on DynamoDB (which uses "resource arn"). Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-01-07 12:51:10 +02:00
Nadav Har'El	4f3150c282	test/alternator: mark Alternator-only test scylla_only The test test_batch.py::test_batch_write_item_large_broken_connection failed on DynamoDB (Refs #26079). It turns out this test has many problems: 1. This test wrongly assumes a batch write needs to complete in one attempt - and this fails on DynamoDB with low WCU capacity where the batch needs to be resumed in multiple requests. Using boto3's batch_writer() fixes this problem. 2. This test has NOTHING to do with batches - so is mis-named and mis-placed. The batch write is just a way to prepare some data in the table, and the real test is about Query'ing the data back and observing the long response and reproducing issue #14454. I did not rename or move the test, but left a comment explaining the situation. 3. This test is written to assume the Query's response uses HTTP chunked encoding. Which isn't actually true for DynamoDB, at least not at the time of this writing. So the test fails on DynamoDB. For the last reason, I made this test scylla_only. This test can't really be run on DynamoDB without rewriting it. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-01-07 12:51:10 +02:00
Nadav Har'El	df6b347911	test/alternator: fix test on DynamoDB The test test_batch.py::test_batch_write_item_large often fails when running on DynamoDB, and this patch fixes it. The test checks that a large but not over-the-limits large batch works. However, "works" only means that the batch is not an error - it doesn't guarantee that all the items in the batch are performed. If the WCU limits of the table are exceeded DynamoDB may perform only part of the the batch and return the remaining items as UnprocessedItems. This not only can happen, it usually does happen on DynamoDB - because a new on-demand-billing table always start with a very low WCU capacity. So in this patch we update the test to recognize and perform the UnprocessedItems, instead of assuming it needs to be empty. The test continues to pass on Alternator, and finally passes on DynamoDB. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-01-07 12:51:10 +02:00
Nadav Har'El	9d6a463324	test/alternator: increase wait_for_gsi() timeout In Alternator tests, the wait_for_gsi() utility function is used in tests that add a GSI to an existing table, to wait for this new GSI to become ready. Although this takes a fraction of a second on Alternator, we noticed that this takes many minutes (!) on DynamoDB so we used an absurdly high 10 minute timeout to allow tests to also pass on DynamoDB. But it turns out that 10 minutes wasn't absurdly high enough, and tests using it in test_gsi_updatetable.py started to fail on DynamoDB. Empirically, 10 minutes was enough in the past but it seems that today adding a GSI to an empty table routinely takes as much as 20 minutes. So this patch increases the wait_for_gsi() timeout to a whopping 30 minutes. After this patch, the tests in test_gsi_updatetable.py which used to fail - test_gsi_backfill_with_lsi, test_gsi_backfill_with_real_column, test_gsi_creates_and_deletes and test_gsi_backfill_oversized_key now all pass on DynamoDB - but each takes more than 20 minutes to pass. To allow the test to fail much more quickly on Alternator (where creating a GSI takes a fraction of a second), we set a much lower but still very high timeout when running on Alternator - 60 seconds. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-01-07 12:50:54 +02:00
Nadav Har'El	5c2ca56adf	test/alternator: fix test passing a spurious parameter The test test_streams.py::test_streams_putitem_new_item_overrides_old_lsi failed on DynamoDB (Refs #26079) because we passed an unused parameter NonKeyAttributes to the Projection setting an LSI. NonKeyAttributes is only allowed when ProjectionType=INCLUDE, but we used ProjectionType=ALL. DynamoDB refuses to create an LSI with such inconsistent parameters, and we just need to remove this unnecessary parameter from this test. The reason why this test didn't fail on Alternator is that Alternator doesn't yet support or even parse the Projection parameter (Refs #5036). We also add an xfailing test (passes on DynamoDB, fails on Alternator) checking that a spurious NonKeyAttributes parameter is rejected. When we get around to implement the projection feature (#5036), this will be yet another acceptance test for this feature. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2026-01-05 13:51:01 +02:00
Szymon Malewski	1f658bb2e2	alternator/http_compression: Add compression of streamed response This patch adds compression of chunked responses. It adds intermediate stream to compress chunks of data that are provided to http sink. Fixes #27246	2026-01-05 10:14:42 +01:00
Szymon Malewski	b8afb173a6	alternator/http_compression: Add implementation od gzip/deflate of string response Previous commit added means to decide whether client asks for compression and with which algorithm. This patch adds actual compression of responses based on zlib library. For now only string (not chunked) responses are compressed. Several previously defined tests start to pass.	2026-01-05 10:14:42 +01:00
Szymon Malewski	08386ea959	test/alternator: add tests for compressed responses Adds set of tests that: 1. Show how DynamoDB handles response compression. It supports 'gzip' and 'deflate' compression, which can be selected by providing 'Accept-Encoding` header. It only encodes response above 4096B. - `test_compressed_response`, `test_compressed_response_large` show compression for various response sizes. - `test_accept_encoding_header` focuses on testing various values of Accept-Encoding header. - `test_multiple_accept_encoding_headers` verifies behaviour with repeted Accept-Encoding headers. 2. Will confirm implementation of response compression in Alternator (#27246) Additonally to above test, we check Altenator specific expectations: - `test_chunked_response_compression` makes sure that compression will work also for chunked responses. - `test_set_compression_options` checks config options to set response size threshold for compression and compression level 3. `test_signature_trims_accept_encoding_spaces` reveals Alternator's bug in signature verification (#27775)	2026-01-05 10:13:40 +01:00
Nadav Har'El	6c8ddfc018	test/alternator: fix typo in test_returnvalues.py Different DynamoDB operations have different settings allowed for their "ReturnValues" argument. In particular, some operations allow ReturnValues=UPDATED_OLD but the DeleteItem operation does not. We have a test, test_delete_item_returnvalues, aimed to verify this but it had a typo and didn't actually check "UPDATED_OLD". This patch fixes this typo. The test still passes because the code itself (executor.cc, delete_item_operation's constructor) has the correct check - it was just the test that was wrong. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#27918	2026-01-01 19:33:23 +02:00
Radosław Cybulski	a31c8762ca	Update tests	2025-12-29 08:33:09 +01:00
Nadav Har'El	4ae45eb367	test/alternator: remove unused imports Remove many unused "import" statements or parts of import statement. All of them were detected by Copilot, but I verified each one manually and prepared this patch. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#27676	2025-12-24 13:44:28 +02:00
Nadav Har'El	da00401b7d	test/alternator: rename test with duplicate name The file test/alternator/test_transact.py accidentally had two tests with the same name, test_transact_get_items_projection_expression. This means the first of the two tests was ignored and never run. This patch renames the second of the two to a more appropriate (and unique...) name. I verified that after this change the number of tests in this file grows by one, and that still all tests pass on DynamoDB and fail (as expected by xfail) on Alternator. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#27702	2025-12-24 13:43:43 +02:00
Nadav Har'El	84df5cfaf8	test/alternator: delete unnecessary "pass" Fixing something that never bothered anyone but our automated "code quality" tool: there's an unnecessary call to "pass" in one of our tests. Just remove it. Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Closes scylladb/scylladb#27645	2025-12-16 19:29:23 +03:00
Pavel Emelyanov	31f90c089c	Merge 'test/alternator: remove unused variable assignments and statements' from Nadav Har'El Copilot found in test/alternator a bunch of places where we unnecessarily assign a variable that we don't use, or had a duplicated statement which doesn't do anything. This patch fixes all of them. AI still doesn't know how to prepare a patch that looks anything close to reasonable, so I did this part manually, and also carefully investigated each and every change (this took a lot of human time). These patches don't change anything in the functionality of any of the tests. It's all cosmetic. Closes scylladb/scylladb#27655 * github.com:scylladb/scylladb: test/alternator: remove unnecessary duplicate statement test/alternator: remove unused variable assignments	2025-12-16 19:23:34 +03:00
Nadav Har'El	64d9c370ee	test/alternator: remove unnecessary duplicate statement copilot noticed that test/alternator/test_scan.py had a duplicate statement (call to full_scan()). It doesn't break the test, but also adds nothing but confusion - so let's just remove it. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-12-15 18:07:45 +02:00
Nadav Har'El	a3959fe3db	test/alternator: remove unused variable assignments copilot noticed in that in in many of Alternator tests, we have some unnecessary assignments. For example, in a few places, we use the idiom: with pytest.raises(...): ret = ... The "ret=" part is unnecessary, as this test expects the statement to fail (hence the raises()), and ret is never assigned. The assignment was only there because we copied this statement from another place in the test, which does expect the statement to pass and wants to validate the returned value. So we should just drop the "ret=" from these tests. Another common occurance is that we used the idiom response = table.do_something() Without checking the response and no intention to check it (either we know it will work, or we just want to check it doesn't throw). So we can drop the "response=" here too. All of the unused variables in this patch were discovered by Copilot, but I reviewed each of them carefully myself and prepared this patch. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-12-15 18:07:05 +02:00
Nadav Har'El	b3b0860e7c	test/alternator: add reproducer for bug with storing invalid values This patch adds a reproducer for a long-known bug, #8070, where Alternator can store invalid values which are just blindly stored as JSON, and we will only see the failure when reading the item back - and either the client will fail to parse it, or sometimes even Alternator's own code (e.g., FilterExpression) will fail to parse it. The right behavior is to fail the write - not the read. The included test checks writing different kinds of invalid values using PutItem, UpdateItem, and BatchWriteItem. The new tests pass on DynamoDB, but fail on Alternator so marked as "xfail". Refs #8070. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-12-11 11:58:22 +02:00
Nadav Har'El	db15c212a6	test/alternator: reproducer for issue 27375 This patch adds a reproducer for issue #27375, where even with alternator_streams_increased_compatibility set to true, if an attribute is set to the same value it had but using a different JSON representation - a Alternator Streams event is unduly produced. For example, if a map {'dog': 1, 'cat': 2} is changed to {'cat': 2, 'dog': 1}, this non-change should not be reported. The new test added in this patch passes on DynamoDB (an event is not generated) but fails on Alternator (an event is generated), so the new test is marked with xfail. Refs #27375. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-12-11 11:34:19 +02:00
Nadav Har'El	3595941020	utils/rjson: fix error messages from rjson::parse() rjson::parse() when parsing JSON stored in a chunked_content (a vector of temporary buffers) failed to initialize its byte counter to 0, resulting in garbage positions in error messages like: Parsing JSON failed: Missing a name for object member. at 1452254 These error messages were most noticable in Alternator, which parses JSON requests using a chunked_content, and reports these errors back to the user. The fix is trivial: add the missing initialization of the counter. The patch also adds a regression test for this bug - it sends a JSON corrupt at position 1, and expect to see "at 1" and not some large random number. Fixes #27372 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-12-11 11:17:01 +02:00
Nadav Har'El	102516a787	test/alternator: extract get_signed_request() to util.py get_signed_request() started in test_manual_requests.py as a way to sign a manually-created DynamoDB-API request - for sending requests that boto3 can't. Over time, we started to use this function in additional test files, and it's about time to move it to util.py - which is more natural to import from multiple files. This patch also adds a new function, manual_request(), which combines get_signed_request() and actually sending the request via requests.post(). New tests should prefer it, because it's easier to use. We'll use the new function in tests that we add in the next patches. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-12-11 11:16:42 +02:00
Nadav Har'El	350cbd1d66	alternator: fix typo of BatchWriteItem in comments The DynamoDB API's "BatchWriteItem" operation is spelled like this, in singular. Some comments incorrectly referred to as BatchWriteItems - in plural. This patch fixes those mistakes. There are no functional changes here or changes to user-facing documents - these mistakes were only in code comments. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#27446	2025-12-05 15:08:58 +02:00
Avi Kivity	ce2a403f18	Merge 'alternator: implement gzip-compressed requests' from Nadav Har'El In this series we implement Alternator's support for gzip-compressed requests, i.e., requests with the "Content-Encoding: gzip" header, other uncompressed header, and a gzip-compressed body. The server needs to verify the signature of the compressed content, and then uncompress the body before running the request. We only support gzip compression because this is what DynamoDB supports. But in the future we can easily add support for other compression algorithms like lz4 or zstd. This series Refs #5041 but doesn't "Fixes" it because it only implements compressed requests (Content-Encoding), not compressed responses (Accept-Encoding). In addition to the code changes, the series also contains tests for this feature that make sure it behaves like DynamoDB. Note that while we will have now support in our server for compressed requests, just like DynamoDB does, the clients (AWS SDKs) will probably NOT make use of it because they do not enable request compression by default. For example, see the tests for some hoops one needs to jump through in boto3 (the Python SDK) to send compressed requests. However, we are hoping that in the future Alternator's modified clients will use compressed requests and enjoy this feature. Closes scylladb/scylladb#27080 * github.com:scylladb/scylladb: test/alternator: enable, and add, tests for gzip'ed requests alternator: implement gzip-compressed requests	2025-11-30 13:27:46 +02:00
Piotr Dulikowski	44c605e59c	Merge 'Fix the types of change events in Alternator Streams' from Piotr Wieczorek This patch increases the compatibility with DynamoDB Streams by integrating the DynamoDB's event type rules (described in https://github.com/scylladb/scylladb/issues/6918) into Alternator. The main changes are: - introduce a new flag `alternator_streams_strict_compatibility`, meant as a guard of performance-intensive operations that increase the compatibility with DynamoDB Streams. If enabled, Alternator always performs a RBW before a data-modifying operation, and propagates its result to CDC. Then, the old item is compared to the new one, to determine the mutation type (INSERT vs MODIFY). This option is a no-op for tables with disabled Alternator Streams, - reduce splitting of simple Alternator mutations, - correctly distinguish event types described in #6918, except for item deletes. Deleting a missing item with DeleteItem, BatchWriteItem, or a missing field with UpdateItem still emit REMOVEs. To summarize, the emitted events of the data manipulation operations should be as follows: - DeleteItem/BatchWriteItem.DeleteItem of existing item: REMOVE (OK) - DeleteItem of nonexistent item: nothing (OK) - BatchWriteItem.DeleteItem of nonexistent item: nothing (OK) - PutItem/UpdateItem/BatchWriteItem.PutItem of existing and not equal item: MODIFY (OK) - PutItem/UpdateItem/BatchWriteItem.PutItem of existing and equal item: nothing (OK) - PutItem/UpdateItem/BatchWriteItem.PutItem of nonexistent item: INSERT (OK) No backport is necessary. Refs https://github.com/scylladb/scylladb/pull/26149 Refs https://github.com/scylladb/scylladb/pull/26396 Refs https://github.com/scylladb/scylladb/issues/26382 Fixes https://github.com/scylladb/scylladb/issues/6918 Closes scylladb/scylladb#26121 * github.com:scylladb/scylladb: test/alternator: Enable the tests failing because of #6918 alternator, cdc: Don't emit events for no-op removes alternator, cdc: Don't emit an event for equal items alternator/streams, cdc: Differentiate item replace and item update in CDC alternator: Change the return type of rmw_operation_return config: Add alternator_streams_strict_compatibility flag cdc: Don't split a row marker away from row cells	2025-11-30 07:20:22 +01:00
Radosław Cybulski	b54a9f4613	Fix use-after-free in encode_paging_state in Alternator Fix unlikely use-after-free in `encode_paging_state`. The function incorrectly assumes that current position to encode will always have data for all clustering columns the schema defines. It's possible to encounter current position having less than all columns specified, for eample in case of range tombstone. Those don't happen in Alternator tables as DynamoDB doesn't allow range deletions and clustering key might be of size at most 1. Alternator api can be used to read scylla system tables and those do have range tombstones with more than single clustering column. The fix is to stop trying to encode columns, that don't have the value - they are not needed anyway, as there's no possible position with those values (range tombstone made sure of that). Fixes #27001 Fixes #27125 Closes scylladb/scylladb#26960	2025-11-28 16:51:15 +03:00
Nadav Har'El	32afcdbaf0	test/alternator: enable, and add, tests for gzip'ed requests After in the previous patch we implemented support in Alternator for gzip-compressed requests ("Content-Encoding: gzip"), here we enable an existing xfail-ing test for this feature, and also add more tests for more cases: * A test for longer compressed requests, or a short compressed request which expands to a longer request. Since the decompression uses small buffers, this test reaches additional code paths. * Check for various cases of a malformed gzip'ed request, and also an attempt to use an unsupported Content-Encoding. DynamoDB returns error 500 for both cases, so we want to test that we do to - and not silently ignore such errors. * Check that two concatenated gzip'ed streams is a valid request, and check that garbage at the end of the gzip - or a missing character at the end of the gzip - is recognized as an error. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-11-27 09:42:47 +02:00
Nadav Har'El	66bd3dc22c	test/alternator: tests for request compression DynamoDB's documentation https://docs.aws.amazon.com/sdkref/latest/guide/feature-compression.html suggests that DynamoDB allows request bodies to be compressed (currently only by gzip). The purpose of patch is to have a test reproducing this feature. The test shows us that indeed DynamoDB understands compressed requests using the "gzip" encoding, but Alternator does not, so the new test is xfail. As you can see in the test code, although the low-level SDK (botocore) can send compress requests, this is not actually enabled for DynamoDB and we need to resort to some trickery to send compressed requests. But the point is that once we do manage to send compressed requests, the test shows us that they work properly on AWS, but fail on Alternator. The failure of the compressed requests on Alternator is reported like: An error occurred (ValidationException) when calling the PutItem operation: Parsing JSON failed: Invalid value. at 70459088 This error message should probably be improved (what is that high number?!) but of course even better would be to make it really work. By enabling tracing on alternator-server (e.g., edit test/cqlpy/run.py and add `'--logger-log-level', 'alternator-server=trace',`) we can see exactly what request the SDK sends Alternator. What we can see in the request is: 1. The request headers are uncompressed (this is expected in HTTP) 2. There is a header "Content-Encoding: gzip" 3. The request's body is binary, a full-fleged gzip output complete with a gzip magic in the beginning. Refs #5041 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes scylladb/scylladb#27049	2025-11-21 10:48:33 +02:00
Radosław Cybulski	ce8db6e19e	Add table name to tracing in alternator Add a table name to Alternator's tracing output, as some clients would like to consistently receive this information. - add missing `tracing::add_table_name` in `executor::scan` - add emiting tables' names in `trace_state::build_parameters_map` - update tests, so when tracing is looked for it is filtered by table's name, which confirms table is being outputed. - change `struct one_session_records` declaration to `class one_session_records`, as `one_session_records` is later defined as class. Refs #26618 Fixes #24031 Closes scylladb/scylladb#26634	2025-11-21 09:33:40 +02:00
Botond Dénes	a084094c18	Merge 'alternator and cql: tests for default sstable compression' from Nadav Har'El The purpose of this two-patch series is to reproduce a previously unknown bug, Refs #26914. Recently we saw a lot of patches that change how we create new schemas (keyspaces and tables), sometimes changing various long-time defaults. We started to worry that perhaps some of these defaults were applied only to CQL base tables and perhaps not to Alternator or to CQL's auxiliary tables (materialized views, secondary indexes, or CDC logs). For example, in Refs #26307 we wondered if perhaps the default "speculative_retry" option is different in Alternator than in CQL. The first patch includes Alternator tests, and the second CQL tests. In both tests we discover that although recently (commit `adf9c42`, Refs #26610) we changed the default sstable compressor from LZ4Compressor to LZ4WithDictsCompressor, actually this change was only applied to CQL base tables. All Alternator tables and all CQL auxiliary tables (views, indexes, CDC) still use the old LZ4Compressor. This is issue #26914. Closes scylladb/scylladb#26915 * github.com:scylladb/scylladb: test/cqlpy: test compression setting for auxiliary table test/alternator: tests for schema of Alternator table	2025-11-20 10:24:31 +02:00
Nadav Har'El	11f6a25d44	test/alternator: tests for schema of Alternator table This patch introduces a new test that exposed a previously unknown bug, Refs #26914: Recently we saw a lot of patches that change how we create new schemas (keyspaces and tables), sometimes changing various long-time defaults. We started to worry that perhaps some of these defaults were applied only to CQL and not to Alternator. For example, in Refs #26307 we wondered if perhaps the default "speculative_retry" option is different in Alternator than in CQL. This patch includes a new test file test/alternator/test_cql_schema.py, with tests for verifying how Alternator configures the underlying tables it creates. This test shows that the "speculative_retry" doesn't have this suspected bug - it defaults to "99.0PERCENTILE" in both CQL and Alternator. But unfortunately, we do have this bug with the "compression" option: It turns out that recently (commit `adf9c42`, Refs #26610) we changed the default sstable compressor from LZ4Compressor to LZ4WithDictsCompressor, but the change was only applied to CQL, not Alternator. So the test that "compression" is the same in both fails - and marked "xfails" and I created a new issue to track it - #26914. Another test verifies that Alternators "auxiliary" tables - holding GSIs, LSIs and Streams - have the same default properties as the base table. This currently seems to hold (there is no bug). Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-11-19 09:18:37 +02:00
Nadav Har'El	5b78e1cebe	test/alternator: tests for ExclusiveStartKey in GSI After in the previous patches we added more exhaustive testing for the ExclusiveStartKey feature of Query and Scan, in this patch we add tests for this feature in the context of GSIs. Most interestingly, the ExclusiveStartKey when querying a GSI isn't just the key of the GSI, but also includes the key columns of the base - in other words, it is the key that Scylla uses for its materialized view. The tests here confirm that paging on GSI works - this paging uses ExclusiveStartKey of course - but also what is the specific structure and meaning of the content of ExclusiveStartKey. We also include two xfailing tests which again, like in the previous patches, show we don't do enough validation (issue #26988) and don't recognize wrong values or spurious columns in ExclusiveStartKey. As usual, all new tests pass on DynamoDB, and all except the xfailing ones pass on Alternator. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-11-17 22:07:28 +02:00
Nadav Har'El	65b364d94a	test/alternator: more tests for ExclusiveStartKey in Scan In the previous patch we added more tests for ExclusiveStartKey in the context of the "Query" request. Here we do a similar thing for "Scan". There are fewer error cases for Scan. In particular, while it didn't make sense to use ExclusiveStartKey on a Query on a table without a sort key (since a Query there always returns a single item), for Scan it's needed - for paging. So we add in this patch a test (that we didn't have before!) that Scan paging works correctly also in the case of a table without a sort key. This patch has one xfailing test reproducing #26988, that we don't recognize and refuse spurious columns (columns not in the key) in ExclusiveStartKey. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-11-17 22:07:28 +02:00
Nadav Har'El	c049992a93	test/alternator: more tests for ExclusiveStartKey in Query We already have in test/alternator/test_query.py a test - test_query_exclusivestartkey - for one successful uses of ExclusiveStartKey. But we missed testing quite a few edge cases of this parameter, so this patch adds more tests for it - see the comments on each individual test explaining its purpose. With the new tests, we actually identified three cases where we got the error handling wrong - cases of ExclusiveStartKey which DynamoDB refuses, but Alternator allows. So three of the tests included here pass on DynamoDB but fail on Alternator, so are marked with "xfail". Refs #26988 - which is a new issue about these three cases of missing validation. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-11-17 22:07:27 +02:00
Nadav Har'El	c03081eb12	alternator: improve error in tablets_mode_for_new_keyspaces=enforced When in tablets_mode_for_new_keyspaces=enforced mode, Alternator is supposed to fail when CreateTable asks explicitly for vnodes. Before this patch, this error was an ugly "Internal Server Error" (an exception thrown from deep inside the implementation), this patch checks for this case in the right place, to generate a proper ValidationException with a proper error message. We also enable the test test_tablets_tag_vs_config which should have caught this error, but didn't because it was marked xfail because tablets_mode_for_new_keyspaces had not been live-updatable. Now that it is, we can enable the test. I also improved the test to be slightly faster (no need to change the configuration so many times) and also check the ordinary case - where the schema doesn't choose neither vnodes nor tablets explicitly and we should just use the default. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2025-11-09 12:52:29 +02:00
Piotr Szymaniak	eeb3a40afb	alternator: Fix test_ttl_expiration_streams() The test is now aware of the new name of the `system:initial_tablets` tag.	2025-11-09 12:52:29 +02:00
Piotr Szymaniak	a659698c6d	alternator: Fix test_scan_paging_missing_limit() With tablets, the test begun failing. The failure was correlated with the number of initial tablets, which when kept at default, equals 4 tablets per shard in release build and 2 tablets per shard in dev build. In this patch we split the test into two - one with a more data in the table to check the original purpose of this test - that Scan doesn't return the entire table in one page if "Limit" is missing. The other test reproduces issue #10327 - that when the table is small, Scan's page size isn't strictly limited to 1MB as it is in DynamoDB. Experimentally, 8000 KB of data (compared to 6000 KB before this patch) is enough when we have up to 4 initial tablets per shard (so 8 initial tablets on a two-shard node as we typically run in tests). Original patch by Piotr Szymaniak <piotr.szymaniak@scylladb.com> modified by Nadav Har'El <nyh@scylladb.com>	2025-11-09 12:52:29 +02:00
Piotr Szymaniak	345747775b	alternator: Don't require vnodes for TTL tests Since #23662 Alternator supports TTL with tablets too. Let's clear some leftovers causing Alternator to test TTL with vnodes instead of with what is default for Alternator (tablets or vnodes).	2025-11-09 12:52:29 +02:00
Piotr Szymaniak	274d0b6d62	alternator: Remove obsolete test from test_table.py Since Alternator is capable of runnng with tablets according to the flag in config, remove the obsolete test that is making sure that Alternator runs with vnodes.	2025-11-09 12:52:29 +02:00
Piotr Szymaniak	63897370cb	alternator: Fix tag name to request vnodes The tag was lately renamed from `experimental:initial_tablets` to `system::initial_tablets`. This commit fixes both the tests as well as the exceptions sent to the user instructing how to create table with vnodes.	2025-11-09 12:52:29 +02:00
Piotr Szymaniak	c7de7e76f4	alternator: Fix test name clash in test_tablets.py	2025-11-09 12:52:28 +02:00
Piotr Szymaniak	7466325028	alternator: test_tablets.py handles new policy reg. tablets Adjust the tests so they are in-line with the config flag 'tablets_mode_for_new_keyspaces` that the Alternator learned to honour.	2025-11-09 12:52:28 +02:00

1 2 3 4 5 ...

553 Commits