scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-07 15:33:15 +00:00

Files

Nadav Har'El f41dac2a3a alternator: avoid large contiguous allocation for request body

Alternator request sizes can be up to 16 MB, but the current implementation
had the Seastar HTTP server read the entire request as a contiguous string,
and then processed it. We can't avoid reading the entire request up-front -
we want to verify its integrity before doing any additional processing on it.
But there is no reason why the entire request needs to be stored in one big
*contiguous* allocation. This always a bad idea. We should use a non-
contiguous buffer, and that's the goal of this patch.

We use a new Seastar HTTPD feature where we can ask for an input stream,
instead of a string, for the request's body. We then begin the request
handling by reading lthe content of this stream into a
vector<temporary_buffer<char>> (which we alias "chunked_content"). We then
use this non-contiguous buffer to verify the request's signature and
if successful - parse the request JSON and finally execute it.

Beyond avoiding contiguous allocations, another benefit of this patch is
that while parsing a long request composed of chunks, we free each chunk
as soon as its parsing completed. This reduces the peak amount of memory
used by the query - we no longer need to store both unparsed and parsed
versions of the request at the same time.

Although we already had tests with requests of different lengths, most
of them were short enough to only have one chunk, and only a few had
2 or 3 chunks. So we also add a test which makes a much longer request
(a BatchWriteItem with large items), which in my experiment had 17 chunks.
The goal of this test is to verify that the new signature and JSON parsing
code which needs to cross chunk boundaries work as expected.

Fixes #7213.

Signed-off-by: Nadav Har'El <nyh@scylladb.com>
Message-Id: <20210309222525.1628234-1-nyh@scylladb.com>

2021-03-10 09:22:34 +01:00

conftest.py

alternator: implemented nested attribute paths in UpdateExpression

2021-02-14 12:21:34 +02:00

README.md

…

run

alternator: make default timeout configurable

2020-12-09 14:30:43 +01:00

suite.yaml

…

test_authorization.py

alternator: use a more specific error when Authorization header is missing

2020-12-14 09:18:24 +01:00

test_batch.py

alternator: avoid large contiguous allocation for request body

2021-03-10 09:22:34 +01:00

test_condition_expression.py

alternator: support attribute paths in ConditionExpression, FilterExpression

2021-02-08 19:19:09 +02:00

test_cors.py

Alternator: add support for CORS protocol

2021-02-23 13:15:03 +01:00

test_describe_endpoints.py

…

test_describe_table.py

alternator: add missing TableId field to DescribeTable response

2020-11-09 20:21:47 +01:00

test_expected.py

alternator: fix ValidationException in FilterExpression - and more

2021-02-08 14:16:30 +02:00

test_filter_expression.py

alternator: support attribute paths in ConditionExpression, FilterExpression

2021-02-08 19:19:09 +02:00

test_gsi.py

alternator: implemented nested attribute paths in UpdateExpression

2021-02-14 12:21:34 +02:00

test_health.py

…

test_item.py

alternator: overhaul attrs_to_get handling

2021-02-08 14:16:40 +02:00

test_key_condition_expression.py

alternator test: small fixes for test_key_condition_expression_multi

2020-06-10 07:34:20 +02:00

test_key_conditions.py

alternator: fix order conditions on binary attributes

2020-06-03 10:55:50 +02:00

test_lsi.py

alternator test: fix comment

2020-10-05 02:19:22 +03:00

test_manual_requests.py

test/alternator: better tests of oversized requests

2021-03-03 07:06:45 +01:00

test_nested.py

…

test_number.py

test: add alternator test for incorrect numeric values

2020-07-09 13:58:33 +03:00

test_projection_expression.py

alternator: implemented nested attribute paths in UpdateExpression

2021-02-14 12:21:34 +02:00

test_query_filter.py

alternator test: fix test wrongly failing on AWS

2020-12-14 09:18:31 +01:00

test_query.py

alternator: fix broken Scan/Query paging with bytes keys

2020-12-08 09:38:23 +01:00

test_returnvalues.py

alternator: correct implemention of UpdateItem with nested attributes and ReturnValues

2021-02-14 12:21:34 +02:00

test_scan.py

alternator: fix broken Scan/Query paging with bytes keys

2020-12-08 09:38:23 +01:00

test_scylla.py

…

test_streams.py

alternator test: de-duplicate some duplicate code

2021-01-11 08:47:25 +01:00

test_system_tables.py

alternator, test: make test_fetch_from_system_tables faster

2020-12-07 08:52:31 +01:00

test_table.py

Alternator: allow CreateTable with SSESpecification explicitly disabled

2020-08-17 13:48:52 +02:00

test_tag.py

alternator: CreateTable with bad Tags shouldn't create a table

2020-07-13 17:14:44 +03:00

test_tracing.py

alternator-test: increase timeout in tracing test

2021-02-04 17:17:07 +02:00

test_update_expression.py

alternator-test: test that index can't be a name reference (#xyz)

2021-03-08 10:17:19 +01:00

util.py

alternator test: use ConsistentRead=True for full_query/scan

2020-06-17 14:57:45 +02:00

README.md

Tests for Alternator that should also pass, identically, against DynamoDB.

Tests use the boto3 library for AWS API, and the pytest frameworks (both are available from Linux distributions, or with "pip install").

To run all tests against the local installation of Alternator on http://localhost:8000, just run pytest.

Some additional pytest options:

To run all tests in a single file, do pytest test_table.py.
To run a single specific test, do pytest test_table.py::test_create_table_unsupported_names.
Additional useful pytest options, especially useful for debugging tests:
- -v: show the names of each individual test running instead of just dots.
- -s: show the full output of running tests (by default, pytest captures the test's output and only displays it if a test fails)

Add the --aws option to test against AWS instead of the local installation. For example - pytest --aws test_item.py or pytest --aws.

If you plan to run tests against AWS and not just a local Scylla installation, the files ~/.aws/credentials should be configured with your AWS key:

[default]
aws_access_key_id = XXXXXXXXXXXXXXXXXXXX
aws_secret_access_key = xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

and ~/.aws/config with the default region to use in the test:

[default]
region = us-east-1

HTTPS support

In order to run tests with HTTPS, run pytest with --https parameter. Note that the Scylla cluster needs to be provided with alternator_https_port configuration option in order to initialize a HTTPS server. Moreover, running an instance of a HTTPS server requires a certificate. Here's how to easily generate a key and a self-signed certificate, which is sufficient to run --https tests:

openssl genrsa 2048 > scylla.key
openssl req -new -x509 -nodes -sha256 -days 365 -key scylla.key -out scylla.crt

If this pair is put into conf/ directory, it will be enough to allow the alternator HTTPS server to think it's been authorized and properly certified. Still, boto3 library issues warnings that the certificate used for communication is self-signed, and thus should not be trusted. For the sake of running local tests this warning is explicitly ignored.

Authorization

By default, boto3 prepares a properly signed Authorization header with every request. In order to confirm the authorization, the server recomputes the signature by using user credentials (user-provided username + a secret key known by the server), and then checks if it matches the signature from the header. Early alternator code did not verify signatures at all, which is also allowed by the protocol. A partial implementation of the authorization verification can be allowed by providing a Scylla configuration parameter:

  alternator_enforce_authorization: true

The implementation is currently coupled with Scylla's system_auth.roles table, which means that an additional step needs to be performed when setting up Scylla as the test environment. Tests will use the following credentials: Username: alternator Secret key: secret_pass

With CQLSH, it can be achieved by executing this snipped:

cqlsh -x "INSERT INTO system_auth.roles (role, salted_hash) VALUES ('alternator', 'secret_pass')"

Most tests expect the authorization to succeed, so they will pass even with alternator_enforce_authorization turned off. However, test cases from test_authorization.py may require this option to be turned on, so it's advised.