scylladb

Files

Nadav Har'El 099145fe9a materialized view: fix bug in some large modifications to base partitions

Sometimes a single modification to a base partition requires updates to
a large number of view rows. A common example is deletion of a base
partition containing many rows. A large BATCH is also possible.

To avoid large allocations, we split the large amount of work into
batch of 100 (max_rows_for_view_updates) rows each. The existing code
assumed an empty result from one of these batches meant that we are
done. But this assumption was incorrect: There are several cases when
a base-table update may not need a view update to be generated (see
can_skip_view_updates()) so if all 100 rows in a batch were skipped,
the view update stopped prematurely. This patch includes two tests
showing when this bug can happen - one test using a partition deletion
with a USING TIMESTAMP causing the deletion to not affect the first
100 rows, and a second test using a specially-crafed large BATCH.
These use cases are fairly esoteric, but in fact hit a user in the
wild, which led to the discovery of this bug.

The fix is fairly simple: To detect when build_some() is done it is no
longer enough to check if it returned zero view-update rows; Rather,
it explicitly returns whether or not it is done as an std::optional.

The patch includes several tests for this bug, which pass on Cassandra,
failed on Scylla before this patch, and pass with this patch.

Fixes #12297.

Signed-off-by: Nadav Har'El <nyh@scylladb.com>

Closes #12305

(cherry picked from commit 92d03be37b)

2023-01-04 10:05:18 +02:00

cassandra_tests

cql: fix column-name aliases in SELECT JSON

2022-12-05 20:12:44 +02:00

conftest.py

cql-pytest: add new_user and new_session utils

2022-07-11 10:49:15 +02:00

nodetool.py

test/cql-pytest: nicer error message if a test can't find nodetool

2022-04-05 20:29:02 +03:00

pytest.ini

…

README.md

cql-pytest: add to README an example of repeating a test

2021-10-07 15:30:41 +03:00

rest_api.py

CQL3: Bloom filter efficacy test

2022-03-23 16:51:50 +02:00

run

test/cql-pytest: implement test_tools.py without run-script cooperation

2022-03-14 20:25:22 +02:00

run-cassandra

cql-pytest: speed up permissions refresh period for tests

2022-07-11 10:30:01 +02:00

run.py

cql-pytest: speed up permissions refresh period for tests

2022-07-11 10:30:01 +02:00

suite.yaml

test.py: switch cql-pytest and rest_api suites to PythonTestSuite

2022-05-25 20:26:42 +03:00

test_allow_filtering.py

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

test_batch.py

Add big batch logs tests

2022-04-04 17:25:13 +03:00

test_bloom_filter.py

cql: validate bloom_filter_fp_chance up-front

2022-10-04 16:21:48 +03:00

test_cdc.py

cdc: Ensure columns removed from log table are registered as dropped

2022-05-04 14:19:39 +02:00

test_clustering_order.py

test/cql-pytest: add test for default clustering order of SELECT

2022-05-16 11:52:02 +02:00

test_empty.py

test/cql-pytest: tests for assigning an empty string to non-string

2022-05-27 16:37:01 +02:00

test_filtering.py

test/cql-pytest: de-duplicate code checking for an old buggy driver

2022-06-09 14:23:40 +03:00

test_frozen_collection.py

test/cql-pytest: use unique keys instead of random keys

2022-01-31 09:01:23 +02:00

test_json.py

cql: fix column-name aliases in SELECT JSON

2022-12-05 20:12:44 +02:00

test_keyspace.py

test: add test cases for keyspace storage options

2022-04-08 09:17:01 +02:00

test_large_cells_rows.py

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

test_logs.py

Add big batch logs tests

2022-04-04 17:25:13 +03:00

test_lwt.py

cql-pytest: remove "xfail" mark from two passing tests

2022-07-11 08:34:19 +03:00

test_materialized_view.py

materialized view: fix bug in some large modifications to base partitions

2023-01-04 10:05:18 +02:00

test_minmax.py

test/cql-pytest: use unique keys instead of random keys

2022-01-31 09:01:23 +02:00

test_non_deterministic_functions.py

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

test_null.py

cql: batch statement, inserting a row with a null key column should be forbidden

2022-12-28 18:15:40 +02:00

test_paging.py

mutation_compactor: reset stop flag on page start

2022-12-25 09:45:30 +02:00

test_permissions.py

cql-pytest: add a case for granting/revoking data permissions

2022-07-12 13:44:21 +02:00

test_range_and_slice.py

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

test_scan.py

Merge 'cql3: don't ignore other restrictions when a multi column restriction is present during filtering' from Jan Ciołek

2022-11-21 14:02:33 +02:00

test_secondary_index.py

test/cql-pytest: avoid deprecation message

2022-07-11 08:01:23 +03:00

test_service_levels.py

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

test_shedding.py

test/cql-pytest: use unique keys instead of random keys

2022-01-31 09:01:23 +02:00

test_ssl.py

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

test_sstable.py

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

test_static.py

test/cql-pytest: add a couple of tests for static columns

2022-02-21 16:04:57 +02:00

test_system_tables.py

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

test_tools.py

test/cql-pytest: test_tools.py: add test for sstable write

2022-08-03 14:00:50 +03:00

test_ttl.py

test/cql-pytest: use unique keys instead of random keys

2022-01-31 09:01:23 +02:00

test_type_duration.py

test/cql-pytest: use unique keys instead of random keys

2022-01-31 09:01:23 +02:00

test_type_string.py

test/cql-pytest: use unique keys instead of random keys

2022-01-31 09:01:23 +02:00

test_type_time.py

test/cql-pytest: use unique keys instead of random keys

2022-01-31 09:01:23 +02:00

test_uda.py

test: cql3: Add UDA REDUCEFUNC test

2022-07-18 15:25:41 +02:00

test_using_timeout.py

test/cql-pytest: use unique keys instead of random keys

2022-01-31 09:01:23 +02:00

test_utf8.py

test/cql-pytest: confirm that table names cannot include non-Latin letters

2022-02-22 20:58:25 +03:00

test_validation.py

test/cql-pytest: add test for blobAsInt() et al for various blob lengths

2022-04-11 12:44:22 +03:00

test_virtual_tables.py

config: fix printing of experimental feature list

2022-07-11 09:17:30 +02:00

test_wasm.py

wasm: test instances reuse

2022-07-20 18:19:25 +02:00

test-timestamp.py

test/cql-pytest: use unique keys instead of random keys

2022-01-31 09:01:23 +02:00

util.py

test: move scylla_inject_error from alternator/ to cql-pytest/

2022-07-29 09:35:20 +02:00

README.md

Single-node functional tests for Scylla's CQL features.

These tests use the Python CQL library and the pytest frameworks. By using an actual CQL library for the tests, they can be run against any implementation of CQL - both Scylla and Cassandra. Most tests - except in rare cases - should pass on both, to ensure that Scylla is compatible with Cassandra in most features.

To run all tests against an already-running local installation of Scylla or Cassandra on localhost, just run pytest. The "--host" and "--port" can be used to give a different location for the running Scylla or Cassanra. The "--ssl" option can be used to use an encrypted (TLSv1.2) connection.

More conveniently, we have two scripts - "run" and "run-cassandra" - which do all the work necessary to start Scylla or Cassandra (respectively), and run the tests on them. The Scylla or Cassandra process is run in a temporary directory which is automatically deleted when the test ends.

"run" automatically picks the most recently compiled version of Scylla in build/*/scylla - but this choice of Scylla executable can be overridden with the SCYLLA environment variable. "run-cassandra" defaults to running the command cassandra from the user's path, but this can be overriden by setting the CASSANDRA environment variable to the path of the cassandra script, e.g., export CASSANDRA=$HOME/apache-cassandra-3.11.10/bin/cassandra. A few of the tests also require the nodetool when running on Cassandra - this tool is again expected to be in the user's path, or be overridden with the NODETOOL environment variable. Nodetool is not needed to test Scylla.

Additional options can be passed to "pytest" or to "run" / "run-cassandra" to control which tests to run:

To run all tests in a single file, do pytest test_table.py.
To run a single specific test, do pytest test_table.py::test_create_table_unsupported_names.
To run the same test or tests 100 times, add the --count=100 option. This is faster than running run 100 times, because Scylla is only run once, and also counts for you how many of the runs failed. For pytest to support the --count option, you need to install a pytest extension: pip install pytest-repeat

Additional useful pytest options, especially useful for debugging tests:

-v: show the names of each individual test running instead of just dots.
-s: show the full output of running tests (by default, pytest captures the test's output and only displays it if a test fails)