scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-05-12 19:02:12 +00:00

Author	SHA1	Message	Date
Tomasz Grabiec	09b575474b	Merge "test: raft: generators infrastructure with an actual random nemesis test" from Kamil Operations and generators can be composed to create more complex operations and generators. There are certain composition patterns useful for many different test scenarios. We implement a couple of such patterns. For example: - Given multiple different operation types, we can create a new operation type - `either_of` - which is a "union" of the original operation types. Executing `either_of` operation means executing an operation of one of the original types, but the specific type can be chosen in runtime. - Given a generator `g`, `op_limit(n, g)` is a new generator which limits the number of operations produced by `g`. - Given a generator `g` and a time duration of `d` ticks, `stagger(g, d)` is a new generator which spreads the operations from `g` roughly every `d` ticks. (The actual definition in code is more general and complex but the idea is similar.) Some of these patterns have correspodning notions in Jepsen, e.g. our `stagger` has a corresponding `stagger` in Jepsen (although our `stagger` is more general). Finally, we implement a test that uses this new infrastructure. Two `Executable` operations are implemented: - `raft_call` is for calling to a Raft cluster with a given state machine command, - `network_majority_grudge` partitions the network in half, putting the leader in the minority. We run a workload of these operations against a cluster of 5 nodes with 6 threads for executing the operations: one "nemesis thread" for `network_majority_grudge` and 5 "client threads" for `raft_call`. Each client thread randomly chooses a contact point which it tries first when executing a `raft_call`, but it can also "bounce" - call a different server when the previous returned "not_a_leader" (we use the generic "bouncing" wrapper to do this). For now we only print the resulting history. In a follow-up patchset we will analyze it for consistency anomalies. * kbr/raft-test-generator-v4: test: raft: randomized_nemesis_test: a basic generator test test: raft: generator: a library of basic generators test: raft: introduce generators test: raft: introduce `future_set` test: raft: randomized_nemesis_test: handle `raft::stopped_error` in timeout futures	2021-08-16 15:55:25 +02:00
Kamil Braun	3344ac8a6c	test: raft: randomized_nemesis_test: a basic generator test The previous commits introduced basic the generator concept and a library of most common composition patterns. In this commit we implement a test that uses this new infrastructure. Two `Executable` operations are implemented: - `raft_call` is for calling to a Raft cluster with a given state machine command, - `network_majority_grudge` partitions the network in half, putting the leader in the minority. We run a workload of these operations against a cluster of 5 nodes with 6 threads for executing the operations: one "nemesis thread" for `network_majority_grudge` and 5 "client threads" for `raft_call`. Each client thread randomly chooses a contact point which it tries first when executing a `raft_call`, but it can also "bounce" - call a different server when the previous returned "not_a_leader" (we use the generic "bouncing" wrapper to do this). For now we only print the resulting history. In a follow-up patchset we will analyze it for consistency anomalies.	2021-08-16 13:07:08 +02:00
Kamil Braun	66ec484730	test: raft: generator: a library of basic generators Operations and generators can be composed to create more complex operations and generators. There are certain composition patterns useful for many different test scenarios. This commit introduces a couple of such patterns. For example: - Given multiple different operation types, we can create a new operation type - `either_of` - which is a "union" of the original operation types. Executing `either_of` operation means executing an operation of one of the original types, but the specific type can be chosen in runtime. - Given a generator `g`, `op_limit(n, g)` is a new generator which limits the number of operations produced by `g`. - Given a generator `g` and a time duration of `d` ticks, `stagger(g, d)` is a new generator which spreads the operations from `g` roughly every `d` ticks. (The actual definition in code is more general and complex but the idea is similar.) And so on. Some of these patterns have correspodning notions in Jepsen, e.g. our `stagger` has a corresponding `stagger` in Jepsen (although our `stagger` is more general).	2021-08-16 13:07:08 +02:00
Kamil Braun	d8863c5a7b	test: raft: introduce generators We introduce the concepts of "operations" and "generators", basic building blocks that will allow us to declaratively write randomized tests for torturing simulated Raft clusters. An "operation" is a data structure representing a computation which may cause side effects such as calling a Raft cluster or partitioning the network, represented in the code with the `Executable` concept. It has an `execute` function performing the computation and returns a result of type `result_type`. Different computations of the same type share state of type `state_type`. The state can, for example, contain database handles. Each execution is performed on an abstract `thread' (represented by a `thread_id`) and has a logical starting time point. The thread and start point together form the execution's `context` which is passed as a reference to `execute`. Two operations may be called in parallel only if they are on different threads. A generator, represented through the `Generator` concept, produces a sequence of operations. An operation can be fetched from a generator using the `op` function, which also returns the next state of the generator (generators are purely functional data structures). The generator concept is inspired by the generators in the Jepsen testing library for distributed systems. We also implement `interpreter` which "interprets", or "runs", a given generator, by fetching operations from the generator and executing them with concurrency controlled by the abstract threads. The algorithm used in the interpreter is also similar to the interpreter algorithm in Jepsen, although there are differences. Most notably we don't have a "worker" concept - everything runs on a single shard; but we use "abstract threads" combined with futures for concurrency. There is also no notion of "process". Finally, the interpreter doesn't keep an explicit history, but instead uses a callback `Recorder` to notify the user about operation invocations and completions. The user can decide to save these events in a history, or perhaps they can analyze them on the fly using constant memory.	2021-08-16 13:07:08 +02:00
Kamil Braun	421b1b9494	test: raft: introduce `future_set` A set of futures that can be polled. Polling the set (`poll` function) returns the value of one of the futures which became available or `std::nullopt` if the given logical durationd passes (according to the given timer), whichever event happens first. The current implementation assumes sequential polling. New futures can be added to the set with `add`. All futures can be removed from the set with `release`.	2021-08-16 13:07:08 +02:00
Kamil Braun	a5e92e1c45	test: raft: randomized_nemesis_test: handle `raft::stopped_error` in timeout futures The timeout futures in `call` and `reconfigure` may be discarded after Raft servers were `abort()`ed which would result in `raft::stopped_error` and the test complained about discarded exceptional futures. Discard these errors explicitly.	2021-08-16 13:07:08 +02:00
Eliran Sinvani	403db8e943	service level controller: Subscriber API unit test Here we add a very simple unit test for the configuration change API.	2021-08-16 11:38:59 +03:00
Piotr Sarna	38c1fd0762	cql-pytest: add a test suite for user-defined aggregates The test suite now consists of a single user aggregate: a custom implementation for existing avg() built-in function, as well as a couple of cases for catching incorrect operations, like using wrong function signatures or dropping used functions.	2021-08-13 11:16:52 +02:00
Piotr Sarna	5f773d04d2	cql-pytest: add context managers for functions and aggregates These context managers can be used to create temporary user-defined functions and user-defined aggregates.	2021-08-13 11:16:52 +02:00
Nadav Har'El	89724533f8	test/cql-pytest: CREATE INDEX IF NOT EXISTS vs. Cassandra What should the following pair of statements do? CREATE INDEX xyz ON tbl(a) CREATE INDEX IF NOT EXISTS xyz ON tbl(b) There are two reasonable choices: 1. An index with the name xyz already exists, so the second command should do nothing, because of the "IF NOT EXISTS". 2. The index on tbl(b) does not yet exist, so the command should try to create it. And when it can't (because the name xyz is already taken), it should produce an error message. Currently, Cassandra went with choice 1, and Scylla went with choice 2. After some discussions on the mailing list, we agreed that Scylla's choice is the better one and Cassandra's choice could be considered a bug: The "IF NOT EXIST" feature is meant to allow idempotent creation of an index - and not to make it easy to make mistakes without not noticing. The second command listed above is most likely a mistake by the user, not anything intentional: The command intended to ensure than an index on column b exists, but after the silent success of the command, no such index exists. So this patch doesn't change any Scylla code (it just adds a comment), and rather it adds a test which "enshrines" the current behavior. The test passes on Scylla and fails on Cassandra so we tag it "cassandra_bug", meaning that we consider this difference to be intentional and we consider Cassandra's behavior in this case to be wrong. Fixes #9182. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210811113906.2105644-1-nyh@scylladb.com>	2021-08-11 13:41:58 +02:00
Michał Chojnowski	2aa0a2e6a1	test: perf: perf_collection: use the optimized version of bptree Since key_compare does not conform to SimpleLessCompare, the benchmark tests the non-optimized version of bptree (without SIMD key search). We want to test the optimized version. Closes #9180	2021-08-10 17:04:34 +03:00
Nadav Har'El	65381bd155	test/alternator: add tests for expression length limits The DynamoDB documentation https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/Limits.html describes several hard limits on the size of the size of expressions (ProjectionExpression, ConditionExpression, UpdateExpression, FilterExpression) and various elements they contain. In this patch we begin testing those limits with a comprehensive test for the length of each of these four expressions: we test that lengths up to (and including) 4096 bytes are allowed but longer expressions are rejected. We also add TODOs for additional documented limits that should be tested in the future. Currently, this test passes on DynamoDB but xfails on Alternator because Alternator does not enforce any limits on the expression length. I don't think this is a real problem, and we may consider keeping it this way, but we should at least be aware that this difference exists and an xfailing test will remind us. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210810081948.2012120-2-nyh@scylladb.com>	2021-08-10 12:06:21 +02:00
Nadav Har'El	9d49a32486	test/alternator: add tests for attribute name limits DynamoDB limits attribute names in items to lengths of up 65535 bytes, but in some cases (such as key attributes) the limit is lower - 255. This patch adds tests for many of these cases. All the new tests pass on DynamoDB, but some still xfail on Alternator because Alternator is too lenient - sometimes allowing longer attribute names than DynamoDB allows. While this may sound great, it also has downsides: The oversized attribute names perform badly, and as they grow, Alternator's internal limits will be reached as well, and result in an unsightly "internal server error" being reported instead of the expected user-friendly error. Refs #9169. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210810081948.2012120-1-nyh@scylladb.com>	2021-08-10 12:06:13 +02:00
Avi Kivity	112cee4960	Merge "make sstable::make_reader() return flat_mutation_reader_v2" from Michael " * Make `sstable::make_reader()` return `flat_mutation_reader_v2`, retain the old one as `sstable::make_reader_v1()` * Start weaning tests off `sstable::make_reader_v1()` (done all the easy ones, i.e. those not involving range tombstones) " * tag 'sstable-make-reader-v2-v1' of github.com:cmm/scylla: tests: use flat_mutation_reader_v2 in the easier part of sstable_3_x_test tests: upgrade the "buffer_overflow" test to flat_mutation_reader_v2 tests: get rid of sstable::make_reader_v1() in broken_sstable_test tests: get rid of sstable::make_reader_v1() in the trivial cases sstables: make sstable::make_reader() return flat_mutation_reader_v2	2021-08-10 12:57:10 +03:00
Avi Kivity	a7ef826c2b	Merge "Fold validation compaction into scrub" from Botond " Validation compaction -- although I still maintain that it is a good descriptive name -- was an unfortunate choice for the underlying functionality because Origin has burned the name already as it uses it for a compaction type used during repair. This opens the door for confusion for users coming from Cassandra who will associate Validation compaction with the purpose it is used for in Origin. Additionally, since Origin's validation compaction was not user initiated, it didn't have a corresponding `nodetool` command to start it. Adding such a command would create an operational difference between us and Origin. To avoid all this we fold validation compaction into scrub compaction, under a new "validation" mode. I decided against using the also suggested `--dry-mode` flag as I feel that a new mode is a more natural choice, we don't have to define how it interacts with all the other modes, unlike with a `--dry-mode` flag. Fixes: #7736 Tests: unit(dev), manual(REST API) " * 'scrub-validation-mode/v2' of https://github.com/denesb/scylla: compaction/compaction_descriptor: add comment to Validation compaction type compaction/compaction_descriptor: compaction_options: remove validate api: storage_service: validate_keyspace -> scrub_keyspace (validate mode) compaction/compaction_manager: hide perform_sstable_validation() compaction: validation compaction -> scrub compaction (validate mode) compaction/compaction_descriptor: compaction_options: add options() accessor compaction/compaction_descriptor: compaction_options::scrub::mode: add validate	2021-08-10 12:18:35 +03:00
Michael Livshin	c0ba657a86	tests: use flat_mutation_reader_v2 in the easier part of sstable_3_x_test That is, anything not involving range tombstones. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2021-08-09 19:20:48 +03:00
Michael Livshin	7c2854a094	tests: upgrade the "buffer_overflow" test to flat_mutation_reader_v2 Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2021-08-09 19:20:48 +03:00
Michael Livshin	a4c43eda3a	tests: get rid of sstable::make_reader_v1() in broken_sstable_test Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2021-08-09 19:20:48 +03:00
Michael Livshin	37c9f8f137	tests: get rid of sstable::make_reader_v1() in the trivial cases Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2021-08-09 19:20:48 +03:00
Michael Livshin	f07306d75c	sstables: make sstable::make_reader() return flat_mutation_reader_v2 Rename the old version to `sstables::make_reader_v1()`, to have a nicely searcheable eradication target. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2021-08-09 19:20:48 +03:00
Tomasz Grabiec	e115fce8f7	Merge "raft: sometimes become a candidate even if outside the configuration" from Kamil There are situations where a node outside the current configuration is the only node that can become a leader. We become candidates in such cases. But there is an easy check for when we don't need to; a comment was added explaining that. * kbr/candidate-outside-config-v3: raft: sometimes become a candidate even if outside the configuration raft: fsm: update _commit_idx when applying snapshot	2021-08-09 12:29:03 +02:00
Piotr Dulikowski	e18b29765a	hints: add hint sync point structure Adds a sync_point structure. A sync point is a (possibly incomplete) mapping from hint queues to a replay position in it. Users will be able to create sync points consisting of the last written positions of some hint queues, so then they can wait until hint replay in all of the queues reach that point. The sync point supports serialization - first it is serialized with the help of IDL to a binary form, and then converted to a hexadecimal string. Deserialization is also possible.	2021-08-09 09:24:36 +02:00
Piotr Dulikowski	5a0942a0f8	utils,alternator: move base64 code from alternator to utils The base64 encoding/decoding functions will be used for serialization of hint sync point descriptions. Base64 format is not specific to Alternator, so it can be moved to utils.	2021-08-09 09:24:36 +02:00
Kamil Braun	7533c84e62	raft: sometimes become a candidate even if outside the configuration There are situations where a node outside the current configuration is the only node that can become a leader. We become candidates in such cases. But there is an easy check for when we don't need to; a comment was added explaining that.	2021-08-06 13:18:32 +02:00
Kamil Braun	93822b0ee7	test: raft: regression test for storing cluster configuration when taking snapshots Before the fix introduced in the previous patch, the cluster would forget its configuration when taking a snapshot, making it unable to reelect a leader. This regression test catches that.	2021-08-06 12:17:22 +02:00
Avi Kivity	52364b5da0	Merge 'cql3: Use expressions to calculate the local-index clustering ranges' from Jan Ciołek Calculating clustering ranges on a local index has been rewritten to use the new `expression` variant. This allows us to finally remove the old `bounds_ranges` function. Closes #9080 * github.com:scylladb/scylla: cql3: Remove unused functions like bounds_ranges cql3: Use expressions to calculate the local-index clustering ranges statement_restrictions_test: tests for extracting column restrictions expression: add a function to extract restrictions for a column	2021-08-05 18:32:11 +03:00
Kamil Braun	f050d3682c	raft: fsm: stronger check for outdated remote snapshots We must not apply remote snapshots with commit indexes smaller than our local commit index; this could result in out-of-order command application to the local state machine replica, leading to serializability violations. Message-Id: <20210805112736.35059-1-kbraun@scylladb.com>	2021-08-05 14:29:50 +02:00
Nadav Har'El	ae51fef57c	cql-pytest: add tests for estimated partition count In issue #9083 a user noted that whereas Cassandra's partition-count estimation is accurate, Scylla's (rewritten in commit `b93cc21`) is very inaccurate. The tests introduce here, which all xfail on Scylla, confirm this suspicion. The most important tests are the "simple" tests, involving a workload which writes N distinct partitions and then asks for the estimated partition count. Cassandra provides accurate estimates, which grow more accurate with more partitions, so it passes these tests, while Scylla provides bad estimates and fails them. Additional tests demonstrate that neither Scylla nor Cassandra can handle anything beyond the "simple" case of distinct partitions. Two tests which xfail on both Cassandra and Scylla demonstrate that if we write the same partitions to multiple sstables - or also delete partitions - the estimated partition counts will be way off. Refs #9083 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210726211315.1515856-1-nyh@scylladb.com>	2021-08-05 08:50:19 +02:00
Botond Dénes	5f6468d7d7	compaction/compaction_manager: hide perform_sstable_validation() We are folding validation compaction into scrub (at least on the interface level), so remove the validation entry point accordingly and have users go through `perform_sstable_scrub()` instead.	2021-08-05 07:36:44 +03:00
Botond Dénes	a258f5639b	compaction: validation compaction -> scrub compaction (validate mode) Fold validation compaction into scrub compaction (validate mode). Only on the interface level though: to initiate validation compaction one now has to use `compaction_options::make_scrub(compaction_options::scrub::mode::validate)`. The implementation code stays as-is -- separate.	2021-08-05 07:32:05 +03:00
Nadav Har'El	d640998ca8	test/cql-pytest: add test for another ALLOW FILTERING case In this patch we add another test case for a case where ALLOW FILTERING should not be required (and Cassandra doesn't require it) but Scylla does. This problem was introduced by pull request #9122. The pull request fixed an incorrect query (see issue #9085) involving both an index and a multi-column restriction on a compound clustering key - and the fix is using filtering. However, in one specific case involving a full prefix, it shouldn't require filtering. This test reproduces this case. The new test passes on Cassandra (and also theoretically, should pass), but fails on Scylla - the check_af_optional() call fails because Scylla made the ALLOW FILTERING mandatory for that case. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210803092046.1677584-1-nyh@scylladb.com>	2021-08-04 15:24:47 +03:00
Nadav Har'El	dba184039a	test/alternator: another test for Query's ExclusiveStartKey We already have tests for Query's ExclusiveStartKey option, but we only exercised it as a way for paging linearly through all the results. Now we add a test that confirms that ExclusiveStartKey can be used not just for paging through all the result - but also for jumping directly to the middle of a partition after any clustering key (existing or non- existing clustering key). The new test also for the first time verifies that ExclusiveStartKey with a specific format works (previous tests just copied LastEvaluatedKey to ExclusiveStartKey, so any opaque cookie could have worked). The test passes on both DynamoDB and Alternator so it did not find a new bug. But it's useful to have as a regression test, in case in the future we want to improve paging performance (see #6278) - and need to keep in mind that ExclusiveStartKey is not just for paging. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210729114703.1609058-1-nyh@scylladb.com>	2021-08-04 15:24:47 +03:00
Kamil Braun	4165045356	test: raft: randomized_nemesis_test: handle timeouts in rpc::send_snapshot They were already correctly returned to the caller, but we had a leftover discarded future that would sometimes end up with a broken_promise exception. Ignore the exception explicitly. Message-Id: <20210803122207.78406-1-kbraun@scylladb.com>	2021-08-04 15:24:47 +03:00
Raphael S. Carvalho	a869d61c89	tests: Move compaction-related tests into its own unit With commit `1924e8d2b6`, compaction code was moved into a top level dir as compaction is layered on top of sstables. Let's continue this work by moving all compaction unit tests into its own test file. This also makes things much more organized. sstable_datafile_test, as its name implies, will only contain sstable data tests. Perhaps it should be renamed to only sstable_data_test, as the test also contains tests involving other components, not only the data one. BEFORE $ cat test/boost/sstable_datafile_test.cc \| grep TEST_CASE \| wc -l 105 AFTER $ cat test/boost/sstable_compaction_test.cc \| grep TEST_CASE \| wc -l 57 $ cat test/boost/sstable_datafile_test.cc \| grep TEST_CASE \| wc -l 48 Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20210802192120.148583-1-raphaelsc@scylladb.com>	2021-08-02 22:26:26 +03:00
Jan Ciolek	a7d1dab066	statement_restrictions_test: tests for extracting column restrictions Add unit tests for the function extract_single_column_restrictions_for_column() Signed-off-by: Jan Ciolek <jan.ciolek@scylladb.com>	2021-08-02 15:43:42 +02:00
Tomasz Grabiec	3e47f28c65	Merge "raft: use the correct term when storing a snapshot" from Kamil We should not use the current term; we should use the term of the snapshot's index, which may be lower. * https://github.com/kbr-/scylla/tree/snapshot-right-term-fix: test: raft: regression test for using the correct term when taking a snapshot test: raft: randomized_nemesis_test: server configuration parameter raft: use the correct term when storing a snapshot	2021-08-02 15:33:52 +02:00
Dejan Mircevski	debf65e136	cql3: Filter regular-index results on multi-column When a WHERE clause contains a multi-column restriction and an indexed regular column, we must filter the results. It is generally not possible to craft the index-table query so it fetches only the matching rows, because that table's clustering key doesn't match up with the column tuple. Fixes #9085. Tests: unit (dev, debug) Signed-off-by: Dejan Mircevski <dejan@scylladb.com> Closes #9122	2021-08-02 14:15:43 +03:00
Kamil Braun	ac5121a016	test: raft: regression test for using the correct term when taking a snapshot	2021-08-02 11:48:35 +02:00
Kamil Braun	63fdc718d4	test: raft: randomized_nemesis_test: server configuration parameter	2021-08-02 11:47:19 +02:00
Avi Kivity	ebda2fd4db	test: cql_test_env: increase file descriptor limit It was observed that since `fce124bd90` ('Merge "Introduce flat_mutation_reader_v2" from Tomasz') database_test takes much longer. This is expected since it now runs the upgrade/downgrade reader tests on all existing tests. It was also observed that in a similar time frame database_test sometimes times our on test machines, taking much longer than usual, even with the extra work for testing reader upgrade/downgrade. In an attempt to reproduce, I noticed ti failing on EMFILE (too many open file descriptors). I saw that tests usually use ~100 open file descriptors, while the default limit is 1024. I suspect we have runaway concurrency, but I was not able to pinpoint the cause. It could be compaction lagging behind, or cleanup work for deleting tables (the test test_database_with_data_in_sstables_is_a_mutation_source creates and deletes many tables). As a stopgap solution to unblock the tests, this patch raises the file descriptor limit in the way recommended by [1]. While tests shouldn't use so many descriptors, I ran out of ideas about how to plug the hole. Note that main() does something similar, through more elaborate since it needs to communicate to users. See `ec60f44b64` ("main: improve process file limit handling"). [1] http://0pointer.net/blog/file-descriptor-limits.html Closes #9121	2021-08-02 11:57:14 +03:00
Pavel Solodovnikov	b1a3b59a08	test: test_materialized_view: test_mv_select_stmt_bound_values: improve error handling Restrict expected exception message to filter only relevant exception, matching both for scylla and cassandra. For example, the former has this message: Cannot use query parameters in CREATE MATERIALIZED VIEW statements While the latter throws this: Bind variables are not allowed in CREATE MATERIALIZED VIEW statements Also, place cleanup code in try-finally clause. Tests: cql-pytest:test_materialized_view.py(dev) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20210802083912.229886-1-pa.solodovnikov@scylladb.com>	2021-08-02 11:49:50 +03:00
Nadav Har'El	e8fe1817df	cql-pytest: translate Cassandra's tests for timestamps This is a translation of Cassandra's CQL unit test source file validation/entities/TimestampTest.java into our our cql-pytest framework. This test file checks has a few tests (8) on various features of cell timestamps. All these tests pass on Cassandra and on Scylla - i.e., these tests no new Scylla bug was detected :-) Two of the new tests are very slow (6 seconds each) and check a trivial feature that was already checked elsewhere more efficiently (the fact that TTL expiration works), so I marked them "skip" after verifying they really pass. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20210801142738.1633126-1-nyh@scylladb.com>	2021-08-02 09:25:49 +02:00
Pavel Solodovnikov	d07f681a95	test: test_non_deterministic_functions: add `lwt` to test cases names The tests are related to LWT so add the corresponding prefix to all the tests cases to emphasize that. Tests: cql-pytest(dev) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20210801131820.164480-1-pa.solodovnikov@scylladb.com>	2021-08-01 16:23:30 +03:00
Avi Kivity	48860b135a	Merge "cql3: fix `current()` functions to be non-deterministic" from Pavel S " Previously, the following functions were incorrectly marked as pure, meaning that the function is executed at "prepare" step: `currenttimestamp()` * `currenttime()` * `currentdate()` * `currenttimeuuid()` For functions that possibly depend on timing and random seed, this is clearly a bug. Cassandra doesn't have a notion of pure functions, so they are lazily evaluated. Make Scylla to match Cassandra behavior for these functions. Add a unit-test for a fix (excluding `currentdate()` function, because there is no way to use synthetic clock with query processor and sleeping for a whole day to demonstrate correct behavior is clearly not an option). Also, extend the cql-pytest for #8604 since there are now more non-deterministic CQL functions, they are all subject to the test now. Fixes: #8816 " * 'timeuuid_function_pure_annotation_v3' of https://github.com/ManManson/scylla: test: test_non_deterministic_functions: test more non-pure functions cql3: change `current*()` CQL functions to be non-pure	2021-08-01 12:35:36 +03:00
Pavel Solodovnikov	a130921120	test: test_non_deterministic_functions: test more non-pure functions Check that all existing non-pure functions (except for `currentdate()`) work correctly with or without prepared statements. Tests: cql-pytest/test_non_deterministic_functions.py(dev) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-08-01 12:18:26 +03:00
Pavel Solodovnikov	21d758020a	cql3: change `current()` CQL functions to be non-pure These include the following: `currenttimestamp()` * `currenttime()` * `currentdate()` * `currenttimeuuid()` Previously, they were incorrectly marked as pure, meaning that the function is executed at "prepare" step. For functions that possibly depend on timing and random seed, this is clearly a bug. Cassandra doesn't have a notion of pure functions, so they are lazily evaluated. Make Scylla to match Cassandra behavior for these functions. Add a unit-test for a fix (excluding `currentdate()` function, because there is no way to use synthetic clock with query processor and sleeping for a whole day to demonstrate correct behavior is clearly not an option). Tests: unit(dev, debug) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-08-01 12:17:23 +03:00
Pavel Solodovnikov	1ca7825cf6	test: add a test checking that bind markers within MVs SELECT statement don't lead to a crash The request should fail with `InvalidRequest` exception and shouldn't crash the database. Don't check for actual error messages, because they are different between Scylla and Cassandra. The former has this message: Cannot use query parameters in CREATE MATERIALIZED VIEW statements While the latter throws this: Bind variables are not allowed in CREATE MATERIALIZED VIEW statements Tests: cql-pytest/test_materialized_view.py(scylla dev, cassandra trunk) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-07-30 17:57:24 +03:00
Piotr Sarna	1c7af8d46f	cql-pytest: adjust a test case for Cassandra 4 One of the test cases stopped working against Cassandra 4, but that's just because it returns a slightly different error type. The test case is adjusted to work on both Scylla and new Cassandra. Message-Id: <222a7f63a3e9739c6fc646173306fcdb3da25890.1627655555.git.sarna@scylladb.com>	2021-07-30 17:36:23 +03:00
Avi Kivity	0876248c2b	Merge "cql3: cache function calls evaluation for non-deterministic functions" from Pavel S " `function_call` AST nodes are created for each function with side effects in a CQL query, i.e. non-deterministic functions (`uuid()`, `now()` and some others timeuuid-related). These nodes are evaluated either when a query itself is executed or query restrictions are computed (e.g. partition/clustering key ranges for LWT requests). We need to cache the calls since otherwise when handling a `bounce_to_shard` request for an LWT query, we can possibly enter an infinite bouncing loop (in case a function is used to calculate partition key ranges for a query), since the results can be different each time. Furthermore, we don't support bouncing more than one time. Returning `bounce_to_shard` message more than one time will result in a crash. Caching works only for LWT statements and only for the function calls that affect partition key range computation for the query. `variable_specifications` class is renamed to `prepare_context` and generalized to record information about each `function_call` AST node and modify them, as needed: * Check whether a given function call is a part of partition key statement restriction. * Assign ids for caching if above is true and the call is a part of an LWT statement. There is no need to include any kind of statement identifier in the cache key since `query_options` (which holds the cache) is limited to a single statement, anyway. Function calls are indexed by the order in which they appear within a statement while parsing. There is no need to include any kind of statement identifier to the cache key since `query_options` (which holds the cache) is limited to a single statement, anyway. Note that `function_call::raw` AST nodes are not created for selection clauses of a SELECT statement hence they can only accept only one of the following things as parameters: * Other function calls. * Literal values. * Parameter markers. In other words, only parameters that can be immediately reduced to a byte buffer are allowed and we don't need to handle database inputs to non-pure functions separately since they are not possible in this context. Anyhow, we don't even have a single non-pure function that accepts arguments, so precautions are not needed at the moment. Add a test written in `cql-pytest` framework to verify that both prepared and unprepared lwt statements handle `bounce_to_shard` messages correctly in such scenario. Fixes: #8604 Tests: unit(dev, debug) NOTE: the patchset uses `query_options` as a container for cached values. This doesn't look clean and `service::query_state` seems to be a better place to store them. But it's not forwarded to most of the CQL code and would mean that a huge number of places would have to be amended. The series presents a trade-off to avoid forwarding `query_state` everywhere (but maybe it's the thing that needs to be done, nonetheless). " * 'lwt_bounce_to_shard_cached_fn_v6' of https://github.com/ManManson/scylla: cql-pytest: add a test for non-pure CQL functions cql3: cache function calls evaluation for non-deterministic functions cql3: rename `variable_specifications` to `prepare_context`	2021-07-30 14:21:11 +03:00
Pavel Solodovnikov	eaf70df203	cql-pytest: add a test for non-pure CQL functions Introduce a test using `cql-pytest` framework to assert that both prepared an unprepared LWT statements (insert with `IF NOT EXISTS`) with a non-deterministic function call work correctly in case its evaluation affects partition key range computation (hence the choice of `cas_shard()` for lwt query). Tests: cql-pytest/test_non_deterministic_functions.py Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2021-07-30 01:22:50 +03:00

1 2 3 4 5 ...

2109 Commits