scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-04-27 11:55:15 +00:00

Author	SHA1	Message	Date
Kamil Braun	e87ca733f0	Merge 'test.py: fix bugs, add support for flaky tests' from Konstantin Osipov Marking a test as flaky allows to keep running it in CI rather than disable it when it's discovered that a test is flaky. Flaky tests, if they fail, show up as flaky in the output, but don't fail the CI. ``` kostja@hulk:~/work/scylla/scylla$ ./test.py cdc_with --repeat=30 --verbose Found 30 tests. ================================================================================ [N/TOTAL] SUITE MODE RESULT TEST ------------------------------------------------------------------------------ [1/30] cql debug [ FLKY ] cdc_with_lwt_test.2 9.36s [2/30] cql debug [ FLKY ] cdc_with_lwt_test.1 9.53s [3/30] cql debug [ PASS ] cdc_with_lwt_test.7 9.37s [4/30] cql debug [ PASS ] cdc_with_lwt_test.8 9.41s [5/30] cql debug [ PASS ] cdc_with_lwt_test.10 9.76s [6/30] cql debug [ FLKY ] cdc_with_lwt_test.9 9.71s ``` Closes #10721 * github.com:scylladb/scylla: test.py: add support for flaky tests test.py: make Test hierarchy resettable test.py: proper suite name in the log test.py: shutdown cassandra-python connection before exit	2022-06-10 19:00:36 +02:00
Konstantin Osipov	2b92d96c87	test.py: proper suite name in the log Use a nice suite name rather than an internal Python object key in the log. Fixes a regression introduced when addressing a style-related review remark.	2022-06-10 14:10:21 +03:00
Konstantin Osipov	950d606e38	test.py: shutdown cassandra-python connection before exit Shutdown cassandra-python connections before exit, to avoid warnings/exceptions at shutdown. Cassandra-python runs a thread pool and if connections are not shut down before exit, there could be a warning that the thread pool is not destroyed before exiting main.	2022-06-10 14:10:21 +03:00
Kamil Braun	aeba88cc29	Merge 'test.py: fixes for connection handling' from Alecco Change port type passed to Cassandra Python driver to int to avoid format errors in exceptions. Manually shutdown connections to avoid reconnects after tests are done (required by upcoming async pytests). Tests: (dev) Closes #10722 * github.com:scylladb/scylla: test.py: shutdown connection manually test.py: fix port type passed to Cassandra driver	2022-06-10 11:40:47 +02:00
Botond Dénes	1c8c693ff7	Merge "Redefine Leveled compaction backlog" from Raphael S. Carvalho " This series is a consequence of the work started by: "compaction: LCS: Fix inefficiency when pushing SSTables to higher levels" `9de7abdc80` "Redefine Compaction Backlog to tame compaction aggressiveness" `d8833de3bb` The backlog definition for leveled is incorrectly built on the assumption that the world must reach the state of zero amplification, i.e. everything in the last level. The actual goal is space amplification of 1.1. In reality, LCS just wants that for every level L, level L is fan_out=10 times larger than L-1. See more in commit `9de7abdc80` which adjusts LCS to conform to this goal. If level 3 = 1000G, level 2 = 100G, level 1 = 10G, level 0 = 1G, that should return zero backlog as space amplification is (1000+100+10+1)/1000 = ~1.1 But today, LCS calculates high backlog for the layout above, as it will only be satisfied once everything is promoted to the maximum level. That's completely disconnected from what the strategy actually wants. Therefore, a mismatch. With today's definition, the backlog for any SSTable is: sizeof(sstable) * (Lmax - levelof(sstable)) * fan_out where Lmax = maximum level, and fan_out = LCS' fan out which is 10 by default That's essentially calculating the total cost for data in the SSTable to climb up to the maximum level. Of course, if a SSTable is at the maximum level, (Lmax - levelof(sstable)) returns zero, therefore backlog for it is zero. Take a look at this example: If L0 sstable is 0.16G, then its backlog = 0.16G * (3 - 0) * 10 = 4.8G 0.16G = LCS' default fragment size Maximum level (Lmax in formula) can be easily 3 as: log10 of (30G/0.16G=~187 sstables)) = ~2.27 ~2.27 means that data has exceeded level 2 capacity and so needs 3 levels. So 3 L0 sstables could add ~15G of backlog. With 1G memory per shard (30:1 disk memory ratio), that's normalized backlog of ~15, which translates into additional ~500 shares. That's halfway to full compaction speed. With more files in higher levels, we can easily get to a normalized backlog above 30, resulting in 1k shares. The suboptimal backlog definition causes either table using LCS or coexisting tables to run with more shares than needed, causing compaction to steal resources, resulting in higher latency and reduced throughput. To solve this problem, a new formula is used which will basically calculate the amount of work needed to achieve the layout goal. We no longer want to promote everything to the last level, but instead we'll incrementally calculate the backlog in each level L, which is the amount of work needed such that the next level L + 1 is at least fan_out times bigger. Fixes #10583. Results ===== image: https://user-images.githubusercontent.com/1409139/168713675-d5987d09-7011-417c-9f91-70831c069382.png The patched version correctly clears the backlog, meaning that once LCS is satisfied, backlog is 0. Therefore, next compaction either from this table or another won't run unnecessarily aggressive. p99 read and write latency have clearly improved. throughput is also more stable. " * 'LCS_backlog_revamp' of https://github.com/raphaelsc/scylla: tests: sstable_compaction_test: Adjust controller unit test for LCS compaction: Redefine Leveled compaction backlog	2022-06-10 09:21:13 +03:00
Raphael S. Carvalho	079283193a	tests: sstable_compaction_test: Adjust controller unit test for LCS The controller unit test for LCS was only creating level 0 SSTables. As level 0 falls back to STCS controller, it means that we weren't actually testing LCS controller. So let's adjust the unit test to account for LCS fan_out, which is 10 instead of 4, and also allow creation of SSTables on higher levels. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2022-06-09 14:21:40 -03:00
Nadav Har'El	75c2bd78ae	test/alternator: reproducer for GetBatchItem duplicate keys It turns out that DynamoDB forbids requesting the same item more than once in a GetBatchItem request. Trying to do it would obviously be a waste, but DynamoDB outright refuses it - and Alternator currently doesn't (refs #10757). The test currently passes on DynamoDB and fails on Alternator, so it is marked xfail. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #10758	2022-06-09 07:04:50 +02:00
Petr Gusev	0450974057	cql3_type::raw_collection: handle unknown types first The issue is about handling errors when the user specifies something strange instead of a type, e.g. CREATE TABLE try1 (a int PRIMARY KEY, b list<zzz>): * the error message only talks about collections, while zzz could also be an UDT; * the same error message is given even when zzz is not a valid collection or UDT name. The first point has already been fixed, now Scylla says 'Non-frozen user types or collections are not allowed inside collections: list<zzz>'. This commit fixes the second. Whether the type is a valid UDT or not is checked in cql3_type::raw_ut::prepare_internal, but 'non-frozen' check triggers first in cql3_type::raw_collection::prepare_internal, before we recursively get to the argument types of the collection. The patch reverses the order here, first thing we recurse and ensure that the collection argument types are valid, and only then we apply the collection checks. A side effect of this is that the error messages of the checks in raw_collection will include the keyspace name, because it will now be assigned in raw_ut::prepare_internal before them. The patch affects the validation order, so in case of list<zzz<xxx>> the message could be different, but it doesn't seem to be possible according to the Cql grammar. Examples: create type ut2 (a int, b list<ut1>); --> error('Unknown type ks.ut1') create type ut1 (a int); create type ut2 (a int, b list<ut1>); --> error('Non-frozen user types or collections are not allowed inside collections: list<ks.ut1>') create type ut2 (a int, b list<frozen<ut1>>); --> OK Fixes: scylladb#3541 Closes #10726	2022-06-07 11:16:12 +02:00
Tomasz Grabiec	beadd248e3	memtable, cache: Eagerly compact data with tombstones When memtable receives a tombstone it can happen under some workloads that it covers data which is still in the memtable. Some workloads may insert and delete data within a short time frame. We could reduce the rate of memtable flushes if we eagerly drpo tombstoned data. One workload which benefits is the raft log. It stores a row for each uncommitted raft entry. When entries are committed they are deleted. So the live set is expected to be short under normal conditions. Fixes #652.	2022-06-06 19:25:41 +02:00
Tomasz Grabiec	9135d1fd1f	memtable: Subtract from flushed memory when cleaning This patch prevents virtual dirty from going negative during memtable flush in case partition version merging erases data previously accounted by the flush reader. There is an assert in ~flush_memory_accounter which guards for this. This will start happening after tombstones are compacted with rows on partition version merging. This problem is prevented by the patch by having the cleaner notify the memtable layer via callback about the amount of dirty memory released during merging, so that the memtable layer can adjust its accounting.	2022-06-06 19:25:41 +02:00
Tomasz Grabiec	374234cf76	test: mutation: Compare against compacted mutations Memtables and cache will compact eagerly, so tests should not expect readers to produce exact mutations written, only those which are equivalant after applying copmaction.	2022-06-06 19:25:40 +02:00
Tomasz Grabiec	604e720706	compacting_reader: Drop irrelevant tombstones The compacting reader created using make_compacting_reader() was not dropping range_tombstone_change fragments which were shadowed by the partition tombstones. As a result the output fragment stream was not minimal. Lack of this change would cause problems in unit tests later in the series after the change which makes memtables lazily compact partition versions. In test_reverse_reader_reads_in_native_reverse_order we compare output of two readers, and assume that compacted streams are the same. If compacting reader doesn't produce minimal output, then the streams could differ if one of them went through the compaction in the memtable (which is minimal).	2022-06-06 19:23:37 +02:00
Tomasz Grabiec	0e3c4fc641	mvcc: Apply mutations in memtable with preemption enabled Preerequisite for eagerly applying tombstones, which we want to be preemptible. Before the patch, apply path to the memtable was not preemptible. Because merging can now be defered, we need to involve snapshots to kick-off background merging in case of preemption. This requires us to propagate region and cleaner objects, in order to create a snapshot.	2022-06-06 19:23:37 +02:00
Tomasz Grabiec	0e78ad50ea	test: memtable: Make failed_flush_prevents_writes() immune to background merging Before the change, the test artificiallu set the soft pressure condition hoping that the background flusher will flush the memtable. It won't happen if by the time the background flusher runs the LSA region is updated and soft pressure (which is not really there) is lifted. Once apply() becomes preemptibe, backgroun partition version merging can lift the soft pressure, making the memtable flush not occur and making the test fail. Fix by triggering soft pressure on retries.	2022-06-06 19:23:37 +02:00
Alejo Sanchez	98061c8960	test.py: shutdown connection manually To prevent async scheduling issues of reconnection after tests are done, manually close the connection after fixture ends. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-06-03 12:09:18 +02:00
Alejo Sanchez	17afcff228	test.py: fix port type passed to Cassandra driver Port is expected to be int, not str. Using a str causes errors for exception formatting. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-06-03 12:09:06 +02:00
Botond Dénes	49215fcff7	Merge 'Remove `flat_mutation_reader` (v1)' from Michael Livshin - Introduce a simpler substitute for `flat_mutation_reader`-resulting-from-a-downgrade that is adequate for the remaining uses but is _not_ a full-fledged reader (does not redirect all logic to an `::impl`, does not buffer, does not really have `::peek()`), so hopefully carries a smaller performance overhead. The name `mutation_fragment_v1_stream` is kind of a mouthful but it's the best I have - (not tests) Use the above instead of `downgrade_to_v1()` - Plug it in as another option in `mutation_source`, in and out - (tests) Substitute deliberate uses of `downgrade_to_v1()` with `mutation_fragment_v1_stream()` - (tests) Replace all the previously-overlooked occurrences of `mutation_source::make_reader()` with `mutation_source::make_reader_v2()`, or with `mutation_source::make_fragment_v1_stream()` where deliberate or still required (see below) - (tests) This series still leaves some tests with `mutation_fragment_v1_stream` (i.e. at v1) where not called for by the test logic per se, because another missing piece of work is figuring out how to properly feed `mutation_fragment_v2` (i.e. range tombstone changes) to `mutation_partition`. While that is not done (and I think it's better to punt on it in this PR), we have to produce `mutation_fragment` instances in tests that `apply()` them to `mutation_partition`, thus we still use downgraded readers in those tests - Remove the `flat_mutation_reader` class and things downstream of it Fixes #10586 Closes #10654 * github.com:scylladb/scylla: fix "ninja dev-headers" flat_mutation_reader ist tot tests: downgrade_to_v1() -> mutation_fragment_v1_stream() tests: flat_reader_assertions: refactor out match_compacted_mutation() tests: ms.make_reader() -> ms.make_fragment_v1_stream() repair/row_level: mutation_fragment_v1_stream() instead of downgrade_to_v1() stream_transfer_task: mutation_fragment_v1_stream() instead of downgrade_to_v1() sstables_loader: mutation_fragment_v1_stream() instead of downgrade_to_v1() mutation_source: add ::make_fragment_v1_stream() introduce mutation_fragment_v1_stream tests: ms.make_reader() -> ms.make_reader_v2() tests: remove test_downgrade_to_v1_clear_buffer() mutation_source_test: fix indentation tests: remove some redundant calls to downgrade_to_v1() tests: remove some to-become-pointless ms.make_reader()-using tests tests: remove some to-become-pointless reader downgrade tests	2022-06-03 07:26:29 +03:00
Kamil Braun	72f629c2b6	test: cdc_enable_disable_test: remove non-determinism The test sometimes fails because the order of rows in the SELECT results depends on how stream IDs for the different partition keys get generated. In some runs the stream ID for pk=1 may go before the stream ID for pk=4, in some runs the other way. The fix is to use the same partition key but different clustering keys for the different rows. Refs: #10601 Closes #10718	2022-06-02 19:40:07 +03:00
Michael Livshin	029508b77c	flat_mutation_reader ist tot Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-05-31 23:42:34 +03:00
Michael Livshin	2a91323051	tests: downgrade_to_v1() -> mutation_fragment_v1_stream() Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-05-31 23:42:34 +03:00
Michael Livshin	eabe568d1c	tests: flat_reader_assertions: refactor out match_compacted_mutation() Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-05-31 23:42:34 +03:00
Michael Livshin	a08ee649fc	tests: ms.make_reader() -> ms.make_fragment_v1_stream() Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-05-31 23:42:34 +03:00
Michael Livshin	d137b32994	tests: ms.make_reader() -> ms.make_reader_v2() Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-05-31 23:42:34 +03:00
Michael Livshin	1a9e0ed73d	tests: remove test_downgrade_to_v1_clear_buffer() The projected limited replacement of downgraded v1 mutation reader will not do its own buffering, so this test will be pointless. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-05-31 23:42:34 +03:00
Michael Livshin	66ceb32612	mutation_source_test: fix indentation Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-05-31 23:42:34 +03:00
Michael Livshin	b9ada78ec2	tests: remove some redundant calls to downgrade_to_v1() Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-05-31 23:42:34 +03:00
Michael Livshin	63a61ccaad	tests: remove some to-become-pointless ms.make_reader()-using tests mutation_source are going to be created only from v2 readers and the ::make_reader() method family is scheduled for removal. Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-05-31 23:42:34 +03:00
Michael Livshin	b288cc4f9f	tests: remove some to-become-pointless reader downgrade tests Signed-off-by: Michael Livshin <michael.livshin@scylladb.com>	2022-05-31 23:42:34 +03:00
Calle Wilund	adda43edc7	CDC - do not remove log table on CDC disable Fixes #10489 Killing the CDC log table on CDC disable is unhelpful in many ways, partly because it can cause random exceptions on nodes trying to do a CDC-enabled write at the same time as log table is dropped, but also because it makes it impossible to collect data generated before CDC was turned off, but which is not yet consumed. Since data should be TTL:ed anyway, retaining the table should not really add any overhead beyond the compaction to eventually clear it. And user did set TTL=0 (disabled), then he is already responsible for clearing out the data. This also has the nice feature of meshing with the alternator streams semantics. Closes #10601	2022-05-31 19:07:07 +03:00
Konstantin Osipov	94a192a7aa	Revert "test.py: temporarily disable raft" This reverts commit `26128a222b`. The issue the commit depends on is fixed, so enable raft back. Closes #10694	2022-05-31 14:39:26 +03:00
Mikołaj Sielużycki	bc18e97473	sstable_writer: Fix mutation order violation The change - adds a test which exposes a problem of a peculiar setup of tombstones that trigger a mutation fragment stream validation exception - fixes the problem Applying tombstones in the order: range_tombstone_change pos(ck1), after_all_prefixed, tombstone_timestamp=1 range_tombstone_change pos(ck2), before_all_prefixed, tombstone=NONE range_tombstone_change pos(NONE), after_all_prefixed, tombstone=NONE Leads to swapping the order of mutations when written and read from disk via sstable writer. This is caused by conversion of range_tombstone_change (in memory representation) to range tombstone marker (on disk representation) and back. When this mutation stream is written to disk, the range tombstone markers type is calculated based on the relationship between range_tombstone_changes. The RTC series as above produces markers (start, end, start). When the last marker is loaded from disk, it's kind gets incorrectly loaded as before_all_prefixed instead of after_all_prefixed. This leads to incorrect order of mutations. The solution is to skip writing a new range_tombstone_change with empty tombstone if the last range_tombstone_change already has empty tombstone. This is redundant information and can be safely removed, while the logic of encoding RTCs as markers doesn't handle such redundancy well. Closes #10643	2022-05-31 13:39:48 +03:00
Piotr Sarna	7169e021e5	Merge 'cql3: support list subscripts in WHERE clause' from Avi Kivity I noticed that `column_condition` (used in LWT `IF` clause) supports lists. As part of the Grand Expression Unification we'll need to migrate that to expressions, so we'll need to support list subscripts. Use the opportunity to relax the normal filtering to allow filtering on list subscripts: `WHERE my_list[:index] = :value`. Closes #10645 * github.com:scylladb/scylla: test: cql-pytest: add test for list subscript filtering doc: document list subscripts usable in WHERE clause cql3: expr: drop restrictions on list subscripts cql3: expr: prepare_expr: support subscripted lists cql3: expressions: reindent get_value() cql3: expression: evaluate() support subscripting lists	2022-05-31 09:28:52 +02:00
Avi Kivity	4b53af0bd5	treewide: replace parallel_for_each with coroutine::parallel_for_each in coroutines coroutine::parallel_for_each avoids an allocation and is therefore preferred. The lifetime of the function object is less ambiguous, and so it is safer. Replace all eligible occurences (i.e. caller is a coroutine). One case (storage_service::node_ops_cmd_heartbeat_updater()) needed a little extra attention since there was a handle_exception() continuation attached. It is converted to a try/catch. Closes #10699	2022-05-31 09:06:24 +03:00
Avi Kivity	248cdf0e34	test: cql-pytest: add test for list subscript filtering Test match and mismatch, as well as out of bound cases.	2022-05-30 20:47:47 +03:00
Kamil Braun	78f81171ba	Merge 'raft: test non-voters in `randomized_nemesis_test`' from Kamil Braun We modify the `reconfigure` and `modify_config` APIs to take a vector of <server_id, bool> pairs (instead of just a vector of server_ids), where the bool indicates whether the server is a voter in the modified config. The `reconfiguration` operation would previously shuffle the set of servers and split it into two parts: members and non-members. Now it partitions it into three parts: voters, non-voters, and non-members. The PR also includes fixes for some liveness problems stumbled upon during testing. Closes #10640 * github.com:scylladb/scylla: test: raft: randomized_nemesis_test: include non-voters during reconfigurations raft: server: if `add_entry` with `wait_type::applied` successfully returns, ensure `state_machine::apply` is called for this entry raft: tracker: fix the definition of `voters()` raft: when printing `raft::server_address`, include `can_vote`	2022-05-30 15:06:35 +02:00
Avi Kivity	35e0474410	cql3: expr: prepare_expr: support subscripted lists Infer the type of a list index as int32_type. The error message when a non-subscriptable type is provided is changed, so the corresponding test is changed too.	2022-05-30 13:29:49 +03:00
Piotr Sarna	5d59c841d0	Merge 'alternator: add Describe operations even if a feature... is not supported' from Nadav Har'El This small series implements the DescribeTimeToLive and DescribeContinuousBackups operations in Alternator. Even if the corresponding features aren't implemented, it can help applications that we implement just the Describe operation that can say that this feature is in fact currently disabled. Fixes #10660 Closes #10670 * github.com:scylladb/scylla: alternator: remove dead code alternator: implement DescribeContinuousBackups operation alternator: allow DescribeTimeToLive even without TTL enabled	2022-05-30 09:26:13 +02:00
Nadav Har'El	63132431e8	test/cql-pytest: reproducers for three scan bugs This patch contains five tests which reproduce three old bugs in Scylla's handling of multi-column restrictions like (c1,c2)<(1,2). These old bugs are: Refs #64 (yes, a two-digit issue!) Refs #4244 Refs #6200 The three github issues are closely intertwined, exposing the same or similar bugs in our internal implementation, and I suspect that eventually most of them could be fixed together. In writing these tests, I carefully read all three issues and the various failure scenarios described in them, tried to distill and simplify the scenarios, and also consider various other broken variants of the scenarios. The resulting tests are heavily commented, explaining the motivation of each test and exactly which of the above bugs it reproduces. All five tests included in this patch pass on Cassandra and currently fail on Scylla, so are marked "xfail". Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #10675	2022-05-30 09:25:00 +02:00
Avi Kivity	6bcbe3157a	Merge "Turn table::config::sstables_manager* into table::sstables_manager&" from Pavel E " The table keeps references on sstables_ and compaction_ managers (among other things), but the latter sits as a pointer on table's config while the former -- as on-table direct reference. This set unifies both by turning sstables manager on-config pointer into on-table reference. branch: https://github.com/xemul/scylla/tree/br-table-vs-sstables-manager tests: https://jenkins.scylladb.com/job/releng/job/Scylla-CI/574/ " * 'br-table-vs-sstables-manager' of https://github.com/xemul/scylla: tests: Remove sstables_manager& from column_family_test_config() table: Move sstables_manager from config onto table itself table, db, tests: Pass sstables_manager& into table constructor	2022-05-29 13:02:50 +03:00
Nadav Har'El	52362de3df	test/cql-pytest: tests for assigning an empty string to non-string In issues #7944 and #10625 it was noticed that by assigning an empty string to a non-string type (int, date, etc.) using INSERT or INSERT JSON, some combinations of the above can create "empty" values while they should produce a clear error. The tests added in this patch explore the different combinations of types and insert modes, and reproduce several buggy cases in Scylla (resulting in xfail'ing tests) as well as Cassandra. We feared that there might be a way using those buggy statements to create a partition with an empty key - something which used to kill older versions of Scylla. But the tests show that this is not possible - while a user can use the buggy statements to create an empty value, Scylla refuses it when it is used as a single-column partition key. Refs #10625 Refs #7944 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #10628	2022-05-27 16:37:01 +02:00
Michał Radwański	906cee7052	treewide: remove unqualified calls to std::move clang 15 emits such a warning: cql3/statements/raw/parsed_statement.cc:46:16: error: unqualified call to 'std::move' [-Werror,-Wunqualified-std-cast-call] , warnings(move(warnings)) ^ std:: cql3/statements/raw/parsed_statement.cc:52:101: error: unqualified call to 'std::move' [-Werror,-Wunqualified-std-cast-call] : prepared_statement(statement_, ctx.get_variable_specifications(), partition_key_bind_indices, move(warnings)) ^ std:: Closes #10656	2022-05-27 16:36:49 +02:00
Tomasz Grabiec	e9fbc0b6c5	Merge 'test.py: add cluster and new approval tests' from Konstantin Osipov The purpose of this series is to introduce infrastructure for managed scylla processes into test.py, switch some existing suites to use test.py managed processes and introduce cluster tests. All of this is expected to make possible to test Raft topology changes and schema changes using an easy to use and fast tool such as test.py. In general this will allow testing Scylla clusters from within the development test harness. Branch URL: kostja/test.py.v5 Closes #10406 * github.com:scylladb/scylla: test: disable topology/test_null test.py: disable cdc_with_lwt_test it's flaky in debug mode test.py: workaround for a python bug test: cleanup (drop keyspace) in two cql tests to support --repeat test.py: respect --verbose even if output is a tty test: remove tools/cql_repl test.py: switch cql/ suite to pytest/tabular output test: remove a flaky test case test.py: implement CQL approval tests over pytest test.py: implement cql_repl test.py: add topology suite test.py: add common utility functions to test/pylib test.py: switch cql-pytest and rest_api suites to PythonTestSuite test.py: introduce PythonTest and PythonTestSuite test.py: use artifact registry test.py: temporarily disable raft test.py: (pylib) add Scylla Server and Artifact Registry test.py: (pylib) add Host Registry to track used server hosts test.py: (pylib) add a pool of scylla servers (or clusters)	2022-05-27 16:36:08 +02:00
Pavel Emelyanov	3dab0bfc8d	tests: Remove sstables_manager& from column_family_test_config() It's unused arg in there after last patch. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-05-27 16:37:21 +03:00
Pavel Emelyanov	490bf65e11	table: Move sstables_manager from config onto table itself The manager reference is already available in constructor and thus can be copied to on-table member. The code that chooses the manager (user/system one) should be moved from make_column_family_config() into add_column_family() method. Once this happens, the get_sstables_manager() should be fixed to return the reference from its new location. While at it -- mark the method in question noexcept and add it's mutable overload. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-05-27 16:37:21 +03:00
Pavel Emelyanov	50e6810536	table, db, tests: Pass sstables_manager& into table constructor In core code there's only one place that constructs table -- in database.cc -- and this place currently has the sstables_manager pointer sitting on table config (despite it's a pointer, it's always non-null). All the tests always use the manager from one of _env's out there. For now the new contructor arg is unused. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-05-27 16:27:44 +03:00
Kamil Braun	2dafd99e3b	test: raft: randomized_nemesis_test: include non-voters during reconfigurations We modify the `reconfigure` and `modify_config` APIs to take a vector of <server_id, bool> pairs (instead of just a vector of server_ids), where the bool indicates whether the server is a voter in the modified config. The `reconfiguration` operation would previously shuffle the set of servers and split it into two parts: members and non-members. Now it partitions it into three parts: voters, non-voters, and non-members.	2022-05-27 12:06:18 +02:00
Nadav Har'El	e363febeb1	test/cql-pytest: reproducer for bug in index+filter+limit This patch adds reproducing tests for wrong handling of LIMIT in a query which uses a secondary index and filtering, described in issue #10649. In that case, Scylla incorrectly limits the number of rows found in the index before the filtering, while it should limit the number of rows after the filtering. The tests in this patch (which xfail on Scylla, and pass on Cassandra) go beyond the minimum required to reproduce this bug. It turns out that there are different sub-cases of this problem that go through different code paths, namely whether the base table has clustering keys or just partition keys, and whether the overall LIMITed result spans more than one page. So these tests attempt to also cover all these sub-cases. Without all these test sub-cases, an incomplete and incorrect fix of this bug may, by chance, cause the original test to succeed. Refs #10649 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #10658	2022-05-26 16:46:45 +02:00
Tomasz Grabiec	edd447ed32	Merge 'raft: test `read_barrier`s in `randomized_nemesis_test`' from Kamil Braun Introduce a new operation, `raft_read`, which calls `read_barrier` on a server, reads the state of the server's state machine, and returns that state. Extend the generator in `basic_generator_test` to generate `raft_read`s. Only do it if forwarding is enabled (although it may make sense to test read barriers in non-forwarding scenario as well - we may think about it and do it in a follow-up). Check the consistency of the read results by comparing them with the model and using the result to extend the model with any newly observed elements. The patchset includes some fixes for correctness (#10578) and liveness (handling aborts correctly). Closes #10561 * github.com:scylladb/scylla: test: raft: randomized_nemesis_test: check consistency of reads test: raft: randomized_nemesis_test: perform linearizable reads using read_barriers test: raft: randomized_nemesis_test: add flags for disabling nemeses raft: server: in `abort()`, abort read barriers before waiting for rpc abort raft: server: handle aborts correctly in `read_barrier` raft: fsm: don't advance commit index further than match_idx during read_quorum	2022-05-26 16:46:35 +02:00
Nadav Har'El	d0ca09a925	alternator: implement DescribeContinuousBackups operation Although we don't yet support the DynamoDB API's backup features (see issue #5063), we can already implement the DescribeContinuousBackups operation. It should just say that continuous backups, and point-in-time restores, and disabled. This will be useful for client code which tries to inquire about continuous backups, even if not planning to use them in practice (e.g., see issue #10660). Refs #5063 Refs #10660 Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2022-05-26 15:13:50 +03:00
Konstantin Osipov	7b3f9bb5fa	test: disable topology/test_null Scylla has a bug that only fires ones in a hundred runs in debug mode when a schema change parallel to a topology change leads to a lost keyspace and internal error. Disable the tests until Raft is enabled for schema.	2022-05-26 14:09:58 +03:00

1 2 3 4 5 ...

3186 Commits