scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-08 07:53:20 +00:00

Author	SHA1	Message	Date
Avi Kivity	e2dfa24540	Merge "token_metadata: add clear_gently" from Benny " We've encountered a number of reactor stalls related to token_metadata that were fixed in `052a8d036d`. This is a follow-up series that adds a clear_gently method to token_metadata that uses continuations to prevent reactor stalls when destroying token_metadata objects. Test: unit(dev), {network_topology_strategy,storage_proxy}_test(debug) " * tag 'token_metadata_clear_gently-v3' of github.com:bhalevy/scylla: token_metadata: add clear_gently token_metadata: shared_token_metadata: add mutate_token_metadata token_metdata: futurize update_normal_tokens abstract_replication_strategy: get_pending_address_ranges: invoke clone_only_token_map if can_yield repair: replace_with_repair: convert to coroutine	2020-12-22 13:23:31 +02:00
Nadav Har'El	f2978e1873	cql-pytest: port Cassandra's collection_test.py A previous patch added test/cql-pytest/cassandra_tests - a framework for porting Cassandra's unit tests to Python - but only ported two tiny test files with just 3 tests. In this patch, we finally port a much larger test file validation/entities/collection_test.java. This file includes 50 separate tests, which cover a lot of aspects of collection support, as well as how other stuff interact with collections. As of now, 23 (!) of these 50 tests fail, and exposed six new issues in Scylla which I carefully documented: Refs #7735: CQL parser missing support for Cassandra 3.10's new "+=" syntax Refs #7740: CQL prepared statements incomplete support for "unset" values Refs #7743: Restrictions missing support for "IN" on tables with collections, added in Cassandra 4.0 Refs #7745: Length of map keys and set items are incorrectly limited to 64K in unprepared CQL Refs #7747: Handling of multiple list updates in a single request differs from recent Cassandra Refs #7751: Allow selecting map values and set elements, like in Cassandra 4.0 These issues vary in severity - some are simply new Cassandra 4.0 features that Scylla never implemented, but one (#7740) is an old Cassandra 2.2 feature which it seems we did not implement correctly in some cases that involve collections. Note that there are some things that the ported tests do not include. In a handful of places there are things which the Python driver checks, before sending a request - not giving us an opportunity to check how the server handles such errors. Another notable change in this port is that the original tests repeated a lot of tests with and without a "nodetool flush". In this port I chose to stub the flush() function - it does NOT flush. I think the point of these tests is to check the correctness of the CQL features - not to verify that memtable flush works correctly. Doing a real memtable flush is not only slow, it also doesn't really check much (Scylla may still serve data from cache, not sstables). So I decided it is pointless. An important goal of this patch is that all 50 tests (except three skipped tests because Python has client-side checking), pass when run on Cassandra (with test/cql-pytest/run-cassandra). This is very important: It was very easy to make mistakes while porting the tests, and I did make many such mistakes; But running the against Cassandra allowed me to fix those mistakes - because the correct tests should pass on Cassandra. And now they do. Unfortunately, the new tests are significantly slower than what we've been accustomed in Alternator/CQL tests. The 50 tests create more than a hundred tables, udfs, udts, and similar slow operations - they do not reuse anything via fixtures. The total time for these 50 tests (in dev build mode) is around 18 seconds. Just one test - testMapWithLargePartition is responsibe for almost half (!) of that time - we should consider in the future whether it's worth it or can be made smaller. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201215155802.2867386-1-nyh@scylladb.com>	2020-12-22 13:22:09 +02:00
Benny Halevy	56aa49ca81	token_metadata: shared_token_metadata: add mutate_token_metadata mutate_token_metadata acquires the shared_token_metadata lock, clones the token_metadata (using clone_async) and calls an asynchronous functor on the cloned copy of the token_metadata to mutate it. If the functor is successful, the mutated clone is set back to to the shared_token_metadata, otherwise, the clone is destroyed. With that, get rid of shared_token_metadata::clone Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-12-22 11:22:19 +02:00
Benny Halevy	e089c22ec1	token_metdata: futurize update_normal_tokens The function complexity if O(#tokens) in the worst case as for each endpoint token to traverses _token_to_endpoint_map lineraly to erase the endpoint mapping if it exists. This change renames the current implementation of update_normal_tokens to update_normal_tokens_sync and clones the code as a coroutine that returns a future and may yield if needed. Eventually we should futurize the whole token_metadata and abstract_replication_strategy interface and get rid of the synchronous functions. Until then the sync version is still required from call sites that are neither returning a future nor run in a seastar thread. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2020-12-22 10:35:15 +02:00
Piotr Sarna	da7e87dc56	test: add cases for using timeout with bind markers The test suite for USING TIMEOUT already included binding the timeout value, but only for wildcard (?). The test case is now extended with named bind markers. Tests: unit(dev) Message-Id: <b5344f40d26d90b36e90a04c2474127728535eaa.1608573624.git.sarna@scylladb.com>	2020-12-22 09:03:56 +02:00
Avi Kivity	b3e39d81aa	Merge 'Avoid scanning sstables in parallel for TWCS single-partition queries' from Kamil Braun We introduce a new single-key sstable reader for sstables created by `TimeWindowCompactionStrategy`. The reader uses the fact that sstables created by TWCS are mostly disjoint with respect to the contained `position_in_partition`s in order to avoid having multiple sstable readers opened at the same time unnecessarily. In case there are overlapping ranges (for example, in the current time-window), it performs the necessary merging (it uses `clustering_order_reader_merger`, introduced recently). The reader uses min/max clustering key metadata present in `md` sstables in order to decide when to open or close a sstable reader. The following experiment was performed: 1. create a TWCS table with 1 minute windows 2. fill the table with 8 equal windows of data (each window flushed to a separate sstable) 3. perform `select * from ks.t where pk = 0 limit 1` query with and without the change The expectation is that with the commit, only one sstable will be opened to fetch that one row; without the commit all 8 sstables would be opened at once. The difference in the value of `scylla_reactor_aio_bytes_read` was measured (value after the query minus value before the query), both with and without the commit. With the commit, the difference was 67584. Without the commit, the difference was 528384. 528384 / 67584 ~= 7.8. Fixes #6418. Closes #7437 * github.com:scylladb/scylla: sstables: gather clustering key filtering statistics in TWCS single key reader sstables: use time_series_sstable_set in time_window_compaction_strategy sstable_set: new reader for TWCS single partition queries mutation_reader_test: test clustering_order_reader_merger with time_series_sstable_set sstable_set: introduce min_position_reader_queue sstable_set: introduce time_series_sstable_set sstables: add min_position and max_position accessors sstable_set: make create_single_key_sstable_reader a virtual method clustering_order_reader_merger: fix the 0 readers case	2020-12-19 23:53:18 +02:00
Kamil Braun	b41139a07f	mutation_reader_test: test clustering_order_reader_merger with time_series_sstable_set	2020-12-18 16:33:27 +01:00
Kamil Braun	5e846b33b8	clustering_order_reader_merger: fix the 0 readers case With 0 readers the merger would produce a `partition_end` fragment when it should immediately return `end_of_stream` instead.	2020-12-18 12:30:40 +01:00
Avi Kivity	a60c81b615	Merge 'cql3: Fix handling of impossible restrictions on a primary-key column' from Dejan Mircevski There were two problems with handling conflicting equalities on the same PK column (eg, c=1 AND c=0): 1. When the column is indexed, Scylla crashed (#7772) 2. Computing ranges and slices was throwing an exception This series fixes them both; it also happens to resolve some old TODOs from restriction_test. Tests: unit (dev, debug) Closes #7804 * github.com:scylladb/scylla: cql3: Fix value_for when restriction is impossible cql3: Fix range computation for p=1 AND p=1	2020-12-17 12:01:36 +02:00
Dejan Mircevski	46b4b59945	cql3: Fix value_for when restriction is impossible Previously, single_column_restrictions::value_for() assumed that a column's restriction specifies exactly one value for the column. But since `37ebe521e3`, multiple equalities on the same column are allowed, so the restriction could be a conjunction of conflicting equalities (eg, c=1 AND c=0). That violates an assert and crashes Scylla. This patch fixes value_for() by gracefully handling the impossible-restriction case. Fixes #7772 Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-12-16 15:00:29 -05:00
Dejan Mircevski	4bb1107652	cql3: Fix range computation for p=1 AND p=1 Previously compute_bounds was assuming that primary-key columns are restricted by exactly one equality, resulting in the following error: query 'select p from t where p=1 and p=1' failed: std::bad_variant_access (std::get: wrong index for variant) This patch removes that assumption and deals correctly with the multiple-equalities case. As a byproduct, it also stops raising "invalid null value" exceptions for null RHS values. Signed-off-by: Dejan Mircevski <dejan@scylladb.com>	2020-12-16 14:46:48 -05:00
Avi Kivity	fdb47c954d	Merge "idl: allow IDL compiler to parse `const` specifiers for template arguments" from Pavel S " This patch series consists of the following patches: 1. The first one turned out to be a massive rewrite of almost everything in `idl-compiler.py`. It aims to decouple parser structures from the internal representation which is used in the code-generation itself. Prior to the patch everything was working with raw token lists and the code was extremely fragile and hard to understand and modify. Moreover, every change in the parser code caused a cascade effect of breaking things at many different places, since they were relying on the exact format of output produced by parsing rules. Now there is a bunch of supplementary AST structures which provide hierarchical and strongly typed structure as the output of parsing routine. It is much easier to verify (by the means of `isinstance`, for example) and extend since the internal structures used in code-generation are decoupled from the structure of parsing rules, which are now controlled by custom parse actions providing high-level abstractions. It is tested manually by checking that the old code produces exactly the same autogenerated sources for all Scylla IDLs as the new one. 2 and 3. Cosmetics changes only: fixed a few typos and moved from old-fashioned `string.Template` to python f-strings. This improves readability of the idl-compiler code by a lot. Only one non-functional whitespace change introduced. 4. This patch adds a very basic support for the parser to understand `const` specifier in case it's used with a template parameter for a data member in a class, e.g. struct my_struct { std::vector<const raft::log_entry> entries; }; It actually does two things: * Adjusts `static_asserts` in corresponding serializer methods to match const-ness of fields. * Defines a second serializer specialization for const type in `.dist.hh` right next to non-const one. This seems to be sufficient for raft-related uses for now. Please note there is no support for the following cases, though: const std::vector<raft::log_entry> entries; const raft::term_t term; None of the existing IDLs are affected by the change, so that we can gradually improve on the feature and write the idl unit-tests to increase test coverage with time. 5. A basic unit-test that writes a test struct with an `std::vector<S<const T>>` field and reads it back to verify that serialization works correctly. 6. Basic documentation for AST classes. TODO: should also update the docs in `docs/IDL.md`. But it is already quite outdated, and some changes would even be out of scope for this patch set. " * 'idl-compiler-refactor-v5' of https://github.com/ManManson/scylla: idl: add docstrings for AST classes idl: add unit-test for `const` specifiers feature idl: allow to parse `const` specifiers for template arguments idl: fix a few typos in idl-compiler idl: switch from `string.Template` to python f-strings and format string in idl-compiler idl: Decouple idl-compiler data structures from grammar structure	2020-12-16 14:05:33 +02:00
Pavel Solodovnikov	1e6df841a5	idl: add unit-test for `const` specifiers feature Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com>	2020-12-15 16:03:18 +03:00
Piotr Sarna	13317f7698	alternator: ensure correct isolation level in tracing tests Taking advantage of the fact that isolation level can be defined for a table with a tag, the tracing test that relies on CAS can now be sure to have a correct isolation level. Message-Id: <43f005ab9d566c7d3d55ce93c553127b1df9e87f.1607954739.git.sarna@scylladb.com>	2020-12-14 17:37:55 +02:00
Piotr Sarna	7081e361cc	test: add isolation level requirement message to tracing tests Alternator tracing tests require the cluster to have the 'always' isolation level configured to work properly. If that's not the case, the tests will fail due to not having CAS-related traces present in the logs. In order to help the users fix their configuration, a helper message is printed before the test case is performed. Automatic tests do not need this, because they are all ran with matching isolation level, but this message could greatly improve the user experience for manual tests. Message-Id: <62bcbf60e674f57a55c9573852b6a28f99cbf408.1607949754.git.sarna@scylladb.com>	2020-12-14 14:53:58 +02:00
Piotr Sarna	4b0303d8ae	tests: make alternator tracing tests idempotent The outcome of alternator tracing tests was that tracing probability was always set to 0 after the test was finished. That makes sense for most test runs, but manual tests can work on existing clusters with tracing probability set to some other value. Due to preserve previous trace probability, the value is now extracted and stored, so that it can be restored after the test is done. Message-Id: <94f829b63f92847b4abb3b16f228bf9870f90c2e.1607949754.git.sarna@scylladb.com>	2020-12-14 14:53:23 +02:00
Benny Halevy	0e79e0f215	test: mutation_diff: extend section markers When the different mutations are printed via BOOST_REQUIRE_EQUAL, we don't get the "expect {} but got {}" section markers. Instead, the parts we're interested in are bracketed like "critical check X == Y has failed [{} != {}]" Test: with both formats: - https://github.com/scylladb/scylla/files/3890627/test_concurrent_reads_and_eviction.log - https://github.com/scylladb/scylla/files/4303117/flat_mutation_reader_test.118.log - https://github.com/scylladb/scylla/files/5687372/flat_mutation_reader_test.172.log.gz Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20201214100521.3814909-1-bhalevy@scylladb.com>	2020-12-14 12:11:34 +02:00
Nadav Har'El	72cb3e9255	alternator test: add missing wait for update_table to finish Three tests in test_streams.py run update_table() on a table without waiting for it to complete, and then call update_table() on the same table or delete it. This always works in Scylla, and usually works in AWS, but if we reach the second call, it may fail because the previous update_table() did not take effect yet. We sometimes see these failures when running the Alternator test suite against AWS. So in this patch, after an each update_table() we wait for the table to return from UPDATING to ACTIVE status. The entire Alternator test suite now passes (or skipped) on AWS, so: Fixes #7778. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201213164931.2767236-1-nyh@scylladb.com>	2020-12-14 09:18:38 +01:00
Nadav Har'El	43ce0aef3d	alternator test: fix test wrongly failing on AWS The test test_query_filter.py::test_query_filter_paging fails on AWS and shouldn't fail, so this patch fixes the test. Note that this is only a test problem - no fix is needed for Alternator itself. The test reads 20 results with 1-result pages, and assumed that 21 pages are returned. The 21st page may happen because when the server returns the 20th, it might not yet know there will be no additional results, so another page is needed - and will be empty. Still a different implementation might notice that the last page completed the iteration, and not return an extra empty page. This is perfectly fine, and this is what AWS DynamoDB does today - and should not be considered an error. Refs #7778 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201213143612.2761943-1-nyh@scylladb.com>	2020-12-14 09:18:31 +01:00
Nadav Har'El	4ab98a4c68	alternator: use a more specific error when Authorization header is missing When request signature checking is enabled in Alternator, each request should come with the appropriate Authorization header. Most errors in this preparing this header will result in an InvalidSignatureException response; But DynamoDB returns a more specific error when this header is completely missing: MissingAuthenticationTokenException. We should do the same, but before this patch we return InvalidSignatureException also for a missing header. The test test_authorization.py::test_no_authorization_header used to enshrine our wrong error message, and failed when run against AWS. After this patch, we fix the error message and the test - which now passes against both Alternator and AWS. Refs #7778. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201213133825.2759357-1-nyh@scylladb.com>	2020-12-14 09:18:24 +01:00
Piotr Sarna	d6e7e36280	test: add prepared statement tests to USING TIMEOUT suite	2020-12-14 07:50:40 +01:00
Piotr Sarna	0148b41a02	test: add a test suite for USING TIMEOUT The test suite is based on cql-pytest and checks if USING TIMEOUT works as expected.	2020-12-14 07:50:40 +01:00
Avi Kivity	19aaf8eb83	Merge "Remove global storage service from index manager" from Pavel E " The initial intent was to remove call for global storage service from secondary index manager's create_view_for_index(), but while fixing it one of intermediate schema table's helper managed to benefit from it by re-using the database reference flying by. The cleanup is done by simply pushing the database reference along the stack from the code that already has it down the create_view_for_index(). tests: unit(dev) " * 'br-no-storages-in-index-and-schema' of https://github.com/xemul/scylla: schema-tables: Use db from make_update_table_mutations in make_update_indices_mutations schema-tables: Add database argument to make_update_table_mutations schema-tables: Factor out calls getting database instance index-manager: Move feature evaluation one level up	2020-12-13 12:41:51 +02:00
Pekka Enberg	c990f2bd34	Merge 'Reinstate [[nodiscard]] support' from Avi Kivity The switch to clang disabled the clang-specific -Wunused-value since it generated some harmless warnings. Unfortunately, that also prevent [[nodiscard]] violations from warning. Fix by clearing all instances of the warning (including [[nodiscard]] violations that crept in while it was disabled) and reinstating the warning. Closes #7767 * github.com:scylladb/scylla: build: reinstate -Wunused-value warning for [[nodiscard]] test: lib: don't ignore future in compare_readers() test: mutation_test: check both ranges when comparing summaries serialializer: silence unused value warning in variant deserializer	2020-12-12 09:54:05 +02:00
Pavel Emelyanov	89fd524c5a	schema-tables: Add database argument to make_update_table_mutations There are 3 callers of this helper (cdc, migration manager and tests) and all of them already have the database object at hands. The argument will be used by next patch to remove call for global storage proxy instance from make_update_indices_mutations. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-12-11 21:21:22 +03:00
Raphael S. Carvalho	e4b55f40f3	sstables: Fix sstable reshaping for STCS The heuristic of STCS reshape is correct, and it built the compaction descriptor correctly, but forgot to return it to the caller, so no reshape was ever done on behalf of STCS even when the strategy needed it. Fixes #7774. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20201209175044.1609102-1-raphaelsc@scylladb.com>	2020-12-10 12:45:25 +02:00
Nadav Har'El	a8fdbf31cd	alternator: fix UpdateItem ADD for non-existent attribute UpdateItem's "ADD" operation usually adds elements to an existing set or adds a number to an existing counter. But it can also be used to create a new set or counter (as if adding to an empty set or zero). We unfortunately did not have a test for this case (creating a new set or counter), and when I wrote such a test now, I discovered the implementation was missing. So this patch adds both the test and the implementation. The new test used to fail before this patch, and passes with it - and passes on DynamoDB. Note that we only had this bug for the newer UpdateItem syntax. For the old AttributeUpdates syntax, we already support ADD actions on missing attributes, and already tested it in test_update_item_add(). I just forgot to test the same thing for the newer syntax, so I missed this bug :-( Fixes #7763. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201207085135.2551845-1-nyh@scylladb.com>	2020-12-09 18:44:30 +01:00
Nadav Har'El	781f9d9aca	alternator: make default timeout configurable Whereas in CQL the client can pass a timeout parameter to the server, in the DynamoDB API there is no such feature; The server needs to choose reasonable timeouts for its own internal operations - e.g., writes to disk, querying other replicas, etc. Until now, Alternator had a fixed timeout of 10 seconds for its requests. This choice was reasonable - it is much higher than we expect during normal operations, and still lower than the client-side timeouts that some DynamoDB libraries have (boto3 has a one-minute timeout). However, there's nothing holy about this number of 10 seconds, some installations might want to change this default. So this patch adds a configuration option, "--alternator-timeout-in-ms", to choose this timeout. As before, it defaults to 10 seconds (10,000ms). In particular, some test runs are unusually slow - consider for example testing a debug build (which is already very slow) in an extremely over-comitted test host. In some cases (see issue #7706) we noticed the 10 second timeout was not enough. So in this patch we increase the default timeout chosen in the "test/alternator/run" script to 30 seconds. Please note that as the code is structured today, this timeout only applies to some operations, such as GetItem, UpdateItem or Scan, but does not apply to CreateTable, for example. This is a pre-existing issue that this patch does not change. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201207122758.2570332-1-nyh@scylladb.com>	2020-12-09 14:30:43 +01:00
Avi Kivity	f802356572	Revert "Revert "Merge "raft: fix replication if existing log on leader" from Gleb"" This reverts commit `dc77d128e9`. It was reverted due to a strange and unexplained diff, which is now explained. The HEAD on the working directory being pulled from was set back, so git thought it was merging the intended commits, plus all the work that was committed from HEAD to master. So it is safe to restore it.	2020-12-08 19:19:55 +02:00
Avi Kivity	1badd315ef	Merge "Speed up devel tests 10 times" from Pavel E " The multishard_mutation_query test is toooo slow when built with clang in dev mode. By reducing the number of scans it's possible to shrink the full suite run time from half an hour down to ~3 minutes. tests: unit(dev) " * 'br-devel-mode-tests' of https://github.com/xemul/scylla: test: Make multishard_mutation_query test do less scans configure: Add -DDEVEL to dev build flags	2020-12-08 15:42:12 +02:00
Pavel Emelyanov	b837cf25b1	test: Make multishard_mutation_query test do less scans When built by clang this dev-mode test takes ~30 minutes to complete. Let's reduce this time by reducing the scale of the test if DEVEL is set. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-12-08 15:55:04 +03:00
Avi Kivity	98271a5c57	Merge 'types: don't linearize in serialize_for_cql()' from Michał Chojnowski A sequel to #7692. This series gets rid of linearization in `serialize_for_cql`, which serializes collections and user types from `collection_mutation_view` to CQL. We switch from `bytes` to `bytes_ostream` as the intermediate buffer type. The only user of of `serialize_for_cql` immediately copies the result to another `bytes_ostream`. We could avoid some copies and allocations by writing to the final `bytes_ostream` directly, but it's currently hidden behind a template. Before this series, `serialize_for_cql_aux()` delegated the actual writing to `collection_type_impl::pack` and `tuple_type_impl::build_value`, by passing them an intermediate `vector`. After this patch, the writing is done directly in `serialize_for_cql_aux()`. Pros: we avoid the overhead of creating an intermediate vector, without bloating the source code (because creating that intermediate vector requires just as much code as serializing the values right away). Cons: we duplicate the CQL collection format knowledge contained in `collection_type_impl::pack` and `tuple_type_impl::build_value`. Refs: #6138 Closes #7771 * github.com:scylladb/scylla: types: switch serialize_for_cql from bytes to bytes_ostream types: switch serialize_for_cql_aux from bytes to bytes_ostream types: serialize user types to bytes_ostream types: serialize lists to bytes_ostream types: serialize sets to bytes_ostream types: serialize maps to bytes_ostream utils: fragment_range: use range-based for loop instead of boost::for_each types: add write_collection_value() overload for bytes_ostream and value_view	2020-12-08 12:38:36 +02:00
Nadav Har'El	86779664f4	alternator: fix broken Scan/Query paging with bytes keys When an Alternator table has partition keys or sort keys of type "bytes" (blobs), a Scan or Query which required paging used to fail - we used an incorrect function to output LastEvaluatedKey (which tells the user where to continue at the next page), and this incorrect function was correct for strings and numbers - but NOT for bytes (for bytes, we need to encode them as base-64). This patch also includes two tests - for bytes partition key and for bytes sort key - that failed before this patch and now pass. The test test_fetch_from_system_tables also used to fail after a Limit was added to it, because one of the tables it scans had a bytes key. That test is also fixed by this patch. Fixes #7768 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201207175957.2585456-1-nyh@scylladb.com>	2020-12-08 09:38:23 +01:00
Michał Chojnowski	d43fd456cd	types: switch serialize_for_cql from bytes to bytes_ostream Now we can serialize collections from collection_mutation_view_description without linearizations.	2020-12-07 17:55:36 +01:00
Avi Kivity	8fc0bbd487	test: lib: don't ignore future in compare_readers() A fast_forward_to() call is not waited on in compare_readers(). Since this is called in a thread, add a future::get() call to wait for it.	2020-12-07 16:50:20 +02:00
Avi Kivity	732d83dc0e	test: mutation_test: check both ranges when comparing summaries A copy/paste error means we ignore the termination of one of the ranges. Change the comma expression to a disjunction to avoid the unused value warning from clang. The code is not perfect, since if the two ranges are not the same size we'll invoke undefined behavior, but it is no worse than before (where we ignored the comparison completely).	2020-12-07 16:47:52 +02:00
Nadav Har'El	220d6dde17	alternator, test: make test_fetch_from_system_tables faster The test test_fetch_from_system_tables tests Alternator's system-table feature by reading from all system tables. The intention was to confirm we don't crash reading any of them - as they have different schemas and can run into different problems (we had such problems in the initial implementation). The intention was not to read a lot from each table - we only make a single "Scan" call on each, to read one page of data. However, the Scan call did not set a Limit, so the single page can get pretty big. This is not normally a problem, but in extremely slow runs - such as when running the debug build on an extremely overcommitted test machine (e.g., issue #7706) reading this large page may take longer than our default timeout. I'll send a separate patch for the timeout issue, but for now, there is really no reason why we need to read a big page. It is good enough to just read 50 rows (with Limit=50). This will still read all the different types and make the test faster. As an example, in the debug run on my laptop, this test spent 2.4 seconds to read the "compaction_history" table before this patch, and only 0.1 seconds after this patch. 2.4 seconds is close to our default timeout (10 seconds), 0.1 is very far. Fixes #7706 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201207075112.2548178-1-nyh@scylladb.com>	2020-12-07 08:52:31 +01:00
Nadav Har'El	0cd05dd0fd	cql-pytest: add tests for ALLOW FILTERING The original goal of this patch was to replace the two single-node dtests allow_filtering_test and allow_filtering_secondary_indexes_test, which recently caused us problems when we wanted to change the ALLOW FILTERING behavior but the tests were outside the tree. I'm hoping that after this patch, those two tests could be removed from dtest. But this patch actually tests more cases then those original dtest, and moreover tests not just whether ALLOW FILTERING is required or not, but also that the results of the filtering is correct. Currently, four of the included tests are expected to fail ("xfail") on Scylla, reproducing two issues: 1. Refs #5545: "WHERE x IN ..." on indexed column x wrongly requires ALLOW FILTERING 2. Refs #7608: "WHERE c=1" on clustering key c should require ALLOW FILTERING, but doesn't. All tests, except the one for issue #5545, pass on Cassandra. That one fails on Cassandra because doesn't support IN on an indexed column at all (regardless of whether ALLOW FILTERING is used or not). Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201115124631.1224888-1-nyh@scylladb.com>	2020-12-06 19:51:25 +02:00
Pavel Solodovnikov	56c0fcfcb2	cql_query_test: handle `bounce_to_shard` msg in `test_null_value_tuple_floating_types_and_uuids` Use `prepared_on_shard` helper function to handle `bounce_to_shard` messages that can happen when using LWT statements. Fixes: #7757 Tests: unit(dev) Signed-off-by: Pavel Solodovnikov <pa.solodovnikov@scylladb.com> Message-Id: <20201204172944.601730-1-pa.solodovnikov@scylladb.com>	2020-12-06 19:34:13 +02:00
Avi Kivity	ca950e6f08	Merge "Remove get_local_storage_service() from counters" from Pavel E " The storage service is called there to get the cached value of db::system_keyspace::get_local_host_id(). Keeping the value on database decouples it from storage service and kills one more global storage service reference. tests: unit(dev) " * 'br-remove-storage-service-from-counters-2' of https://github.com/xemul/scylla: counters: Drop call to get_local_storage_service and related counters: Use local id arg in transform_counter_update_to_shards database: Have local id arg in transform_counter_updates_to_shards() storage_service: Keep local host id to database	2020-12-06 16:15:21 +02:00
Avi Kivity	dc77d128e9	Revert "Merge "raft: fix replication if existing log on leader" from Gleb" This reverts commit `0aa1f7c70a`, reversing changes made to `72c59e8000`. The diff is strange, including unrelated commits. There is no understanding of the cause, so to be safe, revert and try again.	2020-12-06 11:34:19 +02:00
Pavel Emelyanov	62214e2258	database: Have local id arg in transform_counter_updates_to_shards() There are two places that call it -- database code itself and tests. The former already has the local host id, so just pass one. The latter are a bit trickier. Currently they use the value from storage_service created by storage_service_for_tests, but since this version of service doesn't pass through prepare_to_join() the local_host_id value there is default-initialized, so just default-initialize the needed argument in place. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2020-12-04 15:09:30 +03:00
Avi Kivity	a95c2a946c	Merge 'mutation_reader: introduce clustering_order_reader_merger' from Kamil Braun This abstraction is used to merge the output of multiple readers, each opened for a single partition query, into a non-decreasing stream of mutation_fragments. It is similar to `mutation_reader_merger`, but an important difference is that the new merger may select new readers in the middle of a partition after it already returned some fragments from that partition. It uses the new `position_reader_queue` abstraction to select new readers. It doesn't support multi-partition (ring range) queries. The new merger will be later used when reading from sstable sets created by TimeWindowCompactionStrategy. This strategy creates many sstables that are mostly disjoint w.r.t the contained clustering keys, so we can delay opening sstable readers when querying a partition until after we have processed all mutation fragments with positions before the keys contained by these sstables. A microbenchmark was added that compares the existing combining reader (which uses `mutation_reader_merger` underneath) with a new combining reader built using the new `clustering_order_reader_merger` and a simple queue of readers that returns readers from some supplied set. The used set of readers is built from the following ranges of keys (each range corresponds to a single reader): `[0, 31]`, `[30, 61]`, `[60, 91]`, `[90, 121]`, `[120, 151]`. The microbenchmark runs the reader and divides the result by the number of mutation fragments. The results on my laptop were: ``` $ build/release/test/perf/perf_mutation_readers -t clustering_combined.* -r 10 single run iterations: 0 single run duration: 1.000s number of runs: 10 test iterations median mad min max clustering_combined.ranges_generic 2911678 117.598ns 0.685ns 116.175ns 119.482ns clustering_combined.ranges_specialized 3005618 111.015ns 0.349ns 110.063ns 111.840ns ``` `ranges_generic` denotes the existing combining reader, `ranges_specialized` denotes the new reader. Split from https://github.com/scylladb/scylla/pull/7437. Closes #7688 * github.com:scylladb/scylla: tests: mutation_source_test for clustering_order_reader_merger perf: microbenchmark for clustering_order_reader_merger mutation_reader_test: test clustering_order_reader_merger in memory test: generalize `random_subset` and move to header mutation_reader: introduce clustering_order_reader_merger	2020-12-02 12:15:35 +02:00
Kamil Braun	502ed2e9f7	tests: mutation_source_test for clustering_order_reader_merger	2020-12-02 11:13:58 +01:00
Nadav Har'El	fae2ba60e9	cql-pytest: start to port Cassandra's CQL unit tests In issue #7722, it was suggested that we should port Cassandra's CQL unit tests into our own repository, by translating the Java tests into Python using the new cql-pytest framework. Cassandra's CQL unit test framework is orders of magnitude faster than dtest, and in-tree, so Cassandra have been moving many CQL correctness tests there, and we can also benefit from their test cases. In this patch, we take the first step in a long journey: 1. I created a subdirectory, test/cql-pytest/cassandra_tests, where all the translated Cassandra tests will reside. The structure of this directory will mirror that of the test/unit/org/apache/cassandra/cql3 directory in the Cassandra repository. pytest conveniently looks for test files recursively, so when all the cql-pytest are run, the cassandra_tests files will be run as well. As usual, one can also run only a subset of all the tests, e.g., "test/cql-pytest/run -vs cassandra_tests" runs only the tests in the cassandra_tests subdirectory (and its subdirectories). 2. I translated into Python two of the smallest test files - validation/entities/{TimeuuidTest,DataTypeTest}.java - containing just three test functions. The plan is to translate entire Java test files one by one, and to mirror their original location in our own repository, so it will be easier to remember what we already translated and what remains to be done. 3. I created a small library, porting.py, of functions which resemble the common functions of the Java tests (CQLTester.java). These functions aim to make porting the tests easier. Despite the resemblence, the ported code is not 100% identical (of course) and some effort is still required in this porting. As we continue this porting effort, we'll probably need more of these functions, can can also continue to improve them to reduce the porting effort. Refs #7722. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201201192142.2285582-1-nyh@scylladb.com>	2020-12-02 09:29:22 +01:00
Nadav Har'El	5c08489569	cql-pytest: don't run tests if Scylla boot timed out In test/cql-pytest/run.py we have a 200 second timeout to boot Scylla. I never expected to reach this timeout - it normally takes (in dev build mode) around 2 seconds, but in one run on Jenkins we did reach it. It turns out that the code does not recognize this timeout correctly, thought that Scylla booted correctly - and then failed all the subtests when they fail to connect to Scylla. This patch fixes the timeout logic. After the timeout, if Scylla's CQL port is still not responsive, the test run is failed - without trying to run many individual tests. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20201201150927.2272077-1-nyh@scylladb.com>	2020-12-02 08:48:44 +02:00
Kamil Braun	2da723b9c8	cdc: produce postimage when inserting with no regular columns When a row was inserted into a table with no regular columns, and no such row existed in the first place, postimage would not be produced. Fix this. Fixes #7716. Closes #7723	2020-12-01 18:01:23 +02:00
Dejan Mircevski	e45af3b9b8	index: Ensure restriction is supported in find_idx Previously, statement_restrictions::find_idx() would happily return an index for a non-EQ restriction (because it checked only the column name, not the operator). This is incorrect: when the selected index is for a non-EQ restriction, it is impossible to query that index table. Fixes #7659. Tests: unit (dev) Signed-off-by: Dejan Mircevski <dejan@scylladb.com> Closes #7665	2020-12-01 15:16:48 +02:00
Alejo Sanchez	72a64b05ea	raft: replication test: fix total entries for initial snapshot Since now total expected entries are updated by load snapshot, do not trim the total entries expected values with the initial snapshot on test state machine initialization. reported by @gleb Branch URL: https://github.com/alecco/scylla/tree/raft-ale-tests-06-snapshot-total-entries Tests: unit ({dev}), unit ({debug}), unit ({release}) Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com> Message-Id: <20201125171232.321992-1-alejo.sanchez@scylladb.com>	2020-11-30 21:34:31 +01:00
Kamil Braun	af49a95627	perf: microbenchmark for clustering_order_reader_merger	2020-11-30 11:55:44 +01:00

1 2 3 4 5 ...

1079 Commits