scylladb

mirror of https://github.com/scylladb/scylladb.git synced 2026-06-07 07:23:15 +00:00

Author	SHA1	Message	Date
Benny Halevy	4062cd17e0	test: hashers_test: mutation_fragment_sanity_check: stop semaphore To stop the semaphore as required we need run the test in a seastar thread. Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20211024053402.990142-1-bhalevy@scylladb.com>	2021-10-24 11:29:23 +03:00
Benny Halevy	0746b5add6	storage_service: replicate_to_all_cores: update all keyspaces Currently we update the effective_replication_map only on non-system keyspace, leaving the system keyspace, that uses the local replication strategy, with the empty replication_map, as it was first initialized. This may lead to a crash when get_ranges is called later as seen in #9494 where get_ranges was called from the perform_sstable_upgrade path. This change updates the effective_replication_map on all keyspaces rather than just on the non-system ones and adds a unit test that reproduces #9494 without the fix and passes with it. Fixes #9494 Test: unit(dev), database_test(debug) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20211020143217.243949-1-bhalevy@scylladb.com>	2021-10-20 17:54:23 +03:00
Nadav Har'El	e4a6569258	config: experimental flag UNUSED_CDC shouldn't be distinct from UNUSED When an experimental feature graduates from being experimental, we want to continue allow the old "--experimental-features=..." option to work, in case some user's configuration uses it - just do nothing. The way we do it is to map in db::experimental_features_t::map() the feature's name to the UNUSED value - this way the feature's name is accepted, but doesn't change anything. When the CDC feature graduated from being experimental, a new bit UNUSED_CDC was introduced to do the same thing. This separate bit was not actually necessary - if we ever check for UNUSED_CDC bit anywhere in the code it means the flag isn't actually unused ;-) And we don't check it. So simplify the code by conflating UNUSED_CDC into UNUSED. This will also make it easy to build from db::experimental_features_t::map() a list of current experimental features - now it will simply be those that do not map to UNUSED. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211013105107.123544-1-nyh@scylladb.com>	2021-10-20 17:54:17 +03:00
Nadav Har'El	88afcc7fe3	Merge 'cql-pytest: Forbid deletions based on secondary index' from Piotr Sarna This series fixes a bug which allowed using a secondary index in a restriction for a DELETE statement, which resulted in generating incorrect slices and deleting the whole partition instead. Secondary indexes are not meant to be used for deletes, which this series enforces by marking the indexes as not queriable. It also comes with a reproducing test case, originally provided by @fee-mendes (thanks!). Fixes #9495 Tests: unit(release) Closes #9496 * github.com:scylladb/scylla: cql-pytest: add reproducer for deleting based on secondary index cql3: forbid querying indexes for deletions	2021-10-20 17:54:17 +03:00
Botond Dénes	995a41d422	test/perf/perf_sstable: add support for compaction strategies So the compaction perf of different compaction strategies can be compared. Data timestamps are diversified such that they fall into four different bucket if TWCS is used, in order to be able to stress the timestamp based splitting code path. Closes #9488	2021-10-20 17:54:17 +03:00
Piotr Sarna	83722b5563	cql-pytest: add reproducer for deleting based on secondary index This commit adds a test case for a bug reported by Felipe <felipemendes@scylladb.com>. The bug involves trying to delete an entry from a partition based on a secondary index created on a column which is part of the compound clustering key, and the unfortunate result is that the whole partition gets wiped. Cassandra's behavior is in this case correct - deletion based on a secondary index column is not allowed. Refs #9495	2021-10-19 08:50:20 +02:00
Kamil Braun	22061831c1	Merge 'cql3: keyspace prepare_options: expand replication_factor also for fully qualified NetworkTopologyStrategy' from Benny Halevy It was auto-expanded only if the strategy name was the short "NetworkTopologyStrategy" name. Fixes #9302. Closes #9304. * 'prepare_options' of https://github.com/bhalevy/scylla: cql3: keyspace prepare_options: expand replication_factor also for fully qualified NetworkTopologyStrategy abstract_replication_strategy: add to_qualified_class_name	2021-10-18 16:40:57 +03:00
Raphael S. Carvalho	062436829c	compaction/TWCS: optimize reshape for disjoint sstables spanning multiple windows After `a4053dbb72`, data segregation is postponed to offstrategy, so reshape procedure is called with disjoint sstables which belong to different windows, so let's extend the optimization for disjoint sstables which span more than one window. In this way, write amplification is reduced for offstrategy compaction, as all disjoint sstables will be compacted at once. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20211013203046.233540-2-raphaelsc@scylladb.com>	2021-10-18 16:40:57 +03:00
Raphael S. Carvalho	aa4aba40aa	sstables: sstable_run: introduce estimate_droppable_tombstone_ratio Make it possible to estimate dropppable tombstones for sstable runs. The result is averaged by number of fragments composing the run. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Message-Id: <20211014143424.353357-1-raphaelsc@scylladb.com>	2021-10-18 12:24:08 +03:00
Benny Halevy	b9aa92edd4	cql3: keyspace prepare_options: expand replication_factor also for fully qualified NetworkTopologyStrategy It was auto-expanded only if the strategy name was the short "NetworkTopologyStrategy" name. Fixes #9302 Test: cql_query_test.test_rf_expand(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-18 12:18:07 +03:00
Piotr Sarna	4bfaa7d9fc	Merge 'Service levels: fix undefined behaviours' from Eliran Sinvani This mini series contains two fixes that are bundled together since the second one assumes that the first one exists (or it will not fix anything really...), the two problems were: 1. When certain operations are called on a service level controller which doesn't have it's data accessor set, it can lead to a crash since some operations will still try to dereference the accessor pointer. 2. The cql environment test initialized the accessor with a sharded<system_distributed_data>& however this sharded class as itself is not initialized (sharded::start wasn't called), so for the same that were unsafe for null dereference the accessor will now crash for trying to access uninitialized sharded instance. Closes #9468 * github.com:scylladb/scylla: CQL test environment: Fix bad initialization order Service Level Controller: Fix possible dereference of a null pointer	2021-10-18 08:53:53 +02:00
Nadav Har'El	1d751491a3	test/alternator: recognize when Scylla crashes Before this patch, if Scylla crashes during some test in test/alternator, all tests after it will fail because they can't connect to Scylla - and we can get a report on hundreds of failures without a clear sign of where the real problem was. This patch introduces an autouse fixture (i.e., a fixture automatically used by every test) which tries to run a do-nothing health-check request after each test. If this health-check request fails, we conclude that Scylla crashed and report the test in which this happened - and exit pytest instead of failing a hundred more tests. The failure report looks something like this: ``` ! _pytest.outcomes.Exit: Scylla appears to have crashed in test test_batch.py::test_batch_get_item ! ``` And the entire test run fails. These extra health checks are not free, but they come fairly close to being free: In my tests I measured less than 0.1 seconds slowdown of the entire test suite (which has 618 tests) caused by the extra health checks. Fixes #9489 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211017123222.217559-1-nyh@scylladb.com>	2021-10-17 20:45:30 +03:00
Nadav Har'El	86e8979ff2	test/alternator, test/cql-pytest: enable specific experimental features Issue #9467 deprecated the blanket "--experimental" option which we used to enable all experimental Scylla features for testing, and suggests that individual experimental features should be enabled instead. So this is what we do in this patch for the Scylla-running scripts in test/alternator and test/cql-pytest: We need to enable UDF for the CQL tests, and to enable Alternator Streams and Alternator TTL for the Alternator tests. Refs #9467 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211012110312.719654-2-nyh@scylladb.com>	2021-10-15 16:36:35 +03:00
Avi Kivity	4f3b8f38e2	Merge "Add effective_replication_map" from Benny " The current api design of abstract_replication_strategy provides a can_yield parameter to calls that may stall when traversing the token metadata in O(n^2) and even in O(n) for a large number of token ranges. But, to use this option the caller must run in a seastar thread. It can't be used if the caller runs a coroutine or plain async tasks. Rather than keep adding threads (e.g. in storage_service::load_and_stream or storage_service::describe_ring), the series offers an infrastructure change: precalculating the token->endpoints map once, using an async task, and keeping the results in a `effective_replication_map` object. The latter can be used for efficient and stall-free calls, like get_natural_endpoints, or get_ranges/get_primary_range, replacing their equivalents in abstract_replication_strategy, and dropping the public abstract_replication_strategy::calculate_natural_endpoints and its internal cached_endpoints map. Other than the performance benefits of: 1. The current calls require running a thread to yield. Precalculating the map (using async task) allows us to use synchronous calls without stalling the rector. 2. The replication maps can and should be shared between keyspaces that use the same replication strategy. (Will be sent as a follow-up to the series) The bigger benefits (courtesy of Avi Kivity) are laying the groundwork for: 1. atomic replication metadata - an operation can capture a replication map once, and then use consistent information from the map without worrying that it changes under its feet. We may even be able to s/inet_address/replica_ptr/ later. 2. establish boundaries on the use of replication information - by making a replication map not visible, and observing when its reference count drops to zero, we can tell when the new replication map is fully in use. When we start writing to a new node we'll be able to locate a point in time where all writes that were not aware of the new node were completed (this is the point where we should start streaming). Notes: * The get_natural_endpoints method that uses the effective_replication_map is still provided as a abstract_replication_strategy virtual method so that local_strategy can override it and privide natural endpoints for any search token, even in the absence of token_metadata, when\ called early-on, before token_metadata has been established. The effective_replication_map materializes the replication strategy over a given replication strategy options and token_metadata. Whenever either of those change for a keyspace, we make a new effective_replication_map and keep it in the keyspace for latter use. Methods that depend on an ad-hoc token_metadata (e.g. during node operations like bootstrap or replace) are still provided by abstract_replication_strategy. TODO: - effective_replication_map registry - Move pending ranges from token_metadata to replication map - get rid of abstract_replication_strategy::get_range_addresses(token_metadata&) - calculate replication map and use it instead. Test: unit(dev, debug) Dtest: next-gating, bootstrap_test.py update_cluster_layout_tests.py alternator_tests.py -a 'dtest-full,!dtest-heavy' (release) " * tag 'effective_replication_strategy-v6' of github.com:bhalevy/scylla: (44 commits) effective_replication_map: add get_range_addresses abstract_replication_strategy: get rid of shared_token_metadata member and ctor param abstract_replication_strategy: recognized_options: pass const topology& abstract_replication_strategy: precacluate get_replication_factor for effective_replication_map token_metadata: get rid of now-unused sync methods abstract_replication_strategy: get rid of do_calculate_natural_endpoints abstract_replication_strategy: futurize get_address_ranges abstract_replication_strategy: futurize get_range_addresses abstract_replication_strategy: futurize get_ranges(inet_address ep, token_metadata_ptr) abstract_replication_strategy: move get_ranges and get_primary_ranges to effective_replication_map compaction_manager: pass owned_ranges via cleanup/upgrade options abstract_replication_strategy: get rid of cached_endpoints all replication strategies: get rid of do_get_natural_endpoints storage_proxy: use effective_replication_map token_metadata_ptr along with endpoints abstract_replication_strategy: move get_natural_endpoints_without_node_being_replaced to effective_replication_map storage_service: bootstrap: add log messages storage_service: get_mutable_token_metadata_ptr: always invalidate_cached_rings shared_token_metadata: set: check version monotonicity token_metadata: use static ring version token_metadata: get rid of copy constructor and assignment operator ...	2021-10-13 20:28:30 +03:00
Tomasz Grabiec	d8832b9fd8	Merge 'Memtable make reversing reader' from Michał Radwański Make a reader that reads from memtable in reverse order. This draft PR includes two commits, out of which only the second is relevant for review. Described in #9133. Refs #1413. Closes #9174 * github.com:scylladb/scylla: partition_snapshot_reader: pop_range_tombstone returns reference (instead of value) when possible. memtable: enable native reversing partition_snapshot_reader: reverse ck_range when needed by Reversing memtable, partition_snapshot_reader: read from partition in reverse partition_snapshot_reader: rows_position and rows_iter_type supporting reverse iteration partition_snapshot_reader: split responsibility of ck_range partition_snapshot_reader: separate _schema into _query_schema and _partition_schema query: reverse clustering_range test: cql_query_test: fix test_query_limit for reversed queries	2021-10-13 20:24:02 +03:00
Benny Halevy	8c85197c6c	abstract_replication_strategy: get rid of shared_token_metadata member and ctor param It is not used any more. Methods either use the token_metadata_ptr in the effective_replication_map, or receive an ad-hoc token_metadata. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 16:10:06 +03:00
Benny Halevy	4d2561ff75	abstract_replication_strategy: precacluate get_replication_factor for effective_replication_map Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 16:10:06 +03:00
Benny Halevy	dfdc8d4ddb	abstract_replication_strategy: move get_ranges and get_primary_ranges* to effective_replication_map Provide a sync get_ranges method by effective_replication_map that uses the precalculated map to get all token ranges owned by or replicated on a given endpoint. Reuse do_get_ranges as common infrastructure for all 3 cases: get_ranges, get_primary_ranges, and get_primary_ranges_within_dc. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 16:09:51 +03:00
Benny Halevy	5483269dfb	compaction_manager: pass owned_ranges via cleanup/upgrade options So they can be easily computed using an async task before constructing the compaction object in a following patch. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 14:17:46 +03:00
Benny Halevy	3393df45eb	token_metadata, storage_service: unify token_metadata_lock and merge_lock. Serialize the metadata changes with keyspace create, update, or drop. This will become necessary in the following patch when we update the effective_replication_map on all keyspaces and we want instances on all shards end up with the same replication map. Note that storage_service::keyspace_changed is called from the scheme_merge path so it already holds the merge_lock. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 13:01:25 +03:00
Benny Halevy	eb752c3f69	test: network_topology_strategy_test: use effective_replication_map to get_natural_endpoints Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 12:53:09 +03:00
Benny Halevy	a1c573e6d3	abstract_replication_strategy: make calculate_natural_endpoints_sync private And with that rename calculate_natural_endpoints(const token& search_token, const token_metadata&, can_yield) to do_calculate_natural_endpoints and make it protected, With this patch, all its external users call the async version, so rename it back to calculate_natural_endpoints, and make calculate_natural_endpoints_sync private since it's being called only within abstract_replication_strategy. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2021-10-13 12:39:36 +03:00
Eliran Sinvani	56981f2259	CQL test environment: Fix bad initialization order The service level controller was initialized with a data accessor that uses the system distributed keyspace before the later have been initialized. If there is a use of this accessor (for example by calling to: service_level_controller::get_distributed_service_levels()) if will fail miserably and crash. Not initializing the data accessor doesn't mean the same thing since we can deal with such call when the accessor is not initialized. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2021-10-12 13:27:59 +03:00
Nadav Har'El	e4bc97349c	cql-pytest: XFAILing test was fixed by a Python driver fix Issue #8203 describes a bug in a long scan which returns a lot of empty pages (e.g., because most of the results are filtered out). We have two cql-pytest test cases that reproduced this bug - one for a whole-table scan and one for a single-partition scan. It turned out that the bug was not in the Scylla server, but actually in the Python driver which incorrectly stopped the iteration after an empty page even though this page did contain the "more pages" flag. This driver bug was already fixed in the Datastax driver (see `6ed53d9f70`, and in the Scylla fork of the driver: `1d9077d3f4` So in this patch we drop the XFAIL, and if the driver is not new enough to contain this fix - the test is skipped. Since our Jenkins machines have the latest Scylla fork of the driver and it already contains this fix, these tests will not be skipped - and will run and should pass. Developers who run these tests on their development machine will see these tests either passing or skipped - depending on which version of the driver they have installed. Closes #8203 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211011113848.698935-1-nyh@scylladb.com>	2021-10-12 10:04:02 +02:00
Nadav Har'El	33f8ec09df	Merge 'treewide: improve compatibility with gcc 11' from Avi Kivity Our source base drifted away from gcc compatibility; this mostly restores the ability to build with gcc. An important exception is coroutines that have an initializer list [1]; this still doesn't work. We aim to switch back to gcc 11 if/when this gives us better C++ compatibility and performance. Test: unit (dev) [1] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98056 Closes #9459 * github.com:scylladb/scylla: test: radix_tree_printer: avoid template specialization in class context test: raft: avoid ignored variable errors test: reader_concurrency_semaphore_test: isolate from namespace of source_location test: cql_query_test: drop unused lambda assert_replication_not_contains test: commitlog_test: don't use deprecated seastar::unaligned_cast test: adjust signed/unsigned comparisons in loops and boost tests build: silence some gcc 11 warnings sstables: processing_result_generator: make coroutine support palatable for C++20 compilers managed_bytes: avoid compile-time loop in converting constructor service: service_level_controller: drop unused variable sl_compare raft: disambiguate promise name in raft::active_read locator: azure_snitch: use full type name in definition of globals cql3: statements: create_service_level_statement: don't ignore replace_defaults() cql3: statement_restrictions: adjust call to std::vector deduction guide types: remove recursive constraint in deserialize_value cql3: restrictions: relax constraint on visitor_with_binary_operator_content treewide: handle switch statements that return cql3: expr: correct type of captured map value_type cdc: adjust type of streams_count alternator: disambiguate attrs_to_get in table_requests	2021-10-11 16:54:01 +03:00
Pavel Emelyanov	99d8994835	storage_service: Remove view update generator from It's not used by storage service any longer. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-10-11 11:09:02 +03:00
Pavel Emelyanov	e106e0571a	distributed_loader: Fix methods visibility Most of the methods are marked public, but only few of them should. Test needs a bit more, however, so the distributed_loader_for_tests is declared as friend class. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-10-11 11:03:29 +03:00
Michał Radwański	771f3b12bd	memtable: enable native reversing This commit consists of changes, which need to reside in a single commit, so that the tests pass on each of the commits. 1. Remove do_make_flat_reader which disabled reverse reads by making the slice a forward one. Remove call to get_ranges which would do superfluous reversal of clustering ranges. 2. test: cql_query_test: remove expectation that the test_query_limit fails for reversed queries, since reversed queries no longer require linear memory wrt. the result size, when paginated.	2021-10-10 20:38:18 +02:00
Avi Kivity	ef45a208ef	test: radix_tree_printer: avoid template specialization in class context gcc complains that it's illegal. It's unnecessary too - we can replace it with a simple overload.	2021-10-10 18:17:53 +03:00
Avi Kivity	11cc772388	test: raft: avoid ignored variable errors Avoid instantiating unused variables, and in one case ignore it, to avoid a gcc warning.	2021-10-10 18:17:53 +03:00
Avi Kivity	cdb50b1972	test: reader_concurrency_semaphore_test: isolate from namespace of source_location More modern gcc uses std::source_location instead of std::experimental::source_location. Rely on seastar::compat to get it right for us.	2021-10-10 18:17:53 +03:00
Avi Kivity	a08bcc0528	test: cql_query_test: drop unused lambda assert_replication_not_contains gcc complains that it exists.	2021-10-10 18:17:53 +03:00
Avi Kivity	9166d1ab1d	test: commitlog_test: don't use deprecated seastar::unaligned_cast unaligned_cast is deprecated, and gcc complains that it violates strict aliasing rules. Switch to std::copy_n() instead.	2021-10-10 18:17:53 +03:00
Avi Kivity	9907303bf5	test: adjust signed/unsigned comparisons in loops and boost tests gcc complains about comparing a signed loop induction variable with an unsigned limit, or comparing an expected value and measured value. Fix by using unsigned throughout, except in one case where the signed value was needed for the data_value constructor.	2021-10-10 18:16:50 +03:00
Mikołaj Sielużycki	235c38e78f	sstables, gdb: Retire usage of sstable_tracker sstables_manager superseeds previous implementation of sstables_tracker for tracking lifetime of the tables. Update scylla-gdb.py to use sstables_manager in a backwards compatible way, as sstables_manager is not available in Scylla Enterprise 2020.1. Add explicit test for "scylla sstables" command, as previously only "scylla active-sstables" was tested. Closes #9439	2021-10-07 14:40:47 +02:00
Nadav Har'El	d1505762df	cql-pytest: add to README an example of repeating a test pytest supports - if the "repeat" extension is installed - a convenient and efficient way to repeat the same test (or all of them) multiple times. Since it's very useful, let's document it in cql-pytest/README.md. By the way, our test.py also has a "--repeat" option, but it can only run all cql-pytest tests, not just repeat a single small test, and it is also slower (and arguably, different) because it restarts Scylla instead of running a test 100 times on the same Scylla. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211007122146.624210-1-nyh@scylladb.com>	2021-10-07 15:30:41 +03:00
Kamil Braun	96f18c4bb0	test: test_sstable_reversing_reader_random_schema: fix the workaround for #9352 The test generates random mutations and eliminates mutations whose keys tokenize to 0, in particular it eliminates mutations with empty partition keys (which should not end up in sstables). However it would do that after using the randomly generated mutations to create their reversed versions. So the reversed versions of mutations with empty partition keys would stay. Fix by placing the workaround earlier in the test. Closes #9447	2021-10-07 14:01:43 +03:00
Benny Halevy	90fd4d5ed7	test: sstable_conforms_to_mutation_source_test: test_sstable_reversing_reader_random_schema: auto-close reader on exception I stumbled upon this failure in dev mode: ``` test/boost/sstable_conforms_to_mutation_source_test.cc(0): Entering test case "test_sstable_reversing_reader_random_schema" sstable_conforms_to_mutation_source_test: ./seastar/src/core/fstream.cc:205: virtual seastar::file_data_source_impl::~file_data_source_impl(): Assertion `_reads_in_progress == 0' failed. Aborting on shard 0. ``` Since dev mode has no debug symbols I can't decode the stack trace so I'm not 100% sure about the root cause and I couldn't reproduce it in release or debug modes yet. One vulnerability in the current code is that r1 won't be closed if an exception is thrown before r1 and r2 are moved to `compare_readers` so this change adds a deferred close of r1 in this case. Test: sstable_conforms_to_mutation_source_test(dev) Signed-off-by: Benny Halevy <bhalevy@scylladb.com> Message-Id: <20211006144009.696412-1-bhalevy@scylladb.com>	2021-10-06 17:53:49 +03:00
Nadav Har'El	0f8d3ea459	cql-pytest: translate Cassandra's tests for ORDER BY This is a translation of Cassandra's CQL unit test source file validation/operations/SelectOrderByTest.java into our our cql-pytest framework. This test file includes 17 tests for various features and corners of SELECT's "ORDER BY" feature. All these tests pass on Cassandra, but three fail on Scylla and are marked as xfail: One previously-unknown Scylla bug: Refs #9435: SELECT with IN, ORDER BY and function call does not obey the ORDER BY And two new reproducers for already known bugs: Refs #2247: ORDER BY should allow skipping equality-restricted clustering columns Refs #7751: Allow selecting map values and set elements, like in Cassandra 4.0 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211005174140.571056-1-nyh@scylladb.com>	2021-10-06 12:31:38 +03:00
Nadav Har'El	77bd4afda7	test/alternator: avoid client-side validation Ever since we started testing Alternator with tests written in Python and using Amazon's "boto3" library, one limitation kept annoying us: Boto3 verifies the validity of the request parameters before passing them on to the server. It verifies that mandatory parameters are not missing, that parameters have the right types, and sometimes even the right ranges - all in the library before ever sending the request. This meant that in many cases, we couldn't get good test coverage for Alternator's server-side handling of wrong parameters. As it turns out, it is trivial to tell boto3 to not do its client-side request validation, with the `parameter_validation=False` config flag. We just never noticed that such a flag existed :-) So this patch adds this flag. It then fixes a few tests which expected ParameterValidationError - this error is the client-side validation failure, but should now be replaced by checking the server-side error. The patch also adds a couple of invalid parameter checks that we couldn't do before because of boto3's eagerness to check them on the client side. We can add a lot more of these error tests in the future, now that we got rid of client-side validation. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211005095514.537226-1-nyh@scylladb.com>	2021-10-05 13:26:51 +02:00
Nadav Har'El	6dee86eade	test/alternator: another test for adding a GSI to an existing table This patch adds yet another test for Alternator's unimplemented feature of adding a GSI to an already existing table (issue #5022), but this test is for a very specific corner case - tables which contain string attributes with an empty value - the corner case described in issue #9424: DynamoDB used to forbid any string attributes from being set to an empty string, but this changed in May 2020, and since then empty strings are allowed - but NOT as keys. So although it is legal to set a string attribute to an empty string, if this table has a GSI whose key is that specific attribute, the update command is refused. We already had a test for this - test_gsi_empty_value. However, the case in this patch is the case where a GSI is added to a table after the table already has data. In this case (as this test demonstrates), we are supposed to drop the items which have the empty string key from the GSI. Even when #5022 (the ability to add GSIs to existing tables) will be done, this test will continue to fail. The unique problem of this test is that Scylla's materialized views do allow empty strings as clustering keys (right now) and even partition keys (after #9375 will be solved), while we don't want them to enter the GSI. We will probably need to add to the view's filter, which right now contains (as required) "x IS NOT NULL" also the filter "x != ''" (when x's type is a string or binary) so that items with empty-string keys will be dropped. Refs #5022 Refs #9375 Refs #9424 Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211003170636.477582-1-nyh@scylladb.com>	2021-10-05 13:26:43 +02:00
Nadav Har'El	b136104298	alternator/test: test for invalid numeric values DynamoDB has a rather baroque definition of numbers, and in particular it does not allow numeric attributes to be set to infinity or NaN. Although I did check invalid numbers in the past, manually, I was never able to write a unit test for this in the past - because the boto3 library catches such errors on the client side, and prevents the test from sending broken requests to the server. So in this patch, I finally came up with a solution - a context manager client_no_transform() which yields a client which does NOT do any transformation or validation on the request's parameters, allowing us to use boto3 to create improper requests - and test the server's handling of them. The test in this patch passes - it did not discover a new bug, but it is a useful regression test and the client_no_transform() trick can be used in more error-case tests which until now we were unable to write. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Message-Id: <20211004161809.520236-1-nyh@scylladb.com>	2021-10-05 13:13:45 +02:00
Avi Kivity	2d25705db0	cql3: deinline non-trivial methods in selection.hh This allows us to forward-declare raw_selector, which in turn reduces indirect inclusions of expression.hh from 147 to 58, reducing rebuilds when anything in that area changes. Includes that were lost due to the change are restored in individual translation units. Closes #9434	2021-10-05 12:58:55 +02:00
Avi Kivity	d3f8148807	utils: untie rjson.hh from base64.hh base64.hh pulls in the huge rjson.hh, so if someone just wants a base64 codec they have to pull in the entire rapidjson library. Move the json related parts of base64.hh to rjson.hh and adjust includes and namespaces. In practice it doesn't make much difference, as all users of base64 appear to want json too. But it's cleaner not to mix the two. Closes #9433	2021-10-05 12:57:54 +02:00
Avi Kivity	3a67c661d4	Merge "Improve parallelizm of mutation source tests" from Pavel E " There's a run_mutation_source_tests lib helper that runs a bunch of tests sequentially. The problem is that it does 4 different flavors of it each being a certain decoration over provided reader. This amplification makes some test cases run enormous amount of time without any chance for parallelizm. The simplest way to help running those cases in parallel is to teach the slowest cases to run different flavors of mutation source tests in dedicated cases. This patch makes it so. The resulting timings are dev debug sequential run: 2m1s 53m50s --parallel-cases (+ this patch): 1m3s 31m15s tests: unit(dev, debug) " * 'br-parallel-mutation-source-tests' of https://github.com/xemul/scylla: test: Split multishard combining reader case test: Split database test case test: Split run_mutation_source_tests	2021-10-05 12:22:52 +03:00
Kamil Braun	0c24c18d0c	test: cql_query_test: fix test_query_limit for reversed queries (Single-partition) reversed queries are no longer unlimited but some places still treat them as such. This causes, for example, shorter pages for such queries, which breaks a test that expects certain results to come in a single page.	2021-10-05 11:22:39 +02:00
Kamil Braun	c9a7778497	test: raft: randomized_nemesis_test: remove an obsolete comment	2021-10-05 11:04:11 +02:00
Kamil Braun	961f5a904c	test: raft: randomized_nemesis_test: handle missing snapshot in `rpc::send_snapshot` It's possible that the server drops the snapshot in the same iteration of `io_fiber` loop as it tries to send it (the sending of messages happens after snapshot dropping). Handle this case. Refs #9407.	2021-10-05 11:04:11 +02:00
Pavel Emelyanov	b742e6cbb6	test: Split multishard combining reader case All the cases in this test also run mutation source tests and the case with single-fragment buffer takes times more time to execute than the others. Splitting this single case so that it runs mutation source tests flavours in different cases improves the test parallelizm. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-10-05 11:57:02 +03:00
Pavel Emelyanov	30075094ac	test: Split database test case The test_database_with_data_in_sstables_is_a_mutation_source case runs the mutation source tests in one go. The problem is that on each step a whole new ks:cf is created which takes the majority of the tests time. In the end of the day this case is the slowest one in the suite being up to two times longer (depending on mode) than the #2 on this list. This patch splits the case into 4 so that each mutation source flavor is run in separate case. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2021-10-05 11:53:18 +03:00

1 2 3 4 5 ...

2376 Commits